CN114807209B - Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments - Google Patents

Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments Download PDF

Info

Publication number
CN114807209B
CN114807209B CN202210322772.1A CN202210322772A CN114807209B CN 114807209 B CN114807209 B CN 114807209B CN 202210322772 A CN202210322772 A CN 202210322772A CN 114807209 B CN114807209 B CN 114807209B
Authority
CN
China
Prior art keywords
methyltransferase
dna
tube
dna fragment
fragment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210322772.1A
Other languages
Chinese (zh)
Other versions
CN114807209A (en
Inventor
杜军
谢天
卢修亮
李涛
岳方正
王文朋
王早霞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Biological Disaster Prevention And Control Center Of State Forestry And Grassland Administration
Tsingke Biotechnology Co Ltd
Original Assignee
Biological Disaster Prevention And Control Center Of State Forestry And Grassland Administration
Tsingke Biotechnology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Biological Disaster Prevention And Control Center Of State Forestry And Grassland Administration, Tsingke Biotechnology Co Ltd filed Critical Biological Disaster Prevention And Control Center Of State Forestry And Grassland Administration
Priority to CN202210322772.1A priority Critical patent/CN114807209B/en
Publication of CN114807209A publication Critical patent/CN114807209A/en
Application granted granted Critical
Publication of CN114807209B publication Critical patent/CN114807209B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E50/00Technologies for the production of fuel of non-fossil origin
    • Y02E50/10Biofuels, e.g. bio-diesel

Abstract

The invention belongs to the field of synthetic biology, and particularly relates to a method for improving the conversion efficiency of DNA fragments in a host so as to achieve the purpose of industrialized batch assembly of large DNA fragments. The method provided by the invention has the advantages that after the target fragment is subjected to methylation modification, the conversion efficiency is improved, the same amount of DNA is used, the number of transformants growing on a flat plate after conversion coating is obviously improved, the possibility of screening positive clones is greatly improved, the amount of DNA to be assembled is less, the material cost, the labor cost and the time cost of production are reduced, and the method is favorable for industrial production of large-fragment DNA synthesis.

Description

Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments
Technical field:
the invention belongs to the field of synthetic biology, and particularly relates to a method for improving the conversion efficiency of DNA fragments in a host so as to achieve the purpose of industrialized batch assembly of large DNA fragments.
The background technology is as follows:
synthetic biology is an emerging discipline developed in recent years, and the main objective is to design and construct diverse new devices, networks and biological pathways through DNA-encoded biological component modules, wherein the most critical is how to synthesize or modify genes, and thus, DNA synthesis or modification becomes a key point of synthetic biology. Microorganisms are capable of synthesizing various metabolites, and the synthetase genes of many metabolites tend to exist in clusters and tend to be large, usually several tens or even hundreds of kb, so that it is difficult to synthesize them by chemical synthesis or PCR. Currently, researchers have developed methods for preparing large DNA fragments based on Saccharomyces cerevisiae or E.coli.
Saccharomyces cerevisiae not only has wide application in industry and food, but also has wide application in molecular biology research due to the advantages of clear genetic background, high homologous recombination efficiency and the like. As a first fully sequenced eukaryotic microorganism, saccharomyces cerevisiae is of great importance in molecular biology and biosynthesis of valuable products. The large fragment assembly technology in the synthetic biology is mostly based on the homologous recombination capability of saccharomyces cerevisiae cells, and particularly, a target large fragment is designed into a plurality of small fragments of 5-8kb in advance, the 5-8kb fragments are constructed by using traditional methods of enzyme digestion, enzyme ligation, in vitro homologous recombination and the like, the homologous arms of 70-100bp are contained between each fragment, all fragments and linearization vectors are transformed into the saccharomyces cerevisiae cells, the cells can recognize the homologous arms between the fragments, and recombination is carried out by utilizing a DNA fragmentation repair system in the living cells of the saccharomyces to form an annular complete plasmid.
For example, chinese patent No. CN201310389392.0 discloses a method for rapidly assembling yeast with multi-fragment DNA, which comprises the following steps: (1) Co-transforming a plurality of DNA molecules comprising homology arms with a linearized yeast shuttle vector to yeast; (2) Eluting all transformed yeast colonies on the whole screening culture plate, centrifuging, and discarding the supernatant to obtain transformed yeast cells; (3) Extracting plasmid DNA of the saccharomycete cells obtained in the step (2), and converting competent cells of the escherichia coli; (4) screening the escherichia coli clone to obtain large fragment DNA. The method of the invention has the advantages of high speed, simplicity, feasibility, high success rate, low cost, high efficiency and easy operation, can assemble a plurality of small-segment DNA molecules into one large-segment DNA molecule, is favorable for industrial scale expansion and has wide application.
It will be appreciated that only all small fragments are transformed into the same cell, and that the yeast can assemble the fragments in sequence. It is well known that Saccharomyces cerevisiae has a very low conversion efficiency, typically only up to 10 3 -10 4 CFU/. Mu.gDNA. In our experiments, it was found that one of the important reasons for the low transformation efficiency of Saccharomyces cerevisiae is that the fragment is excised by the defense system-nuclease system of the yeast cells after transformation into the yeast cells. Therefore, it is particularly important to solve this problem so as to greatly improve the conversion efficiency of large fragments in the Saccharomyces cerevisiae system.
The invention comprises the following steps:
in order to solve the technical problems, the invention provides a method for improving the conversion efficiency of a Saccharomyces cerevisiae DNA fragment, in particular to improving the conversion efficiency of a DNA large fragment in a Saccharomyces cerevisiae system.
The method has simple operation and high success rate, is suitable for large-scale production, and can convert the fragment with 0.1-1Mb from 10 in Saccharomyces cerevisiae 3 -10 4 CFU/. Mu.g DNA was increased to 10 4 -10 5 CFU/. Mu.gDNA, the transformation efficiency is improved by about 10 times.
The invention provides a method for improving the conversion efficiency of saccharomyces cerevisiae DNA fragments, which comprises the following steps:
(1) Synthesizing target DNA fragment: dividing the DNA fragment of 0.1-100kb or the macromolecular fragment to be synthesized of 20kb-1Mb into a plurality of fragments of 0.1-100kb for synthesis;
the DNA fragment may be linearized DNA or circularized plasmid DNA;
(2) Carrying out methylation modification on the DNA fragment or the split fragment obtained in the step (1);
further, the methylation modification is obtained by methyltransferase treatment;
still further, the methyltransferase may be at least one of the following methyltransferases: enzymes having methylation-modified DNA such as CpG methyltransferase, gpC methyltransferase, dam methyltransferase, dcm methyltransferase, ecoRI methyltransferase, hpaII methyltransferase, mspI methyltransferase, hhaI methyltransferase, taq I methyltransferase, aluI methyltransferase, bamHI methyltransferase, haeIII methyltransferase, dpnM methyltransferase, human DNA methyltransferase, ecoGII methyltransferase and the like;
preferably, the methyltransferase is a methyltransferase having a recognition sequence of only 2 or 4 bases, such as CpG methyltransferase, gpC methyltransferase, taq I methyltransferase, dpnM methyltransferase, etc.;
more preferably, the methyltransferase is selected from the group consisting of 2 enzyme combinations or 3 enzyme combinations or 4 enzyme combinations of CpG methyltransferase, gpC methyltransferase, taq I methyltransferase, dpnM methyltransferase;
more preferably, the methyltransferase is combined with CpG methyltransferase, gpC methyltransferase, taq I methyltransferase, dpnM methyltransferase in an enzyme activity ratio of 4:3:1:2;
further, the methylation modification system is as follows (50 μl reaction): adding 2-5. Mu.g of DNA fragment and 1-10U of methyltransferase, adding S-adenosylmethionine with a final concentration of 80-160. Mu.M, 5. Mu.L of 10 Xreaction buffer, ddH 2 O is added to 50 mu L, and the mixture is reacted in a water bath at 37-65 ℃ for 1-5h;
furthermore, the methylation modification system is a 50 mu L reaction system, and can be amplified or reduced in a same ratio in practical application; further, the reaction buffer consists of: 200mM Tris-Ac (pH 7.9), 500mM KAC,100mM MgAc2,100mM DTT,1000. Mu.g/ml Albumin (Albumin), the remainder being water;
(3) The DNA fragments after methylation modification or the DNA fragments after resolution after methylation modification and the linearized plasmid are transformed into yeast cells together for assembly or transformation;
further, the DNA fragment is designed with a homologous arm connected with a plasmid, and homologous arms capable of being sequentially connected into full-length fragments are arranged between the split fragments;
further, the length of the homology arm is 25-150bp, the length of the preferable homology arm is 50-100bp, and the length of the more preferable homology arm is 100bp;
further, the plasmid can be any one of yeast vector or yeast and escherichia coli shuttle vector, and can be a public general plasmid or an engineered plasmid: such as pGADT7, pGBKT7, pRS415, pPICZC, pYX212 and the like;
the transformation method can be various, such as PEG-LiAc method, electrotransformation method, etc., and it is understood that the method of the invention modifies the DNA to be transformed so that the DNA is not excised by nuclease system in living cells, thereby improving the transformation efficiency of DNA fragments, and the invention can be applied to any transformation method.
The beneficial effects are that:
1. according to the technical scheme provided by the invention, the quantity of the transformants growing on the plate after the transformation plating is obviously increased due to the improvement of the transformation efficiency and the same DNA consumption, so that the possibility of screening positive clones is greatly improved (under the condition that too few transformants exist, all transformants are always screened and positive clones cannot be found);
2. according to the technical scheme provided by the invention, as methylation modification protects the transformed nucleic acid from degradation, in a single cell, the nucleic acid molecule exists for a longer time, which is more beneficial for the living cells to assemble the fragments, namely the positive rate after transformation is greatly improved;
3. according to the technical scheme provided by the invention, the conversion efficiency is improved, so that the amount of DNA to be assembled is less, the material cost, the labor cost and the time cost of production are reduced, and the industrial production of large-segment DNA synthesis is facilitated.
Description of the drawings:
FIG. 1 unmodified sequence;
FIG. 2 shows the sequence modified by DpnM methyltransferase.
The specific embodiment is as follows:
the invention is described below by means of specific embodiments. The technical means used in the present invention are methods well known to those skilled in the art unless specifically stated. Further, the embodiments should be construed as illustrative, and not limiting the scope of the invention, which is defined solely by the claims. Various changes or modifications to the materials ingredients and amounts used in these embodiments will be apparent to those skilled in the art without departing from the spirit and scope of the invention.
When a DNA sequence is transformed into a yeast cell, the yeast cell nuclease system recognizes several bases and cleaves it. After modification of DNA, the site is protected and not recognized, thus playing a role in protection.
The sites recognized by one nuclease in the yeast nuclease system are assumed to be: CTGATCTC (grey mark in fig. 1) and treatment of this DNA with DpnM methyltransferase (recognizes GATC sequence) (fig. 2) will modify the GATC sequence into grey parts of fig. 2, and the modified sequence cannot be recognized by the yeast nuclease system, i.e. cannot be excised, i.e. protection is achieved.
It is noted that yeast nuclease systems contain a variety of nucleases, the recognition sites of which have not been reported so far, and therefore the relationship between the manner of methyltransferase modification and the protection against yeast nuclease systems is not clear.
The invention provides a method for improving the conversion efficiency of a DNA fragment in a host cell, which comprises the steps of introducing the DNA fragment to be converted into the host cell for conversion after methylation modification, so that the conversion efficiency is improved, the number of transformants growing on a plate after conversion coating is obviously improved, and the possibility of screening positive clones is greatly improved;
the DNA fragment may be linearized DNA or circularized plasmid DNA; the length of the DNA fragment can be from 100bp-5000bp or 5000bp-20000bp or 20000bp-100000bp or more than 100000bp;
the method is suitable for the DNA fragments of 0.1-100kb, and is also suitable for the DNA large fragments of more than 20kb or 100kb which need to be further assembled; for large DNA fragments larger than 20kb or 100kb, the large DNA fragments can be split into a plurality of DNA fragments of 0.1-100kb, and after methylation modification, transformation and assembly are completed in host cells;
in the present invention, 50. Mu.L of methylation modification system was used as follows: adding 2-5 muThe gDNA fragment was mixed with 1-10U methyltransferase, S-adenosylmethionine was added at a final concentration of 80-160. Mu.M, 5. Mu.L of 10 Xreaction buffer, ddH 2 O is added to 50 mu L, and the mixture is reacted in a water bath at 37-65 ℃ for 1-5h;
furthermore, the methylation modification system is a 50 mu L reaction system, and can be amplified or reduced in a same ratio in practical application;
in the invention, the types of DNA fragments in a methylation system can be one or more, if the large DNA fragment to be assembled is 160kb, the large DNA fragment can be split into about 32 fragments of 5000bp, and the 32 fragments are simultaneously added into the methylation system, and then introduced into a host cell for transformation and assembly after the methylation reaction is completed;
in the present invention, the 10 x reaction buffer composition used for methylation modification is: 200mM Tris-Ac (pH 7.9), 500mM KAC,100mM MgAc2,100mM DTT,1000. Mu.g/ml Albumin (Albumin);
in the present invention, methyltransferases used for methylation modification include, but are not limited to: enzymes having methylation-modified DNA such as CpG methyltransferase, gpC methyltransferase, dam methyltransferase, dcm methyltransferase, ecoRI methyltransferase, hpaII methyltransferase, mspI methyltransferase, hhaI methyltransferase, taq I methyltransferase, aluI methyltransferase, bamHI methyltransferase, haeIII methyltransferase, dpnM methyltransferase, human DNA methyltransferase, ecoGII methyltransferase and the like;
in the present invention, the methyltransferase used for methylation modification may be a combination of one, two, three or four or more of the above methyltransferases; preferably, it is a combination of two or more of CpG methyltransferase, gpC methyltransferase, taq I methyltransferase and DpnM methyltransferase; more preferably, the CpG methyltransferase, the GpC methyltransferase, the Taq I methyltransferase and the DpnM methyltransferase are used in combination in a ratio of 4:3:1:2;
in the invention, for the target DNA fragment of 0.1-100kb, methylation modification can be directly carried out, and then the target DNA fragment and linearized plasmid are transformed into a yeast host cell together; the DNA fragment is designed with a homology arm for connecting plasmids;
in the invention, for the target DNA fragment of more than 20kb or 100kb, after methylation modification, a plurality of split DNA fragments of 0.1-100kb are transformed into yeast host cells together with linearized plasmids for transformation and assembly to obtain the target DNA fragment of more than 20kb or 100 kb; the DNA fragments are provided with homologous arms which can be sequentially connected to obtain full-length fragments, and the first and the last fragments are designed with homologous arms connected with plasmids;
the length of the adopted homology arm is 25-150bp, the length of the preferable homology arm is 50-100bp, and the length of the more preferable homology arm is 100bp;
in the invention, the adopted plasmid can be any one of yeast vector or yeast and escherichia coli shuttle vector, and can be a public general plasmid or an altered plasmid, such as pGADT7, pGBKT7, pRS415, pPICZC, pYX212 and the like;
the transformation method adopted by the invention can be various, such as PEG-LiAc method, electric transformation method and the like, and it is understood that the method of the invention modifies the DNA to be transformed so that the DNA is not excised by a nuclease system in living cells, and the transformation efficiency of the DNA fragments is improved, thus the invention can be applied to any transformation method.
The invention will be further explained by means of specific embodiments.
Example 1A method for improving the conversion efficiency of DNA fragment Yeast
The embodiment fits into a DNA fragment (shown as SEQ ID NO. 1) of about 6000bp, homologous arms 1 and 2 connected with plasmids are arranged at two ends of the fragment, and then the fragment is obtained by a chemical synthesis method;
homology arm 1:atgtgctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaa aacgacggccag
homology arm 2:ttcctgtgtgaaattgttatccgctcgccggcggctgctaacaaagcccgaaaggaagct gagttggctgctgcca
after methylation modification of the synthetic fragment, the synthetic fragment was introduced into yeast cells together with linearized plasmid pRS415 for transformation as follows:
(1) 42 μg of the above synthetic fragment was distributed on average into 21 tubes 1.5ml centrifuge tubes, each tube being numbered: 1-21;
(2) To each of the above tubes, 10 Xreaction buffer (200 mM Tris-Ac (pH 7.9), 500mM KAC,100mM MgAc2,100mM DTT,1000. Mu.g/ml Albumin (Albumin)) 5. Mu.L, ademetionine 160. Mu.M, and ddH were added 2 O is added to 47 mu L;
(3) Add 3. Mu.L ddH to tube 1 2 O;
Add 3. Mu.L of DpnM methyltransferase to tube 2;
add 3. Mu. LCpG I methyltransferase to tube 3;
add 3. Mu.L of GpC I methyltransferase to tube 4;
add 3. Mu.L of dam methyltransferase to tube 5;
3 μ LTaq I methyltransferase was added to tube 6;
1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of CpG I methyltransferase were added to tube 7;
1.5. Mu.L CpG I methyltransferase, 1.5. Mu.L GpC I methyltransferase were added to tube 8;
1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of dam methyltransferase were added to tube 9;
to tube 10, 1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of Taq I methyltransferase were added;
to tube 11, 1. Mu.LCpG I methyltransferase, 1. Mu.L GpC I methyltransferase, 1. Mu.L DpnM methyltransferase were added;
to tube 12, 1. Mu.LCpG I methyltransferase, 1. Mu.L GpC I methyltransferase, 1. Mu.L Taq I methyltransferase were added;
to tube 13, 1. Mu.L of GpC I methyltransferase, 1. Mu.L of Taq I methyltransferase were added;
to tube 14, 1. Mu.L of DpnM methyltransferase, 1. Mu.L of Taq I methyltransferase were added;
to tube 15, cpGI methyltransferase 0.75. Mu.L, gpC I methyltransferase 0.75. Mu.L, taq I methyltransferase 0.75. Mu.L and DpnM methyltransferase 0.75. Mu.L were added;
to tube 16, cpGI methyltransferase 0.3. Mu.L, gpC I methyltransferase 0.6. Mu.L, taq I methyltransferase 0.9. Mu.L and DpnM methyltransferase 1.2. Mu.L were added;
to tube 17, cpGI methyltransferase 0.6. Mu.L, gpC I methyltransferase 0.3. Mu.L, taq I methyltransferase 1.2. Mu.L and DpnM methyltransferase 0.9. Mu.L were added;
to tube 18 was added CpGI methyltransferase 1.2. Mu.L, gpC I methyltransferase 0.9. Mu.L, taq I methyltransferase 0.3. Mu.L and DpnM methyltransferase 0.6. Mu.L;
to tube 19 was added CpGI methyltransferase 0.6. Mu.L, gpC I methyltransferase 0.6. Mu.L, taq I methyltransferase 0.6. Mu. L, dpnM methyltransferase 0.6. Mu. L, dam methyltransferase 0.6. Mu.L;
to tube 20 was added CpGI methyltransferase 0.5. Mu.L, gpC I methyltransferase 0.5. Mu.L, taq I methyltransferase 0.5. Mu. L, dpnM methyltransferase 0.5. Mu. L, dam methyltransferase 0.5. Mu. L, dcm methyltransferase 0.5. Mu.L;
to tube 21 was added CpGI methyltransferase 0.5. Mu.L, gpC I methyltransferase 0.5. Mu.L, taq I methyltransferase 0.5. Mu. L, dpnM methyltransferase 0.5. Mu. L, dam methyltransferase 0.5. Mu. L, bamHI methyltransferase 0.5. Mu.L;
the enzyme activity unit of each mu L of internal methyltransferase is 1U;
(4) Shaking and mixing uniformly, and placing in a 37 ℃ water bath for 1h; cleaning and recycling the modified DNA by using a PCR cleaning and recycling kit;
(5) 10ng of each recovered product is converted into Saccharomyces cerevisiae BY4741 BY using a PEG-LiAc method; the specific transformation method comprises the following steps: adding 50%PEG3350,1.0M LiAc,10mg/mL ssDNA derived from salmon monomer and 10ng (4) recovered product and 5ng linearized plasmid pRS415 into competent cells of Saccharomyces cerevisiae, mixing thoroughly, incubating at 30deg.C for 30min, adding DMSO with final concentration of 10%, blowing and mixing, heat-shock at 42deg.C for 25min, centrifuging to collect yeast cells, adding 5mM CaCl 2 Incubating at 25deg.C for 10min, adding YPDA culture medium, culturing at 30deg.C for 90min, diluting with 10 times gradient for 5 gradients, applying SD-leu defect culture medium, and culturing in a 30 deg.C incubator; after 48 hours of cultivation, transformant statistics were carried out, and the results are shown in Table 1 below.
Total number of transformants = colony number x dilution x total volume of stock solution of conversion reaction/volume of plating bacteria solution;
conversion efficiency = total number of transformants/amount of DNA added (μg).
TABLE 1
Tube count Total number of transformants Conversion efficiency (CFU/. Mu.g)
Tube 1 (4.3±0.42)×10 2 (4.3±0.42)×10 4
Tube 2 (3.2±0.26)×10 2 (3.2±0.26)×10 4
Tube 3 (6.4±0.52)×10 2 (6.4±0.52)×10 4
Tube 4 (7.4±0.50)×10 2 (7.4±0.50)×10 4
Tube 5 (4.2±0.34)×10 2 (4.2±0.34)×10 4
Tube 6 (3.1±0.47)×10 2 (3.1±0.37)×10 4
Tube 7 (7.0±0.62)×10 2 (7.0±0.62)×10 4
Tube 8 (7.2±0.52)×10 2 (7.2±0.52)×10 4
Tube 9 (1.9±0.14)×10 2 (1.9±0.14)×10 4
Tube 10 (4.2±0.54)×10 2 (4.2±0.54)×10 4
Tube 11 (7.2±0.87)×10 2 (7.2±0.87)×10 4
Tube 12 (3.7±0.62)×10 2 (3.7±0.62)×10 4
Tube 13 (4.9±0.38)×10 2 (4.9±0.38)×10 4
Tube 14 (4.3±0.45)×10 2 (4.3±0.45)×10 4
Tube 15 (9.0±0.86)×10 2 (9.0±0.86)×10 4
Tube 16 (6.2±0.88)×10 2 (6.2±0.88)×10 4
Tube 17 (4.3±0.24)×10 2 (4.3±0.24)×10 4
Tube 18 (3.4±0.46)×10 3 (3.4±0.46)×10 5
Tube 19 (1.0±0.12)×10 3 (1.0±0.12)×10 5
Tube 20 (7.6±0.64)×10 2 (7.6±0.64)×10 4
Tube 21 (8.1±0.42)×10 2 (8.1±0.42)×10 4
From the above experimental results, it was found that the transformation efficiency was improved to some extent as compared with the unmethylated control tube 1, but the transformation efficiency was changed irregularly, which was consistent with the current situation that the yeast nuclease system contained a plurality of nucleases, but the recognition sites were not clear. Among them, it was unexpectedly found that when CpGI methyltransferase, gpCI methyltransferase, taq I methyltransferase and DpnM methyltransferase were combined in a ratio of 4:3:1:2, the conversion efficiency was significantly higher than other experimental groups, and the conversion rate was improved by nearly ten times as compared to the blank group.
Example 2A method for improving the conversion efficiency of DNA Large fragment Yeast
The present example fits into a large 120Kb DNA fragment, the fragment is split into 11 fragments of 5-20Kb (respectively fragment 1, fragment 2, fragment 3, fragment 4, fragment 5, fragment 6 and fragment … … fragment 11, and homology arms are arranged on the fragments in a pairwise sequence, the sequences are shown in SEQ ID NO.1-11, homology arm 1 is connected to the 5 'end of fragment 1, homology arm 2 is connected to the 3' end of sequence 11, the homology arm is the same as in example 1), the 11 fragments are obtained by a chemical synthesis method, and after methylation modification is carried out on the 11 fragments, the 11 fragments and linearized plasmid pRS415 are introduced into yeast cells together for transformation and assembly, and the specific procedures are as follows: (1) 84 μg of the 11 fragment mixture was distributed on average into 21 tubes 1.5ml centrifuge tubes, each tube being numbered: 1-21; (2) To each of the above-mentioned tubes, 10 Xreaction buffer (same as in example 1) 5. Mu.L, ademetionine 160. Mu.M, and ddH were added 2 O is added to 47 mu L;
(3) Add 3. Mu.L ddH2O to tube 1;
add 3. Mu.L of DpnM methyltransferase to tube 2;
add 3. Mu.L CpG I methyltransferase to tube 3;
add 3. Mu.L of GpC I methyltransferase to tube 4;
add 3. Mu.L of dam methyltransferase to tube 5;
3 μ LTaq I methyltransferase was added to tube 6;
1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of CpG I methyltransferase were added to tube 7;
1.5. Mu.L CpG I methyltransferase, 1.5. Mu.L GpC I methyltransferase were added to tube 8;
1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of dam methyltransferase were added to tube 9;
to tube 10, 1.5. Mu.L of DpnM methyltransferase, 1.5. Mu.L of Taq I methyltransferase were added;
to tube 11, 1. Mu.LCpG I methyltransferase, 1. Mu.L GpC I methyltransferase, 1. Mu.L DpnM methyltransferase were added;
to tube 12, 1. Mu.LCpG I methyltransferase, 1. Mu.L GpC I methyltransferase, 1. Mu.L Taq I methyltransferase were added;
to tube 13, 1. Mu.L of GpC I methyltransferase, 1. Mu.L of Taq I methyltransferase were added;
to tube 14, 1. Mu.L of DpnM methyltransferase, 1. Mu.L of Taq I methyltransferase were added;
add 0.75. Mu.L of CpG I methyltransferase, 0.75. Mu.L of GpC I methyltransferase, 0.75. Mu.L of Taq I methyltransferase and 0.75. Mu.L of DpnM methyltransferase to tube 15;
to tube 16, 0.3. Mu.L of CpG I methyltransferase, 0.6. Mu.L of GpC I methyltransferase, 0.9. Mu.L of Taq I methyltransferase and 1.2. Mu.L of DpnM methyltransferase were added;
to tube 17, 0.6. Mu.L of CpG I methyltransferase, 0.3. Mu.L of GpC I methyltransferase, 1.2. Mu.L of Taq I methyltransferase and 0.9. Mu.L of DpnM methyltransferase were added;
to tube 18, 1.2. Mu.L of CpG I methyltransferase, 0.9. Mu.L of GpC I methyltransferase, 0.3. Mu.L of Taq I methyltransferase and 0.6. Mu.L of DpnM methyltransferase were added;
to tube 19, 0.6. Mu.L of CpG I methyltransferase, 0.6. Mu.L of GpC I methyltransferase, 0.6. Mu. L, dpnM methyltransferase, 0.6. Mu. L, dam methyltransferase and 0.6. Mu.L of Taq I methyltransferase were added;
to the tube 20, 0.5. Mu.L of CpG I methyltransferase, 0.5. Mu.L of GpC I methyltransferase, 0.5. Mu. L, dpnM methyltransferase, 0.5. Mu. L, dam methyltransferase, 0.5. Mu. L, dcm methyltransferase, 0.5. Mu.L of Taq I methyltransferase were added;
to tube 21, 0.5. Mu.L of CpG I methyltransferase, 0.5. Mu.L of GpC I methyltransferase, 0.5. Mu. L, dpnM methyltransferase, 0.5. Mu. L, dam methyltransferase, 0.5. Mu. L, bamHI methyltransferase and 0.5. Mu.L of Taq I methyltransferase were added;
the enzyme activity unit of each mu L of internal methyltransferase is 3U;
(4) Shaking and mixing uniformly, and placing in a 37 ℃ water bath for 1h; cleaning and recovering the modified DNA by using a PCR cleaning and recovering kit;
(5) 10ng of each recovered product was transformed into Saccharomyces cerevisiae BY4741 BY the PEG-LiAc method; the transformation method is the same as in example 1, SD-leu defect culture medium is coated, and the substrate is placed in a 30 ℃ incubator for culture; after 48 hours of cultivation, transformant statistics were carried out, and the results are shown in Table 2 below.
Total number of transformants = colony number x dilution x total volume of stock solution of conversion reaction/volume of plating bacteria solution;
conversion efficiency = total number of transformants/amount of DNA added (μg).
TABLE 2
Tube count Total number of transformants Conversion efficiency (CFU/. Mu.g)
Tube 1 (5.0±0.48)×10 (5.0±0.48)×10 2
Tube 2 (3.3±0.42)×10 (3.3±0.42)×10 2
Tube 3 (7.2±0.59)×10 (7.2±0.59)×10 2
Tube 4 (1.0±0.13)×10 2 (1.0±0.13)×10 4
Tube 5 (1.2±0.15)×10 2 (1.2±0.15)×10 4
Tube 6 (2.4±0.33)×10 2 (2.4±0.33)×10 4
Tube 7 (6.4±0.56)×10 2 (6.4±0.56)×10 4
Tube 8 (6.8±0.67)×10 2 (6.8±0.67)×10 4
Tube 9 (2.2±0.14)×10 2 (2.2±0.14)×10 4
Tube 10 (3.4±0.24)×10 2 (3.4±0.24)×10 4
Tube 11 (7.4±0.48)×10 2 (7.4±0.48)×10 4
Tube 12 (6.2±0.52)×10 2 (6.2±0.52)×10 4
Tube 13 (5.0±0.44)×10 2 (5.0±0.44)×10 4
Tube 14 (3.8±0.37)×10 2 (3.8±0.37)×10 4
Tube 15 (8.2±0.34)×10 2 (8.2±0.34)×10 4
Tube 16 (3.0±0.18)×10 2 (3.0±0.18)×10 4
Tube 17 (4.1±0.26)×10 2 (4.1±0.26)×10 4
Tube 18 (1.1±0.16)×10 3 (1.1±0.16)×10 5
Tube 19 (9.3±0.46)×10 2 (9.3±0.46)×10 4
Tube 20 (7.0±0.38)×10 2 (7.0±0.38)×10 4
Tube 21 (3.6±0.24)×10 2 (3.8±0.24)×10 4
From the above experimental results, it was found that the conversion efficiency was improved to some extent compared with the unmethylated blank tube 1 after the methylation treatment of the DNA fragment with different methyltransferases. Among them, it was unexpectedly found that when CpGI methyltransferase, gpC I methyltransferase, taq I methyltransferase and DpnM methyltransferase were combined in a ratio of 4:3:1:2, the conversion efficiency was significantly higher than other experimental groups, and the conversion rate was improved by a factor of ten times as compared to the blank group.
Example 3A method for improving the conversion efficiency of DNA Large fragment Yeast
Methylation of A, B, C, D different large fragments (4 fragments with sequences shown in SEQ ID NO.8-11, 5 'ends of sequences 8, 9, 10, 11 connected to homology arm 1, 3' ends of sequences 8, 9, 10, 11 connected to homology arm 2, and then synthesized) was performed using the optimal combination (18 tubes) in example 2:
homology arm 1:atgtgctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaa aacgacggccag
homology arm 2:ttcctgtgtgaaattgttatccgctcgccggcggctgctaacaaagcccgaaaggaagct gagttggctgctgcca
the above fragments were subjected to methylation modification after chemical synthesis, the modification method and system were the same as in tube 18 of example 2, while unmethylated fragments were set as controls, the remaining Saccharomyces cerevisiae transformation conditions were the same as in example 2, and the number of transformants was as shown in Table 3 below:
TABLE 3 Table 3
A A-control B B-control
Number of transformants (2.2±0.32)×10 3 (3.7±0.30)×10 2 (4.0±0.27)×10 3 (6.2±0.62)×10 2
C C-control D D-control
Number of transformants (4.4±0.28)×10 3 (8.1±0.53)×10 2 (2.4±0.45)×10 3 (5.7±0.37)×10 2
From the above experimental results, it was found that when CpG I methyltransferase, gpC I methyltransferase, taq I methyltransferase and DpnM methyltransferase were combined in a ratio of 4:3:1:2, the conversion efficiency was improved 3-7 times as compared with the unmethylated blank.
The above examples merely represent a few embodiments of the present invention, which are described in more detail and are not to be construed as limiting the scope of the patent. It should be noted that, for a person skilled in the art, the above embodiments may also make several variations, combinations and improvements, without departing from the scope of the present patent. Therefore, the protection scope of the patent is subject to the claims.
SEQUENCE LISTING
<110> Beijing Engine biotechnology Co., ltd
Biological disaster prevention and control center for national forestry and grassland bureau
<120> a method for improving the conversion efficiency of Saccharomyces cerevisiae DNA fragment
<130> 1
<160> 11
<170> PatentIn version 3.5
<210> 1
<211> 6433
<212> DNA
<213> artificial sequence
<400> 1
tagctgggtt ttccttgttg ctattttaaa aggtgattca tggagaactg gagatatgga 60
gtgtgaatgg acatgagtga gataagcagt ggatgtgtgt ggcagtttct gaccagggtg 120
tctctgtgtt tgcaggtgtc cagtgtgagg tgcagctggt ggagtccggg ggaggcttag 180
ttcagcctgg ggggtccctg agactctcct gtgcagcctc tggattcacc ttcagtagct 240
actggatgca ctgggtccgc caagctccag ggaaggggct ggtgtgggtc tcacgtatta 300
atagtgatga ctccgtgaag ggccgattca ccatctccag agacaacgcc aagaacacgc 360
tgtatctgca aatgaacagt ctgagagccg aggacacggc tgtgtattac tgtgcaagag 420
acacagtgag gggaagtcaa tgtgagccca gacacaaacc tcgctgcagg ggcatctgag 480
accacgaggg ggtgtcctgg gccctgtgaa ctgggctgct ctccgtggca gcggctggtg 540
gtgctaaagg ctgattttct ctcagcatct ggggctgatt catcaagttt cctcagagaa 600
cctttcagat ttacaattct gtacttacgt ttaatgtctc tgaatgtgac actttccttc 660
cctggtgtgt ctttgttttt gtgacaagag gacacattct cacctccaca gaagcccgag 720
tgtcactttg gggacagaaa tgaccctgcc ctggtcacca gaatcagagt cccgaggaag 780
cccaggagga cctgggaagt gtttttcaat cagactcagg gcaggcgtct ccgtgggaat 840
ctctgattgg aacaggcttt gggattcaga ttgggaccaa gagggaggct cacccagggc 900
cagggtcctt agaatcctga cagttttcac agtaacccca tcgtccttta aaactgaaca 960
tctaactcag aactgaccca tttggtcctt tctctgtaat ccattttcct tttctctagg 1020
cttcattctt acacttccct tttcaccttc attctgaaaa tggagggtgt gcttcctgtg 1080
gtctaaacca caggcctcag atgcattacc tggaactcag gtgtggctct ggctatggct 1140
cctgtggacc tggcaggctg agggatcttt ctcattccct ggtgcctgca tgcccctgct 1200
gtcttctatg cgtggatgca tttgggaaat gcaagtggac actcatagtc gtttcctcaa 1260
atgggatact gttgtagagc tgatcttgtg cttctcaccc tgtcacagag cccccactct 1320
cacttgtgga tttttgggag agctgaggat ggacacttta ttgggctgtg agctctgcat 1380
gatggcaatc gtgaggtctg ggtgggcaca gccagagcca atggagctgg ccaaaaggaa 1440
agacaggacg gaattcctgg gaagtcctac agctgctgtc taccatagag ttccattgtc 1500
ttctcctctg ctaggattaa accagaccga ccaagttcat ctaggacaat cttcctgacc 1560
tagagaagtg atgacaggct ttaataacac ctgtattata catcagagca acacctagat 1620
tagtgtttga ttgaataatt gagactatgt cctagtcaag gtgacacaca aaatcaatta 1680
ttaccatgat taatatttta tattgattaa tattttaata ctaatattaa tcaatgcaat 1740
attgatttaa tattaattaa tattttatat ttgacaatag aatcagactc atgttataaa 1800
taattttgca aatatatgta tattattatt ggtcttctga gcataaatct ccattagccg 1860
attaagtgtg catgcattgg tcgaaggagg gcatccaccc ttttgaggga aaatgcatgg 1920
agggtaaatg aggctgggaa gctgatggca tatgatggaa gcctgtccct gagtgaagga 1980
gagagggagg caggattggg tggaactttc ctacatttct gtgctgtgca aagaaactcc 2040
agattatcac tgagtctcct gcaagtcaaa gttgcccctc aggaaacccc catcactccc 2100
agcaatgact ctgccccaga tgagtgcaag gctcactcta ttcctgagaa aggagcacag 2160
gatatgggat ttagcatgag ccacgtcatg gacgtcagag agcaggagct gggtgcatga 2220
tccaagtgca ctgttctgcc tgtaggtgga gagaggaagg tgcattccca gagctaacac 2280
actgtggatt tttatacaga atacacgttt tcacctgatt tattggaagg catatgaaaa 2340
aatgtgcagt ccacaaacaa ttgtaatttc caaatttacc acaattgcat tatttcttca 2400
ttctgcatga tgtcctggaa cagagttaca tttttcatgg gtattgtttc atggattgag 2460
gacatgaatt tcacacctgc ctttttagcc atgtgcagag accttgagta gagcatgtct 2520
ggacctcatc tacactatct ctcatgcccc aagggaaaga gtggagactt gactgactgc 2580
aaactttggg gtgggagctg ctcctgcaca cctcagaggc tgcagagacc ccaagtgcag 2640
ttttattgag ttggggtgtc tttctatgtg ggggaattat gggctgcccc actcctggtg 2700
attgctggtc agctctaaag acgggttcag aatgaggtca cctggtaacc tttctcactg 2760
cagctccatt atgtaaatta ccactaccag attccaggta ataatgcaag ctctattttt 2820
gcgacctcat tttcttatct tttttgttca taagtttggt tatagaaaat aagtcccttc 2880
cccttatgag cctgtaaatt cagaagcaag ttacaagcat tgggtaaaca cacccattac 2940
aaatgacaga aattgcccat aacaaaaagg ccccatgcaa gtccaaaatc cagcagggca 3000
gtcaaatcat agagctccac aatgatcacc tgtaactccc cttctcacat ccgggacatg 3060
ttgatgcaat aggtgagttc cgatggtctt gggcagctcc acccctgtgg ctttgcaggg 3120
tacagcctcc ctcctggctg ctttcacagg ctggtgttca gtgtctgtgg attttccagg 3180
cacaaagtgc aaactgtcag tggatctacc attccagggt ctggaggata gcagccctct 3240
tctcacagct ccactaggca gtgccccagt aaggactctg tgtgggagct ctggccctat 3300
atttcctttc ctccttgccc tagcagaggt tctccatgaa tgatccacca ctgcagccaa 3360
cttatgactg gacatccagg tgttttcata catcttctga aatctaggca gaggttctca 3420
aacaccaatt cttagcactc tgtcgataat ggttcatttt tttggcctgt tcattactgg 3480
tatttttcaa aggaatctca cttgaatctt tactcttttg cattttgtct ccatgacaat 3540
gttgggaagt tttacctcca ccatcataac atgatctagt gatctcacac atttgtggca 3600
aacaatacct acaaattcag aagctctttg cttttctttc catgaaatat aattctttct 3660
gttctgtgta taagcatatc ttagcaactc tgtgcacacc cacatagatg tccacaagcc 3720
tatgaattat tctctgtaaa taaaaattta tatcaatttc cctcaatgtt cataattctc 3780
ctgagggtga ggaagctcct tctcgatctg ttcaaacaaa atgcccagaa accatctggt 3840
aggtaaggag ttcacctggc tctggtgtgg ggtctgtctc tttccctctg ttgtcacaca 3900
ggtcagccca gttgttcagg tcctaagaag aaagcccagg tttgtcctga ttttaaaaca 3960
catcaaactt ctgatgactc tcctgttacc cacatccatg gagatagatt atttattata 4020
taattcacca aactaatgtc aaatgcccaa gttgcaatac cacacatcct agggtatgtt 4080
catgcaattc aatggaggag aaagtctttc agagacagat ggatctgaaa tgataaatat 4140
gtgggtaagg actctgggct tgagtatcat tgtccagcca tgtttcacaa gtgtgtcctg 4200
tcagggaagg acaagagttc cttgtgttct cagagggaag gggtcacaga gttcctctct 4260
ggttcccagg aaagataatc gcactaatct tcatgatctt catgagacta tcctccagtg 4320
ctgacctgtt atagagtttt tgtctgaagt tctcactgca atccccaatc tacatatttt 4380
caatcagaag tgtttagagg ccaggacata tcttcacggt cacacattga gaaggatgta 4440
gatatgtccc actaccttct cctgagatct cagacagaat cccagatttc aaaaggacac 4500
agaaggacag ctctcaggtg cttttaaaaa atgacccact tccagggaca gggagcttcc 4560
ctataaccat ggtggatgtt ctgaactaca ataaacattg gatggatcca ggattgtttg 4620
aagtcactgt cattattaca ttcagctgct gtttcaatgt gtctgaagta gtaaatgaca 4680
atttagatga caatttatat gaatcttcaa gggtagaaca atattgacca tattccaaaa 4740
tctgtccttg atccatgatc acactcatct cccagaccag gtccttcagc acgtctcttt 4800
acctgaaaga agaggactct gggcttggag aggggagacc ccaagaagac aactgagttc 4860
tcaaagggca cagccagcat cctactccca gggcgagccc aaaagactgg ggcctccctc 4920
ctcctttttc acctctccgt acaaaggcac cacccacatg caaatcctta cttaagcacc 4980
cacaggaaac caccacacat ttccttaaat tcaggttcca gctcacatgg gaaatacttt 5040
ctgagagtcc tggacctcct gtgcaagaac atgaaacacc tgtggttctt cctcctcctg 5100
gtggcagctc ccagatgtga gtgtctcagg gatccagaca tgggggtatg ggaggtgcct 5160
ctgatcccag ggctcactgt gggtctctct gttcacaggg gtcctgtccc aggtgcagct 5220
gcaggagtcg ggcccaggac tggtgaagcc ttcggagacc ctgtccctca cctgcactgt 5280
ctctggtggc tccgtcagca gtggtagtta ctactggagc tggatccggc agcccccagg 5340
gaagggactg gagtggattg ggtatatcta ttacagtggg agcaccaact acaacccctc 5400
cctcaagagt cgagtcacca tatcagtaga cacgtccaag aaccagttct ccctgaagct 5460
gagctctgtg accgctgcgg acacggccgt gtattactgt gcgagagaca cagtgagggg 5520
aggtgagtgt gagcccagac acaaacctcc ctgcatggac gcggagggga ccggcgcagg 5580
tgctgctcag gaccagcagg tggcgcgcgg ggcccccaga gcatgaggcc gggtcaggag 5640
caggtgcagg gagggcgggg cttcctcatc tgctcagtgg tctccgtcct cgccagcacc 5700
tcgctgtcac cagggctcct ctttctttat tatctgtggt tctgcttcct cacattcttg 5760
tgccaggaaa gaaacgagga agacaaattt tcgtctatag ttgaagcttc accaattact 5820
aggaacttgc ctacaagttc ctgcatgacc cattataact tatcgattaa aaaatatata 5880
ttctaatgct tctcaccatc tcttgatttg tatcatcaac tgaattgtac cctctttgaa 5940
attcatatga tgaaacctta aattcaatgg atctatattg gaattttaat gaaataatta 6000
aggttaaatg tggtcataat tgtaagaccc taatgcaata gacgtgttgt ctttataaga 6060
agaggaagag acaccagaga cctctcactt ttcacgtgca ggcagagaag aggccatgtg 6120
gagacatagt gcactagaag gtggcccagt gcaagccagg aagaagccgc gccaagaacc 6180
agccctgcca gcacactaat cttcaacatt cagactgcag aattttaaga aaatcaatat 6240
ttgttgttta agccacccac tcctgttgtc ttcttatgaa gatccagaca gactaatacc 6300
acataactct gttagtgctg tcccctggat ggagaattag cctcctgagg ctgggcacat 6360
ctctgattga gtctctaggt ttcttggatg accatttgga gttgatgcct gaaggtgaga 6420
agagacaaac cgg 6433
<210> 2
<211> 8048
<212> DNA
<213> artificial sequence
<400> 2
tctgattgag tctctaggtt tcttggatga ccatttggag ttgatgcctg aaggtgagaa 60
gagacaaacc ggcccagttg ttcacgtcct aacaagaaag cccaggtttg ttctgatttt 120
aaaacacttc aaacttctga tgactctcct gttacccaca tccatggaga tagattattt 180
attatataat tcaccaaact aatgtcaaat gtccaagttg caataccaca catcctaggg 240
tatgttcatg caattcaatg gaggagaaag tctttcagag acagatggat ctgaaatgat 300
aaatatgtgg gtaaggactc tggacttgag tgtcattgtc cagccatgtt tcacaagtgt 360
gtcctgtcag ggaaggatca gagttccttg tgctctcaga gggaaggggt cacagagttc 420
ctctctggtt cccaggaaag gtaatcgcac taatcttcat gatcttcatg agactatcct 480
ccagtgctga cctgttatag agtttttgtc tgaagttctc actgcaatcc ccaatctaca 540
tattttcaat cagaagtgtt tagaggccag gacacatctt caaggtcaca cattgagaag 600
gatgtagata tgtcccacta ccttctcctg agatctcaga cagaatccca gatttcaaaa 660
ggacacagaa ggacagctct caggtgcttt taaaaaatga cccacttcca gggacaggga 720
gcttccctat aaccatggtg gatgttctga actacaataa acattggatg gatccaggat 780
tgtttgaagt cactgtcatt attacattca gctgctgttt caatgtgtct gaagtagtaa 840
atgacaattt agatgacaat ttatatgaat cttcaagggt agaacaatat tgaccatatt 900
ccaaaatctg tccttgatcc atgatcacac tcatctccca gaccaggtcc ttcagcacgt 960
ctctttacct gaaagaagag gactctgggc ttggagaggg gagaccccaa gaagacaact 1020
gagttctcaa agggcacagc cagcatccta ctcccagggc gagcccaaaa gactggggcc 1080
tccctcctcc tttttcacct ctccatacaa aggcaccacc cacatgcaaa tcctcactta 1140
agcacccaca ggaaaccacc acacatttcc ttaaattcag gttccagctc acatgggaaa 1200
tactttctga gagtcctgga cctcctgtgc aagaacatga aacatctgtg gttcttcctt 1260
ctcctggtgg cagctcccag atgtgagtat ctcagggatc cagacatggg gatatgggag 1320
gtgcctctga tcccagggct cactgtgggt ctctctgttc acaggggtcc tgtcccaggt 1380
gcagctgcag gagtcgggcc caggactggt gaagccttcg gagaccctgt ccctcacctg 1440
cactgtctct ggtggctcca tcagtagtta ctactggagc tggatccggc agcccccagg 1500
gaagggactg gagtggattg ggtatatcta ttacagtggg agcaccaact acaacccctc 1560
cctcaagagt cgagtcacca tatcagtaga cacgtccaag aaccagttct ccctgaagct 1620
gagctctgtg accgctgcgg acacggccgt gtattactgt gcgagagaca cagtgagggg 1680
aggtgagtgt gagcccagac aaaaacctcc gtgcagggag gcggagggga ccggcgcagg 1740
tgctgctcag cgccagcagg gggcgcgcgg ggcccacaga gcaggaggcc cggtcaggag 1800
caggtgcagg gagggcgggg cttcctcatc tgctcagtgg tctccctcct cgccagcacc 1860
tcagctgtcc ccaggggtcc tctttcttta ttatctgtgg ttctgcttcc tcacattctt 1920
gtgccaagaa agaaatgagg aagacaaatt ttcgtctgta gttgaagttt caccaattac 1980
taggaacttt cctagaagtt cctgcatggc ccattatagc ttactgatta aaaaatatat 2040
attctaacgc ttctcagcat ctcttgattt gtgtcatcaa ctgaattgtg ccctctttga 2100
aattcatatg cagaaacctt aaattcaatt gatgtatatt ggaattttaa tgaaataatt 2160
aaggttaaat gtggtcataa gtgtaagact ctaattcaac agacgtgtcg tctttataag 2220
aagaggaaga gacaccagag acctctcact tttcacgtgc aggcagagaa gaggccatgt 2280
ggagacgtaa tgcactagaa ggtggcccag tgcaagccag gaagaagcct caccaagaac 2340
caaccctgcc agaacattga tcttcaacat tcagactgca gaattttaag aaaatcaata 2400
tttgttgttt aagccaccca ctcctgttgt cttcttatga agatccagac agactaatac 2460
cacataactc tgttagcgct gtcccctgga tgcagaatca gcccgctggg gctgggcaca 2520
tctctcagat ttccacataa agtaggcaaa aaatagtagt tctgatataa aaatttgtca 2580
tgtccctgtt ggccaatttc tgggcaaggt cttttaaaga agccctgggg gctttgtcac 2640
aaaagttgcc ttttatcatt tattaggaca taactgatga acaatgagta ccagttggat 2700
ggagactgac cactgaccat cttctgctgt ctcctaagta tgccacagaa aaccacacca 2760
acattactct atgtcttcaa ctttctaaat ttgcactgat tggtatttaa ggcaggccca 2820
gcgttgaata actcctttag tttttgcttc tctgggaaag gtcttatcta tcctggcctt 2880
ggtcttcaag tttcagcaat tctgggaagc caaggacgcc tctatctcct cctccatgct 2940
ctgcaactca cctgagaaca gctttctcat tggaatgtct tctgtttaag gaataagagt 3000
ccctgtttca ggcttgggtg cctgagtaca cctactggat ccagcccagg attggagaaa 3060
ctttccagaa cacatcacct gagaaatgac cagtcacact gttacacttt cacaatttcc 3120
gcttcctcat gagaaaatta aaattgcaga gactttttca taagcgttgt gccatgtcct 3180
ttcttgtttt cttgcctgtt catttatgtc agaccaggtg ccacatctat gtaatcaggt 3240
tagaatcctg cctccagtaa cacatgaaaa ggacctatgg ttgtactttt ggtctttgct 3300
ccaaagtgta aagattacaa aagtcatcac cctcattctt atgccaagag tcatctgcac 3360
aatctgatct tcaatacatt ttagaatcca tcaaatgaat gaaattccat tttttaaatt 3420
accaccccaa aaactagaga gatgggcatg tccagaatag cagttgatgg ttgcttaact 3480
ggaagagaag tttcagaagc cacaagctgt tgaaggcact tacgtggtta gcactataga 3540
cgtctgcaag acagatgtgg actagggtga aatgacagtt ccagagggcc gcactctcct 3600
cagtcttctg gaatttccct ctagaaatct ccagaatcta aaaaatacaa tccaaatatg 3660
tttcctatgg gtcataactg gggaagttta attactgaaa aatatatcag gagccttctc 3720
caaaagatcc tacagggaag aacttttcca gaacctcata ctatgtgaag ggaagaaaaa 3780
tctgcccatt ccagatccct ccccacttcc tccattatta tacaaatgag taagtttagc 3840
caatagggta agatgtaagc aaatagtcca gggaaactga agccacaaaa aggagtaaag 3900
atgaaaattc agcttttccc ctggagatgc ctggtcaagg tcacagccca gaaaaggaat 3960
ctgattgagt ctctaggttt cttggatgac catttggagt tgatgcctga aggtgagaag 4020
agacaaaccg gttattagaa gacatgtatc aaaaccaaac aaggtggtaa ggacagtttg 4080
aaaaaaaatt ccaaggctgc tgacacaccc agataactgg tggctgtagt tatgcctgct 4140
aagatttggg tgcatggggc ttggctttcg ttacctccct tggacttatt ttcccaaaca 4200
aagaaacctc cgggttaggg ggaccctatt tattccagtc acctggcatg atttgcagga 4260
taattgctca gaattaaaat attcgtccag atgtttatat agcccatgcc tgtgtttctt 4320
ctgagctgca gccagagatc attggttggt tcacagcgat aagcagagtt agtctaaaat 4380
ggaggcaaat acttaaaact tatttcttct ctcagttaat ggattctata gagaaaagta 4440
gctactcggc atgggaatgt aaaaaaatga gtaaactatg atcttattct gaactcatta 4500
acaacaaacc tgaaaaacca attgaagaga ctgtaattta aagacaagtg tatgatatgt 4560
tttgaaacat aatttttctc tctccagttc tgatttttgt cagaaactaa tcattatagg 4620
actgagtgat ttgcaaaata aactttagtc ttatggttgg tctgatcatt tgcataaagc 4680
gaagccataa taattaataa taattctgta ggaaaagcct gcaagcacga ggagcttcac 4740
agtctaacac tatgagcaca tgcatcctcc agcaactcac tgaatatttt caagtcagct 4800
ggttcttagc ttaaataaca tccagttggt atctgtccca ggaacactaa tatatggttc 4860
tctctgcagg cccctttctc cacagattaa gggttttttt ttttctctgt aatatcaact 4920
cagatatgtt gaatgctttt tccttattag tggtttttca ggtttgttgt taatgatttc 4980
agaataagat cattgtttac tcattttttt taaattcccg tgccgagtag ctacttttct 5040
ctatagaatc cattaactgg gagaaaaaat aacattttct tatgggtgaa caattaaata 5100
gtttgacata tatttatgta ctggtatata atgcagcttg aaatcaaggc atgcctcaat 5160
cataaaaatc atggctaaat tctcaaagaa ttgtgctgag tgaaagaagc taaggaatta 5220
agagtaaatt ttatataatt cattgtagaa atattagaag atgccactac cataaattaa 5280
aatgaagaag acttaaattt ttctgagaaa atggtgttgg gaatgatgcg gatgtgattt 5340
aagtttcaga ggaataagaa aaagatttag ggattaattt aattattcaa aagttgattg 5400
aagtgccgag tgaatggctg caaacatagc tctacatttt tcaaatcatt ccctataaat 5460
ttgaattaat tatttatttt tatacttgaa taaagcaata acaaagaaat aaatgaatat 5520
ttttgctaaa atggagcaat aaaaagactg atattgacag aagaaatatg actgacttct 5580
gaaaatacac acacatgagc cgtggttctc tctacatatt tagataaatt acagaaagtt 5640
gtcataactg atggggaatc ctgcagactt cactaggcat agtccacact gccctggagt 5700
tgtctcaggg gagctgcctc ctccagtggt tagagcacag gcccaggtaa taggactcat 5760
ttttttagat gtgtaatttt agacacactg cacaactgct gtgttctctg tgcaaattat 5820
ctcctgtaaa atgtaacatt gaaacctgcc ttaaatatat tgtgtaaata tgtaaaaata 5880
aaatcagatt gtgagagcta aatgctaatc aaggcgcaat cacgtaatat acaattatat 5940
tttcctgaat gatggaatta ataccaatct cccccaggac acttcatctg cacggagccc 6000
ggcctctcct cagatgtccc accccagagc ttgctatata gtcggggaca tgcaaatagg 6060
gccctccctc tgctgatgaa aaccagccca gctgaccctg cagctctggg agaggagccc 6120
agcactggga ttccgaggtg tttccattcg gtgatcagca ctgaacacag aggactcacc 6180
atggagtttt ggctgagctg ggttttcctt gttgctattt caaaaggtga ttcatggaga 6240
actagagata tcgagtgtga gtgaacacga gtgagagaaa cagtggatat gtgtggcagt 6300
ttctaaccaa tgtctctgtg tttgcaggtg tccagtgtga ggtgcagctg gtggagactg 6360
gaggaggctt gatccagcct ggggggtccc tgagactctc ctgtgcagcc tctgggttca 6420
ccgtcagtag caactacatg agctgggtcc gccaggctcc agggaagggg ctggagtggg 6480
tctcagttat ttatagcggt ggtagcacat actacgcaga ctccgtgaag ggccgattca 6540
ccatctccag agacaattcc aagaacacgc tgtatcttca aatgaacagc ctgagagccg 6600
aggacacggc cgtgtattac tgtgcgagag acacagtgag gggaggccat tgtgcgccca 6660
gacacaaacc tccctgcagg aacgctgggg aaatcagcgg cagggggcgc tcaggagcca 6720
ctgatcagag tcagccccgg aggcaggtgc agatggaggc tgatttcctg tcaggatgtg 6780
ggactctgtc ttcttctgac ggttccccag ggaacctctc taagtttagc attctgtgcc 6840
tatgaacgtc ttctctaagt atttgaaaga gattatttta atatgaagag cagttctcac 6900
tcgcacaaaa tgtggattga tgcttactgg gatgaaaagt cctcaaacat ggtcaccacg 6960
atcagagtct gagtgagctc agggcttcct gctgagtctc ctcctatcag accaaggaca 7020
gggacctcag tgaggttccc cgtctagaac agtctttatg gatactgatt gtgggcggca 7080
agccacccag gtgccgacgc aagagaccga ggacacgagc tgttccagta caataaaata 7140
taaaacaaga atagttatac cagatataga tcttagatat gattatatat gaatatcatt 7200
aatcattagt tggtagcaat tactctttat tccaatatta taataatcct cactctacaa 7260
tcataaccta ggaaaagcca ggccatacag agataggagc tgaggggaca tagtgagaag 7320
tgaccagaag acaagagtgc gagccttctg ttatgcctgg acagggcgac cagagggctc 7380
cttggtctag cagtaatgcc agcatctggg aagacgcctg ttgccaagcg gaccatggtc 7440
tagtggtaga ctcagtgtca aggaaaaaca cctgctactt agcagaccag gaaagggagt 7500
ctccctttcc ccggggagtt tagagaagac tctgctcctc cacctcctgt ggagggcctg 7560
atatcagtca gacccgcccg cacttatccg gaggcctaac agtctccctg tgatgctgtg 7620
cttcagtggc cacactccta gtcctccttc gtgttccatc ctgtacacct ggctctgcct 7680
tctagatagc agtagcaaat cagtgaaagt actaacagtc tctgataagc agaaataata 7740
ttgtaagctg tttctctcct tctcctctct ctctctgcct cagctgccag gcaggaaagg 7800
gtcccctgtc cagtggacac gtgacccatg tgaccttacc tatcattgga gatggctcac 7860
actccttacc ctgtcccttt gtcttatatc caattaatat cagcgcagcc tggcattcag 7920
ggccactact agtctccgca tcttggtggt agtggtcccc cgggcccagc tgtcttttct 7980
tttatctctt tgtcttgtgt ctttatttct atgctctctc gtctccgcac acggggagaa 8040
acccactg 8048
<210> 3
<211> 6000
<212> DNA
<213> artificial sequence
<400> 3
gtcttttctt ttatctcttt gtcttgtgtc tttatttcta tgctctctcg tctccgcaca 60
cggggagaaa cccactgacc ctgtggggct ggtccctaca ctgatcacag acaatagagg 120
gtaggccagg atcagtgtca tgtaggacat cacaggtttc acctctgaac cttttcctga 180
cactaaatat gcaaatcagc atcagcactg atctggtgat tcttttgttc ctaatccatt 240
taattccttt ttcagtcgtt gttttcattt ttccgtttgc ttttcctgct ttctgcaaaa 300
ggaagatttt tccctgtggt caaaattccg gacctcaagc cctttcctga cgctcaggtg 360
ggtctcaggc tgtggctgct gcagtcacgc gggagaggct ggtgggactt tcttcactcc 420
tcgtcactca gggccctcca ctgtgttgca tggagactta tctggaaatg caagttgcga 480
ctgagaactg aaggggacaa gcttgtttgg ttaacatggg atgtggatgt gtttctaatt 540
ttgttctgat aaactttcac agagtaactt tctgcactag tcatgtgagg aagaggatgt 600
gaacgttgtc agaataaaaa tagaacaact tgtgttataa tctttacagg tgaagctgga 660
gaaggtcatg aatagagggt tctcatgcac acatccctga taacaagaac taccataaaa 720
ttactctgca caaccacaac tttcaacaaa ggctaccaca acaataagag aattaatatt 780
gtgaggatat ctgccctgca actcccagta caatcttaaa ctgattccac ccttgttatt 840
aattcttcta cccccaggat aattgcctca gaacagctca tgtaagtcct ctcatttatc 900
ctttaaaaca acctttacca acctttacta acctgacttc ctttacctac ctaaatatgc 960
agaaatcact tccaaaacat tcctcttatc tctcttaata ttgtgattat tttattagtt 1020
aaaatgtatt tttgtttgtg tttgtctgtg tctctgtgtg tgtagcctgg gtgcacgcta 1080
tactagctgt tttcataatg acttatggtg aatgcctatt agtgctcaag aggccggcgt 1140
ctgagcactg gaggtcaatt gtgcaggtgt cacacacatc tgtgatgggc ccaataaaca 1200
gccctgggca caagcactca ggtgagcttc cctggtggac aatgcttcac acatgttgtc 1260
atgcatcact gctgggagaa ctggggacaa gagactcctc agggaaagaa cacctgggag 1320
ctcatgcctg gattctcagg gccttctccc tggtgccttt gctcatttta attcatattc 1380
attcgctata ataaagctgc acccatgaga ataacagctt ttcttgtgtc ctttatttgt 1440
actgattaag gggcctgagg atggtcttgg gacactcaac acaattacat cagagttgga 1500
gaatgctaga aagttcctga ttgctgacgc atggctagag gattttttat aacatcaaag 1560
gatgagaaag tgcgggataa aagacattca atgcccagat ggctacagaa tcatatggca 1620
tgagatggca tgtacactcc aataaggagc aggaagtgaa agtaatccat ggaatttaga 1680
aacaacagat acagcttcct aggagttgtt ccctgaaaaa ggaaaagtaa aaataatgag 1740
gaacaacccg aatatgcaat tcatatctgt gatagctaaa ataatagtaa aaagatgcac 1800
atccagccct gcagtgtgcc ctgtgagccg aggctgctag actcccattc accaccaaag 1860
ggtacagttg tccagaaaca acacacaaca cataagagtg acacagaagg tagcccagag 1920
tttgggctgg ggagaggaat ccactagaca actataacaa ggaggacagc atggaaagtg 1980
gcatcatttt gtgatatgat tgatatcatt aatttatatc atcggattgt gaacaaagtt 2040
agccaatgga ctatgagagt gactgacgta gcagcttctc tggcttcgtg tgctgcagat 2100
gagaggagaa tgtttgggag gatgctggac ccacagctca caacagacat atgacagacg 2160
tagctgatcc caacctcaag taggatgttc ttgatgagac agcttccctg gtgcactgga 2220
tatcagatcc tgtatagttt cagaacttgt gtagtttgtg ttgctttggc tttttctctg 2280
gatcttttag gtgctaatga aatgagagtg cttcacaaaa aaatggggca gacaaagggg 2340
atagttaagg gcctcctcca agcaatgtgg atatcataaa ataaatgtta aaaaatgaaa 2400
gaactaaaca agaaggtgct ggggttgaca caaaggactt aaaacagcac taccaggagt 2460
tgggtgcagc aatgggcatc ccttcaggtc catcagcaag tcagagacct aaacaaatat 2520
tccctattca tgttgaattg gcagagtgta aaaatctaga agacaaaatg gccaggaaaa 2580
atgtgcgatt ttgcatgggt gaaggtctgg caagttaatc atcttgagga ctgttgaggt 2640
cctgaggtgg tcctgccctt gcttgatgcc caggccgtag tccaactcac acgaaattgc 2700
cagggaatag acagtagttt ttttagtagt tcttgttatc aggcataagt gcatttgaat 2760
tttctcttca tggcctttcc tggcactatt tctcattttt tttaacacac atagtttcaa 2820
ctagatttat caccttcaca gggtcacaga gaagggtgga agaagggagg ccctgtatgg 2880
gtctcgaaga aacatggaaa agagtggaga gggacaatag cagggtgtaa ggaattattg 2940
agaccttact ctgcccctcc caggaggctc aggccagcct ttttctgcat ttgaggttct 3000
gggttataaa cgctgtagac tcctcccttc agggcagggt gacaactatg caaatgcaag 3060
tgggggcctc cccacttaaa cccagggctc ccctccacag tgagtctccc tcactgccca 3120
gctgggatct cagggcttca ttttctgtcc tccaccatca tggggtcaac cgccatcctc 3180
gccctcctcc tggctgttct ccaaggtcag tcctgccgag ggcttgaggt cacagaggag 3240
aacgggtgga aaggagcccc tgattcaaat tttgtgtctc ccccacagga gtctgtgccg 3300
aggtgcagct ggtgcagtct ggagcagagg tgaaaaagcc cggggagtct ctgaagatct 3360
cctgtaaggg ttctggatac agctttacca gctactggat cggctgggtg cgccagatgc 3420
ccgggaaagg cctggagtgg atggggatca tctatcctgg tgactctgat accagataca 3480
gcccgtcctt ccaaggccag gtcaccatct cagccgacaa gtccatcagc accgcctacc 3540
tgcagtggag cagcctgaag gcctcggaca ccgccatgta ttactgtgcg agacacacag 3600
tgagagaaac cagccccgag cccgtctaaa accctccaca ccgcaggtgc agaatgagct 3660
gctagagact cactccccag gggcctctct attcatccgg ggaggaaaca ctggctgttt 3720
gtgtcctcag gagcaagaac cagagaacaa tgtgggaggg ttcccagccc ctaaggcaac 3780
tgtatagggg acctgaccat gggaggtgga ttctctgacg gggctcttgt gtgttctaca 3840
aggttgttca tggtgtatat tagatggtta acatcaaaag gctgcctaac aggcacctct 3900
ccaatatgac agtattttaa ttagtgaaaa ttttacacag ttcatcattg cttgcttgcc 3960
ttcctccctc ctgtccactc tcactcactc cttcttttat tttctactta attttacaaa 4020
atcatttaac ccctttttga actattaata ggttatcttt gtttggtgat tgttttcctt 4080
tcaataatat gtactgaata attcatcttt gtgccaattc ataagtattc tggtgtaata 4140
aagacttctt tcataaaaat tggataaatt aaaataaaga taaattttta aaaacatacg 4200
atctatcaaa actgaaccat aaagaaataa aaactctggg ttgggtgtgt ttgctcattc 4260
ctgtaatccc agcactttgg gaggccatgg ccggtggatc acctgaggtc aggagttcga 4320
tatcggtctg gcaaacgtgg agaaacgctg tctccactaa aaatacaaaa attagctgga 4380
catggtggtg ctcgcctgta gtcccagata cttgggagcc taaggccaga gaagagattg 4440
aacccgggag gcagaggttg aaataagccg aaatctagcc actgcattcc agactgggca 4500
acagagtgag actccatccc gaaaaaaaaa actgaacaga cctatgagta aagagattga 4560
gtcagtgatt tttcaaacat ctcaaatcaa agaaaagtca agaacttcat ggcttcacta 4620
ctgaatttta tcaaaaattt aaaaaaaaac tagaatctct atacaaattt ccaacaaaat 4680
aaagaggaaa aaatacatgc aagcttattt tggaaggtcc tatttccaaa gcaaggaaaa 4740
gacactccaa ataaataaaa ttacaggcta atatccctga ttatctagtt tcaaaaactc 4800
tcaagggtgg tgagaaacca aattcaacag cacattaaca acagaattca ccatgatcag 4860
gtgtggttta tctctaggaa gcaatgaagt ttcaacctgc agaaataaat gtgatatatc 4920
aaatgaaaat attgaaggac taaaaccata tgcaccatat gtccatgtca atagatgcag 4980
aaagagcctc tgtcgaaatc cactacactc taatttttaa aaatctgaac atattatgca 5040
taaaacatat atacctcaac ataataaaga ccacatatca caagcccaca tctaacatca 5100
tacacagtga tgaaaatttt attttcctct aagactagaa actagacaag atgcttcact 5160
atcaccaata ttattaaaca cagcactggg tggtctagac aaaacaggcc agaagaaaaa 5220
aatagaagtc atccatatag taatgaataa atataaaata tatttttaca tattacatgc 5280
tcttatttat acaaagcctt aaacactcca ccaaaaaaga ttgaaactaa tgaagaaatt 5340
caataaagtt gcagaatcca aaatcaaact tacatttcaa gatggcaggt tagaggcatt 5400
gctagtatac ctcttccact tggaaggaca aactgatgtg gagagatgaa catcgcggac 5460
ttattttcaa gaagcaatgc aggaactgaa cagaaacact gaaataatct acaaactttc 5520
tgaaaaagca ggaagctgca gcctacactg tgagtcaggt gaagggctgg gagtccccag 5580
catttgaggg agaacagcca aaaatttcag ccagtggtcc caagttgaaa gtagctctca 5640
acaggggtgt gtaatataac ctagggttgg gacgaactcc cttggccagg gcctgggtag 5700
ggaagtgtta aaagtgggct ctgcaagttt aggagccatg ggtgcaggag ctggtgccct 5760
gctttgcagc agacaggaag gggcatggca tgaaacccat ggctgcagtc tccatgggga 5820
cagcctatga ctcctggcat ttgcaggtat tgatcacaga ttgactgaaa ctcatctcac 5880
tgctgccagt ggaacaccac gggagtggat caccaagtat gtgggaactg tgtggggcct 5940
accaccacct gccatactgt tgttgtaaaa cataagtaaa atatattatt catctatgta 6000
<210> 4
<211> 9783
<212> DNA
<213> artificial sequence
<400> 4
tcaccaagta tgtgggaact gtgtggggcc taccaccacc tgccatactg ttgttgtaaa 60
acataagtaa aatatattat tcatctatgt acatttccaa gctgtgtaga agaattttta 120
ataagaccca gagtaaaaaa agaatgcaaa tatgtagggg ccagccctac agggtctgtg 180
gatctttctc cccatgtgca gagatgagag atcgtagaaa taaaggcaca agacaaagag 240
atagaagaaa aaacagccgg acccagggga ccactaccac caagacacag actagaagtg 300
gccccgaatg cctggctccg ctgttattta ttggatacaa ggcaaaaggg gaagggtaag 360
gagtgtgagt catctgcaat gattgataag gtcatgtggg tcacgtgtcc accagacaga 420
gggcacttcc ctgtttggca gccgaggcgg agagagagag aggacagctt aggtcattat 480
ttattccatt ctcttctcag aaagatcaaa gactttaata ctttcactaa ttctgctact 540
gctatctaga gggcggagcc aggtgtacag agtggaacat gaaagtgaaa caggagtgtg 600
accgctgaag cacagcatca cagagagacg tttaggcctc tggagggctg cgggcaggtt 660
tgactgatgt caggccttcc acaagaggtg gtggagcaga gtcttctcta actcccccgg 720
ggaaagggag actccctttc caggtcttct aagtaatggg taatgggtgc cttcccaggc 780
actggcgcta ccgctagacc gaggagccct ctagtggccc tgtccgggcg tgacagaggc 840
tcacactcct gtcttctggt cacttctcac tgtgtccctt cagctcctat tgctgtatgg 900
cctggttttt cctaggttat aattgtagag caaggattat tataatgttg gaataaagag 960
taatgctaca gactgatgat taatgatatt catatataat catatctata acctatttct 1020
agtacaacta ttcttatttt acatattctc ttcattacac tggaacagct tgtgccctca 1080
gtctcttgcc tcagcacctg ggtggcttgc cgcccagaca aatattgtta agcttcttaa 1140
tagaaaaaca aattatggta aatgtgttca ctggaatact acccgtcatt tataataaat 1200
taatgcctga tacacagagc aacaaggtaa aatatctaag tatttatgtt gagtaaaata 1260
agctaaacaa ataagaatat atactatgta atttcatttt tataaattct gataaataaa 1320
aatgcatctg aagtaaaata atgaagataa gtagttgcct ggggtaatgg tagaagaagg 1380
gagggggaga ggaggaggaa tacagcagaa caaggggaaa tgttgagaag aattcacttg 1440
tccactttct tggtaatgat agcagttaca tcatttttat tagttgtaca ttttaaatat 1500
gtgaagttta ttatctttca attaagcctc ataaaatgtc ttacaagcaa acaaatggaa 1560
acttagacaa ggaaagagta atagaaagac agaaaaaata agttcaatgt cagaagtacc 1620
tgaaaattaa tgtgcctgga tcctagttct ctccatattt tcagaagagt gctggagggc 1680
agcaaaacca cacatgctct tattacggaa tgtgggttct gataaaaaca ctagacacat 1740
ccagctttgt cctggagttg gtttaggggg atgtcagaga cagtgatgaa gagcacaggg 1800
ccagataccg gggttcactc atcccagaca tgagctccga gacgcataca gagccccccc 1860
atgtgtgggt ttacttccac ttctgtaaat ggagaaaata ttgtctccta cagaacatag 1920
tttacatgaa tatttaaaat gaaatagggt gattagtgca aagtgtttat cacagcacaa 1980
tttcataata agacagcata ttttccaaat gcaatcattg ccagcaaact tctacagggc 2040
accgtcgtct tatctgggta cagcctactc ctcaagggtc ccaccctaga gcttgctata 2100
tagtaggaga tatgcaaata gggccctccc tctactgatg aaaaccaacc caaccctgac 2160
cctgcagctc tcagagaggt gccttagccc tggattccaa ggcatttcca cttggtgatc 2220
agcactgaac acagaggact caccatggag ttggggctgt gctgggtttt ccttgttgct 2280
attttagaag gtgattcatg gaaaactaga gagatttagt gtgtgtggat atgagtgaga 2340
gaaacagtgg atatgtgtgg cagtttctga ccttggtgtc tctttgtttg caggtgtcca 2400
gtgtgaggtg cagctggtgg agtctggggg aggcttggta cagcctggag ggtccctgag 2460
actctcctgt gcagcctctg gattcacctt cagtagttat gaaatgaact gggtccgcca 2520
ggctccaggg aaggggctgg agtgggtttc atacattagt agtagtggta gtaccatata 2580
ctacgcagac tctgtgaagg gccgattcac catctccaga gacaacgcca agaactcact 2640
gtatctgcaa atgaacagcc tgagagccga ggacacggct gtttattact gtgcgagaga 2700
cacagtgagg ggaagtcagt gtgagcccag acacaaacct ccctgcaggg gtcccttggg 2760
accaccaggg ggcgacagca cactgagcac ggggctgtct ccagggcagg tgcaggtgct 2820
gctgagggct ggcttcctgt gatggcctgg ggcggcctca ttgtcaaatt tccccgggga 2880
acttctccag atttacaatc ctgtactaat atttgatgtc tctaaatgca accttttttt 2940
tcctttttgt gtctgttttt tttttttaaa acaggaggac acatcctcac ctccacagaa 3000
gccacagtgt cactttgggg gcagaaataa tcctttcgtg gtcaacaggg tgagagtttt 3060
gaggaatccc agggaaacct ggggaatgtt ttccaattag actcagggca gagacctcca 3120
tgggaatctc tgattagaac aggctttgag ttctgatggg agccaagaga gaggctcacc 3180
cagggtcagg gttcttaaaa cctgatggtt ttcacagcaa tcccccttca tcttgtgaaa 3240
ctgggcacat ctgactcaga ctgattcagt tgaccctctt tctgctaatc cattttcctt 3300
cccagtagac ttgattctca cagatccctt tcttcttctc tttcctgaaa acagaggatg 3360
tgttttctgt agtctaaatt ccaaggctca ggtccgcagg agctgggtag gctgaggggt 3420
cttcctcact cactattgcc tggaaaatcc tgctgtcttc tgtgcatgga ggcatttgga 3480
aaatgaagca ggcattagtc atgaagggaa taatactagt tttctccaat gggatgttga 3540
tgtagagctg atcttatgct tctcacactg tcacaaagtt tggactctca cctgtgactt 3600
tgaggagagc ttgtgatacc ttatcttgtt ttaatatgaa tagactctcc cttagctcag 3660
gaagctggaa cagactccat ttggctcctt catttgtaag acatcaaggg ctcctcaccc 3720
acccccttcc tcaaggactt aacttgttta agctgactcc cagcatctca aagagtgcga 3780
ttaactgata aggtactgtg gcaagctgtg tccgcagttc ccaggaattt ggccaggtga 3840
tggtacccta aagcccctgc atttgtgtct ggcagataac acccagagcc cccacaccta 3900
tcatcttgtg atgaatttaa agcccctgca cctggaactg tttgcctgta accatttgtc 3960
ctttcaactt ttttgcctgt tttactcctg ttagaatgct acagttaggc tccccctccc 4020
ctctctaaac caaagtataa aagaaaatct agcaccttct tcggggctga gagaatttcc 4080
agcgttagcc atctctcagt tgccagctaa taaaggactc ctgaattcat ctcaaagtgc 4140
ggcgtttctc tctaactcgc tcggttacaa caagctgagg atggacactc cattgtgcag 4200
tgagctctgg gtgacagtaa tcgtagggtc tccccgggca gcctaaggtc aatactgctg 4260
gccgtcggga aagacaggct ggaattcctg ggaagacctg catctgccat ctaccacgga 4320
gtcctatggt cttctgttac actctctttg aatcagcacc acctagatta tctaaaacac 4380
tctttgtgac ttcatgactg ggaaaaaata atggcagtct ctactaacac ctgtgttaag 4440
ccatgggagc aacacctagg ctagtgtgtg attgagtagt tgagactgtg gtctagtcaa 4500
ggtgacacat aaaattgatt gttgccatta tgatatttta ttttatattt gacaatataa 4560
tcatgctcat attataaata tttctgttac attatttgtg tcagaggctt tggaaccaga 4620
acaacttcat cttgaataag gggtaggaaa aataagactg agacctgctg ggctacattc 4680
ccagtaagct aaggcgttct tagtcacagg atgagatagg aggtctgcac aagatccagg 4740
tcataaagac cttgctaata aagtttacag taaagaaact gggctgaagc ccaccaaaac 4800
caagatggtg acaaaagtga cctctgtttg tcctcattgc tcattatatg ctaagtatta 4860
tgctttaaca ttctaaaaga cactccccac agagccatga cagtttacaa atgccatagc 4920
aacatcagga agtttcccta gaaactaaag agtggaggaa acctcagctt tgggaattgc 4980
ccggggaatt catgaataat ttataaggaa aatgggtcaa attagccaat agggtaagaa 5040
tcctataatg ttccagccag aaaagggaaa gcatcaattt gacttctgga gactctgtat 5100
ataggtcaca ttccatagaa agaagagcac taaattatta agattcaatt gtaactacat 5160
atgatacttc cgggatgcac aaattaaaga tgaaagtgtg acaatgcaca ggctctgtca 5220
gagaaggaga acgtagggaa actgaaagag aatggcagag agtaagacaa ggacagaaga 5280
gcaatatgaa gcctctgaca tcaactctgt aaggacaagg tcttggcaac tcccgcattg 5340
accacagatt ctgttaccat gggccatttg cagggttcct agctaaggaa aaagaaaaaa 5400
aaaaaaactg catgcacttc ccaagtctcc acttgtatcc tgtttttctt agatctctag 5460
gacaaaaaaa atggcataaa cctagaccta gtgtcagtgt aggaggtact ttctttatag 5520
gcaagacact aggaagaagg gaaaatgtgt gttattggac taggagatac agagatggct 5580
ttaatctgaa tagatctaca cccgcaggta ttctcgaatg caacattcaa ctacaagagc 5640
ctaatgaaga aacacgaccc tccccaaacc cctgcaagct cttgtattca ctgtgtctcc 5700
actgatttag tgcacctgga gcttcagagc actggctccc tcctgtgtcc tagaatcttt 5760
tctttgggtc tttctgcaga attcaatagg attagttagg ctaatcaatt gtttcaagag 5820
atggtgttgg cagcatatat tgtcgctgga agagtattct aagccaggac acaggcactt 5880
tatgccggac taaagactct ggagaaaatg ttttgtgagc cctgacagaa acctccttga 5940
aaggtaaggg ctgggcagga ggggcactca ggagccacgc agcacacggt ccagccctag 6000
aacaggaggc tggggaggag gtttcctctc agggcctcgg ttttccttcg tcagaaaaaa 6060
aaaaatctaa aataaccgtt caacaagttg ctgatatgcc ttcaaatatc ctgctacaat 6120
ggaacaattc atataacttc aggcaatgag aactattttt taaatcgggc ttctaggatt 6180
ataagtatct taatagtgaa aatgtgaaga ataggtatga ttttactatt tcaatggata 6240
cagaattgtg ggagtcacta tattcctatg aacaaaaaat tcagatttca gtgttaagta 6300
atgttgccta cattgtgtga gtgacggggc agtggtggat ctgagagtgt ggtgggtgca 6360
cggacataat gattcagaaa gcaatatgga aagatgagta tctatggata cgaactgaaa 6420
gtatgtaaat acttcacaaa atactaataa acggagttga atataaaacc cataattatc 6480
caaaacacaa atttcttgga agttattttg ggaacatgat ttcataaaga actccaaact 6540
cttgtttcaa cttctgactc ctcgtttctg tgatataaga aaaccatttc caattatgca 6600
tctcagggca attctgtaaa cccagagcat ttctgctgaa gatcctgggg aatcaagaca 6660
ccgggcaggt gatggagaca ctgtctcagg tgcgcccaac gaatctcaga ggaacctgct 6720
ggagagtcac gtggaacatc tacagtcagt ttctcagagt caacagtgag ctgtgttggt 6780
gcctgagggg accatgatgg ggccaaggca cgtgctcagt gtcgtggaca gtgatggtcc 6840
agaaatgatc tagatggtct tgacgctaat gaaatatggg ttcagagtga ggagcataat 6900
ctgtggggac ttgttcttca gtgaaaggat cctgtccgca aacagaaatg gagcaggaca 6960
tgcatttctt caagcaggat tagggcttgg accatcagca tcccactcct gtgtggcaga 7020
tgggacatct atcttctttc tcaacctcga tcaggctttg aggtatgaaa taatctgtct 7080
catgaatatg caaataacct tagatctact gaggtaaata tggatacatc tgggccctga 7140
aagcatcatc caacaaccac atcccttctc tacagaagcc tctgagagga aagttcttca 7200
ccatggactg gacctggagg gtcttctgct tgctggctgt agctccaggt aaagggccaa 7260
ctggttccag ggctgaggaa gggatttttt ccagtttaga ggactgtcat tctctactgt 7320
gtcctctccg caggtgctca ctcccaggtg cagctggtgc agtctggggc tgaggtgaag 7380
aagcctgggg cctcagtgaa ggtttcctgc aaggcatctg gatacacctt caccagctac 7440
tatatgcact gggtgcgaca ggcccctgga caagggcttg agtggatggg aataatcaac 7500
cctagtggtg gtagcacaag ctacgcacag aagttccagg gcagagtcac catgaccagg 7560
gacacgtcca cgagcacagt ctacatggag ctgagcagcc tgagatctga ggacacggcc 7620
gtgtattact gtgcgagaga cacagtgtga gaaaccacat cctcagagtg tcagaaaccc 7680
tgagggagga gtcagctgtg ctgagctgag aaaatgacag gggttattca gtttaagact 7740
gtttagaaaa cgggttatat atttgagaac aaagaacaat agaaacacaa tcgaattgta 7800
agagaaatat tccattcaag agccaccaca taagccaaac tgacagagtg ggaaaggcca 7860
cactcagtaa agttgataca aacataccat aaaggtgcta ctatgaacaa gtttttgaat 7920
tagatgaata aatcatttgg agcaaggtta tttggtcata tgttaagagt aagcatgatt 7980
cttacaaagt gggaaaattg tctttcaaat gtttctgtca cttcttacca taaagttcat 8040
tttagaggtt ttaggattac agtgaaattg cacagaaggt gtgagaattc ccatgaatcc 8100
ctgccccgca cggacaccgc ctcctccact acagccatcc tgccccacag tcacaaataa 8160
gtcacaatgg atgaatctac aagaactctt ggttctttct ttttctggtg atcccctaat 8220
ataacaagcc taaattatct tggaacaccc aggtattttc aatggctttc tagaagtgat 8280
attagtcaga gggaaagtga gtgaggctat tactatttga gcactttctt ccaaaatcca 8340
caaaatatat gttaatttgg agtttttctt acttctggtt tacaatgtcc cttcccagag 8400
agtaagattt ttaagctttt agtgaggctg gaaaaaaaaa tttttaaaaa agagaaataa 8460
gctttcctgt attaggctga cttatcccag cggcagcaac aagcacagcc cagacccagg 8520
aaaagtctta ataatattat ctaatgtgct ctggagactc tttcagcact ccctcaacat 8580
agggagaaga aaaacaaatt ttcctttgtc ttatgatatg agtttataga ttcttgttct 8640
ctgtaactag taacttcaag tattctgttt tatctaagaa gcacaacgaa ggtcatgaga 8700
agcctgagca ggccagaact acagctgtct aggtaccgga gtgagtgtta tgagatcaac 8760
cagtgcaagg ctctttagaa caaaacctag ataacagaca tctgggttgc atagcaatgg 8820
tcatgtgtaa tcctgagtta tgaacctgtt acaatttgat taactgtctc tgtcctgcct 8880
ccgtatccct gcttttgtgc actctaagct tgcttcaagc tagcccaccc cattttggga 8940
agtgtgtata aaagtcaagc gctctctttg ttctgtgccc agtctttggt cattgagtct 9000
gctgggtctg ggtgtactca gtaataaaaa tatcctcctg tatacacccc aagatctctc 9060
tctggtcctc cagattctgc aacatttcag gcagattcac atctctaaaa ggccagcaag 9120
ttctggtcaa tcccataatg aaaatccttt aatgagactt ggcaaacgtg acaataagag 9180
actccttgta taatgcccta gagttggatt agacacactg tgagctcttg ggtggtggtt 9240
ctgaataagg cagtttgtgc agcaaatgca aacacatgca tgggatccag gcaggaacaa 9300
aagcttccct ttacaaagtg ggtgggcatc tggagggagc cctcagaggt gggcagtggt 9360
cgtccttgct gactgcacat tagccagagg cgtgaccata attggtcttg cagggaaaga 9420
gcaccactga ggtcataggt tatgaaaatg tttgtcatcc tccagtgagc aagtccatct 9480
gcttgcttgt gggtgtcaac tccatggtgg atacactctg ggagatgaca agatgcacac 9540
aaacctcctc tcactaatta tccactacca cacactcaag accaaccttt gctccagaaa 9600
ggaatacgtg tctgtggaaa tagacagagc ttaagtattt tgtaacctgg tgaacatact 9660
gtgcaaaacc aaacgtttca ggaagattag aatacaactc tgagtaatgc atgggttttt 9720
gttcatgaaa actctctctc cctaaatcct tgatggatat aacatgacta cataaattgg 9780
gct 9783
<210> 5
<211> 15885
<212> DNA
<213> artificial sequence
<400> 5
aatacaactc tgagtaatgc atgggttttt gttcatgaaa actctctctc cctaaatcct 60
tgatggatat aacatgacta cataaattgg gctgatcatt tttatgctat aaaattaata 120
gatgacactg cactccagcc tgcacaacca agcaagtctt catctgtaaa atctaaaaaa 180
gaaaaattag taggtactga cttcgaaatt tttgataata atattttcac cacccaaatt 240
taatcacacc cacatgttac ctgcatcttc actgaaaagt tcccagtcac gatgagttcc 300
ttcaatgctc catgtgttca aatctggaca tcaagagagt ccagagaata aaacacaatg 360
acggcagtga aactgatata tattcagcac ctcttaactc aggaggactc catacaccct 420
ggcacacagc tgcttttcta aatggctcac aatgactcca gctcactcac agagctcaga 480
cagaaacctc ccttcagggt gggagctggg tggcaggggg cactcagtac ccgcagaggt 540
gaaaatgagc tttcagatgg aacttccctg tcacctcaac atggaattta ttgtttcatt 600
tcattacctc tctttccata atggttcatt tcttttggcc tgttcattac tgatattttt 660
cagagcaatc tcacttgaat ctttactctt ttgcattttg tctccttgac aatgttggga 720
agttttacct ccagcatcat aacatgatct agtgatctga cacattgtgc aaacaatacc 780
tacaaattca gaacctcttt gtttttcttt ccacaaaata taattctttc tgttctgtgt 840
atgagcatgt cttagcaacc ctgtacacac ccacatagat gtctacaagc ctatgaattg 900
ttctctgtaa ataaaaattt atctcaaatt ccttcaatgt tcataattct gagagtgagg 960
aaggtccttc tcaatctgtt caaacaaaat gcccagagac catccagtag gtaaggagtt 1020
cacctggctc tggtgtgggg tctgtctctt tccctctgtt gtcccacagg tcagcccagt 1080
tgttcacgtc ctaacaagaa agcccaggtt tgtcctgatt ttaaaacaca tcaaacttct 1140
gatgactctc ctgttaccca catccatgga gatagattat ttattatata attcagcaaa 1200
ctaatgtcaa atgcccaagt tgcaataccg cacatcctag ggtatgttca tgcaattcaa 1260
tggaggagaa aatctttcag agacagatgg atctgaaatg ataaatatgt gggtaaggac 1320
tctgggcttg agtatcattg tccagccatg tttcacaagt gtgtcctgtc agggaaggat 1380
cagagttcct tgtgctgtca gagggaaggg gtcacagagt tcctctctgg ttcccaggaa 1440
aggtaatcgc actaatcttc atgatcttca tgagactatc ctccagtgct gacctgttac 1500
ggagtttttg tctgaagttc tcactgcaat ccccaatcta catattttca atcagaagtg 1560
tttagaggcc aggacacatc ttcaaggtca cacattgaga aggatggaga tatgtcccac 1620
taccttctcc tacgatctca gacagaatcc caaatttcaa aaggacacag aaggacagct 1680
ctcaggtgct tttaaaaaat gacccacttc cagggacagt gagcttccct gtaaccatgg 1740
tggatgttct gaactacaat aaacattgga tggatccagt attgtttgaa gtcactgtca 1800
ttattacatt cagctgttgt ttcaatgtgt ctgaaagggt aaatgactat ttagatggcc 1860
tgggtgtgtg gttggtttta tatgaatctt taagggttga acagtactga ccctattcca 1920
aaatctgtcc ttgatccagg atcacactca tctctcagac cagctccttc agcacatctc 1980
tttacctgga agaagaggac tctgggcttg gagaggggag gccccaagaa gagaactgag 2040
ttctcaaagg gcacagccag cattctcctc ccagggtgag ctcaaaagac tggcgcctct 2100
ctcatccctt ttcactgctc cgtacaaacg caccaccccc atgcaaatcc tcacttaggc 2160
gcccacagga agccaccaca catttcctta aattcaggtc caactcataa gggaaatgct 2220
ttctgagagt catggatctc atgtgcaaga aaatgaagca cctgtggttc ttcctcctgc 2280
tggtggcggc tcccagatgt gagtgtttct aggatgcaga catggagata tgggaggctg 2340
cctctgatcc cagggctcac tgtgggtttt tctgttcaca ggggtcctgt cccagctgca 2400
gctgcaggag tcgggcccag gactggtgaa gccttcggag accctgtccc tcacctgcac 2460
tgtctctggt ggctccatca gcagtagtag ttactactgg ggctggatcc gccagccccc 2520
agggaagggg ctggagtgga ttgggagtat ctattatagt gggagcacct actacaaccc 2580
gtccctcaag agtcgagtca ccatatccgt agacacgtcc aagaaccagt tctccctgaa 2640
gctgagctct gtgaccgccg cagacacggc tgtgtattac tgtgcgagac acacagtgag 2700
gggaggtgag tgtgagccca gacaaaaacc tccctgcagg gaggctgagg gggcgggcgc 2760
aggtgcagct cagggccagc agggggcgtg cggagctcac ggaatacaag gccgggtcag 2820
gagcaggtgc agggtgagcg gggcttgctc atcttctcag agatcatcat ctccctcctc 2880
gccagcacct cagctttccg tagaggtcct ctttctttat tgtgtgtggt tctacttcct 2940
cacatccttg tgccaggaaa gaaaggagta aggcaaattt tcctgttaca attgaagttt 3000
caccaattac taagaacttt cctgcaagta cctgcacagc ccattatacc ttatttatat 3060
atgtatatat tctaatgctt ctcaccatct cttgatttgt gtcatcaatt taattgtgcc 3120
ctttttgaaa ttcatatgct gaaactttaa atccaatgga tctatatcgg aattttaatg 3180
gtataattaa tgttaaatgt ggtcataaat gagaccctaa tgcaatagag ctgttgtctt 3240
tataagaaga ggaagagaca ccagatacct ctcacttctc acatgcactc agagaagagg 3300
ccacgtggag acatagtgca ctagaaggtg ggcctctgca agccaggaag aagccgcacc 3360
aagaaccaac cctgccagca ccttgatctt ctacattcag actgcagaat tgtaagaaaa 3420
tcaatatttg ttgtttaagc cacccactcc ttttgtcttc ttacgaagac ccagacaggc 3480
taataccaca caactctgtt agctccatct cctggaggga gaagcagccc cctgaagctg 3540
ggcacatcgc tcagattttc acatgaatta ggcaaaaaca gtagctctca tataaaaact 3600
gtcacgtccc tgttgggaca aggtctttta aacaagccct ggggctttgt cacaaatgtt 3660
gcattttatc ctttattagg acttaattaa ttgacaatga gtaccagctg gatggaaact 3720
gaccactgac catcttctgc tgtctcctta ttatatcaca gaaaaccaca gcaacattac 3780
tctatgtctt caactttcta aatttgtact gaatctattg ctaaatgagg agctacatgg 3840
ggtctgagtt ttgttacctt cttcccagtc ttccccaatt accaagcata gaagatactt 3900
tcagtgaaat ttagctgtca atgcccccaa caccacatca tgttttaagg tccaaggact 3960
ttctttgggg ggctattgaa aaacactttt gaatggaaaa tcctaaagca tacaacagct 4020
gaaagaatgg cccctgtgca catgaaggct gaaggggtgg atgatagggt acgttcctcc 4080
aaggtgttcc tgggcatgtg atggttggat acctcatgca tacgaaaaca aggactgaac 4140
tgagatagaa acaagggcca tatcctatag gaaaacaaga aaaaaagggg catccctgaa 4200
gagagttaat tagacttggt gggtttaaga tgaaattttt tccaaagttt tttaatgttc 4260
taagctaaat atgaatattt tcaataaact gcaaattaaa aggagaatga atttagaagt 4320
ttagacagaa aaaagccatg agggtgtaga aagcaaatcc acaaaaaact gtcacatggc 4380
agaggccaga atggagctga tgcagctaca tcattattct gcagacctag ttgagaccct 4440
ttggtgttta aggcaggtcc agcagtgaag aactccttta gtttttgtct gggaaagttc 4500
taatctaatc ttcatttctg aaaaggattt acatggtaaa gattcgtctt ggcagggttt 4560
tatttatttt tagtcccccc caacccacgt caatttgagt aaagtcatgt gttacgtaac 4620
aatgctgcag tcaacagtag accacatata tgtttgtggc cacataatgt tgtaatacca 4680
tattttcact aaactttttc catgttttga tgtgtttaga tagagagata cttaccatgg 4740
tgtttccatt tcctgcagtg ttcagtgcag taacacgctg tacaggttag ttgtctggga 4800
accctgcgct aatccacata ccctgtgtgt agtagggtat tccttgtagg tttgtgaaaa 4860
ttcacccttg atgcttgcac aatgatgaaa ttgcccaggg acacatttct caaaaagaac 4920
ctctgtcaat aaaccacaca ccattgaata caatccctgt gccttctgaa ctgaaaacct 4980
tctgctgaga aatctgttcc ttttctttct gttttctaaa gcctctttta tttccagcaa 5040
tgtggtttta aactgtgtca catcctgcat ttgcaacagc aggcatgact ctagaggacc 5100
atatgtgaag ttccacaggc aggcgcagaa gacaagtaat tcgcgatttc actgtagttt 5160
gtttaaatgg tagactcaca gaagtagagt ggagtgttgg ttaccagggg ctggaggggt 5220
ggactggaaa aggcgagatt ttggtcaaag ggtgaaatgt tccagttaga cagaaggaga 5280
aagttacagt gttctaatgc acaagatgga aaatatggct aataatgcat tatatatttc 5340
aaaatggcta aaaatgcaaa tgttaaaaat tttctcacta agaaatgata cagcctgggc 5400
gcggtggctc acacctgtaa tcccggcact ttgggaggct gaggcgggcg aatcacaagg 5460
tttggagttc aagaccagcc tggccagatg gcaaaacccc gtctccacta aaaacacaaa 5520
aaatagccag gtgtggtggc acgcggttgt aatgccagct actcgggagg ctgatgcagg 5580
agaattgctt gaacccggga gacagaggtt gcagtgagct gagatcatgc cactgcactc 5640
cagcctgggt gacagtgaga ctccaactca aaaaaaaaaa aaagaatgaa atgatagctg 5700
tgtgaggtga tggatatgtc aaatagcttg attaacaatt ccataatata tacatgtatc 5760
atagcatgtg tgaggtgacc tgtatgaagg tgaagtgacc tgggggctgt gtaagattac 5820
ctgggggtat gtgtgaggtg aacacggcac atatgaggtg accagagaca catgtgaggg 5880
gacatgggag atatgtcggg taagccaggg ggatttgtga ggtgacacag agcaagtgtg 5940
aggtgacgtg gatgcatgta aggtgatttg gggcacatgt gaggtgacgt gggggatatg 6000
tgtgagctga cctcgggtga gtgtgaggtg atccaggatg gatgtgaggt gacctggggg 6060
tgtgtgtgag gtgacttggg cacatgtcag gtgacatcgg ggattggtgg tgatgtgggt 6120
gtgtgtgagg tgacctaagg aggtatgagg tcacctgggg catctgtggg gacctggaaa 6180
tgtgtgaggt gggtacaggt gaagtcacct agggcacatg tgaggtgacc tgtggaatgt 6240
gtatggtgac ctgggaccgt ttgaggtgac atggggaatg cgtgaggtga cctggggcat 6300
gtgtgaggtg acaaagggca tgtgtgcagt gacctgggtg catgtgaggt gacctggtgc 6360
atgtgttagg ttacctgggg gatgtgtgag gtgaacaggg gcaggtgtga agtgcctgag 6420
gaatgtgtga ggtgaactgg tgcatgtatg tggtgacctg ggggatttac gaggtgacct 6480
gcgggatgtg tgaggtgacc ttgggtgagc gtgaggtgac ctgacgtatg tatgaggtga 6540
ccagggtgtg tctgaggtga tgtggggtat atgtgcggtg acctgggaaa tgtttgaggt 6600
gacctggtgg atgtgtgagg tgacctgcag gatgtgtaag gtgacactgg ggatgtgtga 6660
ggtgactcag ggcacatgtg aggtgaaatt ggggatgcgt gaggtgattc agggcatatg 6720
tgaggtgacc ttggggatgt gtgaggtcac cttgggtgag tgtgtggtca tctgggatat 6780
ttcccaactt cgtaaggaaa gcctgcaaca acacatggca ttagaagctt cacactctag 6840
caccattaat gcgtacatcc tcaaggaact ctaaatgttt tgaggttagc ttaaatgcca 6900
tggagcggca ccttccccag gcacactcat atatgcatcc tgggtctccc tgccaagcta 6960
tatctcctca aatttacagt ttatgtttgc tctgtgacat caactctgat atgttcaagt 7020
gtgttttttt ttctttattt gtagttgttc aggcttgttg tttcactctc attactctgg 7080
gctcagtcct ctcctcaggt gtcccacttc agagctcgct gtgaaatagg agacatgcaa 7140
atagggcccc ccctttcctg ataaaaagca gcccagtcct gaccctgcag ccctgggaga 7200
gaagcaccag ccctgggatt ctcaggtgtt tccactttgt catcagcaac aaacaaatta 7260
ccatggaatt tgggctgagc tgggtttttc ttgctgctat tttaaaaggt gattcatgaa 7320
gaactaagga tattgagtga gtggacatga gtgagagaaa cagtggattt gtgtggcagt 7380
ttctgaccag ggtgtctctg tgtttgcagg tgtccagtgt gaggtgcagc tggtggagtc 7440
tgggggaggc ttggtacagc ctgggggatc cctgagactc tcctgtgcag cctctggatt 7500
caccttcagt aacagtgaca tgaactgggt ccatcaggct ccaggaaagg ggctggagtg 7560
ggtatcgggt gttagttgga atggcagtag gacgcactat gcagactctg tgaagggccg 7620
attcatcatc tccagagaca attccaggaa caccctgtat ctgcaaacga atagcctgag 7680
ggccgaggac acggctgtgt attactgtgt gagaaacact gtgagaggtc ggaagtgtga 7740
gcccagacac aaacctcctg caggaacgtt gggggaaatc agctgcaggg ggcgctcagg 7800
acccactcat cagagtcaac cccagagcag gtgcacatgg aggctggggt ttgtttcctg 7860
tcaggatttg ggacttcctc tgcttctgac agtttctcta gggaaactct ttaattttag 7920
atttctgtgc ccaccaatgt catctctaca tttttttaat cattgtatat gaggactcgt 7980
tctcacatgc acaatatgta tattgccacc tatgggaatg aaaggtcctc aaccatggtc 8040
accagcatca gagtcgtgag gaagctcagg ggtgcctggt gagtcttctc cagtcagact 8100
caggacagta acctcaaggg gattcccttg tgagaactca cacattttca tgagaacagc 8160
accaggagtc agttctaaac cattcatgaa agacccactc catgacccag tcacctccca 8220
ccaggtcacc ccttcacaac tggggattat aatacaacat gagatttggg gcaggacaca 8280
aatccaaacc atatcagata cacattgtga aatacgcatg gtggtcaggt aatttgtatt 8340
tctatcacct cacagcgcta ccattttatt atttttttta atttactgca tgagtgagtg 8400
tctgatgaga acacctaaga tctacccttt cagcaacaat catttttaca atacagtatt 8460
aactatagga ccattgctgt acgttagatc tccagaactc atccaacctg cacaactgaa 8520
actctgtaca atttaacaca catcacccaa tttccctcac ctcccaggtc ctgggaccca 8580
ctattctact ctctgctttc aagagcttga atattttaga tcccacatgt aaatgagatc 8640
atgcagcatt tgtctttctg catctggctt attcaactca gcatcatgtc ctccaggccc 8700
atccgtgttg ttgcaaatgt cagaatttcc ctcttttcaa agccaaacaa aatacagatg 8760
tatgtataca cattttcttt atacattcat ccatttacag tcattaaatt attttacaaa 8820
tctacactat tattaataat cttgcaatga acatgtcttt ggcaaagtaa ttttatttcc 8880
tttgcatata taacaagaag tgggatcacc agattatatg atagctttat ttttaactta 8940
tcaagtaacc aatcctacca caggatattt ccctttcccc cacattcttg ccaacatttg 9000
tcatctgtta catttctgat aatagccata ttaactagtg tgagttgata ttgcattgtg 9060
cttctcattt gaattcctct gataattagg aatgttgaga actttttcgt tttctgtttg 9120
ccatgcatgt atcttctgaa aaaaattatc caggtttttg ccctttttta tcagttcatt 9180
tgttatttgc tgttgaggtg tatgggttat ttatacattt ggacagaact tcttgtcaga 9240
tccataattg cacatagttt ttcctgtgct ttgttattaa attcaaagaa atcagttccg 9300
aattaatggc aggaattttt tttgctccat gtcatttatg agtttatggc ttcaggtatt 9360
atgtccattt ttagttgatt tttttttttt ttgggagacg gagtcttgct ctgtagccca 9420
ggctggagtg cagtggcgcg atctcggctc actgcaagct ccgcctcccg agttcacgcc 9480
attctcctgc ctcagcctcc gagtagctgg gactacaggc gctggccacc gcgcctagct 9540
aatttttttt tgtattttta gtagagacgg ggtttcactg tggtctccat ctcctgacct 9600
cgtgatccgc ccgcctcggc ctcccaaagt gctgggatta caggcgtgag ccaccgcgcg 9660
cggcctatta gttgattttt ttatatggag ttagagaagg cctaatttta tttcttttgc 9720
atatgaatgc cagttttaca ccattattga aaagactgtc ctttctctac tgtgtgctct 9780
tggcacacaa aatcaggaga gacataatga caaaaaaaat atgagatcaa tagtcctgat 9840
gaacatagac ctgaaagtcc tcaacaaaat accatcaaat tgaatccaga agcactttaa 9900
aatgtgatac atcatggtca ggtgggcttt acccctggga tgcaaggctc gttcaatatc 9960
cacagtaact ctgattcaca atgtaaacag aataaaagca aaaagcatat gattatggac 10020
tacgtaaatt ggactgatcg tttttatgct gttaaattaa taggtgagtc tgcactccag 10080
cctgggcaac agaataatct tgtctgtaaa atacaaaaga aagataaatt aatagatact 10140
gactttgaca tttcggataa taatattttc ataaaccgaa tttaattata cccacattgt 10200
tacctacacc ttcactgaaa agttcctagt tatgttgagt tccatcaaca ctccacatgt 10260
tcaaatctgg acatccaaga gagtctagag aataaaacgc aatgagggca gtgaaacttg 10320
cgtatattca gcacctctta actcaggagg actcaataca ccctggaaca ctctgctttt 10380
ctgaatggct cacaatgact ccagctcact ctccaacctc ctcaaacatc tggcctctgt 10440
ttgccctaag ttcacgctct gctcttagtc tatgttctga agtctttgta gaggtgaaaa 10500
tgagctgtca gatggatctt ccttctcact gcaacatgga atttgctatt tcacttaatg 10560
accactcttt ccacaatggt tgatttcttt tggcctgttc attactggtg attttcaagg 10620
gaatctcagt tgaatcttta ctgttttgca ttttgtctcc atgacaatgt tgggaagttt 10680
ttcttctagc agcataacat gatctagtga cctgacacat ttgcagcaaa caatacctac 10740
aaattcagaa gctctttggt tttctttcca cgaaatataa ttcttgctct tctgtgtatg 10800
agcacatcct agcatccctg tacacaccca cgtagatgtc tacacgccga tgaaatattc 10860
cctgtaaata aaaaaagtat ctcagtttct ctcaatgttc ataattctcc tgagggtgag 10920
gaaggtactt ctgggtctgc tcaaacaaat ggcccagaga ccacctggta ggtaggtaag 10980
gagctcacct cgctctggat attgagtctg tctctttccc tctgtcgtct catagaaggc 11040
cagcccactt gttcagctcc taagaagaga gcccaggttt atccagatta tacaacacaa 11100
ccagcttctg atgactctcc tgttacaaca tccatggaga tattttgtgt attatataat 11160
tcaccaaact aatgtgaaat gcccaagttg caatactgca caccctaggg tatgttcttg 11220
caattcagcg gaggagaaat tctttcagag acagatggat ctgaattggt aaatatgtgg 11280
gtacgaattc tgggcttgag tgtcattgtc cagccatgtt tcacaggtgt gacctgtcag 11340
ggaagaacca gagttccttg ttctctcaga gggtagagct cacagaggtc ctctctggtt 11400
cccaggaaag gtaatttcac taatcttggt gatgagacta tcctccagtg ctgatgtact 11460
atagagtttt catctgaagc tgtcactgct atccccaatg tacatctttt cacacagaaa 11520
tgtttagagg tcaggccata ttctcagggt tacacattga gaaggatgga gatatattct 11580
actaccttct cctgagatct cacacacaat ctcaaatttc aaaaggtctc agaagggcag 11640
ctctcaggta ctatttaaaa ataacccact tcctgggaca ggtagcatcc ttctaaccat 11700
gatggatgtt ctgaactaca gtacacattg catggatcca ggtttgtctc aattcactgt 11760
gattattaca ctcagcagct gtttcaatat gtctgaaggg gtaaatgaca atttaggtga 11820
cctgggtgta tggttggtgt tatatgaatc tttaaatgta gaacagtatt aactgtattc 11880
caaaatctgt ctttgatcca tgatcacact tgtctcccag accagctcct tcagcacatt 11940
tcctacctgg aagaagagga ctctgggttt ggtgagggga ggccacagga agagaactga 12000
gttctcagag ggcacagcca gcatacacct cccagggtga gcccaaaaga ctggggcctc 12060
cctcatccct ttttacctat ccatacaaag gcaccaccca catgcaaatc ctcacttagg 12120
cacccacagg aaatgactac acatttcctt aaattcaggg tccagctcac atgggaagtg 12180
ctttctgaga gtcatggacc tcctgcacaa gaacatgaaa cacctgtggt tcttcctcct 12240
cctggtggca gctcccagat gtgagtgtct caggaatgcg gatatgaaga tatgagatgc 12300
tgcctctgat cccagggctc actgtgggtt tctctgttca caggggtcct gtcccaggtg 12360
cagctacagc agtggggcgc aggactgttg aagccttcgg agaccctgtc cctcacctgc 12420
gctgtctatg gtgggtcctt cagtggttac tactggagct ggatccgcca gcccccaggg 12480
aaggggctgg agtggattgg ggaaatcaat catagtggaa gcaccaacta caacccgtcc 12540
ctcaagagtc gagtcaccat atcagtagac acgtccaaga accagttctc cctgaagctg 12600
agctctgtga ccgccgcgga cacggctgtg tattactgtg cgagaggcac agtgagggga 12660
ggtgagtgtg agcccagaca aaaacctccc tgcaggtagg cagagggggc gggcgcaggt 12720
actgctcaag accagcaggt ggcgcgcggc gcccacagat cccgaggccg ggtccggagc 12780
aggtgcaagg agggcggggc ttcctcaaca gctcagtggt ctgtctcctc gccagcacct 12840
cagatgtccc caggactctc tttctttatt atctgtggtt ctgcttcctc acatccttgt 12900
ggcagggaag aaaggaggaa gacaattttt ctgtttactg ttgaggtttc accaattact 12960
agaaactttc ctacaagttc ctgcatgact cattttgcca tatatggatt ctcaccatct 13020
cttgatttgt ttcatcaatc gaattgtgcc ctatttgaaa ttaacttact gaaaccttaa 13080
atccaatgga tctatactgg aattttaatg atgtaattga ggttaaatgt ggtcaaagtg 13140
tgagacccta atgcaataaa ccgttgtctt tataagaaga ggaagagaca ccagagacct 13200
ctcacttttc acgtgcacac agagaagagg ccatgtggag acgtagtgca ctagaaggtg 13260
gccctgtgca agccaggaag aagccatact aagaaccaat tttgccagct ccttgatctt 13320
cgacattcag actgtagaat tgtaagaaaa tcaatatttg ttgtttaagc cacccaatcc 13380
tgttgtcttc ttacgaagac ccaaactgac taataccacc taactctgtt agctctgtct 13440
cctggaggga gaagcagccc cctgaggctg ggcactgcat ctctcagatt tccacgtgaa 13500
gtaggcaaaa atagtagttc tcatataaaa atgtgtcatg gctctgttgg ccatttttga 13560
gcatggtctc tgaaaccagc cctggtgtgt gtgtgtaaca aatgtccctt atcttttatt 13620
tggacataac aaatagacga taggtaccag ctggatggag attggccact gatcatcttc 13680
tgttctcctt agtatgtcac agaaaaccac accaacatca ccagcatcac tgtttttctt 13740
ctaccacctc aaacggacta cagaaatgat ccctgcagta tggtctattt cttcggcttt 13800
cgaaatttgc actgaatctc ttcctaaatg gggagctaca tggggtctga gttttgttcc 13860
tttcttccca gtcttcccca agtaccaagg acagaataga cttaaaattt ggccatcaat 13920
gcccccaaac accacatcac tttctaaaat ccacatcctg catccatcct tctctggaca 13980
cccctcatca ggctacccag gaatggccag aatctggtat cagcttatgg gctgaggcca 14040
cgagttatac acatgtgtga tttcagtcac acacactcta cttcaggacc acacctgtgt 14100
tctgaggagc tcaggcacct gctgatctca gtaattctct aataaatcac acacctctta 14160
ttaataaagg tccagatggc cccatcagct gcagagcagt ggagtaaagc tcatgggtgg 14220
gtccatcagg cagaagtcag acaatggaag ggatggctgg tcacttccat tttcactgat 14280
gttcgcaatg aattttaatg gaataaaaca atattaacta caaagagaca gaaaagaaga 14340
ctggctcgtc aagatattta aggacaagga actttatttg gggggaaagt gaaagacact 14400
tctaaatgga aatccctaaa gcacatacag cagctgacag agtggccact gtgcacatga 14460
gggctgagga gatggatggt agagtccatt cctgcaacat gtctctgggt gtgtgatggt 14520
tagactcctc atgcatatga atataagagc tggactcatg gagaaaaaaa gggccatatc 14580
ccataggaaa ggaagaaaaa aaaacagatg ggcacccctg gagagagctc attagatttg 14640
gtgtgtttaa gatgaaaatt tcttccaaaa attgtaatgt tctaagctaa atatgaatct 14700
cttcaataaa ctgagaatag gagaatggag ctaggacttg agagaggaaa caaatcttga 14760
gagagcagaa agcaaatcca caaaaactgt cacatgacag aggtcagaat ggagctgatg 14820
tagctacttc actattctgc agacttattg accatgtgga gaaggggctt gaacaaatgg 14880
agacgttctc caaccttctg aatcagcctc cttcttatgc atgagtagaa atcatagttc 14940
tgggggtgac cttcccaatt ttcctgtcca tacctcttcc cccaggggta gagtgtcttc 15000
ccaaccacaa tggttccttc cacctcagcc tcccaaagtg ctggaattat aggcgtgagg 15060
caccgaattc aggtccaact cataagggaa atgctttctg agagtcatgg acctcctgtg 15120
caagaacatg aagcacctgt ggtttttcct cctgctggtg gcagctccca gatgtgagtg 15180
tctcagggat ccagacgtga agatatggga agtgcctctg atcccagggc tcaccgtggg 15240
tttttctgtt cacaggggtc ctgtcccagg tgcagctgca ggagtcgggc ccaggactgg 15300
tgaagccttc ggagaccctg tccctcacct gcactgtctc tggttactcc atcagcagtg 15360
gttactactg gggctggatc cggcagcccc cagggaaggg gctggagtgg attgggagta 15420
tctatcatag tgggagcacc tactacaacc cgtccctcaa gagtcgagtc accatatcag 15480
tagacacgtc caagaaccag ttctccctga agctgagctc tgtgaccgcc gcagacacgg 15540
ccgtgtatta ctgtgcgaga gacacagtga ggggaggtga gtgtgagccc agacacaaac 15600
ctccctgcag ggaggctgag gggaccggcg caggtgcagc tcacggccag cagggggcgc 15660
gcggagctca cggaatacaa ggccgggtca ggagcaggtg cagggtgagc ggggcttgct 15720
catcttctca gagatcatca tctccctcct cgccagcacc tcagctttcc gtagaggtcc 15780
tctttcttta ttgtctgtgg ttctacttcc tcacatcctt gtgccaggaa agaaaggagt 15840
aaggcaaatt ttcctgtaat cctgagtaag gtggccactt tgaca 15885
<210> 6
<211> 12329
<212> DNA
<213> artificial sequence
<400> 6
tgtggttcta cttcctcaca tccttgtgcc aggaaagaaa ggagtaaggc aaattttcct 60
gtaatcctga gtaaggtggc cactttgaca gtcttctcat gctgcctctg ccaccttctc 120
tgccagaaga taccatttca actttaacac agcatgatcg aaacatacaa ccaaacttct 180
ccccgatctg cggccactgg actgcccatc agcatgaaaa tttttatgta tttacttact 240
gtttttctta tcacccagat gattgggtca gcactttttg ctgtgtatct tcatagaagg 300
ttggacaagg taagatgaac cacaagcctt tattaactaa atttggggtc cttactaatt 360
cataggttgg ttctacccaa atgatggatg atggtagaaa ccaaatagaa gaatggtctt 420
gtggcataat gtttgttgcc tagtcaatga agtctcatat tcttgtctct ggttaggatc 480
ttgggatctg gagtcagact gcctgggttc aaatcttggc tctgcccata ccatctctgt 540
tatcctgggg caagtgcctc agtttccaca tctgagaaat ggggatggta ttggtgtcca 600
tttcatagat taagtgagtt tagccttgta aaaagcttag gagggggtct gatacatagt 660
aagcactatg tacgcactag ctataattat ttgctaaagt tctgctttaa aagtaagcta 720
tttttttatg gagacagctt ttttctttta aatttccagc taggcaagaa gagcgtcaat 780
ttgatctaaa atttcataat gcttcagatt aacatagaca tggataagtc ccagaatttg 840
cagtctttta gtaaaagtag cattttctgt gtaattcttc acaagcactg attgtagttg 900
caggatgctc agtctccctc tgagatgttt tacattttta aatggttaga cttgcaggaa 960
caaaagagca gagtaactta gtaggctgtt ttgcattctt aggaaaagaa aaccatcagg 1020
acttattttg ttttcatgta ttttttcact tccactgagg agtataattg gctggtgttg 1080
acaaaatacc aatcatagat gtaaaggaga aagttgatta gttttctggc tgttcctaaa 1140
attctggatg caggaactgt ggctagaaag catctggatg attgcacttt atcagggata 1200
cttgagtgtc ctctcttagg atctggacct agaattaatg tcatgagatt tttctaacag 1260
gataaggtga ggtagtgagg gctgaagtca tccactgggt tatccaaata ttaggtttca 1320
ctgctgacaa aagagggggc ttctggtctg gttggttatt tgtgtttggc ctgatgtgct 1380
ctgtcaatca aatgtatgga cataggccta gcttctaaag gggcaatagt gacctcagtg 1440
gactgatatt taccgtacta tttacatgtg ctcttaatta cagcagaagc tgccagctaa 1500
ctgaatcttg ttttgaatct aaaaaatcta ctcttaaagc aagaaaatgg tataaaatta 1560
gttgataatg caagtgaatt ctgtacattt aattattcta agacattgga aaataaaata 1620
tcttgttact ttgaggataa aagatgattt ctttaaaaat gcaaatgttt tctacaaata 1680
ctaaagttaa aagggagaga gatgtaatta gaactcgtta actgacacat tgcaaattaa 1740
cttcttttta taaagcactg catcacaaac actaaaatga agtgggcaaa ttagctctgc 1800
agaaaactat tttctaggct gatgtttata atgaccaatc attactgaag caatgagaaa 1860
tgtgacaatt acagaatatt gctgctatag tatgttgaaa aaatatgcat tttgtagtga 1920
acatttagta gaatagctct gatttctacc tggagtttct gataacatga catcttaatt 1980
gctgtctttt atagattttt aaactgcaaa tacaaaatag caatcagcca atataataac 2040
ttattattct ccatttatgc ctgaaagtcc tcctcttgtt gatgccgtgg aaatgaatgt 2100
agaggcagat atcattagct gtattctcct tccgaatgac atttatcata tccttgttat 2160
tccaaaatag atagaagatg aaaggaatct tcatgaagat tttgtattca tgaaaacgat 2220
acagagatgc aacacaggag aaagatcctt atccttactg aactgtgagg agattaaaag 2280
ccagtttgaa ggctttgtga aggtaagcag cttaattact ggtaaaagtg tcattgaaat 2340
attttactac atttgctaga tcgggaaact gacaatgcca atgtttaaag attggttata 2400
gacacagaca cacagacaca cacacacata tatatgcatg cagatataca cacatacatg 2460
ggtgtgtgtg tgggggttaa aaaaaaaaaa cacaaagaca ctctctgggg aaaatacacc 2520
cttaggggca cagtcacaca tatttgtcag cttacatatg cagctaccac taggcaaaat 2580
gatgaagtcc accaagcttg gtttttgcat tgctgtgtct ccccatccaa accttgatgc 2640
tctcgcactg gggacccaga gtctgatccc catttcccag ggaagcaata gccgtcaaca 2700
gctgccgtgg cagcaggcca caagtgaagg gacacctgaa gactggtaac agtctctggt 2760
gcttctctga tgatggaatt ttaggtgtcc tgacagtgag atctttccct tttactgggg 2820
agagaggtgc agggaataag taatagacat tctcagtgtc gctcaaacca gactccatat 2880
aatatcactt gctcatgaag cccgcccact ctatggccgg tcatgaccag aggcacagag 2940
ggttcaaagc cttttagccc accaggctgg tagctagcat gaagtcactg cagtgactgt 3000
ggcttataac agatacctaa aacaagaatt tttagaacct ttacattaat tccatcatca 3060
cagacatagg gtctaggggc tctttctcct gaggcagaac atcaagagtt ctttctgcct 3120
atgtcccttt cagaacactg agtcaaatac ccttgggcct cggctcactt aggggtcatt 3180
tctaggaggc agcactccac attgaggaca gttctgggcc aggtgggtgg gtatctgggt 3240
aaaccaacag gaattagttc tcacatatag atgatgtgta atttaatgca ggcgtaaaag 3300
ggttaagatc ttatttctga tcttatttct gccctcctgt actgtcaccg aggtgccatt 3360
taattcatta gtgaagactc taacagctta ttcctgagtc acctacggag aacagaatgt 3420
ggctcaaatc cgctgcttgc tttcaggttc tttacactaa tctaggcttt agatgaaact 3480
cctaaaccct ttctttgcaa gactggccag ctaggaaaat gatttgagtt tcttcggttc 3540
ttcgaggatt tgggccagta ttacagagta ttggaagatg ttaccagttt aaatgtgaat 3600
aaaggcactt tcaaaacaat ggctaataat ccaaataaca gactgaatgt gcttggctat 3660
gtgactttgg gtaaataact tcacctttct gggcctcagt tttgtcatct ataacatgag 3720
aagacagatt atctgtaagg gcactatcag ctctgacatt ctacaattat gtgataagcc 3780
ttcagttccc tccaatggca gtgagagtgg cttgtcagtc cccctcgttt cttacggaga 3840
cttttacggt tgaattgtca attcctcacg tcattatttc aggttggcta tgtatgtaaa 3900
gctcccaaaa tcagctaccg aggataggag taaagaaaac agtcagtttg gcctccctgc 3960
ttatgcttgt atgaaaaaag tgacagctcc aaagtttcat attcttaaaa ggcagatctt 4020
ctcaggcatg tcagccaggg ccccagggat ctcctcctta catgcaacta aggaggctcc 4080
ttgtctctac tgcagcaggt gtggaaccct agtcaacacc acctatacct aggattacgt 4140
acaatgagta gatacaaagt cctccagcta cccaatcctc ccccaatgac ggatcccctt 4200
tccaatacgc tttcccccaa atttctcacc ctaaaacaaa attcgagact ttgaaaaaac 4260
tcaataggac aattatagaa tagctccaga ttagattcat attttcttag ctaatgttag 4320
taggctttct ttccgggcca cagtctggct gcacctaagc aacctcaagt ttgaatttgg 4380
agtctttgaa tcaggtcttg atggggtctt agaagtcatc agatccaatt ctcaatccac 4440
aacttcagtc ttctctccac ctcctgacta agtggtcatc caatctctgt ttgaacatct 4500
ctagtgacaa ggaactcatt atctctggag gcaggtagca ctaatctgtc attttggggg 4560
aaagatggta ttcagggctc aagtgagggt aagcagaggt attattttga atagtataat 4620
ttcatattaa aacttacaac ccaccacacc tctgctagat gttcagttcc atgattattt 4680
gcccaccaat gcctgcgatg cctttgagag agccaaagca tttctatttc aagttaaagg 4740
gcaacctgtc catacctgcc acatggaact cccactaaga gagaaataac ccattctgga 4800
ttttctgaaa gtccacttta aaaagtattt cagttgaggt ggggagtgaa gcaagaaaaa 4860
aaaaaggctc tggggagtgt ggttgggcga aagttcacgg aaaggctagg ctgggctcat 4920
gaaacacgag ctttgctgac ttcatgtttt catcttggcc aggcctcaac accaatgcaa 4980
caacttagcc taaaagtatc tcaaccttga tcaccacact ctactttttg aaaagacact 5040
aaatagtcat ttgtttactt gtgatctcac aaacattttc ctgtcaccac atcttcatag 5100
tgccgcgctt cagctcaaat ggaaagttga agctctgggg cccatgtgag tgttctgagg 5160
ctcaggttcc cctggaggct ctatgaacta cgcccttaaa tctggcaact gagctgggcc 5220
tacagccagc actcaacagt gacagcacaa attccttctg gaggaggaaa taaaaggaag 5280
ggtcctatag acaactgatt ccaggagtgg gaaggagcac aggactttga ttatcataag 5340
atgtgaaaat actactgtct tcttcccttg tgtgcagagg atagacagat ggaattagct 5400
aagcccagcc tatgaatgcc atctcacagt ttccactctt ggtttaaacc tcagcttctt 5460
tgggtgacct cataatgacc agttaagccc tccaggcctt ttgttcagtc tctttaaaat 5520
ggcagcaaca gcctttatca tcttccaacc tgtgttgatg gaagttcctg ttagcttctt 5580
taaatacctc tagacttcct tcagtttata agtgaaaaga aaccttttaa gaagtgtcgc 5640
acttgccttt gaacatcaac accattggga gatggcctgt gtttccgaaa tgctgattat 5700
tctaagtaaa tacagtgcaa ctatcaataa gagaatctct tcagcccatt gaaagggata 5760
gcaaaattaa aaatgtctga gggtcttttc atagtctggc atttctcccc aaggtcaaac 5820
ttactattat cttttcctac aggatttcag accaaattta ttctaataga tacacaccat 5880
gctttatgtt taataatatt ccatatacca gttcccaggg tagaatcatc tccccattcg 5940
gcattatttg tcaatatctg tcaaagccaa ggaggttgag gtcataggaa gggtcaggat 6000
cacagcctct ggtctggaga gagcactgga atggagataa taaggcctgg attttacttc 6060
cagattctcc cctgggcttt ctgggttgtt ggctcatctg tcagatccat ggactcccaa 6120
ttggcatgat ggaattaatg acaggatctg agtctatatg ataatcctca ccagaaacag 6180
acaacagagt aatgacagat gcaaaacgaa tgataatttt aaaaccccac agcagagccc 6240
ctgtcaaaat gacctcttgc aatgcttctt attttaggat ataatgttaa acaaagagga 6300
gacgaagaaa gaaaacagct ttgaaatgca aaaaggtagg tttgctattt gctaatttct 6360
atgaatgcct aaaaactaaa aggaagcttt aggctgatca tattgaacaa cccagtgttg 6420
ttgcatcagg gaacttttag ccctggaaat aaaacaggaa cacaattgtc aaattgacac 6480
cttctctggt ccctgtgatt tggaaagact ttgtacatat atatttatga aaaaaggatg 6540
tgttccttta atgccgatga taccaaatct gaagaaatcc cattatgttc aataccttaa 6600
tagaagcaac catacagcct gataccacct acagtggaat aagaagacag gaaagtcatc 6660
atttggtaac agtggcattc atcactcatt gataacagtt tttcatgggg cacagtggcc 6720
ggtggagcct ctgggatcaa ggagtgacaa tgtcacagtg ttctattatt tgcccggttc 6780
ttaaagtgag agcatcctga acatctcagg gttggaagag aacttgagag ttctcaaatc 6840
cagcaccatc cccacaacaa aaatctcctt cacaataaca ctgaccgtcc agcctctgat 6900
caaacatgtc gagggatgag gcaccttcca cctcataagg cagcctgatc cgtctttgaa 6960
tggctctaat aataccaaga ttactatact actccagaga agtctttcct cctcaagtca 7020
aactttgttc ctataatctc cactcattgg tcccagttct gctctttgag gccctagtaa 7080
acaaagtata attgctctcc tacccagcag ctgtccagat atggaagaca gcaatcatgg 7140
tggccaagcc ttgactgagc tttttcttct ccaggctaaa gatccctgat gtcttccact 7200
gtttctccta tgaccctttc caggaccttt cttctgccac tcacctcctt ttcttggaca 7260
cactaacgtt ttcctgttct tttagaatgt ggcatcgcaa accaatacaa taatgcgtga 7320
agtgacttca gcagcagatt atgggaaaga cggggtgttg ttagagagaa ttttatatca 7380
caaagttggt gaacatgatg ttatggcttc tgcaaattta atacacacaa aaacatacat 7440
acatacaggg atagagatac tattttctga ggcaaagaga gtactcagac cttgccttaa 7500
ctgttgttct ggatactaaa tggtcatccg acttccatga aggttttatc ttcagaatga 7560
ctgcaagata tgttgagtaa tagtaccacg ctgtctgtta attacagaga aatctgagga 7620
aacagtttat gtagatgctg cctagaagtc ttcagggaaa tgataatatt aaccaaactg 7680
gtcatttagg tcatgcaatt taactcaaca tttatagggc acttacaaag tgcccaatat 7740
caggctcata actggacaaa aagaaacttc cacacagtct ctgcccttag aagattgaca 7800
catctcatta gggagcaggg ctttaacaca agaaataatt aaagacagat acaatagttc 7860
agccagttgc ttgaccaatt cagaaaccat aagaatctta ctaagtgtgc agactttgga 7920
gcccactaaa atccccagtg tatggagttg ttcctaaaag caagattcac ggtatgttta 7980
atgaagacca gtgtttttag cctgtgtcaa tctatgcaaa atggaatcga gtattgatca 8040
actgttagga gaatgagacc gatggaaaca gccaattcaa ttactcagat attagaaacc 8100
aacttttcct tcagtgggag agatgtcaga ccattttatc tttcctttta tataatctat 8160
ttttgcacag tctctattac acagttgtag aactggacca gatagttttg tgggcagttt 8220
ttgcattatt ttagcctgac agtttttggt tccatttcag gtgatcagaa tcctcaaatt 8280
gcggcacatg tcataagtga ggccagcagt aaaacaacat ctggtaagtc acacagcatc 8340
tgagcggtag ccacccaagg ggaaaggctg ggatgccgaa gtcatgttac ctaatggtta 8400
aactcctctt ttcccctggg acccaattta caaacctacc cctacacttc tcctattccc 8460
ttctttgtct tcaaagtgag ttcaaatgca cagatgggac ttagagggac aaaaggaggt 8520
ggaatgcaat ctggatgttc tcattatgtt cttgctcaat ggctgattct aaatgatgaa 8580
ttactgggtg gagggaccat tgttctgaca acatagaaga aatggcatgt agtgacctcc 8640
tgactgggag catccctcct cctaacccat cttcactgtg tggaaatggg cctcatgggg 8700
tatttcctgc catctgtcaa tccctgtatg attaagctca gcctcactga ggccaacctc 8760
agggaaagta aaggtaaaat cattctgtaa agatcaatag gtcccaagac gttacatttt 8820
ccaatgaagt aacaacagac gacatattgt gatcttttca actctgaacg attttatttc 8880
catatacgtt ctgccaccat tctagccttt agatattttt tcccaaatgt gcatcttgcg 8940
ataactggtg ccaaagaata tgtcgtatct gataaatgga tggaaacatg cacgctaaca 9000
taaagtctcc catcaacata aaggcaagag cgtcagagga gtctttgaaa aattctacag 9060
agtgctccgg aatggagttc taagcagtgc atgtgtgtgt gcatatgtgt atgtgtgtga 9120
cagggagaga aagagagatg gacagagaga gaaaaaagac actgcttcat ctctgaagtg 9180
gcttgggctt ctcagtaggc gtaacacatg gacagttatc attatcatgg atcatggtac 9240
caaagtaaga gcactgaata gggagttttt gaacactggg attcaaggac catgaccact 9300
gcttgctggg tgaccttgag caagaccctt tacctatgca gcagttttct acttcaccta 9360
ctttacaggg tggctttgag catcaaatca gctaatgtgg ccgaaagtga tgctgtcgag 9420
tgctgtacaa ccgtaaggtg acactactta gtttacttca ccatggctta gatgtcaaaa 9480
gggtgacata aagcccctca ctaataccag ttagttacac aatatttaat aattttgtca 9540
agtacccctt ctctcttctg gatcagatga caacaacaga gaaatctcct agaagaatag 9600
cttcccactg gtcttttttt gcctgtatct aaacccttga tcttggatat atttcataga 9660
gctcagattc tcccaaaagg cttgtaatgg atatcagtcc tacaatatct tacagtctgc 9720
atcacaatag gtttccaggg gatcagatgg gaagacagta acattccacc cccaccccag 9780
tcccaaacct cttcttccta cctagccatg ctgctaaaat cttgccctac atcccacagc 9840
aagtactaaa attaggtaag gacgtaccaa agtaaactta ctgaactaaa agattgagaa 9900
cctgcccttt ttttctcaat aaaatggttc aaaagggcaa acattctaat gaagcattgt 9960
ttctggagtg gtctggaggg cccggatctg tcaggcattt caggatgcct ccctattagt 10020
aaagggcgag tcttaccagg tgggatcttg tgccctgata gacctaagac tatcgaatag 10080
gaattatttt ttaaaaagct caaggaagca aacacatcag tactttcact tttcctcaac 10140
cctcaccccc atcagtcagt ctagctttct gtgggagctg agatttcaag tcgggtgcac 10200
acactacttt gaacccactc aacatctcag ccgagaaaat ggcacactgt tggtgggtac 10260
tctggcttag ccacaagaat actggtactt tcaagttggt ggcgcccact acaatgggag 10320
atcaaaacat accgtgaaat gagcacacag tttattttca tacttccttg cctaatttta 10380
gtccttgctg ggggaggcag atcaggtttg caacagcatg atcaggtagg aagaaatggg 10440
gtcttttctc tgtgctgagg ctgagctagg tagactgaca actctctgac tttgtaaaat 10500
tcaaggcaag caaggtattc atggtaatat tagcaaaaat ttggtccgag taatttggta 10560
tgtataattt atgatgtcaa attttgaaat catttgtgcc ttcttaagtt caaggcaaat 10620
tggctataag aactctaacg agagaaagaa actcactgtg atctcttact ttatttaatc 10680
ttcacaagtc tctgaaatat gctccaatat gagccccgtg ttgcagatga ggaactgaag 10740
ctcatggaga tttagagact tgcccaagct taaatagagc ctagattgga acatggctct 10800
gtctgactct gaagcccatg gaaggggcct tgagaatcca tccctataca aagccaatat 10860
ccaacattaa actatatttt ttgtcagaat gtgaaccatg ctctgcttca cctcaccaca 10920
aactttccct ttctttgtaa cagtgttaca gtgggctgaa aaaggatact acaccatgag 10980
caacaacttg gtaaccctgg aaaatgggaa acagctgacc gttaaaagac aaggactcta 11040
ttatatctat gcccaagtca ccttctgttc caatcgggaa gcttcgagtc aagctccatt 11100
tatagccagc ctctgcctaa agtcccccgg tagattcgag agaatcttac tcagagctgc 11160
aaatacccac agttccgcca aaccttgcgg gcaacaatcc attcacttgg gaggagtatt 11220
tgaattgcaa ccaggtgctt cggtgtttgt caatgtgact gatccaagcc aagtgagcca 11280
tggcactggc ttcacgtcct ttggcttact caaactctga acagtgtcac cttgcaggct 11340
gtggtggagc tgacgctggg agtcttcata atacagcaca gcggttaagc ccaccccctg 11400
ttaactgcct atttataacc ctaggatcct ccttatggag aactatttat tatacactcc 11460
aaggcatgta gaactgtaat aagtgaatta caggtcacat gaaaccaaaa cgggccctgc 11520
tccataagag cttatatatc tgaagcagca accccactga tgcagacatc cagagagtcc 11580
tatgaaaaga caaggccatt atgcacaggt tgaattctga gtaaacagca gataacttgc 11640
caagttcagt tttgtttctt tgcgtgcagt gtctttccat ggataatgca tttgatttat 11700
cagtgaagat gcagaaggga aatggggagc ctcagctcac attcagttat ggttgactct 11760
gggttcctat ggccttgttg gagggggcca ggctctagaa cgtctaacac agtggagaac 11820
cgaaaccccc cccccccccc cgccaccctc tcggacagtt attcattctc tttcaatctc 11880
tctctctcca tctctctctt tcagtctctc tctctcaacc tctttcttcc aatctctctt 11940
tctcaatctc tctgtttccc tttgtcagtc tcttccctcc cccagtctct cttctcaatc 12000
cccctttcta acacacacac acacacacac acacacacac acacacacac acacacacac 12060
acagagtcag gccgttgcta gtcagttctc ttctttccac cctgtcccta tctctaccac 12120
tatagatgag ggtgaggagt agggagtgca gccctgagcc tgcccactcc tcattacgaa 12180
atgactgtat ttaaaggaaa tctattgtat ctacctgcag tctccattgt ttccagagtg 12240
aacttgtaat tatcttgtta tttatttttt gaataataaa gacctcttaa cattagtcca 12300
tacctcttcc cccaggggta gagtgtctt 12329
<210> 7
<211> 14861
<212> DNA
<213> artificial sequence
<400> 7
ttgtttccag agtgacttgt aattatcttg ttatttattt tttgaataat aaagacctct 60
taacattagt ccatacctct tcccccaggg gtagagtgtc ttcccaacca caatggttcc 120
ttccacctca gcctcccaaa gtgctggaat tataggcgtg aggcaccgca actggcctct 180
tcatttttat tcatatgttc cttcagcagc cactatgtct tcccactgat ttcttcagtt 240
tctgcctttt ccttttgaat aaggctgtta ctcctgaggg aagatgggag gtgggcctgg 300
acagggactt ggtgcattcc tctctcctgt cccagttctt attggtttct ccagtgtctg 360
tagaacagtg gttttggtgg ctttacctct gcagataatt tctcttgcaa tgtagtggtg 420
atggggaggt gtgtctggat gcatttcagc tatagttgct gttttgcttt cccagacagc 480
accatcccaa agggtagagg ctggagcatt ttgtgatgta tccccagtac tgaagaaaaa 540
ggcttcaata gcaggaggaa ttcctcaact gtatacactc tgagaattta aacaataact 600
tctctatcac actcaaattg aaaccatcca atgaatatgt ctactttaat cgtgtgctaa 660
cttaaatggc atttggcagc ctctgtccca gaaaagatta tcatctgctc ctgtttattt 720
ccctgcacgt ccttatctct cttcagattt cagatatatt gtttgtccta taacatcaaa 780
aatttgatgt atatgtgcta atttgcagat cagtaagttt agtagctgtt gtaagaataa 840
taacatattt ttatcgggtg cttacatctc caagctgaga agcacctcta tgtgtaatac 900
taagaaacta gaaatgatac aaatatcaag aagatacata gataaaaagt aatggcatgc 960
taatttactg taataacatc catcatgaga atcaatacat tgttgatgct caacatgttt 1020
gcatcataag tagttacgtg cgagaagcca cacaaataaa acacatacta tataattcct 1080
gtataataaa ttcttgaaac tcaaaactaa ggtattaaat gtgaaggact gactcagaat 1140
atggtggggg aagaaaataa ttgggaagga ggaattgtag agcaacacaa ggaaactttt 1200
aagtgtaatt tgttcattat ttggatggct tttggggatg cacaggtgag cacgagtgga 1260
attacattgt gttttgtttg tttctttttt ttttctgaag agatgtggtc cttctctgcg 1320
acccaggctg gagtgtagtg gtgggatcat agttcaatgt agcctctaac ttctggtctc 1380
caacaatact cctgcatctg ccatctaagt agctggaact accgttgtgt gccaggaggt 1440
ttggcttggc ttgagtactt attaaaccac acactttttg caatatttaa tgtatagtaa 1500
taatgtctca ataagactac tacaaatcaa tgaatgaata atttgttcag tacagattca 1560
tggaaaaata gacactaaca tgatgaatgt ctgacattta tgaaaataca actgcatgaa 1620
atgtgcttct ctttacattc attaggtaaa cacaatagtg catacacatc acaccgtgct 1680
ttcattacag gaagaaagat ctgaaaatgt cactggggtg aaccacattg tgctgggctt 1740
ggttcaggga gcagtcaggc ccagtgttgt gaccttcaac cacagaatcc ttaaaaaata 1800
ataaaagagg ctcccccaaa gtccccatca gttcccggac tcgctatgtt tctggatcgt 1860
atcagtgcat ccggagctcc ctggtggctt tagtgattcc ttgcttgcca tgctgaggtc 1920
tccctgtaga ttatgttggg ttttctgagg ccgttttgct atttaagacc cactccctgg 1980
cacagagcga ttccctctaa acctgataga ggttctgaac taaaaataac attaagtgaa 2040
tcctggtgtg tctgaactca agtgattgtt acattaagct gctgttgcaa tctgtttcct 2100
cacctgggaa aagaggagcc aggacatagt gagttgaggc cccaggaaga taactgaatt 2160
ctcagagggc acagccagca tcctcctccc agggagagtc taaaagactg gggcctccct 2220
catccctttt cacttctcca tacagaggca ccacccccat gcaaatctca cttaggcacc 2280
cacaggaaac caccacacat ttccttaaat tcagggtcca gctcacatgg gaaatacttt 2340
ctgagagtcc tggacctcct gtgcaagaac atgaaacacc tgtggttctt cctcctgctg 2400
gtggcagctc ccagatgtga gtgtctcaag gctgcagaca tggagatatg ggaggtgcct 2460
ctgagcccag ggctcactgt gggtctctct gttcacaggg gtcctgtccc agctgcagct 2520
gcaggagtcc ggctcaggac tggtgaagcc ttcacagacc ctgtccctca cctgcgctgt 2580
ctctggtggc tccatcagca gtggtggtta ctcctggagc tggatccggc agccaccagg 2640
gaagggcctg gagtggattg ggtacatcta tcatagtggg agcacctact acaacccgtc 2700
cctcaagagt cgagtcacca tatcagtaga caggtccaag aaccagttct ccctgaagct 2760
gagctctgtg accgccgcgg acacggccgt gtattactgt gccagagaca caatgagggg 2820
aggtgagtgt gagcccagac acaaacctcc ctgcagggag gcggaggggg cgggcgcagg 2880
tgctgctcag gaccagcagg gggcgcgcgg ggcccacaga gcatgaggcc gggtcaggag 2940
caggtgcagg gagggcgggg cttcctcatc agctcagtgc tctccctcct cgccagcacc 3000
tcagctgtcc ccaggactcc tctttcttta ttatctgtgg ttctgcttcc tcacatcctt 3060
gtggtaggaa aggaaggagg aaggcaaatt ttcctcttag agtcaaagtg tcactaatta 3120
ctaggaactt tcctacaagt tcctgaatgt cccatttttc cttcttaatt aaaaaaaata 3180
tatattctaa tacttctcac catctcttga tttgtgtcat cagttgaatt gtgctgtctt 3240
tgaaattcaa atgctgaaac cttaaatcca attgatctat attggaattt taaggatgga 3300
attaaggtta aatgtgatca taagtctgag attctaatgc aatagatctg ttgtctttat 3360
aagaagtgga agagtcacca gagacctctc acttttcccg tgcacgcaga gaagaggcca 3420
tgtggagaca tagtggacta gaaggtgcaa gccaagaaga agccgcacca agaaccaacc 3480
ctgccagcac ctggaccttg gacattcaga cttcagaatg gtgagaaaat caatgtttgt 3540
tgtttaagcc acccactcct gttgtcttct tatgaagacc cagacagact aataccacat 3600
aactctgtta gctcctggag ggagaagcag ctccctgagg ctgggcacat ctctcagata 3660
tccatatgaa gtaggcagaa atagtagttc tcatataaaa atgtgtcatg gccctgttgg 3720
ccattttttt ttgacggagt ctcgctctgt ggccaggctg gagtgcagtg gtgcaatctc 3780
agctgactac aaaccccgtc tcctgggttc aggcaattct ccttcctcaa cctcctgagc 3840
agctgggact acaggtgtcc accaccatgc cagcctattt tttttttgta tttttagcag 3900
agacagggtt tcactgtgtt agggtggtct caaatctcct cactgcatga tctgcctgcc 3960
tcggtttccc aaagtgttgg gattacaagt gtgagccacc gcacctggcc cctgttggcc 4020
attcttgggg caatgtctct gaaaccagcc ctggtgcctg taccacaaaa ttttctttta 4080
tctttcatgt ggatataaca aatggacaat gaggaccagc tgcatggaga ctgaccactg 4140
aacatcttct gctgtctcct aagtaagtca caggaaaaca caccaacatc accaacataa 4200
ctgttttcct tcaacttcct cgaacgaact atagaaatga tcccttaaag tatagtctat 4260
tccttcaact ttctcaattt gcactgaatc ccttcctaaa ttaggagcta catagggtct 4320
gagttttgtt ccctttctcc cagtcttccc caagtatcaa ggacagaata gatttaaatt 4380
aaatttggcc gtccatgccc ccaacaccac atcggtttct aacatccttg tcatgtaccc 4440
atccttctgt gggcacccca catcaggttg cccaggaatg gccagaaggt gtcatcacct 4500
tatgggctga ggctacaagt tatacacacg tgtgatttca gtcacacaca ctctactgca 4560
ggacacacct gtgttctgag gaactcaggc gcctgctgat ctcaggtctt ctctaataaa 4620
ttacacacct cttatgaatg aaggtccaga tggccccatc agctgcagag cagtggatta 4680
aagctcatgg gtgggtcagt caggcagaag tcagacaatg ggataataga agtcagacaa 4740
tagtcactga ggttcccaat gtattttaat ggaataaaac aatattagca acaaaggggc 4800
agtaaagaag actggctctt gaaggtattt aaggaccagg aactttattt ggggggaaag 4860
tgagagacac ttttacatgg aaagccctaa agcacataca gcagctgaca gagtggccac 4920
tgtgcacatg agggctgagg agacggatga taggttacgt tgctccaaga tgtccctggg 4980
tgtgtgatgg ttggactcct catgcatatg aaaataagag ctggactcag ggagaaacaa 5040
gggccatacc ccatagaaaa agacaagtct taactgcttc cagctgagga gggatgctgt 5100
ttggggaaga tctctcttgg aggtctaagg gaccccagga aaagggagcc attatcccag 5160
gcttcagttg catgaccatt tggagtttga tggtctgaaa atgagaagag gcaaatctgg 5220
ttattagaag acatgtatga aaaccaaaca aggtggcaag gacagcttga aagaaaattc 5280
caaggctgct gacattccta gataactgca gctgtagtta tgcctgctaa ggtttgggcg 5340
catggggctt ggcttttgtc agctccctgg gatttatttt cccaaacaaa gaaacctcca 5400
ggttaggggc accctattca ttcccatcac ctggcatgat ttaaaggata attgcttaga 5460
attaaaatat tgatccagat tttttatatt ccccatcgct ttttgtttct tctgggctgt 5520
agccagagat cattgattgg cgctcaggaa taagcagagt tagtctaaaa tgcaggcaaa 5580
tacttaaaca actgaagaga ttagaattta aagacaagtg tatgatatgt tttgaaatac 5640
aatgtttctc tttccagttt tggtttttgt cggcagcaaa taatgataag actgagttgt 5700
ttgcaaaata aactttagtc ttaaacttgg cctgattatt tgcataaagt gcagcaagaa 5760
tattaataat aattctgtag gaaaagcctg caagcaccag gagcttcaca gtctaacact 5820
atgagcacgt gcatcctcac gcaactcact gaatatttcc aagccagcct gttccgatct 5880
taaatgccat ccagtggcat ctgccccagg tacactaata catgggtcct gcttctctct 5940
gcagcctcct ctctcctcag atttcaggtt ttgtttattg tttgttttct ctctgacatc 6000
aacacagata tgttgaaggt tttctttttt ttatttgtag ttgttcagct ttgttgttaa 6060
tgaggtcaga ataagctcat agtttacaca tttttacatt cccatgccga gtagctgctt 6120
ttctctatca aatccattaa ctgagagaac aatcacattt cgttacaggt gaacagttaa 6180
atagtttggc atatatttct gtgctggaat ctaatgcagc ttgaaatcaa gtcatgcctc 6240
actcattgaa aaaaacatgg ctaaattctc aaagaattgt gctgagtgaa agaaactaag 6300
gaattaagag taaattttac atgatacatt tgtagaaatt ttagaagatg ccactattat 6360
aaattaacat ggagaagatt taaatgtttc tgagaatatg ctattgggag taatggggat 6420
gtgagttaaa tttcagagga ataagagaaa gatttaggga ttaatttttt caaaccttga 6480
ttgaagtgct gagtaaatgg ttgcaaacat aggtctacat ttttcaaatc attcaccata 6540
aatttgaatt atttattaat tacactcgaa taaagcaata aagaaactga tgagataata 6600
tttgactgaa ttgcatcaat aaatagatcg atattaacac aaggaatata actgatttcc 6660
aaaaacatac acatgaacca tggttcactc tgcgtattta gataaattac agaaagttgt 6720
cataacagat ggggaatcct gcagacttca ctaggcatgg gccatgctgc cctggagttg 6780
tctcagggga gctgcctcct ccagaggtta gagcacaggc ccaggtaata ggactaaatt 6840
tttagatgtg ttatcttaga cacactgcac aactgctgtg ttctctatgt aaattatctc 6900
ctgtaaaata taacattgaa gcctgcatta aatatattgt gtaaatatgt aagaataaaa 6960
gaaagttatg agagctaagt gttaatcaag gcacaagcat ataagatata actatatttt 7020
cctgaatgat ggaattacta ccagtctccc ccaggacact tcatctgccc tgagcccagc 7080
ctctcctcag atgtcccacc cagagcttgc tatatagtgg gggacatgca aatagggccc 7140
tccctctact gatgaaaacc agcccagccc tgaccctgca gctctgggag aggagcccag 7200
cactagaagt cggcggtgtt tccattcggt gatcagcact gaacacagag gactcaccat 7260
ggagtttggg ctgagctggg ttttcctcgt tgctctttta agaggtgatt catggagaaa 7320
tagagagact gagtgtgagt gaacatgagt gagaaaaact ggatttgtgt ggcattttct 7380
gataacggtg tccttctgtt tgcaggtgtc cagtgtcagg tgcagctggt ggagtctggg 7440
ggaggcgtgg tccagcctgg gaggtccctg agactctcct gtgcagcgtc tggattcacc 7500
ttcagtagct atggcatgca ctgggtccgc caggctccag gcaaggggct ggagtgggtg 7560
gcagttatat ggtatgatgg aagtaataaa tactatgcag actccgtgaa gggccgattc 7620
accatctcca gagacaattc caagaacacg ctgtatctgc aaatgaacag cctgagagcc 7680
gaggacacgg ctgtgtatta ctgtgcgaga gacacagtga ggggaggtca ttgtgcgccc 7740
agacacaaac ctccctgcag gaacgctggg gggaaatcag ctgcaggggg cgctcaggag 7800
ccactgatca gagtcagccc tggaggcagg tgcagatgga ggctgtttcc tgtcaggatg 7860
tgggactttg tcttcttctg acagttcccc agggaacctc ttaaatttag aaaactgtgc 7920
ctaacaatgt cttctctatg catatgagga ccttttctcc ctggcacaaa atgcagattg 7980
acgctgacac ggatgaaaat tcctcaacca tggtcacaag gatcagagtc ctgagtaacc 8040
tcagggcttc ctggtgattc ttctccaatc agacccagga cagggacctc cgtgagattc 8100
cctgactgga acagtcttta tggatcctgg tcacagacaa tagagaggct gaaccagggt 8160
cagcgtcatg tagaacgtca cagatttcac gtctgatcct tctcctgaca cgaaagtatg 8220
caaatcagta tcagcaccga tctggtgctt cttttgttcc taatccattt actttctttt 8280
ttcgtcgttt ttctcctttt tccatttgtt tttcctgctt tttgcaaaag gaagatgttt 8340
tccctgtgag atgcagggga tgacaatttt gggagatggc tggaacatcc aatatcctca 8400
gggccgacca tcagtaagtg caggctagaa gtctcagaaa gagctgaagc tgcttaatca 8460
ccgtggagtt ttaccttctc cagttctgct ctgatggaat cagggccaag caggttatca 8520
atgataatct acctaacata gagtcaactg attccagttt caataacgtc tgttaaaaat 8580
tcacaccacc acctggatta ctgttttgtc aaatcaatac acagtattgt ccagctaagt 8640
agacccaaag acggaccgtt gcccatggag aaaaacatta acctgagttc taggttctta 8700
cagtgttaaa ggtgtaaaac tgattattaa aaatgaggct atttttcttt ttgctgttga 8760
gttgtagaag tttcttttcc attttgacat gaaaactttt tcagatatat ggcatattat 8820
ccaattctgt aagttgtagt tattttgttg ctttgcagaa tctttttcat aatctattcc 8880
cacttgttca attctgcttt tttttgtagg tgatttgaat gtaaaatcct gaaaaagatt 8940
gctaattttt tgagtgttga gagttttaca attagaggta ttacagttag atatttgagg 9000
catttggagt gaatttttgt gtttattcta acctaaaatt cttaattctt ttcatgggaa 9060
aatccagttt tcataccacc ctctttggaa gacactacaa tttagccatg ttatattgat 9120
ggttctcatg ctaaaaatca gcttgtcatc aatatgtggg tttatatcta agctctatat 9180
aggtatttaa gccaaaactt tctatgtttt taataaatgt tgaggcctgg aagtgaaatg 9240
cctaaagctt tcttcttgcc ttgttacaga tattgtacca aaatattcta accttttact 9300
attgagtgta ataatagctg tggcctttct taacggcttt tattatgttc aagttgtttt 9360
cttgtcttcc tactttgttc atagttttaa taatgaaact gatttttttc aaagtctttt 9420
tctgtgtctg atgaaatgtt actgagatat tttttcttta gttttttaat gtgtaccaaa 9480
ctgattgatt tgagaatgtt gaatcaagta tgcatctcag gaagaaattt gagttggtca 9540
tggtgtatgt cttctaaaac actttggagc ttagtttact attgttgggg attaattcat 9600
gtctactaat gatattggtc tgtagtttcc tttattgtgg tgcctttgtc tattactggt 9660
aatactatca tggtagtctc atagaaagag tttagaaggt gtatggcaga ctacctttaa 9720
aatagatttt atcagtggag aaatggtgat agttttttct tcacttttct gttgggaaga 9780
attttatgtt gtttaaaaga tattcagaat gacttaacct ggtttatgag ctttcattct 9840
attcctttct tccattcttt ttgaagaact gcttaccttt cctatttatt tttaagtttg 9900
tttttagaca tatgcaatac attttgaggt gaaacctggt ggaatttttt ccaataaatt 9960
agtaaaagac aagtcttaac tgcttccagc tgaggaggga tgctgtttgg ggaagatctc 10020
tcttggaggt ctaagggacc ccaggaaaag ggagccatta tcccaggctt cagttgcatg 10080
accatttgga gtttgatggt ctgaaaatga gaagaggcaa atctggttat tagaagacat 10140
gtatgaaaac caaacaaggt ggcaaggaca gcttgaaaga aaattccaag gctgctgaca 10200
ttcctagata actgcagctg tagttatgcc tgctaaggtt tgggcgcatg gggcttggct 10260
tttgtcagct ccctgggatt tattttccca aacaaagaaa cctccaggtt aggggcaccc 10320
tattcattcc catcacctgg catgatttaa aggataattg cttagaatta aaatattgat 10380
ccagattttt tatattcccc atcgcttttt gtttcttctg ggctgtagcc agagatcatt 10440
gattggcgct caggaataag cagagttagt ctaaaatgca ggcaaatact taaacaactg 10500
aagagattag aatttaaaga caagtgtatg atatgttttg aaatacaatg tttctctttc 10560
cagttttggt ttttgtcagc agcaaataat gataagactg agttgtttgc aaaataaact 10620
ttagtcttaa acttggcctg attatttgca taaagtgcag caagaatatt aataataatt 10680
ctgtaggaaa agcctgcaag caccaggagc ttcacagtct aacactatga gcacgtgcat 10740
cctcacgcaa ctcactgaat atgtccaagt cagcctgttc cgatcttaaa tgccatccag 10800
tggcatctgc cccaggtaca ctaatacatg ggtcctgctt ctctctgcag ccgcctctct 10860
cctcagattt caggttttgt gtattgtttg ttttctctct gacatcaaca cagatatgtt 10920
gaaggttttc ttttttttat ttgtagttgt tcagctttgt tgttaatgag gtcagaataa 10980
gctcatagtt tacacatttt tacattccca tgccgagtag ctgcttttct ctatcaaatc 11040
cattaactga gagaacaatc acatttcgtt acaggtgaac agttaaatag tttggcatat 11100
atttctgtgc tggaatctaa tgcagcttga aatcaagtca tgcctcactc attgaaaaaa 11160
acatggctaa attctcaaag aattgtgctg agtgaaagaa actaaggaat gaagagtaaa 11220
ttttatatga tacatttgta gaaattttag aagatgccac tattataaat taacatggag 11280
aagatttaaa tgtttctgag aatatgctat tgggagtaat ggggatgtga gttaaatttc 11340
agaggaataa gagaaagatt tagggattaa tttattcaaa ccttgattga agtgctgagt 11400
aaatggttgc aaacataggt ctacattttt caaatcattc accataaatt tgaattattt 11460
attaattaca ctcgaataaa gcaataaaga aactgatgag ataatatttg actgaattgc 11520
agcaataaat agatcgatat taacacaagg aatataactg acttccaaaa acatacacat 11580
gaaccgtggt tcactctgcg tatttaggta aataacagaa agttgtcata acagatgggg 11640
aatcctgcag acttcactag gcatgggcca tgctgccctg gagttgtctc aggggagctg 11700
cctcctccag aggttagagc acaggcccag gtaataggac taaattttta gatgtgttat 11760
cttagacaca ctgcacaact gctgtgttct ctatgtaaat tatctcctgt aaaatataac 11820
attgaagcct gcattaaata tattgtgtaa atatgtaaga ataaaagaaa gttatgagag 11880
ctaagtgtta atcaaggcac aagcatataa gatataacta tattttcctg aatgatggaa 11940
ttactaccag tctcccccag gacacttcat ctgccctgag cccagcctct cctcagatgt 12000
cccacccaga gcttgctata tagtggggga catgcaaata gggccctccc tctactgatg 12060
aaaaccagcc cagccctgac cctgcagctc tgggagagga gcccagcact agaagtcggc 12120
ggtgtttcca ttcggtgatc agcactgaac acagaggact caccatggag tttgggctga 12180
gctgggtttt cctcgttgct cttttaagag gtgattcatg gagaaataga gagactgagt 12240
gtgagtgaac atgagtgaga aaaactggat ttgtgtggca ttttctgata acggtgtcct 12300
tctgtttgca ggtgtccagt gtcaggtgca gctggtggag tctgggggag gcgtggtcca 12360
gcctgggagg tccctgagac tctcctgtgc agcctctgga ttcaccttca gtagctatgg 12420
catgcactgg gtccgccagg ctccaggcaa ggggctggag tgggtggcag ttatatcata 12480
tgatggaagt aataaatact atgcagactc cgtgaagggc cgattcacca tctccagaga 12540
caattccaag aacacgctgt atctgcaaat gaacagcctg agagctgagg acacggctgt 12600
gtattactgt gcgaaagaca cagtgagggg aagtcattgt gcgcccagac acaaacctcc 12660
ctgcaggaac gctggcggga aatcagcggc agggggcgct caggagccac tgatcagagt 12720
cagccctgga ggcaggtgca gatggaggct gtttcctgtc aggatgtggg actttgtctt 12780
cttctgacag ttccccaggg aacctcttaa atttagaaaa ctgtgcctaa caatgtcttc 12840
tctatgcata tgaggacctt ttctccctgg cacaaaatgc agattgacgc tgacacggat 12900
gaaaattcct caaccatggt cacaaggatc agagtcctga gtaacctcag ggcttcctgg 12960
tgagtcttct ccaatcagac ccaggacagg gacctccgtg agattccctg actggaacag 13020
tctttatgga tcctggtcac agacaataga gaggctgaac cagggtcagc gtcatgtaga 13080
acctcacaga tttcacgtct gatccttctc ctgacacgaa agtatgcaaa tcagtatcag 13140
caccgatctg gtgcttcttt tgttcctaat ccatttactt tattttttcg tcgtttttct 13200
cctttttcca tttgtttttc ctgctttttg caaaaggaag atgttttccc tgtgagatgc 13260
aggggatgac aattttggga gatggctgga acatccaata tcctcagggc cggccatcag 13320
taagtgcagg ctggaagtct cagaaagagc tgaagctgct taatcaccgt ggagttttac 13380
cttctccagt tctgctctga tggaatcagg gccaagcagg ttatcaatga taatctacct 13440
aacatagagt caaccgattc cagtttcaat aacgtctgtt aaaaattcac accaccacct 13500
ggattagtgt tttgtcaaat caatacacag tattgtccag ctaagtagac ccaaagacgg 13560
accgttgccc atggagaaaa acattaacct gagttctagg ttcttacagt gttaaaggtg 13620
taaaactgat tattaaaaat gaggctattt ttctttttgc tgttgagttg tagaagtttc 13680
ttttccattt ggacattaaa actttttgag atatatggca tattatccaa ttctgtaagt 13740
tgtagttact tggttgcttt gcagaatctt tttcataatc tattcccact tgttcaattc 13800
tgcttttttt ttgtaggtga tttgaatgta aaatccagaa aaagattgct aattttttga 13860
gggttgagag ttttacaatt acaggtatta aagttagata tttgaggcat ttggagtgaa 13920
tttttgtgtt tattctaacc taaaattctt aattcttttc atgggaaaat ccagttttca 13980
taccaccctc tttggaagac actacaattt agccatgtta tattgatggt tctcatgcta 14040
aaaatcagct cgtcatcaat atgtgggttt atatctaagc tctgtatagg tatttatgcc 14100
aaaactttct atgtttttaa caaatgttaa ggcctggaag tgaaatgcct aaagctttct 14160
tcttgccttg ttacagatat tggaccaaaa tattctaacc ttttactatt gagtgtaata 14220
atagctgtgg cctttcttaa cggcttttat tatgttcaag ttgttttctt gtcttcctac 14280
tttgttcata gtttttataa tgaaactgat ttttttcaaa gtctttttct gtgtctgatg 14340
aaatgttact gaggtatttt ttctttagtt ttttaatgtg gtgtaccaaa ttggttgatt 14400
tgagaatgtt gaatcaagta tgcatctcag gaagaaattt gagttggtca tggtgtatgt 14460
cttctaaaac actttggagc ttagtttact attgttgggg attaattcat gtctactaat 14520
gatattggtc tgtagttttc ttttattgtg gtgcctttgt ctattactgg taatactatc 14580
atggtagcct catagaaaga gtttagaaga tatatggcag actaccttta aaatagattt 14640
tatcagtgga gaaatggtga tagttttttc ttcacttttc tgttgggaag aattttatgt 14700
tgtttaaaag atattcagaa tgacttaacc tggtttatga gctttcattc tattcctttc 14760
ttccattctt tttacacctg gcctcttcgt ttttattcat atattccttc agcagccact 14820
atgaagaagt gaagtcaaga tgaagaacca tttgcttttc t 14861
<210> 8
<211> 11077
<212> DNA
<213> artificial sequence
<400> 8
acacctggcc tcttcgtttt tattcatata ttccttcagc agccactatg aagaagtgaa 60
gtcaagatga agaaccattt gcttttctgg ggagtcctgg cggtttttat taaggctgtt 120
catgtgaaag gtatgtgata tttagaaaat gatcccagat caaaggaaaa atataggaca 180
gatctgtttt tcagttttaa acatttatgc ctatgctttg gttggccagt tagccagcta 240
ctttttccta acatgctttc atctttctat ggcagttgtc tttacataga taagactctt 300
tttattcttt tttctcttat cttacttact tgtttgccta ctacaaacta gagaataaga 360
aattagctaa ctacagatag tcctgcttat atgaatgtcc atattgaatc ttaaaaatct 420
ataatccttc ccttgagtct cttaaattaa tattctggag aaatgaaata ataattactg 480
taagtacaac atggaatgca ttgggatttt atttccccaa agaggctggt attggtttta 540
acagctttcc tggatctgat cccacagatt tattatgacc ctttaacaat gtttaatgtt 600
atacaggttg cctataaatt gatgactgaa cttttgttat tttgtttact gatctttcaa 660
acaacttcag aaaatggaat acattctgaa tttagaggta cagtattaaa taattttcat 720
ttagaaggac acactaaaat ttgtctggat tgttcctatc actgagtctg tggttcttat 780
ttcaagatga ataatgacag aaatgtgatt aagaataggc cttatatata tattttatac 840
taatttagca ttaagaataa ctttctaatg atttaaaata taaaatatta gtaaagcatg 900
tacaatgtaa gtatatatac atatctaaaa atatatgata tatataatgg ggctacttcc 960
ctataaaccc tataaagttg aaagttgaaa atatcatact ttcagcttac agtgacttta 1020
tagggaggta gccccattgt agctgaggag catactgaat acatatcact ttcacagcat 1080
cataaagtag aaaaatcata agtagaacca tggtaagttg gggaccatct gtatataaaa 1140
cagctatcac ttactgaata tgatattcta ggtatcatgc taattccttt acatataata 1200
ttccatttaa ttctctcagt aaccatatga gaataaatac tattattact tctgacatac 1260
agttgagcca gttgagactt acagaggtga agtaacttac caaagtcata cagcaagcaa 1320
gagtgtaagc agaacccagg caatctgacc ctatagcccg cactcctact atcctattct 1380
gccttgtgta aggcaatcac tttttagtaa actcattatg gtcatatcta gcatatgttg 1440
ttaaatctat tttcttattt gtttgaagtg aattaaattt aacaaatatt tattgaacac 1500
ctactatgtg tcagatgctc ttggaataaa gtaaggcaca gtccctaata acaggtttgt 1560
taggaaaaat actgcctgtt ggtgagagca aatgctagac tctaagtctt ggtgaattga 1620
aagtatttgt aacttttctg acagctatgt atttctgata gatattaggc tgcctaatat 1680
ttttctatat taacttccca ggtttctggt aaggaagtaa tcatatgccg taatagaata 1740
tagacatttt ctatttaaga cttaatagtc aagtcactcc tcaacaatct atatttttgg 1800
tcaggcatta tgtgctcatt tatatttcat tgagaaaaaa cttttaagag tcatggatac 1860
acaagtagga aagaggacat catttaagtc attttacctc tttcagatgt taatgagtta 1920
tttcattacc ttgaatttac tccataatgt ttctttccaa tattacacag tgtcagaaga 1980
ttcattagga acttatgaat aatggtataa taatgcatta taaagtactt caaacaatta 2040
caagtacaat aacaatggtc ttaaaatgtt tttgtagtgg ctcttatcat taagactgaa 2100
atactgcatt aggtcctctt tgctctggga atcgtctcag gaatagactt ccagttaaaa 2160
ttataactat tctgatcatt taaactttat aacttgaatt cgtatactaa atgttttttt 2220
tttagttgag gcttaatgag aggaattgtt aaaggtcact tagctaaaat tatatgaata 2280
aagcatattt aacatcattg ttgtaattat tattattatt attatttttt gagatggagt 2340
ctcgctctgt tgcccaagct ggagtggagt ggtgcagtct tggctcactg caacctccac 2400
ctcccaggtt caagagattc ttctgcctca gcctcccgag tagctgggat tacaggtgtg 2460
tgccaccatg cctggctaat ttttgtattg ttagtagaga tggagtttca ccatattggc 2520
caggctggtc tggaactcct gaactcatga tccacccgcc tcggcctccc aaagtgctgg 2580
gattacaggc gtgagccact gcacccggcc aacatcattg ttattattaa ttagcactct 2640
tatttaatag cattacttat gtaaagctat tattgaattt ttaaaatctg gtaatttgcc 2700
aggcgtggtg gctcacgccc ataatcccag cactttggga ggccgaggag ggtggatcac 2760
ctgaggtcag gagttcgaga ccagcctaac caacatggtg aaactcggtc tctactaaaa 2820
atacaaaaat tagctgggca tggtggcaca tgcctgtaat cccagctact tgggaggttg 2880
aggcaggaga atcacttgaa cccgggaggt ggaggttgca gtgagctgag atcacgccat 2940
tgcactccag cctgggcaag agtgaaactc tgtctcaaaa aataaaatta aaataaaatc 3000
tgttaattga tccaaaccca gatgatacaa aatttccttt gtttattcct tgcaatactt 3060
atcaaggtca ggtgcggtgg ctctgtaatc ccagcacttt gggaggctcg aggcaggtgg 3120
atcacttgag gtcaggagtt taagaccagc ctgggcaaca tggtgaaact ccatctctgc 3180
taaaaaaaaa tacaaagatc tagccaggtg tgatggtgca cacctgtagt cccagctact 3240
tgggaggctg agggaggaaa atcacttgaa cccaggaggc agaggttgca gtgagctgag 3300
atcgtgccac tgcactccag cctggatgac agagtgagac attctggcaa ataaacaaac 3360
aacttatcag acctctttgt ctaatattca gattagacaa taaatgttga cattcaaagt 3420
taaggcttta atggtcatat gtgtatttat tcatttattc attttttttt ctgtatacag 3480
gcatattcaa ttgagattgt acctcataat atttctaaat tgcttttgca aatttacagc 3540
actttatatc tatatatact ctatctttta aaataattct ttttttcctt ttttttttga 3600
gatgaagtct cactctgttg cccaagctgg agtgcagtgg cacgatcttg gctcactgca 3660
acctccacct cccaggttca agtgattctc ctgcctcaga ctcttgagta gctgggacta 3720
caggcatgca ccaccatgcc cggctaattt ttgtgttttt agtagagatg gggtttcact 3780
atgttggcca ggccggtctt gaactcctga ccctgtgatc tgcccgcctc ggcctcccaa 3840
agtcctggga ttacaggcat gagccactgc gccctgctta aaataattct tataagaaat 3900
atagcctttt tttaaattga atcatgtact atgataatct gattgttatt atgaaaaggc 3960
agataattgt ggtagagaat atttaaaagt cttacaaaat aaggttatct tggatgaaaa 4020
catgcctgtt cttaactata atacaattta tacaaaaaaa atttaagtac aaatgagagt 4080
ttaggaagga tgcgaaaatc agaattcttt caacccagat tcccatagac tggttcaaga 4140
aaaaagtgaa aatccctttt agagtacctt tggatttcct ttactgtgat gtaagattct 4200
gcttagagaa caaatggaga aagctgagac aaaccaacca taaaactatt tttctgcagg 4260
caagaaagat aaaataactc aatacaaaat atataaattg acctttgttt tcctttcttt 4320
ttaatacgtt tacctttcaa gcccaagaag atgaaaggat tgttcttgtt gacaacaaat 4380
gtaagtgtgc ccggattact tccaggatca tccgttcttc cgaagatcct aatgaggaca 4440
ttgtggagag aaacatccga attatgtatg tggcattcca tattcctttt tttcctgttc 4500
taagcagaga tactttctga cactgagcat tgtttgaatg taacacctgc ctctttactc 4560
ccctttttgc tgtgtagcac tttacatgag taaagaattg agtcttatga atgagcttga 4620
ggcaatttta ttgaaattag gggtaagaat gaaagattac agccaatcta aaatcagtgg 4680
gctacatcaa attataaaca aaggttgtaa tctagtcatc aacatgcact gtttatatgg 4740
agttatttct attcatttat ttttgtttgc ttgtttttga gacagggtct cactctgtca 4800
cccaggctag agtgcagtgg caagattact gcttactgca gcctcgaact cccaggctca 4860
agtgatcctc ctcagtagat gggactacag gcatgcacca ccccacacct ggataatttt 4920
ttatttttct ttagagacag ggtctcacta tgttgcccag gctgttctcg aacttctgag 4980
ctcaagcgat cctcctgcct tggcctccca gagtgctagg attacaggcc tgaaccacca 5040
cgcctggcca agttatttca atttgaatca gaactttcag ttgcctttga tcagttcaaa 5100
ctgaaacttc agataacact aaagccttat tttgtaccat atatagataa ctgagtcttt 5160
tatactgtaa ccagatgtct gcatcgagat aaggcagact gatcacatta atttgatttg 5220
atctcacttt tagattggat ttcagatatt catttcattt cacacacagt ccagagttat 5280
atgctataca gctctttttc aatagcaaga atgctaatac tatttgagaa attagatttc 5340
tattggatcc aatcagttat aagttttcct tgtaaaccac tcctacctta atagtttttc 5400
cccaatcaaa tctcctatag aaagagaaca gagcattcca ttccattcaa tcaatggaaa 5460
aagtataatg tgtttaagaa ttttgtttta agttctgtct gggacactta atagttgtgt 5520
gtttttgtat gcaagttact aatctacttt gaacttcaac tttttcatct ttaaatgagg 5580
aaaaaaatca tgcttaactc ataggattat tgtaaaaata aatagagaat attcctttta 5640
gagcttatat atttctggga catattaaaa aatatttact attatttatt cacctaatct 5700
atgtgccaat tattgtcacc atcctgctta aaagtcttgt gatggtttcc ccaacattga 5760
ccttaggata tattacaaat gccttggcct gatctacaaa gcttagcata actctcaaga 5820
tcatccattg ccacttttct tttacttcat gtttaaaaat tcagcactac tggggctggg 5880
cacggtggct catgcctgta atgccagcac tttgggaggc cgaggcgggt ggatcacctg 5940
aggttaggag tttgagacga gcctggccaa catggtgaaa ccccgtctct actaaaaata 6000
caaaaattat ccgggcgtgg tggcaggtgc ctgtaatccc agctactcgg gaggctgaag 6060
caggagaatc acttgaaccc ggaaggcgga ggttgcagtc agccaagatc atgccattgc 6120
tgcactccag cctgggggac aagagcgaga ctttctcaaa aaaaaaaaaa aaaaaaaatt 6180
cagcactgct gtagtgtctc tatttcccca cattcacagt gcttttacat ctgctagttc 6240
ctctgtttga agaactctgt cccacccctc ttagcctagt taactccttc tcaatgtgca 6300
tgtcttagtt cagccatcac ttcctctagg aggttttctt tgaccactca gatcctgccc 6360
acatagcttc gtgttaggga tagaatagct ggagtagtgg gtaattggaa agaatggcat 6420
acatgacact ggcacttaag ccatgttgaa gagtgtctat cttacgttta aacctccagg 6480
agatttttaa aatctcctcc atgatcttct aattcatctt tttattttat gaaatgattc 6540
tttttatatt acattgttta ctgaaatcag ataattcaga tatttcatga cactttgcct 6600
gatgtgttga tcttctgttc tgatttctga aagtatattg ccattgctgt gccattatgg 6660
tattattatt attattctta tcattatttt gagatggagt ctccctctgt cacccgggtt 6720
ggaatgcagt ggtgcgatct tggcttactg caacctctgc ctcctgggtt caaacaattc 6780
tcctgcctca gcctccccga gtagctggga ctacaggcac gcaccaccat gcccagctag 6840
tttttttttg tatttttagt agagatgagg tttcaccatg ttggccaggc tagtctcgaa 6900
cacttgacct caggtgatct gcccaccttg gcctcccaaa gtgctgagac tgcaggcatg 6960
agccaccgtg cctggctttt tttttttttt tttttttttt tttttttgag acagagtctc 7020
actctgttgc ccaggctgga gtgcagtagc atgatcttgg ctcactgcaa cctctgcctc 7080
ccaggttcaa gcaattcttc tgcctcagtc tccctagtag ctggaactac atatgcactc 7140
ctcatcgcct ggctaatttt tgtattttta gtagagacaa ggtttcatca tgttggccaa 7200
gctgggcttg aactcctgac ctcaagcaat ccacctgcct cagcctccca aagtgctggg 7260
attacaggcg tgagccacca tgccgagcct gccattatgg tattattccc aactccatta 7320
cttactaaaa tgatgtgact ttgggcaagt tacttaacta gcctttttcc taaactttaa 7380
aaaaagggat aattatgcat gtatgaattt atacacatat actttatagg gttgtgagga 7440
ttaaatacaa taattcataa aagaacagtg tttggcttat agtagcctct cctttttctg 7500
atattactag agtctttaca gtcttattaa gaatagtttc attcattcat tcaacgaatg 7560
tttatttagt aactgccaaa tattaagggt tggtggtggg gataaaacag tttatttaaa 7620
atatagtcca tttcttaaaa gaatttatag tctaacagga aaccgtaaac aattaaaacg 7680
gcaattacaa ttctgtattg taagtgctat gataggaatg ggcatgggtt gttataggag 7740
tactgaggag gggcacccag tccaatcctg ggggagagga gaagggtggc agtgggaggc 7800
aggaaagggt gcttggaata agtggtggga tggatgatgg ctgttagctc aataggagtt 7860
tgctggctaa atgtgtgtgg ggagtgagag aattaatagg tggggcaggt cattccagga 7920
aaaggaaaca ggattgctgt gacgattata tgagatcatc acgtaatatg cttagcatag 7980
taccaggcac actgtaacca gtattatatg ttagaactgt tattgttaag atgggagatg 8040
agaaaagcat ggtacgtttg tggaattgca agtgattgtt tacagctggc atgcattgta 8100
gatggtgaca ttgtgagaga tgaagctgga gagttagggg catattagga tacgtgcctt 8160
ctatgctatt tgaggatacc ctgttatctg tggagagaca ctagaggttt taagcagaga 8220
aggaaattaa gcttgcattt ttagaaagga tgtttggcaa taatgtggcc agtggtttga 8280
aggggtcaag acaaaaggaa gacattgtta caatctagga gaggatagtg gtgtcctgaa 8340
ttaaggcagt aataatgaga agagaggcaa gtggatggat ttcacacatg aaaataatta 8400
gaatcaactg gactgtgaga ttgattagat atgtgacagg ggaggattca ggtatgatag 8460
ccaggttttt ggcttggatg gtatttataa atactgagtt aggaaacaca aggtttttgt 8520
tgttcctttt ggtagggagg gctgcacatg agaaggaatg atgacttaga tgaaaaaaat 8580
gttaaagtta atgaaatcac ctgaactcgt tgacaatcat ctaatttttc atgggacatc 8640
acgatgaaaa agttccagct ttgctactta attgttgtgt gaacttgggt ggatcactaa 8700
cctctcctat ttatgatttc ctctcctcta aaatctggat aataatatct gtctgtccat 8760
ttcacatata ttgaacatat tccatatgcc acatacctag ctaggcactg ggaatacata 8820
aatgtttttg cctcaaggaa cttacatagc ccagggaagc aaccactctg tagacaagca 8880
actatgatac ggcatttgat ataataaata tatatgagca ttaatgagac tgcagaggaa 8940
gaagtttgtt gaagggtggg gataacatca atataattat tacagaggta aaacgtgagt 9000
tgagttttaa aggaaaagca aaaatttgcc aagtcagcaa agatagttga cagcggagtt 9060
gttgtgaggt ttattttgaa tgagaaatat ttaaaatata gcaagttact ataaaaatct 9120
gagtaacggt attactctag agtgcttata gagtactaaa tacaataagt actgttctga 9180
tgaaatagtg gccgtatact cattaccaaa aataacaata tctgcatttc attgttttaa 9240
ctttgttttc tttcttttct tttagtgttc ctctgaacaa cagggagaat atctctgatc 9300
ccacctcacc attgagaacc agatttgtgt accatttgtc tgacctgtaa gatatatttt 9360
tttccatagt aatatagatg tggaagttaa tagcttttaa ttttaacctt gttagtaaga 9420
atgtttttaa aaatatgttg gagtataaac atttacaaac ataatctgaa cttttgaata 9480
cattaattcc tatgttaatt attaggtatc ataaattcat aaaactttgt cacagataaa 9540
atttagctat acattttttc taaagaaaaa atcattggca ttcatagaaa ggccaatttc 9600
tcttaatagt tcaataagtg gatttgatct tataaaaagg caggtgtttc tttggaaatg 9660
acagactcca acatcaattt ttttaaaaat tctccctttc ttgtcactat aaataacttg 9720
tttagacaga tatacagttg ggaataagcc taacacagta gaaattgctg tatggtgtag 9780
ataaaacaat catattatca tatcattaat tatattgctt actttcaact aatatatatt 9840
aaagattgga aaatcccata agctattctg tattgtagag ctgcttatgt ctgaaaggag 9900
tcatcccttg ctgtcatgtc agagctgcaa gaactaattg attttggatt gaaatgtgta 9960
gtcacatttt gagacagcat ttgaggggat tgtctaatac atatatttgc ttttcagctg 10020
taaaaaatgt gatcctacag aagtggagct ggataatcag atagttactg ctacccagag 10080
caatatctgt gatgaagaca gtgctacaga gacctgctac acttatgaca gaaacaagtg 10140
ctacacagct gtggtcccac tcgtatatgg tggtgagacc aaaatggtgg aaacagcctt 10200
aaccccagat gcctgctatc ctgactaatt taagtcattg ctgactgcat agctcttttt 10260
cttgagaggc tctccatttt gattcagaaa gttagcatat ttattaccaa tgaatttgaa 10320
accagggctt tttttttttt ttgggtgatg taaaaccaac tccctgccac caaaataatt 10380
aaaatagtca cattgttatc tttattaggt aatcacttct taattatatg ttcatactct 10440
aagtatcaaa atcttccaat tatcatgctc acctgaaaga ggtatgctct cttaggaata 10500
cagtttctag cattaaacaa ataaacaagg ggagaaaata aaactcaagg actgaaaatc 10560
aggaggtgta ataaaatgtt cctcgcattc ccccccgctt tttttttttt ttttgacttt 10620
gccttggaga gccagagctt ccgcattttc tttactattc tttttaaaaa aagtttcact 10680
gtgtagagaa catatatgca taaacatagg tcaattatat gtctccatta gaaaaataat 10740
aattggaaaa catgttctag aactagttac aaaaataatt taaggtgaaa tctctaatat 10800
ttataaaagt agcaaaataa atgcataatt aaaatatatt tggacataac agacttggaa 10860
gcagatgata cagacttctt tttttcataa tcaggttagt gtaagaaatt gccatttgaa 10920
acaatccatt ttgtaactga accttatgaa atatatgtat ttcatggtac gtattctcta 10980
gcacagtctg agcaattaaa tagattcata agcacaccca ccaaggctcc ggatgtgttc 11040
cccatcatat cagggtgcag acacccaaag gataaca 11077
<210> 9
<211> 7426
<212> DNA
<213> artificial sequence
<400> 9
gtattctcta gcacagtctg agcaattaaa tagattcata agcacaccca ccaaggctcc 60
ggatgtgttc cccatcatat cagggtgcag acacccaaag gataacagcc ctgtggtcct 120
ggcatgcttg ataactgggt accacccaac gtccgtgact gtcacctggt acatggggac 180
acagagccag ccccagagaa ccttccctga gatacaaaga cgggacagct actacatgac 240
aagcagccag ctctccaccc ccctccagca gtggcgccaa ggcgagtaca aatgcgtggt 300
ccagcacacc gccagcaaga gtaagaagga gatcttccgc tggccaggta ggtcgcaccg 360
gagatcaccc agaagggccc cccaggaccc ccagcacctt ccactcaggg cctgaccaca 420
aagacagaag caagggctgg gctgtgaggc aacccccacc tccccctcag agcacgttcc 480
tcccccttca ccctgtatcc acccctccgg accctcccca tctcagtccc tccgctccct 540
ctctctgagg cccatctccc aatacccaga tcactttcct tccagaccct tccctcagtg 600
tgcacggagg cagcttgccc agcaaaggtg actgtctagt gggcttccca cagccaagct 660
cccaccccat gctgcggccc ctcccttctt cctgcttggc tgcctgtgcc ccccacctgc 720
ctgtccacaa cccagcctct ggtacatcca tgccctctgc cctcagcctc acctgcactt 780
ttccttggat ttcagagtct ccaaaggcac aggcctcctc agtgcccact gcacaacccc 840
aagcagaggg cagcctcgcc aaggcaacca cagccccagc caccacccgt aacacaggtg 900
agaagcccct tccctgcaca ctccaccccc acccacctgc tcattcctca gccgcctcct 960
ccaggcagcc cttcataact ccttgtctga gtctccaagt cacactttgg taaggagagg 1020
gacactgaac ggacctctaa caaacaccta ctgccagcca gccccagtct gggggccagc 1080
agatgccaaa caaccagcag actcccagag cagacctggg ccggctccct ggcccatgga 1140
cccagctctg cctcgctgag ctgaggcatg ggctctcagc gcagcctcac atagagccac 1200
cctgccgagg cagtccggct tgcagactca caggtcactt gggccgcagc agcccctccc 1260
cgtgaccctc gcctcccgcc cgccccagcc tggctctctc caagtgttgg atcttggtgg 1320
ccagcctgct tctcaccctc accctgcctg ccacctcaga atggcagggg aaagagggcc 1380
ctcaccaaga actttatctg agaagtctga ggcttgtgac tctgacctgc ctgagatgtc 1440
catgtggccg gggggacggg ttcagtgttc gggagaactc gggtacgtgc ctgactttct 1500
ctgagtaggg caggaagctg ttaggagaag cagcagtgag gtgggctgga ccaacaggca 1560
gaatgactgt ccctcagcca ccctctggga tgtgggtcaa gctctgacaa aggcatggca 1620
cagccatggt ggcccctgct tggatgagtg gccacggtgc cctcaccctg ggccagaatc 1680
tgcctccact ctgcaggtgc agaaacacga cattcccgtc tctaaacaca cctagctcct 1740
aggcttgggg tgggcctatc aaatgcaggg agatggacac agcacaaggg ccagagcttc 1800
ccatgagaaa ggtgagggca gctgctccct gacccgggca tctgcacttg tccctctcca 1860
ccctcctcat gggcagtgga gactcagcaa caaaacaagt tgagtgcatt agcagccagc 1920
tctggagcca agtcactcac cccacggcct tggctgctgg tggaggggcc ttcccctggg 1980
cagcctccaa gaagacagcc aagtgctctt actcagacca cggcgctgct tcctggcacc 2040
tcgatttccc acaacaacat ggggtgcaga caggctaggg ccccctgccc tggggcctgg 2100
acggcatcca gttaaagatg acccttcacg ggcggtgcct gaggtgtgct gacctcagca 2160
gctaagccct caggtctggt ctgcactgcc ccacctggag gacccaactg acccagacac 2220
agccagggtt atggcatgac cccgtggacg gtgacccaca ggccagatgc agccgggggc 2280
tgttttgtgt ggcctagaaa tgtctttaca gttgtagtgg gatggaggag gaagaggaag 2340
agaggagggg agaggaaagc agggaagggg aaaaagagga gttcaatgca accccaaaag 2400
ccagaacagt tttgagctga aagaacaagg caggaaacat cccagtacct gacttcaaaa 2460
catactataa agcagttgta atcaaaacag gatcataaaa acagacacac agacccatgg 2520
aacagaaaag cgagcccaga aataaatcta catgcttgca gtccattgat tttcaacaaa 2580
ggcaccagga aaacacaatg gggagaggac agtttcctca ataaatagtg ctggggaaac 2640
tggatatcca tgtgcagact aatgaaacta cacaaaaatc aattgaaaac agtctaggcc 2700
aggcgcggtg gctcatgccg gtaatcccag cactttggga ggccgagaca ggcggatcac 2760
ctgaggtcag gagttcgaga ccagcttggc caacatggcg aaacccggtc tccactaaaa 2820
atacaaaaat tagcacatgg tggcctacgt ctgttatccc agcttttcag gaggctgagg 2880
caggagaatc gcttgaatcc gggaggtgaa ggttgcaggg agccaagatt gcgccactgc 2940
attccagcct gggcaatgga gcgagactgt ctcaaaaaaa aaaaaaaaaa aaagaaaaga 3000
aaacagtcta aaggtttaac tgaacagata aagctactag aagaaaacat agggggaaaa 3060
ctccatgaca ttagtctgag caacgatttt tggatatgat cccaaaagct caggcagcac 3120
tagtcacaaa agccaagata cagaaccaac ctaagcaccc ctcagcagat gcacaggtaa 3180
agaaaatgtg gtacgtatgg ggcacaatgg aatacgattc agcctttaaa aacagtgaaa 3240
ttctgtcatt ggcaacaatg tagatgaacc tgaaggacac ttatgctaag tgaaataagc 3300
caggcacaga aggagcaata ctgcatgatt gcacttacat ctggcaggtt aaaaaggcaa 3360
actcttagag gcagacagta gagaggtggt gccagggagc gggcactggt ggctggggag 3420
atgttggtca aagggcacaa aactgcagtt gggaggaatt agttcaggac atcccttgta 3480
catggggaca gtggttagta acaacggatt gtatccttga aaaccgctaa gaaaatagtt 3540
tttaagtgtt cttgacacaa aaagtgacac gtatgtgaga tactgcatgg tcattagctg 3600
gatttagcca ttccacaatg tacacatatt tcaaacattg tgttgtatat gataaacatg 3660
tataattttt gtcaattaaa aatttttagg aagaggagga gaagagaaga agaaggagaa 3720
ggagaaagag gaacaagaag agagagagac aaagacacca ggttttttct gacccctggg 3780
ctatcaaaac acctattgcc caataactag ttggccgttg gtgccctaaa ctattgaagc 3840
gattgctgtt atgtggatgg gccccggaca cttagaaact cgtgacccct gaggaccccc 3900
acgaggacag tcagggtccc cccgaactca gggagcactg aggaaggagc tcttagaggc 3960
gtggggcccc tcaggcccct cagagggctc tgccacatgg gtcaggggca ggctgagggg 4020
gagtcccagg ctccatgccc agcctctgtg cctctgacca gggtgtcccc cacaccgcct 4080
cctccccagt gccctccact ggccacacct ggccagaagc tggggagagg agagcacagt 4140
ggttaagtca gtccctgcag ggagacggca ccagaaaaac ctggcctgtg gatgagtccc 4200
ggcctggcag ccacagagca gagagctctg gaagcaacga aggcccgagt ctgctcaggg 4260
aagagcgggc agcagcccca gggccggaca gtgaccaaga gtggcaccgc ccatggctca 4320
acgggtcttt gcccacagat cccccagccc ctggagacag ggtctgtgtg cctggccgtg 4380
caggcaggca ccacactcag ggggaggcca ctgtggagct ctgtgcagag ccccgggcgg 4440
gagcctactg ctcccgaagg tccggccaca gctgctctcg tttgctctcc cctgcagagt 4500
gtccgagcca cacccagcct cttggcgtct acctgctaac ccctgcagtg caggacctgt 4560
ggctccggga caaagccacc ttcacctgct tcgtggtggg cagtgacctg aaggatgctc 4620
acctgacctg ggaggtggct gggaaggtcc ccacaggggg cgtggaggaa gggctgctgg 4680
agcggcacag caacggctcc cagagccagc acagccgtct gaccctgccc aggtccttgt 4740
ggaacgcggg gacctccgtc acctgcacac tgaaccatcc cagcctccca ccccagaggt 4800
tgatggcgct gagagaaccc ggtgagcctg gctcccaggt ggggagacga gggtgcccac 4860
agcctgctga cccctacgcc tgccccaggg ccatgacccc agctgggccc cagcagcacc 4920
ggtcatcctc cacaggaaag gagaagggag gcaccagcac cctggccggc cccacttctc 4980
tcccagtgcc cccgtggcca gagcctgaca gcctccccca cctccccgca gctgcgcagg 5040
cacccgtcaa gctttccctg aacctgctgg cctcgtctga ccctcccgag gcggcctcgt 5100
ggctcctgtg tgaggtgtct ggcttctcgc cccccaacat cctcctgatg tggctggagg 5160
accagcgtga ggtgaacact tctgggtttg cccccgcacg cccccctcca cagcccagga 5220
gcaccacgtt ctgggcctgg agtgtgctgc gtgtcccagc cccgcccagc cctcagccag 5280
ccacctacac gtgtgtggtc agccacgagg actcccggac tctgctcaac gccagccgga 5340
gcctagaagt cagctgtgag tcacccccag gcccagggtt gggacgggga ctctgagggg 5400
ggccataagg agctggaatc catactaggc aggggtgggc actgggcagg ggcggggcta 5460
ggctgtcctg ggcacacagg ccccttctcg gtgtccggca ggagcacaga cttcccagta 5520
ctcctgggcc atggatgtcc cagcgtccat ccttgctgtc cacaccacgt gctggcccag 5580
gctggctggc acagtgtaag aggtggatac aacccctcgc cgtgccctga ggagtggcgg 5640
tttcctccca agacattccc cacggctggg tgctgggcac aggccttccc tggtgtgacc 5700
gtgaatgtgg tcaccctgaa cagctgccct ctctggggac atctgactgt ccaagaccac 5760
agtcagcacc tctgggagcc agaggggtct ccagagaccc ccagatgtca ggcttgggct 5820
cagtgcccag cgaaaggtca gccccacaca tgcccataat gggcgcccac ccagagtgac 5880
agcccccagc ctcctgccag gcccaccctt ttccgccccc ttgaggcatg gcacacagac 5940
cagtgcgccc actgcccgag catggcccca gtgggatgtg gtggccacga ggggctgtac 6000
acacagcagg aggctgtccg ccctgctcag ggcctgctgc ctatgcccca gctgtccagc 6060
caagggaggc atggaagggc ccctggtgta agctggagcc aggcacccag gcccccggcc 6120
accctgcaga gccaaggaaa ggaagacacc caagtcaaca aggggcaggg ctgagggctg 6180
tcccaggctc ttttggcccg aggggctgcc agcagccctg acccggcatg ggccttcccc 6240
agaagcgacc ctgtgaggtg gcctcacaga gaaccccctc tgaggacagt gtctgaccct 6300
gcctgcctca cacagatggg ccccacagca gtgggcaacc tggggggcag cagcccaacc 6360
tgaccctgca gggactgccc cctgcagcag cagctgcttc tcagtccccc aacctccctg 6420
tccccgccag agggtcttcc ccgaagctgc agccccaacc catggctgcc cacctggaac 6480
cgggactccc tgtccactgc cccctcccct tcggggcccc atctgtgctg gggcccaggt 6540
tcggcctaca gattcccatc attgccatgg cctcctgacc ttgcctatcc acccccaacc 6600
accggctcca tgctgaccct cccccaggct cccacgccca gctggccggc catccccagg 6660
cacagacagt ctgggatctc acaggttagc ctggaccatc cacctggcca gacctgggag 6720
aggctggaag ctgccctgcc accatgctcc agggccccag gttgcagtac tatggggtga 6780
gggtgtgtgt gcacacccgt gtgtacctag gatatccgag tgtacccttg tgcccccaag 6840
cacaagtctc cctcccaggc agtgaggccc agatggtgca gtggttagag ctgaggctta 6900
tcccacagag aaccctggcg ccttggtcaa ggaagcccct atgcctttct tgcctcgatt 6960
tcccctcttg tctgctgagc cagcaggggc cacgtcctgg gctgctgtga ggaggaagtg 7020
agttggtgct aggaggggct cctgtgtgtg catgggcggg aggggtgcag gtatctgagc 7080
accccggtct ccacttgaga gagcagggca ggagctccct gacccaccca gactacacac 7140
gctgtgtcca cgtgtctcac attatctgtg gcagaggatc cggcttcttt ctcaatttcc 7200
agttcttcac aaagcaatgc ctttgtaaaa tgcaataaga aatactagaa aaatgatatg 7260
aacagaaaga cacgccgatt ttttgttatt agatgtaaca gaccatggcc ccatgaaatg 7320
aagcagctgc aggagtcagc cctggacctg aagagcacac acttaccctc ttcacacatt 7380
acattctgct gggcctttat taaccacatt tctatgaacc tagtca 7426
<210> 10
<211> 15081
<212> DNA
<213> artificial sequence
<400> 10
agcagctgca ggagtcagcc ctggacctga agagcacaca cttaccctct tcacacatta 60
cattctgctg ggcctttatt aaccacattt ctatgaacct agtcagggtt gggtttcaag 120
tgtatggttg ccacggctat cagagttgaa gtcagcttct cctgttcaca aaaagttcag 180
gttcctccag tgatacccac ttttgtgtcc cggtttggct cttcccattt ctttctcccc 240
agagagagcc tgtctctttc atctgtggca ggtgcatcct gctgacactt ttacttggtg 300
attgtttgtg gggtgaaggg gcttggacac agggggatgt tctccaacct tctgaccaag 360
catccttctt agctatgggt ggtgagagtg gctctgaagc atggtcttcc aagcgttcct 420
gttccttccc ttccccaagt cagagtgtct ctttctagcc acagtggtct tccatcagtg 480
tcctcagctt ctgacccact gtccttactc cacagactca gggtttcatt cctcaggaaa 540
gagacgggag gtgtttctgg gtagagtctc cttggtgtcc tctgtttcct tctgttctag 600
ttgattctac cagtgcctga aggacacaag atttaataaa tgtctcccac atatcgtgaa 660
gagggattca gcattgaacg gagctactgt tcttcctccc cagtcaacac cacaggacag 720
caggtggctg agttgtctgg ggatttcccc aattctgtag gaaaagcccg caagtgccag 780
tagtttcaca ctctcacagc gttagcacac acatcctcaa caactcatga aacatttcca 840
ggttagcttt ttcctatctt caataccata caatgagtgg cacctgcccc aggtactcta 900
ataaatggac ctagttttct ctgcaggctc cccattttct cagatttcag ggtttttttc 960
tctgcaacat caactcagat atgttgaagg gttcattttt tgtagttctt caaatttctt 1020
gttaatgagg tcagaagaag atcattttct catttttttt tttacattcc cgtgcttagt 1080
atctgctttc taaataaaat tcggaaacta agacaaaata tcaaatatcc actatttggt 1140
gcattgttaa actatttgag aaatattcat gtactgaaat acaatgaaca attgaaatca 1200
aggcatgcct cagtcacata agaatgtgag gacattatca aataattgtg ctgagtgaag 1260
gaagctaaag aattcacagt aaatctcctg tgatttcttt tgtataaatt gtagaaaatg 1320
caactattct aaattaacat ggagaagatc tgaatttttc tgagaaaagt gtggtgaggt 1380
aacaagatgg tgaaataaaa ttacagagaa gtgagagaaa aaaatttgga ggttaattta 1440
attgctaata gcacgattga agtgctgatt caaaggctgc acacatatac caacattttc 1500
catatcgtac actataaatt tgaattcact attgattgaa tttttgaata aagcagtaac 1560
aaaaatgagt atattggctg aggaagagca agaaagagat gaatattgac acttgaataa 1620
tcacggactc ctgaaaatac acacatgtga acactgattg catatttcag gtaaatacta 1680
gaaaaagcaa agtcacataa agtcgttatg acaggtggga taccctgcaa atcacactag 1740
gcatgtccca cactgccctg gagctgtctc aggggagcag tctcctccag tgtttagagg 1800
cacaggcacg gataataggg ctgacgctgt ccagatgtgt gatattggac acattgcaca 1860
actgctctgt tatgtatgta attcatcttc tctaaaaatg taacattgac acttgcactg 1920
aatatattct gcaaatatgt aaacattaaa taagatgatg actgctaatt gatcatcaag 1980
tcacaatcac ataatctgaa gttatatttt cctgagagat aggattacct ccagtgtttt 2040
ccgggaccct ctcatctgct ctgggcactg ccctctcctc cagcgtccca ctagagcttg 2100
ctatatagta ggagacatgc aaatagggcc ctccctctgc tgataaaaac cagccgagcc 2160
cagaccctgc agctctggga gaagagcccc agccccagaa ttcccaggag tttccattcg 2220
gtgatcagca ctgaacacag aggactcacc atggagtttg ggctgagctg ggttttcctt 2280
gttgctatta taaaaggtga tttatggaga actagagaca ttgagtggac gtgagtgaga 2340
taagcagtga atatatgtgg cagtttctga ccaggttgtc tctgtgtttg caggtgtcca 2400
gtgtcaggtg cagctggtgg agtctggggg aggcttggtc aagcctggag ggtccctgag 2460
actctcctgt gcagcctctg gattcacctt cagtgactac tacatgagct ggatccgcca 2520
ggctccaggg aaggggctgg agtgggtttc atacattagt agtagtagta gttacacaaa 2580
ctacgcagac tctgtgaagg gccgattcac catctccaga gacaacgcca agaactcact 2640
gtatctgcaa atgaacagcc tgagagccga ggacacggct gtgtattact gtgcgagaga 2700
cacagtgagg ggaggtcagt gtgagcccag acacaaacct ccctgcaggg gtccccagga 2760
ccaccagggg gcgcccggga cactgtgcac ggggctgtct ccagggcagg tgcaggtgct 2820
gctgagggct ggcttcctgt catggcctgg ggctgcctca ttgtcaaatt tccccaggaa 2880
cttctccaga tttacaattc tgtactaaca tttgatgtct ctaaatgcaa tacttttttt 2940
gtcctttttg tttctttgtt tttttgcaac aggagtacat atcctcagct ccacagaagc 3000
cagggtgtca ctttgggggc agaaataatc ctttcatggt taccaggata agagtcctga 3060
ggaatcccag ggaaacctgg agagtgtttt ccagttagac tcagggcaga gacctccatg 3120
ggaatctctg attagaacag gccttgagct ctgacgggag ccaagagaga ggctcgccca 3180
gggtcagagt ccttaaaacc tgatggtttt cacagctatc ccccctcgtc ttgtaaactc 3240
agactgattc agttgaccct ctttctgcta atccatttcc ttctctgtag gtttgattct 3300
cacagttcgc tttcttcttc tcttccctga aaacagacga tgtgttttct gtagtcaaaa 3360
tcccagggct caggtctgca ggacctgggt aggctacggg gactttctca ctcaccattg 3420
tccggacact cttgtcttct gtgcatggag gcatttggaa aatgaagtgg acattagcca 3480
tgaagggaat aatactagtt ttctccaata ggatattgat gtagagctga tcttgtgctt 3540
ctcacactgt ctcagagttt ggactctcac ctgtgacttt gagaagagct ggggatgggc 3600
actccattgt gctgtgagct ctgggtaacg ataattgtag aatctgccta ggcagtctaa 3660
ggtcaatact actcgtcatc aggaaagaca gctggaattc ctgggaagat ctgcatctgc 3720
cgtccaccat ggagtcccat cgtcttctgt tatgctctct ttgaatcagt cccacctaga 3780
ttatctagaa cactcttcct gacttaggaa aaataatggc aggctccact aacacctgtg 3840
ttatgccatg ggagcaacac ctaagctagt gtgtgaatga gtagatgaga ctgtggtcta 3900
gtcaaggtga caggtaaaat tgattgttgc cattatgata ttttatttta tatttggcaa 3960
tatagtcatg ctcctattat aaatattctt tgagacggag tcttgctctg ttgccagcct 4020
ggagagtgca gcagcacgat ctcggctcac tgcaacctct tccacctccc aggttcaagc 4080
agttctcctg cctcagcctc ccgagaaact gggattacag gtgcacgcca ccatgcctgg 4140
ctaatttttg tatttttagt atcgacaggg tttcaccatg tgggccagtg tggtctcgat 4200
ttcctgacct cgtgatctgc acacctcggc ctcccaaagt gctgggatta caggcatgag 4260
ccaccacacc tggcctagtt ttattaatgt ctgcccatac cagcaactac atacctatgg 4320
ggacattaat ttacatctgc agacatatgt gtaaatacac aagcctatac atacatgtgt 4380
agccatttat atttaacatt ataataaaat aattctaaaa tattttctaa agaattaaac 4440
ttaatgatga gctaaatata aattagagta atctataatt gatttcgatc attctctata 4500
gtttgcataa atggatgtct atttctaagc cttaacatag tgtattcgtc attttaaaat 4560
agtcaagaaa aaattacaaa tgttcttgtc acaaaaaaag ataaggattt gatgatttga 4620
ggtaatatat atgtcaatta actcgattca attattccgt attgcattca caaaccataa 4680
catacctttg tgccccataa atatatacaa ccaaaatttc tcaattttca atgaaatttt 4740
aattatatat ttttaaatct gatgcctctc cttggattaa gccacctcct caggcttaca 4800
gggctcttcc attttctcaa catgttgtta taccagatga gcacaaacac attaatttca 4860
ttatgcttag ctttaatttt tcaaaacaac ataaaggtga taattttacc aatagacgta 4920
ttacaaccta ctgtgcatga gaccctttct gtgcttcaag gtttcttctc aggactttat 4980
atgtattgca aattttcatt gactcaacaa caataacatt taggtttaaa ttctggattc 5040
cccaacagaa ccagtgcttc ctgctggagc tggatccatc agcccccagg gaagggactg 5100
gagtgggtca ggtgcacagg tcatgaaggg agcacaaatt ctaacccact cctcaagagt 5160
ccagtcacca cctccagatc tatgtccaaa aacagctctt cgtatggctg agtgacatta 5220
gcaacaagca cacaaccatg tttgtttttt tgttttgttt tggtgtgtgt gtgtgtgttt 5280
ttgatagagt cttgtgtcac ccaggctgga gtgcagtggg gcaatcaatc ttggctcact 5340
acgagctcca cctcctgggt tcaagtgatt ctccagcctc agcctccaag tagcttggac 5400
tgcaggcact caccaccaca cctggctaat ttttgtattt ttagtagaga caaggtttca 5460
ctatgttggc caggctggtc ttgaactcct gaccttctga tccacccgac tcggcctcca 5520
aaagtgctgg gattacaagt ataagccact gtacccagcc acaaacaagt atttttaagc 5580
aaaagacaca gtgaagagac ttcagtatga gcccacacac aaaccttcct gtgggagttt 5640
acagaacagc agtgggtgct gaggacagaa gccagcaccc aggaaccagc agggaaaccc 5700
agggggcatt tggcaccgcc tgaaggctca ggaccgttgt ggggctcagt ggtcaggcag 5760
gctcaaggtt cagcctcaga gcaggtgtag caggcgggga aaggctctga agacggagtt 5820
tagtgtcacc ttctcatttc caccactaaa caccctccag cacatctagc aggcggggaa 5880
aggctctgaa gacggagttt agtgtcacct tctcatttcc accactaaac accctccagc 5940
acatctaatt ctaatctaat gtatgggtgt tcatgtgttt agagaatatt atttatgtta 6000
tgaatctata gccatctgtg ggtgcatcaa gttaacctct tcaacctatg tggaccctgt 6060
tcattaggaa taagtccttg tatttgagga cctcacaaat taataattat gtagaatcac 6120
tttctttttc agtctccttt ccttcctctt tctttctttc tcactcacac actgacaaac 6180
acacggggtg ccataacatt aattacctga tgtattaaag aatattgatt catttgcaac 6240
ttgaccagtt tagctgttgt tcatgttgtt gtaagatcag gaacatgttt ctcagctgtg 6300
tactcctcta agctgagcag cagctttatt tgaaatacac agaaactgaa aacaatccaa 6360
atattaatca gcaccttcat aagtaaacaa attgtggaaa agtcatttat tgggatagta 6420
cccactacta caatcagtaa atgttggata cgatcaacag catggttcaa ttcacaagta 6480
catatgatga gtatagtgag ccaaagcaca tacatatgat tccgttttat aaactgtaca 6540
aagtgaatac tcattggaaa ttacataaca aagatcacta actgacttct ccatagtaag 6600
agaagcgaag gtataggagg agaaattgcg agagacaaag agaaaattga gaggcgaatt 6660
catttgtttt ctctgtgtat ggtcattagg tcaatgtttg tcaaatggtg aacatattgt 6720
gtgaagatta tatgtcagta ttacctcatt aaagttatta taaataaaag tctaatgtgg 6780
tagaaaaaga tgaagagaga aataaaaata atacaagaaa agtcatgaac tcctgaatga 6840
attaaccctt agtttttctc tattacttac aaaaacacca agatacagcc aaataatatc 6900
acgatatcat tataagaaga gtgttttgta aacctcactg ggaatttcta gctctttcct 6960
agagttaatt ttgggaacag ttggatccaa ttgtgagaaa cgcaggccgg acactgagac 7020
tggctcttat gagatgtgag ctcttgtcta tgtcacatgg tccctccata cttgggggtt 7080
tacattcaca tctgtaaatg aaggaaacat tgactctcaa agaacatatt tcatgtgcat 7140
gtaaaagtat gaatgctaat gagaattaat tacttatgaa gtataatcac ccacatccac 7200
tcttggacac agcccactct gaggcatccg ttacagaact cattatatag taggagacat 7260
gcaaataggg tcctccctct gctgatgaaa accagcccag ccctgaccct gcagctctgg 7320
gaaaggagcc ccagccctga gattcccagg tgtttccatt cggtgatcag cactgaacac 7380
agaactcacc atggagtttg gactgagctg ggttttcctt gttgctattt taaaaggtga 7440
ttcatggata aatagagatg ttgagtgtga gtgaacatga gtgagagaaa cagtggatat 7500
gtgtggcagt gtctgaccag ggtgtctctg tgtttgcagg tgtccagtgt gaagtgcagc 7560
tggtggagtc tgggggagtc gtggtacagc ctggggggtc cctgagactc tcctgtgcag 7620
cctctggatt cacctttgat gattatgcca tgcactgggt ccgtcaagct ccggggaagg 7680
gtctggagtg ggtctctctt attagttggg atggtggtag cacctactat gcagactctg 7740
tgaagggtcg attcaccatc tccagagaca acagcaaaaa ctccctgtat ctgcaaatga 7800
acagtctgag agctgaggac accgccttgt attactgtgc aaaagataca cagtgaggag 7860
aagtcagtga gagcccagac aaaaacctcc ctgcaggaag acaggagggg cctgggctgc 7920
agaggccgct caagacacac tgagcatagg gttaactctg ggacaagttg ctcaggaagg 7980
ttaagagctg gtttcctttc agagtcttca caatttctcc atctaacagt ttcctcagga 8040
accctgtcta gatctgtgat ctggatctgc tgaaactgcc tgtgtcacct tcctcacctg 8100
tgattttggg ggagctgatt gtggacactc cagtgtgtgg gatttcttga tgacagcaat 8160
tgtgtcttct gtctaggcat gtctagggct ggccatcagg aagggcaggc tggaattttt 8220
ggaaagaggc gcacctgcca tccaccagga aattttgttg tcttttgttc tgctagaatt 8280
aaatcagaca caccaaggtt aactagcact atcttcctag ctcgagaaac ttgatagcaa 8340
gcttgaataa cacttgtatg aaaccatcag agcaacacct agaatagtgt ctgatttaat 8400
aattgagact atggtctagc caaggagaca cataaaacgt gatttccaag gttggttcat 8460
attttatatt tatcaataga atctggctgg tattataaac agctttgcca aaatatgtgt 8520
gttagtgttg gtcttcggag aagcaaactc tgtgactgga ttagcttcgt gtacagtttc 8580
cttcgtgtac agtttccctg tccactcttc tgagaggaaa tgcagaggtg gtggaagggg 8640
tctggggtag ctgaatgcat gtgagggaag tcggtccctg agtgaaggag aaagggagga 8700
ggactgggtg gaactttcct aaacttctgt gttgttctat gaaggtccag caaggtcact 8760
gaatcagagt cgtgtcacag tttcccatcg gggacccagt gactcccagc agtggctctg 8820
ctcagatcag cgcagagctt gttcatctcc taagagtgga gcacaggacg tggctccagc 8880
accagccatg gcatggaaga tagagagcag ccctgggtgc ctgggtcagg tgcattgtgc 8940
tccctgcagg tggagggagg gaagtgctga ctcagggccc agagactgtg ggttctatac 9000
agaacataca cttttacttc atttctgtgg atgacataga aacaaacatg cagtctgtaa 9060
acaatggtga ttcctacatt tgccccagtt gctttattcc ttgattctgc agaatgtcct 9120
ggcacagaat tgcctttctc atgtgaaatt gtgatgtgtg ttcaggatgt gtgatcccta 9180
ctttcattgt tttcctccat gtacagagct cctaaataga gtacagctga ccctccttct 9240
acactgttcc tctcctcccc acaggaagag cagataatta cctgaggctg aatctgaggt 9300
gggatctgtc ctctgaacct cagagcctgc agagaccccc agctgcagat tcatggagtc 9360
aggtgtttgt acatgtggga accttgagct gttcttttgt cagtgaacac tccttagaac 9420
taattgtggg ttcagaatta ggacacccat tgatctatca cacctcagct cattctgcca 9480
ctcagaatct ccagaaattc aggaaatggt tgaatgtaca tttttgtgac aaatttttct 9540
catttcactt agttgtgagt tttgttagca gaaagtgcta ccatttatgg gtcccaaact 9600
gatgaagctc atttgcttca acagaagtta ggaggctcct aaaacttctc atcatcccct 9660
ctctctgcac ttcatgtgaa attcagtttt acctggaatt ctggtgtgtt ggtttagcca 9720
gatttcccca tcctccatgt ctgattttcc tgggtatctg atcagtttcc ttgcccttca 9780
ccatacctcg gtgacgtcta atcaccctgg cccaacatca agaattctgg caggttggtt 9840
taagcaggat ttgctgtgcc cctgatgttt cctctcagta attttccatc ttccggaccc 9900
caccctgctg cttggcttta aatccccaca tttcctttct gcatttggag ttcagctgaa 9960
tctctctcct cactgcaaaa caccattgcc ctgatcccaa cacctaccat gaggaccctg 10020
gataaagtct tccttactgt gctggaacaa gtgtctttac ttgatatttt ttataatgct 10080
gcctgaaagt ctctcgaaca taacttttta ttatgtcgag atgtgttctt tctataccca 10140
ttttttacat tttttttcct gaaaggatgt tgagtttcat caaatgattt ttcagcatca 10200
attgaaaaaa actatatgtc gattaaagat ctaaatgtaa aatctaaaac tataaaaact 10260
ctggataata acctaggaaa tacaatttag gacattagaa ctggcaaaga cattatgaag 10320
aaaatgcaat tttaacaaaa tcaaaaaatg gcaaatgagc cctaagtaaa ctaaagagct 10380
tctgcatagc aaaagatact atcaacagag caaacaggca acctaaagaa tggaaaataa 10440
aatatttgca atctatacat ctgacaaggt ctaatattta tagcatacat gcaattaaac 10500
aaacatacaa gaaaaaaagt gcccaaagga catgaaaaga cacttataaa aagacctaca 10560
tgtggccaac aagcatagga aaaaatgctg actatcacta tcattagaga aatacatatt 10620
aaaaccacaa tgaggtacca tctcacatca gtcagaatgg ctaatcttaa aaaaataata 10680
acagatgctg ctaaggttgc aggaaaaagg gaacatttat acacttctag tgcaagtata 10740
aattagttca accattgtgg aaagcagtgc catgatccct ccaataacct aaaacagaag 10800
tttcatttga cccaacaatc ctacaactgg acatatacct aaaggagtat aaacgtatag 10860
gttcattcca gcactattca caatagcata gacatggaat ttacctaaat gcctgtcact 10920
ggcataatgg atagagaaaa atgtgataca tataaccatg gaatactatg cagctaagaa 10980
aagaatgaaa tcatgttctt tgtaggaata tgatggaact ggtagtcatt actcttagaa 11040
aactaattca ggaacagaaa accagatact atatgttctc acttattttt tggagataaa 11100
taatgagaaa tattctctca gggcctgagt cttccatatt caaaaaaaaa ttctaaaata 11160
agtgttcaac aagttgccga catgccttta aatatcctgt ttcatctgga ccccttatat 11220
cactcaataa agccatgaga actatattta aaagtgggat tctatgataa taaataactt 11280
aacagtaaga ataggaagga tatggatggt tttactaatt taatgggtac ataactgtgg 11340
tatacactat attcctatga ataagatatt ctgatttcag agctaagtat tgtttcctac 11400
gttgtgtgtg tgacttggtt gtgttggatt tgagagtgta gcacgtgcac agaccatgtg 11460
gatcagaaat ccacatgtaa agatgacaat ctatggatgt gaactacaag tatgtaagta 11520
cttcaagtaa ttctatttaa tggagtttga aataaaactc aaacttattc aaaacactaa 11580
ttacttggta tttattttga gatctatgag tttatcaaga aattcaaatt cctatttcta 11640
tttaaactcc cgattcctac tctcagtggg agggaaaact cacagccaat cacacatcac 11700
aggacaaatc tgtaaacgaa gagtcattcc tctgaaggtc ctgggtgttc aggactctca 11760
ggcaggtgct gaggaccctg tcttgggagt gcccagcaga tctcagaacc ctacatgggg 11820
cctgctggac actcatgtgg gataactagt cgccacttat tcagagttac cagtgagctt 11880
tgactgttcc gaatgggacc agcatggagt caaggtgcct gctcaatgtc agagacagcg 11940
atggtctcag aaacaatcca ggtaatctct aggccaataa aatgtggatt cacagtgaga 12000
agtacatcct ggaggtggag cttgttcttc agtgggaaga gtgctgtgca cagaaagctt 12060
agaaatgggg aagggggtgc gtttcctcag gcaggattag ggcttcgtcc ctcagcgtcc 12120
cactcttgta tggctgatgt ggcatctgtg ttttctttct catactagat aaggctttga 12180
gctgtgaaat accctgcctc atgaatatgc aaataacctg agctcttctg aggtaaatat 12240
aggtatattg gtgccctgag agcatcactc aacaaccaca tctgtcctct agagaaaacc 12300
ctgtgagcac agctcctcac catggactgg acctggagga tcctcttctt ggtggcagca 12360
gctacaagta aggggcttcc tagtctcaaa gctgaggaac ggatcctggt tcagtcaaag 12420
aggattttat tctctcctgt gttctctcca caggtgccca ctcccaggtg cagctggtgc 12480
agtctggggc tgaggtgaag aagcctgggg cctcagtgaa ggtctcctgc aaggcttctg 12540
gatacacctt caccagttat gatatcaact gggtgcgaca ggccactgga caagggcttg 12600
agtggatggg atggatgaac cctaacagtg gtaacacagg ctatgcacag aagttccagg 12660
gcagagtcac catgaccagg aacacctcca taagcacagc ctacatggag ctgagcagcc 12720
tgagatctga ggacacggcc gtgtattact gtgcgagagg cacagtgtga aaaaccacat 12780
cctcagagag tcagaaaccc taggggagaa ggcagctgtg ctgggctgag gagatgacag 12840
gggttatcag gtttaaggct tttttgaaaa tgggttatat atttgagaaa aaaataacaa 12900
tagaaacaag tacacactct aattttaaga gatatattca attcaagaat tgtagaagcc 12960
gaattcacag tgggaaaggc cacactcaat aaagttgata aaaacattcc aggaaggtgc 13020
tactgtgaac aagttttcaa attagatgaa taaataattt ggagcaatgt tatttgatca 13080
tgaggtgaga cagtatgatt cttaaaaagt ggaaaaaatt ctttcaaatg tttctgccac 13140
tctttatcat aaaattaatt ttagagcagt tttaggatta caatgaaatt gcacagaagg 13200
catgagaatt cccatgactc cctgccctac acaggcagag cctcctccac tatgaccatc 13260
cagcaccaca gtcacaaatc agttacaatg gaggaatctc caaggacatt tgattctttc 13320
ttttttggtg atgccctaat ataacaagcc taatttatct cggaatacca caagttttgt 13380
tcagtggatt tctaggaatc atattagtta gagggaaagt tggtgaggct cttactattt 13440
gaactcattc ttccaaaatc cacaaaacat atattaattt aaagcttatc ttacttctgg 13500
tttacaaaga tccttcccag agagtaagat ttttaagatt ttagaggcag gttcatgtct 13560
ctaaaaggcc agcaagttct ggggaatccc ataatgaaca tcctttcatg ggaattggag 13620
accctggcaa tgagagactc catgtataat gccctagagt tggattagat gcactgtgag 13680
ctcttgggtg gtggttctga aaaaggtggt ctgtgcagta aatgcaaaca catgcacggg 13740
acacaggaag gagcaaaagc ttccctttac aaagtgtgtg ggcacctgga agaaaccctc 13800
acaagggcac gtctcaccct taatgactgc acattagcca ggggcatgat catgattggt 13860
cttacaagga aagagcacca ctgaggtcat aggttatgaa aatgtcatcc tccagctgag 13920
aaattccatc tgctggcatg tgcgtgtcaa ccccgtggag ggtgcactct gggagttgac 13980
gagatgcata gaaacctcct gtcatgaatt atcgtctacc acacagtcag aatcaacctg 14040
tgctccagaa agagatacat gcctgtagaa ataaacagag cttaaggatt ttcttacctg 14100
ctgaatatag tgcccaaaac cacactttca ggaagattag ctcagaaatg tttacaaagt 14160
gactgaaggg cagtggcggg tgaggtgatg ggacagcctc agggctgcac ttgaggaggg 14220
ctccctctcc catgcaggct tttcctccag gagctgcatc aggaactcac agaggataag 14280
ggagaattct gagaacatgc tactgtggta ctgcctagag aggaaggata aatgatgaca 14340
aatacatctc tgagtaacat atgggcattt gttcatgaaa actgtttttt ctgaaagctt 14400
atgaaggtct tgaaatgccc ctgattagct gaggacaaac atttaaaccc tccttccaca 14460
gggagttcaa gcaggctgga tgtgtcctgc tatggatgat cttccccagc cccttcctct 14520
tcccagctca tccctggctc tctgtgtaaa aagttttcat cagtggaagg tggttgatga 14580
agtgaggtct tcaatttcct catcttctct gtggtcatgt tattttcctc atctgaggtt 14640
taaaaactca cctgcatgca gcacatgaca ggctaaaatc tcttgtggac aaaacagtaa 14700
caaaggcacc caccatggtt gagcacccgt gttgctgaca gtgaccacca agggtcaacg 14760
tcctcttcac aatcctgtgt cagagcagca cttgagtgat ttcactagca acttcccagg 14820
agaatcggct taaaactact tgtcgcattt tccatacaga tataacacat ctattttcct 14880
gaagaaatag aaagagctga atgctgaata cactgtatgt ctgctggttc tgcaagtttg 14940
tgactatcac tttctaattt ctgatctgtg tggaccaccc tcactggtgt gaccagccgg 15000
gtgtgcccaa agcttactca atgtttatac tgcattgctt ttgatttctt tatatatttg 15060
tcccctccaa atctcatgtt g 15081
<210> 11
<211> 14191
<212> DNA
<213> artificial sequence
<400> 11
tcactggtgt gaccagccgg gtgtgcccaa agcttactca atgtttatac tgcattgctt 60
ttgatttctt tatatatttg tcccctccaa atctcatgtt gtaatatgat tcccagtgtt 120
ggaagtgggg ccactaggag gtgattagat caagggggca gatccctcat gaatgattga 180
gcatcttccc cttggtgatg agtgaattct cccccagtca gctcacacac agtctagttg 240
tttaagtctg ggccccccct cagcctcttg ctcccattct tgccatgtgg catacctgct 300
ctcccttcac cttctgctat gattgtaagt ttcctgaggt cctcgccaga agcagatgct 360
gatgccatgc ttcctataaa gcctgcagaa ctgtgagcta attcaatctc tttcctttat 420
aagttaccca gcctcatgta tttctttaca gtaatgcaaa gggacaaaca caggtggctc 480
ctgacacacc cctcatcttc tcctttgtca tcatctggct acttccatgt gtgtctgtgt 540
cctctcctct catgaggaca ccagtctttg gatttgggac ccatcgaaaa tttagtatga 600
tatcatctca agtccttaaa ctaatttgta aagaccctat ttctaaatat ggtcacattc 660
tgaggttcca ggtgggcata catttggcca gggagatgcc atcaacccag cacaccagcc 720
tacgcaggta catttgcata tagcttaggg ttgacctttc ccatggaagt attacatccc 780
cctctggact ccagtttcac atgttttaga atgcttcgcc ttgtccagca ggatgagtct 840
cttttcattt ttagtatttt tctctctctt ctttgcatta gattaaaaat aattgacata 900
gtctattgat gcatctttaa gttttcagtt ccttctcatg tctgctccat tctgttatga 960
aacttgagta atgtttttct atttcttttc ttttttaata aaatagagat gggatttctc 1020
catgttgccc aggctggtct caaactcctg ggatcaagtg atccttcctt cttggcctcc 1080
caaagtgcta ggataacagg catgggccac tgtgcctggc caaatgatgt ttttcatttc 1140
agaatcatac ttttagttct agaattttca tttggttctt gttaatattt tccctttctt 1200
acagagattc cccatctagt cactcattat aaccatattg tcctttaagt ctttgaacat 1260
atttaacatg acttctttaa agtccttgtg tgaaaactgg gtctgcatct aggtcatctt 1320
ggagttgatc tccattgatc cctttttctt ctgactatgg atcacatttt catgtttctt 1380
tgcatatatg gtaattttgg atgtgcacct gatgttgttg ataacatgtg gtacaggcta 1440
tgggttttgc gttcttcctt ggcagggcat tgattgtttt tttcaatgtt aggaaacagc 1500
ttgagttgac tcaaactccc aagtctgtct gcctggcagt tggcagtagc tggaatctca 1560
gttctctcga ccttacaggt gctgctttct gctgggccct ttggagtttt ccctacccat 1620
gcacacctaa gggatcggcc agaggtttca gtggagttta cttgcagatt gtggggtttc 1680
cctttggtga ccttctcctt tatggacatc ttctcttcat ttccagctgc tctgaaaatg 1740
cagccctgca tcccactcct caccaggagg gctgcagttt cctgcttgaa ctctagctgc 1800
acccattaca tgccctgggg tgtgacttca gaccaatatt tccgggaata atccttacta 1860
gtgtttgcct acttctggtc atgttccagt gcctgcaatt ggtgtgtgtg ggtgttgtgt 1920
gtgcttacag cttttccagt gtttataatt gccatctgcc aaagggctag tctgatatta 1980
gctgctccag cattactgga atcagaacta ctttctctca tgtggttttt cattttcatt 2040
tccctgttga ctagtgtggt tcaacacctt ttcatatgtt tagcggctat ttggatatct 2100
tctgtaaaac atctgttcaa ttcttttgcc tattccttgt tgaattattt gatttttttt 2160
ctcgttggtt tacaggggtc ttctttatat tatggatctg tttgtgtcag tcagttatat 2220
atgtttatag gaaacattga gaaaaacaaa agattagtag gtgccttcca tggaaagcaa 2280
ggcatcgtct tgtaccctct tccaggttat ttcattctat tgagctctgc atcttttatt 2340
tcctttgacc tattttacac ctctataagc caggggttcc cttcatcaca ccatatcaca 2400
ctatgtcaca ccagggatgt cacaccacat cacactatgt aactcctcat cacactgggg 2460
atgtcgtacc aggtcacacc ccatcatgcc acatcacact gtgtcacacc gcatcacatc 2520
acaccaagta tgtgacaccc tgacacagca catcagacat cacatcatgt cacaccacac 2580
ttcatcacac cccaccccac cagagttgtc acaccccatc acaccacatc acattacgtc 2640
acgccatgtc aaactacatc ataccccatt accccaggga tgttatgcca catcacatta 2700
tgtcacacca caatacacca catcaaattg tgtcacacca catcacaccc catcacacca 2760
gggatgttat gccacatcac cttatgtcat gccacatcac accacatcac accccatcac 2820
accgtggatg tcacccgtca tcacaccact tcacataatg ccacacaaca tcacacccca 2880
tcacaccagg ggcgtcacgc cccatcagaa cacgtcacat tatgccaaac cacatcacat 2940
catgtcacac cacaccacac cacactatgt cacagcctgt tacacttcat cacaccacat 3000
cacactaggg atgtcacacc ttgtcccatt acatcacatc atgtcacacc attccacatc 3060
aaaccatatt atgccccatc ataccaggga tgtcatacca tataatacta tgtcacactg 3120
catcacacca gggatgtaac accacatcac accatgtcca accttgtagc ccaacaccac 3180
atcacagaag ggacattaca ccatgtcata ccacatcaca ccatgcaact cctcatcaca 3240
tgagggaggt cgaaccctgt cacaccccat catgccacat cacaccatgt cacactgcat 3300
tacatcacac caacgatgtg acactctgtc acaccacatc acacatgtca caccagagat 3360
gtcacactcc atcacatcac atcacatcac acccgttaaa tgatatcata ccccatcaca 3420
ccagggatgt cacaccgcat cacatcaggt catatcacat cacactacgt cacaccacat 3480
cacaccaggg gtgtcacact ctgtcacagg atgtcccacc atgtcacacc ataccacatc 3540
acaccatatc atatcctatc atactaggga tgtcatgccc catcccacta tgtcacacca 3600
catcacacca tgtccaacca tgtcacatca catcacacca catcacacca tgtcacacca 3660
gggatgtcac actctgtcat accacatcac accatgtcct gtcacatcac acattacacc 3720
ctggcacacc atggatatca caccacatca caccatgtca tgttacatca cagcatggac 3780
tgctaggggg tgcgtggagg cagccttgct ggagagttga gggagggtcc tggggctggg 3840
cgtggtgttc ctgcaggagg actggccctc tggaggatgc tcggttccag gtggaaaggg 3900
ggaggtgggg cctgggtgct tcagggaggg gccatccttg ggcttgggat agagccggtg 3960
atgcagctgc tgcccgccct gcaccccagg tgctctctcc ctcacccccc gcggggctgc 4020
agcagcgtgt cctgagagtt aaagggctgg gcttcagcac ccagttcagg ccaggcaccc 4080
tgaagcccac cctccagcgg cgagccctcc cacggcacag cagggtccgg gctctgggga 4140
tttcaccccc aactctggtg cagctccagc tgctcgatgc cacacaaacg aatccaacca 4200
ctcctccttc ctgggtgaga tggtctctct cctgccacag gcaactccga cggcattttc 4260
cagccaccgc agccaccgca gccactgcag taacaagacc ctgtccttga ctgagttcca 4320
gccaggctcc tcggagcctc tccactcggc ctcaaccttg gcttgtaaag acttgagcag 4380
acactaacag tttctaacag cttctggccg tacccctagg ccgacccctg ccccgtcaac 4440
acctgcctga gaaagctccg tgcaccagaa ctcaccgttt ggaccaaccc cgacctccct 4500
ttctcagggt atctgctgag agggccgcaa ccacacgacc ttctatccgt tcccgatgtc 4560
tgtgcatttc ctgtgaccca ggagggtctt tctcaagact tgagagccgc tccctgaagt 4620
gtccccattg ggaaggatgg ggcctgtgtc tccaggctct gggaggacag aatcctgacc 4680
tcaacagtgg ccggcacgga cacagcgggt cccatcccgg ggacgctgac cagcgctggg 4740
caacttttcc cttccccgaa gactgagccc cgagcaccct ccctgctccc cctaccacct 4800
ccctttacaa ggctgtggcc tctgcacaga tgaaggtgag tccaggtcat gccggactct 4860
ttcttctgtt gcaatagtta tttctgttga aaatctgtcc ttgctacatg acctagtgcc 4920
cagggggatg ctgagacagg atgaatgtat tctgcatgtg agaagaacat gaattttggg 4980
gcccagagtc tggactgtga tgggttaaat tgtggcccct acaaattcat atattcaagt 5040
cttaatccct ggcctcacaa tgtgactatt tggagatggg gtctttacag aggtcattaa 5100
gttcataggg ggtcactaat ctaatccgat gtgtgttcta agaagagaag cttaggacac 5160
gggcacacag agggatggcc acgtgaggac cagggaggag acggtgtcta caagccaagg 5220
agagagggct tgagagaaac cagccctgcc tgcatcctga tctcagattc ctggtctcta 5280
ggcctgggag gatccatgtc tgccgtggga gccgccccgc tgtggtcctg agctgactca 5340
cacagatctg acacccacct ctcgcttcgg accatggttg gttctggaag gccctccctg 5400
tggctctgcc tggccagcct gagccagctc ccagcctcga cccagctttc cctggaggcc 5460
ctgtcccccg cagagtgacc agggcaggca gcaccgtgcc cagcaggagg agaaactgca 5520
tccatgtaga aaagaggaga agccccgggg gtccatgtag cgacaggggc cagggagggt 5580
cgctcgggca atgcgtgtgg ctgcaggagg cggggggcgt gtgcagggag cccccgaggt 5640
gcagctggac cagcctcctc ctgaccgtgt ttcccaccgg gggcaggagg cacgtggaca 5700
caggaaggcg gctcccatca cgaagtacaa gacttaaaaa ggatatttta ttgtcatcac 5760
aaaagaaaca tcaaagacaa ttaatgaact ttagaaaatt taaaagaaga aaagctacca 5820
aagctgaaat ggtggcacct ccttcgagtg agcccgggag tcctccctga cggctgaggc 5880
aggcgctggc cgcactcccg ctcgagtctc ccttcctgtc tgcggattct gcgtgacagt 5940
cacggaacgg cgtgatgggg gcagcagagc gtgggggcct ctgtccagca ctcgtggcca 6000
gcagccctgc tttcgcaaga acacgggcac cctctttggc gtcttgcctc tccacctggt 6060
gcccccagag tggctgcttg ttcctgctgc acgtgacccg ggactggacg ccagcctctg 6120
tgatgagttc tggctgtgtc cacgctcctg gctctcccgg tgtccctcca cctctctccc 6180
cgatgctcct gggcctcctc tgtcctcagg ccccaccaag gctgagtctt gcccgcctgg 6240
gacctggtca ccagccttct ctgggaggcc tgtctgggca gatgcccagc ccttccttgg 6300
gctatcctca cccttgcact gtggggctcc tgcagcggcc acatggccca ggctcttctc 6360
tgagtgatct cggtggactg gagtgggtgg gaggtggcag tgtcctgggc ctggcccctt 6420
ctctccccag tgcggactct ggggctggct gtccctgcgg gtcgagttcc acccgagaat 6480
ccagcagtgt gggcaggcag ccaaggggtg gtgctggcac cgagactgtt ctcaggagcc 6540
agagagcagc gttctttgct tgaaatcaga acaacctcat tcctcatgtc aggagttcac 6600
gggagtgccc ggaatggagg ctggctggct gcgggctggg aggaaggccg tctgagtgag 6660
ccttcgcagc tcttggaagc ctccccaaca gggcctgatg gtgctgtggc ttccctacct 6720
tggcggctga ttttcccact caccaactgg aaaccacgcc tgtgttcagg aggctggtgt 6780
ggacggggtt ggctccaggg cgaggtcctg cctgggtggg ggcctgggat gccggtcact 6840
gactcctttt gtgtgagcac ctggtggtct ggagggcagg gacgtcctgc tgaggggaca 6900
cctggccccc agcgccctgc atgcatcaag cagcggaggt ctggggtaga cctgctatgc 6960
acagggtctg gaaggggggc gtgtcagggg tcagaaggtg acttcgaggc cagagagcca 7020
tggggttcag ggcggtgagg tcgggggcag gtgtggcctg ggtggtggct gagcatggcc 7080
cacggctcgt gtgtggggtc tgggcggccc tggacacccc gcagagggtg gccctaggcc 7140
ccctgcccga tcatgttcct gtagtcgggg acgatggtct gcttcaggtc caccaccgag 7200
gagaagatcc acttcacctg taggcaaggc acagcacagg ggtgagcgag gccacagccc 7260
tgtccccgag ccccacccac ccctcagggc actgagggcc acatctcttc ccccaaggtc 7320
cacccacccc tcatgccacc caggccacag ccctgcccct gaggcccatc cgcccttcag 7380
tccacccagg tgccagggcc tcaccactgc ctgctctgag gcctggacat ggagagcaga 7440
gccagggcac caacagcatg tgggcagtac agaagacagc gtcagggaca ggtggagaca 7500
gtgtggggga tagtgttggg gacaggtggg gacagagtgg gggacagtgt cagggacagg 7560
aggagacaga gtggtggaca gtgttgggga caggaggaaa cagtgtgggg gacatttttg 7620
gggacaggag gggacagtgt gggggacagt gttggggaca ggtggggaaa gcatggtgta 7680
cagtgttggg gactggtggg gacagcgtgg gggaaagtgt tggggacagg aggggacagc 7740
gtgggggaca gtgttaggga gaggtgggaa cagtgtaggg gacagtgtca gggggaggtg 7800
gggacagcgt gggggacagt gtcagggaaa ggtggggaca gtgtggggga cagtgtcagg 7860
gagaggggac agtgtggggg acagcgtcag ggagaggtgg ggacagcatg ggggacagtg 7920
tcagggagag gtgcggacag catggacagt gtcgggacag gtggggacag tgtgaggaca 7980
ttgttgggac aggtggggac agtgtggggg acagtgtcgg ggacaggtgg gaacagcgtg 8040
ggggacagtg tcagggacag gtggggacag cattggggag agtgtcaggg acagatgggg 8100
acagtgtggg ggacagcatc agggacaggt ggggacagca tgggggacag tgtcaaggac 8160
aggtggggac agcatggggt acagtgtcgg agatgggtgg ggacagcatg agagacagtg 8220
tcggggacag tgtcagggac aggtggggac agcatgtggg acagtgggac aggtggggac 8280
agcatggggg acagtgtcag ggacaggagg agacagcatg ggggacagtg ttgcatacag 8340
gaggggaaag catggggaca gtgtcaggga ctgtagggga cagagtgggg gacagtgtca 8400
gagacaggag gagacagcat ggggacagtg ttggggacag gaggggacag cgtgggggac 8460
agtgtcgggg acaggtgggg acagtatggg ggacagtgtc ggggacaggt ggtgacagcg 8520
tgggggacag tgtcagggac aggaggagac aggagaagac agcatggggg acagtgtgag 8580
gtacaggagg aaactgtggg ggacattgtt ggggacagga ggggacagcg tgggggacag 8640
tgtcagggat aggaggagac gagaagacag tgtaggggac agtgtcggga caggagggga 8700
caggaggaaa cagcatgggg gacattcagg gacaggaggg gactgtgggg gacagtgttg 8760
gggacaggtg gggatagcat ggggtacagt gttggggact ggtggggaca gtgtggggaa 8820
tagtgtccgg gacaggtggg gatagtgtgg gggacagcgt cagggacagg tggggatagt 8880
gtgggggaca gcgtcaggga caggtgggga tagtgtgggg gacactgtca gtgacagttt 8940
gtgacagcac aggggacagt gtcagggaca gggaacgtgt gggggacagt gtcagggaca 9000
gtttgtgaca gtgtggggga cagtgtcagg gacaggtggg tgcagcattg gggatagtgt 9060
caggcacatg tggagacagt ctgggggaca ctgttggaca ggtgggtaca gcgttgggga 9120
gtgtcaggga caggtggcga cagcgttggg gatagtgtca gggacatgtg gagacagtct 9180
gggggacact gttggacagg tgggtacaga gtgagggaca gtttgtgaga gcgtggggga 9240
cagcgtcagg gacaggtggg gacagcctgg ggacagtgtc agggacagtg tgtgacagca 9300
tgggggcaat gtcaaggaca gctggggaca acgtgcggcc gaccttgaag aaggtgacgg 9360
tggcactgta gcacacgctt agcaggaaga gtgtgatgaa gatggtgatg gtcgtccaca 9420
gcccgtccag ctccccgtcc tgcgcctccg cacagctctc ctccagttgc agctctggac 9480
aggaaggggg tggtcagtgc tgtgtccccc tgggcttggg cctctggggg tgattccctc 9540
tgtggcgggg cctaggatgt agggcccggc ctcgatggcc caacagtgtc ctgaggtcag 9600
ctcccggaag ctgtccatcc tgggcaccgg ctttggccct ggggctcagc cagacacccg 9660
gccctaaata gcgacctggc cctcagcagg acccgctccc cgtctcccgt gtccctccct 9720
gagccccaga gggcaggaga tatgaagccc acccctcatg tgaccccagg agcagggaag 9780
ggctgtattg ggaagtgggc cagggccagg gacgtgacgt ggtgtgtgat cccctgtgtg 9840
tgtgtggcgg ctgcaggggc actttgtgag aggaggactg ggtttgtctg agctggtcag 9900
caagtggaga agctgccgag aggctcgtgg gccttgaggt gccgcatggg gcttgtaggg 9960
gcctgtgtcc gaggagtgtt cacgtgtgcg aggaccttgc tctggtctgg gtgctgtgca 10020
gttcgcccgg gtgaggctcc gtgtgtgagg cgtgcacgtg tgtgtgtggt ggccgtgtgg 10080
ccggccaacc tcagtgcggg gtttgttgaa cgggtctggg ctgagtgtgt gtgtgggcat 10140
ctggaccagt ccctccatag ggcccgagag tgcatgtccc cggagtcggt tgtgtcccca 10200
tgcgggtgcg aggctgggca gggctgccag gggttagtgc cgtgggggta gatgggtgag 10260
ggagggcctg tccctacgca catggactag gcatgccccc gagtgggcat ggggggtcgg 10320
aggacagggc gctcacagaa caggacagtc tcctacagag gcaggggctg tgtgtctgtc 10380
cccaggggct cctagggctt ctcgtggctc agcccagggc aggtgctgct ggagggaggg 10440
ccacgctggc aaatccccca ccctgccgag ggcagcccct ggctgagccc caccctaggc 10500
ggcccaggca cacctgcaca gcctgggcca gtgtggggac agtgggaccc gctctgcctc 10560
cctcatgcca ctcaggcctc agactcggcc tgacccgtgg aaagaaccat cacagtctcg 10620
caggggccca gggcagtggt gggtgcttta tttccatgct gggtgcctgg gaagtatgta 10680
gacggggtac gtgccaagca tcctcgtgcg accgcgagag cccggggagc gggggcttgc 10740
cggccgtcgc actcatttac cccggggaca gggagaggct cttctgcgtg tagtggttct 10800
gcagaccctc atgcatcacg gagcatgaga agacgttccc ctgctgccac ctgctcttgt 10860
ccacggtgag cttgctatag aggaagaagg agccgttgga gtccagcatg ggaggcgtgg 10920
tcttgtagtt gttctccggc tgcccattgc tctcccactc cacggcgatg tcgctggggt 10980
agaagccttt gaccaggcag gtcagggtga cctggttctt ggtcatcttc tgggatgggg 11040
gcagggtgta cacctgtggt tctcggggct gccctgtagg gacagaggtt ggtacagcgg 11100
tcactcccag ggcagagggt gggccaagcc ggcctctgtc cacgtggcct tcgcgctccg 11160
tgggtcccac ctttggtttt ggagatggtt ttctcgatgg gggctgggag gcctttgttg 11220
gagaccttgc acttgtactc cttgccgttc agccagttct ggtgcacgac ggtgaggacg 11280
ctgaccacat ggtacgtgct gttgtactgc tcctcccacg gctttgtctt ggcattatgc 11340
acctccacgc cgtccacgta ccagttgaac ttgacctcag ggtcttcgtg gctcacgtcc 11400
accaccacgc acgtgacctc aggggtccgg gagatcatga gggtatcctt gggttttggg 11460
gggaagagga agactgacgg tccccccagg ggttcagttg ctgaggaaga gatggaggcg 11520
gacgtgtcag cacccggttg gggcctgtcc ctggacgcag gctactctag ggcacctgtc 11580
ccgccttgag ctggagggcg aggcctgggc tggcttactt gcacatggtg ggcatgtgtg 11640
agttgtgtca caacatgggg ttttgggctc tgcagagaga agattgggag ttactcagat 11700
ctgggaggag aggtgtctga gctgagggag tggagatctt ggcctttggg gtgggcttag 11760
gtcaggggca gggtcttccc ggatatggct cttggccagt ctaagtgcag cacctgcccc 11820
tttgtgcgca gggcctgggg taggggcttc cagcctgtgg ctgcctggag cctggtggaa 11880
aaagccagaa gaccctctcc ctgagcatga gtggggcggg cagaggcctc cgggtgagga 11940
gacagatggg gcctgccttg ctgccctgga ctggggctgc acagccgggg tacgtccagg 12000
caagagggct gagcctggct tccagcagac accctccctc cctgtgctgg cctctcacca 12060
actgtcttgt ccaccttggt gttgctgggc ttgtgatcta cgttgcaggt gtaggtctgg 12120
gtgcccaagc tgctggaggg cacggtcacc acgctgctga gggagtagag tcctgaggac 12180
tgtaggacag ccgggaaggt gtgcacgctt ctggtcaggg cccctgagtt ccacgacacc 12240
gtcaccggtt cggggaagta gtccttgacc aggcagccca gggccgctgt gccctcagag 12300
acgctcctgg aggagggcac cagggggaag accgatgggc ccttggtgga ggctgcaaga 12360
gaggtggtgc catgtgaccg cggtgtggga cagagctggg cccagggtgc agaggcccct 12420
cggttcttgt ctatccgcga gggtccaggc agggtccagt gtctgggctc acgggcattg 12480
agtgtgcacc tggctggtgc cacctgcctc accttagccc cctccctgcc ccaaagccaa 12540
agtcaggccc ggcctgcccc agaaagcttg caggactggt ggccctgtgg tgcccttctg 12600
caggcacccc tgcagcctag ggggcggggc tcggcagcca ggtcagtgct ttgtctcaaa 12660
aaaaacaaaa acaaaaacaa aaaacaaaac aaaacaaaaa acaaacaaat aaaaagttgt 12720
aaaaggattg tggaaaaggg atcttatgtg gtcaaaggcg gctgtgattg gatttattta 12780
tttatttttt tgagacagag tttcactctt gttgcctaag ctggagtgca atggtgtgat 12840
ctcggttcac tgcaacctct gcctcttggg ttcaagcgat tctcctgcct cagcctccag 12900
agtagctggg attacaggtg cccaccacca cgcccagcta atttttatag ttttagtaga 12960
gacaggggtt tcaccacgtt ggccaggctg gtctcgaact cctgacctca tgatccaccc 13020
gcctaggcct cccaaagtgt tgggattaca ggcatgagcc actgcacctg gccggattta 13080
tttatttatt tatttttgag acagggtccc actgtgttgc ctaggcagca gtgcagtggc 13140
actcagcact gaggctgagg gaggttgagg tgggagggct caagcaatcc tcccacctca 13200
gcctcccaag tagctgggac tacgggcacg tgccaccatg cctggccaat tttttttttg 13260
tattttttgt agagacaggg cttccccatg ttgcccaggc tgatctcaaa ctcctgggct 13320
taagtgatcc accctcctgg aatgctggga ctacaggtgt aagccacctt gcccagcctg 13380
gatggaatta tttataaggt ttaattaaaa ttagctttaa tattaacagt tcatgtgaaa 13440
ctagaatttg gtcttctctg ttaaagtgac agttttctgg aatattggtc tgctcttcat 13500
tatggcaggt ttttcttttt tttttttcac cttgaaaaat atatatttac aggaacaatt 13560
ccccattctc tgggactctt tagaaaaaaa aaaggtccat tttggggaag caaaacagtg 13620
gagacgagtg tagcaccgtc ccccaaatca ccaaccccca ggtcccaagg cctgggctgg 13680
gccagggctg acagggaagc ccaggagtct tttgaaccca ctcttcctgc ctagaataga 13740
gacaggacag gctttatgtc ccccattcct ccctcccacc tccagggaca ttgaaagtgt 13800
cctttgtacc tacctgtagg aaattggggg gctgggaggg agggaactga aaatacacat 13860
ttgttatcaa aaataaacat ctggggggga ggcggcagga agattccctc ccaaatccct 13920
ttcttcacac ccaccccacc aaatatagga agagatgact ccctctcccc tattgaaaag 13980
ccccatttaa aaatagatta tactatcaaa atggcagcac gggagagaca gggagacctg 14040
gagtactggc tggaggggcc cccccagaca ggaaccaccc ccacaaaacc cctccatggg 14100
aggaaacagg caggacccca gggagtttgg cagacaaagg aatggcttct cagggggaag 14160
aagaacaaag ggtaatcatg gtcatagctg t 14191

Claims (2)

1. A method for improving the conversion efficiency of saccharomyces cerevisiae DNA fragments is characterized by comprising the following steps:
(1) Synthesizing target DNA fragment: synthesizing the DNA fragment of 5-20 kb;
(2) Performing methylation modification on the DNA fragment in the step (1);
(3) The DNA fragment subjected to methylation modification and the linearized plasmid are transformed into saccharomyces cerevisiae by using a PEG-LiAc method;
the methylation modification is obtained by methyltransferase treatment, and the methyltransferase is formed by combining CpG methyltransferase, gpC methyltransferase, taq I methyltransferase and DpnM methyltransferase according to the enzyme activity ratio of 4:3:1:2;
the 50. Mu.L reaction system for methylation modification is as follows: adding 2-5 mug DNA fragment and 1-10U methyltransferase, adding S-adenosylmethionine with final concentration of 80-160 mu M, 5 mu L10 Xreaction buffer, ddH 2 O is added to 50 mu L, and the mixture is reacted in a water bath at 37-65 ℃ for 1-5h;
the 10 x reaction buffer composition is: 200mM Tris-Ac, pH7.9, 500mM KAC,100mM MgAc2,100mM DTT, 1000. Mu.g/ml albumin, the balance being water;
the DNA fragment is designed with a homology arm for connecting plasmids;
the plasmid is pRS415.
2. A method of improving the conversion efficiency of a saccharomyces cerevisiae DNA fragment according to claim 1 wherein said DNA fragment is a linearized DNA, or a circularised plasmid DNA.
CN202210322772.1A 2022-03-30 2022-03-30 Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments Active CN114807209B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210322772.1A CN114807209B (en) 2022-03-30 2022-03-30 Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210322772.1A CN114807209B (en) 2022-03-30 2022-03-30 Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments

Publications (2)

Publication Number Publication Date
CN114807209A CN114807209A (en) 2022-07-29
CN114807209B true CN114807209B (en) 2023-11-17

Family

ID=82533079

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210322772.1A Active CN114807209B (en) 2022-03-30 2022-03-30 Method for improving conversion efficiency of saccharomyces cerevisiae DNA fragments

Country Status (1)

Country Link
CN (1) CN114807209B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107674877A (en) * 2017-08-11 2018-02-09 中国农业科学院生物技术研究所 The methylated transferase gene of glycosyl compound
CN110964738A (en) * 2019-12-27 2020-04-07 苏州泓迅生物科技股份有限公司 Multi-fragment DNA assembly kit, multi-fragment DNA assembly method and application thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9267132B2 (en) * 2007-10-08 2016-02-23 Synthetic Genomics, Inc. Methods for cloning and manipulating genomes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107674877A (en) * 2017-08-11 2018-02-09 中国农业科学院生物技术研究所 The methylated transferase gene of glycosyl compound
CN110964738A (en) * 2019-12-27 2020-04-07 苏州泓迅生物科技股份有限公司 Multi-fragment DNA assembly kit, multi-fragment DNA assembly method and application thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
稻瘟病菌精氨酸甲基转移酶基因MoHMT1的功能分析;张文泽 等;生物技术通报;第35卷(第12期);第38-44页 *
组蛋白甲基化和去甲基化研究进展;刘飞 等;中国农学通报;第23卷(第2期);第56-59页 *

Also Published As

Publication number Publication date
CN114807209A (en) 2022-07-29

Similar Documents

Publication Publication Date Title
AU2017267184B2 (en) Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy
KR101840618B1 (en) Treatment of tumor suppressor gene related diseases by inhibition of natural antisense transcript to the gene
KR101708544B1 (en) Methods and nucleic acids for analyses of cellular proliferative disorders
CN107941681B (en) Method for identifying quantitative cellular composition in biological sample
KR102046668B1 (en) Methods and nucleic acids for determining the prognosis of a cancer subject
KR101999410B1 (en) Chromosomal landing pads and related uses
AU2016376191A1 (en) Materials and methods for treatment of amyotrophic lateral sclerosis and/or frontal temporal lobular degeneration
AU2016364667A1 (en) Materials and methods for treatment of Alpha-1 antitrypsin deficiency
CN107250373A (en) The gene editing realized is delivered by microfluid
CN107223159A (en) The detection of DNA from particular cell types and correlation technique
CA2941594A1 (en) Genetic polymorphisms of the protein receptor c (procr) associated with myocardial infarction, methods of detection and uses thereof
KR20180093902A (en) Detection of fetal chromosomal anomalies using differentially methylated diene regions between fetuses and pregnant women
CN113853437A (en) Use of adeno-associated viral vectors for correcting gene defects/expressing proteins in hair cells and supporting cells in the inner ear
KR20170086027A (en) Compositions and methods comprising bacteria for improving behavior in neurodevelopmental disorders
KR20210138587A (en) Combination Gene Targets for Improved Immunotherapy
CN107267482A (en) It is used as the phosphodiesterase 4 D7 of prostate cancer marker
GB2424886A (en) Polynucleotide primers against epidermal growth factor receptor and method of detecting gene mutations
AU2023202878A1 (en) Methods for targeted insertion of DNA in genes
KR20230034198A (en) Methods for activating and expanding tumor-infiltrating lymphocytes
KR20040065524A (en) Method for assessing and treating leukemia
CN108026587A (en) Novel biomarker and method for treating cancer
KR20130123357A (en) Methods and kits for diagnosing conditions related to hypoxia
KR20220160053A (en) Immunotherapy targets in multiple myeloma and methods for their identification
KR20200054430A (en) Method for producing probes for detecting mutations derived from cell in lung cancer tissue and method for detecting using thereof
AU2018360287B2 (en) Method for determining the response of a malignant disease to an immunotherapy

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100176 Room 401, building 5, yard 156, Jinghai 4th Road, Daxing Economic and Technological Development Zone, Beijing

Applicant after: Beijing Qingke Biotechnology Co.,Ltd.

Applicant after: Biological disaster prevention and control center of State Forestry and grassland administration

Address before: 100176 Room 401, building 5, yard 156, Jinghai 4th Road, Daxing Economic and Technological Development Zone, Beijing

Applicant before: Beijing Qingke Biotechnology Co.,Ltd.

Applicant before: Biological disaster prevention and control center of State Forestry and grassland administration

GR01 Patent grant
GR01 Patent grant