CN110734900A - cytosine base editing tool and application thereof - Google Patents

cytosine base editing tool and application thereof Download PDF

Info

Publication number
CN110734900A
CN110734900A CN201911075141.9A CN201911075141A CN110734900A CN 110734900 A CN110734900 A CN 110734900A CN 201911075141 A CN201911075141 A CN 201911075141A CN 110734900 A CN110734900 A CN 110734900A
Authority
CN
China
Prior art keywords
fragment
apobec3g
nucleotide sequence
fusion protein
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911075141.9A
Other languages
Chinese (zh)
Other versions
CN110734900B (en
Inventor
李佳楠
于文霞
黄行许
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Shanghai for Science and Technology
Original Assignee
University of Shanghai for Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Shanghai for Science and Technology filed Critical University of Shanghai for Science and Technology
Priority to CN201911075141.9A priority Critical patent/CN110734900B/en
Publication of CN110734900A publication Critical patent/CN110734900A/en
Application granted granted Critical
Publication of CN110734900B publication Critical patent/CN110734900B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/78Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y305/00Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
    • C12Y305/04Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
    • C12Y305/04001Cytosine deaminase (3.5.4.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/09Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/22Vectors comprising a coding region that has been codon optimised for expression in a respective host

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The invention relates to the field of biotechnology, in particular to cytosine base editing tools and application thereof.A fusion protein is provided by the invention, and comprises an APOBEC3G fragment and a SpCas9-D10A nickase fragment.A series of APOBEC3G cytosine base editing tools provided by the invention have R24A, W94L, Y124A, W127L and P200K amino acid mutations relative to a wild type in an APOBEC3G fragment, wherein the four amino acid mutations of R24A, W94L, Y124A and W127L can limit the combination of APOBEC3G and RNA, D128K, P199A, P200K and Q322K can improve the combination of APOBEC3G and DNA, and can improve the editing efficiency in the base editing of changing C at 4-7 bit of the 5' end of sgRNA into T, and greatly reduce or even eliminate the RNA target RNA editing effect of cytosine base editing tools.

Description

cytosine base editing tool and application thereof
Technical Field
The invention relates to the technical field of biology, in particular to an cytosine base editing tool and application thereof.
Background
CRISPR/Cas9 is currently the most efficient and convenient genome editing technology. Cas9 nuclease, guided by guide RNA (sgRNA), can reach a specific target of the genome, cleave it, thereby generating DNA Double Strand Breaks (DSB), and then achieve editing through endogenous DNA repair mechanisms. DNA Repair mechanisms include Non-Homologous End joining (NHEJ) and Homologous recombination Repair (HDR). Among them, NHEJ repair results in random insertions, deletions, leading to inactivation of genes, which dominates in genome repair. And the HDR can be accurately repaired by utilizing the template, so that the gene mutation is corrected.
But actually, the probability of HDR-mediated accurate repair is very low, usually less than 5%, thus greatly limiting the application of CRISPR/Cas9 in the transformation from scientific research to application, in particular, is a big problem in the field of gene editing, .
Recently, a newly developed Base Editor (BE) has successfully solved the above problems, and the efficiency of correcting gene mutation has been greatly improved. There are two types of conventional Base editors, a Cytosine Base Editor (CBE) and an Adenine Base Editor (ABE).
CBE and ABE are the combination of RuvC domain inactivated Cas9D10Anickase (nCas9) and cytosine deaminase/adenine deaminase integrated at , guided by the sgRNA to the target site and bound to the complementary DNA strand of the sgRNA, cytosine deaminase deaminates a limited range of cytosine C around to uracil U, which can pair complementarily with cytosine A, and upon DNA replication, U will eventually BE replaced by the complementary pairing base T of A. similarly, adenine deaminase deaminates a limited range of adenine A around to hypoxanthine I, which can pair complementarily with cytosine C, and upon DNA replication, I will eventually BE replaced by the complementary pairing base G of C. thus achieving the purpose of C-to-T or A-to-G.
The deaminase rAPOBEC1 in BE3 is a rat cytosine deaminase, wherein the endogenous state of rAPOBEC1 can edit single-stranded DNA in addition to the above representationRNA, changing C to U. Recent studies found that BE3 base editor produces serious RNA off-Target effect (off-Target)5The application of the base editor is greatly limited.
Disclosure of Invention
The purpose of the present invention is to provide an editing tool for cytosine bases and the use thereof.
In order to achieve the purpose, the invention provides fusion proteins, which are characterized by sequentially comprising an APOBEC3G (A3G) fragment and an SpCas9-D10A nickase fragment from N end to C end, wherein the APOBEC3G fragment has cytosine deaminase activity, at least amino acid mutations in R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K exist in the APOBEC3G fragment, or the APOBEC3G fragment is an APOBEC3G fragment deleted from the start codon of APOBEC3G to 190 th or 197 th position.
Preferably, the APOBEC3G fragment is derived from human (Homo sapiens).
Preferably, the nucleotide sequence of the APOBEC3G fragment comprises:
a) a nucleotide sequence shown as SEQ ID NO. 27-36; or,
b) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.27-36 and having the functions of the nucleotide sequence defined in a).
More preferably, the nucleotide sequence in b) may have more than 80%, 85%, 90%, 93%, 95%, 97%, or 99% similarity to SEQ ID No.27-36 in a).
More preferably, the nucleotide sequence in b) specifically comprises a nucleotide sequence shown as SEQ ID NO.27-36 obtained by replacing, deleting or adding or more (specifically 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2 or 3) amino acid codons, or adding or more (specifically 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2 or 3) amino acid codons at the N-terminal and/or C-terminal.
Preferably, the nucleotide sequence of the SpCas9-D10A nickase fragment comprises:
c) a nucleotide sequence shown as SEQ ID NO. 37-38; or,
d) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.37-38 and having the function of the nucleotide sequence defined in d).
More preferably, the nucleotide sequence in d) may have more than 80%, 85%, 90%, 93%, 95%, 97%, or 99% similarity to SEQ ID NO. 37-38.
More preferably, the nucleotide sequence in d) specifically includes a nucleotide sequence shown as SEQ ID No.37-38 obtained by substituting, deleting or adding or more (specifically, 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2, or 3) amino acid codons, or adding or more (specifically, 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2, or 3) amino acid codons at the N-terminal and/or C-terminal.
Preferably, the fusion protein further comprises a nuclear localization signal fragment located at the N-terminus of the APOBEC3G fragment or the C-terminus of the SpCas9-D10A nickase fragment.
More preferably, the nucleotide sequence of the nuclear localization signal fragment is shown as SEQ ID NO. 39.
Preferably, the fusion protein further comprises a flexible linker peptide fragment located at the N-terminus of APOBEC3G fragment, between APOBEC3G fragment and SpCas9-D10A nicase, or at the C-terminus of SpCas9-D10A nicase.
More preferably, the nucleotide sequence of the flexibly linked peptide fragment is as set forth in SEQ ID NO. 40-41.
The present invention also provides isolated polynucleotides encoding the above fusion proteins.
The invention also provides constructs, which are characterized in that the constructs are obtained by inserting the separated polynucleotides into an expression vector, and the polynucleotide sequence of the constructs is shown in SEQ ID NO. 1-14.
Preferably, the expression vector includes, but is not limited to, a pCMV expression vector, a pSV2 expression vector, a pGL3 expression vector, and the like.
The invention also provides expression systems, which is characterized in that the expression system is a host cell, the host cell contains the construct or integrates the isolated polynucleotide into the genome, the host cell can express the fusion protein, and the fusion protein can be matched with the sgRNA, so that the fusion protein can be positioned to a target region to realize base editing of the target region.
Preferably, the host cell is selected from a eukaryotic cell or a prokaryotic cell.
More preferably, the host cell is selected from a mouse cell or a human cell.
, the host cell is selected from mouse brain neuroma cell, human embryo kidney cell, human cervical cancer cell, human colon cancer cell, human osteosarcoma cell.
Further , the host cell of the expression system is selected from the group consisting of N2a cells, HEK293FT cells, Hela cells, HCT116 cells, and U2OS cells.
-base editing tool, comprising the fusion protein and sgRNA.
The invention also provides the application of the base editing tool in gene editing of eukaryotes.
Preferably, the gene editing is base editing of C-to-T at positions 4-7 of the 5' end of the sgRNA in the target region.
APOBEC3G is a member of the human APOBEC family, and can bind to single-stranded DNA or RNA, generate deamination, mutate C to U, and play an important role in antiviral processes. Deamination of APOBEC3G tends to occur in the CC sequence. There are two functional domains of APOBEC3G, and earlier studies suggest that the primary role of the amino terminus is RNA binding, and the primary role of the carboxy terminus is DNA binding, as well as deamination, which is also a common feature of all two-domain APOBECs. Of these, it is noteworthy that APOBEC3G was not considered to have RNA editing function by earlier studies. Recent studies have indicated that there is competition between the DNA and RNA binding domains of APOBEC3G and, unlike previous studies, APOBEC3G was overexpressed and found to have RNA deaminase activity.
Compared with the prior art, the invention has the beneficial effects that:
(1) the invention provides a new -generation cytosine base editing tool, wherein APOBEC3G is connected with a spCas9-D10Anickase fragment, and then a functional domain responsible for RNA combination in APOBEC3G is mutated, the RNA deamination function is damaged, and the DNA deamination activity is improved.
(2) The base editing system provided by the invention widens the targeted range of a genome, can use an NGG sequence as PAM, realizes the C-to-T base of 4-7 sites at the 5' end in a sgRNA target region, has high mutation precision, and can greatly reduce or even eliminate RNA off-target effect.
(3) Compared with the wild type, the APOBEC3G fragment of the invention has amino acid mutations of R24A, W94L, Y124A, W127L and the like, can limit the combination of APOBEC3G and RNA, and can greatly reduce or even eliminate the RNA off-target effect of a cytosine base editing tool in the base editing of mutating C at 4-7 sites of the 5' end of sgRNA into T; in addition, the cytosine base editing tool provided by the invention has higher editing efficiency on DNA than a classical cytosine base editing tool (BE3), greatly improves the mutation accuracy on the premise of ensuring the editing efficiency, and has good industrialization prospect.
Drawings
FIG. 1 shows a schematic structure diagram of APOBEC3G-BE3, APOBEC3G-BE4 series plasmids used in the examples of the present invention;
FIG. 2 is a statistical chart showing the editing capacity of A3G-BE3,191-BE3,198-BE3 and BE3 to endogenous gene loci in HEK293T cells, wherein a is a statistical chart of C-to-T editing efficiency of A3G-BE3,191-BE3,198-BE3 and BE3 at HEK293Site3, b is a statistical chart of RNA off-target efficiency of A3G-BE3,191-BE3,198-BE3 and BE3, C4 and C5 represent positions of C at target loci, counted from PAM distal bases, and 191-BE3 and 198-BE3 are truncated APOBEC3G-BE3 deleted from position to position 190 and position 197 of APOBEC3G, respectively;
FIG. 3 is a statistical chart showing the editing ability of the 4M-BE3, BE3, to the endogenous gene locus in HEK293T cells according to the present invention; wherein a is a C-to-T editing efficiency statistical chart of three sites of 4M-BE3, A3G-BE3, BE3 in HEK293Site3, HEK293Site2 and EMX 1; b is a statistical chart of RNA off-target efficiency of 4M-BE3, A3G-BE3 and BE 3; c3, C4, C5, C6, C8 represent the position of C at the target site, counted from the PAM distal base;
FIG. 4 is a statistical chart showing the editing capacity of series mutants in the endogenous gene locus in HEK293T cells based on 4M-BE3, wherein a is a schematic diagram of the mutation locus structure performed based on 4M-BE3, b is a statistical chart of the C-to-T editing efficiency of 7 mutant plasmids and 4M-BE3, BE3 in three loci of HEK293Site2, HEK293Site3 and EMX1, C is a statistical chart of the C-to-T editing efficiency of two combined mutant plasmids and 4M-BE3 on the basis of b, and BE3 in three loci of HEK293Site2, HEK293Site3 and EMX1, D is a statistical chart of the C-to-T editing efficiency of two plasmids 4M-BE3,4M + P A + P199 + P200A-BE 72, 4M + D A + P A + A, the C-BE 72, the C-to-BE 72, and the C-BE 72, C-RNA at the target-off-C-RNA, C A, C A, C A represents the C-to C A, C A, C A, C A, C A, C A represents the;
FIG. 5 is a statistical chart showing the editing capacity of the optimized plasmids 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4 in HEK293T cells for endogenous gene loci in the present invention; wherein,
a is a structural schematic diagram of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE 4;
b is a C-to-T editing efficiency statistical chart of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4 at HEK293Site 3;
c is a statistical chart of RNA off-target efficiency of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4BE 3; c4, C5 represents the position of C at the target site, counted from the PAM distal base.
Detailed Description
Before describing specific embodiments of the present invention at step , it is to be understood that the scope of the present invention is not limited to the specific embodiments described below, and it is to be understood that the terminology used in the examples is for the purpose of describing the specific embodiments and is not intended to be limiting of the scope of the present invention, and that the singular forms "", "" and "the" include the plural forms as used in the specification and the claims unless the context clearly dictates otherwise.
When numerical ranges are given in the examples, it is understood that unless otherwise indicated herein, each numerical range has its two ends and any numbers between the two ends are optional.
Unless otherwise indicated, the experimental methods, detection methods, and preparation methods disclosed herein all employ techniques conventional in the art of molecular biology, biochemistry, chromatin structure and analysis, analytical chemistry, cell culture, recombinant DNA technology, and related arts. These techniques are well described in the literature and may be found in particular in Sambrook et al, Molecular CLONINGG: a LABORATORY MANUAL, Second edition, Cold SpriNGG harbor LABORATORY Press, 1989and Third edition, 2001; ausubel et al, Current PROTOCOLS Inmolecular BIOLOGY, John Wiley & Sons, New York, 1987and pharmaceutical upperes; the seriesMethods IN Enzymogy, Academic Press, San Diego; wolffe, CHROMATIN STRUCTURE ANDFUNCTION, Third edition, Academic Press, San Diego, 1998; (iii) METHODS IN ENZYMOLOGY, Vol.304, Chromatin (P.M.Wassarman and A.P.Wolffe, eds.), Academic Press, SanDiego, 1999; and METHODS IN MOLECULAR BIOLOGY, Vol.119, chromatography Protocols (P.B.Becker, ed.) Humana Press, Totowa, 1999, etc.
Example 1
In this example, the RNA binding site DNA binding site in the APOBEC3G part was subjected to point mutation (including R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K), or the APOBEC3G fragment was deleted from the th start codon of APOBEC3G to the 190 th or 197 th position to obtain a truncated APOBEC3G fragment, the amino-terminal of APOBEC3G was truncated to construct C-APOBEC3G-BE3, or the D10Anickase Cas9 part in BE3 was replaced by D10Anickase Cas9(BE4) which expresses more efficiently, respectively.
The related plasmid is shown in figure 1, wherein 4M is R24A, W94L, Y124A and W127 four-point mutation, and A3G (OP) is A3G after optimizing codon.
The construction method of the mutant plasmid used in the examples was as follows: amino acid mutations were introduced into the APOBEC3G portion of the A3G-BE3 plasmid or the A3G-BE4 plasmid by the Mut Express II FastMutagenesis Kit V2(Vazyme, C214-02) (FIG. 1).
The constructed plasmids C-191-BE3, C-198-BE3, 4M-BE3,4M + D128K-BE3, 4M + P199A-BE3, 4M + P199W-BE3, 4M + P200A-BE3, 4M + P200K-BE3,4M + Q322K-BE3, 4M + D128K + P199A + P200A-BE3, A3G-BE4max, 4M + P199A + P200K-BE4max, 4M + D128K + P199A + P200K-BE4max, A3G (OP) + P199A + P200K-BE4max, and the sequence is shown in SEQ ID NO. 1-14.
Example 2
In this example, the APOBEC3G series of tools were used to edit the endogenous gene locus in HEK293T cells, and the editing efficiency and RNA off-target efficiency were examined.
2.1 construction of sgRNA plasmid
Selecting 3 human endogenous gene loci, designing sgRNAs, wherein the positions of 3 sgRNAs in a genome are NC-000015.10: 107422339-107422361; NC _ 000013.11: 87944780-87944802; NC _ 000014.9: 72917055-72917077.
The upstream and downstream sequences of sgRNA were ligated to pGL3-U6-sgRNA (Addgene #51133) vector linearized with BsaI (NEB: R0539L) by programmed (95 ℃,5 min; 95 ℃ -85 ℃ at-2 ℃/s; 85 ℃ -25 ℃ at-0.1 ℃/s; hold at 4 ℃). The polynucleotide sequence used is shown in SEQ ID NO. 15-20. The linearization system is shown below: pGL3-U6-sgRNA 2. mu.g; buffer (NEB: R0539L) 6. mu.L;BsaI 2. mu.L; ddH2O was replenished to 60. mu.L. The cleavage was carried out overnight at 37 ℃. The linking system is as follows: t4 ligation buffer (NEB: M0202L) 1. mu.L, linearized vector 20NGG, annealed oligo fragment (10. mu.M) 5. mu.L, T4 ligase (NEB: M0202L) 0.5. mu.L, ddH2O was replenished to 10. mu.L.16 ℃ and ligated overnight. The connected vector is transformed, selected and identified. The positive clones were shaken to extract the plasmid (Axygene: AP-MN-P-250G) and the concentration was determined.
2.2 culture transfection and recovery of cells
HEK293T cells (purchased from ATCC) were inoculated in DMEM high-sugar medium (HyClone, SH30022.01B) supplemented with 10% FBS, containing 1% Penicillin Streptomycin (v/v) (Gibco). When the cell concentration is 80%, the cell state is recovered to the optimum state by changing the culture medium with 10% serum DMEM and culturing for 2 hours. The amount of plasmid transfected per well was 4. mu.g of APOBEC3G series editing tool plasmid (see FIG. 1) and 2. mu.g of sgRNA plasmid, respectively, prepared in example 1. The plasmids were mixed in 250. mu.l of Opti-MEM (Gibco,11058021) medium, respectively. Mu.l of Lipofectamine 2000 transfection reagent (Thermo,11668019) was mixed into 250. mu.l of Opti-MEM medium and mixed well, and left to stand for 5 minutes. The plasmid-mixed Opti-MEM was added to the plasmid-mixed Opti-MEM mixed with Lipofectamine 2000, gently whipped, mixed well, and allowed to stand for 20 minutes. Opti-MEM mixed with plasmid and Lipofectamine 2000 was added to each 6cm plate (80% concentration of cells in the plate for transfection). 6 hours after transfection, the cells were replaced with 10% FBS in DMEM. After 48 hours of transfection, 5% of the cells with the highest positive rate were sorted out, 5000 cells were used for detecting DNA editing efficiency, and the remaining 50 ten thousand cells were used for extracting RNA for detecting RNA off-target efficiency.
2.3DNA editing efficiency detection
The DNA was first lysed to obtain the genome, the lysate consisted of 50mM KCl, 1.5mM MgCl2, 10mM Tris pH 8.0, 0.5% Nonidet P-40, 0.5% Tween 20, 100g/ml protease K. And carrying out PCR amplification on a sequence near the target, purifying an amplification product, and identifying by using a SaNGGer sequencing method. The amplification system was as follows: 2Xbuffer (Vazyme, P505) 25. mu.L; dNTP 1 u L; f (10 pmol/. mu.L) 1. mu.L; r (10 pmol/. mu.L) 1. mu.L; 1 mu L of template; 0.5. mu.L of DNA polymerase (Vazyme, P505); ddH2O was made up to 50. mu.L. The amplified PCR product was purified by the following steps: adding PCR-A (Axygen: AP-PCR-250G) with three times of volume to pass through the column, centrifuging, and centrifuging at 12000 r/min for 1 min; 700 μ LW2 was added and centrifuged for 1 min; discarding the waste liquid, adding 700 mu LW2, and centrifuging for 1 minute; waste liquid is discarded, and idling is carried out for 1 minute; adding 20 μ L water for elution. The PCR amplification primers used are shown in SEQ ID NO. 21-26. And performing Sanger sequencing on the obtained PCR product by using a PCR amplification one-way primer, and then comparing sequencing results and editing efficiency. The results are shown in FIGS. 2A,3A, and 4.
2.4 detection of RNA off-target Effect efficiency
In RNA detection, Trizol (Vazyme, R401-01) is used for extracting total RNA, and the extraction steps are as follows: adding 1ml of Trizol into each hole, uniformly mixing, collecting, adding 200ul of trichloromethane, fully and uniformly mixing by reversing up and down, centrifuging at the temperature of 4 ℃, and centrifuging at 12000 r/min for 15 minutes; sucking 400ul of supernatant, adding isopropanol with the same volume, reversing the supernatant and uniformly mixing the mixture, centrifuging the mixture at the temperature of 4 ℃, and centrifuging the mixture at 12000 r/min for 10 minutes; discarding the supernatant, adding 1mL of 75% ethanol, reversing the upside down, mixing uniformly, centrifuging at the temperature of 4 ℃, and centrifuging at 12000 r/min for 10 minutes; the supernatant was discarded, air-dried and dissolved in water. 2ug was taken for RNA-seq and the off-target effect was analyzed to obtain the specific off-target (i.e., the number of mutations) and the results are shown in FIGS. 2B,3B, and 5.
From the viewpoint of DNA editing efficiency and RNA off-target efficiency, the APOBEC3G series editing tool has the advantages of high editing efficiency, extremely low off-target efficiency, and even elimination.
In conclusion, the present invention effectively overcomes various disadvantages of the prior art and has high industrial utilization value.
It will be appreciated by those skilled in the art that modifications and variations can be made to the disclosed embodiments without departing from the spirit and scope of the invention, and therefore, is equivalent to modifications and variations that would be apparent to those skilled in the art without departing from the spirit and scope of the invention as disclosed in the appended claims.
SEQUENCE LISTING
<110> Shanghai science and technology university
<120> cytosine base editing tools and application thereof
<160>41
<170>PatentIn version 3.5
<210>1
<211>8430
<212>DNA
<213>Artificial Sequence
<220>
<223>C-191-BE3
<400>1
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggagattctc 420
agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc ttgggtcaga 480
ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga cacctgggtc 540
ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca cggtttcctt 600
gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa gctggacctg 660
gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag ctgtgcccag 720
gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt cactgcccgc 780
atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga ggctggggcc 840
aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt tgtggaccac 900
cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga cctgagtggg 960
aggctgcggg ccattctcca gaatcaggaa aacagcggca gcgagactcc cgggacctca 1020
gagtccgcca cacccgaaag tgataaaaag tattctattg gtttagccat cggcactaat 1080
tccgttggat gggctgtcat aaccgatgaa tacaaagtac cttcaaagaa atttaaggtg 1140
ttggggaaca cagaccgtca ttcgattaaa aagaatctta tcggtgccct cctattcgat 1200
agtggcgaaa cggcagaggc gactcgcctg aaacgaaccg ctcggagaag gtatacacgt 1260
cgcaagaacc gaatatgtta cttacaagaa atttttagca atgagatggc caaagttgac 1320
gattctttct ttcaccgttt ggaagagtcc ttccttgtcg aagaggacaa gaaacatgaa 1380
cggcacccca tctttggaaa catagtagat gaggtggcat atcatgaaaa gtacccaacg 1440
atttatcacc tcagaaaaaa gctagttgac tcaactgata aagcggacct gaggttaatc 1500
tacttggctc ttgcccatat gataaagttc cgtgggcact ttctcattga gggtgatcta 1560
aatccggaca actcggatgt cgacaaactg ttcatccagt tagtacaaac ctataatcag 1620
ttgtttgaag agaaccctat aaatgcaagt ggcgtggatg cgaaggctat tcttagcgcc 1680
cgcctctcta aatcccgacg gctagaaaac ctgatcgcac aattacccgg agagaagaaa 1740
aatgggttgt tcggtaacct tatagcgctc tcactaggcc tgacaccaaa ttttaagtcg 1800
aacttcgact tagctgaaga tgccaaattg cagcttagta aggacacgta cgatgacgat 1860
ctcgacaatc tactggcaca aattggagat cagtatgcgg acttattttt ggctgccaaa 1920
aaccttagcg atgcaatcct cctatctgac atactgagag ttaatactga gattaccaag 1980
gcgccgttat ccgcttcaat gatcaaaagg tacgatgaac atcaccaaga cttgacactt 2040
ctcaaggccc tagtccgtca gcaactgcct gagaaatata aggaaatatt ctttgatcag 2100
tcgaaaaacg ggtacgcagg ttatattgac ggcggagcga gtcaagagga attctacaag 2160
tttatcaaac ccatattaga gaagatggat gggacggaag agttgcttgt aaaactcaat 2220
cgcgaagatc tactgcgaaa gcagcggact ttcgacaacg gtagcattcc acatcaaatc 2280
cacttaggcg aattgcatgc tatacttaga aggcaggagg atttttatcc gttcctcaaa 2340
gacaatcgtg aaaagattga gaaaatccta acctttcgca taccttacta tgtgggaccc 2400
ctggcccgag ggaactctcg gttcgcatgg atgacaagaa agtccgaaga aacgattact 2460
ccatggaatt ttgaggaagt tgtcgataaa ggtgcgtcag ctcaatcgtt catcgagagg 2520
atgaccaact ttgacaagaa tttaccgaac gaaaaagtat tgcctaagca cagtttactt 2580
tacgagtatt tcacagtgta caatgaactc acgaaagtta agtatgtcac tgagggcatg 2640
cgtaaacccg cctttctaag cggagaacag aagaaagcaa tagtagatct gttattcaag 2700
accaaccgca aagtgacagt taagcaattg aaagaggact actttaagaa aattgaatgc 2760
ttcgattctg tcgagatctc cggggtagaa gatcgattta atgcgtcact tggtacgtat 2820
catgacctcc taaagataat taaagataag gacttcctgg ataacgaaga gaatgaagat 2880
atcttagaag atatagtgtt gactcttacc ctctttgaag atcgggaaat gattgaggaa 2940
agactaaaaa catacgctca cctgttcgac gataaggtta tgaaacagtt aaagaggcgt 3000
cgctatacgg gctggggacg attgtcgcgg aaacttatca acgggataag agacaagcaa 3060
agtggtaaaa ctattctcga ttttctaaag agcgacggct tcgccaatag gaactttatg 3120
cagctgatcc atgatgactc tttaaccttc aaagaggata tacaaaaggc acaggtttcc 3180
ggacaagggg actcattgca cgaacatatt gcgaatcttg ctggttcgcc agccatcaaa 3240
aagggcatac tccagacagt caaagtagtg gatgagctag ttaaggtcat gggacgtcac 3300
aaaccggaaa acattgtaat cgagatggca cgcgaaaatc aaacgactca gaaggggcaa 3360
aaaaacagtc gagagcggat gaagagaata gaagagggta ttaaagaact gggcagccag 3420
atcttaaagg agcatcctgt ggaaaatacc caattgcaga acgagaaact ttacctctat 3480
tacctacaaa atggaaggga catgtatgtt gatcaggaac tggacataaa ccgtttatct 3540
gattacgacg tcgatcacat tgtaccccaa tcctttttga aggacgattc aatcgacaat 3600
aaagtgctta cacgctcgga taagaaccga gggaaaagtg acaatgttcc aagcgaggaa 3660
gtcgtaaaga aaatgaagaa ctattggcgg cagctcctaa atgcgaaact gataacgcaa 3720
agaaagttcg ataacttaac taaagctgag aggggtggct tgtctgaact tgacaaggcc 3780
ggatttatta aacgtcagct cgtggaaacc cgccaaatca caaagcatgt tgcacagata 3840
ctagattccc gaatgaatac gaaatacgac gagaacgata agctgattcg ggaagtcaaa 3900
gtaatcactt taaagtcaaa attggtgtcg gacttcagaa aggattttca attctataaa 3960
gttagggaga taaataacta ccaccatgcg cacgacgctt atcttaatgc cgtcgtaggg 4020
accgcactca ttaagaaata cccgaagcta gaaagtgagt ttgtgtatgg tgattacaaa 4080
gtttatgacg tccgtaagat gatcgcgaaa agcgaacagg agataggcaa ggctacagcc 4140
aaatacttct tttattctaa cattatgaat ttctttaaga cggaaatcac tctggcaaac 4200
ggagagatac gcaaacgacc tttaattgaa accaatgggg agacaggtga aatcgtatgg 4260
gataagggcc gggacttcgc gacggtgaga aaagttttgt ccatgcccca agtcaacata 4320
gtaaagaaaa ctgaggtgca gaccggaggg ttttcaaagg aatcgattct tccaaaaagg 4380
aatagtgata agctcatcgc tcgtaaaaag gactgggacc cgaaaaagta cggtggcttc 4440
gatagcccta cagttgccta ttctgtccta gtagtggcaa aagttgagaa gggaaaatcc 4500
aagaaactga agtcagtcaa agaattattg gggataacga ttatggagcg ctcgtctttt 4560
gaaaagaacc ccatcgactt ccttgaggcg aaaggttaca aggaagtaaa aaaggatctc 4620
ataattaaac taccaaagta tagtctgttt gagttagaaa atggccgaaa acggatgttg 4680
gctagcgccg gagagcttca aaaggggaac gaactcgcac taccgtctaa atacgtgaat 4740
ttcctgtatt tagcgtccca ttacgagaag ttgaaaggtt cacctgaaga taacgaacag 4800
aagcaacttt ttgttgagca gcacaaacat tatctcgacg aaatcataga gcaaatttcg 4860
gaattcagta agagagtcat cctagctgat gccaatctgg acaaagtatt aagcgcatac 4920
aacaagcaca gggataaacc catacgtgag caggcggaaa atattatcca tttgtttact 4980
cttaccaacc tcggcgctcc agccgcattc aagtattttg acacaacgat agatcgcaaa 5040
cgatacactt ctaccaagga ggtgctagac gcgacactga ttcaccaatc catcacggga 5100
ttatatgaaa ctcggataga tttgtcacag cttgggggtg actctggtgg ttctactaat 5160
ctgtcagata ttattgaaaa ggagaccggt aagcaactgg ttatccagga atccatcctc 5220
atgctcccag aggaggtgga agaagtcatt gggaacaagc cggaaagcga tatactcgtg 5280
cacaccgcct acgacgagag caccgacgag aatgtcatgc ttctgactag cgacgcccct 5340
gaatacaagc cttgggctct ggtcatacag gatagcaacg gtgagaacaa gattaagatg 5400
ctctctggtg gttctcccaa gaagaagagg aaagtctaac cggtcatcat caccatcacc 5460
attgagttta aacccgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 5520
tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 5580
aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 5640
gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg 5700
cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctcgata ccgtcgacct 5760
ctagctagag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 5820
tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctag ggtgcctaat 5880
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 5940
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 6000
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 6060
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 6120
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 6180
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 6240
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 6300
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 6360
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 6420
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 6480
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 6540
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 6600
ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 6660
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 6720
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 6780
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 6840
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 6900
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 6960
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 7020
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 7080
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 7140
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 7200
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg 7260
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 7320
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 7380
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 7440
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 7500
actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt 7560
caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac 7620
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 7680
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 7740
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 7800
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 7860
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 7920
cccgaaaagt gccacctgac gtcgacggat cgggagatcg atctcccgat cccctagggt 7980
cgactctcag tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt 8040
gtgtgttgga ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct 8100
tgaccgacaa ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg 8160
tacgggccag atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta 8220
cggggtcatt agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg 8280
gcccgcctgg ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc 8340
ccatagtaac gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa 8400
ctgcccactt ggcagtacat caagtgtatc 8430
<210>2
<211>8409
<212>DNA
<213>Artificial Sequence
<220>
<223>C-198-BE3
<400>2
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggatccaaag 420
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 480
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 540
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 600
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 660
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 720
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 780
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 840
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 900
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 960
aatcaggaaa acagcggcag cgagactccc gggacctcag agtccgccac acccgaaagt 1020
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 1080
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 1140
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 1200
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 1260
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 1320
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 1380
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 1440
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 1500
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 1560
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 1620
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 1680
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 1740
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 1800
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 1860
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 1920
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 1980
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 2040
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt 2100
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 2160
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 2220
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 2280
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 2340
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 2400
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 2460
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 2520
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 2580
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 2640
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 2700
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 2760
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 2820
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 2880
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 2940
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 3000
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 3060
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 3120
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 3180
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 3240
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 3300
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 3360
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 3420
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 3480
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 3540
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 3600
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 3660
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 3720
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 3780
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 3840
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 3900
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 3960
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 4020
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 4080
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 4140
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 4200
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 4260
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 4320
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 4380
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 4440
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 4500
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 4560
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 4620
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 4680
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 4740
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 4800
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 4860
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 4920
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 4980
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 5040
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 5100
ttgtcacagc ttgggggtga ctctggtggt tctactaatc tgtcagatat tattgaaaag 5160
gagaccggta agcaactggt tatccaggaa tccatcctca tgctcccaga ggaggtggaa 5220
gaagtcattg ggaacaagcc ggaaagcgat atactcgtgc acaccgccta cgacgagagc 5280
accgacgaga atgtcatgcttctgactagc gacgcccctg aatacaagcc ttgggctctg 5340
gtcatacagg atagcaacgg tgagaacaag attaagatgc tctctggtgg ttctcccaag 5400
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 5460
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 5520
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 5580
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 5640
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 5700
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 5760
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 5820
gagccggaag cataaagtgt aaagcctagg gtgcctaatg agtgagctaa ctcacattaa 5880
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 5940
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 6000
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 6060
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 6120
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 6180
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 6240
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 6300
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 6360
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 6420
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 6480
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 6540
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 6600
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 6660
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 6720
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 6780
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 6840
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 6900
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 6960
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 7020
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 7080
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 7140
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 7200
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 7260
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 7320
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 7380
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 7440
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 7500
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 7560
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 7620
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 7680
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 7740
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 7800
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 7860
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 7920
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 7980
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 8040
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 8100
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 8160
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 8220
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 8280
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 8340
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 8400
aagtgtatc 8409
<210>3
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M-BE3
<400>3
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>4
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K-BE3
<400>4
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcgaaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>5
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199A-BE3
<400>5
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgcccccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>6
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199W-BE3
<400>6
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg attggcccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaattctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>7
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P200A-BE3
<400>7
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccagccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctttctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggagaggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>8
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P200K-BE3
<400>8
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccaaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>9
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+Q322K-BE3
<400>9
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg taaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>10
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K+P199A+P200A-BE3
<400>10
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgccaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>11
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>A3G-BE4max
<400>11
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atcctggagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctctactac 840
ttctgggacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatc cacccacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggtttccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>12
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199A+P200K-BE4max
<400>12
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttgacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcctgttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>13
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K+P199A+P200K-BE4max
<400>13
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttaagc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctcagctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>14
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>A3G(OP)+P199A+P200K-BE4max
<400>14
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gccccacttt 480
cggaacaccg tggagcggat gtacagagat accttcagct acaacttcta taatagacct 540
atcctgtccc ggagaaatac cgtgtggctg tgctatgagg tgaagacaaa gggcccatct 600
cggccccctc tggatgccaa gatctttaga ggccaggtgt acagcgagct gaagtatcac 660
cctgagatga ggttctttca ctggttctcc aagtggagga agctgcaccg cgaccaggag 720
tacgaggtga cctggtatat cagctggtcc ccctgcacca agtgtacacg cgatatggcc 780
acatttctgg ccgaggaccc taaggtgacc ctgacaatct ttgtggccag gctgtactat 840
ttccgggacc cagattacca ggaggccctg cgctctctgt gccagaagcg ggatggcccc 900
agagccacca tgaagatcat gaactacgac gagtttcagc actgttggag caagttcgtg 960
tattcccagc gggagctgtt cgagccttgg aacaatctgc caaagtacta tatcctgctg 1020
cacatcatgc tgggcgagat cctgagacac agcatggatg ccaagacctt caccttcaac 1080
ttcaacaatg agccatgggt gcggggcaga cacgagacct acctgtgcta tgaggtggag 1140
cggatgcaca acgacacatg ggtgctgctg aatcagaggc gcggctttct gtgcaatcag 1200
gcaccacaca agcacggctt cctggagggc aggcacgcag agctgtgctt cctggatgtg 1260
atccctttct ggaagctgga cctggatcag gactaccgcg tgacctgttt tacatcttgg 1320
agcccatgct tctcctgtgc ccaggagatg gccaagttta tctccaagaa taagcacgtg 1380
tctctgtgca tcttcaccgc caggatctac gacgatcagg gcaggtgtca ggagggactg 1440
cgcacactgg cagaggcagg agccaagatc tctatcatga cctatagcga gtttaagcac 1500
tgctgggata cattcgtgga ccaccagggc tgtccattcc agccctggga tggcctggac 1560
gagcactccc aggacctgtc tggcaggctg agggccatcc tgcagaacca ggagaattct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>15
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>15
accgggccca gactgagcac gtga 24
<210>16
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>16
aaactcacgt gctcagtctg ggcc 24
<210>17
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>17
accgtgcccc tccctccctg gccc 24
<210>18
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>18
aaacgggcca gggagggagg ggca 24
<210>19
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>19
accggaacac aaagcataga ctgc 24
<210>20
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>20
aaacgcagtc tatgctttgt gttc 24
<210>21
<211>27
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>21
gcccatgcaa ttagtctatt tctgctg 27
<210>22
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>22
gcaggagctg cacatactag cc 22
<210>23
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>23
ggggccccta accctatgta gc 22
<210>24
<211>20
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>24
ccattggcct gcttcgtggc 20
<210>25
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>25
gttactgcag cccaagcctc ag 22
<210>26
<211>23
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>26
gtccagcccc atctgtcaaa ctg 23
<210>27
<211>583
<212>DNA
<213> APOBEC3G fragment
<400>27
ggagattctc agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc 60
ttgggtcaga ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga 120
cacctgggtc ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca 180
cggtttcctt gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa 240
gctggacctg gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag 300
ctgtgcccag gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt 360
cactgcccgc atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga 420
ggctggggcc aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt 480
tgtggaccac cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga 540
cctgagtggg aggctgcggg ccattctcca gaatcaggaa aac 583
<210>28
<211>564
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>28
atggatccaa agacattcac tttcaacttt aacaatgaac cttgggtcag aggacggcat 60
gagacttacc tgtgttatga ggtggagcgc atgcacaatg acacctgggt cctgctgaac 120
cagcgcaggg gctttctatg caaccaggct ccacataaac acggtttcct tgaaggccgc 180
catgcagagc tgtgcttcct ggacgtgatt cccttttgga agctggacct ggaccaggac 240
tacagggtta cctgcttcac ctcctggagc ccctgcttca gctgtgccca ggaaatggct 300
aaattcattt caaaaaacaa acacgtgagc ctgtgcatct tcactgcccg catctatgat 360
gatcaaggaa gatgtcagga ggggctgcgc accctggccg aggctggggc caaaatttca 420
ataatgacat acagtgaatt taagcactgc tgggacacct ttgtggacca ccagggatgt 480
cccttccagc cctgggatgg actagatgag cacagccaag acctgagtgg gaggctgcgg 540
gccattctcc agaatcagga aaac 564
<210>29
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>29
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>30
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>30
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>31
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>31
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>32
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>32
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggattggccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>33
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>33
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccagcc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>34
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>34
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>35
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>35
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtaaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>36
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>36
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>37
<211>4101
<212>DNA
<213>Artificial Sequence
<220>
<223> SpCas9-D10A nickase fragment
<400>37
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 60
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 120
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 180
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 240
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 300
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 360
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 420
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 480
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 540
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 600
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 660
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 720
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 780
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 840
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 900
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 960
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 1020
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt1080
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 1140
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 1200
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 1260
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 1320
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 1380
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 1440
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 1500
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 1560
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 1620
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 1680
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 1740
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 1800
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 1860
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 1920
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 1980
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 2040
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 2100
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 2160
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 2220
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 2280
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 2340
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 2400
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 2460
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 2520
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 2580
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 2640
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 2700
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 2760
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 2820
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 2880
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 2940
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 3000
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 3060
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 3120
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 3180
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 3240
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 3300
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 3360
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 3420
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 3480
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 3540
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 3600
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 3660
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 3720
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 3780
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 3840
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 3900
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 3960
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 4020
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 4080
ttgtcacagc ttgggggtga c 4101
<210>38
<211>4101
<212>DNA
<213>Artificial Sequence
<220>
<223> SpCas9-D10A nickase fragment
<400>38
gacaagaagt acagcatcgg cctggccatc ggcaccaact ctgtgggctg ggccgtgatc 60
accgacgagt acaaggtgcc cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac 120
agcatcaaga agaacctgat cggagccctg ctgttcgaca gcggcgaaac agccgaggcc 180
acccggctga agagaaccgc cagaagaaga tacaccagac ggaagaaccg gatctgctat 240
ctgcaagaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccacagactg 300
gaagagtcct tcctggtgga agaggataag aagcacgagc ggcaccccat cttcggcaac 360
atcgtggacg aggtggccta ccacgagaag taccccacca tctaccacct gagaaagaaa 420
ctggtggaca gcaccgacaa ggccgacctg cggctgatct atctggccct ggcccacatg 480
atcaagttcc ggggccactt cctgatcgag ggcgacctga accccgacaa cagcgacgtg 540
gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga aaaccccatc 600
aacgccagcg gcgtggacgc caaggccatc ctgtctgcca gactgagcaa gagcagacgg 660
ctggaaaatc tgatcgccca gctgcccggc gagaagaaga atggcctgtt cggaaacctg 720
attgccctga gcctgggcct gacccccaac ttcaagagca acttcgacct ggccgaggat 780
gccaaactgc agctgagcaa ggacacctac gacgacgacc tggacaacct gctggcccag 840
atcggcgacc agtacgccga cctgtttctg gccgccaaga acctgtccga cgccatcctg 900
ctgagcgaca tcctgagagt gaacaccgag atcaccaagg cccccctgag cgcctctatg 960
atcaagagat acgacgagca ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag 1020
cagctgcctg agaagtacaa agagattttc ttcgaccaga gcaagaacgg ctacgccggc 1080
tacattgacg gcggagccag ccaggaagag ttctacaagt tcatcaagcc catcctggaa 1140
aagatggacg gcaccgagga actgctcgtg aagctgaaca gagaggacct gctgcggaag 1200
cagcggacct tcgacaacgg cagcatcccc caccagatcc acctgggaga gctgcacgcc 1260
attctgcggc ggcaggaaga tttttaccca ttcctgaagg acaaccggga aaagatcgag 1320
aagatcctga ccttccgcat cccctactac gtgggccctc tggccagggg aaacagcaga 1380
ttcgcctgga tgaccagaaa gagcgaggaa accatcaccc cctggaactt cgaggaagtg 1440
gtggacaagg gcgcttccgc ccagagcttc atcgagcgga tgaccaactt cgataagaac 1500
ctgcccaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtat 1560
aacgagctga ccaaagtgaa atacgtgacc gagggaatga gaaagcccgc cttcctgagc 1620
ggcgagcaga aaaaggccat cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg 1680
aagcagctga aagaggacta cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc 1740
ggcgtggaag atcggttcaa cgcctccctg ggcacatacc acgatctgct gaaaattatc 1800
aaggacaagg acttcctgga caatgaggaa aacgaggaca ttctggaaga tatcgtgctg 1860
accctgacac tgtttgagga cagagagatg atcgaggaac ggctgaaaac ctatgcccac 1920
ctgttcgacg acaaagtgat gaagcagctg aagcggcgga gatacaccgg ctggggcagg 1980
ctgagccgga agctgatcaa cggcatccgg gacaagcagt ccggcaagac aatcctggat 2040
ttcctgaagt ccgacggctt cgccaacaga aacttcatgc agctgatcca cgacgacagc 2100
ctgaccttta aagaggacat ccagaaagcc caggtgtccg gccagggcga tagcctgcac 2160
gagcacattg ccaatctggc cggcagcccc gccattaaga agggcatcct gcagacagtg 2220
aaggtggtgg acgagctcgt gaaagtgatg ggccggcaca agcccgagaa catcgtgatc 2280
gaaatggcca gagagaacca gaccacccag aagggacaga agaacagccg cgagagaatg 2340
aagcggatcg aagagggcat caaagagctg ggcagccaga tcctgaaaga acaccccgtg 2400
gaaaacaccc agctgcagaa cgagaagctg tacctgtact acctgcagaa tgggcgggat 2460
atgtacgtgg accaggaact ggacatcaac cggctgtccg actacgatgt ggaccatatc 2520
gtgcctcaga gctttctgaa ggacgactcc atcgacaaca aggtgctgac cagaagcgac 2580
aagaaccggg gcaagagcga caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac 2640
tactggcggc agctgctgaa cgccaagctg attacccaga gaaagttcga caatctgacc 2700
aaggccgaga gaggcggcct gagcgaactg gataaggccg gcttcatcaa gagacagctg 2760
gtggaaaccc ggcagattac aaagcacgtg gcacagatcc tggactcccg gatgaacact 2820
aagtacgacg agaatgacaa gctgatccgg gaagtgaaag tgatcaccct gaagtccaag 2880
ctggtgtccg atttccggaa ggatttccag ttttacaaag tgcgcgagat caacaactac 2940
caccacgccc acgacgccta cctaaacgcc gtcgtgggaa ccgcactgat caaaaagtac 3000
cctaagctgg aaagcgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg 3060
atcgccaaga gcgagcagga aatcggcaag gctaccgcca agtacttctt ctacagcaac 3120
atcatgaact ttttcaagac cgagattacc ctggccaacg gcgagatccg gaagcggcct 3180
ctgatcgaga caaacggcga aaccggggag atcgtgtggg ataagggccg ggattttgcc 3240
accgtgcgga aagtgctgag catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag 3300
acaggcggct tcagcaaaga gtctatcaga cccaagagga acagcgataa gctgatcgcc 3360
agaaagaagg actgggaccc taagaagtac ggcggcttcg tgagccccac cgtggcctat 3420
tctgtgctgg tggtggccaa agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa 3480
gagctgctgg ggatcaccat catggaaaga agcagcttcg agaagaatcc catcgacttt 3540
ctggaagcca agggctacaa agaagtgaaa aaggacctga tcatcaagct gcctaagtac 3600
tccctgttcg agctggaaaa cggccggaag agaatgctgg cctctgccag attcctgcag 3660
aagggaaacg aactggccct gccctccaaa tatgtgaact tcctgtacct ggccagccac 3720
tatgagaagc tgaagggctc ccccgaggat aatgagcaga aacagctgtt tgtggaacag 3780
cacaagcact acctggacga gatcatcgag cagatcagcg agttctccaa gagagtgatc 3840
ctggccgacg ctaatctgga caaagtgctg tccgcctaca acaagcaccg ggataagccc 3900
atcagagagc aggccgagaa tatcatccac ctgtttaccc tgaccaatct gggagcccct 3960
agagccttca agtactttga caccaccatc gaccggaagg tgtacagaag caccaaagag 4020
gtgctggacg ccaccctgat ccaccagagc atcaccggcc tgtacgagac acggatcgac 4080
ctgtctcagc tgggaggtga c 4101
<210>39
<211>57
<212>DNA
<213>Artificial Sequence
<220>
<223> Nuclear localization Signal fragment
<400>39
atgaaacgga cagccgacgg aagcgagttc gagtcaccaa agaagaagcg gaaagtc 57
<210>40
<211>96
<212>DNA
<213>Artificial Sequence
<220>
<223> Flexible linker peptide fragment
<400>40
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96
<210>41
<211>96
<212>DNA
<213>Artificial Sequence
<220>
<223> Flexible linker peptide fragment
<400>41
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96

Claims (16)

  1. The fusion protein is characterized by sequentially comprising an APOBEC3G fragment and an SpCas9-D10A nickase fragment from the N end to the C end, wherein the APOBEC3G fragment has cytosine deaminase activity, at least amino acid mutations in R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K exist in the APOBEC3G fragment, or the APOBEC3G fragment is a truncated APOBEC3G fragment deleted from start codon of APOBEC3G to 190 th position or 197 th position.
  2. 2. The fusion protein of claim 1, wherein the nucleotide sequence of the APOBEC3G fragment comprises:
    a) a nucleotide sequence shown as SEQ ID NO. 27-36; or,
    b) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.27-36 and having the functions of the nucleotide sequence defined in a).
  3. 3. The fusion protein of claim 1, wherein the nucleotide sequence of the SpCas9-D10A nickase fragment comprises:
    c) a nucleotide sequence shown as SEQ ID NO. 37-38; or,
    d) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.37-38 and having the function of the nucleotide sequence defined in d).
  4. 4. The fusion protein of claim 1, further comprising a nuclear localization signal fragment located N-terminal to the APOBEC3G fragment or C-terminal to the SpCas9-D10A nickase fragment.
  5. 5. The fusion protein of claim 4, wherein the nucleotide sequence of the nuclear localization signal fragment is set forth in SEQ ID No. 39.
  6. 6. The fusion protein of claim 1, further comprising a flexible linker peptide fragment at the N-terminus of the APOBEC3G fragment, between the APOBEC3G fragment and SpCas9-D10A nicase, or at the C-terminus of the SpCas9-D10A nicase.
  7. 7. The fusion protein of claim 6, wherein the nucleotide sequence of the flexibly linked peptide fragment is set forth in SEQ ID No. 40-41.
  8. An isolated polynucleotide of , wherein the isolated polynucleotide encodes the fusion protein of claim 1.
  9. kinds of constructs, characterized in that the constructs are constructed by inserting the isolated polynucleotide of claim 8 into expression vector, and the polynucleotide sequence of the constructs is shown in SEQ ID NO. 1-14.
  10. 10. The construct of claim 9, wherein the expression vector is of the group consisting of a pCMV expression vector, a pSV2 expression vector, and a pGL3 expression vector.
  11. An expression system, wherein the expression system is a host cell comprising the construct of claim 9 or wherein the isolated polynucleotide of claim 8 is integrated into the genome of the host cell.
  12. 12. The expression system of claim 11, wherein the host cell is selected from a mouse cell or a human cell.
  13. 13. The expression system of claim 11, wherein the host cell is selected from the group consisting of mouse brain neuroma cells, human embryonic kidney cells, human cervical cancer cells, human colon cancer cells, human osteosarcoma cells.
  14. The base editing tool, comprising the fusion protein of claim 1 and a sgRNA.
  15. 15. Use of the base editing tool of claim 14 in gene editing in eukaryotes.
  16. 16. The use of the base editing tool of claim 15 in eukaryotic gene editing, wherein the gene editing is base editing of C-to-T at positions 4-7 of the 5' end of the sgRNA in the target region.
CN201911075141.9A 2019-11-06 2019-11-06 Cytosine base editing tool and application thereof Active CN110734900B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911075141.9A CN110734900B (en) 2019-11-06 2019-11-06 Cytosine base editing tool and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911075141.9A CN110734900B (en) 2019-11-06 2019-11-06 Cytosine base editing tool and application thereof

Publications (2)

Publication Number Publication Date
CN110734900A true CN110734900A (en) 2020-01-31
CN110734900B CN110734900B (en) 2022-09-30

Family

ID=69272245

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911075141.9A Active CN110734900B (en) 2019-11-06 2019-11-06 Cytosine base editing tool and application thereof

Country Status (1)

Country Link
CN (1) CN110734900B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113249362A (en) * 2020-02-07 2021-08-13 辉大(上海)生物科技有限公司 Modified cytosine base editor and application thereof
CN114058607A (en) * 2020-07-31 2022-02-18 上海科技大学 Fusion protein for C-to-U base editing and preparation method and application thereof
CN114561429A (en) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 Treatment method for inhibiting HBV surface antigen based on base editing ATG initiation codon
CN114561392A (en) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 Method for removing HBV e antigen by closing target gene based on base editing technology
CN114606265A (en) * 2022-04-07 2022-06-10 吉林大学 Mini-base editor capable of realizing single AAV (adeno-associated virus) coating
CN116555237A (en) * 2022-03-08 2023-08-08 中国科学院遗传与发育生物学研究所 Cytosine deaminase and its use in base editing

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090269831A1 (en) * 2008-02-07 2009-10-29 Harris Reuben S Modified cytosine deaminases
CN102482639A (en) * 2009-04-03 2012-05-30 医学研究会 Mutants of activation-induced cytidine deaminase (aid) and methods of use
WO2016014837A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
CN108513575A (en) * 2015-10-23 2018-09-07 哈佛大学的校长及成员们 Nucleobase editing machine and application thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090269831A1 (en) * 2008-02-07 2009-10-29 Harris Reuben S Modified cytosine deaminases
CN102482639A (en) * 2009-04-03 2012-05-30 医学研究会 Mutants of activation-induced cytidine deaminase (aid) and methods of use
WO2016014837A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
US20160022737A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
CN108513575A (en) * 2015-10-23 2018-09-07 哈佛大学的校长及成员们 Nucleobase editing machine and application thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIAO WANG ET AL.: "Efficient base editing in methylated regions with a human APOBEC3A-Cas9 fusion", 《NATURE BIOTECHNOLOGY》 *
赵亚伟等: "碱基编辑器的开发及其在细菌基因组编辑中的应用", 《微生物学通报》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113249362A (en) * 2020-02-07 2021-08-13 辉大(上海)生物科技有限公司 Modified cytosine base editor and application thereof
CN113249362B (en) * 2020-02-07 2023-04-14 辉大(上海)生物科技有限公司 Modified cytosine base editor and application thereof
CN114058607A (en) * 2020-07-31 2022-02-18 上海科技大学 Fusion protein for C-to-U base editing and preparation method and application thereof
CN114058607B (en) * 2020-07-31 2024-02-27 上海科技大学 Fusion protein for editing C to U base, and preparation method and application thereof
CN116555237A (en) * 2022-03-08 2023-08-08 中国科学院遗传与发育生物学研究所 Cytosine deaminase and its use in base editing
CN114561429A (en) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 Treatment method for inhibiting HBV surface antigen based on base editing ATG initiation codon
CN114561392A (en) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 Method for removing HBV e antigen by closing target gene based on base editing technology
CN114606265A (en) * 2022-04-07 2022-06-10 吉林大学 Mini-base editor capable of realizing single AAV (adeno-associated virus) coating
CN114606265B (en) * 2022-04-07 2024-01-30 吉林大学 Mini base editor capable of realizing single AAV virus coating

Also Published As

Publication number Publication date
CN110734900B (en) 2022-09-30

Similar Documents

Publication Publication Date Title
CN110734900B (en) Cytosine base editing tool and application thereof
KR102700050B1 (en) Production of human milk oligosaccharides in microbial hosts with engineered introgression/extrogression
KR102147005B1 (en) Fad2 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks
KR20220141332A (en) Measles-Vectorized COVID-19 Immunogenic Compositions and Vaccines
AU2020264412B2 (en) Dna-binding protein using ppr motif, and use thereof
US20030167538A1 (en) Use of the maize x112 mutant ahas 2 gene and imidazolinone herbicides for selection of transgenic monocots, maize, rice and wheat plants resistant to the imidazolinone herbicides
US20040013648A1 (en) Vector system
KR20190120287A (en) Genome Editing System and Method
KR102494564B1 (en) Malaria vaccine
CN112204147A (en) Cpf 1-based plant transcription regulatory system
KR20210105382A (en) RNA encoding protein
CN107002095A (en) Adeno-associated virus vector for treating lysosomal storage disease
CN101827938A (en) Plants with altered root architecture, involving the RT1 gene, related constructs and methods
CN110305901A (en) A kind of luciferase reporter gene carrier and its construction method and application based on the gene promoter area people TLR4
KR20210005167A (en) Use of lentivector-transduced T-RAPA cells to alleviate lysosomal storage disease
JP2024037797A (en) Using infectious nucleic acid to treat cancer
CN112626035A (en) New coronary pneumonia vaccine and vaccine kit
CN114836473B (en) Lentiviral vector for constructing cell strain model for screening pharmaceutical activity and application
CN111378626B (en) CHO cell line, construction method, recombinant protein expression system and application
US6730481B2 (en) Primers-attached vector elongation (PAVE): a 5′-directed cDNA cloning strategy
CN113621650B (en) Establishment and application of efficient silk fibroin heavy chain promoter secretion expression system
CN113005092A (en) Preparation method and application of PD1 knockout LMP1 targeted CAR-T cell
JPH1175859A (en) Apoptosis-related gene expressible virus vector system
RU2798786C2 (en) Production of human dairy oligosaccharides in microbial producers with artificial import/export
PL244825B1 (en) Mutant of Tritirachium album proteinase K and its zymogen, an expression plasmid, a recombinant strain of Pichia pastoris and method for preparing a mature form of proteinase K mutant

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant