CN110734900A - cytosine base editing tool and application thereof - Google Patents
cytosine base editing tool and application thereof Download PDFInfo
- Publication number
- CN110734900A CN110734900A CN201911075141.9A CN201911075141A CN110734900A CN 110734900 A CN110734900 A CN 110734900A CN 201911075141 A CN201911075141 A CN 201911075141A CN 110734900 A CN110734900 A CN 110734900A
- Authority
- CN
- China
- Prior art keywords
- fragment
- apobec3g
- nucleotide sequence
- fusion protein
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 title abstract description 32
- 229940104302 cytosine Drugs 0.000 title abstract description 16
- 108010004483 APOBEC-3G Deaminase Proteins 0.000 claims abstract description 61
- 102100038076 DNA dC->dU-editing enzyme APOBEC-3G Human genes 0.000 claims abstract description 61
- 239000012634 fragment Substances 0.000 claims abstract description 50
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 18
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 18
- 108091027544 Subgenomic mRNA Proteins 0.000 claims abstract description 16
- 230000035772 mutation Effects 0.000 claims abstract description 12
- 150000001413 amino acids Chemical class 0.000 claims abstract description 10
- 108010008532 Deoxyribonuclease I Proteins 0.000 claims abstract description 9
- 102000007260 Deoxyribonuclease I Human genes 0.000 claims abstract description 9
- 102220595577 Major vault protein_R24A_mutation Human genes 0.000 claims abstract description 7
- 102220534804 Protein quaking_Y124A_mutation Human genes 0.000 claims abstract description 7
- 102220196624 rs1057518977 Human genes 0.000 claims abstract description 7
- 230000000694 effects Effects 0.000 claims abstract description 5
- 210000004027 cell Anatomy 0.000 claims description 40
- 239000002773 nucleotide Substances 0.000 claims description 26
- 125000003729 nucleotide group Chemical group 0.000 claims description 26
- 108091033319 polynucleotide Proteins 0.000 claims description 16
- 102000040430 polynucleotide Human genes 0.000 claims description 16
- 239000002157 polynucleotide Substances 0.000 claims description 16
- 241000282414 Homo sapiens Species 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 10
- 238000010362 genome editing Methods 0.000 claims description 7
- 108010033276 Peptide Fragments Proteins 0.000 claims description 6
- 102000007079 Peptide Fragments Human genes 0.000 claims description 6
- 108010080611 Cytosine Deaminase Proteins 0.000 claims description 5
- 102000000311 Cytosine Deaminase Human genes 0.000 claims description 5
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 5
- 210000004899 c-terminal region Anatomy 0.000 claims description 4
- 108091081024 Start codon Proteins 0.000 claims description 3
- 101100297347 Caenorhabditis elegans pgl-3 gene Proteins 0.000 claims description 2
- 206010008342 Cervix carcinoma Diseases 0.000 claims description 2
- 206010009944 Colon cancer Diseases 0.000 claims description 2
- 241000206602 Eukaryota Species 0.000 claims description 2
- 208000005890 Neuroma Diseases 0.000 claims description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 claims description 2
- 210000004556 brain Anatomy 0.000 claims description 2
- 201000010881 cervical cancer Diseases 0.000 claims description 2
- 208000029742 colonic neoplasm Diseases 0.000 claims description 2
- 210000005260 human cell Anatomy 0.000 claims description 2
- 210000003292 kidney cell Anatomy 0.000 claims description 2
- 201000008968 osteosarcoma Diseases 0.000 claims description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 abstract description 20
- 238000010357 RNA editing Methods 0.000 abstract description 2
- 230000026279 RNA modification Effects 0.000 abstract description 2
- 108020004414 DNA Proteins 0.000 description 46
- 239000013612 plasmid Substances 0.000 description 17
- 108091033409 CRISPR Proteins 0.000 description 7
- 108090000623 proteins and genes Proteins 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 5
- 239000012124 Opti-MEM Substances 0.000 description 5
- 238000000034 method Methods 0.000 description 5
- 230000009437 off-target effect Effects 0.000 description 5
- 230000009615 deamination Effects 0.000 description 4
- 238000006481 deamination reaction Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 108010077544 Chromatin Proteins 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 230000004568 DNA-binding Effects 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- 239000012097 Lipofectamine 2000 Substances 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 230000004570 RNA-binding Effects 0.000 description 3
- 210000003483 chromatin Anatomy 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000012154 double-distilled water Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000006780 non-homologous end joining Effects 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 108010052875 Adenine deaminase Proteins 0.000 description 2
- 238000010354 CRISPR gene editing Methods 0.000 description 2
- 238000010442 DNA editing Methods 0.000 description 2
- 230000008265 DNA repair mechanism Effects 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 206010064571 Gene mutation Diseases 0.000 description 2
- 102100023823 Homeobox protein EMX1 Human genes 0.000 description 2
- 101001048956 Homo sapiens Homeobox protein EMX1 Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000005782 double-strand break Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000002699 waste material Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 230000006463 DNA deamination Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 101100445099 Mus musculus Emx1 gene Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 101710130181 Protochlorophyllide reductase A, chloroplastic Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 229960001701 chloroform Drugs 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/04—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
- C12Y305/04001—Cytosine deaminase (3.5.4.1)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/09—Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention relates to the field of biotechnology, in particular to cytosine base editing tools and application thereof.A fusion protein is provided by the invention, and comprises an APOBEC3G fragment and a SpCas9-D10A nickase fragment.A series of APOBEC3G cytosine base editing tools provided by the invention have R24A, W94L, Y124A, W127L and P200K amino acid mutations relative to a wild type in an APOBEC3G fragment, wherein the four amino acid mutations of R24A, W94L, Y124A and W127L can limit the combination of APOBEC3G and RNA, D128K, P199A, P200K and Q322K can improve the combination of APOBEC3G and DNA, and can improve the editing efficiency in the base editing of changing C at 4-7 bit of the 5' end of sgRNA into T, and greatly reduce or even eliminate the RNA target RNA editing effect of cytosine base editing tools.
Description
Technical Field
The invention relates to the technical field of biology, in particular to an cytosine base editing tool and application thereof.
Background
CRISPR/Cas9 is currently the most efficient and convenient genome editing technology. Cas9 nuclease, guided by guide RNA (sgRNA), can reach a specific target of the genome, cleave it, thereby generating DNA Double Strand Breaks (DSB), and then achieve editing through endogenous DNA repair mechanisms. DNA Repair mechanisms include Non-Homologous End joining (NHEJ) and Homologous recombination Repair (HDR). Among them, NHEJ repair results in random insertions, deletions, leading to inactivation of genes, which dominates in genome repair. And the HDR can be accurately repaired by utilizing the template, so that the gene mutation is corrected.
But actually, the probability of HDR-mediated accurate repair is very low, usually less than 5%, thus greatly limiting the application of CRISPR/Cas9 in the transformation from scientific research to application, in particular, is a big problem in the field of gene editing, .
Recently, a newly developed Base Editor (BE) has successfully solved the above problems, and the efficiency of correcting gene mutation has been greatly improved. There are two types of conventional Base editors, a Cytosine Base Editor (CBE) and an Adenine Base Editor (ABE).
CBE and ABE are the combination of RuvC domain inactivated Cas9D10Anickase (nCas9) and cytosine deaminase/adenine deaminase integrated at , guided by the sgRNA to the target site and bound to the complementary DNA strand of the sgRNA, cytosine deaminase deaminates a limited range of cytosine C around to uracil U, which can pair complementarily with cytosine A, and upon DNA replication, U will eventually BE replaced by the complementary pairing base T of A. similarly, adenine deaminase deaminates a limited range of adenine A around to hypoxanthine I, which can pair complementarily with cytosine C, and upon DNA replication, I will eventually BE replaced by the complementary pairing base G of C. thus achieving the purpose of C-to-T or A-to-G.
The deaminase rAPOBEC1 in BE3 is a rat cytosine deaminase, wherein the endogenous state of rAPOBEC1 can edit single-stranded DNA in addition to the above representationRNA, changing C to U. Recent studies found that BE3 base editor produces serious RNA off-Target effect (off-Target)5The application of the base editor is greatly limited.
Disclosure of Invention
The purpose of the present invention is to provide an editing tool for cytosine bases and the use thereof.
In order to achieve the purpose, the invention provides fusion proteins, which are characterized by sequentially comprising an APOBEC3G (A3G) fragment and an SpCas9-D10A nickase fragment from N end to C end, wherein the APOBEC3G fragment has cytosine deaminase activity, at least amino acid mutations in R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K exist in the APOBEC3G fragment, or the APOBEC3G fragment is an APOBEC3G fragment deleted from the start codon of APOBEC3G to 190 th or 197 th position.
Preferably, the APOBEC3G fragment is derived from human (Homo sapiens).
Preferably, the nucleotide sequence of the APOBEC3G fragment comprises:
a) a nucleotide sequence shown as SEQ ID NO. 27-36; or,
b) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.27-36 and having the functions of the nucleotide sequence defined in a).
More preferably, the nucleotide sequence in b) may have more than 80%, 85%, 90%, 93%, 95%, 97%, or 99% similarity to SEQ ID No.27-36 in a).
More preferably, the nucleotide sequence in b) specifically comprises a nucleotide sequence shown as SEQ ID NO.27-36 obtained by replacing, deleting or adding or more (specifically 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2 or 3) amino acid codons, or adding or more (specifically 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2 or 3) amino acid codons at the N-terminal and/or C-terminal.
Preferably, the nucleotide sequence of the SpCas9-D10A nickase fragment comprises:
c) a nucleotide sequence shown as SEQ ID NO. 37-38; or,
d) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.37-38 and having the function of the nucleotide sequence defined in d).
More preferably, the nucleotide sequence in d) may have more than 80%, 85%, 90%, 93%, 95%, 97%, or 99% similarity to SEQ ID NO. 37-38.
More preferably, the nucleotide sequence in d) specifically includes a nucleotide sequence shown as SEQ ID No.37-38 obtained by substituting, deleting or adding or more (specifically, 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2, or 3) amino acid codons, or adding or more (specifically, 1-50, 1-30, 1-20, 1-10, 1-5, 1-3, 1, 2, or 3) amino acid codons at the N-terminal and/or C-terminal.
Preferably, the fusion protein further comprises a nuclear localization signal fragment located at the N-terminus of the APOBEC3G fragment or the C-terminus of the SpCas9-D10A nickase fragment.
More preferably, the nucleotide sequence of the nuclear localization signal fragment is shown as SEQ ID NO. 39.
Preferably, the fusion protein further comprises a flexible linker peptide fragment located at the N-terminus of APOBEC3G fragment, between APOBEC3G fragment and SpCas9-D10A nicase, or at the C-terminus of SpCas9-D10A nicase.
More preferably, the nucleotide sequence of the flexibly linked peptide fragment is as set forth in SEQ ID NO. 40-41.
The present invention also provides isolated polynucleotides encoding the above fusion proteins.
The invention also provides constructs, which are characterized in that the constructs are obtained by inserting the separated polynucleotides into an expression vector, and the polynucleotide sequence of the constructs is shown in SEQ ID NO. 1-14.
Preferably, the expression vector includes, but is not limited to, a pCMV expression vector, a pSV2 expression vector, a pGL3 expression vector, and the like.
The invention also provides expression systems, which is characterized in that the expression system is a host cell, the host cell contains the construct or integrates the isolated polynucleotide into the genome, the host cell can express the fusion protein, and the fusion protein can be matched with the sgRNA, so that the fusion protein can be positioned to a target region to realize base editing of the target region.
Preferably, the host cell is selected from a eukaryotic cell or a prokaryotic cell.
More preferably, the host cell is selected from a mouse cell or a human cell.
, the host cell is selected from mouse brain neuroma cell, human embryo kidney cell, human cervical cancer cell, human colon cancer cell, human osteosarcoma cell.
Further , the host cell of the expression system is selected from the group consisting of N2a cells, HEK293FT cells, Hela cells, HCT116 cells, and U2OS cells.
-base editing tool, comprising the fusion protein and sgRNA.
The invention also provides the application of the base editing tool in gene editing of eukaryotes.
Preferably, the gene editing is base editing of C-to-T at positions 4-7 of the 5' end of the sgRNA in the target region.
APOBEC3G is a member of the human APOBEC family, and can bind to single-stranded DNA or RNA, generate deamination, mutate C to U, and play an important role in antiviral processes. Deamination of APOBEC3G tends to occur in the CC sequence. There are two functional domains of APOBEC3G, and earlier studies suggest that the primary role of the amino terminus is RNA binding, and the primary role of the carboxy terminus is DNA binding, as well as deamination, which is also a common feature of all two-domain APOBECs. Of these, it is noteworthy that APOBEC3G was not considered to have RNA editing function by earlier studies. Recent studies have indicated that there is competition between the DNA and RNA binding domains of APOBEC3G and, unlike previous studies, APOBEC3G was overexpressed and found to have RNA deaminase activity.
Compared with the prior art, the invention has the beneficial effects that:
(1) the invention provides a new -generation cytosine base editing tool, wherein APOBEC3G is connected with a spCas9-D10Anickase fragment, and then a functional domain responsible for RNA combination in APOBEC3G is mutated, the RNA deamination function is damaged, and the DNA deamination activity is improved.
(2) The base editing system provided by the invention widens the targeted range of a genome, can use an NGG sequence as PAM, realizes the C-to-T base of 4-7 sites at the 5' end in a sgRNA target region, has high mutation precision, and can greatly reduce or even eliminate RNA off-target effect.
(3) Compared with the wild type, the APOBEC3G fragment of the invention has amino acid mutations of R24A, W94L, Y124A, W127L and the like, can limit the combination of APOBEC3G and RNA, and can greatly reduce or even eliminate the RNA off-target effect of a cytosine base editing tool in the base editing of mutating C at 4-7 sites of the 5' end of sgRNA into T; in addition, the cytosine base editing tool provided by the invention has higher editing efficiency on DNA than a classical cytosine base editing tool (BE3), greatly improves the mutation accuracy on the premise of ensuring the editing efficiency, and has good industrialization prospect.
Drawings
FIG. 1 shows a schematic structure diagram of APOBEC3G-BE3, APOBEC3G-BE4 series plasmids used in the examples of the present invention;
FIG. 2 is a statistical chart showing the editing capacity of A3G-BE3,191-BE3,198-BE3 and BE3 to endogenous gene loci in HEK293T cells, wherein a is a statistical chart of C-to-T editing efficiency of A3G-BE3,191-BE3,198-BE3 and BE3 at HEK293Site3, b is a statistical chart of RNA off-target efficiency of A3G-BE3,191-BE3,198-BE3 and BE3, C4 and C5 represent positions of C at target loci, counted from PAM distal bases, and 191-BE3 and 198-BE3 are truncated APOBEC3G-BE3 deleted from position to position 190 and position 197 of APOBEC3G, respectively;
FIG. 3 is a statistical chart showing the editing ability of the 4M-BE3, BE3, to the endogenous gene locus in HEK293T cells according to the present invention; wherein a is a C-to-T editing efficiency statistical chart of three sites of 4M-BE3, A3G-BE3, BE3 in HEK293Site3, HEK293Site2 and EMX 1; b is a statistical chart of RNA off-target efficiency of 4M-BE3, A3G-BE3 and BE 3; c3, C4, C5, C6, C8 represent the position of C at the target site, counted from the PAM distal base;
FIG. 4 is a statistical chart showing the editing capacity of series mutants in the endogenous gene locus in HEK293T cells based on 4M-BE3, wherein a is a schematic diagram of the mutation locus structure performed based on 4M-BE3, b is a statistical chart of the C-to-T editing efficiency of 7 mutant plasmids and 4M-BE3, BE3 in three loci of HEK293Site2, HEK293Site3 and EMX1, C is a statistical chart of the C-to-T editing efficiency of two combined mutant plasmids and 4M-BE3 on the basis of b, and BE3 in three loci of HEK293Site2, HEK293Site3 and EMX1, D is a statistical chart of the C-to-T editing efficiency of two plasmids 4M-BE3,4M + P A + P199 + P200A-BE 72, 4M + D A + P A + A, the C-BE 72, the C-to-BE 72, and the C-BE 72, C-RNA at the target-off-C-RNA, C A, C A, C A represents the C-to C A, C A, C A, C A, C A, C A represents the;
FIG. 5 is a statistical chart showing the editing capacity of the optimized plasmids 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4 in HEK293T cells for endogenous gene loci in the present invention; wherein,
a is a structural schematic diagram of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE 4;
b is a C-to-T editing efficiency statistical chart of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4 at HEK293Site 3;
c is a statistical chart of RNA off-target efficiency of 4M + P199A + P200K-BE4,4M + D128K + P199A + P200K-BE4BE 3; c4, C5 represents the position of C at the target site, counted from the PAM distal base.
Detailed Description
Before describing specific embodiments of the present invention at step , it is to be understood that the scope of the present invention is not limited to the specific embodiments described below, and it is to be understood that the terminology used in the examples is for the purpose of describing the specific embodiments and is not intended to be limiting of the scope of the present invention, and that the singular forms "", "" and "the" include the plural forms as used in the specification and the claims unless the context clearly dictates otherwise.
When numerical ranges are given in the examples, it is understood that unless otherwise indicated herein, each numerical range has its two ends and any numbers between the two ends are optional.
Unless otherwise indicated, the experimental methods, detection methods, and preparation methods disclosed herein all employ techniques conventional in the art of molecular biology, biochemistry, chromatin structure and analysis, analytical chemistry, cell culture, recombinant DNA technology, and related arts. These techniques are well described in the literature and may be found in particular in Sambrook et al, Molecular CLONINGG: a LABORATORY MANUAL, Second edition, Cold SpriNGG harbor LABORATORY Press, 1989and Third edition, 2001; ausubel et al, Current PROTOCOLS Inmolecular BIOLOGY, John Wiley & Sons, New York, 1987and pharmaceutical upperes; the seriesMethods IN Enzymogy, Academic Press, San Diego; wolffe, CHROMATIN STRUCTURE ANDFUNCTION, Third edition, Academic Press, San Diego, 1998; (iii) METHODS IN ENZYMOLOGY, Vol.304, Chromatin (P.M.Wassarman and A.P.Wolffe, eds.), Academic Press, SanDiego, 1999; and METHODS IN MOLECULAR BIOLOGY, Vol.119, chromatography Protocols (P.B.Becker, ed.) Humana Press, Totowa, 1999, etc.
Example 1
In this example, the RNA binding site DNA binding site in the APOBEC3G part was subjected to point mutation (including R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K), or the APOBEC3G fragment was deleted from the th start codon of APOBEC3G to the 190 th or 197 th position to obtain a truncated APOBEC3G fragment, the amino-terminal of APOBEC3G was truncated to construct C-APOBEC3G-BE3, or the D10Anickase Cas9 part in BE3 was replaced by D10Anickase Cas9(BE4) which expresses more efficiently, respectively.
The related plasmid is shown in figure 1, wherein 4M is R24A, W94L, Y124A and W127 four-point mutation, and A3G (OP) is A3G after optimizing codon.
The construction method of the mutant plasmid used in the examples was as follows: amino acid mutations were introduced into the APOBEC3G portion of the A3G-BE3 plasmid or the A3G-BE4 plasmid by the Mut Express II FastMutagenesis Kit V2(Vazyme, C214-02) (FIG. 1).
The constructed plasmids C-191-BE3, C-198-BE3, 4M-BE3,4M + D128K-BE3, 4M + P199A-BE3, 4M + P199W-BE3, 4M + P200A-BE3, 4M + P200K-BE3,4M + Q322K-BE3, 4M + D128K + P199A + P200A-BE3, A3G-BE4max, 4M + P199A + P200K-BE4max, 4M + D128K + P199A + P200K-BE4max, A3G (OP) + P199A + P200K-BE4max, and the sequence is shown in SEQ ID NO. 1-14.
Example 2
In this example, the APOBEC3G series of tools were used to edit the endogenous gene locus in HEK293T cells, and the editing efficiency and RNA off-target efficiency were examined.
2.1 construction of sgRNA plasmid
Selecting 3 human endogenous gene loci, designing sgRNAs, wherein the positions of 3 sgRNAs in a genome are NC-000015.10: 107422339-107422361; NC _ 000013.11: 87944780-87944802; NC _ 000014.9: 72917055-72917077.
The upstream and downstream sequences of sgRNA were ligated to pGL3-U6-sgRNA (Addgene #51133) vector linearized with BsaI (NEB: R0539L) by programmed (95 ℃,5 min; 95 ℃ -85 ℃ at-2 ℃/s; 85 ℃ -25 ℃ at-0.1 ℃/s; hold at 4 ℃). The polynucleotide sequence used is shown in SEQ ID NO. 15-20. The linearization system is shown below: pGL3-U6-sgRNA 2. mu.g; buffer (NEB: R0539L) 6. mu.L;BsaI 2. mu.L; ddH2O was replenished to 60. mu.L. The cleavage was carried out overnight at 37 ℃. The linking system is as follows: t4 ligation buffer (NEB: M0202L) 1. mu.L, linearized vector 20NGG, annealed oligo fragment (10. mu.M) 5. mu.L, T4 ligase (NEB: M0202L) 0.5. mu.L, ddH2O was replenished to 10. mu.L.16 ℃ and ligated overnight. The connected vector is transformed, selected and identified. The positive clones were shaken to extract the plasmid (Axygene: AP-MN-P-250G) and the concentration was determined.
2.2 culture transfection and recovery of cells
HEK293T cells (purchased from ATCC) were inoculated in DMEM high-sugar medium (HyClone, SH30022.01B) supplemented with 10% FBS, containing 1% Penicillin Streptomycin (v/v) (Gibco). When the cell concentration is 80%, the cell state is recovered to the optimum state by changing the culture medium with 10% serum DMEM and culturing for 2 hours. The amount of plasmid transfected per well was 4. mu.g of APOBEC3G series editing tool plasmid (see FIG. 1) and 2. mu.g of sgRNA plasmid, respectively, prepared in example 1. The plasmids were mixed in 250. mu.l of Opti-MEM (Gibco,11058021) medium, respectively. Mu.l of Lipofectamine 2000 transfection reagent (Thermo,11668019) was mixed into 250. mu.l of Opti-MEM medium and mixed well, and left to stand for 5 minutes. The plasmid-mixed Opti-MEM was added to the plasmid-mixed Opti-MEM mixed with Lipofectamine 2000, gently whipped, mixed well, and allowed to stand for 20 minutes. Opti-MEM mixed with plasmid and Lipofectamine 2000 was added to each 6cm plate (80% concentration of cells in the plate for transfection). 6 hours after transfection, the cells were replaced with 10% FBS in DMEM. After 48 hours of transfection, 5% of the cells with the highest positive rate were sorted out, 5000 cells were used for detecting DNA editing efficiency, and the remaining 50 ten thousand cells were used for extracting RNA for detecting RNA off-target efficiency.
2.3DNA editing efficiency detection
The DNA was first lysed to obtain the genome, the lysate consisted of 50mM KCl, 1.5mM MgCl2, 10mM Tris pH 8.0, 0.5% Nonidet P-40, 0.5% Tween 20, 100g/ml protease K. And carrying out PCR amplification on a sequence near the target, purifying an amplification product, and identifying by using a SaNGGer sequencing method. The amplification system was as follows: 2Xbuffer (Vazyme, P505) 25. mu.L; dNTP 1 u L; f (10 pmol/. mu.L) 1. mu.L; r (10 pmol/. mu.L) 1. mu.L; 1 mu L of template; 0.5. mu.L of DNA polymerase (Vazyme, P505); ddH2O was made up to 50. mu.L. The amplified PCR product was purified by the following steps: adding PCR-A (Axygen: AP-PCR-250G) with three times of volume to pass through the column, centrifuging, and centrifuging at 12000 r/min for 1 min; 700 μ LW2 was added and centrifuged for 1 min; discarding the waste liquid, adding 700 mu LW2, and centrifuging for 1 minute; waste liquid is discarded, and idling is carried out for 1 minute; adding 20 μ L water for elution. The PCR amplification primers used are shown in SEQ ID NO. 21-26. And performing Sanger sequencing on the obtained PCR product by using a PCR amplification one-way primer, and then comparing sequencing results and editing efficiency. The results are shown in FIGS. 2A,3A, and 4.
2.4 detection of RNA off-target Effect efficiency
In RNA detection, Trizol (Vazyme, R401-01) is used for extracting total RNA, and the extraction steps are as follows: adding 1ml of Trizol into each hole, uniformly mixing, collecting, adding 200ul of trichloromethane, fully and uniformly mixing by reversing up and down, centrifuging at the temperature of 4 ℃, and centrifuging at 12000 r/min for 15 minutes; sucking 400ul of supernatant, adding isopropanol with the same volume, reversing the supernatant and uniformly mixing the mixture, centrifuging the mixture at the temperature of 4 ℃, and centrifuging the mixture at 12000 r/min for 10 minutes; discarding the supernatant, adding 1mL of 75% ethanol, reversing the upside down, mixing uniformly, centrifuging at the temperature of 4 ℃, and centrifuging at 12000 r/min for 10 minutes; the supernatant was discarded, air-dried and dissolved in water. 2ug was taken for RNA-seq and the off-target effect was analyzed to obtain the specific off-target (i.e., the number of mutations) and the results are shown in FIGS. 2B,3B, and 5.
From the viewpoint of DNA editing efficiency and RNA off-target efficiency, the APOBEC3G series editing tool has the advantages of high editing efficiency, extremely low off-target efficiency, and even elimination.
In conclusion, the present invention effectively overcomes various disadvantages of the prior art and has high industrial utilization value.
It will be appreciated by those skilled in the art that modifications and variations can be made to the disclosed embodiments without departing from the spirit and scope of the invention, and therefore, is equivalent to modifications and variations that would be apparent to those skilled in the art without departing from the spirit and scope of the invention as disclosed in the appended claims.
SEQUENCE LISTING
<110> Shanghai science and technology university
<120> cytosine base editing tools and application thereof
<160>41
<170>PatentIn version 3.5
<210>1
<211>8430
<212>DNA
<213>Artificial Sequence
<220>
<223>C-191-BE3
<400>1
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggagattctc 420
agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc ttgggtcaga 480
ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga cacctgggtc 540
ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca cggtttcctt 600
gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa gctggacctg 660
gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag ctgtgcccag 720
gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt cactgcccgc 780
atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga ggctggggcc 840
aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt tgtggaccac 900
cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga cctgagtggg 960
aggctgcggg ccattctcca gaatcaggaa aacagcggca gcgagactcc cgggacctca 1020
gagtccgcca cacccgaaag tgataaaaag tattctattg gtttagccat cggcactaat 1080
tccgttggat gggctgtcat aaccgatgaa tacaaagtac cttcaaagaa atttaaggtg 1140
ttggggaaca cagaccgtca ttcgattaaa aagaatctta tcggtgccct cctattcgat 1200
agtggcgaaa cggcagaggc gactcgcctg aaacgaaccg ctcggagaag gtatacacgt 1260
cgcaagaacc gaatatgtta cttacaagaa atttttagca atgagatggc caaagttgac 1320
gattctttct ttcaccgttt ggaagagtcc ttccttgtcg aagaggacaa gaaacatgaa 1380
cggcacccca tctttggaaa catagtagat gaggtggcat atcatgaaaa gtacccaacg 1440
atttatcacc tcagaaaaaa gctagttgac tcaactgata aagcggacct gaggttaatc 1500
tacttggctc ttgcccatat gataaagttc cgtgggcact ttctcattga gggtgatcta 1560
aatccggaca actcggatgt cgacaaactg ttcatccagt tagtacaaac ctataatcag 1620
ttgtttgaag agaaccctat aaatgcaagt ggcgtggatg cgaaggctat tcttagcgcc 1680
cgcctctcta aatcccgacg gctagaaaac ctgatcgcac aattacccgg agagaagaaa 1740
aatgggttgt tcggtaacct tatagcgctc tcactaggcc tgacaccaaa ttttaagtcg 1800
aacttcgact tagctgaaga tgccaaattg cagcttagta aggacacgta cgatgacgat 1860
ctcgacaatc tactggcaca aattggagat cagtatgcgg acttattttt ggctgccaaa 1920
aaccttagcg atgcaatcct cctatctgac atactgagag ttaatactga gattaccaag 1980
gcgccgttat ccgcttcaat gatcaaaagg tacgatgaac atcaccaaga cttgacactt 2040
ctcaaggccc tagtccgtca gcaactgcct gagaaatata aggaaatatt ctttgatcag 2100
tcgaaaaacg ggtacgcagg ttatattgac ggcggagcga gtcaagagga attctacaag 2160
tttatcaaac ccatattaga gaagatggat gggacggaag agttgcttgt aaaactcaat 2220
cgcgaagatc tactgcgaaa gcagcggact ttcgacaacg gtagcattcc acatcaaatc 2280
cacttaggcg aattgcatgc tatacttaga aggcaggagg atttttatcc gttcctcaaa 2340
gacaatcgtg aaaagattga gaaaatccta acctttcgca taccttacta tgtgggaccc 2400
ctggcccgag ggaactctcg gttcgcatgg atgacaagaa agtccgaaga aacgattact 2460
ccatggaatt ttgaggaagt tgtcgataaa ggtgcgtcag ctcaatcgtt catcgagagg 2520
atgaccaact ttgacaagaa tttaccgaac gaaaaagtat tgcctaagca cagtttactt 2580
tacgagtatt tcacagtgta caatgaactc acgaaagtta agtatgtcac tgagggcatg 2640
cgtaaacccg cctttctaag cggagaacag aagaaagcaa tagtagatct gttattcaag 2700
accaaccgca aagtgacagt taagcaattg aaagaggact actttaagaa aattgaatgc 2760
ttcgattctg tcgagatctc cggggtagaa gatcgattta atgcgtcact tggtacgtat 2820
catgacctcc taaagataat taaagataag gacttcctgg ataacgaaga gaatgaagat 2880
atcttagaag atatagtgtt gactcttacc ctctttgaag atcgggaaat gattgaggaa 2940
agactaaaaa catacgctca cctgttcgac gataaggtta tgaaacagtt aaagaggcgt 3000
cgctatacgg gctggggacg attgtcgcgg aaacttatca acgggataag agacaagcaa 3060
agtggtaaaa ctattctcga ttttctaaag agcgacggct tcgccaatag gaactttatg 3120
cagctgatcc atgatgactc tttaaccttc aaagaggata tacaaaaggc acaggtttcc 3180
ggacaagggg actcattgca cgaacatatt gcgaatcttg ctggttcgcc agccatcaaa 3240
aagggcatac tccagacagt caaagtagtg gatgagctag ttaaggtcat gggacgtcac 3300
aaaccggaaa acattgtaat cgagatggca cgcgaaaatc aaacgactca gaaggggcaa 3360
aaaaacagtc gagagcggat gaagagaata gaagagggta ttaaagaact gggcagccag 3420
atcttaaagg agcatcctgt ggaaaatacc caattgcaga acgagaaact ttacctctat 3480
tacctacaaa atggaaggga catgtatgtt gatcaggaac tggacataaa ccgtttatct 3540
gattacgacg tcgatcacat tgtaccccaa tcctttttga aggacgattc aatcgacaat 3600
aaagtgctta cacgctcgga taagaaccga gggaaaagtg acaatgttcc aagcgaggaa 3660
gtcgtaaaga aaatgaagaa ctattggcgg cagctcctaa atgcgaaact gataacgcaa 3720
agaaagttcg ataacttaac taaagctgag aggggtggct tgtctgaact tgacaaggcc 3780
ggatttatta aacgtcagct cgtggaaacc cgccaaatca caaagcatgt tgcacagata 3840
ctagattccc gaatgaatac gaaatacgac gagaacgata agctgattcg ggaagtcaaa 3900
gtaatcactt taaagtcaaa attggtgtcg gacttcagaa aggattttca attctataaa 3960
gttagggaga taaataacta ccaccatgcg cacgacgctt atcttaatgc cgtcgtaggg 4020
accgcactca ttaagaaata cccgaagcta gaaagtgagt ttgtgtatgg tgattacaaa 4080
gtttatgacg tccgtaagat gatcgcgaaa agcgaacagg agataggcaa ggctacagcc 4140
aaatacttct tttattctaa cattatgaat ttctttaaga cggaaatcac tctggcaaac 4200
ggagagatac gcaaacgacc tttaattgaa accaatgggg agacaggtga aatcgtatgg 4260
gataagggcc gggacttcgc gacggtgaga aaagttttgt ccatgcccca agtcaacata 4320
gtaaagaaaa ctgaggtgca gaccggaggg ttttcaaagg aatcgattct tccaaaaagg 4380
aatagtgata agctcatcgc tcgtaaaaag gactgggacc cgaaaaagta cggtggcttc 4440
gatagcccta cagttgccta ttctgtccta gtagtggcaa aagttgagaa gggaaaatcc 4500
aagaaactga agtcagtcaa agaattattg gggataacga ttatggagcg ctcgtctttt 4560
gaaaagaacc ccatcgactt ccttgaggcg aaaggttaca aggaagtaaa aaaggatctc 4620
ataattaaac taccaaagta tagtctgttt gagttagaaa atggccgaaa acggatgttg 4680
gctagcgccg gagagcttca aaaggggaac gaactcgcac taccgtctaa atacgtgaat 4740
ttcctgtatt tagcgtccca ttacgagaag ttgaaaggtt cacctgaaga taacgaacag 4800
aagcaacttt ttgttgagca gcacaaacat tatctcgacg aaatcataga gcaaatttcg 4860
gaattcagta agagagtcat cctagctgat gccaatctgg acaaagtatt aagcgcatac 4920
aacaagcaca gggataaacc catacgtgag caggcggaaa atattatcca tttgtttact 4980
cttaccaacc tcggcgctcc agccgcattc aagtattttg acacaacgat agatcgcaaa 5040
cgatacactt ctaccaagga ggtgctagac gcgacactga ttcaccaatc catcacggga 5100
ttatatgaaa ctcggataga tttgtcacag cttgggggtg actctggtgg ttctactaat 5160
ctgtcagata ttattgaaaa ggagaccggt aagcaactgg ttatccagga atccatcctc 5220
atgctcccag aggaggtgga agaagtcatt gggaacaagc cggaaagcga tatactcgtg 5280
cacaccgcct acgacgagag caccgacgag aatgtcatgc ttctgactag cgacgcccct 5340
gaatacaagc cttgggctct ggtcatacag gatagcaacg gtgagaacaa gattaagatg 5400
ctctctggtg gttctcccaa gaagaagagg aaagtctaac cggtcatcat caccatcacc 5460
attgagttta aacccgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 5520
tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 5580
aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 5640
gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg 5700
cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctcgata ccgtcgacct 5760
ctagctagag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 5820
tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctag ggtgcctaat 5880
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 5940
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 6000
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 6060
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 6120
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 6180
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 6240
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 6300
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 6360
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 6420
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 6480
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 6540
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 6600
ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 6660
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 6720
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 6780
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 6840
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 6900
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 6960
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 7020
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 7080
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 7140
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 7200
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg 7260
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 7320
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 7380
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 7440
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 7500
actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt 7560
caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac 7620
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 7680
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 7740
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 7800
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 7860
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 7920
cccgaaaagt gccacctgac gtcgacggat cgggagatcg atctcccgat cccctagggt 7980
cgactctcag tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt 8040
gtgtgttgga ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct 8100
tgaccgacaa ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg 8160
tacgggccag atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta 8220
cggggtcatt agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg 8280
gcccgcctgg ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc 8340
ccatagtaac gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa 8400
ctgcccactt ggcagtacat caagtgtatc 8430
<210>2
<211>8409
<212>DNA
<213>Artificial Sequence
<220>
<223>C-198-BE3
<400>2
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggatccaaag 420
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 480
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 540
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 600
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 660
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 720
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 780
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 840
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 900
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 960
aatcaggaaa acagcggcag cgagactccc gggacctcag agtccgccac acccgaaagt 1020
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 1080
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 1140
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 1200
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 1260
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 1320
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 1380
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 1440
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 1500
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 1560
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 1620
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 1680
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 1740
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 1800
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 1860
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 1920
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 1980
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 2040
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt 2100
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 2160
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 2220
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 2280
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 2340
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 2400
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 2460
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 2520
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 2580
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 2640
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 2700
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 2760
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 2820
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 2880
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 2940
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 3000
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 3060
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 3120
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 3180
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 3240
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 3300
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 3360
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 3420
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 3480
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 3540
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 3600
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 3660
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 3720
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 3780
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 3840
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 3900
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 3960
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 4020
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 4080
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 4140
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 4200
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 4260
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 4320
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 4380
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 4440
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 4500
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 4560
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 4620
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 4680
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 4740
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 4800
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 4860
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 4920
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 4980
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 5040
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 5100
ttgtcacagc ttgggggtga ctctggtggt tctactaatc tgtcagatat tattgaaaag 5160
gagaccggta agcaactggt tatccaggaa tccatcctca tgctcccaga ggaggtggaa 5220
gaagtcattg ggaacaagcc ggaaagcgat atactcgtgc acaccgccta cgacgagagc 5280
accgacgaga atgtcatgcttctgactagc gacgcccctg aatacaagcc ttgggctctg 5340
gtcatacagg atagcaacgg tgagaacaag attaagatgc tctctggtgg ttctcccaag 5400
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 5460
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 5520
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 5580
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 5640
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 5700
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 5760
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 5820
gagccggaag cataaagtgt aaagcctagg gtgcctaatg agtgagctaa ctcacattaa 5880
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 5940
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 6000
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 6060
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 6120
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 6180
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 6240
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 6300
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 6360
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 6420
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 6480
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 6540
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 6600
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 6660
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 6720
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 6780
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 6840
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 6900
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 6960
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 7020
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 7080
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 7140
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 7200
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 7260
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 7320
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 7380
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 7440
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 7500
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 7560
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 7620
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 7680
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 7740
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 7800
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 7860
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 7920
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 7980
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 8040
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 8100
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 8160
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 8220
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 8280
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 8340
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 8400
aagtgtatc 8409
<210>3
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M-BE3
<400>3
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>4
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K-BE3
<400>4
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcgaaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>5
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199A-BE3
<400>5
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgcccccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>6
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199W-BE3
<400>6
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg attggcccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaattctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>7
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P200A-BE3
<400>7
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccagccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctttctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggagaggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>8
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P200K-BE3
<400>8
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccaaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>9
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+Q322K-BE3
<400>9
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg taaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>10
<211>8997
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K+P199A+P200A-BE3
<400>10
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgccaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210>11
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>A3G-BE4max
<400>11
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atcctggagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctctactac 840
ttctgggacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatc cacccacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggtttccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>12
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+P199A+P200K-BE4max
<400>12
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttgacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcctgttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>13
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>4M+D128K+P199A+P200K-BE4max
<400>13
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttaagc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctcagctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>14
<211>9429
<212>DNA
<213>Artificial Sequence
<220>
<223>A3G(OP)+P199A+P200K-BE4max
<400>14
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gccccacttt 480
cggaacaccg tggagcggat gtacagagat accttcagct acaacttcta taatagacct 540
atcctgtccc ggagaaatac cgtgtggctg tgctatgagg tgaagacaaa gggcccatct 600
cggccccctc tggatgccaa gatctttaga ggccaggtgt acagcgagct gaagtatcac 660
cctgagatga ggttctttca ctggttctcc aagtggagga agctgcaccg cgaccaggag 720
tacgaggtga cctggtatat cagctggtcc ccctgcacca agtgtacacg cgatatggcc 780
acatttctgg ccgaggaccc taaggtgacc ctgacaatct ttgtggccag gctgtactat 840
ttccgggacc cagattacca ggaggccctg cgctctctgt gccagaagcg ggatggcccc 900
agagccacca tgaagatcat gaactacgac gagtttcagc actgttggag caagttcgtg 960
tattcccagc gggagctgtt cgagccttgg aacaatctgc caaagtacta tatcctgctg 1020
cacatcatgc tgggcgagat cctgagacac agcatggatg ccaagacctt caccttcaac 1080
ttcaacaatg agccatgggt gcggggcaga cacgagacct acctgtgcta tgaggtggag 1140
cggatgcaca acgacacatg ggtgctgctg aatcagaggc gcggctttct gtgcaatcag 1200
gcaccacaca agcacggctt cctggagggc aggcacgcag agctgtgctt cctggatgtg 1260
atccctttct ggaagctgga cctggatcag gactaccgcg tgacctgttt tacatcttgg 1320
agcccatgct tctcctgtgc ccaggagatg gccaagttta tctccaagaa taagcacgtg 1380
tctctgtgca tcttcaccgc caggatctac gacgatcagg gcaggtgtca ggagggactg 1440
cgcacactgg cagaggcagg agccaagatc tctatcatga cctatagcga gtttaagcac 1500
tgctgggata cattcgtgga ccaccagggc tgtccattcc agccctggga tggcctggac 1560
gagcactccc aggacctgtc tggcaggctg agggccatcc tgcagaacca ggagaattct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210>15
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>15
accgggccca gactgagcac gtga 24
<210>16
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>16
aaactcacgt gctcagtctg ggcc 24
<210>17
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>17
accgtgcccc tccctccctg gccc 24
<210>18
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>18
aaacgggcca gggagggagg ggca 24
<210>19
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>19
accggaacac aaagcataga ctgc 24
<210>20
<211>24
<212>DNA
<213>Artificial Sequence
<220>
<223> Polynucleotide sequences
<400>20
aaacgcagtc tatgctttgt gttc 24
<210>21
<211>27
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>21
gcccatgcaa ttagtctatt tctgctg 27
<210>22
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>22
gcaggagctg cacatactag cc 22
<210>23
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>23
ggggccccta accctatgta gc 22
<210>24
<211>20
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>24
<210>25
<211>22
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>25
gttactgcag cccaagcctc ag 22
<210>26
<211>23
<212>DNA
<213>Artificial Sequence
<220>
<223> primer
<400>26
gtccagcccc atctgtcaaa ctg 23
<210>27
<211>583
<212>DNA
<213> APOBEC3G fragment
<400>27
ggagattctc agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc 60
ttgggtcaga ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga 120
cacctgggtc ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca 180
cggtttcctt gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa 240
gctggacctg gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag 300
ctgtgcccag gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt 360
cactgcccgc atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga 420
ggctggggcc aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt 480
tgtggaccac cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga 540
cctgagtggg aggctgcggg ccattctcca gaatcaggaa aac 583
<210>28
<211>564
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>28
atggatccaa agacattcac tttcaacttt aacaatgaac cttgggtcag aggacggcat 60
gagacttacc tgtgttatga ggtggagcgc atgcacaatg acacctgggt cctgctgaac 120
cagcgcaggg gctttctatg caaccaggct ccacataaac acggtttcct tgaaggccgc 180
catgcagagc tgtgcttcct ggacgtgatt cccttttgga agctggacct ggaccaggac 240
tacagggtta cctgcttcac ctcctggagc ccctgcttca gctgtgccca ggaaatggct 300
aaattcattt caaaaaacaa acacgtgagc ctgtgcatct tcactgcccg catctatgat 360
gatcaaggaa gatgtcagga ggggctgcgc accctggccg aggctggggc caaaatttca 420
ataatgacat acagtgaatt taagcactgc tgggacacct ttgtggacca ccagggatgt 480
cccttccagc cctgggatgg actagatgag cacagccaag acctgagtgg gaggctgcgg 540
gccattctcc agaatcagga aaac 564
<210>29
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>29
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>30
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>30
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>31
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>31
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>32
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>32
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggattggccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>33
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>33
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccagcc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>34
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>34
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>35
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>35
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtaaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>36
<211>1152
<212>DNA
<213>Artificial Sequence
<220>
<223> APOBEC3G fragment
<400>36
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210>37
<211>4101
<212>DNA
<213>Artificial Sequence
<220>
<223> SpCas9-D10A nickase fragment
<400>37
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 60
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 120
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 180
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 240
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 300
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 360
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 420
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 480
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 540
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 600
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 660
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 720
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 780
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 840
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 900
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 960
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 1020
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt1080
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 1140
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 1200
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 1260
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 1320
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 1380
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 1440
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 1500
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 1560
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 1620
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 1680
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 1740
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 1800
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 1860
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 1920
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 1980
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 2040
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 2100
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 2160
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 2220
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 2280
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 2340
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 2400
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 2460
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 2520
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 2580
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 2640
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 2700
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 2760
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 2820
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 2880
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 2940
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 3000
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 3060
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 3120
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 3180
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 3240
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 3300
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 3360
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 3420
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 3480
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 3540
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 3600
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 3660
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 3720
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 3780
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 3840
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 3900
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 3960
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 4020
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 4080
ttgtcacagc ttgggggtga c 4101
<210>38
<211>4101
<212>DNA
<213>Artificial Sequence
<220>
<223> SpCas9-D10A nickase fragment
<400>38
gacaagaagt acagcatcgg cctggccatc ggcaccaact ctgtgggctg ggccgtgatc 60
accgacgagt acaaggtgcc cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac 120
agcatcaaga agaacctgat cggagccctg ctgttcgaca gcggcgaaac agccgaggcc 180
acccggctga agagaaccgc cagaagaaga tacaccagac ggaagaaccg gatctgctat 240
ctgcaagaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccacagactg 300
gaagagtcct tcctggtgga agaggataag aagcacgagc ggcaccccat cttcggcaac 360
atcgtggacg aggtggccta ccacgagaag taccccacca tctaccacct gagaaagaaa 420
ctggtggaca gcaccgacaa ggccgacctg cggctgatct atctggccct ggcccacatg 480
atcaagttcc ggggccactt cctgatcgag ggcgacctga accccgacaa cagcgacgtg 540
gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga aaaccccatc 600
aacgccagcg gcgtggacgc caaggccatc ctgtctgcca gactgagcaa gagcagacgg 660
ctggaaaatc tgatcgccca gctgcccggc gagaagaaga atggcctgtt cggaaacctg 720
attgccctga gcctgggcct gacccccaac ttcaagagca acttcgacct ggccgaggat 780
gccaaactgc agctgagcaa ggacacctac gacgacgacc tggacaacct gctggcccag 840
atcggcgacc agtacgccga cctgtttctg gccgccaaga acctgtccga cgccatcctg 900
ctgagcgaca tcctgagagt gaacaccgag atcaccaagg cccccctgag cgcctctatg 960
atcaagagat acgacgagca ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag 1020
cagctgcctg agaagtacaa agagattttc ttcgaccaga gcaagaacgg ctacgccggc 1080
tacattgacg gcggagccag ccaggaagag ttctacaagt tcatcaagcc catcctggaa 1140
aagatggacg gcaccgagga actgctcgtg aagctgaaca gagaggacct gctgcggaag 1200
cagcggacct tcgacaacgg cagcatcccc caccagatcc acctgggaga gctgcacgcc 1260
attctgcggc ggcaggaaga tttttaccca ttcctgaagg acaaccggga aaagatcgag 1320
aagatcctga ccttccgcat cccctactac gtgggccctc tggccagggg aaacagcaga 1380
ttcgcctgga tgaccagaaa gagcgaggaa accatcaccc cctggaactt cgaggaagtg 1440
gtggacaagg gcgcttccgc ccagagcttc atcgagcgga tgaccaactt cgataagaac 1500
ctgcccaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtat 1560
aacgagctga ccaaagtgaa atacgtgacc gagggaatga gaaagcccgc cttcctgagc 1620
ggcgagcaga aaaaggccat cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg 1680
aagcagctga aagaggacta cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc 1740
ggcgtggaag atcggttcaa cgcctccctg ggcacatacc acgatctgct gaaaattatc 1800
aaggacaagg acttcctgga caatgaggaa aacgaggaca ttctggaaga tatcgtgctg 1860
accctgacac tgtttgagga cagagagatg atcgaggaac ggctgaaaac ctatgcccac 1920
ctgttcgacg acaaagtgat gaagcagctg aagcggcgga gatacaccgg ctggggcagg 1980
ctgagccgga agctgatcaa cggcatccgg gacaagcagt ccggcaagac aatcctggat 2040
ttcctgaagt ccgacggctt cgccaacaga aacttcatgc agctgatcca cgacgacagc 2100
ctgaccttta aagaggacat ccagaaagcc caggtgtccg gccagggcga tagcctgcac 2160
gagcacattg ccaatctggc cggcagcccc gccattaaga agggcatcct gcagacagtg 2220
aaggtggtgg acgagctcgt gaaagtgatg ggccggcaca agcccgagaa catcgtgatc 2280
gaaatggcca gagagaacca gaccacccag aagggacaga agaacagccg cgagagaatg 2340
aagcggatcg aagagggcat caaagagctg ggcagccaga tcctgaaaga acaccccgtg 2400
gaaaacaccc agctgcagaa cgagaagctg tacctgtact acctgcagaa tgggcgggat 2460
atgtacgtgg accaggaact ggacatcaac cggctgtccg actacgatgt ggaccatatc 2520
gtgcctcaga gctttctgaa ggacgactcc atcgacaaca aggtgctgac cagaagcgac 2580
aagaaccggg gcaagagcga caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac 2640
tactggcggc agctgctgaa cgccaagctg attacccaga gaaagttcga caatctgacc 2700
aaggccgaga gaggcggcct gagcgaactg gataaggccg gcttcatcaa gagacagctg 2760
gtggaaaccc ggcagattac aaagcacgtg gcacagatcc tggactcccg gatgaacact 2820
aagtacgacg agaatgacaa gctgatccgg gaagtgaaag tgatcaccct gaagtccaag 2880
ctggtgtccg atttccggaa ggatttccag ttttacaaag tgcgcgagat caacaactac 2940
caccacgccc acgacgccta cctaaacgcc gtcgtgggaa ccgcactgat caaaaagtac 3000
cctaagctgg aaagcgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg 3060
atcgccaaga gcgagcagga aatcggcaag gctaccgcca agtacttctt ctacagcaac 3120
atcatgaact ttttcaagac cgagattacc ctggccaacg gcgagatccg gaagcggcct 3180
ctgatcgaga caaacggcga aaccggggag atcgtgtggg ataagggccg ggattttgcc 3240
accgtgcgga aagtgctgag catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag 3300
acaggcggct tcagcaaaga gtctatcaga cccaagagga acagcgataa gctgatcgcc 3360
agaaagaagg actgggaccc taagaagtac ggcggcttcg tgagccccac cgtggcctat 3420
tctgtgctgg tggtggccaa agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa 3480
gagctgctgg ggatcaccat catggaaaga agcagcttcg agaagaatcc catcgacttt 3540
ctggaagcca agggctacaa agaagtgaaa aaggacctga tcatcaagct gcctaagtac 3600
tccctgttcg agctggaaaa cggccggaag agaatgctgg cctctgccag attcctgcag 3660
aagggaaacg aactggccct gccctccaaa tatgtgaact tcctgtacct ggccagccac 3720
tatgagaagc tgaagggctc ccccgaggat aatgagcaga aacagctgtt tgtggaacag 3780
cacaagcact acctggacga gatcatcgag cagatcagcg agttctccaa gagagtgatc 3840
ctggccgacg ctaatctgga caaagtgctg tccgcctaca acaagcaccg ggataagccc 3900
atcagagagc aggccgagaa tatcatccac ctgtttaccc tgaccaatct gggagcccct 3960
agagccttca agtactttga caccaccatc gaccggaagg tgtacagaag caccaaagag 4020
gtgctggacg ccaccctgat ccaccagagc atcaccggcc tgtacgagac acggatcgac 4080
ctgtctcagc tgggaggtga c 4101
<210>39
<211>57
<212>DNA
<213>Artificial Sequence
<220>
<223> Nuclear localization Signal fragment
<400>39
atgaaacgga cagccgacgg aagcgagttc gagtcaccaa agaagaagcg gaaagtc 57
<210>40
<211>96
<212>DNA
<213>Artificial Sequence
<220>
<223> Flexible linker peptide fragment
<400>40
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96
<210>41
<211>96
<212>DNA
<213>Artificial Sequence
<220>
<223> Flexible linker peptide fragment
<400>41
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96
Claims (16)
- The fusion protein is characterized by sequentially comprising an APOBEC3G fragment and an SpCas9-D10A nickase fragment from the N end to the C end, wherein the APOBEC3G fragment has cytosine deaminase activity, at least amino acid mutations in R24A, W94L, Y124A, W127L, D128K, P199A, P199W, P200A, P200K and Q322K exist in the APOBEC3G fragment, or the APOBEC3G fragment is a truncated APOBEC3G fragment deleted from start codon of APOBEC3G to 190 th position or 197 th position.
- 2. The fusion protein of claim 1, wherein the nucleotide sequence of the APOBEC3G fragment comprises:a) a nucleotide sequence shown as SEQ ID NO. 27-36; or,b) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.27-36 and having the functions of the nucleotide sequence defined in a).
- 3. The fusion protein of claim 1, wherein the nucleotide sequence of the SpCas9-D10A nickase fragment comprises:c) a nucleotide sequence shown as SEQ ID NO. 37-38; or,d) a nucleotide sequence having more than 80% sequence similarity with SEQ ID NO.37-38 and having the function of the nucleotide sequence defined in d).
- 4. The fusion protein of claim 1, further comprising a nuclear localization signal fragment located N-terminal to the APOBEC3G fragment or C-terminal to the SpCas9-D10A nickase fragment.
- 5. The fusion protein of claim 4, wherein the nucleotide sequence of the nuclear localization signal fragment is set forth in SEQ ID No. 39.
- 6. The fusion protein of claim 1, further comprising a flexible linker peptide fragment at the N-terminus of the APOBEC3G fragment, between the APOBEC3G fragment and SpCas9-D10A nicase, or at the C-terminus of the SpCas9-D10A nicase.
- 7. The fusion protein of claim 6, wherein the nucleotide sequence of the flexibly linked peptide fragment is set forth in SEQ ID No. 40-41.
- An isolated polynucleotide of , wherein the isolated polynucleotide encodes the fusion protein of claim 1.
- kinds of constructs, characterized in that the constructs are constructed by inserting the isolated polynucleotide of claim 8 into expression vector, and the polynucleotide sequence of the constructs is shown in SEQ ID NO. 1-14.
- 10. The construct of claim 9, wherein the expression vector is of the group consisting of a pCMV expression vector, a pSV2 expression vector, and a pGL3 expression vector.
- An expression system, wherein the expression system is a host cell comprising the construct of claim 9 or wherein the isolated polynucleotide of claim 8 is integrated into the genome of the host cell.
- 12. The expression system of claim 11, wherein the host cell is selected from a mouse cell or a human cell.
- 13. The expression system of claim 11, wherein the host cell is selected from the group consisting of mouse brain neuroma cells, human embryonic kidney cells, human cervical cancer cells, human colon cancer cells, human osteosarcoma cells.
- The base editing tool, comprising the fusion protein of claim 1 and a sgRNA.
- 15. Use of the base editing tool of claim 14 in gene editing in eukaryotes.
- 16. The use of the base editing tool of claim 15 in eukaryotic gene editing, wherein the gene editing is base editing of C-to-T at positions 4-7 of the 5' end of the sgRNA in the target region.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911075141.9A CN110734900B (en) | 2019-11-06 | 2019-11-06 | Cytosine base editing tool and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911075141.9A CN110734900B (en) | 2019-11-06 | 2019-11-06 | Cytosine base editing tool and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110734900A true CN110734900A (en) | 2020-01-31 |
CN110734900B CN110734900B (en) | 2022-09-30 |
Family
ID=69272245
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911075141.9A Active CN110734900B (en) | 2019-11-06 | 2019-11-06 | Cytosine base editing tool and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110734900B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113249362A (en) * | 2020-02-07 | 2021-08-13 | 辉大(上海)生物科技有限公司 | Modified cytosine base editor and application thereof |
CN114058607A (en) * | 2020-07-31 | 2022-02-18 | 上海科技大学 | Fusion protein for C-to-U base editing and preparation method and application thereof |
CN114561429A (en) * | 2022-03-22 | 2022-05-31 | 绍兴市妇幼保健院 | Treatment method for inhibiting HBV surface antigen based on base editing ATG initiation codon |
CN114561392A (en) * | 2022-03-22 | 2022-05-31 | 绍兴市妇幼保健院 | Method for removing HBV e antigen by closing target gene based on base editing technology |
CN114606265A (en) * | 2022-04-07 | 2022-06-10 | 吉林大学 | Mini-base editor capable of realizing single AAV (adeno-associated virus) coating |
CN116555237A (en) * | 2022-03-08 | 2023-08-08 | 中国科学院遗传与发育生物学研究所 | Cytosine deaminase and its use in base editing |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090269831A1 (en) * | 2008-02-07 | 2009-10-29 | Harris Reuben S | Modified cytosine deaminases |
CN102482639A (en) * | 2009-04-03 | 2012-05-30 | 医学研究会 | Mutants of activation-induced cytidine deaminase (aid) and methods of use |
WO2016014837A1 (en) * | 2014-07-25 | 2016-01-28 | Sangamo Biosciences, Inc. | Gene editing for hiv gene therapy |
CN108513575A (en) * | 2015-10-23 | 2018-09-07 | 哈佛大学的校长及成员们 | Nucleobase editing machine and application thereof |
-
2019
- 2019-11-06 CN CN201911075141.9A patent/CN110734900B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090269831A1 (en) * | 2008-02-07 | 2009-10-29 | Harris Reuben S | Modified cytosine deaminases |
CN102482639A (en) * | 2009-04-03 | 2012-05-30 | 医学研究会 | Mutants of activation-induced cytidine deaminase (aid) and methods of use |
WO2016014837A1 (en) * | 2014-07-25 | 2016-01-28 | Sangamo Biosciences, Inc. | Gene editing for hiv gene therapy |
US20160022737A1 (en) * | 2014-07-25 | 2016-01-28 | Sangamo Biosciences, Inc. | Gene editing for hiv gene therapy |
CN108513575A (en) * | 2015-10-23 | 2018-09-07 | 哈佛大学的校长及成员们 | Nucleobase editing machine and application thereof |
Non-Patent Citations (2)
Title |
---|
XIAO WANG ET AL.: "Efficient base editing in methylated regions with a human APOBEC3A-Cas9 fusion", 《NATURE BIOTECHNOLOGY》 * |
赵亚伟等: "碱基编辑器的开发及其在细菌基因组编辑中的应用", 《微生物学通报》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113249362A (en) * | 2020-02-07 | 2021-08-13 | 辉大(上海)生物科技有限公司 | Modified cytosine base editor and application thereof |
CN113249362B (en) * | 2020-02-07 | 2023-04-14 | 辉大(上海)生物科技有限公司 | Modified cytosine base editor and application thereof |
CN114058607A (en) * | 2020-07-31 | 2022-02-18 | 上海科技大学 | Fusion protein for C-to-U base editing and preparation method and application thereof |
CN114058607B (en) * | 2020-07-31 | 2024-02-27 | 上海科技大学 | Fusion protein for editing C to U base, and preparation method and application thereof |
CN116555237A (en) * | 2022-03-08 | 2023-08-08 | 中国科学院遗传与发育生物学研究所 | Cytosine deaminase and its use in base editing |
CN114561429A (en) * | 2022-03-22 | 2022-05-31 | 绍兴市妇幼保健院 | Treatment method for inhibiting HBV surface antigen based on base editing ATG initiation codon |
CN114561392A (en) * | 2022-03-22 | 2022-05-31 | 绍兴市妇幼保健院 | Method for removing HBV e antigen by closing target gene based on base editing technology |
CN114606265A (en) * | 2022-04-07 | 2022-06-10 | 吉林大学 | Mini-base editor capable of realizing single AAV (adeno-associated virus) coating |
CN114606265B (en) * | 2022-04-07 | 2024-01-30 | 吉林大学 | Mini base editor capable of realizing single AAV virus coating |
Also Published As
Publication number | Publication date |
---|---|
CN110734900B (en) | 2022-09-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110734900B (en) | Cytosine base editing tool and application thereof | |
KR102700050B1 (en) | Production of human milk oligosaccharides in microbial hosts with engineered introgression/extrogression | |
KR102147005B1 (en) | Fad2 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks | |
KR20220141332A (en) | Measles-Vectorized COVID-19 Immunogenic Compositions and Vaccines | |
AU2020264412B2 (en) | Dna-binding protein using ppr motif, and use thereof | |
US20030167538A1 (en) | Use of the maize x112 mutant ahas 2 gene and imidazolinone herbicides for selection of transgenic monocots, maize, rice and wheat plants resistant to the imidazolinone herbicides | |
US20040013648A1 (en) | Vector system | |
KR20190120287A (en) | Genome Editing System and Method | |
KR102494564B1 (en) | Malaria vaccine | |
CN112204147A (en) | Cpf 1-based plant transcription regulatory system | |
KR20210105382A (en) | RNA encoding protein | |
CN107002095A (en) | Adeno-associated virus vector for treating lysosomal storage disease | |
CN101827938A (en) | Plants with altered root architecture, involving the RT1 gene, related constructs and methods | |
CN110305901A (en) | A kind of luciferase reporter gene carrier and its construction method and application based on the gene promoter area people TLR4 | |
KR20210005167A (en) | Use of lentivector-transduced T-RAPA cells to alleviate lysosomal storage disease | |
JP2024037797A (en) | Using infectious nucleic acid to treat cancer | |
CN112626035A (en) | New coronary pneumonia vaccine and vaccine kit | |
CN114836473B (en) | Lentiviral vector for constructing cell strain model for screening pharmaceutical activity and application | |
CN111378626B (en) | CHO cell line, construction method, recombinant protein expression system and application | |
US6730481B2 (en) | Primers-attached vector elongation (PAVE): a 5′-directed cDNA cloning strategy | |
CN113621650B (en) | Establishment and application of efficient silk fibroin heavy chain promoter secretion expression system | |
CN113005092A (en) | Preparation method and application of PD1 knockout LMP1 targeted CAR-T cell | |
JPH1175859A (en) | Apoptosis-related gene expressible virus vector system | |
RU2798786C2 (en) | Production of human dairy oligosaccharides in microbial producers with artificial import/export | |
PL244825B1 (en) | Mutant of Tritirachium album proteinase K and its zymogen, an expression plasmid, a recombinant strain of Pichia pastoris and method for preparing a mature form of proteinase K mutant |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |