CN113480660B - Chimeric terpene synthase and application thereof - Google Patents

Chimeric terpene synthase and application thereof Download PDF

Info

Publication number
CN113480660B
CN113480660B CN202110618456.4A CN202110618456A CN113480660B CN 113480660 B CN113480660 B CN 113480660B CN 202110618456 A CN202110618456 A CN 202110618456A CN 113480660 B CN113480660 B CN 113480660B
Authority
CN
China
Prior art keywords
leu
ala
ser
glu
asp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110618456.4A
Other languages
Chinese (zh)
Other versions
CN113480660A (en
Inventor
刘天罡
陈蓉
卞光凯
叶紫玲
慕昕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN202110618456.4A priority Critical patent/CN113480660B/en
Publication of CN113480660A publication Critical patent/CN113480660A/en
Application granted granted Critical
Publication of CN113480660B publication Critical patent/CN113480660B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1085Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/88Lyases (4.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P5/00Preparation of hydrocarbons or halogenated hydrocarbons
    • C12P5/002Preparation of hydrocarbons or halogenated hydrocarbons cyclic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y205/00Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
    • C12Y205/01Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide

Abstract

The present invention provides a chimeric terpene synthase having terpene synthase and prenyltransferase activity. The chimeric terpene synthase has an amino acid sequence shown in any one of SEQ ID NO 1-11. The chimeric terpene synthase can catalyze a substrate to produce various terpenoids after fermentation culture. The invention provides a preparation method of several terpenoids, which is simple to prepare.

Description

Chimeric terpene synthase and application thereof
Technical Field
The invention relates to the technical field of biology, in particular to a chimeric terpene synthase and application thereof.
Background
Terpenoids refer to compounds having (C) 5 H 8 ) n The general formula and derivatives thereof containing oxygen and having different saturation degrees can be regarded as a natural compound formed by connecting isoprene or isopentane in various ways. To date, about 76000 terpenoids have been found in animals, plants, and microorganisms. Terpenoids are widely used in the perfume industry, health care product industry, agricultural production field and medical industry. The application value of the terpenoid is very wide, and the discovery of more terpenoids has great practical significance.
In general, terpene Synthases (TS) synthesize the corresponding monoterpene, sesquiterpene, diterpene, sesterterpene and triterpenoid products using geranyl pyrophosphate (GPP), farnesyl diphosphate (FPP), geranylgeranyl diphosphate (GGPP) and geranylfarnesyl farnesyl diphosphate (GFPP) as substrates, whereas these different-chain-length isoprene pyrophosphate substrates are synthesized by the corresponding Prenyltransferases (PT) using isopentenyl pyrophosphate (IPP) and dimethylallyl pyrophosphate (DMAPP), i.e. synthesis of the isopentenyl substrates and production of the terpenoids are accomplished by two enzymes independently catalyzing them, but PT also contains several chimeric domains for the same time, PT is found to be useful for the discovery of new chimeric structures for both the synthases.
Therefore, there is a need to develop new chimeric terpene synthases.
Disclosure of Invention
The present invention is directed to solving one of the problems of the background art, and thus, in a first aspect of the present invention, the present invention provides a chimeric terpene synthase having terpene synthase and prenyltransferase activities.
In one or more embodiments of the invention, the chimeric terpene synthase has an amino acid sequence set forth in any one of SEQ ID NOs 1-11.
In a second aspect of the invention, there is provided a nucleic acid molecule encoding the chimeric terpene synthase of the first aspect of the invention, the nucleic acid molecule having the nucleotide sequence set forth in any one of SEQ ID NOs 14 to 24.
In a third aspect, the invention provides a construct comprising a nucleic acid molecule according to the second aspect of the invention.
In a fourth aspect of the invention, the invention provides a recombinant cell comprising a nucleic acid molecule according to the second aspect of the invention.
In a fifth aspect, the invention provides a use of a chimeric terpene synthase according to the first aspect of the invention and/or a nucleic acid molecule according to the second aspect of the invention and/or a construct according to the third aspect of the invention and/or a recombinant cell according to the fourth aspect of the invention in the preparation of a terpenoid.
In one or more embodiments of the present invention, the terpenoid comprises at least one of the following structures,
Figure GDA0003894694150000021
in a sixth aspect of the present invention, the present invention provides a method for preparing a terpenoid, comprising: culturing the recombinant cell to obtain a culture product; and isolating the terpenoid from the culture product, the terpenoid comprising at least one of the following structures,
Figure GDA0003894694150000022
in a seventh aspect of the present invention, the present invention provides a terpenoid, which has a structure represented by structural formula 1 or structural formula 2,
Figure GDA0003894694150000031
compared with the prior art, the invention has the following advantages and beneficial effects:
1. the invention provides a chimeric terpene synthase which can catalyze a substrate to produce various terpene compounds after fermentation culture.
2. The invention provides a preparation method of several terpenoids, which is simple to prepare.
Drawings
FIG. 1 is a schematic diagram of the construction process of plasmid pZY 141;
FIG. 2 shows the GCMS detection spectrum of an organic phase after fermentation culture and treatment of a mutant strain containing PTTS 010;
FIG. 3 shows GCMS detection patterns of organic phase after fermentation culture and treatment of mutant strains containing PTTS009, PTTS037 and PTTS 054;
FIG. 4 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 008;
FIG. 5 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 021;
FIG. 6 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 052;
FIG. 7 shows the GCMS detection spectrum of an organic phase after the mutant strain containing PTTS119 is fermented and cultured;
FIG. 8 shows GCMS detection patterns of organic phase after fermentation culture and treatment of mutant containing PTTS 023;
FIG. 9 shows GCMS detection patterns of organic phase after fermentation culture of PTTS 013-containing mutants;
FIG. 10 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant strain containing PTTS 066;
FIG. 11 is a schematic representation of plasmid pZY 141;
in the figure, sesterevilene (1) represents compound 1, sesterorbicule (2) represents compound 2, β -gerylfarnesene (5) represents compound 5, gerylfarnesol (6) represents compound 6, geryllinaloool (23) represents compound 23, and gerylgeraniol (24) represents compound 24.
Detailed Description
The scheme of the invention will be explained with reference to the examples. It will be appreciated by those skilled in the art that the following examples are illustrative of the invention only and should not be taken as limiting the scope of the invention. The examples, where specific techniques or conditions are not indicated, are to be construed according to the techniques or conditions described in the literature in the art or according to the product specifications. The methods used are conventional methods known in the art unless otherwise specified, and the consumables and reagents used are commercially available unless otherwise specified. Unless otherwise defined, technical and scientific terms used herein have the same meaning as is familiar to those skilled in the art. In addition, any methods or materials similar or equivalent to those described herein can also be used in the present invention.
Example 1 construction of vectors for Gene-digging Chassis Yeast strains
(1) Plasmid pZY141
△Gal1,7,10::Trp_TACT1_tHMG1_pGAL10
The schematic construction process of plasmid pZY141 is shown in FIG. 1, and the specific process is as follows: using 30000B as a template, amplifying by using primers 1411-F/R to obtain a fragment 1141 ( Gal 1,7,10 left homology arms), amplifying by using primers 1415-F/R to obtain a fragment 1415 (bidirectional promoter Pgal10-pGAL 1), amplifying by using primers 1416-F/R to obtain a fragment 1416 (terminator tCPS 1), amplifying by using primers 1417-F/R to obtain a fragment 1417 (Gal, 7,10 right homology arms), amplifying by using pRS424 as a template by using primers 1412-F/R to obtain a fragment 1412 (tryptophan screening marker), amplifying by using S288C as a template by using primers 1413-F/R to obtain a fragment 1413 (terminator tACT 1), amplifying by using primers 1414-F/R to obtain a fragment 1414 (tHMG 1), amplifying by using pRS426 as a template by using primers 1418-F/R to obtain a fragment 1418 (vector backbone). These fragments were recombined by yeast assembly to give vector pZY141. Obtaining a fragment with a target gene after enzyme digestion linearization, wherein the nucleotide sequence is shown as SEQ ID NO: shown at 12. The primers used in the construction of plasmid pZY141 are shown in Table 1 below:
TABLE 1 primers used in the construction of plasmid pZY141
Figure GDA0003894694150000041
Figure GDA0003894694150000051
Example 2 construction of Gene-digging Chassis Yeast strains
The linearized fragment of plasmid pZY141 (nucleotide sequence shown in SEQ ID NO: 12) was transferred into the s.cerevisiae strain CEN. PK2-1D and integrated into the chromosome according to homologous recombination at the Gal1,7,10 site and the selection marker tryptophan (Trp 1), so that the background strain YZL141 was successfully constructed.
EXAMPLE 3 construction of a potential chimeric terpene synthase expression plasmid
Searching NCBI and Uniprot databases, limiting the amino acid length to 700-900, downloading all matched nucleotide/protein sequences, analyzing the downloaded sequences, screening sequences with terpene synthase conserved domains DDXXD/E and NSE/DTE, and carrying out PTTS gene synthesis to obtain the following PTTS genes: pUC57-008 (having a nucleotide sequence shown by SEQ ID NO: 14), pUC57-009 (having a nucleotide sequence shown by SEQ ID NO: 15), pUC57-010 (having a nucleotide sequence shown by SEQ ID NO: 16), pUC57-013 (having a nucleotide sequence shown by SEQ ID NO: 17), pUC57-021 (having a nucleotide sequence shown by SEQ ID NO: 18), pUC57-023 (having a nucleotide sequence shown by SEQ ID NO: 19), pUC57-037 (having a nucleotide sequence shown by SEQ ID NO: 20), pUC57-051 (having a nucleotide sequence shown by SEQ ID NO: 21), pUC57-054 (having a nucleotide sequence shown by SEQ ID NO: 22), pUC57-066 (having a nucleotide sequence shown by SEQ ID NO: 23), and pUC57-119 (having a nucleotide sequence shown by SEQ ID NO: 24). The PTTS gene was amplified using a plasmid synthesized by GeneCo as a template, and the plasmid backbone (nucleotide sequence shown in SEQ ID NO: 13) was amplified using pGB315 as a template. Then, the PTTS gene and the plasmid skeleton amplified by pGB315 are assembled by yeast assembly or Gibson respectively. In addition, in the construction process of the plasmid, the connecting peptide GSTGS reported previously is used for connecting GFPP synthetase and PTTSs, so that a fusion protein is formed, the physical and spatial distance between the PTTSs and GFPPS is shortened, and the metabolic flux of GFPP is improved to promote the functional characterization of sesterterpene synthase. In addition, the introduction of GFPPS can help release the utilization capacity of GFPP by the chimeric diterpene synthase, so that the chimeric diterpene synthase can generate sesterterpene compounds with various structures.
(1) Construction of plasmids the following Table
TABLE 2 plasmid Table
Serial number Plasmids Plasmid information
1 pRC317 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS008-T ADH1
2 pRC318 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS009-T ADH1
3 pRC319 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS010-T ADH1
4 pRC320 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS013-T ADH1
5 pRC324 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS021-T ADH1
6 pRC325 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS023-T ADH1
7 pRC333 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS037-T ADH1
8 pRC345 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS051-T ADH1
9 pRC347 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS054-T ADH1
10 pRC354 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS066-T ADH1
11 pRC377 pGB315 derived,URA,T CYC1 -FDS-P GAL1 -P GAL10 -PTTS119-T ADH1
(1) Plasmid pRC317 was constructed specifically: and (3) amplifying the gene pUC57-008 by using a primer PTTS008-F/R to obtain PTTS008, and amplifying the Vector skeleton by using a primer PTTS008V-F/R to obtain PTTS008-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC317. Plasmids pRC318, pRC319, pRC320, pRC324, pRC325, pRC333, pRC345, pRC347, pRC354 and pRC377 were obtained by the same construction. (2) Plasmid pRC318 specific construction Process: PTTS009 was obtained by amplifying pUC57-009 with PTTS009-F/R as a primer, and PTTS009-Vector was obtained by amplifying Vector backbone with PTTS009V-F/R as a primer. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC318. (3) The specific construction process of plasmid pRC 319: the gene pUC57-010 is amplified by the primer PTTS010-F/R to obtain PTTS010, and the Vector skeleton is amplified by the primer PTTS010V-F/R to obtain PTTS010-Vector. The obtained fragments were recombined by Gibson or yeast assembly to obtain plasmid pRC319. (4) Plasmid pRC320 was constructed specifically: the gene pUC57-013 was amplified with the primer PTTS013-F/R to obtain PTTS013, and the Vector backbone was amplified with the primer PTTS013V-F/R to obtain PTTS013-Vector. The obtained fragments were recombined by Gibson or yeast assembly to obtain plasmid pRC320. (5) The specific construction process of plasmid pRC 324: and (3) amplifying the gene pUC57-021 by using the primer PTTS021-F/R to obtain PTTS021, and amplifying the Vector skeleton by using the primer PTTS021V-F/R to obtain PTTS021-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC324. (6) Plasmid pRC325 specific construction Process: the gene pUC57-023 is amplified by the primer PTTS023-F/R to obtain PTTS023, and the Vector skeleton is amplified by the primer PTTS023V-F/R to obtain PTTS023-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC325. (7) The specific construction process of plasmid pRC 333: the gene pUC57-037 is amplified by the primer PTTS037-F/R to obtain PTTS037, and the Vector skeleton is amplified by the primer PTTS037V-F/R to obtain PTTS037-Vector. The obtained fragments are recombined in a Gibson or yeast assembly mode to obtain a plasmid pRC333. (7) Plasmid pRC325 specific construction Process: the gene pUC57-023 is amplified by the primer PTTS023-F/R to obtain PTTS023, and the Vector skeleton is amplified by the primer PTTS023V-F/R to obtain PTTS023-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC325. (8) Plasmid pRC345 specific construction Process: and amplifying the gene pUC57-051 by using the primer PTTS051-F/R to obtain PTTS051, and amplifying the Vector framework by using the primer PTTS051V-F/R to obtain PTTS051-Vector. The obtained fragments are recombined in a Gibson or yeast assembly mode to obtain a plasmid pRC345. (9) Plasmid pRC347 specific construction Process: the gene pUC57-054 is amplified by the primer PTTS054-F/R to obtain PTTS054, and the Vector skeleton is amplified by the primer PTTS054V-F/R to obtain PTTS054-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC347. Plasmid pRC354 specific construction procedure: the gene pUC57-066 is amplified by the primer PTTS066-F/R to obtain PTTS066, and the Vector skeleton is amplified by the primer PTTS066V-F/R to obtain PTTS066-Vector. The obtained fragments were recombined by Gibson or yeast assembly to obtain plasmid pRC 354. 11. Plasmid pRC377 concrete construction Process: the gene pUC57-119 was amplified with the primer PTTS119-F/R to obtain PTTS119, and the Vector backbone was amplified with the primer PTTS119V-F/R to obtain PTTS119-Vector. The obtained fragments were recombined by means of Gibson or yeast assembly to obtain plasmid pRC377.
TABLE 3 primer sequence Listing
Figure GDA0003894694150000071
Figure GDA0003894694150000081
Example 4 construction of potential chimeric terpene synthase expressing strains
Plasmids pRC317, pRC318, pRC319, pRC320, pRC324, pRC325, pRC333, pRC345, pRC347, pRC354 and pRC377 obtained in example 3 were transferred to the background strain YZL141 obtained in example 2, respectively, and the selection marker was uracil (URA 3), and the selection plate was Sc-URA (synthetic yeast nitrogen source YNB 6.7g/L, glucose 20g/L, uracil-deficient mixed amino acid powder 1.3g/L,2% agar powder), to obtain corresponding mutants.
The amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC317 is shown as SEQ ID NO:1 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC318 is shown in SEQ ID NO:2 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC319 is shown as SEQ ID NO:3 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC320 is shown in SEQ ID NO:4 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC324 is shown as SEQ ID NO:5 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC325 is shown as SEQ ID NO:6 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC333 is shown in SEQ ID NO:7 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC345 is shown as SEQ ID NO:8 is shown in the specification; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC347 is shown as SEQ ID NO:9 is shown in the figure; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC354 is shown as SEQ ID NO:10 is shown in the figure; the amino acid sequence of the chimeric terpene synthase expressed by the mutant strain corresponding to the plasmid pRC377 is shown as SEQ ID NO: shown at 11.
Example 5 Shake flask fermentation and product detection of potential chimeric terpene synthase expressing strains
The mutant strains obtained in example 4 were cultured by shake flask fermentation respectively, and the specific method was as follows: single clones were picked from the transformation plates into PA flasks containing 5mL of seed medium, the seed medium formulation was: YNB (6.7 g/L), yeast powder (10 g/L), glucose (20 g/L), uracil-deficient mixed amino acid powder (1.3 g/L). Carrying out transfer after shaking culture of seeds at 30 ℃ overnight, carrying out transfer according to the initial OD =0.1, placing the seeds on a shaking table at 30 ℃ to start shaking flask fermentation, wherein the formula of a fermentation medium is as follows: peptone (20 g/L), yeast powder (10 g/L), glucose (10 g/L) and galactose (10 g/L). And (3) collecting samples after fermentation is finished, and adding normal hexane: ethyl acetate (4. And finally, re-dissolving by using a small amount of normal hexane, and detecting the composition of the product by using organic phase GCMS. Semi-preparative purification and nuclear magnetic identification were performed on compounds 1, 2, 5, 6, 23, 24.
The GC-MS detection results are shown in figures 2-10, wherein figure 2 shows the GCMS detection map of the organic phase after the PTTS 010-containing mutant strain is fermented and cultured and treated; the time to peak of compound 1 was 20.3min; FIG. 3 shows GCMS detection patterns of organic phase after fermentation culture and treatment of mutant strains containing PTTS009, PTTS037 and PTTS 054; the time to peak of compound 2 was 20.2; FIG. 4 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 008; the peak-off time of compound 5 was 22.0min; FIG. 5 shows GCMS detection spectra of treated mutant strains containing PTTS021 after fermentation culture; the time to peak of compound 6 was 21.9min; FIG. 6 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 052; the time to peak of compound 6 was 21.9min; FIG. 7 shows the GCMS detection spectrum of an organic phase after the mutant strain containing PTTS119 is fermented and cultured; the peak-off time of compound 6 was 21.9min; FIG. 8 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 023; the time to peak for compound 24 was 18.1min; FIG. 9 shows GCMS detection patterns of organic phase after fermentation culture of PTTS 013-containing mutants; the time to peak of compound 6 was 15.8min; the time to peak of compound 5 was 21.8min; FIG. 10 shows GCMS detection spectra of organic phase after fermentation culture and treatment of mutant containing PTTS 066; the time to peak of compound 24 was 28.1min and the time to peak of compound 6 was 21.9mi.
From the results of the experiments, it was found that the mutant strain containing PTTS010 can synthesize sestereviscene (Compound 1), which is a novel structural compound. PTTS009/PTTS037/PTTS054 can synthesize a sesterorbicule (Compound 2), a sesterorbicule compound, which is also a new structural compound. PTTS008 enables the synthesis of the beta-geranylfarnesene (compound 5), a sesterterpene compound. PTTS021/PTTS052/PTTS119 can synthesize the diterpene geranylfanesol compound (compound 6). PTTS023 can synthesize the diterpene compound geranylgeraniol (compound 24). PTTS013 can synthesize the diterpene compound geranyllinalol (compound 23) and the sesterterpene compound beta-geranylfarnesene (compound 5). PTTS066 allows the synthesis of the diterpenoid compound geranylgeraniol (compound 24) and the sesterterpene compound geranylfarnesol (compound 6).
Thus, the isopentenyl transferase domain of the chimeric terpene synthase provided by the invention can synthesize geranylgeranyl pyrophosphate (GGPP) or geranylfarnesyl diphosphate (GFPP) by taking isopentenyl pyrophosphate (IPP) and dimethylallyl pyrophosphate (DMAPP) as substrates, and synthesize different types of diterpenes and sesterterpene products catalyzed by the terpene synthase domain.
Hydrogen spectra, carbon spectra, and oxygen spectra of the compounds 1, 2, 5, 6, 23, and 24, 1 H- 1 The H COSY and HMBC spectral data are shown below.
The following Table 4 shows the hydrogen spectrum, carbon spectrum, and, 1 H- 1 H COSY and HMBC spectrogram data
TABLE 4 hydrogen spectrum, carbon spectrum, of Compound 1, 1 H- 1 H COSY and HMBC spectrogram data
Figure GDA0003894694150000101
Figure GDA0003894694150000111
Table 5 shows the hydrogen spectrum, carbon spectrum, and, 1 H- 1 H COSY and HMBC spectrogram data
TABLE 5 hydrogen spectrum, carbon spectrum, of Compound 2, 1 H- 1 H COSY and HMBC spectrogram data
Figure GDA0003894694150000112
Figure GDA0003894694150000121
The following table 6 shows hydrogen spectra of compound 5 and compound 6: ( 1 H NMR,CDCl 3 500 MHz) and carbon spectrum ( 13 C NMR,CDCl 3 125 MHz) data
TABLE 6 Hydrogen spectra of Compound 5 and Compound 6: ( 1 H NMR,CDCl 3 500 MHz) and carbon spectrum ( 13 C NMR,CDCl 3 125 MHz) data
Figure GDA0003894694150000122
The following table 7 is a hydrogen spectrum of compound 23 and compound 24 (see 1 H NMR,CDCl 3 500 MHz) and carbon spectrum ( 13 C NMR,CDCl 3 125 MHz) data
TABLE 7 Hydrogen spectra of Compound 23 and Compound 24: ( 1 H NMR,CDCl 3 500 MHz) and carbon spectrum ( 13 C NMR,CDCl 3 125 MHz) data
Figure GDA0003894694150000131
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.
Sequence listing
<110> Wuhan university
<120> chimeric terpene synthase and application thereof
<160> 24
<170> SIPOSequenceListing 1.0
<210> 1
<211> 715
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 1
Met Glu Ser Leu Trp Glu His Ser Val Pro Val Asp His Glu Val Ala
1 5 10 15
Leu Ser Thr Gly Cys Phe Thr Ser Leu Pro Ile Arg Ile His Gln Arg
20 25 30
Asn Asp Leu Ala Asp Asp Ala Thr Lys Gln Ser Ile Arg Asp Trp Gly
35 40 45
Ser Tyr Val Gly Asp Gly Trp Glu Gln Arg Ser Gly Ser Ser Trp Ser
50 55 60
Pro Val Gly Asn Trp Gly Ala Phe Ile Phe Pro Glu Ser Leu Pro Glu
65 70 75 80
Arg Leu Gly Val Ile Thr Tyr Leu Ala Asn Met Gly Asn Ile His Asp
85 90 95
Asp Leu Cys Asp Asp Leu Thr Tyr Asp Glu Ala Leu Glu Glu His Asn
100 105 110
Ser Leu Ser Gln Ala Met Glu Ile Ser Thr Ser Gly Pro His Gln Gly
115 120 125
Thr Gln Asn Phe Met Arg Ser Met Lys Met Lys Lys Tyr Ile Ser Lys
130 135 140
Tyr Leu Leu Glu Ala Leu Glu Ile Asp Arg Pro Arg Ala Leu Arg Met
145 150 155 160
Ile Asn Ser Tyr Arg Ser Lys Trp Leu Asp Val Met Glu Arg Asn Asn
165 170 175
Val Asn Asp Ile Glu Ser Leu Asp Glu Tyr Leu Ile Phe Arg Asn Leu
180 185 190
Asn Gly Gly Met Glu Ala Phe Trp Ser Met Val Glu Phe Gly Met Ala
195 200 205
Val Asp Val Ser Glu Ser Asp Lys Lys Arg Ile Arg Pro Ile Phe Ala
210 215 220
Ala Ala Glu Ser Ala Leu Val Leu Thr Asn Asp Tyr Trp Ser Trp Asp
225 230 235 240
Arg Glu Trp Trp Gln Ala Gln Arg Ala Lys Asp Pro Arg Ile Val Asn
245 250 255
Ala Val His Leu Phe Met Arg Thr Glu Gly Leu Ala Ile Tyr Gln Ala
260 265 270
Arg Glu Ala Val Arg Asp Arg Ile Leu Ala Tyr Glu Ala Glu Phe Leu
275 280 285
Arg Leu Lys Ala Glu Phe Tyr Val Gln Asn Pro Asn Ala Ala Val His
290 295 300
Leu Lys Arg Tyr Ile Glu Leu Cys Gly Val Ile Thr Ala Gly Asn His
305 310 315 320
Tyr Trp Cys Ala Asn Cys Pro Arg His His Ser Trp Arg Asn Glu Asp
325 330 335
Ile Pro Pro Ile Glu Arg Ser Phe Ser Phe Ser Ser Glu Asp Thr Ile
340 345 350
Pro Asp Asp Cys Thr Leu Ser Pro Ala Ala Thr Ile Ser Ser Ser Val
355 360 365
Ser Tyr Lys Ser Ser Pro Thr Thr Val Leu Thr Ser Ser Lys Ser Ile
370 375 380
Gly Val Ile Ala Thr Ala Asp Gly Asp Glu Lys Pro Gln Arg Leu Ser
385 390 395 400
Asp Ser Val Leu Gln Gly Pro Cys Gln Tyr Ile Arg Ser Met Pro Ser
405 410 415
Lys Gly Leu Arg His Leu Phe Ala Glu Ala Leu Arg Gln Trp Leu Leu
420 425 430
Val Glu Asp Ala Ser Leu His Gln Ile Lys Asn Ile Ile Asp Leu Leu
435 440 445
His Asn Ser Ser Leu Ile Leu Asp Asp Ile Glu Asp Asp Ser Pro Leu
450 455 460
Arg Arg Gly Leu Pro Ala Thr His Met Val Phe Gly Arg Ala Gln Ser
465 470 475 480
Ile Asn Ser Ala Asn Phe Met Phe Val Gln Ala Val Gln Val Ala Arg
485 490 495
Thr Leu Lys Ser Pro Asp Cys Leu Gly Ile Leu Leu Glu Glu Leu Asp
500 505 510
Cys Leu Phe Ile Gly Gln Ser Trp Asp Leu Tyr Trp Lys Phe His Thr
515 520 525
Gln Ile Pro Thr Glu Gln Glu Tyr Phe Glu Met Val Asp Ser Lys Thr
530 535 540
Gly Ala Met Phe Arg Leu Leu Thr Arg Leu Met Leu His Thr Ser Ser
545 550 555 560
Val Thr Ala Thr Asp Asp Ser Ala Gln Leu Leu Glu Ala Met Cys Arg
565 570 575
Leu Leu Gly Arg Phe Phe Gln Val Arg Asp Asp Phe Met Asn Leu Ser
580 585 590
Ser Asn Glu Tyr Ser Asp Leu Lys Gly Phe Cys Glu Asp Leu Asp Glu
595 600 605
Gly Lys Met Ser Tyr Pro Met Val Met Val Leu Arg Gln Asn Pro Glu
610 615 620
Tyr Gln Asp Gln Ile Met Gly Thr Leu Arg Gln Gln Ala Met Ser Ala
625 630 635 640
Ala Lys Gly Gly Leu Ser Gln Pro Val Arg Leu Ser Arg Glu Thr Lys
645 650 655
Val Tyr Ile Leu Gly Leu Leu Lys Ser Ser Gly Ala Met Ala Thr Thr
660 665 670
Leu Lys Lys Leu Gln Glu Leu Glu Thr Ala Leu Glu Ser Leu Ile Asp
675 680 685
Gly Leu Glu Thr Gly Phe Gly Glu Thr Asn Pro Val Met Arg Ile Leu
690 695 700
Leu Ser Arg Leu Ser Val Arg Asp Ile Thr Leu
705 710 715
<210> 2
<211> 714
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 2
Met Ala Glu Ser Met Trp Lys Tyr Ser Thr Ala Leu Asp Pro Glu Met
1 5 10 15
Val Arg Arg Thr Gly Cys Phe Thr Thr Leu Pro Val Arg Ile Asn Asn
20 25 30
Arg Asp Asp Ile Pro Asn Ala Ala Ser Leu Arg Val Leu Lys Asp Trp
35 40 45
Ala Glu His Thr Gly Asn Lys Asp Ile Ser Gln Asp Arg Leu Ser Leu
50 55 60
Ser Pro Val Gly Ser Phe Cys Ser Leu Ile Tyr Cys Glu Thr Ile Pro
65 70 75 80
Glu Arg Leu Asp Ser Ile Ser Tyr Leu Thr Asp Leu Phe Phe Leu Ile
85 90 95
Asp Asp Ala Thr Glu Glu Val Ala Asn Asp Lys Ala Ala Gln Glu Glu
100 105 110
Trp Ala Gly Phe Ser Gly Ala Met Thr Asp Ser Leu Gly Glu Thr Pro
115 120 125
Gln Arg Asp His Asp Leu Glu Phe Met Lys Lys Lys Lys Leu Thr Ala
130 135 140
Gln Val Met Leu Asp Phe Met Thr Ile Asp Ala Glu Leu Gly Leu Asp
145 150 155 160
Leu Val Lys Ser Cys Lys Ala Gly Trp Thr Pro Leu Ala Ala Gly Val
165 170 175
Glu Trp Pro Thr Ile Glu Asp Tyr Leu Val Phe Arg Arg Asp Ser Ala
180 185 190
Gly Leu Asp Ile Tyr Trp Gly Lys Thr Val Phe Gly Leu Gly Glu Lys
195 200 205
Leu Thr Asp Asp Glu Glu Lys Leu Ile Arg Pro Leu Val Trp Ala Ala
210 215 220
Glu Lys Ala Ala Met Leu Asn Asn Asp Tyr Trp Ser Trp Asp Val Glu
225 230 235 240
Tyr Ser Glu Ala Asn Gln Lys Thr Glu His Leu Thr Asn Ala Val Ala
245 250 255
Val Leu Met Lys Asn Glu Gly Ile Ser Ala Glu Glu Gly Lys Glu Arg
260 265 270
Ile Lys Gly Leu Ile Ile Asp Tyr Glu Ala Glu Tyr Leu Arg Leu Arg
275 280 285
Ala Gln Phe Tyr Glu Asn His Pro Ser Ala Arg Leu Tyr Leu Arg Lys
290 295 300
Arg Val Glu Leu Ala Gly Ser Met Ala Ala Gly Val Ser Phe Trp Ser
305 310 315 320
Ala Asn Ser Pro Arg Tyr His Leu Ser Lys Lys Gln Val Glu Ala Ala
325 330 335
Gly Leu Pro Asp Asp Pro Arg Thr Lys Pro Asn Gln Thr Ser Asp Thr
340 345 350
Gly Ser Pro Met Gly Arg Ser Ser Thr Asp Ser Gln Val Ala Ser Asp
355 360 365
Ile Thr Trp Ser Thr Ser Thr Asp Ile Ser Ser Gln Ser Gln His Thr
370 375 380
Gln Asp Leu Val Ala His Ser Tyr Tyr Gly Val Pro Lys Leu Asp Arg
385 390 395 400
Thr Ala Leu Asn Gly Pro Ile Glu Tyr Val Ser Ala Met Pro Ser Lys
405 410 415
Gly Val Arg Ser Ser Leu Ile Asp Ala Met Asn Gln Trp Cys Gln Val
420 425 430
Pro Ser Ser Gln Leu Ala Val Val Lys Ser Val Ile Asp Gln Leu His
435 440 445
Asn Ser Ser Leu Val Leu Asp Asp Ile Gln Asp Asp Ser Pro Met Arg
450 455 460
Arg Gly Lys Thr Ala Thr His Leu Ile Phe Gly Gln Ala Gln Ala Ile
465 470 475 480
Asn Ser Ala Asn Phe Leu Tyr Val Arg Val Val Gln Glu Val His Ala
485 490 495
Thr Arg Asn Thr Ala Leu Met Asp Ile Leu Leu Glu Glu Leu Glu Asp
500 505 510
Leu His Val Gly Gln Ser Trp Asp Leu Tyr Trp Lys Tyr Asn Leu Arg
515 520 525
Trp Pro Ser Glu Ala Glu Tyr Phe Ser Met Ile Asp Leu Lys Thr Gly
530 535 540
Gly Leu Phe Arg Met Leu Val Arg Met Met Gln Thr Leu Ser Pro Leu
545 550 555 560
Lys Asp Arg Asp Phe Val Cys Asp Thr Leu Val Ser Met Val Ser Arg
565 570 575
Phe Phe Gln Val Arg Asp Asp Tyr Leu Asn Leu Asn Ser Arg Glu Tyr
580 585 590
Ser Thr Gln Lys Gly Trp Cys Glu Asp Leu Asp Glu Gly Lys Phe Ser
595 600 605
Tyr Leu Ile Ile His Cys Leu Gln His Ser Pro Lys Phe Arg Asp Arg
610 615 620
Ile Met Gly Phe Phe Arg Gln Arg Thr Gly Cys Val Gly Pro Met Pro
625 630 635 640
Thr Val Gly Lys Val Gln Ile Ile Glu Tyr Leu Gln Glu Ala Gly Ser
645 650 655
Phe Asn Ala Cys Trp Glu Leu Leu Asn Ser Leu Glu Asp Asp Ile Glu
660 665 670
Lys Glu Ile Lys Arg Leu Glu Glu Ile Thr Gly Glu Lys Asn Pro Leu
675 680 685
Met His Leu Leu Leu Lys Leu Leu Ser Val Lys Thr Glu Lys Pro Asn
690 695 700
Gly Lys Ala Val Val Ala Pro Ala Gly Leu
705 710
<210> 3
<211> 728
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 3
Met Ala Glu Phe Ala Ile Pro Val Pro Asp Asp Val Val Lys Gln Ser
1 5 10 15
Gly Thr Leu Ser Arg Phe Pro Thr Ala Val His Arg Glu His Ala Arg
20 25 30
Cys Leu Ala Ala Ala Asn Lys Ile Arg Asp Asp Phe Ala Ala Gln Val
35 40 45
Asp Trp Asp Leu Asp Ala Lys Thr Thr Gly His Tyr Pro Thr Leu Gly
50 55 60
Ala Val His Val Val Ala Phe Thr Met Pro Glu Cys Leu Pro Glu Arg
65 70 75 80
Leu Ala Leu Met Thr Arg Phe Thr Asp Phe Thr Ile Met Asn Asp Asp
85 90 95
Tyr Tyr Asp Ala Val Asp Arg Asp Gln Ala Thr Ser Phe Asn Ala Glu
100 105 110
Leu Gln Arg Ser Leu Gly Arg Asp Cys His Ser Asn Thr Val Gln Gly
115 120 125
Asn Ala Ser Val Ala Ile Lys Thr Lys Gln Phe Gln Ala Ser Ile Leu
130 135 140
Val Glu Met Met Val Met Asp Arg Asp Leu Ala Met Asp Val Met Asp
145 150 155 160
Thr Tyr Ser Asp Gly Leu Glu Thr Ala Thr Phe Pro Pro Ser Asp Ile
165 170 175
Cys Thr Ile Glu Glu Tyr Leu Pro Val Arg Leu Val Asn Cys Gly Leu
180 185 190
Asp Val Phe Gln Glu Met Ser Cys Phe Gly Leu Gly Val His Leu Thr
195 200 205
Lys Ala Glu Lys Glu Lys Leu Ser Asp Ile Ala Asn Thr Ala Leu Tyr
210 215 220
Thr Ala Ala Leu Ile Asn Asp Cys His Ser Trp Pro Lys Glu Leu Lys
225 230 235 240
His His Leu Glu Thr Pro Gly Ser Asp Val Pro Phe Asn Ala Val Cys
245 250 255
Ile Leu Met Arg Gln Phe Asn Cys Ser Asp Val Lys Ala Ile Glu Arg
260 265 270
Leu Arg Ala Ile Tyr Val Glu Ile Gln Glu Arg His Leu Ser Leu Val
275 280 285
Arg Asn Leu Glu Gln Ser Glu Gly Ser Ile Pro Glu Thr His Arg Lys
290 295 300
Tyr Ile Met Ala Ala Gln Tyr Ala Ala Ser Gly Ser Glu Phe Trp Ser
305 310 315 320
Leu Tyr Ala Pro Arg Tyr Pro Ser Lys Glu Asp Leu Glu Gln Pro Glu
325 330 335
Tyr Val Leu Val Asp Asn Val Leu His Arg Arg Ser Met Ser Asp Lys
340 345 350
Asp Leu Pro Thr Ser Asp Lys Asp Leu Ala Arg Ala Asp Ser Ala Met
355 360 365
His Ile Glu Thr Ile Lys Thr Ala Gly Ser Ser Gly Met Ser His Met
370 375 380
Asn Glu Ala Tyr Ser Ser Thr Pro Ala Thr Glu Met Val Ala Trp Asp
385 390 395 400
Ala Gly Ser Glu Ile Ile His Thr Glu Ile Asp Ser Asn Gly Ser Lys
405 410 415
Glu Leu Ala Pro Asn Gly Ala Gln Thr Arg Val Gln Lys Pro Ser Glu
420 425 430
Asp Ala Val Arg Ala Pro Tyr Asp Tyr Ile Arg Ala Leu Pro Ser Lys
435 440 445
Arg Ile Arg Glu Thr Phe Ile Asp Ala Leu Asp Ser Trp Leu Ala Val
450 455 460
Pro Ala Gly Ser Ser Ala Ser Ile Lys Ser Ile Ile Gly Met Leu His
465 470 475 480
Gln Ser Ser Leu Met Leu Asp Asp Ile Glu Asp Asp Ser Thr Leu Arg
485 490 495
Arg Gly Lys Pro Thr Ala His Thr Leu Phe Gly Ile Ala Gln Thr Ile
500 505 510
Asn Ser Ala Asn Trp Val Phe Ala Cys Ala Phe Glu Glu Leu Arg Ser
515 520 525
Leu Arg Gly Val Asp Ala Ala Thr Ile Phe Val Glu Glu Val Lys Asn
530 535 540
Leu His Cys Gly Gln Ala Leu Asp Leu His Trp Lys His His Thr Tyr
545 550 555 560
Ile Pro Ser Val Asp Glu Tyr Leu Asn Met Val Asp His Lys Thr Gly
565 570 575
Gly Leu Phe Arg Leu Cys Val Arg Leu Met Gln Gly Glu Ser Ser Thr
580 585 590
Ser Cys His His Ile Asp Ala Glu Arg Phe Ile Thr Leu Leu Gly Arg
595 600 605
Tyr Phe Gln Ile Arg Asp Asp Tyr Gln Asn Leu Val Ser Asp Glu Tyr
610 615 620
Thr Asn Gln Lys Gly Phe Cys Glu Asp Leu Asp Glu Gly Lys Ile Ser
625 630 635 640
Leu Pro Leu Ile Tyr Cys Leu Ala Gly Ser Asp Pro Thr Gln Ile Met
645 650 655
Ile Arg Gly Ile Leu Gln His Lys Arg Ala Gly Glu Met Pro Leu Ser
660 665 670
Met Lys Lys Leu Ile Leu Glu Lys Met Arg Ser Gly Gly Ala Leu Asn
675 680 685
Ala Thr Ile Ser Leu Leu Lys Asp Leu Gln Asp Asn Ile Leu Glu Glu
690 695 700
Leu Lys Ser Leu Glu Ser Ala Phe Gly Ser Gly Asn Pro Met Leu Glu
705 710 715 720
Leu Val Leu Arg Arg Leu Trp Ile
725
<210> 4
<211> 684
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 4
Met Ser Pro Val Asp Ile Ser Phe Ile His Ser Glu Leu Val Asp Arg
1 5 10 15
Glu Glu Val Ala Arg Val Cys Ala Thr Thr Leu Pro Val Arg Lys Ser
20 25 30
Lys Tyr Ser Pro Leu Val Glu Lys Ala Val Leu Glu Phe Gln Gln Gln
35 40 45
Trp Gln Glu Glu Val Gly Phe Ala Tyr Cys Gly Ala Thr Ser Pro Gln
50 55 60
Gly Pro Val Thr Val Phe Phe Pro Pro Glu Thr Lys Gln Asp Arg Val
65 70 75 80
Glu Ile Phe Ala Lys Leu Ile Glu Tyr Phe Phe Ala Tyr Asp Asp Val
85 90 95
Leu Thr Ala Pro Gly Ser Ala Lys Thr Gly Glu Glu Gln Ser Ser Asp
100 105 110
Ser Ile Gly Trp Glu Val Arg Lys Gly Thr Thr Ser Arg Val Arg Ile
115 120 125
Ser Ala Met Lys Gln Ile Gln Ser Glu Val Phe Leu Arg Leu Leu Glu
130 135 140
Ile Asp Arg Lys Arg Gly Asn Leu Ile Leu Arg Ala Ile Asn Asp Leu
145 150 155 160
Ser Arg Val His Gly Thr Ile Gly Ser Arg Asp Leu Arg Thr Trp Asp
165 170 175
Asp Leu Val Gln Tyr Arg Ala Asp Asp Tyr Gly Ala Glu Leu Asn Ile
180 185 190
Met Ser Ile Ile Tyr Cys Cys Glu Leu Asp Leu Thr Gln Gly Asp Ile
195 200 205
Gln Ala Leu Lys Glu Ile Trp Ser Pro Ala Thr Ala Ala Ala Ala Leu
210 215 220
Val Asn Asp Leu Tyr Ser Phe Asn Arg Glu Val Ile Leu Glu Pro Asp
225 230 235 240
Thr Asp Thr Asp Thr Thr Ile Thr Thr Pro Asn Ser Val Trp Tyr Leu
245 250 255
Met Lys Thr Leu Asn Leu Ser Val Ser Gln Ala Lys Glu Phe Leu Val
260 265 270
Lys Asp Lys Ile Leu Pro Leu Glu Arg Glu Phe Ile Ala Lys Lys Ala
275 280 285
Glu Tyr Leu Ala Asn Met Asn Pro Leu Thr Pro Lys Ser Gly Asp Ile
290 295 300
Ile Tyr Phe Leu Glu Met Val Gly Leu Gly Leu Ser Gly Asn Trp Tyr
305 310 315 320
Trp His Ala Ile Ala Asp Arg Phe His Arg Trp Ala Glu Leu Leu Gln
325 330 335
Leu Pro Pro Ala Lys Leu Phe Asp Tyr Asp Glu Ala Thr Ala Thr Cys
340 345 350
Ala Thr Phe Leu Asn Thr Gln Ser Leu Arg Gly Arg Lys Asn Arg Ile
355 360 365
Leu Glu Ser Thr Glu Ser Leu Gln Ala Met Thr Asp Asp Pro Tyr Tyr
370 375 380
Lys Val Leu His Gln Pro Ile Asp Tyr Leu Arg Ser Val Pro Ser Lys
385 390 395 400
Asn Ile Arg Gly Thr Ile Ile Gln Ala Leu Asn Leu Trp Leu Asn Ala
405 410 415
Pro Glu Ser Ala Ala Ala Gln Val Glu Asp Leu Ile Gly His Leu His
420 425 430
Glu Ser Ser Leu Leu Leu Asp Asp Ile Gln Asp Ser Ser Glu Leu Arg
435 440 445
Arg Gly Arg Pro Ser Ala Tyr Arg Val Phe Gly Val Pro Gln Thr Ile
450 455 460
Asn Ala Ala Thr His Ala Leu Thr Leu Ala Phe Glu Lys Val Val Pro
465 470 475 480
Leu Met Lys Pro Asp Ser Ser His Val Phe Phe Asp Glu Leu Arg Asn
485 490 495
Leu His Val Gly Gln Ala Met Asp Leu Tyr Trp Thr Arg Ser Gly Tyr
500 505 510
Arg Pro Ser Ile Ala Glu Tyr Leu Glu Met Asn Arg Leu Lys Thr Gly
515 520 525
Ala Leu Phe Cys Leu Ala Ser Asn Leu Leu Ser Asn Gln Gly Ser Phe
530 535 540
Ser Ala Gly Ala Ile Lys Gln Thr Asp Leu Asn Asp Leu Met Ile Ser
545 550 555 560
Leu Gly Gln Tyr Phe Gln Ala Arg Asp Asp Tyr Ile Asn Leu Ala Ser
565 570 575
Thr Lys Tyr Gln Glu Gln Lys Gly Phe Ala Gln Asp Leu Asp Glu Gly
580 585 590
Lys Leu Ser Leu Pro Leu Ile His Leu Leu Thr Gln Ser Pro Asn Ala
595 600 605
Ala Leu Ile Glu Asn Ile Gln Gln Glu Arg Ala Arg Asn Asn Lys Leu
610 615 620
Pro Ala Asp Leu Lys Gln Leu Ile Leu Asp Glu Met Arg Asp Gln Lys
625 630 635 640
Ile Met Gln Leu Thr Glu Glu Thr Leu Asn Gly Leu Glu Ala Lys Val
645 650 655
Tyr Arg Tyr Leu Glu Arg Leu Glu Val Ser Ala Gly Ile Lys Asn Phe
660 665 670
Thr Phe Arg Phe Leu Leu Asp Arg Leu Arg Glu Met
675 680
<210> 5
<211> 699
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 5
Met Asp Leu Pro Thr Ser Glu Leu Asp Ala Thr Trp Thr Asn Ser Lys
1 5 10 15
Ser Val Asp Leu Thr Arg Glu Arg Pro Pro Asp Cys Phe Thr Thr Arg
20 25 30
Ala Val Arg Ile Asn Lys Cys Asp Asp Val Met Asn Leu Tyr Thr Thr
35 40 45
Lys Ala Ile Asn Asn Trp Thr Ala Ala Met Cys Asp Asp Ala Asp Arg
50 55 60
Lys Val Gly His Ala Phe Ser Ser Val Gly Asn Val Ser Ser Phe Ile
65 70 75 80
Phe Pro Glu Cys Ser Pro Asp Lys Ala His Ile Leu Ala Tyr Phe Thr
85 90 95
Gln Val Ser Phe Ile His Asp Asp Ala Leu Ser Ser Lys Asp Ser Lys
100 105 110
Glu Glu Asn Lys His Leu Ser His Ala Leu Asp Pro Lys Asp Asn Asn
115 120 125
Pro Gly Thr Ser Glu Arg Gly Lys Ala Met Lys Lys Phe Leu Ser Gln
130 135 140
Thr Val Leu Glu Leu Ile Asp Met Asp Thr Asn Glu Gly Gln Glu Phe
145 150 155 160
Val Lys Asn Leu Lys Val Trp Ala Asp Asp Glu Ile Gly Leu Lys Asn
165 170 175
Pro Gln Thr Ile Glu Cys Val Asp Glu Tyr Leu His Phe Arg Arg Leu
180 185 190
Asn Gly Gly Ile Lys Ala Tyr Trp His Trp Leu Ala Phe Ser His Glu
195 200 205
Asp Arg Phe Thr Gln Ala Asp Trp Asp Ser Ile Glu Asp Leu Leu Lys
210 215 220
Ser Ala Asn Arg Val Phe Ile Leu Thr Asn Asp Tyr Phe Ser Trp Pro
225 230 235 240
Arg Glu Arg Leu Tyr Gly Gln Gly Arg Ile Ala Asn Val Ile Glu Phe
245 250 255
Tyr Met Arg Thr Glu Gly Leu Ser Glu Glu Glu Ala Lys Gln Arg Thr
260 265 270
Lys Glu Glu Ile Leu Gln Gly Glu His Leu Phe His Asn Met Cys Val
275 280 285
Glu Arg Phe Ala Arg Glu Pro Asn Leu Pro Arg His Val Lys Lys Leu
290 295 300
Leu Gln Val Ala Glu Val Ala Met Gly Gly Tyr Asn Tyr Trp Ala Ser
305 310 315 320
Thr Cys Pro Arg Leu Asn Ser Trp Lys Glu Gln Ala Pro Thr Ala Glu
325 330 335
Thr Asp Phe His Gly Ser Lys Asn Asp Glu Val Ser Asn Pro Thr Gly
340 345 350
Lys Ala Glu Leu Val Lys Pro Val Lys Thr Asn Gln Thr Ser Gly Lys
355 360 365
Ile Gln Ala Ile Thr Lys Val Gln Ser Leu Pro Ser Val Ser His Phe
370 375 380
Thr Cys Thr Ser Asp Leu Asp Asp Ser Ala Leu Leu Ala Pro Ala His
385 390 395 400
Tyr Val Glu Ser Leu Ser Ser Lys Asn Val Leu Ser Lys Leu Val Glu
405 410 415
Ala Phe Asn Val Trp Met Gln Val Pro Pro Lys Pro Leu Ala Ala Ile
420 425 430
Lys His Val Leu Asp Asp Leu His Asn Ser Ser Leu Ile Leu Asp Asp
435 440 445
Ile Gln Asp Asn Ser Pro Leu Arg Arg Gly Arg Thr Ala Thr His Leu
450 455 460
Ile Phe Gly Pro Ala Gln Ser Ile Asn Ser Ala Thr Tyr Met Phe Val
465 470 475 480
Lys Ala Ala Gln Thr Val Asp Ala Leu Gly Thr Pro Gln Met Met Thr
485 490 495
Ala Phe Leu Gln Gly Leu Glu Thr Leu Phe Ile Gly Gln Ser Trp Asp
500 505 510
Ile Ser Trp Arg Gln Ser Phe His Cys Pro Thr Glu Ser Glu Tyr Leu
515 520 525
Ser Ala Ala Asp Lys Lys Thr Gly Ala Leu Leu Thr Met Met Val Glu
530 535 540
Leu Met Gln Cys Asn Ala Lys Thr Leu Pro Phe Ser Tyr Arg Leu Ser
545 550 555 560
Pro Leu Ala Arg Leu Phe Gly Arg Trp Tyr Gln Val Arg Asp Asp Tyr
565 570 575
Met Asn Leu Gln Gly Ala Asp Tyr Ser Lys Gln Lys Gly Phe Cys Glu
580 585 590
Asp Leu Asp Glu Gly Lys Leu Ser Tyr Pro Ile Leu Lys Cys Cys Gln
595 600 605
Lys Ser Glu Thr Asn Lys Gly Ile Ile Leu Gly Ile Phe Arg Gln Met
610 615 620
Arg Met Thr Asn Thr Lys Met Met Arg Glu Ser Lys Leu His Ile Leu
625 630 635 640
Asp Leu Met Ser Ser Ala Arg Ala Leu Glu Asp Thr Phe Asp Tyr Leu
645 650 655
Gln Gln Leu Gln Gln Glu Ile Glu Arg Asp Ile Arg Glu Ile Glu Val
660 665 670
Leu Ala Gly Glu Ser Asn Pro Glu Leu Leu Leu Leu Val Lys Val Leu
675 680 685
Gly Ala Ile Pro Lys Pro Gly Lys Lys Gly His
690 695
<210> 6
<211> 746
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 6
Met Asp Met Leu Asp Gly His Thr Ser Glu Leu Ile Gly Leu Ser Arg
1 5 10 15
Tyr Asp Thr His Gly Phe Leu Ser Pro Gly Gly Tyr Thr Leu Arg Arg
20 25 30
His Lys Ala Glu Thr Leu Ala Asn Ala Gly Cys Tyr Glu Ala Arg Gln
35 40 45
Asp Trp Ile Gln Tyr Ile Gly Ala Ala Ala Glu Glu Phe Gly Asn Cys
50 55 60
Asn Pro Val Asn Gly Asn Phe Thr Ala Val Val Leu Pro Leu Ala Lys
65 70 75 80
Pro Glu Arg Leu Arg Leu Val Ala Tyr Ile Leu Glu Tyr Ala Phe Leu
85 90 95
His Asp Asn Ile Val Glu Val Ala Lys Asp Asn Ala Ser Ser Ser Lys
100 105 110
Asp Asn Asp Ala Phe Ser Leu Gly Asp Met Glu Thr Ser His Gln Asn
115 120 125
Ala Lys Leu Gly Arg Lys Gln Ile Gln Ala Lys Met Ile Leu Gln Leu
130 135 140
Thr Gln Thr Asp Pro Val Cys Ala Lys Arg Val Met Gly Val Trp Gln
145 150 155 160
Gln Met Leu Glu Thr Thr Ile Lys Asp Lys Ser Lys Ser Phe Ala Asn
165 170 175
Leu Glu Glu Tyr Leu Glu Phe Arg Ile Ile Asp Thr Gly Ala Pro Phe
180 185 190
Val Glu Ser Val Met Leu Phe Gly Leu Gly Glu Thr Leu Thr Lys Glu
195 200 205
Glu Asp Ser Leu Leu Ala Pro Ile Ile Arg Pro Cys Tyr Ala Ser Leu
210 215 220
Ala Leu Ala Asn Asp Tyr Phe Ser Phe His Arg Glu Trp Ala Glu Tyr
225 230 235 240
Gln Ser Ala Gln Ser Ser Ser Ser Ser Ser Pro Asp Ala Pro Ser Ala
245 250 255
Ala Pro Ala Ala Pro Ile Asn Leu Val Tyr Leu Tyr Thr Gln Trp Gln
260 265 270
Ser Val Asp Ile Pro Thr Ala Lys Arg Leu Val Arg Glu Ala Ala Asn
275 280 285
Arg Phe Glu Ala Asp Phe Leu Arg Lys Cys Glu Asp Phe Lys Lys Ser
290 295 300
Gly Gln Ser Asn Asp Lys Leu Asp Arg Tyr Leu Arg Gly Leu Gln Tyr
305 310 315 320
Gln Val Ser Gly Asn Val Val Trp Ser Leu Asn Cys Pro Arg Tyr His
325 330 335
Pro Glu Trp Arg Tyr Asp Pro Asn Lys Gly Leu Glu Asp Ala Leu Thr
340 345 350
Ala Glu Arg Arg Asn Pro Pro Phe Val Glu Gln Val Glu Asp Val Asp
355 360 365
Ala Ser Val Ser Asp Lys Glu Ala Ala Thr Lys Arg Met Ser Ile Ala
370 375 380
Ser Gly Gly Ser Arg Glu Asn Asp Ser Asp Val Ser Thr Thr Ser Ser
385 390 395 400
Ser Trp Asp Asp Arg Ala Ala Ser Arg Ser Ser Ser Phe Ser Ala Met
405 410 415
Leu Asp Asp Asp Glu Ser Gly Met Arg Glu Phe Lys Glu Ala Gln Arg
420 425 430
Glu Leu Ser Leu Asp Ser Leu Leu Pro Ser Glu Glu Lys Leu Gly Met
435 440 445
Glu Ile Val Asn Ala Pro Phe Glu Tyr Thr Arg Leu Met Pro Ser Lys
450 455 460
Asn Val Arg Gly Thr Phe Ile Asp Ala Leu Asn Leu Trp Ala Gly Leu
465 470 475 480
Ser Glu Glu Val Leu Gly Gln Ile Lys Glu Val Val Asp Asp Leu His
485 490 495
Thr Ala Ser Leu Met Phe Asp Asp Val Glu Asp Gly Ser Glu Leu Arg
500 505 510
Arg Gly Asn Pro Ala Ala His Ala Val Tyr Gly Ile Pro Gln Thr Ile
515 520 525
Asn Ser Ala Ser Leu Ala Ile Leu Glu Ala Val Gln Lys Ala Lys Asp
530 535 540
Leu Pro Ile Pro Gly Ala Val Asp Ile Ala Leu Glu Gln Leu Arg Asp
545 550 555 560
Leu His Val Gly Gln Ser Tyr Asp Leu Tyr Trp Thr Arg His Met Thr
565 570 575
Cys Pro Ser Glu Ser Glu Tyr Leu Glu Met Val Ala Lys Lys Thr Gly
580 585 590
Gly Leu Phe Leu Leu Leu Ser Arg Leu Met Ser Glu His Met Pro Lys
595 600 605
Glu Val Arg Ser Leu Val Asn Asp Leu Val Thr Gln Val Gly Ile Tyr
610 615 620
Phe Gln Ile Arg Asp Asp Tyr Gln Asn Leu Ser Ser Asp Glu Tyr Thr
625 630 635 640
Ala Gln Lys Gly Phe Cys Glu Asp Leu Asp Glu Gly Lys Leu Ser Phe
645 650 655
Pro Leu Val His Tyr Leu Asn Thr Glu Arg Gly Ser Ser Asn Ser Gln
660 665 670
Gln Val Arg Glu Val Leu Gln Glu Arg Gln Leu Arg Gly Ser Leu Ser
675 680 685
Met Pro Leu Lys Leu Leu Thr Leu Gln Arg Leu Lys Ser Ser Asn Ser
690 695 700
Leu Glu Tyr Thr Arg Asp Ser Leu Met Arg Leu Glu Arg Gly Val Asp
705 710 715 720
Gly Thr Ile Glu Glu Leu Glu Arg Ser Thr Gly Lys Lys Asn Trp Val
725 730 735
Leu Arg Met Cys Met Ala Lys Leu Ser Val
740 745
<210> 7
<211> 757
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 7
Met Ala Ser Glu Met Trp Lys Tyr Ser Ser Pro Ile Asp Pro Glu Val
1 5 10 15
Val Lys Ala Thr Gly Cys Phe Thr Thr Leu Pro Val Arg Ile Asn Asn
20 25 30
Arg Asp Asp Leu Ala Asn Gly Ala Ser Ser Arg Val Leu Lys Asp Trp
35 40 45
Ala Glu His Thr Gly Asn Gln Ser Ile Asp Arg Asn Arg Val Ser Phe
50 55 60
Ser Pro Val Gly Ser Phe Cys Ser Leu Ile Tyr Cys Glu Thr Ile Pro
65 70 75 80
Glu Arg Leu Asp Ser Ile Ser Tyr Leu Thr Asp Leu Phe Phe Leu Ile
85 90 95
Asp Asp Ala Thr Glu Glu Val Glu Asn Asp Thr Leu Ala Gln Glu Gln
100 105 110
Trp Ala Gly Phe Ser Gly Ala Met Thr Asp Ser Leu Ala Glu Ala Ala
115 120 125
Pro Gln Arg Asp His Asp Leu Asp Met Met Lys Lys Lys Lys Leu Val
130 135 140
Ala Arg Val Met Leu Asp Phe Met Arg Leu Asp Pro Val Leu Gly Leu
145 150 155 160
Asp Leu Val Lys Ser Cys Lys Ala Gly Trp Thr Pro Leu Thr Ala Gly
165 170 175
Val Glu Trp Glu Thr Met Glu Asp Tyr Leu Arg Phe Arg Arg Leu Ser
180 185 190
Ala Gly Leu Asp Ile Tyr Trp Thr Lys Thr Val Phe Gly Leu Gly Glu
195 200 205
Lys Leu Thr Asp Asp Glu Glu Lys Leu Ile Arg Pro Leu Thr Trp Ala
210 215 220
Ala Glu Lys Ala Ala Met Leu Asn Asn Asp Tyr Trp Ser Trp Asp Ile
225 230 235 240
Glu Tyr Leu Gln Ala Gly Gly Asn Ile Glu Lys Leu Thr Asn Ala Val
245 250 255
Ala Val Leu Met Arg Lys Glu Gly Leu Thr Ala Glu Glu Gly Lys Gln
260 265 270
Arg Ile Lys Asn Leu Ile Ile Gly Tyr Glu Glu Glu Tyr Ser Arg Leu
275 280 285
Arg Asp Asp Phe Tyr Asn Ala His Pro Ser Ala Arg Leu Tyr Leu Arg
290 295 300
Lys Arg Val Glu Leu Ala Gly Ser Met Ala Ala Gly Val Ser Phe Trp
305 310 315 320
Ser Ala Asn Ser Pro Arg Tyr His Ile Pro Thr Gln Gln Thr Glu Thr
325 330 335
Thr Thr Glu Ser Gly Glu Ala Ala Gln Pro Ala Ala Gly Thr Trp His
340 345 350
Ala Glu Val Arg Asp Thr Ser Arg Pro Gly Ser Asp Ser Ser Val Ser
355 360 365
Ser Thr Ser Glu Ser Asp Asp Ser Ile Asp Ala Ser Ala Ser Ser Gly
370 375 380
Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Ala Leu Ser Ser Ser Gln
385 390 395 400
Thr Ser Leu Ser Ser Ser Leu Ser Glu Ala Glu Glu Glu Glu Glu Ala
405 410 415
His Lys Pro Pro Pro Cys Phe Asp Ala Pro Ala Lys Leu Gly Arg Ala
420 425 430
Ala Ile Asp Ala Pro Ile Asp Tyr Val Ser Gly Met Pro Ser Lys Gly
435 440 445
Val Arg Thr Ser Leu Ile Asp Ala Met Asn Gln Trp Cys Arg Val Pro
450 455 460
Ser Ala Gln Leu Gly Ala Val Lys Arg Val Val Asp Val Leu His Asn
465 470 475 480
Ser Ser Leu Ile Leu Asp Asp Ile Gln Asp Asp Ser Pro Met Arg Arg
485 490 495
Gly Lys Thr Ala Thr His Leu Val Phe Gly Ala Ala Gln Ala Ile Asn
500 505 510
Ser Ala Thr Phe Leu His Val Arg Ala Val Arg Glu Val His Ala Thr
515 520 525
Gly Ser Ala Ala Leu Met Ala Val Leu Leu Glu Glu Leu Glu Asp Leu
530 535 540
His Val Gly Gln Ser Trp Asp Leu Tyr Trp Lys Tyr Asn Leu Arg Trp
545 550 555 560
Pro Thr Glu Asp Glu Tyr Phe Ser Met Ile Asp Leu Lys Thr Gly Gly
565 570 575
Leu Phe Arg Met Leu Val Arg Met Met Arg Val Leu Ala Pro Glu Pro
580 585 590
Thr Gly Gly Glu Thr Lys Gly Gly Glu Phe Ala Cys Asp Ala Leu Val
595 600 605
Ser Met Val Ser Arg Phe Phe Gln Val Arg Asp Asp Tyr Leu Asn Leu
610 615 620
Asn Ser Arg Glu Tyr Gly Ser Gln Lys Gly Trp Cys Glu Asp Leu Asp
625 630 635 640
Glu Gly Lys Phe Ser Tyr Leu Val Ile His Cys Leu Glu Thr Ser Pro
645 650 655
Arg Phe Arg Asp Arg Val Met Gly Leu Phe Arg Gln Arg Ala Gly Thr
660 665 670
Ala Ser Ala Ser Ser Gly Pro Thr Pro Met Pro Ser Val Ala Lys Val
675 680 685
Gln Ile Ile Glu Tyr Leu Tyr Glu Ala Gly Ser Phe Asp Ala Cys Trp
690 695 700
Arg Leu Leu Val Arg Leu Glu Asp Asp Ile Glu Gly Glu Ile Arg Arg
705 710 715 720
Leu Glu Glu Ala Thr Gly Glu Glu Asn Pro Gln Met His Leu Leu Leu
725 730 735
Lys Leu Leu Ser Val Lys Asn Asp Lys Pro Asn Lys Gly Pro Val Val
740 745 750
Val Pro Ala Gly Leu
755
<210> 8
<211> 718
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 8
Met Ala Gly Thr Arg Ser Ser Arg Pro Gly Ala Thr Ser Phe Ile Gln
1 5 10 15
His Ser Ile Pro Leu Pro Arg Ser Ala Tyr Glu Gly Val Glu Tyr Phe
20 25 30
Cys Arg Phe Arg Pro Arg Ile His Arg Asp Ala Ile Leu Ala Asp Ala
35 40 45
Gly Ser Trp Gln Cys Gln Val Asp Phe Phe Gly Ser Ser Ala Thr Ala
50 55 60
Arg Ala Asp Ser Ile Arg Asn Lys Asn His Thr Ser Tyr Ala Val Gly
65 70 75 80
Cys Ile Asn Pro Val Val Gly Asn Phe Thr Ala Leu Cys Ala Cys Glu
85 90 95
Ala Ile Pro Asp Arg Leu Ala Leu Thr Thr Tyr Met Val Glu Tyr Ala
100 105 110
Tyr Ile His Asp Asp Val Ile Glu Tyr Ala Glu Asn Lys Asp Glu Asp
115 120 125
Arg Asp Asn Val Arg Arg Arg Gln Leu Gln Ala Lys Met Ala Val Glu
130 135 140
Leu Met Asp Ile Asp Lys Val Lys Gly Lys Glu Cys Leu Arg Leu Trp
145 150 155 160
Lys Glu Met Ser Asp Val Phe Val Gln Ile Arg Glu Leu Lys Phe Thr
165 170 175
Lys Leu Asp Asp Tyr Leu Thr Phe Arg Val Ile Asp Ala Gly Cys Pro
180 185 190
Trp Thr Met Ser Leu Leu Cys Phe Ser Met Asp Phe Thr Leu Asn Asp
195 200 205
Asp Glu Val Glu Lys Thr Ala Ala Ile Thr Ser Ala Ala Tyr Asp Gly
210 215 220
Trp Val Leu Val Asn Asp Tyr Phe Ser Trp Glu Lys Glu Trp Lys Asn
225 230 235 240
His Gln Ala Asn Gly Gly Ser Gly Val Ile Ala Asn Ala Ile Phe Leu
245 250 255
Phe Met Arg Trp Tyr Ser Val Asp Ala Val Glu Gly Arg Arg Met Leu
260 265 270
Arg Lys Glu Ile Leu Ala Arg Glu Glu Lys Tyr Cys Lys Ala Lys Glu
275 280 285
Glu Phe Leu Val Ser Gly Asn Val Thr Asp Lys Thr Ser Gln Trp Leu
290 295 300
Glu Leu Leu Asp His Val Thr Ala Gly Asn Phe Ala Trp Ser Met Thr
305 310 315 320
Thr Ala Arg Tyr Gln Leu Gly Gly Lys Asp Ala Tyr Pro Ala Leu Arg
325 330 335
Ala Ala Asn Thr Asp Asn Trp Glu Thr Ser Thr Thr Asp Ser Leu Ser
340 345 350
Asn Pro Ile Ser His Asn Ala Asp Lys Ile Ala Arg Lys Ile Asn Leu
355 360 365
Ile Phe Lys Glu Gln Lys Phe Leu Asp Ala Arg Gly Leu Val Asn His
370 375 380
Thr Glu Asp Tyr Pro Pro Ile Val Leu Thr Ala Gln Val Ser Gln Pro
385 390 395 400
Asp Glu Thr Pro Glu Phe Ile His Ser Gln Val Thr Gln Ala Arg Ser
405 410 415
Phe Thr Gln Tyr Glu Lys Met Ile Leu Gln Pro Gln Asn Tyr Leu Glu
420 425 430
Ser Met Pro Ser Lys Gly Val Arg Asn Ser Val Ile Asp Gly Leu Glu
435 440 445
Met Trp Tyr Gln Val Pro Glu Arg Ser Leu Ala Thr Ile Arg Lys Ile
450 455 460
Val Asn Leu Leu His Ser Ser Ser Leu Met Leu Asp Asp Ile Glu Asp
465 470 475 480
Asn Ser Pro Leu Arg Arg Gly Leu Pro Ala Thr His Thr Val Phe Gly
485 490 495
Ile Ser Gln Thr Ile Asn Ser Ala Asn Leu Leu Met Phe Lys Ala Leu
500 505 510
Lys Ala Ala Glu Ser Leu Ser Pro Ala Ala Val Arg Ile Phe Ile Glu
515 520 525
Arg Ile Ile Glu Gly His Ile Gly Gln Gly Met Glu Leu Tyr Trp Thr
530 535 540
Phe His Thr Glu Ile Pro Thr Glu Glu Glu Tyr Phe Val Met Val Asp
545 550 555 560
Gly Lys Thr Gly Gly Leu Phe Ile Leu Leu Ala Glu Leu Met Arg Ser
565 570 575
Glu Ala Thr Arg His Lys Asp Leu Asp Thr Ser Leu Leu Met Lys Leu
580 585 590
Val Gly Arg Phe Phe Gln Ala Arg Asp Asp Tyr Gln Asn Leu Glu Ser
595 600 605
Ala Gln Tyr Thr Gln Gln Lys Gly Phe Ala Glu Asp Ile Gly Glu Gly
610 615 620
Lys Leu Ser Leu Pro Leu Ile His Ala Leu Gly Ser Lys Thr Pro Gln
625 630 635 640
Arg Gly Arg Leu Met Ser Ile Leu Gln Gln Arg Lys Ser Thr Val Asp
645 650 655
Leu Pro Phe His Ile Arg Lys Leu Ala Leu Asp Asp Ile Lys Ala Thr
660 665 670
Gly Gly Leu Lys Tyr Ala Lys Lys Met Ala Met Ser Leu Gln Asp Ser
675 680 685
Val Asn Glu Thr Leu Thr Gln Tyr Glu Asp Lys Val Gly Ala Lys Asn
690 695 700
Trp Ile Leu Arg Leu Val Gln Lys Arg Leu Glu Leu Glu Val
705 710 715
<210> 9
<211> 712
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 9
Met Ala Glu Glu Met Trp Lys Tyr Ser Ile Pro Ile Asp Pro Glu Thr
1 5 10 15
Val Lys Gln Thr Gly Cys Phe Thr Thr Leu Pro Val Arg Ile Asn Asn
20 25 30
Arg Asp Asp Leu Ala Asn Ala Ala Ser Ser Arg Val Leu Lys Asp Trp
35 40 45
Ala Glu His Thr Gly Asn Gln Asn Ile Asp Pro Asn Arg Val Ser Phe
50 55 60
Ser Pro Val Gly Ser Phe Cys Ser Leu Ile Tyr Cys Glu Thr Ile Pro
65 70 75 80
Glu Arg Leu Asp Ser Ile Ser Tyr Leu Thr Asp Leu Phe Phe Leu Ile
85 90 95
Asp Asp Ala Thr Glu Glu Val Glu Asn Asp Lys Val Ala Gln Glu Gln
100 105 110
Trp Ala Gly Phe Ser Gly Ala Met Thr Asp Ser Leu Gly Glu Thr Pro
115 120 125
Gln Arg Asp His Asp Leu Asp Ser Met Lys Lys Lys Lys Leu Val Ala
130 135 140
Arg Val Met Leu Asp Phe Met Arg Leu Asp Ser Ala Leu Gly Leu Asp
145 150 155 160
Leu Val Lys Ser Cys Lys Ala Gly Trp Thr Pro Leu Ala Ala Gly Val
165 170 175
Glu Trp Glu Thr Met Glu Asp Tyr Leu His Phe Arg Arg Leu Ser Ala
180 185 190
Gly Leu Asp Ile Tyr Trp Thr Lys Thr Val Phe Gly Leu Gly Glu Lys
195 200 205
Leu Ser Asp Asp Glu Glu Lys Leu Ile Arg Pro Leu Val Trp Ala Ala
210 215 220
Glu Lys Ala Ala Met Leu Asn Asn Asp Tyr Trp Ser Trp Asp Ile Glu
225 230 235 240
Tyr Phe Gln Ala Asn Gln Lys Ile Asp Met Leu Thr Asn Ala Val Ala
245 250 255
Val Leu Met Arg Lys Glu Gly Ile Thr Ala Glu Glu Gly Lys Asn Arg
260 265 270
Ile Lys Asn Leu Ile Leu Gly Tyr Glu Ala Glu Tyr Ser Gln Leu Arg
275 280 285
Ala Gln Phe Tyr Glu Ser His Pro Ser Ala Arg Leu Tyr Leu Arg Lys
290 295 300
Arg Val Glu Leu Ala Gly Ser Met Ala Ala Gly Val Ser Phe Trp Ser
305 310 315 320
Ala Asn Ser Pro Arg Tyr His Leu Pro Thr Asp Lys Pro Arg Pro Ser
325 330 335
Val Val Pro Val Glu Val Lys Phe Glu Glu Met Thr Leu Lys Ser Ser
340 345 350
Ala Glu Ser Val Ala Ser Thr Glu Ser Glu Asn Ser Met Ala Ala Thr
355 360 365
Ser Gly Ser Gly Thr Glu Leu Thr Ser Gln Ser Ser His Ser Asp Gly
370 375 380
Leu Arg Lys Arg Tyr Phe Asp Val Pro Lys Leu Gly Pro Ala Ala Ile
385 390 395 400
Asn Glu Pro Ile Asp Tyr Val Ser Gly Met Pro Ser Lys Gly Val Arg
405 410 415
Ser Ser Leu Ile Asp Ala Met Asn Gln Trp Cys Gln Val Pro Ser Ser
420 425 430
Gln Leu Ala Val Val Lys Arg Val Ile Asp Leu Leu His Asn Ser Ser
435 440 445
Leu Ile Leu Asp Asp Ile Gln Asp Asp Ser Pro Met Arg Arg Gly Lys
450 455 460
Thr Ala Thr His Leu Ile Phe Gly Glu Ala Gln Ser Ile Asn Ser Ala
465 470 475 480
Thr Tyr Leu Tyr Val Arg Val Val Gln Glu Val His Ala Thr Gly Asn
485 490 495
Ala Ala Leu Met Lys Val Leu Leu Glu Glu Leu Asp Asp Leu His Val
500 505 510
Gly Gln Ser Trp Asp Leu Tyr Trp Lys Tyr Asn Leu Lys Trp Pro Thr
515 520 525
Glu Gly Glu Tyr Phe Ser Met Ile Asp Leu Lys Thr Gly Gly Leu Phe
530 535 540
Arg Met Leu Val Arg Met Met His Ala Leu Cys His Pro Glu Cys Lys
545 550 555 560
Glu Glu Phe Ala Cys Asp Ala Leu Val Ser Met Val Ser Arg Phe Phe
565 570 575
Gln Val Arg Asp Asp Tyr Leu Asn Leu Asn Ser Arg Glu Tyr Ser Asn
580 585 590
Gln Lys Gly Trp Cys Glu Asp Leu Asp Glu Gly Lys Phe Ser Tyr Leu
595 600 605
Ile Ile His Cys Leu Glu Asn Ser Pro Arg Tyr Arg Asp Arg Ile Met
610 615 620
Gly Leu Phe Arg Gln Arg Thr Gly Cys Ser Gly Pro Met Pro Ser Val
625 630 635 640
Gly Lys Val Gln Ile Ile Glu Tyr Leu Gln Glu Ala Gly Ser Phe Asp
645 650 655
Ala Cys Trp Glu Leu Leu Asn Lys Leu Glu Asp Glu Ile Glu Ala Glu
660 665 670
Ile Arg Arg Leu Glu Ala Val Thr Gly Glu Glu Asn Pro Gln Met His
675 680 685
Leu Leu Leu Lys Leu Leu Ser Val Lys Asn Glu Lys Pro Asn Lys Gly
690 695 700
Pro Val Ile Val Pro Leu Gly Val
705 710
<210> 10
<211> 718
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 10
Met Cys Thr Met Asp Asp His Thr Ser Glu Leu Leu Asp Ile Ser Gln
1 5 10 15
Phe Asp Thr Gln Gly Phe Cys Pro Gly Tyr Pro Leu Arg Arg His Arg
20 25 30
Tyr Glu Ser Leu Ala Asn Ser Gly Cys Tyr Glu Ala Arg Gln Asp Trp
35 40 45
Ile His Tyr Val Gly Pro Ala Glu Glu Phe Gly Ser Cys Asp Pro Thr
50 55 60
Asn Gly Asn Phe Thr Ala Leu Val Leu Pro Leu Thr Lys Pro Glu Arg
65 70 75 80
Leu His Leu Val Ala Tyr Ile Ile Glu Cys Glu Phe Ile Arg Val Leu
85 90 95
Tyr Pro Tyr Ala Phe Leu His Asp Asn Val Val Glu Ala Val Met Pro
100 105 110
Asp Ser Ser Thr Asn Asn Gly Asp Asp Phe His Leu Gly Glu Ala Glu
115 120 125
Lys Arg Gln Arg Asn Val Arg Met Gly Arg Lys Gln Ile Gln Ala Lys
130 135 140
Met Met Tyr Gln Leu Ala Arg Thr Asp Lys Arg Cys Ala Glu Arg Val
145 150 155 160
Lys Arg Val Trp Lys Thr Met Leu Ser Thr Thr Leu Lys His Lys Ser
165 170 175
Asp Ser Phe Ala Ser Leu Glu Glu Tyr Leu Asn Tyr Arg Ile Ile Asp
180 185 190
Thr Gly Gly Tyr Trp Val Glu Ser Val Met Leu Phe Gly Met Gly Met
195 200 205
Thr Leu Thr Glu Glu Glu Asp Ala Gln Leu Ala Glu Val Val Arg Pro
210 215 220
Cys Tyr Ala Ser Leu Ala Leu Ala Asn Asp Tyr Phe Ser Phe Asp Arg
225 230 235 240
Glu Trp Glu Glu Ala Gln Asn Gly Gly Pro Pro Pro Thr Asn Ala Val
245 250 255
Trp Leu Tyr Met Arg Trp Asn Gly Ile Pro Met Pro Ala Ala Lys Lys
260 265 270
Leu Val Arg Glu Ala Ser Asn Arg Tyr Glu Ala Arg Phe Leu Glu Leu
275 280 285
Cys Asp Ser Phe Arg Arg Asn Asn Pro Leu Tyr Ser Glu Lys Leu Asp
290 295 300
Lys Tyr Leu Arg Gly Leu Ala Tyr Gln Ile Ser Gly Asn Val Val Trp
305 310 315 320
Ser Leu Thr Cys Pro Arg Tyr His Pro Glu Phe Arg Tyr Asp Pro Asn
325 330 335
Ala Gly Met Glu Asp Leu Leu Thr Ala Gln Ala Arg Gly Asp Gly Gly
340 345 350
Ala Val Ser Gly Gly Glu Ala Asp Tyr Thr Ala Glu His Arg Gln Ser
355 360 365
Ile Ile Ser Leu Glu Ser Gln His Thr Ala Ser Ser Thr Arg Tyr Ser
370 375 380
Ala Ser Asp Trp Gln Ser Ser Arg Ser Ser Ser Phe Ser Glu Ile Ser
385 390 395 400
Val Asp Gly Glu Asp Gly Glu Thr His Ala Val Lys Leu Pro Ala Glu
405 410 415
Gln Gly Leu Asp Thr Lys His Val Ala Ala Pro Phe Asp Tyr Val Ala
420 425 430
Ser Leu Pro Ser Lys Gly Val Arg Thr Thr Leu Ile Asp Ala Leu Asn
435 440 445
Leu Trp Cys Asp Leu Pro Asp Thr Thr Leu Ala Gly Ile Lys Glu Val
450 455 460
Val Asp Lys Leu His Thr Ala Ser Leu Met Phe Asp Asp Ile Glu Asp
465 470 475 480
Gly Ser Glu Leu Arg Arg Gly Asn Pro Ala Ala His Ala Val Tyr Gly
485 490 495
Val Pro Gln Thr Ile Asn Ala Ala Ser Phe Ala Ile Val Asp Ala Val
500 505 510
Ser Lys Ala Gln Asp Ile Pro Val Thr Gly Ala Val Asp Ile Val Leu
515 520 525
Glu Gln Leu Arg Asp Leu His Ile Gly Gln Ser Tyr Asp Ile His Trp
530 535 540
Thr Arg His Ser Ser Cys Pro Ser Glu Thr Glu Tyr Leu Glu Met Val
545 550 555 560
Ala Lys Lys Thr Gly Gly Leu Phe Lys Leu Ile Ser Arg Leu Met Thr
565 570 575
Thr Gly Leu Pro Asp Ile Thr Lys Ser Thr Ile Asp Asp Leu Val Thr
580 585 590
Gln Leu Gly Ile Tyr Phe Gln Ile Arg Asp Asp Phe Lys Asn Leu Asn
595 600 605
Ser Asp Glu Tyr Ala Ser Gln Lys Gly Phe Cys Glu Asp Leu Asp Glu
610 615 620
Gly Lys Phe Ser Phe Pro Leu Val His Tyr Leu Thr Lys Gly Gln Glu
625 630 635 640
Ser Pro Pro Phe Met Leu Arg Glu Ile Met Gln Gln Arg Arg Glu Ser
645 650 655
Lys Gly Leu Asn Val Pro Ser Lys Met Leu Ala Leu Gln Gln Leu Lys
660 665 670
Lys Ser Gly Ser Leu Glu Tyr Thr His Arg Thr Leu Lys Arg Leu Glu
675 680 685
Val Gln Ile Asn Gln Ile Ile Gly Arg Leu Glu Arg Ile Thr Gly Lys
690 695 700
Asp Asn Trp Val Leu Arg Leu Cys Met Gly Glu Leu Thr Val
705 710 715
<210> 11
<211> 703
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 11
Met Ile Ser Ile Ala Pro Phe Pro Asp Glu Asp Gln Phe Ser Val Leu
1 5 10 15
Val Pro Arg Asp Thr Pro Asp Leu His Gly Phe Cys His Gly Tyr Pro
20 25 30
Leu Arg Arg His Lys Ala Glu His Lys Ala Asn Glu Gly Ser Leu Gln
35 40 45
Leu Arg Ala Asp Trp Glu Lys Tyr Ile Gly Pro Ile Glu Arg Trp Gly
50 55 60
Ser Cys Asn Pro Trp Glu Gly His Phe Gly Ala Val Val Leu Pro Cys
65 70 75 80
Cys Arg Pro Glu Arg Met Ala Ile Val Asn Tyr Ile Phe Glu Tyr Ala
85 90 95
Phe Met Tyr Asp Asn Val Val Glu Ser Ala Ala Gln Ser Thr Ile Asn
100 105 110
Ala His Ala Asp Asp Ile Ala Leu Asp Glu Thr Glu Tyr Arg Thr Val
115 120 125
Arg Ser Val Thr Gly Thr Lys Gln Val Gln Ser Lys Met Leu Leu Glu
130 135 140
Leu Phe Ala Leu Asp Pro Ala Cys Ala Gln Val Val Leu Asp Ser Trp
145 150 155 160
Lys Thr Met Ile Asp Thr Thr Ala Thr Lys Asp Lys Thr Arg Ala Phe
165 170 175
Gln Asn Trp Glu Glu Tyr Val Asp Tyr Arg Ile Ile Asp Thr Gly Ala
180 185 190
Pro Phe Val Asp Met Leu Met Arg Phe Gly Met Gly Phe Val Leu Thr
195 200 205
Pro Glu Glu Gln Ala Ser Ile Glu Pro Val Val Arg Pro Cys Tyr Ala
210 215 220
Ala Leu Gly Leu Ala Asn Asp Tyr Phe Ser Phe Asp Val Glu Trp Asp
225 230 235 240
Glu Phe Gln His Ala Glu Glu Thr Thr Met Thr Asn Ala Val Trp Leu
245 250 255
Cys Met His Trp His Gly Val Asp Val Pro Ala Ala Lys Glu Ile Val
260 265 270
Arg Arg Val Thr Asn Asn Phe Glu Arg Glu Phe Gln Gln Arg Val Ala
275 280 285
Glu Tyr Val Ala Gly Asp Gly Arg Glu Asn Arg Lys Val Gln Val Tyr
290 295 300
Leu Arg Ala Leu Gly Tyr Gln Ile Pro Gly Asn Val Ala Trp Ser Leu
305 310 315 320
Arg Cys Pro Arg Tyr His Pro His Leu Val Glu Glu Ala Gly Arg Leu
325 330 335
Leu Gln Gln Ser Met Gln Thr Ala Ser Asp Glu Gly Cys Val Ala Pro
340 345 350
Val Gln Val Gln Ala Lys Arg Ala Ser Val Ser Glu Glu Ser Glu Ser
355 360 365
Ser Glu Ser Ser Val Trp Ser Gly Pro Asn Ser Pro Arg Ser Ser Ile
370 375 380
Ser Ser Ala Ser Glu Glu Glu Glu Ala Arg Glu Val Glu Glu Val Lys
385 390 395 400
Leu Gly Ala Glu His Leu Leu Asn Pro Ala Glu Tyr Ile Ala Ser Leu
405 410 415
Pro Ser Lys Gly Val Arg Glu Ala Phe Ile Asp Ala Leu Asn Val Trp
420 425 430
Leu Met Leu Pro Asp Arg Gln Val Lys Leu Leu Lys Ser Ile Ala Lys
435 440 445
Thr Leu His Asn Ala Ser Leu Met Leu Asp Asp Ile Glu Asp Ser Ser
450 455 460
Pro Leu Arg Arg Gly Gln Pro Ala Thr His Thr Val Tyr Gly Val Gly
465 470 475 480
Pro Thr Ile Asn Ser Ala Asn Tyr Leu Leu Leu Gln Ala Leu Asp Glu
485 490 495
Val Arg Gly Leu Asp Asp Ser Asp Gly Gln Cys Leu Asp Ile Tyr Thr
500 505 510
Glu Glu Met Arg Asn Leu Phe Thr Gly Gln Ser Tyr Asp Val Tyr Trp
515 520 525
Thr Arg Glu Ala Val Cys Pro Ser Glu Ala Glu Tyr Met Glu Met Ile
530 535 540
Arg Gln Lys Thr Gly Gly Leu Phe Arg Leu Leu Ala Arg Leu Met Ile
545 550 555 560
Ala Thr Ala Pro Ala Gln Arg Asn Arg Asp Ile Pro Leu Leu Pro Leu
565 570 575
Ile Asp Lys Leu Gly Glu Tyr Phe Gln Ile Arg Asp Asp Tyr Lys Asn
580 585 590
Leu Thr Glu Glu Tyr Thr Asp Gln Lys Gly Phe Cys Glu Asp Leu Asp
595 600 605
Glu Gly Lys Phe Ser Phe Pro Leu Ile His Ala Leu Arg Ala Gln Glu
610 615 620
Arg Glu Lys Asn Gly Arg Leu Trp Glu Met Leu Glu Gln Gly Arg Ala
625 630 635 640
Gly Gly Thr Met Glu Leu Ala Arg Lys Arg Glu Val Leu Asp Ile Leu
645 650 655
Gln Asp Ser Gly Ser Met Glu Tyr Thr Arg Arg Thr Leu Gln Gly Ile
660 665 670
Met Gly Glu Ile His Gly Gly Met Gly Arg Val Glu Arg Glu Thr Gly
675 680 685
Gly Val Asn Trp Val Leu Arg Leu Leu Val Met Arg Leu Glu Val
690 695 700
<210> 12
<211> 7327
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
ggccgcttca ccgattctga gcgaatcaca ggtgagaaat ttggattcga aataaaccta 60
aaaaaactat ccaataaggc ttccataggc ttcgtatttc ccgaccattc caattggaaa 120
aattgagcgc tgtcttccat gatcttagat aaagccttaa tacttggctc atttccattt 180
gaggttagct ctagcaactg gtaaagcatt tgaagaccag ttggatcatc tctatgctgc 240
ctatagtaag tggaaaataa agtagtaata acgatacctg tcacgttcga aacgagatta 300
ttgttcgaat tataaaggcc atgtataatg tttgatttga cgtatcctaa gtcgtggcta 360
ttgctcttaa ttaaattgtt tccccctagc attgaatttt tcaacagcat accggcggta 420
gctctattgt tctgaagatc ctgtagggag tagtgttgct tcaatacatc atcagattcg 480
ccttcaatta aaatataaca caaataattg agaaattcag gttgcagctg gaagttctcc 540
atagcttcca ttgcgttatt acgaatctct ggatttggtg acatacagtt ctgtaaaaga 600
gttgctagtt gcaacacata gtcttcggcg ggcttccatg tcgatgccat ctttattcac 660
ttaactactg ctaacaattc tggaaaccaa agactgcgga atattctgat atgtattact 720
actattcgct gctcttctgc ataattaata ctgaaaagtt tttcatactt ttaaacataa 780
ccttttttta agcaaaactc tatgacccgg attagaaaac tacgaaaaga gggtaataac 840
ataggtgcag gatttccatc gataacgacg ccgacaatga gccttgctgc aacatccaat 900
taggactaat aactatcgta ggaatttcta cgtaataaac ttcaacagag cctaaaattt 960
gaaaataaat aatctagagg ggaaacttaa agaaattcta ttcttgtcaa taaagtggaa 1020
atctgtcaga tgtcacagtt tctttatttg tgacacatat tttcaacata aattcaggca 1080
ttagtgctgt aagcacaaaa agttgtggcg atatgaatat tccagatttt acttacaagc 1140
tgcattgtag tcttacaatt ctttttttct tttttttttt ttatggaaag gaccactctt 1200
acataactag aatagcatta agaatcagat ttacagataa agatgacatt attttatata 1260
tatattgtca ctccgttcaa gtcgacaacc aataaaaaat ttaaaaaaag ccaggcagtt 1320
aatagaaaaa atatgatatg aatgaatatt ccactttctt ttcacagctt gtctgtaagc 1380
ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 1440
ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata aacgacatta 1500
ctatatatat aatataggaa gcatttaata gacagcatcg taatatatgt gtactttgca 1560
gttatgacgc cagatggcag tagtggaaga tattctttat tgaaaaatag cttgtcacct 1620
tacgtacaat cttgatccgg agcttttctt tttttgccga ttaagaatta attcggtcga 1680
aaaaagaaaa ggagagggcc aagagggagg gcattggtga ctattgagca cgtgagtata 1740
cgtgattaag cacacaaagg cagcttggag tatgtctgtt attaatttca caggtagttc 1800
tggtccattg gtgaaagttt gcggcttgca gagcacagag gccgcagaat gtgctctaga 1860
ttccgatgct gacttgctgg gtattatatg tgtgcccaat agaaagagaa caattgaccc 1920
ggttattgca aggaaaattt caagtcttgt aaaagcatat aaaaatagtt caggcactcc 1980
gaaatacttg gttggcgtgt ttcgtaatca acctaaggag gatgttttgg ctctggtcaa 2040
tgattacggc attgatatcg tccaactgca tggagatgag tcgtggcaag aataccaaga 2100
gttcctcggt ttgccagtta ttaaaagact cgtatttcca aaagactgca acatactact 2160
cagtgcagct tcacagaaac ctcattcgtt tattcccttg tttgattcag aagcaggtgg 2220
gacaggtgaa cttttggatt ggaactcgat ttctgactgg gttggaaggc aagagagccc 2280
cgaaagctta cattttatgt tagctggtgg actgacgcca gaaaatgttg gtgatgcgct 2340
tagattaaat ggcgttattg gtgttgatgt aagcggaggt gtggagacaa atggtgtaaa 2400
agactctaac aaaatagcaa atttcgtcaa aaatgctaag aaataggtta ttactgagta 2460
gtatttattt aagtattgtt tgtgcacttg cctatgcggt gtgaaatacc gcacagatgc 2520
gtaaggagaa aataccgcat caggaaattg taaacgttaa tattttgtta aaattcgcgt 2580
taaatttttg ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt 2640
ataaatcaaa agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc 2700
cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 2760
gcccactacg tgaaccatca ccctaatcaa gttttttggg gtcgaggtgc cgtaaagcac 2820
taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg 2880
tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag 2940
cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa tgcgccgcta cagggcgcgt 3000
cgcgccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt 3060
cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc 3120
cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgagcgcgcg taatacgact 3180
cactataggg cgaattgggt actacacggt ccaatggata aacatttttt atcaacacta 3240
tgatatataa atataatagt ttttcgtata tatattcctt tttttggtca atttttgaaa 3300
ttttcgtaga aaagggagag acaaaacaca ttatatcaat gaaaacgtac aaaaagtaga 3360
taaagtcagt gcttaaacac gtcttttcct taaaaatact ttattatttt tattttattg 3420
agagggtggt ttaaaaatag aaatagagag agaggtacat acataaacat acgcgcacaa 3480
aagcagagat taggatttaa tgcaggtgac ggacccatct ttcaaacgat ttatatcagt 3540
ggcgtccaaa ttgttaggtt ttgttggttc agcaggtttc ctgttgtggg tcatatgact 3600
ttgaaccaaa tggccggctg ctagggcagc acataaggat aattcacctg ccaagacggc 3660
acaggcaact attcttgcta attgacgtgc gttggtacca ggagcggtag catgcgggcc 3720
tcttacacct aataagtcca acatggcacc ttgtggttct agaacagtac caccaccgat 3780
ggtacctact tcgatggatg gcatggatac ggaaattctc aaatcaccgt ccacttcttt 3840
catcaatgtt atacagttgg aactttcaac attttgtgca ggatcttgtc ctaatgccaa 3900
gaaaacagct gtcactaaat tagctgcatg tgcgttaaat ccaccaacag acccagccat 3960
tgcagatcca accaaattct tagcaatgtt caactcaacc aatgcggaaa catcactttt 4020
taacactttt ctgacaacat caccaggaat agtagcttct gcgacgacac tcttaccacg 4080
accttcgatc cagttgatgg cagctggttt tttgtcggta cagtagttac cagaaacgga 4140
gacaacctcc atatcttccc agccatactc ttctaccatt tgctttaatg agtattcgac 4200
acctttagaa atcatattca tacccattgc gtcaccagta gttgttctaa atctcatgaa 4260
gagtaaatct cctgctagac aagtttgaat atgttgcaga cgtgcaaatc ttgatgtaga 4320
gttaaaagct tttttaattg cgttttgtcc ctcttctgag tctaaccata tcttacaggc 4380
accagatctt ttcaaagttg ggaaacggac tactgggcct cttgtcatac catccttagt 4440
taaaacagtt gttgcaccac cgccagcatt gattgcctta cagccacgca tggcagaagc 4500
taccaaacaa ccctctgtag ttgccattgg tatatgataa gatgtaccat cgataaccaa 4560
ggggcctata acaccaacgg gcaaaggcat gtaacctata acattttcac aacaagcgcc 4620
aaatacgcgg tcgtagtcat aatttttata tggtaaacga tcagatgcta atacaggagc 4680
ttctgccaaa attgaaagag ccttcctacg taccgcaacc gctctcgtag tatcacctaa 4740
ttttttctcc aaagcgtaca aaggtaactt accgtgaata accaaggcag cgacctcttt 4800
gttcttcaat tgttttgtat ttccactact taataatgct tctaattctt ctaaaggacg 4860
tattttctta tccaagcttt caatatcgcg ggaatcatct tcctcactag atgatgaagg 4920
tcctgatgag ctcgattgcg cagatgataa acttttgact ttcgatccag aaatgactgt 4980
tttattggtt aaaaccattt atattgaatt ttcaaaaatt cttacttttt ttttggatgg 5040
acgcaaagaa gtttaataat catattacat ggcaatacca ccatatacat atccatatct 5100
aatcttactt atatgttgtg gaaatgtaaa gagccccatt atcttagcct aaaaaaacct 5160
tctctttgga actttcagta atacgcttaa ctgctcattg ctatattgaa gtacggatta 5220
gaagccgccg agcgggcgac agccctccga cggaagactc tcctccgtgc gtcctggtct 5280
tcaccggtcg cgttcctgaa acgcagatgt gcctcgcgcc gcactgctcc gaacaataaa 5340
gattctacaa tactagcttt tatggttatg aagaggaaaa attggcagta acctggcccc 5400
acaaaccttc aaatcaacga atcaaattaa caaccatagg ataataatgc gattagtttt 5460
ttagccttat ttctggggta attaatcagc gaagcgatga tttttgatct attaacagat 5520
atataaatgc aaaagctgca taaccacttt aactaatact ttcaacattt tcggtttgta 5580
ttacttctta ttcaaatgtc ataaaagtat caacaaaaaa ttgttaatat acctctatac 5640
tttaacgtca aggagaaaaa actatagcgc aatgattgaa tagtcaaaga tttttttttt 5700
ttaatttttt ttttttcata gaacttttta tttaaataaa tcacgtctat atatgtatca 5760
gtataacgta aaaaaaaaaa caccgtcagt taaacaaaac ataaataaaa aaaaaaagaa 5820
gtgtcaaatc aagtgtcaaa tgtatacttc ttttttttac tttgttcaga acaacttctc 5880
atttttttct actcataact ttagcatcac aaagtacaca ataataacga gtagtaacac 5940
ttttatagtt catacatgct tcaactactt aataaatgat tgtatgataa tgttttcaat 6000
gtaagagatt tcgattatcc acaaactttg aaacacaggg acacaattct tgatatgctt 6060
tcaaccgctg cgttttggat acctattctt gacataatat gactaccatt ttgttattgt 6120
acgtggggca gttgacgtct tatcatatgt caaagtcatt tgcgaagttc ttggcaagtt 6180
gccaactgac gagatgcagt aaaaagagat tgccgtcttg aaactttttg tccttttttt 6240
tttccgggga ctctacgaga accctttgtc ctactgatta attttgtact gaatttggac 6300
aattcagatt ttagtagaca agcgcgagga ggaaaagaaa tgacagaaaa attccgatgg 6360
acaagaagat aggaaaaaaa aaaagctttc accgatttcc tagaccggaa aaaagtcgta 6420
tgacatcaga atgaaaaatt ttcaagttag acaaggacaa aatcaggaca aattgtaaag 6480
atatgataaa tcatttgatt cagcgccaat ttgccctttt ccattttcca ttaaatctct 6540
gttctctctt acttatatga tgattaggta tcatctgtat aaaactcctt tcttaatttc 6600
actctaaagc ataccccata gagaagatct ttcggttcga agacattcct acgcataata 6660
agaataggag ggaataatgc cagacaatct atcattacat ttaagcggct cttcaaaaag 6720
attgaactct cgccaactta tggaatcttc caatgagacc tttgcgccaa ataatgtgga 6780
tttggaaaaa gagtataagt catctcagag taatataact accgaagttt atgaggcatc 6840
gagctttgaa gaaaaagtaa gctcagaaaa acctcaatac agctcattct ggaagaaaat 6900
ctattatgaa tatgtggtcg ttgacaaatc aatcttgggt gtttctattc tggattcatt 6960
tatgtacaac caggacttga agcccgtcga aaaagaaagg cgggtttggt cctggtacaa 7020
ttattgttac ttctggcttg ctgaatgttt caatatcaac acttggcaaa ttgcagctac 7080
aggtctacaa ctgggtctaa attggtggca gtgttggata acaatttgga ttgggtacgg 7140
tttcgttggt gcttttgttg ttttggcctc tagagttgga tctgcttatc atttgtcatt 7200
ccctatatca tctagagcat cattcggtat tttcttctct ttatggcctg ttattaatag 7260
agtcgtcatg gccatcgttt ggtatagtgt ccaagcttat attgcggcaa ctcccgtatc 7320
attaagc 7327
<210> 13
<211> 10833
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
ggtggcaagc ttatattgaa ttttcaaaaa ttcttacttt ttttttggat ggacgcaaag 60
aagtttaata atcatattac atggcattac caccatatac atatccatat acatatccat 120
atctaatctt acttatatgt tgtggaaatg taaagagccc cattatctta gcctaaaaaa 180
accttctctt tggaactttc agtaatacgc ttaactgctc attgctatat tgaagtacgg 240
attagaagcc gccgagcggg tgacagccct ccgaaggaag actctcctcc gtgcgtcctc 300
gtcttcaccg gtcgcgttcc tgaaacgcag atgtgcctcg cgccgcactg ctccgaacaa 360
taaagattct acaatactag cttttatggt tatgaagagg aaaaattggc agtaacctgg 420
ccccacaaac cttcaaatga acgaatcaaa ttaacaacca taggatgata atgcgattag 480
ttttttagcc ttatttctgg ggtaattaat cagcgaagcg atgatttttg atctattaac 540
agatatataa atgcaaaaac tgcataacca ctttaactaa tactttcaac attttcggtt 600
tgtattactt cttattcaaa tgtaataaaa gtatcaacaa aaaattgtta atatacctct 660
atactttaac gtcaaggaga aaaaactata gtcgactcat gtaattagtt atgtcacgct 720
tacattcacg ccctcccccc acatccgctc taaccgaaaa ggaaggagtt agacaacctg 780
aagtctaggt ccctatttat ttttttatag ttatgttagt attaagaacg ttatttatat 840
ttcaaatttt tctttttttt ctgtacagac gcgtgtacgc atgtaacatt atactgaaaa 900
ccttgcttga gaaggttttg ggacgctcga aggctttaat ttgcggccgg tacccaattc 960
gccctatagt gagtcgtatt acgcgcgctc actggccgtc gttttacaac gtcgtgactg 1020
ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg 1080
gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg 1140
cgaatggcgc gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 1200
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 1260
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 1320
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 1380
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 1440
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 1500
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 1560
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcct gatgcggtat 1620
tttctcctta cgcatctgtg cggtatttca caccgcatag ggtaataact gatataatta 1680
aattgaagct ctaatttgtg agtttagtat acatgcattt acttataata cagtttttta 1740
gttttgctgg ccgcatcttc tcaaatatgc ttcccagcct gcttttctgt aacgttcacc 1800
ctctacctta gcatcccttc cctttgcaaa tagtcctctt ccaacaataa taatgtcaga 1860
tcctgtagag accacatcat ccacggttct atactgttga cccaatgcgt ctcccttgtc 1920
atctaaaccc acaccgggtg tcataatcaa ccaatcgtaa ccttcatctc ttccacccat 1980
gtctctttga gcaataaagc cgataacaaa atctttgtcg ctcttcgcaa tgtcaacagt 2040
acccttagta tattctccag tagataggga gcccttgcat gacaattctg ctaacatcaa 2100
aaggcctcta ggttcctttg ttacttcttc tgccgcctgc ttcaaaccgc taacaatacc 2160
tgggcccacc acaccgtgtg cattcgtaat gtctgcccat tctgctattc tgtatacacc 2220
cgcagaatac tgcaatttga ctgtattacc aatgtcagca aattttctgt cttcgaagag 2280
taaaaaattg tacttggcgg ataatgcctt tagcggctta actgtgccct ccatggaaaa 2340
atcagtcaag atatccacat gtgtttttag taaacaaatt ttgggaccta atgcttcaac 2400
taactccagt aattccttgg tggtacgaac atccaatgaa gcacacaagt ttgtttgctt 2460
ttcgtgcatg atattaaata gcttggcagc aacaggacta ggatgagtag cagcacgttc 2520
cttatatgta gctttcgaca tgatttatct tcgtttcctg caggtttttg ttctgtgcag 2580
ttgggttaag aatactgggc aatttcatgt ttcttcaaca ctacatatgc gtatatatac 2640
caatctaagt ctgtgctcct tccttcgttc ttccttctgt tcggagatta ccgaatcaaa 2700
aaaatttcaa agaaaccgaa atcaaaaaaa agaataaaaa aaaaatgatg aattgaattg 2760
aaaagctgtg gtatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca 2820
gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc 2880
cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc 2940
atcaccgaaa cgcgcgaaag agatggagga acgggaaaaa gttagttgtg gtgataggtg 3000
gcaagtggta ttccgtaaga acaacaagaa aagcatttca tattatggct gaactgagcg 3060
aacaagtgca aaatttaagc atcaacgaca acaacgagaa tggttatgtt cctcctcact 3120
taagaggaaa accaagaagt gccagaaata acagtagcaa ctacaataac aacaacggcg 3180
gctacaacgg tggccgtggc ggtggcagct tctttagcaa caaccgtcgt ggtggttacg 3240
gcaacggtgg tttcttcggt ggaaacaacg gtggcagcag atctaacggc cgttctggtg 3300
gtagatggat cgatggcaaa catgtcccag ctccaagaaa cgaaaaggcc gagatcgcca 3360
tatttggtgt ccccgaggat ccaaatttcc aatcttctgg tattaacttc gataactacg 3420
atgatattcc agtggacgcc tctggtaagg atgttcctga accaatcaca gaatttacct 3480
cacctccatt ggacggattg ttattggaaa acatcaaatt ggcccgtttc accaagccaa 3540
cacctgtgca aaaatactcc gtccctatcg ttgccaacgg cagagatttg atggcctgtg 3600
cgcagaccgg ttctggtaag actggtgggt ttttattccc agtgttgtcc gaatcattta 3660
agactggacc atctcctcaa ccagagtctc aaggctcctt ttaccaaaga aaggcctacc 3720
caactgctgt cattatggct ccaactagag agttggccac ccaaattttc gatgaagcca 3780
agaaatttac ttatagatcc tgggtcaagg cctgcgtcgt ctacggtggt tctccaattg 3840
gtaaccaact aagagaaatt gaacgtggtt gcgatctttt agtcgctact ccaggtcgtt 3900
tgaatgactt gttggaacgt ggtaaaattt ctttggccaa cgtcaagtat ttggttctag 3960
atgaagctga tagaatgttg gatatgggtt tcgaacctca aattagacat attgtcgaag 4020
actgtgatat gactcctgtt ggtgaaagac aaactctgat gttctcagct acttttcccg 4080
ctgatatcca acatttggcc cgtgatttct taagtgacta catctttttg tctgttggta 4140
gagtcggttc tacttcagaa aacattactc aaaaagtctt atacgttgaa aatcaagata 4200
agaagtcagc cttattggat ctattgtctg catccactgc tcgaggcttt tgttcccttt 4260
agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt 4320
gttatccgct cacaattcca cacaacatag gagccggaag cataaagtgt aaagcctggg 4380
gtgcctaatg agtgaggtaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt 4440
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt 4500
tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc 4560
tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 4620
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 4680
ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 4740
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 4800
gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 4860
ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 4920
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 4980
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 5040
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 5100
tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 5160
tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 5220
ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 5280
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 5340
gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 5400
aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 5460
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 5520
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 5580
ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 5640
cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 5700
ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 5760
ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 5820
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 5880
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 5940
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 6000
ctggtgaata ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 6060
gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 6120
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 6180
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 6240
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 6300
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 6360
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 6420
gcacatttcc ccgaaaagtg ccacctgaac gaagcatctg tgcttcattt tgtagaacaa 6480
aaatgcaacg cgagagcgct aatttttcaa acaaagaatc tgagctgcat ttttacagaa 6540
cagaaatgca acgcgaaagc gctattttac caacgaagaa tctgtgcttc atttttgtaa 6600
aacaaaaatg caacgcgaga gcgctaattt ttcaaacaaa gaatctgagc tgcattttta 6660
cagaacagaa atgcaacgcg agagcgctat tttaccaaca aagaatctat acttcttttt 6720
tgttctacaa aaatgcatcc cgagagcgct atttttctaa caaagcatct tagattactt 6780
tttttctcct ttgtgcgctc tataatgcag tctcttgata actttttgca ctgtaggtcc 6840
gttaaggtta gaagaaggct actttggtgt ctattttctc ttccataaaa aaagcctgac 6900
tccacttccc gcgtttactg attactagcg aagctgcggg tgcatttttt caagataaag 6960
gcatccccga ttatattcta taccgatgtg gattgcgcat actttgtgaa cagaaagtga 7020
tagcgttgat gattcttcat tggtcagaaa attatgaacg gtttcttcta ttttgtctct 7080
atatactacg tataggaaat gtttacattt tcgtattgtt ttcgattcac tctatgaata 7140
gttcttacta caattttttt gtctaaagag taatactaga gataaacata aaaaatgtag 7200
aggtcgagtt tagatgcaag ttcaaggagc gaaaggtgga tgggtaggtt atatagggat 7260
atagcacaga gatatatagc aaagagatac ttttgagcaa tgtttgtgga agcggtattc 7320
gcaatatttt agtagctcgt tacagtccgg tgcgtttttg gttttttgaa agtgcgtctt 7380
cagagcgctt ttggttttca aaagcgctct gaagttccta tactttctag agaataggaa 7440
cttcggaata ggaacttcaa agcgtttccg aaaacgagcg cttccgaaaa tgcaacgcga 7500
gctgcgcaca tacagctcac tgttcacgtc gcacctatat ctgcgtgttg cctgtatata 7560
tatatacatg agaagaacgg catagtgcgt gtttatgctt aaatgcgtac ttatatgcgt 7620
ctatttatgt aggatgaaag gtagtctagt acctcctgtg atattatccc attccatgcg 7680
gggtatcgta tgcttccttc agcactaccc tttagctgtt ctatatgctg ccactcctca 7740
attggattag tctcatcctt caatgctatc atttcctttg atattggatc atactaagaa 7800
accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtctc 7860
gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga gacggtcaca 7920
gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt 7980
ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact cgagtagtga 8040
gtcaacaatg ggttctgggg cccgattgcc tttctcaatg ccaccaaagg gaatttcgac 8100
gaagaagtca ctcctcatct tcaaattcgt tcttacgccc tggctttcgt tccccaccac 8160
tagaacaaca ggcagctcgt tacataatcc gttcaaatcg tgcatgctaa tagtttttcc 8220
aacagtgtat ttttctgacg tggcattagc taagtggctt gtaataaacg tccagccacc 8280
catttcttgt gatttagtaa aaaactctaa cggtttatca acgtaaaata tgggcagaag 8340
ttcgagggcc ccactgcttg tcttggacac cacaggcgtc aaaggagagc agtttcttct 8400
cgacatcaca atgaagtcaa cccccaggaa gtaagcgctt ctaataatgg caccgatatt 8460
gtgagggtca gttatttcat ccagatataa cccgagagga aacttcttag cgtctgtttt 8520
cgtaccataa ggcagttcat gaggtatatt ttcgttattg aagcccagct cgtgaatgct 8580
taatgctgct gaactggtgt ccatgtcgcc taggtacgca atctccacag gctgcaaagg 8640
ttttgtctca agagcaatgt tattgtgcac cccgtaattg gtcaacaagt ttaatctgtg 8700
cttgtccacc agctctgtcg taaccttcag ttcatcgact atctgaagaa atttactagg 8760
aatagtgcca tggtacagca accgagaatg gcaatttcta ctcgggttca gcaacgctgc 8820
ataaacgctg ttggtgccgt agacatattc gaagatagga ttatcattca taagtttcag 8880
agcaatgtcc ttattctgga acttggattt atggctcttt tggtttaatt tcgcctgatt 8940
cttgatctcc tttagcttct cgacgtgggc ctttttcttg ccatatggat ccgctgcacg 9000
gtcctgttcc ctagcatgta cgtgagcgta tttcctttta aaccacgacg ctttgtcttc 9060
attcaacgtt tcccattgtt tttttctact attgctttgc tgtgggaaaa acttatcgaa 9120
agatgacgac tttttcttaa ttctcgtttt aagagcttgg tgagcgctag gagtcactgc 9180
caggtatcgt ttgaacacgg cattagtcag ggaagtcata acacagtcct ttcccgcaat 9240
tttctttttc tattactctt ggcctcctct agtacactct atattttttt atgcctcggt 9300
aatgattttc attttttttt ttccacctag cggatgactc tttttttttc ttagcgattg 9360
gcattatcac ataatggagg gagcagtaaa ggggcgatca gagttgaaaa aaaaatgctg 9420
ccgtaaaaaa agcatgcacg tatacacttg agtaaatata tttatgtatt catatctcaa 9480
gataagatca tgatacataa aagcgatata acgtcttgaa gctcaggtaa ggggctagta 9540
gatgcaatga atgacgatga agatagagcc caactgaagg ctaggctgtg gatccgtgtg 9600
gaagaacgat tacaacaggt gttgtcctct gaggacataa aatacacacc gagattcatc 9660
aactcattgc tggagttagc atatctacaa ttgggtgaaa tggggagcga tttgcaggca 9720
tttgctcggc atgccggtag aggtgtggtc aataagagcg acctcatgct atacctgaga 9780
aagcaacctg acctacagga aagagttact caagaataag aattttcgtt ttaaaaccta 9840
agagtcactt taaaatttgt atacacttat tttttttata acttatttaa taataaaaat 9900
cataaatcat aagaaattcg cgaattctta atgatctctt gcagcaacta attcacaaat 9960
gtatgccaat gcagcaccat caacatcagc atttctcaaa tgtctttgtg cagcttcgat 10020
atggaaagct aatttttcct ttgcaccagc caatgataac aatgctggat atgtagcctt 10080
attgtttgat tgatcagaac caactggttt accaatcttt tcttctgcac cttcaatatc 10140
taaaatatca tctctgattt ggaaagctaa acccaaatgt gcagcaaatt catccaattc 10200
tcttgtttgt cttgcatcag caccaccaat taatgcacca gcatgaactg agtattgcaa 10260
cattttacca gtcttatgtc tatgaatata ttctaattca gacaatgtca aagttttacc 10320
ttcaccttcc atatctgcag cttgacctgc agccataccc tctggacctg cagctttagc 10380
caatctttcg atcaatctca atctaacaga tggtggtatt ctttcatcat cgatttcagt 10440
gatcaattgg aatgcataag ctaacaaacc gtcaccagcc aaaattgcca tagcttcacc 10500
gaaaacctta tggtttgttg gtttacctct tctcaaatca tcattatcca ttgatggcaa 10560
atcatcatga atcaaagaag cagtatggat catttcgatt gcacaagcaa ctggtaaacc 10620
aactgctgga tctttaccca aagctctaac tgttgataac aacaacaatg gtctaattct 10680
tttaccacct gcttctaaag agtatgccat agctttcttc aatttagctg gaccttccaa 10740
tctttcgatg tatcttgaca atgcagtttc aacagcttgc ttttgttcgt tcaagaattg 10800
ttcaacagac aattgagctg atccagtact gcc 10833
<210> 14
<211> 2145
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
atggaatcat tatgggaaca ttctgttcca gttgatcatg aagttgcttt atcaactggt 60
tgtttcacat ctttgccaat cagaatacat caaagaaacg atttggcaga tgatgctact 120
aagcaatcaa tcagagattg gggttcttat gttggtgacg gttgggaaca aagatcaggt 180
tcttcatggt ctccagttgg taattggggt gcttttattt tcccagaatc tttaccagaa 240
agattgggtg ttattacata tttggcaaac atgggtaaca tccatgatga tttgtgtgat 300
gatttgactt acgatgaagc attggaagaa cataactcat tgtctcaagc aatggaaatc 360
tcaacatctg gtccacatca aggtactcaa aacttcatga gatcaatgaa gatgaagaaa 420
tacatctcta aatacttgtt ggaagcattg gaaattgata gaccaagagc attgagaatg 480
atcaactcat acagatctaa gtggttggat gttatggaaa gaaacaacgt taacgatatc 540
gaatcattag atgaatactt gatttttaga aatttgaacg gtggtatgga agcattttgg 600
tctatggttg aatttggtat ggcagttgat gtttcagaat ctgataagaa aagaattaga 660
ccaatctttg ctgcagctga atcagctttg gttttaacaa acgattactg gtcttgggat 720
agagaatggt ggcaagcaca aagagctaaa gatccaagaa tcgttaacgc tgttcatttg 780
tttatgagaa ctgaaggttt ggcaatctat caagcaagag aagctgttag agatagaatt 840
ttggcttatg aagctgaatt tttgagattg aaggctgaat tttacgttca aaatccaaac 900
gcagctgttc atttgaagag atacatcgaa ttgtgtggtg ttattacagc tggtaatcat 960
tactggtgtg caaattgtcc aagacatcat tcttggagaa atgaagatat tccaccaatc 1020
gaaagatcat tttctttttc ttcagaagat actatcccag atgattgtac attgtcacca 1080
gcagctacta tttcttcatc tgtttcttac aagtcatctc caactacagt tttgacatca 1140
tctaagtcta tcggtgttat tgcaactgct gatggtgacg aaaagccaca aagattgtca 1200
gattctgttt tgcaaggtcc atgtcaatac attagatcaa tgccatctaa aggtttgaga 1260
catttgtttg cagaagcatt gagacaatgg ttgttagttg aagatgcttc attgcatcaa 1320
attaaaaata tcatcgattt gttgcataat tcatctttga tcttggatga tatcgaagat 1380
gattctccat taagaagagg tttgccagca acacacatgg tttttggtag agctcaatca 1440
attaattctg caaacttcat gttcgttcaa gcagttcaag ttgctagaac tttgaagtca 1500
ccagattgtt tgggtatttt gttagaagaa ttggattgtt tgtttattgg tcaatcttgg 1560
gatttgtact ggaagttcca tactcaaatc ccaacagaac aagaatactt tgaaatggtt 1620
gattcaaaga ctggtgctat gttcagattg ttgacaagat tgatgttgca tacttcatct 1680
gttactgcaa cagatgattc tgctcaattg ttagaagcaa tgtgtagatt gttaggtaga 1740
tttttccaag ttagagatga tttcatgaat ttgtcatcta acgaatattc agatttgaag 1800
ggtttctgtg aagatttgga tgaaggtaaa atgtcttacc caatggttat ggttttgaga 1860
caaaacccag aataccaaga tcaaatcatg ggtacattga gacaacaagc tatgtctgca 1920
gctaaaggtg gtttatcaca accagttaga ttgtctagag aaactaaggt ttacatcttg 1980
ggtttgttga aatcatctgg tgcaatggct actacattga agaaattgca agaattggaa 2040
acagctttag aatctttgat tgatggtttg gaaactggtt ttggtgaaac aaatccagtt 2100
atgagaattt tgttatcaag attatctgtt agagatatta ctttg 2145
<210> 15
<211> 2142
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
atggctgaat caatgtggaa atattctact gcattagatc cagaaatggt tagaagaaca 60
ggttgtttca ctacattgcc agttagaatt aataacagag atgatatccc aaatgctgca 120
tcattgagag ttttaaaaga ttgggctgaa catacaggta ataaggatat ctctcaagat 180
agattatctt tgtcaccagt tggttctttt tgttcattga tctattgtga aactatccca 240
gaaagattgg attctatctc atacttgact gatttgtttt tcttgatcga tgatgctaca 300
gaagaagttg caaatgataa agctgcacaa gaagaatggg ctggtttttc aggtgcaatg 360
actgattctt taggtgaaac accacaaaga gatcatgatt tggagtttat gaagaaaaag 420
aaattgactg ctcaagttat gttggatttc atgacaatcg atgcagaatt gggtttggat 480
ttggttaagt catgtaaagc tggttggact ccattagctg caggtgttga atggccaaca 540
atcgaagatt atttggtttt tagaagagat tctgctggtt tggatatcta ttggggtaaa 600
actgttttcg gtttgggtga aaaattgaca gatgatgaag aaaaattgat cagaccattg 660
gtttgggctg cagaaaaggc tgcaatgttg aacaacgatt actggtcatg ggatgttgaa 720
tactctgaag ctaatcaaaa gactgaacat ttgacaaatg ctgttgcagt tttgatgaag 780
aacgaaggta tctctgcaga agagggtaaa gaaagaatta aaggtttgat catcgattat 840
gaagctgaat acttgagatt gagagcacaa ttctacgaaa accatccatc agctagattg 900
tacttaagaa agagagttga attggcaggt tctatggctg caggtgtttc tttttggtca 960
gctaattctc caagatacca tttgtctaag aaacaagttg aagctgcagg tttgccagat 1020
gatccaagaa caaaaccaaa tcaaacttca gatacaggtt ctccaatggg tagatcttca 1080
actgattcac aagttgcttc tgatatcaca tggtctactt caacagatat ttcttcacaa 1140
tcacaacata cacaagattt ggttgcacat tcttattacg gtgttccaaa attggataga 1200
actgctttga atggtccaat tgaatacgtt tcagcaatgc catctaaagg tgttagatct 1260
tcattgattg atgctatgaa tcaatggtgt caagttccat cttcacaatt ggcagttgtt 1320
aagtcagtta ttgatcaatt gcataattct tcattggttt tagatgatat ccaagatgat 1380
tctccaatga gaagaggtaa aactgctaca catttgattt ttggtcaagc tcaagcaatt 1440
aattcagcaa atttcttgta cgttagagtt gttcaagaag ttcatgctac tagaaataca 1500
gcattgatgg atatcttgtt agaagaatta gaagatttgc atgttggtca atcttgggat 1560
ttgtactgga agtacaattt gagatggcca tcagaagctg aatacttttc tatgatcgat 1620
ttgaagactg gtggtttgtt tagaatgttg gttagaatga tgcaaacatt gtctccattg 1680
aaggatagag atttcgtttg tgatactttg gtttcaatgg tttctagatt tttccaagtt 1740
agagatgatt atttgaattt gaatagcaga gaatactcaa ctcaaaaagg ttggtgtgaa 1800
gatttggatg agggtaaatt ttcatatttg atcatccatt gtttgcaaca ttctccaaag 1860
tttagagata gaatcatggg tttctttaga caaagaactg gttgtgttgg tccaatgcca 1920
acagttggta aagttcaaat catcgaatac ttacaagaag ctggttcttt taatgcatgt 1980
tgggaattgt taaattcttt ggaagatgat attgaaaagg aaattaaaag attagaagaa 2040
atcacaggtg aaaagaatcc attgatgcat ttgttgttga aattgttgtc tgttaagact 2100
gaaaagccaa acggtaaagc tgttgttgct ccagcaggtt ta 2142
<210> 16
<211> 2184
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
atggcagaat ttgctattcc agttccagat gatgttgtta aacaatctgg tacattgtca 60
agatttccaa ctgcagttca tagagaacat gctagatgtt tagctgcagc taataagatt 120
agagatgatt ttgcagctca agttgattgg gatttggatg ctaaaactac aggtcattat 180
ccaacattag gtgcagttca tgttgttgct tttactatgc cagaatgttt gccagaaaga 240
ttggcattaa tgactagatt cactgatttc actatcatga acgatgatta ctacgatgct 300
gttgatagag atcaagcaac atcttttaat gctgaattgc aaagatcttt aggtagagat 360
tgtcattcaa atacagttca gggtaatgca tctgttgcta ttaaaactaa gcaattccaa 420
gcatctatct tggttgaaat gatggttatg gatagagatt tggcaatgga tgttatggat 480
acatactctg atggtttaga aactgctaca tttccaccat cagatatttg tactatcgaa 540
gaatacttgc cagttagatt ggttaactgt ggtttggatg ttttccaaga aatgtcttgt 600
ttcggtttgg gtgttcattt gacaaaggca gaaaaggaaa aattgtcaga tatcgcaaac 660
acagctttgt atactgcagc tttgattaat gattgtcatt cttggccaaa agaattgaaa 720
catcatttgg aaactccagg ttcagatgtt ccttttaatg ctgtttgtat cttgatgaga 780
caattcaatt gttctgatgt taaggcaatc gaaagattga gagctatcta tgttgaaatc 840
caagaaagac atttgtcatt agttagaaat ttggaacaat ctgaaggttc aatcccagaa 900
acacatagaa agtacatcat ggcagctcaa tacgcagctt ctggttcaga attttggtct 960
ttgtatgctc caagataccc atcaaaagaa gatttggaac aaccagaata cgttttggtt 1020
gataacgttt tgcatagaag atctatgtca gataaggatt tgccaacttc tgataaggat 1080
ttggcaagag ctgattcagc aatgcatatc gaaacaatta aaactgctgg ttcttcaggc 1140
atgtctcaca tgaatgaagc atactcttca acaccagcta ctgaaatggt tgcatgggat 1200
gctggttcag aaatcatcca tacagaaatc gattctaatg gttcaaaaga attggcacca 1260
aatggtgctc aaactagagt tcaaaaacca tctgaagatg cagttagagc tccatacgat 1320
tacatcagag cattgccatc taagagaatt agagaaactt ttattgatgc tttggattca 1380
tggttagcag ttccagctgg ttcttcagca tctattaaat caatcatcgg catgttgcat 1440
caatcttcat tgatgttgga tgatatcgaa gatgattcta cattgagaag aggtaaacca 1500
acagctcata ctttgttcgg tatcgcacaa actattaatt ctgctaactg ggttttcgca 1560
tgtgctttcg aagaattgag atcattgaga ggtgttgatg cagctactat cttcgttgaa 1620
gaagttaaaa atttgcattg tggtcaagca ttggatttgc attggaagca tcatacatac 1680
atcccatctg ttgatgaata cttgaacatg gttgatcata agactggtgg tttgtttaga 1740
ttgtgtgtta gattgatgca gggtgaatct tcaacatctt gtcatcatat cgatgctgaa 1800
agattcatta ctttgttggg tagatacttc caaatcagag atgattacca aaatttggtt 1860
tcagatgaat acacaaacca aaagggtttc tgtgaagatt tggatgaagg taaaatttct 1920
ttgccattga tctattgttt agcaggttca gatccaactc aaatcatgat cagaggtatt 1980
ttgcaacata aaagagctgg tgaaatgcca ttgtctatga agaaattgat cttggaaaag 2040
atgagatctg gtggtgcatt gaatgctact atttcattgt tgaaggattt gcaagataac 2100
atcttagaag aattgaaatc tttagaatca gctttcggtt ctggtaatcc aatgttggaa 2160
ttagttttga gaagattatg gatt 2184
<210> 17
<211> 2052
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
atgtctccag ttgatatttc ttttattcat tcagaattag ttgatagaga agaagttgca 60
agagtttgtg ctactacatt gccagttaga aagtctaaat attcaccatt agttgaaaag 120
gctgttttgg aatttcaaca acaatggcaa gaagaagttg gttttgcata ctgtggtgct 180
acttctccac aaggtccagt tacagttttc tttccaccag aaactaaaca agatagagtt 240
gaaatcttcg caaaattgat cgaatatttc tttgcttacg atgatgtttt gacagcacca 300
ggttctgcta aaactggtga agaacaatct tcagattcaa ttggttggga agttagaaaa 360
ggtactacat ctagagttag aatctcagct atgaagcaaa tccaatctga agttttcttg 420
agattgttag aaatcgatag aaagcgtggt aatttgatct tgagagctat taatgatttg 480
tctagagttc atggtacaat tggttctaga gatttgagaa cttgggatga tttggttcaa 540
tatagagcag atgattacgg tgctgaattg aacatcatgt caatcatcta ttgttgtgaa 600
ttagatttga cacaaggtga cattcaagca ttgaaagaaa tttggtctcc agctactgct 660
gcagctgcat tggttaacga tttgtactct tttaatagag aagttatttt agaaccagat 720
actgatacag atactacaat cactacacca aattctgttt ggtacttaat gaagacattg 780
aatttgtctg tttcacaagc aaaggaattt ttggttaagg ataagatctt accattggaa 840
agagagttta ttgctaagaa agctgaatac ttagctaaca tgaatccatt gactccaaag 900
tctggtgaca tcatatactt tttggaaatg gttggtttgg gtttgtctgg taattggtac 960
tggcatgcaa ttgctgatag atttcataga tgggcagaat tgttacaatt accaccagct 1020
aaattgtttg attatgatga agcaactgct acatgtgcaa catttttgaa cacacaatct 1080
ttgagaggta gaaagaatag aattttagaa tctacagaat cattgcaagc tatgactgat 1140
gatccatact acaaagtttt acatcaacca atcgattact tgagatccgt tccatctaaa 1200
aatatcagag gtacaatcat ccaagcatta aatttgtggt tgaatgctcc agaatctgct 1260
gcagctcaag ttgaagattt gattggtcat ttgcatgaat cttcattgtt gttggatgat 1320
atccaagatt cttcagaatt gagaagaggt agaccatcag cttatagagt ttttggtgtt 1380
ccacaaacta ttaatgcagc tactcatgca ttgacattgg ctttcgaaaa ggttgttcca 1440
ttgatgaagc cagattcttc acatgttttc tttgatgaat tgagaaattt gcatgttggt 1500
caagcaatgg atttgtattg gacaagatct ggttacagac catcaatcgc tgaatatttg 1560
gaaatgaaca gattgaagac tggtgcatta ttttgtttgg cttctaattt gttgtcaaac 1620
caaggttctt tttcagcagg tgctattaaa caaacagatt tgaacgattt gatgatttct 1680
ttaggtcaat attttcaagc aagagatgat tacatcaatt tggcttcaac taagtaccaa 1740
gaacaaaagg gttttgctca agatttggat gagggtaaat tgtctttacc attgatccat 1800
ttgttgacac aatcaccaaa cgcagctttg atcgaaaaca tccaacaaga aagagcaaga 1860
aacaataagt tgccagctga tttgaagcaa ttgatcttgg atgaaatgag agatcaaaag 1920
attatgcaat tgactgaaga aacattgaac ggtttggaag caaaggttta cagatactta 1980
gaaagattgg aagtttctgc tggtattaaa aattttactt ttagattttt attggataga 2040
ttgagagaaa tg 2052
<210> 18
<211> 2097
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 18
atggatttgc caacttctga attggatgct acttggacaa attctaagtc agttgatttg 60
acaagagaaa gaccaccaga ttgtttcact actagagctg ttagaattaa taagtgtgat 120
gatgttatga atttgtacac tacaaaggca attaataact ggactgctgc aatgtgtgat 180
gatgctgata gaaaagttgg tcatgcattt tcttcagttg gtaacgtttc ttcttttatt 240
ttcccagaat gttcaccaga taaggctcat atcttggcat acttcactca agtttctttt 300
attcatgatg atgcattgtc ttcaaaggat tcaaaggaag aaaataagca tttgtctcat 360
gctttggatc caaaggataa caacccaggt acatcagaac gtggtaaagc aatgaagaaa 420
ttcttgtctc aaactgtttt ggaattgatc gatatggata caaacgaagg tcaagaattt 480
gttaaaaatt tgaaagtttg ggctgatgat gaaatcggtt tgaaaaatcc acaaacaatc 540
gaatgtgttg atgaatattt gcatttcaga agattaaatg gtggtattaa agcatattgg 600
cattggttag cattttctca tgaagataga ttcactcaag ctgattggga ttcaatcgaa 660
gatttgttga agtctgcaaa tagagttttt attttgacaa acgattactt ctcatggcca 720
agagaaagat tgtacggtca aggtagaatc gctaacgtta tcgaatttta catgagaact 780
gaaggtttgt ctgaagaaga agcaaagcaa agaacaaagg aagaaatctt gcaaggtgaa 840
catttgttcc ataacatgtg tgttgaaaga ttcgctagag aaccaaattt gccaagacat 900
gttaagaaat tgttacaagt tgctgaagtt gcaatgggtg gttataatta ctgggcttca 960
acttgtccaa gattaaattc ttggaaagaa caagctccaa ctgcagaaac agatttccat 1020
ggttctaaaa atgatgaagt ttctaaccca actggtaaag ctgaattggt taagccagtt 1080
aagactaacc aaacatcagg taaaatccaa gcaatcacaa aggttcaatc tttaccatct 1140
gtttcacatt tcacttgtac atcagatttg gatgattctg ctttgttagc tccagcacat 1200
tatgttgaat ctttgtcttc taaaaatgtt ttgtctaaat tggttgaagc ctttaatgtt 1260
tggatgcaag ttccaccaaa accattggct gcaattaaac atgttttgga tgatttgcat 1320
aattcttcat tgatcttgga tgatatccaa gataattctc cattaagaag aggtagaact 1380
gctacacatt tgatttttgg tccagctcaa tctattaatt cagcaactta catgttcgtt 1440
aaagctgcac aaacagttga tgctttaggt actccacaaa tgatgacagc atttttgcaa 1500
ggtttagaaa ctttgtttat tggtcaatct tgggatattt catggagaca atcttttcat 1560
tgtccaacag aatcagaata tttgtctgct gcagataaga aaactggtgc tttgttgaca 1620
atgatggttg aattgatgca atgtaacgca aagactttgc cattttcata cagattgtct 1680
ccattggcta gattatttgg tagatggtac caagttagag atgattacat gaatttgcaa 1740
ggtgctgatt actctaagca aaagggtttc tgtgaagatt tggatgaggg taaattatca 1800
tacccaatct tgaagtgttg tcaaaaatct gaaactaata agggtatcat cttgggtatt 1860
tttagacaaa tgagaatgac taacacaaag atgatgagag aatcaaaatt gcatatcttg 1920
gatttgatgt cttcagctag agcattggaa gatacattcg attatttgca acaattgcaa 1980
caagaaatcg aaagagatat cagagaaatt gaagttttag ctggtgaatc taacccagaa 2040
ttgttgttgt tggttaaagt tttgggtgca atcccaaaac caggtaaaaa gggtcat 2097
<210> 19
<211> 2238
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 19
atggatatgt tggatggtca tacttctgaa ttgatcggtt tgtcaagata cgatacacat 60
ggtttcttgt ctccaggtgg ttacactttg agaagacata aggcagaaac attggcaaat 120
gctggttgtt atgaagctag acaagattgg attcaataca ttggtgctgc agctgaagaa 180
tttggtaact gtaacccagt taacggtaac ttcactgcag ttgttttacc attggctaaa 240
ccagaaagat tgagattggt tgcatacatc ttggaatacg catttttgca tgataacatc 300
gttgaagttg caaaggataa cgcttcttca tctaaggata acgatgcttt ttctttgggt 360
gacatggaaa catcacatca aaacgcaaaa ttgggtagaa agcaaatcca agctaagatg 420
atcttacaat tgactcaaac agatccagtt tgtgctaaaa gagttatggg tgtttggcaa 480
caaatgttgg aaactacaat taaagacaag tcaaaatctt ttgcaaattt ggaagaatat 540
ttggagttta gaatcatcga tactggtgct ccattcgttg aatctgttat gttgttcggt 600
ttgggtgaaa ctttgacaaa ggaagaagat tcattgttgg caccaatcat cagaccatgt 660
tatgcttctt tagcattggc taacgattac ttctcattcc atagagaatg ggcagaatac 720
caatctgctc aatcatcttc atcttcatct ccagatgctc catcagcagc tccagcagct 780
ccaattaatt tggtttattt gtacactcaa tggcaatctg ttgatattcc aacagcaaaa 840
agattggtta gagaagcagc taacagattc gaagctgatt tcttgagaaa gtgtgaagat 900
ttcaagaaat ctggtcaatc taacgataaa ttggatagat atttgagagg tttgcaatac 960
caagtttctg gtaacgttgt ttggtcattg aactgtccaa gatatcatcc agaatggaga 1020
tacgatccaa ataagggttt ggaagatgca ttaactgctg aaagaagaaa tccaccattt 1080
gttgaacaag ttgaagatgt tgatgcatca gtttctgata aagaagcagc tacaaaaaga 1140
atgtcaattg cttctggtgg ttcaagagaa aacgattctg atgtttcaac tacatcatct 1200
tcatgggatg atagagcagc ttctagatct tcatcttttt cagcaatgtt ggatgatgat 1260
gaatctggta tgagagagtt taaagaagct caaagagaat tatctttgga ttcattgtta 1320
ccatctgaag aaaaattggg tatggaaatc gttaacgctc cattcgaata cactagattg 1380
atgccatcta aaaatgttag aggtactttt attgatgcat taaatttgtg ggctggtttg 1440
tctgaagaag ttttaggtca aattaaagaa gttgttgatg atttgcatac tgcatcttta 1500
atgtttgatg atgttgaaga tggttcagaa ttgagaagag gtaatccagc agctcatgct 1560
gtttacggta ttccacaaac aattaattca gcatctttag ctatcttgga agcagttcaa 1620
aaagctaaag atttgccaat tccaggtgca gttgatatcg ctttagaaca attgagagat 1680
ttgcatgttg gtcaatctta tgatttgtac tggactagac acatgacatg tccatcagaa 1740
tctgaatact tagaaatggt tgctaagaaa actggtggtt tgtttttgtt gttgtctaga 1800
ttaatgtcag aacacatgcc aaaggaagtt agatctttgg ttaacgattt ggttactcaa 1860
gttggtatct atttccaaat cagagatgat taccaaaatt tgtcatctga tgaatacaca 1920
gctcaaaaag gtttctgtga agatttggat gagggtaaat tgtcttttcc attggttcat 1980
tacttgaaca cagaaagagg ttcatctaat tcacaacaag ttagagaagt tttacaagaa 2040
agacaattga gaggttcatt gtctatgcca ttgaaattgt tgactttgca aagattgaaa 2100
tcatctaatt ctttggaata cacaagagat tcattgatga gattagaaag aggtgttgat 2160
ggtactatcg aagaattaga aagatctaca ggtaaaaaga attgggtttt gagaatgtgt 2220
atggctaaat tgtcagtt 2238
<210> 20
<211> 2271
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 20
atggcatctg aaatgtggaa atattcttca ccaattgatc cagaagttgt taaagctaca 60
ggttgtttca ctacattgcc agttagaatt aataatagag atgatttggc taatggtgca 120
tcttcaagag ttttgaaaga ttgggctgaa catactggta accaatctat cgatagaaac 180
agagtttctt tttcaccagt tggttctttt tgttcattga tctattgtga aacaatccca 240
gaaagattgg attctatctc atacttgact gatttgtttt tcttgatcga tgatgctaca 300
gaagaagttg aaaacgatac tttggctcaa gaacaatggg caggtttttc aggtgctatg 360
acagattctt tggcagaagc tgcaccacaa agagatcatg atttggatat gatgaagaaa 420
aagaaattgg ttgctagagt tatgttggat ttcatgagat tggatccagt tttaggtttg 480
gatttggtta agtcatgtaa agcaggttgg acaccattaa ctgctggtgt tgaatgggaa 540
actatggaag attatttgag attcagaaga ttgtctgctg gtttggatat ctattggaca 600
aagactgttt tcggtttggg tgaaaaattg acagatgatg aagaaaaatt gatcagacca 660
ttgacttggg ctgcagaaaa ggctgcaatg ttgaacaacg attactggtc ttgggatatc 720
gaatacttac aagcaggtgg taacatcgaa aaattgacaa acgctgttgc agttttaatg 780
agaaaagaag gtttgactgc tgaagaaggt aaacaaagaa ttaaaaattt gatcatcggt 840
tacgaagaag aatactcaag attgagagat gatttctaca acgctcatcc atctgcaaga 900
ttgtacttaa gaaagagagt tgaattggca ggttcaatgg ctgcaggtgt ttctttttgg 960
tcagctaatt ctccaagata tcatattcca acacaacaaa ctgaaactac aactgaatct 1020
ggtgaagctg cacaaccagc tgcaggtaca tggcatgctg aagttagaga tacttcaaga 1080
ccaggttctg attcttcagt ttcttcaaca tctgaatcag atgattcaat tgatgcttct 1140
gcatcttcag gtacaacaac tactacaact acaactacaa ctgctttgtc ttcatctcaa 1200
acttcattat catcttcatt gtctgaagca gaagaagaag aagaagctca taaaccacca 1260
ccatgttttg atgctccagc aaaattgggt agagctgcaa ttgatgcacc aattgattac 1320
gtttctggta tgccatctaa aggtgttaga acttcattga ttgatgctat gaatcaatgg 1380
tgtagagttc catctgctca attaggtgca gttaagagag ttgttgatgt tttgcataat 1440
tcttcattga tcttggatga tatccaagat gattctccaa tgagaagagg taaaacagct 1500
actcatttgg tttttggtgc tgcacaagct attaattcag caacattttt gcatgttaga 1560
gcagttagag aagttcatgc tactggttct gctgcattga tggctgtttt gttagaagaa 1620
ttagaagatt tgcatgttgg tcaatcatgg gatttgtact ggaagtacaa tttgagatgg 1680
ccaacagaag atgaatattt ttctatgatc gatttgaaga ctggtggttt gtttagaatg 1740
ttggttagaa tgatgagagt tttggctcca gaaccaacag gtggtgaaac taaaggtggt 1800
gaatttgctt gtgatgcatt ggtttcaatg gtttctagat ttttccaagt tagagatgat 1860
tatttgaatt tgaactctag agaatacggt tctcaaaaag gttggtgtga agatttggat 1920
gagggtaaat tttcatactt agttattcat tgtttggaaa catctccaag attcagagat 1980
agagttatgg gtttgtttag acaaagagca ggtacagctt cagcatcttc aggtccaact 2040
ccaatgccat ctgttgctaa ggttcaaatc atcgaatatt tgtacgaagc aggttctttt 2100
gatgcttgtt ggagattgtt agttagatta gaagatgata tcgaaggtga aatcagaaga 2160
ttggaagaag caactggtga agaaaaccca caaatgcatt tgttgttgaa attgttgtct 2220
gttaaaaatg ataagccaaa taagggtcca gttgttgttc cagctggttt g 2271
<210> 21
<211> 2154
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 21
atggctggta ctagatcttc aagaccaggt gcaacttctt ttattcaaca ttctatccca 60
ttaccaagat ctgcttatga aggtgttgaa tacttttgta gattcagacc aagaatacat 120
agagatgcaa ttttggctga tgcaggttca tggcaatgtc aagttgattt ctttggttct 180
tcagctacag caagagctga ttcaatcaga aataagaacc atacttctta tgctgttggt 240
tgtatcaacc cagttgttgg taacttcaca gcattatgtg cttgtgaagc aattccagat 300
agattggctt tgactacata catggttgaa tacgcataca tccatgatga tgttatcgaa 360
tacgctgaaa ataaggatga agatagagat aacgttagaa gaagacaatt gcaagctaaa 420
atggcagttg aattgatgga tatcgataag gttaagggta aagaatgttt gagattgtgg 480
aaggaaatgt ctgatgtttt cgttcaaatc agagaattga agtttactaa attggatgat 540
tatttgactt ttagagttat tgatgctggt tgtccatgga caatgtcatt gttgtgtttc 600
tctatggatt tcactttgaa cgatgatgaa gttgaaaaga ctgctgcaat tacatcagct 660
gcttatgatg gttgggtttt ggttaacgat tacttctctt gggaaaagga atggaagaac 720
catcaagcta acggtggttc aggtgttatt gcaaacgcta ttttcttgtt tatgagatgg 780
tactctgttg atgcagttga aggtagaaga atgttgagaa aggaaatctt ggctagagaa 840
gaaaaatact gtaaggcaaa ggaagaattt ttagtttcag gcaacgttac tgataagaca 900
tctcaatggt tggaattgtt agatcatgtt actgctggta attttgcatg gtcaatgact 960
acagctagat atcaattagg tggtaaagat gcttacccag cattgagagc tgcaaatact 1020
gataattggg aaacatcaac tacagattca ttgtctaacc caatctctca taacgctgat 1080
aagatcgcaa gaaaaattaa tttgattttt aaggaacaaa agttcttgga tgctagaggt 1140
ttggttaatc atactgaaga ttatccacca attgttttga cagcacaagt ttctcaacca 1200
gatgaaactc cagagtttat tcattcacaa gttactcaag ctagatcttt tacacaatac 1260
gaaaagatga tcttgcaacc acaaaactac ttggaatcaa tgccatctaa gggtgttaga 1320
aattctgtta ttgatggttt agaaatgtgg taccaagttc cagaaagatc attggctaca 1380
atcagaaaga tcgttaattt gttgcattct tcatctttga tgttggatga tatcgaagat 1440
aactctccat taagaagagg tttgccagca actcatacag tttttggtat ttcacaaact 1500
attaattctg ctaatttgtt gatgttcaaa gcattaaagg ctgcagaatc attgtctcca 1560
gctgcagtta gaattttcat tgaaagaatc atcgaaggtc atatcggtca gggtatggaa 1620
ttgtactgga ctttccatac agaaatccct acagaagaag aatactttgt tatggttgat 1680
ggtaaaactg gtggtttgtt tattttgttg gctgaattga tgagatcaga agcaactaga 1740
cataaggatt tggatacatc tttgttgatg aaattggttg gtagattttt ccaagctaga 1800
gatgattacc aaaatttgga atctgctcaa tacacacaac aaaaaggttt tgcagaagat 1860
attggtgagg gtaaattgtc attaccattg atccatgctt tgggttctaa aacaccacaa 1920
agaggtagat taatgtcaat cttgcaacaa agaaaatcta ctgttgattt gccattccat 1980
atcagaaaat tggctttgga tgatatcaaa gcaactggtg gtttgaagta cgctaagaaa 2040
atggcaatgt cattgcaaga ttctgttaac gaaactttga cacaatacga agataaggtt 2100
ggtgctaaaa attggatctt gagattggtt caaaagagat tagaattgga agtt 2154
<210> 22
<211> 2136
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 22
atggctgaag aaatgtggaa atattctatc ccaatcgatc cagaaactgt taagcaaaca 60
ggttgtttca ctacattgcc agttagaatt aataacagag atgatttggc aaatgctgca 120
tcttcaagag ttttgaaaga ttgggctgaa catactggta accaaaacat cgatccaaac 180
agagtttctt tttcaccagt tggttctttt tgttcattga tctattgtga aacaatccca 240
gaaagattgg attctatctc atacttgact gatttgtttt tcttgatcga tgatgcaaca 300
gaagaagttg aaaacgataa ggttgctcaa gaacaatggg caggtttttc aggtgctatg 360
actgattctt taggtgaaac accacaaaga gatcatgatt tggattctat gaagaaaaag 420
aaattggttg ctagagttat gttggatttc atgagattgg attcagcatt gggtttggat 480
ttggttaagt cttgtaaagc tggttggact ccattggctg caggtgttga atgggaaaca 540
atggaagatt atttgcattt cagaagatta tcagctggtt tggatatcta ttggactaag 600
acagttttcg gtttgggtga aaaattgtcc gatgatgaag aaaaattgat cagaccattg 660
gtttgggctg cagaaaaggc tgcaatgttg aacaacgatt actggtcttg ggatatcgaa 720
tactttcaag caaaccaaaa gattgatatg ttaactaatg ctgttgcagt tttgatgaga 780
aaagaaggta ttacagctga agaaggtaaa aatagaatta aaaatttgat cttaggttat 840
gaagcagaat actctcaatt gagagctcaa ttctacgaat ctcatccatc agcaagattg 900
tacttaagaa agagagttga attggctggt tcaatggctg caggtgtttc tttttggtca 960
gctaattctc caagatatca tttgccaact gataaaccaa gaccatctgt tgttccagtt 1020
gaggttaagt tcgaagaaat gacattgaag tcttcagctg aatcagttgc atctactgaa 1080
tcagaaaatt ctatggctgc aacatctggt tcaggtactg aattaacatc acaatcttca 1140
cattctgatg gtttgagaaa gagatacttc gatgttccaa aattgggtcc agctgcaatt 1200
aatgaaccaa ttgattacgt ttctggtatg ccatctaaag gtgttagatc ttcattgatt 1260
gatgcaatga atcaatggtg tcaagttcca tcttcacaat tggctgttgt taagagagtt 1320
attgatttgt tgcataattc ttcattgatc ttggatgata tccaagatga ttctccaatg 1380
agaagaggta aaactgcaac acatttgatc ttcggtgaag cacaatcaat taattctgct 1440
acttatttgt acgttagagt tgttcaagaa gttcatgcta ctggtaatgc tgcattgatg 1500
aaagttttgt tagaagaatt ggatgatttg catgttggtc aatcatggga tttgtactgg 1560
aagtacaatt tgaaatggcc aactgagggt gaatattttt ctatgatcga tttgaagaca 1620
ggtggtttgt ttagaatgtt ggttagaatg atgcatgctt tgtgtcatcc agaatgtaaa 1680
gaagaatttg cttgtgatgc attagtttca atggtttcta gatttttcca agttagagat 1740
gattatttga atttgaatag cagagaatac tcaaaccaaa aaggttggtg tgaagatttg 1800
gatgagggta aattttcata tttgatcatc cattgtttgg aaaattctcc aagatacaga 1860
gatagaatca tgggtttgtt tagacaaaga actggttgtt caggtccaat gccatctgtt 1920
ggtaaagttc aaatcatcga atacttacaa gaagcaggtt catttgatgc ttgttgggaa 1980
ttgttgaata agttggaaga tgaaatcgaa gcagaaatca gaagattaga agctgttaca 2040
ggtgaagaaa acccacaaat gcatttgttg ttgaaattgt tgtctgttaa aaatgaaaag 2100
ccaaataagg gtccagttat tgttccatta ggtgtt 2136
<210> 23
<211> 2154
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 23
atgtgtacta tggatgatca tacatctgaa ttgttggata tctcacaatt cgatactcaa 60
ggtttctgtc caggttatcc attgagaaga catagatacg aatctttagc aaattcaggt 120
tgttatgaag ctagacaaga ttggattcat tacgttggtc cagctgaaga atttggttct 180
tgtgatccaa ctaacggtaa cttcacagca ttggttttac cattgactaa accagaaaga 240
ttgcatttgg ttgcttacat catcgagtgt gagtttatta gagttttgta cccatacgca 300
tttttgcatg ataatgttgt tgaagcagtt atgccagatt cttcaacaaa caacggtgac 360
gatttccatt tgggtgaagc tgaaaagaga caaagaaacg ttagaatggg tagaaagcaa 420
atccaagcaa agatgatgta ccaattggct agaactgata agagatgtgc agaaagagtt 480
aagagagttt ggaagacaat gttgtctact acattgaagc ataagtcaga ttctttcgca 540
tcattggaag aatatttgaa ctacagaatc atcgatactg gtggttactg ggttgaatct 600
gttatgttgt ttggtatggg tatgacttta acagaagaag aagatgctca attggcagaa 660
gttgttagac catgttatgc ttctttagct ttggcaaacg attacttctc attcgataga 720
gaatgggaag aagctcaaaa tggtggtcca ccaccaacaa atgcagtttg gttgtatatg 780
agatggaatg gtattccaat gccagctgct aagaaattgg ttagagaagc atctaacaga 840
tacgaagcta gatttttaga attgtgtgat tcttttagaa gaaacaaccc attgtactca 900
gaaaaattgg ataaatattt gagaggtttg gcttaccaaa tttctggtaa tgttgtttgg 960
tcattgactt gtccaagata tcatccagag tttagatacg atccaaatgc tggtatggaa 1020
gatttgttaa ctgctcaagc tagaggtgac ggtggtgctg tttctggtgg tgaagctgat 1080
tacacagcag aacatagaca atcaatcatt tctttagaat cacaacatac tgcttcttca 1140
acaagatatt cagcatctga ttggcaatct tcaagatctt catctttttc tgaaatctca 1200
gttgatggtg aagatggtga aactcatgca gttaaattgc cagctgaaca aggtttagat 1260
acaaaacatg ttgctgcacc atttgattat gttgcttctt tgccatcaaa gggtgttaga 1320
actacattga tcgatgcatt gaatttgtgg tgtgatttgc cagatactac attggctggt 1380
attaaagaag ttgttgataa attgcataca gcatctttaa tgttcgatga tatcgaagat 1440
ggttcagaat tgagaagagg taatccagct gcacatgcag tttacggtgt tccacaaact 1500
attaatgctg catcttttgc tattgttgat gcagtttcaa aagctcaaga tattccagtt 1560
acaggtgctg ttgatattgt tttagaacaa ttgagagatt tgcatatcgg tcaatcttat 1620
gatatccatt ggactagaca ttcatcttgt ccatcagaaa cagaatactt agaaatggtt 1680
gctaagaaaa ctggtggttt gtttaaatta atttctagat tgatgactac aggtttacca 1740
gatatcacta aatcaacaat cgatgatttg gttacacaat tgggtatcta tttccaaatc 1800
agagatgatt tcaagaattt gaactctgat gaatacgctt cacaaaaagg tttctgtgaa 1860
gatttggatg agggtaaatt ttctttccca ttggttcatt atttgactaa aggtcaagaa 1920
tctccacctt ttatgttgag agaaatcatg caacaaagaa gagaatctaa gggtttgaac 1980
gttccatcaa agatgttggc tttgcaacaa ttgaagaaat ctggttcttt agaatacact 2040
catagaacat tgaagagatt agaagttcaa attaatcaaa tcattggtag attggaaaga 2100
atcactggta aagataactg ggttttgaga ttgtgtatgg gtgaattaac agtt 2154
<210> 24
<211> 2109
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 24
atgatttcta ttgctccatt cccagatgaa gatcaatttt cagttttagt tccaagagat 60
actccagatt tgcatggttt ttgtcatggt tatccattaa gaagacataa agcagaacat 120
aaggctaacg aaggttcttt acaattgaga gcagattggg aaaaatacat tggtccaatt 180
gaaagatggg gttcatgtaa tccatgggaa ggtcattttg gtgctgttgt tttgccatgt 240
tgtagaccag aaagaatggc aatcgttaac tacatcttcg aatacgcttt tatgtacgat 300
aacgttgttg aatctgctgc acaatcaact attaatgctc atgcagatga tatcgcattg 360
gatgaaactg aatacagaac agttagatct gttactggta caaagcaagt tcaatcaaag 420
atgttgttag aattgtttgc tttggatcca gcttgtgcac aagttgtttt ggattcttgg 480
aagacaatga tcgatactac agcaactaag gataagacaa gagctttcca aaactgggaa 540
gaatacgttg attacagaat catcgatact ggtgctccat tcgttgatat gttgatgaga 600
ttcggtatgg gttttgtttt gacaccagaa gaacaagcat ctattgaacc agttgttaga 660
ccatgttatg ctgcattggg tttggctaac gattacttct cattcgatgt tgaatgggat 720
gaatttcaac atgcagaaga aactacaatg actaatgctg tttggttatg tatgcattgg 780
catggtgttg atgttccagc tgcaaaggaa atcgttagaa gagttacaaa caattttgaa 840
agagaatttc aacaaagagt tgcagaatat gttgctggtg acggtagaga aaacagaaag 900
gttcaagttt acttaagagc attgggttat caaattcctg gtaatgttgc ttggtcttta 960
agatgtccaa gataccatcc acatttggtt gaagaagctg gtagattgtt acaacaatct 1020
atgcaaactg catcagatga aggttgtgtt gctccagttc aagttcaagc aaaaagagct 1080
tctgtttcag aagaatctga atcttcagaa tcttcagttt ggtctggtcc aaattcacca 1140
agatcttcaa tttcttcagc atcagaagaa gaagaagcta gagaagttga agaagttaaa 1200
ttgggtgctg aacatttgtt gaacccagca gaatacatcg cttctttacc atcaaaaggt 1260
gttagagaag cctttattga tgctttgaac gtttggttga tgttgccaga tagacaagtt 1320
aaattgttga agtctatcgc aaagactttg cataacgctt cattaatgtt ggatgatatc 1380
gaagattctt caccattgag aagaggtcaa ccagctactc atacagttta tggtgttggt 1440
ccaacaatta attctgcaaa ctacttgttg ttgcaagcat tggatgaagt tagaggtttg 1500
gatgattcag atggtcaatg tttggatatc tatactgaag aaatgagaaa tttgtttaca 1560
ggtcaatctt atgatgttta ctggacaaga gaagcagttt gtccatcaga agctgaatac 1620
atggaaatga ttagacaaaa gactggtggt ttgtttagat tgttggctag attgatgatt 1680
gctacagcac cagctcaaag aaacagagat atcccattgt tgccattgat cgataaattg 1740
ggtgaatact ttcaaatcag agatgattac aagaatttga ctgaagaata cacagatcaa 1800
aaaggtttct gtgaagattt ggatgagggt aaattttctt tcccattgat ccatgcattg 1860
agagctcaag aaagagaaaa gaatggtaga ttatgggaaa tgttggaaca aggtagagca 1920
ggtggtacta tggaattggc tagaaagaga gaagttttag atatcttgca agattctggt 1980
tcaatggaat acactagaag aacattgcaa ggtattatgg gtgaaattca tggtggtatg 2040
ggtagagttg aaagagaaac aggtggtgtt aactgggttt tgagattgtt ggttatgaga 2100
ttggaagtt 2109

Claims (3)

1. Use of a chimeric terpene synthase and/or nucleic acid molecule and/or construct and/or recombinant cell in the preparation of a terpenoid;
when the amino acid sequence of the chimeric terpene synthase is shown as SEQ ID NO. 1, the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 14; the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 14; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000011
when the amino acid sequence of the chimeric terpene synthase is shown in SEQ ID NO. 2, SEQ ID NO. 7 and SEQ ID NO. 9, the nucleotide sequence of the nucleic acid molecule is shown in SEQ ID NO. 15, SEQ ID NO. 20 and SEQ ID NO. 22; correspondingly, the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 15, SEQ ID NO. 20 and SEQ ID NO. 22; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000012
when the amino acid sequence of the chimeric terpene synthase is shown as SEQ ID NO. 3, the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 16; the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO:16; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000013
when the amino acid sequences of the chimeric terpene synthases are shown as SEQ ID NO. 5 and SEQ ID NO. 11, the nucleotide sequences of the nucleic acid molecules are shown as SEQ ID NO. 18 and SEQ ID NO. 24 correspondingly; correspondingly, the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 18 and SEQ ID NO. 24; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000021
when the amino acid sequence of the chimeric terpene synthase is shown as SEQ ID NO. 6, the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 19; the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 19; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000022
when the amino acid sequence of the chimeric terpene synthase is shown as SEQ ID NO. 4, the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 17; the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO:17; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000023
when the amino acid sequence of the chimeric terpene synthase is shown as SEQ ID NO. 10, the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 23; the nucleic acid molecule encodes the chimeric terpene synthase, and the nucleotide sequence of the nucleic acid molecule is shown as SEQ ID NO. 23; the construct contains the nucleic acid molecule; the recombinant cell comprises the nucleic acid molecule; the structure of the terpenoid is as follows,
Figure FDA0003851018990000031
2. a preparation method of a terpenoid is characterized by comprising the following steps: culturing the recombinant cells to obtain a culture product; and isolating the terpenoid from the culture product, the recombinant cells comprising a nucleic acid molecule, the terpenoid having the structure shown below,
Figure FDA0003851018990000032
when the terpenoid is a compound shown as a formula 5, the nucleotide sequence of the nucleic acid molecule is SEQ ID NO. 14 and/or SEQ ID NO. 17;
when the terpenoid is the compound shown as the formula 2, the nucleotide sequence of the nucleic acid molecule is at least one of SEQ ID NO. 15, SEQ ID NO. 20 and SEQ ID NO. 22;
when the terpenoid is the compound shown as the formula 1, the nucleotide sequence of the nucleic acid molecule is SEQ ID NO 16;
when the terpenoid is the compound shown as the formula 6, the nucleotide sequence of the nucleic acid molecule is at least one of SEQ ID NO. 18, SEQ ID NO. 23 and SEQ ID NO. 24;
when the terpenoid is a compound shown as a formula 24, the nucleotide sequence of the nucleic acid molecule is SEQ ID NO. 19 and/or SEQ ID NO. 23;
when the terpenoid is the compound shown as the formula 23, the nucleotide sequence of the nucleic acid molecule is SEQ ID NO. 17.
3. A terpenoid is characterized in that the structure of the terpenoid is shown as a structural formula 1 or a structural formula 2,
Figure FDA0003851018990000041
CN202110618456.4A 2021-06-03 2021-06-03 Chimeric terpene synthase and application thereof Active CN113480660B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110618456.4A CN113480660B (en) 2021-06-03 2021-06-03 Chimeric terpene synthase and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110618456.4A CN113480660B (en) 2021-06-03 2021-06-03 Chimeric terpene synthase and application thereof

Publications (2)

Publication Number Publication Date
CN113480660A CN113480660A (en) 2021-10-08
CN113480660B true CN113480660B (en) 2022-11-18

Family

ID=77934438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110618456.4A Active CN113480660B (en) 2021-06-03 2021-06-03 Chimeric terpene synthase and application thereof

Country Status (1)

Country Link
CN (1) CN113480660B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114807107A (en) * 2022-04-12 2022-07-29 武汉大学 Chimeric triterpene synthase and triterpene compounds
CN115404229A (en) * 2022-05-19 2022-11-29 华东理工大学 Bifunctional terpene synthase, mutant thereof and catalytic product 5-15 ring system sesterterpene compound

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108239630A (en) * 2016-12-27 2018-07-03 武汉臻智生物科技有限公司 A kind of method that Terpene synthase is transformed
CN108239631A (en) * 2016-12-27 2018-07-03 武汉臻智生物科技有限公司 A kind of Terpene synthase and application thereof
CN110093329A (en) * 2018-01-30 2019-08-06 深圳艾格鑫科技有限公司 Terpene synthase and its application

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105647879A (en) * 2016-03-16 2016-06-08 武汉大学 Diterpene compound variediene synthesizing gene Au13192 and application thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108239630A (en) * 2016-12-27 2018-07-03 武汉臻智生物科技有限公司 A kind of method that Terpene synthase is transformed
CN108239631A (en) * 2016-12-27 2018-07-03 武汉臻智生物科技有限公司 A kind of Terpene synthase and application thereof
CN110093329A (en) * 2018-01-30 2019-08-06 深圳艾格鑫科技有限公司 Terpene synthase and its application

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Hacquard,S.等.KZL82634.1,GeneBank.《NCBI GeneBank》.2016,第1-2页. *
Identification of novel sesterterpene/triterpene synthase from Bacillus clausii;Tsutomu Sato等;《ChemBioChem》;20130403;第14卷(第7期);摘要、第832页右栏、表2 *
KZL82634.1,GeneBank;Hacquard,S.等;《NCBI GeneBank》;20160419;第1-2页 *
萜类化合物生物合成元件的挖掘与产物的高效合成;卞光凯;《中国优秀博士学位论文全文数据库(电子期刊网)》;20200615(第06期);第23页、第71页 *

Also Published As

Publication number Publication date
CN113480660A (en) 2021-10-08

Similar Documents

Publication Publication Date Title
CN113480660B (en) Chimeric terpene synthase and application thereof
KR102628801B1 (en) Protective DNA templates and methods of use for intracellular genetic modification and increased homologous recombination
CN101437953B (en) Diacylglycerol acyltransferases for alteration of polyunsaturated fatty acids and oil content in oleaginous organisms
KR101992494B1 (en) Herbicide tolerant soybean plants and methods for identifying same
CN107988246A (en) A kind of gene knockout carrier and its zebra fish Glioma Model
CN109735479A (en) A kind of recombined bacillus subtilis synthesizing 2&#39;-Fucosyl lactose and its construction method and application
CN107881160A (en) There are recombinant antibodies of unique sugar spectrum and preparation method thereof caused by a kind of CHO host cells edited as genome
CN1926238A (en) High efficiency peptide production in plant cells
KR101992345B1 (en) Promoter polynucleotide, signal polypeptide and use thereof
WO1992013957A1 (en) Stamen-specific promoters from corn
CN111836825A (en) Optimized plant CRISPR/CPF1 system
CN112746083B (en) Method for editing target gene promoter inactivated gene through single base
US6603062B1 (en) Method for producing transgenic plants with modified 5-aminolevulinic acid biosynthesis, method for identifying 5-aminolevulinic acid synthesis effectors
KR102201720B1 (en) Recombinant Corynebacterium Producing 1,3-PDO and Inhibited 3-HP Production and Method for Preparing 1,3-PDO Using Thereof
CN114807107A (en) Chimeric triterpene synthase and triterpene compounds
CN112322512A (en) Method for synthesizing S-adenosylmethionine by modifying saccharomyces cerevisiae through DL-methionine based on CRISPR technology
AU2010221132B2 (en) Stable, fertile, high polyhydroxyalkanoate producing plants and methods of producing them
TAN et al. Molecular analysis of the genome of an attenuated strain of cucumber green mottle mosaic virus
CN108277232A (en) A kind of Se-enriched yeast and preparation method thereof of ease constipation function
CN113621640A (en) Method for constructing glutamic acid-cysteine dipeptide producing bacteria
CN111100886B (en) Biosynthesis method of N-methyl pyrroline
KR102176556B1 (en) Strain with increased squalene production and method for producing squalene using the same
KR102009266B1 (en) Recombinant foot-and-mouth disease virus expressing protective antigen of type SAT2 ZIM
CN111378677A (en) DNA assembling method and application thereof
KR101901990B1 (en) Vector replicable both in E.coli and lactic acid bacteria cell, cell including the vector, and method of using the cell

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant