KR20010041062A - Method for Producing Biotin - Google Patents
Method for Producing Biotin Download PDFInfo
- Publication number
- KR20010041062A KR20010041062A KR1020007009100A KR20007009100A KR20010041062A KR 20010041062 A KR20010041062 A KR 20010041062A KR 1020007009100 A KR1020007009100 A KR 1020007009100A KR 20007009100 A KR20007009100 A KR 20007009100A KR 20010041062 A KR20010041062 A KR 20010041062A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- leu
- gly
- val
- glu
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/13—Transferases (2.) transferring sulfur containing groups (2.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/185—Heterocyclic compounds containing sulfur atoms as ring hetero atoms in the condensed system
- C12P17/186—Heterocyclic compounds containing sulfur atoms as ring hetero atoms in the condensed system containing a 2-oxo-thieno[3,4-d]imidazol nucleus, e.g. Biotin
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
본 발명은 서열 1을 갖는 S-아세노실메티오닌 합성효소 유전자, 및 서열 3, 서열 5 또는 서열 7을 갖는 비오틴 생합성 유전자인 bioS1, bioS2 및(또는) bioS3, 및 가능하게는 bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 하나 이상의 추가의 비오틴 합성 유전자 서열을 함유하는 유전자 재조합 생성물에 관한 것이다. 본 발명은 이 유전자 재조합 생성물을 함유하는 유기체 및 비오틴을 생성하기 위한 그의 용도, 및 비오틴을 생성하는 방법에 관한 것이다.The present invention relates to an S-acenosylmethionine synthetase gene having SEQ ID NO: 1, and a biotin biosynthetic gene having SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 bioS1, bioS2 and / or bioS3, and possibly bioA, bioB, bioF, A recombinant product containing one or more additional biotin synthetic gene sequences selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR. The present invention relates to an organism containing this recombinant product and its use for producing biotin, and a method for producing biotin.
Description
보효소로서, 비오틴(비타민 H)은 효소-촉매작용 카르복실화 및 탈카르복실화 반응에서 필수적인 역할을 한다. 따라서 비오틴은 살아있는 세포에서 필수적인 인자이다. 거의 모든 동물 및 일부 미생물은 그들 자신이 비오틴을 합성할 수 없기 때문에 외부로부터 비오틴을 섭취하여야 한다. 그러므로, 비오틴은 이들 유기체에 있어서 필수적인 비타민이다. 반대로, 세균, 효모 및 식물은 그들 자신이 전구체로부터 비오틴을 합성할 수 있다(브라운(Brown) 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326], 드몰(DeMoll, E.)의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]).As coenzymes, biotin (vitamin H) plays an essential role in enzyme-catalyzed carboxylation and decarboxylation reactions. Biotin is therefore an essential factor in living cells. Almost all animals and some microorganisms must take biotin from the outside because they cannot synthesize biotin themselves. Therefore, biotin is an essential vitamin for these organisms. In contrast, bacteria, yeasts and plants themselves can synthesize biotin from precursors (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326, DeMoll, E (Escherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5).
비오틴의 합성은 세균 유기체, 특히 그람-양성 세균인 에세리히아 콜리(Escherichia coli) 및 그람-양성 세균인 바실루스 스파에리쿠스(Bacillus sphaericus)에서 조사되어 왔다(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326]). 지방산 합성으로부터 유도된 피멜릴-CoA(PmCoA)는 이미 이 콜리(E. coli)에서 최초로 공지된 중간체로서 여겨져 왔다(드몰의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5 1996]). 현재까지, 이 비오틴 전구체가 이 콜리에서 합성되는 경로는 거의 알려져 있지 않다(레모인(Lemoine) 등의 문헌[Mol. Microbiol. 19, 1996: 645-647]). bioC 및 bioH는 상응하는 단백질이 Pm-CoA의 합성을 담당하는 두 유전자인 것으로 확인되었다. 유전자 생성물, 즉 BioH 및 BioC의 효소 기능은 지금까지 알려져 있지 않다(레모인 등의 문헌[Mol. Microbiol. 19, 1996: 645-647], 드몰의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]). Pm-CoA는 4개의 추가의 효소 단계에서 비오틴으로 전환된다. BioF는 우선 Pm-CoA를 알라닌과 축합하여 7-케토-8-아미노펠라르곤산(KAPA)을 형성한다. 그 다음, KAPA는 BioA에 의해 7,8-디아미노펠라르곤산(DAPA)으로 전환된다. ATP-의존성 카르복실화 반응 후, 다음 단계에 의해 디티오비오틴(DTB)으로 되며 이 단계는 BioD에 의해 촉진된다. DTB는 마지막 단계에서 비오틴으로 전환된다. 이 단계는 BioB에 의해 촉진된다. DTB의 비오틴으로의 전환에 관련된 화학적 및 효소적 메카니즘은 지금까지 불완전하게만 이해되었고 해명되었다.The synthesis of biotin has been investigated in bacterial organisms, especially Gram-positive bacteria Escherichia coli and Gram-positive bacteria Bacillus sphaericus (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326]. Pimelyl-CoA (PmCoA) derived from fatty acid synthesis has already been considered as the first known intermediate in E. coli (Escherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5 1996]. To date, little is known about how this biotin precursor is synthesized in this collie (Lemoine et al., Mol. Microbiol. 19, 1996: 645-647). bioC and bioH have been identified that the corresponding proteins are two genes responsible for the synthesis of Pm-CoA. The enzymatic function of the gene product, namely BioH and BioC, is not known so far (Remoin et al., Mol. Microbiol. 19, 1996: 645-647, Dsmol et al., Esccherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]. Pm-CoA is converted to biotin in four additional enzymatic steps. BioF first condenses Pm-CoA with alanine to form 7-keto-8-aminofellargonic acid (KAPA). KAPA is then converted to 7,8-diaminofellargonic acid (DAPA) by BioA. After the ATP-dependent carboxylation reaction, dithiobiotin (DTB) is obtained by the next step, which is promoted by BioD. DTB is converted to biotin in the last step. This step is facilitated by BioB. The chemical and enzymatic mechanisms involved in the conversion of DTB to biotin have so far been incompletely understood and elucidated.
DTB의 비오틴으로의 전환은 지금까지 세균 및 식물 세포 추출물에서 특징화되었을 뿐이었다(WO94/8023호, EP-B-0 449 724호, 사냘(Sanyal) 등의 문헌[Arch. Biochem. Biophys., Vol. 326, No. 1, 1996: 48-56] 및 문헌[Biochemistry 33, 1994: 3625-3631], 발데트(Baldet) 등의 문헌[Europ. J. Biochem. 217, 1, 1993: 479-485], 메장(Mejean) 등의 문헌[Biochem. Biophys. Res. Commun., Vol. 217, No. 3, 1995: 1231-1237], 오시로(Ohshiro) 등의 문헌[Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741]).The conversion of DTB to biotin has only been so far characterized in bacterial and plant cell extracts (WO 94/8023, EP-B-0 449 724, Sanal et al., Arch. Biochem. Biophys., Vol. 326, No. 1, 1996: 48-56 and Biochemistry 33, 1994: 3625-3631, Bald et al., Europ. J. Biochem. 217, 1, 1993: 479-485. , Mejean et al., Biochem. Biophys. Res. Commun., Vol. 217, No. 3, 1995: 1231-1237, Ohshiro et al. Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741].
시험관내 연구 결과, NADPH, 시스테인, 티아민, Fe2+, 아스파라긴, 세린, 프럭토즈 1,6-비스포스페이트 및 S-아데노실메티오닌과 같은 저분자량의 인자는 비오틴의 합성을 모의할 수 있음이 밝혀졌다(오시로 등의 문헌[Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741], 버치(Birch) 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165], 이푸쿠(Ifuku) 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189], 사냘 등의 문헌[Arch. Biochem. Biophys., 326, 1, 1996: 48-56]).In vitro studies have shown that low molecular weight factors such as NADPH, cysteine, thiamine, Fe 2+ , asparagine, serine, fructose 1,6-bisphosphate, and S-adenosylmethionine can simulate the synthesis of biotin (Osiro et al. Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741, Birch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165). , Ifuku et al., Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189, Sarsham et al. Arch. Biochem. Biophys., 326, 1, 1996: 48-56; ).
상기 저분자량의 인자들 외에, BioB의 존재하에 DTB로부터의 비오틴의 합성을 모의하는 다른 단백질이 확인되었다. 이들 단백질은 플라보독신 및 플라보독신 NADPH 환원효소이다(버치 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165], 이푸쿠 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189], 사냘 등의 문헌[Arch. Biochem. Biophys., 326, 1, 1996: 48-56]). 비오틴 합성을 모의하는 그밖의 단백질은 유전자 bioS1 및 bioS2이며, 이들은 독일 특허출원 제197.31274.8호(우선권 22.7.97)에 기술되어 있다.In addition to these low molecular weight factors, other proteins have been identified that simulate the synthesis of biotin from DTB in the presence of BioB. These proteins are flavodoxin and flavodoxin NADPH reductase (Burch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165, Ipuku et al. Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189, Sarsham et al., Arch. Biochem. Biophys., 326, 1, 1996: 48-56). Other proteins that simulate biotin synthesis are the genes bioS1 and bioS2, which are described in German patent application No. 197.31274.8 (Priority 22.7.97).
비오틴 분자내의 황의 유래에 관하여 상이한 결과가 얻어졌다. 완전 세포 추출물에서의 비오틴 합성에 대한 조사 결과,35S-표지된 시스테인의 존재하에 비오틴에 방사능이 도입되었음이 나타났고,35S-표지된 메티오닌 또는 S-아데노실메티오닌이 사용될 때는 비오틴 분자내로의 황의 도입을 증명하기가 불가능하였다(이푸쿠 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 184-189], 버치 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165]).Different results have been obtained regarding the derivation of sulfur in the biotin molecule. Investigations into biotin synthesis in whole cell extracts showed that radioactivity was introduced into biotin in the presence of 35 S-labeled cysteine and into the biotin molecule when 35 S-labeled methionine or S-adenosylmethionine was used. It was not possible to prove the introduction of sulfur (Ipuku et al., Biosci. Biotechnol. Biochem., 59, 2, 1995; 184-189), Birch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165].
기술된 단백질, 즉 bioF, bioA, bioD 및 bioB를 암호화하는 유전자는 이 콜리의 양방향 오페론에서 암호화된다. 이 오페론은 λ결합 부위와 이 콜리 염색체에서 약 17미니트(minute)에 있는 uvrB 유전자 좌위 사이에 위치한다(벌린(Berlyn) 등의 문헌[1996: 1715-1902]). 추가의 두 유전자(이중 하나, 즉 bioC는 이미 Pm-CoA의 합성에서 기술된 기능을 가짐)가 이 오페론에서 추가로 암호화되지만, 지금까지 bioA의 하류에 위치하는 개방 판독틀에 어떠한 기능도 부여할 수 없었다(WO94/8023호, 오쓰카(Otsuka) 등의 문헌[J. Biol. Chem. 263, 1988: 19577-85]). 이 콜리 단백질 BioF, BioA, BioD 및 BioB에 대해 매우 보존된 상동물은 비 스파에리쿠스(B. sphaericus), 비 서브틸리스(B. subtilis), 시네코시스티스 종(Syneccocystis sp.)(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326], 바우어(Bower) 등의 문헌[J. Bacteriol. 175, 1996: 4122-4130], 가네코(Kaneko) 등의 문헌[DNA Res. 3, 3, 1996: 109-136]), 메타노코쿠스 야나시(Methanococcus janaschi)와 같은 원시세균 및 사카로미세스 세레비지아에(Saccharomyces cerevisiae)와 같은 효모(쟝(Zhang) 등의 문헌[Arch. Boichem. Biophys. 309, 1, 1994: 29-35])에서 또는 아라비돕시스 탈리아나(Arabidopsis thaliana)(발데트 등의 문헌[C. R. Acad. Sci. III, Sci. Vie. 319, 2, 1996: 99-106])에서 발견되었다.The genes encoding the described proteins, namely bioF, bioA, bioD and bioB, are encoded in this collie's bidirectional operon. This operon is located between the λ binding site and the uvrB locus at about 17 minutes on this Coli chromosome (Berlyn et al. (1996: 1715-1902)). Two additional genes (one of which bioC already has the function described in the synthesis of Pm-CoA) are additionally encoded in this operon, but so far will not impart any function to the open reading frame downstream of bioA. (WO 94/8023, Otsuka et al., J. Biol. Chem. 263, 1988: 19577-85). Highly conserved moorings for this collie protein BioF, BioA, BioD and BioB include B. sphaericus, B. subtilis, Syneccocystis sp. (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326, Bower et al., J. Bacteriol. 175, 1996: 4122-4130, Kaneko et al., DNA Res. 3, 3, 1996: 109-136), primitive bacteria such as Methanococcus janaschi and yeast such as Saccharomyces cerevisiae; Arch. Boichem. Biophys. 309, 1, 1994: 29-35) or Arabidopsis thaliana (Valette et al., CR Acad. Sci. III, Sci. Vie. 319, 2, 1996). : 99-106).
지금까지 조사된 두 그람-양성 미생물에서, Pm-CoA의 합성은 이 콜리에서와는 다른 방식으로 진행되는 듯하다. bioH 및 bioC의 어떠한 상동물도 찾을 수 없었다(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9. 1991: 295-326]).In the two Gram-positive microorganisms examined so far, the synthesis of Pm-CoA seems to proceed in a different way than in this collie. No mocha of bioH and bioC was found (Brown et al. Biotechnol. Genet. Eng. Rev. 9. 1991: 295-326).
비오틴은 3개의 키랄성 중심을 갖는 광학 활성 물질이다. 비오틴은 지금까지 값비싼 다단계 화학 합성에 의해 경제적으로 제조되어 왔을 뿐이었다.Biotin is an optically active substance with three chiral centers. Biotin has only been produced economically by expensive multi-step chemical synthesis.
이러한 화학 합성의 대안으로서, 미생물을 사용하여 비오틴을 제조하기 위한 발효 공정을 구성하려는 많은 시도가 이루어졌다. 비오틴 오페론을 다수카피성(multi-copy) 플라스미드에 클로닝하는 방법이 성공적으로 사용되어 이러한 유전자로 형질전환된 미생물에서 비오틴 합성을 증대시켰다. 비오틴 합성에서의 증가는 또한 birA 돌연변이체를 선택하여 비오틴 유전자 발현을 조절 해제함으로써 이루어졌다(파이(Pai C. H.)의 문헌[J. Bacteriol. 112, 1972: 1280-1287]). 두 시도를 조합하면, 즉 조절-결함 균주(EP-B-0 236 429호)에서 플라스미드-암호화된 생합성을 발현하면, 생산성이 더욱 더 증가되었다. 이와 관련하여, 비오틴 오페론은 그의 천연의 양방향 프로모터의 통제하에 남아 있거나(EP-B-0 235 429호) 또는 그의 유전자는 외부에서 조절할 수 있는 프로모터의 통제하에 있을 수 있다(WO94/08023호).As an alternative to such chemical synthesis, many attempts have been made to construct fermentation processes for producing biotin using microorganisms. Cloning of biotin operons into multi-copy plasmids has been used successfully to enhance biotin synthesis in microorganisms transformed with these genes. The increase in biotin synthesis was also made by deselecting birA mutants to deregulate biotin gene expression (Pai C. H., J. Bacteriol. 112, 1972: 1280-1287). Combining the two trials, ie, expressing plasmid-encoded biosynthesis in a regulatory-defective strain (EP-B-0 236 429), increased productivity even further. In this regard, the biotin operon may remain under the control of its natural bidirectional promoter (EP-B-0 235 429) or its gene may be under the control of an externally regulated promoter (WO94 / 08023).
지금까지 이 콜리에서 비오틴을 발효에 의해 생성하기 위하여 수행되었던 시도는 경제적으로 적당한 생산성을 얻지 못했다.The attempts made so far to produce biotin by fermentation in this collie have not achieved economically adequate productivity.
본 발명은 서열 1을 갖는 S-아데노실메티오닌 합성효소 유전자 및 각각 서열 3, 서열 5 및 서열 7을 갖는 비오틴 생합성 유전자 bioS1, bioS2 및(또는) bioS3, 및 경우에 따라, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 및 bioR 군에서 선택된 하나 이상의 추가의 비오틴 합성 유전자 서열을 함유하는 유전자 구조물에 관한 것이다. 본 발명은 또한 이 유전자 구조물을 함유하는 유기체 및 비오틴을 제조하기 위한 유전자 구조물의 용도, 및 비오틴의 제조방법에 관한 것이다.The present invention provides an S-adenosylmethionine synthase gene having SEQ ID NO: 1 and a biotin biosynthetic gene bioS1, bioS2 and / or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and optionally bioA, bioB, bioF, A genetic construct containing one or more additional biotin synthetic gene sequences selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY and bioR. The present invention also relates to the use of the gene constructs for producing organisms and biotin containing the gene constructs, and to methods of producing biotin.
도 1은 벡터 pHS1에 SAM 합성효소를 암호화하는 metK 유전자를 클로닝하여 얻은 pHS1 metK 유전자 구조물을 나타낸다.Figure 1 shows the pHS1 metK gene construct obtained by cloning the metK gene encoding SAM synthase in vector pHS1.
도 2는 플라스미드 pHS1 bioS1에 metK 유전자를 클로닝하여 얻은 pHS1 metK bioS1 유전자 구조물을 나타낸다.Figure 2 shows the pHS1 metK bioS1 gene construct obtained by cloning the metK gene in plasmid pHS1 bioS1.
본 발명의 목적은 가능한 많이 비오틴을 합성하는, 비오틴을 제조하기 위한 공업적 발효 공정을 개발하는 것이다.It is an object of the present invention to develop an industrial fermentation process for producing biotin, which synthesizes biotin as much as possible.
본 발명자들은 이 목적이 서열 1을 갖는 S-아데노실메티오닌 합성효소(SAM 합성효소) 유전자 및 서열 3, 서열 5 및 서열 7을 갖는 하나 이상의 추가의 비오틴 생합성 유전자 bioS1, bioS2 또는 bioS3, 및 이들의 기능적 변이체, 유사체 또는 유도체를 비오틴을 합성할 수 있는 원핵 또는 진핵 숙주 유기체에서 발현시키고, 이 유기체를 배양하고, 균체를 분리 제거하거나 또는 비오틴을 정제한 후 합성된 비오틴을 바로 사용하는, 비오틴을 생성하기 위한 본 발명에 따른 방법에 의해 이루어짐을 발견하였다.The inventors have found that for this purpose the S-adenosylmethionine synthase (SAM synthetase) gene having SEQ ID NO: 1 and one or more additional biotin biosynthetic genes bioS1, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and their Functional variants, analogs, or derivatives are expressed in prokaryotic or eukaryotic host organisms capable of synthesizing biotin, culturing the organism, isolating cells or purifying biotin, and immediately using the synthesized biotin to produce biotin. It has been found to be made by the method according to the invention for
본 발명에 따른 방법에 사용된 유전자, 즉 서열 1을 갖는 SAM 합성효소 유전자 및 서열 3, 서열 5 및 서열 7을 갖는 비오틴 생합성 유전자 bioS1, bioS2 및 bioS3은 스위스프로트-데이타베이스(SwissProt-database)에 수탁번호 P04384(metK), U29581(bioS1), P39171(bioS2) 및 D90811(bioS3)로 보관되어 있다. 이 콜리 MetK의 다수의 상동물이 데이타베이스에 기재되어 있다. 이들 상동물은 기타 유박테리아(eubacteia)(예컨대, 에이치 인플루엔자에(H. influenzae) 및 비 서브틸리스) 및 진핵생물(예컨대, 효모: 에스 세레비지아에, 식물: 피 델토이데스(P. deltoides), 절지동물: 디 멜라노가스테르(D. melanogaster) 및 포유류: 알 노르베기쿠스(R. norvegicus))과 같은 유기체를 포함한다.The genes used in the method according to the invention, namely the SAM synthetase gene having SEQ ID NO: 1 and the biotin biosynthetic genes bioS1, bioS2 and bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, were added to the SwissProt-database. Accession No. P04384 (metK), U29581 (bioS1), P39171 (bioS2) and D90811 (bioS3). Many moieties of this Coli MetK are described in the database. These moters include other eubacteia (e.g. H. influenzae and non-subtilis) and eukaryotes (e.g. yeast: E. cerevisiae, plants: P. deltoides ), Arthropods: D. melanogaster and mammals: R. norvegicus).
비오틴 생합성의 생산성은 서열 1을 갖는 하나 이상의 SAM 합성효소 유전자 및 그의 기능적 변이체, 유사체 또는 유도체를 서열 3, 서열 5 및 서열 7을 갖는 하나 이상의 비오틴 합성 유전자 bioS1, bioS2 또는 bioS3, 및 이들의 기능 변이체, 유사체 또는 유도체와 함께 원핵 또는 진핵 숙주 유기체에서 발현시킴으로써 현저하게 증가시킬 수 있다. 바람직하게는 SAM 합성효소 유전자와 bioS1의 조합이 발현에 사용된다. 비오틴 합성을 더욱 더 증가시키기 위하여, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 및 bioR 군에서 선택된 하나 이상의 추가의 비오틴 유전자를 유리하게는 동시에 발현시킨다. 이들 유전자의 발현은 이들 유전자가 없는 대조군에 비하여 2배 이상, 바람직하게는 3배보다 많이 비오틴 합성을 증대시킨다.The productivity of biotin biosynthesis may include one or more SAM synthetase genes having SEQ ID NO: 1 and functional variants, analogs or derivatives thereof, comprising one or more biotin synthesis genes bioS1, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and functional variants thereof. , By expression in prokaryotic or eukaryotic host organisms together with analogs or derivatives. Preferably a combination of SAM synthase gene and bioS1 is used for expression. To further increase biotin synthesis, one or more additional biotin genes selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY and bioR are advantageously expressed simultaneously. Expression of these genes enhances biotin synthesis more than two times, preferably more than three times, compared to the control without these genes.
본 발명에 따른 방법에 사용된 유전자, 즉 뉴클레오티드 서열 1을 갖는 SAM 합성효소 유전자, 뉴클레오티드 서열 3을 갖는 bioS1 유전자, 뉴클레오티드 서열 5를 갖는 bioS2 유전자 및 뉴클레오티드 서열 7을 갖는 bioS3 유전자(이들 서열은 각각 서열 2, 서열 4, 서열 6 및 서열 8에 제공된 아미노산 서열을 암호화함), 또는 이들의 대립유전자 변이체를 하기의 단리 및 서열결정 후에 얻을 수 있다. 변이체는 각각 아미노산 수준에서 30 내지 100% 상동성, 바람직하게는 50 내지 100% 상동성, 매우 특히 바람직하게는 80 내지 100% 상동성을 나타내는 서열 1, 서열 3, 서열 5 및 서열 7인 것으로 이해되어야 한다. 대립유전자 변이체는 특히 서열 1, 서열 3, 서열 5 및 서열 7에 기술된 서열로부터 뉴클레오티드의 결실, 삽입 또는 치환에 의해 얻어질 수 있는 기능 변이체(그러나, 효소 활성은 유지됨)를 포함한다.Genes used in the method according to the invention, ie SAM synthase gene having nucleotide sequence 1, bioS1 gene having nucleotide sequence 3, bioS2 gene having nucleotide sequence 5 and bioS3 gene having nucleotide sequence 7 (these sequences are respectively 2, SEQ ID NO: 4, SEQ ID NO: 6 and encoding the amino acid sequences provided in SEQ ID NO: 8), or allelic variants thereof can be obtained after isolation and sequencing below. It is understood that the variants are SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, which each exhibit 30 to 100% homology, preferably 50 to 100% homology, very particularly preferably 80 to 100% homology at the amino acid level Should be. Allelic variants include, in particular, functional variants obtainable by deletion, insertion or substitution of nucleotides from the sequences set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, however, enzymatic activity is maintained.
또한, 변이체는 O-아세틸세린 술포히드롤라제 A, O-아세틸세린 술포히드롤라제 B, β-시스타티오나제(플린트(Flint) 등의 문헌[J. Biol. Chem., Vol. 271, 1996: 16053-16067] 참조)와 같은 유전자의 기능적 등가물 또는 비오틴의 합성에서 bioS1, bioS2 또는 bioS3의 효소 활성을 나타낼 수 있는, 예를 들어 클렙시엘라(Klebsiella), 칸디다(Candida), 효모 또는 카에노르합디티스(Caenorhabditis)로부터의 nifS 및 그의 원핵 및 진핵 상동물인 것으로 이해되어야 한다.In addition, the variants include O-acetylserine sulfohydrolase A, O-acetylserine sulfohydrolase B, β-cystionase (Flint et al., J. Biol. Chem., Vol. 271, 1996: 16053-16067), which may exhibit the enzymatic activity of bioS1, bioS2 or bioS3 in the synthesis of biotin or functional equivalents of biotin, for example Klebsiella, Candida, yeast or carcass It is to be understood that nifS from Caenorhabditis and its prokaryotic and eukaryotic murines.
서열 1, 서열 3, 서열 5 및 서열 7의 기능적 유사체는, 예를 들어 세균, 진균, 식물, 동물 또는 인간 상동물과 같은 이들의 원핵 또는 진핵 상동물인 것으로 이해되어야 한다. 또한, 유사체는 절단된 서열이나 또는 암호화 및 비암호화 DNA 서열로부터의 1본쇄 DNA 또는 RNA인 것으로 이해되어야 한다.Functional analogs of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 are to be understood to be their prokaryotic or eukaryotic moieties, such as, for example, bacteria, fungi, plants, animals or human moron. It is also to be understood that the analog is single stranded DNA or RNA from a truncated sequence or from encoded and unencoded DNA sequences.
유도체는, 예를 들어 프로모터 변이체인 것으로 이해되어야 한다. 주어진 뉴클레오티드 서열의 상류에 위치하는 프로모터는 프로모터의 작용성 또는 활성이 손상됨이 없이 하나 이상의 뉴클레오티드 치환에 의해 또는 삽입(들) 및(또는) 결실(들)에 의해 변할 수 있다. 또한, 프로모터의 활성은 이들의 서열을 변화시킴으로써 증가시킬 수 있거나, 프로모터는 상이한 종의 유기체의 프로모터를 포함하여 더 활성있는 프로모터에 의해 완전히 대체될 수 있다.Derivatives are to be understood, for example, as promoter variants. A promoter located upstream of a given nucleotide sequence may be altered by one or more nucleotide substitutions or by insertion (s) and / or deletion (s) without impairing the functionality or activity of the promoter. In addition, the activity of promoters can be increased by changing their sequences, or promoters can be completely replaced by more active promoters, including those of different species of organisms.
유도체는 또한 유전자의 발현 및(또는) 단백질의 발현이 증가되도록 그의 뉴클레오티드 서열이 개시 코돈의 -1 내지 -30 상류의 영역에서 변화된 변이체로서 이해되어야 한다.Derivatives are also to be understood as variants whose nucleotide sequence has been changed in the region of -1 to -30 upstream of the initiation codon so that expression of the gene and / or protein is increased.
비오틴을 합성하는 모든 그람-음성 또는 그람-양성 세균은 주로 본 발명에 따른 방법에서 원핵 숙주 유기체로서 사용하기에 적합하다. 예로서 언급될 수 있는 그람-음성 세균은 엔테로박테리아세아에(Enterobacteriaceae)(예: 에세리히아, 아에로박터(Aerobacter), 엔테로박터(Enterobacter), 시트로박터(Citrobacter), 시겔라(Shigella), 클렙시엘라, 세라티아(Serratia), 에르위니아(Erwinia) 또는 살모넬라(Salmonella) 속), 슈도모나다세아에(Pseudomonadaceae)(예: 슈도모나스(Pseudomonas), 크산토모나스(Xanthomonas), 부르크홀데리아(Burkholderia), 글루코노박터(Gluconobacter), 니트로소모나스(Nitrosomonas), 니트로박터(Nitrobacter), 메타노모나스(Methanomonas), 코마모나스(Comamonas), 셀룰로모나스(Cellulomonas) 또는 아세토박터(Acetobacter) 속), 아조토박테라세아에(Azotobacteraceae)(예: 아조토박터(Azotobacter), 아조모나스(Azomonas), 베이예린키아(Beijerinckia) 또는 데륵시아(Derxia) 속), 네이세리아세아에(Neisseriaceae)(예: 모락셀라(Moraxella), 아시네토박터(Acinetobacter), 킨겔라(Kingella), 네이세리아(Neisseria) 또는 브란하멜라(Branhamella) 속), 리조비아세아에(Rhizobiaceae)(예: 리조비움(Rhizobium) 또는 아그로박테리움(Agrobacterium) 속), 또는 그람-음성 속인 지모모나스(Zymomonas), 크로모박테리움(Chromobacterium) 또는 플라보박테리움(Flavobacterium) 속이다. 예로서 언급될 수 있는 그람-양성 세균은 내생포자-형성 그람-양성 통기성 또는 혐기성 세균(예: 바실루스, 스포로락토바실루스(Sporolactobacillus) 또는 클로스트리디움(Clostridium) 속), 코리네형 세균(예: 아르트로박터(Arthrobacter), 셀룰로모나스, 쿠르토박테리움(Curtobacterium), 코리네박테리움(Corynebacterium), 브레비박테리움(Brevibacterium), 미크로박테리움(Microbacterium), 쿠르티아(Kurthia) 속), 악티노미세탈레스(Actinomycetales)(예: 미코박테리움(Mycobacterium), 로도코쿠스(Rhodococcus), 스트렙토미세스(Streptomyces) 또는 노카르디아(Nocardia) 속), 락토바실라세아에(Lactobacillaceae)(예: 락토바실루스(Lactobacillus) 또는 락토코쿠스(Lactococcus) 속), 또는 그람-양성 구균(예: 미크로코쿠스(Micrococcus) 또는 스타필로코쿠스(Staphylococcus) 속)이다.All Gram-negative or Gram-positive bacteria that synthesize biotin are suitable for use primarily as prokaryotic host organisms in the method according to the invention. Gram-negative bacteria that may be mentioned by way of example include Enterobacteriaceae (e.g., Escherichia, Aerobacter, Enterobacter, Citrobacter, Shigella). ), Klebsiella, Serratia, Erwinia or Salmonella, Pseudomonadaceae (e.g. Pseudomonas, Xanthomonas, Burgunderia) (Burkholderia), Gluconobacter, Nitrosomonas, Nitrobacter, Methanomonas, Comamonas, Cellulomonas, or Acetobacter ), Azotobacteraceae (e.g., Azotobacter, Azomonas, Beijerinckia or Derxia), Neisseriaceae (e.g. : Moraxella, Acinetobacter obacter, Kingella, Neisseria or Branhamella genus, Rhizobiaceae (e.g., Rhizobium or Agrobacterium genus), or Gram-negative genus Zymomonas, Chromobacterium or Flavoacterium. Gram-positive bacteria that may be mentioned by way of example are endospores-forming Gram-positive breathable or anaerobic bacteria (e.g. Bacillus, Sporolactobacilli or Clostridium genus), coryneform bacteria (e.g. Arthrobacter, Cellulomonas, Curtobacterium, Corynebacterium, Brevibacterium, Microbacterium, Kurthia genus), Actinomycetales (e.g. Mycobacterium, Rhodococcus, Streptomyces or Nocardia genus), Lactobacillaceae (e.g. Lactobacillus or Lactococcus), or Gram-positive cocci (eg, Micrococcus or Staphylococcus).
본 발명에 따른 방법에 에세리히아, 시트로박터, 세라티아, 클렙시엘라, 살모넬라, 슈도모나스, 코마모나스, 아시네토박터, 아조토박터, 크로모박테리움, 바실루스, 클로스트리디움, 아르트로박터, 코리네박테리움, 브레비박테리움, 락토코쿠스, 락토바실루스, 스트렙토미세스, 리조비움, 아그로박테리움 또는 스타필로코쿠스 속의 세균을 사용하는 것이 바람직하다. 에세리히아 콜리, 시트로박터 프레운디(Citobacter freundii), 세라티아 마르세센스(Serratia marcescens), 살모넬라 타이피무리움(Salmonella typhimurium), 슈도모나스 멘도시나(Pseudomonas mendocina), 슈도모나스 아에루기노사(Pseudomonas aeruginosa), 슈도모나스 무타빌리스(Pseudomonas mutabilis), 슈도모나스 클로로라피스(Pseudomonas chlororaphis), 슈도모나스 플루오레센스(Pseudomonas fluorescens), 코마모나스 아시도보란스(Comamonas acidovorans), 코마모나스 테스토스테로니(Comamonas testosteroni), 아시네토박터 칼코아세티쿠스(Acinetobacter calcoaceticus), 아조토박터 비넬란디(Azotobacter vinelandii), 크로모박테리움 비올라세움(Chromobacterium violaceum), 바실루스 서브틸리스, 바실루스 스파에리쿠스, 바실루스 스테아로써모필루스(Bacillus stearothermophilus), 바실루스 푸밀루스(Bacillus pumilus), 바실루스 리헤니포르미스(Bacillus licheniformis), 바실루스 아밀로리퀘파시엔스(Bacillus amyloliquefaciens), 바실루스 메가테리움(Bacillus megaterium), 바실루스 세레우스(Bacillus cereus), 바실루스 투린기엔시스(Bacillus thuringiensis), 아르트로박터 시트레우스(Arthrobacter citreus), 아르트로박터 파라피네우스(Arthrobacter paraffineus), 코리네박테리움 글루타미쿰(Corynebacterium glutamicum), 코리네박테리움 프리모리옥시단스(Corenebacterium primorioxydans), 코리네박테리움 종, 브레비박테리움 케토글루타미쿰(Brevibacterium ketoglutamicum), 브레비박테리움 리넨스(Brevibacterium linens), 브레비박테리움 종, 스트렙토미세스 리비단스(Streptomyces lividans), 리조비움 레구미노사룸(Rhizobium leguminosarum) 또는 아그로박테리움 투메파시엔스(Agrobacterium tumefaciens)의 종과 속이 특히 바람직하다. 유리하게는, 비오틴의 증가된 천연 생성을 나타내는 세균을 사용한다.Methods according to the invention are Escherichia, Citrobacter, Serratia, Klebsiella, Salmonella, Pseudomonas, Comamonas, Acinetobacter, Azotobacter, Chromobacterium, Bacillus, Clostridium, Artrobacter Preference is given to using bacteria of the genus Corynebacterium, Brevibacterium, Lactococcus, Lactobacillus, Streptomyces, Rizovium, Agrobacterium or Staphylococcus. Esseria coli, Citobacter freundii, Serratia marcescens, Salmonella typhimurium, Pseudomonas mendocina, Pseudomonas aeruginosa aeruginosa, Pseudomonas mutabilis, Pseudomonas chlororaphis, Pseudomonas fluorescens, Comamonas acidovorans, Testamonas acidonines, testosterone, and testosterone. Abactobacter calcoaceticus, Azotobacter vinelandii, Chromobacterium violaceum, Bacillus subtilis, Bacillus spaerycus, Bacillus stearerophilus stemophilus Bacillus ), Bacillus pumilus, Bacillus rihenniformi (Bacillus licheniformis), Bacillus amyloliquefaciens, Bacillus megaterium, Bacillus cereus, Bacillus thuringiensis, Arthrobacter citruus citreus, Arthrobacter paraffineus, Corynebacterium glutamicum, Corynebacterium primorioxydans, Corynebacterium species, Brevibacterium ketoglulu Brevibacterium ketoglutamicum, Brevibacterium linens, Brevibacterium spp., Streptomyces lividans, Rizobium leguminosarum or Agrobacterium tumefacienium Particularly preferred are species and genus of Agrobacterium tumefaciens. Advantageously, bacteria are used that exhibit increased natural production of biotin.
기재된 속의 분류학적 위치는 최근 상당히 변화되었고, 여전히 잘못된 종속명이 고쳐지는 변화 상태에 있다. 과거에 자주 요구되었던 세균 분류학내의 상기 속의 이러한 분류학적 재편성 때문에, 상기 언급된 것이 아닌 과, 속 및 종도 또한 본 발명의 방법에 적합하다.The taxonomic position of the genus described has changed considerably in recent years and is still in a changing state where the wrong dependency name is fixed. Because of this taxonomic reorganization of the genus in bacterial taxonomy which has often been required in the past, families, genus and species not mentioned above are also suitable for the methods of the present invention.
진균, 효모, 식물 또는 식물 세포와 같은 모든 비오틴-합성 유기체는 주로 본 발명에 따른 방법에서 진핵 숙주 유기체로서 사용하기에 적합하다. 바람직하게 언급될 수 있는 효모는 로도토룰라(Rhodotorula), 야로위아(Yarrowia), 스포로볼로미세스(Sporobolomyces), 사카로미세스 또는 시조사카로미세스(Schizosaccharomyces)이다. 로도토룰라 루브라(Rhodotorula rubra), 로도토룰라 글루티니스(Rhodotorula glutinis), 로도토룰라 그라미니스(Rhodotorula graminis), 야로위아 리포리티카(Yarrowia lipolytica), 스포로볼로미세스 살모니콜로르(Sporobolomyces salmonicolor), 스포로볼로미세스 시바타누스(Sporobolomyces shibatanus) 또는 사카로미세스 세레비지아에의 종과 속이 특히 바람직하다.All biotin-synthetic organisms such as fungi, yeasts, plants or plant cells are mainly suitable for use as eukaryotic host organisms in the method according to the invention. Yeasts that may be preferably mentioned are Rhodotorula, Yarrowia, Sporobolomyces, Saccharomyces or Schizosaccharomyces. Rhodotorula rubra, Rhodotorula glutinis, Rhodotorula graminis, Yarrowia lipolytica, Sporobolomises Salmonolocolor ), Sporobolomyces shibatanus or Saccharomyces cerevisiae is particularly preferred.
주로, 모든 식물을 숙주 유기체로서 사용할 수 있으며, 동물 영양 또는 인간 영양에서 한 역할을 하는 식물, 예를 들어 옥수수, 밀, 보리, 호밀, 감자, 완두, 콩, 해바라기, 야자, 기장, 참깨, 코프라 또는 평지가 바람직하다. 아라비도프시스 탈리아나(Arabidopsis thaliana) 또는 라벤둘라 베라(Lavendula vera)와 같은 식물도 적합하다. 식물 세포 배양물, 식물 원형질체 또는 칼루스(callus) 배양물이 특히 바람직하다.Primarily, all plants can be used as host organisms, and plants that play a role in animal or human nutrition, such as corn, wheat, barley, rye, potatoes, peas, beans, sunflowers, palms, millet, sesame seeds, copra Or flat. Plants such as Arabidopsis thaliana or Lavendula vera are also suitable. Particular preference is given to plant cell cultures, plant protoplasts or callus cultures.
본 발명에 따른 방법에 비오틴을 증식 배지로 분비할 수 있고, 경우에 따라 이미 비오틴의 증가된 천연 합성을 또한 나타내는 세균, 진균, 효모 또는 식물 세포와 같은 미생물이 유리하게 사용된다. 유리하게는, 이들 유기체는 또한 이들의 비오틴 생합성의 조절에 관하여 결함이 있을 수 있다. 즉, 이 합성은 조절되지 않거나 또는 매우 감소된 정도로 조절될 뿐이다. 이 조절 결함으로 인해 이들 유기체는 이미 실질적으로 증가된 비오틴 생산성을 갖게 된다. 이러한 조절 결함은, 예를 들어 birA-결손 변이주 형태의 에세리히아 콜리로부터 공지되며, 바람직하게는 외부 영향에 의해 유도될 수 있는 결함, 예를 들어 온도-유도성인 결함으로서 세포내에 존재하여야 한다. 주로, 천연 비오틴 생성을 나타내지 않는 유기체도 비오틴 유전자로 형질전환되면 사용될 수 있다.Microorganisms such as bacteria, fungi, yeast or plant cells are advantageously used in the method according to the invention which can secrete biotin into the growth medium and optionally also exhibit an increased natural synthesis of biotin. Advantageously, these organisms may also be defective with regard to the regulation of their biotin biosynthesis. That is, this synthesis is not controlled or only adjusted to a very reduced degree. These regulatory deficiencies result in these organisms already having substantially increased biotin productivity. Such regulatory defects are known, for example, from Escherichia coli in the form of birA-defective mutant strains and should preferably be present in the cells as defects which can be induced by external influences, for example temperature-induced defects. Primarily, organisms that do not exhibit natural biotin production can be used if they are transformed with a biotin gene.
비오틴 생산성을 전체적으로 더욱 더 증가시키기 위하여, 본 발명에 따른 방법에서 유기체는 유리하게는 bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 하나 이상의 추가의 비오틴 유전자를 갖는다. 유리하게는, 비오틴 합성을 모의하는 이들 유전자는 또한 세포내에 서열 1, 서열 3, 서열 5 또는 서열 7 및 이들의 조합과 함께 존재할 수 있다. 비오틴 합성을 모의하는 유전자의 예는 플라보레독신 유전자 및 플라보레독신 환원효소 유전자이다. 이 추가의 유전자 또는 이들 추가의 유전자들은 세포내에 서열 1, 서열 3, 서열 5 또는 서열 7 또는 이들의 조합을 갖는 유전자와 같은 하나 이상의 카피로 존재할 수 있다. 이들은 서열 1, 서열 3, 서열 5 및(또는) 서열 7과 같은 벡터 또는 별개의 벡터에 존재하거나, 또는 염색체에 삽입될 수 있다. 서열 1, 서열 3, 서열 5 및(또는) 서열 7은 또한 한 벡터에 또는 별개의 벡터에 함께 있을 수 있거나, 또는 게놈에 삽입될 수 있다.In order to further increase biotin productivity as a whole, the organisms in the method according to the invention advantageously comprise at least one additional biotin selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR. Has a gene. Advantageously, these genes simulating biotin synthesis may also be present in the cell with SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 and combinations thereof. Examples of genes that simulate biotin synthesis are the flaboredoxin gene and the flaboredoxin reductase gene. This additional gene or these additional genes may be present in the cell in one or more copies, such as a gene having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, or a combination thereof. They may be present in vectors or separate vectors, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, or inserted into chromosomes. SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 may also be together in one vector or in separate vectors, or may be inserted into the genome.
본 발명에 따른 유전자 구조물은 유전자의 발현을 증가시키기 위하여 하나 이상의 조절 신호에 기능적으로 연결된 서열 1의 SAM 합성효소 유전자 및 서열 3, 서열 5 및(또는) 서열 7의 비오틴 합성 유전자, 및 이들의 기능적 변이체, 유사체 또는 유도체의 유전자 서열인 것으로 이해되어야 한다. 이들 신규한 조절 서열 외에, 이들 서열의 천연 조절은 아직도 실제 구조 유전자의 상류에 존재할 수 있으며, 경우에 따라 천연 조절이 끊기고 유전자의 발현이 증가되도록 유전학적으로 변할 수 있다. 그러나, 유전자 구조물은 또한 단순한 방식으로 조립될 수 있다. 즉, 서열 1, 서열 3, 서열 5 및(또는) 서열 7의 상류에 추가의 조절 신호가 삽입되지 않고, 조절을 갖는 천연의 프로모터는 제거되지 않는다. 대신에, 천연의 조절 서열은 비오틴에 의한 조절이 더 이상 일어나지 않고 유전자 발현이 증가되도록 돌연변이된다. 서열 1, 서열 3, 서열 5 및(또는) 서열 7은 하나의 프로모터의 조절하에 있거나 또는 별개의 프로모터의 조절하에 있을 수 있다. 추가의 유리한 조절 요소가 또한 DNA 서열의 3' 말단에 삽입될 수 있다. 서열 1, 서열 3, 서열 5 또는 서열 7을 갖는 유전자는 유전자 구조물에 하나 이상의 카피로 존재할 수 있다.The genetic construct according to the present invention is a SAM synthase gene of SEQ ID NO: 1 and a biotin synthesis gene of SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, and their functionalities, functionally linked to one or more regulatory signals to increase expression of the gene. It is to be understood that this is the gene sequence of the variant, analog or derivative. In addition to these novel regulatory sequences, the natural regulation of these sequences may still exist upstream of the actual structural gene, and in some cases, may be genetically altered so that natural regulation is interrupted and expression of the gene is increased. However, genetic constructs can also be assembled in a simple manner. That is, no further regulatory signal is inserted upstream of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, and the native promoter with regulation is not removed. Instead, natural regulatory sequences are mutated such that regulation by biotin no longer occurs and gene expression is increased. SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 may be under the control of one promoter or under the control of a separate promoter. Additional advantageous regulatory elements can also be inserted at the 3 'end of the DNA sequence. The gene having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 may exist in one or more copies in the gene construct.
본 발명에 따른 방법에 유리한 조절 서열은, 예를 들어 그람-음성 세균에 유리하게 사용되는 cos-, tac-, trp-, tet-, trp-tet-, lpp-, lac-, lpp-lac-, lacIq-, T7-, T5-, T3-, gal-, trc-, ara-, SP6-, λ-PR- 또는 λ-PL-프로모터와 같은 프로모터에 존재한다. 추가의 유리한 조절 서열은, 예를 들어 그람-양성 프로모터인 amy 및 SPO2에, 효모 프로모터인 ADC1, MFα, AC, P-60, CYC1, GAPDH에 또는 식물 프로모터인 CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33 또는 nos에, 또는 유비퀴틴(ubiquitin) 프로모터 또는 파세올린(phaseolin) 프로모터에 존재한다.Regulatory sequences advantageous for the method according to the invention are, for example, cos-, tac-, trp-, tet-, trp-tet-, lpp-, lac-, lpp-lac- which are advantageously used in Gram-negative bacteria. , lacI q- , T7-, T5-, T3-, gal-, trc-, ara-, SP6-, λ-P R -or λ-P L -promoter. Further advantageous regulatory sequences are, for example, in the Gram-positive promoters amy and SPO2, in the yeast promoters ADC1, MFα, AC, P-60, CYC1, GAPDH or in the plant promoters CaMV / 35S, SSU, OCS, lib4 , usp, STLS1, B33 or nos, or in the ubiquitin promoter or in the phaseolin promoter.
주로, 본 발명에 따른 방법에 전술한 프로모터와 같은 모든 천연 프로모터를 이들의 조절 서열과 함께 사용할 수 있다. 또한, 합성 프로모터도 유리하게 사용할 수 있다.Primarily, all natural promoters, such as the promoters described above in the method according to the invention, can be used with their regulatory sequences. Synthetic promoters can also be advantageously used.
자신의 프로모터를 가질 수 있거나, 서열중 하나의 프로모터의 조절하에 있을 수 있거나, 또는 서열 1, 서열 3, 서열 5 또는 서열 7의 모든 서열의 프로모터의 조절하에 있을 수 있는, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 다른 비오틴 유전자가 유전자 구조물에 하나 이상의 카피로 존재할 수 있다.BioA, bioB, bioF, which may have its own promoter, may be under the control of a promoter of one of the sequences, or may be under the control of a promoter of all sequences of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 Other biotin genes selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR may be present in the gene construct in one or more copies.
전술한 숙주 유기체에서의 발현을 위하여, 유전자 구조물은 유리하게는 숙주에서 유전자의 최적의 발현을 달성할 수 있게 하는 숙주-특이성 벡터로 삽입된다. 벡터는 숙련자에게 널리 공지되어 있으며, 예를 들어 클로닝 벡터란 책(Eds. Pouwels P. H. et al. Elsevier, Amsterdam-New York-Oxford, 1985, ISBN 0 444 904018)으로부터 확인할 수 있다. 플라스미드 외에, 백터도 또한 파지, 비루스, 트랜스포손(transposon), IS 요소, 플라스미드, 코스미드, 또는 선형 또는 환형 DNA와 같은 숙련자에게 공지된 모든 다른 벡터인 것으로 이해되어야 한다. 이들 벡터는 숙주 유기체에서 자율적으로 복제되거나 염색체 복제될 수 있다.For expression in the host organisms described above, the gene constructs are advantageously inserted into host-specific vectors that enable to achieve optimal expression of the gene in the host. Vectors are well known to the skilled person and can be found, for example, in the book Cloning Vector (Eds. Pouwels P. H. et al. Elsevier, Amsterdam-New York-Oxford, 1985, ISBN 0 444 904018). In addition to plasmids, vectors should also be understood to be phages, viruses, transposons, IS elements, plasmids, cosmids, or any other vector known to the skilled person, such as linear or circular DNA. These vectors can autonomously replicate or chromosome replicate in the host organism.
발현 시스템은 예로서 전술된 숙주 유기체와 그 유기체에 적당한 벡터(예: 플라스미드, 비루스 또는 파지, 예를 들어 RNA 폴리머라제/프로모터 시스템을 함유하는 플라스미드, 파지 λ, 또는 Mu 또는 다른 용원 파지 또는 트랜스포손 및(또는) 추가의 유리한 조절 서열)의 조합인 것으로 이해되어야 한다.Expression systems include, for example, the above-described host organisms and plasmids, phage λ, or Mu or other source phages or transposons containing a vector suitable for that organism, such as a plasmid, virus or phage, for example an RNA polymerase / promoter system. And / or further advantageous regulatory sequences).
발현 시스템이란 용어는 바람직하게는 에세리히아 콜리와 그의 플라스미드 및 파지 및 관련 프로모터의 조합, 및 바실루스와 그의 플라스미드 및 프로모터의 조합인 것으로 이해되어야 한다.The term expression system is preferably to be understood to be a combination of Escherichia coli and its plasmids and phages and related promoters, and a combination of Bacillus and its plasmids and promoters.
추가의 3' 및(또는) 5'-말단 조절 서열은 또한 본 발명에 따른 서열 1, 서열 3, 서열 5 및(또는) 서열 7을 유리하게 발현하기에 적합하다.Additional 3 'and / or 5'-terminal regulatory sequences are also suitable for advantageously expressing SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 according to the present invention.
이들 조절 서열은 비오틴 유전자의 특이 발현 및 단백질의 발현을 이룰 수 있게 하려는 것이다. 이는 숙주 유기체에 따라, 예를 들어 유전자가 유도 후 겨우 발현되거나 과잉발현되거나, 또는 유전자가 즉시 발현되고(발현되거나) 과잉발현됨을 뜻할 수 있다.These regulatory sequences are intended to enable the specific expression of biotin genes and the expression of proteins. This may mean that depending on the host organism, for example, the gene is barely expressed or overexpressed after induction, or the gene is immediately expressed (expressed) and overexpressed.
이와 관련하여, 조절 서열 또는 인자는 바람직하게는 비오틴 유전자 발현에 긍정적으로 영향을 끼침으로써 발현을 증가시킬 수 있다. 예를 들어, 조절 요소는 유리하게는 프로모터 및(또는) 증진자(enhancer)와 같은 강한 전사 신호를 사용하여 전사 수준에서 강화될 수 있다. 그러나, 이 외에도 예를 들어 mRNA의 안정성을 개선시킴으로써 번역을 강화할 수도 있다.In this regard, regulatory sequences or factors may preferably increase expression by positively affecting biotin gene expression. For example, regulatory elements can advantageously be enhanced at the level of transcription using strong transcriptional signals such as promoters and / or enhancers. In addition, however, translation can also be enhanced, for example by improving the stability of mRNA.
증진자는, 예를 들어 RNA 폴리머라제와 DNA 사이의 상호작용을 개선시킴으로써 비오틴 유전자 발현의 증가를 가져오는 DNA 서열인 것으로 이해되어야 한다.Enhancers should be understood to be DNA sequences that result in increased biotin gene expression, for example by improving the interaction between RNA polymerase and DNA.
서열 1, 서열 3, 서열 5 및 서열 7로부터 유도된 단백질(서열 2, 서열 4, 서열 6 및 서열 8 참조)의 증가 및 이들의 효소 활성의 증가는, 출발 효소에 비하여, 예를 들어 UV 조사와 같은 전형적인 돌연변이 발생에 의해 또는 화학적 돌연변이원으로 처리하고(처리하거나) 부위-지향성 돌연변위 발생, 결실(들), 삽입(들) 및(또는) 치환(들)과 같은 특이한 돌연변이 발생에 의해 상응하는 유전자 서열 또는 이들의 상동물의 서열을 변화시킴으로써 이룰 수 있다. 기술된 유전자 증폭과는 별도로, 증가된 효소 활성은 또한 효소 생합성을 억제하는 소거 인자에 의해 또한(또는) 불활성 효소 대신에 활성 효소를 합성함으로써 얻을 수 있다.The increase in proteins derived from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7 (see SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8) and their enzymatic activity, for example, compared to the starting enzyme, for example UV irradiation By typical mutagenesis such as or by treatment with chemical mutagens and / or by specific mutagenesis such as site-directed mutagenesis, deletion (s), insertion (s) and / or substitution (s). It can be achieved by changing the gene sequence or the sequence of their molar animals. Apart from the described gene amplifications, increased enzyme activity can also be obtained by scavenging factors that inhibit enzyme biosynthesis and / or by synthesizing active enzymes instead of inactive enzymes.
본 발명에 따른 방법은 유리하게는 DTB의 비오틴으로의 전환을 증가시키고, 따라서 서열 1, 서열 3, 서열 5 및 서열 7, 및 서열 1과 서열 5 또는 서열 1과 서열 7을 갖는 유전자의 조합, 바람직하게는 서열 1과 서열 3을 갖는 유전자의 조합을 갖는 비오틴 유전자를 사용함으로써 전체 비오틴 생산성이 증가되는데, 이들 유전자는 벡터에 의해 또한(또는) 염색체 클로닝에 의해 유기체에 도입된다.The method according to the invention advantageously increases the conversion of DTB into biotin, thus combining the genes having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 and SEQ ID NO: 1 and SEQ ID NO: 5 or SEQ ID NO: 1 and SEQ ID NO: 7, Preferably the use of biotin genes having a combination of genes having SEQ ID NO: 1 and SEQ ID NO: 3 increases overall biotin productivity, which genes are introduced into the organism by vectors and / or by chromosomal cloning.
본 발명에 따른 방법에서, 서열 1, 서열 3, 서열 5 및(또는) 서열 7을 갖는 미생물은 이 유기체를 증식시킬 수 있는 배지에서 증식된다. 이 배지는 합성 배지 또는 천연 배지일 수 있다. 숙련자에게 공지되어 있으며 유기체에 적당한 배지를 사용한다. 미생물을 증식시키기 위하여, 사용된 배지는 탄소원, 질소원, 무기 염, 및 경우에 따라 소량의 비타민 및 미량 원소를 함유한다.In the method according to the invention, microorganisms having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 are propagated in a medium capable of propagating this organism. This medium may be synthetic or natural medium. A medium known to the skilled person and suitable for the organism is used. To propagate microorganisms, the medium used contains a carbon source, a nitrogen source, an inorganic salt, and optionally small amounts of vitamins and trace elements.
유리한 탄소원의 예는 모노사카라이드, 디사카라이드 또는 폴리사카라이드(예: 글루코즈, 프럭토즈, 만노즈, 크실로즈, 갈락토즈, 리보즈, 소르보즈, 리불로즈, 락토즈, 말토즈, 슈크로즈, 라피노즈, 전분 또는 셀룰로즈)와 같은 당, 복합 당원(예: 당밀), 당 인산염(예: 프럭토즈-1,6-비스포스페이트), 당 알콜(에: 만니톨), 폴리올(예: 글리세롤), 알콜(예: 메탄올 또는 에탄올), 카르복실산(예: 시트르산, 락트산 또는 아세트산), 지방(예: 대두유 또는 평지자유) 또는 아미노산(예: 글루탐산 또는 아스파르트산), 또는 질소원으로서 동시에 사용될 수 있는 아미노 당이다.Examples of advantageous carbon sources are monosaccharides, disaccharides or polysaccharides such as glucose, fructose, mannose, xylose, galactose, ribose, sorbose, ribulose, lactose, maltose, sucrose , Sugars such as raffinose, starch or cellulose, complex sugar sources (e.g. molasses), sugar phosphates (e.g. fructose-1,6-bisphosphate), sugar alcohols (e. Mannitol), polyols (e.g. glycerol) , Alcohols (such as methanol or ethanol), carboxylic acids (such as citric acid, lactic acid or acetic acid), fats (such as soybean oil or rapeseed oil) or amino acids (such as glutamic acid or aspartic acid), or may be used simultaneously as a nitrogen source Amino sugars.
유리한 질소원은 유기 또는 무기 질소 화합물 또는 이들 화합물을 함유하는 물질이다. 그 예는 암모늄 염(예: NH4Cl 또는 (NH4)2SO4), 질산염 또는 우레아, 또는 복합 질소원(예: 옥수수 침지액, 브류어(brewer)의 효모 자가분해물, 대두분, 밀 글루텐, 효모 추출물, 고기 추출물, 카제인 가수분해물 또는 질소원으로서 동시에 자주 사용될 수 있는 효모 또는 감자 단백질)이다.Advantageous nitrogen sources are organic or inorganic nitrogen compounds or materials containing these compounds. Examples include ammonium salts (eg NH 4 Cl or (NH 4 ) 2 SO 4 ), nitrates or ureas, or complex nitrogen sources (eg corn steep liquor, brewer's yeast autolysates, soy flour, wheat gluten) , Yeast extract, meat extract, casein hydrolysate or yeast or potato protein, which can often be used simultaneously at the same time as a nitrogen source.
무기 염의 예는 칼슘, 마그네슘, 나트륨, 망간, 칼륨, 아연, 구리 및 철의 염이다. 특히 언급되는 이들 염의 음이온은 염화물, 황산염 및 인산염 이온이다. 본 발명에 따른 방법에서 생산성을 증가시키는데 중요한 인자는 생산 배지에 Fe2+또는 Fe3+염 및(또는) 칼륨 염을 첨가하는 것이다.Examples of inorganic salts are salts of calcium, magnesium, sodium, manganese, potassium, zinc, copper and iron. Particularly mentioned anions of these salts are chloride, sulfate and phosphate ions. An important factor in increasing productivity in the process according to the invention is the addition of Fe 2+ or Fe 3+ salts and / or potassium salts to the production medium.
경우에 따라, 비타민과 같은 생육인자 또는 리보플라빈, 티아민, 폴산, 니코틴산, 파토텐산염 또는 피리독신과 같은 생육 촉진제, 아미노산(예: 알라닌, 시스테인, 아스파라긴, 아스파르트산, 글루타민, 세린, 메티오닌 또는 리신), 카르복실산(예: 시트르산, 포름산, 피멜산 또는 락트산) 또는 디티오트레이톨과 같은 물질이 영양 배지에 첨가된다.If desired, growth factors such as vitamins or growth promoters such as riboflavin, thiamine, folic acid, nicotinic acid, patotothenate or pyridoxine, amino acids (e.g. alanine, cysteine, asparagine, aspartic acid, glutamine, serine, methionine or lysine) Substances such as carboxylic acid (eg citric acid, formic acid, pimelic acid or lactic acid) or dithiothritol are added to the nutrient medium.
항생물질은, 경우에 따라, 세포에서 비오틴 유전자-함유 벡터를 안정화하기 위하여 배지에 첨가된다.Antibiotics are optionally added to the medium to stabilize the biotin gene-containing vector in the cells.
상기 영양분이 혼합되는 비율은 발효의 성질에 따라 좌우되며, 개별적인 경우에 따라 결정된다. 배지 성분들은 모두, 필요에 따라 별도로 멸균되거나 함께 멸균된 후, 발효를 시작할 때 초기에 도입될 수 있거나, 또는 경우에 따라 발효하는 동안 순차적으로 첨가될 수 있다.The proportion at which the nutrients are mixed depends on the nature of the fermentation and on a case by case basis. The media components can all be introduced separately at the start of fermentation, if desired, after being sterilized separately or together together as needed, or optionally added sequentially during fermentation.
배양 조건은 유기체가 최적으로 생육하고 최고의 가능한 수율이 얻어지도록 조정된다. 바람직한 배양 온도는 15℃ 내지 40℃이다. 25℃ 내지 37℃의 온도가 특히 바람직하다. pH는 바람직하게는 3 내지 9의 범위로 유지된다. 5 내지 8의 pH 값이 특히 유리하다. 일반적으로, 8 내지 240시간, 바람직하게는 8 내지 120시간의 배양 기간이 충분하다. 이 시간내에, 최대량의 비오틴이 배지에 축적되고(축적되거나) 세포를 파괴한 후 얻어질 수 있다.Culture conditions are adjusted so that the organism grows optimally and the best possible yield is obtained. Preferred culture temperatures are from 15 ° C. to 40 ° C. Particular preference is given to temperatures of 25 ° C to 37 ° C. The pH is preferably maintained in the range of 3-9. PH values of 5 to 8 are particularly advantageous. In general, an incubation period of 8 to 240 hours, preferably 8 to 120 hours is sufficient. Within this time, the maximum amount of biotin can be obtained after accumulating (accumulating) in the medium and destroying the cells.
비오틴을 생성하기 위한 본 발명에 따른 방법은 연속적으로 또는 회분식으로 또는 반회분식으로 수행될 수 있다. 비오틴 유전자로 형질전환된 식물 세포로부터 전체 식물이 재생되면, 이들을 본 발명에 따른 방법에 따라 완전하게 정상적으로 생육시키고 증식시킬 수 있다.The process according to the invention for producing biotin can be carried out continuously or batchwise or semibatch. Once the whole plants are regenerated from plant cells transformed with the biotin gene, they can be grown and propagated completely normally according to the method according to the invention.
1. S-아데노실메티오닌 합성효소 유전자의 클로닝(서열 1):1. Cloning of S-adenosylmethionine synthase gene (SEQ ID NO: 1):
게놈 이 콜리 DNA로부터 출발하여, 2가지의 특정 올리고뉴클레오티드를 사용하는 폴리머라제 연쇄 반응에 의해 이 콜리 염색체로부터 SAM 합성효소를 암호화하는 유전자(metK)를 증폭하였다. 이렇게 증폭된 DNA를 정제하고, 제한 효소 Acc65I로 분해하고, 이 콜리 균주에서 이 유전자를 과잉발현시킬 수 있는, 동일한 효소로 절단된 벡터에 삽입하였다. 두 올리고뉴클레오티드중 하나를 사용하여, 최적화된 번역 신호를 갖는 유전자 구조물을 제공하였다.The genome amplifies the gene encoding SAM synthase (metK) from this coli chromosome by polymerase chain reaction using two specific oligonucleotides, starting from coli DNA. The amplified DNA was purified, digested with restriction enzyme Acc65I, and inserted into a vector digested with the same enzyme capable of overexpressing this gene in this collie strain. One of the two oligonucleotides was used to provide a genetic construct with an optimized translation signal.
a) 이 콜리 염색체로부터 metK 유전자를 증폭하기 위한 올리고뉴클레오티드의 생성a) production of oligonucleotides to amplify metK gene from this coli chromosome
metK를 리보솜 결합 부위, 암호화 서열의 개시 코돈 및 두 제한 효소 인식 부위 사이의 정지 코돈으로 이루어진 발현 카세트로서 증폭시켰다. 두 제한 부위를 위하여 Acc65I 인식 서열이 선택되었다. 뉴클레오티드 PmetK1(5'-GCGGTACCAGGTGATATTAAATATGGCAAAAC-3') 및 PmetK2(5'-CGGGTACCGATTACTTCAGACCGGCAGC-3')를 사용하여 metK 유전자를 증폭하고 클로닝하였다.metK was amplified as an expression cassette consisting of a ribosomal binding site, an initiation codon of the coding sequence and a stop codon between two restriction enzyme recognition sites. Acc65I recognition sequences were selected for both restriction sites. MetK genes were amplified and cloned using the nucleotides PmetK1 (5'-GCGGTACCAGGTGATATTAAATATGGCAAAAC-3 ') and PmetK2 (5'-CGGGTACCGATTACTTCAGACCGGCAGC-3').
b) PCR의 수행:b) Performing PCR:
조건:Condition:
이 콜리 W3110으로부터 얻은 염색체 DNA 0.5㎍을 주형으로서 사용하였다. 올리고뉴클레오티드 PmetK1 및 PmetK2를 각각 15p㏖의 농도로 사용하였다. dNTP의 농도는 200μM이었다. 제조자 반응 완충제내의 Pwo DNA 폴리머라제(뵈링거 만하임(Boehringer Mannheim)) 2.5U를 폴리머라제로서 사용하였다. PCR 반응 체적은 100㎕이었다.0.5 µg of chromosomal DNA obtained from this Coli W3110 was used as a template. Oligonucleotides PmetK1 and PmetK2 were each used at a concentration of 15 mmol. The concentration of dNTP was 200 μM. 2.5 U of Pwo DNA polymerase (Boehringer Mannheim) in manufacturer reaction buffer was used as polymerase. PCR reaction volume was 100 μl.
증폭:Amplification:
DNA를 94℃에서 2분동안 변성시켰다. 그 다음, 올리고뉴클레오티드를 55℃에서 30초동안 어닐링(annealing)하였다. 신장은 72℃에서 75초동안 일어났다. PCR 반응은 30주기에 걸쳐 수행하였다.DNA was denatured at 94 ° C. for 2 minutes. The oligonucleotides were then annealed at 55 ° C. for 30 seconds. Elongation occurred at 72 ° C. for 75 seconds. PCR reactions were performed over 30 cycles.
약 1145bp의 크기를 갖는 생성된 DNA 생성물을 정제하고, 적합한 완충제내에서 Acc65I로 분해하였다.The resulting DNA product, having a size of about 1145 bp, was purified and digested with Acc65I in a suitable buffer.
c) 발현 벡터에서의 metK의 클로닝c) cloning of metK in the expression vector
벡터 pHS1(구성은 독일 특허출원 제197.31274.8호, 우선권 22.7.97, 실시예 1, pp. 14-17에 기술되어 있음)을 Acc65I로 분해하고, 새우 알칼리성 포스파타제(SAP)(뵈링거 만하임)를 사용하여 탈포스포릴화하였다. SAP를 변성시킨 후, 제조자의 지시에 따라 고속 DNA 연결 키트(Rapid DNA Ligation kit)를 사용하여 1:3의 몰비로 벡터와 단편을 연결시켰다. 연결 혼합물로 이 콜리 XL-1-blue 균주를 형질전환시켰다. 플라스미드 제조 및 제한 분석에 의해 양의 클론이 밝혀졌다. 제한 분해 및 서열결정에 의해 pHS1내의 metK 단편의 정확한 배향을 결정하였다. 생성된 구조물을 pHS1 metK로 명명하였다(도 1). pHS1 metK의 서열은 서열 9에 제공된다. 서열 10은 metK-암호화 영역으로부터 추론한 아미노산 서열을 나타낸다.Vector pHS1 (constitution is described in German patent application No. 197.31274.8, priority 22.7.97, Example 1, pp. 14-17) was digested with Acc65I and shrimp alkaline phosphatase (SAP) (Schoringer Mannheim) Dephosphorylation was used. After denaturation of SAP, the vector and fragment were linked in a molar ratio of 1: 3 using a Rapid DNA Ligation kit according to the manufacturer's instructions. This Coli XL-1-blue strain was transformed with the ligation mixture. Positive clones were revealed by plasmid preparation and restriction analysis. Restriction digestion and sequencing determined the exact orientation of the metK fragment in pHS1. The resulting construct was named pHS1 metK (FIG. 1). The sequence of pHS1 metK is provided in SEQ ID NO: 9. SEQ ID NO: 10 shows the amino acid sequence deduced from the metK-coding region.
2. 플라스미드 pHBbio14 및 pHS1 bioS1의 구성2. Composition of Plasmids pHBbio14 and pHS1 bioS1
플라스미드 pHBbio14 및 pHS1 bioS1의 구성은 이미 기술되어 있다(독일 특허출원 제197.31274.8호, 우선권 22.7.97, 실시예 1, 2 및 5).The construction of the plasmids pHBbio14 and pHS1 bioS1 has already been described (German patent application no. 197.31274.8, priority 22.7.97, examples 1, 2 and 5).
3. pHS1 metK bioS1의 구성3. Composition of pHS1 metK bioS1
플라스미드 제조 방법(뵈링거)을 사용하여 플라스미드 pHS1 bioS1(서열 11, (독일 특허출원 제197.31274.8호, 우선권 22.7.97), 서열 12는 bioS1-암호화 영역으로부터 추론한 아미노산 서열을 나타냄) 및 pHS1 metK(서열 9)를 정제하였다. pHS1 metK로부터 Acc65I로 분해하여 metK 유전자를 갖는 단편을 단리하였다. pHS1 bioS1을 Acc65I로 분해하고, 새우 알칼리성 포스파타제(SAP)(뵈링거 만하임)로 탈포스포릴화하였다. 제조자의 지시에 따라 SAP를 변성시킨 후, 제조자의 지시에 따라 고속 DNA 연결 키트를 사용하여 1:3의 몰비로 벡터와 metK 단편을 연결시켰다. 연결 혼합물로 이 콜리 XL-1-blue 균주를 형질전환시켰다. 플라스미드 제조 및 제한 분석에 의해 양성 클론이 확인되었다. 제한 분해 및 서열결정에 의해 pHS1 bioS1내의 metK 단편의 정확한 배향을 결정하였다. 생성된 구조물을 pHS1 metK bioS1로 명명하였다(도 2). pHS1 metK bioS1의 서열은 서열 13에 제공된다. 서열 14는 metK-암호화 영역으로부터 추론한 아미노산 서열을 나타내고, 서열 15는 bioS1-암호화 영역으로부터 추론한 아미노산 서열을 나타낸다.Plasmid pHS1 bioS1 (SEQ ID NO: 11 (German Patent Application No. 197.31274.8, Priority 22.7.97), SEQ ID NO: 12 shows the amino acid sequence inferred from the bioS1-coding region) and pHS1 using the plasmid preparation method (Schlinger) metK (SEQ ID NO: 9) was purified. Fragments carrying the metK gene were isolated by digestion with Acc65I from pHS1 metK. pHS1 bioS1 was digested with Acc65I and dephosphorylated with shrimp alkaline phosphatase (SAP) (Möllinger Mannheim). After denaturation of SAP according to the manufacturer's instructions, the vector and metK fragments were linked in a molar ratio of 1: 3 using a high speed DNA ligation kit according to the manufacturer's instructions. This Coli XL-1-blue strain was transformed with the ligation mixture. Positive clones were identified by plasmid preparation and restriction analysis. Restriction digestion and sequencing determined the exact orientation of the metK fragment in pHS1 bioS1. The resulting construct was named pHS1 metK bioS1 (FIG. 2). The sequence of pHS1 metK bioS1 is provided in SEQ ID NO: 13. SEQ ID NO: 14 shows an amino acid sequence inferred from a metK-coding region and SEQ ID NO: 15 shows an amino acid sequence inferred from a bioS1-coding region.
4. metK, bioS1 및 metK와 bioS1를 함께 과잉발현함에 의한 비오틴 생산성의 증가4. Increased biotin productivity by overexpressing metK, bioS1 and metK and bioS1 together
자연적인 리팜피신-내성 콜로니를 리팜피신 플레이트에 도말함으로써 BM4086(케트너(Ketner) 및 캠벨(Campbell)의 문헌[J. Molec. Biology, 1975, 96:13]))으로부터 단리하였다. 이들 내성 균주중 하나로부터 P1 용해질이 생성되었다. 이 P1 용해질을 균주 W3110으로 형질도입하고, 이어서 리팜피신을 사용하여 클론을 선택하였다. 생성된 균주를 CaCl2방법(마니아티스(Maniatis) 등의 문헌[Molecular Cloning, Cols Spring Harbour Laboratory Press, 1989])을 사용하여 플라스미드 pHBbio14로 형질전환시키고, 1㎖당 암피실린 100㎍을 함유하는 LB에서 생육시켰다. 단리된, 형질전환된 균주(LU5560)를 CaCl2방법을 사용하여 각각의 경우에 플라스미드 pHS1, pHS1 metK, pHS1 bioS1 또는 pHS1 metK bioS1로 형질전환시킨 다음, 1㎖당 암피실린 100㎍ 및 카나마이신 25㎍을 함유하는 LB 한천에서 선택하였다.Natural rifampicin-resistant colonies were isolated from BM4086 (Ketner and Campbell, J. Molec. Biology, 1975, 96:13) by plating on rifampicin plates. P1 lysate was produced from one of these resistant strains. This P1 lysate was transduced with strain W3110 and then clones were selected using rifampicin. The resulting strains were transformed with plasmid pHBbio14 using the CaCl 2 method (Maniatis et al. (Molecular Cloning, Cols Spring Harbor Laboratory Press, 1989)) and in LB containing 100 μg of ampicillin per ml. It was grown. The isolated, transformed strain (LU5560) was transformed in each case with plasmid pHS1, pHS1 metK, pHS1 bioS1 or pHS1 metK bioS1 using CaCl 2 method, followed by 100 μg of ampicillin and 25 μg of kanamycin per mL. It was selected from the containing LB agar.
각 형질전환체로부터 얻은 하나의 콜로니를 각각의 경우에 적당한 항생물질을 함유하는 DYT 배양액에 접종하고, 12시간동안 배양하였다. 밤새 배양하여(=ONC) 1ℓ당 글리세롤 30g 및 적당한 항생물질을 함유한 TB 배지(샘브룩(Sambrook, J.), 프리치(Fritsch, E. F.), 마니아티스의 문헌[2nd ed. Cold Spring Harbor Laboratory Press., 1989 ISBN 0-87969-373-8])에 10㎖ 배양액을 접종하였다. 플라스미드 pHS1, pHS1 metK, pHS1 bioS1 및 pHS1 metK bioS1이 존재하는 경우, metK 및 bioS1 유전자, 또는 각각 두 유전자의 조합의 발현을 유도하기 위하여, 1mM IPTG 및 0.5% 아라비노즈를 동시에 첨가하였다. 24시간 후, 세포를 배양 상청액으로부터 원심분리에 의해 제거하고, 스트렙타비딘을 사용하는 경쟁적 ELISA에 의해 상청액내의 비오틴 농도를 결정하였다. 이 결정의 결과를 하기 표 I에 나타낸다.One colony from each transformant was inoculated in DYT culture containing the appropriate antibiotic in each case and incubated for 12 hours. Incubated overnight (= ONC) TB medium containing 30 g of glycerol per liter and appropriate antibiotics (Sambrook, J., Frisch, EF, Maniatis, 2nd ed. Cold Spring Harbor Laboratory Press., 1989 ISBN 0-87969-373-8]) was inoculated with 10 ml culture. When plasmids pHS1, pHS1 metK, pHS1 bioS1 and pHS1 metK bioS1 were present, 1 mM IPTG and 0.5% arabinose were added simultaneously to induce expression of metK and bioS1 genes, or a combination of both genes, respectively. After 24 hours, cells were removed from the culture supernatant by centrifugation and the biotin concentration in the supernatant was determined by competitive ELISA using streptavidin. The results of this determination are shown in Table I below.
<110> BASF Aktiengesellschaft<110> BASF Aktiengesellschaft
<120> Verfahren zur Herstellung von Biotin<120> Verfahren zur Herstellung von Biotin
<130> 0050_48792<130> 0050_48792
<140> 19806872.7<140> 19806872.7
<141> 1998-02-19<141> 1998-02-19
<160> 15<160> 15
<170> PatentIn Vers. 2.0<170> Patent In Vers. 2.0
<210> 1<210> 1
<211> 1155<211> 1155
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (1)..(1155)<222> (1) .. (1155)
<400> 1<400> 1
atg gca aaa cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct 48atg gca aaa cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct 48
Met Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His ProMet Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro
1 5 10 151 5 10 15
gac aaa att gct gac caa att tct gat gcc gtt tta gac gcg atc ctc 96gac aaa att gct gac caa att tct gat gcc gtt tta gac gcg atc ctc 96
Asp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile LeuAsp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu
20 25 3020 25 30
gaa cag gat ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc 144gaa cag gat ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc 144
Glu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys ThrGlu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr
35 40 4535 40 45
ggc atg gtt tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac 192ggc atg gtt tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac 192
Gly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val AspGly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp
50 55 6050 55 60
atc gaa gag atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat 240atc gaa gag atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat 240
Ile Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val HisIle Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His
65 70 75 8065 70 75 80
tcc gac atg ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc 288tcc gac atg ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc 288
Ser Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala IleSer Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile
85 90 9585 90 95
ggc aaa cag tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg 336ggc aaa cag tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg 336
Gly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp ProGly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro
100 105 110100 105 110
ctg gaa cag ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act 384ctg gaa cag ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act 384
Leu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala ThrLeu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr
115 120 125115 120 125
aat gaa acc gac gtg ctg atg cca gca cct atc acc tat gca cac cgt 432aat gaa acc gac gtg ctg atg cca gca cct atc acc tat gca cac cgt 432
Asn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His ArgAsn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg
130 135 140130 135 140
ctg gta cag cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg 480ctg gta cag cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg 480
Leu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro TrpLeu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp
145 150 155 160145 150 155 160
ctg cgc ccg gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc 528ctg cgc ccg gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc 528
Leu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp GlyLeu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly
165 170 175165 170 175
aaa atc gtt ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa 576aaa atc gtt ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa 576
Lys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser GluLys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu
180 185 190180 185 190
gag atc gac cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc 624gag atc gac cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc 624
Glu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile IleGlu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile
195 200 205195 200 205
aag cca att ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc 672aag cca att ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc 672
Lys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe PheLys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe
210 215 220210 215 220
atc aac ccg acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc 720atc aac ccg acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc 720
Ile Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp CysIle Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys
225 230 235 240225 230 235 240
ggt ctg act ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg 768ggt ctg act ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg 768
Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met AlaGly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala
245 250 255245 250 255
cgt cac ggt ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac 816cgt cac ggt ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac 816
Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val AspArg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp
260 265 270260 265 270
cgt tcc gca gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct 864cgt tcc gca gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct 864
Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val AlaArg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala
275 280 285275 280 285
gct ggc ctg gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc 912gct ggc ctg gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc 912
Ala Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile GlyAla Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly
290 295 300290 295 300
gtg gct gaa ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa 960gtg gct gaa ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa 960
Val Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu LysVal Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys
305 310 315 320305 310 315 320
gtg cct tct gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg 1008gtg cct tct gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg 1008
Val Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp LeuVal Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu
325 330 335325 330 335
cgc cca tac ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac 1056cgc cca tac ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac 1056
Arg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile TyrArg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr
340 345 350340 345 350
aaa gaa acc gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg 1104aaa gaa acc gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg 1104
Lys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro TrpLys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp
355 360 365355 360 365
gaa aaa acc gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag 1152gaa aaa acc gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag 1152
Glu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysGlu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
370 375 380370 375 380
taa 1155taa 1155
385385
<210> 2<210> 2
<211> 384<211> 384
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 2<400> 2
Met Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His ProMet Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro
1 5 10 151 5 10 15
Asp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile LeuAsp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu
20 25 3020 25 30
Glu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys ThrGlu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr
35 40 4535 40 45
Gly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val AspGly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp
50 55 6050 55 60
Ile Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val HisIle Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His
65 70 75 8065 70 75 80
Ser Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala IleSer Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile
85 90 9585 90 95
Gly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp ProGly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro
100 105 110100 105 110
Leu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala ThrLeu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr
115 120 125115 120 125
Asn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His ArgAsn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg
130 135 140130 135 140
Leu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro TrpLeu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp
145 150 155 160145 150 155 160
Leu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp GlyLeu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly
165 170 175165 170 175
Lys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser GluLys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu
180 185 190180 185 190
Glu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile IleGlu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile
195 200 205195 200 205
Lys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe PheLys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe
210 215 220210 215 220
Ile Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp CysIle Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys
225 230 235 240225 230 235 240
Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met AlaGly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala
245 250 255245 250 255
Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val AspArg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp
260 265 270260 265 270
Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val AlaArg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala
275 280 285275 280 285
Ala Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile GlyAla Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly
290 295 300290 295 300
Val Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu LysVal Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys
305 310 315 320305 310 315 320
Val Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp LeuVal Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu
325 330 335325 330 335
Arg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile TyrArg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr
340 345 350340 345 350
Lys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro TrpLys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp
355 360 365355 360 365
Glu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysGlu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
370 375 380370 375 380
<210> 3<210> 3
<211> 1206<211> 1206
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (1)..(1206)<222> (1) .. (1206)
<400> 3<400> 3
atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 48atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 48
Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu
1 5 10 151 5 10 15
cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 96cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 96
Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro
20 25 3020 25 30
gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 144gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 144
Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly
35 40 4535 40 45
aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 192aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 192
Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg
50 55 6050 55 60
tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 240tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 240
Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp
65 70 75 8065 70 75 80
gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 288gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 288
Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met
85 90 9585 90 95
gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 336gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 336
Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile
100 105 110100 105 110
att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 384att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 384
Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met
115 120 125115 120 125
gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 432gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 432
Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala
130 135 140130 135 140
cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 480cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 480
Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg
145 150 155 160145 150 155 160
agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 528agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 528
Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys
165 170 175165 170 175
ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 576ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 576
Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val
180 185 190180 185 190
gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 624gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 624
Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val
195 200 205195 200 205
cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 672cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 672
Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr
210 215 220210 215 220
ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 720ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 720
Gly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu
225 230 235 240225 230 235 240
gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 768gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 768
Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser
245 250 255245 250 255
ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 816ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 816
Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly
260 265 270260 265 270
acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 864acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 864
Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp
275 280 285275 280 285
ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 912ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 912
Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu
290 295 300290 295 300
gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 960gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 960
Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser
305 310 315 320305 310 315 320
ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1008ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1008
Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val
325 330 335325 330 335
cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1056cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1056
His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu
340 345 350340 345 350
cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1104cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1104
Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val
355 360 365355 360 365
acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1152acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1152
Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp
370 375 380370 375 380
gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1200gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1200
Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val
385 390 395 400385 390 395 400
gat taa 1206gat taa 1206
AspAsp
<210> 4<210> 4
<211> 401<211> 401
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 4<400> 4
Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu
1 5 10 151 5 10 15
Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro
20 25 3020 25 30
Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly
35 40 4535 40 45
Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg
50 55 6050 55 60
Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp
65 70 75 8065 70 75 80
Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met
85 90 9585 90 95
Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile
100 105 110100 105 110
Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met
115 120 125115 120 125
Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala
130 135 140130 135 140
Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg
145 150 155 160145 150 155 160
Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys
165 170 175165 170 175
Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val
180 185 190180 185 190
Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val
195 200 205195 200 205
Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr
210 215 220210 215 220
Gly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu
225 230 235 240225 230 235 240
Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser
245 250 255245 250 255
Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly
260 265 270260 265 270
Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp
275 280 285275 280 285
Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu
290 295 300290 295 300
Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser
305 310 315 320305 310 315 320
Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val
325 330 335325 330 335
His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu
340 345 350340 345 350
Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val
355 360 365355 360 365
Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp
370 375 380370 375 380
Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val
385 390 395 400385 390 395 400
AspAsp
<210> 5<210> 5
<211> 1215<211> 1215
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (1)..(1215)<222> (1) .. (1215)
<400> 5<400> 5
atg aaa tta ccg att tat ctc gac tac tcc gca acc acg ccg gtg gac 48atg aaa tta ccg att tat ctc gac tac tcc gca acc acg ccg gtg gac 48
Met Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val AspMet Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val Asp
1 5 10 151 5 10 15
ccg cgt gtt gcc gag aaa atg atg cag ttt atg acg atg gac gga acc 96ccg cgt gtt gcc gag aaa atg atg cag ttt atg acg atg gac gga acc 96
Pro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly ThrPro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly Thr
20 25 3020 25 30
ttt ggt aac ccg gcc tcc cgt tct cac cgt ttc ggc tgg cag gct gaa 144ttt ggt aac ccg gcc tcc cgt tct cac cgt ttc ggc tgg cag gct gaa 144
Phe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala GluPhe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala Glu
35 40 4535 40 45
gaa gcg gta gat atc gcc cgt aat cag att gcc gat ctg gtc ggc gct 192gaa gcg gta gat atc gcc cgt aat cag att gcc gat ctg gtc ggc gct 192
Glu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly AlaGlu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly Ala
50 55 6050 55 60
gat ccg cgt gaa atc gtc ttt acc tct ggt gca acc gaa tct gac aac 240gat ccg cgt gaa atc gtc ttt acc tct ggt gca acc gaa tct gac aac 240
Asp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp AsnAsp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp Asn
65 70 75 8065 70 75 80
ctg gcg atc aaa ggt gca gcc aac ttt tat cag aaa aaa ggc aag cac 288ctg gcg atc aaa ggt gca gcc aac ttt tat cag aaa aaa ggc aag cac 288
Leu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys HisLeu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys His
85 90 9585 90 95
atc atc acc agc aaa acc gaa cac aaa gcg gta ctg gat acc tgc cgt 336atc atc acc agc aaa acc gaa cac aaa gcg gta ctg gat acc tgc cgt 336
Ile Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys ArgIle Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys Arg
100 105 110100 105 110
cag ctg gag cgc gaa ggt ttt gaa gtc acc tac ctg gca ccg cag cgt 384cag ctg gag cgc gaa ggt ttt gaa gtc acc tac ctg gca ccg cag cgt 384
Gln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln ArgGln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln Arg
115 120 125115 120 125
aac ggc att atc gac ctg aaa gaa ctt gaa gca gcg atg cgt gac gac 432aac ggc att atc gac ctg aaa gaa ctt gaa gca gcg atg cgt gac gac 432
Asn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp AspAsn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp Asp
130 135 140130 135 140
acc atc ctc gtg tcc atc atg cac gta aat aac gaa atc ggc gtg gtg 480acc atc ctc gtg tcc atc atg cac gta aat aac gaa atc ggc gtg gtg 480
Thr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val ValThr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val Val
145 150 155 160145 150 155 160
cag gat atc gcg gct atc ggc gaa atg tgc cgt gct cgt ggc att atc 528cag gat atc gcg gct atc ggc gaa atg tgc cgt gct cgt ggc att atc 528
Gln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile IleGln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile Ile
165 170 175165 170 175
tat cac gtt gat gca acc cag agc gtg ggt aaa ctg cct atc gac ctg 576tat cac gtt gat gca acc cag agc gtg ggt aaa ctg cct atc gac ctg 576
Tyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp LeuTyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp Leu
180 185 190180 185 190
agc cag ttg aaa gtt gac ctg atg tct ttc tcc ggt cac aaa atc tat 624agc cag ttg aaa gtt gac ctg atg tct ttc tcc ggt cac aaa atc tat 624
Ser Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile TyrSer Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile Tyr
195 200 205195 200 205
ggc ccg aaa ggt atc ggt gcg ctg tat gta cgt cgt aaa ccg cgc gta 672ggc ccg aaa ggt atc ggt gcg ctg tat gta cgt cgt aaa ccg cgc gta 672
Gly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg ValGly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg Val
210 215 220210 215 220
cgc atc gaa gcg caa atg cac ggc ggc ggt cac gag cgc ggt atg cgt 720cgc atc gaa gcg caa atg cac ggc ggc ggt cac gag cgc ggt atg cgt 720
Arg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met ArgArg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met Arg
225 230 235 240225 230 235 240
tcc ggc act ctg cct gtt cac cag atc gtc gga atg ggc gag gcc tat 768tcc ggc act ctg cct gtt cac cag atc gtc gga atg ggc gag gcc tat 768
Ser Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala TyrSer Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala Tyr
245 250 255245 250 255
cgc atc gca aaa gaa gag atg gcg acc gag atg gaa cgt ctg cgc ggc 816cgc atc gca aaa gaa gag atg gcg acc gag atg gaa cgt ctg cgc ggc 816
Arg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg GlyArg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg Gly
260 265 270260 265 270
ctg cgt aac cgt ctg tgg aac ggc atc aaa gat atc gaa gaa gtt tac 864ctg cgt aac cgt ctg tgg aac ggc atc aaa gat atc gaa gaa gtt tac 864
Leu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val TyrLeu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val Tyr
275 280 285275 280 285
ctg aac ggt gac ctg gaa cac ggt gcg ccg aac att ctc aac gtc agc 912ctg aac ggt gac ctg gaa cac ggt gcg ccg aac att ctc aac gtc agc 912
Leu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val SerLeu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val Ser
290 295 300290 295 300
ttc aac tac gtt gaa ggt gag tcg ctg att atg gcg ctg aaa gac ctc 960ttc aac tac gtt gaa ggt gag tcg ctg att atg gcg ctg aaa gac ctc 960
Phe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp LeuPhe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp Leu
305 310 315 320305 310 315 320
gca gtt tct tca ggt tcc gcc tgt acg tca gca agc ctc gaa ccg tcc 1008gca gtt tct tca ggt tcc gcc tgt acg tca gca agc ctc gaa ccg tcc 1008
Ala Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro SerAla Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro Ser
325 330 335325 330 335
tac gtg ctg cgc gcg ctg ggg ctg aac gac gag ctg gca cat agc tct 1056tac gtg ctg cgc gcg ctg ggg ctg aac gac gag ctg gca cat agc tct 1056
Tyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser SerTyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser Ser
340 345 350340 345 350
atc cgt ttc tct tta ggt cgt ttt act act gaa gaa gag atc gac tac 1104atc cgt ttc tct tta ggt cgt ttt act act gaa gaa gag atc gac tac 1104
Ile Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp TyrIle Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp Tyr
355 360 365355 360 365
acc atc gag tta gtt cgt aaa tcc atc ggt cgt ctg cgt gac ctt tct 1152acc atc gag tta gtt cgt aaa tcc atc ggt cgt ctg cgt gac ctt tct 1152
Thr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu SerThr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu Ser
370 375 380370 375 380
ccg ctg tgg gaa atg tac aag cag ggc gtg gat ctg aac agc atc gaa 1200ccg ctg tgg gaa atg tac aag cag ggc gtg gat ctg aac agc atc gaa 1200
Pro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile GluPro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile Glu
385 390 395 400385 390 395 400
tgg gct cat cat taa 1215tgg gct cat cat taa 1215
Trp Ala His HisTrp Ala His His
405405
<210> 6<210> 6
<211> 404<211> 404
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 6<400> 6
Met Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val AspMet Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val Asp
1 5 10 151 5 10 15
Pro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly ThrPro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly Thr
20 25 3020 25 30
Phe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala GluPhe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala Glu
35 40 4535 40 45
Glu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly AlaGlu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly Ala
50 55 6050 55 60
Asp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp AsnAsp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp Asn
65 70 75 8065 70 75 80
Leu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys HisLeu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys His
85 90 9585 90 95
Ile Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys ArgIle Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys Arg
100 105 110100 105 110
Gln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln ArgGln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln Arg
115 120 125115 120 125
Asn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp AspAsn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp Asp
130 135 140130 135 140
Thr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val ValThr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val Val
145 150 155 160145 150 155 160
Gln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile IleGln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile Ile
165 170 175165 170 175
Tyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp LeuTyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp Leu
180 185 190180 185 190
Ser Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile TyrSer Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile Tyr
195 200 205195 200 205
Gly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg ValGly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg Val
210 215 220210 215 220
Arg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met ArgArg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met Arg
225 230 235 240225 230 235 240
Ser Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala TyrSer Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala Tyr
245 250 255245 250 255
Arg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg GlyArg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg Gly
260 265 270260 265 270
Leu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val TyrLeu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val Tyr
275 280 285275 280 285
Leu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val SerLeu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val Ser
290 295 300290 295 300
Phe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp LeuPhe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp Leu
305 310 315 320305 310 315 320
Ala Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro SerAla Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro Ser
325 330 335325 330 335
Tyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser SerTyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser Ser
340 345 350340 345 350
Ile Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp TyrIle Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp Tyr
355 360 365355 360 365
Thr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu SerThr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu Ser
370 375 380370 375 380
Pro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile GluPro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile Glu
385 390 395 400385 390 395 400
Trp Ala His HisTrp Ala His His
<210> 7<210> 7
<211> 1221<211> 1221
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (1)..(1221)<222> (1) .. (1221)
<400> 7<400> 7
atg att ttt tcc gtc gac aaa gtg cgg gcc gac ttt ccg gtg ctt tcg 48atg att ttt tcc gtc gac aaa gtg cgg gcc gac ttt ccg gtg ctt tcg 48
Met Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu SerMet Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu Ser
1 5 10 151 5 10 15
cgt gag gta aac ggt ttg ccg ctg gct tat ctc gac agc gcc gcc agt 96cgt gag gta aac ggt ttg ccg ctg gct tat ctc gac agc gcc gcc agt 96
Arg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala SerArg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala Ser
20 25 3020 25 30
gcg cag aaa ccg agc cag gtg att gac gcc gag gcc gag ttt tat cgt 144gcg cag aaa ccg agc cag gtg att gac gcc gag gcc gag ttt tat cgt 144
Ala Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr ArgAla Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr Arg
35 40 4535 40 45
cat ggc tac gcg gcg gtg cat cgt ggt att cat acc tta agc gcc cag 192cat ggc tac gcg gcg gtg cat cgt ggt att cat acc tta agc gcc cag 192
His Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala GlnHis Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala Gln
50 55 6050 55 60
gcg acc gag aaa atg gag aac gtg cgc aag cgg gca tcg ctg ttt att 240gcg acc gag aaa atg gag aac gtg cgc aag cgg gca tcg ctg ttt att 240
Ala Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe IleAla Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe Ile
65 70 75 8065 70 75 80
aat gcc cgt tcg gcg gaa gag ctg gtg ttc gtc cgc ggc acg acg gaa 288aat gcc cgt tcg gcg gaa gag ctg gtg ttc gtc cgc ggc acg acg gaa 288
Asn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr GluAsn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr Glu
85 90 9585 90 95
ggg atc aat ctg gtc gcc aat agc tgg ggc aac agc aac gtg cgg gcg 336ggg atc aat ctg gtc gcc aat agc tgg ggc aac agc aac gtg cgg gcg 336
Gly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg AlaGly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg Ala
100 105 110100 105 110
ggc gat aac atc atc atc agt cag atg gag cac cac gct aac att gtt 384ggc gat aac atc atc atc agt cag atg gag cac cac gct aac att gtt 384
Gly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile ValGly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile Val
115 120 125115 120 125
ccc tgg cag atg ctt tgc gca cgc gtt ggc gca gag ctg cgt gtg atc 432ccc tgg cag atg ctt tgc gca cgc gtt ggc gca gag ctg cgt gtg atc 432
Pro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val IlePro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val Ile
130 135 140130 135 140
ccg ctc aat ccc gat ggt acg ttg caa ctg gag acg ctg cct acg ctg 480ccg ctc aat ccc gat ggt acg ttg caa ctg gag acg ctg cct acg ctg 480
Pro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr LeuPro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr Leu
145 150 155 160145 150 155 160
ttt gat gag aaa act cgc ctg ctg gca att act cat gtc tcc aac gtg 528ttt gat gag aaa act cgc ctg ctg gca att act cat gtc tcc aac gtg 528
Phe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn ValPhe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn Val
165 170 175165 170 175
ctt ggc aca gaa aat cca ctg gcg gaa atg atc acg ctt gcg cac cag 576ctt ggc aca gaa aat cca ctg gcg gaa atg atc acg ctt gcg cac cag 576
Leu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His GlnLeu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His Gln
180 185 190180 185 190
cat ggc gca aaa gtg ctg gtg gat ggc gct cag gcg gtg atg cat cat 624cat ggc gca aaa gtg ctg gtg gat ggc gct cag gcg gtg atg cat cat 624
His Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His HisHis Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His His
195 200 205195 200 205
ccg gtg gat gtt cag gcg ctg gat tgc gac ttt tac gtg ttc tcc ggg 672ccg gtg gat gtt cag gcg ctg gat tgc gac ttt tac gtg ttc tcc ggg 672
Pro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser GlyPro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser Gly
210 215 220210 215 220
cat aaa ctg tat ggc ccc acc gga att ggc att ctt tat gtg aaa gaa 720cat aaa ctg tat ggc ccc acc gga att ggc att ctt tat gtg aaa gaa 720
His Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys GluHis Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys Glu
225 230 235 240225 230 235 240
gcc ttg ttg cag gag atg ccg ccg tgg gaa ggg ggc ggt tct atg atc 768gcc ttg ttg cag gag atg ccg ccg tgg gaa ggg ggc ggt tct atg atc 768
Ala Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met IleAla Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met Ile
245 250 255245 250 255
gcc acc gtc agc ctg agt gaa ggc act acc tgg acc aaa gca cca tgg 816gcc acc gtc agc ctg agt gaa ggc act acc tgg acc aaa gca cca tgg 816
Ala Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro TrpAla Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro Trp
260 265 270260 265 270
cgg ttt gaa gcc ggt aca ccc aat acc ggg ggc atc att ggt ctt ggc 864cgg ttt gaa gcc ggt aca ccc aat acc ggg ggc atc att ggt ctt ggc 864
Arg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu GlyArg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu Gly
275 280 285275 280 285
gcg gcg ctg gag tat gtt tcg gcg ctg ggg ctt aat aac ata gcc gag 912gcg gcg ctg gag tat gtt tcg gcg ctg ggg ctt aat aac ata gcc gag 912
Ala Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala GluAla Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala Glu
290 295 300290 295 300
tat gaa cag aat ctg atg cat tat gcg cta tca cag ctg gaa tct gta 960tat gaa cag aat ctg atg cat tat gcg cta tca cag ctg gaa tct gta 960
Tyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser ValTyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser Val
305 310 315 320305 310 315 320
ccg gat ctc act ctc tat ggc cca caa aac agg ctt ggc gtt att gct 1008ccg gat ctc act ctc tat ggc cca caa aac agg ctt ggc gtt att gct 1008
Pro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile AlaPro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile Ala
325 330 335325 330 335
ttt aat ctc ggt aaa cac cac gcc tat gat gtt ggc agt ttt ctc gat 1056ttt aat ctc ggt aaa cac cac gcc tat gat gtt ggc agt ttt ctc gat 1056
Phe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu AspPhe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu Asp
340 345 350340 345 350
aat tac ggc att gct gtg cgt acc gga cat cac tgc gca atg cca ttg 1104aat tac ggc att gct gtg cgt acc gga cat cac tgc gca atg cca ttg 1104
Asn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro LeuAsn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro Leu
355 360 365355 360 365
atg gcc tat tac aac gtc cct gcg atg tgt cgg gcg tcg ctg gcc atg 1152atg gcc tat tac aac gtc cct gcg atg tgt cgg gcg tcg ctg gcc atg 1152
Met Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala MetMet Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala Met
370 375 380370 375 380
tat aac acc cat gaa gaa gtg gat cgt ctg gtg acc ggc ctg caa cgt 1200tat aac acc cat gaa gaa gtg gat cgt ctg gtg acc ggc ctg caa cgt 1200
Tyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln ArgTyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln Arg
385 390 395 400385 390 395 400
att cac cgt ttg ctg gga taa 1221att cac cgt ttg ctg gga taa 1221
Ile His Arg Leu Leu GlyIle His Arg Leu Leu Gly
405405
<210> 8<210> 8
<211> 406<211> 406
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 8<400> 8
Met Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu SerMet Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu Ser
1 5 10 151 5 10 15
Arg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala SerArg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala Ser
20 25 3020 25 30
Ala Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr ArgAla Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr Arg
35 40 4535 40 45
His Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala GlnHis Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala Gln
50 55 6050 55 60
Ala Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe IleAla Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe Ile
65 70 75 8065 70 75 80
Asn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr GluAsn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr Glu
85 90 9585 90 95
Gly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg AlaGly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg Ala
100 105 110100 105 110
Gly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile ValGly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile Val
115 120 125115 120 125
Pro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val IlePro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val Ile
130 135 140130 135 140
Pro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr LeuPro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr Leu
145 150 155 160145 150 155 160
Phe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn ValPhe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn Val
165 170 175165 170 175
Leu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His GlnLeu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His Gln
180 185 190180 185 190
His Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His HisHis Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His His
195 200 205195 200 205
Pro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser GlyPro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser Gly
210 215 220210 215 220
His Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys GluHis Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys Glu
225 230 235 240225 230 235 240
Ala Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met IleAla Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met Ile
245 250 255245 250 255
Ala Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro TrpAla Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro Trp
260 265 270260 265 270
Arg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu GlyArg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu Gly
275 280 285275 280 285
Ala Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala GluAla Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala Glu
290 295 300290 295 300
Tyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser ValTyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser Val
305 310 315 320305 310 315 320
Pro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile AlaPro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile Ala
325 330 335325 330 335
Phe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu AspPhe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu Asp
340 345 350340 345 350
Asn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro LeuAsn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro Leu
355 360 365355 360 365
Met Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala MetMet Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala Met
370 375 380370 375 380
Tyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln ArgTyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln Arg
385 390 395 400385 390 395 400
Ile His Arg Leu Leu GlyIle His Arg Leu Leu Gly
405405
<210> 9<210> 9
<211> 3720<211> 3720
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (530)..(1684)<222> (530) .. (1684)
<400> 9<400> 9
gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60
agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120
ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180
aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240
agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300
aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360
atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420
agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480
ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538
Met Ala LysMet ala lys
1One
cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586
His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys IleHis Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys Ile
5 10 155 10 15
gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634
Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln AspAla Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln Asp
20 25 30 3520 25 30 35
ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682
Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met ValPro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met Val
40 45 5040 45 50
tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730
Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu GluLeu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu Glu
55 60 6555 60 65
atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778
Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp MetIle Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp Met
70 75 8070 75 80
ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826
Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys GlnGly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys Gln
85 90 9585 90 95
tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874
Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu GlnSer Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu Gln
100 105 110 115100 105 110 115
ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922
Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu ThrGly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu Thr
120 125 130120 125 130
gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970
Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val GlnAsp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val Gln
135 140 145135 140 145
cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018
Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg ProArg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg Pro
150 155 160150 155 160
gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066
Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile ValAsp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile Val
165 170 175165 170 175
ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114
Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile AspGly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile Asp
180 185 190 195180 185 190 195
cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162
Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro IleGln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro Ile
200 205 210200 205 210
ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210
Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn ProLeu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn Pro
215 220 225215 220 225
acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258
Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu ThrThr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu Thr
230 235 240230 235 240
ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306
Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His GlyGly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His Gly
245 250 255245 250 255
ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354
Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser AlaGly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser Ala
260 265 270 275260 265 270 275
gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402
Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly LeuAla Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly Leu
280 285 290280 285 290
gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450
Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala GluAla Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala Glu
295 300 305295 300 305
ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498
Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro SerPro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro Ser
310 315 320310 315 320
gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546
Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro TyrGlu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro Tyr
325 330 335325 330 335
ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594
Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu ThrGly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu Thr
340 345 350 355340 345 350 355
gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642
Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys ThrAla Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys Thr
360 365 370360 365 370
gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684
Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysAsp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
375 380 385375 380 385
tcggtaccgc ttgatatcga attcctgcag cccgggggat cccatggtac gcgtgctaga 1744tcggtaccgc ttgatatcga attcctgcag cccgggggat cccatggtac gcgtgctaga 1744
ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 1804ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 1804
tgtcggtgaa cgctctcctg agtaggacaa atccgccgcc ctagacctag gggatatatt 1864tgtcggtgaa cgctctcctg agtaggacaa atccgccgcc ctagacctag gggatatatt 1864
ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg 1924ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg 1924
cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga 1984cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga 1984
gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat 2044gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat 2044
ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 2104ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 2104
ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc 2164ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc 2164
gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc 2224gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc 2224
gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg 2284gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg 2284
gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca 2344gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca 2344
ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa 2404ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa 2404
aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta 2464aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta 2464
gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag 2524gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag 2524
attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc 2584attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc 2584
tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat 2644tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat 2644
aagttgttac tagtgcttgg attctcacca ataaaaaacg cccggcggca accgagcgtt 2704aagttgttac tagtgcttgg attctcacca ataaaaaacg cccggcggca accgagcgtt 2704
ctgaacaaat ccagatggag ttctgaggtc attactggat ctatcaacag gagtccaagc 2764ctgaacaaat ccagatggag ttctgaggtc attactggat ctatcaacag gagtccaagc 2764
gagctctcga accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga 2824gagctctcga accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga 2824
tgcgctgcga atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc 2884tgcgctgcga atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc 2884
cgccaagctc ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca 2944cgccaagctc ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca 2944
cacccagccg gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg 3004cacccagccg gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg 3004
gcaagcaggc atcgccatgg gtcacgacga gatcctcgcc gtcgggcatg cgcgccttga 3064gcaagcaggc atcgccatgg gtcacgacga gatcctcgcc gtcgggcatg cgcgccttga 3064
gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat 3124gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat 3124
cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt 3184cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt 3184
cgaatgggca ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg 3244cgaatgggca ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg 3244
atactttctc ggcaggagca aggtgagatg acaggagatc ctgccccggc acttcgccca 3304atactttctc ggcaggagca aggtgagatg acaggagatc ctgccccggc acttcgccca 3304
atagcagcca gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc 3364atagcagcca gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc 3364
ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg 3424ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg 3424
acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg 3484acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg 3484
catcagagca gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag 3544catcagagca gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag 3544
cggccggaga acctgcgtgc aatccatctt gttcaatcat gcgaaacgat cctcatcctg 3604cggccggaga acctgcgtgc aatccatctt gttcaatcat gcgaaacgat cctcatcctg 3604
tctcttgatc agatcttgat cccctgcgcc atcagatcct tggcggcaag aaagccatcc 3664tctcttgatc agatcttgat cccctgcgcc atcagatcct tggcggcaag aaagccatcc 3664
agtttacttt gcagggcttc ccaaccttac cagagggcgc cccagctggc aattcc 3720agtttacttt gcagggcttc ccaaccttac cagagggcgc cccagctggc aattcc 3720
<210> 10<210> 10
<211> 384<211> 384
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 10<400> 10
Met Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His ProMet Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro
1 5 10 151 5 10 15
Asp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile LeuAsp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu
20 25 3020 25 30
Glu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys ThrGlu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr
35 40 4535 40 45
Gly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val AspGly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp
50 55 6050 55 60
Ile Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val HisIle Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His
65 70 75 8065 70 75 80
Ser Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala IleSer Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile
85 90 9585 90 95
Gly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp ProGly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro
100 105 110100 105 110
Leu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala ThrLeu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr
115 120 125115 120 125
Asn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His ArgAsn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg
130 135 140130 135 140
Leu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro TrpLeu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp
145 150 155 160145 150 155 160
Leu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp GlyLeu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly
165 170 175165 170 175
Lys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser GluLys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu
180 185 190180 185 190
Glu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile IleGlu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile
195 200 205195 200 205
Lys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe PheLys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe
210 215 220210 215 220
Ile Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp CysIle Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys
225 230 235 240225 230 235 240
Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met AlaGly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala
245 250 255245 250 255
Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val AspArg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp
260 265 270260 265 270
Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val AlaArg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala
275 280 285275 280 285
Ala Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile GlyAla Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly
290 295 300290 295 300
Val Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu LysVal Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys
305 310 315 320305 310 315 320
Val Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp LeuVal Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu
325 330 335325 330 335
Arg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile TyrArg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr
340 345 350340 345 350
Lys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro TrpLys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp
355 360 365355 360 365
Glu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysGlu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
370 375 380370 375 380
<210> 11<210> 11
<211> 3794<211> 3794
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (601)..(1806)<222> (601) .. (1806)
<400> 11<400> 11
gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60
agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120
ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180
aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240
agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300
aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360
atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420
agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480
ggataacaat ttcacacacg ctagcggtac cgggcccccc ctcgaggtcg acggtatcga 540ggataacaat ttcacacacg ctagcggtac cgggcccccc ctcgaggtcg acggtatcga 540
taagcttgat atcgaattcc tgcagcccgg gggatcccat ggtacgcgtc gaggagtacc 600taagcttgat atcgaattcc tgcagcccgg gggatcccat ggtacgcgtc gaggagtacc 600
atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 648atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 648
Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu
1 5 10 151 5 10 15
cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 696cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 696
Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro
20 25 3020 25 30
gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 744gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 744
Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly
35 40 4535 40 45
aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 792aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 792
Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg
50 55 6050 55 60
tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 840tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 840
Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp
65 70 75 8065 70 75 80
gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 888gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 888
Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met
85 90 9585 90 95
gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 936gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 936
Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile
100 105 110100 105 110
att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 984att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 984
Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met
115 120 125115 120 125
gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 1032gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 1032
Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala
130 135 140130 135 140
cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 1080cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 1080
Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg
145 150 155 160145 150 155 160
agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 1128agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 1128
Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys
165 170 175165 170 175
ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 1176ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 1176
Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val
180 185 190180 185 190
gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 1224gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 1224
Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val
195 200 205195 200 205
cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 1272cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 1272
Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr
210 215 220210 215 220
ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 1320ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 1320
Gly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu
225 230 235 240225 230 235 240
gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 1368gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 1368
Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser
245 250 255245 250 255
ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 1416ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 1416
Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly
260 265 270260 265 270
acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 1464acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 1464
Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp
275 280 285275 280 285
ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 1512ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 1512
Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu
290 295 300290 295 300
gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 1560gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 1560
Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser
305 310 315 320305 310 315 320
ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1608ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1608
Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val
325 330 335325 330 335
cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1656cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1656
His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu
340 345 350340 345 350
cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1704cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1704
Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val
355 360 365355 360 365
acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1752acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1752
Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp
370 375 380370 375 380
gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1800gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1800
Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val
385 390 395 400385 390 395 400
gat taa acgcgtgcta gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg 1856gat taa acgcgtgcta gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg 1856
AspAsp
gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1916gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1916
ccctagacct aggggatata ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt 1976ccctagacct aggggatata ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt 1976
cgactgcggc gagcggaaat ggcttacgaa cggggcggag atttcctgga agatgccagg 2036cgactgcggc gagcggaaat ggcttacgaa cggggcggag atttcctgga agatgccagg 2036
aagatactta acagggaagt gagagggccg cggcaaagcc gtttttccat aggctccgcc 2096aagatactta acagggaagt gagagggccg cggcaaagcc gtttttccat aggctccgcc 2096
cccctgacaa gcatcacgaa atctgacgct caaatcagtg gtggcgaaac ccgacaggac 2156cccctgacaa gcatcacgaa atctgacgct caaatcagtg gtggcgaaac ccgacaggac 2156
tataaagata ccaggcgttt ccccctggcg gctccctcgt gcgctctcct gttcctgcct 2216tataaagata ccaggcgttt ccccctggcg gctccctcgt gcgctctcct gttcctgcct 2216
ttcggtttac cggtgtcatt ccgctgttat ggccgcgttt gtctcattcc acgcctgaca 2276ttcggtttac cggtgtcatt ccgctgttat ggccgcgttt gtctcattcc acgcctgaca 2276
ctcagttccg ggtaggcagt tcgctccaag ctggactgta tgcacgaacc ccccgttcag 2336ctcagttccg ggtaggcagt tcgctccaag ctggactgta tgcacgaacc ccccgttcag 2336
tccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccgga aagacatgca 2396tccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccgga aagacatgca 2396
aaagcaccac tggcagcagc cactggtaat tgatttagag gagttagtct tgaagtcatg 2456aaagcaccac tggcagcagc cactggtaat tgatttagag gagttagtct tgaagtcatg 2456
cgccggttaa ggctaaactg aaaggacaag ttttggtgac tgcgctcctc caagccagtt 2516cgccggttaa ggctaaactg aaaggacaag ttttggtgac tgcgctcctc caagccagtt 2516
acctcggttc aaagagttgg tagctcagag aaccttcgaa aaaccgccct gcaaggcggt 2576acctcggttc aaagagttgg tagctcagag aaccttcgaa aaaccgccct gcaaggcggt 2576
tttttcgttt tcagagcaag agattacgcg cagaccaaaa cgatctcaag aagatcatct 2636tttttcgttt tcagagcaag agattacgcg cagaccaaaa cgatctcaag aagatcatct 2636
tattaatcag ataaaatatt tctagatttc agtgcaattt atctcttcaa atgtagcacc 2696tattaatcag ataaaatatt tctagatttc agtgcaattt atctcttcaa atgtagcacc 2696
tgaagtcagc cccatacgat ataagttgtt actagtgctt ggattctcac caataaaaaa 2756tgaagtcagc cccatacgat ataagttgtt actagtgctt ggattctcac caataaaaaa 2756
cgcccggcgg caaccgagcg ttctgaacaa atccagatgg agttctgagg tcattactgg 2816cgcccggcgg caaccgagcg ttctgaacaa atccagatgg agttctgagg tcattactgg 2816
atctatcaac aggagtccaa gcgagctctc gaaccccaga gtcccgctca gaagaactcg 2876atctatcaac aggagtccaa gcgagctctc gaaccccaga gtcccgctca gaagaactcg 2876
tcaagaaggc gatagaaggc gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg 2936tcaagaaggc gatagaaggc gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg 2936
aggaagcggt cagcccattc gccgccaagc tcttcagcaa tatcacgggt agccaacgct 2996aggaagcggt cagcccattc gccgccaagc tcttcagcaa tatcacgggt agccaacgct 2996
atgtcctgat agcggtccgc cacacccagc cggccacagt cgatgaatcc agaaaagcgg 3056atgtcctgat agcggtccgc cacacccagc cggccacagt cgatgaatcc agaaaagcgg 3056
ccattttcca ccatgatatt cggcaagcag gcatcgccat gggtcacgac gagatcctcg 3116ccattttcca ccatgatatt cggcaagcag gcatcgccat gggtcacgac gagatcctcg 3116
ccgtcgggca tgcgcgcctt gagcctggcg aacagttcgg ctggcgcgag cccctgatgc 3176ccgtcgggca tgcgcgcctt gagcctggcg aacagttcgg ctggcgcgag cccctgatgc 3176
tcttcgtcca gatcatcctg atcgacaaga ccggcttcca tccgagtacg tgctcgctcg 3236tcttcgtcca gatcatcctg atcgacaaga ccggcttcca tccgagtacg tgctcgctcg 3236
atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg gatcaagcgt atgcagccgc 3296atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg gatcaagcgt atgcagccgc 3296
cgcattgcat cagccatgat ggatactttc tcggcaggag caaggtgaga tgacaggaga 3356cgcattgcat cagccatgat ggatactttc tcggcaggag caaggtgaga tgacaggaga 3356
tcctgccccg gcacttcgcc caatagcagc cagtcccttc ccgcttcagt gacaacgtcg 3416tcctgccccg gcacttcgcc caatagcagc cagtcccttc ccgcttcagt gacaacgtcg 3416
agcacagctg cgcaaggaac gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc 3476agcacagctg cgcaaggaac gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc 3476
tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc 3536tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc 3536
gctgacagcc ggaacacggc ggcatcagag cagccgattg tctgttgtgc ccagtcatag 3596gctgacagcc ggaacacggc ggcatcagag cagccgattg tctgttgtgc ccagtcatag 3596
ccgaatagcc tctccaccca agcggccgga gaacctgcgt gcaatccatc ttgttcaatc 3656ccgaatagcc tctccaccca agcggccgga gaacctgcgt gcaatccatc ttgttcaatc 3656
atgcgaaacg atcctcatcc tgtctcttga tcagatcttg atcccctgcg ccatcagatc 3716atgcgaaacg atcctcatcc tgtctcttga tcagatcttg atcccctgcg ccatcagatc 3716
cttggcggca agaaagccat ccagtttact ttgcagggct tcccaacctt accagagggc 3776cttggcggca agaaagccat ccagtttact ttgcagggct tcccaacctt accagagggc 3776
gccccagctg gcaattcc 3794gccccagctg gcaattcc 3794
<210> 12<210> 12
<211> 401<211> 401
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 12<400> 12
Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu
1 5 10 151 5 10 15
Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro
20 25 3020 25 30
Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly
35 40 4535 40 45
Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg
50 55 6050 55 60
Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp
65 70 75 8065 70 75 80
Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met
85 90 9585 90 95
Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile
100 105 110100 105 110
Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met
115 120 125115 120 125
Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala
130 135 140130 135 140
Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg
145 150 155 160145 150 155 160
Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys
165 170 175165 170 175
Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val
180 185 190180 185 190
Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val
195 200 205195 200 205
Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr
210 215 220210 215 220
Gly Pro Thr GlyIle Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr GlyIle Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu
225 230 235 240225 230 235 240
Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser
245 250 255245 250 255
Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly
260 265 270260 265 270
Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp
275 280 285275 280 285
Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu
290 295 300290 295 300
Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser
305 310 315 320305 310 315 320
Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val
325 330 335325 330 335
His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu
340 345 350340 345 350
Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val
355 360 365355 360 365
Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp
370 375 380370 375 380
Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val
385 390 395 400385 390 395 400
AspAsp
<210> 13<210> 13
<211> 4975<211> 4975
<212> DNA<212> DNA
<213> Escherichia coli<213> Escherichia coli
<220><220>
<221> CDS<221> CDS
<222> (530)..(1684)<222> (530) .. (1684)
<220><220>
<221> CDS<221> CDS
<222> (1782)..(2987)<222> (1782) .. (2987)
<400> 13<400> 13
gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60
agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120
ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180
aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240
agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300
aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360
atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420
agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480
ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538
Met Ala LysMet ala lys
1One
cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586
His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys IleHis Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys Ile
5 10 155 10 15
gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634
Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln AspAla Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln Asp
20 25 30 3520 25 30 35
ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682
Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met ValPro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met Val
40 45 5040 45 50
tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730
Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu GluLeu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu Glu
55 60 6555 60 65
atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778
Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp MetIle Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp Met
70 75 8070 75 80
ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826
Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys GlnGly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys Gln
85 90 9585 90 95
tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874
Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu GlnSer Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu Gln
100 105 110 115100 105 110 115
ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922
Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu ThrGly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu Thr
120 125 130120 125 130
gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970
Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val GlnAsp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val Gln
135 140 145135 140 145
cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018
Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg ProArg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg Pro
150 155 160150 155 160
gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066
Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile ValAsp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile Val
165 170 175165 170 175
ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114
Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile AspGly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile Asp
180 185 190 195180 185 190 195
cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162
Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro IleGln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro Ile
200 205 210200 205 210
ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210
Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn ProLeu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn Pro
215 220 225215 220 225
acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258
Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu ThrThr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu Thr
230 235 240230 235 240
ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306
Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His GlyGly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His Gly
245 250 255245 250 255
ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354
Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser AlaGly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser Ala
260 265 270 275260 265 270 275
gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402
Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly LeuAla Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly Leu
280 285 290280 285 290
gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450
Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala GluAla Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala Glu
295 300 305295 300 305
ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498
Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro SerPro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro Ser
310 315 320310 315 320
gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546
Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro TyrGlu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro Tyr
325 330 335325 330 335
ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594
Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu ThrGly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu Thr
340 345 350 355340 345 350 355
gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642
Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys ThrAla Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys Thr
360 365 370360 365 370
gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684
Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysAsp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
375 380 385375 380 385
tcggtaccgg gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc 1744tcggtaccgg gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc 1744
agcccggggg atcccatggt acgcgtcgag gagtacc atg aac gtt ttt aat ccc 1799agcccggggg atcccatggt acgcgtcgag gagtacc atg aac gtt ttt aat ccc 1799
Met Asn Val Phe Asn ProMet Asn Val Phe Asn Pro
390390
gcg cag ttt cgc gcc cag ttt ccc gca cta cag gat gcg ggc gtc tat 1847gcg cag ttt cgc gcc cag ttt ccc gca cta cag gat gcg ggc gtc tat 1847
Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu Gln Asp Ala Gly Val TyrAla Gln Phe Arg Ala Gln Phe Pro Ala Leu Gln Asp Ala Gly Val Tyr
395 400 405395 400 405
ctc gac agc gcc gcg acc gcg ctt aaa cct gaa gcc gtg gtt gaa gcc 1895ctc gac agc gcc gcg acc gcg ctt aaa cct gaa gcc gtg gtt gaa gcc 1895
Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro Glu Ala Val Val Glu AlaLeu Asp Ser Ala Ala Thr Ala Leu Lys Pro Glu Ala Val Val Glu Ala
410 415 420410 415 420
acc caa cag ttt tac agt ctg agc gcc gga aac gtc cat cgc agc cag 1943acc caa cag ttt tac agt ctg agc gcc gga aac gtc cat cgc agc cag 1943
Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly Asn Val His Arg Ser GlnThr Gln Gln Phe Tyr Ser Leu Ser Ala Gly Asn Val His Arg Ser Gln
425 430 435425 430 435
ttt gcc gaa gcc caa cgc ctg acc gcg cgt tat gaa gct gca cga gag 1991ttt gcc gaa gcc caa cgc ctg acc gcg cgt tat gaa gct gca cga gag 1991
Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg Tyr Glu Ala Ala Arg GluPhe Ala Glu Ala Gln Arg Leu Thr Ala Arg Tyr Glu Ala Ala Arg Glu
440 445 450 455440 445 450 455
aaa gtg gcg caa tta ctg aat gca ccg gat gat aaa act atc gtc tgg 2039aaa gtg gcg caa tta ctg aat gca ccg gat gat aaa act atc gtc tgg 2039
Lys Val Ala Gln Leu Leu Asn Ala Pro Asp Asp Lys Thr Ile Val TrpLys Val Ala Gln Leu Leu Asn Ala Pro Asp Asp Lys Thr Ile Val Trp
460 465 470460 465 470
acg cgc ggc acc act gaa tcc atc aac atg gtg gca caa tgc tat gcg 2087acg cgc ggc acc act gaa tcc atc aac atg gtg gca caa tgc tat gcg 2087
Thr Arg Gly Thr Thr Glu Ser Ile Asn Met Val Ala Gln Cys Tyr AlaThr Arg Gly Thr Thr Glu Ser Ile Asn Met Val Ala Gln Cys Tyr Ala
475 480 485475 480 485
cgt ccg cgt ctg caa ccg ggc gat gag att att gtc agc gtg gca gaa 2135cgt ccg cgt ctg caa ccg ggc gat gag att att gtc agc gtg gca gaa 2135
Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile Ile Val Ser Val Ala GluArg Pro Arg Leu Gln Pro Gly Asp Glu Ile Ile Val Ser Val Ala Glu
490 495 500490 495 500
cac cac gcc aac ctc gtc ccc tgg ctg atg gtc gcc caa caa act gga 2183cac cac gcc aac ctc gtc ccc tgg ctg atg gtc gcc caa caa act gga 2183
His His Ala Asn Leu Val Pro Trp Leu Met Val Ala Gln Gln Thr GlyHis His Ala Asn Leu Val Pro Trp Leu Met Val Ala Gln Gln Thr Gly
505 510 515505 510 515
gcc aaa gtg gtg aaa ttg ccg ctt aat gcg cag cga ctg ccg gat gtc 2231gcc aaa gtg gtg aaa ttg ccg ctt aat gcg cag cga ctg ccg gat gtc 2231
Ala Lys Val Val Lys Leu Pro Leu Asn Ala Gln Arg Leu Pro Asp ValAla Lys Val Val Lys Leu Pro Leu Asn Ala Gln Arg Leu Pro Asp Val
520 525 530 535520 525 530 535
gat ttg ttg cca gaa ctg att act ccc cgt agt cgg att ctg gcg ttg 2279gat ttg ttg cca gaa ctg att act ccc cgt agt cgg att ctg gcg ttg 2279
Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg Ser Arg Ile Leu Ala LeuAsp Leu Leu Pro Glu Leu Ile Thr Pro Arg Ser Arg Ile Leu Ala Leu
540 545 550540 545 550
ggt cag atg tcg aac gtt act ggc ggt tgc ccg gat ctg gcg cga gcg 2327ggt cag atg tcg aac gtt act ggc ggt tgc ccg gat ctg gcg cga gcg 2327
Gly Gln Met Ser Asn Val Thr Gly Gly Cys Pro Asp Leu Ala Arg AlaGly Gln Met Ser Asn Val Thr Gly Gly Cys Pro Asp Leu Ala Arg Ala
555 560 565555 560 565
att acc ttt gct cat tca gcc ggg atg gtg gtg atg gtt gat ggt gct 2375att acc ttt gct cat tca gcc ggg atg gtg gtg atg gtt gat ggt gct 2375
Ile Thr Phe Ala His Ser Ala Gly Met Val Val Met Val Asp Gly AlaIle Thr Phe Ala His Ser Ala Gly Met Val Val Met Val Asp Gly Ala
570 575 580570 575 580
cag ggg gca gtg cat ttc ccc gcg gat gtt cag caa ctg gat att gat 2423cag ggg gca gtg cat ttc ccc gcg gat gtt cag caa ctg gat att gat 2423
Gln Gly Ala Val His Phe Pro Ala Asp Val Gln Gln Leu Asp Ile AspGln Gly Ala Val His Phe Pro Ala Asp Val Gln Gln Leu Asp Ile Asp
585 590 595585 590 595
ttc tat gct ttt tca ggt cac aaa ctg tat ggc ccg aca ggt atc ggc 2471ttc tat gct ttt tca ggt cac aaa ctg tat ggc ccg aca ggt atc ggc 2471
Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr Gly Pro Thr Gly Ile GlyPhe Tyr Ala Phe Ser Gly His Lys Leu Tyr Gly Pro Thr Gly Ile Gly
600 605 610 615600 605 610 615
gtg ctg tat ggt aaa tca gaa ctg ctg gag gcg atg tcg ccc tgg ctg 2519gtg ctg tat ggt aaa tca gaa ctg ctg gag gcg atg tcg ccc tgg ctg 2519
Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu Ala Met Ser Pro Trp LeuVal Leu Tyr Gly Lys Ser Glu Leu Leu Glu Ala Met Ser Pro Trp Leu
620 625 630620 625 630
ggc ggc ggc aaa atg gtt cac gaa gtg agt ttt gac ggc ttc acg act 2567ggc ggc ggc aaa atg gtt cac gaa gtg agt ttt gac ggc ttc acg act 2567
Gly Gly Gly Lys Met Val His Glu Val Ser Phe Asp Gly Phe Thr ThrGly Gly Gly Lys Met Val His Glu Val Ser Phe Asp Gly Phe Thr Thr
635 640 645635 640 645
caa tct gcg ccg tgg aaa ctg gaa gct gga acg cca aat gtc gct ggt 2615caa tct gcg ccg tgg aaa ctg gaa gct gga acg cca aat gtc gct ggt 2615
Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly Thr Pro Asn Val Ala GlyGln Ser Ala Pro Trp Lys Leu Glu Ala Gly Thr Pro Asn Val Ala Gly
650 655 660650 655 660
gtc ata gga tta agc gcg gcg ctg gaa tgg ctg gca gat tac gat atc 2663gtc ata gga tta agc gcg gcg ctg gaa tgg ctg gca gat tac gat atc 2663
Val Ile Gly Leu Ser Ala Ala Leu Glu Trp Leu Ala Asp Tyr Asp IleVal Ile Gly Leu Ser Ala Ala Leu Glu Trp Leu Ala Asp Tyr Asp Ile
665 670 675665 670 675
aac cag gcc gaa agc tgg agc cgt agc tta gca acg ctg gcg gaa gat 2711aac cag gcc gaa agc tgg agc cgt agc tta gca acg ctg gcg gaa gat 2711
Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu Ala Thr Leu Ala Glu AspAsn Gln Ala Glu Ser Trp Ser Arg Ser Leu Ala Thr Leu Ala Glu Asp
680 685 690 695680 685 690 695
gcg ctg gcg aaa cgt ccc ggc ttt cgt tca ttc cgc tgc cag gat tcc 2759gcg ctg gcg aaa cgt ccc ggc ttt cgt tca ttc cgc tgc cag gat tcc 2759
Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser Phe Arg Cys Gln Asp SerAla Leu Ala Lys Arg Pro Gly Phe Arg Ser Phe Arg Cys Gln Asp Ser
700 705 710700 705 710
agc ctg ctg gcc ttt gat ttt gct ggc gtt cat cat agc gat atg gtg 2807agc ctg ctg gcc ttt gat ttt gct ggc gtt cat cat agc gat atg gtg 2807
Ser Leu Leu Ala Phe Asp Phe Ala Gly Val His His Ser Asp Met ValSer Leu Leu Ala Phe Asp Phe Ala Gly Val His His Ser Asp Met Val
715 720 725715 720 725
acg ctg ctg gcg gag tac ggt att gcc ctg cgg gcc ggg cag cat tgc 2855acg ctg ctg gcg gag tac ggt att gcc ctg cgg gcc ggg cag cat tgc 2855
Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu Arg Ala Gly Gln His CysThr Leu Leu Ala Glu Tyr Gly Ile Ala Leu Arg Ala Gly Gln His Cys
730 735 740730 735 740
gct cag ccg cta ctg gca gaa tta ggc gta acc ggc aca ctg cgc gcc 2903gct cag ccg cta ctg gca gaa tta ggc gta acc ggc aca ctg cgc gcc 2903
Ala Gln Pro Leu Leu Ala Glu Leu Gly Val Thr Gly Thr Leu Arg AlaAla Gln Pro Leu Leu Ala Glu Leu Gly Val Thr Gly Thr Leu Arg Ala
745 750 755745 750 755
tct ttt gcg cca tat aat aca aag agt gat gtg gat gcg ctg gtg aat 2951tct ttt gcg cca tat aat aca aag agt gat gtg gat gcg ctg gtg aat 2951
Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp Val Asp Ala Leu Val AsnSer Phe Ala Pro Tyr Asn Thr Lys Ser Asp Val Asp Ala Leu Val Asn
760 765 770 775760 765 770 775
gcc gtt gac cgc gcg ctg gaa tta ttg gtg gat taa acgcgtgcta 2997gcc gtt gac cgc gcg ctg gaa tta ttg gtg gat taa acgcgtgcta 2997
Ala Val Asp Arg Ala Leu Glu Leu Leu Val AspAla Val Asp Arg Ala Leu Glu Leu Leu Val Asp
780 785780 785
gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt ttatctgttg 3057gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt ttatctgttg 3057
tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ccctagacct aggggatata 3117tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ccctagacct aggggatata 3117
ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat 3177ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat 3177
ggcttacgaa cggggcggag atttcctgga agatgccagg aagatactta acagggaagt 3237ggcttacgaa cggggcggag atttcctgga agatgccagg aagatactta acagggaagt 3237
gagagggccg cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa 3297gagagggccg cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa 3297
atctgacgct caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt 3357atctgacgct caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt 3357
ccccctggcg gctccctcgt gcgctctcct gttcctgcct ttcggtttac cggtgtcatt 3417ccccctggcg gctccctcgt gcgctctcct gttcctgcct ttcggtttac cggtgtcatt 3417
ccgctgttat ggccgcgttt gtctcattcc acgcctgaca ctcagttccg ggtaggcagt 3477ccgctgttat ggccgcgttt gtctcattcc acgcctgaca ctcagttccg ggtaggcagt 3477
tcgctccaag ctggactgta tgcacgaacc ccccgttcag tccgaccgct gcgccttatc 3537tcgctccaag ctggactgta tgcacgaacc ccccgttcag tccgaccgct gcgccttatc 3537
cggtaactat cgtcttgagt ccaacccgga aagacatgca aaagcaccac tggcagcagc 3597cggtaactat cgtcttgagt ccaacccgga aagacatgca aaagcaccac tggcagcagc 3597
cactggtaat tgatttagag gagttagtct tgaagtcatg cgccggttaa ggctaaactg 3657cactggtaat tgatttagag gagttagtct tgaagtcatg cgccggttaa ggctaaactg 3657
aaaggacaag ttttggtgac tgcgctcctc caagccagtt acctcggttc aaagagttgg 3717aaaggacaag ttttggtgac tgcgctcctc caagccagtt acctcggttc aaagagttgg 3717
tagctcagag aaccttcgaa aaaccgccct gcaaggcggt tttttcgttt tcagagcaag 3777tagctcagag aaccttcgaa aaaccgccct gcaaggcggt tttttcgttt tcagagcaag 3777
agattacgcg cagaccaaaa cgatctcaag aagatcatct tattaatcag ataaaatatt 3837agattacgcg cagaccaaaa cgatctcaag aagatcatct tattaatcag ataaaatatt 3837
tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc cccatacgat 3897tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc cccatacgat 3897
ataagttgtt actagtgctt ggattctcac caataaaaaa cgcccggcgg caaccgagcg 3957ataagttgtt actagtgctt ggattctcac caataaaaaa cgcccggcgg caaccgagcg 3957
ttctgaacaa atccagatgg agttctgagg tcattactgg atctatcaac aggagtccaa 4017ttctgaacaa atccagatgg agttctgagg tcattactgg atctatcaac aggagtccaa 4017
gcgagctctc gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc 4077gcgagctctc gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc 4077
gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc 4137gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc 4137
gccgccaagc tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc 4197gccgccaagc tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc 4197
cacacccagc cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt 4257cacacccagc cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt 4257
cggcaagcag gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt 4317cggcaagcag gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt 4317
gagcctggcg aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg 4377gagcctggcg aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg 4377
atcgacaaga ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg 4437atcgacaaga ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg 4437
gtcgaatggg caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat 4497gtcgaatggg caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat 4497
ggatactttc tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc 4557ggatactttc tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc 4557
caatagcagc cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac 4617caatagcagc cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac 4617
gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc 4677gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc 4677
ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc 4737ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc 4737
ggcatcagag cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca 4797ggcatcagag cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca 4797
agcggccgga gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc 4857agcggccgga gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc 4857
tgtctcttga tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat 4917tgtctcttga tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat 4917
ccagtttact ttgcagggct tcccaacctt accagagggc gccccagctg gcaattcc 4975ccagtttact ttgcagggct tcccaacctt accagagggc gccccagctg gcaattcc 4975
<210> 14<210> 14
<211> 384<211> 384
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 14<400> 14
Met Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His ProMet Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro
1 5 10 151 5 10 15
Asp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile LeuAsp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu
20 25 3020 25 30
Glu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys ThrGlu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr
35 40 4535 40 45
Gly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val AspGly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp
50 55 6050 55 60
Ile Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val HisIle Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His
65 70 75 8065 70 75 80
Ser Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala IleSer Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile
85 90 9585 90 95
Gly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp ProGly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro
100 105 110100 105 110
Leu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala ThrLeu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr
115 120 125115 120 125
Asn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His ArgAsn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg
130 135 140130 135 140
Leu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro TrpLeu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp
145 150 155 160145 150 155 160
Leu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp GlyLeu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly
165 170 175165 170 175
Lys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser GluLys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu
180 185 190180 185 190
Glu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile IleGlu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile
195 200 205195 200 205
Lys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe PheLys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe
210 215 220210 215 220
Ile Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp CysIle Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys
225 230 235 240225 230 235 240
Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met AlaGly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala
245 250 255245 250 255
Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val AspArg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp
260 265 270260 265 270
Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val AlaArg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala
275 280 285275 280 285
Ala Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile GlyAla Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly
290 295 300290 295 300
Val Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu LysVal Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys
305 310 315 320305 310 315 320
Val Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp LeuVal Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu
325 330 335325 330 335
Arg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile TyrArg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr
340 345 350340 345 350
Lys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro TrpLys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp
355 360 365355 360 365
Glu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysGlu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys
370 375 380370 375 380
<210> 15<210> 15
<211> 401<211> 401
<212> PRT<212> PRT
<213> Escherichia coli<213> Escherichia coli
<400> 15<400> 15
Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu
1 5 10 151 5 10 15
Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro
20 25 3020 25 30
Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly
35 40 4535 40 45
Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg
50 55 6050 55 60
Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp
65 70 75 8065 70 75 80
Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met
85 90 9585 90 95
Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile
100 105 110100 105 110
Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met
115 120 125115 120 125
Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala
130 135 140130 135 140
Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg
145 150 155 160145 150 155 160
Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys
165 170 175165 170 175
Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val
180 185 190180 185 190
Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val
195 200 205195 200 205
Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr
210 215 220210 215 220
Gly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu
225 230 235 240225 230 235 240
Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser
245 250 255245 250 255
Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly
260 265 270260 265 270
Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp
275 280 285275 280 285
Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu
290 295 300290 295 300
Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser
305 310 315 320305 310 315 320
Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val
325 330 335325 330 335
His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu
340 345 350340 345 350
Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val
355 360 365355 360 365
Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp
370 375 380370 375 380
Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val
385 390 395 400385 390 395 400
AspAsp
Claims (14)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19806872A DE19806872A1 (en) | 1998-02-19 | 1998-02-19 | Production of biotin by expressing S-adenosyl-methionine synthase and second biotin synthesis gene in host cells |
DE19806872.7 | 1998-02-19 | ||
PCT/EP1999/001052 WO1999042591A1 (en) | 1998-02-19 | 1999-02-17 | Method for producing biotin |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20010041062A true KR20010041062A (en) | 2001-05-15 |
Family
ID=7858233
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020007009100A KR20010041062A (en) | 1998-02-19 | 1999-02-17 | Method for Producing Biotin |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP1054977A1 (en) |
JP (1) | JP2002504338A (en) |
KR (1) | KR20010041062A (en) |
CN (1) | CN1210403C (en) |
CA (1) | CA2321264A1 (en) |
DE (1) | DE19806872A1 (en) |
IL (1) | IL137310A0 (en) |
WO (1) | WO1999042591A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101023788B1 (en) * | 2008-04-18 | 2011-03-21 | 주식회사 콧데 | Recombinant plasmid, biotin producing microorganism transformed by the recombinant plasmid and method for manufacturing biotin using the transformed biotin producing microorganism |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2014233711B2 (en) * | 2013-03-15 | 2020-05-28 | Monsanto Technology Llc | Compositions and Methods for the Production and Delivery of RNA |
CN107099497B (en) * | 2017-06-09 | 2020-08-11 | 浙江大学 | Plasmid and cell for promoting biotin synthesis and promoting method thereof |
CN112473700A (en) * | 2020-12-07 | 2021-03-12 | 南昌航空大学 | Preparation method and application of bismuth oxybromide/biochar composite visible-light-driven photocatalyst |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09224690A (en) * | 1996-02-21 | 1997-09-02 | Shiseido Co Ltd | Production of biotin |
DK0806479T3 (en) * | 1996-05-06 | 2003-10-06 | Hoffmann La Roche | Production of biotin by fermentation |
DE19731274A1 (en) * | 1997-07-22 | 1999-01-28 | Basf Ag | Process for the production of biotin |
-
1998
- 1998-02-19 DE DE19806872A patent/DE19806872A1/en not_active Withdrawn
-
1999
- 1999-02-17 JP JP2000532531A patent/JP2002504338A/en not_active Withdrawn
- 1999-02-17 CN CNB998031534A patent/CN1210403C/en not_active Expired - Fee Related
- 1999-02-17 IL IL13731099A patent/IL137310A0/en unknown
- 1999-02-17 WO PCT/EP1999/001052 patent/WO1999042591A1/en not_active Application Discontinuation
- 1999-02-17 KR KR1020007009100A patent/KR20010041062A/en not_active Application Discontinuation
- 1999-02-17 EP EP99908923A patent/EP1054977A1/en not_active Withdrawn
- 1999-02-17 CA CA002321264A patent/CA2321264A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101023788B1 (en) * | 2008-04-18 | 2011-03-21 | 주식회사 콧데 | Recombinant plasmid, biotin producing microorganism transformed by the recombinant plasmid and method for manufacturing biotin using the transformed biotin producing microorganism |
Also Published As
Publication number | Publication date |
---|---|
CA2321264A1 (en) | 1999-08-26 |
CN1210403C (en) | 2005-07-13 |
CN1291232A (en) | 2001-04-11 |
EP1054977A1 (en) | 2000-11-29 |
JP2002504338A (en) | 2002-02-12 |
DE19806872A1 (en) | 1999-08-26 |
WO1999042591A1 (en) | 1999-08-26 |
IL137310A0 (en) | 2001-07-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4800341B2 (en) | Psod expression unit | |
WO2019147078A1 (en) | Corynebacterium sp. microorganism producing purine nucleotide and method for producing purine nucleotide by using same | |
EP2150559B1 (en) | Increased production of a target product via stabilization of mrna | |
CA3073180C (en) | Microorganism having increased glycine productivity and method for producing fermented composition using the same | |
Möckel et al. | Functional and structural analyses of threonine dehydratase from Corynebacterium glutamicum | |
AU2002245099A1 (en) | Rhodococcus cloning and expression vectors | |
Kusian et al. | Characterization of the duplicate ribulose-1, 5-bisphosphate carboxylase genes and cbb promoters of Alcaligenes eutrophus | |
KR20010041062A (en) | Method for Producing Biotin | |
Jendrossek et al. | Characterization of alcohol dehydrogenase genes of derepressible wild-type Alcaligenes eutrophus H16 and constitutive mutants | |
US6436681B1 (en) | Method for producing biotin | |
US6277609B1 (en) | Method to produce biotin | |
Freter et al. | Identification of a novel gene, aut, involved in autotrophic growth of Alcaligenes eutrophus | |
CN114787369B (en) | Process for producing sulfur-containing amino acid or derivative thereof | |
AU2002241272B2 (en) | Transformant producing PF1022 substance derivatives, process for producing the same and novel biosynthesis gene | |
WO2023106543A1 (en) | Corynebacterium glutamicum variant having improved l-lysine production ability and method for producing l-lysine by using same | |
WO2024019216A1 (en) | Microorganism of genus corynebacterium having improved ability to produce l-arginine or l-citrulline, and method for producing l-arginine or l-citrulline using same | |
JP4267715B2 (en) | Biotin biosynthesis gene II | |
WO2022231049A1 (en) | Corynebacterium glutamicum variant having improved l-lysine production ability and method for producing l-lysine by using same | |
WO2022196919A1 (en) | Corynebacterium glutamicum variant having improved l-citrulline production ability, and method for producing l-citrulline by using same | |
CN111676205A (en) | Microorganism with improved L-lysine productivity and method for producing L-lysine using the same | |
IE920681A1 (en) | A process for the terminal hydroxylation of ethyl groups on aromatic 5- or 6-membered heterocyclic compounds | |
CN115261295A (en) | Corynebacterium glutamicum mutant strain having improved L-lysine productivity and method for producing L-lysine using the same | |
KR20230053351A (en) | Mutant of Corynebacterium glutamicum with enhanced L-lysine productivity and method for preparing L-lysine using the same | |
US20240043886A1 (en) | Microorganism of the genus corynebacterium for producing l-amino acid and method for producing l-amino acid using the same | |
CN114787370A (en) | Process for producing sulfur-containing amino acid or derivative thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |