KR20010041062A

KR20010041062A - Method for Producing Biotin

Info

Publication number: KR20010041062A
Application number: KR1020007009100A
Authority: KR
Inventors: 하트비크 슈뢰더
Original assignee: 스타르크, 카르크; 바스프 악티엔게젤샤프트
Priority date: 1998-02-19
Filing date: 1999-02-17
Publication date: 2001-05-15
Also published as: CA2321264A1; CN1210403C; CN1291232A; EP1054977A1; JP2002504338A; DE19806872A1; WO1999042591A1; IL137310A0

Abstract

본 발명은 서열 1을 갖는 S-아세노실메티오닌 합성효소 유전자, 및 서열 3, 서열 5 또는 서열 7을 갖는 비오틴 생합성 유전자인 bioS1, bioS2 및(또는) bioS3, 및 가능하게는 bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 하나 이상의 추가의 비오틴 합성 유전자 서열을 함유하는 유전자 재조합 생성물에 관한 것이다. 본 발명은 이 유전자 재조합 생성물을 함유하는 유기체 및 비오틴을 생성하기 위한 그의 용도, 및 비오틴을 생성하는 방법에 관한 것이다.The present invention relates to an S-acenosylmethionine synthetase gene having SEQ ID NO: 1, and a biotin biosynthetic gene having SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 bioS1, bioS2 and / or bioS3, and possibly bioA, bioB, bioF, A recombinant product containing one or more additional biotin synthetic gene sequences selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR. The present invention relates to an organism containing this recombinant product and its use for producing biotin, and a method for producing biotin.

Description

Method for Producing Biotin {Method for Producing Biotin}

보효소로서, 비오틴(비타민 H)은 효소-촉매작용 카르복실화 및 탈카르복실화 반응에서 필수적인 역할을 한다. 따라서 비오틴은 살아있는 세포에서 필수적인 인자이다. 거의 모든 동물 및 일부 미생물은 그들 자신이 비오틴을 합성할 수 없기 때문에 외부로부터 비오틴을 섭취하여야 한다. 그러므로, 비오틴은 이들 유기체에 있어서 필수적인 비타민이다. 반대로, 세균, 효모 및 식물은 그들 자신이 전구체로부터 비오틴을 합성할 수 있다(브라운(Brown) 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326], 드몰(DeMoll, E.)의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]).As coenzymes, biotin (vitamin H) plays an essential role in enzyme-catalyzed carboxylation and decarboxylation reactions. Biotin is therefore an essential factor in living cells. Almost all animals and some microorganisms must take biotin from the outside because they cannot synthesize biotin themselves. Therefore, biotin is an essential vitamin for these organisms. In contrast, bacteria, yeasts and plants themselves can synthesize biotin from precursors (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326, DeMoll, E (Escherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5).

비오틴의 합성은 세균 유기체, 특히 그람-양성 세균인 에세리히아 콜리(Escherichia coli) 및 그람-양성 세균인 바실루스 스파에리쿠스(Bacillus sphaericus)에서 조사되어 왔다(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326]). 지방산 합성으로부터 유도된 피멜릴-CoA(PmCoA)는 이미 이 콜리(E. coli)에서 최초로 공지된 중간체로서 여겨져 왔다(드몰의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5 1996]). 현재까지, 이 비오틴 전구체가 이 콜리에서 합성되는 경로는 거의 알려져 있지 않다(레모인(Lemoine) 등의 문헌[Mol. Microbiol. 19, 1996: 645-647]). bioC 및 bioH는 상응하는 단백질이 Pm-CoA의 합성을 담당하는 두 유전자인 것으로 확인되었다. 유전자 생성물, 즉 BioH 및 BioC의 효소 기능은 지금까지 알려져 있지 않다(레모인 등의 문헌[Mol. Microbiol. 19, 1996: 645-647], 드몰의 문헌[Escherichia coli and Salmonella, eds. Neidhardt, F. C. et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]). Pm-CoA는 4개의 추가의 효소 단계에서 비오틴으로 전환된다. BioF는 우선 Pm-CoA를 알라닌과 축합하여 7-케토-8-아미노펠라르곤산(KAPA)을 형성한다. 그 다음, KAPA는 BioA에 의해 7,8-디아미노펠라르곤산(DAPA)으로 전환된다. ATP-의존성 카르복실화 반응 후, 다음 단계에 의해 디티오비오틴(DTB)으로 되며 이 단계는 BioD에 의해 촉진된다. DTB는 마지막 단계에서 비오틴으로 전환된다. 이 단계는 BioB에 의해 촉진된다. DTB의 비오틴으로의 전환에 관련된 화학적 및 효소적 메카니즘은 지금까지 불완전하게만 이해되었고 해명되었다.The synthesis of biotin has been investigated in bacterial organisms, especially Gram-positive bacteria Escherichia coli and Gram-positive bacteria Bacillus sphaericus (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326]. Pimelyl-CoA (PmCoA) derived from fatty acid synthesis has already been considered as the first known intermediate in E. coli (Escherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5 1996]. To date, little is known about how this biotin precursor is synthesized in this collie (Lemoine et al., Mol. Microbiol. 19, 1996: 645-647). bioC and bioH have been identified that the corresponding proteins are two genes responsible for the synthesis of Pm-CoA. The enzymatic function of the gene product, namely BioH and BioC, is not known so far (Remoin et al., Mol. Microbiol. 19, 1996: 645-647, Dsmol et al., Esccherichia coli and Salmonella, eds. Neidhardt, FC et al. ASM Press, Washington DC, USA, 1996: 704-708, ISBN 1-55581-084-5]. Pm-CoA is converted to biotin in four additional enzymatic steps. BioF first condenses Pm-CoA with alanine to form 7-keto-8-aminofellargonic acid (KAPA). KAPA is then converted to 7,8-diaminofellargonic acid (DAPA) by BioA. After the ATP-dependent carboxylation reaction, dithiobiotin (DTB) is obtained by the next step, which is promoted by BioD. DTB is converted to biotin in the last step. This step is facilitated by BioB. The chemical and enzymatic mechanisms involved in the conversion of DTB to biotin have so far been incompletely understood and elucidated.

DTB의 비오틴으로의 전환은 지금까지 세균 및 식물 세포 추출물에서 특징화되었을 뿐이었다(WO94/8023호, EP-B-0 449 724호, 사냘(Sanyal) 등의 문헌[Arch. Biochem. Biophys., Vol. 326, No. 1, 1996: 48-56] 및 문헌[Biochemistry 33, 1994: 3625-3631], 발데트(Baldet) 등의 문헌[Europ. J. Biochem. 217, 1, 1993: 479-485], 메장(Mejean) 등의 문헌[Biochem. Biophys. Res. Commun., Vol. 217, No. 3, 1995: 1231-1237], 오시로(Ohshiro) 등의 문헌[Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741]).The conversion of DTB to biotin has only been so far characterized in bacterial and plant cell extracts (WO 94/8023, EP-B-0 449 724, Sanal et al., Arch. Biochem. Biophys., Vol. 326, No. 1, 1996: 48-56 and Biochemistry 33, 1994: 3625-3631, Bald et al., Europ. J. Biochem. 217, 1, 1993: 479-485. , Mejean et al., Biochem. Biophys. Res. Commun., Vol. 217, No. 3, 1995: 1231-1237, Ohshiro et al. Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741].

시험관내 연구 결과, NADPH, 시스테인, 티아민, Fe²⁺, 아스파라긴, 세린, 프럭토즈 1,6-비스포스페이트 및 S-아데노실메티오닌과 같은 저분자량의 인자는 비오틴의 합성을 모의할 수 있음이 밝혀졌다(오시로 등의 문헌[Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741], 버치(Birch) 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165], 이푸쿠(Ifuku) 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189], 사냘 등의 문헌[Arch. Biochem. Biophys., 326, 1, 1996: 48-56]).In vitro studies have shown that low molecular weight factors such as NADPH, cysteine, thiamine, Fe ²⁺ , asparagine, serine, fructose 1,6-bisphosphate, and S-adenosylmethionine can simulate the synthesis of biotin (Osiro et al. Biosci. Biotechnol. Biochem., 58, 9, 1994: 1738-1741, Birch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165). , Ifuku et al., Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189, Sarsham et al. Arch. Biochem. Biophys., 326, 1, 1996: 48-56; ).

상기 저분자량의 인자들 외에, BioB의 존재하에 DTB로부터의 비오틴의 합성을 모의하는 다른 단백질이 확인되었다. 이들 단백질은 플라보독신 및 플라보독신 NADPH 환원효소이다(버치 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165], 이푸쿠 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189], 사냘 등의 문헌[Arch. Biochem. Biophys., 326, 1, 1996: 48-56]). 비오틴 합성을 모의하는 그밖의 단백질은 유전자 bioS1 및 bioS2이며, 이들은 독일 특허출원 제197.31274.8호(우선권 22.7.97)에 기술되어 있다.In addition to these low molecular weight factors, other proteins have been identified that simulate the synthesis of biotin from DTB in the presence of BioB. These proteins are flavodoxin and flavodoxin NADPH reductase (Burch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165, Ipuku et al. Biosci. Biotechnol. Biochem., 59, 2, 1995; 185-189, Sarsham et al., Arch. Biochem. Biophys., 326, 1, 1996: 48-56). Other proteins that simulate biotin synthesis are the genes bioS1 and bioS2, which are described in German patent application No. 197.31274.8 (Priority 22.7.97).

비오틴 분자내의 황의 유래에 관하여 상이한 결과가 얻어졌다. 완전 세포 추출물에서의 비오틴 합성에 대한 조사 결과,³⁵S-표지된 시스테인의 존재하에 비오틴에 방사능이 도입되었음이 나타났고,³⁵S-표지된 메티오닌 또는 S-아데노실메티오닌이 사용될 때는 비오틴 분자내로의 황의 도입을 증명하기가 불가능하였다(이푸쿠 등의 문헌[Biosci. Biotechnol. Biochem., 59, 2, 1995; 184-189], 버치 등의 문헌[J. Biol. Chem. 270, 32, 1995: 19158-19165]).Different results have been obtained regarding the derivation of sulfur in the biotin molecule. Investigations into biotin synthesis in whole cell extracts showed that radioactivity was introduced into biotin in the presence of ³⁵ S-labeled cysteine and into the biotin molecule when ³⁵ S-labeled methionine or S-adenosylmethionine was used. It was not possible to prove the introduction of sulfur (Ipuku et al., Biosci. Biotechnol. Biochem., 59, 2, 1995; 184-189), Birch et al., J. Biol. Chem. 270, 32, 1995: 19158-19165].

기술된 단백질, 즉 bioF, bioA, bioD 및 bioB를 암호화하는 유전자는 이 콜리의 양방향 오페론에서 암호화된다. 이 오페론은 λ결합 부위와 이 콜리 염색체에서 약 17미니트(minute)에 있는 uvrB 유전자 좌위 사이에 위치한다(벌린(Berlyn) 등의 문헌[1996: 1715-1902]). 추가의 두 유전자(이중 하나, 즉 bioC는 이미 Pm-CoA의 합성에서 기술된 기능을 가짐)가 이 오페론에서 추가로 암호화되지만, 지금까지 bioA의 하류에 위치하는 개방 판독틀에 어떠한 기능도 부여할 수 없었다(WO94/8023호, 오쓰카(Otsuka) 등의 문헌[J. Biol. Chem. 263, 1988: 19577-85]). 이 콜리 단백질 BioF, BioA, BioD 및 BioB에 대해 매우 보존된 상동물은 비 스파에리쿠스(B. sphaericus), 비 서브틸리스(B. subtilis), 시네코시스티스 종(Syneccocystis sp.)(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326], 바우어(Bower) 등의 문헌[J. Bacteriol. 175, 1996: 4122-4130], 가네코(Kaneko) 등의 문헌[DNA Res. 3, 3, 1996: 109-136]), 메타노코쿠스 야나시(Methanococcus janaschi)와 같은 원시세균 및 사카로미세스 세레비지아에(Saccharomyces cerevisiae)와 같은 효모(쟝(Zhang) 등의 문헌[Arch. Boichem. Biophys. 309, 1, 1994: 29-35])에서 또는 아라비돕시스 탈리아나(Arabidopsis thaliana)(발데트 등의 문헌[C. R. Acad. Sci. III, Sci. Vie. 319, 2, 1996: 99-106])에서 발견되었다.The genes encoding the described proteins, namely bioF, bioA, bioD and bioB, are encoded in this collie's bidirectional operon. This operon is located between the λ binding site and the uvrB locus at about 17 minutes on this Coli chromosome (Berlyn et al. (1996: 1715-1902)). Two additional genes (one of which bioC already has the function described in the synthesis of Pm-CoA) are additionally encoded in this operon, but so far will not impart any function to the open reading frame downstream of bioA. (WO 94/8023, Otsuka et al., J. Biol. Chem. 263, 1988: 19577-85). Highly conserved moorings for this collie protein BioF, BioA, BioD and BioB include B. sphaericus, B. subtilis, Syneccocystis sp. (Brown et al. Biotechnol. Genet. Eng. Rev. 9, 1991: 295-326, Bower et al., J. Bacteriol. 175, 1996: 4122-4130, Kaneko et al., DNA Res. 3, 3, 1996: 109-136), primitive bacteria such as Methanococcus janaschi and yeast such as Saccharomyces cerevisiae; Arch. Boichem. Biophys. 309, 1, 1994: 29-35) or Arabidopsis thaliana (Valette et al., CR Acad. Sci. III, Sci. Vie. 319, 2, 1996). : 99-106).

지금까지 조사된 두 그람-양성 미생물에서, Pm-CoA의 합성은 이 콜리에서와는 다른 방식으로 진행되는 듯하다. bioH 및 bioC의 어떠한 상동물도 찾을 수 없었다(브라운 등의 문헌[Biotechnol. Genet. Eng. Rev. 9. 1991: 295-326]).In the two Gram-positive microorganisms examined so far, the synthesis of Pm-CoA seems to proceed in a different way than in this collie. No mocha of bioH and bioC was found (Brown et al. Biotechnol. Genet. Eng. Rev. 9. 1991: 295-326).

비오틴은 3개의 키랄성 중심을 갖는 광학 활성 물질이다. 비오틴은 지금까지 값비싼 다단계 화학 합성에 의해 경제적으로 제조되어 왔을 뿐이었다.Biotin is an optically active substance with three chiral centers. Biotin has only been produced economically by expensive multi-step chemical synthesis.

이러한 화학 합성의 대안으로서, 미생물을 사용하여 비오틴을 제조하기 위한 발효 공정을 구성하려는 많은 시도가 이루어졌다. 비오틴 오페론을 다수카피성(multi-copy) 플라스미드에 클로닝하는 방법이 성공적으로 사용되어 이러한 유전자로 형질전환된 미생물에서 비오틴 합성을 증대시켰다. 비오틴 합성에서의 증가는 또한 birA 돌연변이체를 선택하여 비오틴 유전자 발현을 조절 해제함으로써 이루어졌다(파이(Pai C. H.)의 문헌[J. Bacteriol. 112, 1972: 1280-1287]). 두 시도를 조합하면, 즉 조절-결함 균주(EP-B-0 236 429호)에서 플라스미드-암호화된 생합성을 발현하면, 생산성이 더욱 더 증가되었다. 이와 관련하여, 비오틴 오페론은 그의 천연의 양방향 프로모터의 통제하에 남아 있거나(EP-B-0 235 429호) 또는 그의 유전자는 외부에서 조절할 수 있는 프로모터의 통제하에 있을 수 있다(WO94/08023호).As an alternative to such chemical synthesis, many attempts have been made to construct fermentation processes for producing biotin using microorganisms. Cloning of biotin operons into multi-copy plasmids has been used successfully to enhance biotin synthesis in microorganisms transformed with these genes. The increase in biotin synthesis was also made by deselecting birA mutants to deregulate biotin gene expression (Pai C. H., J. Bacteriol. 112, 1972: 1280-1287). Combining the two trials, ie, expressing plasmid-encoded biosynthesis in a regulatory-defective strain (EP-B-0 236 429), increased productivity even further. In this regard, the biotin operon may remain under the control of its natural bidirectional promoter (EP-B-0 235 429) or its gene may be under the control of an externally regulated promoter (WO94 / 08023).

지금까지 이 콜리에서 비오틴을 발효에 의해 생성하기 위하여 수행되었던 시도는 경제적으로 적당한 생산성을 얻지 못했다.The attempts made so far to produce biotin by fermentation in this collie have not achieved economically adequate productivity.

본 발명은 서열 1을 갖는 S-아데노실메티오닌 합성효소 유전자 및 각각 서열 3, 서열 5 및 서열 7을 갖는 비오틴 생합성 유전자 bioS1, bioS2 및(또는) bioS3, 및 경우에 따라, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 및 bioR 군에서 선택된 하나 이상의 추가의 비오틴 합성 유전자 서열을 함유하는 유전자 구조물에 관한 것이다. 본 발명은 또한 이 유전자 구조물을 함유하는 유기체 및 비오틴을 제조하기 위한 유전자 구조물의 용도, 및 비오틴의 제조방법에 관한 것이다.The present invention provides an S-adenosylmethionine synthase gene having SEQ ID NO: 1 and a biotin biosynthetic gene bioS1, bioS2 and / or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and optionally bioA, bioB, bioF, A genetic construct containing one or more additional biotin synthetic gene sequences selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY and bioR. The present invention also relates to the use of the gene constructs for producing organisms and biotin containing the gene constructs, and to methods of producing biotin.

도 1은 벡터 pHS1에 SAM 합성효소를 암호화하는 metK 유전자를 클로닝하여 얻은 pHS1 metK 유전자 구조물을 나타낸다.Figure 1 shows the pHS1 metK gene construct obtained by cloning the metK gene encoding SAM synthase in vector pHS1.

도 2는 플라스미드 pHS1 bioS1에 metK 유전자를 클로닝하여 얻은 pHS1 metK bioS1 유전자 구조물을 나타낸다.Figure 2 shows the pHS1 metK bioS1 gene construct obtained by cloning the metK gene in plasmid pHS1 bioS1.

본 발명의 목적은 가능한 많이 비오틴을 합성하는, 비오틴을 제조하기 위한 공업적 발효 공정을 개발하는 것이다.It is an object of the present invention to develop an industrial fermentation process for producing biotin, which synthesizes biotin as much as possible.

본 발명자들은 이 목적이 서열 1을 갖는 S-아데노실메티오닌 합성효소(SAM 합성효소) 유전자 및 서열 3, 서열 5 및 서열 7을 갖는 하나 이상의 추가의 비오틴 생합성 유전자 bioS1, bioS2 또는 bioS3, 및 이들의 기능적 변이체, 유사체 또는 유도체를 비오틴을 합성할 수 있는 원핵 또는 진핵 숙주 유기체에서 발현시키고, 이 유기체를 배양하고, 균체를 분리 제거하거나 또는 비오틴을 정제한 후 합성된 비오틴을 바로 사용하는, 비오틴을 생성하기 위한 본 발명에 따른 방법에 의해 이루어짐을 발견하였다.The inventors have found that for this purpose the S-adenosylmethionine synthase (SAM synthetase) gene having SEQ ID NO: 1 and one or more additional biotin biosynthetic genes bioS1, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and their Functional variants, analogs, or derivatives are expressed in prokaryotic or eukaryotic host organisms capable of synthesizing biotin, culturing the organism, isolating cells or purifying biotin, and immediately using the synthesized biotin to produce biotin. It has been found to be made by the method according to the invention for

본 발명에 따른 방법에 사용된 유전자, 즉 서열 1을 갖는 SAM 합성효소 유전자 및 서열 3, 서열 5 및 서열 7을 갖는 비오틴 생합성 유전자 bioS1, bioS2 및 bioS3은 스위스프로트-데이타베이스(SwissProt-database)에 수탁번호 P04384(metK), U29581(bioS1), P39171(bioS2) 및 D90811(bioS3)로 보관되어 있다. 이 콜리 MetK의 다수의 상동물이 데이타베이스에 기재되어 있다. 이들 상동물은 기타 유박테리아(eubacteia)(예컨대, 에이치 인플루엔자에(H. influenzae) 및 비 서브틸리스) 및 진핵생물(예컨대, 효모: 에스 세레비지아에, 식물: 피 델토이데스(P. deltoides), 절지동물: 디 멜라노가스테르(D. melanogaster) 및 포유류: 알 노르베기쿠스(R. norvegicus))과 같은 유기체를 포함한다.The genes used in the method according to the invention, namely the SAM synthetase gene having SEQ ID NO: 1 and the biotin biosynthetic genes bioS1, bioS2 and bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, were added to the SwissProt-database. Accession No. P04384 (metK), U29581 (bioS1), P39171 (bioS2) and D90811 (bioS3). Many moieties of this Coli MetK are described in the database. These moters include other eubacteia (e.g. H. influenzae and non-subtilis) and eukaryotes (e.g. yeast: E. cerevisiae, plants: P. deltoides ), Arthropods: D. melanogaster and mammals: R. norvegicus).

비오틴 생합성의 생산성은 서열 1을 갖는 하나 이상의 SAM 합성효소 유전자 및 그의 기능적 변이체, 유사체 또는 유도체를 서열 3, 서열 5 및 서열 7을 갖는 하나 이상의 비오틴 합성 유전자 bioS1, bioS2 또는 bioS3, 및 이들의 기능 변이체, 유사체 또는 유도체와 함께 원핵 또는 진핵 숙주 유기체에서 발현시킴으로써 현저하게 증가시킬 수 있다. 바람직하게는 SAM 합성효소 유전자와 bioS1의 조합이 발현에 사용된다. 비오틴 합성을 더욱 더 증가시키기 위하여, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 및 bioR 군에서 선택된 하나 이상의 추가의 비오틴 유전자를 유리하게는 동시에 발현시킨다. 이들 유전자의 발현은 이들 유전자가 없는 대조군에 비하여 2배 이상, 바람직하게는 3배보다 많이 비오틴 합성을 증대시킨다.The productivity of biotin biosynthesis may include one or more SAM synthetase genes having SEQ ID NO: 1 and functional variants, analogs or derivatives thereof, comprising one or more biotin synthesis genes bioS1, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and functional variants thereof. , By expression in prokaryotic or eukaryotic host organisms together with analogs or derivatives. Preferably a combination of SAM synthase gene and bioS1 is used for expression. To further increase biotin synthesis, one or more additional biotin genes selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY and bioR are advantageously expressed simultaneously. Expression of these genes enhances biotin synthesis more than two times, preferably more than three times, compared to the control without these genes.

본 발명에 따른 방법에 사용된 유전자, 즉 뉴클레오티드 서열 1을 갖는 SAM 합성효소 유전자, 뉴클레오티드 서열 3을 갖는 bioS1 유전자, 뉴클레오티드 서열 5를 갖는 bioS2 유전자 및 뉴클레오티드 서열 7을 갖는 bioS3 유전자(이들 서열은 각각 서열 2, 서열 4, 서열 6 및 서열 8에 제공된 아미노산 서열을 암호화함), 또는 이들의 대립유전자 변이체를 하기의 단리 및 서열결정 후에 얻을 수 있다. 변이체는 각각 아미노산 수준에서 30 내지 100% 상동성, 바람직하게는 50 내지 100% 상동성, 매우 특히 바람직하게는 80 내지 100% 상동성을 나타내는 서열 1, 서열 3, 서열 5 및 서열 7인 것으로 이해되어야 한다. 대립유전자 변이체는 특히 서열 1, 서열 3, 서열 5 및 서열 7에 기술된 서열로부터 뉴클레오티드의 결실, 삽입 또는 치환에 의해 얻어질 수 있는 기능 변이체(그러나, 효소 활성은 유지됨)를 포함한다.Genes used in the method according to the invention, ie SAM synthase gene having nucleotide sequence 1, bioS1 gene having nucleotide sequence 3, bioS2 gene having nucleotide sequence 5 and bioS3 gene having nucleotide sequence 7 (these sequences are respectively 2, SEQ ID NO: 4, SEQ ID NO: 6 and encoding the amino acid sequences provided in SEQ ID NO: 8), or allelic variants thereof can be obtained after isolation and sequencing below. It is understood that the variants are SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, which each exhibit 30 to 100% homology, preferably 50 to 100% homology, very particularly preferably 80 to 100% homology at the amino acid level Should be. Allelic variants include, in particular, functional variants obtainable by deletion, insertion or substitution of nucleotides from the sequences set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, however, enzymatic activity is maintained.

또한, 변이체는 O-아세틸세린 술포히드롤라제 A, O-아세틸세린 술포히드롤라제 B, β-시스타티오나제(플린트(Flint) 등의 문헌[J. Biol. Chem., Vol. 271, 1996: 16053-16067] 참조)와 같은 유전자의 기능적 등가물 또는 비오틴의 합성에서 bioS1, bioS2 또는 bioS3의 효소 활성을 나타낼 수 있는, 예를 들어 클렙시엘라(Klebsiella), 칸디다(Candida), 효모 또는 카에노르합디티스(Caenorhabditis)로부터의 nifS 및 그의 원핵 및 진핵 상동물인 것으로 이해되어야 한다.In addition, the variants include O-acetylserine sulfohydrolase A, O-acetylserine sulfohydrolase B, β-cystionase (Flint et al., J. Biol. Chem., Vol. 271, 1996: 16053-16067), which may exhibit the enzymatic activity of bioS1, bioS2 or bioS3 in the synthesis of biotin or functional equivalents of biotin, for example Klebsiella, Candida, yeast or carcass It is to be understood that nifS from Caenorhabditis and its prokaryotic and eukaryotic murines.

서열 1, 서열 3, 서열 5 및 서열 7의 기능적 유사체는, 예를 들어 세균, 진균, 식물, 동물 또는 인간 상동물과 같은 이들의 원핵 또는 진핵 상동물인 것으로 이해되어야 한다. 또한, 유사체는 절단된 서열이나 또는 암호화 및 비암호화 DNA 서열로부터의 1본쇄 DNA 또는 RNA인 것으로 이해되어야 한다.Functional analogs of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 are to be understood to be their prokaryotic or eukaryotic moieties, such as, for example, bacteria, fungi, plants, animals or human moron. It is also to be understood that the analog is single stranded DNA or RNA from a truncated sequence or from encoded and unencoded DNA sequences.

유도체는, 예를 들어 프로모터 변이체인 것으로 이해되어야 한다. 주어진 뉴클레오티드 서열의 상류에 위치하는 프로모터는 프로모터의 작용성 또는 활성이 손상됨이 없이 하나 이상의 뉴클레오티드 치환에 의해 또는 삽입(들) 및(또는) 결실(들)에 의해 변할 수 있다. 또한, 프로모터의 활성은 이들의 서열을 변화시킴으로써 증가시킬 수 있거나, 프로모터는 상이한 종의 유기체의 프로모터를 포함하여 더 활성있는 프로모터에 의해 완전히 대체될 수 있다.Derivatives are to be understood, for example, as promoter variants. A promoter located upstream of a given nucleotide sequence may be altered by one or more nucleotide substitutions or by insertion (s) and / or deletion (s) without impairing the functionality or activity of the promoter. In addition, the activity of promoters can be increased by changing their sequences, or promoters can be completely replaced by more active promoters, including those of different species of organisms.

유도체는 또한 유전자의 발현 및(또는) 단백질의 발현이 증가되도록 그의 뉴클레오티드 서열이 개시 코돈의 -1 내지 -30 상류의 영역에서 변화된 변이체로서 이해되어야 한다.Derivatives are also to be understood as variants whose nucleotide sequence has been changed in the region of -1 to -30 upstream of the initiation codon so that expression of the gene and / or protein is increased.

비오틴을 합성하는 모든 그람-음성 또는 그람-양성 세균은 주로 본 발명에 따른 방법에서 원핵 숙주 유기체로서 사용하기에 적합하다. 예로서 언급될 수 있는 그람-음성 세균은 엔테로박테리아세아에(Enterobacteriaceae)(예: 에세리히아, 아에로박터(Aerobacter), 엔테로박터(Enterobacter), 시트로박터(Citrobacter), 시겔라(Shigella), 클렙시엘라, 세라티아(Serratia), 에르위니아(Erwinia) 또는 살모넬라(Salmonella) 속), 슈도모나다세아에(Pseudomonadaceae)(예: 슈도모나스(Pseudomonas), 크산토모나스(Xanthomonas), 부르크홀데리아(Burkholderia), 글루코노박터(Gluconobacter), 니트로소모나스(Nitrosomonas), 니트로박터(Nitrobacter), 메타노모나스(Methanomonas), 코마모나스(Comamonas), 셀룰로모나스(Cellulomonas) 또는 아세토박터(Acetobacter) 속), 아조토박테라세아에(Azotobacteraceae)(예: 아조토박터(Azotobacter), 아조모나스(Azomonas), 베이예린키아(Beijerinckia) 또는 데륵시아(Derxia) 속), 네이세리아세아에(Neisseriaceae)(예: 모락셀라(Moraxella), 아시네토박터(Acinetobacter), 킨겔라(Kingella), 네이세리아(Neisseria) 또는 브란하멜라(Branhamella) 속), 리조비아세아에(Rhizobiaceae)(예: 리조비움(Rhizobium) 또는 아그로박테리움(Agrobacterium) 속), 또는 그람-음성 속인 지모모나스(Zymomonas), 크로모박테리움(Chromobacterium) 또는 플라보박테리움(Flavobacterium) 속이다. 예로서 언급될 수 있는 그람-양성 세균은 내생포자-형성 그람-양성 통기성 또는 혐기성 세균(예: 바실루스, 스포로락토바실루스(Sporolactobacillus) 또는 클로스트리디움(Clostridium) 속), 코리네형 세균(예: 아르트로박터(Arthrobacter), 셀룰로모나스, 쿠르토박테리움(Curtobacterium), 코리네박테리움(Corynebacterium), 브레비박테리움(Brevibacterium), 미크로박테리움(Microbacterium), 쿠르티아(Kurthia) 속), 악티노미세탈레스(Actinomycetales)(예: 미코박테리움(Mycobacterium), 로도코쿠스(Rhodococcus), 스트렙토미세스(Streptomyces) 또는 노카르디아(Nocardia) 속), 락토바실라세아에(Lactobacillaceae)(예: 락토바실루스(Lactobacillus) 또는 락토코쿠스(Lactococcus) 속), 또는 그람-양성 구균(예: 미크로코쿠스(Micrococcus) 또는 스타필로코쿠스(Staphylococcus) 속)이다.All Gram-negative or Gram-positive bacteria that synthesize biotin are suitable for use primarily as prokaryotic host organisms in the method according to the invention. Gram-negative bacteria that may be mentioned by way of example include Enterobacteriaceae (e.g., Escherichia, Aerobacter, Enterobacter, Citrobacter, Shigella). ), Klebsiella, Serratia, Erwinia or Salmonella, Pseudomonadaceae (e.g. Pseudomonas, Xanthomonas, Burgunderia) (Burkholderia), Gluconobacter, Nitrosomonas, Nitrobacter, Methanomonas, Comamonas, Cellulomonas, or Acetobacter ), Azotobacteraceae (e.g., Azotobacter, Azomonas, Beijerinckia or Derxia), Neisseriaceae (e.g. : Moraxella, Acinetobacter obacter, Kingella, Neisseria or Branhamella genus, Rhizobiaceae (e.g., Rhizobium or Agrobacterium genus), or Gram-negative genus Zymomonas, Chromobacterium or Flavoacterium. Gram-positive bacteria that may be mentioned by way of example are endospores-forming Gram-positive breathable or anaerobic bacteria (e.g. Bacillus, Sporolactobacilli or Clostridium genus), coryneform bacteria (e.g. Arthrobacter, Cellulomonas, Curtobacterium, Corynebacterium, Brevibacterium, Microbacterium, Kurthia genus), Actinomycetales (e.g. Mycobacterium, Rhodococcus, Streptomyces or Nocardia genus), Lactobacillaceae (e.g. Lactobacillus or Lactococcus), or Gram-positive cocci (eg, Micrococcus or Staphylococcus).

본 발명에 따른 방법에 에세리히아, 시트로박터, 세라티아, 클렙시엘라, 살모넬라, 슈도모나스, 코마모나스, 아시네토박터, 아조토박터, 크로모박테리움, 바실루스, 클로스트리디움, 아르트로박터, 코리네박테리움, 브레비박테리움, 락토코쿠스, 락토바실루스, 스트렙토미세스, 리조비움, 아그로박테리움 또는 스타필로코쿠스 속의 세균을 사용하는 것이 바람직하다. 에세리히아 콜리, 시트로박터 프레운디(Citobacter freundii), 세라티아 마르세센스(Serratia marcescens), 살모넬라 타이피무리움(Salmonella typhimurium), 슈도모나스 멘도시나(Pseudomonas mendocina), 슈도모나스 아에루기노사(Pseudomonas aeruginosa), 슈도모나스 무타빌리스(Pseudomonas mutabilis), 슈도모나스 클로로라피스(Pseudomonas chlororaphis), 슈도모나스 플루오레센스(Pseudomonas fluorescens), 코마모나스 아시도보란스(Comamonas acidovorans), 코마모나스 테스토스테로니(Comamonas testosteroni), 아시네토박터 칼코아세티쿠스(Acinetobacter calcoaceticus), 아조토박터 비넬란디(Azotobacter vinelandii), 크로모박테리움 비올라세움(Chromobacterium violaceum), 바실루스 서브틸리스, 바실루스 스파에리쿠스, 바실루스 스테아로써모필루스(Bacillus stearothermophilus), 바실루스 푸밀루스(Bacillus pumilus), 바실루스 리헤니포르미스(Bacillus licheniformis), 바실루스 아밀로리퀘파시엔스(Bacillus amyloliquefaciens), 바실루스 메가테리움(Bacillus megaterium), 바실루스 세레우스(Bacillus cereus), 바실루스 투린기엔시스(Bacillus thuringiensis), 아르트로박터 시트레우스(Arthrobacter citreus), 아르트로박터 파라피네우스(Arthrobacter paraffineus), 코리네박테리움 글루타미쿰(Corynebacterium glutamicum), 코리네박테리움 프리모리옥시단스(Corenebacterium primorioxydans), 코리네박테리움 종, 브레비박테리움 케토글루타미쿰(Brevibacterium ketoglutamicum), 브레비박테리움 리넨스(Brevibacterium linens), 브레비박테리움 종, 스트렙토미세스 리비단스(Streptomyces lividans), 리조비움 레구미노사룸(Rhizobium leguminosarum) 또는 아그로박테리움 투메파시엔스(Agrobacterium tumefaciens)의 종과 속이 특히 바람직하다. 유리하게는, 비오틴의 증가된 천연 생성을 나타내는 세균을 사용한다.Methods according to the invention are Escherichia, Citrobacter, Serratia, Klebsiella, Salmonella, Pseudomonas, Comamonas, Acinetobacter, Azotobacter, Chromobacterium, Bacillus, Clostridium, Artrobacter Preference is given to using bacteria of the genus Corynebacterium, Brevibacterium, Lactococcus, Lactobacillus, Streptomyces, Rizovium, Agrobacterium or Staphylococcus. Esseria coli, Citobacter freundii, Serratia marcescens, Salmonella typhimurium, Pseudomonas mendocina, Pseudomonas aeruginosa aeruginosa, Pseudomonas mutabilis, Pseudomonas chlororaphis, Pseudomonas fluorescens, Comamonas acidovorans, Testamonas acidonines, testosterone, and testosterone. Abactobacter calcoaceticus, Azotobacter vinelandii, Chromobacterium violaceum, Bacillus subtilis, Bacillus spaerycus, Bacillus stearerophilus stemophilus Bacillus ), Bacillus pumilus, Bacillus rihenniformi (Bacillus licheniformis), Bacillus amyloliquefaciens, Bacillus megaterium, Bacillus cereus, Bacillus thuringiensis, Arthrobacter citruus citreus, Arthrobacter paraffineus, Corynebacterium glutamicum, Corynebacterium primorioxydans, Corynebacterium species, Brevibacterium ketoglulu Brevibacterium ketoglutamicum, Brevibacterium linens, Brevibacterium spp., Streptomyces lividans, Rizobium leguminosarum or Agrobacterium tumefacienium Particularly preferred are species and genus of Agrobacterium tumefaciens. Advantageously, bacteria are used that exhibit increased natural production of biotin.

기재된 속의 분류학적 위치는 최근 상당히 변화되었고, 여전히 잘못된 종속명이 고쳐지는 변화 상태에 있다. 과거에 자주 요구되었던 세균 분류학내의 상기 속의 이러한 분류학적 재편성 때문에, 상기 언급된 것이 아닌 과, 속 및 종도 또한 본 발명의 방법에 적합하다.The taxonomic position of the genus described has changed considerably in recent years and is still in a changing state where the wrong dependency name is fixed. Because of this taxonomic reorganization of the genus in bacterial taxonomy which has often been required in the past, families, genus and species not mentioned above are also suitable for the methods of the present invention.

진균, 효모, 식물 또는 식물 세포와 같은 모든 비오틴-합성 유기체는 주로 본 발명에 따른 방법에서 진핵 숙주 유기체로서 사용하기에 적합하다. 바람직하게 언급될 수 있는 효모는 로도토룰라(Rhodotorula), 야로위아(Yarrowia), 스포로볼로미세스(Sporobolomyces), 사카로미세스 또는 시조사카로미세스(Schizosaccharomyces)이다. 로도토룰라 루브라(Rhodotorula rubra), 로도토룰라 글루티니스(Rhodotorula glutinis), 로도토룰라 그라미니스(Rhodotorula graminis), 야로위아 리포리티카(Yarrowia lipolytica), 스포로볼로미세스 살모니콜로르(Sporobolomyces salmonicolor), 스포로볼로미세스 시바타누스(Sporobolomyces shibatanus) 또는 사카로미세스 세레비지아에의 종과 속이 특히 바람직하다.All biotin-synthetic organisms such as fungi, yeasts, plants or plant cells are mainly suitable for use as eukaryotic host organisms in the method according to the invention. Yeasts that may be preferably mentioned are Rhodotorula, Yarrowia, Sporobolomyces, Saccharomyces or Schizosaccharomyces. Rhodotorula rubra, Rhodotorula glutinis, Rhodotorula graminis, Yarrowia lipolytica, Sporobolomises Salmonolocolor ), Sporobolomyces shibatanus or Saccharomyces cerevisiae is particularly preferred.

주로, 모든 식물을 숙주 유기체로서 사용할 수 있으며, 동물 영양 또는 인간 영양에서 한 역할을 하는 식물, 예를 들어 옥수수, 밀, 보리, 호밀, 감자, 완두, 콩, 해바라기, 야자, 기장, 참깨, 코프라 또는 평지가 바람직하다. 아라비도프시스 탈리아나(Arabidopsis thaliana) 또는 라벤둘라 베라(Lavendula vera)와 같은 식물도 적합하다. 식물 세포 배양물, 식물 원형질체 또는 칼루스(callus) 배양물이 특히 바람직하다.Primarily, all plants can be used as host organisms, and plants that play a role in animal or human nutrition, such as corn, wheat, barley, rye, potatoes, peas, beans, sunflowers, palms, millet, sesame seeds, copra Or flat. Plants such as Arabidopsis thaliana or Lavendula vera are also suitable. Particular preference is given to plant cell cultures, plant protoplasts or callus cultures.

본 발명에 따른 방법에 비오틴을 증식 배지로 분비할 수 있고, 경우에 따라 이미 비오틴의 증가된 천연 합성을 또한 나타내는 세균, 진균, 효모 또는 식물 세포와 같은 미생물이 유리하게 사용된다. 유리하게는, 이들 유기체는 또한 이들의 비오틴 생합성의 조절에 관하여 결함이 있을 수 있다. 즉, 이 합성은 조절되지 않거나 또는 매우 감소된 정도로 조절될 뿐이다. 이 조절 결함으로 인해 이들 유기체는 이미 실질적으로 증가된 비오틴 생산성을 갖게 된다. 이러한 조절 결함은, 예를 들어 birA-결손 변이주 형태의 에세리히아 콜리로부터 공지되며, 바람직하게는 외부 영향에 의해 유도될 수 있는 결함, 예를 들어 온도-유도성인 결함으로서 세포내에 존재하여야 한다. 주로, 천연 비오틴 생성을 나타내지 않는 유기체도 비오틴 유전자로 형질전환되면 사용될 수 있다.Microorganisms such as bacteria, fungi, yeast or plant cells are advantageously used in the method according to the invention which can secrete biotin into the growth medium and optionally also exhibit an increased natural synthesis of biotin. Advantageously, these organisms may also be defective with regard to the regulation of their biotin biosynthesis. That is, this synthesis is not controlled or only adjusted to a very reduced degree. These regulatory deficiencies result in these organisms already having substantially increased biotin productivity. Such regulatory defects are known, for example, from Escherichia coli in the form of birA-defective mutant strains and should preferably be present in the cells as defects which can be induced by external influences, for example temperature-induced defects. Primarily, organisms that do not exhibit natural biotin production can be used if they are transformed with a biotin gene.

비오틴 생산성을 전체적으로 더욱 더 증가시키기 위하여, 본 발명에 따른 방법에서 유기체는 유리하게는 bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 하나 이상의 추가의 비오틴 유전자를 갖는다. 유리하게는, 비오틴 합성을 모의하는 이들 유전자는 또한 세포내에 서열 1, 서열 3, 서열 5 또는 서열 7 및 이들의 조합과 함께 존재할 수 있다. 비오틴 합성을 모의하는 유전자의 예는 플라보레독신 유전자 및 플라보레독신 환원효소 유전자이다. 이 추가의 유전자 또는 이들 추가의 유전자들은 세포내에 서열 1, 서열 3, 서열 5 또는 서열 7 또는 이들의 조합을 갖는 유전자와 같은 하나 이상의 카피로 존재할 수 있다. 이들은 서열 1, 서열 3, 서열 5 및(또는) 서열 7과 같은 벡터 또는 별개의 벡터에 존재하거나, 또는 염색체에 삽입될 수 있다. 서열 1, 서열 3, 서열 5 및(또는) 서열 7은 또한 한 벡터에 또는 별개의 벡터에 함께 있을 수 있거나, 또는 게놈에 삽입될 수 있다.In order to further increase biotin productivity as a whole, the organisms in the method according to the invention advantageously comprise at least one additional biotin selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR. Has a gene. Advantageously, these genes simulating biotin synthesis may also be present in the cell with SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 and combinations thereof. Examples of genes that simulate biotin synthesis are the flaboredoxin gene and the flaboredoxin reductase gene. This additional gene or these additional genes may be present in the cell in one or more copies, such as a gene having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, or a combination thereof. They may be present in vectors or separate vectors, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, or inserted into chromosomes. SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 may also be together in one vector or in separate vectors, or may be inserted into the genome.

본 발명에 따른 유전자 구조물은 유전자의 발현을 증가시키기 위하여 하나 이상의 조절 신호에 기능적으로 연결된 서열 1의 SAM 합성효소 유전자 및 서열 3, 서열 5 및(또는) 서열 7의 비오틴 합성 유전자, 및 이들의 기능적 변이체, 유사체 또는 유도체의 유전자 서열인 것으로 이해되어야 한다. 이들 신규한 조절 서열 외에, 이들 서열의 천연 조절은 아직도 실제 구조 유전자의 상류에 존재할 수 있으며, 경우에 따라 천연 조절이 끊기고 유전자의 발현이 증가되도록 유전학적으로 변할 수 있다. 그러나, 유전자 구조물은 또한 단순한 방식으로 조립될 수 있다. 즉, 서열 1, 서열 3, 서열 5 및(또는) 서열 7의 상류에 추가의 조절 신호가 삽입되지 않고, 조절을 갖는 천연의 프로모터는 제거되지 않는다. 대신에, 천연의 조절 서열은 비오틴에 의한 조절이 더 이상 일어나지 않고 유전자 발현이 증가되도록 돌연변이된다. 서열 1, 서열 3, 서열 5 및(또는) 서열 7은 하나의 프로모터의 조절하에 있거나 또는 별개의 프로모터의 조절하에 있을 수 있다. 추가의 유리한 조절 요소가 또한 DNA 서열의 3' 말단에 삽입될 수 있다. 서열 1, 서열 3, 서열 5 또는 서열 7을 갖는 유전자는 유전자 구조물에 하나 이상의 카피로 존재할 수 있다.The genetic construct according to the present invention is a SAM synthase gene of SEQ ID NO: 1 and a biotin synthesis gene of SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, and their functionalities, functionally linked to one or more regulatory signals to increase expression of the gene. It is to be understood that this is the gene sequence of the variant, analog or derivative. In addition to these novel regulatory sequences, the natural regulation of these sequences may still exist upstream of the actual structural gene, and in some cases, may be genetically altered so that natural regulation is interrupted and expression of the gene is increased. However, genetic constructs can also be assembled in a simple manner. That is, no further regulatory signal is inserted upstream of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7, and the native promoter with regulation is not removed. Instead, natural regulatory sequences are mutated such that regulation by biotin no longer occurs and gene expression is increased. SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 may be under the control of one promoter or under the control of a separate promoter. Additional advantageous regulatory elements can also be inserted at the 3 'end of the DNA sequence. The gene having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 may exist in one or more copies in the gene construct.

본 발명에 따른 방법에 유리한 조절 서열은, 예를 들어 그람-음성 세균에 유리하게 사용되는 cos-, tac-, trp-, tet-, trp-tet-, lpp-, lac-, lpp-lac-, lacI^q-, T7-, T5-, T3-, gal-, trc-, ara-, SP6-, λ-P_R- 또는 λ-P_L-프로모터와 같은 프로모터에 존재한다. 추가의 유리한 조절 서열은, 예를 들어 그람-양성 프로모터인 amy 및 SPO2에, 효모 프로모터인 ADC1, MFα, AC, P-60, CYC1, GAPDH에 또는 식물 프로모터인 CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33 또는 nos에, 또는 유비퀴틴(ubiquitin) 프로모터 또는 파세올린(phaseolin) 프로모터에 존재한다.Regulatory sequences advantageous for the method according to the invention are, for example, cos-, tac-, trp-, tet-, trp-tet-, lpp-, lac-, lpp-lac- which are advantageously used in Gram-negative bacteria. , lacI ^q- , T7-, T5-, T3-, gal-, trc-, ara-, SP6-, λ-P _R -or λ-P _L -promoter. Further advantageous regulatory sequences are, for example, in the Gram-positive promoters amy and SPO2, in the yeast promoters ADC1, MFα, AC, P-60, CYC1, GAPDH or in the plant promoters CaMV / 35S, SSU, OCS, lib4 , usp, STLS1, B33 or nos, or in the ubiquitin promoter or in the phaseolin promoter.

주로, 본 발명에 따른 방법에 전술한 프로모터와 같은 모든 천연 프로모터를 이들의 조절 서열과 함께 사용할 수 있다. 또한, 합성 프로모터도 유리하게 사용할 수 있다.Primarily, all natural promoters, such as the promoters described above in the method according to the invention, can be used with their regulatory sequences. Synthetic promoters can also be advantageously used.

자신의 프로모터를 가질 수 있거나, 서열중 하나의 프로모터의 조절하에 있을 수 있거나, 또는 서열 1, 서열 3, 서열 5 또는 서열 7의 모든 서열의 프로모터의 조절하에 있을 수 있는, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY 또는 bioR 군에서 선택된 다른 비오틴 유전자가 유전자 구조물에 하나 이상의 카피로 존재할 수 있다.BioA, bioB, bioF, which may have its own promoter, may be under the control of a promoter of one of the sequences, or may be under the control of a promoter of all sequences of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 Other biotin genes selected from the group bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR may be present in the gene construct in one or more copies.

전술한 숙주 유기체에서의 발현을 위하여, 유전자 구조물은 유리하게는 숙주에서 유전자의 최적의 발현을 달성할 수 있게 하는 숙주-특이성 벡터로 삽입된다. 벡터는 숙련자에게 널리 공지되어 있으며, 예를 들어 클로닝 벡터란 책(Eds. Pouwels P. H. et al. Elsevier, Amsterdam-New York-Oxford, 1985, ISBN 0 444 904018)으로부터 확인할 수 있다. 플라스미드 외에, 백터도 또한 파지, 비루스, 트랜스포손(transposon), IS 요소, 플라스미드, 코스미드, 또는 선형 또는 환형 DNA와 같은 숙련자에게 공지된 모든 다른 벡터인 것으로 이해되어야 한다. 이들 벡터는 숙주 유기체에서 자율적으로 복제되거나 염색체 복제될 수 있다.For expression in the host organisms described above, the gene constructs are advantageously inserted into host-specific vectors that enable to achieve optimal expression of the gene in the host. Vectors are well known to the skilled person and can be found, for example, in the book Cloning Vector (Eds. Pouwels P. H. et al. Elsevier, Amsterdam-New York-Oxford, 1985, ISBN 0 444 904018). In addition to plasmids, vectors should also be understood to be phages, viruses, transposons, IS elements, plasmids, cosmids, or any other vector known to the skilled person, such as linear or circular DNA. These vectors can autonomously replicate or chromosome replicate in the host organism.

발현 시스템은 예로서 전술된 숙주 유기체와 그 유기체에 적당한 벡터(예: 플라스미드, 비루스 또는 파지, 예를 들어 RNA 폴리머라제/프로모터 시스템을 함유하는 플라스미드, 파지 λ, 또는 Mu 또는 다른 용원 파지 또는 트랜스포손 및(또는) 추가의 유리한 조절 서열)의 조합인 것으로 이해되어야 한다.Expression systems include, for example, the above-described host organisms and plasmids, phage λ, or Mu or other source phages or transposons containing a vector suitable for that organism, such as a plasmid, virus or phage, for example an RNA polymerase / promoter system. And / or further advantageous regulatory sequences).

발현 시스템이란 용어는 바람직하게는 에세리히아 콜리와 그의 플라스미드 및 파지 및 관련 프로모터의 조합, 및 바실루스와 그의 플라스미드 및 프로모터의 조합인 것으로 이해되어야 한다.The term expression system is preferably to be understood to be a combination of Escherichia coli and its plasmids and phages and related promoters, and a combination of Bacillus and its plasmids and promoters.

추가의 3' 및(또는) 5'-말단 조절 서열은 또한 본 발명에 따른 서열 1, 서열 3, 서열 5 및(또는) 서열 7을 유리하게 발현하기에 적합하다.Additional 3 'and / or 5'-terminal regulatory sequences are also suitable for advantageously expressing SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 according to the present invention.

이들 조절 서열은 비오틴 유전자의 특이 발현 및 단백질의 발현을 이룰 수 있게 하려는 것이다. 이는 숙주 유기체에 따라, 예를 들어 유전자가 유도 후 겨우 발현되거나 과잉발현되거나, 또는 유전자가 즉시 발현되고(발현되거나) 과잉발현됨을 뜻할 수 있다.These regulatory sequences are intended to enable the specific expression of biotin genes and the expression of proteins. This may mean that depending on the host organism, for example, the gene is barely expressed or overexpressed after induction, or the gene is immediately expressed (expressed) and overexpressed.

이와 관련하여, 조절 서열 또는 인자는 바람직하게는 비오틴 유전자 발현에 긍정적으로 영향을 끼침으로써 발현을 증가시킬 수 있다. 예를 들어, 조절 요소는 유리하게는 프로모터 및(또는) 증진자(enhancer)와 같은 강한 전사 신호를 사용하여 전사 수준에서 강화될 수 있다. 그러나, 이 외에도 예를 들어 mRNA의 안정성을 개선시킴으로써 번역을 강화할 수도 있다.In this regard, regulatory sequences or factors may preferably increase expression by positively affecting biotin gene expression. For example, regulatory elements can advantageously be enhanced at the level of transcription using strong transcriptional signals such as promoters and / or enhancers. In addition, however, translation can also be enhanced, for example by improving the stability of mRNA.

증진자는, 예를 들어 RNA 폴리머라제와 DNA 사이의 상호작용을 개선시킴으로써 비오틴 유전자 발현의 증가를 가져오는 DNA 서열인 것으로 이해되어야 한다.Enhancers should be understood to be DNA sequences that result in increased biotin gene expression, for example by improving the interaction between RNA polymerase and DNA.

서열 1, 서열 3, 서열 5 및 서열 7로부터 유도된 단백질(서열 2, 서열 4, 서열 6 및 서열 8 참조)의 증가 및 이들의 효소 활성의 증가는, 출발 효소에 비하여, 예를 들어 UV 조사와 같은 전형적인 돌연변이 발생에 의해 또는 화학적 돌연변이원으로 처리하고(처리하거나) 부위-지향성 돌연변위 발생, 결실(들), 삽입(들) 및(또는) 치환(들)과 같은 특이한 돌연변이 발생에 의해 상응하는 유전자 서열 또는 이들의 상동물의 서열을 변화시킴으로써 이룰 수 있다. 기술된 유전자 증폭과는 별도로, 증가된 효소 활성은 또한 효소 생합성을 억제하는 소거 인자에 의해 또한(또는) 불활성 효소 대신에 활성 효소를 합성함으로써 얻을 수 있다.The increase in proteins derived from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7 (see SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8) and their enzymatic activity, for example, compared to the starting enzyme, for example UV irradiation By typical mutagenesis such as or by treatment with chemical mutagens and / or by specific mutagenesis such as site-directed mutagenesis, deletion (s), insertion (s) and / or substitution (s). It can be achieved by changing the gene sequence or the sequence of their molar animals. Apart from the described gene amplifications, increased enzyme activity can also be obtained by scavenging factors that inhibit enzyme biosynthesis and / or by synthesizing active enzymes instead of inactive enzymes.

본 발명에 따른 방법은 유리하게는 DTB의 비오틴으로의 전환을 증가시키고, 따라서 서열 1, 서열 3, 서열 5 및 서열 7, 및 서열 1과 서열 5 또는 서열 1과 서열 7을 갖는 유전자의 조합, 바람직하게는 서열 1과 서열 3을 갖는 유전자의 조합을 갖는 비오틴 유전자를 사용함으로써 전체 비오틴 생산성이 증가되는데, 이들 유전자는 벡터에 의해 또한(또는) 염색체 클로닝에 의해 유기체에 도입된다.The method according to the invention advantageously increases the conversion of DTB into biotin, thus combining the genes having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 and SEQ ID NO: 1 and SEQ ID NO: 5 or SEQ ID NO: 1 and SEQ ID NO: 7, Preferably the use of biotin genes having a combination of genes having SEQ ID NO: 1 and SEQ ID NO: 3 increases overall biotin productivity, which genes are introduced into the organism by vectors and / or by chromosomal cloning.

본 발명에 따른 방법에서, 서열 1, 서열 3, 서열 5 및(또는) 서열 7을 갖는 미생물은 이 유기체를 증식시킬 수 있는 배지에서 증식된다. 이 배지는 합성 배지 또는 천연 배지일 수 있다. 숙련자에게 공지되어 있으며 유기체에 적당한 배지를 사용한다. 미생물을 증식시키기 위하여, 사용된 배지는 탄소원, 질소원, 무기 염, 및 경우에 따라 소량의 비타민 및 미량 원소를 함유한다.In the method according to the invention, microorganisms having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and / or SEQ ID NO: 7 are propagated in a medium capable of propagating this organism. This medium may be synthetic or natural medium. A medium known to the skilled person and suitable for the organism is used. To propagate microorganisms, the medium used contains a carbon source, a nitrogen source, an inorganic salt, and optionally small amounts of vitamins and trace elements.

유리한 탄소원의 예는 모노사카라이드, 디사카라이드 또는 폴리사카라이드(예: 글루코즈, 프럭토즈, 만노즈, 크실로즈, 갈락토즈, 리보즈, 소르보즈, 리불로즈, 락토즈, 말토즈, 슈크로즈, 라피노즈, 전분 또는 셀룰로즈)와 같은 당, 복합 당원(예: 당밀), 당 인산염(예: 프럭토즈-1,6-비스포스페이트), 당 알콜(에: 만니톨), 폴리올(예: 글리세롤), 알콜(예: 메탄올 또는 에탄올), 카르복실산(예: 시트르산, 락트산 또는 아세트산), 지방(예: 대두유 또는 평지자유) 또는 아미노산(예: 글루탐산 또는 아스파르트산), 또는 질소원으로서 동시에 사용될 수 있는 아미노 당이다.Examples of advantageous carbon sources are monosaccharides, disaccharides or polysaccharides such as glucose, fructose, mannose, xylose, galactose, ribose, sorbose, ribulose, lactose, maltose, sucrose , Sugars such as raffinose, starch or cellulose, complex sugar sources (e.g. molasses), sugar phosphates (e.g. fructose-1,6-bisphosphate), sugar alcohols (e. Mannitol), polyols (e.g. glycerol) , Alcohols (such as methanol or ethanol), carboxylic acids (such as citric acid, lactic acid or acetic acid), fats (such as soybean oil or rapeseed oil) or amino acids (such as glutamic acid or aspartic acid), or may be used simultaneously as a nitrogen source Amino sugars.

유리한 질소원은 유기 또는 무기 질소 화합물 또는 이들 화합물을 함유하는 물질이다. 그 예는 암모늄 염(예: NH₄Cl 또는 (NH₄)₂SO₄), 질산염 또는 우레아, 또는 복합 질소원(예: 옥수수 침지액, 브류어(brewer)의 효모 자가분해물, 대두분, 밀 글루텐, 효모 추출물, 고기 추출물, 카제인 가수분해물 또는 질소원으로서 동시에 자주 사용될 수 있는 효모 또는 감자 단백질)이다.Advantageous nitrogen sources are organic or inorganic nitrogen compounds or materials containing these compounds. Examples include ammonium salts (eg NH ₄ Cl or (NH ₄ ) ₂ SO ₄ ), nitrates or ureas, or complex nitrogen sources (eg corn steep liquor, brewer's yeast autolysates, soy flour, wheat gluten) , Yeast extract, meat extract, casein hydrolysate or yeast or potato protein, which can often be used simultaneously at the same time as a nitrogen source.

무기 염의 예는 칼슘, 마그네슘, 나트륨, 망간, 칼륨, 아연, 구리 및 철의 염이다. 특히 언급되는 이들 염의 음이온은 염화물, 황산염 및 인산염 이온이다. 본 발명에 따른 방법에서 생산성을 증가시키는데 중요한 인자는 생산 배지에 Fe²⁺또는 Fe³⁺염 및(또는) 칼륨 염을 첨가하는 것이다.Examples of inorganic salts are salts of calcium, magnesium, sodium, manganese, potassium, zinc, copper and iron. Particularly mentioned anions of these salts are chloride, sulfate and phosphate ions. An important factor in increasing productivity in the process according to the invention is the addition of Fe ²⁺ or Fe ³⁺ salts and / or potassium salts to the production medium.

경우에 따라, 비타민과 같은 생육인자 또는 리보플라빈, 티아민, 폴산, 니코틴산, 파토텐산염 또는 피리독신과 같은 생육 촉진제, 아미노산(예: 알라닌, 시스테인, 아스파라긴, 아스파르트산, 글루타민, 세린, 메티오닌 또는 리신), 카르복실산(예: 시트르산, 포름산, 피멜산 또는 락트산) 또는 디티오트레이톨과 같은 물질이 영양 배지에 첨가된다.If desired, growth factors such as vitamins or growth promoters such as riboflavin, thiamine, folic acid, nicotinic acid, patotothenate or pyridoxine, amino acids (e.g. alanine, cysteine, asparagine, aspartic acid, glutamine, serine, methionine or lysine) Substances such as carboxylic acid (eg citric acid, formic acid, pimelic acid or lactic acid) or dithiothritol are added to the nutrient medium.

항생물질은, 경우에 따라, 세포에서 비오틴 유전자-함유 벡터를 안정화하기 위하여 배지에 첨가된다.Antibiotics are optionally added to the medium to stabilize the biotin gene-containing vector in the cells.

상기 영양분이 혼합되는 비율은 발효의 성질에 따라 좌우되며, 개별적인 경우에 따라 결정된다. 배지 성분들은 모두, 필요에 따라 별도로 멸균되거나 함께 멸균된 후, 발효를 시작할 때 초기에 도입될 수 있거나, 또는 경우에 따라 발효하는 동안 순차적으로 첨가될 수 있다.The proportion at which the nutrients are mixed depends on the nature of the fermentation and on a case by case basis. The media components can all be introduced separately at the start of fermentation, if desired, after being sterilized separately or together together as needed, or optionally added sequentially during fermentation.

배양 조건은 유기체가 최적으로 생육하고 최고의 가능한 수율이 얻어지도록 조정된다. 바람직한 배양 온도는 15℃ 내지 40℃이다. 25℃ 내지 37℃의 온도가 특히 바람직하다. pH는 바람직하게는 3 내지 9의 범위로 유지된다. 5 내지 8의 pH 값이 특히 유리하다. 일반적으로, 8 내지 240시간, 바람직하게는 8 내지 120시간의 배양 기간이 충분하다. 이 시간내에, 최대량의 비오틴이 배지에 축적되고(축적되거나) 세포를 파괴한 후 얻어질 수 있다.Culture conditions are adjusted so that the organism grows optimally and the best possible yield is obtained. Preferred culture temperatures are from 15 ° C. to 40 ° C. Particular preference is given to temperatures of 25 ° C to 37 ° C. The pH is preferably maintained in the range of 3-9. PH values of 5 to 8 are particularly advantageous. In general, an incubation period of 8 to 240 hours, preferably 8 to 120 hours is sufficient. Within this time, the maximum amount of biotin can be obtained after accumulating (accumulating) in the medium and destroying the cells.

비오틴을 생성하기 위한 본 발명에 따른 방법은 연속적으로 또는 회분식으로 또는 반회분식으로 수행될 수 있다. 비오틴 유전자로 형질전환된 식물 세포로부터 전체 식물이 재생되면, 이들을 본 발명에 따른 방법에 따라 완전하게 정상적으로 생육시키고 증식시킬 수 있다.The process according to the invention for producing biotin can be carried out continuously or batchwise or semibatch. Once the whole plants are regenerated from plant cells transformed with the biotin gene, they can be grown and propagated completely normally according to the method according to the invention.

1. S-아데노실메티오닌 합성효소 유전자의 클로닝(서열 1):1. Cloning of S-adenosylmethionine synthase gene (SEQ ID NO: 1):

게놈 이 콜리 DNA로부터 출발하여, 2가지의 특정 올리고뉴클레오티드를 사용하는 폴리머라제 연쇄 반응에 의해 이 콜리 염색체로부터 SAM 합성효소를 암호화하는 유전자(metK)를 증폭하였다. 이렇게 증폭된 DNA를 정제하고, 제한 효소 Acc65I로 분해하고, 이 콜리 균주에서 이 유전자를 과잉발현시킬 수 있는, 동일한 효소로 절단된 벡터에 삽입하였다. 두 올리고뉴클레오티드중 하나를 사용하여, 최적화된 번역 신호를 갖는 유전자 구조물을 제공하였다.The genome amplifies the gene encoding SAM synthase (metK) from this coli chromosome by polymerase chain reaction using two specific oligonucleotides, starting from coli DNA. The amplified DNA was purified, digested with restriction enzyme Acc65I, and inserted into a vector digested with the same enzyme capable of overexpressing this gene in this collie strain. One of the two oligonucleotides was used to provide a genetic construct with an optimized translation signal.

a) 이 콜리 염색체로부터 metK 유전자를 증폭하기 위한 올리고뉴클레오티드의 생성a) production of oligonucleotides to amplify metK gene from this coli chromosome

metK를 리보솜 결합 부위, 암호화 서열의 개시 코돈 및 두 제한 효소 인식 부위 사이의 정지 코돈으로 이루어진 발현 카세트로서 증폭시켰다. 두 제한 부위를 위하여 Acc65I 인식 서열이 선택되었다. 뉴클레오티드 PmetK1(5'-GCGGTACCAGGTGATATTAAATATGGCAAAAC-3') 및 PmetK2(5'-CGGGTACCGATTACTTCAGACCGGCAGC-3')를 사용하여 metK 유전자를 증폭하고 클로닝하였다.metK was amplified as an expression cassette consisting of a ribosomal binding site, an initiation codon of the coding sequence and a stop codon between two restriction enzyme recognition sites. Acc65I recognition sequences were selected for both restriction sites. MetK genes were amplified and cloned using the nucleotides PmetK1 (5'-GCGGTACCAGGTGATATTAAATATGGCAAAAC-3 ') and PmetK2 (5'-CGGGTACCGATTACTTCAGACCGGCAGC-3').

b) PCR의 수행:b) Performing PCR:

조건:Condition:

이 콜리 W3110으로부터 얻은 염색체 DNA 0.5㎍을 주형으로서 사용하였다. 올리고뉴클레오티드 PmetK1 및 PmetK2를 각각 15p㏖의 농도로 사용하였다. dNTP의 농도는 200μM이었다. 제조자 반응 완충제내의 Pwo DNA 폴리머라제(뵈링거 만하임(Boehringer Mannheim)) 2.5U를 폴리머라제로서 사용하였다. PCR 반응 체적은 100㎕이었다.0.5 µg of chromosomal DNA obtained from this Coli W3110 was used as a template. Oligonucleotides PmetK1 and PmetK2 were each used at a concentration of 15 mmol. The concentration of dNTP was 200 μM. 2.5 U of Pwo DNA polymerase (Boehringer Mannheim) in manufacturer reaction buffer was used as polymerase. PCR reaction volume was 100 μl.

증폭:Amplification:

DNA를 94℃에서 2분동안 변성시켰다. 그 다음, 올리고뉴클레오티드를 55℃에서 30초동안 어닐링(annealing)하였다. 신장은 72℃에서 75초동안 일어났다. PCR 반응은 30주기에 걸쳐 수행하였다.DNA was denatured at 94 ° C. for 2 minutes. The oligonucleotides were then annealed at 55 ° C. for 30 seconds. Elongation occurred at 72 ° C. for 75 seconds. PCR reactions were performed over 30 cycles.

약 1145bp의 크기를 갖는 생성된 DNA 생성물을 정제하고, 적합한 완충제내에서 Acc65I로 분해하였다.The resulting DNA product, having a size of about 1145 bp, was purified and digested with Acc65I in a suitable buffer.

c) 발현 벡터에서의 metK의 클로닝c) cloning of metK in the expression vector

벡터 pHS1(구성은 독일 특허출원 제197.31274.8호, 우선권 22.7.97, 실시예 1, pp. 14-17에 기술되어 있음)을 Acc65I로 분해하고, 새우 알칼리성 포스파타제(SAP)(뵈링거 만하임)를 사용하여 탈포스포릴화하였다. SAP를 변성시킨 후, 제조자의 지시에 따라 고속 DNA 연결 키트(Rapid DNA Ligation kit)를 사용하여 1:3의 몰비로 벡터와 단편을 연결시켰다. 연결 혼합물로 이 콜리 XL-1-blue 균주를 형질전환시켰다. 플라스미드 제조 및 제한 분석에 의해 양의 클론이 밝혀졌다. 제한 분해 및 서열결정에 의해 pHS1내의 metK 단편의 정확한 배향을 결정하였다. 생성된 구조물을 pHS1 metK로 명명하였다(도 1). pHS1 metK의 서열은 서열 9에 제공된다. 서열 10은 metK-암호화 영역으로부터 추론한 아미노산 서열을 나타낸다.Vector pHS1 (constitution is described in German patent application No. 197.31274.8, priority 22.7.97, Example 1, pp. 14-17) was digested with Acc65I and shrimp alkaline phosphatase (SAP) (Schoringer Mannheim) Dephosphorylation was used. After denaturation of SAP, the vector and fragment were linked in a molar ratio of 1: 3 using a Rapid DNA Ligation kit according to the manufacturer's instructions. This Coli XL-1-blue strain was transformed with the ligation mixture. Positive clones were revealed by plasmid preparation and restriction analysis. Restriction digestion and sequencing determined the exact orientation of the metK fragment in pHS1. The resulting construct was named pHS1 metK (FIG. 1). The sequence of pHS1 metK is provided in SEQ ID NO: 9. SEQ ID NO: 10 shows the amino acid sequence deduced from the metK-coding region.

2. 플라스미드 pHBbio14 및 pHS1 bioS1의 구성2. Composition of Plasmids pHBbio14 and pHS1 bioS1

플라스미드 pHBbio14 및 pHS1 bioS1의 구성은 이미 기술되어 있다(독일 특허출원 제197.31274.8호, 우선권 22.7.97, 실시예 1, 2 및 5).The construction of the plasmids pHBbio14 and pHS1 bioS1 has already been described (German patent application no. 197.31274.8, priority 22.7.97, examples 1, 2 and 5).

3. pHS1 metK bioS1의 구성3. Composition of pHS1 metK bioS1

플라스미드 제조 방법(뵈링거)을 사용하여 플라스미드 pHS1 bioS1(서열 11, (독일 특허출원 제197.31274.8호, 우선권 22.7.97), 서열 12는 bioS1-암호화 영역으로부터 추론한 아미노산 서열을 나타냄) 및 pHS1 metK(서열 9)를 정제하였다. pHS1 metK로부터 Acc65I로 분해하여 metK 유전자를 갖는 단편을 단리하였다. pHS1 bioS1을 Acc65I로 분해하고, 새우 알칼리성 포스파타제(SAP)(뵈링거 만하임)로 탈포스포릴화하였다. 제조자의 지시에 따라 SAP를 변성시킨 후, 제조자의 지시에 따라 고속 DNA 연결 키트를 사용하여 1:3의 몰비로 벡터와 metK 단편을 연결시켰다. 연결 혼합물로 이 콜리 XL-1-blue 균주를 형질전환시켰다. 플라스미드 제조 및 제한 분석에 의해 양성 클론이 확인되었다. 제한 분해 및 서열결정에 의해 pHS1 bioS1내의 metK 단편의 정확한 배향을 결정하였다. 생성된 구조물을 pHS1 metK bioS1로 명명하였다(도 2). pHS1 metK bioS1의 서열은 서열 13에 제공된다. 서열 14는 metK-암호화 영역으로부터 추론한 아미노산 서열을 나타내고, 서열 15는 bioS1-암호화 영역으로부터 추론한 아미노산 서열을 나타낸다.Plasmid pHS1 bioS1 (SEQ ID NO: 11 (German Patent Application No. 197.31274.8, Priority 22.7.97), SEQ ID NO: 12 shows the amino acid sequence inferred from the bioS1-coding region) and pHS1 using the plasmid preparation method (Schlinger) metK (SEQ ID NO: 9) was purified. Fragments carrying the metK gene were isolated by digestion with Acc65I from pHS1 metK. pHS1 bioS1 was digested with Acc65I and dephosphorylated with shrimp alkaline phosphatase (SAP) (Möllinger Mannheim). After denaturation of SAP according to the manufacturer's instructions, the vector and metK fragments were linked in a molar ratio of 1: 3 using a high speed DNA ligation kit according to the manufacturer's instructions. This Coli XL-1-blue strain was transformed with the ligation mixture. Positive clones were identified by plasmid preparation and restriction analysis. Restriction digestion and sequencing determined the exact orientation of the metK fragment in pHS1 bioS1. The resulting construct was named pHS1 metK bioS1 (FIG. 2). The sequence of pHS1 metK bioS1 is provided in SEQ ID NO: 13. SEQ ID NO: 14 shows an amino acid sequence inferred from a metK-coding region and SEQ ID NO: 15 shows an amino acid sequence inferred from a bioS1-coding region.

4. metK, bioS1 및 metK와 bioS1를 함께 과잉발현함에 의한 비오틴 생산성의 증가4. Increased biotin productivity by overexpressing metK, bioS1 and metK and bioS1 together

자연적인 리팜피신-내성 콜로니를 리팜피신 플레이트에 도말함으로써 BM4086(케트너(Ketner) 및 캠벨(Campbell)의 문헌[J. Molec. Biology, 1975, 96:13]))으로부터 단리하였다. 이들 내성 균주중 하나로부터 P1 용해질이 생성되었다. 이 P1 용해질을 균주 W3110으로 형질도입하고, 이어서 리팜피신을 사용하여 클론을 선택하였다. 생성된 균주를 CaCl₂방법(마니아티스(Maniatis) 등의 문헌[Molecular Cloning, Cols Spring Harbour Laboratory Press, 1989])을 사용하여 플라스미드 pHBbio14로 형질전환시키고, 1㎖당 암피실린 100㎍을 함유하는 LB에서 생육시켰다. 단리된, 형질전환된 균주(LU5560)를 CaCl₂방법을 사용하여 각각의 경우에 플라스미드 pHS1, pHS1 metK, pHS1 bioS1 또는 pHS1 metK bioS1로 형질전환시킨 다음, 1㎖당 암피실린 100㎍ 및 카나마이신 25㎍을 함유하는 LB 한천에서 선택하였다.Natural rifampicin-resistant colonies were isolated from BM4086 (Ketner and Campbell, J. Molec. Biology, 1975, 96:13) by plating on rifampicin plates. P1 lysate was produced from one of these resistant strains. This P1 lysate was transduced with strain W3110 and then clones were selected using rifampicin. The resulting strains were transformed with plasmid pHBbio14 using the CaCl ₂ method (Maniatis et al. (Molecular Cloning, Cols Spring Harbor Laboratory Press, 1989)) and in LB containing 100 μg of ampicillin per ml. It was grown. The isolated, transformed strain (LU5560) was transformed in each case with plasmid pHS1, pHS1 metK, pHS1 bioS1 or pHS1 metK bioS1 using CaCl ₂ method, followed by 100 μg of ampicillin and 25 μg of kanamycin per mL. It was selected from the containing LB agar.

각 형질전환체로부터 얻은 하나의 콜로니를 각각의 경우에 적당한 항생물질을 함유하는 DYT 배양액에 접종하고, 12시간동안 배양하였다. 밤새 배양하여(=ONC) 1ℓ당 글리세롤 30g 및 적당한 항생물질을 함유한 TB 배지(샘브룩(Sambrook, J.), 프리치(Fritsch, E. F.), 마니아티스의 문헌[2nd ed. Cold Spring Harbor Laboratory Press., 1989 ISBN 0-87969-373-8])에 10㎖ 배양액을 접종하였다. 플라스미드 pHS1, pHS1 metK, pHS1 bioS1 및 pHS1 metK bioS1이 존재하는 경우, metK 및 bioS1 유전자, 또는 각각 두 유전자의 조합의 발현을 유도하기 위하여, 1mM IPTG 및 0.5% 아라비노즈를 동시에 첨가하였다. 24시간 후, 세포를 배양 상청액으로부터 원심분리에 의해 제거하고, 스트렙타비딘을 사용하는 경쟁적 ELISA에 의해 상청액내의 비오틴 농도를 결정하였다. 이 결정의 결과를 하기 표 I에 나타낸다.One colony from each transformant was inoculated in DYT culture containing the appropriate antibiotic in each case and incubated for 12 hours. Incubated overnight (= ONC) TB medium containing 30 g of glycerol per liter and appropriate antibiotics (Sambrook, J., Frisch, EF, Maniatis, 2nd ed. Cold Spring Harbor Laboratory Press., 1989 ISBN 0-87969-373-8]) was inoculated with 10 ml culture. When plasmids pHS1, pHS1 metK, pHS1 bioS1 and pHS1 metK bioS1 were present, 1 mM IPTG and 0.5% arabinose were added simultaneously to induce expression of metK and bioS1 genes, or a combination of both genes, respectively. After 24 hours, cells were removed from the culture supernatant by centrifugation and the biotin concentration in the supernatant was determined by competitive ELISA using streptavidin. The results of this determination are shown in Table I below.

균주Strain 플라스미드 IPlasmid I 플라스미드 IIPlasmid II 비오틴 ㎎/ℓBiotin mg / L LU5580LU5580 pHBbio14pHBbio14 대조군, 플라스미드 없음Control, no plasmid 1111 LU5580LU5580 pHBbio14pHBbio14 pHS1pHS1 2525 LU5580LU5580 pHBbio14pHBbio14 pHS1 bioS1pHS1 bioS1 4545 LU5580LU5580 pHBbio14pHBbio14 pHS1 metKpHS1 metK 3737 LU5580LU5580 pHBbio14pHBbio14 pHS1 metK bioS1pHS1 metK bioS1 5252

＜110＞ BASF Aktiengesellschaft<110> BASF Aktiengesellschaft

＜120＞ Verfahren zur Herstellung von Biotin<120> Verfahren zur Herstellung von Biotin

＜130＞ 0050_48792<130> 0050_48792

＜140＞ 19806872.7<140> 19806872.7

＜141＞ 1998-02-19<141> 1998-02-19

＜160＞ 15<160> 15

＜170＞ PatentIn Vers. 2.0<170> Patent In Vers. 2.0

＜210＞ 1<210> 1

＜211＞ 1155<211> 1155

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (1)..(1155)<222> (1) .. (1155)

＜400＞ 1<400> 1

atg gca aaa cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct 48atg gca aaa cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct 48

Met Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His ProMet Ala Lys His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro

1 5 10 151 5 10 15

gac aaa att gct gac caa att tct gat gcc gtt tta gac gcg atc ctc 96gac aaa att gct gac caa att tct gat gcc gtt tta gac gcg atc ctc 96

Asp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile LeuAsp Lys Ile Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu

20 25 3020 25 30

gaa cag gat ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc 144gaa cag gat ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc 144

Glu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys ThrGlu Gln Asp Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr

35 40 4535 40 45

ggc atg gtt tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac 192ggc atg gtt tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac 192

Gly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val AspGly Met Val Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp

50 55 6050 55 60

atc gaa gag atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat 240atc gaa gag atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat 240

Ile Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val HisIle Glu Glu Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His

65 70 75 8065 70 75 80

tcc gac atg ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc 288tcc gac atg ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc 288

Ser Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala IleSer Asp Met Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile

85 90 9585 90 95

ggc aaa cag tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg 336ggc aaa cag tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg 336

Gly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp ProGly Lys Gln Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro

100 105 110100 105 110

ctg gaa cag ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act 384ctg gaa cag ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act 384

Leu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala ThrLeu Glu Gln Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr

115 120 125115 120 125

aat gaa acc gac gtg ctg atg cca gca cct atc acc tat gca cac cgt 432aat gaa acc gac gtg ctg atg cca gca cct atc acc tat gca cac cgt 432

Asn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His ArgAsn Glu Thr Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg

130 135 140130 135 140

ctg gta cag cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg 480ctg gta cag cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg 480

Leu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro TrpLeu Val Gln Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp

145 150 155 160145 150 155 160

ctg cgc ccg gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc 528ctg cgc ccg gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc 528

Leu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp GlyLeu Arg Pro Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly

165 170 175165 170 175

aaa atc gtt ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa 576aaa atc gtt ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa 576

Lys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser GluLys Ile Val Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu

180 185 190180 185 190

gag atc gac cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc 624gag atc gac cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc 624

Glu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile IleGlu Ile Asp Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile

195 200 205195 200 205

aag cca att ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc 672aag cca att ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc 672

Lys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe PheLys Pro Ile Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe

210 215 220210 215 220

atc aac ccg acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc 720atc aac ccg acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc 720

Ile Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp CysIle Asn Pro Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys

225 230 235 240225 230 235 240

ggt ctg act ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg 768ggt ctg act ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg 768

Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met AlaGly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala

245 250 255245 250 255

cgt cac ggt ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac 816cgt cac ggt ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac 816

Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val AspArg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp

260 265 270260 265 270

cgt tcc gca gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct 864cgt tcc gca gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct 864

Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val AlaArg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala

275 280 285275 280 285

gct ggc ctg gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc 912gct ggc ctg gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc 912

Ala Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile GlyAla Gly Leu Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly

290 295 300290 295 300

gtg gct gaa ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa 960gtg gct gaa ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa 960

Val Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu LysVal Ala Glu Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys

305 310 315 320305 310 315 320

gtg cct tct gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg 1008gtg cct tct gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg 1008

Val Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp LeuVal Pro Ser Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu

325 330 335325 330 335

cgc cca tac ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac 1056cgc cca tac ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac 1056

Arg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile TyrArg Pro Tyr Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr

340 345 350340 345 350

aaa gaa acc gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg 1104aaa gaa acc gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg 1104

Lys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro TrpLys Glu Thr Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp

355 360 365355 360 365

gaa aaa acc gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag 1152gaa aaa acc gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag 1152

Glu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysGlu Lys Thr Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys

370 375 380370 375 380

taa 1155taa 1155

385385

＜210＞ 2<210> 2

＜211＞ 384<211> 384

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 2<400> 2

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

＜210＞ 3<210> 3

＜211＞ 1206<211> 1206

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (1)..(1206)<222> (1) .. (1206)

＜400＞ 3<400> 3

atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 48atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 48

Met Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala LeuMet Asn Val Phe Asn Pro Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu

1 5 10 151 5 10 15

cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 96cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 96

Gln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys ProGln Asp Ala Gly Val Tyr Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro

20 25 3020 25 30

gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 144gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 144

Glu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala GlyGlu Ala Val Val Glu Ala Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly

35 40 4535 40 45

aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 192aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 192

Asn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala ArgAsn Val His Arg Ser Gln Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg

50 55 6050 55 60

tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 240tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 240

Tyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro AspTyr Glu Ala Ala Arg Glu Lys Val Ala Gln Leu Leu Asn Ala Pro Asp

65 70 75 8065 70 75 80

gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 288gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 288

Asp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn MetAsp Lys Thr Ile Val Trp Thr Arg Gly Thr Thr Glu Ser Ile Asn Met

85 90 9585 90 95

gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 336gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 336

Val Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu IleVal Ala Gln Cys Tyr Ala Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile

100 105 110100 105 110

att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 384att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 384

Ile Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu MetIle Val Ser Val Ala Glu His His Ala Asn Leu Val Pro Trp Leu Met

115 120 125115 120 125

gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 432gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 432

Val Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn AlaVal Ala Gln Gln Thr Gly Ala Lys Val Val Lys Leu Pro Leu Asn Ala

130 135 140130 135 140

cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 480cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 480

Gln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro ArgGln Arg Leu Pro Asp Val Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg

145 150 155 160145 150 155 160

agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 528agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 528

Ser Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly CysSer Arg Ile Leu Ala Leu Gly Gln Met Ser Asn Val Thr Gly Gly Cys

165 170 175165 170 175

ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 576ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 576

Pro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met ValPro Asp Leu Ala Arg Ala Ile Thr Phe Ala His Ser Ala Gly Met Val

180 185 190180 185 190

gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 624gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 624

Val Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp ValVal Met Val Asp Gly Ala Gln Gly Ala Val His Phe Pro Ala Asp Val

195 200 205195 200 205

cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 672cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 672

Gln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu TyrGln Gln Leu Asp Ile Asp Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr

210 215 220210 215 220

ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 720ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 720

Gly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr Gly Ile Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu

225 230 235 240225 230 235 240

gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 768gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 768

Ala Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val SerAla Met Ser Pro Trp Leu Gly Gly Gly Lys Met Val His Glu Val Ser

245 250 255245 250 255

ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 816ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 816

Phe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala GlyPhe Asp Gly Phe Thr Thr Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly

260 265 270260 265 270

acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 864acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 864

Thr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu TrpThr Pro Asn Val Ala Gly Val Ile Gly Leu Ser Ala Ala Leu Glu Trp

275 280 285275 280 285

ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 912ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 912

Leu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser LeuLeu Ala Asp Tyr Asp Ile Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu

290 295 300290 295 300

gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 960gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 960

Ala Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg SerAla Thr Leu Ala Glu Asp Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser

305 310 315 320305 310 315 320

ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1008ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1008

Phe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly ValPhe Arg Cys Gln Asp Ser Ser Leu Leu Ala Phe Asp Phe Ala Gly Val

325 330 335325 330 335

cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1056cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1056

His His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala LeuHis His Ser Asp Met Val Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu

340 345 350340 345 350

cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1104cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1104

Arg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly ValArg Ala Gly Gln His Cys Ala Gln Pro Leu Leu Ala Glu Leu Gly Val

355 360 365355 360 365

acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1152acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1152

Thr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser AspThr Gly Thr Leu Arg Ala Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp

370 375 380370 375 380

gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1200gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1200

Val Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu ValVal Asp Ala Leu Val Asn Ala Val Asp Arg Ala Leu Glu Leu Leu Val

385 390 395 400385 390 395 400

gat taa 1206gat taa 1206

AspAsp

＜210＞ 4<210> 4

＜211＞ 401<211> 401

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 4<400> 4

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

AspAsp

＜210＞ 5<210> 5

＜211＞ 1215<211> 1215

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (1)..(1215)<222> (1) .. (1215)

＜400＞ 5<400> 5

atg aaa tta ccg att tat ctc gac tac tcc gca acc acg ccg gtg gac 48atg aaa tta ccg att tat ctc gac tac tcc gca acc acg ccg gtg gac 48

Met Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val AspMet Lys Leu Pro Ile Tyr Leu Asp Tyr Ser Ala Thr Thr Pro Val Asp

1 5 10 151 5 10 15

ccg cgt gtt gcc gag aaa atg atg cag ttt atg acg atg gac gga acc 96ccg cgt gtt gcc gag aaa atg atg cag ttt atg acg atg gac gga acc 96

Pro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly ThrPro Arg Val Ala Glu Lys Met Met Gln Phe Met Thr Met Asp Gly Thr

20 25 3020 25 30

ttt ggt aac ccg gcc tcc cgt tct cac cgt ttc ggc tgg cag gct gaa 144ttt ggt aac ccg gcc tcc cgt tct cac cgt ttc ggc tgg cag gct gaa 144

Phe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala GluPhe Gly Asn Pro Ala Ser Arg Ser His Arg Phe Gly Trp Gln Ala Glu

35 40 4535 40 45

gaa gcg gta gat atc gcc cgt aat cag att gcc gat ctg gtc ggc gct 192gaa gcg gta gat atc gcc cgt aat cag att gcc gat ctg gtc ggc gct 192

Glu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly AlaGlu Ala Val Asp Ile Ala Arg Asn Gln Ile Ala Asp Leu Val Gly Ala

50 55 6050 55 60

gat ccg cgt gaa atc gtc ttt acc tct ggt gca acc gaa tct gac aac 240gat ccg cgt gaa atc gtc ttt acc tct ggt gca acc gaa tct gac aac 240

Asp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp AsnAsp Pro Arg Glu Ile Val Phe Thr Ser Gly Ala Thr Glu Ser Asp Asn

65 70 75 8065 70 75 80

ctg gcg atc aaa ggt gca gcc aac ttt tat cag aaa aaa ggc aag cac 288ctg gcg atc aaa ggt gca gcc aac ttt tat cag aaa aaa ggc aag cac 288

Leu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys HisLeu Ala Ile Lys Gly Ala Ala Asn Phe Tyr Gln Lys Lys Gly Lys His

85 90 9585 90 95

atc atc acc agc aaa acc gaa cac aaa gcg gta ctg gat acc tgc cgt 336atc atc acc agc aaa acc gaa cac aaa gcg gta ctg gat acc tgc cgt 336

Ile Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys ArgIle Ile Thr Ser Lys Thr Glu His Lys Ala Val Leu Asp Thr Cys Arg

100 105 110100 105 110

cag ctg gag cgc gaa ggt ttt gaa gtc acc tac ctg gca ccg cag cgt 384cag ctg gag cgc gaa ggt ttt gaa gtc acc tac ctg gca ccg cag cgt 384

Gln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln ArgGln Leu Glu Arg Glu Gly Phe Glu Val Thr Tyr Leu Ala Pro Gln Arg

115 120 125115 120 125

aac ggc att atc gac ctg aaa gaa ctt gaa gca gcg atg cgt gac gac 432aac ggc att atc gac ctg aaa gaa ctt gaa gca gcg atg cgt gac gac 432

Asn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp AspAsn Gly Ile Ile Asp Leu Lys Glu Leu Glu Ala Ala Met Arg Asp Asp

130 135 140130 135 140

acc atc ctc gtg tcc atc atg cac gta aat aac gaa atc ggc gtg gtg 480acc atc ctc gtg tcc atc atg cac gta aat aac gaa atc ggc gtg gtg 480

Thr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val ValThr Ile Leu Val Ser Ile Met His Val Asn Asn Glu Ile Gly Val Val

145 150 155 160145 150 155 160

cag gat atc gcg gct atc ggc gaa atg tgc cgt gct cgt ggc att atc 528cag gat atc gcg gct atc ggc gaa atg tgc cgt gct cgt ggc att atc 528

Gln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile IleGln Asp Ile Ala Ala Ile Gly Glu Met Cys Arg Ala Arg Gly Ile Ile

165 170 175165 170 175

tat cac gtt gat gca acc cag agc gtg ggt aaa ctg cct atc gac ctg 576tat cac gtt gat gca acc cag agc gtg ggt aaa ctg cct atc gac ctg 576

Tyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp LeuTyr His Val Asp Ala Thr Gln Ser Val Gly Lys Leu Pro Ile Asp Leu

180 185 190180 185 190

agc cag ttg aaa gtt gac ctg atg tct ttc tcc ggt cac aaa atc tat 624agc cag ttg aaa gtt gac ctg atg tct ttc tcc ggt cac aaa atc tat 624

Ser Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile TyrSer Gln Leu Lys Val Asp Leu Met Ser Phe Ser Gly His Lys Ile Tyr

195 200 205195 200 205

ggc ccg aaa ggt atc ggt gcg ctg tat gta cgt cgt aaa ccg cgc gta 672ggc ccg aaa ggt atc ggt gcg ctg tat gta cgt cgt aaa ccg cgc gta 672

Gly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg ValGly Pro Lys Gly Ile Gly Ala Leu Tyr Val Arg Arg Lys Pro Arg Val

210 215 220210 215 220

cgc atc gaa gcg caa atg cac ggc ggc ggt cac gag cgc ggt atg cgt 720cgc atc gaa gcg caa atg cac ggc ggc ggt cac gag cgc ggt atg cgt 720

Arg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met ArgArg Ile Glu Ala Gln Met His Gly Gly Gly His Glu Arg Gly Met Arg

225 230 235 240225 230 235 240

tcc ggc act ctg cct gtt cac cag atc gtc gga atg ggc gag gcc tat 768tcc ggc act ctg cct gtt cac cag atc gtc gga atg ggc gag gcc tat 768

Ser Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala TyrSer Gly Thr Leu Pro Val His Gln Ile Val Gly Met Gly Glu Ala Tyr

245 250 255245 250 255

cgc atc gca aaa gaa gag atg gcg acc gag atg gaa cgt ctg cgc ggc 816cgc atc gca aaa gaa gag atg gcg acc gag atg gaa cgt ctg cgc ggc 816

Arg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg GlyArg Ile Ala Lys Glu Glu Met Ala Thr Glu Met Glu Arg Leu Arg Gly

260 265 270260 265 270

ctg cgt aac cgt ctg tgg aac ggc atc aaa gat atc gaa gaa gtt tac 864ctg cgt aac cgt ctg tgg aac ggc atc aaa gat atc gaa gaa gtt tac 864

Leu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val TyrLeu Arg Asn Arg Leu Trp Asn Gly Ile Lys Asp Ile Glu Glu Val Tyr

275 280 285275 280 285

ctg aac ggt gac ctg gaa cac ggt gcg ccg aac att ctc aac gtc agc 912ctg aac ggt gac ctg gaa cac ggt gcg ccg aac att ctc aac gtc agc 912

Leu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val SerLeu Asn Gly Asp Leu Glu His Gly Ala Pro Asn Ile Leu Asn Val Ser

290 295 300290 295 300

ttc aac tac gtt gaa ggt gag tcg ctg att atg gcg ctg aaa gac ctc 960ttc aac tac gtt gaa ggt gag tcg ctg att atg gcg ctg aaa gac ctc 960

Phe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp LeuPhe Asn Tyr Val Glu Gly Glu Ser Leu Ile Met Ala Leu Lys Asp Leu

305 310 315 320305 310 315 320

gca gtt tct tca ggt tcc gcc tgt acg tca gca agc ctc gaa ccg tcc 1008gca gtt tct tca ggt tcc gcc tgt acg tca gca agc ctc gaa ccg tcc 1008

Ala Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro SerAla Val Ser Ser Gly Ser Ala Cys Thr Ser Ala Ser Leu Glu Pro Ser

325 330 335325 330 335

tac gtg ctg cgc gcg ctg ggg ctg aac gac gag ctg gca cat agc tct 1056tac gtg ctg cgc gcg ctg ggg ctg aac gac gag ctg gca cat agc tct 1056

Tyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser SerTyr Val Leu Arg Ala Leu Gly Leu Asn Asp Glu Leu Ala His Ser Ser

340 345 350340 345 350

atc cgt ttc tct tta ggt cgt ttt act act gaa gaa gag atc gac tac 1104atc cgt ttc tct tta ggt cgt ttt act act gaa gaa gag atc gac tac 1104

Ile Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp TyrIle Arg Phe Ser Leu Gly Arg Phe Thr Thr Glu Glu Glu Ile Asp Tyr

355 360 365355 360 365

acc atc gag tta gtt cgt aaa tcc atc ggt cgt ctg cgt gac ctt tct 1152acc atc gag tta gtt cgt aaa tcc atc ggt cgt ctg cgt gac ctt tct 1152

Thr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu SerThr Ile Glu Leu Val Arg Lys Ser Ile Gly Arg Leu Arg Asp Leu Ser

370 375 380370 375 380

ccg ctg tgg gaa atg tac aag cag ggc gtg gat ctg aac agc atc gaa 1200ccg ctg tgg gaa atg tac aag cag ggc gtg gat ctg aac agc atc gaa 1200

Pro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile GluPro Leu Trp Glu Met Tyr Lys Gln Gly Val Asp Leu Asn Ser Ile Glu

385 390 395 400385 390 395 400

tgg gct cat cat taa 1215tgg gct cat cat taa 1215

Trp Ala His HisTrp Ala His His

405405

＜210＞ 6<210> 6

＜211＞ 404<211> 404

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 6<400> 6

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

Trp Ala His HisTrp Ala His His

＜210＞ 7<210> 7

＜211＞ 1221<211> 1221

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (1)..(1221)<222> (1) .. (1221)

＜400＞ 7<400> 7

atg att ttt tcc gtc gac aaa gtg cgg gcc gac ttt ccg gtg ctt tcg 48atg att ttt tcc gtc gac aaa gtg cgg gcc gac ttt ccg gtg ctt tcg 48

Met Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu SerMet Ile Phe Ser Val Asp Lys Val Arg Ala Asp Phe Pro Val Leu Ser

1 5 10 151 5 10 15

cgt gag gta aac ggt ttg ccg ctg gct tat ctc gac agc gcc gcc agt 96cgt gag gta aac ggt ttg ccg ctg gct tat ctc gac agc gcc gcc agt 96

Arg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala SerArg Glu Val Asn Gly Leu Pro Leu Ala Tyr Leu Asp Ser Ala Ala Ser

20 25 3020 25 30

gcg cag aaa ccg agc cag gtg att gac gcc gag gcc gag ttt tat cgt 144gcg cag aaa ccg agc cag gtg att gac gcc gag gcc gag ttt tat cgt 144

Ala Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr ArgAla Gln Lys Pro Ser Gln Val Ile Asp Ala Glu Ala Glu Phe Tyr Arg

35 40 4535 40 45

cat ggc tac gcg gcg gtg cat cgt ggt att cat acc tta agc gcc cag 192cat ggc tac gcg gcg gtg cat cgt ggt att cat acc tta agc gcc cag 192

His Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala GlnHis Gly Tyr Ala Ala Val His Arg Gly Ile His Thr Leu Ser Ala Gln

50 55 6050 55 60

gcg acc gag aaa atg gag aac gtg cgc aag cgg gca tcg ctg ttt att 240gcg acc gag aaa atg gag aac gtg cgc aag cgg gca tcg ctg ttt att 240

Ala Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe IleAla Thr Glu Lys Met Glu Asn Val Arg Lys Arg Ala Ser Leu Phe Ile

65 70 75 8065 70 75 80

aat gcc cgt tcg gcg gaa gag ctg gtg ttc gtc cgc ggc acg acg gaa 288aat gcc cgt tcg gcg gaa gag ctg gtg ttc gtc cgc ggc acg acg gaa 288

Asn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr GluAsn Ala Arg Ser Ala Glu Glu Leu Val Phe Val Arg Gly Thr Thr Glu

85 90 9585 90 95

ggg atc aat ctg gtc gcc aat agc tgg ggc aac agc aac gtg cgg gcg 336ggg atc aat ctg gtc gcc aat agc tgg ggc aac agc aac gtg cgg gcg 336

Gly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg AlaGly Ile Asn Leu Val Ala Asn Ser Trp Gly Asn Ser Asn Val Arg Ala

100 105 110100 105 110

ggc gat aac atc atc atc agt cag atg gag cac cac gct aac att gtt 384ggc gat aac atc atc atc agt cag atg gag cac cac gct aac att gtt 384

Gly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile ValGly Asp Asn Ile Ile Ile Ser Gln Met Glu His His Ala Asn Ile Val

115 120 125115 120 125

ccc tgg cag atg ctt tgc gca cgc gtt ggc gca gag ctg cgt gtg atc 432ccc tgg cag atg ctt tgc gca cgc gtt ggc gca gag ctg cgt gtg atc 432

Pro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val IlePro Trp Gln Met Leu Cys Ala Arg Val Gly Ala Glu Leu Arg Val Ile

130 135 140130 135 140

ccg ctc aat ccc gat ggt acg ttg caa ctg gag acg ctg cct acg ctg 480ccg ctc aat ccc gat ggt acg ttg caa ctg gag acg ctg cct acg ctg 480

Pro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr LeuPro Leu Asn Pro Asp Gly Thr Leu Gln Leu Glu Thr Leu Pro Thr Leu

145 150 155 160145 150 155 160

ttt gat gag aaa act cgc ctg ctg gca att act cat gtc tcc aac gtg 528ttt gat gag aaa act cgc ctg ctg gca att act cat gtc tcc aac gtg 528

Phe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn ValPhe Asp Glu Lys Thr Arg Leu Leu Ala Ile Thr His Val Ser Asn Val

165 170 175165 170 175

ctt ggc aca gaa aat cca ctg gcg gaa atg atc acg ctt gcg cac cag 576ctt ggc aca gaa aat cca ctg gcg gaa atg atc acg ctt gcg cac cag 576

Leu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His GlnLeu Gly Thr Glu Asn Pro Leu Ala Glu Met Ile Thr Leu Ala His Gln

180 185 190180 185 190

cat ggc gca aaa gtg ctg gtg gat ggc gct cag gcg gtg atg cat cat 624cat ggc gca aaa gtg ctg gtg gat ggc gct cag gcg gtg atg cat cat 624

His Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His HisHis Gly Ala Lys Val Leu Val Asp Gly Ala Gln Ala Val Met His His

195 200 205195 200 205

ccg gtg gat gtt cag gcg ctg gat tgc gac ttt tac gtg ttc tcc ggg 672ccg gtg gat gtt cag gcg ctg gat tgc gac ttt tac gtg ttc tcc ggg 672

Pro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser GlyPro Val Asp Val Gln Ala Leu Asp Cys Asp Phe Tyr Val Phe Ser Gly

210 215 220210 215 220

cat aaa ctg tat ggc ccc acc gga att ggc att ctt tat gtg aaa gaa 720cat aaa ctg tat ggc ccc acc gga att ggc att ctt tat gtg aaa gaa 720

His Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys GluHis Lys Leu Tyr Gly Pro Thr Gly Ile Gly Ile Leu Tyr Val Lys Glu

225 230 235 240225 230 235 240

gcc ttg ttg cag gag atg ccg ccg tgg gaa ggg ggc ggt tct atg atc 768gcc ttg ttg cag gag atg ccg ccg tgg gaa ggg ggc ggt tct atg atc 768

Ala Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met IleAla Leu Leu Gln Glu Met Pro Pro Trp Glu Gly Gly Gly Ser Met Ile

245 250 255245 250 255

gcc acc gtc agc ctg agt gaa ggc act acc tgg acc aaa gca cca tgg 816gcc acc gtc agc ctg agt gaa ggc act acc tgg acc aaa gca cca tgg 816

Ala Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro TrpAla Thr Val Ser Leu Ser Glu Gly Thr Thr Trp Thr Lys Ala Pro Trp

260 265 270260 265 270

cgg ttt gaa gcc ggt aca ccc aat acc ggg ggc atc att ggt ctt ggc 864cgg ttt gaa gcc ggt aca ccc aat acc ggg ggc atc att ggt ctt ggc 864

Arg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu GlyArg Phe Glu Ala Gly Thr Pro Asn Thr Gly Gly Ile Ile Gly Leu Gly

275 280 285275 280 285

gcg gcg ctg gag tat gtt tcg gcg ctg ggg ctt aat aac ata gcc gag 912gcg gcg ctg gag tat gtt tcg gcg ctg ggg ctt aat aac ata gcc gag 912

Ala Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala GluAla Ala Leu Glu Tyr Val Ser Ala Leu Gly Leu Asn Asn Ile Ala Glu

290 295 300290 295 300

tat gaa cag aat ctg atg cat tat gcg cta tca cag ctg gaa tct gta 960tat gaa cag aat ctg atg cat tat gcg cta tca cag ctg gaa tct gta 960

Tyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser ValTyr Glu Gln Asn Leu Met His Tyr Ala Leu Ser Gln Leu Glu Ser Val

305 310 315 320305 310 315 320

ccg gat ctc act ctc tat ggc cca caa aac agg ctt ggc gtt att gct 1008ccg gat ctc act ctc tat ggc cca caa aac agg ctt ggc gtt att gct 1008

Pro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile AlaPro Asp Leu Thr Leu Tyr Gly Pro Gln Asn Arg Leu Gly Val Ile Ala

325 330 335325 330 335

ttt aat ctc ggt aaa cac cac gcc tat gat gtt ggc agt ttt ctc gat 1056ttt aat ctc ggt aaa cac cac gcc tat gat gtt ggc agt ttt ctc gat 1056

Phe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu AspPhe Asn Leu Gly Lys His His Ala Tyr Asp Val Gly Ser Phe Leu Asp

340 345 350340 345 350

aat tac ggc att gct gtg cgt acc gga cat cac tgc gca atg cca ttg 1104aat tac ggc att gct gtg cgt acc gga cat cac tgc gca atg cca ttg 1104

Asn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro LeuAsn Tyr Gly Ile Ala Val Arg Thr Gly His His Cys Ala Met Pro Leu

355 360 365355 360 365

atg gcc tat tac aac gtc cct gcg atg tgt cgg gcg tcg ctg gcc atg 1152atg gcc tat tac aac gtc cct gcg atg tgt cgg gcg tcg ctg gcc atg 1152

Met Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala MetMet Ala Tyr Tyr Asn Val Pro Ala Met Cys Arg Ala Ser Leu Ala Met

370 375 380370 375 380

tat aac acc cat gaa gaa gtg gat cgt ctg gtg acc ggc ctg caa cgt 1200tat aac acc cat gaa gaa gtg gat cgt ctg gtg acc ggc ctg caa cgt 1200

Tyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln ArgTyr Asn Thr His Glu Glu Val Asp Arg Leu Val Thr Gly Leu Gln Arg

385 390 395 400385 390 395 400

att cac cgt ttg ctg gga taa 1221att cac cgt ttg ctg gga taa 1221

Ile His Arg Leu Leu GlyIle His Arg Leu Leu Gly

405405

＜210＞ 8<210> 8

＜211＞ 406<211> 406

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 8<400> 8

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

Ile His Arg Leu Leu GlyIle His Arg Leu Leu Gly

405405

＜210＞ 9<210> 9

＜211＞ 3720<211> 3720

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (530)..(1684)<222> (530) .. (1684)

＜400＞ 9<400> 9

gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60gacgtctgtg tggaattgtg agcggataac aatttcacac agggccctcg gacaccgagg 60

agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120agaatgtcaa gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac 120

ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180ggagcttttt tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca 180

aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240aggtagttat ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa 240

agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300agctcatttc tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg 300

aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360aacagctacg gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa 360

atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420atgtcctgag caatcaccta tgaactgtcg actcgagata gcatttttat ccataagatt 420

agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480agccgatcct aaggtttaca attgtgagcg ctcacaatta tgatagattc aattgtgagc 480

ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538ggataacaat ttcacacacg ctagcggtac caaagaggag aaattaact atg gca aaa 538

Met Ala LysMet ala lys

1One

cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586cac ctt ttt acg tcc gag tcc gtc tct gaa ggg cat cct gac aaa att 586

His Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys IleHis Leu Phe Thr Ser Glu Ser Val Ser Glu Gly His Pro Asp Lys Ile

5 10 155 10 15

gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634gct gac caa att tct gat gcc gtt tta gac gcg atc ctc gaa cag gat 634

Ala Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln AspAla Asp Gln Ile Ser Asp Ala Val Leu Asp Ala Ile Leu Glu Gln Asp

20 25 30 3520 25 30 35

ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682ccg aaa gca cgc gtt gct tgc gaa acc tac gta aaa acc ggc atg gtt 682

Pro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met ValPro Lys Ala Arg Val Ala Cys Glu Thr Tyr Val Lys Thr Gly Met Val

40 45 5040 45 50

tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730tta gtt ggc ggc gaa atc acc acc agc gcc tgg gta gac atc gaa gag 730

Leu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu GluLeu Val Gly Gly Glu Ile Thr Thr Ser Ala Trp Val Asp Ile Glu Glu

55 60 6555 60 65

atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778atc acc cgt aac acc gtt cgc gaa att ggc tat gtg cat tcc gac atg 778

Ile Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp MetIle Thr Arg Asn Thr Val Arg Glu Ile Gly Tyr Val His Ser Asp Met

70 75 8070 75 80

ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826ggc ttt gac gct aac tcc tgt gcg gtt ctg agc gct atc ggc aaa cag 826

Gly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys GlnGly Phe Asp Ala Asn Ser Cys Ala Val Leu Ser Ala Ile Gly Lys Gln

85 90 9585 90 95

tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874tct cct gac atc aac cag ggc gtt gac cgt gcc gat ccg ctg gaa cag 874

Ser Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu GlnSer Pro Asp Ile Asn Gln Gly Val Asp Arg Ala Asp Pro Leu Glu Gln

100 105 110 115100 105 110 115

ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922ggc gcg ggt gac cag ggt ctg atg ttt ggc tac gca act aat gaa acc 922

Gly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu ThrGly Ala Gly Asp Gln Gly Leu Met Phe Gly Tyr Ala Thr Asn Glu Thr

120 125 130120 125 130

gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970gac gtg ctg atg cca gca cct atc acc tat gca cac cgt ctg gta cag 970

Asp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val GlnAsp Val Leu Met Pro Ala Pro Ile Thr Tyr Ala His Arg Leu Val Gln

135 140 145135 140 145

cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018cgt cag gct gaa gtg cgt aaa aac ggc act ctg ccg tgg ctg cgc ccg 1018

Arg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg ProArg Gln Ala Glu Val Arg Lys Asn Gly Thr Leu Pro Trp Leu Arg Pro

150 155 160150 155 160

gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066gac gcg aaa agc cag gtg act ttt cag tat gac gac ggc aaa atc gtt 1066

Asp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile ValAsp Ala Lys Ser Gln Val Thr Phe Gln Tyr Asp Asp Gly Lys Ile Val

165 170 175165 170 175

ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114ggt atc gat gct gtc gtg ctt tcc act cag cac tct gaa gag atc gac 1114

Gly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile AspGly Ile Asp Ala Val Val Leu Ser Thr Gln His Ser Glu Glu Ile Asp

180 185 190 195180 185 190 195

cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162cag aaa tcg ctg caa gaa gcg gta atg gaa gag atc atc aag cca att 1162

Gln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro IleGln Lys Ser Leu Gln Glu Ala Val Met Glu Glu Ile Ile Lys Pro Ile

200 205 210200 205 210

ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210ctg ccc gct gaa tgg ctg act tct gcc acc aaa ttc ttc atc aac ccg 1210

Leu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn ProLeu Pro Ala Glu Trp Leu Thr Ser Ala Thr Lys Phe Phe Ile Asn Pro

215 220 225215 220 225

acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258acc ggt cgt ttc gtt atc ggt ggc cca atg ggt gac tgc ggt ctg act 1258

Thr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu ThrThr Gly Arg Phe Val Ile Gly Gly Pro Met Gly Asp Cys Gly Leu Thr

230 235 240230 235 240

ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306ggt cgt aaa att atc gtt gat acc tac ggc ggc atg gcg cgt cac ggt 1306

Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His GlyGly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Met Ala Arg His Gly

245 250 255245 250 255

ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354ggc ggt gca ttc tct ggt aaa gat cca tca aaa gtg gac cgt tcc gca 1354

Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser AlaGly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp Arg Ser Ala

260 265 270 275260 265 270 275

gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402gcc tac gca gca cgt tat gtc gcg aaa aac atc gtt gct gct ggc ctg 1402

Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly LeuAla Tyr Ala Ala Arg Tyr Val Ala Lys Asn Ile Val Ala Ala Gly Leu

280 285 290280 285 290

gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450gcc gat cgt tgt gaa att cag gtt tcc tac gca atc ggc gtg gct gaa 1450

Ala Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala GluAla Asp Arg Cys Glu Ile Gln Val Ser Tyr Ala Ile Gly Val Ala Glu

295 300 305295 300 305

ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498ccg acc tcc atc atg gta gaa act ttc ggt act gag aaa gtg cct tct 1498

Pro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro SerPro Thr Ser Ile Met Val Glu Thr Phe Gly Thr Glu Lys Val Pro Ser

310 315 320310 315 320

gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546gaa caa ctg acc ctg ctg gta cgt gag ttc ttc gac ctg cgc cca tac 1546

Glu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro TyrGlu Gln Leu Thr Leu Leu Val Arg Glu Phe Phe Asp Leu Arg Pro Tyr

325 330 335325 330 335

ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594ggt ctg att cag atg ctg gat ctg ctg cac ccg atc tac aaa gaa acc 1594

Gly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu ThrGly Leu Ile Gln Met Leu Asp Leu Leu His Pro Ile Tyr Lys Glu Thr

340 345 350 355340 345 350 355

gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642gca gca tac ggt cac ttt ggt cgt gaa cat ttc ccg tgg gaa aaa acc 1642

Ala Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys ThrAla Ala Tyr Gly His Phe Gly Arg Glu His Phe Pro Trp Glu Lys Thr

360 365 370360 365 370

gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684gac aaa gcg cag ctg ctg cgc gat gct gcc ggt ctg aag taa 1684

Asp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu LysAsp Lys Ala Gln Leu Leu Arg Asp Ala Ala Gly Leu Lys

375 380 385375 380 385

tcggtaccgc ttgatatcga attcctgcag cccgggggat cccatggtac gcgtgctaga 1744tcggtaccgc ttgatatcga attcctgcag cccgggggat cccatggtac gcgtgctaga 1744

ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 1804ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 1804

tgtcggtgaa cgctctcctg agtaggacaa atccgccgcc ctagacctag gggatatatt 1864tgtcggtgaa cgctctcctg agtaggacaa atccgccgcc ctagacctag gggatatatt 1864

ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg 1924ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg 1924

cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga 1984cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga 1984

gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat 2044gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat 2044

ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 2104ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 2104

ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc 2164ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc 2164

gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc 2224gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc 2224

gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg 2284gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg 2284

gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca 2344gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca 2344

ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa 2404ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa 2404

aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta 2464aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta 2464

gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag 2524gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag 2524

attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc 2584attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc 2584

tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat 2644tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat 2644

aagttgttac tagtgcttgg attctcacca ataaaaaacg cccggcggca accgagcgtt 2704aagttgttac tagtgcttgg attctcacca ataaaaaacg cccggcggca accgagcgtt 2704

ctgaacaaat ccagatggag ttctgaggtc attactggat ctatcaacag gagtccaagc 2764ctgaacaaat ccagatggag ttctgaggtc attactggat ctatcaacag gagtccaagc 2764

gagctctcga accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga 2824gagctctcga accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga 2824

tgcgctgcga atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc 2884tgcgctgcga atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc 2884

cgccaagctc ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca 2944cgccaagctc ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca 2944

cacccagccg gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg 3004cacccagccg gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg 3004

gcaagcaggc atcgccatgg gtcacgacga gatcctcgcc gtcgggcatg cgcgccttga 3064gcaagcaggc atcgccatgg gtcacgacga gatcctcgcc gtcgggcatg cgcgccttga 3064

gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat 3124gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat 3124

cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt 3184cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt 3184

cgaatgggca ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg 3244cgaatgggca ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg 3244

atactttctc ggcaggagca aggtgagatg acaggagatc ctgccccggc acttcgccca 3304atactttctc ggcaggagca aggtgagatg acaggagatc ctgccccggc acttcgccca 3304

atagcagcca gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc 3364atagcagcca gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc 3364

ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg 3424ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg 3424

acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg 3484acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg 3484

catcagagca gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag 3544catcagagca gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag 3544

cggccggaga acctgcgtgc aatccatctt gttcaatcat gcgaaacgat cctcatcctg 3604cggccggaga acctgcgtgc aatccatctt gttcaatcat gcgaaacgat cctcatcctg 3604

tctcttgatc agatcttgat cccctgcgcc atcagatcct tggcggcaag aaagccatcc 3664tctcttgatc agatcttgat cccctgcgcc atcagatcct tggcggcaag aaagccatcc 3664

agtttacttt gcagggcttc ccaaccttac cagagggcgc cccagctggc aattcc 3720agtttacttt gcagggcttc ccaaccttac cagagggcgc cccagctggc aattcc 3720

＜210＞ 10<210> 10

＜211＞ 384<211> 384

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 10<400> 10

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

＜210＞ 11<210> 11

＜211＞ 3794<211> 3794

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (601)..(1806)<222> (601) .. (1806)

＜400＞ 11<400> 11

ggataacaat ttcacacacg ctagcggtac cgggcccccc ctcgaggtcg acggtatcga 540ggataacaat ttcacacacg ctagcggtac cgggcccccc ctcgaggtcg acggtatcga 540

taagcttgat atcgaattcc tgcagcccgg gggatcccat ggtacgcgtc gaggagtacc 600taagcttgat atcgaattcc tgcagcccgg gggatcccat ggtacgcgtc gaggagtacc 600

atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 648atg aac gtt ttt aat ccc gcg cag ttt cgc gcc cag ttt ccc gca cta 648

1 5 10 151 5 10 15

cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 696cag gat gcg ggc gtc tat ctc gac agc gcc gcg acc gcg ctt aaa cct 696

20 25 3020 25 30

gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 744gaa gcc gtg gtt gaa gcc acc caa cag ttt tac agt ctg agc gcc gga 744

35 40 4535 40 45

aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 792aac gtc cat cgc agc cag ttt gcc gaa gcc caa cgc ctg acc gcg cgt 792

50 55 6050 55 60

tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 840tat gaa gct gca cga gag aaa gtg gcg caa tta ctg aat gca ccg gat 840

65 70 75 8065 70 75 80

gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 888gat aaa act atc gtc tgg acg cgc ggc acc act gaa tcc atc aac atg 888

85 90 9585 90 95

gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 936gtg gca caa tgc tat gcg cgt ccg cgt ctg caa ccg ggc gat gag att 936

100 105 110100 105 110

att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 984att gtc agc gtg gca gaa cac cac gcc aac ctc gtc ccc tgg ctg atg 984

115 120 125115 120 125

gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 1032gtc gcc caa caa act gga gcc aaa gtg gtg aaa ttg ccg ctt aat gcg 1032

130 135 140130 135 140

cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 1080cag cga ctg ccg gat gtc gat ttg ttg cca gaa ctg att act ccc cgt 1080

145 150 155 160145 150 155 160

agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 1128agt cgg att ctg gcg ttg ggt cag atg tcg aac gtt act ggc ggt tgc 1128

165 170 175165 170 175

ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 1176ccg gat ctg gcg cga gcg att acc ttt gct cat tca gcc ggg atg gtg 1176

180 185 190180 185 190

gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 1224gtg atg gtt gat ggt gct cag ggg gca gtg cat ttc ccc gcg gat gtt 1224

195 200 205195 200 205

cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 1272cag caa ctg gat att gat ttc tat gct ttt tca ggt cac aaa ctg tat 1272

210 215 220210 215 220

ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 1320ggc ccg aca ggt atc ggc gtg ctg tat ggt aaa tca gaa ctg ctg gag 1320

225 230 235 240225 230 235 240

gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 1368gcg atg tcg ccc tgg ctg ggc ggc ggc aaa atg gtt cac gaa gtg agt 1368

245 250 255245 250 255

ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 1416ttt gac ggc ttc acg act caa tct gcg ccg tgg aaa ctg gaa gct gga 1416

260 265 270260 265 270

acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 1464acg cca aat gtc gct ggt gtc ata gga tta agc gcg gcg ctg gaa tgg 1464

275 280 285275 280 285

ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 1512ctg gca gat tac gat atc aac cag gcc gaa agc tgg agc cgt agc tta 1512

290 295 300290 295 300

gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 1560gca acg ctg gcg gaa gat gcg ctg gcg aaa cgt ccc ggc ttt cgt tca 1560

305 310 315 320305 310 315 320

ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1608ttc cgc tgc cag gat tcc agc ctg ctg gcc ttt gat ttt gct ggc gtt 1608

325 330 335325 330 335

cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1656cat cat agc gat atg gtg acg ctg ctg gcg gag tac ggt att gcc ctg 1656

340 345 350340 345 350

cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1704cgg gcc ggg cag cat tgc gct cag ccg cta ctg gca gaa tta ggc gta 1704

355 360 365355 360 365

acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1752acc ggc aca ctg cgc gcc tct ttt gcg cca tat aat aca aag agt gat 1752

370 375 380370 375 380

gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1800gtg gat gcg ctg gtg aat gcc gtt gac cgc gcg ctg gaa tta ttg gtg 1800

385 390 395 400385 390 395 400

gat taa acgcgtgcta gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg 1856gat taa acgcgtgcta gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg 1856

AspAsp

gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1916gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1916

ccctagacct aggggatata ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt 1976ccctagacct aggggatata ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt 1976

cgactgcggc gagcggaaat ggcttacgaa cggggcggag atttcctgga agatgccagg 2036cgactgcggc gagcggaaat ggcttacgaa cggggcggag atttcctgga agatgccagg 2036

aagatactta acagggaagt gagagggccg cggcaaagcc gtttttccat aggctccgcc 2096aagatactta acagggaagt gagagggccg cggcaaagcc gtttttccat aggctccgcc 2096

cccctgacaa gcatcacgaa atctgacgct caaatcagtg gtggcgaaac ccgacaggac 2156cccctgacaa gcatcacgaa atctgacgct caaatcagtg gtggcgaaac ccgacaggac 2156

tataaagata ccaggcgttt ccccctggcg gctccctcgt gcgctctcct gttcctgcct 2216tataaagata ccaggcgttt ccccctggcg gctccctcgt gcgctctcct gttcctgcct 2216

ttcggtttac cggtgtcatt ccgctgttat ggccgcgttt gtctcattcc acgcctgaca 2276ttcggtttac cggtgtcatt ccgctgttat ggccgcgttt gtctcattcc acgcctgaca 2276

ctcagttccg ggtaggcagt tcgctccaag ctggactgta tgcacgaacc ccccgttcag 2336ctcagttccg ggtaggcagt tcgctccaag ctggactgta tgcacgaacc ccccgttcag 2336

tccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccgga aagacatgca 2396tccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccgga aagacatgca 2396

aaagcaccac tggcagcagc cactggtaat tgatttagag gagttagtct tgaagtcatg 2456aaagcaccac tggcagcagc cactggtaat tgatttagag gagttagtct tgaagtcatg 2456

cgccggttaa ggctaaactg aaaggacaag ttttggtgac tgcgctcctc caagccagtt 2516cgccggttaa ggctaaactg aaaggacaag ttttggtgac tgcgctcctc caagccagtt 2516

acctcggttc aaagagttgg tagctcagag aaccttcgaa aaaccgccct gcaaggcggt 2576acctcggttc aaagagttgg tagctcagag aaccttcgaa aaaccgccct gcaaggcggt 2576

tttttcgttt tcagagcaag agattacgcg cagaccaaaa cgatctcaag aagatcatct 2636tttttcgttt tcagagcaag agattacgcg cagaccaaaa cgatctcaag aagatcatct 2636

tattaatcag ataaaatatt tctagatttc agtgcaattt atctcttcaa atgtagcacc 2696tattaatcag ataaaatatt tctagatttc agtgcaattt atctcttcaa atgtagcacc 2696

tgaagtcagc cccatacgat ataagttgtt actagtgctt ggattctcac caataaaaaa 2756tgaagtcagc cccatacgat ataagttgtt actagtgctt ggattctcac caataaaaaa 2756

cgcccggcgg caaccgagcg ttctgaacaa atccagatgg agttctgagg tcattactgg 2816cgcccggcgg caaccgagcg ttctgaacaa atccagatgg agttctgagg tcattactgg 2816

atctatcaac aggagtccaa gcgagctctc gaaccccaga gtcccgctca gaagaactcg 2876atctatcaac aggagtccaa gcgagctctc gaaccccaga gtcccgctca gaagaactcg 2876

tcaagaaggc gatagaaggc gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg 2936tcaagaaggc gatagaaggc gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg 2936

aggaagcggt cagcccattc gccgccaagc tcttcagcaa tatcacgggt agccaacgct 2996aggaagcggt cagcccattc gccgccaagc tcttcagcaa tatcacgggt agccaacgct 2996

atgtcctgat agcggtccgc cacacccagc cggccacagt cgatgaatcc agaaaagcgg 3056atgtcctgat agcggtccgc cacacccagc cggccacagt cgatgaatcc agaaaagcgg 3056

ccattttcca ccatgatatt cggcaagcag gcatcgccat gggtcacgac gagatcctcg 3116ccattttcca ccatgatatt cggcaagcag gcatcgccat gggtcacgac gagatcctcg 3116

ccgtcgggca tgcgcgcctt gagcctggcg aacagttcgg ctggcgcgag cccctgatgc 3176ccgtcgggca tgcgcgcctt gagcctggcg aacagttcgg ctggcgcgag cccctgatgc 3176

tcttcgtcca gatcatcctg atcgacaaga ccggcttcca tccgagtacg tgctcgctcg 3236tcttcgtcca gatcatcctg atcgacaaga ccggcttcca tccgagtacg tgctcgctcg 3236

atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg gatcaagcgt atgcagccgc 3296atgcgatgtt tcgcttggtg gtcgaatggg caggtagccg gatcaagcgt atgcagccgc 3296

cgcattgcat cagccatgat ggatactttc tcggcaggag caaggtgaga tgacaggaga 3356cgcattgcat cagccatgat ggatactttc tcggcaggag caaggtgaga tgacaggaga 3356

tcctgccccg gcacttcgcc caatagcagc cagtcccttc ccgcttcagt gacaacgtcg 3416tcctgccccg gcacttcgcc caatagcagc cagtcccttc ccgcttcagt gacaacgtcg 3416

agcacagctg cgcaaggaac gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc 3476agcacagctg cgcaaggaac gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc 3476

tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc 3536tgcagttcat tcagggcacc ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc 3536

gctgacagcc ggaacacggc ggcatcagag cagccgattg tctgttgtgc ccagtcatag 3596gctgacagcc ggaacacggc ggcatcagag cagccgattg tctgttgtgc ccagtcatag 3596

ccgaatagcc tctccaccca agcggccgga gaacctgcgt gcaatccatc ttgttcaatc 3656ccgaatagcc tctccaccca agcggccgga gaacctgcgt gcaatccatc ttgttcaatc 3656

atgcgaaacg atcctcatcc tgtctcttga tcagatcttg atcccctgcg ccatcagatc 3716atgcgaaacg atcctcatcc tgtctcttga tcagatcttg atcccctgcg ccatcagatc 3716

cttggcggca agaaagccat ccagtttact ttgcagggct tcccaacctt accagagggc 3776cttggcggca agaaagccat ccagtttact ttgcagggct tcccaacctt accagagggc 3776

gccccagctg gcaattcc 3794gccccagctg gcaattcc 3794

＜210＞ 12<210> 12

＜211＞ 401<211> 401

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 12<400> 12

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

Gly Pro Thr GlyIle Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu GluGly Pro Thr GlyIle Gly Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

AspAsp

＜210＞ 13<210> 13

＜211＞ 4975<211> 4975

＜212＞ DNA<212> DNA

＜213＞ Escherichia coli<213> Escherichia coli

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (530)..(1684)<222> (530) .. (1684)

＜220＞<220>

＜221＞ CDS<221> CDS

＜222＞ (1782)..(2987)<222> (1782) .. (2987)

＜400＞ 13<400> 13

Met Ala LysMet ala lys

1One

5 10 155 10 15

20 25 30 3520 25 30 35

40 45 5040 45 50

55 60 6555 60 65

70 75 8070 75 80

85 90 9585 90 95

100 105 110 115100 105 110 115

120 125 130120 125 130

135 140 145135 140 145

150 155 160150 155 160

165 170 175165 170 175

180 185 190 195180 185 190 195

200 205 210200 205 210

215 220 225215 220 225

230 235 240230 235 240

245 250 255245 250 255

260 265 270 275260 265 270 275

280 285 290280 285 290

295 300 305295 300 305

310 315 320310 315 320

325 330 335325 330 335

340 345 350 355340 345 350 355

360 365 370360 365 370

375 380 385375 380 385

tcggtaccgg gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc 1744tcggtaccgg gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcctgc 1744

agcccggggg atcccatggt acgcgtcgag gagtacc atg aac gtt ttt aat ccc 1799agcccggggg atcccatggt acgcgtcgag gagtacc atg aac gtt ttt aat ccc 1799

Met Asn Val Phe Asn ProMet Asn Val Phe Asn Pro

390390

gcg cag ttt cgc gcc cag ttt ccc gca cta cag gat gcg ggc gtc tat 1847gcg cag ttt cgc gcc cag ttt ccc gca cta cag gat gcg ggc gtc tat 1847

Ala Gln Phe Arg Ala Gln Phe Pro Ala Leu Gln Asp Ala Gly Val TyrAla Gln Phe Arg Ala Gln Phe Pro Ala Leu Gln Asp Ala Gly Val Tyr

395 400 405395 400 405

ctc gac agc gcc gcg acc gcg ctt aaa cct gaa gcc gtg gtt gaa gcc 1895ctc gac agc gcc gcg acc gcg ctt aaa cct gaa gcc gtg gtt gaa gcc 1895

Leu Asp Ser Ala Ala Thr Ala Leu Lys Pro Glu Ala Val Val Glu AlaLeu Asp Ser Ala Ala Thr Ala Leu Lys Pro Glu Ala Val Val Glu Ala

410 415 420410 415 420

acc caa cag ttt tac agt ctg agc gcc gga aac gtc cat cgc agc cag 1943acc caa cag ttt tac agt ctg agc gcc gga aac gtc cat cgc agc cag 1943

Thr Gln Gln Phe Tyr Ser Leu Ser Ala Gly Asn Val His Arg Ser GlnThr Gln Gln Phe Tyr Ser Leu Ser Ala Gly Asn Val His Arg Ser Gln

425 430 435425 430 435

ttt gcc gaa gcc caa cgc ctg acc gcg cgt tat gaa gct gca cga gag 1991ttt gcc gaa gcc caa cgc ctg acc gcg cgt tat gaa gct gca cga gag 1991

Phe Ala Glu Ala Gln Arg Leu Thr Ala Arg Tyr Glu Ala Ala Arg GluPhe Ala Glu Ala Gln Arg Leu Thr Ala Arg Tyr Glu Ala Ala Arg Glu

440 445 450 455440 445 450 455

aaa gtg gcg caa tta ctg aat gca ccg gat gat aaa act atc gtc tgg 2039aaa gtg gcg caa tta ctg aat gca ccg gat gat aaa act atc gtc tgg 2039

Lys Val Ala Gln Leu Leu Asn Ala Pro Asp Asp Lys Thr Ile Val TrpLys Val Ala Gln Leu Leu Asn Ala Pro Asp Asp Lys Thr Ile Val Trp

460 465 470460 465 470

acg cgc ggc acc act gaa tcc atc aac atg gtg gca caa tgc tat gcg 2087acg cgc ggc acc act gaa tcc atc aac atg gtg gca caa tgc tat gcg 2087

Thr Arg Gly Thr Thr Glu Ser Ile Asn Met Val Ala Gln Cys Tyr AlaThr Arg Gly Thr Thr Glu Ser Ile Asn Met Val Ala Gln Cys Tyr Ala

475 480 485475 480 485

cgt ccg cgt ctg caa ccg ggc gat gag att att gtc agc gtg gca gaa 2135cgt ccg cgt ctg caa ccg ggc gat gag att att gtc agc gtg gca gaa 2135

Arg Pro Arg Leu Gln Pro Gly Asp Glu Ile Ile Val Ser Val Ala GluArg Pro Arg Leu Gln Pro Gly Asp Glu Ile Ile Val Ser Val Ala Glu

490 495 500490 495 500

cac cac gcc aac ctc gtc ccc tgg ctg atg gtc gcc caa caa act gga 2183cac cac gcc aac ctc gtc ccc tgg ctg atg gtc gcc caa caa act gga 2183

His His Ala Asn Leu Val Pro Trp Leu Met Val Ala Gln Gln Thr GlyHis His Ala Asn Leu Val Pro Trp Leu Met Val Ala Gln Gln Thr Gly

505 510 515505 510 515

gcc aaa gtg gtg aaa ttg ccg ctt aat gcg cag cga ctg ccg gat gtc 2231gcc aaa gtg gtg aaa ttg ccg ctt aat gcg cag cga ctg ccg gat gtc 2231

Ala Lys Val Val Lys Leu Pro Leu Asn Ala Gln Arg Leu Pro Asp ValAla Lys Val Val Lys Leu Pro Leu Asn Ala Gln Arg Leu Pro Asp Val

520 525 530 535520 525 530 535

gat ttg ttg cca gaa ctg att act ccc cgt agt cgg att ctg gcg ttg 2279gat ttg ttg cca gaa ctg att act ccc cgt agt cgg att ctg gcg ttg 2279

Asp Leu Leu Pro Glu Leu Ile Thr Pro Arg Ser Arg Ile Leu Ala LeuAsp Leu Leu Pro Glu Leu Ile Thr Pro Arg Ser Arg Ile Leu Ala Leu

540 545 550540 545 550

ggt cag atg tcg aac gtt act ggc ggt tgc ccg gat ctg gcg cga gcg 2327ggt cag atg tcg aac gtt act ggc ggt tgc ccg gat ctg gcg cga gcg 2327

Gly Gln Met Ser Asn Val Thr Gly Gly Cys Pro Asp Leu Ala Arg AlaGly Gln Met Ser Asn Val Thr Gly Gly Cys Pro Asp Leu Ala Arg Ala

555 560 565555 560 565

att acc ttt gct cat tca gcc ggg atg gtg gtg atg gtt gat ggt gct 2375att acc ttt gct cat tca gcc ggg atg gtg gtg atg gtt gat ggt gct 2375

Ile Thr Phe Ala His Ser Ala Gly Met Val Val Met Val Asp Gly AlaIle Thr Phe Ala His Ser Ala Gly Met Val Val Met Val Asp Gly Ala

570 575 580570 575 580

cag ggg gca gtg cat ttc ccc gcg gat gtt cag caa ctg gat att gat 2423cag ggg gca gtg cat ttc ccc gcg gat gtt cag caa ctg gat att gat 2423

Gln Gly Ala Val His Phe Pro Ala Asp Val Gln Gln Leu Asp Ile AspGln Gly Ala Val His Phe Pro Ala Asp Val Gln Gln Leu Asp Ile Asp

585 590 595585 590 595

ttc tat gct ttt tca ggt cac aaa ctg tat ggc ccg aca ggt atc ggc 2471ttc tat gct ttt tca ggt cac aaa ctg tat ggc ccg aca ggt atc ggc 2471

Phe Tyr Ala Phe Ser Gly His Lys Leu Tyr Gly Pro Thr Gly Ile GlyPhe Tyr Ala Phe Ser Gly His Lys Leu Tyr Gly Pro Thr Gly Ile Gly

600 605 610 615600 605 610 615

gtg ctg tat ggt aaa tca gaa ctg ctg gag gcg atg tcg ccc tgg ctg 2519gtg ctg tat ggt aaa tca gaa ctg ctg gag gcg atg tcg ccc tgg ctg 2519

Val Leu Tyr Gly Lys Ser Glu Leu Leu Glu Ala Met Ser Pro Trp LeuVal Leu Tyr Gly Lys Ser Glu Leu Leu Glu Ala Met Ser Pro Trp Leu

620 625 630620 625 630

ggc ggc ggc aaa atg gtt cac gaa gtg agt ttt gac ggc ttc acg act 2567ggc ggc ggc aaa atg gtt cac gaa gtg agt ttt gac ggc ttc acg act 2567

Gly Gly Gly Lys Met Val His Glu Val Ser Phe Asp Gly Phe Thr ThrGly Gly Gly Lys Met Val His Glu Val Ser Phe Asp Gly Phe Thr Thr

635 640 645635 640 645

caa tct gcg ccg tgg aaa ctg gaa gct gga acg cca aat gtc gct ggt 2615caa tct gcg ccg tgg aaa ctg gaa gct gga acg cca aat gtc gct ggt 2615

Gln Ser Ala Pro Trp Lys Leu Glu Ala Gly Thr Pro Asn Val Ala GlyGln Ser Ala Pro Trp Lys Leu Glu Ala Gly Thr Pro Asn Val Ala Gly

650 655 660650 655 660

gtc ata gga tta agc gcg gcg ctg gaa tgg ctg gca gat tac gat atc 2663gtc ata gga tta agc gcg gcg ctg gaa tgg ctg gca gat tac gat atc 2663

Val Ile Gly Leu Ser Ala Ala Leu Glu Trp Leu Ala Asp Tyr Asp IleVal Ile Gly Leu Ser Ala Ala Leu Glu Trp Leu Ala Asp Tyr Asp Ile

665 670 675665 670 675

aac cag gcc gaa agc tgg agc cgt agc tta gca acg ctg gcg gaa gat 2711aac cag gcc gaa agc tgg agc cgt agc tta gca acg ctg gcg gaa gat 2711

Asn Gln Ala Glu Ser Trp Ser Arg Ser Leu Ala Thr Leu Ala Glu AspAsn Gln Ala Glu Ser Trp Ser Arg Ser Leu Ala Thr Leu Ala Glu Asp

680 685 690 695680 685 690 695

gcg ctg gcg aaa cgt ccc ggc ttt cgt tca ttc cgc tgc cag gat tcc 2759gcg ctg gcg aaa cgt ccc ggc ttt cgt tca ttc cgc tgc cag gat tcc 2759

Ala Leu Ala Lys Arg Pro Gly Phe Arg Ser Phe Arg Cys Gln Asp SerAla Leu Ala Lys Arg Pro Gly Phe Arg Ser Phe Arg Cys Gln Asp Ser

700 705 710700 705 710

agc ctg ctg gcc ttt gat ttt gct ggc gtt cat cat agc gat atg gtg 2807agc ctg ctg gcc ttt gat ttt gct ggc gtt cat cat agc gat atg gtg 2807

Ser Leu Leu Ala Phe Asp Phe Ala Gly Val His His Ser Asp Met ValSer Leu Leu Ala Phe Asp Phe Ala Gly Val His His Ser Asp Met Val

715 720 725715 720 725

acg ctg ctg gcg gag tac ggt att gcc ctg cgg gcc ggg cag cat tgc 2855acg ctg ctg gcg gag tac ggt att gcc ctg cgg gcc ggg cag cat tgc 2855

Thr Leu Leu Ala Glu Tyr Gly Ile Ala Leu Arg Ala Gly Gln His CysThr Leu Leu Ala Glu Tyr Gly Ile Ala Leu Arg Ala Gly Gln His Cys

730 735 740730 735 740

gct cag ccg cta ctg gca gaa tta ggc gta acc ggc aca ctg cgc gcc 2903gct cag ccg cta ctg gca gaa tta ggc gta acc ggc aca ctg cgc gcc 2903

Ala Gln Pro Leu Leu Ala Glu Leu Gly Val Thr Gly Thr Leu Arg AlaAla Gln Pro Leu Leu Ala Glu Leu Gly Val Thr Gly Thr Leu Arg Ala

745 750 755745 750 755

tct ttt gcg cca tat aat aca aag agt gat gtg gat gcg ctg gtg aat 2951tct ttt gcg cca tat aat aca aag agt gat gtg gat gcg ctg gtg aat 2951

Ser Phe Ala Pro Tyr Asn Thr Lys Ser Asp Val Asp Ala Leu Val AsnSer Phe Ala Pro Tyr Asn Thr Lys Ser Asp Val Asp Ala Leu Val Asn

760 765 770 775760 765 770 775

gcc gtt gac cgc gcg ctg gaa tta ttg gtg gat taa acgcgtgcta 2997gcc gtt gac cgc gcg ctg gaa tta ttg gtg gat taa acgcgtgcta 2997

Ala Val Asp Arg Ala Leu Glu Leu Leu Val AspAla Val Asp Arg Ala Leu Glu Leu Leu Val Asp

780 785780 785

gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt ttatctgttg 3057gaggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt ttatctgttg 3057

tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ccctagacct aggggatata 3117tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ccctagacct aggggatata 3117

ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat 3177ttccgcttcc tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat 3177

ggcttacgaa cggggcggag atttcctgga agatgccagg aagatactta acagggaagt 3237ggcttacgaa cggggcggag atttcctgga agatgccagg aagatactta acagggaagt 3237

gagagggccg cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa 3297gagagggccg cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa 3297

atctgacgct caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt 3357atctgacgct caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt 3357

ccccctggcg gctccctcgt gcgctctcct gttcctgcct ttcggtttac cggtgtcatt 3417ccccctggcg gctccctcgt gcgctctcct gttcctgcct ttcggtttac cggtgtcatt 3417

ccgctgttat ggccgcgttt gtctcattcc acgcctgaca ctcagttccg ggtaggcagt 3477ccgctgttat ggccgcgttt gtctcattcc acgcctgaca ctcagttccg ggtaggcagt 3477

tcgctccaag ctggactgta tgcacgaacc ccccgttcag tccgaccgct gcgccttatc 3537tcgctccaag ctggactgta tgcacgaacc ccccgttcag tccgaccgct gcgccttatc 3537

cggtaactat cgtcttgagt ccaacccgga aagacatgca aaagcaccac tggcagcagc 3597cggtaactat cgtcttgagt ccaacccgga aagacatgca aaagcaccac tggcagcagc 3597

cactggtaat tgatttagag gagttagtct tgaagtcatg cgccggttaa ggctaaactg 3657cactggtaat tgatttagag gagttagtct tgaagtcatg cgccggttaa ggctaaactg 3657

aaaggacaag ttttggtgac tgcgctcctc caagccagtt acctcggttc aaagagttgg 3717aaaggacaag ttttggtgac tgcgctcctc caagccagtt acctcggttc aaagagttgg 3717

tagctcagag aaccttcgaa aaaccgccct gcaaggcggt tttttcgttt tcagagcaag 3777tagctcagag aaccttcgaa aaaccgccct gcaaggcggt tttttcgttt tcagagcaag 3777

agattacgcg cagaccaaaa cgatctcaag aagatcatct tattaatcag ataaaatatt 3837agattacgcg cagaccaaaa cgatctcaag aagatcatct tattaatcag ataaaatatt 3837

tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc cccatacgat 3897tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc cccatacgat 3897

ataagttgtt actagtgctt ggattctcac caataaaaaa cgcccggcgg caaccgagcg 3957ataagttgtt actagtgctt ggattctcac caataaaaaa cgcccggcgg caaccgagcg 3957

ttctgaacaa atccagatgg agttctgagg tcattactgg atctatcaac aggagtccaa 4017ttctgaacaa atccagatgg agttctgagg tcattactgg atctatcaac aggagtccaa 4017

gcgagctctc gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc 4077gcgagctctc gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc 4077

gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc 4137gatgcgctgc gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc 4137

gccgccaagc tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc 4197gccgccaagc tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc 4197

cacacccagc cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt 4257cacacccagc cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt 4257

cggcaagcag gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt 4317cggcaagcag gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt 4317

gagcctggcg aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg 4377gagcctggcg aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg 4377

atcgacaaga ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg 4437atcgacaaga ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg 4437

gtcgaatggg caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat 4497gtcgaatggg caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat 4497

ggatactttc tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc 4557ggatactttc tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc 4557

caatagcagc cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac 4617caatagcagc cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac 4617

gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc 4677gcccgtcgtg gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc 4677

ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc 4737ggacaggtcg gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc 4737

ggcatcagag cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca 4797ggcatcagag cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca 4797

agcggccgga gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc 4857agcggccgga gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc 4857

tgtctcttga tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat 4917tgtctcttga tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat 4917

ccagtttact ttgcagggct tcccaacctt accagagggc gccccagctg gcaattcc 4975ccagtttact ttgcagggct tcccaacctt accagagggc gccccagctg gcaattcc 4975

＜210＞ 14<210> 14

＜211＞ 384<211> 384

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 14<400> 14

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

＜210＞ 15<210> 15

＜211＞ 401<211> 401

＜212＞ PRT<212> PRT

＜213＞ Escherichia coli<213> Escherichia coli

＜400＞ 15<400> 15

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330 335325 330 335

340 345 350340 345 350

355 360 365355 360 365

370 375 380370 375 380

385 390 395 400385 390 395 400

AspAsp

Claims

S-adenosylmethionine synthase gene having SEQ ID NO: 1 and at least one additional biotin biosynthetic gene bioS1, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, and functional variants, analogs or derivatives thereof, for synthesizing biotin A method for producing biotin, which is expressed in a prokaryotic or eukaryotic host organism, cultured the organism, isolating and removing cells or purifying biotin, and immediately using the synthesized biotin.

The method of claim 1 wherein the variant of the gene having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 exhibits 30-100% homology at the amino acid level deduced from the sequences claimed in claim 1, and the synthesis of biotin How to increase

The method according to claim 1 or 2, wherein Escherichia, Citrobacter, Serratia, Klebsiella, Salmonella, Pseudomonas, and comamonas Comamonas, Acinetobacter, Azotobacter, Chromobacterium, Bacillus, Clostridium, Arthrobacter, Corynebacterium ), Brevibacterium, Lactococcus, Lactobacillus, Streptomyces, Streptomyces, Rhizobium, Agrobacterium, Staphylococcus, Staphylococcus, Staphylococcus A method using as an host organism an organism selected from the group of Rhodotorula, Sporobolomyces, Yarrowia, Schizosaccharomyces or Saccharomyces.

4. The method of claim 1, wherein the regulatory-deficient biotin mutant is used as a host organism.

5. The method of claim 1, wherein one or more copies of the genes having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7 as claimed in claim 1, are alone in a prokaryotic or eukaryotic host organism. Or with one or more copies of one or more additional biotin genes selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR.

The method of any one of claims 1 to 5, wherein one or more copies of the genes having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7 as claimed in claim 1 are solely expressed in prokaryotic or eukaryotic host organisms. Or in a shared vector or in a separate vector with one or more copies of one or more additional biotin genes selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR.

An S-adenosylmethionine synthetase gene having SEQ ID NO: 1 and one or more additional biotin biosynthetic genes bioS12, bioS2 or bioS3 having SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, and functional variants, analogs or derivatives thereof, A gene construct that is functionally linked to (or connected to) one or more regulatory signals and whose natural regulation is turned off to increase gene expression and / or protein expression.

The genetic construct of claim 7 which is inserted into a vector suitable for expressing the gene in a prokaryotic or eukaryotic host organism.

The gene construct of claim 7 or 8, wherein the genes having SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7, and functional variants, analogs or derivatives thereof, are present in the gene construct in several copies.

10. The biotin biosynthesis gene of any one of claims 7 to 9, wherein the gene is the S-adenosylmethionine synthase gene as claimed in claim 7, and at least one additional biotin biosynthesis gene having SEQ ID NO: 3 bioS1, bioS2 or bioS3, and functional variants, analogs or derivatives thereof, with one or more copies of one or more additional genes selected from the group bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or bioR Gene constructs present in a gene construct or vector together.

An organism comprising a construct that is a gene as claimed in claim 7.

Use of the sequence as claimed in claim 1 for producing biotin.

The bioS3 gene having SEQ ID NO: 7 or a functional variant, analog or derivative thereof, or the S-adenosylmethionine synthase gene bioS1, bioS2, bioA, bioB, bioF, bioC, bioD, bioH, bioP, bioW, bioX, bioY or Use for biotin production of a bioS3 gene having SEQ ID NO: 7 or a functional variant, analog or derivative thereof in combination with one or more additional genes selected from the bioR group.

Use of a gene construct as claimed in any one of claims 7 to 10 for producing biotin.