KR20090078113A - A microorganism of escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same - Google Patents

A microorganism of escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same Download PDF

Info

Publication number
KR20090078113A
KR20090078113A KR1020080003912A KR20080003912A KR20090078113A KR 20090078113 A KR20090078113 A KR 20090078113A KR 1020080003912 A KR1020080003912 A KR 1020080003912A KR 20080003912 A KR20080003912 A KR 20080003912A KR 20090078113 A KR20090078113 A KR 20090078113A
Authority
KR
South Korea
Prior art keywords
ala
leu
val
gly
ser
Prior art date
Application number
KR1020080003912A
Other languages
Korean (ko)
Other versions
KR100971508B1 (en
Inventor
김선원
윤상활
이숙희
류희경
장희정
오덕근
Original Assignee
경상대학교산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 경상대학교산학협력단 filed Critical 경상대학교산학협력단
Priority to KR1020080003912A priority Critical patent/KR100971508B1/en
Publication of KR20090078113A publication Critical patent/KR20090078113A/en
Application granted granted Critical
Publication of KR100971508B1 publication Critical patent/KR100971508B1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P5/00Preparation of hydrocarbons or halogenated hydrocarbons
    • C12P5/007Preparation of hydrocarbons or halogenated hydrocarbons containing one or more isoprene units, i.e. terpenes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y503/00Intramolecular oxidoreductases (5.3)
    • C12Y503/03Intramolecular oxidoreductases (5.3) transposing C=C bonds (5.3.3)
    • C12Y503/03002Isopentenyl-diphosphate DELTA-isomerase (5.3.3.2)

Landscapes

  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Plant Pathology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

An Escherichia sp. strain of which isoprenoid productivity is improved is provided to increase the productivity efficiency of isoprenoid such as beta-carotine and retinal by introducing a gene which codes an enzyme related to mavalonate pathway. A Escherichia sp. microorganism which produces an isoprenoid is transformed by a gene. The genes comprise a gene coding acetyl-CoA acetyltransferase/hydroxymethylgrutaryl(HMG)-CoA reductage; a gene coding HMG-CoA synthase; a gene coding mavalonate kinase; a gene coding Streptococcus pneumonia-derived phosphomevalonate kinase; a gene coding Streptococcus pneumonia derived-mevalonate diphosphate decarboxylase; a gene coding isopentenyl diphosphate isomerase; a gene coding Pantoea agglomerans derived geranylgeranyl pyrophosphate; and a gene coding phytoene synthase.

Description

이소프레노이드 생산성이 향상된 에세리키아 속 미생물 및 그를 이용하여 이소프레노이드를 생산하는 방법{A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same}I microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same

본 발명은 이소프레노이드 생산성이 향상된 에세리키아 속 미생물 및 그를 이용하여 이소프레노이드를 생산하는 방법에 관한 것이다.The present invention relates to a microorganism of the genus Escherichia which has improved isoprenoid productivity and a method for producing isoprenoids using the same.

이소프레노이드는 탄소수가 5개인 이소프렌을 기본 단위로 하여 이루어진 화합물의 총칭이다. 이소프레노이드에는 라이코펜, β-카로틴, 아스타잔틴, 코엔자임 큐텐, 레티놀 및 레티날 등이 포함된다.Isoprenoid is a generic term for compounds composed of isoprene having 5 carbon atoms as a base unit. Isoprenoids include lycopene, β-carotene, astaxanthin, coenzyme qtene, retinol, retinal, and the like.

이소프레노이드를 생합성하는 경로에는 메발로네이트 경로와 비메발로네이트 경로가 알려져 있다. 에세리키아 속 미생물, 예를 들면 대장균은 비메발로네이트 경로인 MEP 경로만을 가지고 있다. 도 1은 MEP 경로와 메발로네이트 경로를 통하여 기본 빌딩 블록으로부터 이소프레노이드의 일종인 레티날을 합성하는 과정을 도식적으로 나타낸 도면이다. 도 1에서, DXP는 1-데옥시-D-자일룰로즈-5-포스페이트이고, MEP는 2-C-메틸-D-에리스리톨-4-포스페이트이고, IPP는 이소펜테닐 디포스페이 트이고, DMAPP는 디메틸알릴 디포스페이트이고, FPP는 파네실 피로포스페이트이고, GGPP는 제라닐제라닐 피로포스페이트이고, HMG는 하이드록시메틸글루타릴을 나타낸다. The paths for biosynthesizing isoprenoids are known as mevalonate and nonmevalonate pathways. Microorganisms of the genus Escherichia, for example E. coli, have only the MEP pathway, which is a non-mevalonate pathway. 1 is a diagram schematically illustrating a process of synthesizing a retinal, which is a type of isoprenoid, from a basic building block through a MEP path and a mevalonate path. In Figure 1, DXP is 1-deoxy-D-xylulose-5-phosphate, MEP is 2-C-methyl-D-erythritol-4-phosphate, IPP is isopentenyl diphosphate, and DMAPP is Dimethylallyl diphosphate, FPP is faresyl pyrophosphate, GGPP is geranylgeranyl pyrophosphate, and HMG represents hydroxymethylglutaryl.

에세리키아 속 미생물, 예를 들면 대장균은 증식이 빠르고 그 대사가 잘 연구되어 있으나, 내재적 MEP 경로는 이소프레노이드를 생합성하는데 있어서, 메발로네이트를 생산하는데 효율적이지 않은 것으로 알려져 왔다. 따라서, 에세리키아 속 미생물에 외래 이소프레노이드 합성 경로에 관여하는 효소를 코딩하는 유전자를 도입한 미생물을 개발하고자 하는 연구가 알려져 있었다. 예를 들면, 한국특허공개 제2007-0103100호에는 다양한 생물체로부터 유래된 메발로네이트 경로 관련 효소들의 유전자 조합에 의해 형질전환된 미생물 및 이를 이용하여 메발로네이트 경로를 통해 이소프레이노이드를 생산하는 방법이 개시되어 있다. 구체적으로, a) 스타필로코커스 아우레우스 (Staphylococcus aureus) 유래의 메발로네이트 인산화효소, 및 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 인산메발로네이트 인산화효소 및 이인산메발로네이트 탈카복실화효소; 및 (b-1) 엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸전달효소/하이드록시메틸글루타릴-CoA 환원효소 및 하이드록시메틸글루타릴-CoA 합성효소, 또는 (b-2) 랄스토니아 유트로파 (Ralstonia eutropha) 유래의 아세틸-CoA 아세틸전달효소, 및 스트렙토코커스 뉴모니애 (Streptococcus pneunomiae) 유래의 하이드록시메틸글루타릴-CoA 합성효소 및 하이드록시메틸글루타릴-CoA 환원효소를 암호화하는 유전자들을 숙주 미생물에 도입시켜 형질전환시킨 후, 형질전환된 숙주 미생물을 배양하여 이 소프레노이드를 생산하는 것을 포함하는, 이소프레노이드의 제조방법이 개시되어 있다. Although microorganisms of the genus Escherichia, for example E. coli, have a rapid proliferation and are well studied for their metabolism, the intrinsic MEP pathway has been known to be inefficient for producing mevalonate in biosynthesizing isoprenoids. Therefore, research has been known to develop microorganisms incorporating genes encoding the enzymes involved in the foreign isoprenoid synthesis pathway to the microorganisms of Escherichia. For example, Korean Patent Laid-Open Publication No. 2007-0103100 discloses a microorganism transformed by a genetic combination of enzymes associated with mevalonate pathways derived from various organisms and a method for producing isoprenoids through the mevalonate pathway using the same. Is disclosed. Specifically, a) Staphylococcus aureus ) mevalonate kinase, and Streptococcus pneumoniae ) phosphomethalonate kinase and diphosphoryl methalonate decarboxylase; And (b-1) Enterococcus faecalis ) acetyl-CoA acetyltransferase / hydroxymethylglutaryl-CoA reductase and hydroxymethylglutaryl-CoA synthetase, or (b-2) Ralstonia eutropa acetyl-CoA acetyltransferase from eutropha ), and Streptococcus genes encoding hydroxymethylglutaryl-CoA synthase and hydroxymethylglutaryl-CoA reductase from pneunomiae ) were introduced into the host microorganism and transformed, followed by culturing the transformed host microorganism. A method for producing isoprenoids is disclosed that includes producing a prenoid.

그러나, 상기 방법에 의하더라도 여전히 β-카로틴 및 레티날과 같은 이소프레이노이드를 높은 생산성으로 생산하는 에세리키아 속 미생물 및 그를 이용하여 이소프레이노이드를 생산하는 방법이 요구되고 있다. 이에 본 발명자들은 대장균에 최적의 외래 메발로네이트 경로에 관여하는 효소를 코딩하는 유전자를 도입하는 동시에, 내재적 MEP 경로의 속도 결정에 단계인 DXP 신타제를 코딩하는 유전자를 에세리키아 속 미생물에 도입하여 본 발명을 완성하였다. However, even with the above method, there is still a need for a microorganism of the genus Escherichia that produces isoprenoids such as β-carotene and retinal with high productivity, and a method for producing isoprenoids using the same. In this regard, the present inventors introduced a gene encoding an enzyme involved in an optimal foreign mevalonate pathway into E. coli, and a gene encoding DXP synthase, which is a step in determining the rate of the intrinsic MEP pathway, into microorganisms of the genus Escherichia. The present invention was completed.

본 발명의 목적은 β-카로틴 및 레티날과 같은 이소프레노이드를 높은 효율로 생산할 수 있는 에세리키아 속 미생물을 제공하는 것이다.An object of the present invention is to provide a microorganism of the genus Escherichia that can produce isoprenoids such as β-carotene and retinal with high efficiency.

본 발명의 목적은 β-카로틴 및 레티날과 같은 이소프레노이드를 높은 효율로 생산하는 방법을 제공하는 것이다. It is an object of the present invention to provide a method for producing isoprenoids such as β-carotene and retinal with high efficiency.

본 발명은 엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸트란스퍼라제/하이드록시메틸글루타릴 (HMG)-CoA 리덕타제를 코딩하는 유전자, 엔테로코커스 패칼리스 유래의 HMG-CoA 신타제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 메발로네이트 키나제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 유래의 포스포메발로네이트 키나제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 유래의 메발로네이트 디포스페이트 데카르복실라제를 코딩하는 유전자, 대장균 유래의 이소펜테닐 디포스페이트 (IPP) 이소머라제를 코딩하는 유전자, 판토에아 아글루메란스 (pantoea agglomerans) 유래의 제라닐제라닐 피로포스페이트 (GGPP) 신타제를 코딩하는 유전자, 판토에아 아글루메란스 유래의 피토엔 신타제를 코딩하는 유전자, 판토에아 아글루메란스 유래의 피토엔 데히드로게나제를 코딩하는 유전자, 판토에아 아나나티스 (pantoea ananatis) 유래의 라이코펜-β-시클라제를 코딩하는 유전자 및 대장균 유래 1-데옥시자일룰로즈-5-포스페이트 (DXP) 신타제를 코딩하는 유전자로 형질전환되고, 이소 프레노이드를 생산하는 에세리키아 속 미생물을 제공한다. The invention Enterococcus faecalis (Enterococcus faecalis) Origin of acetyl -CoA acetyl trans flops cyclase / hydroxymethyl-glutaryl (HMG) of the gene, Enterococcus faecalis-derived coding for -CoA reductase HMG-CoA synthetase Gene coding for Streptococcus pneumoniae pneumoniae ) genes encoding mevalonate kinase, genes encoding phosphomevalonate kinase from Streptococcus pneumoniae, genes encoding mevalonate diphosphate decarboxylase from Streptococcus pneumoniae, Gene coding for isopentenyl diphosphate (IPP) isomerase from E. coli, gene encoding geranylgeranyl pyrophosphate (GGPP) synthase from pantoea agglomerans , pantoe Gene encoding phytoene synthase derived from agglutamine , gene encoding phytoene dehydrogenase derived from pantoea agglomerans , lycopene from pantoea ananatis transgenic with a gene encoding β-cyclase and a gene encoding E. coli derived 1-deoxyxylulose-5-phosphate (DXP) synthase And the environment, and provides a microorganism of the genus Escherichia to produce isoprenoid.

상기 에세리키아 속 미생물에는 대장균, 예를 들면, MG1655, DH5α, XL1-Blue 및 BL21이 포함되며, 바람직하게는 대장균 DH5α이다.The Escherichia spp. Microorganisms include Escherichia coli, for example, MG1655, DH5α, XL1-Blue, and BL21, and preferably Escherichia coli DH5α.

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물은, 내재적으로 MEP 경로를 가지고 있는 것이나, 아세틸-CoA로부터 IPP를 생산하는데 관여하는 외래 메발로네이트 경로의 효소를 코딩하는 유전자가 도입된 균주이다. 또한, 상기 IPP로부터 β-카로틴을 합성하는데 관여하는 효소를 코딩하는 유전자가 추가적으로 도입된 균주이다. 또한, 내재적 MEP 경로에서 속도 결정 단계에 해당하는 효소 DXP 신타제를 코딩하는 유전자가 추가적으로 도입된 균주이다. 따라서, 상기 미생물은 β-카로틴을 고농도로 생산할 수 있다. Microorganisms of the genus Escherichia that produce the isoprenoids of the present invention inherently have a MEP pathway, but a strain into which a gene encoding an enzyme of the foreign mevalonate pathway involved in producing IPP from acetyl-CoA is introduced. to be. In addition, a gene in which a gene encoding an enzyme involved in synthesizing β-carotene from the IPP is additionally introduced. In addition, the gene encoding the enzyme DXP synthase corresponding to the rate determining step in the intrinsic MEP pathway is additionally introduced. Thus, the microorganism can produce high concentration of β-carotene.

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물은, 헤마토코커스 플루비알리스 (Haematococcus pluvialis) 유래의 IPP 이소포머레제를 코딩하는 유전자로 추가적으로 형질전환된 것일 수 있다. 이렇게 함으로써, 두 카피의 IPP 이소포머레제가 도입되어, IPP로부터 DMAPP로의 전환이 촉진된다. Microorganisms of the genus Escherichia producing the isoprenoids of the present invention are Hematococcus fluvialis ( Haematococcus pluvialis ) may be additionally transformed with a gene encoding IPP isoformase . This introduces two copies of the IPP isoformase, facilitating the transition from IPP to DMAPP.

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물은, 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 (Sabehi et al., PLoS Biol. 2005 , 3, 1409-1417) 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자, 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제를 코딩하는 유전자, 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)를 코딩하는 유전자, 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자로 이루어진 군으로 선택되는 하나 이상의 유전자로 더 형질전환된 것일 수 있다. 상기 유전자들은 β-카로틴을 레티날로 전환시키는 효소를 코딩하는 것으로, 바람직하게는 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자이고, 더욱 바람직하게는, 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자는 대장균에서 코돈 사용 최적화된 서열번호 32의 염기서열을 갖는 것이다.Microorganisms of the genus Esseria producing the isoprenoids of the present invention are uncultured marine bacterium 66A03 (Sabehi et al., PLoS Biol. 2005 , 3 , 1409-1417) Gene encoding β-carotene monooxygenase derived from, gene encoding β-carotene 15,15′-monooxygenase derived from mouse ( Mus musculus ), Natronomonas paraonis pharaonis ) selected from the group consisting of a gene encoding brp-like protein 2 derived from ATCC35678 , a gene encoding β-carotene monooxygenase from Halobacterium salinarum ATCC700922 It may be further transformed with one or more genes. The genes encode an enzyme that converts β-carotene to retinal, preferably uncultured marine bacterium 66A03. Gene encoding the derived β-carotene monooxygenase, more preferably uncultured marine bacterium 66A03 The gene encoding the derived β-carotene monooxygenase has a nucleotide sequence of SEQ ID NO: 32 optimized for codon use in E. coli.

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물은, 상기 엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸트란스퍼라제/하이드록시메틸글루타릴 (HMG)-CoA 리덕타제가 서열번호 1, 엔테로코커스 패칼리스 유래의 HMG-CoA 신타제가 서열번호 2, 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 메발로네이트 키나제가 서열번호 3, 스트렙토코커스 뉴모니애 유래의 포스포메발로네이트 키나제가 서열번호 4, 스트렙토코커스 뉴모니애 유래의 메발로네이트 디포스페이트 데카르복실라제가 서열번호 5, 대장균 유래의 이소펜테닐 디포스페이트 (IPP) 이소머라제가 서열번호 6, 판토에아 아글루메란스 (pantoea agglomerans) 유래의 제라닐제라닐 피로포스페이트 (GGPP) 신타제가 서열번호 7, 판토에아 아글루메란스 유래의 피토엔 신타제가 서열번호 8, 판토에아 아글루메란스 유래의 피토엔 데히드로게나제가 서열번호 9, 판토에아 아나나티스 (pantoea ananatis) 유래의 라이코펜-β-시클라제가 서열번호 10 및 대장균 유래 1-데옥시자일룰로즈-5-포스페이트 (DXP) 신타제가 서열번호 11의 아미노산 서열을 갖는 것일 수 있다. Microorganisms of the genus Escherichia producing the isoprenoids of the present invention, Enterococcus ( Enterococcus) faecalis ) acetyl-CoA acetyltransferase / hydroxymethylglutaryl (HMG) -CoA reductase is SEQ ID NO: 1, HMG-CoA synthase from Enterococcus faecalis is SEQ ID NO: 2, Streptococcus pneumoniae ( Metalogone kinase derived from Streptococcus pneumoniae ) is SEQ ID NO: 3, phosphomevalonate kinase from Streptococcus pneumoniae is SEQ ID NO: 4, Mevalonate diphosphate decarboxylase from Streptococcus pneumoniae is SEQ ID NO: 5, isopentenyl diphosphate (IPP) isomerase from Escherichia coli is SEQ ID NO: 6, geranyl geranyl pyrophosphate (GGPP) synthase from pantoea agglomerans is SEQ ID NO: 7, Pantoe A phytoene synthase derived from agglomerans is SEQ ID NO: 8, Pantoea A phytoene dehydrogenase derived from agglomerans is SEQ ID NO: 9, Pantoea ana Teeth (pantoea ananatis) may be derived from lycopene -β- Sickle la I SEQ ID NO: 10 derived from E. coli and 1-deoxy-5-xylene rule rose phosphate (DXP) synthetase I comprising the amino acid sequence of SEQ ID NO: 11.

또한, 상기 헤마토코커스 플루비알리스 (Haematococcus pluvialis) 유래의 IPP 이소포머레제는 서열번호 12의 아미노산 서열을 갖는 것일 수 있다. In addition, the hematococcus fluvialis ( Haematococcus pluvialis ) may be an IPP isoformase having an amino acid sequence of SEQ ID NO: 12.

또한, 상기 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제가 서열번호 13, 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제가 서열번호 14, 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)가 서열번호 15, 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제가 서열번호 16 또는 17의 아미노산 서열을 갖는 것일 수 있다 In addition, the uncultured marine bacterium 66A03 Β-carotene monooxygenase derived from SEQ ID NO: 13, mouse ( Mus β-carotene 15,15'-monooxygenase from musculus ) is SEQ ID NO: 14, brp-like protein 2 from Natronomonas pharaonis ATCC35678 is SEQ ID NO: 15, halo Bacterium Salinarum ( Halobacterium salinarum ) β-carotene monooxygenase derived from ATCC700922 may have an amino acid sequence of SEQ ID NO: 16 or 17

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물의 일 예는, 기탁번호 KCTC 11254BP로 기탁된 대장균 DH5α/pTDHB/pSNA 또는 기탁번호 KCTC 11255BP로 기탁된 대장균 DH5α/pTDHBSR/pSNA인 것을 특징으로 하는 미생물인 것을 특징으로 하는 미생물이다. 기탁번호 KCTC 11254BP로 기탁된 대장균 DH5α/pTDHB/pSNA 또는 기탁번호 KCTC 11255BP로 기탁된 대장균 DH5α/pTDHBSR/pSNA는 배지 중의 탄소원로부터 β-카로틴 또는 레티날을 높은 생산성으로 생산할 수 있는 균이다. An example of Escherichia spp. Microorganisms producing the isoprenoids of the present invention is E. coli DH5α / pTDHB / pSNA deposited with accession number KCTC 11254BP or E. coli DH5α / pTDHBSR / pSNA deposited with accession number KCTC 11255BP. It is a microorganism characterized in that the microorganism. E. coli DH5α / pTDHB / pSNA deposited with accession number KCTC 11254BP or E. coli DH5α / pTDHBSR / pSNA deposited with accession number KCTC 11255BP is a bacterium capable of producing high productivity of β-carotene or retinal from a carbon source in the medium.

상기 β-카로틴을 레티날로 전환하는 효소를 코딩하는 유전자는 각각, 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 (Sabehi et al., PLoS Biol . 2005 , 3, 1409-1417) 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자, 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제를 코딩하는 Bcmo1 유전자, 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)를 코딩하는 brp2 유전자, 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제를 코딩하는 blh 및 brp 유전자로 이루어진 군으로 선택되는 하나 이상의 유전자인 것일 수 있다.The gene encoding the enzyme for converting β-carotene to retinal, respectively, β-carotene derived from uncultured marine bacterium 66A03 (Sabehi et al., PLoS Biol . 2005 , 3 , 1409-1417) Gene encoding monooxygenase, mouse ( Mus musculus ), a Bcmo1 gene encoding β-carotene 15,15'-monooxygenase, Natronomonas p. pharaonis ) A brp2 gene encoding brp-like protein 2 derived from ATCC35678, Halobacterium salinarum ) may be one or more genes selected from the group consisting of blh and brp genes encoding β-carotene monooxygenase derived from ATCC700922.

본 명세서에 있어서, 상기 이소프레노이드는 β-카로틴, 레티날, 레티놀, 레티닐 아세테이트 및 레틴산으로 이루어진 군으로부터 선택되는 것일 수 있다. 바람직하게는, β-카로틴 또는 레티날이다.In the present specification, the isoprenoid may be selected from the group consisting of β-carotene, retinal, retinol, retinyl acetate, and retinic acid. Preferably, it is β-carotene or retinal.

본 발명은 또한, 상기한 바와 같은 본 발명의 미생물을 배양하는 단계; 및 배양물로부터 이소프레노이드를 분리하는 단계;를 포함하는, 에세리키아 속 미생물로부터 이소프레노이드를 생산하는 방법을 제공한다. The present invention also comprises the steps of culturing the microorganism of the present invention as described above; And separating the isoprenoid from the culture; provides a method for producing isoprenoids from the genus Escherichia.

본 발명의 에세리키아 속 미생물로부터 이소프레노이드를 생산하는 방법은 상기한 바와 같은 본 발명의 미생물을 배양하는 단계를 포함한다.The method for producing isoprenoids from the genus Escherichia of the present invention comprises culturing the microorganism of the present invention as described above.

본 발명의 방법에 있어서, 배양은 합성, 반합성, 또는 복합 배양 배지에서 배양할 수 있다. 배양 배지로는 탄소원, 질소원, 비타민 및 미네랄로 구성된 배지를 사용할 수 있다. 예를 들어, MRS (Man-Rogosa-Sharp) 액체 배지 및 우유가 첨가된 액체 배지를 사용할 수 있다. 탄소원으로는 전분, 포도당, 자당, 갈락토스, 과당, 글리세롤 및 이들의 혼합물로 이루어진 군으로부터 선택된 것을 사용할 수 있으며, 바람직하기로는 글리세롤이다. 질소원으로는 황산암모늄, 질산암모늄, 질산나트륨, 글루탐산, 카사미노산, 효모추출물, 펩톤, 트립톤, 대두박 및 이들의 혼합물로 이루어진 군으로부터 선택된 것을 사용할 수 있으며, 미네랄은 염화나트륨, 인산제이칼륨, 황산마그네슘 및 이들의 혼합물로 이루어진 군으로부터 선택된 것을 사용할 수 있다.In the method of the present invention, the culture can be cultured in synthetic, semisynthetic, or complex culture medium. As the culture medium, a medium consisting of a carbon source, a nitrogen source, vitamins and minerals can be used. For example, a Man-Rogosa-Sharp (MRS) liquid medium and a liquid medium with milk added can be used. The carbon source may be one selected from the group consisting of starch, glucose, sucrose, galactose, fructose, glycerol and mixtures thereof, preferably glycerol. The nitrogen source may be selected from the group consisting of ammonium sulfate, ammonium nitrate, sodium nitrate, glutamic acid, casamino acid, yeast extract, peptone, tryptone, soybean meal, and mixtures thereof. Minerals include sodium chloride, dipotassium phosphate, magnesium sulfate And mixtures thereof.

미생물 배양 배지 내 상기 탄소원, 질소원 및 미네랄 각각은 리터당 10 내지 100 g, 5 내지 40 g 및 0.5 내지 4 g 을 이용할 수 있다.Each of the carbon source, nitrogen source and mineral in the microbial culture medium may use 10 to 100 g, 5 to 40 g and 0.5 to 4 g per liter.

상기의 통상의 배양 배지에 첨가되는 비타민은 비타민 A, 비타민 B, 비타민 C, 비타민 D, 비타민 E 및 이들의 혼합물로 이루어진 군으로부터 선택될 수 있다. 비타민은 통상의 배양 배지에 상기에서 언급한 탄소원, 질소원, 미네랄 등의 배지와 동시에 첨가하거나, 멸균하여 준비된 배지에 첨가할 수 있다.Vitamins added to the conventional culture medium may be selected from the group consisting of vitamin A, vitamin B, vitamin C, vitamin D, vitamin E and mixtures thereof. The vitamin may be added to a conventional culture medium at the same time as the above-mentioned medium, such as carbon sources, nitrogen sources, minerals, or the like, to the medium prepared by sterilization.

배양은 통상의 대장균 배양 조건으로 수행할 수 있으며, 예를 들어 약 15-45℃에서 배양할 수 있다. 배양액 중의 배양 배지를 제거하고 농축된 균체만을 회수하기 위해 원심분리 또는 여과과정을 거칠 수 있으며 이러한 단계는 당업자의 필요에 따라 수행할 수 있다. 농축된 균체는 통상적인 방법에 따라 냉동하거나 냉동건조하여 그 활성을 잃지 않도록 보존할 수 있다.The culturing can be carried out under conventional E. coli culture conditions, for example, can be cultured at about 15-45 ℃. Centrifugation or filtration may be performed to remove the culture medium in the culture and recover only the concentrated cells, and this step may be performed according to the needs of those skilled in the art. The concentrated cells can be preserved so as not to lose their activity by freezing or lyophilizing according to a conventional method.

상기 배양의 일 예에 있어서, 탄소원으로서 글리세롤을 포함하는 배지에서 이루어지는 것일 수 있다. 바람직하게는, 0.5 부피 % 내지 2.0부피 %의 글리세롤을 포함하는 배지에서 이루어지는 것일 수 있다. 상기 배지는 글리세롤 및 아라비노즈가 첨가된 YT 배지일 수 있다. YT 배지는 1.6중량% 트립톤, 1중량% 효모 추출물, 0.5 중량% NaCl이다. In one example of the culture, it may be made in a medium containing glycerol as a carbon source. Preferably, it may be made in a medium containing 0.5% by volume to 2.0% by volume of glycerol. The medium may be YT medium added with glycerol and arabinose. YT medium is 1.6 wt% tryptone, 1 wt% yeast extract, 0.5 wt% NaCl.

본 발명의 미생물에 대하여는 상기한 바와 같다. The microorganism of the present invention is as described above.

본 발명의 방법은 또한, 배양물로부터 이소프레노이드를 분리하는 단계를 포함한다. 상기 이소프레노이드, 예를 들면, β-카로틴 또는 레티날을 분리하는 것은 당업계에 잘 알려져 있다. 예를 들면, 이온교환 크로마토그래피, HPLC 등의 방법에 의하여 분리될 수 있다. 구체적으로, 균체를 회수한 후에 아세톤등의 용매를 이용한 추출 후에 고순도의 제품을 얻기 위해서는 HPLC 또는 결정화 작업등을 통한 분리정제가 진행될 수 있다. The method also includes the step of separating the isoprenoids from the culture. It is well known in the art to isolate such isoprenoids, for example β-carotene or retinal. For example, it can be separated by ion exchange chromatography, HPLC, or the like. Specifically, in order to obtain a high-purity product after extraction with a solvent such as acetone after recovering the cells may be separated and purified through HPLC or crystallization operation.

본 발명의 이소프레노이드를 생산하는 에세리키아 속 미생물에 의하면, β- 카로틴 및 레티날과 같은 이소프레노이드를 높은 효율로 생산할 수 있다.According to the Escherichia spp. Microorganism producing the isoprenoid of the present invention, isoprenoids such as β-carotene and retinal can be produced with high efficiency.

본 발명의 에세리키아 속 미생물로부터 이소프레노이드를 생산하는 방법에 의하면, β-카로틴 및 레티날과 같은 이소프레노이드를 높은 효율로 생산할 수 있다.According to the method for producing isoprenoids from the microorganism of the genus Escherichia, it is possible to produce isoprenoids such as β-carotene and retinal with high efficiency.

이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나, 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다. Hereinafter, the present invention will be described in more detail with reference to Examples. However, these examples are for illustrative purposes only and the scope of the present invention is not limited to these examples.

레티노이드Retinoid 분석 조건 Analysis condition

이하의 실시예에서 레티노이드는 다음의 방법으로 분석하였다.In the following examples, retinoids were analyzed by the following method.

배양액 100㎕를 취해서 14,000rpm에서 40초 동안 원심분리하여 균체를 회수하였다. 아세톤 1ml을 첨가하여 균체를 재현탁시킨 다음 어두운 곳에서 55℃에서 15분 동안 추출하였다. 다음으로, 14,000rpm에서 10분 동안 원심분리하여 상층액만을 취하여 HPLC 정량 분석을 실시하였다. 레티노이드의 HPLC 분석 조건은 아래 표 1과 같다. 100 µl of the culture solution was taken and centrifuged at 14,000 rpm for 40 seconds to recover the cells. The cells were resuspended by adding 1 ml of acetone and then extracted at 55 ° C. for 15 minutes in the dark. Next, centrifugation at 14,000 rpm for 10 minutes to take only the supernatant was carried out by HPLC quantitative analysis. HPLC analysis conditions of the retinoids are shown in Table 1 below.

표 1. 레티노이드 분석을 위한 HPLC 조건Table 1. HPLC Conditions for Retinoid Analysis

품목subject 조건Condition HPLC 모델HPLC Model SHIMADZU Class 10Avp series with UV/Vis detectorSHIMADZU Class 10Avp series with UV / Vis detector 칼럼column Symmetry C18(250x4.6,5㎛) with Symmetry guard C18(15x4.6,5㎛)Symmetry C18 (250x4.6,5㎛) with Symmetry guard C18 (15x4.6,5㎛) 유속Flow rate 1.5 mL/min1.5 mL / min 주입 부피Injection volume 20 μL20 μL 이동상Mobile phase 아세토니트릴 : 메탄올 = 5 : 95 Acetonitrile: methanol = 5: 95 오븐 온도Oven temperature 40℃40 ℃ 작동 시간Working time 15 분15 mins

β-카로틴의 HPLC 조건은 레티노이드 HPLC 조건과 동일하고 이동상만 아세토니트릴 : 메탄올 = 30 : 70을 사용하였다. UV 검출기의 분석파장은 β-카로틴은 454nm, 레티날은 370nm, 레티놀과 레티닐 아세테이트는 340nm에서 분석하였다. HPLC conditions of β-carotene were the same as the retinoid HPLC conditions, and only mobile phase was used acetonitrile: methanol = 30: 70. The wavelength of the UV detector was analyzed at 454 nm for β-carotene, 370 nm for retinal, and 340 nm for retinol and retinyl acetate.

실시예Example 1 : β-카로틴 및  1: β-carotene and 레티날Retinal 고생산성 대장균의 제조용 벡터의 제조 Preparation of vector for production of high productivity E. coli

본 실시예에서는 MEP 경로를 가지고 있는 대장균에, 속도 결정 단계에 해당하는 효소인 DXP 신타제를 코딩하는 유전자를 대장균에 추가적으로 도입하는 동시에, 메발로네이트 경로에 관여하는 효소를 코딩하는 유전자를 다양한 유전자원으로부터 선별하여 도입하여, β-카로틴 고생산성 대장균을 제조하였다.In this embodiment, E. coli, which has an MEP pathway, additionally introduces a gene encoding DXP synthase, which is an enzyme corresponding to the rate determining step, into E. coli, and simultaneously encodes a gene encoding an enzyme that participates in the mevalonate pathway. Selected from the source and introduced, β-carotene high productivity E. coli was prepared.

(1) (One) 탄소원으로부터From carbon sources IPPIPP 를 합성하는데 관여하는 Involved in synthesizing 메발로네이트Mevalonate 경로의 효소를 코딩하는 유전자를 포함하는  Containing genes encoding enzymes of the pathway pSNApSNA 벡터의 제조 Manufacture of vector

본 절에서 사용된 탄소원으로부터 IPP를 합성하는데 관여하는 메발로네이트 경로의 효소를 코딩하는 유전자는 다음 표 2와 같다. The gene encoding the enzyme of the mevalonate pathway involved in synthesizing IPP from the carbon source used in this section is shown in Table 2 below.

표 2. Table 2.

효소명Enzyme Name 유전자gene 유전자 서열 (Genbank 허가번호)Gene sequence (Genbank authorization number) 엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸트란스퍼라제/하이드록시메틸글루타릴 (HMG)-CoA 리덕타제Acetyl-CoA Acetyltransferase / Hydroxymethylglutaryl (HMG) -CoA Reductase from Enterococcus faecalis mvaEmvaE 서열번호 18(AF290092)SEQ ID NO: 18 (AF290092) 엔테로코커스 패칼리스 유래의 HMG-CoA 신타제HMG-CoA Synthase from Enterococcus faecalis mvaSmvaS 서열번호 19(AF290092)SEQ ID NO: 19 (AF290092) 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 메발로네이트 키나제 Streptococcus pneumoniae ) mevalonate kinase mvaK1mvaK1 서열번호 20(AF290099)SEQ ID NO: 20 (AF290099) 스트렙토코커스 뉴모니애 유래의 포스포메발로네이트 키나제Phosphomevalonate Kinase from Streptococcus pneumoniae mvaK2mvaK2 서열번호 21(AF290099)SEQ ID NO: 21 (AF290099) 스트렙토코커스 뉴모니애 유래의 메발로네이트 디포스페이트 데카르복실라제Mevalonate diphosphate decarboxylase from Streptococcus pneumoniae mvaDmvaD 서열번호 22 AF290099SEQ ID NO: 22 AF290099 대장균 유래의 이소펜테닐 디포스페이트(IPP) 이소머라제Isopentenyl diphosphate (IPP) isomerase from E. coli idiidi 서열번호 23 (U00096)SEQ ID NO: 23 (U00096)

표 3. 표 2의 유전자를 증폭하기 위한 프라이머 및 제한 효소Table 3. Primers and restriction enzymes for amplifying the genes of Table 2

유전자gene 프라이머 서열Primer sequence 제한 효소Restriction enzymes mvaE mvaE FF 서열번호 37SEQ ID NO: 37 SacISacI RR 서열번호 38SEQ ID NO: 38 SmaISmaI mvaS mvaS FF 서열번호 39SEQ ID NO: 39 SmaISmaI RR 서열번호 40SEQ ID NO: 40 BamHIBamHI mvaK1, mvaK2, mvaD mvaK1, mvaK2, mvaD FF 서열번호 41SEQ ID NO: 41 KpnIKpnI RR 서열번호 42SEQ ID NO: 42 XbaIXbaI idi idi FF 서열번호 43SEQ ID NO: 43 SmaISmaI RR 서열번호 44SEQ ID NO: 44 SphISph

표 3에 표 2의 유전자 클로닝에 사용된 프라이머 서열 및 제한효소를 나타내었다. mvaK1, mvaK2, mvaD는 하나의 오페론으로 염색체상에 존재해서 각각의 유전자를 PCR 클로닝하지 않고 오페론을 통째로 한번에 PCR 클로닝을 하였다. Table 3 shows the primer sequences and restriction enzymes used for cloning the genes of Table 2. mvaK1, mvaK2, and mvaD existed on the chromosome as one operon, and PCR cloning of the operon was performed at once without the PCR cloning of each gene.

표 2의 유전자들은 표 3에 열거된 프라이머를 사용하고, 해당 유전자를 포함하고 있는 균주의 염색체 DNA를 주형으로 한, PCR을 통하여 증폭하였다. 증폭된 산물을 표 3에 열거된 제한 효소를 이용하여 pSTV28 벡터 (Takara Korea, 한국) (서열번호 45)에 도입하여, 벡터 pSNA를 제조하였다. 벡터 pSNA는 아세틸-CoA로부터 IPP를 생산할 수 있는 메발로네이트 경로의 효소를 코딩하는 유전자를 모두 포함하고 있다. The genes of Table 2 were amplified by PCR using the primers listed in Table 3 and using the chromosomal DNA of the strain containing the gene as a template. The amplified product was introduced into pSTV28 vector (Takara Korea, Korea) (SEQ ID NO: 45) using the restriction enzymes listed in Table 3 to prepare a vector pSNA. The vector pSNA contains all of the genes encoding enzymes of the mevalonate pathway that can produce IPP from acetyl-CoA.

도 2는 pSNA 벡터의 지도를 나타내는 도면이다.2 shows a map of a pSNA vector.

(2) (2) IPPIPP 로부터 β-카로틴을 합성하는데 관여하는 효소를 코딩하는 유전자를 포함하는 벡터 A vector comprising a gene encoding an enzyme involved in synthesizing β-carotene from pTDHBpTDHB 의 제조Manufacture

본 절에서 사용된 IPP로부터 β-카로틴을 합성하는데 관여하는 효소를 코딩하는 유전자 및 MEP 경로의 속도결정 단계의 효소인 DXP 신타제 유전자는 다음 표 4와 같다. The gene encoding the enzyme involved in synthesizing β-carotene from the IPP used in this section and the DXP synthase gene, which is an enzyme in the rate determining step of the MEP pathway, are shown in Table 4 below.

표 4. Table 4.

효소명Enzyme Name 유전자gene 유전자 서열 (Genbank 허가번호)Gene sequence (Genbank authorization number) 헤마토코커스 플루비알리스 (Haematococcus pluvialis) 유래의 IPP 이소포머레제IPP Isoformerase from Haematococcus pluvialis ipiHp1ipiHp1 서열번호 24(AF082325)SEQ ID NO: 24 (AF082325) 대장균 유래 1-데옥시자일룰로즈-5-포스페이트 (DXP) 신타제Escherichia coli-derived 1-deoxyxylulose-5-phosphate (DXP) synthase dxsdxs 서열번호 25(U00096)SEQ ID NO: 25 (U00096) 판토에아 아글루메란스 (pantoea agglomerans) 유래의 제라닐제라닐 피로포스페이트 (GGPP) 신타제 Geranylgeranyl pyrophosphate (GGPP) synthase from pantoea agglomerans crtEcrtE 서열번호 26(M87280)SEQ ID NO: 26 (M87280) 판토에아 아글루메란스 유래의 피토엔 신타제Phytoene synthase from Pantoea agglomerans crtBcrtB 서열번호 27(M87280)SEQ ID NO: 27 (M87280) 판토에아 아글루메란스 유래의 피토엔 데히드로게나제Phytoen dehydrogenase from Pantoea agglomerans crtIcrtI 서열번호 28M87280SEQ ID NO: 28M87280 판토에아 아나나티스 (pantoea ananatis) 유래의 라이코펜-β-시클라제Lycopene-β-cyclase from Pantoea ananatis crtYcrtY 서열번호 29(D90087)SEQ ID NO: 29 (D90087)

표 5.Table 5.

유전자gene 프라이머primer 제한 효소Restriction enzymes ipiHp1ipiHp1 FF 서열번호 46SEQ ID NO: 46 SmaISmaI SphISph RR 서열번호 47SEQ ID NO: 47 dxsdxs FF 서열번호 48SEQ ID NO: 48 EcoRI SnaBI Eco RI Sna BI RR 서열번호 49SEQ ID NO: 49 crtEcrtE FF 서열번호 50SEQ ID NO: 50 BspHI EcoRI Bsp HI Eco RI RR 서열번호 51SEQ ID NO: 51 crtBcrtB ,, crtIcrtI FF 서열번호 52SEQ ID NO: 52 EcoRIEcoRI SacISacI RR 서열번호 53SEQ ID NO: 53 crtYcrtY FF 서열번호 54SEQ ID NO: 54 SalISalI PstIPstI RR 서열번호 55SEQ ID NO: 55

상기 표 4의 유전자 클로닝에 사용된 프라이머 서열 및 제한효소는 표 5와 같다. crtB, crtI는 하나의 오페론으로 염색체상에 존재해서 각각의 유전자를 PCR 클로닝하지 않고 오페론을 통째로 한번에 PCR 클로닝을 하였다.Primer sequences and restriction enzymes used for cloning the gene of Table 4 are shown in Table 5. crtB and crtI exist on the chromosome as one operon, and PCR cloning of the operon was performed at once without PCR cloning of each gene.

표 4의 유전자들은 표 5에 열거된 프라이머를 사용하고, 해당 유전자를 포함하고 있는 균주의 염색체 DNA를 주형으로 한, PCR을 통하여 증폭하였다. 증폭된 산물을 표 5에 열거된 제한 효소를 이용하여 pTrc99A 벡터 (Genbank 허기 번호 M22744) (서열번호 29)에 도입하여, 벡터 pTDHB를 제조하였다. 벡터 pTDHB는 IPP로부터 β-카로틴을 합성하는데 관여하는 효소를 코딩하는 유전자 및 MEP 경로의 속도결정 단계의 효소인 DXP 신타제 유전자를 모두 포함하고 있다. The genes of Table 4 were amplified by PCR using the primers listed in Table 5 and using the chromosomal DNA of the strain containing the gene as a template. The amplified product was introduced into the pTrc99A vector (Genbank gestation number M22744) (SEQ ID NO: 29) using the restriction enzymes listed in Table 5 to prepare a vector pTDHB. The vector pTDHB contains both the gene encoding the enzyme involved in synthesizing β-carotene from IPP and the DXP synthase gene, which is an enzyme in the rate determining step of the MEP pathway.

도 3은 pTDHB의 벡터 지도를 나타내는 도면이다. 3 shows a vector map of pTDHB.

(3) β-카로틴으로부터 (3) from β-carotene 레티날을Retinal 합성하는데 관여하는 효소를 코딩하는 유전자를 포함하는 벡터의 제조 Preparation of a vector comprising a gene encoding an enzyme involved in the synthesis

본 절에서 사용된 β-카로틴으로부터 레티날을 합성하는데 관여하는 효소를 코딩하는 유전자는 다음 표 6과 같다. 표 6에서 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자는 blh의 대장균 코돈 최적화 서열인 SR 유전자를 사용하였다. The gene encoding the enzyme involved in synthesizing retinal from β-carotene used in this section is shown in Table 6 below. In Table 6, the gene encoding β-carotene monooxygenase from uncultured marine bacterium 66A03 was used as SR gene, the coli codon optimization sequence of blh .

표 6. Table 6.

효소명Enzyme Name 유전자gene 유전자 서열 (Genbank 허가번호)Gene sequence (Genbank authorization number) 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제Uncultured marine bacterium 66A03 Β-carotene monooxygenase derived from blhblh 서열번호 31 (DQ065755)SEQ ID NO: 31 (DQ065755) 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제Uncultured marine bacterium 66A03 Β-carotene monooxygenase derived from SR (blh의대장균 코돈최적화 서열) SR ( blh coli codon optimization sequence) 서열번호 32SEQ ID NO: 32 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제Β-carotene 15,15'-monooxygenase from mouse musculus bcmo1bcmo1 서열번호 33 (NM_021486)SEQ ID NO: 33 (NM_021486) 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)Brp-like protein 2 from Natronomonas pharaonis ATCC35678 brp2brp2 서열번호 34 (CR936257)SEQ ID NO: 34 (CR936257) 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제Β-carotene monooxygenase from Halobacterium salinarum ATCC700922 blhblh 서열번호 35 (AE004437)SEQ ID NO: 35 (AE004437) 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제Β-carotene monooxygenase from Halobacterium salinarum ATCC700922 brpbrp 서열번호 36 (AE004437)SEQ ID NO: 36 (AE004437)

표 7. Table 7.

유전자gene 프라이머 서열Primer sequence 제한 효소Restriction enzymes SR SR FF 서열번호 56SEQ ID NO: 56 Spe1Spe1 SpeISpeI RR 서열번호 57SEQ ID NO: 57 bcmo1bcmo1 FF 서열번호 58SEQ ID NO: 58 EcoRIEcoRI SpeISpeI RR 서열번호 59SEQ ID NO: 59 brp2brp2 FF 서열번호 60SEQ ID NO: 60 EcoRIEcoRI SpeISpeI RR 서열번호 61SEQ ID NO: 61 blhblh FF 서열번호 62SEQ ID NO: 62 EcoRIEcoRI SpeISpeI RR 서열번호 63SEQ ID NO: 63 brpbrp FF 서열번호 64SEQ ID NO: 64 EcoRIEcoRI SpeISpeI RR 서열번호 65SEQ ID NO: 65

표 6의 유전자 클로닝에 사용된 프라이머 서열 및 제한효소는 표 7과 같다. 표 6의유전자들은 표 7에 열거된 프라이머를 사용하고, 해당 유전자를 포함하고 있는 균주의 염색체 DNA와 생쥐의 cDNA 라이브러리를 주형으로 한, PCR을 통하여 증폭하였다. 증폭된 산물을 표 7에 열거된 제한 효소를 이용하여 벡터 pTDHB에 각각 도입하여, 벡터 pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh, pTDHBbrp를 제조하였다. 벡터 pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh 및 pTDHBbrp는 각각 pTDHB 벡터에 SR, Bcmo1, brp2, blh, brp 유전자가 도입된 벡터로서, IPP로부터 β-카로틴 을 거쳐 레티날을 합성하는데 관여하는 효소를 코딩하는 유전자 및 MEP 경로의 속도결정 단계의 효소인 DXP 신타제 유전자를 모두 포함하고 있다. Primer sequences and restriction enzymes used for gene cloning in Table 6 are shown in Table 7. The genes of Table 6 were amplified by PCR using the primers listed in Table 7 as a template of the chromosomal DNA of the strain containing the gene and the cDNA library of the mouse. The amplified products were introduced into the vectors pTDHB using the restriction enzymes listed in Table 7, respectively, to prepare vectors pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh, and pTDHBbrp. The vectors pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh and pTDHBbrp are vectors in which SR, Bcmo1, brp2, blh and brp genes are introduced into the pTDHB vector, respectively, and encode an enzyme involved in synthesizing retinal via β-carotene from IPP. And the DXP synthase gene, which is an enzyme in the rate determining step of the MEP pathway.

도 4는 pTDHBSR의 벡터 지도를 나타내는 도면이다. pTDHBBcmo1, pTDHBbrp2, pTDHBblh 및 pTDHBbrp는 pTDHBSR에서 SR 유전자가 각각 Bcmo1, brp2, blh 및 brp 유전자로 치환된 것을 제외하고는 동일하다.4 is a diagram illustrating a vector map of pTDHBSR. pTDHBBcmo1, pTDHBbrp2, pTDHBblh and pTDHBbrp are identical except that the SR gene in pTDHBSR is replaced with Bcmo1, brp2, blh and brp genes, respectively.

실시예Example 2: β-카로틴 및  2: β-carotene and 레티날Retinal 고생산성 대장균의 제조 Preparation of High Productivity Escherichia Coli

(1) (One) 레티날Retinal 및 β-카로틴 생산의 최적 균주의 선발 And selection of optimal strains of β-carotene production

실시예 1에서 제조된 pSNA 벡터와, pTDHB, pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh 또는 pTDHBbrp를 각각 대장균 DH5α에 도입하고, 배양하였다. The pSNA vector prepared in Example 1 and pTDHB, pTDHBSR, pTDHBBcmo1, pTDHBbrp2, pTDHBblh or pTDHBbrp were introduced into E. coli DH5α, and cultured.

배양은 탄소원으로 0.5 중량% 글리세롤 및 0.2 중량% 아라비노스를 포함하는 2YT 배지에서 29℃에서 48시간 동안 배양하였다. 배양은 50 ml 시험관에서 배지 7ml 중에서 균주 0.2ml을 초기 접종하고, 250rpm에서 교반하면서 배양하였다. 48시간 배양액에서 균체를 회수하고 아세톤으로 β-카로틴과 레티날을 추출한 후에 HPLC를 이용하여 레티날 및 β-카로틴의 농도를 측정하였다. The culture was incubated for 48 hours at 29 ° C. in 2YT medium containing 0.5 wt% glycerol and 0.2 wt% arabinose as the carbon source. The culture was initially inoculated with 0.2 ml of strain in 7 ml of medium in a 50 ml test tube and incubated with stirring at 250 rpm. Cells were recovered from the culture medium for 48 hours, β-carotene and retinal were extracted with acetone, and the concentrations of retinal and β-carotene were measured using HPLC.

48 시간 경과 후 레티날 및 β-카로틴의 생산량은 다음 표 8과 같다.After 48 hours the production of retinal and β-carotene is shown in Table 8.

표 8. Table 8.

농도  density 균주Strain pTDHB pTDHB pTDHBSRpTDHBSR pTDHBBcmo1pTDHBBcmo1 pTDHBbrp2pTDHBbrp2 pTDHBblhpTDHBblh pTDHBbrppTDHBbrp 레티날(mg/L)Retinal (mg / L) 00 22 00 0.050.05 0.140.14 0.220.22 β-카로틴(mg/L)β-carotene (mg / L) 120120 00 1111 3535 33 00

위의 결과로부터 pTDHBSR를 포함하는 재조합 대장균 DH5α 균주가 레티날 생 산 균주로, pTDHB를 포함하는 재조합 대장균 DH5α 균주가 β-카로틴 생산 균주로 가장 적합한 것으로 확인되었다. From the above results, it was confirmed that the recombinant E. coli DH5α strain containing pTDHBSR was the retinal production strain, and the recombinant E. coli DH5α strain containing pTDHB was most suitable as the β-carotene producing strain.

(2) (2) pTDHBSRpTDHBSR 균주를 이용한  Using strain 레티날Retinal 생산을 위한 최적 배양 조건의 탐색 Search for Optimal Culture Conditions for Production

(2.1) 용존산소의 영향(2.1) Influence of dissolved oxygen

레티날 생산에 미치는 용존산소의 영향을 알아 보기 위하여 시험관에 배양액 의 부피를 3, 5, 7, 10ml으로 변화시켜, pSNA와 pTDHBSR를 포함하는 재조합 대장균 DH5α 균주를 배양하였다. In order to examine the effect of dissolved oxygen on retinal production, the recombinant E. coli DH5α strain containing pSNA and pTDHBSR were cultured by changing the volume of the culture solution to 3, 5, 7, and 10 ml in vitro.

배양은 탄소원으로 0.5중량% 글리세롤 및 0.2중량% 아라비노스를 포함하는 2YT 배지에서 29℃에서 48시간 동안 배양하였다. 배양은 50ml 시험관에서 상기한 바와 같은 배지 양에 균주 0.2ml를 초기 접종하고, 250 rpm에서 진탕하면서 배양하였다. 레티노이드 생산량을 분석하기 위해서 배양액에서 균체를 회수하고 아세톤으로 추출한 후에 HPLC를 이용하여 레티날, 레티놀, 레티닐 아세테이트의 농도를 측정하였다.  The culture was incubated for 48 hours at 29 ° C. in 2YT medium containing 0.5 wt% glycerol and 0.2 wt% arabinose as the carbon source. The culture was initially inoculated with 0.2 ml of the strain in a medium amount as described above in a 50 ml test tube, and cultured with shaking at 250 rpm. In order to analyze the retinoid production, the cells were recovered from the culture and extracted with acetone, and then the concentrations of retinal, retinol and retinyl acetate were measured using HPLC.

배양 결과, 배양에 사용된 균주는 레티놀 데히드로게나제, 레티날 데히드로게나제, 레티놀 O-지방산 트란스퍼라제 유전자가 도입되어 있지 않은 균주임에도 불구하고, 레티날, 레티놀, 레티닐 아세테이트 등의 다양한 레티노이드가 생산되었다. 이들 레티노이드는 공통적으로 대수기 말기까지 생산이 증가하다가 정지기에 접어들어 분해되는 양상을 보였다. 특히, 레티노이드 중 생산량이 가장 많은 레티놀은 최대 생산량에 도달한 이후 급속하게 분해되었다.As a result of the culture, although the strain used for the cultivation is a strain which does not contain the retinol dehydrogenase, retinal dehydrogenase, or the retinol O-fatty acid transferase gene, retinal, retinol, retinyl acetate, etc. Various retinoids were produced. These retinoids have been shown to increase production until the end of the algebraic period and then break down during the stop phase. In particular, the highest yield of retinol among retinoids was rapidly degraded after reaching the maximum yield.

레티노이드의 최대 생산량 및 시간은 다음 표 9,10,11과 같다.Maximum production and time of retinoids are shown in Tables 9, 10, and 11 below.

표 9. 레티날 생산량 Table 9. Retinal Production

농도 및 시간Concentration and time 배양 부피(mL)Culture volume (mL) 33 55 77 1010 최대 농도 (mg/L)Max Concentration (mg / L) 1010 1818 1919 1313 시간(h)Hours (h) 1818 3030 3636 3636

레티날의 생산은 대수기 말기에서 최대 생산량이 되었고, 그 후 완만하게 분해되었다. Retinal production peaked at the end of the logarithmic phase and then slowly decomposed.

표 10. 레티닐 아세테이트 생산량 Table 10. Retinyl Acetate Production

농도 및 시간Concentration and time 배양 부피(mL)Culture volume (mL) 33 55 77 1010 최대 농도 (mg/L)Max Concentration (mg / L) 55 8.58.5 99 1010 시간(h)Hours (h) 1818 3030 3636 4242

레티닐 아세테이트의 생산은 배양액의 부피에 따른 큰 차이를 보이지 않았다.The production of retinyl acetate did not show a significant difference depending on the volume of the culture.

표 11. 레티놀 생산량 Table 11. Retinol Production

농도 및 시간Concentration and time 배양 부피(mL)Culture volume (mL) 33 55 77 1010 최대 농도 (mg/L)Max Concentration (mg / L) 55 2323 2525 2020 시간(h)Hours (h) 1818 3030 3636 4242

이상의 결과로부터, 총 레티노이드의 양은 7ml에서 36시간에 최대 55mg/L이 되었다. 따라서, 레티노이드 생산을 위한 시험관 배양의 최적 배양액량은 7ml이었다.From the above results, the total amount of retinoids reached a maximum of 55 mg / L in 36 hours at 7 ml. Therefore, the optimum culture volume of the in vitro culture for retinoid production was 7 ml.

(2.2) 최적 숙주세포의 선발(2.2) Selection of Optimal Host Cells

본 절에서는 pSNA 벡터와 pTDHBSR 벡터를 5종의 대장균, 즉 MG1655, DH5α, XL1-Blue, S17-1, 및 BL21에 도입하고, 7ml 배지를 포함하는 시험관에서 배양하였다.In this section, pSNA vector and pTDHBSR vector were introduced into five E. coli strains, MG1655, DH5α, XL1-Blue, S17-1, and BL21, and cultured in vitro containing 7 ml medium.

기타 배양조건 및 분석조건은 (2.1)절에 기재된 바와 동일하게 실험하였다.Other culture and assay conditions were the same as described in section (2.1).

그 결과, DH5α 균주가 레티노이드 생산량이 36시간에서 50mg/L로 가장 높았으며, MG1655 균주는 레티노이드 생산과 균체 성장이 모두 가장 낮았다. BL21은 균체성장은 높았으나, 레티노이드 생산은 낮았다.As a result, the DH5α strain had the highest retinoid yield of 50 mg / L at 36 hours, and the MG1655 strain had the lowest retinoid production and cell growth. BL21 showed high cell growth but low retinoid production.

표 12.Table 12.

농도 및 시간Concentration and time 숙주 세포Host cell MG1655MG1655 DH5αDH5α XL1-BlueXL1-Blue BL21BL21 S17-1S17-1 레티놀 최대농도 (mg/L)Maximum Retinol Concentration (mg / L) 0.060.06 3030 7.57.5 1.91.9 88 총레티노이드 최대 농도 (mg/L)Maximum total retinoid concentration (mg / L) 2.262.26 5050 23.423.4 3.73.7 2424 시간(h)Hours (h) 2424 3636 3636 4848 2424

(2.3) 최적 (2.3) optimal 탄소원의Carbon source 선발 Selection

본 절에서는 pSNA 벡터와 pTDHBSR 벡터를 포함하는 대장균 DH5α를 7ml 배지를 포함하는 시험관에서 배양하였다.In this section, Escherichia coli DH5α containing pSNA vector and pTDHBSR vector were cultured in vitro containing 7 ml medium.

배양은 탄소원으로 0.5% 글리세롤, 0.5% 포도당, 0.5% 자일로스, 0.5% 말토스, 0.5% 갈락토스 중 하나와 0.2중량% 아라비노스를 포함하는 2YT 배지에서 29℃에서 48시간 동안 배양하였다. 기타 배양조건 및 분석조건은 (2.2)절에 기재된 바와 동일하게 실험하였다. The culture was incubated for 48 hours at 29 ° C. in 2YT medium containing one of 0.5% glycerol, 0.5% glucose, 0.5% xylose, 0.5% maltose, 0.5% galactose and 0.2% by weight arabinose as a carbon source. Other culture and assay conditions were the same as described in section (2.2).

그 결과, 글리세롤을 탄소원으로 사용한 경우 다른 탄소원을 사용한 경우에 비하여 36시간에서 20mg/L로 최대 15배까지 레티놀 생산이 증가하였으며, 포도당 및 말토스를 사용한 경우 레티놀이 24 시간에 9.3 mg/L와 5 mg/L로 생산된 반면, 자일로스, 및 갈락토스를 사용한 경우, 레티놀 생산이 탄소원을 첨가하지 않은 경우와 비슷하였다. 탄소원을 첨가하지 않은 경우 레티노이드 생산이 거의 되지 않는 것으로 탄소원이 중요한 것을 알 수 있다. As a result, when glycerol was used as a carbon source, retinol production increased up to 15 times from 36 hours to 20 mg / L, compared to other carbon sources. When glucose and maltose were used, 9.3 mg / L at 24 hours While produced at 5 mg / L, xylose, and galactose were used, the retinol production was similar to that without the addition of a carbon source. If the carbon source is not added, the retinoid production is rare, indicating that the carbon source is important.

표 13.Table 13.

농도 및 시간 Concentration and time 탄소원 (중량%)Carbon source (% by weight) 없음none 0.5% 글리세롤0.5% Glycerol 0.5%포도당0.5% glucose 0.5% 자일로스0.5% xylose 0.5%말토스0.5% maltose 0.5%갈락토스0.5% galactose 레티놀 최대농도 (mg/L)Maximum Retinol Concentration (mg / L) 2.62.6 2020 9.39.3 2.82.8 55 1.31.3 총 레티노이드 최대농도 (mg/L)Maximum total retinoid concentration (mg / L) 4.74.7 3636 1111 77 1313 3.63.6 시간 (h)Time (h) 2424 3636 2424 2424 2424 2424

(2.4) 글리세롤의 최적 농도의 선발(2.4) Selection of Optimal Concentrations of Glycerol

본 절에서는 글리세롤의 농도를 0, 0.5, 1.0, 2.0 중량%로 각각 포함하고 0.2중량% 아라비노스를 포함하는 2YT 배지에서 96시간 동안 배양하는 것을 제외하고는, (2.3)절에 기재된 바와 동일하게 실험하였다. This section is the same as described in section 2.3, except incubating for 96 hours in 2YT medium containing glycerol concentrations of 0, 0.5, 1.0 and 2.0 wt% and 0.2 wt% arabinose, respectively. Experiment.

그 결과, 1.0% 글리세롤을 포함하는 배지에서 48시간만에 43mg/L의 레티놀이 생산되었고, 2.0% 글리세롤을 포함하는 배지에서 96시간까지 45mg/L의 레티놀이 계속 생산되었다. 소수 산물인 레티날과 레티닐 아세테이트도 역시 글리세롤 농도가 증가함에 따라 생산량이 증가하였다. 레티노이드 총 생산량에 있어서, 1.0% 글리세롤를 포함하는 배지에서 48시간에서 80mg/L로 가장 많은 생산량을 보였다.As a result, 43 mg / L of retinol was produced in 48 hours in a medium containing 1.0% glycerol and 45 mg / L of retinol continued for 96 hours in a medium containing 2.0% glycerol. The minor products, retinal and retinyl acetate, also increased in production with increasing glycerol concentrations. In total retinoid yield, the highest yield was obtained at 80 mg / L at 48 hours in a medium containing 1.0% glycerol.

표 14.Table 14.

농도 및 시간Concentration and time 글리세롤 농도(중량%)Glycerol Concentration (% by weight) 0%0% 0.5% 0.5% 1.0% 1.0% 2.0% 2.0% 레티날 최대농도 (mg/L)Maximum Retinal Concentration (mg / L) 2.62.6 14.314.3 2525 1313 레티닐 아세테이트 최대농도 (mg/L)Retinyl Acetate Maximum Concentration (mg / L) 1.61.6 88 1212 8.38.3 레티놀 최대농도 (mg/L)Maximum Retinol Concentration (mg / L) 3.83.8 2020 4343 4545 레티노이드 최대농도 (mg/L)Maximum Retinoid Concentration (mg / L) 88 42.342.3 8080 66.366.3 시간 (h)Time (h) 2424 3636 4848 9696

실시예Example 3:  3: pSNApSNA  And pTDHBpTDHB 벡터를 포함하는 대장균  Escherichia coli containing vector DH5DH5 α의 β-카로틴 생산성의 확인Confirmation of α-carotene productivity

본 실시예에서는 pSNA 벡터와 pTDHB 벡터를 포함하는 대장균 DH5α를 7ml 배지를 포함하는 시험관에서 배양하였다.In this example, E. coli DH5α containing a pSNA vector and a pTDHB vector were cultured in a test tube containing 7 ml of medium.

배양은 탄소원으로 0.5, 1.5, 및 2.0 중량% 글리세롤 중 하나와 0.2중량% 아라비노스를 포함하는 2YT 배지에서 29℃에서 168시간 동안 배양하였다. 배양은 50ml 시험관에서 상기한 바와 같은 배지 양에 균주 0.2ml를 초기 접종하고, 250rpm에서 진탕하면서 배양하였다. β-카로틴 생산량을 분석하기 위해서 배양액에서 균체를 회수하고 아세톤으로 추출한 후에 HPLC를 이용하여 β-카로틴의 농도를 측정하였다. The cultures were incubated for 168 hours at 29 ° C. in 2YT medium containing 0.2% by weight arabinose and one of 0.5, 1.5, and 2.0% by weight glycerol as the carbon source. The culture was initially inoculated with 0.2 ml of the strain in a medium amount as described above in a 50 ml test tube, and cultured with shaking at 250 rpm. In order to analyze β-carotene production, the cells were recovered from the culture medium, extracted with acetone, and then the concentration of β-carotene was measured using HPLC.

그 결과, 글리세롤의 농도가 증가할수록 β-카로틴의 생산은 증가하였으며, 2.0%를 포함하는 배지에서 144 시간에서 약 500mg/L의 β-카로틴이 생산되었다. As a result, the production of β-carotene increased as the concentration of glycerol increased, and about 500 mg / L of β-carotene was produced at 144 hours in a medium containing 2.0%.

표 15.Table 15.

농도 및 시간Concentration and time 글리세롤 농도(중량%)Glycerol Concentration (% by weight) 0.5%0.5% 1.5%1.5% 2.0%2.0% β-카로틴 최대 농도 (mg/L)β-carotene maximum concentration (mg / L) 150150 250250 500500 시간(hr)Hours (hr) 4848 120120 120120

본 발명자들은 본 실시예에서 얻어진 균주 DH5α/pTDHB/pSNA (수탁번호 KCTC 11254BP)와 DH5α/pTDHBSR/pSNA (수탁번호 KCTC 11255BP)를 부다페스트 조약하의 국제기탁기관인 한국생명공학연구원 유전자원 센터 유전자은행에 2008년 1월 2일자로 기탁하였다. The present inventors transferred the strains DH5α / pTDHB / pSNA (Accession No. KCTC 11254BP) and DH5α / pTDHBSR / pSNA (Accession No. KCTC 11255BP) obtained in this example to the Korea Institute of Bioscience and Gene Genetic Center Gene Bank, an international depository under the Budapest Treaty. It was deposited on 2 January.

도 1은 MEP 경로와 메발로네이트 경로를 통하여 기본 빌딩 블록으로부터 이소프레노이드의 일종인 레티날을 합성하는 과정을 도식적으로 나타낸 도면이다.1 is a diagram schematically illustrating a process of synthesizing a retinal, which is a type of isoprenoid, from a basic building block through a MEP path and a mevalonate path.

도 2는 pSNA 벡터의 지도를 나타내는 도면이다.2 shows a map of a pSNA vector.

도 3은 pTDHB의 벡터 지도를 나타내는 도면이다. 3 shows a vector map of pTDHB.

도 4는 pTDHBSR의 벡터 지도를 나타내는 도면이다. 4 is a diagram illustrating a vector map of pTDHBSR.

<110> INDUSTRY-ACADEMIC COOPERATION FOUNDATION GYEONGSANG NATIONAL UNIVERSITY <120> A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same <130> PN077141 <160> 65 <170> KopatentIn 1.71 <210> 1 <211> 803 <212> PRT <213> Enterococcus faecalis <400> 1 Met Lys Thr Val Val Ile Ile Asp Ala Leu Arg Thr Pro Ile Gly Lys 1 5 10 15 Tyr Lys Gly Ser Leu Ser Gln Val Ser Ala Val Asp Leu Gly Thr His 20 25 30 Val Thr Thr Gln Leu Leu Lys Arg His Ser Thr Ile Ser Glu Glu Ile 35 40 45 Asp Gln Val Ile Phe Gly Asn Val Leu Gln Ala Gly Asn Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ile Ala Ile Asn Ser Gly Leu Ser His Glu Ile Pro 65 70 75 80 Ala Met Thr Val Asn Glu Val Cys Gly Ser Gly Met Lys Ala Val Ile 85 90 95 Leu Ala Lys Gln Leu Ile Gln Leu Gly Glu Ala Glu Val Leu Ile Ala 100 105 110 Gly Gly Ile Glu Asn Met Ser Gln Ala Pro Lys Leu Gln Arg Phe Asn 115 120 125 Tyr Glu Thr Glu Ser Tyr Asp Ala Pro Phe Ser Ser Met Met Tyr Asp 130 135 140 Gly Leu Thr Asp Ala Phe Ser Gly Gln Ala Met Gly Leu Thr Ala Glu 145 150 155 160 Asn Val Ala Glu Lys Tyr His Val Thr Arg Glu Glu Gln Asp Gln Phe 165 170 175 Ser Val His Ser Gln Leu Lys Ala Ala Gln Ala Gln Ala Glu Gly Ile 180 185 190 Phe Ala Asp Glu Ile Ala Pro Leu Glu Val Ser Gly Thr Leu Val Glu 195 200 205 Lys Asp Glu Gly Ile Arg Pro Asn Ser Ser Val Glu Lys Leu Gly Thr 210 215 220 Leu Lys Thr Val Phe Lys Glu Asp Gly Thr Val Thr Ala Gly Asn Ala 225 230 235 240 Ser Thr Ile Asn Asp Gly Ala Ser Ala Leu Ile Ile Ala Ser Gln Glu 245 250 255 Tyr Ala Glu Ala His Gly Leu Pro Tyr Leu Ala Ile Ile Arg Asp Ser 260 265 270 Val Glu Val Gly Ile Asp Pro Ala Tyr Met Gly Ile Ser Pro Ile Lys 275 280 285 Ala Ile Gln Lys Leu Leu Ala Arg Asn Gln Leu Thr Thr Glu Glu Ile 290 295 300 Asp Leu Tyr Glu Ile Asn Glu Ala Phe Ala Ala Thr Ser Ile Val Val 305 310 315 320 Gln Arg Glu Leu Ala Leu Pro Glu Glu Lys Val Asn Ile Tyr Gly Gly 325 330 335 Gly Ile Ser Leu Gly His Ala Ile Gly Ala Thr Gly Ala Arg Leu Leu 340 345 350 Thr Ser Leu Ser Tyr Gln Leu Asn Gln Lys Glu Lys Lys Tyr Gly Val 355 360 365 Ala Ser Leu Cys Ile Gly Gly Gly Leu Gly Leu Ala Met Leu Leu Glu 370 375 380 Arg Pro Gln Gln Lys Lys Asn Ser Arg Phe Tyr Gln Met Ser Pro Glu 385 390 395 400 Glu Arg Leu Ala Ser Leu Leu Asn Glu Gly Gln Ile Ser Ala Asp Thr 405 410 415 Lys Lys Glu Phe Glu Asn Thr Ala Leu Ser Ser Gln Ile Ala Asn His 420 425 430 Met Ile Glu Asn Gln Ile Ser Glu Thr Glu Val Pro Met Gly Val Gly 435 440 445 Leu His Leu Thr Val Asp Glu Thr Asp Tyr Leu Val Pro Met Ala Thr 450 455 460 Glu Glu Pro Ser Val Ile Ala Ala Leu Ser Asn Gly Ala Lys Ile Ala 465 470 475 480 Gln Gly Phe Lys Thr Val Asn Gln Gln Arg Leu Met Arg Gly Gln Ile 485 490 495 Val Phe Tyr Asp Val Ala Asp Ala Glu Ser Leu Ile Asp Glu Leu Gln 500 505 510 Val Arg Glu Thr Glu Ile Phe Gln Gln Ala Glu Leu Ser Tyr Pro Ser 515 520 525 Ile Val Lys Arg Gly Gly Gly Leu Arg Asp Leu Gln Tyr Arg Ala Phe 530 535 540 Asp Glu Ser Phe Val Ser Val Asp Phe Leu Val Asp Val Lys Asp Ala 545 550 555 560 Met Gly Ala Asn Ile Val Asn Ala Met Leu Glu Gly Val Ala Glu Leu 565 570 575 Phe Arg Glu Trp Phe Ala Glu Gln Lys Ile Leu Phe Ser Ile Leu Ser 580 585 590 Asn Tyr Ala Thr Glu Ser Val Val Thr Met Lys Thr Ala Ile Pro Val 595 600 605 Ser Arg Leu Ser Lys Gly Ser Asn Gly Arg Glu Ile Ala Glu Lys Ile 610 615 620 Val Leu Ala Ser Arg Tyr Ala Ser Leu Asp Pro Tyr Arg Ala Val Thr 625 630 635 640 His Asn Lys Gly Ile Met Asn Gly Ile Glu Ala Val Val Leu Ala Thr 645 650 655 Gly Asn Asp Thr Arg Ala Val Ser Ala Ser Cys His Ala Phe Ala Val 660 665 670 Lys Glu Gly Arg Tyr Gln Gly Leu Thr Ser Trp Thr Leu Asp Gly Glu 675 680 685 Gln Leu Ile Gly Glu Ile Ser Val Pro Leu Ala Leu Ala Thr Val Gly 690 695 700 Gly Ala Thr Lys Val Leu Pro Lys Ser Gln Ala Ala Ala Asp Leu Leu 705 710 715 720 Ala Val Thr Asp Ala Lys Glu Leu Ser Arg Val Val Ala Ala Val Gly 725 730 735 Leu Ala Gln Asn Leu Ala Ala Leu Arg Ala Leu Val Ser Glu Gly Ile 740 745 750 Gln Lys Gly His Met Ala Leu Gln Ala Arg Ser Leu Ala Met Thr Val 755 760 765 Gly Ala Thr Gly Lys Glu Val Glu Ala Val Ala Gln Gln Leu Lys Arg 770 775 780 Gln Lys Thr Met Asn Gln Asp Arg Ala Leu Ala Ile Leu Asn Asp Leu 785 790 795 800 Arg Lys Gln <210> 2 <211> 383 <212> PRT <213> Enterococcus faecalis <400> 2 Met Thr Ile Gly Ile Asp Lys Ile Ser Phe Phe Val Pro Pro Tyr Tyr 1 5 10 15 Ile Asp Met Thr Ala Leu Ala Glu Ala Arg Asn Val Asp Pro Gly Lys 20 25 30 Phe His Ile Gly Ile Gly Gln Asp Gln Met Ala Val Asn Pro Ile Ser 35 40 45 Gln Asp Ile Val Thr Phe Ala Ala Asn Ala Ala Glu Ala Ile Leu Thr 50 55 60 Lys Glu Asp Lys Glu Ala Ile Asp Met Val Ile Val Gly Thr Glu Ser 65 70 75 80 Ser Ile Asp Glu Ser Lys Ala Ala Ala Val Val Leu His Arg Leu Met 85 90 95 Gly Ile Gln Pro Phe Ala Arg Ser Phe Glu Ile Lys Glu Ala Cys Tyr 100 105 110 Gly Ala Thr Ala Gly Leu Gln Leu Ala Lys Asn His Val Ala Leu His 115 120 125 Pro Asp Lys Lys Val Leu Val Val Ala Ala Asp Ile Ala Lys Tyr Gly 130 135 140 Leu Asn Ser Gly Gly Glu Pro Thr Gln Gly Ala Gly Ala Val Ala Met 145 150 155 160 Leu Val Ala Ser Glu Pro Arg Ile Leu Ala Leu Lys Glu Asp Asn Val 165 170 175 Met Leu Thr Gln Asp Ile Tyr Asp Phe Trp Arg Pro Thr Gly His Pro 180 185 190 Tyr Pro Met Val Asp Gly Pro Leu Ser Asn Glu Thr Tyr Ile Gln Ser 195 200 205 Phe Ala Gln Val Trp Asp Glu His Lys Lys Arg Thr Gly Leu Asp Phe 210 215 220 Ala Asp Tyr Asp Ala Leu Ala Phe His Ile Pro Tyr Thr Lys Met Gly 225 230 235 240 Lys Lys Ala Leu Leu Ala Lys Ile Ser Asp Gln Thr Glu Ala Glu Gln 245 250 255 Glu Arg Ile Leu Ala Arg Tyr Glu Glu Ser Ile Ile Tyr Ser Arg Arg 260 265 270 Val Gly Asn Leu Tyr Thr Gly Ser Leu Tyr Leu Gly Leu Ile Ser Leu 275 280 285 Leu Glu Asn Ala Thr Thr Leu Thr Ala Gly Asn Gln Ile Gly Leu Phe 290 295 300 Ser Tyr Gly Ser Gly Ala Val Ala Glu Phe Phe Thr Gly Glu Leu Val 305 310 315 320 Ala Gly Tyr Gln Asn His Leu Gln Lys Glu Thr His Leu Ala Leu Leu 325 330 335 Asp Asn Arg Thr Glu Leu Ser Ile Ala Glu Tyr Glu Ala Met Phe Ala 340 345 350 Glu Thr Leu Asp Thr Asp Ile Asp Gln Thr Leu Glu Asp Glu Leu Lys 355 360 365 Tyr Ser Ile Ser Ala Ile Asn Asn Thr Val Arg Ser Tyr Arg Asn 370 375 380 <210> 3 <211> 292 <212> PRT <213> Streptococcus pneumoniae <400> 3 Met Thr Lys Lys Val Gly Val Gly Gln Ala His Ser Lys Ile Ile Leu 1 5 10 15 Ile Gly Glu His Ala Val Val Tyr Gly Tyr Pro Ala Ile Ser Leu Pro 20 25 30 Leu Leu Glu Val Glu Val Thr Cys Lys Val Val Ser Ala Glu Ser Pro 35 40 45 Trp Arg Leu Tyr Glu Glu Asp Thr Leu Ser Met Ala Val Tyr Ala Ser 50 55 60 Leu Glu Tyr Leu Asp Ile Thr Glu Ala Cys Val Arg Cys Glu Ile Asp 65 70 75 80 Ser Ala Ile Pro Glu Lys Arg Gly Met Gly Ser Ser Ala Ala Ile Ser 85 90 95 Ile Ala Ala Ile Arg Ala Val Phe Asp Tyr Tyr Gln Ala Asp Leu Pro 100 105 110 His Asp Val Leu Glu Ile Leu Val Asn Arg Ala Glu Met Ile Ala His 115 120 125 Met Asn Pro Ser Gly Leu Asp Ala Lys Thr Cys Leu Ser Asp Gln Pro 130 135 140 Ile Arg Phe Ile Lys Asn Val Gly Phe Thr Glu Leu Glu Met Asp Leu 145 150 155 160 Ser Ala Tyr Leu Val Ile Ala Asp Thr Gly Val Tyr Gly His Thr Arg 165 170 175 Glu Ala Ile Gln Val Val Gln Asn Lys Gly Lys Asp Ala Leu Pro Phe 180 185 190 Leu His Ala Leu Gly Glu Leu Thr Gln Gln Ala Glu Val Ala Ile Ser 195 200 205 Gln Lys Tyr Ala Glu Gly Leu Gly Leu Ile Phe Ser Gln Ala His Leu 210 215 220 His Leu Lys Glu Ile Gly Val Ser Ser Pro Glu Ala Asp Phe Leu Val 225 230 235 240 Glu Thr Ala Leu Ser Tyr Gly Ala Leu Gly Ala Lys Met Ser Gly Gly 245 250 255 Gly Leu Gly Gly Cys Ile Ile Ala Leu Val Thr Asn Leu Thr His Ala 260 265 270 Gln Glu Leu Ala Glu Arg Leu Glu Glu Lys Gly Ala Val Gln Thr Trp 275 280 285 Ile Glu Ser Leu 290 <210> 4 <211> 336 <212> PRT <213> Streptococcus pneumoniae <400> 4 Met Ile Ala Val Lys Thr Cys Gly Lys Leu Tyr Trp Ala Gly Glu Tyr 1 5 10 15 Ala Ile Leu Glu Pro Gly Gln Leu Ala Leu Ile Lys Asp Ile Pro Ile 20 25 30 Tyr Met Arg Ala Glu Ile Ala Phe Ser Asp Ser Tyr Arg Ile Tyr Ser 35 40 45 Asp Met Phe Asp Phe Ala Val Asp Leu Arg Pro Asn Pro Asp Tyr Ser 50 55 60 Leu Ile Gln Glu Thr Ile Ala Leu Met Gly Asp Phe Leu Ala Val Arg 65 70 75 80 Gly Gln Asn Leu Arg Pro Phe Ser Leu Lys Ile Cys Gly Lys Met Glu 85 90 95 Arg Glu Gly Lys Lys Phe Gly Leu Gly Ser Ser Gly Ser Val Val Val 100 105 110 Leu Val Val Lys Ala Leu Leu Ala Leu Tyr Asn Leu Ser Val Asp Gln 115 120 125 Asn Leu Leu Phe Lys Leu Thr Ser Ala Val Leu Leu Lys Arg Gly Asp 130 135 140 Asn Gly Ser Met Gly Asp Leu Ala Cys Ile Val Ala Glu Asp Leu Val 145 150 155 160 Leu Tyr Gln Ser Phe Asp Arg Gln Lys Ala Ala Ala Trp Leu Glu Glu 165 170 175 Glu Asn Leu Ala Thr Val Leu Glu Arg Asp Trp Gly Phe Phe Ile Ser 180 185 190 Gln Val Lys Pro Thr Leu Glu Cys Asp Phe Leu Val Gly Trp Thr Lys 195 200 205 Glu Val Ala Val Ser Ser His Met Val Gln Gln Ile Lys Gln Asn Ile 210 215 220 Asn Gln Asn Phe Leu Ser Ser Ser Lys Glu Thr Val Val Ser Leu Val 225 230 235 240 Glu Ala Leu Glu Gln Gly Lys Ala Glu Lys Val Ile Glu Gln Val Glu 245 250 255 Val Ala Ser Lys Leu Leu Glu Gly Leu Ser Thr Asp Ile Tyr Thr Pro 260 265 270 Leu Leu Arg Gln Leu Lys Glu Ala Ser Gln Asp Leu Gln Ala Val Ala 275 280 285 Lys Ser Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile Ala Leu Ser Phe 290 295 300 Asp Ala Gln Ser Ser Arg Asn Thr Leu Lys Asn Arg Trp Ala Asp Leu 305 310 315 320 Gly Ile Glu Leu Leu Tyr Gln Glu Arg Ile Gly His Asp Asp Lys Ser 325 330 335 <210> 5 <211> 317 <212> PRT <213> Streptococcus pneumoniae <400> 5 Met Asp Arg Glu Pro Val Thr Val Arg Ser Tyr Ala Asn Ile Ala Ile 1 5 10 15 Ile Lys Tyr Trp Gly Lys Lys Lys Glu Lys Glu Met Val Pro Ala Thr 20 25 30 Ser Ser Ile Ser Leu Thr Leu Glu Asn Met Tyr Thr Glu Thr Thr Leu 35 40 45 Ser Pro Leu Pro Ala Asn Val Thr Ala Asp Glu Phe Tyr Ile Asn Gly 50 55 60 Gln Leu Gln Asn Glu Val Glu His Ala Lys Met Ser Lys Ile Ile Asp 65 70 75 80 Arg Tyr Arg Pro Ala Gly Glu Gly Phe Val Arg Ile Asp Thr Gln Asn 85 90 95 Asn Met Pro Thr Ala Ala Gly Leu Ser Ser Ser Ser Ser Gly Leu Ser 100 105 110 Ala Leu Val Lys Ala Cys Asn Ala Tyr Phe Lys Leu Gly Leu Asp Arg 115 120 125 Ser Gln Leu Ala Gln Glu Ala Lys Phe Ala Ser Gly Ser Ser Ser Arg 130 135 140 Ser Phe Tyr Gly Pro Leu Gly Ala Trp Asp Lys Asp Ser Gly Glu Ile 145 150 155 160 Tyr Pro Val Glu Thr Asp Leu Lys Leu Ala Met Ile Met Leu Val Leu 165 170 175 Glu Asp Lys Lys Lys Pro Ile Ser Ser Arg Asp Gly Met Lys Leu Cys 180 185 190 Val Glu Thr Ser Thr Thr Phe Asp Asp Trp Val Arg Gln Ser Glu Lys 195 200 205 Asp Tyr Gln Asp Met Leu Ile Tyr Leu Lys Glu Asn Asp Phe Ala Lys 210 215 220 Ile Gly Glu Leu Thr Glu Lys Asn Ala Leu Ala Met His Ala Thr Thr 225 230 235 240 Lys Thr Ala Ser Pro Ala Phe Ser Tyr Leu Thr Asp Ala Ser Tyr Glu 245 250 255 Ala Met Ala Phe Val Arg Gln Leu Arg Glu Lys Gly Glu Ala Cys Tyr 260 265 270 Phe Thr Met Asp Ala Gly Pro Asn Val Lys Val Phe Cys Gln Glu Lys 275 280 285 Asp Leu Glu His Leu Ser Glu Ile Phe Gly Gln Arg Tyr Arg Leu Ile 290 295 300 Val Ser Lys Thr Lys Asp Leu Ser Gln Asp Asp Cys Cys 305 310 315 <210> 6 <211> 182 <212> PRT <213> E. coli K12 MG1655 <400> 6 Met Gln Thr Glu His Val Ile Leu Leu Asn Ala Gln Gly Val Pro Thr 1 5 10 15 Gly Thr Leu Glu Lys Tyr Ala Ala His Thr Ala Asp Thr Arg Leu His 20 25 30 Leu Ala Phe Ser Ser Trp Leu Phe Asn Ala Lys Gly Gln Leu Leu Val 35 40 45 Thr Arg Arg Ala Leu Ser Lys Lys Ala Trp Pro Gly Val Trp Thr Asn 50 55 60 Ser Val Cys Gly His Pro Gln Leu Gly Glu Ser Asn Glu Asp Ala Val 65 70 75 80 Ile Arg Arg Cys Arg Tyr Glu Leu Gly Val Glu Ile Thr Pro Pro Glu 85 90 95 Ser Ile Tyr Pro Asp Phe Arg Tyr Arg Ala Thr Asp Pro Ser Gly Ile 100 105 110 Val Glu Asn Glu Val Cys Pro Val Phe Ala Ala Arg Thr Thr Ser Ala 115 120 125 Leu Gln Ile Asn Asp Asp Glu Val Met Asp Tyr Gln Trp Cys Asp Leu 130 135 140 Ala Asp Val Leu His Gly Ile Asp Ala Thr Pro Trp Ala Phe Ser Pro 145 150 155 160 Trp Met Val Met Gln Ala Thr Asn Arg Glu Ala Arg Lys Arg Leu Ser 165 170 175 Ala Phe Thr Gln Leu Lys 180 <210> 7 <211> 307 <212> PRT <213> Pantoea agglomerans <400> 7 Met Val Ser Gly Ser Lys Ala Gly Val Ser Pro His Arg Glu Ile Glu 1 5 10 15 Val Met Arg Gln Ser Ile Asp Asp His Leu Ala Gly Leu Leu Pro Glu 20 25 30 Thr Asp Ser Gln Asp Ile Val Ser Leu Ala Met Arg Glu Gly Val Met 35 40 45 Ala Pro Gly Lys Arg Ile Arg Pro Leu Leu Met Leu Leu Ala Ala Arg 50 55 60 Asp Leu Arg Tyr Gln Gly Ser Met Pro Thr Leu Leu Asp Leu Ala Cys 65 70 75 80 Ala Val Glu Leu Thr His Thr Ala Ser Leu Met Leu Asp Asp Met Pro 85 90 95 Cys Met Asp Asn Ala Glu Leu Arg Arg Gly Gln Pro Thr Thr His Lys 100 105 110 Lys Phe Gly Glu Ser Val Ala Ile Leu Ala Ser Val Gly Leu Leu Ser 115 120 125 Lys Ala Phe Gly Leu Ile Ala Ala Thr Gly Asp Leu Pro Gly Glu Arg 130 135 140 Arg Ala Gln Ala Val Asn Glu Leu Ser Thr Ala Val Gly Val Gln Gly 145 150 155 160 Leu Val Leu Gly Gln Phe Arg Asp Leu Asn Asp Ala Ala Leu Asp Arg 165 170 175 Thr Pro Asp Ala Ile Leu Ser Thr Asn His Leu Lys Thr Gly Ile Leu 180 185 190 Phe Ser Ala Met Leu Gln Ile Val Ala Ile Ala Ser Ala Ser Ser Pro 195 200 205 Ser Thr Arg Glu Thr Leu His Ala Phe Ala Leu Asp Phe Gly Gln Ala 210 215 220 Phe Gln Leu Leu Asp Asp Leu Arg Asp Asp His Pro Glu Thr Gly Lys 225 230 235 240 Asp Arg Asn Lys Asp Ala Gly Lys Ser Thr Leu Val Asn Arg Leu Gly 245 250 255 Ala Asp Ala Ala Arg Gln Lys Leu Arg Glu His Ile Asp Ser Ala Asp 260 265 270 Lys His Leu Thr Phe Ala Cys Pro Gln Gly Gly Ala Ile Arg Gln Phe 275 280 285 Met His Leu Trp Phe Gly His His Leu Ala Asp Trp Ser Pro Val Met 290 295 300 Lys Ile Ala 305 <210> 8 <211> 309 <212> PRT <213> Pantoea agglomerans <400> 8 Met Ser Gln Pro Pro Leu Leu Asp His Ala Thr Gln Thr Met Ala Asn 1 5 10 15 Gly Ser Lys Ser Phe Ala Thr Ala Ala Lys Leu Phe Asp Pro Ala Thr 20 25 30 Arg Arg Ser Val Leu Met Leu Tyr Thr Trp Cys Arg His Cys Asp Asp 35 40 45 Val Ile Asp Asp Gln Thr His Gly Phe Ala Ser Glu Ala Ala Ala Glu 50 55 60 Glu Glu Ala Thr Gln Arg Leu Ala Arg Leu Arg Thr Leu Thr Leu Ala 65 70 75 80 Ala Phe Glu Gly Ala Glu Met Gln Asp Pro Ala Phe Ala Ala Phe Gln 85 90 95 Glu Val Ala Leu Thr His Gly Ile Thr Pro Arg Met Ala Leu Asp His 100 105 110 Leu Asp Gly Phe Ala Met Asp Val Ala Gln Thr Arg Tyr Val Thr Phe 115 120 125 Glu Asp Thr Leu Arg Tyr Cys Tyr His Val Ala Gly Val Val Gly Leu 130 135 140 Met Met Ala Arg Val Met Gly Val Arg Asp Glu Arg Val Leu Asp Arg 145 150 155 160 Ala Cys Asp Leu Gly Leu Ala Phe Gln Leu Thr Asn Ile Ala Arg Asp 165 170 175 Ile Ile Asp Asp Ala Ala Ile Asp Arg Cys Tyr Leu Pro Ala Glu Trp 180 185 190 Leu Gln Asp Ala Gly Leu Thr Pro Glu Asn Tyr Ala Ala Arg Glu Asn 195 200 205 Arg Ala Ala Leu Ala Arg Val Ala Glu Arg Leu Ile Asp Ala Ala Glu 210 215 220 Pro Tyr Tyr Ile Ser Ser Gln Ala Gly Leu His Asp Leu Pro Pro Arg 225 230 235 240 Cys Ala Trp Ala Ile Ala Thr Ala Arg Ser Val Tyr Arg Glu Ile Gly 245 250 255 Ile Lys Val Lys Ala Ala Gly Gly Ser Ala Trp Asp Arg Arg Gln His 260 265 270 Thr Ser Lys Gly Glu Lys Ile Ala Met Leu Met Ala Ala Pro Gly Gln 275 280 285 Val Ile Arg Ala Lys Thr Thr Arg Val Thr Pro Arg Pro Ala Gly Leu 290 295 300 Trp Gln Arg Pro Val 305 <210> 9 <211> 486 <212> PRT <213> Pantoea agglomerans <400> 9 Met Lys Lys Thr Val Val Ile Gly Ala Gly Phe Gly Gly Leu Ala Leu 1 5 10 15 Ala Ile Arg Leu Gln Ala Ala Gly Ile Pro Thr Val Leu Leu Glu Gln 20 25 30 Arg Asp Lys Pro Gly Gly Arg Ala Tyr Val Trp His Asp Gln Gly Phe 35 40 45 Thr Phe Asp Ala Gly Pro Thr Val Ile Thr Asp Pro Thr Ala Leu Glu 50 55 60 Ala Leu Phe Thr Leu Ala Gly Arg Arg Met Glu Asp Tyr Val Arg Leu 65 70 75 80 Leu Pro Val Lys Pro Phe Tyr Arg Leu Cys Trp Glu Ser Gly Lys Thr 85 90 95 Leu Asp Tyr Ala Asn Asp Ser Ala Glu Leu Glu Ala Gln Ile Thr Gln 100 105 110 Phe Asn Pro Arg Asp Val Glu Gly Tyr Arg Arg Phe Leu Ala Tyr Ser 115 120 125 Gln Ala Val Phe Gln Glu Gly Tyr Leu Arg Leu Gly Ser Val Pro Phe 130 135 140 Leu Ser Phe Arg Asp Met Leu Arg Ala Gly Pro Gln Leu Leu Lys Leu 145 150 155 160 Gln Ala Trp Gln Ser Val Tyr Gln Ser Val Ser Arg Phe Ile Glu Asp 165 170 175 Glu His Leu Arg Gln Ala Phe Ser Phe His Ser Leu Leu Val Gly Gly 180 185 190 Asn Pro Phe Thr Thr Ser Ser Ile Tyr Thr Leu Ile His Ala Leu Glu 195 200 205 Arg Glu Trp Gly Val Trp Phe Pro Glu Gly Gly Thr Gly Ala Leu Val 210 215 220 Asn Gly Met Val Lys Leu Phe Thr Asp Leu Gly Gly Glu Ile Glu Leu 225 230 235 240 Asn Ala Arg Val Glu Glu Leu Val Val Ala Asp Asn Arg Val Ser Gln 245 250 255 Val Arg Leu Ala Asp Gly Arg Ile Phe Asp Thr Asp Ala Val Ala Ser 260 265 270 Asn Ala Asp Val Val Asn Thr Tyr Lys Lys Leu Leu Gly His His Pro 275 280 285 Val Gly Gln Lys Arg Ala Ala Ala Leu Glu Arg Lys Ser Met Ser Asn 290 295 300 Ser Leu Phe Val Leu Tyr Phe Gly Leu Asn Gln Pro His Ser Gln Leu 305 310 315 320 Ala His His Thr Ile Cys Phe Gly Pro Arg Tyr Arg Glu Leu Ile Asp 325 330 335 Glu Ile Phe Thr Gly Ser Ala Leu Ala Asp Asp Phe Ser Leu Tyr Leu 340 345 350 His Ser Pro Cys Val Thr Asp Pro Ser Leu Ala Pro Pro Gly Cys Ala 355 360 365 Ser Phe Tyr Val Leu Ala Pro Val Pro His Leu Gly Asn Ala Pro Leu 370 375 380 Asp Trp Ala Gln Glu Gly Pro Lys Leu Arg Asp Arg Ile Phe Asp Tyr 385 390 395 400 Leu Glu Glu Arg Tyr Met Pro Gly Leu Arg Ser Gln Leu Val Thr Gln 405 410 415 Arg Ile Phe Thr Pro Ala Asp Phe His Asp Thr Leu Asp Ala His Leu 420 425 430 Gly Ser Ala Phe Ser Ile Glu Pro Leu Leu Thr Gln Ser Ala Trp Phe 435 440 445 Arg Pro His Asn Arg Asp Ser Asp Ile Ala Asn Leu Tyr Leu Val Gly 450 455 460 Ala Gly Thr His Pro Gly Ala Gly Ile Pro Gly Val Val Ala Ser Ala 465 470 475 480 Lys Ala Thr Ala Ser Leu 485 <210> 10 <211> 382 <212> PRT <213> Pantoea ananatis <400> 10 Met Gln Pro His Tyr Asp Leu Ile Leu Val Gly Ala Gly Leu Ala Asn 1 5 10 15 Gly Leu Ile Ala Leu Arg Leu Gln Gln Gln Gln Pro Asp Met Arg Ile 20 25 30 Leu Leu Ile Asp Ala Ala Pro Gln Ala Gly Gly Asn His Thr Trp Ser 35 40 45 Phe His His Asp Asp Leu Thr Glu Ser Gln His Arg Trp Ile Ala Pro 50 55 60 Leu Val Val His His Trp Pro Asp Tyr Gln Val Arg Phe Pro Thr Arg 65 70 75 80 Arg Arg Lys Leu Asn Ser Gly Tyr Phe Cys Ile Thr Ser Gln Arg Phe 85 90 95 Ala Glu Val Leu Gln Arg Gln Phe Gly Pro His Leu Trp Met Asp Thr 100 105 110 Ala Val Ala Glu Val Asn Ala Glu Ser Val Arg Leu Lys Lys Gly Gln 115 120 125 Val Ile Gly Ala Arg Ala Val Ile Asp Gly Arg Gly Tyr Ala Ala Asn 130 135 140 Ser Ala Leu Ser Val Gly Phe Gln Ala Phe Ile Gly Gln Glu Trp Arg 145 150 155 160 Leu Ser His Pro His Gly Leu Ser Ser Pro Ile Ile Met Asp Ala Thr 165 170 175 Val Asp Gln Gln Asn Gly Tyr Arg Phe Val Tyr Ser Leu Pro Leu Ser 180 185 190 Pro Thr Arg Leu Leu Ile Glu Asp Thr His Tyr Ile Asp Asn Ala Thr 195 200 205 Leu Asp Pro Glu Cys Ala Arg Gln Asn Ile Cys Asp Tyr Ala Ala Gln 210 215 220 Gln Gly Trp Gln Leu Gln Thr Leu Leu Arg Glu Glu Gln Gly Ala Leu 225 230 235 240 Pro Ile Thr Leu Ser Gly Asn Ala Asp Ala Phe Trp Gln Gln Arg Pro 245 250 255 Leu Ala Cys Ser Gly Leu Arg Ala Gly Leu Phe His Pro Thr Thr Gly 260 265 270 Tyr Ser Leu Pro Leu Ala Val Ala Val Ala Asp Arg Leu Ser Ala Leu 275 280 285 Asp Val Phe Thr Ser Ala Ser Ile His His Ala Ile Thr His Phe Ala 290 295 300 Arg Glu Arg Trp Gln Gln Gln Gly Phe Phe Arg Met Leu Asn Arg Met 305 310 315 320 Leu Phe Leu Ala Gly Pro Ala Asp Ser Arg Trp Arg Val Met Gln Arg 325 330 335 Phe Tyr Gly Leu Pro Glu Asp Leu Ile Ala Arg Phe Tyr Ala Gly Lys 340 345 350 Leu Thr Leu Thr Asp Arg Leu Arg Ile Leu Ser Gly Lys Pro Pro Val 355 360 365 Pro Val Leu Ala Ala Leu Gln Ala Ile Met Thr Thr His Arg 370 375 380 <210> 11 <211> 620 <212> PRT <213> E. coli K12 MG1655 <400> 11 Met Ser Phe Asp Ile Ala Lys Tyr Pro Thr Leu Ala Leu Val Asp Ser 1 5 10 15 Thr Gln Glu Leu Arg Leu Leu Pro Lys Glu Ser Leu Pro Lys Leu Cys 20 25 30 Asp Glu Leu Arg Arg Tyr Leu Leu Asp Ser Val Ser Arg Ser Ser Gly 35 40 45 His Phe Ala Ser Gly Leu Gly Thr Val Glu Leu Thr Val Ala Leu His 50 55 60 Tyr Val Tyr Asn Thr Pro Phe Asp Gln Leu Ile Trp Asp Val Gly His 65 70 75 80 Gln Ala Tyr Pro His Lys Ile Leu Thr Gly Arg Arg Asp Lys Ile Gly 85 90 95 Thr Ile Arg Gln Lys Gly Gly Leu His Pro Phe Pro Trp Arg Gly Glu 100 105 110 Ser Glu Tyr Asp Val Leu Ser Val Gly His Ser Ser Thr Ser Ile Ser 115 120 125 Ala Gly Ile Gly Ile Ala Val Ala Ala Glu Lys Glu Gly Lys Asn Arg 130 135 140 Arg Thr Val Cys Val Ile Gly Asp Gly Ala Ile Thr Ala Gly Met Ala 145 150 155 160 Phe Glu Ala Met Asn His Ala Gly Asp Ile Arg Pro Asp Met Leu Val 165 170 175 Ile Leu Asn Asp Asn Glu Met Ser Ile Ser Glu Asn Val Gly Ala Leu 180 185 190 Asn Asn His Leu Ala Gln Leu Leu Ser Gly Lys Leu Tyr Ser Ser Leu 195 200 205 Arg Glu Gly Gly Lys Lys Val Phe Ser Gly Val Pro Pro Ile Lys Glu 210 215 220 Leu Leu Lys Arg Thr Glu Glu His Ile Lys Gly Met Val Val Pro Gly 225 230 235 240 Thr Leu Phe Glu Glu Leu Gly Phe Asn Tyr Ile Gly Pro Val Asp Gly 245 250 255 His Asp Val Leu Gly Leu Ile Thr Thr Leu Lys Asn Met Arg Asp Leu 260 265 270 Lys Gly Pro Gln Phe Leu His Ile Met Thr Lys Lys Gly Arg Gly Tyr 275 280 285 Glu Pro Ala Glu Lys Asp Pro Ile Thr Phe His Ala Val Pro Lys Phe 290 295 300 Asp Pro Ser Ser Gly Cys Leu Pro Lys Ser Ser Gly Gly Leu Pro Ser 305 310 315 320 Tyr Ser Lys Ile Phe Gly Asp Trp Leu Cys Glu Thr Ala Ala Lys Asp 325 330 335 Asn Lys Leu Met Ala Ile Thr Pro Ala Met Arg Glu Gly Ser Gly Met 340 345 350 Val Glu Phe Ser Arg Lys Phe Pro Asp Arg Tyr Phe Asp Val Ala Ile 355 360 365 Ala Glu Gln His Ala Val Thr Phe Ala Ala Gly Leu Ala Ile Gly Gly 370 375 380 Tyr Lys Pro Ile Val Ala Ile Tyr Ser Thr Phe Leu Gln Arg Ala Tyr 385 390 395 400 Asp Gln Val Leu His Asp Val Ala Ile Gln Lys Leu Pro Val Leu Phe 405 410 415 Ala Ile Asp Arg Ala Gly Ile Val Gly Ala Asp Gly Gln Thr His Gln 420 425 430 Gly Ala Phe Asp Leu Ser Tyr Leu Arg Cys Ile Pro Glu Met Val Ile 435 440 445 Met Thr Pro Ser Asp Glu Asn Glu Cys Arg Gln Met Leu Tyr Thr Gly 450 455 460 Tyr His Tyr Asn Asp Gly Pro Ser Ala Val Arg Tyr Pro Arg Gly Asn 465 470 475 480 Ala Val Gly Val Glu Leu Thr Pro Leu Glu Lys Leu Pro Ile Gly Lys 485 490 495 Gly Ile Val Lys Arg Arg Gly Glu Lys Leu Ala Ile Leu Asn Phe Gly 500 505 510 Thr Leu Met Pro Glu Ala Ala Lys Val Ala Glu Ser Leu Asn Ala Thr 515 520 525 Leu Val Asp Met Arg Phe Val Lys Pro Leu Asp Glu Ala Leu Ile Leu 530 535 540 Glu Met Ala Ala Ser His Glu Ala Leu Val Thr Val Glu Glu Asn Ala 545 550 555 560 Ile Met Gly Gly Ala Gly Ser Gly Val Asn Glu Val Leu Met Ala His 565 570 575 Arg Lys Pro Val Pro Val Leu Asn Ile Gly Leu Pro Asp Phe Phe Ile 580 585 590 Pro Gln Gly Thr Gln Glu Glu Met Arg Ala Glu Leu Gly Leu Asp Ala 595 600 605 Ala Gly Met Glu Ala Lys Ile Lys Ala Trp Leu Ala 610 615 620 <210> 12 <211> 305 <212> PRT <213> Haematococcus pluvialis <400> 12 Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His Ile Pro Arg Val Asn 1 5 10 15 Ser Ala Gln Gln Pro Ser Cys Ala His Ala Arg Leu Gln Phe Lys Leu 20 25 30 Arg Ser Met Gln Met Thr Leu Met Gln Pro Ser Ile Ser Ala Asn Leu 35 40 45 Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 50 55 60 Ala Gly Gly Gln Ser Gln Asp Glu Leu Met Leu Lys Asp Glu Cys Ile 65 70 75 80 Leu Val Asp Val Glu Asp Asn Ile Thr Gly His Ala Ser Lys Leu Glu 85 90 95 Cys His Lys Phe Leu Pro His Gln Pro Ala Gly Leu Leu His Arg Ala 100 105 110 Phe Ser Val Phe Leu Phe Asp Asp Gln Gly Arg Leu Leu Leu Gln Gln 115 120 125 Arg Ala Arg Ser Lys Ile Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 130 135 140 Cys Ser His Pro Leu His Gly Gln Thr Pro Asp Glu Val Asp Gln Leu 145 150 155 160 Ser Gln Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala Ile 165 170 175 Arg Lys Leu Glu His Glu Leu Gly Ile Pro Ala His Gln Leu Pro Ala 180 185 190 Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 195 200 205 Gln Pro Ala Ala Thr Gln Ser Ala Leu Trp Gly Glu His Glu Met Asp 210 215 220 Tyr Ile Leu Phe Ile Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 225 230 235 240 Glu Val Asp Glu Val Arg Tyr Val Thr Gln Glu Glu Leu Arg Gln Met 245 250 255 Met Gln Pro Asp Asn Gly Leu Gln Trp Ser Pro Trp Phe Arg Ile Ile 260 265 270 Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu 275 280 285 Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His Ile Asn Glu 290 295 300 Ala 305 <210> 13 <211> 275 <212> PRT <213> Uncultured marine bacterium 66A03 <400> 13 Met Gly Leu Met Leu Ile Asp Trp Cys Ala Leu Ala Leu Val Val Phe 1 5 10 15 Ile Gly Leu Pro His Gly Ala Leu Asp Ala Ala Ile Ser Phe Ser Met 20 25 30 Ile Ser Ser Ala Lys Arg Ile Ala Arg Leu Ala Gly Ile Leu Leu Ile 35 40 45 Tyr Leu Leu Leu Ala Thr Ala Phe Phe Leu Ile Trp Tyr Gln Leu Pro 50 55 60 Ala Phe Ser Leu Leu Ile Phe Leu Leu Ile Ser Ile Ile His Phe Gly 65 70 75 80 Met Ala Asp Phe Asn Ala Ser Pro Ser Lys Leu Lys Trp Pro His Ile 85 90 95 Ile Ala His Gly Gly Val Val Thr Val Trp Leu Pro Leu Ile Gln Lys 100 105 110 Asn Glu Val Thr Lys Leu Phe Ser Ile Leu Thr Asn Gly Pro Thr Pro 115 120 125 Ile Leu Trp Asp Ile Leu Leu Ile Phe Phe Leu Cys Trp Ser Ile Gly 130 135 140 Val Cys Leu His Thr Tyr Glu Thr Leu Arg Ser Lys His Tyr Asn Ile 145 150 155 160 Ala Phe Glu Leu Ile Gly Leu Ile Phe Leu Ala Trp Tyr Ala Pro Pro 165 170 175 Leu Val Thr Phe Ala Thr Tyr Phe Cys Phe Ile His Ser Arg Arg His 180 185 190 Phe Ser Phe Val Trp Lys Gln Leu Gln His Met Ser Ser Lys Lys Met 195 200 205 Met Ile Gly Ser Ala Ile Ile Leu Ser Cys Thr Ser Trp Leu Ile Gly 210 215 220 Gly Gly Ile Tyr Phe Phe Leu Asn Ser Lys Met Ile Ala Ser Glu Ala 225 230 235 240 Ala Leu Gln Thr Val Phe Ile Gly Leu Ala Ala Leu Thr Val Pro His 245 250 255 Met Ile Leu Ile Asp Phe Ile Phe Arg Pro His Ser Ser Arg Ile Lys 260 265 270 Ile Lys Asn 275 <210> 14 <211> 566 <212> PRT <213> Mus musculus <400> 14 Met Glu Ile Ile Phe Gly Gln Asn Lys Lys Glu Gln Leu Glu Pro Val 1 5 10 15 Gln Ala Lys Val Thr Gly Ser Ile Pro Ala Trp Leu Gln Gly Thr Leu 20 25 30 Leu Arg Asn Gly Pro Gly Met His Thr Val Gly Glu Ser Lys Tyr Asn 35 40 45 His Trp Phe Asp Gly Leu Ala Leu Leu His Ser Phe Ser Ile Arg Asp 50 55 60 Gly Glu Val Phe Tyr Arg Ser Lys Tyr Leu Gln Ser Asp Thr Tyr Ile 65 70 75 80 Ala Asn Ile Glu Ala Asn Arg Ile Val Val Ser Glu Phe Gly Thr Met 85 90 95 Ala Tyr Pro Asp Pro Cys Lys Asn Ile Phe Ser Lys Ala Phe Ser Tyr 100 105 110 Leu Ser His Thr Ile Pro Asp Phe Thr Asp Asn Cys Leu Ile Asn Ile 115 120 125 Met Lys Cys Gly Glu Asp Phe Tyr Ala Thr Thr Glu Thr Asn Tyr Ile 130 135 140 Arg Lys Ile Asp Pro Gln Thr Leu Glu Thr Leu Glu Lys Val Asp Tyr 145 150 155 160 Arg Lys Tyr Val Ala Val Asn Leu Ala Thr Ser His Pro His Tyr Asp 165 170 175 Glu Ala Gly Asn Val Leu Asn Met Gly Thr Ser Val Val Asp Lys Gly 180 185 190 Arg Thr Lys Tyr Val Ile Phe Lys Ile Pro Ala Thr Val Pro Asp Ser 195 200 205 Lys Lys Lys Gly Lys Ser Pro Val Lys His Ala Glu Val Phe Cys Ser 210 215 220 Ile Ser Ser Arg Ser Leu Leu Ser Pro Ser Tyr Tyr His Ser Phe Gly 225 230 235 240 Val Thr Glu Asn Tyr Val Val Phe Leu Glu Gln Pro Phe Lys Leu Asp 245 250 255 Ile Leu Lys Met Ala Thr Ala Tyr Met Arg Gly Val Ser Trp Ala Ser 260 265 270 Cys Met Ser Phe Asp Arg Glu Asp Lys Thr Tyr Ile His Ile Ile Asp 275 280 285 Gln Arg Thr Arg Lys Pro Val Pro Thr Lys Phe Tyr Thr Asp Pro Met 290 295 300 Val Val Phe His His Val Asn Ala Tyr Glu Glu Asp Gly Cys Val Leu 305 310 315 320 Phe Asp Val Ile Ala Tyr Glu Asp Ser Ser Leu Tyr Gln Leu Phe Tyr 325 330 335 Leu Ala Asn Leu Asn Lys Asp Phe Glu Glu Lys Ser Arg Leu Thr Ser 340 345 350 Val Pro Thr Leu Arg Arg Phe Ala Val Pro Leu His Val Asp Lys Asp 355 360 365 Ala Glu Val Gly Ser Asn Leu Val Lys Val Ser Ser Thr Thr Ala Thr 370 375 380 Ala Leu Lys Glu Lys Asp Gly His Val Tyr Cys Gln Pro Glu Val Leu 385 390 395 400 Tyr Glu Gly Leu Glu Leu Pro Arg Ile Asn Tyr Ala Tyr Asn Gly Lys 405 410 415 Pro Tyr Arg Tyr Ile Phe Ala Ala Glu Val Gln Trp Ser Pro Val Pro 420 425 430 Thr Lys Ile Leu Lys Tyr Asp Ile Leu Thr Lys Ser Ser Leu Lys Trp 435 440 445 Ser Glu Glu Ser Cys Trp Pro Ala Glu Pro Leu Phe Val Pro Thr Pro 450 455 460 Gly Ala Lys Asp Glu Asp Asp Gly Val Ile Leu Ser Ala Ile Val Ser 465 470 475 480 Thr Asp Pro Gln Lys Leu Pro Phe Leu Leu Ile Leu Asp Ala Lys Ser 485 490 495 Phe Thr Glu Leu Ala Arg Ala Ser Val Asp Ala Asp Met His Leu Asp 500 505 510 Leu His Gly Leu Phe Ile Pro Asp Ala Asp Trp Asn Ala Val Lys Gln 515 520 525 Thr Pro Ala Glu Thr Gln Glu Val Glu Asn Ser Asp His Pro Thr Asp 530 535 540 Pro Thr Ala Pro Glu Leu Ser His Ser Glu Asn Asp Phe Thr Ala Gly 545 550 555 560 His Gly Gly Ser Ser Leu 565 <210> 15 <211> 350 <212> PRT <213> Natronomonas pharaonis ATCC35678 <400> 15 Met Ser Asn Ala Ser Leu Arg Pro Ser Gly Thr Ala Ser Ala Thr Leu 1 5 10 15 Phe Arg Leu Ala Phe Leu Pro Gly Trp Ala Val Ile Ala Ala Thr Thr 20 25 30 Gly Ala Phe Leu Val Gly Ala Ser Leu Pro Leu Thr Tyr Gln Leu Ile 35 40 45 Pro Leu Ala Ala Ser Val Val Leu Leu Gly Leu Pro His Gly Ala Val 50 55 60 Asp His Leu Ala Leu Pro Arg Thr Arg Asn Glu Arg Val Thr Val Arg 65 70 75 80 Trp Leu Ala Ala Ile Gly Val Leu Tyr Ala Val Val Gly Gly Leu Tyr 85 90 95 Ala Ala Val Trp Phe Leu Ala Pro Val Gly Ala Val Ala Ala Phe Ile 100 105 110 Phe Met Thr Trp Val His Trp Gly Gln Gly Glu Ile Tyr Pro Leu Val 115 120 125 Ala Leu Ala Asp Ala Asp His Leu Asp Gly Arg Leu Glu Arg Gly Leu 130 135 140 Thr Ala Ala Ile Arg Gly Ala Leu Pro Met Leu Val Pro Phe Val Ala 145 150 155 160 Phe Pro Asp Gln Tyr Glu Leu Val Val Thr Thr Leu Val Gly Leu Phe 165 170 175 Asp Ala Asp Ala Ala Ala Thr Ala Ala Ala Ala Phe Thr Pro Thr Ala 180 185 190 Arg Leu Ala Val Ala Val Thr Val Gly Gly Leu Val Ala Val Thr Leu 195 200 205 Gly Ile Gly Ala Val Ala Ala Ser Glu Thr Gly Trp Gly Pro Trp Leu 210 215 220 Leu Asp Ala Gly Glu Thr Gly Leu Leu Ile Leu Phe Phe Ala Ala Val 225 230 235 240 Pro Pro Ile Phe Ala Ile Gly Leu Tyr Phe Cys Phe Trp His Ser Leu 245 250 255 Arg His Ile Val Arg Leu Leu Ala Val Asp Asn Arg Ala Ala Pro Ala 260 265 270 Leu Asp Gly Arg Arg Tyr Gly Ala Ala Leu Ala Arg Phe Ala Arg Asp 275 280 285 Ala Ala Pro Leu Ser Ala Ala Ser Leu Val Leu Leu Gly Leu Leu Tyr 290 295 300 Leu Ala Val Pro Gly Ser Val Asp Ser Pro Leu Ser Leu Val Gly Thr 305 310 315 320 Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Val Val Val 325 330 335 Ala Trp Met Asp His Glu Gln Arg Leu Trp Arg Pro Gly Ala 340 345 350 <210> 16 <211> 284 <212> PRT <213> Halobacterium salinarum ATCC700922 <400> 16 Met Pro His Gly Ala Ile Asp Tyr Leu Ala Leu Pro Arg Ala Val Thr 1 5 10 15 Gly Thr Val Thr Val Arg Trp Leu Ala Val Val Gly Val Leu Tyr Leu 20 25 30 Val Leu Gly Gly Gly Tyr Ala Ala Ala Trp Phe Phe Ala Pro Val Pro 35 40 45 Ala Ala Phe Ala Phe Val Ala Ile Thr Trp Leu His Trp Gly Gln Gly 50 55 60 Asp Leu Tyr Pro Leu Leu Asp Phe Leu Asp Val Asp Tyr Leu Asp Thr 65 70 75 80 Arg Pro Arg Arg Ala Ala Thr Val Leu Ile Arg Gly Gly Leu Pro Met 85 90 95 Leu Val Pro Leu Leu Gly Phe Pro Glu Arg Tyr Arg Ser Val Val Asp 100 105 110 Ala Phe Ala Ala Pro Phe Gly Gly Ser Val Gly Asp Leu Ala Val Phe 115 120 125 Asp Pro Arg Val Arg Leu Trp Leu Gly Val Ala Phe Ala Ala Ala Thr 130 135 140 Val Ala Val Leu Ala Ala Gly Arg Arg Arg Thr His Ser Pro Gly Ala 145 150 155 160 Trp Arg Val Asp Ala Ala Glu Thr Leu Leu Leu Trp Val Phe Phe Phe 165 170 175 Val Val Pro Pro Val Phe Ala Val Gly Val Tyr Phe Cys Val Trp His 180 185 190 Ser Val Arg His Val Ala Arg Ala Ile Ala Val Asp Gly Ser Val His 195 200 205 Pro Ser Leu Arg Ala Gly Asp Ile Leu Gly Pro Leu Ala Arg Phe Gly 210 215 220 Val Glu Ala Ala Pro Met Thr Ala Ala Ala Leu Ala Leu Gly Gly Val 225 230 235 240 Leu Trp Trp Ala Val Pro Asn Pro Pro Thr Thr Leu Glu Ser Gly Ala 245 250 255 Ala Leu Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Ala 260 265 270 Val Val Thr Trp Met Asp Arg Val Gln Gly Val Leu 275 280 <210> 17 <211> 284 <212> PRT <213> Halobacterium salinarum ATCC700922 <400> 17 Met Pro His Gly Ala Ile Asp Tyr Leu Ala Leu Pro Arg Ala Val Thr 1 5 10 15 Gly Thr Val Thr Val Arg Trp Leu Ala Val Val Gly Val Leu Tyr Leu 20 25 30 Val Leu Gly Gly Gly Tyr Ala Ala Ala Trp Phe Phe Ala Pro Val Pro 35 40 45 Ala Ala Phe Ala Phe Val Ala Ile Thr Trp Leu His Trp Gly Gln Gly 50 55 60 Asp Leu Tyr Pro Leu Leu Asp Phe Leu Asp Val Asp Tyr Leu Asp Thr 65 70 75 80 Arg Pro Arg Arg Ala Ala Thr Val Leu Ile Arg Gly Gly Leu Pro Met 85 90 95 Leu Val Pro Leu Leu Gly Phe Pro Glu Arg Tyr Arg Ser Val Val Asp 100 105 110 Ala Phe Ala Ala Pro Phe Gly Gly Ser Val Gly Asp Leu Ala Val Phe 115 120 125 Asp Pro Arg Val Arg Leu Trp Leu Gly Val Ala Phe Ala Ala Ala Thr 130 135 140 Val Ala Val Leu Ala Ala Gly Arg Arg Arg Thr His Ser Pro Gly Ala 145 150 155 160 Trp Arg Val Asp Ala Ala Glu Thr Leu Leu Leu Trp Val Phe Phe Phe 165 170 175 Val Val Pro Pro Val Phe Ala Val Gly Val Tyr Phe Cys Val Trp His 180 185 190 Ser Val Arg His Val Ala Arg Ala Ile Ala Val Asp Gly Ser Val His 195 200 205 Pro Ser Leu Arg Ala Gly Asp Ile Leu Gly Pro Leu Ala Arg Phe Gly 210 215 220 Val Glu Ala Ala Pro Met Thr Ala Ala Ala Leu Ala Leu Gly Gly Val 225 230 235 240 Leu Trp Trp Ala Val Pro Asn Pro Pro Thr Thr Leu Glu Ser Gly Ala 245 250 255 Ala Leu Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Ala 260 265 270 Val Val Thr Trp Met Asp Arg Val Gln Gly Val Leu 275 280 <210> 18 <211> 2412 <212> DNA <213> Enterococcus faecalis <400> 18 ttgaaaacag tagttattat tgatgcatta cgaacaccaa ttggaaaata taaaggcagc 60 ttaagtcaag taagtgccgt agacttagga acacatgtta caacacaact tttaaaaaga 120 cattccacta tttctgaaga aattgatcaa gtaatctttg gaaatgtttt acaagctgga 180 aatggccaaa atcccgcacg acaaatagca ataaacagcg gtttatctca tgaaattccc 240 gcaatgacag ttaatgaggt ctgcggatca ggaatgaagg ccgttatttt ggcgaaacaa 300 ttgattcaat taggagaagc ggaagtttta attgctggcg ggattgagaa tatgtcccaa 360 gcacctaaat tacaacgatt taattacgaa acagaaagct atgatgcgcc tttttctagt 420 atgatgtacg atgggttaac ggatgccttt agtggtcaag caatgggctt aactgctgaa 480 aatgtggccg aaaagtatca tgtaactaga gaagagcaag atcaattttc tgtacattca 540 caattaaaag cagctcaagc acaagcagaa gggatattcg ctgacgaaat agccccatta 600 gaagtatcag gaacgcttgt ggagaaagat gaagggattc gccctaattc gagcgttgag 660 aagctaggaa cgcttaaaac agtttttaaa gaagacggta ctgtaacagc agggaatgca 720 tcaaccatta atgatggggc ttctgctttg attattgctt cacaagaata tgccgaagca 780 cacggtcttc cttatttagc tattattcga gacagtgtgg aagtcggtat tgatccagcc 840 tatatgggaa tttcgccgat taaagccatt caaaaactgt tagcgcgcaa tcaacttact 900 acggaagaaa ttgatctgta tgaaatcaac gaagcatttg cagcaacttc aatcgtggtc 960 caaagagaac tggctttacc agaggaaaag gtcaacattt atggtggcgg tatttcatta 1020 ggtcatgcga ttggtgccac aggtgctcgt ttattaacga gtttaagtta tcaattaaat 1080 caaaaagaaa agaaatatgg agtggcttct ttatgtatcg gcggtggctt aggactcgct 1140 atgctactag agagacctca gcaaaaaaaa aacagccgat tttatcaaat gagtcctgag 1200 gaacgcctgg cttctcttct taatgaaggc cagatttctg ctgatacaaa aaaagaattt 1260 gaaaatacgg ctttatcttc gcagattgcc aatcatatga ttgaaaatca aatcagtgaa 1320 acagaagtgc cgatgggcgt tggcttacat ttaacagtgg acgaaactga ttatttggta 1380 ccaatggcga cagaagagcc ctcagtgatt gcggctttga gtaatggtgc aaaaatagca 1440 caaggattta aaacagtgaa tcaacaacgt ttaatgcgtg gacaaatcgt tttttacgat 1500 gttgcagacg ccgagtcatt gattgatgaa ctacaagtaa gagaaacgga aatttttcaa 1560 caagcagagt taagttatcc atctatcgtt aaacgcggcg gcggcttaag agatttgcaa 1620 tatcgtgctt ttgatgaatc atttgtatct gtcgactttt tagtagatgt taaggatgca 1680 atgggggcaa atatcgttaa cgctatgttg gaaggtgtgg ccgagttgtt ccgtgaatgg 1740 tttgcggagc aaaagatttt attcagtatt ttaagtaatt atgccacgga gtcggttgtt 1800 acgatgaaaa cggctattcc agtttcacgt ttaagtaagg ggagcaatgg ccgggaaatt 1860 gctgaaaaaa ttgttttagc ttcacgctat gcttcattag atccttatcg ggcagtcacg 1920 cataacaaag ggatcatgaa tggcattgaa gctgtcgttt tagctacagg aaatgataca 1980 cgcgctgtta gcgcttcttg tcatgctttt gcggtgaagg aaggtcgcta ccaaggtttg 2040 actagttgga cgctggatgg cgaacaacta attggtgaaa tttcagttcc gcttgcgtta 2100 gccacggttg gcggtgccac aaaagtctta cctaaatctc aagcagctgc tgatttgtta 2160 gcagtgacgg atgcaaaaga actaagtcga gtagtagcgg ctgttggttt ggcacaaaat 2220 ttagcggcgt tacgggcctt agtctctgaa ggaattcaaa aaggacacat ggctctacaa 2280 gcacgttctt tagcgatgac ggtcggagct actggtaaag aagttgaggc agtcgctcaa 2340 caattaaaac gtcaaaaaac gatgaaccaa gaccgagcct tggctatttt aaatgattta 2400 agaaaacaat aa 2412 <210> 19 <211> 1152 <212> DNA <213> Enterococcus faecalis <400> 19 atgacaattg ggattgataa aattagtttt tttgtgcccc cttattatat tgatatgacg 60 gcactggctg aagccagaaa tgtagaccct ggaaaatttc atattggtat tgggcaagac 120 caaatggcgg tgaacccaat cagccaagat attgtgacat ttgcagccaa tgccgcagaa 180 gcgatcttga ccaaagaaga taaagaggcc attgatatgg tgattgtcgg gactgagtcc 240 agtatcgatg agtcaaaagc ggccgcagtt gtcttacatc gtttaatggg gattcaacct 300 ttcgctcgct ctttcgaaat caaggaagct tgttacggag caacagcagg cttacagtta 360 gctaagaatc acgtagcctt acatccagat aaaaaagtct tggttgtagc agcagatatt 420 gcaaaatatg gattaaattc tggcggtgag cctacacaag gagctggggc ggttgcaatg 480 ttagttgcta gtgaaccgcg catcttggct ttaaaagagg ataatgtgat gctgacgcaa 540 gatatctatg acttttggcg tccaacaggc catccgtatc ctatggtcga tggtcctttg 600 tcaaacgaaa cctacatcca atcttttgcc caagtctggg atgaacataa aaaaagaacc 660 ggtcttgatt ttgcagatta tgatgcttta gcgttccata ttccttacac aaaaatgggc 720 aaaaaagcct tattagcaaa aatctccgac caaactgaag cagaacagga acgaatttta 780 gcccgttatg aagaaagcat catctatagt cgtcgcgtag gaaacttgta tacgggttca 840 ctttatctgg gactcatttc ccttttagaa aatgcaacga ctttaaccgc aggcaatcaa 900 attgggttat tcagttatgg ttctggtgct gtcgctgaat ttttcactgg tgaattagta 960 gctggttatc aaaatcattt acaaaaagaa actcatttag cactgctaga taatcggaca 1020 gaactttcta tcgctgaata tgaagccatg tttgcagaaa ctttagacac agatattgat 1080 caaacgttag aagatgaatt aaaatatagt atttctgcta ttaataatac cgttcgctct 1140 tatcgaaact aa 1152 <210> 20 <211> 879 <212> DNA <213> Streptococcus pneumoniae <400> 20 atgacaaaaa aagttggtgt cggtcaggca catagtaaga taattttaat aggggaacat 60 gcggtcgttt acggttatcc tgccatttcc ctgcctcttt tggaggtgga ggtgacctgt 120 aaggtagttt ctgcagagag tccttggcgc ctttatgagg aggatacctt gtccatggcg 180 gtttatgcct cactggagta tttggatatc acagaagcct gcgttcgttg tgagattgac 240 tcggctatcc ctgagaaacg ggggatgggt tcgtcagcgg ctatcagcat agcggccatt 300 cgtgcggtat ttgactacta tcaggctgat ctgcctcatg atgtactaga aatcttggtc 360 aatcgagctg agatgattgc ccatatgaat cctagtggtt tggatgctaa gacctgtctc 420 agtgaccaac ctattcgctt tatcaagaac gtaggattta cagaacttga gatggattta 480 tccgcctatt tggtgattgc cgatacgggt gtttatggtc atactcgtga agccatccaa 540 gtggttcaaa ataagggcaa ggatgcccta ccgtttttgc atgccttggg agaattaacc 600 cagcaagcag aagttgcgat ttcacaaaaa tatgctgaag gactgggact aatcttcagt 660 caagctcatt tacatctaaa agaaattgga gtcagtagcc ctgaggcaga ctttttggtt 720 gaaacggctc ttagctatgg tgctctgggt gccaagatga gcggtggtgg gctaggaggt 780 tgtatcatag ccttggtaac caatttgacg cacgcacaag aactagcaga aagattagaa 840 gagaaaggag ctgttcagac atggatagag agcctgtaa 879 <210> 21 <211> 1011 <212> DNA <213> Streptococcus pneumoniae <400> 21 atgattgctg ttaaaacttg cggaaaactc tattgggcag gtgaatatgc tattttagag 60 ccagggcagt tagctttgat aaaggatatt cccatctata tgagggctga gattgctttt 120 tctgacagct accgtatcta ttcagatatg tttgatttcg cagtggactt aaggcccaat 180 cctgactaca gcttgattca agaaacgatt gctttgatgg gagacttcct cgctgttcgc 240 ggtcagaatt taagaccttt ttccctaaaa atctgtggca aaatggaacg agaagggaaa 300 aagtttggtc taggttctag tggcagcgtc gttgtcttgg ttgtcaaggc tttactggct 360 ctctataatc tttcggttga tcagaatctc ttgttcaagc tgactagcgc tgtcttgctc 420 aagcgaggag acaatggttc catgggcgac cttgcctgta ttgtggcaga ggatttggtt 480 ctttaccagt catttgatcg ccagaaggcg gctgcttggt tagaagaaga aaacttggcg 540 acagttctgg agcgtgattg gggatttttt atctcacaag tgaaaccaac tttagaatgt 600 gatttcttag tgggatggac caaggaagtg gctgtatcga gtcacatggt ccagcaaatc 660 aagcaaaata tcaatcaaaa ttttttaagt tcctcaaaag aaacggtggt ttctttggtc 720 gaagccttgg agcaggggaa agccgaaaaa gttatcgagc aagtagaagt agccagcaag 780 cttttagaag gcttgagtac agatatttac acgcctttgc ttagacagtt gaaagaagcc 840 agtcaagatt tgcaggccgt tgccaagagt agtggtgctg gtggtggtga ctgtggcatc 900 gccctgagtt ttgatgcgca atcttctcga aacactttaa aaaatcgttg ggccgatctg 960 gggattgagc tcttatatca agaaaggata ggacatgacg acaaatcgta a 1011 <210> 22 <211> 954 <212> DNA <213> Streptococcus pneumoniae <400> 22 atggatagag agcctgtaac agtacgttcc tacgcaaata ttgctattat caaatattgg 60 ggaaagaaaa aagaaaaaga gatggtgcct gctactagca gtatttctct aactttggaa 120 aatatgtata cagagacgac cttgtcgcct ttaccagcca atgtaacagc tgacgaattt 180 tacatcaatg gtcagctaca aaatgaggtc gagcatgcca agatgagtaa gattattgac 240 cgttatcgtc cagctggtga gggctttgtc cgtatcgata ctcaaaacaa tatgcctacg 300 gcagcgggcc tgtcctcaag ttctagtggt ttgtccgccc tggtcaaggc ttgtaatgct 360 tatttcaagc ttggattgga tagaagtcag ttggcacagg aagccaaatt tgcctcaggc 420 tcttcttctc ggagttttta tggaccacta ggagcctggg ataaggatag tggagaaatt 480 taccctgtag agacagactt gaaactagct atgattatgt tggtgctaga ggacaagaaa 540 aaaccaatct ctagccgtga cgggatgaaa ctttgtgtgg aaacctcgac gacttttgac 600 gactgggttc gtcagtctga gaaggactat caggatatgc tgatttatct caaggaaaat 660 gattttgcca agattggaga attaacggag aaaaatgctc tggctatgca tgctacgaca 720 aagactgcta gtccagcctt ttcttatctg acggatgcct cttatgaggc tatggccttt 780 gttcgccagc ttcgtgagaa aggagaggcc tgctacttta ccatggatgc tggtcccaat 840 gttaaggtct tctgtcagga gaaagacttg gagcatttgt cagaaatttt cggtcagcgt 900 tatcgcttga ttgtgtcaaa aacaaaggat ttgagtcaag atgattgctg ttaa 954 <210> 23 <211> 546 <212> DNA <213> E. coli K12 MG1655 <400> 23 atgcaaacgg aacacgtcat tttattgaat gcacagggag ttcccacggg tacgctggaa 60 aagtatgccg cacacacggc agacacccgc ttacatctcg cgttctccag ttggctgttt 120 aatgccaaag gacaattatt agttacccgc cgcgcactga gcaaaaaagc atggcctggc 180 gtgtggacta actcggtttg tgggcaccca caactgggag aaagcaacga agacgcagtg 240 atccgccgtt gccgttatga gcttggcgtg gaaattacgc ctcctgaatc tatctatcct 300 gactttcgct accgcgccac cgatccgagt ggcattgtgg aaaatgaagt gtgtccggta 360 tttgccgcac gcaccactag tgcgttacag atcaatgatg atgaagtgat ggattatcaa 420 tggtgtgatt tagcagatgt attacacggt attgatgcca cgccgtgggc gttcagtccg 480 tggatggtga tgcaggcgac aaatcgcgaa gccagaaaac gattatctgc atttacccag 540 cttaaa 546 <210> 24 <211> 918 <212> DNA <213> Haematococcus pluvialis <400> 24 atgcttcgtt cgttgctcag aggcctcacg catatccccc gcgtgaactc cgcccagcag 60 cccagctgtg cacacgcgcg actccagttt aagctcagga gcatgcagat gacgctcatg 120 cagcccagca tctcagccaa tctgtcgcgc gccgaggacc gcacagacca catgaggggt 180 gcaagcacct gggcaggcgg gcagtcgcag gatgagctga tgctgaagga cgagtgcatc 240 ttggtggatg ttgaggacaa catcacaggc catgccagca agctggagtg tcacaagttc 300 ctaccacatc agcctgcagg cctgctgcac cgggccttct ctgtgttcct gtttgacgat 360 caggggcgac tgctgctgca acagcgtgca cgctcaaaaa tcaccttccc aagtgtgtgg 420 acgaacacct gctgcagcca ccctttacat gggcagaccc cagatgaggt ggaccaacta 480 agccaggtgg ccgacggaac agtacctggc gcaaaggctg ctgccatccg caagttggag 540 cacgagctgg ggataccagc gcaccagctg ccggcaagcg cgtttcgctt cctcacgcgt 600 ttgcactact gtgccgcgga cgtgcagcca gctgcgacac aatcagcgct ctggggcgag 660 cacgaaatgg actacatctt gttcatccgg gccaacgtca ccttggcgcc caaccctgac 720 gaggtggacg aagtcaggta cgtgacgcaa gaggagctgc ggcagatgat gcagccggac 780 aacgggctgc aatggtcgcc gtggtttcgc atcatcgccg cgcgcttcct tgagcgttgg 840 tgggctgacc tggacgcggc cctaaacact gacaaacacg aggattgggg aacggtgcat 900 cacatcaacg aagcgtga 918 <210> 25 <211> 1884 <212> DNA <213> E. coli K12 MG1655 <400> 25 atggaattca ggaggcccct gatgagtttt gatattgcca aatacccgac cctggcactg 60 gtcgactcca cccaggagtt acgactgttg ccgaaagaga gtttaccgaa actctgcgac 120 gaactgcgcc gctatttact cgacagcgtg agccgttcca gcgggcactt cgcctccggg 180 ctgggcacgg tcgaactgac cgtggcgctg cactatgtct acaacacccc gtttgaccaa 240 ttgatttggg atgtggggca tcaggcttat ccgcataaaa ttttgaccgg acgccgcgac 300 aaaatcggca ccatccgtca gaaaggcggt ctgcacccgt tcccgtggcg cggcgaaagc 360 gaatatgacg tattaagcgt cgggcattca tcaacctcca tcagtgccgg aattggtatt 420 gcggttgctg ccgaaaaaga aggcaaaaat cgccgcaccg tctgtgtcat tggcgatggc 480 gcgattaccg caggcatggc gtttgaagcg atgaatcacg cgggcgatat ccgtcctgat 540 atgctggtga ttctcaacga caatgaaatg tcgatttccg aaaatgtcgg cgcgctcaac 600 aaccatctgg cacagctgct ttccggtaag ctttactctt cactgcgcga aggcgggaaa 660 aaagttttct ctggcgtgcc gccaattaaa gagctgctca aacgcaccga agaacatatt 720 aaaggcatgg tagtgcctgg cacgttgttt gaagagctgg gctttaacta catcggcccg 780 gtggacggtc acgatgtgct ggggcttatc accacgctaa agaacatgcg cgacctgaaa 840 ggcccgcagt tcctgcatat catgaccaaa aaaggtcgtg gttatgaacc ggcagaaaaa 900 gacccgatca ctttccacgc cgtgcctaaa tttgatccct ccagcggttg tttgccgaaa 960 agtagcggcg gtttgccgag ctattcaaaa atctttggcg actggttgtg cgaaacggca 1020 gcgaaagaca acaagctgat ggcgattact ccggcgatgc gtgaaggttc cggcatggtc 1080 gagttttcac gtaaattccc ggatcgctac ttcgacgtgg caattgccga gcaacacgcg 1140 gtgacctttg ctgcgggtct ggcgattggt gggtacaaac ccattgtcgc gatttactcc 1200 actttcctgc aacgcgccta tgatcaggtg ctgcatgacg tggcgattca aaagcttccg 1260 gtcctgttcg ccatcgaccg cgcgggcatt gttggtgctg acggtcaaac ccatcagggt 1320 gcttttgatc tctcttacct gcgctgcata ccggaaatgg tcattatgac cccgagcgat 1380 gaaaacgaat gtcgccagat gctctatacc ggctatcact ataacgatgg cccgtcagcg 1440 gtgcgctacc cgcgtggcaa cgcggtcggc gtggaactga cgccgctgga aaaactacca 1500 attggcaaag gcattgtgaa gcgtcgtggc gagaaactgg cgatccttaa ctttggtacg 1560 ctgatgccag aagcggcgaa agtcgccgaa tcgctgaacg ccacgctggt cgatatgcgt 1620 tttgtgaaac cgcttgatga agcgttaatt ctggaaatgg ccgccagcca tgaagcgctg 1680 gtcaccgtag aagaaaacgc cattatgggc ggcgcaggca gcggcgtgaa cgaagtgctg 1740 atggcccatc gtaaaccagt acccgtgctg aacattggcc tgccggactt ctttattccg 1800 caaggaactc aggaagaaat gcgcgccgaa ctcggcctcg atgccgctgg tatggaagcc 1860 aaaatcaagg cctggctggc ataa 1884 <210> 26 <211> 924 <212> DNA <213> Pantoea agglomerans <400> 26 atggtgagtg gcagtaaagc gggcgtttcg cctcatcgcg aaatagaagt aatgagacaa 60 tccattgacg atcacctggc tggcctgtta cctgaaaccg acagccagga tatcgtcagc 120 cttgcgatgc gtgaaggcgt catggcaccc ggtaaacgga tccgtccgct gctgatgctg 180 ctggccgccc gcgacctccg ctaccagggc agtatgccta cgctgctcga tctcgcctgc 240 gccgttgaac tgacccatac cgcgtcgctg atgctcgacg acatgccctg catggacaac 300 gccgagctgc gccgcggtca gcccactacc cacaaaaaat ttggtgagag cgtggcgatc 360 cttgcctccg ttgggctgct ctctaaagcc tttggtctga tcgccgccac cggcgatctg 420 ccgggggaga ggcgtgccca ggcggtcaac gagctctcta ccgccgtggg cgtgcagggc 480 ctggtactgg ggcagtttcg cgatcttaac gatgccgccc tcgaccgtac ccctgacgct 540 atcctcagca ccaaccacct caagaccggc attctgttca gcgcgatgct gcagatcgtc 600 gccattgctt ccgcctcgtc gccgagcacg cgagagacgc tgcacgcctt cgccctcgac 660 ttcggccagg cgtttcaact gctggacgat ctgcgtgacg atcacccgga aaccggtaaa 720 gatcgcaata aggacgcggg aaaatcgacg ctggtcaacc ggctgggcgc agacgcggcc 780 cggcaaaagc tgcgcgagca tattgattcc gccgacaaac acctcacttt tgcctgtccg 840 cagggcggcg ccatccgaca gtttatgcat ctgtggtttg gccatcacct tgccgactgg 900 tcaccggtca tgaaaatcgc ctga 924 <210> 27 <211> 930 <212> DNA <213> Pantoea agglomerans <400> 27 atgagccaac cgccgctgct tgaccacgcc acgcagacca tggccaacgg ctcgaaaagt 60 tttgccaccg ctgcgaagct gttcgacccg gccacccgcc gtagcgtgct gatgctctac 120 acctggtgcc gccactgcga tgacgtcatt gacgaccaga cccacggctt cgccagcgag 180 gccgcggcgg aggaggaggc cacccagcgc ctggcccggc tgcgcacgct gaccctggcg 240 gcgtttgaag gggccgagat gcaggatccg gccttcgctg cctttcagga ggtggcgctg 300 acccacggta ttacgccccg catggcgctc gatcacctcg acggctttgc gatggacgtg 360 gctcagaccc gctatgtcac ctttgaggat acgctgcgct actgctatca cgtggcgggc 420 gtggtgggtc tgatgatggc cagggtgatg ggcgtgcggg atgagcgggt gctggatcgc 480 gcctgcgatc tggggctggc cttccagctg acgaatatcg cccgggatat tattgacgat 540 gcggctattg accgctgcta tctgcccgcc gagtggctgc aggatgccgg gctgaccccg 600 gagaactatg ccgcgcggga gaatcgggcc gcgctggcgc gggtggcgga gcggcttatt 660 gatgccgcag agccgtacta catctcctcc caggccgggc tacacgatct gccgccgcgc 720 tgcgcctggg cgatcgccac cgcccgcagc gtctaccggg agatcggtat taaggtaaaa 780 gcggcgggag gcagcgcctg ggatcgccgc cagcacacca gcaaaggtga aaaaattgcc 840 atgctgatgg cggcaccggg gcaggttatt cgggcgaaga cgacgagggt gacgccgcgt 900 ccggccggtc tttggcagcg tcccgtttag 930 <210> 28 <211> 1459 <212> DNA <213> Pantoea agglomerans <400> 28 atgaaaaaaa ccgttgtgat tggcgcaggc tttggtggcc tggcgctggc gattcgcctg 60 caggcggcag ggatcccaac cgtactgctg gagcagcggg acaagcccgg cggtcgggcc 120 tacgtctggc atgaccaggg ctttaccttt gacgccgggc cgacggtgat caccgatcct 180 accgcgcttg aggcgctgtt caccctggcc ggcaggcgca tggaggatta cgtcaggctg 240 ctgccggtaa aacccttcta ccgactctgc tgggagtccg ggaagaccct cgactatgct 300 aacgacagcg ccgagcttga ggcgcagatt acccagttca acccccgcga cgtcgagggc 360 taccggcgct ttctggctta ctcccaggcg gtattccagg agggatattt gcgcctcggc 420 agcgtgccgt tcctctcttt tcgcgacatg ctgcgcgccg ggccgcagct gcttaagctc 480 caggcgtggc agagcgtcta ccagtcggtt tcgcgcttta ttgaggatga gcatctgcgg 540 caggccttct cgttccactc cctgctggta ggcggcaacc ccttcaccac ctcgtccatc 600 tacaccctga tccacgccct tgagcgggag tggggggtct ggttccctga gggcggcacc 660 ggggcgctgg tgaacggcat ggtgaagctg tttaccgatc tgggcgggga gatcgaactc 720 aacgcccggg tcgaagagct ggtggtggcc gataaccgcg taagccaggt ccggctggcg 780 gatggtcgga tctttgacac cgacgccgta gcctcgaacg ctgacgtggt gaacacctat 840 aaaaagctgc tcggccacca tccggtgggg cagaagcggg cggcagcgct ggagcgcaag 900 agcatgagca actcgctgtt tgtgctctac ttcggcctga accagcctca ttcccagctg 960 gcgcaccata ccatctgttt tggtccccgc taccgggagc tgatcgacga gatctttacc 1020 ggcagcgcgc tggcggatga cttctcgctc tacctgcact cgccctgcgt gaccgatccc 1080 tcgctcgcgc ctcccggctg cgccagcttc tacgtgctgg ccccggtgcc gcatcttggc 1140 aacgcgccgc tggactgggc gcaggagggg ccgaagctgc gcgaccgcat ctttgactac 1200 cttgaagagc gctatatgcc cggcctgcgt agccagctgg tgacccagcg gatctttacc 1260 ccggcagact tccacgacac gctggatgcg catctgggat cggccttctc catcgagccg 1320 ctgctgaccc aaagcgcctg gttccgcccg cacaaccgcg acagcgacat tgccaacctc 1380 tacctggtgg gcgcaggtac tcaccctggg gcgggcattc ctggcgtagt ggcctcggcg 1440 aaagccaccg ccagcctga 1459 <210> 29 <211> 1149 <212> DNA <213> Pantoea ananatis <400> 29 atgcaaccgc attatgatct gattctcgtg ggggctggac tcgcgaatgg ccttatcgcc 60 ctgcgtcttc agcagcagca acctgatatg cgtattttgc ttatcgacgc cgcaccccag 120 gcgggcggga atcatacgtg gtcatttcac cacgatgatt tgactgagag ccaacatcgt 180 tggatagctc cgctggtggt tcatcactgg cccgactatc aggtacgctt tcccacacgc 240 cgtcgtaagc tgaacagcgg ctacttttgt attacttctc agcgtttcgc tgaggtttta 300 cagcgacagt ttggcccgca cttgtggatg gataccgcgg tcgcagaggt taatgcggaa 360 tctgttcggt tgaaaaaggg tcaggttatc ggtgcccgcg cggtgattga cgggcggggt 420 tatgcggcaa attcagcact gagcgtgggc ttccaggcgt ttattggcca ggaatggcga 480 ttgagccacc cgcatggttt atcgtctccc attatcatgg atgccacggt cgatcagcaa 540 aatggttatc gcttcgtgta cagcctgccg ctctcgccga ccagattgtt aattgaagac 600 acgcactata ttgataatgc gacattagat cctgaatgcg cgcggcaaaa tatttgcgac 660 tatgccgcgc aacagggttg gcagcttcag acactgctgc gagaagaaca gggcgcctta 720 cccattactc tgtcgggcaa tgccgacgca ttctggcagc agcgccccct ggcctgtagt 780 ggattacgtg ccggtctgtt ccatcctacc accggctatt cactgccgct ggcggttgcc 840 gtggccgacc gcctgagtgc acttgatgtc tttacgtcgg cctcaattca ccatgccatt 900 acgcattttg cccgcgagcg ctggcagcag cagggctttt tccgcatgct gaatcgcatg 960 ctgtttttag ccggacccgc cgattcacgc tggcgggtta tgcagcgttt ttatggttta 1020 cctgaagatt taattgcccg tttttatgcg ggaaaactca cgctgaccga tcggctacgt 1080 attctgagcg gcaagccgcc tgttccggta ttagcagcat tgcaagccat tatgacgact 1140 catcgttaa 1149 <210> 30 <211> 4176 <212> DNA <213> Artificial Sequence <220> <223> pTrc99A vector <400> 30 gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc 60 ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc 120 gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc 180 tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga 240 taacaatttc acacaggaaa cagaccatgg aattcgagct cggtacccgg ggatcctcta 300 gagtcgacct gcaggcatgc aagcttggct gttttggcgg atgagagaag attttcagcc 360 tgatacagat taaatcagaa cgcagaagcg gtctgataaa acagaatttg cctggcggca 420 gtagcgcggt ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg 480 atggtagtgt ggggtctccc catgcgagag tagggaactg ccaggcatca aataaaacga 540 aaggctcagt cgaaagactg ggcctttcgt tttatctgtt gtttgtcggt gaacgctctc 600 ctgagtagga caaatccgcc gggagcggat ttgaacgttg cgaagcaacg gcccggaggg 660 tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa ttaagcagaa ggccatcctg 720 acggatggcc tttttgcgtt tctacaaact ctttttgttt atttttctaa atacattcaa 780 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 840 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 900 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 960 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 1020 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 1080 tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 1140 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 1200 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 1260 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 1320 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 1380 cgatgcctac agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 1440 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 1500 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 1560 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 1620 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 1680 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1740 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1800 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1860 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1920 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1980 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 2040 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 2100 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 2160 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 2220 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 2280 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 2340 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 2400 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 2460 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 2520 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 2580 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 2640 cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca 2700 tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc 2760 gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc 2820 gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 2880 gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa 2940 ttcgcgcgcg aaggcgaagc ggcatgcatt tacgttgaca ccatcgaatg gtgcaaaacc 3000 tttcgcggta tggcatgata gcgcccggaa gagagtcaat tcagggtggt gaatgtgaaa 3060 ccagtaacgt tatacgatgt cgcagagtat gccggtgtct cttatcagac cgtttcccgc 3120 gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg aaaaagtgga agcggcgatg 3180 gcggagctga attacattcc caaccgcgtg gcacaacaac tggcgggcaa acagtcgttg 3240 ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc cgtcgcaaat tgtcgcggcg 3300 attaaatctc gcgccgatca actgggtgcc agcgtggtgg tgtcgatggt agaacgaagc 3360 ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg cgcaacgcgt cagtgggctg 3420 atcattaact atccgctgga tgaccaggat gccattgctg tggaagctgc ctgcactaat 3480 gttccggcgt tatttcttga tgtctctgac cagacaccca tcaacagtat tattttctcc 3540 catgaagacg gtacgcgact gggcgtggag catctggtcg cattgggtca ccagcaaatc 3600 gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc tgcgtctggc tggctggcat 3660 aaatatctca ctcgcaatca aattcagccg atagcggaac gggaaggcga ctggagtgcc 3720 atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg gcatcgttcc cactgcgatg 3780 ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg ccattaccga gtccgggctg 3840 cgcgttggtg cggatatctc ggtagtggga tacgacgata ccgaagacag ctcatgttat 3900 atcccgccgt caaccaccat caaacaggat tttcgcctgc tggggcaaac cagcgtggac 3960 cgcttgctgc aactctctca gggccaggcg gtgaagggca atcagctgtt gcccgtctca 4020 ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 4080 gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 4140 caacgcaatt aatgtgagtt agcgcgaatt gatctg 4176 <210> 31 <211> 828 <212> DNA <213> Uncultured marine bacterium 66A03 <220> <221> misc_feature <222> (1)..(828) <223> blh gene : DQ065755 <400> 31 atgggcttga tgttaattga ttggtgtgct ttagcattgg ttgtgtttat tggtttgcca 60 catggtgcct tagatgctgc tatttctttt tcaatgattt cttcagcaaa gagaattgct 120 agattagcag gaatactatt aatttacctg ttgttagcaa ccgcattttt tttaatttgg 180 tatcaattac cagcattttc tcttcttatt tttcttttga taagcataat ccattttgga 240 atggctgatt tcaatgcatc cccaagtaaa cttaagtggc ctcatattat tgcacatggc 300 ggcgttgtta ctgtttggtt gccgcttatc caaaaaaatg aagttacgaa gctattttca 360 atattaacaa atggtccaac tcccatttta tgggacatac tattgatatt ttttttatgt 420 tggagcatag gagtatgtct tcatacctat gaaactttac gttctaaaca ttataatatc 480 gcctttgaac ttattggatt aatttttcta gcctggtatg cacccccact cgttactttt 540 gccacatact tctgctttat ccacagcaga cgtcacttta gttttgtttg gaaacagtta 600 cagcatatga gttcaaaaaa aatgatgata ggtagtgcca ttattttatc ttgtacgagc 660 tggttgatag gcggaggaat atattttttc ctcaattcga aaatgattgc cagtgaagct 720 gctttacaaa ctgtctttat tggtcttgca gctttaacag ttcctcacat gatacttatc 780 gactttatat ttagaccaca ctcttccaga attaaaatca aaaattga 828 <210> 32 <211> 828 <212> DNA <213> Artificial Sequence <220> <223> E. coli codon optimized blh gene of uncultured marine bacterium 66A03 <400> 32 atgggtctga tgctgattga ttggtgtgca ctggctctgg ttgttttcat tggcctgccg 60 cacggcgcgc tggatgctgc catttctttt tctatgatct cttctgcaaa acgcattgct 120 cgtctggctg gtattctgct gatctatctg ctgctggcga ccgcgttctt cctgatctgg 180 tatcagctgc cagcgtttag cctgctgatc ttcctgctga tctccattat ccactttggt 240 atggcagact tcaacgcgtc cccaagcaaa ctgaaatggc cgcatatcat cgcccacggc 300 ggtgttgtta ctgtttggct gccgctgatc cagaaaaacg aagtaactaa actgtttagc 360 atcctgacta acggtccgac tccgatcctg tgggacatcc tgctgatttt cttcctgtgt 420 tggtctattg gcgtgtgtct gcacacgtac gaaaccctgc gctctaaaca ttacaacatc 480 gcctttgaac tgatcggtct gattttcctg gcgtggtatg cgccgcctct ggttacgttt 540 gccacttact tctgcttcat tcattcccgt cgccacttct cctttgtgtg gaagcagctg 600 caacacatgt cttccaaaaa gatgatgatt ggcagcgcga ttatcctgtc ctgtacctct 660 tggctgatcg gcggtggtat ctatttcttc ctgaactcca aaatgatcgc ctctgaggct 720 gcgctgcaga ctgtgttcat cggtctggcg gcactgaccg tgccgcacat gattctgatc 780 gacttcatct tccgtccgca ctcttcccgt atcaaaatca aaaactaa 828 <210> 33 <211> 1701 <212> DNA <213> Mus musculus <400> 33 atggagataa tatttggcca gaataagaaa gaacagctgg agccagttca ggccaaagtg 60 acaggcagca ttccagcatg gctgcagggg accctgctcc gaaacgggcc cgggatgcac 120 acagtgggag agagcaagta caaccattgg tttgatggcc tggcccttct ccacagtttc 180 tccatcagag atggggaggt cttctacagg agcaaatacc tgcagagtga cacctacatc 240 gccaacattg aggccaacag aatcgtggtg tctgagttcg gaaccatggc ctacccggac 300 ccctgcaaaa acatcttttc caaagctttc tcctacttgt ctcacaccat ccccgacttc 360 acagacaact gtctgatcaa catcatgaaa tgtggagaag acttctatgc aaccacggag 420 accaactaca tcaggaaaat cgacccccag accctagaga ccttggagaa ggttgattac 480 cggaagtatg tggcggtaaa cctggctacc tcgcaccctc attatgacga ggctgggaat 540 gtccttaaca tgggcacatc cgtcgtggac aaagggagga caaaatacgt gatatttaag 600 atccctgcca cagtgccaga cagcaagaag aaagggaaga gtcccgtgaa gcacgcggaa 660 gttttctgct ccatttcctc ccgctcgctg ctctctccca gctactacca cagctttggt 720 gtcacggaga actatgtggt gtttctggag cagcctttta agttggatat cctcaagatg 780 gccaccgcat acatgagggg agtgagctgg gcttcctgta tgtcattcga cagggaggac 840 aagacataca ttcatatcat cgaccagagg accaggaagc ctgtgcctac caagttctac 900 acagatccca tggtggtctt ccatcatgtc aatgcctacg aggaggacgg ctgtgtgctg 960 tttgatgtga tcgcctatga ggacagcagc ctctatcagc tcttctacct ggccaacctg 1020 aacaaggact tcgaggagaa gtccaggctg acctcagtgc ctaccctcag gaggtttgct 1080 gtgcccctcc atgtggacaa ggatgcagaa gtgggctcaa atttagtcaa ggtgtcatct 1140 acaactgcaa cagccctgaa ggagaaagac ggccatgtct attgccagcc cgaggtcctc 1200 tacgaaggcc tagagctccc tcggataaat tatgcttaca acgggaagcc atatcgctac 1260 atctttgcag ctgaagtaca gtggagtcca gtcccaacca agatactgaa atatgacatt 1320 ctcacaaagt cctccttaaa gtggtctgag gagagctgct ggccagcaga gcctctgttt 1380 gttcccacgc caggtgcgaa ggatgaagat gatggagtca ttttatcagc catcgtctct 1440 acggatcccc aaaagctgcc ttttttactc attctggatg ccaaaagttt tacggaactg 1500 gctcgcgcct ctgttgatgc ggacatgcac ctggaccttc atggtttatt tatcccagat 1560 gcagactgga atgcagtgaa gcagactcca gctgaaacgc aagaggttga aaactcagat 1620 catcccacag atccgacagc accagaactg agccacagtg aaaatgactt cacagcgggt 1680 catggtggct caagtcttta a 1701 <210> 34 <211> 1053 <212> DNA <213> Natronomonas pharaonis ATCC35678 <400> 34 atgagtaacg cgtcgctccg gccctccggg acggccagtg cgacactgtt ccggctggcg 60 ttcctcccgg gctgggctgt catcgcggcg acgacgggtg cgttcctcgt gggagcctca 120 ctacccctta cctaccaact catcccgctc gccgctagcg tggtcctgct cgggctccca 180 cacggcgctg tcgaccattt ggcgctcccg cggacccgaa acgagcgggt cacggttcga 240 tggcttgcgg ccatcggtgt cctctatgcc gttgtcggcg ggctctatgc ggcagtctgg 300 tttctcgcgc ccgtcggtgc cgtcgccgcg ttcattttta tgacgtgggt tcactggggc 360 caaggagaaa tctatccgct tgttgcgctc gctgacgccg accacctcga tgggcggctc 420 gaacgcggac tgacagccgc catccgcggc gcattgccga tgctcgtccc gtttgtcgcc 480 tttcccgacc agtacgagct cgtcgtgaca accctcgtcg ggctcttcga cgccgatgca 540 gcggcgacgg cggcggccgc gttcacgccg accgcacggc tcgcagtcgc ggtcaccgtc 600 ggggggttgg tggcggtcac cctcggtatt ggagctgtcg ctgccagcga gaccggctgg 660 gggccgtggc tgcttgatgc cggcgaaaca gggcttctga ttctgttctt tgcggcggtg 720 ccgccgattt tcgccatcgg cctctacttt tgtttctggc actcgcttcg ccacatcgtc 780 cggttgctcg ctgtcgataa ccgggcagca ccggcgctcg atggtcgccg atacggcgcg 840 gcgcttgcgc gctttgctcg ggatgcagcc ccgctgtcag cggcgtcgct cgtgttgctc 900 gggctgttgt atctagccgt gcccggcagc gtcgactcgc cgctttcgct tgtcgggacc 960 tacctcgtgt tgatagccgt gctcacgctc ccgcacgtgg tcgtcgtggc gtggatggac 1020 cacgaacagc ggctctggcg acccggagca tag 1053 <210> 35 <211> 855 <212> DNA <213> Halobacterium salinarum ATCC700922 <400> 35 atgccacacg gcgcgatcga ctacctcgcg ttgccccgcg cggtcacggg caccgtcacc 60 gtgcggtggc tggcggtcgt cggcgtcctc tacctcgtgc tcggtggtgg ctacgccgcc 120 gcgtggtttt tcgcgcccgt tcccgctgcg ttcgcgttcg tcgcgatcac gtggctgcac 180 tgggggcagg gcgacctcta cccgctgctc gacttcctcg acgtcgacta cctcgatacg 240 cgcccgcggc gcgcggcgac ggtcctgatc cggggtggcc tcccgatgct cgtgccgctg 300 ctcgggttcc cggagcggta ccgcagcgtc gtcgacgcgt tcgccgcgcc gttcggcggc 360 tccgtcggcg acctcgcggt gttcgacccg cgcgtccgcc tgtggctggg cgtcgcgttc 420 gcagccgcga ccgtcgcggt gctcgcggcg ggcagacgcc gcacccactc ccccggcgcg 480 tggcgcgtcg acgccgccga aaccctcctg ttgtgggtgt tcttcttcgt cgtgccgccg 540 gtgttcgccg tcggcgtcta cttctgcgtc tggcactcgg tccggcacgt cgcgcgcgcc 600 atcgccgtcg acggctcggt ccacccgtcg ctgcgcgcgg gcgacatcct cggaccgctg 660 gcccggttcg gcgtggaggc cgcgccgatg acggcggccg cgctcgcgtt gggcggcgtg 720 ctgtggtggg cggtacccaa cccgccgacc acgctcgaat ccggggccgc actctacctc 780 gtgttgatcg ccgtgctcac cctgccacac gtcgccgtgg tcacgtggat ggaccgcgtg 840 cagggcgtcc tctga 855 <210> 36 <211> 1080 <212> DNA <213> Halobacterium salinarum ATCC700922 <400> 36 atgagcaata ggtcgcagtt cgtcccgtcg tggctcgtgc cggaggcagc cggcgacctc 60 ccgttgaccg tgtcgcggct gtcactgctc gcgcttgccg ccgcgttcgc ggtcggatac 120 ggcgcgggct tcgcggtccc actggaggtc caggcgggcg tctacctgtt gggtatggtc 180 gcgatgaacc tcccgcacgg cggctacgag catttcgaga acctgcggcg acgggctgcc 240 tccttccagg gcaagtatat cgtcgcctac ctggtcggga tcgcggcgtt cggcgcgctg 300 tttttcgtcg cgcccgtcgc cggactgggg ctggcagtca cggtggccgt cgccaaaggt 360 gggttcggtg gcgtgcagtc gatggacgcc ctctacggaa ctgaccattt gcgcacgcgc 420 ccccagcggt ggctcgccgc cgtcgtccgg ggcggcgcgg tgatggtggt tcccatgttg 480 ttctggacgg acgtgttcta cgcgttcagc tcggtcatga tctcgatttt cgaccccagc 540 gccgtgtcgg cgctcggcgg tgacatcgca acccggcggc tcgtgctcgg cggcgggtac 600 ggggcgctcg tggtcgcaca cctggggctc ggctaccggc gggcggccgg caccgggtcg 660 ttcctcgccg acgccgccga gacgctgctg ttgatcgcgt acttcgcgct cgttccggtg 720 gtcatcgccg tcgggctgta cttcccgctg tggtactcgg cccgccaggt ggcccgatcg 780 tcggccgtcg acgacacggc gatgacgcag gcagacgcca ccggcatgct tgacgccctg 840 gacgccgacg acccggcgcg cgccacgctt gcctcgtggg cggtgctcat cgtcggcagc 900 gtcgccacgt tcggcctggc ggccgtgctc tggctgctgt ccccacagcc cctgggtggt 960 ggtgggatcc tcgtgggctt ggtcgcgttc tggagcatct tcgtgagcat catcgcgctc 1020 ccgcacgtcg tcgtcggcgg gtggcttgac cgcactcgcg gcatctggta cgtcccataa 1080 1080 <210> 37 <211> 44 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 37 cgagctcagg agcatttaga tgttgaaaac agtagttatt attg 44 <210> 38 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 38 gcccggggtg gcctgaaacg gctacc 26 <210> 39 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 39 gcccgggagg agttaaagaa atgacaattg 30 <210> 40 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 40 cggatccctt agtttcgata agagcgaac 29 <210> 41 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 41 cggtaccaat gacaaaaaaa gttggtgtcg g 31 <210> 42 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 42 ttctagatta cgatttgtcg tcatgtcc 28 <210> 43 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 43 ccccgggagg agagaaatta tgcaaacgga ac 32 <210> 44 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 44 tgcatgctta tttaagctgg gtaaatg 27 <210> 45 <211> 2999 <212> DNA <213> Artificial Sequence <220> <223> pSTV28 vector sequence <400> 45 cgtatggcaa tgaaagacgg tgagctggtg atatgggata gtgttcaccc ttgttacacc 60 gttttccatg agcaaactga aacgttttca tcgctctgga gtgaatacca cgacgatttc 120 cggcagtttc tacacatata ttcgcaagat gtggcgtgtt acggtgaaaa cctggcctat 180 ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg ggtgagtttc 240 accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt tttcaccatg 300 ggcaaatatt atacgcaagg cgacaaggtg ctgatgccgc tggcgattca ggttcatcat 360 gccgtctgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca gtactgcgat 420 gagtggcagg gcggggcgta atttttttaa ggcagttatt ggtgccctta aacgcctggt 480 gctacgcctg aataagtgat aataagcgga tgaatggcag aaattcgaaa gcaaattcga 540 cccggtcgtc ggttcagggc agggtcgtta aatagccgct tatgtctatt gctggtttac 600 cggtttattg actaccggaa gcagtgtgac cgtgtgcttc tcaaatgcct gaggccagtt 660 tgctcaggct ctccccgtgg aggtaataat tgacgatatg atcatttatt ctgcctccca 720 gagcctgata aaaacggtta gcgcttcgtt aatacagatg taggtgttcc acagggtagc 780 cagcagcatc ctgcgatgca gatccggaac ataatggtgc agggcgcttg tttcggcgtg 840 ggtatggtgg caggccccgt ggccggggga ctgttgggcg ctgccggcac ctgtcctacg 900 agttgcatga taaagaagac agtcataagt gcggcgacga tagtcatgcc ccgcgcccac 960 cggaaggagc taccggacag cggtgcggac tgttgtaact cagaataaga aatgaggccg 1020 ctcatggcgt tccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1080 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 1140 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 1200 gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgaat 1260 tcgagctcgg tacccgggga tcctctagag tcgacctgca ggcatgcaag cttggcactg 1320 gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 1380 gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 1440 tcccaacagt tgcgcagcct gaatggcgaa tgagcttatc gatgataagc tgtcaaacat 1500 gagaattaca acttatatcg tatggggctg acttcaggtg ctacatttga agagataaat 1560 tgcactgaaa tctagaaata ttttatctga ttaataagat gatcttcttg agatcgtttt 1620 ggtctgcgcg taatctcttg ctctgaaaac gaaaaaaccg ccttgcaggg cggtttttcg 1680 aaggttctct gagctaccaa ctctttgaac cgaggtaact ggcttggagg agcgcagtca 1740 ccaaaacttg tcctttcagt ttagccttaa ccggcgcatg acttcaagac taactcctct 1800 aaatcaatta ccagtggctg ctgccagtgg tgcttttgca tgtctttccg ggttggactc 1860 aagacgatag ttaccggata aggcgcagcg gtcggactga acggggggtt cgtgcataca 1920 gtccagcttg gagcgaactg cctacccgga actgagtgtc aggcgtggaa tgagacaaac 1980 gcggccataa cagcggaatg acaccggtaa accgaaaggc aggaacagga gagcgcacga 2040 gggagccgcc aggggaaacg cctggtatct ttatagtcct gtcgggtttc gccaccactg 2100 atttgagcgt cagatttcgt gatgcttgtc aggggggcgg agcctatgga aaaacggctt 2160 tgccgcggcc ctctcacttc cctgttaagt atcttcctgg catcttccag gaaatctccg 2220 ccccgttcgt aagccatttc cgctcgccgc agtcgaacga ccgagcgtag cgagtcagtg 2280 agcgaggaag cggaatatat cctgtatcac atattctgct gacgcaccgg tgcagccttt 2340 tttctcctgc cacatgaagc acttcactga caccctcatc agtgccaaca tagtaagcca 2400 gtatacactc cgctagcgct gatgtccggc ggtgcttttg ccgttacgca ccaccccgtc 2460 agtagctgaa caggagggac agctgataga aacagaagcc actggagcac ctcaaaaaca 2520 ccatcataca ctaaatcagt aagttggcag catcacccga cgcactttgc gccgaataaa 2580 tacctgtgac ggaagatcac ttcgcagaat aaataaatcc tggtgtccct gttgataccg 2640 ggaagccctg ggccaacttt tggcgaaaat gagacgttga tcggcacgta agaggttcca 2700 actttcacca taatgaaata agatcactac cgggcgtatt ttttgagtta tcgagatttt 2760 caggagctaa ggaagctaaa atggagaaaa aaatcactgg atataccacc gttgatatat 2820 cccaatggca tcgtaaagaa cattttgagg catttcagtc agttgctcaa tgtacctata 2880 accagaccgt tcagctggat attacggcct ttttaaagac cgtaaagaaa aataagcaca 2940 agttttatcc ggcctttatt cacattcttg cccgcctgat gaatgctcat ccggaattt 2999 <210> 46 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 46 ccccgggagg agagaaatta tgcaaacgga ac 32 <210> 47 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 47 tgcatgctta tttaagctgg gtaaatg 27 <210> 48 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 48 ccggaattca taatgagttt tgatattgcc 30 <210> 49 <211> 37 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 49 ctcctcgagt acgtatcatt atgccagcca ggccttg 37 <210> 50 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 50 ctcatgacgg tctgcgcaaa aaaacacgtt c 31 <210> 51 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 51 ggaattctta actgacggca gcgagttttt tg 32 <210> 52 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 52 cgaattcagg agcgactaca tgaaaccaac tacgg 35 <210> 53 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 53 ggagctctta gagcgggcgc tgccagagat g 31 <210> 54 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 54 ggtcgacagg aggatcggga tgaccgcgag ac 32 <210> 55 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 55 gctgcagtta tgcgcgggtt cctgcaaatg 30 <210> 56 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 56 gactagtgaa ttcaggaggt aataaatatg g 31 <210> 57 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 57 cactagttag tttttgattt tg 22 <210> 58 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 58 gactagtagg agcggttcca tggagataat atttg 35 <210> 59 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 59 catggtggct caagtcttta actagtc 27 <210> 60 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 60 gactagtagg aggccgagta tgagtaacgc gtc 33 <210> 61 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 61 ctctggcgac ccggagcata actagtc 27 <210> 62 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 62 gactagtagg aggtgttcgg catgccacac gg 32 <210> 63 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 63 gaccgcgtgc agggcgtcct ctaactagtc 30 <210> 64 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 64 gactagtagg aggtattcat atgagcaata ggtc 34 <210> 65 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> Reverse primer <400> 65 cggcatctgg tacgtcccat aactagtc 28 <110> INDUSTRY-ACADEMIC COOPERATION FOUNDATION GYEONGSANG NATIONAL UNIVERSITY <120> A microorganism of Escherichia genus having enhanced isoprenoid          productivity and method of producing isoprenoid using the same <130> PN077141 <160> 65 <170> KopatentIn 1.71 <210> 1 <211> 803 <212> PRT <213> Enterococcus faecalis <400> 1 Met Lys Thr Val Val Ile Ile Asp Ala Leu Arg Thr Pro Ile Gly Lys   1 5 10 15 Tyr Lys Gly Ser Leu Ser Gln Val Ser Ala Val Asp Leu Gly Thr His              20 25 30 Val Thr Thr Gln Leu Leu Lys Arg His Ser Thr Ile Ser Glu Glu Ile          35 40 45 Asp Gln Val Ile Phe Gly Asn Val Leu Gln Ala Gly Asn Gly Gln Asn      50 55 60 Pro Ala Arg Gln Ile Ala Ile Asn Ser Gly Leu Ser His Glu Ile Pro  65 70 75 80 Ala Met Thr Val Asn Glu Val Cys Gly Ser Gly Met Lys Ala Val Ile                  85 90 95 Leu Ala Lys Gln Leu Ile Gln Leu Gly Glu Ala Glu Val Leu Ile Ala             100 105 110 Gly Gly Ile Glu Asn Met Ser Gln Ala Pro Lys Leu Gln Arg Phe Asn         115 120 125 Tyr Glu Thr Glu Ser Tyr Asp Ala Pro Phe Ser Ser Met Met Tyr Asp     130 135 140 Gly Leu Thr Asp Ala Phe Ser Gly Gln Ala Met Gly Leu Thr Ala Glu 145 150 155 160 Asn Val Ala Glu Lys Tyr His Val Thr Arg Glu Glu Gln Asp Gln Phe                 165 170 175 Ser Val His Ser Gln Leu Lys Ala Ala Gln Ala Gln Ala Glu Gly Ile             180 185 190 Phe Ala Asp Glu Ile Ala Pro Leu Glu Val Ser Gly Thr Leu Val Glu         195 200 205 Lys Asp Glu Gly Ile Arg Pro Asn Ser Ser Val Glu Lys Leu Gly Thr     210 215 220 Leu Lys Thr Val Phe Lys Glu Asp Gly Thr Val Thr Ala Gly Asn Ala 225 230 235 240 Ser Thr Ile Asn Asp Gly Ala Ser Ala Leu Ile Ile Ala Ser Gln Glu                 245 250 255 Tyr Ala Glu Ala His Gly Leu Pro Tyr Leu Ala Ile Ile Arg Asp Ser             260 265 270 Val Glu Val Gly Ile Asp Pro Ala Tyr Met Gly Ile Ser Pro Ile Lys         275 280 285 Ala Ile Gln Lys Leu Leu Ala Arg Asn Gln Leu Thr Thr Glu Glu Ile     290 295 300 Asp Leu Tyr Glu Ile Asn Glu Ala Phe Ala Ala Thr Ser Ile Val Val 305 310 315 320 Gln Arg Glu Leu Ala Leu Pro Glu Glu Lys Val Asn Ile Tyr Gly Gly                 325 330 335 Gly Ile Ser Leu Gly His Ala Ile Gly Ala Thr Gly Ala Arg Leu Leu             340 345 350 Thr Ser Leu Ser Tyr Gln Leu Asn Gln Lys Glu Lys Lys Tyr Gly Val         355 360 365 Ala Ser Leu Cys Ile Gly Gly Gly Leu Gly Leu Ala Met Leu Leu Glu     370 375 380 Arg Pro Gln Gln Lys Lys Asn Ser Arg Phe Tyr Gln Met Ser Pro Glu 385 390 395 400 Glu Arg Leu Ala Ser Leu Leu Asn Glu Gly Gln Ile Ser Ala Asp Thr                 405 410 415 Lys Lys Glu Phe Glu Asn Thr Ala Leu Ser Ser Gln Ile Ala Asn His             420 425 430 Met Ile Glu Asn Gln Ile Ser Glu Thr Glu Val Pro Met Gly Val Gly         435 440 445 Leu His Leu Thr Val Asp Glu Thr Asp Tyr Leu Val Pro Met Ala Thr     450 455 460 Glu Glu Pro Ser Val Ile Ala Ala Leu Ser Asn Gly Ala Lys Ile Ala 465 470 475 480 Gln Gly Phe Lys Thr Val Asn Gln Gln Arg Leu Met Arg Gly Gln Ile                 485 490 495 Val Phe Tyr Asp Val Ala Asp Ala Glu Ser Leu Ile Asp Glu Leu Gln             500 505 510 Val Arg Glu Thr Glu Ile Phe Gln Gln Ala Glu Leu Ser Tyr Pro Ser         515 520 525 Ile Val Lys Arg Gly Gly Gly Leu Arg Asp Leu Gln Tyr Arg Ala Phe     530 535 540 Asp Glu Ser Phe Val Ser Val Asp Phe Leu Val Asp Val Lys Asp Ala 545 550 555 560 Met Gly Ala Asn Ile Val Asn Ala Met Leu Glu Gly Val Ala Glu Leu                 565 570 575 Phe Arg Glu Trp Phe Ala Glu Gln Lys Ile Leu Phe Ser Ile Leu Ser             580 585 590 Asn Tyr Ala Thr Glu Ser Val Val Thr Met Lys Thr Ala Ile Pro Val         595 600 605 Ser Arg Leu Ser Lys Gly Ser Asn Gly Arg Glu Ile Ala Glu Lys Ile     610 615 620 Val Leu Ala Ser Arg Tyr Ala Ser Leu Asp Pro Tyr Arg Ala Val Thr 625 630 635 640 His Asn Lys Gly Ile Met Asn Gly Ile Glu Ala Val Val Leu Ala Thr                 645 650 655 Gly Asn Asp Thr Arg Ala Val Ser Ala Ser Cys His Ala Phe Ala Val             660 665 670 Lys Glu Gly Arg Tyr Gln Gly Leu Thr Ser Trp Thr Leu Asp Gly Glu         675 680 685 Gln Leu Ile Gly Glu Ile Ser Val Pro Leu Ala Leu Ala Thr Val Gly     690 695 700 Gly Ala Thr Lys Val Leu Pro Lys Ser Gln Ala Ala Ala Asp Leu Leu 705 710 715 720 Ala Val Thr Asp Ala Lys Glu Leu Ser Arg Val Val Ala Ala Val Gly                 725 730 735 Leu Ala Gln Asn Leu Ala Ala Leu Arg Ala Leu Val Ser Glu Gly Ile             740 745 750 Gln Lys Gly His Met Ala Leu Gln Ala Arg Ser Leu Ala Met Thr Val         755 760 765 Gly Ala Thr Gly Lys Glu Val Glu Ala Val Ala Gln Gln Leu Lys Arg     770 775 780 Gln Lys Thr Met Asn Gln Asp Arg Ala Leu Ala Ile Leu Asn Asp Leu 785 790 795 800 Arg Lys Gln             <210> 2 <211> 383 <212> PRT <213> Enterococcus faecalis <400> 2 Met Thr Ile Gly Ile Asp Lys Ile Ser Phe Phe Val Pro Pro Tyr Tyr   1 5 10 15 Ile Asp Met Thr Ala Leu Ala Glu Ala Arg Asn Val Asp Pro Gly Lys              20 25 30 Phe His Ile Gly Ile Gly Gln Asp Gln Met Ala Val Asn Pro Ile Ser          35 40 45 Gln Asp Ile Val Thr Phe Ala Ala Asn Ala Ala Glu Ala Ile Leu Thr      50 55 60 Lys Glu Asp Lys Glu Ala Ile Asp Met Val Ile Val Gly Thr Glu Ser  65 70 75 80 Ser Ile Asp Glu Ser Lys Ala Ala Ala Val Val Leu His Arg Leu Met                  85 90 95 Gly Ile Gln Pro Phe Ala Arg Ser Phe Glu Ile Lys Glu Ala Cys Tyr             100 105 110 Gly Ala Thr Ala Gly Leu Gln Leu Ala Lys Asn His Val Ala Leu His         115 120 125 Pro Asp Lys Lys Val Leu Val Val Ala Ala Asp Ile Ala Lys Tyr Gly     130 135 140 Leu Asn Ser Gly Gly Glu Pro Thr Gln Gly Ala Gly Ala Val Ala Met 145 150 155 160 Leu Val Ala Ser Glu Pro Arg Ile Leu Ala Leu Lys Glu Asp Asn Val                 165 170 175 Met Leu Thr Gln Asp Ile Tyr Asp Phe Trp Arg Pro Thr Gly His Pro             180 185 190 Tyr Pro Met Val Asp Gly Pro Leu Ser Asn Glu Thr Tyr Ile Gln Ser         195 200 205 Phe Ala Gln Val Trp Asp Glu His Lys Lys Arg Thr Gly Leu Asp Phe     210 215 220 Ala Asp Tyr Asp Ala Leu Ala Phe His Ile Pro Tyr Thr Lys Met Gly 225 230 235 240 Lys Lys Ala Leu Leu Ala Lys Ile Ser Asp Gln Thr Glu Ala Glu Gln                 245 250 255 Glu Arg Ile Leu Ala Arg Tyr Glu Glu Ser Ile Ile Tyr Ser Arg Arg             260 265 270 Val Gly Asn Leu Tyr Thr Gly Ser Leu Tyr Leu Gly Leu Ile Ser Leu         275 280 285 Leu Glu Asn Ala Thr Thr Leu Thr Ala Gly Asn Gln Ile Gly Leu Phe     290 295 300 Ser Tyr Gly Ser Gly Ala Val Ala Glu Phe Phe Thr Gly Glu Leu Val 305 310 315 320 Ala Gly Tyr Gln Asn His Leu Gln Lys Glu Thr His Leu Ala Leu Leu                 325 330 335 Asp Asn Arg Thr Glu Leu Ser Ile Ala Glu Tyr Glu Ala Met Phe Ala             340 345 350 Glu Thr Leu Asp Thr Asp Ile Asp Gln Thr Leu Glu Asp Glu Leu Lys         355 360 365 Tyr Ser Ile Ser Ala Ile Asn Asn Thr Val Arg Ser Tyr Arg Asn     370 375 380 <210> 3 <211> 292 <212> PRT <213> Streptococcus pneumoniae <400> 3 Met Thr Lys Lys Val Gly Val Gly Gln Ala His Ser Lys Ile Ile Leu   1 5 10 15 Ile Gly Glu His Ala Val Val Tyr Gly Tyr Pro Ala Ile Ser Leu Pro              20 25 30 Leu Leu Glu Val Glu Val Thr Cys Lys Val Val Ser Ala Glu Ser Pro          35 40 45 Trp Arg Leu Tyr Glu Glu Asp Thr Leu Ser Met Ala Val Tyr Ala Ser      50 55 60 Leu Glu Tyr Leu Asp Ile Thr Glu Ala Cys Val Arg Cys Glu Ile Asp  65 70 75 80 Ser Ala Ile Pro Glu Lys Arg Gly Met Gly Ser Ser Ala Ala Ile Ser                  85 90 95 Ile Ala Ala Ile Arg Ala Val Phe Asp Tyr Tyr Gln Ala Asp Leu Pro             100 105 110 His Asp Val Leu Glu Ile Leu Val Asn Arg Ala Glu Met Ile Ala His         115 120 125 Met Asn Pro Ser Gly Leu Asp Ala Lys Thr Cys Leu Ser Asp Gln Pro     130 135 140 Ile Arg Phe Ile Lys Asn Val Gly Phe Thr Glu Leu Glu Met Asp Leu 145 150 155 160 Ser Ala Tyr Leu Val Ile Ala Asp Thr Gly Val Tyr Gly His Thr Arg                 165 170 175 Glu Ala Ile Gln Val Val Gln Asn Lys Gly Lys Asp Ala Leu Pro Phe             180 185 190 Leu His Ala Leu Gly Glu Leu Thr Gln Gln Ala Glu Val Ala Ile Ser         195 200 205 Gln Lys Tyr Ala Glu Gly Leu Gly Leu Ile Phe Ser Gln Ala His Leu     210 215 220 His Leu Lys Glu Ile Gly Val Ser Ser Pro Glu Ala Asp Phe Leu Val 225 230 235 240 Glu Thr Ala Leu Ser Tyr Gly Ala Leu Gly Ala Lys Met Ser Gly Gly                 245 250 255 Gly Leu Gly Gly Cys Ile Ile Ala Leu Val Thr Asn Leu Thr His Ala             260 265 270 Gln Glu Leu Ala Glu Arg Leu Glu Glu Lys Gly Ala Val Gln Thr Trp         275 280 285 Ile Glu Ser Leu     290 <210> 4 <211> 336 <212> PRT <213> Streptococcus pneumoniae <400> 4 Met Ile Ala Val Lys Thr Cys Gly Lys Leu Tyr Trp Ala Gly Glu Tyr   1 5 10 15 Ala Ile Leu Glu Pro Gly Gln Leu Ala Leu Ile Lys Asp Ile Pro Ile              20 25 30 Tyr Met Arg Ala Glu Ile Ala Phe Ser Asp Ser Tyr Arg Ile Tyr Ser          35 40 45 Asp Met Phe Asp Phe Ala Val Asp Leu Arg Pro Asn Pro Asp Tyr Ser      50 55 60 Leu Ile Gln Glu Thr Ile Ala Leu Met Gly Asp Phe Leu Ala Val Arg  65 70 75 80 Gly Gln Asn Leu Arg Pro Phe Ser Leu Lys Ile Cys Gly Lys Met Glu                  85 90 95 Arg Glu Gly Lys Lys Phe Gly Leu Gly Ser Ser Gly Ser Val Val Val             100 105 110 Leu Val Val Lys Ala Leu Leu Ala Leu Tyr Asn Leu Ser Val Asp Gln         115 120 125 Asn Leu Leu Phe Lys Leu Thr Ser Ala Val Leu Leu Lys Arg Gly Asp     130 135 140 Asn Gly Ser Met Gly Asp Leu Ala Cys Ile Val Ala Glu Asp Leu Val 145 150 155 160 Leu Tyr Gln Ser Phe Asp Arg Gln Lys Ala Ala Ala Trp Leu Glu Glu                 165 170 175 Glu Asn Leu Ala Thr Val Leu Glu Arg Asp Trp Gly Phe Phe Ile Ser             180 185 190 Gln Val Lys Pro Thr Leu Glu Cys Asp Phe Leu Val Gly Trp Thr Lys         195 200 205 Glu Val Ala Val Ser Ser His Met Val Gln Gln Ile Lys Gln Asn Ile     210 215 220 Asn Gln Asn Phe Leu Ser Ser Ser Lys Glu Thr Val Val Ser Leu Val 225 230 235 240 Glu Ala Leu Glu Gln Gly Lys Ala Glu Lys Val Ile Glu Gln Val Glu                 245 250 255 Val Ala Ser Lys Leu Leu Glu Gly Leu Ser Thr Asp Ile Tyr Thr Pro             260 265 270 Leu Leu Arg Gln Leu Lys Glu Ala Ser Gln Asp Leu Gln Ala Val Ala         275 280 285 Lys Ser Ser Gly Ala Gly Gly Gly Asp Cys Gly Ile Ala Leu Ser Phe     290 295 300 Asp Ala Gln Ser Ser Arg Asn Thr Leu Lys Asn Arg Trp Ala Asp Leu 305 310 315 320 Gly Ile Glu Leu Leu Tyr Gln Glu Arg Ile Gly His Asp Asp Lys Ser                 325 330 335 <210> 5 <211> 317 <212> PRT <213> Streptococcus pneumoniae <400> 5 Met Asp Arg Glu Pro Val Thr Val Arg Ser Tyr Ala Asn Ile Ala Ile   1 5 10 15 Ile Lys Tyr Trp Gly Lys Lys Lys Glu Lys Glu Met Val Pro Ala Thr              20 25 30 Ser Ser Ile Ser Leu Thr Leu Glu Asn Met Tyr Thr Glu Thr Thr Leu          35 40 45 Ser Pro Leu Pro Ala Asn Val Thr Ala Asp Glu Phe Tyr Ile Asn Gly      50 55 60 Gln Leu Gln Asn Glu Val Glu His Ala Lys Met Ser Lys Ile Isp  65 70 75 80 Arg Tyr Arg Pro Ala Gly Glu Gly Phe Val Arg Ile Asp Thr Gln Asn                  85 90 95 Asn Met Pro Thr Ala Ala Gly Leu Ser Ser Ser Ser Ser Gly Leu Ser             100 105 110 Ala Leu Val Lys Ala Cys Asn Ala Tyr Phe Lys Leu Gly Leu Asp Arg         115 120 125 Ser Gln Leu Ala Gln Glu Ala Lys Phe Ala Ser Gly Ser Ser Ser Arg     130 135 140 Ser Phe Tyr Gly Pro Leu Gly Ala Trp Asp Lys Asp Ser Gly Glu Ile 145 150 155 160 Tyr Pro Val Glu Thr Asp Leu Lys Leu Ala Met Ile Met Leu Val Leu                 165 170 175 Glu Asp Lys Lys Lys Pro Ile Ser Ser Arg Asp Gly Met Lys Leu Cys             180 185 190 Val Glu Thr Ser Thr Thr Phe Asp Asp Trp Val Arg Gln Ser Glu Lys         195 200 205 Asp Tyr Gln Asp Met Leu Ile Tyr Leu Lys Glu Asn Asp Phe Ala Lys     210 215 220 Ile Gly Glu Leu Thr Glu Lys Asn Ala Leu Ala Met His Ala Thr Thr 225 230 235 240 Lys Thr Ala Ser Pro Ala Phe Ser Tyr Leu Thr Asp Ala Ser Tyr Glu                 245 250 255 Ala Met Ala Phe Val Arg Gln Leu Arg Glu Lys Gly Glu Ala Cys Tyr             260 265 270 Phe Thr Met Asp Ala Gly Pro Asn Val Lys Val Phe Cys Gln Glu Lys         275 280 285 Asp Leu Glu His Leu Ser Glu Ile Phe Gly Gln Arg Tyr Arg Leu Ile     290 295 300 Val Ser Lys Thr Lys Asp Leu Ser Gln Asp Asp Cys Cys 305 310 315 <210> 6 <211> 182 <212> PRT <213> E. coli K12 MG1655 <400> 6 Met Gln Thr Glu His Val Ile Leu Leu Asn Ala Gln Gly Val Pro Thr   1 5 10 15 Gly Thr Leu Glu Lys Tyr Ala Ala His Thr Ala Asp Thr Arg Leu His              20 25 30 Leu Ala Phe Ser Ser Trp Leu Phe Asn Ala Lys Gly Gln Leu Leu Val          35 40 45 Thr Arg Arg Ala Leu Ser Lys Lys Ala Trp Pro Gly Val Trp Thr Asn      50 55 60 Ser Val Cys Gly His Pro Gln Leu Gly Glu Ser Asn Glu Asp Ala Val  65 70 75 80 Ile Arg Arg Cys Arg Tyr Glu Leu Gly Val Glu Ile Thr Pro Pro Glu                  85 90 95 Ser Ile Tyr Pro Asp Phe Arg Tyr Arg Ala Thr Asp Pro Ser Gly Ile             100 105 110 Val Glu Asn Glu Val Cys Pro Val Phe Ala Ala Arg Thr Thr Ser Ala         115 120 125 Leu Gln Ile Asn Asp Asp Glu Val Met Asp Tyr Gln Trp Cys Asp Leu     130 135 140 Ala Asp Val Leu His Gly Ile Asp Ala Thr Pro Trp Ala Phe Ser Pro 145 150 155 160 Trp Met Val Met Gln Ala Thr Asn Arg Glu Ala Arg Lys Arg Leu Ser                 165 170 175 Ala Phe Thr Gln Leu Lys             180 <210> 7 <211> 307 <212> PRT <213> Pantoea agglomerans <400> 7 Met Val Ser Gly Ser Lys Ala Gly Val Ser Pro His Arg Glu Ile Glu   1 5 10 15 Val Met Arg Gln Ser Ile Asp Asp His Leu Ala Gly Leu Leu Pro Glu              20 25 30 Thr Asp Ser Gln Asp Ile Val Ser Leu Ala Met Arg Glu Gly Val Met          35 40 45 Ala Pro Gly Lys Arg Ile Arg Pro Leu Leu Met Leu Leu Ala Ala Arg      50 55 60 Asp Leu Arg Tyr Gln Gly Ser Met Pro Thr Leu Leu Asp Leu Ala Cys  65 70 75 80 Ala Val Glu Leu Thr His Thr Ala Ser Leu Met Leu Asp Asp Met Pro                  85 90 95 Cys Met Asp Asn Ala Glu Leu Arg Arg Gly Gln Pro Thr Thr His Lys             100 105 110 Lys Phe Gly Glu Ser Val Ala Ile Leu Ala Ser Val Gly Leu Leu Ser         115 120 125 Lys Ala Phe Gly Leu Ile Ala Ala Thr Gly Asp Leu Pro Gly Glu Arg     130 135 140 Arg Ala Gln Ala Val Asn Glu Leu Ser Thr Ala Val Gly Val Gln Gly 145 150 155 160 Leu Val Leu Gly Gln Phe Arg Asp Leu Asn Asp Ala Ala Leu Asp Arg                 165 170 175 Thr Pro Asp Ala Ile Leu Ser Thr Asn His Leu Lys Thr Gly Ile Leu             180 185 190 Phe Ser Ala Met Leu Gln Ile Val Ala Ile Ala Ser Ala Ser Ser Pro         195 200 205 Ser Thr Arg Glu Thr Leu His Ala Phe Ala Leu Asp Phe Gly Gln Ala     210 215 220 Phe Gln Leu Leu Asp Asp Leu Arg Asp Asp His Pro Glu Thr Gly Lys 225 230 235 240 Asp Arg Asn Lys Asp Ala Gly Lys Ser Thr Leu Val Asn Arg Leu Gly                 245 250 255 Ala Asp Ala Ala Arg Gln Lys Leu Arg Glu His Ile Asp Ser Ala Asp             260 265 270 Lys His Leu Thr Phe Ala Cys Pro Gln Gly Gly Ala Ile Arg Gln Phe         275 280 285 Met His Leu Trp Phe Gly His His Leu Ala Asp Trp Ser Pro Val Met     290 295 300 Lys Ile Ala 305 <210> 8 <211> 309 <212> PRT <213> Pantoea agglomerans <400> 8 Met Ser Gln Pro Pro Leu Leu Asp His Ala Thr Gln Thr Met Ala Asn   1 5 10 15 Gly Ser Lys Ser Phe Ala Thr Ala Ala Lys Leu Phe Asp Pro Ala Thr              20 25 30 Arg Arg Ser Val Leu Met Leu Tyr Thr Trp Cys Arg His Cys Asp Asp          35 40 45 Val Ile Asp Asp Gln Thr His Gly Phe Ala Ser Glu Ala Ala Ala Glu      50 55 60 Glu Glu Ala Thr Gln Arg Leu Ala Arg Leu Arg Thr Leu Thr Leu Ala  65 70 75 80 Ala Phe Glu Gly Ala Glu Met Gln Asp Pro Ala Phe Ala Ala Phe Gln                  85 90 95 Glu Val Ala Leu Thr His Gly Ile Thr Pro Arg Met Ala Leu Asp His             100 105 110 Leu Asp Gly Phe Ala Met Asp Val Ala Gln Thr Arg Tyr Val Thr Phe         115 120 125 Glu Asp Thr Leu Arg Tyr Cys Tyr His Val Ala Gly Val Val Gly Leu     130 135 140 Met Met Ala Arg Val Met Gly Val Arg Asp Glu Arg Val Leu Asp Arg 145 150 155 160 Ala Cys Asp Leu Gly Leu Ala Phe Gln Leu Thr Asn Ile Ala Arg Asp                 165 170 175 Ile Ile Asp Asp Ala Ala Ile Asp Arg Cys Tyr Leu Pro Ala Glu Trp             180 185 190 Leu Gln Asp Ala Gly Leu Thr Pro Glu Asn Tyr Ala Ala Arg Glu Asn         195 200 205 Arg Ala Ala Leu Ala Arg Val Ala Glu Arg Leu Ile Asp Ala Ala Glu     210 215 220 Pro Tyr Tyr Ile Ser Ser Gln Ala Gly Leu His Asp Leu Pro Pro Arg 225 230 235 240 Cys Ala Trp Ala Ile Ala Thr Ala Arg Ser Val Tyr Arg Glu Ile Gly                 245 250 255 Ile Lys Val Lys Ala Ala Gly Gly Ser Ala Trp Asp Arg Arg Gln His             260 265 270 Thr Ser Lys Gly Glu Lys Ile Ala Met Leu Met Ala Ala Pro Gly Gln         275 280 285 Val Ile Arg Ala Lys Thr Thr Arg Val Thr Pro Arg Pro Ala Gly Leu     290 295 300 Trp Gln Arg Pro Val 305 <210> 9 <211> 486 <212> PRT <213> Pantoea agglomerans <400> 9 Met Lys Lys Thr Val Val Ile Gly Ala Gly Phe Gly Gly Leu Ala Leu   1 5 10 15 Ala Ile Arg Leu Gln Ala Ala Gly Ile Pro Thr Val Leu Leu Glu Gln              20 25 30 Arg Asp Lys Pro Gly Gly Arg Ala Tyr Val Trp His Asp Gln Gly Phe          35 40 45 Thr Phe Asp Ala Gly Pro Thr Val Ile Thr Asp Pro Thr Ala Leu Glu      50 55 60 Ala Leu Phe Thr Leu Ala Gly Arg Arg Met Glu Asp Tyr Val Arg Leu  65 70 75 80 Leu Pro Val Lys Pro Phe Tyr Arg Leu Cys Trp Glu Ser Gly Lys Thr                  85 90 95 Leu Asp Tyr Ala Asn Asp Ser Ala Glu Leu Glu Ala Gln Ile Thr Gln             100 105 110 Phe Asn Pro Arg Asp Val Glu Gly Tyr Arg Arg Phe Leu Ala Tyr Ser         115 120 125 Gln Ala Val Phe Gln Glu Gly Tyr Leu Arg Leu Gly Ser Val Pro Phe     130 135 140 Leu Ser Phe Arg Asp Met Leu Arg Ala Gly Pro Gln Leu Leu Lys Leu 145 150 155 160 Gln Ala Trp Gln Ser Val Tyr Gln Ser Val Ser Arg Phe Ile Glu Asp                 165 170 175 Glu His Leu Arg Gln Ala Phe Ser Phe His Ser Leu Leu Val Gly Gly             180 185 190 Asn Pro Phe Thr Thr Ser Ser Ile Tyr Thr Leu Ile His Ala Leu Glu         195 200 205 Arg Glu Trp Gly Val Trp Phe Pro Glu Gly Gly Thr Gly Ala Leu Val     210 215 220 Asn Gly Met Val Lys Leu Phe Thr Asp Leu Gly Gly Glu Ile Glu Leu 225 230 235 240 Asn Ala Arg Val Glu Glu Leu Val Val Ala Asp Asn Arg Val Ser Gln                 245 250 255 Val Arg Leu Ala Asp Gly Arg Ile Phe Asp Thr Asp Ala Val Ala Ser             260 265 270 Asn Ala Asp Val Val Asn Thr Tyr Lys Lys Leu Leu Gly His His Pro         275 280 285 Val Gly Gln Lys Arg Ala Ala Ala Leu Glu Arg Lys Ser Met Ser Asn     290 295 300 Ser Leu Phe Val Leu Tyr Phe Gly Leu Asn Gln Pro His Ser Gln Leu 305 310 315 320 Ala His His Thr Ile Cys Phe Gly Pro Arg Tyr Arg Glu Leu Ile Asp                 325 330 335 Glu Ile Phe Thr Gly Ser Ala Leu Ala Asp Asp Phe Ser Leu Tyr Leu             340 345 350 His Ser Pro Cys Val Thr Asp Pro Ser Leu Ala Pro Pro Gly Cys Ala         355 360 365 Ser Phe Tyr Val Leu Ala Pro Val Pro His Leu Gly Asn Ala Pro Leu     370 375 380 Asp Trp Ala Gln Glu Gly Pro Lys Leu Arg Asp Arg Ile Phe Asp Tyr 385 390 395 400 Leu Glu Glu Arg Tyr Met Pro Gly Leu Arg Ser Gln Leu Val Thr Gln                 405 410 415 Arg Ile Phe Thr Pro Ala Asp Phe His Asp Thr Leu Asp Ala His Leu             420 425 430 Gly Ser Ala Phe Ser Ile Glu Pro Leu Leu Thr Gln Ser Ala Trp Phe         435 440 445 Arg Pro His Asn Arg Asp Ser Asp Ile Ala Asn Leu Tyr Leu Val Gly     450 455 460 Ala Gly Thr His Pro Gly Ala Gly Ile Pro Gly Val Val Ala Ser Ala 465 470 475 480 Lys Ala Thr Ala Ser Leu                 485 <210> 10 <211> 382 <212> PRT <213> Pantoea ananatis <400> 10 Met Gln Pro His Tyr Asp Leu Ile Leu Val Gly Ala Gly Leu Ala Asn   1 5 10 15 Gly Leu Ile Ala Leu Arg Leu Gln Gln Gln Gln Pro Asp Met Arg Ile              20 25 30 Leu Leu Ile Asp Ala Ala Pro Gln Ala Gly Gly Asn His Thr Trp Ser          35 40 45 Phe His His Asp Asp Leu Thr Glu Ser Gln His Arg Trp Ile Ala Pro      50 55 60 Leu Val Val His His Trp Pro Asp Tyr Gln Val Arg Phe Pro Thr Arg  65 70 75 80 Arg Arg Lys Leu Asn Ser Gly Tyr Phe Cys Ile Thr Ser Gln Arg Phe                  85 90 95 Ala Glu Val Leu Gln Arg Gln Phe Gly Pro His Leu Trp Met Asp Thr             100 105 110 Ala Val Ala Glu Val Asn Ala Glu Ser Val Arg Leu Lys Lys Gly Gln         115 120 125 Val Ile Gly Ala Arg Ala Val Ile Asp Gly Arg Gly Tyr Ala Ala Asn     130 135 140 Ser Ala Leu Ser Val Gly Phe Gln Ala Phe Ile Gly Gln Glu Trp Arg 145 150 155 160 Leu Ser His Pro His Gly Leu Ser Ser Pro Ile Ile Met Asp Ala Thr                 165 170 175 Val Asp Gln Gln Asn Gly Tyr Arg Phe Val Tyr Ser Leu Pro Leu Ser             180 185 190 Pro Thr Arg Leu Leu Ile Glu Asp Thr His Tyr Ile Asp Asn Ala Thr         195 200 205 Leu Asp Pro Glu Cys Ala Arg Gln Asn Ile Cys Asp Tyr Ala Ala Gln     210 215 220 Gln Gly Trp Gln Leu Gln Thr Leu Leu Arg Glu Glu Gln Gly Ala Leu 225 230 235 240 Pro Ile Thr Leu Ser Gly Asn Ala Asp Ala Phe Trp Gln Gln Arg Pro                 245 250 255 Leu Ala Cys Ser Gly Leu Arg Ala Gly Leu Phe His Pro Thr Thr Gly             260 265 270 Tyr Ser Leu Pro Leu Ala Val Ala Val Ala Asp Arg Leu Ser Ala Leu         275 280 285 Asp Val Phe Thr Ser Ala Ser Ile His His Ala Ile Thr His Phe Ala     290 295 300 Arg Glu Arg Trp Gln Gln Gln Gly Phe Phe Arg Met Leu Asn Arg Met 305 310 315 320 Leu Phe Leu Ala Gly Pro Ala Asp Ser Arg Trp Arg Val Met Gln Arg                 325 330 335 Phe Tyr Gly Leu Pro Glu Asp Leu Ile Ala Arg Phe Tyr Ala Gly Lys             340 345 350 Leu Thr Leu Thr Asp Arg Leu Arg Ile Leu Ser Gly Lys Pro Pro Val         355 360 365 Pro Val Leu Ala Ala Leu Gln Ala Ile Met Thr Thr His Arg     370 375 380 <210> 11 <211> 620 <212> PRT <213> E. coli K12 MG1655 <400> 11 Met Ser Phe Asp Ile Ala Lys Tyr Pro Thr Leu Ala Leu Val Asp Ser   1 5 10 15 Thr Gln Glu Leu Arg Leu Leu Pro Lys Glu Ser Leu Pro Lys Leu Cys              20 25 30 Asp Glu Leu Arg Arg Tyr Leu Leu Asp Ser Val Ser Arg Ser Ser Gly          35 40 45 His Phe Ala Ser Gly Leu Gly Thr Val Glu Leu Thr Val Ala Leu His      50 55 60 Tyr Val Tyr Asn Thr Pro Phe Asp Gln Leu Ile Trp Asp Val Gly His  65 70 75 80 Gln Ala Tyr Pro His Lys Ile Leu Thr Gly Arg Arg Asp Lys Ile Gly                  85 90 95 Thr Ile Arg Gln Lys Gly Gly Leu His Pro Phe Pro Trp Arg Gly Glu             100 105 110 Ser Glu Tyr Asp Val Leu Ser Val Gly His Ser Ser Thr Ser Ile Ser         115 120 125 Ala Gly Ile Gly Ile Ala Val Ala Ala Glu Lys Glu Gly Lys Asn Arg     130 135 140 Arg Thr Val Cys Val Ile Gly Asp Gly Ala Ile Thr Ala Gly Met Ala 145 150 155 160 Phe Glu Ala Met Asn His Ala Gly Asp Ile Arg Pro Asp Met Leu Val                 165 170 175 Ile Leu Asn Asp Asn Glu Met Ser Ile Ser Glu Asn Val Gly Ala Leu             180 185 190 Asn Asn His Leu Ala Gln Leu Leu Ser Gly Lys Leu Tyr Ser Ser Leu         195 200 205 Arg Glu Gly Gly Lys Lys Val Phe Ser Gly Val Pro Pro Ile Lys Glu     210 215 220 Leu Leu Lys Arg Thr Glu Glu His Ile Lys Gly Met Val Val Pro Gly 225 230 235 240 Thr Leu Phe Glu Glu Leu Gly Phe Asn Tyr Ile Gly Pro Val Asp Gly                 245 250 255 His Asp Val Leu Gly Leu Ile Thr Thr Leu Lys Asn Met Arg Asp Leu             260 265 270 Lys Gly Pro Gln Phe Leu His Ile Met Thr Lys Lys Gly Arg Gly Tyr         275 280 285 Glu Pro Ala Glu Lys Asp Pro Ile Thr Phe His Ala Val Pro Lys Phe     290 295 300 Asp Pro Ser Ser Gly Cys Leu Pro Lys Ser Ser Gly Gly Leu Pro Ser 305 310 315 320 Tyr Ser Lys Ile Phe Gly Asp Trp Leu Cys Glu Thr Ala Ala Lys Asp                 325 330 335 Asn Lys Leu Met Ala Ile Thr Pro Ala Met Arg Glu Gly Ser Gly Met             340 345 350 Val Glu Phe Ser Arg Lys Phe Pro Asp Arg Tyr Phe Asp Val Ala Ile         355 360 365 Ala Glu Gln His Ala Val Thr Phe Ala Ala Gly Leu Ala Ile Gly Gly     370 375 380 Tyr Lys Pro Ile Val Ala Ile Tyr Ser Thr Phe Leu Gln Arg Ala Tyr 385 390 395 400 Asp Gln Val Leu His Asp Val Ala Ile Gln Lys Leu Pro Val Leu Phe                 405 410 415 Ala Ile Asp Arg Ala Gly Ile Val Gly Ala Asp Gly Gln Thr His Gln             420 425 430 Gly Ala Phe Asp Leu Ser Tyr Leu Arg Cys Ile Pro Glu Met Val Ile         435 440 445 Met Thr Pro Ser Asp Glu Asn Glu Cys Arg Gln Met Leu Tyr Thr Gly     450 455 460 Tyr His Tyr Asn Asp Gly Pro Ser Ala Val Arg Tyr Pro Arg Gly Asn 465 470 475 480 Ala Val Gly Val Glu Leu Thr Pro Leu Glu Lys Leu Pro Ile Gly Lys                 485 490 495 Gly Ile Val Lys Arg Arg Gly Glu Lys Leu Ala Ile Leu Asn Phe Gly             500 505 510 Thr Leu Met Pro Glu Ala Ala Lys Val Ala Glu Ser Leu Asn Ala Thr         515 520 525 Leu Val Asp Met Arg Phe Val Lys Pro Leu Asp Glu Ala Leu Ile Leu     530 535 540 Glu Met Ala Ala Ser His Glu Ala Leu Val Thr Val Glu Glu Asn Ala 545 550 555 560 Ile Met Gly Gly Ala Gly Ser Gly Val Asn Glu Val Leu Met Ala His                 565 570 575 Arg Lys Pro Val Pro Val Leu Asn Ile Gly Leu Pro Asp Phe Phe Ile             580 585 590 Pro Gln Gly Thr Gln Glu Glu Met Arg Ala Glu Leu Gly Leu Asp Ala         595 600 605 Ala Gly Met Glu Ala Lys Ile Lys Ala Trp Leu Ala     610 615 620 <210> 12 <211> 305 <212> PRT <213> Haematococcus pluvialis <400> 12 Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His Ile Pro Arg Val Asn   1 5 10 15 Ser Ala Gln Gln Pro Ser Cys Ala His Ala Arg Leu Gln Phe Lys Leu              20 25 30 Arg Ser Met Gln Met Thr Leu Met Gln Pro Ser Ile Ser Ala Asn Leu          35 40 45 Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp      50 55 60 Ala Gly Gly Gln Ser Gln Asp Glu Leu Met Leu Lys Asp Glu Cys Ile  65 70 75 80 Leu Val Asp Val Glu Asp Asn Ile Thr Gly His Ala Ser Lys Leu Glu                  85 90 95 Cys His Lys Phe Leu Pro His Gln Pro Ala Gly Leu Leu His Arg Ala             100 105 110 Phe Ser Val Phe Leu Phe Asp Asp Gln Gly Arg Leu Leu Leu Gln Gln         115 120 125 Arg Ala Arg Ser Lys Ile Thr Phe Pro Ser Val Trp Thr Asn Thr Cys     130 135 140 Cys Ser His Pro Leu His Gly Gln Thr Pro Asp Glu Val Asp Gln Leu 145 150 155 160 Ser Gln Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala Ile                 165 170 175 Arg Lys Leu Glu His Glu Leu Gly Ile Pro Ala His Gln Leu Pro Ala             180 185 190 Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val         195 200 205 Gln Pro Ala Ala Thr Gln Ser Ala Leu Trp Gly Glu His Glu Met Asp     210 215 220 Tyr Ile Leu Phe Ile Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 225 230 235 240 Glu Val Asp Glu Val Arg Tyr Val Thr Gln Glu Glu Leu Arg Gln Met                 245 250 255 Met Gln Pro Asp Asn Gly Leu Gln Trp Ser Pro Trp Phe Arg Ile Ile             260 265 270 Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu         275 280 285 Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His Ile Asn Glu     290 295 300 Ala 305 <210> 13 <211> 275 <212> PRT <213> Uncultured marine bacterium 66A03 <400> 13 Met Gly Leu Met Leu Ile Asp Trp Cys Ala Leu Ala Leu Val Val Phe   1 5 10 15 Ile Gly Leu Pro His Gly Ala Leu Asp Ala Ala Ile Ser Phe Ser Met              20 25 30 Ile Ser Ser Ala Lys Arg Ile Ala Arg Leu Ala Gly Ile Leu Leu Ile          35 40 45 Tyr Leu Leu Leu Ala Thr Ala Phe Phe Leu Ile Trp Tyr Gln Leu Pro      50 55 60 Ala Phe Ser Leu Leu Ile Phe Leu Leu Ile Ser Ile Ile His Phe Gly  65 70 75 80 Met Ala Asp Phe Asn Ala Ser Pro Ser Lys Leu Lys Trp Pro His Ile                  85 90 95 Ile Ala His Gly Gly Val Val Thr Val Trp Leu Pro Leu Ile Gln Lys             100 105 110 Asn Glu Val Thr Lys Leu Phe Ser Ile Leu Thr Asn Gly Pro Thr Pro         115 120 125 Ile Leu Trp Asp Ile Leu Leu Ile Phe Phe Leu Cys Trp Ser Ile Gly     130 135 140 Val Cys Leu His Thr Tyr Glu Thr Leu Arg Ser Lys His Tyr Asn Ile 145 150 155 160 Ala Phe Glu Leu Ile Gly Leu Ile Phe Leu Ala Trp Tyr Ala Pro Pro                 165 170 175 Leu Val Thr Phe Ala Thr Tyr Phe Cys Phe Ile His Ser Arg Arg His             180 185 190 Phe Ser Phe Val Trp Lys Gln Leu Gln His Met Ser Ser Lys Lys Met         195 200 205 Met Ile Gly Ser Ala Ile Ile Leu Ser Cys Thr Ser Trp Leu Ile Gly     210 215 220 Gly Gly Ile Tyr Phe Phe Leu Asn Ser Lys Met Ile Ala Ser Glu Ala 225 230 235 240 Ala Leu Gln Thr Val Phe Ile Gly Leu Ala Ala Leu Thr Val Pro His                 245 250 255 Met Ile Leu Ile Asp Phe Ile Phe Arg Pro His Ser Ser Arg Ile Lys             260 265 270 Ile Lys Asn         275 <210> 14 <211> 566 <212> PRT <213> Mus musculus <400> 14 Met Glu Ile Ile Phe Gly Gln Asn Lys Lys Glu Gln Leu Glu Pro Val   1 5 10 15 Gln Ala Lys Val Thr Gly Ser Ile Pro Ala Trp Leu Gln Gly Thr Leu              20 25 30 Leu Arg Asn Gly Pro Gly Met His Thr Val Gly Glu Ser Lys Tyr Asn          35 40 45 His Trp Phe Asp Gly Leu Ala Leu Leu His Ser Phe Ser Ile Arg Asp      50 55 60 Gly Glu Val Phe Tyr Arg Ser Lys Tyr Leu Gln Ser Asp Thr Tyr Ile  65 70 75 80 Ala Asn Ile Glu Ala Asn Arg Ile Val Val Ser Glu Phe Gly Thr Met                  85 90 95 Ala Tyr Pro Asp Pro Cys Lys Asn Ile Phe Ser Lys Ala Phe Ser Tyr             100 105 110 Leu Ser His Thr Ile Pro Asp Phe Thr Asp Asn Cys Leu Ile Asn Ile         115 120 125 Met Lys Cys Gly Glu Asp Phe Tyr Ala Thr Thr Glu Thr Asn Tyr Ile     130 135 140 Arg Lys Ile Asp Pro Gln Thr Leu Glu Thr Leu Glu Lys Val Asp Tyr 145 150 155 160 Arg Lys Tyr Val Ala Val Asn Leu Ala Thr Ser His Pro His Tyr Asp                 165 170 175 Glu Ala Gly Asn Val Leu Asn Met Gly Thr Ser Val Val Asp Lys Gly             180 185 190 Arg Thr Lys Tyr Val Ile Phe Lys Ile Pro Ala Thr Val Pro Asp Ser         195 200 205 Lys Lys Lys Gly Lys Ser Pro Val Lys His Ala Glu Val Phe Cys Ser     210 215 220 Ile Ser Ser Arg Ser Leu Leu Ser Pro Ser Tyr Tyr His Ser Phe Gly 225 230 235 240 Val Thr Glu Asn Tyr Val Val Phe Leu Glu Gln Pro Phe Lys Leu Asp                 245 250 255 Ile Leu Lys Met Ala Thr Ala Tyr Met Arg Gly Val Ser Trp Ala Ser             260 265 270 Cys Met Ser Phe Asp Arg Glu Asp Lys Thr Tyr Ile His Ile Ile Asp         275 280 285 Gln Arg Thr Arg Lys Pro Val Pro Thr Lys Phe Tyr Thr Asp Pro Met     290 295 300 Val Val Phe His His Val Asn Ala Tyr Glu Glu Asp Gly Cys Val Leu 305 310 315 320 Phe Asp Val Ile Ala Tyr Glu Asp Ser Ser Leu Tyr Gln Leu Phe Tyr                 325 330 335 Leu Ala Asn Leu Asn Lys Asp Phe Glu Glu Lys Ser Arg Leu Thr Ser             340 345 350 Val Pro Thr Leu Arg Arg Phe Ala Val Pro Leu His Val Asp Lys Asp         355 360 365 Ala Glu Val Gly Ser Asn Leu Val Lys Val Ser Ser Thr Thr Ala Thr     370 375 380 Ala Leu Lys Glu Lys Asp Gly His Val Tyr Cys Gln Pro Glu Val Leu 385 390 395 400 Tyr Glu Gly Leu Glu Leu Pro Arg Ile Asn Tyr Ala Tyr Asn Gly Lys                 405 410 415 Pro Tyr Arg Tyr Ile Phe Ala Ala Glu Val Gln Trp Ser Pro Val Pro             420 425 430 Thr Lys Ile Leu Lys Tyr Asp Ile Leu Thr Lys Ser Ser Leu Lys Trp         435 440 445 Ser Glu Glu Ser Cys Trp Pro Ala Glu Pro Leu Phe Val Pro Thr Pro     450 455 460 Gly Ala Lys Asp Glu Asp Asp Gly Val Ile Leu Ser Ala Ile Val Ser 465 470 475 480 Thr Asp Pro Gln Lys Leu Pro Phe Leu Leu Ile Leu Asp Ala Lys Ser                 485 490 495 Phe Thr Glu Leu Ala Arg Ala Ser Val Asp Ala Asp Met His Leu Asp             500 505 510 Leu His Gly Leu Phe Ile Pro Asp Ala Asp Trp Asn Ala Val Lys Gln         515 520 525 Thr Pro Ala Glu Thr Gln Glu Val Glu Asn Ser Asp His Pro Thr Asp     530 535 540 Pro Thr Ala Pro Glu Leu Ser His Ser Glu Asn Asp Phe Thr Ala Gly 545 550 555 560 His Gly Gly Ser Ser Leu                 565 <210> 15 <211> 350 <212> PRT <213> Natronomonas pharaonis ATCC35678 <400> 15 Met Ser Asn Ala Ser Leu Arg Pro Ser Gly Thr Ala Ser Ala Thr Leu   1 5 10 15 Phe Arg Leu Ala Phe Leu Pro Gly Trp Ala Val Ile Ala Ala Thr Thr              20 25 30 Gly Ala Phe Leu Val Gly Ala Ser Leu Pro Leu Thr Tyr Gln Leu Ile          35 40 45 Pro Leu Ala Ala Ser Val Val Leu Leu Gly Leu Pro His Gly Ala Val      50 55 60 Asp His Leu Ala Leu Pro Arg Thr Arg Asn Glu Arg Val Thr Val Arg  65 70 75 80 Trp Leu Ala Ala Ile Gly Val Leu Tyr Ala Val Val Gly Gly Leu Tyr                  85 90 95 Ala Ala Val Trp Phe Leu Ala Pro Val Gly Ala Val Ala Ala Phe Ile             100 105 110 Phe Met Thr Trp Val His Trp Gly Gln Gly Glu Ile Tyr Pro Leu Val         115 120 125 Ala Leu Ala Asp Ala Asp His Leu Asp Gly Arg Leu Glu Arg Gly Leu     130 135 140 Thr Ala Ala Ile Arg Gly Ala Leu Pro Met Leu Val Pro Phe Val Ala 145 150 155 160 Phe Pro Asp Gln Tyr Glu Leu Val Val Thr Thr Leu Val Gly Leu Phe                 165 170 175 Asp Ala Asp Ala Ala Ala Thr Ala Ala Ala Ala Phe Thr Pro Thr Ala             180 185 190 Arg Leu Ala Val Ala Val Thr Val Gly Gly Leu Val Ala Val Thr Leu         195 200 205 Gly Ile Gly Ala Val Ala Ala Ser Glu Thr Gly Trp Gly Pro Trp Leu     210 215 220 Leu Asp Ala Gly Glu Thr Gly Leu Leu Ile Leu Phe Phe Ala Ala Val 225 230 235 240 Pro Pro Ile Phe Ala Ile Gly Leu Tyr Phe Cys Phe Trp His Ser Leu                 245 250 255 Arg His Ile Val Arg Leu Leu Ala Val Asp Asn Arg Ala Ala Pro Ala             260 265 270 Leu Asp Gly Arg Arg Tyr Gly Ala Ala Leu Ala Arg Phe Ala Arg Asp         275 280 285 Ala Ala Pro Leu Ser Ala Ala Ser Leu Val Leu Leu Gly Leu Leu Tyr     290 295 300 Leu Ala Val Pro Gly Ser Val Asp Ser Pro Leu Ser Leu Val Gly Thr 305 310 315 320 Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Val Val Val                 325 330 335 Ala Trp Met Asp His Glu Gln Arg Leu Trp Arg Pro Gly Ala             340 345 350 <210> 16 <211> 284 <212> PRT <213> Halobacterium salinarum ATCC700922 <400> 16 Met Pro His Gly Ala Ile Asp Tyr Leu Ala Leu Pro Arg Ala Val Thr   1 5 10 15 Gly Thr Val Thr Val Arg Trp Leu Ala Val Val Gly Val Leu Tyr Leu              20 25 30 Val Leu Gly Gly Gly Tyr Ala Ala Ala Trp Phe Phe Ala Pro Val Pro          35 40 45 Ala Ala Phe Ala Phe Val Ala Ile Thr Trp Leu His Trp Gly Gln Gly      50 55 60 Asp Leu Tyr Pro Leu Leu Asp Phe Leu Asp Val Asp Tyr Leu Asp Thr  65 70 75 80 Arg Pro Arg Arg Ala Ala Thr Val Leu Ile Arg Gly Gly Leu Pro Met                  85 90 95 Leu Val Pro Leu Leu Gly Phe Pro Glu Arg Tyr Arg Ser Val Val Asp             100 105 110 Ala Phe Ala Ala Pro Phe Gly Gly Ser Val Gly Asp Leu Ala Val Phe         115 120 125 Asp Pro Arg Val Arg Leu Trp Leu Gly Val Ala Phe Ala Ala Ala Thr     130 135 140 Val Ala Val Leu Ala Ala Gly Arg Arg Arg Thrg Ser His Gly Ala 145 150 155 160 Trp Arg Val Asp Ala Ala Glu Thr Leu Leu Leu Trp Val Phe Phe Phe                 165 170 175 Val Val Pro Pro Val Phe Ala Val Gly Val Tyr Phe Cys Val Trp His             180 185 190 Ser Val Arg His Val Ala Arg Ala Ile Ala Val Asp Gly Ser Val His         195 200 205 Pro Ser Leu Arg Ala Gly Asp Ile Leu Gly Pro Leu Ala Arg Phe Gly     210 215 220 Val Glu Ala Ala Pro Met Thr Ala Ala Ala Leu Ala Leu Gly Gly Val 225 230 235 240 Leu Trp Trp Ala Val Pro Asn Pro Pro Thr Thr Leu Glu Ser Gly Ala                 245 250 255 Ala Leu Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Ala             260 265 270 Val Val Thr Trp Met Asp Arg Val Gln Gly Val Leu         275 280 <210> 17 <211> 284 <212> PRT <213> Halobacterium salinarum ATCC700922 <400> 17 Met Pro His Gly Ala Ile Asp Tyr Leu Ala Leu Pro Arg Ala Val Thr   1 5 10 15 Gly Thr Val Thr Val Arg Trp Leu Ala Val Val Gly Val Leu Tyr Leu              20 25 30 Val Leu Gly Gly Gly Tyr Ala Ala Ala Trp Phe Phe Ala Pro Val Pro          35 40 45 Ala Ala Phe Ala Phe Val Ala Ile Thr Trp Leu His Trp Gly Gln Gly      50 55 60 Asp Leu Tyr Pro Leu Leu Asp Phe Leu Asp Val Asp Tyr Leu Asp Thr  65 70 75 80 Arg Pro Arg Arg Ala Ala Thr Val Leu Ile Arg Gly Gly Leu Pro Met                  85 90 95 Leu Val Pro Leu Leu Gly Phe Pro Glu Arg Tyr Arg Ser Val Val Asp             100 105 110 Ala Phe Ala Ala Pro Phe Gly Gly Ser Val Gly Asp Leu Ala Val Phe         115 120 125 Asp Pro Arg Val Arg Leu Trp Leu Gly Val Ala Phe Ala Ala Ala Thr     130 135 140 Val Ala Val Leu Ala Ala Gly Arg Arg Arg Thrg Ser His Gly Ala 145 150 155 160 Trp Arg Val Asp Ala Ala Glu Thr Leu Leu Leu Trp Val Phe Phe Phe                 165 170 175 Val Val Pro Pro Val Phe Ala Val Gly Val Tyr Phe Cys Val Trp His             180 185 190 Ser Val Arg His Val Ala Arg Ala Ile Ala Val Asp Gly Ser Val His         195 200 205 Pro Ser Leu Arg Ala Gly Asp Ile Leu Gly Pro Leu Ala Arg Phe Gly     210 215 220 Val Glu Ala Ala Pro Met Thr Ala Ala Ala Leu Ala Leu Gly Gly Val 225 230 235 240 Leu Trp Trp Ala Val Pro Asn Pro Pro Thr Thr Leu Glu Ser Gly Ala                 245 250 255 Ala Leu Tyr Leu Val Leu Ile Ala Val Leu Thr Leu Pro His Val Ala             260 265 270 Val Val Thr Trp Met Asp Arg Val Gln Gly Val Leu         275 280 <210> 18 <211> 2412 <212> DNA <213> Enterococcus faecalis <400> 18 ttgaaaacag tagttattat tgatgcatta cgaacaccaa ttggaaaata taaaggcagc 60 ttaagtcaag taagtgccgt agacttagga acacatgtta caacacaact tttaaaaaga 120 cattccacta tttctgaaga aattgatcaa gtaatctttg gaaatgtttt acaagctgga 180 aatggccaaa atcccgcacg acaaatagca ataaacagcg gtttatctca tgaaattccc 240 gcaatgacag ttaatgaggt ctgcggatca ggaatgaagg ccgttatttt ggcgaaacaa 300 ttgattcaat taggagaagc ggaagtttta attgctggcg ggattgagaa tatgtcccaa 360 gcacctaaat tacaacgatt taattacgaa acagaaagct atgatgcgcc tttttctagt 420 atgatgtacg atgggttaac ggatgccttt agtggtcaag caatgggctt aactgctgaa 480 aatgtggccg aaaagtatca tgtaactaga gaagagcaag atcaattttc tgtacattca 540 caattaaaag cagctcaagc acaagcagaa gggatattcg ctgacgaaat agccccatta 600 gaagtatcag gaacgcttgt ggagaaagat gaagggattc gccctaattc gagcgttgag 660 aagctaggaa cgcttaaaac agtttttaaa gaagacggta ctgtaacagc agggaatgca 720 tcaaccatta atgatggggc ttctgctttg attattgctt cacaagaata tgccgaagca 780 cacggtcttc cttatttagc tattattcga gacagtgtgg aagtcggtat tgatccagcc 840 tatatgggaa tttcgccgat taaagccatt caaaaactgt tagcgcgcaa tcaacttact 900 acggaagaaa ttgatctgta tgaaatcaac gaagcatttg cagcaacttc aatcgtggtc 960 caaagagaac tggctttacc agaggaaaag gtcaacattt atggtggcgg tatttcatta 1020 ggtcatgcga ttggtgccac aggtgctcgt ttattaacga gtttaagtta tcaattaaat 1080 caaaaagaaa agaaatatgg agtggcttct ttatgtatcg gcggtggctt aggactcgct 1140 atgctactag agagacctca gcaaaaaaaa aacagccgat tttatcaaat gagtcctgag 1200 gaacgcctgg cttctcttct taatgaaggc cagatttctg ctgatacaaa aaaagaattt 1260 gaaaatacgg ctttatcttc gcagattgcc aatcatatga ttgaaaatca aatcagtgaa 1320 acagaagtgc cgatgggcgt tggcttacat ttaacagtgg acgaaactga ttatttggta 1380 ccaatggcga cagaagagcc ctcagtgatt gcggctttga gtaatggtgc aaaaatagca 1440 caaggattta aaacagtgaa tcaacaacgt ttaatgcgtg gacaaatcgt tttttacgat 1500 gttgcagacg ccgagtcatt gattgatgaa ctacaagtaa gagaaacgga aatttttcaa 1560 caagcagagt taagttatcc atctatcgtt aaacgcggcg gcggcttaag agatttgcaa 1620 tatcgtgctt ttgatgaatc atttgtatct gtcgactttt tagtagatgt taaggatgca 1680 atgggggcaa atatcgttaa cgctatgttg gaaggtgtgg ccgagttgtt ccgtgaatgg 1740 tttgcggagc aaaagatttt attcagtatt ttaagtaatt atgccacgga gtcggttgtt 1800 acgatgaaaa cggctattcc agtttcacgt ttaagtaagg ggagcaatgg ccgggaaatt 1860 gctgaaaaaa ttgttttagc ttcacgctat gcttcattag atccttatcg ggcagtcacg 1920 cataacaaag ggatcatgaa tggcattgaa gctgtcgttt tagctacagg aaatgataca 1980 cgcgctgtta gcgcttcttg tcatgctttt gcggtgaagg aaggtcgcta ccaaggtttg 2040 actagttgga cgctggatgg cgaacaacta attggtgaaa tttcagttcc gcttgcgtta 2100 gccacggttg gcggtgccac aaaagtctta cctaaatctc aagcagctgc tgatttgtta 2160 gcagtgacgg atgcaaaaga actaagtcga gtagtagcgg ctgttggttt ggcacaaaat 2220 ttagcggcgt tacgggcctt agtctctgaa ggaattcaaa aaggacacat ggctctacaa 2280 gcacgttctt tagcgatgac ggtcggagct actggtaaag aagttgaggc agtcgctcaa 2340 caattaaaac gtcaaaaaac gatgaaccaa gaccgagcct tggctatttt aaatgattta 2400 agaaaacaat aa 2412 <210> 19 <211> 1152 <212> DNA <213> Enterococcus faecalis <400> 19 atgacaattg ggattgataa aattagtttt tttgtgcccc cttattatat tgatatgacg 60 gcactggctg aagccagaaa tgtagaccct ggaaaatttc atattggtat tgggcaagac 120 caaatggcgg tgaacccaat cagccaagat attgtgacat ttgcagccaa tgccgcagaa 180 gcgatcttga ccaaagaaga taaagaggcc attgatatgg tgattgtcgg gactgagtcc 240 agtatcgatg agtcaaaagc ggccgcagtt gtcttacatc gtttaatggg gattcaacct 300 ttcgctcgct ctttcgaaat caaggaagct tgttacggag caacagcagg cttacagtta 360 gctaagaatc acgtagcctt acatccagat aaaaaagtct tggttgtagc agcagatatt 420 gcaaaatatg gattaaattc tggcggtgag cctacacaag gagctggggc ggttgcaatg 480 ttagttgcta gtgaaccgcg catcttggct ttaaaagagg ataatgtgat gctgacgcaa 540 gatatctatg acttttggcg tccaacaggc catccgtatc ctatggtcga tggtcctttg 600 tcaaacgaaa cctacatcca atcttttgcc caagtctggg atgaacataa aaaaagaacc 660 ggtcttgatt ttgcagatta tgatgcttta gcgttccata ttccttacac aaaaatgggc 720 aaaaaagcct tattagcaaa aatctccgac caaactgaag cagaacagga acgaatttta 780 gcccgttatg aagaaagcat catctatagt cgtcgcgtag gaaacttgta tacgggttca 840 ctttatctgg gactcatttc ccttttagaa aatgcaacga ctttaaccgc aggcaatcaa 900 attgggttat tcagttatgg ttctggtgct gtcgctgaat ttttcactgg tgaattagta 960 gctggttatc aaaatcattt acaaaaagaa actcatttag cactgctaga taatcggaca 1020 gaactttcta tcgctgaata tgaagccatg tttgcagaaa ctttagacac agatattgat 1080 caaacgttag aagatgaatt aaaatatagt atttctgcta ttaataatac cgttcgctct 1140 tatcgaaact aa 1152 <210> 20 <211> 879 <212> DNA <213> Streptococcus pneumoniae <400> 20 atgacaaaaa aagttggtgt cggtcaggca catagtaaga taattttaat aggggaacat 60 gcggtcgttt acggttatcc tgccatttcc ctgcctcttt tggaggtgga ggtgacctgt 120 aaggtagttt ctgcagagag tccttggcgc ctttatgagg aggatacctt gtccatggcg 180 gtttatgcct cactggagta tttggatatc acagaagcct gcgttcgttg tgagattgac 240 tcggctatcc ctgagaaacg ggggatgggt tcgtcagcgg ctatcagcat agcggccatt 300 cgtgcggtat ttgactacta tcaggctgat ctgcctcatg atgtactaga aatcttggtc 360 aatcgagctg agatgattgc ccatatgaat cctagtggtt tggatgctaa gacctgtctc 420 agtgaccaac ctattcgctt tatcaagaac gtaggattta cagaacttga gatggattta 480 tccgcctatt tggtgattgc cgatacgggt gtttatggtc atactcgtga agccatccaa 540 gtggttcaaa ataagggcaa ggatgcccta ccgtttttgc atgccttggg agaattaacc 600 cagcaagcag aagttgcgat ttcacaaaaa tatgctgaag gactgggact aatcttcagt 660 caagctcatt tacatctaaa agaaattgga gtcagtagcc ctgaggcaga ctttttggtt 720 gaaacggctc ttagctatgg tgctctgggt gccaagatga gcggtggtgg gctaggaggt 780 tgtatcatag ccttggtaac caatttgacg cacgcacaag aactagcaga aagattagaa 840 gagaaaggag ctgttcagac atggatagag agcctgtaa 879 <210> 21 <211> 1011 <212> DNA <213> Streptococcus pneumoniae <400> 21 atgattgctg ttaaaacttg cggaaaactc tattgggcag gtgaatatgc tattttagag 60 ccagggcagt tagctttgat aaaggatatt cccatctata tgagggctga gattgctttt 120 tctgacagct accgtatcta ttcagatatg tttgatttcg cagtggactt aaggcccaat 180 cctgactaca gcttgattca agaaacgatt gctttgatgg gagacttcct cgctgttcgc 240 ggtcagaatt taagaccttt ttccctaaaa atctgtggca aaatggaacg agaagggaaa 300 aagtttggtc taggttctag tggcagcgtc gttgtcttgg ttgtcaaggc tttactggct 360 ctctataatc tttcggttga tcagaatctc ttgttcaagc tgactagcgc tgtcttgctc 420 aagcgaggag acaatggttc catgggcgac cttgcctgta ttgtggcaga ggatttggtt 480 ctttaccagt catttgatcg ccagaaggcg gctgcttggt tagaagaaga aaacttggcg 540 acagttctgg agcgtgattg gggatttttt atctcacaag tgaaaccaac tttagaatgt 600 gatttcttag tgggatggac caaggaagtg gctgtatcga gtcacatggt ccagcaaatc 660 aagcaaaata tcaatcaaaa ttttttaagt tcctcaaaag aaacggtggt ttctttggtc 720 gaagccttgg agcaggggaa agccgaaaaa gttatcgagc aagtagaagt agccagcaag 780 cttttagaag gcttgagtac agatatttac acgcctttgc ttagacagtt gaaagaagcc 840 agtcaagatt tgcaggccgt tgccaagagt agtggtgctg gtggtggtga ctgtggcatc 900 gccctgagtt ttgatgcgca atcttctcga aacactttaa aaaatcgttg ggccgatctg 960 gggattgagc tcttatatca agaaaggata ggacatgacg acaaatcgta a 1011 <210> 22 <211> 954 <212> DNA <213> Streptococcus pneumoniae <400> 22 atggatagag agcctgtaac agtacgttcc tacgcaaata ttgctattat caaatattgg 60 ggaaagaaaa aagaaaaaga gatggtgcct gctactagca gtatttctct aactttggaa 120 aatatgtata cagagacgac cttgtcgcct ttaccagcca atgtaacagc tgacgaattt 180 tacatcaatg gtcagctaca aaatgaggtc gagcatgcca agatgagtaa gattattgac 240 cgttatcgtc cagctggtga gggctttgtc cgtatcgata ctcaaaacaa tatgcctacg 300 gcagcgggcc tgtcctcaag ttctagtggt ttgtccgccc tggtcaaggc ttgtaatgct 360 tatttcaagc ttggattgga tagaagtcag ttggcacagg aagccaaatt tgcctcaggc 420 tcttcttctc ggagttttta tggaccacta ggagcctggg ataaggatag tggagaaatt 480 taccctgtag agacagactt gaaactagct atgattatgt tggtgctaga ggacaagaaa 540 aaaccaatct ctagccgtga cgggatgaaa ctttgtgtgg aaacctcgac gacttttgac 600 gactgggttc gtcagtctga gaaggactat caggatatgc tgatttatct caaggaaaat 660 gattttgcca agattggaga attaacggag aaaaatgctc tggctatgca tgctacgaca 720 aagactgcta gtccagcctt ttcttatctg acggatgcct cttatgaggc tatggccttt 780 gttcgccagc ttcgtgagaa aggagaggcc tgctacttta ccatggatgc tggtcccaat 840 gttaaggtct tctgtcagga gaaagacttg gagcatttgt cagaaatttt cggtcagcgt 900 tatcgcttga ttgtgtcaaa aacaaaggat ttgagtcaag atgattgctg ttaa 954 <210> 23 <211> 546 <212> DNA <213> E. coli K12 MG1655 <400> 23 atgcaaacgg aacacgtcat tttattgaat gcacagggag ttcccacggg tacgctggaa 60 aagtatgccg cacacacggc agacacccgc ttacatctcg cgttctccag ttggctgttt 120 aatgccaaag gacaattatt agttacccgc cgcgcactga gcaaaaaagc atggcctggc 180 gtgtggacta actcggtttg tgggcaccca caactgggag aaagcaacga agacgcagtg 240 atccgccgtt gccgttatga gcttggcgtg gaaattacgc ctcctgaatc tatctatcct 300 gactttcgct accgcgccac cgatccgagt ggcattgtgg aaaatgaagt gtgtccggta 360 tttgccgcac gcaccactag tgcgttacag atcaatgatg atgaagtgat ggattatcaa 420 tggtgtgatt tagcagatgt attacacggt attgatgcca cgccgtgggc gttcagtccg 480 tggatggtga tgcaggcgac aaatcgcgaa gccagaaaac gattatctgc atttacccag 540 cttaaa 546 <210> 24 <211> 918 <212> DNA <213> Haematococcus pluvialis <400> 24 atgcttcgtt cgttgctcag aggcctcacg catatccccc gcgtgaactc cgcccagcag 60 cccagctgtg cacacgcgcg actccagttt aagctcagga gcatgcagat gacgctcatg 120 cagcccagca tctcagccaa tctgtcgcgc gccgaggacc gcacagacca catgaggggt 180 gcaagcacct gggcaggcgg gcagtcgcag gatgagctga tgctgaagga cgagtgcatc 240 ttggtggatg ttgaggacaa catcacaggc catgccagca agctggagtg tcacaagttc 300 ctaccacatc agcctgcagg cctgctgcac cgggccttct ctgtgttcct gtttgacgat 360 caggggcgac tgctgctgca acagcgtgca cgctcaaaaa tcaccttccc aagtgtgtgg 420 acgaacacct gctgcagcca ccctttacat gggcagaccc cagatgaggt ggaccaacta 480 agccaggtgg ccgacggaac agtacctggc gcaaaggctg ctgccatccg caagttggag 540 cacgagctgg ggataccagc gcaccagctg ccggcaagcg cgtttcgctt cctcacgcgt 600 ttgcactact gtgccgcgga cgtgcagcca gctgcgacac aatcagcgct ctggggcgag 660 cacgaaatgg actacatctt gttcatccgg gccaacgtca ccttggcgcc caaccctgac 720 gaggtggacg aagtcaggta cgtgacgcaa gaggagctgc ggcagatgat gcagccggac 780 aacgggctgc aatggtcgcc gtggtttcgc atcatcgccg cgcgcttcct tgagcgttgg 840 tgggctgacc tggacgcggc cctaaacact gacaaacacg aggattgggg aacggtgcat 900 cacatcaacg aagcgtga 918 <210> 25 <211> 1884 <212> DNA <213> E. coli K12 MG1655 <400> 25 atggaattca ggaggcccct gatgagtttt gatattgcca aatacccgac cctggcactg 60 gtcgactcca cccaggagtt acgactgttg ccgaaagaga gtttaccgaa actctgcgac 120 gaactgcgcc gctatttact cgacagcgtg agccgttcca gcgggcactt cgcctccggg 180 ctgggcacgg tcgaactgac cgtggcgctg cactatgtct acaacacccc gtttgaccaa 240 ttgatttggg atgtggggca tcaggcttat ccgcataaaa ttttgaccgg acgccgcgac 300 aaaatcggca ccatccgtca gaaaggcggt ctgcacccgt tcccgtggcg cggcgaaagc 360 gaatatgacg tattaagcgt cgggcattca tcaacctcca tcagtgccgg aattggtatt 420 gcggttgctg ccgaaaaaga aggcaaaaat cgccgcaccg tctgtgtcat tggcgatggc 480 gcgattaccg caggcatggc gtttgaagcg atgaatcacg cgggcgatat ccgtcctgat 540 atgctggtga ttctcaacga caatgaaatg tcgatttccg aaaatgtcgg cgcgctcaac 600 aaccatctgg cacagctgct ttccggtaag ctttactctt cactgcgcga aggcgggaaa 660 aaagttttct ctggcgtgcc gccaattaaa gagctgctca aacgcaccga agaacatatt 720 aaaggcatgg tagtgcctgg cacgttgttt gaagagctgg gctttaacta catcggcccg 780 gtggacggtc acgatgtgct ggggcttatc accacgctaa agaacatgcg cgacctgaaa 840 ggcccgcagt tcctgcatat catgaccaaa aaaggtcgtg gttatgaacc ggcagaaaaa 900 gacccgatca ctttccacgc cgtgcctaaa tttgatccct ccagcggttg tttgccgaaa 960 agtagcggcg gtttgccgag ctattcaaaa atctttggcg actggttgtg cgaaacggca 1020 gcgaaagaca acaagctgat ggcgattact ccggcgatgc gtgaaggttc cggcatggtc 1080 gagttttcac gtaaattccc ggatcgctac ttcgacgtgg caattgccga gcaacacgcg 1140 gtgacctttg ctgcgggtct ggcgattggt gggtacaaac ccattgtcgc gatttactcc 1200 actttcctgc aacgcgccta tgatcaggtg ctgcatgacg tggcgattca aaagcttccg 1260 gtcctgttcg ccatcgaccg cgcgggcatt gttggtgctg acggtcaaac ccatcagggt 1320 gcttttgatc tctcttacct gcgctgcata ccggaaatgg tcattatgac cccgagcgat 1380 gaaaacgaat gtcgccagat gctctatacc ggctatcact ataacgatgg cccgtcagcg 1440 gtgcgctacc cgcgtggcaa cgcggtcggc gtggaactga cgccgctgga aaaactacca 1500 attggcaaag gcattgtgaa gcgtcgtggc gagaaactgg cgatccttaa ctttggtacg 1560 ctgatgccag aagcggcgaa agtcgccgaa tcgctgaacg ccacgctggt cgatatgcgt 1620 tttgtgaaac cgcttgatga agcgttaatt ctggaaatgg ccgccagcca tgaagcgctg 1680 gtcaccgtag aagaaaacgc cattatgggc ggcgcaggca gcggcgtgaa cgaagtgctg 1740 atggcccatc gtaaaccagt acccgtgctg aacattggcc tgccggactt ctttattccg 1800 caaggaactc aggaagaaat gcgcgccgaa ctcggcctcg atgccgctgg tatggaagcc 1860 aaaatcaagg cctggctggc ataa 1884 <210> 26 <211> 924 <212> DNA <213> Pantoea agglomerans <400> 26 atggtgagtg gcagtaaagc gggcgtttcg cctcatcgcg aaatagaagt aatgagacaa 60 tccattgacg atcacctggc tggcctgtta cctgaaaccg acagccagga tatcgtcagc 120 cttgcgatgc gtgaaggcgt catggcaccc ggtaaacgga tccgtccgct gctgatgctg 180 ctggccgccc gcgacctccg ctaccagggc agtatgccta cgctgctcga tctcgcctgc 240 gccgttgaac tgacccatac cgcgtcgctg atgctcgacg acatgccctg catggacaac 300 gccgagctgc gccgcggtca gcccactacc cacaaaaaat ttggtgagag cgtggcgatc 360 cttgcctccg ttgggctgct ctctaaagcc tttggtctga tcgccgccac cggcgatctg 420 ccgggggaga ggcgtgccca ggcggtcaac gagctctcta ccgccgtggg cgtgcagggc 480 ctggtactgg ggcagtttcg cgatcttaac gatgccgccc tcgaccgtac ccctgacgct 540 atcctcagca ccaaccacct caagaccggc attctgttca gcgcgatgct gcagatcgtc 600 gccattgctt ccgcctcgtc gccgagcacg cgagagacgc tgcacgcctt cgccctcgac 660 ttcggccagg cgtttcaact gctggacgat ctgcgtgacg atcacccgga aaccggtaaa 720 gatcgcaata aggacgcggg aaaatcgacg ctggtcaacc ggctgggcgc agacgcggcc 780 cggcaaaagc tgcgcgagca tattgattcc gccgacaaac acctcacttt tgcctgtccg 840 cagggcggcg ccatccgaca gtttatgcat ctgtggtttg gccatcacct tgccgactgg 900 tcaccggtca tgaaaatcgc ctga 924 <210> 27 <211> 930 <212> DNA <213> Pantoea agglomerans <400> 27 atgagccaac cgccgctgct tgaccacgcc acgcagacca tggccaacgg ctcgaaaagt 60 tttgccaccg ctgcgaagct gttcgacccg gccacccgcc gtagcgtgct gatgctctac 120 acctggtgcc gccactgcga tgacgtcatt gacgaccaga cccacggctt cgccagcgag 180 gccgcggcgg aggaggaggc cacccagcgc ctggcccggc tgcgcacgct gaccctggcg 240 gcgtttgaag gggccgagat gcaggatccg gccttcgctg cctttcagga ggtggcgctg 300 acccacggta ttacgccccg catggcgctc gatcacctcg acggctttgc gatggacgtg 360 gctcagaccc gctatgtcac ctttgaggat acgctgcgct actgctatca cgtggcgggc 420 gtggtgggtc tgatgatggc cagggtgatg ggcgtgcggg atgagcgggt gctggatcgc 480 gcctgcgatc tggggctggc cttccagctg acgaatatcg cccgggatat tattgacgat 540 gcggctattg accgctgcta tctgcccgcc gagtggctgc aggatgccgg gctgaccccg 600 gagaactatg ccgcgcggga gaatcgggcc gcgctggcgc gggtggcgga gcggcttatt 660 gatgccgcag agccgtacta catctcctcc caggccgggc tacacgatct gccgccgcgc 720 tgcgcctggg cgatcgccac cgcccgcagc gtctaccggg agatcggtat taaggtaaaa 780 gcggcgggag gcagcgcctg ggatcgccgc cagcacacca gcaaaggtga aaaaattgcc 840 atgctgatgg cggcaccggg gcaggttatt cgggcgaaga cgacgagggt gacgccgcgt 900 ccggccggtc tttggcagcg tcccgtttag 930 <210> 28 <211> 1459 <212> DNA <213> Pantoea agglomerans <400> 28 atgaaaaaaa ccgttgtgat tggcgcaggc tttggtggcc tggcgctggc gattcgcctg 60 caggcggcag ggatcccaac cgtactgctg gagcagcggg acaagcccgg cggtcgggcc 120 tacgtctggc atgaccaggg ctttaccttt gacgccgggc cgacggtgat caccgatcct 180 accgcgcttg aggcgctgtt caccctggcc ggcaggcgca tggaggatta cgtcaggctg 240 ctgccggtaa aacccttcta ccgactctgc tgggagtccg ggaagaccct cgactatgct 300 aacgacagcg ccgagcttga ggcgcagatt acccagttca acccccgcga cgtcgagggc 360 taccggcgct ttctggctta ctcccaggcg gtattccagg agggatattt gcgcctcggc 420 agcgtgccgt tcctctcttt tcgcgacatg ctgcgcgccg ggccgcagct gcttaagctc 480 caggcgtggc agagcgtcta ccagtcggtt tcgcgcttta ttgaggatga gcatctgcgg 540 caggccttct cgttccactc cctgctggta ggcggcaacc ccttcaccac ctcgtccatc 600 tacaccctga tccacgccct tgagcgggag tggggggtct ggttccctga gggcggcacc 660 ggggcgctgg tgaacggcat ggtgaagctg tttaccgatc tgggcgggga gatcgaactc 720 aacgcccggg tcgaagagct ggtggtggcc gataaccgcg taagccaggt ccggctggcg 780 gatggtcgga tctttgacac cgacgccgta gcctcgaacg ctgacgtggt gaacacctat 840 aaaaagctgc tcggccacca tccggtgggg cagaagcggg cggcagcgct ggagcgcaag 900 agcatgagca actcgctgtt tgtgctctac ttcggcctga accagcctca ttcccagctg 960 gcgcaccata ccatctgttt tggtccccgc taccgggagc tgatcgacga gatctttacc 1020 ggcagcgcgc tggcggatga cttctcgctc tacctgcact cgccctgcgt gaccgatccc 1080 tcgctcgcgc ctcccggctg cgccagcttc tacgtgctgg ccccggtgcc gcatcttggc 1140 aacgcgccgc tggactgggc gcaggagggg ccgaagctgc gcgaccgcat ctttgactac 1200 cttgaagagc gctatatgcc cggcctgcgt agccagctgg tgacccagcg gatctttacc 1260 ccggcagact tccacgacac gctggatgcg catctgggat cggccttctc catcgagccg 1320 ctgctgaccc aaagcgcctg gttccgcccg cacaaccgcg acagcgacat tgccaacctc 1380 tacctggtgg gcgcaggtac tcaccctggg gcgggcattc ctggcgtagt ggcctcggcg 1440 aaagccaccg ccagcctga 1459 <210> 29 <211> 1149 <212> DNA <213> Pantoea ananatis <400> 29 atgcaaccgc attatgatct gattctcgtg ggggctggac tcgcgaatgg ccttatcgcc 60 ctgcgtcttc agcagcagca acctgatatg cgtattttgc ttatcgacgc cgcaccccag 120 gcgggcggga atcatacgtg gtcatttcac cacgatgatt tgactgagag ccaacatcgt 180 tggatagctc cgctggtggt tcatcactgg cccgactatc aggtacgctt tcccacacgc 240 cgtcgtaagc tgaacagcgg ctacttttgt attacttctc agcgtttcgc tgaggtttta 300 cagcgacagt ttggcccgca cttgtggatg gataccgcgg tcgcagaggt taatgcggaa 360 tctgttcggt tgaaaaaggg tcaggttatc ggtgcccgcg cggtgattga cgggcggggt 420 tatgcggcaa attcagcact gagcgtgggc ttccaggcgt ttattggcca ggaatggcga 480 ttgagccacc cgcatggttt atcgtctccc attatcatgg atgccacggt cgatcagcaa 540 aatggttatc gcttcgtgta cagcctgccg ctctcgccga ccagattgtt aattgaagac 600 acgcactata ttgataatgc gacattagat cctgaatgcg cgcggcaaaa tatttgcgac 660 tatgccgcgc aacagggttg gcagcttcag acactgctgc gagaagaaca gggcgcctta 720 cccattactc tgtcgggcaa tgccgacgca ttctggcagc agcgccccct ggcctgtagt 780 ggattacgtg ccggtctgtt ccatcctacc accggctatt cactgccgct ggcggttgcc 840 gtggccgacc gcctgagtgc acttgatgtc tttacgtcgg cctcaattca ccatgccatt 900 acgcattttg cccgcgagcg ctggcagcag cagggctttt tccgcatgct gaatcgcatg 960 ctgtttttag ccggacccgc cgattcacgc tggcgggtta tgcagcgttt ttatggttta 1020 cctgaagatt taattgcccg tttttatgcg ggaaaactca cgctgaccga tcggctacgt 1080 attctgagcg gcaagccgcc tgttccggta ttagcagcat tgcaagccat tatgacgact 1140 catcgttaa 1149 <210> 30 <211> 4176 <212> DNA <213> Artificial Sequence <220> <223> pTrc99A vector <400> 30 gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca ggcagccatc 60 ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt cgctcaaggc 120 gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg gcaaatattc 180 tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat tgtgagcgga 240 taacaatttc acacaggaaa cagaccatgg aattcgagct cggtacccgg ggatcctcta 300 gagtcgacct gcaggcatgc aagcttggct gttttggcgg atgagagaag attttcagcc 360 tgatacagat taaatcagaa cgcagaagcg gtctgataaa acagaatttg cctggcggca 420 gtagcgcggt ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg 480 atggtagtgt ggggtctccc catgcgagag tagggaactg ccaggcatca aataaaacga 540 aaggctcagt cgaaagactg ggcctttcgt tttatctgtt gtttgtcggt gaacgctctc 600 ctgagtagga caaatccgcc gggagcggat ttgaacgttg cgaagcaacg gcccggaggg 660 tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa ttaagcagaa ggccatcctg 720 acggatggcc tttttgcgtt tctacaaact ctttttgttt atttttctaa atacattcaa 780 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 840 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 900 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 960 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 1020 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 1080 tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 1140 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 1200 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 1260 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 1320 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 1380 cgatgcctac agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 1440 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 1500 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 1560 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 1620 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 1680 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1740 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1800 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1860 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1920 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1980 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 2040 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 2100 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 2160 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 2220 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 2280 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 2340 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 2400 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 2460 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 2520 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 2580 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 2640 cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca 2700 tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc 2760 gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc 2820 gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 2880 gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agcagatcaa 2940 ttcgcgcgcg aaggcgaagc ggcatgcatt tacgttgaca ccatcgaatg gtgcaaaacc 3000 tttcgcggta tggcatgata gcgcccggaa gagagtcaat tcagggtggt gaatgtgaaa 3060 ccagtaacgt tatacgatgt cgcagagtat gccggtgtct cttatcagac cgtttcccgc 3120 gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg aaaaagtgga agcggcgatg 3180 gcggagctga attacattcc caaccgcgtg gcacaacaac tggcgggcaa acagtcgttg 3240 ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc cgtcgcaaat tgtcgcggcg 3300 attaaatctc gcgccgatca actgggtgcc agcgtggtgg tgtcgatggt agaacgaagc 3360 ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg cgcaacgcgt cagtgggctg 3420 atcattaact atccgctgga tgaccaggat gccattgctg tggaagctgc ctgcactaat 3480 gttccggcgt tatttcttga tgtctctgac cagacaccca tcaacagtat tattttctcc 3540 catgaagacg gtacgcgact gggcgtggag catctggtcg cattgggtca ccagcaaatc 3600 gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc tgcgtctggc tggctggcat 3660 aaatatctca ctcgcaatca aattcagccg atagcggaac gggaaggcga ctggagtgcc 3720 atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg gcatcgttcc cactgcgatg 3780 ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg ccattaccga gtccgggctg 3840 cgcgttggtg cggatatctc ggtagtggga tacgacgata ccgaagacag ctcatgttat 3900 atcccgccgt caaccaccat caaacaggat tttcgcctgc tggggcaaac cagcgtggac 3960 cgcttgctgc aactctctca gggccaggcg gtgaagggca atcagctgtt gcccgtctca 4020 ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 4080 gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 4140 caacgcaatt aatgtgagtt agcgcgaatt gatctg 4176 <210> 31 <211> 828 <212> DNA <213> Uncultured marine bacterium 66A03 <220> <221> misc_feature (222) (1) .. (828) <223> blh gene: DQ065755 <400> 31 atgggcttga tgttaattga ttggtgtgct ttagcattgg ttgtgtttat tggtttgcca 60 catggtgcct tagatgctgc tatttctttt tcaatgattt cttcagcaaa gagaattgct 120 agattagcag gaatactatt aatttacctg ttgttagcaa ccgcattttt tttaatttgg 180 tatcaattac cagcattttc tcttcttatt tttcttttga taagcataat ccattttgga 240 atggctgatt tcaatgcatc cccaagtaaa cttaagtggc ctcatattat tgcacatggc 300 ggcgttgtta ctgtttggtt gccgcttatc caaaaaaatg aagttacgaa gctattttca 360 atattaacaa atggtccaac tcccatttta tgggacatac tattgatatt ttttttatgt 420 tggagcatag gagtatgtct tcatacctat gaaactttac gttctaaaca ttataatatc 480 gcctttgaac ttattggatt aatttttcta gcctggtatg cacccccact cgttactttt 540 gccacatact tctgctttat ccacagcaga cgtcacttta gttttgtttg gaaacagtta 600 cagcatatga gttcaaaaaa aatgatgata ggtagtgcca ttattttatc ttgtacgagc 660 tggttgatag gcggaggaat atattttttc ctcaattcga aaatgattgc cagtgaagct 720 gctttacaaa ctgtctttat tggtcttgca gctttaacag ttcctcacat gatacttatc 780 gactttatat ttagaccaca ctcttccaga attaaaatca aaaattga 828 <210> 32 <211> 828 <212> DNA <213> Artificial Sequence <220> <223> E. coli codon optimized blh gene of uncultured marine bacterium          66A03 <400> 32 atgggtctga tgctgattga ttggtgtgca ctggctctgg ttgttttcat tggcctgccg 60 cacggcgcgc tggatgctgc catttctttt tctatgatct cttctgcaaa acgcattgct 120 cgtctggctg gtattctgct gatctatctg ctgctggcga ccgcgttctt cctgatctgg 180 tatcagctgc cagcgtttag cctgctgatc ttcctgctga tctccattat ccactttggt 240 atggcagact tcaacgcgtc cccaagcaaa ctgaaatggc cgcatatcat cgcccacggc 300 ggtgttgtta ctgtttggct gccgctgatc cagaaaaacg aagtaactaa actgtttagc 360 atcctgacta acggtccgac tccgatcctg tgggacatcc tgctgatttt cttcctgtgt 420 tggtctattg gcgtgtgtct gcacacgtac gaaaccctgc gctctaaaca ttacaacatc 480 gcctttgaac tgatcggtct gattttcctg gcgtggtatg cgccgcctct ggttacgttt 540 gccacttact tctgcttcat tcattcccgt cgccacttct cctttgtgtg gaagcagctg 600 caacacatgt cttccaaaaa gatgatgatt ggcagcgcga ttatcctgtc ctgtacctct 660 tggctgatcg gcggtggtat ctatttcttc ctgaactcca aaatgatcgc ctctgaggct 720 gcgctgcaga ctgtgttcat cggtctggcg gcactgaccg tgccgcacat gattctgatc 780 gacttcatct tccgtccgca ctcttcccgt atcaaaatca aaaactaa 828 <210> 33 <211> 1701 <212> DNA <213> Mus musculus <400> 33 atggagataa tatttggcca gaataagaaa gaacagctgg agccagttca ggccaaagtg 60 acaggcagca ttccagcatg gctgcagggg accctgctcc gaaacgggcc cgggatgcac 120 acagtgggag agagcaagta caaccattgg tttgatggcc tggcccttct ccacagtttc 180 tccatcagag atggggaggt cttctacagg agcaaatacc tgcagagtga cacctacatc 240 gccaacattg aggccaacag aatcgtggtg tctgagttcg gaaccatggc ctacccggac 300 ccctgcaaaa acatcttttc caaagctttc tcctacttgt ctcacaccat ccccgacttc 360 acagacaact gtctgatcaa catcatgaaa tgtggagaag acttctatgc aaccacggag 420 accaactaca tcaggaaaat cgacccccag accctagaga ccttggagaa ggttgattac 480 cggaagtatg tggcggtaaa cctggctacc tcgcaccctc attatgacga ggctgggaat 540 gtccttaaca tgggcacatc cgtcgtggac aaagggagga caaaatacgt gatatttaag 600 atccctgcca cagtgccaga cagcaagaag aaagggaaga gtcccgtgaa gcacgcggaa 660 gttttctgct ccatttcctc ccgctcgctg ctctctccca gctactacca cagctttggt 720 gtcacggaga actatgtggt gtttctggag cagcctttta agttggatat cctcaagatg 780 gccaccgcat acatgagggg agtgagctgg gcttcctgta tgtcattcga cagggaggac 840 aagacataca ttcatatcat cgaccagagg accaggaagc ctgtgcctac caagttctac 900 acagatccca tggtggtctt ccatcatgtc aatgcctacg aggaggacgg ctgtgtgctg 960 tttgatgtga tcgcctatga ggacagcagc ctctatcagc tcttctacct ggccaacctg 1020 aacaaggact tcgaggagaa gtccaggctg acctcagtgc ctaccctcag gaggtttgct 1080 gtgcccctcc atgtggacaa ggatgcagaa gtgggctcaa atttagtcaa ggtgtcatct 1140 acaactgcaa cagccctgaa ggagaaagac ggccatgtct attgccagcc cgaggtcctc 1200 tacgaaggcc tagagctccc tcggataaat tatgcttaca acgggaagcc atatcgctac 1260 atctttgcag ctgaagtaca gtggagtcca gtcccaacca agatactgaa atatgacatt 1320 ctcacaaagt cctccttaaa gtggtctgag gagagctgct ggccagcaga gcctctgttt 1380 gttcccacgc caggtgcgaa ggatgaagat gatggagtca ttttatcagc catcgtctct 1440 acggatcccc aaaagctgcc ttttttactc attctggatg ccaaaagttt tacggaactg 1500 gctcgcgcct ctgttgatgc ggacatgcac ctggaccttc atggtttatt tatcccagat 1560 gcagactgga atgcagtgaa gcagactcca gctgaaacgc aagaggttga aaactcagat 1620 catcccacag atccgacagc accagaactg agccacagtg aaaatgactt cacagcgggt 1680 catggtggct caagtcttta a 1701 <210> 34 <211> 1053 <212> DNA <213> Natronomonas pharaonis ATCC35678 <400> 34 atgagtaacg cgtcgctccg gccctccggg acggccagtg cgacactgtt ccggctggcg 60 ttcctcccgg gctgggctgt catcgcggcg acgacgggtg cgttcctcgt gggagcctca 120 ctacccctta cctaccaact catcccgctc gccgctagcg tggtcctgct cgggctccca 180 cacggcgctg tcgaccattt ggcgctcccg cggacccgaa acgagcgggt cacggttcga 240 tggcttgcgg ccatcggtgt cctctatgcc gttgtcggcg ggctctatgc ggcagtctgg 300 tttctcgcgc ccgtcggtgc cgtcgccgcg ttcattttta tgacgtgggt tcactggggc 360 caaggagaaa tctatccgct tgttgcgctc gctgacgccg accacctcga tgggcggctc 420 gaacgcggac tgacagccgc catccgcggc gcattgccga tgctcgtccc gtttgtcgcc 480 tttcccgacc agtacgagct cgtcgtgaca accctcgtcg ggctcttcga cgccgatgca 540 gcggcgacgg cggcggccgc gttcacgccg accgcacggc tcgcagtcgc ggtcaccgtc 600 ggggggttgg tggcggtcac cctcggtatt ggagctgtcg ctgccagcga gaccggctgg 660 gggccgtggc tgcttgatgc cggcgaaaca gggcttctga ttctgttctt tgcggcggtg 720 ccgccgattt tcgccatcgg cctctacttt tgtttctggc actcgcttcg ccacatcgtc 780 cggttgctcg ctgtcgataa ccgggcagca ccggcgctcg atggtcgccg atacggcgcg 840 gcgcttgcgc gctttgctcg ggatgcagcc ccgctgtcag cggcgtcgct cgtgttgctc 900 gggctgttgt atctagccgt gcccggcagc gtcgactcgc cgctttcgct tgtcgggacc 960 tacctcgtgt tgatagccgt gctcacgctc ccgcacgtgg tcgtcgtggc gtggatggac 1020 cacgaacagc ggctctggcg acccggagca tag 1053 <210> 35 <211> 855 <212> DNA <213> Halobacterium salinarum ATCC700922 <400> 35 atgccacacg gcgcgatcga ctacctcgcg ttgccccgcg cggtcacggg caccgtcacc 60 gtgcggtggc tggcggtcgt cggcgtcctc tacctcgtgc tcggtggtgg ctacgccgcc 120 gcgtggtttt tcgcgcccgt tcccgctgcg ttcgcgttcg tcgcgatcac gtggctgcac 180 tgggggcagg gcgacctcta cccgctgctc gacttcctcg acgtcgacta cctcgatacg 240 cgcccgcggc gcgcggcgac ggtcctgatc cggggtggcc tcccgatgct cgtgccgctg 300 ctcgggttcc cggagcggta ccgcagcgtc gtcgacgcgt tcgccgcgcc gttcggcggc 360 tccgtcggcg acctcgcggt gttcgacccg cgcgtccgcc tgtggctggg cgtcgcgttc 420 gcagccgcga ccgtcgcggt gctcgcggcg ggcagacgcc gcacccactc ccccggcgcg 480 tggcgcgtcg acgccgccga aaccctcctg ttgtgggtgt tcttcttcgt cgtgccgccg 540 gtgttcgccg tcggcgtcta cttctgcgtc tggcactcgg tccggcacgt cgcgcgcgcc 600 atcgccgtcg acggctcggt ccacccgtcg ctgcgcgcgg gcgacatcct cggaccgctg 660 gcccggttcg gcgtggaggc cgcgccgatg acggcggccg cgctcgcgtt gggcggcgtg 720 ctgtggtggg cggtacccaa cccgccgacc acgctcgaat ccggggccgc actctacctc 780 gtgttgatcg ccgtgctcac cctgccacac gtcgccgtgg tcacgtggat ggaccgcgtg 840 cagggcgtcc tctga 855 <210> 36 <211> 1080 <212> DNA <213> Halobacterium salinarum ATCC700922 <400> 36 atgagcaata ggtcgcagtt cgtcccgtcg tggctcgtgc cggaggcagc cggcgacctc 60 ccgttgaccg tgtcgcggct gtcactgctc gcgcttgccg ccgcgttcgc ggtcggatac 120 ggcgcgggct tcgcggtccc actggaggtc caggcgggcg tctacctgtt gggtatggtc 180 gcgatgaacc tcccgcacgg cggctacgag catttcgaga acctgcggcg acgggctgcc 240 tccttccagg gcaagtatat cgtcgcctac ctggtcggga tcgcggcgtt cggcgcgctg 300 tttttcgtcg cgcccgtcgc cggactgggg ctggcagtca cggtggccgt cgccaaaggt 360 gggttcggtg gcgtgcagtc gatggacgcc ctctacggaa ctgaccattt gcgcacgcgc 420 ccccagcggt ggctcgccgc cgtcgtccgg ggcggcgcgg tgatggtggt tcccatgttg 480 ttctggacgg acgtgttcta cgcgttcagc tcggtcatga tctcgatttt cgaccccagc 540 gccgtgtcgg cgctcggcgg tgacatcgca acccggcggc tcgtgctcgg cggcgggtac 600 ggggcgctcg tggtcgcaca cctggggctc ggctaccggc gggcggccgg caccgggtcg 660 ttcctcgccg acgccgccga gacgctgctg ttgatcgcgt acttcgcgct cgttccggtg 720 gtcatcgccg tcgggctgta cttcccgctg tggtactcgg cccgccaggt ggcccgatcg 780 tcggccgtcg acgacacggc gatgacgcag gcagacgcca ccggcatgct tgacgccctg 840 gacgccgacg acccggcgcg cgccacgctt gcctcgtggg cggtgctcat cgtcggcagc 900 gtcgccacgt tcggcctggc ggccgtgctc tggctgctgt ccccacagcc cctgggtggt 960 ggtgggatcc tcgtgggctt ggtcgcgttc tggagcatct tcgtgagcat catcgcgctc 1020 ccgcacgtcg tcgtcggcgg gtggcttgac cgcactcgcg gcatctggta cgtcccataa 1080                                                                         1080 <210> 37 <211> 44 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 37 cgagctcagg agcatttaga tgttgaaaac agtagttatt attg 44 <210> 38 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 38 gcccggggtg gcctgaaacg gctacc 26 <210> 39 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 39 gcccgggagg agttaaagaa atgacaattg 30 <210> 40 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 40 cggatccctt agtttcgata agagcgaac 29 <210> 41 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 41 cggtaccaat gacaaaaaaa gttggtgtcg g 31 <210> 42 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 42 ttctagatta cgatttgtcg tcatgtcc 28 <210> 43 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 43 ccccgggagg agagaaatta tgcaaacgga ac 32 <210> 44 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 44 tgcatgctta tttaagctgg gtaaatg 27 <210> 45 <211> 2999 <212> DNA <213> Artificial Sequence <220> <223> pSTV28 vector sequence <400> 45 cgtatggcaa tgaaagacgg tgagctggtg atatgggata gtgttcaccc ttgttacacc 60 gttttccatg agcaaactga aacgttttca tcgctctgga gtgaatacca cgacgatttc 120 cggcagtttc tacacatata ttcgcaagat gtggcgtgtt acggtgaaaa cctggcctat 180 ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg ggtgagtttc 240 accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt tttcaccatg 300 ggcaaatatt atacgcaagg cgacaaggtg ctgatgccgc tggcgattca ggttcatcat 360 gccgtctgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca gtactgcgat 420 gagtggcagg gcggggcgta atttttttaa ggcagttatt ggtgccctta aacgcctggt 480 gctacgcctg aataagtgat aataagcgga tgaatggcag aaattcgaaa gcaaattcga 540 cccggtcgtc ggttcagggc agggtcgtta aatagccgct tatgtctatt gctggtttac 600 cggtttattg actaccggaa gcagtgtgac cgtgtgcttc tcaaatgcct gaggccagtt 660 tgctcaggct ctccccgtgg aggtaataat tgacgatatg atcatttatt ctgcctccca 720 gagcctgata aaaacggtta gcgcttcgtt aatacagatg taggtgttcc acagggtagc 780 cagcagcatc ctgcgatgca gatccggaac ataatggtgc agggcgcttg tttcggcgtg 840 ggtatggtgg caggccccgt ggccggggga ctgttgggcg ctgccggcac ctgtcctacg 900 agttgcatga taaagaagac agtcataagt gcggcgacga tagtcatgcc ccgcgcccac 960 cggaaggagc taccggacag cggtgcggac tgttgtaact cagaataaga aatgaggccg 1020 ctcatggcgt tccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1080 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 1140 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 1200 gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgaat 1260 tcgagctcgg tacccgggga tcctctagag tcgacctgca ggcatgcaag cttggcactg 1320 gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt 1380 gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct 1440 tcccaacagt tgcgcagcct gaatggcgaa tgagcttatc gatgataagc tgtcaaacat 1500 gagaattaca acttatatcg tatggggctg acttcaggtg ctacatttga agagataaat 1560 tgcactgaaa tctagaaata ttttatctga ttaataagat gatcttcttg agatcgtttt 1620 ggtctgcgcg taatctcttg ctctgaaaac gaaaaaaccg ccttgcaggg cggtttttcg 1680 aaggttctct gagctaccaa ctctttgaac cgaggtaact ggcttggagg agcgcagtca 1740 ccaaaacttg tcctttcagt ttagccttaa ccggcgcatg acttcaagac taactcctct 1800 aaatcaatta ccagtggctg ctgccagtgg tgcttttgca tgtctttccg ggttggactc 1860 aagacgatag ttaccggata aggcgcagcg gtcggactga acggggggtt cgtgcataca 1920 gtccagcttg gagcgaactg cctacccgga actgagtgtc aggcgtggaa tgagacaaac 1980 gcggccataa cagcggaatg acaccggtaa accgaaaggc aggaacagga gagcgcacga 2040 gggagccgcc aggggaaacg cctggtatct ttatagtcct gtcgggtttc gccaccactg 2100 atttgagcgt cagatttcgt gatgcttgtc aggggggcgg agcctatgga aaaacggctt 2160 tgccgcggcc ctctcacttc cctgttaagt atcttcctgg catcttccag gaaatctccg 2220 ccccgttcgt aagccatttc cgctcgccgc agtcgaacga ccgagcgtag cgagtcagtg 2280 agcgaggaag cggaatatat cctgtatcac atattctgct gacgcaccgg tgcagccttt 2340 tttctcctgc cacatgaagc acttcactga caccctcatc agtgccaaca tagtaagcca 2400 gtatacactc cgctagcgct gatgtccggc ggtgcttttg ccgttacgca ccaccccgtc 2460 agtagctgaa caggagggac agctgataga aacagaagcc actggagcac ctcaaaaaca 2520 ccatcataca ctaaatcagt aagttggcag catcacccga cgcactttgc gccgaataaa 2580 tacctgtgac ggaagatcac ttcgcagaat aaataaatcc tggtgtccct gttgataccg 2640 ggaagccctg ggccaacttt tggcgaaaat gagacgttga tcggcacgta agaggttcca 2700 actttcacca taatgaaata agatcactac cgggcgtatt ttttgagtta tcgagatttt 2760 caggagctaa ggaagctaaa atggagaaaa aaatcactgg atataccacc gttgatatat 2820 cccaatggca tcgtaaagaa cattttgagg catttcagtc agttgctcaa tgtacctata 2880 accagaccgt tcagctggat attacggcct ttttaaagac cgtaaagaaa aataagcaca 2940 agttttatcc ggcctttatt cacattcttg cccgcctgat gaatgctcat ccggaattt 2999 <210> 46 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 46 ccccgggagg agagaaatta tgcaaacgga ac 32 <210> 47 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 47 tgcatgctta tttaagctgg gtaaatg 27 <210> 48 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 48 ccggaattca taatgagttt tgatattgcc 30 <210> 49 <211> 37 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 49 ctcctcgagt acgtatcatt atgccagcca ggccttg 37 <210> 50 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 50 ctcatgacgg tctgcgcaaa aaaacacgtt c 31 <210> 51 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 51 ggaattctta actgacggca gcgagttttt tg 32 <210> 52 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 52 cgaattcagg agcgactaca tgaaaccaac tacgg 35 <210> 53 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 53 ggagctctta gagcgggcgc tgccagagat g 31 <210> 54 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 54 ggtcgacagg aggatcggga tgaccgcgag ac 32 <210> 55 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 55 gctgcagtta tgcgcgggtt cctgcaaatg 30 <210> 56 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 56 gactagtgaa ttcaggaggt aataaatatg g 31 <210> 57 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 57 cactagttag tttttgattt tg 22 <210> 58 <211> 35 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 58 gactagtagg agcggttcca tggagataat atttg 35 <210> 59 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 59 catggtggct caagtcttta actagtc 27 <210> 60 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 60 gactagtagg aggccgagta tgagtaacgc gtc 33 <210> 61 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 61 ctctggcgac ccggagcata actagtc 27 <210> 62 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 62 gactagtagg aggtgttcgg catgccacac gg 32 <210> 63 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 63 gaccgcgtgc agggcgtcct ctaactagtc 30 <210> 64 <211> 34 <212> DNA <213> Artificial Sequence <220> <223> Forward primer <400> 64 gactagtagg aggtattcat atgagcaata ggtc 34 <210> 65 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> reverse primer <400> 65 cggcatctgg tacgtcccat aactagtc 28  

Claims (9)

엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸트란스퍼라제/하이드록시메틸글루타릴 (HMG)-CoA 리덕타제를 코딩하는 유전자, 엔테로코커스 패칼리스 유래의 HMG-CoA 신타제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 메발로네이트 키나제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 유래의 포스포메발로네이트 키나제를 코딩하는 유전자, 스트렙토코커스 뉴모니애 유래의 메발로네이트 디포스페이트 데카르복실라제를 코딩하는 유전자, 대장균 유래의 이소펜테닐 디포스페이트 (IPP) 이소머라제를 코딩하는 유전자, 판토에아 아글루메란스 (Pantoea agglomerans) 유래의 제라닐제라닐 피로포스페이트 (GGPP) 신타제를 코딩하는 유전자, 판토에아 아글루메란스 유래의 피토엔 신타제를 코딩하는 유전자, 판토에아 아글루메란스 유래의 피토엔 데히드로게나제를 코딩하는 유전자, 판토에아 아나나티스 (Pantoea ananatis) 유래의 라이코펜-β-시클라제를 코딩하는 유전자 및 대장균 유래 1-데옥시자일룰로즈-5-포스페이트 (DXP) 신타제를 코딩하는 유전자로 형질전환되고, 이소프레노이드를 생산하는 에세리키아 속 미생물. Enterococcus faecalis ) -derived acetyl-CoA acetyltransferase / hydroxymethylglutaryl (HMG) -CoA reductase, gene encoding HMG-CoA synthase from Enterococcus faecalis, Streptococcus pneumoniae Ke ( Streptococcus pneumoniae ) genes encoding mevalonate kinase, genes encoding phosphomevalonate kinase from Streptococcus pneumoniae, genes encoding mevalonate diphosphate decarboxylase from Streptococcus pneumoniae, Pantoea agglomerans , a gene encoding isopentenyl diphosphate (IPP) isomerase from E. coli gene encoding geranylgeranyl pyrophosphate (GGPP) synthase derived from agglomerans ), gene encoding phytoene synthase derived from pantoea agglomerans , phytoene dehydrate derived from pantoea agglomerans Gene encoding logenase , Pantoea ananatis Esseriki transformed with a gene encoding lycopene-β-cyclase from ananatis and a gene encoding 1- deoxyxylulose- 5-phosphate (DXP) synthase from E. coli, producing isoprenoids Subgenus microorganisms. 제1항에 있어서, 헤마토코커스 플루비알리스 (Haematococcus pluvialis) 유래의 IPP 이소포머레제를 코딩하는 유전자로 추가적으로 형질전환된, 이소프레노이드를 생산하는 에세리키아 속 미생물. The method according to claim 1, wherein the hematococcus fluvialis (Haematococcus pluvialisA microorganism of the genus Escherichia that produces an isoprenoid, further transformed with a gene encoding IPP isoformase derived from 제1항에 있어서, 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자, 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제를 코딩하는 유전자, 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)을 코딩하는 유전자, 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922 유래의 β-카로틴 모노옥시게나제를 코딩하는 유전자로 이루어진 군으로 선택되는 하나 이상의 유전자로 더 형질전환된 것인 미생물.The method of claim 1, wherein the uncultured marine bacterium 66A03 Gene encoding β-carotene monooxygenase derived from, Gene encoding β-carotene 15,15′-monooxygenase derived from mouse ( Mus musculus ), Natronomonas pharaonis ATCC35678 One or more genes selected from the group consisting of a gene encoding a brp-like protein 2 of the gene, β-carotene monooxygenase from Halobacterium salinarum ATCC700922 More microorganisms. 제1항에 있어서, 엔테로코커스 패칼리스 (Enterococcus faecalis) 유래의 아세틸-CoA 아세틸트란스퍼라제/하이드록시메틸글루타릴 (HMG)-CoA 리덕타제가 서열번호 1, 엔테로코커스 패칼리스 유래의 HMG-CoA 신타제가 서열번호 2, 스트렙토코커스 뉴모니애 (Streptococcus pneumoniae) 유래의 메발로네이트 키나제가 서열번호 3, 스트렙토코커스 뉴모니애 유래의 포스포메발로네이트 키나제가 서열번호 4, 스트렙토코커스 뉴모니애 유래의 메발로네이트 디포스페이트 데카르복실라제가 서열번호 5, 대장균 유래의 이소펜테닐 디포스페이트 (IPP) 이소머라제가 서열번호 6, 판토에아 아글루메란스 (pantoea agglomerans) 유래의 제라닐제라닐 피로포스페이트 (GGPP) 신타제가 서열번호 7, 판토에아 아글루메란스 유래의 피토엔 신타제가 서열번호 8, 판토에아 아글루메란스 유래의 피토엔 데히드로게나제가 서열번호 9, 판토에아 아나나티스 (pantoea ananatis) 유래의 라이코펜-β-시클라제가 서열번호 10 및 대장균 유래 1-데옥시자일룰로즈-5-포스페이트 (DXP) 신타제가 서열번호 11의 아미노산 서열을 갖는 것인 미생물. The acetyl-CoA acetyltransferase / hydroxymethylglutaryl (HMG) -CoA reductase from Enterococcus faecalis is SEQ ID NO: 1, HMG- from Enterococcus faecalis. CoA synthase is SEQ ID NO: 2, Streptococcus pneumoniae ) methalonate kinase is SEQ ID NO: 3, phosphomevalonate kinase derived from Streptococcus pneumoniae is SEQ ID NO: 4, mevalonate diphosphate decarboxylase from Streptococcus pneumoniae is SEQ ID NO: 5, Isopentenyl diphosphate (IPP) isomerase derived from Escherichia coli is SEQ ID NO: 6, pantoea agglomerans ( pantoea agglomerans ) geranylgeranyl pyrophosphate (GGPP) synthase is SEQ ID NO: 7, phytoene synthase derived from Pantoea agglomerans SEQ ID NO: 8, phytoen dehydrogena derived from Pantoea agglomerans I am SEQ ID NO: 9, pantoea ananatis A microorganism wherein lycopene-β-cyclase derived from ananatis ) has SEQ ID NO: 10 and E. coli-derived 1- deoxyxylulose- 5-phosphate (DXP) synthase has the amino acid sequence of SEQ ID NO: 11. 제2항에 있어서, 헤마토코커스 플루비알리스 (Haematococcus pluvialis) 유래의 IPP 이소포머레제는 서열번호 12의 아미노산 서열을 갖는 것인 미생물.The method according to claim 2, Haematococcus fluvialis ( Haematococcus pluvialis ) IPP isoformase is a microorganism having the amino acid sequence of SEQ ID NO: 12. 제3항에 있어서, 배양되지 않은 해양 박테리아 (uncultured marine bacterium) 66A03 유래의 β-카로틴 모노옥시게나제가 서열번호 13, 생쥐 (Mus musculus) 유래의 β-카로틴 15,15'-모노옥시게나제가 서열번호 14, 나트로노모나스 파라오니스 (Natronomonas pharaonis) ATCC35678 유래의 brp 유사 단백질 2 (brp-like protein 2)가 서열번호 15, 할로박테리움 살리나룸 (Halobacterium salinarum) ATCC700922은 유래의 β-카로틴 모노옥시게나제가 서열번호 16 또는 17의 아미노산 서열을 갖는 것인 미생물.The β-carotene monooxygenase from uncultured marine bacterium 66A03 is SEQ ID NO: 13, and the β-carotene 15,15'-monooxygenase from Mus musculus is the sequence. No. 14, a brp-like protein 2 derived from Natronomonas pharaonis ATCC35678, SEQ ID NO: 15, Halobacterium salinarum ATCC700922 is derived from β-carotene monoocta The microorganism having a cigenase having an amino acid sequence of SEQ ID NO: 16 or 17. 제1항에 있어서, 상기 미생물은 기탁번호 KCTC 11254BP로 기탁된 대장균 DH5α/pTDHB/pSNA 또는 기탁번호 KCTC 11255BP로 기탁된 대장균 DH5α/pTDHBSR/pSNA인 것을 특징으로 하는 미생물.The microorganism according to claim 1, wherein the microorganism is E. coli DH5α / pTDHB / pSNA deposited with accession number KCTC 11254BP or E. coli DH5α / pTDHBSR / pSNA deposited with accession number KCTC 11255BP. 제1항에 있어서, 상기 이소프레노이드는 β-카로틴, 레티날, 레티놀, 레티닐 아세테이트 및 레틴산으로 이루어진 군으로부터 선택되는 것인 미생물. The microorganism of claim 1, wherein the isoprenoid is selected from the group consisting of β-carotene, retinal, retinol, retinyl acetate, and retinic acid. 제1항 내지 제8항 중 어느 한 항에 따른 미생물을 배양하는 단계; 및 Culturing the microorganism according to any one of claims 1 to 8; And 배양물로부터 이소프레노이드를 분리하는 단계;를 포함하는, 에세리키아 속 미생물로부터 이소프레노이드를 생산하는 방법.Separating the isoprenoids from the culture.
KR1020080003912A 2008-01-14 2008-01-14 A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same KR100971508B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020080003912A KR100971508B1 (en) 2008-01-14 2008-01-14 A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020080003912A KR100971508B1 (en) 2008-01-14 2008-01-14 A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same

Publications (2)

Publication Number Publication Date
KR20090078113A true KR20090078113A (en) 2009-07-17
KR100971508B1 KR100971508B1 (en) 2010-07-21

Family

ID=41336301

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020080003912A KR100971508B1 (en) 2008-01-14 2008-01-14 A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same

Country Status (1)

Country Link
KR (1) KR100971508B1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013019051A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Method for producing retinoid from microorganism
WO2013019053A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Escherichia genus microorganism lacking eute gene or puuc gene and method for producing retinoid using same
WO2013019052A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Escherichia genus microorganism lacking or having amplified ybbo gene and method for producing retinoid using same
KR101359484B1 (en) * 2011-07-29 2014-02-13 경상대학교산학협력단 Escherichia genus microorganism having deleted EutE or PuuC gene and Method of producing retinoids using the same
WO2018208116A1 (en) * 2017-05-11 2018-11-15 경상대학교산학협력단 Marker composition for selecting living modified organism, living modified organism, and transformation method
US11549117B2 (en) 2017-05-11 2023-01-10 Industry-Academic Cooperation Foundation Gyeongsang National University Marker composition for selecting living modified organism, living modified organism, and transformation method

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013019051A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Method for producing retinoid from microorganism
WO2013019053A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Escherichia genus microorganism lacking eute gene or puuc gene and method for producing retinoid using same
WO2013019052A2 (en) * 2011-07-29 2013-02-07 경상대학교산학협력단 Escherichia genus microorganism lacking or having amplified ybbo gene and method for producing retinoid using same
WO2013019051A3 (en) * 2011-07-29 2013-06-20 경상대학교산학협력단 Method for producing retinoid from microorganism
WO2013019052A3 (en) * 2011-07-29 2013-07-04 경상대학교산학협력단 Escherichia genus microorganism lacking or having amplified ybbo gene and method for producing retinoid using same
WO2013019053A3 (en) * 2011-07-29 2013-07-04 경상대학교산학협력단 Escherichia genus microorganism lacking eute gene or puuc gene and method for producing retinoid using same
KR101359484B1 (en) * 2011-07-29 2014-02-13 경상대학교산학협력단 Escherichia genus microorganism having deleted EutE or PuuC gene and Method of producing retinoids using the same
KR101392159B1 (en) * 2011-07-29 2014-05-12 경상대학교산학협력단 Method of producing retinoids from a microorganism
KR101440922B1 (en) * 2011-07-29 2014-09-18 경상대학교산학협력단 Escherichia genus microorganism having deleted or amplified YbbO gene and Method of producing retinoids using the same
US20160362709A1 (en) * 2011-07-29 2016-12-15 Industry-Academic Cooperation Foundation Gyeongsang National University Method for producing retinoid from microorganism
US9644217B2 (en) 2011-07-29 2017-05-09 Industry-Academic Cooperation Foundation Gyeongsang National University Method for producing retinoid from microorganism
US9834794B2 (en) * 2011-07-29 2017-12-05 Industry-Academic Cooperation Foundation Gyeongsang National University Method for producing retinoid from microorganism
WO2018208116A1 (en) * 2017-05-11 2018-11-15 경상대학교산학협력단 Marker composition for selecting living modified organism, living modified organism, and transformation method
US11549117B2 (en) 2017-05-11 2023-01-10 Industry-Academic Cooperation Foundation Gyeongsang National University Marker composition for selecting living modified organism, living modified organism, and transformation method

Also Published As

Publication number Publication date
KR100971508B1 (en) 2010-07-21

Similar Documents

Publication Publication Date Title
KR20130014445A (en) Method of producing retinoids from a microorganism
AU2021204038B2 (en) Mutant microorganisms resistant to lactose killing
AU2012351501B2 (en) Mutant microorganisms to synthesize colanic acid, mannosylated and/or fucosylated oligosaccharides
KR101106253B1 (en) A Echerichia coli comprising a polynucleotide encoding psicose 3-epimerase and method of producing psicose using the same
KR100971508B1 (en) A microorganism of Escherichia genus having enhanced isoprenoid productivity and method of producing isoprenoid using the same
CN111235080B (en) Gene recombination escherichia coli and production method of 5-hydroxytryptamine
CN106867952B (en) One plant of Recombinant organism and the method for producing L-threonine using it
CN111154707B (en) Method for producing genetically engineered escherichia coli and melatonin
KR101443052B1 (en) Modified chondroitin synthase polypeptide and crystal thereof
KR102636404B1 (en) Preparing method for terpene alcohol or its derivatives
KR102636395B1 (en) Preparing method for terpene alcohol or its derivatives
CN104278031A (en) Promoter A regulated by xanthine as well as recombinant expression vector and application of promoter A
CN111394383B (en) Polycoccaceae gene engineering bacteria for biosynthesizing caryophyllene and construction method and application thereof
KR20120088062A (en) A microorganism of Escherichia genus having enhanced farnesol productivity and method of producing farnesol using the same
CN106479928B (en) The indigenous plasmid of one plant of resistance to resistance to high COD salt water meningitidis strains and the source bacterial strain with high salt
KR101226644B1 (en) Transformed Escherichia coli for Over-expression of Fatty Acid Biosynthesis Pathway and Method of Preparing the Same
KR100918121B1 (en) E. coli strain for increasing acetyl-CoA consumption and method of producing vanillin using the strain and adsorbent resin
CN111909914B (en) High PAM compatibility truncated variant txCas9 of endonuclease SpCas9 and application thereof
CN112553237A (en) Novel mariner transposon system, application and construction of bacillus subtilis insertion mutant library
CN110272881B (en) Endonuclease SpCas9 high specificity truncated variant TSpCas9-V1/V2 and application thereof
KR101246910B1 (en) Transformed Escherichia coli for Over-expression of Fatty Acid Biosynthesis Pathway Using Malonyl-CoA and Malonyl-[acp] and Method of Preparing the Same
KR20110107209A (en) Method for preparing transformed e coli for over-expression of fatty acid biosynthesis pathway and transformed e coli prepared by the same
CN111254104B (en) Preparation method of genetically engineered escherichia coli and indole-3-acetic acid
CN111254105B (en) Genetically engineered escherichia coli, preparation method thereof and production method of indole-3-acetic acid
CN113755412B (en) Genetically engineered bacterium for producing MK-7, method and application

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20130620

Year of fee payment: 4

FPAY Annual fee payment

Payment date: 20140703

Year of fee payment: 5

FPAY Annual fee payment

Payment date: 20160711

Year of fee payment: 7

FPAY Annual fee payment

Payment date: 20170703

Year of fee payment: 8

FPAY Annual fee payment

Payment date: 20180702

Year of fee payment: 9

FPAY Annual fee payment

Payment date: 20190627

Year of fee payment: 10