JP2001346585A - Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same - Google Patents

Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same

Info

Publication number
JP2001346585A
JP2001346585A JP2000170696A JP2000170696A JP2001346585A JP 2001346585 A JP2001346585 A JP 2001346585A JP 2000170696 A JP2000170696 A JP 2000170696A JP 2000170696 A JP2000170696 A JP 2000170696A JP 2001346585 A JP2001346585 A JP 2001346585A
Authority
JP
Japan
Prior art keywords
gly
leu
ala
phe
ile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2000170696A
Other languages
Japanese (ja)
Inventor
Hideaki Miyashita
英明 宮下
Takayuki Sasaki
孝行 佐々木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Marine Biotechnology Institute Co Ltd
Original Assignee
Marine Biotechnology Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Marine Biotechnology Institute Co Ltd filed Critical Marine Biotechnology Institute Co Ltd
Priority to JP2000170696A priority Critical patent/JP2001346585A/en
Publication of JP2001346585A publication Critical patent/JP2001346585A/en
Pending legal-status Critical Current

Links

Landscapes

  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)

Abstract

PROBLEM TO BE SOLVED: To obtain a new photochemical protein and to make use of a photochemical function using a wavelength region which cannot be utilized. SOLUTION: This gene encoding a photochemical protein uses chlorophyl (d) as a photoreceptive coloring matter.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】本発明は、クロロフィルdを
光受容色素として用いることのできる光化学系タンパク
質及びそれをコードする遺伝子に関する。また、本発明
は、前記遺伝子をアカリオクロリス(Acaryochloris
属の原核藻類を用いて生産する方法に関する。
TECHNICAL FIELD The present invention relates to a photochemical protein capable of using chlorophyll d as a photoreceptor dye, and a gene encoding the same. In addition, the present invention relates to the above-mentioned gene, wherein Acaryochloris is used .
The present invention relates to a method for producing using a genus Prokaryote.

【0002】[0002]

【従来の技術】植物や光合成細菌の生産する光化学系タ
ンパク質は、光エネルギーを電子の流れに変換する太陽
電池の役割をする。このため、これらのタンパク質を用
いて電流を効率よく取り出すことが試みられている。ま
た、これらのタンパク質は、光によるスイッチング素子
など電子デバイスの開発原料としても期待されている。
2. Description of the Related Art Photochemical proteins produced by plants and photosynthetic bacteria play the role of solar cells that convert light energy into electron flow. For this reason, attempts have been made to efficiently extract current using these proteins. These proteins are also expected to be used as raw materials for developing electronic devices such as switching elements using light.

【0003】これらのタンパク質を上述の用途に効率的
に利用していくためには、そのタンパク質をコードする
遺伝子を単離し、塩基配列を決定することが必要であ
る。現在までのところ、例えば、以下のような原核藻類
の光化学系タンパク質に関する報告がある。
In order to efficiently utilize these proteins for the above-mentioned uses, it is necessary to isolate a gene encoding the protein and determine the nucleotide sequence. So far, for example, there are reports on the following photosystem proteins of prokaryotes.

【0004】・Anabaena variabilis ATCC 29413 Nyhus,K.J., Sonoike,K. and Pakrasi,H.B. Nucleotide sequences of the psaA and psaB genes en
coding the reaction center proteins of Photosystem
I in Anabaena variabilis ATCC 29413 Biochim. Biophys. Acta 1185, 247-251 (1994)
Anabaena variabilis ATCC 29413 Nyhus, KJ, Sonoike, K. and Pakrasi, HB Nucleotide sequences of the psaA and psaB genes en
coding the reaction center proteins of Photosystem
I in Anabaena variabilis ATCC 29413 Biochim. Biophys. Acta 1185, 247-251 (1994)

【0005】・Synechocystis sp. PCC6803 Kaneko,T., Sato,S., Kotani,H., Tanaka,A., Asamizu,
E., Nakamura,Y., Miyajima,N., Hirosawa,M., Sugiur
a,M., Sasamoto,S., Kimura,T., Hosouchi,T., Matsun
o,A., Muraki,A., Nakazaki,N., Naruo,K., Okumura,
S., Shimpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S. Sequence analysis of the genome of the unicellular
cyanobacterium Synechocystis sp. strain PCC6803.
II. Sequence determination of the entire genome an
d assignment of potential protein-coding regions DNA Res. 3 (3), 109-136 (1996)
Synechocystis sp.PCC6803 Kaneko, T., Sato, S., Kotani, H., Tanaka, A., Asamizu,
E., Nakamura, Y., Miyajima, N., Hirosawa, M., Sugiur
a, M., Sasamoto, S., Kimura, T., Hosouchi, T., Matsun
o, A., Muraki, A., Nakazaki, N., Naruo, K., Okumura,
S., Shimpo, S., Takeuchi, C., Wada, T., Watanabe, A.,
Yamada, M., Yasuda, M. and Tabata, S. Sequence analysis of the genome of the unicellular
cyanobacterium Synechocystis sp.strain PCC6803.
II. Sequence determination of the entire genome an
d assignment of potential protein-coding regions DNA Res. 3 (3), 109-136 (1996)

【0006】・Synechococcus sp. PCC 7002 Cantrell,A. and Bryant,D.A. Molecular cloning and nucleotide sequence of the p
saA and psaB genes ofthe cyanobacterium Synechococ
cus sp. PCC 7002 Plant Mol.Biol. 9 (5), 453-468 (1987)
Synechococcus sp.PCC 7002 Cantrell, A. and Bryant, DA Molecular cloning and nucleotide sequence of the p
saA and psaB genes of the cyanobacterium Synechococ
cus sp.PCC 7002 Plant Mol. Biol. 9 (5), 453-468 (1987)

【0007】・Synechococcus vulcanus Shimizu,T., Hiyama,T., Ikeuchi,M. and Inoue,Y. Nucleotide sequences of the psaA and psaB genes en
coding the photosystemI core proteins from the the
rmophilic cyanobacterium Synechococcus vulcanus Plant Mol. Biol. 18 (4), 785-791 (1992)
Synechococcus vulcanus Shimizu, T., Hiyama, T., Ikeuchi, M. and Inoue, Y. Nucleotide sequences of the psaA and psaB genes en
coding the photosystemI core proteins from the the
rmophilic cyanobacterium Synechococcus vulcanus Plant Mol. Biol. 18 (4), 785-791 (1992)

【0008】・Synechococcus elongatus naegeli Muhlenhoff,U., Haehnel,W., Witt,H. and Herrmann,R.
G. Genes encoding eleven subunits of photosystem I fr
om the thermophilic cyanobacterium Synechococcus s
p Gene 127 (1), 71-78 (1993)
Synechococcus elongatus naegeli Muhlenhoff, U., Haehnel, W., Witt, H. and Herrmann, R.
G. Genes encoding eleven subunits of photosystem I fr
om the thermophilic cyanobacterium Synechococcus s
p Gene 127 (1), 71-78 (1993)

【0009】・Prochlorococcus sp. CCMP1378. "Prochlorococcus marinus CCMP1375" Synechococcus sp. WH7803 van der Staay,G.W.M., Moon-van der Staay,S.Y., Par
tensky,F. and Garczarek,L. Rapid evolutionary divergence of photosystem I cor
e subunits PsaA and PsaB in marine prokaryotes Unpublished (Entrez : Search WWW Entrez at NCBI)
Prochlorococcus sp. CCMP1378. "Prochlorococcus marinus CCMP1375" Synechococcus sp. WH7803 van der Staay, GWM, Moon-van der Staay, SY, Par
tensky, F. and Garczarek, L. Rapid evolutionary divergence of photosystem I cor
e subunits PsaA and PsaB in marine prokaryotes Unpublished (Entrez: Search WWW Entrez at NCBI)

【0010】[0010]

【発明が解決しようとする課題】光合成の初期過程は、
光エネルギーを化学エネルギーに変換する過程と考える
ことができる。生物がどのような波長の光を利用して光
合成を行うかは、主としてその生物のもつ受容色素の種
類による。例えば、バクテリオクロロフィルa,b,
c,d,e,f,gとカロテノイド色素を光受容色素と
してもつ非酸素発生型光合成細菌は、可視光や1000nm付
近までの赤外線を利用できる。しかし、最終的に電荷分
離するクロロフィルに集約される光のエネルギー順位が
低いため水の分解をともなう酸素発生型の光合成ができ
ない。このため適当な電子供与体(硫黄、硫化水素、有
機酸など)を必要とする。クロロフィルa,b,cとカ
ロテノイド色素を光受容色素として利用する藻類および
高等植物は、主に可視光の光を利用して光合成を行って
いる。これらの生物の場合、電荷分離するクロロフィル
aに集められる680nm程度の光は、水を分解できる。こ
のため、酸素発生型の光合成にもちいることのできるク
ロロフィルは、クロロフィルaにエネルギー伝達可能な
クロロフィルaより短波長側にQy吸収帯をもつものでな
ければならないと考えられていた。同時に、酸素発生型
の光合成に700nm以上の光エネルギーを用いることので
きる生物は知られていなかった。
The initial stage of photosynthesis is:
It can be considered as a process of converting light energy into chemical energy. The wavelength of light used by an organism to perform photosynthesis mainly depends on the type of accepting dye that the organism has. For example, bacteriochlorophyll a, b,
A non-oxygen generating photosynthetic bacterium having c, d, e, f, g and a carotenoid dye as photoreceptive dyes can use visible light or infrared light up to around 1000 nm. However, oxygen generation type photosynthesis involving decomposition of water cannot be performed due to the low energy order of light concentrated on chlorophyll that eventually separates charges. This requires a suitable electron donor (sulfur, hydrogen sulfide, organic acid, etc.). Algae and higher plants that use chlorophyll a, b, and c and carotenoid pigments as photoreceptive pigments mainly perform photosynthesis using visible light. In the case of these organisms, light of about 680 nm, which is collected by chlorophyll a that separates charges, can decompose water. For this reason, it has been considered that chlorophyll that can be used for oxygen-evolving photosynthesis must have a Qy absorption band on the shorter wavelength side than chlorophyll a that can transfer energy to chlorophyll a. At the same time, no organism has been known that can use light energy of 700 nm or more for oxygen-evolving photosynthesis.

【0011】最近、本発明者らによってアカリオクロリ
ス・マリナ(Acaryochloris marina)という新種の原核藻
類が見つけられた。この藻類は、クロロフィルdを光受
容色素として利用している。他の酸素発生型光合成生物
とはことなり、可視光に加え700-750nm付近の近赤外光
を吸収して、酸素発生型の光合成を行う機能を有してい
る。この微細藻の光化学系タンパク質をコードする遺伝
子を単離できれば、これまで利用することができなかっ
た波長領域を利用した光合成機能活用が可能になる。し
かしながら、現在に至るまでこの遺伝子の単離に成功し
た例は皆無である。本発明は、以上のような有用性の高
いアカリオクロリス・マリナ由来の光化学系タンパク質
をコードする遺伝子を提供することを目的とするもので
ある。
Recently, a new species of prokaryote, Acaryochloris marina, has been discovered by the present inventors. This algae utilizes chlorophyll d as a photoreceptor pigment. Unlike other oxygen-generating photosynthetic organisms, it has the function of absorbing oxygen and generating oxygen-generating photosynthesis by absorbing near-infrared light near 700-750 nm in addition to visible light. If the gene encoding the photochemical protein of the microalga can be isolated, the photosynthetic function utilizing the wavelength region that could not be used until now can be used. However, to date, there have been no examples of successful isolation of this gene. An object of the present invention is to provide a gene encoding a photosystem protein derived from Acariochloris marina having high utility as described above.

【0012】[0012]

【課題を解決するための手段】本発明者らは、クロロフ
ィルdを光受容色素として用いているアカリオクロリス
・マリナに含まれる光化学系タンパク質の遺伝子を得る
ために、これまでに蓄積された光化学系タンパク質のア
ミノ酸配列を比較検討することにより、光化学系タンパ
ク質を増幅できるPCRプライマーを設計し、このプライ
マーを用いてアカリオクロリス・マリナの光化学系タン
パク質の遺伝子配列を決定した。本発明は、これらの知
見に基づき完成されたものである。
Means for Solving the Problems The inventors of the present invention have developed a photochemical protein gene contained in Acaryochloris marina using chlorophyll d as a photoreceptive pigment, and have obtained photochemical proteins accumulated so far. By comparing and examining the amino acid sequences of the system proteins, PCR primers capable of amplifying the photosystem proteins were designed, and the gene sequences of the photosystem proteins of Acaryochloris marina were determined using the primers. The present invention has been completed based on these findings.

【0013】即ち、本発明は、クロロフィルdを光受容
色素として用いる光化学系タンパク質、及びそれをコー
ドする遺伝子に関する。また、本発明は、アカリオクロ
リス属の原核藻類を用いて上記遺伝子を生産する方法に
関する。
That is, the present invention relates to a photochemical protein using chlorophyll d as a photoreceptive dye, and a gene encoding the same. The present invention also relates to a method for producing the above gene using a prokaryotic algae of the genus Acariochloris.

【0014】[0014]

【発明の実施の形態】以下、本発明を詳細に説明する。
本発明のタンパク質及び遺伝子は、以下の第一から第六
のタンパク質及び遺伝子を含む。第一のタンパク質は、
配列番号2記載のアミノ酸配列により表されるタンパク
及び配列番号2記載のアミノ酸配列において1もしくは
複数個のアミノ酸が欠失、置換、若しくは付加されたア
ミノ酸配列により表され、かつ光化学系1反応中心タン
パク質サブユニットPsaAとしての機能を有するタンパク
質を包含するものである。第一の遺伝子は、この第一の
タンパク質をコードするものである。
BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail.
The proteins and genes of the present invention include the following first to sixth proteins and genes. The first protein is
A protein represented by the amino acid sequence of SEQ ID NO: 2 and an amino acid sequence of SEQ ID NO: 2 in which one or more amino acids are deleted, substituted, or added; It includes proteins having a function as the subunit PsaA. The first gene encodes the first protein.

【0015】第二のタンパク質は、配列番号4記載のア
ミノ酸配列により表されるタンパク質及び配列番号4記
載のアミノ酸配列において1もしくは複数個のアミノ酸
が欠失、置換、若しくは付加されたアミノ酸配列により
表され、かつ光化学系1反応中心タンパク質サブユニッ
トPsaBとしての機能を有するタンパク質を包含するもの
である。第二の遺伝子は、この第二のタンパク質をコー
ドするものである。
The second protein is represented by the amino acid sequence of SEQ ID NO: 4 and the amino acid sequence of SEQ ID NO: 4 in which one or more amino acids have been deleted, substituted or added. And a protein having a function as photosystem 1 reaction center protein subunit PsaB. The second gene encodes this second protein.

【0016】第三のタンパク質は、配列番号6記載のア
ミノ酸配列により表されるタンパク質及び配列番号6記
載のアミノ酸配列において1もしくは複数個のアミノ酸
が欠失、置換、若しくは付加されたアミノ酸配列により
表され、かつ光化学系2反応中心タンパク質サブユニッ
トPsbAとしての機能を有するタンパク質を包含するもの
である。第三の遺伝子は、この第三のタンパク質をコー
ドするものである。
The third protein is represented by the protein represented by the amino acid sequence of SEQ ID NO: 6 and the amino acid sequence of the amino acid sequence of SEQ ID NO: 6 in which one or more amino acids have been deleted, substituted or added. And a protein having a function as the photosystem 2 reaction center protein subunit PsbA. The third gene encodes this third protein.

【0017】第四のタンパク質は、配列番号8記載のア
ミノ酸配列を含むタンパク質及び配列番号8記載のアミ
ノ酸配列において1もしくは複数個のアミノ酸が欠失、
置換、若しくは付加されたアミノ酸配列を含み、かつ光
化学系2反応中心タンパク質サブユニットPsbDとしての
機能を有するタンパク質を包含するものである。第四の
遺伝子は、この第四のタンパク質をコードするものであ
る。
The fourth protein comprises a protein comprising the amino acid sequence of SEQ ID NO: 8 and one or more amino acids in the amino acid sequence of SEQ ID NO: 8,
It includes a protein containing a substituted or added amino acid sequence and having a function as a photosystem 2 reaction center protein subunit PsbD. The fourth gene encodes the fourth protein.

【0018】第五のタンパク質は、配列番号10記載の
アミノ酸配列を含むタンパク質及び配列番号10記載の
アミノ酸配列において1もしくは複数個のアミノ酸が欠
失、置換、若しくは付加されたアミノ酸配列を含み、か
つ光化学系2コアアンテナタンパク質Psb Bとしての機
能を有するタンパク質を包含するものである。第五の遺
伝子は、この第五のタンパク質をコードするものであ
る。
The fifth protein includes a protein comprising the amino acid sequence of SEQ ID NO: 10 and an amino acid sequence in which one or more amino acids have been deleted, substituted or added in the amino acid sequence of SEQ ID NO: 10, and This includes proteins having a function as the photochemical two-core antenna protein PsbB. The fifth gene encodes the fifth protein.

【0019】第六のタンパク質は、配列番号12記載の
アミノ酸配列を含むタンパク質及び配列番号12記載の
アミノ酸配列において1もしくは複数個のアミノ酸が欠
失、置換、若しくは付加されたアミノ酸配列を含み、か
つ光化学系2コアアンテナタンパク質Psb Cとしての機
能を有するタンパク質を包含するものである。第六の遺
伝子は、この第六のタンパク質をコードするものであ
る。ここで欠失、置換、もしくは付加は、本願の出願時
において常用される技術、例えば、部位特異的変異誘発
法(Nucleic Acid Res. 10 6487-65000 1982)により生じ
させることができる。
The sixth protein comprises a protein comprising the amino acid sequence of SEQ ID NO: 12 and an amino acid sequence wherein one or more amino acids are deleted, substituted or added in the amino acid sequence of SEQ ID NO: 12, and It includes a protein having a function as a photochemical two-core antenna protein PsbC. The sixth gene encodes the sixth protein. Here, the deletion, substitution, or addition can be generated by a technique commonly used at the time of filing the present application, for example, a site-directed mutagenesis method (Nucleic Acid Res. 10 6487-65000 1982).

【0020】第一から第六のタンパク質は、以下の機能
を持つ。第一のタンパク質と第二のタンパク質は会合
し、光化学系1反応中心タンパク質を形成する。光化学
系1反応中心タンパク質は、光エネルギーを酸化還元力
にかえ光化学系2から供給された電子を使ってNADP+
還元する。
The first to sixth proteins have the following functions. The first protein and the second protein associate to form a photosystem 1 reaction center protein. The photosystem 1 reaction center protein converts light energy into redox power and reduces NADP + using the electrons supplied from photosystem 2.

【0021】第三のタンパク質と第四のタンパク質は会
合し、光化学系2反応中心タンパク質を形成する。光化
学系2反応中心タンパク質は、光エネルギーを酸化還元
力にかえ、水を分解し、電子を奪ってそれを光化学系1
に送りこむ働きをする。
The third protein and the fourth protein associate to form a photosystem 2 reaction center protein. Photosystem 2 reaction center protein converts light energy into redox power, decomposes water, deprives electrons, and converts it to photosystem 1
It works to send to.

【0022】第五のタンパク質と第六のタンパク質は会
合し、光化学系2コアアンテナタンパク質を形成する。
光化学系2コアアンテナタンパク質は、水を分解するた
めの光エネルギを捕獲するためのアンテナで、吸収され
たエネルギーは反応中心タンパク質に伝達される。
The fifth protein and the sixth protein associate to form a photochemical two-core antenna protein.
The photochemical two-core antenna protein is an antenna for capturing light energy for decomposing water, and the absorbed energy is transmitted to the reaction center protein.

【0023】本発明の遺伝子は、例えば以下の手順で得
ることができる。まず、アカリオクロリス属に属する藻
類、例えば、アカリオクロリス・マリナMBIC11017株の
全ゲノム遺伝子を、定法(たとえば、ファルマシア社製
の遺伝子抽出キットGenomicPrepTMを用いて)によって
得る。なお、アカリオクロリス・マリナMBIC11017株
は、工業技術院生命工学工業技術研究所に受託番号 FER
M P-17860として寄託されている(寄託日:平成12年5月
18日)。
The gene of the present invention can be obtained, for example, by the following procedure. First, the whole genomic gene of algae belonging to the genus Acariochloris, for example, Acariochloris marina MBIC11017 strain, is obtained by a conventional method (for example, using a gene extraction kit GenomicPrep manufactured by Pharmacia). The Akario Chloris Marina MBIC11017 strain was licensed to the National Institute of Advanced Industrial Science and Technology by the accession number FER.
Deposited as M P-17860 (Deposit date: May 2000
18th).

【0024】次に、これを鋳型として特定のプライマー
を用いてPCRを行うことによって、本発明の遺伝子を得
ることができる。即ち、第一の遺伝子及び第二の遺伝子
を得るためには、配列番号13で表されるプライマーと配
列番号14で表されるプライマーを使用し、第三の遺伝子
を得るためには、配列番号15で表されるプライマーと配
列番号16で表されるプライマーを使用し、第四の遺伝子
及び第六の遺伝子を得るためには、配列番号17で表され
るプライマーと配列番号18で表されるプライマーを使用
し、第四の遺伝子を得るためには、配列番号19で表され
るプライマーと配列番号20で表されるプライマーを使用
する。
Next, the gene of the present invention can be obtained by performing PCR using this as a template and specific primers. That is, to obtain the first gene and the second gene, using the primer represented by SEQ ID NO: 13 and the primer represented by SEQ ID NO: 14, to obtain the third gene, SEQ ID NO: Using the primer represented by SEQ ID NO: 15 and the primer represented by SEQ ID NO: 16, in order to obtain a fourth gene and the sixth gene, the primer represented by SEQ ID NO: 17 and represented by SEQ ID NO: 18 In order to obtain a fourth gene using a primer, a primer represented by SEQ ID NO: 19 and a primer represented by SEQ ID NO: 20 are used.

【0025】[0025]

【実施例】〔実施例1〕アカリオクロリス・マリナMBIC
11017株(以下、単に「MBIC11017株」という)の全ゲノ
ム遺伝子を、ファルマシア社製の遺伝子抽出キットGeno
micPrepTMを用いて抽出した。これを鋳型にして、CCACC
ACHTGGATTTGGAAYCTとGCNACNGGYTTRTCYTTCCAという塩基
配列で表される2種類のプライマーを用いて、PCRを行
った。PCRによる断片の増幅には、宝酒造製のTaKaRa Ta
qTM を用い、PCR条件は、付属の手順書に従った。具体
的な条件は、下記の通りである。 鋳型となる全ゲノム遺伝子量 1 μg ポリメラーゼ 0.5 μl プライマー量 各10 ng 10×PCR buffe 10 μl dNTP mixture 8 μl 全反応液量 100 μl 温度サイクル 94℃ 1分、55℃ 1分、72℃ 2
分。 サイクル数 30 サイクル
[Example] [Example 1] Akariochloris marina MBIC
The whole genome gene of 11017 strain (hereinafter simply referred to as “MBIC11017 strain”) was converted into a gene extraction kit Geno
Extracted using micPrep . Using this as a template, CCACC
PCR was performed using two types of primers represented by base sequences of ACHTGGATTTGGAAYCT and GCNACNGGYTTRTCYTTCCA. For amplification of fragments by PCR, use TaKaRa Ta
q The PCR conditions were in accordance with the attached procedure using TM . Specific conditions are as follows. Amount of whole genome gene as template 1 μg Polymerase 0.5 μl Primer amount 10 ng each 10 × PCR buffe 10 μl dNTP mixture 8 μl Total reaction volume 100 μl Temperature cycle 94 ° C 1 minute, 55 ° C 1 minute, 72 ° C 2
Minutes. Number of cycles 30 cycles

【0026】なお、上記2種のプライマー配列は、Sear
ch WWW Entrez at NCBIによって検索/引用した、シネ
ココックス・ヴルカヌス(Synechococcus vulcanus
(ACCESSION D10986 D01126), シネココックス・エロン
ガツス・ナエゲリ(Synechococcus elongatus naegel
i) (ACCESSION X63768), シネココックス(Synechoco
ccus)sp. PCC7002 (ACCESSION M18165), シネコシス
ティス(Synechocystis)sp. PCC6803 (ACCESSION D90
906 AB001339), アナベナ・ヴァリアビリス(Anabaena
variabilis) ATCC 29413 (ACCESSION L26326)のPsaA
およびPsaBの配列をそれぞれ比較し共通のアミノ酸配列
TTTWIWNおよびWKDKPVAからそれぞれ設計されたものであ
る。PCRの結果、約4.2kbpの遺伝子断片が増幅された。
この断片の配列を決定した。
[0026] The above two primer sequences are used for Sear.
ch WWW Entrez at Synechococcus vulcanus , searched / cited by NCBI
(ACCESSION D10986 D01126), Synechococcus elongatus naegel
i) (ACCESSION X63768), Synechocox
ccus) sp. PCC7002 (ACCESSION M18165 ), Synechocystis (Synechocystis) sp. PCC6803 (ACCESSION D90
906 AB001339), Anabaena-Variabirisu (Anabaena
variabilis ) PsaA of ATCC 29413 (ACCESSION L26326)
Compare the sequence of PsaB and PsaB, and share the common amino acid sequence
Designed from TTTWIWN and WKDKPVA respectively. As a result of the PCR, a gene fragment of about 4.2 kbp was amplified.
The sequence of this fragment was determined.

【0027】アミノ酸配列の両端を決定するためにイン
バースPCRを行った。主たる手順は、小笠原直毅の方法
(真木寿治監修、PCR Tips、1997、秀潤社、p454-48)
に従った。具体的には、先に抽出したMBIC11017株の全
ゲノム遺伝子約6μgずつを制限酵素EcoRI、HincIII、Ps
tIでそれぞれ消化し、精製した。各消化断片1μgをそ
れぞれT4 DNA Ligase (ニッポンジーン社製)を用いて全
反応液量500μl中で13℃、20時間反応させ、精製した。
Inverse PCR was performed to determine both ends of the amino acid sequence. The main procedure is the method of Naoki Ogasawara (supervised by Toshiharu Maki, PCR Tips, 1997, Shujunsha, p454-48)
Followed. Specifically, about 6 μg of the whole genome gene of the previously extracted MBIC11017 strain was replaced with restriction enzymes EcoRI, HincIII, Ps
Each was digested with tI and purified. 1 μg of each digested fragment was reacted with T4 DNA Ligase (Nippon Gene) in a total reaction volume of 500 μl at 13 ° C. for 20 hours, and purified.

【0028】EcoRI、PstI消化物のLigation精製物を鋳
型にして、TGGAACCACTCCAATTTAGGTGCとGGTCGTGGCTATTGG
CAAという塩基配列で表される2種類のプライマーを用
いて、PCRを行った。PCRによる断片の増幅には、宝酒造
製のTaKaRa TaqTM を用い、PCR条件は、付属の手順書に
従った。アニーリング温度は、55℃とした。その結果得
られた断片の配列決定をおこないアミノ酸コード領域を
特定して、一端の配列を決定した。
Using ligation purified products of EcoRI and PstI digests as templates, TGGAACCACTCCAATTTAGGTGC and GGTCGTGGCTATTGG
PCR was performed using two types of primers represented by the base sequence CAA. For amplification of the fragment by PCR, TaKaRa Taq manufactured by Takara Shuzo was used, and the PCR conditions were in accordance with the attached protocol. The annealing temperature was 55 ° C. The resulting fragment was sequenced to identify the amino acid coding region, and the sequence at one end was determined.

【0029】さらに、HincIII消化物のLigation精製物
を鋳型にして、GGTGGGAGATAATCGCCTCとCCTGGTGACTTCTTG
GTTCAという塩基配列で表される2種類のプライマーを
用いて、PCRを行った。PCRによる断片の増幅には、宝酒
造製のTaKaRa TaqTM を用い、PCR条件は、付属の手順書
に従った。アニーリング温度は、55℃とした。その結果
得られた断片の配列決定をおこないアミノ酸コード領域
を特定して、さらに一端の配列を決定した。
Further, GGTGGGAGATAATCGCCTC and CCTGGGTGACTTCTTG were used as a template with the ligation purified product of the HincIII digest.
PCR was performed using two types of primers represented by the base sequence GTTCA. For amplification of the fragment by PCR, TaKaRa Taq manufactured by Takara Shuzo was used, and the PCR conditions were in accordance with the attached protocol. The annealing temperature was 55 ° C. The resulting fragment was sequenced, the amino acid coding region was identified, and the sequence at one end was determined.

【0030】決定された全配列には、22塩基の非翻訳領
域をはさんで、上流に753アミノ酸と下流に736アミノ酸
をそれぞれコードすると思われる2つのORFが含まれ
ていた。2つのORFの塩基配列を配列番号1および配
列番号3にそれぞれ示す。また。これらの配列から推定
されるアミノ酸配列を配列番号2および配列番号4に示
す。
The entire sequence determined contained two ORFs which seem to encode 753 amino acids upstream and 736 amino acids downstream, respectively, with a 22 base untranslated region interposed. The nucleotide sequences of the two ORFs are shown in SEQ ID NO: 1 and SEQ ID NO: 3, respectively. Also. The amino acid sequences deduced from these sequences are shown in SEQ ID NO: 2 and SEQ ID NO: 4.

【0031】推定アミノ酸配列のホモロジーをデータベ
ースNCBI BLAST(Stephen F. Altschulら、1997、Nuclei
c Acids Res. 25:3389-3402)に基づいて検索を行った結
果、配列番号2のアミノ酸配列は、シネココックス・エ
ロンガツス・ナエゲリ、シネココックス・ヴルカヌス、
フィッシェレラ(Fischerella) PCC7605のPsaAとそれ
ぞれ75%の相同性を示し、配列番号4のアミノ酸配列
は、シネココックスsp. PCC7002、シネコシスティスsp.
PCC6803とそれぞれ76%の相同性を示した。このことから
配列番号1はPsaAをコードする遺伝子で、配列番号3は
PsaBをコードする遺伝子であると判明した。
The homology of the deduced amino acid sequence is stored in the database NCBI BLAST (Stephen F. Altschul et al., 1997, Nuclei
c Acids Res. 25: 3389-3402), and as a result, the amino acid sequence of SEQ ID NO: 2 was identified as Synechococcus elongatus naegeri, Synechococcus vulcanus,
It shows 75% homology with each of PsaA of Fischerella PCC7605, and the amino acid sequence of SEQ ID NO: 4 is obtained from Synechocox sp. PCC7002, Synechocystis sp.
Each showed 76% homology with PCC6803. Accordingly, SEQ ID NO: 1 is a gene encoding PsaA, and SEQ ID NO: 3 is a gene encoding
It turned out to be a gene encoding PsaB.

【0032】〔実施例2〕実施例1の方法と同様の方法
で、MBIC11017株の全ゲノム遺伝子を抽出した。これを
鋳型にして、GAYATHGAYGGIATHMGIGARCCIGTとGGGAAGTTGT
GGGCATTRCGYTCGTGという塩基配列で表される2種類のプ
ライマーを用いて、実施例1のと同様の条件でPCRを行
った。
Example 2 A whole genome gene of MBIC11017 strain was extracted in the same manner as in Example 1. Using this as a template, GAYATHGAYGGIATHMGIGARCCIGT and GGGAAGTTGT
PCR was performed under the same conditions as in Example 1 using two types of primers represented by the base sequence GGGCATTRCGYTCGTG.

【0033】なお、上記2種のプライマー配列のうち前
者は、Search WWW Entrez at NCBIによって検索/引用
した、シネココックス・ヴルカヌス(ACCESSION P7922
2), シネココックス・エロガンツス (ACCESSION P35876
P35877), シネココックスsp.PCC7942 (ACCESSION P049
96), シネコシスティスsp. PCC6803 (ACCESSION D9089
9), アナベナsp. PCC7120 (ACCESSION U21331)のPsbAの
共通のアミノ酸配列DIDGIREPVから設計されたものであ
る。後者のプライマーは、Hessら(Plant Mol. Biol. 2
7: 1189-1196, 1995)から引用した。
The former primer sequence of the above two types of primer sequences was obtained from Synechococcus vulcanus (ACCESSION P7922) searched / quoted by Search WWW Entrez at NCBI.
2), Cinecox Eroganthus (ACCESSION P35876
P35877), Cinecox sp.PCC7942 (ACCESSION P049
96), Synechocystis sp.PCC6803 (ACCESSION D9089
9), designed from the common amino acid sequence DIDGIREPV of PsbA of Anabaena sp. PCC7120 (ACCESSION U21331). The latter primer is described in Hess et al. (Plant Mol. Biol. 2
7: 1189-1196, 1995).

【0034】PCRの結果、約1kbpの遺伝子断片が増幅さ
れた。この断片の配列を決定した。アミノ酸配列の両端
を決定するために、実施例1と同様の手順でインバース
PCRを行いタンパク質コード領域の両端の配列を決定し
た。決定された配列には、360アミノ酸をコードすると
思われるORFが含まれていた。この塩基配列を配列番号
5に示す。また。これらの配列から推定されるアミノ酸
配列を配列番号6に示す。
As a result of the PCR, a gene fragment of about 1 kbp was amplified. The sequence of this fragment was determined. In order to determine both ends of the amino acid sequence, the inverse procedure was used in the same manner as in Example 1.
PCR was performed to determine the sequence at both ends of the protein coding region. The determined sequence contained an ORF likely to encode 360 amino acids. This nucleotide sequence is shown in SEQ ID NO: 5. Also. The amino acid sequence deduced from these sequences is shown in SEQ ID NO: 6.

【0035】推定アミノ酸配列のホモロジーをデータベ
ースNCBI BLAST(Stephen F. Altschulら、1997、Nuclei
c Acids Res. 25:3389-3402)に基づいて検索を行った結
果、配列番号6のアミノ酸配列は、シネココックスsp.
PCC6301、シネコシスティスsp. PCC6803、アナベナsp.
のPsbAとそれぞれ85%の相同性を示した。このことから
配列番号6はPsbAをコードする遺伝子であると判明し
た。
The homology of the deduced amino acid sequence is stored in a database NCBI BLAST (Stephen F. Altschul et al., 1997, Nuclei
c Acids Res. 25: 3389-3402), and as a result, the amino acid sequence of SEQ ID NO: 6 was found to be Synechocox sp.
PCC6301, Synechocystis sp.PCC6803, Anabena sp.
And 85% homology with each other. This proved that SEQ ID NO: 6 was a gene encoding PsbA.

【0036】〔実施例3〕実施例1の方法と同様の方法
で、MBIC11017株の全ゲノム遺伝子を抽出した。これを
鋳型にして、TGGTTYGAYGTNCTCGAYGAYTGGとCGRTCRATACCY
TTYTCRAANCCという塩基配列で表される2種類のプライ
マーを用いて、実施例1と同様の条件でPCRを行った。
Example 3 In the same manner as in Example 1, the whole genome gene of MBIC11017 strain was extracted. Using this as a template, TGGTTYGAYGTNCTCGAYGAYTGG and CGRTCRATACCY
PCR was performed under the same conditions as in Example 1 using two types of primers represented by the base sequence TTYTCRAANCC.

【0037】なお、上記2種のプライマー配列のうち前
者は、Search WWW Entrez at NCBIによって検索/引用
した、シネココックスsp. PCC7002 (ACCESSION U3639
0), シネココックスsp. PCC7942 (ACCESSION M20815),
シネコシスティスsp. PCC6803(ACCESSION M21538), プ
ロコロトリクス・ホランディカ(Prochlorothrix holla
ndica)(ACCESSION U40144)のPsbDの共通のアミノ酸配
列DIDGIREPVから設計されたものである。後者のプライ
マーは、Search WWW Entrez at NCBIによって検索/引
用した、シネココックスsp. PCC7002 (ACCESSION U3639
0), シネココックスsp. PCC7942 (ACCESSION P11004),
シネコシスティスsp. PCC6803 (ACCESSION P09193), プ
ロコロトリクス・ホランディカ(ACCESSION P51753)、ア
ナベナsp. PCC7120 (ACCESSION L21857)のPsbCの共通の
アミノ酸配列GFEKGIDRから設計されたものである。PCR
の結果、約2.3kbpの遺伝子断片が増幅された。この断片
の配列を決定した。
The former primer sequence of the above two types of primer sequences was obtained from Cinecox sp. PCC7002 (ACCESSION U3639) searched / quoted by Search WWW Entrez at NCBI.
0), Cinecox sp.PCC7942 (ACCESSION M20815),
Synechocystis sp. PCC6803 (ACCESSION M21538), Prochlorothrix holla
ndica ) (ACCESSION U40144), designed from the common amino acid sequence DIDGIREPV of PsbD. The latter primer was synechocox sp. PCC7002 (ACCESSION U3639), searched / quoted by Search WWW Entrez at NCBI.
0), Cinecox sp.PCC7942 (ACCESSION P11004),
It is designed from the common amino acid sequence GFEKGIDR of PsbC of Synechocystis sp. PCC6803 (ACCESSION P09193), Procorotix Hollandica (ACCESSION P51753) and Anabaena sp. PCC7120 (ACCESSION L21857). PCR
As a result, a gene fragment of about 2.3 kbp was amplified. The sequence of this fragment was determined.

【0038】決定された全配列には、上流に339アミノ
酸配列(部分配列)と下流に446アミノ酸配列(部分配
列)をそれぞれコードすると思われる2つのORFが含ま
れていた。2つのORFの塩基配列(部分配列)を、配列
番号7および配列番号11にそれぞれ示す。また。これ
らの配列から推定されるアミノ酸配列を配列番号8およ
び配列番号12に示す。
The entire sequence determined contained two ORFs which seem to encode the 339 amino acid sequence (partial sequence) upstream and the 446 amino acid sequence (partial sequence) downstream, respectively. The nucleotide sequences (partial sequences) of the two ORFs are shown in SEQ ID NO: 7 and SEQ ID NO: 11, respectively. Also. The amino acid sequences deduced from these sequences are shown in SEQ ID NO: 8 and SEQ ID NO: 12.

【0039】推定アミノ酸配列のホモロジーをデータベ
ースNCBI BLAST(Stephen F. Altschulら、1997、Nuclei
c Acids Res. 25:3389-3402)に基づいて検索を行った結
果、配列番号8のアミノ酸配列は、アナベナsp. PCC712
0、シネココッカスsp. PCC7002のPsbDとそれぞれ88%の
相同性を示し、配列番号12のアミノ酸配列は、クロレラ
・ヴルガリス(Chlorella vulgaris)のPsbCと69%の相
同性を示し、シネコシスティスsp.、ネフロセルミス・
オリバセア(Nephroselmis olivacea) のPsbCと68%の
相同性を示した。このことから配列番号7はPsbDをコー
ドする遺伝子で、配列番号11はPsbCをコードする遺伝子
であると判明した。
The homology of the deduced amino acid sequence is stored in the database NCBI BLAST (Stephen F. Altschul et al., 1997, Nuclei
c Acids Res. 25: 3389-3402), the amino acid sequence of SEQ ID NO: 8 was found to be Anabaena sp.
0, showing 88% homology with PsbD of Synechococcus sp. PCC7002, respectively. The amino acid sequence of SEQ ID NO: 12 shows 69% homology with PsbC of Chlorella vulgaris , Synechocystis sp.
It showed 68% homology with PsbC of Olivesea ( Nephroselmis olivacea ). This proved that SEQ ID NO: 7 was a gene encoding PsbD and SEQ ID NO: 11 was a gene encoding PsbC.

【0040】〔実施例4〕実施例1の方法と同様の方法
で、MBIC11017株の全ゲノム遺伝子を抽出した。これを
鋳型にして、ATGGGACTACCYTGGTAYCGとCCRTGCCAGAKRTGRC
CRAAという塩基配列で表される2種類のプライマーを用
いて、実施例1のと同様の条件でPCRを行った。
Example 4 In the same manner as in Example 1, the whole genome gene of MBIC11017 strain was extracted. Using this as a template, ATGGGACTACCYTGGTAYCG and CCRTGCCAGAKRTGRC
PCR was performed under the same conditions as in Example 1 using two types of primers represented by the nucleotide sequence of CRAA.

【0041】なお、上記2種のプライマー配列は、Sear
ch WWW Entrez at NCBIによって検索/引用した、シネ
ココックスsp. PCC7002 (ACCESSION 227610), シネココ
ックスsp. PCC7942 (ACCESSION 79674), シネコシステ
ィスsp. PCC6803 (ACCESSION225859), プロコロトリク
ス・ホランディカ (ACCESSION 131272)のPsbBの共通の
アミノ酸配列MGLPWYRおよびFGH(I/L)WHGから設計された
ものである。PCRの結果、約1.4kbpの遺伝子断片が増幅
された。この断片の配列を決定した。
The above two primer sequences are used in the Sear
ch Common to PsbB of Synechocox sp. PCC7002 (ACCESSION 227610), Synechocox sp. PCC7942 (ACCESSION 79674), Synechocystis sp. Is designed from the amino acid sequences MGLPWYR and FGH (I / L) WHG. As a result of the PCR, a gene fragment of about 1.4 kbp was amplified. The sequence of this fragment was determined.

【0042】決定された全配列には、456アミノ酸配列
(部分配列)をコードすると思われるORFが含まれてい
た。ORFの塩基配列(部分配列)を配列番号9に示す。
また。この配列から推定されるアミノ酸配列を配列番号
10に示す。
The entire sequence determined contained an ORF likely to encode a 456 amino acid sequence (partial sequence). SEQ ID NO: 9 shows the nucleotide sequence (partial sequence) of ORF.
Also. The amino acid sequence deduced from this sequence is shown in SEQ ID NO: 10.

【0043】推定アミノ酸配列のホモロジーをデータベ
ースNCBI BLAST(Stephen F. Altschulら、1997、Nuclei
c Acids Res. 25:3389-3402)に基づいて検索を行った結
果、配列番号8のアミノ酸配列は、プロコロトリクス・
ホランディカのPsbBと68%の相同性を示し、マスチゴク
ランドス・ラミノスス(Mastigocladus laminosus)のP
sbDとそれぞれ67%の相同性を示し、アナベナsp. PCC712
0のPsbBと66%の相同性を示した。このことから配列番号
9はPsbBをコードする遺伝子であると判明した。
The homology of the deduced amino acid sequence was determined using the database NCBI BLAST (Stephen F. Altschul et al., 1997, Nuclei
c Acids Res. 25: 3389-3402), the amino acid sequence of SEQ ID NO: 8 was
Shows 68% homology with PsbB of Hollandica and P of Mastigocladus laminosus
each showing 67% homology with sbD, Anabaena sp. PCC712
It showed 66% homology with 0 PsbB. From this, the sequence number
9 was found to be a gene encoding PsbB.

【0044】[0044]

【発明の効果】本発明は、クロロフィルdを光受容色素
として用いる光化学系タンパク質、及びそれをコードす
る遺伝子を提供する。この遺伝子等を用いることによ
り、これまで利用できなかった波長領域を用いた光合成
機能の活用が可能になる。
Industrial Applicability The present invention provides a photochemical protein using chlorophyll d as a photoreceptor dye, and a gene encoding the same. By using this gene or the like, it becomes possible to utilize a photosynthetic function using a wavelength region that has not been available until now.

【0045】[0045]

【配列表】 SEQUENCELISTING <110> MARINEBIOTECHNOLOGY INSTITUTECO.,LTD. <120> <130> P99-0652 <160> 20 <170> PatentIn version 2.0 <210> 1 <211> 2262 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(2259) <400> 1 atg aca act agc cca ggt ggg cca gag aca aaa ggc aga aca gct gaa 48 Met Thr Thr Ser Pro Gly Gly Pro Glu Thr Lys Gly Arg Thr Ala Glu 1 5 10 15 gtt gac atc aac cca gtt agc gct tct tta gaa gtc gcg ggt aag ccg 96 Val Asp Ile Asn Pro Val Ser Ala Ser Leu Glu Val Ala Gly Lys Pro 20 25 30 ggt cac ttt aat aaa agt ctg tcg aaa ggt ccc caa acc acc act tgg 144 Gly His Phe Asn Lys Ser Leu Ser Lys Gly Pro Gln Thr Thr Thr Trp 35 40 45 att tgg aac cta cat gct cta gcc cat gat ttt gat aca caa aca aac 192 Ile Trp Asn Leu His Ala Leu Ala His Asp Phe Asp Thr Gln Thr Asn 50 55 60 gac cta gaa gaa att tcc cgc aaa att ttc agt gcc cat ttt gga cac 240 Asp Leu Glu Glu Ile Ser Arg Lys Ile Phe Ser Ala His Phe Gly His 65 70 75 80 tta tcc atc att ttt gta tgg atc agc ggg atg atc ttc cat gct gct 288 Leu Ser Ile Ile Phe Val Trp Ile Ser Gly Met Ile Phe His Ala Ala 85 90 95 cgt ttt tct aac tac tac gct tgg tta gcc gat ccg ctc ggc aac aaa 336 Arg Phe Ser Asn Tyr Tyr Ala Trp Leu Ala Asp Pro Leu Gly Asn Lys 100 105 110 ccc agt gct cac gta gtt tgg ccc att gtt ggc caa gat att tta aat 384 Pro Ser Ala His Val Val Trp Pro Ile Val Gly Gln Asp Ile Leu Asn 115 120 125 gca gat gtt ggt aat gga ttc cgc gga atc caa att acc tct ggc ctt 432 Ala Asp Val Gly Asn Gly Phe Arg Gly Ile Gln Ile Thr Ser Gly Leu 130 135 140 ttc cat att tta cgc ggg gct gga atg act gac ccc ggt gaa ctt tat 480 Phe His Ile Leu Arg Gly Ala Gly Met Thr Asp Pro Gly Glu Leu Tyr 145 150 155 160 tca gca gcc att ggt gcc ctt gtt gca gcg gtt gta atg atg tac gcc 528 Ser Ala Ala Ile Gly Ala Leu Val Ala Ala Val Val Met Met Tyr Ala 165 170 175 ggg tat tat cac tac cac aag aaa gca cct aaa ttg gag tgg ttc caa 576 Gly Tyr Tyr His Tyr His Lys Lys Ala Pro Lys Leu Glu Trp Phe Gln 180 185 190 aac gcc gaa tca acg atg acc cac cat ctc atc gtt ctt tta ggg ctt 624 Asn Ala Glu Ser Thr Met Thr His His Leu Ile Val Leu Leu Gly Leu 195 200 205 ggg aac ctt gcc tgg aca ggt cac ctt atc cat gtt tct ctg cca gtc 672 Gly Asn Leu Ala Trp Thr Gly His Leu Ile His Val Ser Leu Pro Val 210 215 220 aat aag ctt ctt gat tct ggt gta gcc cca caa gat ata cca atc ccc 720 Asn Lys Leu Leu Asp Ser Gly Val Ala Pro Gln Asp Ile Pro Ile Pro 225 230 235 240 cat gaa ttt ctt ttt gat aat gga ttt atg gcg gat tta tat ccc agc 768 His Glu Phe Leu Phe Asp Asn Gly Phe Met Ala Asp Leu Tyr Pro Ser 245 250 255 ttt gct cag gga tta atg cct tac ttc acc cta aat tgg ggt gct tat 816 Phe Ala Gln Gly Leu Met Pro Tyr Phe Thr Leu Asn Trp Gly Ala Tyr 260 265 270 tct gac ttc ctt acc ttc aaa gga ggg ctt gac cca aca acg ggt ggc 864 Ser Asp Phe Leu Thr Phe Lys Gly Gly Leu Asp Pro Thr Thr Gly Gly 275 280 285 cta tgg atg aca gat ata gcc cat cac cat ttg gca ttg gca gta atg 912 Leu Trp Met Thr Asp Ile Ala His His His Leu Ala Leu Ala Val Met 290 295 300 tac atc att gct ggt cat atg tac cga acc aac tgg ggt att ggg cac 960 Tyr Ile Ile Ala Gly His Met Tyr Arg Thr Asn Trp Gly Ile Gly His 305 310 315 320 agt atg aag gaa atc atg gaa tct cat aaa ggt ccc ttt act ggc gaa 1008 Ser Met Lys Glu Ile Met Glu Ser His Lys Gly Pro Phe Thr Gly Glu 325 330 335 ggc cat aaa ggt cta tat gag gtg ctg aca act tct tgg cat gcc cag 1056 Gly His Lys Gly Leu Tyr Glu Val Leu Thr Thr Ser Trp His Ala Gln 340 345 350 cta gca att aac cta gcc aca tgg ggt tct ttc agc atc att gtt gcc 1104 Leu Ala Ile Asn Leu Ala Thr Trp Gly Ser Phe Ser Ile Ile Val Ala 355 360 365 cac cac atg tac gca atg cct cct tat cct tac ttg gca aca gat tac 1152 His His Met Tyr Ala Met Pro Pro Tyr Pro Tyr Leu Ala Thr Asp Tyr 370 375 380 ggc acg cag ctg aat ctg ttc gtc cat cat atg tgg att gga ggt ttc 1200 Gly Thr Gln Leu Asn Leu Phe Val His His Met Trp Ile Gly Gly Phe 385 390 395 400 ttg att gtt ggt ggt gct gcc cac gca gct att ttc atg gtt cgg gat 1248 Leu Ile Val Gly Gly Ala Ala His Ala Ala Ile Phe Met Val Arg Asp 405 410 415 tac gat cca gct gtg aac caa aac aat gtt ctt gat cgg atg ctt cgt 1296 Tyr Asp Pro Ala Val Asn Gln Asn Asn Val Leu Asp Arg Met Leu Arg 420 425 430 cac cga gat acg atc att tcc cat cta aac tgg gtc tgt att ttc ctt 1344 His Arg Asp Thr Ile Ile Ser His Leu Asn Trp Val Cys Ile Phe Leu 435 440 445 ggg ttc cat tct ttt ggc ttg tat atc cat aac gac aat atg cgt tct 1392 Gly Phe His Ser Phe Gly Leu Tyr Ile His Asn Asp Asn Met Arg Ser 450 455 460 ttg ggt cgg cct caa gat atg ttc tcc gac act gct atc caa ctg caa 1440 Leu Gly Arg Pro Gln Asp Met Phe Ser Asp Thr Ala Ile Gln Leu Gln 465 470 475 480 cct att ttt tct caa tgg gtt cag aac tta caa gca aac gtt gct gga 1488 Pro Ile Phe Ser Gln Trp Val Gln Asn Leu Gln Ala Asn Val Ala Gly 485 490 495 aca att cgg gct ccc ttg gca gaa ggt gca tca agc tta gct tgg ggt 1536 Thr Ile Arg Ala Pro Leu Ala Glu Gly Ala Ser Ser Leu Ala Trp Gly 500 505 510 ggc gat cct ttg ttt gtt ggc gga aaa gtt gca atg caa cat gtt tcc 1584 Gly Asp Pro Leu Phe Val Gly Gly Lys Val Ala Met Gln His Val Ser 515 520 525 tta gga acc gcc gat ttc atg atc cac cac att cac gcc ttc cag att 1632 Leu Gly Thr Ala Asp Phe Met Ile His His Ile His Ala Phe Gln Ile 530 535 540 cac gtt act gtt ctc atc cta atc aag ggt gtt ctc tac gct cgt agc 1680 His Val Thr Val Leu Ile Leu Ile Lys Gly Val Leu Tyr Ala Arg Ser 545 550 555 560 tct cgt cta att cca gac aaa gct aac ttg ggc ttc aga ttc cct tgc 1728 Ser Arg Leu Ile Pro Asp Lys Ala Asn Leu Gly Phe Arg Phe Pro Cys 565 570 575 gac gga cca ggt cgg ggt ggt act tgc caa tct tct ggt tgg gac cat 1776 Asp Gly Pro Gly Arg Gly Gly Thr Cys Gln Ser Ser Gly Trp Asp His 580 585 590 atc ttc ttg ggt ctg ttc tgg atg tac aac tgc atc tca att gtc aat 1824 Ile Phe Leu Gly Leu Phe Trp Met Tyr Asn Cys Ile Ser Ile Val Asn 595 600 605 ttc cac ttc ttc tgg aaa atg cag tcg gat gtt tgg ggt gcc gca aat 1872 Phe His Phe Phe Trp Lys Met Gln Ser Asp Val Trp Gly Ala Ala Asn 610 615 620 gct aat ggc ggc gtt aat tac cta aca gct ggc aac tgg gca cag tct 1920 Ala Asn Gly Gly Val Asn Tyr Leu Thr Ala Gly Asn Trp Ala Gln Ser 625 630 635 640 tca atc act att aat ggt tgg ttg cga gat ttc tta tgg gcc caa tcg 1968 Ser Ile Thr Ile Asn Gly Trp Leu Arg Asp Phe Leu Trp Ala Gln Ser 645 650 655 gtt cag gtg att aac tcc tat ggt tct gcc cta tct gcc tac gga att 2016 Val Gln Val Ile Asn Ser Tyr Gly Ser Ala Leu Ser Ala Tyr Gly Ile 660 665 670 tta ttc cta ggt gcc cac ttc atc tgg gct ttc agc ctg atg ttc ttg 2064 Leu Phe Leu Gly Ala His Phe Ile Trp Ala Phe Ser Leu Met Phe Leu 675 680 685 ttc agt ggt cgt ggc tat tgg caa gag ctg atc gag tct att gtt tgg 2112 Phe Ser Gly Arg Gly Tyr Trp Gln Glu Leu Ile Glu Ser Ile Val Trp 690 695 700 gct cac agc aaa cta aag att gct cca gcc att cag cca cgc gct atg 2160 Ala His Ser Lys Leu Lys Ile Ala Pro Ala Ile Gln Pro Arg Ala Met 705 710 715 720 agt att act caa ggt cgt gca gtt gga ctg ggc cat tac ctc cta ggt 2208 Ser Ile Thr Gln Gly Arg Ala Val Gly Leu Gly His Tyr Leu Leu Gly 725 730 735 gga att gtg acc tct tgg tca ttc tac cta gct cga att ctc gca tta 2256 Gly Ile Val Thr Ser Trp Ser Phe Tyr Leu Ala Arg Ile Leu Ala Leu 740 745 750 gga tag 2262 Gly <210> 2 <211> 753 <212> PRT <213> Acaryuochloris marina <400> 2 Met Thr Thr Ser Pro Gly Gly Pro Glu Thr Lys Gly Arg Thr Ala Glu 1 5 10 15 Val Asp Ile Asn Pro Val Ser Ala Ser Leu Glu Val Ala Gly Lys Pro 20 25 30 Gly His Phe Asn Lys Ser Leu Ser Lys Gly Pro Gln Thr Thr Thr Trp 35 40 45 Ile Trp Asn Leu His Ala Leu Ala His Asp Phe Asp Thr Gln Thr Asn 50 55 60 Asp Leu Glu Glu Ile Ser Arg Lys Ile Phe Ser Ala His Phe Gly His 65 70 75 80 Leu Ser Ile Ile Phe Val Trp Ile Ser Gly Met Ile Phe His Ala Ala 85 90 95 Arg Phe Ser Asn Tyr Tyr Ala Trp Leu Ala Asp Pro Leu Gly Asn Lys 100 105 110 Pro Ser Ala His Val Val Trp Pro Ile Val Gly Gln Asp Ile Leu Asn 115 120 125 Ala Asp Val Gly Asn Gly Phe Arg Gly Ile Gln Ile Thr Ser Gly Leu 130 135 140 Phe His Ile Leu Arg Gly Ala Gly Met Thr Asp Pro Gly Glu Leu Tyr 145 150 155 160 Ser Ala Ala Ile Gly Ala Leu Val Ala Ala Val Val Met Met Tyr Ala 165 170 175 Gly Tyr Tyr His Tyr His Lys Lys Ala Pro Lys Leu Glu Trp Phe Gln 180 185 190 Asn Ala Glu Ser Thr Met Thr His His Leu Ile Val Leu Leu Gly Leu 195 200 205 Gly Asn Leu Ala Trp Thr Gly His Leu Ile His Val Ser Leu Pro Val 210 215 220 Asn Lys Leu Leu Asp Ser Gly Val Ala Pro Gln Asp Ile Pro Ile Pro 225 230 235 240 His Glu Phe Leu Phe Asp Asn Gly Phe Met Ala Asp Leu Tyr Pro Ser 245 250 255 Phe Ala Gln Gly Leu Met Pro Tyr Phe Thr Leu Asn Trp Gly Ala Tyr 260 265 270 Ser Asp Phe Leu Thr Phe Lys Gly Gly Leu Asp Pro Thr Thr Gly Gly 275 280 285 Leu Trp Met Thr Asp Ile Ala His His His Leu Ala Leu Ala Val Met 290 295 300 Tyr Ile Ile Ala Gly His Met Tyr Arg Thr Asn Trp Gly Ile Gly His 305 310 315 320 Ser Met Lys Glu Ile Met Glu Ser His Lys Gly Pro Phe Thr Gly Glu 325 330 335 Gly His Lys Gly Leu Tyr Glu Val Leu Thr Thr Ser Trp His Ala Gln 340 345 350 Leu Ala Ile Asn Leu Ala Thr Trp Gly Ser Phe Ser Ile Ile Val Ala 355 360 365 His His Met Tyr Ala Met Pro Pro Tyr Pro Tyr Leu Ala Thr Asp Tyr 370 375 380 Gly Thr Gln Leu Asn Leu Phe Val His His Met Trp Ile Gly Gly Phe 385 390 395 400 Leu Ile Val Gly Gly Ala Ala His Ala Ala Ile Phe Met Val Arg Asp 405 410 415 Tyr Asp Pro Ala Val Asn Gln Asn Asn Val Leu Asp Arg Met Leu Arg 420 425 430 His Arg Asp Thr Ile Ile Ser His Leu Asn Trp Val Cys Ile Phe Leu 435 440 445 Gly Phe His Ser Phe Gly Leu Tyr Ile His Asn Asp Asn Met Arg Ser 450 455 460 Leu Gly Arg Pro Gln Asp Met Phe Ser Asp Thr Ala Ile Gln Leu Gln 465 470 475 480 Pro Ile Phe Ser Gln Trp Val Gln Asn Leu Gln Ala Asn Val Ala Gly 485 490 495 Thr Ile Arg Ala Pro Leu Ala Glu Gly Ala Ser Ser Leu Ala Trp Gly 500 505 510 Gly Asp Pro Leu Phe Val Gly Gly Lys Val Ala Met Gln His Val Ser 515 520 525 Leu Gly Thr Ala Asp Phe Met Ile His His Ile His Ala Phe Gln Ile 530 535 540 His Val Thr Val Leu Ile Leu Ile Lys Gly Val Leu Tyr Ala Arg Ser 545 550 555 560 Ser Arg Leu Ile Pro Asp Lys Ala Asn Leu Gly Phe Arg Phe Pro Cys 565 570 575 Asp Gly Pro Gly Arg Gly Gly Thr Cys Gln Ser Ser Gly Trp Asp His 580 585 590 Ile Phe Leu Gly Leu Phe Trp Met Tyr Asn Cys Ile Ser Ile Val Asn 595 600 605 Phe His Phe Phe Trp Lys Met Gln Ser Asp Val Trp Gly Ala Ala Asn 610 615 620 Ala Asn Gly Gly Val Asn Tyr Leu Thr Ala Gly Asn Trp Ala Gln Ser 625 630 635 640 Ser Ile Thr Ile Asn Gly Trp Leu Arg Asp Phe Leu Trp Ala Gln Ser 645 650 655 Val Gln Val Ile Asn Ser Tyr Gly Ser Ala Leu Ser Ala Tyr Gly Ile 660 665 670 Leu Phe Leu Gly Ala His Phe Ile Trp Ala Phe Ser Leu Met Phe Leu 675 680 685 Phe Ser Gly Arg Gly Tyr Trp Gln Glu Leu Ile Glu Ser Ile Val Trp 690 695 700 Ala His Ser Lys Leu Lys Ile Ala Pro Ala Ile Gln Pro Arg Ala Met 705 710 715 720 Ser Ile Thr Gln Gly Arg Ala Val Gly Leu Gly His Tyr Leu Leu Gly 725 730 735 Gly Ile Val Thr Ser Trp Ser Phe Tyr Leu Ala Arg Ile Leu Ala Leu 740 745 750 Gly <210> 3 <211> 2211 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(2208) <400> 3 atg gct act aaa ttt cct agt ttt agc caa gac ctt gcc caa gat cca 48 Met Ala Thr Lys Phe Pro Ser Phe Ser Gln Asp Leu Ala Gln Asp Pro 1 5 10 15 aca aca cgt cgg atc tgg tac gga att gcc aca gtt cat gat ttt gag 96 Thr Thr Arg Arg Ile Trp Tyr Gly Ile Ala Thr Val His Asp Phe Glu 20 25 30 act cat gac gga atg acg gag gaa aat ctt tat caa aag att ttc gcg 144 Thr His Asp Gly Met Thr Glu Glu Asn Leu Tyr Gln Lys Ile Phe Ala 35 40 45 act cac ttc ggt cat ctc tct att atc ttt cta tgg tct gct ggc cat 192 Thr His Phe Gly His Leu Ser Ile Ile Phe Leu Trp Ser Ala Gly His 50 55 60 ctt ttc cat gtc gcc tgg caa ggc aac ttt gaa cag tgg atc caa gat 240 Leu Phe His Val Ala Trp Gln Gly Asn Phe Glu Gln Trp Ile Gln Asp 65 70 75 80 cca cta acc atc cgt ccc atc gcc cat gcg att tgg gac ccc cat ttg 288 Pro Leu Thr Ile Arg Pro Ile Ala His Ala Ile Trp Asp Pro His Leu 85 90 95 ggt gat gct gca act cag gcc ttc acc caa gct ggc gct tct ggt cca 336 Gly Asp Ala Ala Thr Gln Ala Phe Thr Gln Ala Gly Ala Ser Gly Pro 100 105 110 gtt gac ctt tgt tat tct ggc ctc tac caa tgg tgg tac acc att ggt 384 Val Asp Leu Cys Tyr Ser Gly Leu Tyr Gln Trp Trp Tyr Thr Ile Gly 115 120 125 atg cgt acc aat ggt gat tta tac att ggt tct gtt ttc ttg atg att 432 Met Arg Thr Asn Gly Asp Leu Tyr Ile Gly Ser Val Phe Leu Met Ile 130 135 140 gtc gct gca gtc atg ttg ttt gca ggt tgg ctt cat cta caa ccc aaa 480 Val Ala Ala Val Met Leu Phe Ala Gly Trp Leu His Leu Gln Pro Lys 145 150 155 160 ttt cga ccc agc tta gcc tgg ttt aga gat gct gaa tcc caa atg aac 528 Phe Arg Pro Ser Leu Ala Trp Phe Arg Asp Ala Glu Ser Gln Met Asn 165 170 175 cac cac ttg gca gtt cta ttt ggt gct agc tct ttg ggc tgg aca ggc 576 His His Leu Ala Val Leu Phe Gly Ala Ser Ser Leu Gly Trp Thr Gly 180 185 190 cac tta atc cac gtt gct att ccc gaa gct cgg ggt cag cac gta ggt 624 His Leu Ile His Val Ala Ile Pro Glu Ala Arg Gly Gln His Val Gly 195 200 205 tgg gat aac ttt ctg tca acc atg cct cac cct gct ggt tta gcg cct 672 Trp Asp Asn Phe Leu Ser Thr Met Pro His Pro Ala Gly Leu Ala Pro 210 215 220 ttc ttt act ggg cgt tgg gga gtt tat gct caa aac cct gat act gct 720 Phe Phe Thr Gly Arg Trp Gly Val Tyr Ala Gln Asn Pro Asp Thr Ala 225 230 235 240 ggt cat att ttt gga act agc gaa ggt gct gga act gcg att att acc 768 Gly His Ile Phe Gly Thr Ser Glu Gly Ala Gly Thr Ala Ile Ile Thr 245 250 255 ttt att ggc ggt ttc cat ccc caa act gaa gca ttg tgg cta act gat 816 Phe Ile Gly Gly Phe His Pro Gln Thr Glu Ala Leu Trp Leu Thr Asp 260 265 270 att gcc cac cac cat ctg gct att gct gtg atg tac atc att gct ggc 864 Ile Ala His His His Leu Ala Ile Ala Val Met Tyr Ile Ile Ala Gly 275 280 285 cat atg tat cga act cag ttc ggt att ggg cat agt atg aaa gag atc 912 His Met Tyr Arg Thr Gln Phe Gly Ile Gly His Ser Met Lys Glu Ile 290 295 300 cta gaa gca cac acc cct ccc agc ggg atg ttg ggt gat gcg cac aag 960 Leu Glu Ala His Thr Pro Pro Ser Gly Met Leu Gly Asp Ala His Lys 305 310 315 320 ggc ctt tat gac act tac aat gaa tct cta cat ttc cag tta ggt ttc 1008 Gly Leu Tyr Asp Thr Tyr Asn Glu Ser Leu His Phe Gln Leu Gly Phe 325 330 335 cac cta gct gca tta ggt gta atc act tct gtg gtt gcc caa cat atg 1056 His Leu Ala Ala Leu Gly Val Ile Thr Ser Val Val Ala Gln His Met 340 345 350 tat tca ttg ccg tca tac gct ttc atc tct caa gac cat gtc aca caa 1104 Tyr Ser Leu Pro Ser Tyr Ala Phe Ile Ser Gln Asp His Val Thr Gln 355 360 365 gct gcg ctt tac aca cat cac caa tat att gct gga att cta gca att 1152 Ala Ala Leu Tyr Thr His His Gln Tyr Ile Ala Gly Ile Leu Ala Ile 370 375 380 ggt gct ttt gcg cat ggt ggt atc ttc ttt gtc cga gat tac gat cca 1200 Gly Ala Phe Ala His Gly Gly Ile Phe Phe Val Arg Asp Tyr Asp Pro 385 390 395 400 gaa cgt aac aag aac aac gtt ctt gct cgt gct ctt gag cat aaa gag 1248 Glu Arg Asn Lys Asn Asn Val Leu Ala Arg Ala Leu Glu His Lys Glu 405 410 415 gcg att atc tcc cac cta tct tgg gta tcc atg ttc agt ggt ttc cat 1296 Ala Ile Ile Ser His Leu Ser Trp Val Ser Met Phe Ser Gly Phe His 420 425 430 acc ctt ggt gtt tat gtt cat aac gac acc gtg gta gct ttt ggt act 1344 Thr Leu Gly Val Tyr Val His Asn Asp Thr Val Val Ala Phe Gly Thr 435 440 445 cct gag aag caa att ttg gtt gag cca atc ttt gcg caa tgg att cag 1392 Pro Glu Lys Gln Ile Leu Val Glu Pro Ile Phe Ala Gln Trp Ile Gln 450 455 460 gca gct cat ggc aaa ctg ctc tta gga ttt gaa aca ctg ctt tca aat 1440 Ala Ala His Gly Lys Leu Leu Leu Gly Phe Glu Thr Leu Leu Ser Asn 465 470 475 480 cct aat gga ttg gct tat aac cct cct aac att tct cct gat gta ttt 1488 Pro Asn Gly Leu Ala Tyr Asn Pro Pro Asn Ile Ser Pro Asp Val Phe 485 490 495 gtt cct gga tgg gtt gaa gca atg aac aac cct gtt atc ggg ccg ttt 1536 Val Pro Gly Trp Val Glu Ala Met Asn Asn Pro Val Ile Gly Pro Phe 500 505 510 atg tct caa ggg cct ggt gac ttc ttg gtt cat cat ggt att gcc ttc 1584 Met Ser Gln Gly Pro Gly Asp Phe Leu Val His His Gly Ile Ala Phe 515 520 525 agt ttg cat gtc acc gtc tta atc tgt gtc aag ggt tgt ttg gat gcc 1632 Ser Leu His Val Thr Val Leu Ile Cys Val Lys Gly Cys Leu Asp Ala 530 535 540 cgt ggt tct aaa ctg atg cct gac aag aaa gac ttt ggt tat agc ttc 1680 Arg Gly Ser Lys Leu Met Pro Asp Lys Lys Asp Phe Gly Tyr Ser Phe 545 550 555 560 cct tgt gat ggc ccc gga cgt ggc ggt act tgt gat atc tct gct tgg 1728 Pro Cys Asp Gly Pro Gly Arg Gly Gly Thr Cys Asp Ile Ser Ala Trp 565 570 575 gat tcc ttc tac ctt gcc ttc ttc tgg atg ctc aac aca att ggt tgg 1776 Asp Ser Phe Tyr Leu Ala Phe Phe Trp Met Leu Asn Thr Ile Gly Trp 580 585 590 att gtc ttc tac ttc aac tgg aag cat ttg gct atc tgg tct ggt aac 1824 Ile Val Phe Tyr Phe Asn Trp Lys His Leu Ala Ile Trp Ser Gly Asn 595 600 605 gaa gct cag ttc aat acc aac tct act tat cta atg ggt tgg ctg cga 1872 Glu Ala Gln Phe Asn Thr Asn Ser Thr Tyr Leu Met Gly Trp Leu Arg 610 615 620 gac tac ctt tgg gga tac tca gct caa ttg att aac ggt tac aca cca 1920 Asp Tyr Leu Trp Gly Tyr Ser Ala Gln Leu Ile Asn Gly Tyr Thr Pro 625 630 635 640 ttt ggt gta aat agc ctg tca gtt tgg gct tgg att ttc ctc tta ggc 1968 Phe Gly Val Asn Ser Leu Ser Val Trp Ala Trp Ile Phe Leu Leu Gly 645 650 655 cac ctc tgc tgg gcg act ggc ttc ttg ttc ttg atc tcc tgg aga ggt 2016 His Leu Cys Trp Ala Thr Gly Phe Leu Phe Leu Ile Ser Trp Arg Gly 660 665 670 tac tgg caa gag ctg att gag act ctc gtt tgg gct cac cag cgt act 2064 Tyr Trp Gln Glu Leu Ile Glu Thr Leu Val Trp Ala His Gln Arg Thr 675 680 685 ccc ctc gcc aac tta gtg aca tgg aaa gac aag cct gtt gct ctc tct 2112 Pro Leu Ala Asn Leu Val Thr Trp Lys Asp Lys Pro Val Ala Leu Ser 690 695 700 atc gtt caa ggt cgc ttg gtg ggt tta gtc cac ttt gcg gtt ggc tat 2160 Ile Val Gln Gly Arg Leu Val Gly Leu Val His Phe Ala Val Gly Tyr 705 710 715 720 tat gtg acc tac gcg gct ttt gtg att ggt gca aca gct cct ctc ggc 2208 Tyr Val Thr Tyr Ala Ala Phe Val Ile Gly Ala Thr Ala Pro Leu Gly 725 730 735 taa 2211 <210> 4 <211> 736 <212> PRT <213> Acaryuochloris marina <400> 4 Met Ala Thr Lys Phe Pro Ser Phe Ser Gln Asp Leu Ala Gln Asp Pro 1 5 10 15 Thr Thr Arg Arg Ile Trp Tyr Gly Ile Ala Thr Val His Asp Phe Glu 20 25 30 Thr His Asp Gly Met Thr Glu Glu Asn Leu Tyr Gln Lys Ile Phe Ala 35 40 45 Thr His Phe Gly His Leu Ser Ile Ile Phe Leu Trp Ser Ala Gly His 50 55 60 Leu Phe His Val Ala Trp Gln Gly Asn Phe Glu Gln Trp Ile Gln Asp 65 70 75 80 Pro Leu Thr Ile Arg Pro Ile Ala His Ala Ile Trp Asp Pro His Leu 85 90 95 Gly Asp Ala Ala Thr Gln Ala Phe Thr Gln Ala Gly Ala Ser Gly Pro 100 105 110 Val Asp Leu Cys Tyr Ser Gly Leu Tyr Gln Trp Trp Tyr Thr Ile Gly 115 120 125 Met Arg Thr Asn Gly Asp Leu Tyr Ile Gly Ser Val Phe Leu Met Ile 130 135 140 Val Ala Ala Val Met Leu Phe Ala Gly Trp Leu His Leu Gln Pro Lys 145 150 155 160 Phe Arg Pro Ser Leu Ala Trp Phe Arg Asp Ala Glu Ser Gln Met Asn 165 170 175 His His Leu Ala Val Leu Phe Gly Ala Ser Ser Leu Gly Trp Thr Gly 180 185 190 His Leu Ile His Val Ala Ile Pro Glu Ala Arg Gly Gln His Val Gly 195 200 205 Trp Asp Asn Phe Leu Ser Thr Met Pro His Pro Ala Gly Leu Ala Pro 210 215 220 Phe Phe Thr Gly Arg Trp Gly Val Tyr Ala Gln Asn Pro Asp Thr Ala 225 230 235 240 Gly His Ile Phe Gly Thr Ser Glu Gly Ala Gly Thr Ala Ile Ile Thr 245 250 255 Phe Ile Gly Gly Phe His Pro Gln Thr Glu Ala Leu Trp Leu Thr Asp 260 265 270 Ile Ala His His His Leu Ala Ile Ala Val Met Tyr Ile Ile Ala Gly 275 280 285 His Met Tyr Arg Thr Gln Phe Gly Ile Gly His Ser Met Lys Glu Ile 290 295 300 Leu Glu Ala His Thr Pro Pro Ser Gly Met Leu Gly Asp Ala His Lys 305 310 315 320 Gly Leu Tyr Asp Thr Tyr Asn Glu Ser Leu His Phe Gln Leu Gly Phe 325 330 335 His Leu Ala Ala Leu Gly Val Ile Thr Ser Val Val Ala Gln His Met 340 345 350 Tyr Ser Leu Pro Ser Tyr Ala Phe Ile Ser Gln Asp His Val Thr Gln 355 360 365 Ala Ala Leu Tyr Thr His His Gln Tyr Ile Ala Gly Ile Leu Ala Ile 370 375 380 Gly Ala Phe Ala His Gly Gly Ile Phe Phe Val Arg Asp Tyr Asp Pro 385 390 395 400 Glu Arg Asn Lys Asn Asn Val Leu Ala Arg Ala Leu Glu His Lys Glu 405 410 415 Ala Ile Ile Ser His Leu Ser Trp Val Ser Met Phe Ser Gly Phe His 420 425 430 Thr Leu Gly Val Tyr Val His Asn Asp Thr Val Val Ala Phe Gly Thr 435 440 445 Pro Glu Lys Gln Ile Leu Val Glu Pro Ile Phe Ala Gln Trp Ile Gln 450 455 460 Ala Ala His Gly Lys Leu Leu Leu Gly Phe Glu Thr Leu Leu Ser Asn 465 470 475 480 Pro Asn Gly Leu Ala Tyr Asn Pro Pro Asn Ile Ser Pro Asp Val Phe 485 490 495 Val Pro Gly Trp Val Glu Ala Met Asn Asn Pro Val Ile Gly Pro Phe 500 505 510 Met Ser Gln Gly Pro Gly Asp Phe Leu Val His His Gly Ile Ala Phe 515 520 525 Ser Leu His Val Thr Val Leu Ile Cys Val Lys Gly Cys Leu Asp Ala 530 535 540 Arg Gly Ser Lys Leu Met Pro Asp Lys Lys Asp Phe Gly Tyr Ser Phe 545 550 555 560 Pro Cys Asp Gly Pro Gly Arg Gly Gly Thr Cys Asp Ile Ser Ala Trp 565 570 575 Asp Ser Phe Tyr Leu Ala Phe Phe Trp Met Leu Asn Thr Ile Gly Trp 580 585 590 Ile Val Phe Tyr Phe Asn Trp Lys His Leu Ala Ile Trp Ser Gly Asn 595 600 605 Glu Ala Gln Phe Asn Thr Asn Ser Thr Tyr Leu Met Gly Trp Leu Arg 610 615 620 Asp Tyr Leu Trp Gly Tyr Ser Ala Gln Leu Ile Asn Gly Tyr Thr Pro 625 630 635 640 Phe Gly Val Asn Ser Leu Ser Val Trp Ala Trp Ile Phe Leu Leu Gly 645 650 655 His Leu Cys Trp Ala Thr Gly Phe Leu Phe Leu Ile Ser Trp Arg Gly 660 665 670 Tyr Trp Gln Glu Leu Ile Glu Thr Leu Val Trp Ala His Gln Arg Thr 675 680 685 Pro Leu Ala Asn Leu Val Thr Trp Lys Asp Lys Pro Val Ala Leu Ser 690 695 700 Ile Val Gln Gly Arg Leu Val Gly Leu Val His Phe Ala Val Gly Tyr 705 710 715 720 Tyr Val Thr Tyr Ala Ala Phe Val Ile Gly Ala Thr Ala Pro Leu Gly 725 730 735 <210> 5 <211> 1083 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(1080) <400> 5 atg aca aca gtc ttg caa aga cgc gaa agc gcc agc gca tgg gaa aga 48 Met Thr Thr Val Leu Gln Arg Arg Glu Ser Ala Ser Ala Trp Glu Arg 1 5 10 15 ttc tgt agc ttc atc acc agc acc aac aac cgt tta tac atc ggt tgg 96 Phe Cys Ser Phe Ile Thr Ser Thr Asn Asn Arg Leu Tyr Ile Gly Trp 20 25 30 ttc ggc gta ttg atg att cct aca ctt ctc acc gct gta acc tgt ttc 144 Phe Gly Val Leu Met Ile Pro Thr Leu Leu Thr Ala Val Thr Cys Phe 35 40 45 gta atc gcc ttc atc ggc gcc cct ccc gtc gac atc gat gga atc cgt 192 Val Ile Ala Phe Ile Gly Ala Pro Pro Val Asp Ile Asp Gly Ile Arg 50 55 60 gag ccc gtt gct ggt tca cta ctt tat ggc aac aac atc atc act ggt 240 Glu Pro Val Ala Gly Ser Leu Leu Tyr Gly Asn Asn Ile Ile Thr Gly 65 70 75 80 gcc gtt gtt cct tca tcc aac gcc att ggc ctt cac ctg tat ccc atc 288 Ala Val Val Pro Ser Ser Asn Ala Ile Gly Leu His Leu Tyr Pro Ile 85 90 95 tgg gaa gca gct tct ctt gat gag tgg ttg tac aac ggt ggc cct tac 336 Trp Glu Ala Ala Ser Leu Asp Glu Trp Leu Tyr Asn Gly Gly Pro Tyr 100 105 110 cag cta atc att ttc cat tac atg att ggt tgt att tgc tac ctc ggt 384 Gln Leu Ile Ile Phe His Tyr Met Ile Gly Cys Ile Cys Tyr Leu Gly 115 120 125 cgt cag tgg gag tac agc tac cgt cta ggg atg cgt cct tgg att tgt 432 Arg Gln Trp Glu Tyr Ser Tyr Arg Leu Gly Met Arg Pro Trp Ile Cys 130 135 140 gtt gct tac tct gca cct ttg gcc gct acc tac tct gtc ttc ttg atc 480 Val Ala Tyr Ser Ala Pro Leu Ala Ala Thr Tyr Ser Val Phe Leu Ile 145 150 155 160 tat cct cta ggt cag ggc agc ttc tcc gac gga atg cct cta ggc atc 528 Tyr Pro Leu Gly Gln Gly Ser Phe Ser Asp Gly Met Pro Leu Gly Ile 165 170 175 agc gga acc ttc aac ttc atg ttc gtg ttc caa gct gag cac aac atc 576 Ser Gly Thr Phe Asn Phe Met Phe Val Phe Gln Ala Glu His Asn Ile 180 185 190 ctc atg cac ccc ttc cac atg ttt gga gtt gct ggt gta ctg ggt ggt 624 Leu Met His Pro Phe His Met Phe Gly Val Ala Gly Val Leu Gly Gly 195 200 205 tcc tta ttc gcc gcc atg cac ggt tcc ttg gtt agc tcc act cta gtt 672 Ser Leu Phe Ala Ala Met His Gly Ser Leu Val Ser Ser Thr Leu Val 210 215 220 cgt gag acc acc gaa ggt gag tcc gcc aac tac ggt tac aag ttc ggc 720 Arg Glu Thr Thr Glu Gly Glu Ser Ala Asn Tyr Gly Tyr Lys Phe Gly 225 230 235 240 caa gag gaa gag acc tac aac atc gtt gca gcc cac ggc tac ttc ggt 768 Gln Glu Glu Glu Thr Tyr Asn Ile Val Ala Ala His Gly Tyr Phe Gly 245 250 255 cgt ttg atc ttc caa tat gca tct ttc agc aac agc cgt tcc ttg cac 816 Arg Leu Ile Phe Gln Tyr Ala Ser Phe Ser Asn Ser Arg Ser Leu His 260 265 270 ttc ttc ttg ggt gca tgg ccc gtt gtc tgc atc tgg ttg act gca atg 864 Phe Phe Leu Gly Ala Trp Pro Val Val Cys Ile Trp Leu Thr Ala Met 275 280 285 ggc atc agc acc atg gcc ttc aac ttg aat ggt ttc aac ttc aac cac 912 Gly Ile Ser Thr Met Ala Phe Asn Leu Asn Gly Phe Asn Phe Asn His 290 295 300 tcc atc gtt gat tca caa ggt aac gtt gtg aac aca tgg gct gac gta 960 Ser Ile Val Asp Ser Gln Gly Asn Val Val Asn Thr Trp Ala Asp Val 305 310 315 320 cta aac cgc gcc aac ttg ggt ttc gaa gtt atg cac gag cgt aac gct 1008 Leu Asn Arg Ala Asn Leu Gly Phe Glu Val Met His Glu Arg Asn Ala 325 330 335 cat aac ttc ccc tta gac ttg gct gct ggt gag tct gct cct gtt gct 1056 His Asn Phe Pro Leu Asp Leu Ala Ala Gly Glu Ser Ala Pro Val Ala 340 345 350 ctt act gct cct gtc atc aac ggt taa 1083 Leu Thr Ala Pro Val Ile Asn Gly 355 360 <210> 6 <211> 360 <212> PRT <213> Acaryuochloris marina <400> 6 Met Thr Thr Val Leu Gln Arg Arg Glu Ser Ala Ser Ala Trp Glu Arg 1 5 10 15 Phe Cys Ser Phe Ile Thr Ser Thr Asn Asn Arg Leu Tyr Ile Gly Trp 20 25 30 Phe Gly Val Leu Met Ile Pro Thr Leu Leu Thr Ala Val Thr Cys Phe 35 40 45 Val Ile Ala Phe Ile Gly Ala Pro Pro Val Asp Ile Asp Gly Ile Arg 50 55 60 Glu Pro Val Ala Gly Ser Leu Leu Tyr Gly Asn Asn Ile Ile Thr Gly 65 70 75 80 Ala Val Val Pro Ser Ser Asn Ala Ile Gly Leu His Leu Tyr Pro Ile 85 90 95 Trp Glu Ala Ala Ser Leu Asp Glu Trp Leu Tyr Asn Gly Gly Pro Tyr 100 105 110 Gln Leu Ile Ile Phe His Tyr Met Ile Gly Cys Ile Cys Tyr Leu Gly 115 120 125 Arg Gln Trp Glu Tyr Ser Tyr Arg Leu Gly Met Arg Pro Trp Ile Cys 130 135 140 Val Ala Tyr Ser Ala Pro Leu Ala Ala Thr Tyr Ser Val Phe Leu Ile 145 150 155 160 Tyr Pro Leu Gly Gln Gly Ser Phe Ser Asp Gly Met Pro Leu Gly Ile 165 170 175 Ser Gly Thr Phe Asn Phe Met Phe Val Phe Gln Ala Glu His Asn Ile 180 185 190 Leu Met His Pro Phe His Met Phe Gly Val Ala Gly Val Leu Gly Gly 195 200 205 Ser Leu Phe Ala Ala Met His Gly Ser Leu Val Ser Ser Thr Leu Val 210 215 220 Arg Glu Thr Thr Glu Gly Glu Ser Ala Asn Tyr Gly Tyr Lys Phe Gly 225 230 235 240 Gln Glu Glu Glu Thr Tyr Asn Ile Val Ala Ala His Gly Tyr Phe Gly 245 250 255 Arg Leu Ile Phe Gln Tyr Ala Ser Phe Ser Asn Ser Arg Ser Leu His 260 265 270 Phe Phe Leu Gly Ala Trp Pro Val Val Cys Ile Trp Leu Thr Ala Met 275 280 285 Gly Ile Ser Thr Met Ala Phe Asn Leu Asn Gly Phe Asn Phe Asn His 290 295 300 Ser Ile Val Asp Ser Gln Gly Asn Val Val Asn Thr Trp Ala Asp Val 305 310 315 320 Leu Asn Arg Ala Asn Leu Gly Phe Glu Val Met His Glu Arg Asn Ala 325 330 335 His Asn Phe Pro Leu Asp Leu Ala Ala Gly Glu Ser Ala Pro Val Ala 340 345 350 Leu Thr Ala Pro Val Ile Asn Gly 355 360 <210> 7 <211> 1020 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(1017) <400> 7 tgg ttt tat gtn ctc gat gan tgg ctt aag cgt gat cgg ttc gtc ttt 48 Trp Phe Tyr Xaa Leu Asp Xaa Trp Leu Lys Arg Asp Arg Phe Val Phe 1 5 10 15 att ggt tgg tca ggt atc cta ctt ttc ccc tgt gcg ttt cta tcc atc 96 Ile Gly Trp Ser Gly Ile Leu Leu Phe Pro Cys Ala Phe Leu Ser Ile 20 25 30 ggg gga tgg ttt acc ggc aca act ttc gta act tcc tgg tac acc cac 144 Gly Gly Trp Phe Thr Gly Thr Thr Phe Val Thr Ser Trp Tyr Thr His 35 40 45 ggt ctt gct agc tcc tac cta gaa ggg gct aac ttc ttg acc gtt gct 192 Gly Leu Ala Ser Ser Tyr Leu Glu Gly Ala Asn Phe Leu Thr Val Ala 50 55 60 gta tcc act ccc gcc gac agc ctc ggc cac tcc cta ctt cta ctt tgg 240 Val Ser Thr Pro Ala Asp Ser Leu Gly His Ser Leu Leu Leu Leu Trp 65 70 75 80 gga ccc gaa gct caa ggt gac ttc acc cgc tgg tgt cag ctg ggt gga 288 Gly Pro Glu Ala Gln Gly Asp Phe Thr Arg Trp Cys Gln Leu Gly Gly 85 90 95 ttg tgg aac ttc acc aca tta cat ggt gtc ttc ggc ttg atc ggc ttc 336 Leu Trp Asn Phe Thr Thr Leu His Gly Val Phe Gly Leu Ile Gly Phe 100 105 110 atg ctg cgt caa ttc gag att gcc cgt cta gtc ggc gtg cgt cct tac 384 Met Leu Arg Gln Phe Glu Ile Ala Arg Leu Val Gly Val Arg Pro Tyr 115 120 125 aac gca gtt gcc ttc agc ggt cct atc gcc gtg tat gtt tcc gtc ttt 432 Asn Ala Val Ala Phe Ser Gly Pro Ile Ala Val Tyr Val Ser Val Phe 130 135 140 ttg atg tat cct ttg ggc caa tcc agc tgg ttc ttt gca cct agc tgg 480 Leu Met Tyr Pro Leu Gly Gln Ser Ser Trp Phe Phe Ala Pro Ser Trp 145 150 155 160 ggt gta aca agc atc ttc cga ttc ttg tta ttt gct caa ggt ttc cac 528 Gly Val Thr Ser Ile Phe Arg Phe Leu Leu Phe Ala Gln Gly Phe His 165 170 175 aac cta acc ctc aac ccc ttc cac atg atg ggt gtt gca ggt att ttg 576 Asn Leu Thr Leu Asn Pro Phe His Met Met Gly Val Ala Gly Ile Leu 180 185 190 ggt ggt gcg ctg ttg tgc gcc att cac gga gcc act gtt gag aac acc 624 Gly Gly Ala Leu Leu Cys Ala Ile His Gly Ala Thr Val Glu Asn Thr 195 200 205 ttg ttt gaa gac ggt caa gac gcc aat aca ttt gct gcg ttc act ccg 672 Leu Phe Glu Asp Gly Gln Asp Ala Asn Thr Phe Ala Ala Phe Thr Pro 210 215 220 acc caa gca gaa gag acc tac tcc atg gtc act gct aac cga ttc tgg 720 Thr Gln Ala Glu Glu Thr Tyr Ser Met Val Thr Ala Asn Arg Phe Trp 225 230 235 240 tct cag att ttc ggg att gcc ttt tcc aac aag cgt tgg ttg cac ttt 768 Ser Gln Ile Phe Gly Ile Ala Phe Ser Asn Lys Arg Trp Leu His Phe 245 250 255 ttc atg ttg ttc gtt cct gtg act ggt cta tgg gct tct gcc att ggc 816 Phe Met Leu Phe Val Pro Val Thr Gly Leu Trp Ala Ser Ala Ile Gly 260 265 270 ctc gtg ggt atc gct ctc aac atg cgt gct tat gac ttc gtt agc cag 864 Leu Val Gly Ile Ala Leu Asn Met Arg Ala Tyr Asp Phe Val Ser Gln 275 280 285 gaa atc cgg gct gct gaa gac cct gag ttc gaa acc ttc tac acc aag 912 Glu Ile Arg Ala Ala Glu Asp Pro Glu Phe Glu Thr Phe Tyr Thr Lys 290 295 300 aac att ctc ttg aat gaa ggt ctg cgc gct tgg atg gca ccc caa gac 960 Asn Ile Leu Leu Asn Glu Gly Leu Arg Ala Trp Met Ala Pro Gln Asp 305 310 315 320 caa atc cat gaa aac ttc atc ttc cct gag gag gtt cta cca cgt gga 1008 Gln Ile His Glu Asn Phe Ile Phe Pro Glu Glu Val Leu Pro Arg Gly 325 330 335 aac gcc ctt taa 1020 Asn Ala Leu <210> 8 <211> 339 <212> PRT <213> Acaryuochloris marina <400> 8 Trp Phe Tyr Xaa Leu Asp Xaa Trp Leu Lys Arg Asp Arg Phe Val Phe 1 5 10 15 Ile Gly Trp Ser Gly Ile Leu Leu Phe Pro Cys Ala Phe Leu Ser Ile 20 25 30 Gly Gly Trp Phe Thr Gly Thr Thr Phe Val Thr Ser Trp Tyr Thr His 35 40 45 Gly Leu Ala Ser Ser Tyr Leu Glu Gly Ala Asn Phe Leu Thr Val Ala 50 55 60 Val Ser Thr Pro Ala Asp Ser Leu Gly His Ser Leu Leu Leu Leu Trp 65 70 75 80 Gly Pro Glu Ala Gln Gly Asp Phe Thr Arg Trp Cys Gln Leu Gly Gly 85 90 95 Leu Trp Asn Phe Thr Thr Leu His Gly Val Phe Gly Leu Ile Gly Phe 100 105 110 Met Leu Arg Gln Phe Glu Ile Ala Arg Leu Val Gly Val Arg Pro Tyr 115 120 125 Asn Ala Val Ala Phe Ser Gly Pro Ile Ala Val Tyr Val Ser Val Phe 130 135 140 Leu Met Tyr Pro Leu Gly Gln Ser Ser Trp Phe Phe Ala Pro Ser Trp 145 150 155 160 Gly Val Thr Ser Ile Phe Arg Phe Leu Leu Phe Ala Gln Gly Phe His 165 170 175 Asn Leu Thr Leu Asn Pro Phe His Met Met Gly Val Ala Gly Ile Leu 180 185 190 Gly Gly Ala Leu Leu Cys Ala Ile His Gly Ala Thr Val Glu Asn Thr 195 200 205 Leu Phe Glu Asp Gly Gln Asp Ala Asn Thr Phe Ala Ala Phe Thr Pro 210 215 220 Thr Gln Ala Glu Glu Thr Tyr Ser Met Val Thr Ala Asn Arg Phe Trp 225 230 235 240 Ser Gln Ile Phe Gly Ile Ala Phe Ser Asn Lys Arg Trp Leu His Phe 245 250 255 Phe Met Leu Phe Val Pro Val Thr Gly Leu Trp Ala Ser Ala Ile Gly 260 265 270 Leu Val Gly Ile Ala Leu Asn Met Arg Ala Tyr Asp Phe Val Ser Gln 275 280 285 Glu Ile Arg Ala Ala Glu Asp Pro Glu Phe Glu Thr Phe Tyr Thr Lys 290 295 300 Asn Ile Leu Leu Asn Glu Gly Leu Arg Ala Trp Met Ala Pro Gln Asp 305 310 315 320 Gln Ile His Glu Asn Phe Ile Phe Pro Glu Glu Val Leu Pro Arg Gly 325 330 335 Asn Ala Leu <210> 9 <211> 1368 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(1368) <400> 9 gta cac aca gtc gtc ctg aat gat cca gga cga ctg cta tct gtg cat 48 Val His Thr Val Val Leu Asn Asp Pro Gly Arg Leu Leu Ser Val His 1 5 10 15 ttg atg cac act gcc ctg gta agc ggc tgg gca ggc tcc atg gca ctc 96 Leu Met His Thr Ala Leu Val Ser Gly Trp Ala Gly Ser Met Ala Leu 20 25 30 tac gag ttg gcc aag tac gat cca agc gat cca gta tta aac ccc atg 144 Tyr Glu Leu Ala Lys Tyr Asp Pro Ser Asp Pro Val Leu Asn Pro Met 35 40 45 tgg cgt caa ggc aca ttc gtt atg cct gtg atg act cgc att ggt gtc 192 Trp Arg Gln Gly Thr Phe Val Met Pro Val Met Thr Arg Ile Gly Val 50 55 60 act cac tct tgg agt ggc tgg aca gtt act ggt gag cct tgg gtt aca 240 Thr His Ser Trp Ser Gly Trp Thr Val Thr Gly Glu Pro Trp Val Thr 65 70 75 80 cag cca gga att tta ggc gca cac cta aac ttc ttt agt tat gag ggt 288 Gln Pro Gly Ile Leu Gly Ala His Leu Asn Phe Phe Ser Tyr Glu Gly 85 90 95 gtc atc ctc atg cat atc ctg gct gca ggt ttg ttt ttc tta gct gcc 336 Val Ile Leu Met His Ile Leu Ala Ala Gly Leu Phe Phe Leu Ala Ala 100 105 110 gtt tgg cac tgg att aac tgg gat tta gac atc tac tat ccc gat ggt 384 Val Trp His Trp Ile Asn Trp Asp Leu Asp Ile Tyr Tyr Pro Asp Gly 115 120 125 tct tct gag ccc gca agt gat tgg ccc aaa att ttc ggt ctt cac cta 432 Ser Ser Glu Pro Ala Ser Asp Trp Pro Lys Ile Phe Gly Leu His Leu 130 135 140 ctg aca tta gga att gtt tgt ttc ggc ttt gga tct ctc cac tta act 480 Leu Thr Leu Gly Ile Val Cys Phe Gly Phe Gly Ser Leu His Leu Thr 145 150 155 160 gga atc tta ggc cca ggc atg tgg gtt tcg gat cct tac gga ctc aca 528 Gly Ile Leu Gly Pro Gly Met Trp Val Ser Asp Pro Tyr Gly Leu Thr 165 170 175 ggt cat gtg caa ggg gtt agc cca gat tgg aga cca ttt gcc ttt gac 576 Gly His Val Gln Gly Val Ser Pro Asp Trp Arg Pro Phe Ala Phe Asp 180 185 190 ccc tac aat ccc aca ggt ttg gtt act cac cat atc tct gca ggg att 624 Pro Tyr Asn Pro Thr Gly Leu Val Thr His His Ile Ser Ala Gly Ile 195 200 205 gcc ctc atc att ggc ggc att ttc cac act gtt tct cgt cct tct gag 672 Ala Leu Ile Ile Gly Gly Ile Phe His Thr Val Ser Arg Pro Ser Glu 210 215 220 cgc ctt tat aac gct ctc agc atg ggt aac gtt gaa acc gta cta tcc 720 Arg Leu Tyr Asn Ala Leu Ser Met Gly Asn Val Glu Thr Val Leu Ser 225 230 235 240 agt tct gtt gcc ttt gta gct gcg gct gca ttc gta atg gtt gga acc 768 Ser Ser Val Ala Phe Val Ala Ala Ala Ala Phe Val Met Val Gly Thr 245 250 255 atg tgg tat gga agt gca act act cca atc gaa ttg ttt ggt cct act 816 Met Trp Tyr Gly Ser Ala Thr Thr Pro Ile Glu Leu Phe Gly Pro Thr 260 265 270 cgt tat cag tgg gat agc ggt tac ttc caa act gaa atc cag cgc cgt 864 Arg Tyr Gln Trp Asp Ser Gly Tyr Phe Gln Thr Glu Ile Gln Arg Arg 275 280 285 gtg cag tct ggt caa aca tgg gat caa atc cct gag aag ctt gtt ttc 912 Val Gln Ser Gly Gln Thr Trp Asp Gln Ile Pro Glu Lys Leu Val Phe 290 295 300 tac gat tac atc ggt aat agc cct gct aaa ggt ggt tta ttc cgc aca 960 Tyr Asp Tyr Ile Gly Asn Ser Pro Ala Lys Gly Gly Leu Phe Arg Thr 305 310 315 320 ggt gct atg aac agt ggt gac ggt att gct aga gca tgg gaa ggt cat 1008 Gly Ala Met Asn Ser Gly Asp Gly Ile Ala Arg Ala Trp Glu Gly His 325 330 335 cct aca ttt acg gat tct gaa ggt cgt gag ttg ttc gtg cga cgc atg 1056 Pro Thr Phe Thr Asp Ser Glu Gly Arg Glu Leu Phe Val Arg Arg Met 340 345 350 ccc aac ttc ttc gaa act ttc cca gtt gtt cta act gac aaa gat ggt 1104 Pro Asn Phe Phe Glu Thr Phe Pro Val Val Leu Thr Asp Lys Asp Gly 355 360 365 gtt gtc cgc gct gac att cct ttc cga cga gct gaa tct cga tac agc 1152 Val Val Arg Ala Asp Ile Pro Phe Arg Arg Ala Glu Ser Arg Tyr Ser 370 375 380 ttt gag cag aaa ggt gtt tca gtc tcc ttt gaa ggt ggt act cta aac 1200 Phe Glu Gln Lys Gly Val Ser Val Ser Phe Glu Gly Gly Thr Leu Asn 385 390 395 400 ggt caa acc ttc acc gat gct cct tct gtt aag aag tat gct cgt aaa 1248 Gly Gln Thr Phe Thr Asp Ala Pro Ser Val Lys Lys Tyr Ala Arg Lys 405 410 415 gct cag ctt ggt gaa cct ttt gag ttc gat cgt gaa acg ctt ggt tct 1296 Ala Gln Leu Gly Glu Pro Phe Glu Phe Asp Arg Glu Thr Leu Gly Ser 420 425 430 gat ggt gtt ttc cga acc agc act cgt ggt tgg ttt gca ttc agc cac 1344 Asp Gly Val Phe Arg Thr Ser Thr Arg Gly Trp Phe Ala Phe Ser His 435 440 445 tct tgc tat gca cta ctc ttc ttc 1368 Ser Cys Tyr Ala Leu Leu Phe Phe 450 455 <210> 10 <211> 456 <212> PRT <213> Acaryuochloris marina <400> 10 Val His Thr Val Val Leu Asn Asp Pro Gly Arg Leu Leu Ser Val His 1 5 10 15 Leu Met His Thr Ala Leu Val Ser Gly Trp Ala Gly Ser Met Ala Leu 20 25 30 Tyr Glu Leu Ala Lys Tyr Asp Pro Ser Asp Pro Val Leu Asn Pro Met 35 40 45 Trp Arg Gln Gly Thr Phe Val Met Pro Val Met Thr Arg Ile Gly Val 50 55 60 Thr His Ser Trp Ser Gly Trp Thr Val Thr Gly Glu Pro Trp Val Thr 65 70 75 80 Gln Pro Gly Ile Leu Gly Ala His Leu Asn Phe Phe Ser Tyr Glu Gly 85 90 95 Val Ile Leu Met His Ile Leu Ala Ala Gly Leu Phe Phe Leu Ala Ala 100 105 110 Val Trp His Trp Ile Asn Trp Asp Leu Asp Ile Tyr Tyr Pro Asp Gly 115 120 125 Ser Ser Glu Pro Ala Ser Asp Trp Pro Lys Ile Phe Gly Leu His Leu 130 135 140 Leu Thr Leu Gly Ile Val Cys Phe Gly Phe Gly Ser Leu His Leu Thr 145 150 155 160 Gly Ile Leu Gly Pro Gly Met Trp Val Ser Asp Pro Tyr Gly Leu Thr 165 170 175 Gly His Val Gln Gly Val Ser Pro Asp Trp Arg Pro Phe Ala Phe Asp 180 185 190 Pro Tyr Asn Pro Thr Gly Leu Val Thr His His Ile Ser Ala Gly Ile 195 200 205 Ala Leu Ile Ile Gly Gly Ile Phe His Thr Val Ser Arg Pro Ser Glu 210 215 220 Arg Leu Tyr Asn Ala Leu Ser Met Gly Asn Val Glu Thr Val Leu Ser 225 230 235 240 Ser Ser Val Ala Phe Val Ala Ala Ala Ala Phe Val Met Val Gly Thr 245 250 255 Met Trp Tyr Gly Ser Ala Thr Thr Pro Ile Glu Leu Phe Gly Pro Thr 260 265 270 Arg Tyr Gln Trp Asp Ser Gly Tyr Phe Gln Thr Glu Ile Gln Arg Arg 275 280 285 Val Gln Ser Gly Gln Thr Trp Asp Gln Ile Pro Glu Lys Leu Val Phe 290 295 300 Tyr Asp Tyr Ile Gly Asn Ser Pro Ala Lys Gly Gly Leu Phe Arg Thr 305 310 315 320 Gly Ala Met Asn Ser Gly Asp Gly Ile Ala Arg Ala Trp Glu Gly His 325 330 335 Pro Thr Phe Thr Asp Ser Glu Gly Arg Glu Leu Phe Val Arg Arg Met 340 345 350 Pro Asn Phe Phe Glu Thr Phe Pro Val Val Leu Thr Asp Lys Asp Gly 355 360 365 Val Val Arg Ala Asp Ile Pro Phe Arg Arg Ala Glu Ser Arg Tyr Ser 370 375 380 Phe Glu Gln Lys Gly Val Ser Val Ser Phe Glu Gly Gly Thr Leu Asn 385 390 395 400 Gly Gln Thr Phe Thr Asp Ala Pro Ser Val Lys Lys Tyr Ala Arg Lys 405 410 415 Ala Gln Leu Gly Glu Pro Phe Glu Phe Asp Arg Glu Thr Leu Gly Ser 420 425 430 Asp Gly Val Phe Arg Thr Ser Thr Arg Gly Trp Phe Ala Phe Ser His 435 440 445 Ser Cys Tyr Ala Leu Leu Phe Phe 450 455 <210> 11 <211> 1339 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1)...(1339) <400> 11 atg aaa act tca tct tcc ctg agg agg ttc tac cac gtg gaa acg ccc 48 Met Lys Thr Ser Ser Ser Leu Arg Arg Phe Tyr His Val Glu Thr Pro 1 5 10 15 ttt aat ccg tct gcg gct ggt tat gac cgc gca acc act ggc tat ggc 96 Phe Asn Pro Ser Ala Ala Gly Tyr Asp Arg Ala Thr Thr Gly Tyr Gly 20 25 30 tgg tgg gct gga aat gca cga tta act gat cta tct ggt cag cta act 144 Trp Trp Ala Gly Asn Ala Arg Leu Thr Asp Leu Ser Gly Gln Leu Thr 35 40 45 ggt gcc cac att gcc cat gct gga atg att acc ttc tgg gct ggt gca 192 Gly Ala His Ile Ala His Ala Gly Met Ile Thr Phe Trp Ala Gly Ala 50 55 60 atg act ttg ttt gaa gtc tct cac ttc att cct gaa aag cct atg tac 240 Met Thr Leu Phe Glu Val Ser His Phe Ile Pro Glu Lys Pro Met Tyr 65 70 75 80 gag caa ggc agc atc ctg ctt gct cac cta gcc gct gaa ggt ttt ggt 288 Glu Gln Gly Ser Ile Leu Leu Ala His Leu Ala Ala Glu Gly Phe Gly 85 90 95 gtt gga cct ggt ggt gaa gtt att agc act tat cct tat ttt gtg att 336 Val Gly Pro Gly Gly Glu Val Ile Ser Thr Tyr Pro Tyr Phe Val Ile 100 105 110 ggt gca att cac cta att gct tct gct gtc ctc ggt ttt ggt ggc ctt 384 Gly Ala Ile His Leu Ile Ala Ser Ala Val Leu Gly Phe Gly Gly Leu 115 120 125 tac cac aca ttc aga ggc cct gct aag ttt gag gat tac tct gat tgg 432 Tyr His Thr Phe Arg Gly Pro Ala Lys Phe Glu Asp Tyr Ser Asp Trp 130 135 140 tgg ggg tat gac tgg gaa gac aaa gaa aag atg atg cag atc ctg ggg 480 Trp Gly Tyr Asp Trp Glu Asp Lys Glu Lys Met Met Gln Ile Leu Gly 145 150 155 160 att cac tta atc ttc ctc gga att ggt gct ctt gct ttt gct gca aaa 528 Ile His Leu Ile Phe Leu Gly Ile Gly Ala Leu Ala Phe Ala Ala Lys 165 170 175 gcc atg ttc ttt ggc ggt ctt tat gat ccc tgg gct cct ggt ggt gga 576 Ala Met Phe Phe Gly Gly Leu Tyr Asp Pro Trp Ala Pro Gly Gly Gly 180 185 190 aat gtt cgc ctg att act aac cca act tgg aac tta ggt act ttc ctg 624 Asn Val Arg Leu Ile Thr Asn Pro Thr Trp Asn Leu Gly Thr Phe Leu 195 200 205 ggt tac att acc cga tct ccc tgg gga gaa ggt ggc tgg atc gtt agt 672 Gly Tyr Ile Thr Arg Ser Pro Trp Gly Glu Gly Gly Trp Ile Val Ser 210 215 220 gtt aac aac cta gaa gac gtt gta ggt ggt cac ctt ctc gta ggt gtt 720 Val Asn Asn Leu Glu Asp Val Val Gly Gly His Leu Leu Val Gly Val 225 230 235 240 cac tac atc ttc ggt ggc gtt ttc cac att ctt gtt aag cct tgg ggt 768 His Tyr Ile Phe Gly Gly Val Phe His Ile Leu Val Lys Pro Trp Gly 245 250 255 tgg gtt cgc cga gcc tat gtc tgg tct ggt gaa gcc tat ctc tcc tac 816 Trp Val Arg Arg Ala Tyr Val Trp Ser Gly Glu Ala Tyr Leu Ser Tyr 260 265 270 agc ttg ggt gcc ctt tac atg tgt ggc atg att gct gtg ggt tat gtc 864 Ser Leu Gly Ala Leu Tyr Met Cys Gly Met Ile Ala Val Gly Tyr Val 275 280 285 tgg ttt aac aac act gtt tac ccc agt gaa ttc tac ggt cct act gct 912 Trp Phe Asn Asn Thr Val Tyr Pro Ser Glu Phe Tyr Gly Pro Thr Ala 290 295 300 gct gaa gct tct cag gct cag gca atg acc ttt ttg att cgt gac caa 960 Ala Glu Ala Ser Gln Ala Gln Ala Met Thr Phe Leu Ile Arg Asp Gln 305 310 315 320 agg tta ggg gcg aac atc gct tct gcc caa ggt cct aca ggt ctt ggt 1008 Arg Leu Gly Ala Asn Ile Ala Ser Ala Gln Gly Pro Thr Gly Leu Gly 325 330 335 aag tat ctg atg cgt tct cct tct ggt gag atc atc ttc ggt ggt gag 1056 Lys Tyr Leu Met Arg Ser Pro Ser Gly Glu Ile Ile Phe Gly Gly Glu 340 345 350 acc atg cgt ttc tgg gat ttc cgt gga cct tgg ttg gag ccc ctt cgt 1104 Thr Met Arg Phe Trp Asp Phe Arg Gly Pro Trp Leu Glu Pro Leu Arg 355 360 365 gga ccc aac ggt ttg gac ctc aac aag ctc aga aat gat att cag cct 1152 Gly Pro Asn Gly Leu Asp Leu Asn Lys Leu Arg Asn Asp Ile Gln Pro 370 375 380 tgg caa gct cgt cgt gcg gct gag tac atg act cat gct cct ttg ggt 1200 Trp Gln Ala Arg Arg Ala Ala Glu Tyr Met Thr His Ala Pro Leu Gly 385 390 395 400 gca ttg aac tct gta ggt ggt gtg gca act gag atc aac tcg gtg aac 1248 Ala Leu Asn Ser Val Gly Gly Val Ala Thr Glu Ile Asn Ser Val Asn 405 410 415 tat gtt tct ccc cgt tct tgg tta tcc act tca cat ttc tgc ctt gcg 1296 Tyr Val Ser Pro Arg Ser Trp Leu Ser Thr Ser His Phe Cys Leu Ala 420 425 430 ttc ttc ttc ttt gtt ggc cat att tgg cac tcc ggc cgc gcc c 1339 Phe Phe Phe Phe Val Gly His Ile Trp His Ser Gly Arg Ala 435 440 445 <210> 12 <211> 446 <212> PRT <213> Acaryuochloris marina <400> 12 Met Lys Thr Ser Ser Ser Leu Arg Arg Phe Tyr His Val Glu Thr Pro 1 5 10 15 Phe Asn Pro Ser Ala Ala Gly Tyr Asp Arg Ala Thr Thr Gly Tyr Gly 20 25 30 Trp Trp Ala Gly Asn Ala Arg Leu Thr Asp Leu Ser Gly Gln Leu Thr 35 40 45 Gly Ala His Ile Ala His Ala Gly Met Ile Thr Phe Trp Ala Gly Ala 50 55 60 Met Thr Leu Phe Glu Val Ser His Phe Ile Pro Glu Lys Pro Met Tyr 65 70 75 80 Glu Gln Gly Ser Ile Leu Leu Ala His Leu Ala Ala Glu Gly Phe Gly 85 90 95 Val Gly Pro Gly Gly Glu Val Ile Ser Thr Tyr Pro Tyr Phe Val Ile 100 105 110 Gly Ala Ile His Leu Ile Ala Ser Ala Val Leu Gly Phe Gly Gly Leu 115 120 125 Tyr His Thr Phe Arg Gly Pro Ala Lys Phe Glu Asp Tyr Ser Asp Trp 130 135 140 Trp Gly Tyr Asp Trp Glu Asp Lys Glu Lys Met Met Gln Ile Leu Gly 145 150 155 160 Ile His Leu Ile Phe Leu Gly Ile Gly Ala Leu Ala Phe Ala Ala Lys 165 170 175 Ala Met Phe Phe Gly Gly Leu Tyr Asp Pro Trp Ala Pro Gly Gly Gly 180 185 190 Asn Val Arg Leu Ile Thr Asn Pro Thr Trp Asn Leu Gly Thr Phe Leu 195 200 205 Gly Tyr Ile Thr Arg Ser Pro Trp Gly Glu Gly Gly Trp Ile Val Ser 210 215 220 Val Asn Asn Leu Glu Asp Val Val Gly Gly His Leu Leu Val Gly Val 225 230 235 240 His Tyr Ile Phe Gly Gly Val Phe His Ile Leu Val Lys Pro Trp Gly 245 250 255 Trp Val Arg Arg Ala Tyr Val Trp Ser Gly Glu Ala Tyr Leu Ser Tyr 260 265 270 Ser Leu Gly Ala Leu Tyr Met Cys Gly Met Ile Ala Val Gly Tyr Val 275 280 285 Trp Phe Asn Asn Thr Val Tyr Pro Ser Glu Phe Tyr Gly Pro Thr Ala 290 295 300 Ala Glu Ala Ser Gln Ala Gln Ala Met Thr Phe Leu Ile Arg Asp Gln 305 310 315 320 Arg Leu Gly Ala Asn Ile Ala Ser Ala Gln Gly Pro Thr Gly Leu Gly 325 330 335 Lys Tyr Leu Met Arg Ser Pro Ser Gly Glu Ile Ile Phe Gly Gly Glu 340 345 350 Thr Met Arg Phe Trp Asp Phe Arg Gly Pro Trp Leu Glu Pro Leu Arg 355 360 365 Gly Pro Asn Gly Leu Asp Leu Asn Lys Leu Arg Asn Asp Ile Gln Pro 370 375 380 Trp Gln Ala Arg Arg Ala Ala Glu Tyr Met Thr His Ala Pro Leu Gly 385 390 395 400 Ala Leu Asn Ser Val Gly Gly Val Ala Thr Glu Ile Asn Ser Val Asn 405 410 415 Tyr Val Ser Pro Arg Ser Trp Leu Ser Thr Ser His Phe Cys Leu Ala 420 425 430 Phe Phe Phe Phe Val Gly His Ile Trp His Ser Gly Arg Ala 435 440 445 <210> 13 <211> 22 <212> DNA <213> Artificial Sequence <400> 13 ccaccachtg gatttggaay ct <210> 14 <211> 20 <212> DNA <213> Artificial Sequence <400> 14 gcnacnggyt trtcyttcca <210> 15 <211> 26 <212> DNA <213> Artificial Sequence <400> 15 gayathgayg giathmgiga rccigt <210> 16 <211> 26 <212> DNA <213> Artificial Sequence <400> 16 gggaagttgt gggcattrcg ytcgtg <210> 17 <211> 24 <212> DNA <213> Artificial Sequence <400> 17 tggttygayg tnctcgayga ytgg <210> 18 <211> 20 <212> DNA <213> Artificial Sequence <400> 18 ccrtgccaga krtgrccraa <210> 19 <211> 20 <212> DNA <213> Artificial Sequence <400> 19 atgggactac cytggtaycg <210> 20 <211> 20 <212> DNA <213> Artificial Sequence <400> 20 ccrtgccaga krtgrccraa[Sequence List] SEQUENCELISTING <110> MARINEBIOTECHNOLOGY INSTITUTECO., LTD. <120> <130> P99-0652 <160> 20 <170> PatentIn version 2.0 <210> 1 <211> 2262 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (2259) <400> 1 atg aca act agc cca ggt ggg cca gag aca aaa ggc aga aca gct gaa 48 Met Thr Thr Ser Pro Gly Gly Pro Glu Thr Lys Gly Arg Thr Ala Glu 1 5 10 15 gtt gac atc aac cca gtt agc gct tct tta gaa gtc gcg ggt aag ccg 96 Val Asp Ile Asn Pro Val Ser Ala Ser Leu Glu Val Ala Gly Lys Pro 20 25 30 ggt cac ttt aat aaa agt ctg tcg aaa ggt ccc caa acc acc act tgg 144 Gly His Phe Asn Lys Ser Leu Ser Lys Gly Pro Gln Thr Thr Thr Trp 35 40 45 att tgg aac cta cat gct cta gcc cat gat ttt gat aca caa aca aac 192 Ile Trp Asn Leu His Ala Leu Ala His Asp Phe Asp Thr Gln Thr Asn 50 55 60 gac cta gaa gaa att tcc cgc aaa att ttc agt gcc cat ttt gga cac 240 Asp Leu Glu Glu Ile Ser Arg Lys Ile Phe Ser Ala His Phe Gly His 65 70 75 80 tta tcc atc att ttt gta tgg atc agc ggg atg atc ttc cat gct gct 288 Leu Ser Ile Ile Phe Val Trp Ile Ser Gly Met Ile Phe His Ala Ala 85 90 95 cgt ttt tct aac tac tac gct tgg tta gcc gat ccg ctc ggc aac aaa 336 Arg Phe Ser Asn Tyr Tyr Ala Trp Leu Ala Asp Pro Leu Gly Asn Lys 100 105 110 ccc agt gct cac gta gtt tgg ccc att gtt ggc caa gat att tta aat 384 Pro Ser Ala His Val Val Trp Pro Ile Val Gly Gln Asp Ile Leu Asn 115 120 125 gca gat gtt ggt aat gga ttc cgc gga atc caa att acc tct ggc ctt 432 Ala Asp Val Gly Asn Gly Phe Arg Gly Ile Gln Ile Thr Ser Gly Leu 130 135 140 ttc cat att tta cgc ggg gct gga atg act gac ccc ggt gaa ctt tat 480 Phe His Ile Leu Arg Gly Ala Gly Met Thr Asp Pro Gly Glu Leu Tyr 145 150 155 160 tca gca gcc att ggt gcc ctt gtt gca gcg gtt gta atg atg tac gcc 528 Ser Ala Ala Ile Gly Ala Leu Val Ala Ala Val Val Met Met Tyr Ala 165 170 175 ggg tat tat tat cac tac cac aag aaa gca cct aaa ttg gag tgg ttc caa 576 Gly Tyr Tyr His Tyr His Lys Lys Ala Pro Lys Leu Glu Trp Phe Gln 180 185 190 aac gcc gaa tca acg atg acc cac cat ctc atc gtt ctt tta ggg ctt 624 Asn Ala Glu Ser Thr Met Thr His His Leu Ile Val Leu Leu Gly Leu 195 200 205 ggg aac ctt gcc tgg aca ggt cac ctt atc cat gtt tct ctg cca gtc 672 Gly Asn Leu Ala Trp Thr Gly His Leu Ile His Val Ser Leu Pro Val 210 215 220 aat aag ctt ctt gat tct ggt gta gcc cca caa gat ata cca atc ccc 720 Asn Lys Leu Leu Asp Ser Gly Val Ala Pro Gln Asp Ile Pro Ile Pro 225 230 235 240 cat gaa ttt ctt ttt gat aat gga ttt atg gcg gat tta tat ccc agc 768 His Glu Phe Leu Phe Asp Asn Gly Phe Met Ala Asp Leu Tyr Pro Ser 245 250 255 ttt gct cag gga tta atg cct tac ttc acc cta aat tgg ggt gct tat 816 Phe Ala Gln Gly Leu Met Pro Tyr Phe Thr Leu Asn Trp Gly Ala Tyr 260 265 270 tct gac ttc ctt acc ttc aaa gga ggg ctt gac cca aca acg ggt ggc 864 Ser Asp Phe Leu Thr Phe Lys Gly Gly Leu Asp Pro Thr Thr Gly Gly 275 275 280 285 cta tgg atg aca gat ata gcc cat cac cat ttg gca ttg gca gta atg 912 Leu Trp Met Thr Asp Ile Ala His His Leu Ala Leu Ala Val Met 290 295 300 tac atc att gct ggt cat atg tac cga acc aac tgg ggt att ggg cac 960 Tyr Ile Ile Ala Gly His Met Tyr Arg Thr Asn Trp Gly Ile Gly His 305 310 315 320 agt atg aag gaa atc atg gaa tct cat aaa ggt ccc ttt act ggc gaa 1008 Ser Met Lys Glu Ile Met Glu Ser His Lys Gly Pr o Phe Thr Gly Glu 325 330 335 ggc cat aaa ggt cta tat gag gtg ctg aca act tct tgg cat gcc cag 1056 Gly His Lys Gly Leu Tyr Glu Val Leu Thr Thr Ser Trp His Ala Gln 340 345 350 350 cta gca att aac cta gcc aca tgg ggt tct ttc agc atc att gtt gcc 1104 Leu Ala Ile Asn Leu Ala Thr Trp Gly Ser Phe Ser Ile Ile Val Ala 355 360 365 cac cac atg tac gca atg cct cct tat cct tac ttg gca aca gat tac 1152 His His Met Tyr Ala Met Pro Pro Tyr Pro Tyr Leu Ala Thr Asp Tyr 370 375 380 ggc acg cag ctg aat ctg ttc gtc cat cat atg tgg att gga ggt ttc 1200 Gly Thr Gln Leu Asn Leu Phe Val His His Met Trp Ile Gly Gly Phe 385 390 395 400 ttg att gtt ggt ggt gct gcc cac gca gct att ttc atg gtt cgg gat 1248 Leu Ile Val Gly Gly Ala Ala His Ala Ala Ile Phe Met Val Arg Asp 405 410 415 tac gat cca gct gtg aac caa aac aat gtt ctt gat cgg atg ctt cgt 1296 Tyr Asp Pro Ala Val Asn Gln Asn Asn Val Leu Asp Arg Met Leu Arg 420 425 430 cac cga gat acg atc att tcc cat cta aac tgg gtc tgt att ttc ctt 1344 His Arg Asp Thr Ile Ile Ser His Leu Asn Trp Val Cys Ile Phe Leu 435 440 445 ggg ttc cat tct ttt ggc ttg tat atc cat aac gac aat atg cgt tct 1392 Gly Phe His Ser Phe Gly Leu Tyr Ile His Asn Asp Asn Met Arg Ser 450 455 460 ttg ggt cgg cct caa gat atg ttc tcc gac act gct atc caa ctg caa 1440 Leu Gly Arg Pro Gln Asp Met Phe Ser Asp Thr Ala Ile Gln Leu Gln 465 470 475 475 480 cct att ttt tct caa tgg gtt cag aac tta caa gca aac gtt gga 1488 Pro Ile Phe Ser Gln Trp Val Gln Asn Leu Gln Ala Asn Val Ala Gly 485 490 495 aca att cgg gct ccc ttg gca gaa ggt gca tca agc tta gct tgg ggt 1536 Thr Ile Arg Ala Pro Leu Ala Glu Gly Ala Ser Ser Leu Ala Trp Gly 500 505 510 ggc gat cct ttg ttt gtt ggc gga aaa gtt gca atg caa cat gtt tcc 1584 Gly Asp Pro Leu Phe Val Gly Gly Lys Val Ala Met Gln His Val Ser 515 520 525 tta gga acc gcc gat ttc atg atc cac cac att cac gcc ttc cag att 1632 Leu Gly Thr Ala Asp Phe Met Ile His His Ile His Ala Phe Gln Ile 530 535 540 cac gtt act gtt ctc atc cta atc aag ggt gtt ctc tac gct cgt agc 1680 His Val Thr Val Leu Ile Leu Ile Lys Gly Val Leu Tyr Ala Arg Ser 545 550 555 560 tct cgt cta att cca gac aaa gct aac ttg ggc ttc aga ttc cct tgc 1728 Ser Arg Leu Ile Pro Asp Lys Ala Asn Leu Gly Phe Arg Phe Cys 565 570 575 gac gga cca ggt cgg ggt ggt act tgc caa tct tct ggt tgg gac cat 1776 Asp Gly Pro Gly Arg Gly Gly Thr Cys Gln Ser Ser Gly Trp Asp His 580 585 590 atc ttc ttg ggt ctg ttc tgg atg tac ag tgc atc tca att gtc aat 1824 Ile Phe Leu Gly Leu Phe Trp Met Tyr Asn Cys Ile Ser Ile Val Asn 595 600 605 ttc cac ttc ttc tgg aaa atg cag tcg gat gtt tgg ggt gcc gca aat 1872 Phe His Phe Mp The Gln Ser Asp Val Trp Gly Ala Ala Asn 610 615 620 gct aat ggc ggc gtt aat tac cta aca gct ggc aac tgg gca cag tct 1920 Ala Asn Gly Gly Val Asn Tyr Leu Thr Ala Gly Asn Trp Ala Gln Ser 625 630 635 640 tca atc act att aat ggt tgg ttg cga gat ttc tta tgg gcc caa tcg 1968 Ser Ile Thr Ile Asn Gly Trp Leu Arg Asp Phe Leu Trp Ala Gln Ser 645 650 655 gtt cag gtg att aac tcc tat ggt tct gcc cta tct gcc c gga att 2016 Val Gln Val Ile Asn Ser Tyr Gly Ser Ala Leu Ser Ala Tyr Gly Ile 660 665 670 tta ttc cta ggt gcc cac ttc atc tgg gct ttc agc ctg atg ttc ttg 2064 Leu Phe Leu Gly Ala His Phe Ile Trp Phe Ser Leu Met Phe Leu 675 680 685 ttc agt ggt cgt ggc tat tgg caa gag ctg atc gag tct att gtt tgg 2112 Phe Ser Gly Arg Gly Tyr Trp Gln Glu Leu Ile Glu Ser Ile Val Trp 690 695 700 gct cac agc aaa cg aag att gct cca gcc att cag cca cgc gct atg 2160 Ala His Ser Lys Leu Lys Ile Ala Pro Ala Ile Gln Pro Arg Ala Met 705 710 715 720 agt att act caa ggt cgt gca gtt gga ctg ggc cat tac ctc cta ggt 2208 Ser Ile Thr Gln Gly Arg Ala Val Gly Leu Gly His Tyr Leu Leu Gly 725 730 735 gga att gtg acc tct tgg tca ttc tac cta gct cga att ctc gca tta 2256 Gly Ile Val Thr Ser Trp Ser Phe Tyr Leu Ala Arg Ile Leu Ala Leu 740 745 750 gga tag 2262 Gly <210> 2 <211> 753 <212> PRT <213> Acaryuochloris marina <400> 2 Met Thr Ser Pro Gly Gly Pro Glu Thr Lys Gly Arg Thr Ala Glu 1 5 10 15 Val Asp Ile Asn Pro Val Ser Ala Ser Leu Glu Val Ala Gly Lys Pro 20 25 30 Gly His Phe Asn Lys Ser Leu Ser Lys Gly Pro Gln Thr Thr Thr Trp 35 40 45 Ile Trp Asn Leu His Ala Leu Ala His Asp Phe Asp Thr Gln Thr Asn 50 55 60 Asp Leu Glu Glu Ile Ser Arg Lys Ile Phe Ser Ala His Phe Gly His 65 70 75 80 Leu Ser Ile Ile Phe Val Trp Ile Ser Gly Met Ile Phe His Ala Ala 85 90 95 Arg Phe Ser Asn Tyr Tyr Ala Trp Leu Ala Asp Pro Leu Gly Asn Lys 100 105 110 Pro Ser Ala His Val Val Trp Pro Ile Val Gly Gln Asp Ile Leu Asn 115 120 125 Ala Asp Val Gly Asn Gly Phe Arg Gly Ile Gln Ile Thr Ser Gly Leu 130 135 140 Phe His Ile Leu Arg Gly Ala Gly Met Thr Asp Pro Gly Glu Leu Tyr 145 150 155 160 Ser Ala Ala Ile Gly Ala Leu Val Ala Ala Val Val Met Met Tyr Ala 165 170 175 Gly Tyr Tyr His Tyr His Lys Lys Ala Pro Lys Leu Glu Trp Phe Gln 180 185 190 Asn Ala Glu Ser Thr Met Thr His His Leu Ile Val Leu Leu Gly Leu 195 200 205 Gly Asn Leu A la Trp Thr Gly His Leu Ile His Val Ser Leu Pro Val 210 215 220 Asn Lys Leu Leu Asp Ser Gly Val Ala Pro Gln Asp Ile Pro Ile Pro 225 230 235 240 His Glu Phe Leu Phe Asp Asn Gly Phe Met Ala Asp Leu Tyr Pro Ser 245 250 255 Phe Ala Gln Gly Leu Met Pro Tyr Phe Thr Leu Asn Trp Gly Ala Tyr 260 265 270 Ser Asp Phe Leu Thr Phe Lys Gly Gly Leu Asp Pro Thr Thr Gly Gly 275 280 285 Leu Trp Met Thr Asp Ile Ala His His His Leu Ala Leu Ala Val Met 290 295 300 Tyr Ile Ile Ala Gly His Met Tyr Arg Thr Asn Trp Gly Ile Gly His 305 310 315 320 Ser Met Lys Glu Ile Met Glu Ser His Lys Gly Pro Phe Thr Gly Glu 325 330 335 Gly His Lys Gly Leu Tyr Glu Val Leu Thr Thr Ser Trp His Ala Gln 340 345 350 Leu Ala Ile Asn Leu Ala Thr Trp Gly Ser Phe Ser Ile Ile Val Ala 355 360 365 His His Met Tyr Ala Met Pro Pro Tyr Pro Tyr Leu Ala Thr Asp Tyr 370 375 380 Gly Thr Gln Leu Asn Leu Phe Val His His Met Trp Ile Gly Gly Ply 385 390 395 400 Leu Ile Val Gly Gly Ala Ala Ala His Ala Ala Ile Phe Met Val Arg Asp 405 410 415 Tyr Asp Pro A la Val Asn Gln Asn Asn Val Leu Asp Arg Met Leu Arg 420 425 430 His Arg Asp Thr Ile Ile Ser His Leu Asn Trp Val Cys Ile Phe Leu 435 440 445 Gly Phe His Ser Phe Gly Leu Tyr Ile His Asn Asp Asn Met Arg Ser 450 455 460 Leu Gly Arg Pro Gln Asp Met Phe Ser Asp Thr Ala Ile Gln Leu Gln 465 470 475 480 Pro Ile Phe Ser Gln Trp Val Gln Asn Leu Gln Ala Asn Val Ala Gly 485 490 490 495 Thr Ile Arg Ala Pro Leu Ala Glu Gly Ala Ser Ser Leu Ala Trp Gly 500 505 510 510 Gly Asp Pro Leu Phe Val Gly Gly Lys Val Ala Met Gln His Val Ser 515 520 525 Leu Gly Thr Ala Asp Phe Met Ile His His Ile His Ala Phe Gln Ile 530 535 540 His Val Thr Val Leu Ile Leu Ile Lys Gly Val Leu Tyr Ala Arg Ser 545 550 555 560 Ser Arg Leu Ile Pro Asp Lys Ala Asn Leu Gly Phe Arg Phe Pro Cys 565 570 575 Asp Gly Pro Gly Arg Gly Gly Thr Cys Gln Ser Ser Gly Trp Asp His 580 585 590 Ile Phe Leu Gly Leu Phe Trp Met Tyr Asn Cys Ile Ser Ile Val Asn 595 600 605 Phe His Phe Phe Trp Lys Met Gln Ser Asp Val Trp Gly Ala Ala Asn 610 615 620 620 Ala Asn Gly GlyVal Asn Tyr Leu Thr Ala Gly Asn Trp Ala Gln Ser 625 630 635 640 Ser Ile Thr Ile Asn Gly Trp Leu Arg Asp Phe Leu Trp Ala Gln Ser 645 650 655 Val Gln Val Ile Asn Ser Tyr Gly Ser Ala Leu Ser Ala Tyr Gly Ile 660 665 670 Leu Phe Leu Gly Ala His Phe Ile Trp Ala Phe Ser Leu Met Phe Leu 675 680 685 Phe Ser Gly Arg Gly Tyr Trp Gln Glu Leu Ile Glu Ser Ile Val Trp 690 695 700 Ala His Ser Lys Leu Lys Ile Ala Pro Ala Ile Gln Pro Arg Ala Met 705 710 715 715 720 Ser Ile Thr Gln Gly Arg Ala Val Gly Leu Gly His Tyr Leu Leu Gly 725 730 735 Gly Ile Val Thr Ser Trp Ser Phe Tyr Leu Ala Arg Ile Leu Ala Leu 740 745 750 Gly <210> 3 <211> 2211 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (2208) <400> 3 atg gct act aaa ttt cct agt ttt agc caa gac ctt gcc caa gat cca 48 Met Ala Thr Lys Phe Pro Ser Phe Ser Gln Asp Leu Ala Gln Asp Pro 1 5 10 15 aca aca cgt cgg atc tgg tac gga att gcc aca gtt cat gat ttt gag 96 Thr Thr Arg Arg Ile Trp Tyr Gly Ile Ala Thr Val His Asp Phe Glu 20 25 30 act cat gac gga atg acg gag gaa aat ctt tat caa aag att ttc gcg 144 Thr His Asp Gly Met Thr Glu Glu Asn Leu Tyr Gln Lys Ile Phe Ala 35 40 45 act cac ttc ggt cat ctc tct att atc ttt cta tgg tct gct ggc cat 192 Thr His Phe Gly His Leu Ser Ile Ile Phe Leu Trp Ser Ala Gly His 50 55 60 ctt ttc cat gtc gcc tgg caa ggc aac ttt gaa cag tgg atc caa gat 240 Leu Phe His Val Ala Trp Gln Gly Asn Phe Glu Gln Trp Ile Gln Asp 65 70 75 80 cca cta acc atc cgt ccc atc gcc cat gcg att tgg gac ccc cat ttg 288 Pro Leu Thr Ile Arg Pro Ile Ala His Ala Ile Trp Asp Pro His Leu 85 90 95 ggt gat gct gca act cag gcc ttc acc caa gct ggc gct tct ggt cca 336 Gly Asp Ala Ala Thr Gln Ala Phe Thr Gln Ala Gly Ala Ser Gly Pro 100 105 110 gtt gac ctt tgt tat tct ggc ctc tac caa tgg tgg tac acc att ggt 384 Val Asp Leu Cys Tyr Ser Gly Leu Tyr Gln Trp Trp Tyr Thr Ile Gly 115 120 125 atg cgt acc aat ggt gat tta tac att ggt tct gtt ttc ttg atg att 432 Met Arg Thr Asn Gly Asp Leu Tyr Ile Gly Ser Val Phe Leu Met Ile 130 135 140 gtc gct gca gtc atg ttg ttt gca ggt tgg ctt cat cta caa ccc aaa 480 Val Ala Ala Val Met Leu Phe Ala Gly Trp Leu His Leu Gln Pro Lys 145 150 155 160 ttt cga ccc agc tta gcc tgg ttt aga gat gct gaa tcc caa atg aac 528 Phe Arg Pro Ser Leu Ala Trp Phe Arg Asp Ala Glu Ser Gln Met Asn 165 170 175 cac cac ttg gca gtt cta ttt ggt gct agc tct ttg ggc tgg aca ggc 576 His His Leu Ala Val Leu Phe Gly Ala Ser Ser Leu Gly Trp Thr Gly 180 185 190 cac tta atc cac gtt gct att ccc gaa gct cgg ggt cag cac gta ggt 624 His Leu Ile His Val Ala Ile Pro Glu Ala Arg Gly Gln His Val Gly 195 200 205 tgg gat aac ttt ctg tca acc atg cct cac cct gct ggt tta gcg cct 672 Trp Asp Asn Phe Leu Ser Thr Met Pro His Pro Ala Gly Leu Ala Pro210 215 220 ttc ttt act ggg cgt tgg gga gtt tat gct caa aac cct gat act gct 720 Phe Phe Thr Gly Arg Trp Gly Val Tyr Ala Gln Asn Pro Asp Thr Ala 225 230 235 240 ggt cat att ttt gga act agc gaa ggt gct gga act gcg att att acc 768 Gly His Ile Phe Gly Thr Ser Glu Gly Ala Gly Thr Ala Ile Ile Thr 245 250 255 ttt att ggc ggt ttc cat ccc caa act gaa gca ttg tgg cta act gat 816 Phe Ile Gly Gly Gly Phe His Pro Gln Thr Glu Ala Leu Trp Leu Thr Asp 260 265 270 att gcc cac cac cat ctg gct att gct gtg atg tac atc att gct ggc 864 Ile Ala His His His Leu Ala Ile Ala Val Met Tyr Ile Ile Ala Gly 275 280 280 cat atg tat cga act cag ttc ggt att ggg cat agt atg aaa gag atc 912 His Met Tyr Arg Thr Gln Phe Gly Ile Gly His Ser Met Lys Glu Ile 290 295 300 cta gaa gca cac acc cct ccc agc ggg atg ttg ggt gat gcg cacag 960 Leu Glu Ala His Thr Pro Pro Ser Gly Met Leu Gly Asp Ala His Lys 305 310 315 320 ggc ctt tat gac act tac aat gaa tct cta cat ttc cag tta ggt ttc 1008 Gly Leu Tyr Asp Thr Tyr Asn Glu Ser Leu His Phe Gln Leu Gly Phe 325 330 335 cac cta gct gca tta ggt gta atc act tct gtg gtt gcc caa cat atg 1056 His Leu Ala Ala Leu Gly Val Ile Thr Ser Val Val Ala Gln His Met 340 345 350 tat tca ttg ccg tca tac gct ttc atc tct caa gac cat gtc aca caa 1104 Tyr Ser Leu Pro Ser Tyr Ala Phe Ile Ser Gln Asp His Val Thr Gln 355 360 365 gct gcg ctt tac aca cat cac caa tat att gct gga att cta gca att 1152 Ala Ala Leu Tyr Thr His His Gln Tyr Ile Ala Gly Ile Leu Ala Ile 370 375 380 ggt gct ttt gcg cat ggt ggt atc ttc ttt gtc cga gat tac gat cca 1200 Gly Ala Phe Ala His Gly Gly Ile Phe Phe Val Arg Asp Tyr Asp Pro 385 390 395 400 gaa cgt aac aag aac aac gtt ctt gct cgt gct ctt gag cat aaa gag 1248 Glu Arg Asn Lys Asn Asn Val Leu Ala Arg Ala Leu Glu His Lys Glu 405 410 415 gcg att atc tcc cac cta tct tgg gta tcc atgtt agt ggt ttc cat 1296 Ala Ile Ile Ser His Leu Ser Trp Val Ser Met Phe Ser Gly Phe His 420 425 430 acc ctt ggt gtt tat gtt cat aac gac acc gtg gta gct ttt ggt act 1344 Thr Leu Gly Val Tyr Val His Asn Asp Thr Val Val Ala Phe Gly Thr 435 440 445 cct gag aag caa att ttg gtt gag cca atc ttt gcg caa tgg att cag 1392 Pro Glu Lys Gln Ile Leu Val Glu Pro Ile Phe Ala Gln Trp Ile Gln 450 455 460 gca gct cat ggc aaa ctg ctc tta gga ttt gaa aca ctg ctt tca aat 1440 Ala Ala His Gly Lys Leu Leu Leu Gly Phe Glu Thr Leu Leu Ser Asn 465 470 470 475 480 cct aat gga ttg gct tat aac cct cct aac att tct c ttt 1488 Pro Asn Gly Leu Ala Tyr Asn Pro Pro Asn Ile Ser Pro Asp Val Phe 485 490 495 gtt cct gga tgg gtt gaa gca atg aac aac cct gtt atc ggg ccg ttt 1536 Val Pro Gly Trp Val Glu Ala Met Asn Asn Pro Val Ile Gly Pro Phe 500 505 510 atg tct caa ggg cct ggt gac ttc ttg gtt cat cat ggt att gcc ttc 1584 Met Ser Gln Gly Pro Gly Asp Phe Leu Val His His Gly Ile Ala Phe 515 520 525 agt ttg cat gtc acc gtc tta atc tgt gtc aag ggt tgt ttg gat gcc 1632 Ser Leu His Val Thr Val Leu Ile Cys Val Lys Gly Cys Leu Asp Ala 530 535 540 cgt ggt tct aaa ctg atg cct gac aag aaa gac ttt ggt tat agc ttc 1680 Arg Gly Ser Lys Leu Met Pro Asp Lys Lys Asp Phe Gly Tyr Ser Phe 545 550 555 560 cct tgt gat ggc ccc gga cgt ggc ggt act tgt gat atc tct gct tgg 1728 Pro Cys Asp Gly Pro Gly Arg Gly Gly Thr Cys Asp Ile Ser Ala Trp 565 570 575 gat tcc ttc tac ctt gcc ttc ttc tgg atg ctc aac aca att ggt tgg 1776 Asp Ser Phe Tyr Leu Ala Phe Phe Trp Met Leu Asn Thr Ile Gly Trp 580 585 590 att gtc ttc tac ttc aac tc ttag ac gct atc tgg tct ggt aac 1824 Ile Val Phe Tyr Phe Asn Trp Lys His Leu Ala Ile Trp Ser Gly Asn 595 600 605 gaa gct cag ttc aat acc aac tct act tat cta atg ggt tgg ctg cga 1872 Glu Ala Gln Phe Asn Thr Asn Ser Thr Tyr Leu Met Gly Trp Leu Arg 610 615 620 gac tac ctt tgg gga tac tca gct caa ttg att aac ggt tac aca cca 1920 Asp Tyr Leu Trp Gly Tyr Ser Ala Gln Leu Ile Asn Gly Tyr Thr Pro 625 630 630 635 640 ttt ggt gta aat agc ctg tca gtt tgg gct tgg att ttc ctc tta ggc 1968 Phe Gly Val Asn Ser Leu Ser Val Trp Ala Trp Ile Phe Leu Leu Gly 645 650 655 cac ctc tgc tgg gcg act ggc ttg tc ttg ttc ttg ttc ttg ttc ttc ttg ttc ttg ttc ttc ttg ttc ttg ttc ttc ttg ttc ttg ttg ttc ttg ttc ttc ttg ttc ttg ttc ttc g aga ggt 2016 His Leu Cys Trp Ala Thr Gly Phe Leu Phe Leu Ile Ser Trp Arg Gly 660 665 670 tac tgg caa gag ctg att gag act ctc gtt tgg gct cac cag cgt act 2064 Tyr Trp Gln Glu Leu Ile Glu Thr Leu Val Trp Ala His Gln Arg Thr 675 680 685 ccc ctc gcc aac tta gtg aca tgg aaa gac aag cct gtt gct ctc tct 2112 Pro Leu Ala Asn Leu Val Thr Trp Lys Asp Lys Pro Val Ala Leu Ser 690 695 700 atc gtt caa ggt cgc ttg gtg ggt tta gtc cac ttt gcg gtt ggc tat 2160 Ile Val Gln Gly Arg Leu Val Gly Leu Val His Phe Ala Val Gly Tyr 705 710 710 715 720 tat gtg acc tac gcg gct ttt gtg att ggt gca aca gct cct ctc gg Val Thr Tyr Ala Ala Phe Val Ile Gly Ala Thr Ala Pro Leu Gly 725 730 735 taa 2211 <210> 4 <211> 736 <212> PRT <213> Acaryuochloris marina <400> 4 Met Ala Thr Lys Phe Pro Ser Phe Ser Gln Asp Leu Ala Gln Asp Pro 1 5 10 15 Thr Thr Arg Arg Ile Trp Tyr Gly Ile Ala Thr Val His Asp Phe Glu 20 25 30 Thr His Asp Gly Met Thr Glu Glu Asn Leu Tyr Gln Lys Ile Phe Ala 35 40 45 Thr His Phe Gly His Leu Ser Ile Ile Phe Leu Trp Ser Ala Gly His 50 55 60 Leu Phe His Val Ala Trp Gln Gly Asn Phe Glu Gln Trp Ile Gln Asp 65 70 75 80 Pro Leu Thr Ile Arg Pro Ile Ala His Ala Ile Trp Asp Pro His Leu 85 90 95 Gly Asp Ala Ala Thr Gln Ala Phe Thr Gln Ala Gly Ala Ser Gly Pro 100 105 110 Val Asp Leu Cys Tyr Ser Gly Leu Tyr Gln Trp Trp Tyr Thr Ile Gly 115 120 125 Met Arg Thr Asn Gly Asp Leu Tyr Ile Gly Ser Val Phe Leu Met Ile 130 135 140 Val Ala Ala Val Met Leu Phe Ala Gly Trp Leu His Leu Gln Pro Lys 145 150 155 160 Phe Arg Pro Ser Leu Ala Trp Phe Arg Asp Ala Glu Ser Gln Met Asn 165 170 175 His His Leu Ala Val Leu Phe Gly Ala Ser Ser Leu Gly Trp Thr Gly 180 185 190 His Leu Ile His Val Ala Ile Pro Glu Ala Arg Gly Gln His Val Gly 195 200 205 Trp Asp Asn P he Leu Ser Thr Met Pro His Pro Ala Gly Leu Ala Pro 210 215 220 Phe Phe Thr Gly Arg Trp Gly Val Tyr Ala Gln Asn Pro Asp Thr Ala 225 230 235 240 Gly His Ile Phe Gly Thr Ser Glu Gly Ala Gly Thr Ala Ile Ile Thr 245 250 255 Phe Ile Gly Gly Phe His Pro Gln Thr Glu Ala Leu Trp Leu Thr Asp 260 265 270 Ile Ala His His His Leu Ala Ile Ala Val Met Tyr Ile Ile Ala Gly 275 280 285 285 His Met Tyr Arg Thr Gln Phe Gly Ile Gly His Ser Met Lys Glu Ile 290 295 300 Leu Glu Ala His Thr Pro Pro Ser Gly Met Leu Gly Asp Ala His Lys 305 310 315 320 Gly Leu Tyr Asp Thr Tyr Asn Glu Ser Leu His Phe Gln Leu Gly Phe 325 330 335 His Leu Ala Ala Leu Gly Val Ile Thr Ser Val Val Ala Gln His Met 340 345 350 Tyr Ser Leu Pro Ser Tyr Ala Phe Ile Ser Gln Asp His Val Thr Gln 355 360 365 Ala Ala Leu Tyr Thr His His Gln Tyr Ile Ala Gly Ile Leu Ala Ile 370 375 380 Gly Ala Phe Ala His Gly Gly Ile Phe Phe Val Arg Asp Tyr Asp Pro 385 390 395 400 Glu Arg Asn Lys Asn Asn Val Leu Ala Arg Ala Leu Glu His Lys Glu 405 410 415 Ala Ile Ile S er His Leu Ser Trp Val Ser Met Phe Ser Gly Phe His 420 425 430 Thr Leu Gly Val Tyr Val His Asn Asp Thr Val Val Ala Phe Gly Thr 435 440 445 Pro Glu Lys Gln Ile Leu Val Glu Pro Ile Phe Ala Gln Trp Ile Gln 450 455 460 Ala Ala His Gly Lys Leu Leu Leu Gly Phe Glu Thr Leu Leu Ser Asn 465 470 475 480 Pro Asn Gly Leu Ala Tyr Asn Pro Pro Asn Ile Ser Pro Asp Val Phe 485 490 495 Val Pro Gly Trp Val Glu Ala Met Asn Asn Pro Val Ile Gly Pro Phe 500 505 510 Met Ser Gln Gly Pro Gly Asp Phe Leu Val His His Gly Ile Ala Phe 515 520 525 Ser Leu His Val Thr Val Val Leu Ile Cys Val Lys Gly Cys Leu Asp Ala 530 535 540 Arg Gly Ser Lys Leu Met Pro Asp Lys Lys Asp Phe Gly Tyr Ser Phe 545 550 555 560 Pro Cys Asp Gly Pro Gly Arg Gly Gly Thr Cys Asp Ile Ser Ala Trp 565 570 575 Asp Ser Phe Tyr Leu Ala Phe Phe Trp Met Leu Asn Thr Ile Gly Trp 580 585 590 590 Ile Val Phe Tyr Phe Asn Trp Lys His Leu Ala Ile Trp Ser Gly Asn 595 600 605 Glu Ala Gln Phe Asn Thr Asn Ser Thr Tyr Leu Met Gly Trp Leu Arg 610 615 620 620 Asp Tyr Leu TrpGly Tyr Ser Ala Gln Leu Ile Asn Gly Tyr Thr Pro 625 630 635 640 Phe Gly Val Asn Ser Leu Ser Val Trp Ala Trp Ile Phe Leu Leu Gly 645 650 655 His Leu Cys Trp Ala Thr Gly Phe Leu Phe Leu Ile Ser Trp Arg Gly 660 665 670 Tyr Trp Gln Glu Leu Ile Glu Thr Leu Val Trp Ala His Gln Arg Thr 675 680 685 Pro Leu Ala Asn Leu Val Thr Trp Lys Asp Lys Pro Val Ala Leu Ser 690 695 700 Ile Val Gln Gly Arg Leu Val Gly Leu Val His Phe Ala Val Gly Tyr 705 710 715 720 720 Tyr Val Thr Tyr Ala Ala Phe Val Ile Gly Ala Thr Ala Pro Leu Gly 725 730 735 <210> 5 <211> 1083 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (1080) <400> 5 atg aca aca gtc ttg caa aga cgc gaa agc gcc agc gca tgg gaa aga 48 Met Thr Thr Val Leu Gln Arg Arg Glu Ser Ala Ser Ala Trp Glu Arg 1 5 10 15 ttc tgt agc ttc atc acc agc acc aac aac cgt tta tac atc ggt tgg 96 Phe Cys Ser Phe Ile Thr Ser Thr Asn Asn Arg Leu Tyr Ile Gly Trp 20 25 30 ttc ggc gta ttg atg att cct aca ctt ctc acc gct gta acc tgt ttc 144 Phe Gly Val Leu Met Ile Pro Thr Leu Leu Thr Ala Val Thr Cys Phe 35 40 45 gta atc gcc ttc atc ggc gcc cct ccc gtc gac atc gat gga atc cgt 192 Val Ile Ala Phe Ile Gly Ala Pro Pro Val Asp Ile Asp Gly Ile Arg 50 55 60 gag ccc gtt gct ggt tca cta ctt tat ggc aac aac atc atc act ggt 240 Glu Pro Val Ala Gly Ser Leu Leu Tyr Gly Asn Asn Ile Ile Thr Gly 65 70 75 80 gcc gtt gtt cct tca tcc aac gcc att ggc ctt cac ctg tat ccc atc 288 Ala Val Val Pro Ser Ser Asn Ala Ile Gly Leu His Leu Tyr Pro Ile 85 90 95 tgg gaa gca gct tct ctt gat gag tgg ttg tac aac ggt ggc cct tac 336 Trp Glu Ala Ala Ser Leu Asp Glu Trp Leu Tyr Asn Gly Gly Pro Tyr 100 105 110 cag cta atc att ttc cat tac atg att ggt tgt att tgc tac ctc ggt 384 Gln Leu Ile Ile Phe His Tyr Met Ile Gly Cys Ile Cys Tyr Leu Gly 115 120 125 cgt cag tgg gag tac agc tac cgt cta ggg atg cgt att tgt 432 Arg Gln Trp Glu Tyr Ser Tyr Arg Leu Gly Met Arg Pro Trp Ile Cys 130 135 140 gtt gct tac tct gca cct ttg gcc gct acc tac tct gtc ttc ttg atc 480 Val Ala Tyr Ser Ala Pro Leu Ala Ala Thr Tyr Ser Val Phe Leu Ile 145 150 155 160 tat cct cta ggt cag ggc agc ttc tcc gac gga atg cct cta ggc atc 528 Tyr Pro Leu Gly Gln Gly Ser Phe Ser Asp Gly Met Pro Leu Gly Ile 165 170 175 agc gga acc ttc aac ttc atg ttc gtg ttc caa gct gag cac aac atc 576 Ser Gly Thr Phe Asn Phe Met Phe Val Phe Gln Ala Glu His Asn Ile 180 185 190 ctc atg cac ccc ttc cac atg ttt gga gtt gct ggt gta ctg ggt ggt 624 His Pro Phe His Met Phe Gly Val Ala Gly Val Leu Gly Gly 195 200 205 tcc tta ttc gcc gcc atg cac ggt tcc ttg gtt agc tcc act cta gtt 672 Ser Leu Phe Ala Ala Met His Gly Ser Leu Val Ser Ser Thr Leu Val 210 215 220 cgt gag acc acc gaa ggt gag tcc gcc aac tac ggt tac aag ttc ggc 720 Arg Glu Thr Thr Glu Gly Glu Ser Ala Asn Tyr Gly Tyr Lys Phe Gly 225 230 235 240 caa gag gaa gag acc tac aac atc gtt gca gcc cac ggc tac ttc ggt 768 Gln Glu Glu Glu Thr Tyr Asn Ile Val Ala Ala His Gly Tyr Phe Gly 245 250 255 cgt ttg atc ttc caa tat gca tct ttc agc aac agc cgt tcc ttg cac 816 Arg Leu Ile Phe Ser Phe Ser Asn Ser Arg Ser Leu His 260 265 270 ttc ttc ttg ggt gca tgg ccc gtt gtc tgc atc tgg ttg act gca atg 864 Phe Phe Leu Gly Ala Trp Pro Val Val Cys Ile Trp Leu Thr Ala Met 275 280 285 ggc atc agc acc atg gcc ttc aac ttg aat ggt ttc aac ttc aac cac 912 Gly Ile Ser Thr Met Ala Phe Asn Leu Asn Gly Phe Asn Phe Asn His 290 295 300 tcc atc gtt gat tca caa ggt aac gtt gtg aac aca ggg gct 960 Ser Ile Val Asp Ser Gln Gly Asn Val Val Asn Thr Trp Ala Asp Val 305 310 315 320 cta aac cgc gcc aac ttg ggt ttc gaa gtt atg cac gag cgt aac gct 1008 Leu Asn Arg Ala Asn Leu Gly Phe Glu Val Met H is Glu Arg Asn Ala 325 330 335 cat aac ttc ccc tta gac ttg gct gct ggt gag tct gct cct gtt gct 1056 His Asn Phe Pro Leu Asp Leu Ala Ala Gly Glu Ser Ala Pro Val Ala 340 345 350 ctt act gct cct gtc atc aac ggt taa 1083 Leu Thr Ala Pro Val Ile Asn Gly 355 360 <210> 6 <211> 360 <212> PRT <213> Acaryuochloris marina <400> 6 Met Thr Thr Val Leu Gln Arg Arg Glu Ser Ala Ser Ala Trp Glu Arg 1 5 10 15 Phe Cys Ser Phe Ile Thr Ser Thr Asn Asn Arg Leu Tyr Ile Gly Trp 20 25 30 Phe Gly Val Leu Met Ile Pro Thr Leu Leu Thr Ala Val Thr Cys Phe 35 40 45 Val Ile Ala Phe Ile Gly Ala Pro Pro Val Asp Ile Asp Gly Ile Arg 50 55 60 Glu Pro Val Ala Gly Ser Leu Leu Tyr Gly Asn Asn Ile Ile Thr Gly 65 70 75 80 Ala Val Val Pro Ser Ser Asn Ala Ile Gly Leu His Leu Tyr Pro Ile 85 90 95 Trp Glu Ala Ala Ser Leu Asp Glu Trp Leu Tyr Asn Gly Gly Pro Tyr 100 105 110 Gln Leu Ile Ile Phe His Tyr Met Ile Gly Cys Ile Cys Tyr Leu Gly 115 120 125 Arg Gln Trp Glu Tyr Ser Tyr Arg Leu Gly Met Arg Pro Trp Ile Cys 130 135 140 Val Ala Tyr Ser Ala Pro Leu Ala Ala Thr Tyr Ser Val Phe Leu Ile 145 150 155 160 Tyr Pro Leu Gly Gln Gly Ser Phe Ser Asp Gly Met Pro Leu Gly Ile 165 170 175 Ser Gly Thr Phe Asn Phe Met Phe Val Phe Gln Ala Glu His Asn Ile 180 185 190 Leu Met His Pro Phe His Met Phe Gly Val Ala Gly Val Leu Gly Gly 195 200 205 Ser Leu Phe A la Ala Met His Gly Ser Leu Val Ser Ser Thr Leu Val 210 215 220 Arg Glu Thr Thr Glu Gly Glu Ser Ala Asn Tyr Gly Tyr Lys Phe Gly 225 230 235 240 Gln Glu Glu Glu Glu Thr Tyr Asn Ile Val Ala Ala His Gly Tyr Phe Gly 245 250 255 Arg Leu Ile Phe Gln Tyr Ala Ser Phe Ser Asn Ser Arg Ser Leu His 260 265 270 Phe Phe Leu Gly Ala Trp Pro Val Val Cys Ile Trp Leu Thr Ala Met 275 280 285 Gly Ile Ser Thr Met Ala Phe Asn Leu Asn Gly Phe Asn Phe Asn His 290 295 300 Ser Ile Val Asp Ser Gln Gly Asn Val Val Asn Thr Trp Ala Asp Val 305 310 315 320 Leu Asn Arg Ala Asn Leu Gly Phe Glu Val Met His Glu Arg Asn Ala 325 330 335 His Asn Phe Pro Leu Asp Leu Ala Ala Gly Glu Ser Ala Pro Val Ala 340 345 350 Leu Thr Ala Pro Val Ile Asn Gly 355 360 <210> 7 <211> 1020 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (1017) <400> 7 tgg ttt tat gtn ctc gat gan tgg ctt aag cgt gat cgg ttc gtc ttt 48 Trp Phe Tyr Xaa Leu Asp Xaa Trp Leu Lys Arg Asp Arg Phe Val Phe 1 5 10 15 att ggt tgg tca ggt atc cta ctt ttc ccc tgt gcg ttt cta tcc atc 96 Ile Gly Trp Ser Gly Ile Leu Leu Phe Pro Cys Ala Phe Leu Ser Ile 20 25 30 ggg gga tgg ttt acc ggc aca act ttc gta act tcc tgg tac acc cac 144 Gly Gly Trp Phe Thr Gly Thr Thr Phe Val Thr Ser Trp Tyr Thr His 35 40 45 ggt ctt gct agc tcc tac cta gaa ggg gct aac ttc ttg acc gtt gct 192 Gly Leu Ala Ser Ser Tyr Leu Glu Gly Ala Asn Phe Leu Thr Val Ala 50 55 60 gta tcc act ccc gcc gac agc ctc ggc cac tcc cta ctt cta ctt tgg 240 Val Ser Thr Pro Ala Asp Ser Leu Gly His Ser Leu Leu Leu Leu Trp 65 70 75 80 gga ccc gaa gct caa ggt gac ttc acc cgc tgg tgt cag ctg ggt gga 288 Gly Pro Glu Ala Gln Gly Asp Phe Thr Arg Trp Cys Gln Leu Gly Gly 85 90 95 ttg tgg aac ttc acc aca tta cat ggt gtc ttc ggc ttg atc ggc ttc 336 Leu Trp Asn Phe Thr Thr Leu His Gly Val Phe Gly Leu Ile Gly Phe 100 105 110 atg ctg cgt caa ttc gag att gcc cgt cta gtc ggc gtg cgt cct tac 384 Met Leu Arg Gln Phe Glu Ile Ala Arg Leu Val Gly Val Arg Pro Tyr 115 120 125 aac gca gtt gcc ttc agc ggt cct tat gcc gtg tatg gtc ttt 432 Asn Ala Val Ala Phe Ser Gly Pro Ile Ala Val Tyr Val Ser Val Phe 130 135 140 ttg atg tat cct ttg ggc caa tcc agc tgg ttc ttt gca cct agc tgg 480 Leu Met Tyr Pro Leu Gly Gln Ser Ser Trp Phe Phe Ala Pro Ser Trp 145 150 155 160 ggt gta aca agc atc ttc cga ttc ttg tta ttt gct caa ggt ttc cac 528 Gly Val Thr Ser Ile Phe Arg Phe Leu Leu Phe Ala Gln Gly Phe His 165 170 175 aac cta acc ctc aac ccc ttc cac atg atg ggt gtt gca ggt att ttg 576 Asn Leu Thr Leu Asn Pro Phe His Met Met Gly Val Ala Gly Ile Leu 180 185 190 ggt ggt gcg ctg ttg tgc gcc att cac gga gcc act gtt gag aac acc 624 Gly Ala Leu Leu Cys Ala Ile His Gly Ala Thr Val Glu Asn Thr 195 200 205 ttg ttt gaa gac ggt caa gac gcc aat aca ttt gct gcg ttc act ccg 672 Leu Phe Glu Asp Gly Gln Asp Ala Asn Thr Phe Ala Ala Phe Thr Pro210 215 220 acc caa gca gaa gag acc tac tcc atg gtc act gct aac cga ttc tgg 720 Thr Gln Ala Glu Glu Thr Tyr Ser Met Val Thr Ala Asn Arg Phe Trp 225 230 235 240 tct cag att ttc ggg att gcc ttt tcc aac aag cgt tgg ttg cac ttt 768 Ser Gln Ile Phe Gly Ile Ala Phe Ser Asn Lys Arg Trp Leu His Phe 245 250 255 ttc atg ttg ttc gtt cct gtg act ggt cta tgg gct tct gcc att ggc 816 Phe Met Leu Phe Val Pro Val Thr Gly Leu Trp Ala Ser Ala Ile Gly 260 265 270 ctc gtg ggt atc gct ctc aac atg cgt gct tat gac ttc gtt agc cag 864 Leu Val Gly Ile Ala Leu Asn Met Arg Ala Tyr Asp Phe Val Ser Gln 275 280c 285ga cgg gct gct gaa gac cct gag ttc gaa acc ttc tac acc aag 912 Glu Ile Arg Ala Ala Glu Asp Pro Glu Phe Glu Thr Phe Tyr Thr Lys 290 295 300 aac att ctc ttg aat gaa ggt ctg cgc gct tgg atg gcac 960 Asn Ile Leu Leu Asn Glu Gly Leu Arg Ala Trp Met Ala Pro Gln Asp 305 310 315 320 caa atc cat gaa aac ttc atc ttc cct gag gag gtt cta cca cgt gga 1008 Gln Ile His Glu Asn Phe Ile Phe Pro Glu Glu Val Leu Pro Arg Gly 325 330 335 aac gcc ctt taa 1020 Asn Ala Leu <210> 8 <211> 339 <212> PRT <213> Acaryuochloris marina <400> 8 Trp Phe Tyr Xaa Leu Asp Xaa Trp Leu Lys Arg Asp Arg Phe Val Phe 1 5 10 15 Ile Gly Trp Ser Gly Ile Leu Leu Phe Pro Cys Ala Phe Leu Ser Ile 20 25 30 Gly Gly Trp Phe Thr Gly Thr Thr Phe Val Thr Ser Trp Tyr Thr His 35 40 45 Gly Leu Ala Ser Ser Tyr Leu Glu Gly Ala Asn Phe Leu Thr Val Ala 50 55 60 Val Ser Thr Pro Ala Asp Ser Leu Gly His Ser Leu Leu Leu Leu Trp 65 70 75 80 Gly Pro Glu Ala Gln Gly Asp Phe Thr Arg Trp Cys Gln Leu Gly Gly 85 90 95 Leu Trp Asn Phe Thr Thr Leu His Gly Val Phe Gly Leu Ile Gly Phe 100 105 110 Met Leu Arg Gln Phe Glu Ile Ala Arg Leu Val Gly Val Arg Pro Tyr 115 120 125 Asn Ala Val Ala Phe Ser Gly Pro Ile Ala Val Tyr Val Ser Val Phe 130 135 140 Leu Met Tyr Pro Leu Gly Gln Ser Ser Trp Phe Phe Ala Pro Ser Trp 145 150 155 160 Gly Val Thr Ser Ile Phe Arg Phe Leu Leu Phe Ala Gln Gly Phe His 165 170 175 Asn Leu Thr Leu Asn Pro Phe His Met Met Gly Val Ala Gly Ile Leu 180 185 190 Gly Gly Ala Leu Leu Cys Ala Ile His Gly Ala Thr Val Glu Asn Thr 195 200 205 Leu Phe Glu As p Gly Gln Asp Ala Asn Thr Phe Ala Ala Phe Thr Pro 210 215 220 Thr Gln Ala Glu Glu Thr Tyr Ser Met Val Thr Ala Asn Arg Phe Trp 225 230 235 240 Ser Gln Ile Phe Gly Ile Ala Phe Ser Asn Lys Arg Trp Leu His Phe 245 250 255 Phe Met Leu Phe Val Pro Val Thr Gly Leu Trp Ala Ser Ala Ile Gly 260 265 270 Leu Val Gly Ile Ala Leu Asn Met Arg Ala Tyr Asp Phe Val Ser Gln 275 280 285 Glu Ile Arg Ala Ala Glu Asp Pro Glu Phe Glu Thr Phe Tyr Thr Lys 290 295 300 Asn Ile Leu Leu Asn Glu Gly Leu Arg Ala Trp Met Ala Pro Gln Asp 305 310 315 320 Gln Ile His Glu Asn Phe Ile Phe Pro Glu Glu Val Leu Pro Arg Gly 325 330 335 Asn Ala Leu <210> 9 <211> 1368 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (1368) <400> 9 gta cac aca gtc gtc ctg aat gat cca gga cga ctg cta tct gtg cat 48 Val His Thr Val Val Leu Asn Asp Pro Gly Arg Leu Leu Ser Val His 1 5 10 15 ttg atg cac act gcc ctg gta agc ggc tgg gca ggc tcc atg gca ctc 96 Leu Met His Thr Ala Leu Val Ser Gly Trp Ala Gly Ser Met Ala Leu 20 25 30 tac gag ttg gcc aag tac gat cca agc gat cca gta tta aac ccc atg 144 Tyr Glu Leu Ala Lys Tyr Asp Pro Ser Asp Pro Val Leu Asn Pro Met 35 40 45 tgg cgt caa ggc aca ttc gtt atg cct gtg atg act cgc att ggt gtc 192 Trp Arg Gln Gly Thr Phe Val Met Pro Val Met Thr Arg Ile Gly Val 50 55 60 act cac tct tgg agt ggc tgg aca gtt act ggt gag cct tgg gtt aca 240 Thr His Ser Trp Ser Gly Trp Thr Val Thr Gly Glu Pro Trp Val Thr 65 70 75 80 cag cca gca att tta ggc gca cac cta aac ttc ttt agt tat gag ggt 288 Gln Pro Gly Ile Leu Gly Ala His Leu Asn Phe Phe Ser Tyr Glu Gly 85 90 95 gtc atc ctc atg cat atc ctg gct gca ggt ttg ttt ttc tta gct gcc 336 Val Ile Leu Met His Ile Leu Ala Ala Gly Leu Phe Phe Leu Ala Ala 100 105 110 gtt tgg cac tgg att aac tgg gat tta gac atc tac tat ccc gat ggt 384 Val Trp His Trp Ile Asn Trp Asp Leu Asp Ile Tyr Tyr Pro Asp Gly 115 120 125 tct tct gag ccc gca agt gat tgg ccc aaa att ttc ggt ctt cac cta 432 Ser Ser Glu Pro Ala Ser Asp Trp Pro Lys Ile Phe Gly Leu His Leu 130 135 140 ctg aca tta gga att gtt tgt ttc ggc ttt gga tct ctc cac tta act 480 Leu Thr Leu Gly Ile Val Cys Phe Gly Phe Gly Ser Leu His Leu Thr 145 150 155 160 gga atc tta ggc cca ggc atg tgg gtt tcg gat cct tac gga ctc aca 528 Gly Ile Leu Gly Pro Gly Met Trp Val Ser Asp Pro Tyr Gly Leu Thr 165 170 175 ggt cat gtg caa ggg gtt agc cca gat tgg aga cca ttt gcc ttt gac 576 Gly His Val Gln Gly Val Ser Pro Asp Trp Arg Pro Phe Ala Phe Asp 180 185 190 ccc tac aat ccc aca ggt ttg gtt act cac cat atc tct gca ggg att 624 Pro Tyr Asn Pro Thr Gly Leu Val Thr His His Ile Ser Ala Gly Ile 195 200 205 gcc ctc atc att ggc ggc att ttc cac act gtt tct cgt cct tct gag 672 Ala Leu Ile Ile Gly Gly Ile Phe His Thr Val Ser Arg Pro Ser Glu210 215 220 cgc ctt tat aac gct ctc agc atg ggt aac gtt gaa acc gta cta tcc 720 Arg Leu Tyr Asn Ala Leu Ser Met Gly Asn Val Glu Thr Val Leu Ser 225 230 235 240 agt tct gtt gcc ttt gta gct gcg gct gca ttc gta atg gtt gga acc 768 Ser Ser Val Ala Phe Val Ala Ala Ala Ala Phe Val Met Val Gly Thr 245 250 255 atg tgg tat gga agt gca act act cca atc gaa ttg ttt ggt cct act 816 Met Trp Tyr Gly Ser Ala Thr Thr Pro Ile Glu Leu Phe Gly Pro Thr 260 265 270 cgt tat cag tgg gat agc ggt tac ttc caa act gaa atc cag cgc cgt 864 Arg Tyr Gln Trp Asp Ser Gly Tyr Phe Gln Thr Glu Ile Gln Arg Arg 275 280 285 gtg cag tct ggt caa aca tgg gat caa atc cct gag aag ctt gtt ttc 912 Val Gln Ser Gly Gln Thr Trp Asp Gln Ile Pro Glu Lys Leu Val Phe 290 295 300 tac gat tac atc ggt aat agc cct gct aaa ggt ggt tta ttc cgca 960 Tyr Asp Tyr Ile Gly Asn Ser Pro Ala Lys Gly Gly Leu Phe Arg Thr 305 310 315 320 ggt gct atg aac agt ggt gac ggt att gct aga gca tgg gaa ggt cat 1008 Gly Ala Met Asn Ser Gly Asp Gly Ile Ala Arg Al a Trp Glu Gly His 325 330 335 cct aca ttt acg gat tct gaa ggt cgt gag ttg ttc gtg cga cgc atg 1056 Pro Thr Phe Thr Asp Ser Glu Gly Arg Glu Leu Phe Val Arg Arg Met 340 345 350 ccc aac ttc ttc gaa act ttc cca gtt gtt cta act gac aaa gat ggt 1104 Pro Asn Phe Phe Glu Thr Phe Pro Val Val Leu Thr Asp Lys Asp Gly 355 360 365 gtt gtc cgc gct gac att cct ttc cga cga gct gaa tct cga tac agc 1152 Val Val Arg Ala Asp Ile Pro Phe Arg Arg Ala Glu Ser Arg Tyr Ser 370 375 380 ttt gag cag aaa ggt gtt tca gtc tcc ttt gaa ggt ggt act cta aac 1200 Phe Glu Gln Lys Gly Val Ser Val Ser Phe Glu Gly Gly Thr Leu Asn 385 390 395 400 ggt caa acc ttc acc gat gct cct tct gtt aag aag tat gct cgt aaa 1248 Gly Gln Thr Phe Thr Asp Ala Pro Ser Val Lys Lys Tyr Ala Arg Lys 405 410 415 gct cag ctt ggt gaa cct ttt gag ttc gat cgt gaa acg ctt ggt tct 1296 Ala Gln Leu Gly Glu Pro Phe Glu Phe Asp Arg Glu Thr Leu Gly Ser 420 425 430 gat ggt gtt ttc cga acc agc act cgt ggt tgg ttt gca ttc agc cac 1344 Asp Gly Val Phe Arg Thr Ser Thr Arg Gly Trp Phe Ala Phe Ser His 435 440 445 tct tgc tat gca cta ctc ttc ttc 1368 Ser Cys Tyr Ala Leu Leu Phe Phe 450 455 <210> 10 <211> 456 <212> PRT <213> Acaryuochloris marina <400> 10 Val His Thr Val Val Leu Asn Asp Pro Gly Arg Leu Leu Ser Val His 1 5 10 15 Leu Met His Thr Ala Leu Val Ser Gly Trp Ala Gly Ser Met Ala Leu 20 25 30 Tyr Glu Leu Ala Lys Tyr Asp Pro Ser Asp Pro Val Leu Asn Pro Met 35 40 45 Trp Arg Gln Gly Thr Phe Val Met Pro Val Met Thr Arg Ile Gly Val 50 55 60 Thr His Ser Trp Ser Gly Trp Thr Val Thr Gly Glu Pro Trp Val Thr 65 70 75 80 Gln Pro Gly Ile Leu Gly Ala His Leu Asn Phe Phe Ser Tyr Glu Gly 85 90 95 Val Ile Leu Met His Ile Leu Ala Ala Gly Leu Phe Phe Leu Ala Ala 100 105 110 Val Trp His Trp Ile Asn Trp Asp Leu Asp Ile Tyr Tyr Pro Asp Gly 115 120 125 Ser Ser Glu Pro Ala Ser Asp Trp Pro Lys Ile Phe Gly Leu His Leu 130 135 140 Leu Thr Leu Gly Ile Val Cys Phe Gly Phe Gly Ser Leu His Leu Thr 145 150 155 160 Gly Ile Leu Gly Pro Gly Met Trp Val Ser Asp Pro Tyr Gly Leu Thr 165 170 175 Gly His Val Gln Gly Val Ser Pro Asp Trp Arg Pro Phe Ala Phe Asp 180 185 190 Pro Tyr Asn Pro Thr Gly Leu Val Thr His His Ile Ser Ala Gly Ile 195 200 205 Ala Leu Ile Ile Gly Gly Ile Phe His Thr Val Ser Arg Pro Ser Glu 210 215 220 Arg Leu Tyr Asn Ala Leu Ser Met Gly Asn Val Glu Thr Val Leu Ser 225 230 235 240 Ser Ser Val Ala Phe Val Ala Ala Ala Ala Phe Val Met Val Gly Thr 245 250 255 Met Trp Tyr Gly Ser Ala Thr Thr Pro Ile Glu Leu Phe Gly Pro Thr 260 265 270 Arg Tyr Gln Trp Asp Ser Gly Tyr Phe Gln Thr Glu Ile Gln Arg Arg 275 280 285 val Gln Ser Gly Gln Thr Trp Asp Gln Ile Pro Glu Lys Leu Val Phe 290 295 300 Tyr Asp Tyr Ile Gly Asn Ser Pro Ala Lys Gly Gly Leu Phe Arg Thr 305 310 315 320 Gly Ala Met Asn Ser Gly Asp Gly Ile Ala Arg Ala Trp Glu Gly His 325 330 335 Pro Thr Phe Thr Asp Ser Glu Gly Arg Glu Leu Phe Val Arg Arg Met 340 345 350 Pro Asn Phe Phe Glu Thr Phe Pro Val Val Leu Thr Asp Lys Asp Gly 355 360 365 Val Val Arg Ala Asp Ile Pro Phe Arg Arg Ala Glu Ser Arg Tyr Ser 370 375 380 Phe Glu Gln Lys Gly Val Ser Val Ser Phe Glu Gly Gly Thr Leu Asn 385 390 395 400 400 Gly Gln Thr Phe Thr Asp Ala Pro Ser Val Lys Lys Tyr Ala Arg Lys 405 410 415 Ala Gln Leu Gly Glu Pro Phe Glu Phe Asp Arg Glu Thr Leu Gly Ser 420 425 430 Asp Gly Val Phe Arg Thr Ser Thr Arg Gly Trp Phe Ala Phe Ser His 435 440 445 Ser Cys Tyr Ala Leu Leu Phe Phe 450 455 <210> 11 <211> 1339 <212> DNA <213> Acaryuochloris marina <220> <221> CDS <222> (1) ... (1339) <400> 11 atg aaa act tca tct tcc ctg agg agg ttc tac cac gtg gaa acg ccc 48 Met Lys Thr Ser Ser Ser Leu Arg Arg Phe Tyr His Val Glu Thr Pro 1 5 10 15 ttt aat ccg tct gcg gct ggt tat gac cgc gca acc act ggc tat ggc 96 Phe Asn Pro Ser Ala Ala Gly Tyr Asp Arg Ala Thr Thr Gly Tyr Gly 20 25 30 tgg tgg gct gga aat gca cga tta act gat cta tct ggt cag cta act 144 Trp Trp Ala Gly Asn Ala Arg Leu Thr Asp Leu Ser Gly Gln Leu Thr 35 40 45 ggt gcc cac att gcc cat gct gga atg att acc ttc tgg gct ggt gca 192 Gly Ala His Ile Ala His Ala Gly Met Ile Thr Phe Trp Ala Gly Ala 50 55 60 atg act ttg ttt gaa gtc tct cac ttc att cct gaa aag cct atg tac 240 Met Thr Leu Phe Glu Val Ser His Phe Ile Pro Glu Lys Pro Met Tyr 65 70 75 80 gag caa ggc agc atc ctg ctt gct cac cta gcc gct gaga ggt ttt ggt 288 Glu Gln Gly Ser Ile Leu Leu Ala His Leu Ala Ala Glu Gly Phe Gly 85 90 95 gtt gga cct ggt ggt gaa gtt att agc act tat cct tat ttt gtg att 336 Val Gly Pro Gly Gly Glu Glu Val Ile Ser Thr Tyr Pro Tyr Phe Val Ile 100 105 110 ggt gca att cac cta att gct tct gct gtc ctc ggt ttt ggt ggc ctt 384 Gly Ala Ile His Leu Ile Ala Ser Ala Val Leu Gly Phe Gly Gly Leu 115 120 125 tac cac aca ttc aga ggc cct gct aag ttt gag gat tac gat tgg 432 Tyr His Thr Phe Arg Gly Pro Ala Lys Phe Glu Asp Tyr Ser Asp Trp 130 135 140 tgg ggg tat gac tgg gaa gac aaa gaa aag atg atg cag atc ctg ggg 480 Trp Gly Tyr Asp Trp Glu Asp Lys Glu Lys Met Met Gln Ile Leu Gly 145 150 155 160 att cac tta atc ttc ctc gga att ggt gct ctt gct ttt gct gca aaa 528 Ile His Leu Ile Phe Leu Gly Ile Gly Ala Leu Ala Phe Ala Ala Lys 165 170 175 gcc atg ttc ttt ggt ctt tat gat ccc tgg gct cct ggt ggt gga 576 Ala Met Phe Phe Gly Gly Leu Tyr Asp Pro Trp Ala Pro Gly Gly Gly 180 185 190 aat gtt cgc ctg att act aac cca act tgg aac tta ggt act ttc ctg 624 Asn Val Arg Leu Ile Thr Asn Pro Thr Trp Asn Leu Gly Thr Phe Leu 195 200 205 ggt tac att acc cga tct ccc tgg gga gaa ggt ggc tgg atc gtt agt 672 Gly Tyr Ile Thr Arg Ser Pro Trp Gly Glu Gly Gly Trp Ile Val Se r 210 215 220 gtt aac aac cta gaa gac gtt gta ggt ggt cac ctt ctc gta ggt gtt 720 Val Asn Asn Leu Glu Asp Val Val Gly Gly His Leu Leu Val Gly Val 225 230 235 240 cac tac atc ttc ggt ggc gtt ttc cac att ctt gtt aag cct tgg ggt 768 His Tyr Ile Phe Gly Gly Val Phe His Ile Leu Val Lys Pro Trp Gly 245 250 255 tgg gtt cgc cga gcc tat gt gtc tgg tct ggt gaa gcc tat ctc tcc tac 816 Trp Val Arg Arg Ala Tyr Val Trp Ser Gly Glu Ala Tyr Leu Ser Tyr 260 265 270 agc ttg ggt gcc ctt tac atg tgt ggc atg att gct gtg ggt tat gtc 864 Ser Leu Gly Ala Leu Tyr Met Cys Gly Met Ile Ala Val Gly Tyr Val 275 280 280 t ttt aac aac act gtt tac ccc agt gaa ttc tac ggt cct act gct 912 Trp Phe Asn Asn Thr Val Tyr Pro Ser Glu Phe Tyr Gly Pro Thr Ala 290 295 300 gct gaa gct tct cag gct cag gca atg acc ttt ttg att cgt gac caa 960 Ala Glu Ala Ser Gln Ala Gln Ala Met Thr Phe Leu Ile Arg Asp Gln 305 310 315 320 agg tta ggg gcg aac atc gct tct gcc caa ggt cct aca ggt ctt ggt 1008 Arg Leu Gly Ala Asn Ile Ala Ser Ala Gln Gly P ro Thr Gly Leu Gly 325 330 335 aag tat ctg atg cgt tct cct tct ggt gag atc atc ttc ggt ggt gag 1056 Lys Tyr Leu Met Arg Ser Pro Ser Gly Glu Ile Ile Phe Gly Gly Glu 340 345 350 acc atg cgt ttc tgg gat ttc cgt gga cct tgg ttg gag ccc ctt cgt 1104 Thr Met Arg Phe Trp Asp Phe Arg Gly Pro Trp Leu Glu Pro Leu Arg 355 360 365 gga ccc aac ggt ttg gac ctc aac aag ctc aga aat gat att cag cct 1152 Gly Pro Gly Leu Asp Leu Asn Lys Leu Arg Asn Asp Ile Gln Pro 370 375 380 tgg caa gct cgt cgt gcg gct gag tac atg act cat gct cct ttg ggt 1200 Trp Gln Ala Arg Arg Ala Ala Glu Tyr Met Thr His Ala Pro Leu Gly 385 390 395 400 gca ttg aac tct gta ggt ggt gtg gca act gag atc aac tcg gtg aac 1248 Ala Leu Asn Ser Val Gly Gly Val Ala Thr Glu Ile Asn Ser Val Asn 405 410 415 tat gtt tct ccc cgt tct tgg tta tcc act tca cat ttc tgc ctt gcg 1296 Tyr Val Ser Pro Arg Ser Trp Leu Ser Thr Ser His Phe Cys Leu Ala 420 425 430 ttc ttc ttc ttt gtt ggc cat att tgg cac tcc ggc cgc gcc c 1339 Phe Phe Phe Phe Phe Val Gly His Ile Trp His Ser Gly Arg Ala 435 440 445 <210> 12 <211> 446 <212> PRT <213> Acaryuochloris marina <400> 12 Met Lys Thr Ser Ser Ser Leu Arg Arg Phe Tyr His Val Glu Thr Pro 1 5 10 15 Phe Asn Pro Ser Ala Ala Gly Tyr Asp Arg Ala Thr Thr Gly Tyr Gly 20 25 30 Trp Trp Ala Gly Asn Ala Arg Leu Thr Asp Leu Ser Gly Gln Leu Thr 35 40 45 Gly Ala His Ile Ala His Ala Gly Met Ile Thr Phe Trp Ala Gly Ala 50 55 60 Met Thr Leu Phe Glu Val Ser His Phe Ile Pro Glu Lys Pro Met Tyr 65 70 75 80 Glu Gln Gly Ser Ile Leu Leu Ala His Leu Ala Ala Glu Gly Phe Gly 85 90 95 Val Gly Pro Gly Gly Glu Val Ile Ser Thr Tyr Pro Tyr Phe Val Ile 100 105 110 Gly Ala Ile His Leu Ile Ala Ser Ala Val Leu Gly Phe Gly Gly Leu 115 120 125 Tyr His Thr Phe Arg Gly Pro Ala Lys Phe Glu Asp Tyr Ser Asp Trp 130 135 140 Trp Gly Tyr Asp Trp Glu Asp Lys Glu Lys Met Met Gln Ile Leu Gly 145 150 155 160 Ile His Leu Ile Phe Leu Gly Ile Gly Ala Leu Ala Phe Ala Ala Lys 165 170 175 Ala Met Phe Phe Gly Gly Leu Tyr Asp Pro Trp Ala Pro Gly Gly Gly 180 185 190 Asn Val Arg Leu Ile Thr Asn Pro Thr Trp Asn Leu Gly Thr Phe Leu 195 200 205 Gly Tyr Ile T hr Arg Ser Pro Trp Gly Glu Gly Gly Trp Ile Val Ser 210 215 220 Val Asn Asn Leu Glu Asp Val Val Gly Gly His Leu Leu Val Gly Val 225 230 235 240 His Tyr Ile Phe Gly Gly Val Phe His Ile Leu Val Lys Pro Trp Gly 245 250 255 Trp Val Arg Arg Ala Tyr Val Trp Ser Gly Glu Ala Tyr Leu Ser Tyr 260 265 270 Ser Leu Gly Ala Leu Tyr Met Cys Gly Met Ile Ala Val Gly Tyr Val 275 280 280 285 Trp Phe Asn Asn Thr Val Tyr Pro Ser Glu Phe Tyr Gly Pro Thr Ala 290 295 300 Ala Glu Ala Ser Gln Ala Gln Ala Met Thr Phe Leu Ile Arg Asp Gln 305 310 315 320 Arg Leu Gly Ala Asn Ile Ala Ser Ala Gln Gly Pro Thr Gly Leu Gly 325 330 335 Lys Tyr Leu Met Arg Ser Pro Ser Gly Glu Ile Ile Phe Gly Gly Glu 340 345 350 Thr Met Arg Phe Trp Asp Phe Arg Gly Pro Trp Leu Glu Pro Leu Arg 355 360 365 365 Gly Pro Asn Gly Leu Asp Leu Asn Lys Leu Arg Asn Asp Ile Gln Pro 370 375 380 Trp Gln Ala Arg Arg Ala Ala Glu Tyr Met Thr His Ala Pro Leu Gly 385 390 395 400 Ala Leu Asn Ser Val Gly Gly Val Ala Thr Glu Ile Asn Ser Val Asn 405 410 415 Tyr Val SerPro Arg Ser Trp Leu Ser Thr Ser His Phe Cys Leu Ala 420 425 430 Phe Phe Phe Phe Val Gly His Ile Trp His Ser Gly Arg Ala 435 440 445 <210> 13 <211> 22 <212> DNA <213> Artificial Sequence <400> 13 ccaccachtg gatttggaay ct <210> 14 <211> 20 <212> DNA <213> Artificial Sequence <400> 14 gcnacnggyt trtcyttcca <210> 15 <211> 26 <212> DNA <213> Artificial Sequence <400> 15 gayathgayg giathmgiga rccigt <210> 16 <211> 26 <212> DNA <213> Artificial Sequence <400> 16 gggaagttgt gggcattrcg ytcgtg <210> 17 <211> 24 <212> DNA <213> Artificial Sequence <400> 17 tggttygayg tnctcgayga ytgg <210> 18 <211> 20 <212> DNA <213> Artificial Sequence <400> 18 ccrtgccaga krtgrccraa <210> 19 <211> 20 <212> DNA <213> Artificial Sequence <400> 19 atgggactac cytggtaycg <210> 20 <211> 20 <212> DNA <213> Artificial Sequence <400> 20 ccrtgccaga krtgrccraa

───────────────────────────────────────────────────── フロントページの続き Fターム(参考) 4B024 AA11 AA17 BA80 CA03 DA05 EA04 GA11 4B065 AA83X AC14 CA24 CA46 4H045 AA10 BA10 CA11 EA05 FA72 FA74  ──────────────────────────────────────────────────続 き Continued on the front page F term (reference) 4B024 AA11 AA17 BA80 CA03 DA05 EA04 GA11 4B065 AA83X AC14 CA24 CA46 4H045 AA10 BA10 CA11 EA05 FA72 FA74

Claims (8)

【特許請求の範囲】[Claims] 【請求項1】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号2記載のアミノ酸配列により表されるタン
パク質 (b) 配列番号2記載のアミノ酸配列において1もしくは
複数個のアミノ酸が欠失、置換、若しくは付加されたア
ミノ酸配列により表され、かつ光化学系1反応中心タン
パク質サブユニットPsaAとしての機能を有するタンパク
1. A gene encoding the following protein (a) or (b): (a) a protein represented by the amino acid sequence of SEQ ID NO: 2; (b) a protein represented by the amino acid sequence of SEQ ID NO: 2 in which one or more amino acids are deleted, substituted, or added; System 1 A protein that functions as a reaction center protein subunit PsaA
【請求項2】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号4記載のアミノ酸配列により表されるタン
パク質 (b) 配列番号4記載のアミノ酸配列において1もしくは
複数個のアミノ酸が欠失、置換、若しくは付加されたア
ミノ酸配列により表され、かつ光化学系1反応中心タン
パク質サブユニットPsaBとしての機能を有するタンパク
2. A gene encoding the following protein (a) or (b): (a) a protein represented by the amino acid sequence of SEQ ID NO: 4; (b) a protein represented by the amino acid sequence of SEQ ID NO: 4 in which one or more amino acids have been deleted, substituted, or added; System 1 A protein that functions as a reaction center protein subunit PsaB
【請求項3】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号6記載のアミノ酸配列により表されるタン
パク質 (b) 配列番号6記載のアミノ酸配列において1もしくは
複数個のアミノ酸が欠失、置換、若しくは付加されたア
ミノ酸配列により表され、かつ光化学系2反応中心タン
パク質サブユニットPsbAとしての機能を有するタンパク
3. A gene encoding the following protein (a) or (b): (a) a protein represented by the amino acid sequence of SEQ ID NO: 6; (b) a protein represented by the amino acid sequence of SEQ ID NO: 6 in which one or more amino acids are deleted, substituted, or added; System 2 A protein that functions as a reaction center protein subunit PsbA
【請求項4】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号8記載のアミノ酸配列を含むタンパク質 (b) 配列番号8記載のアミノ酸配列において1もしくは
複数個のアミノ酸が欠失、置換、若しくは付加されたア
ミノ酸配列を含み、かつ光化学系2反応中心タンパク質
サブユニットPsbDとしての機能を有するタンパク質
4. A gene encoding the following protein (a) or (b): (a) a protein containing the amino acid sequence of SEQ ID NO: 8; (b) a protein containing the amino acid sequence of SEQ ID NO: 8 in which one or more amino acids have been deleted, substituted, or added, and a photochemical system 2 reaction A protein that functions as the central protein subunit PsbD
【請求項5】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号10記載のアミノ酸配列を含むタンパク質 (b) 配列番号10記載のアミノ酸配列において1もしく
は複数個のアミノ酸が欠失、置換、若しくは付加された
アミノ酸配列を含み、かつ光化学系2コアアンテナタン
パク質Psb Bとしての機能を有するタンパク質
5. A gene encoding the following protein (a) or (b): (a) a protein comprising the amino acid sequence of SEQ ID NO: 10; (b) a protein comprising the amino acid sequence of SEQ ID NO: 10 in which one or more amino acids have been deleted, substituted, or added; A protein that functions as an antenna protein Psb B
【請求項6】 以下の(a)又は(b)のタンパク質をコード
する遺伝子。 (a) 配列番号12記載のアミノ酸配列を含むタンパク質 (b) 配列番号12記載のアミノ酸配列において1もしく
は複数個のアミノ酸が欠失、置換、若しくは付加された
アミノ酸配列を含み、かつ光化学系2コアアンテナタン
パク質Psb Cとしての機能を有するタンパク質
6. A gene encoding the following protein (a) or (b): (a) a protein comprising the amino acid sequence of SEQ ID NO: 12; (b) an amino acid sequence of SEQ ID NO: 12 comprising an amino acid sequence in which one or more amino acids have been deleted, substituted, or added; A protein that functions as an antenna protein Psb C
【請求項7】 請求項1〜6記載の遺伝子がコードする
タンパク質。
7. A protein encoded by the gene according to claim 1.
【請求項8】 アカリオクロリス属に属する原核藻類を
用いて請求項1〜6記載の遺伝子を生産する方法。
8. A method for producing the gene according to claim 1, wherein a prokaryotic algae belonging to the genus Acariochloris is used.
JP2000170696A 2000-06-07 2000-06-07 Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same Pending JP2001346585A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2000170696A JP2001346585A (en) 2000-06-07 2000-06-07 Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2000170696A JP2001346585A (en) 2000-06-07 2000-06-07 Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same

Publications (1)

Publication Number Publication Date
JP2001346585A true JP2001346585A (en) 2001-12-18

Family

ID=18673373

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2000170696A Pending JP2001346585A (en) 2000-06-07 2000-06-07 Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same

Country Status (1)

Country Link
JP (1) JP2001346585A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2327769A1 (en) 2007-11-10 2011-06-01 Joule Unlimited, Inc. Hyperphotosynthetic organisms
CN102103084B (en) * 2009-12-18 2012-12-26 中国科学院烟台海岸带研究所 Instrument and method for classifying and discriminating algae based on chlorophyll analysis

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2327769A1 (en) 2007-11-10 2011-06-01 Joule Unlimited, Inc. Hyperphotosynthetic organisms
EP2615164A1 (en) 2007-11-10 2013-07-17 Joule Unlimited Technologies, Inc. Hyperphotosynthetic organisms
CN102103084B (en) * 2009-12-18 2012-12-26 中国科学院烟台海岸带研究所 Instrument and method for classifying and discriminating algae based on chlorophyll analysis

Similar Documents

Publication Publication Date Title
Tamagnini et al. Diversity of cyanobacterial hydrogenases, a molecular approach
Collins et al. Comparative sequence analyses of the 16S rRNA genes of Lactobacillus minutus, Lactobacillus rimae and Streptococcus parvulus: proposal for the creation of a new genus Atopobium
Beckenbach et al. Relationships in the Drosophila obscura species group, inferred from mitochondrial cytochrome oxidase II sequences.
Urbach et al. Rapid diversification of marine picophytoplankton with dissimilar light-harvesting structures inferred from sequences of Prochlorococcus and Synechococcus (Cyanobacteria)
Keeling et al. A non‐canonical genetic code in an early diverging eukaryotic lineage.
Kallas et al. Characterization of two operons encoding the cytochrome b6-f complex of the cyanobacterium Nostoc PCC 7906. Highly conserved sequences but different gene organization than in chloroplasts.
JPWO2005118812A1 (en) Production of astaxanthin or its metabolites using carotenoid ketolase and carotenoid hydroxylase genes
CN113817763B (en) Directed evolution method, mutant and application of beta-galactosidase family genes
Cardy et al. The methane monooxygenase gene cluster of Methylosinus trichosporium: cloning and sequencing of the mmo C gene
US20070037190A1 (en) DNA ligase mutants
CN107667853A (en) The method for creating of the common line with genic sterile of rice and application
CN113913400B (en) L-sorbosone dehydrogenase mutant with improved catalytic activity
CN108865962A (en) It is a kind of can solution expression with high efficiency 4- alpha-glycosyl transferase colibacillus engineering
CN106318924A (en) DNA (deoxyribonucleic acid) polymerase with improved catalytic DNA synthesis extension capability
CN109971728A (en) A kind of the aspergillus niger 6-4 light repair enzyme and its construction method of rite-directed mutagenesis
JP2001346585A (en) Photochemical protein capable of using chlorophyl d photoreceptive coloring matter and gene encoding the same
Goszczynski et al. Molecular divergence of Grapevine virus A (GVA) variants associated with Shiraz disease in South Africa
Lehmann et al. Molecular Cloning of the Isoquinoline 1-Oxidoreductase Genes from Pseudomonas diminuta 7, Structural Analysis of IorA and IorB, and Sequence Comparisons with Other Molybdenum-containing Hydroxylases∗
CN105112379A (en) Superoxide dismutase SOD on basis of extreme condition tolerance, method for preparing superoxide dismutase SOD and application thereof
Gelhaye et al. Identification and characterization of a third thioredoxin h in poplar
Mishra et al. Isolation and molecular characterization of hydrogenase gene from a high rate of hydrogen-producing bacterial strain Enterobacter cloacae IIT-BT 08
Koumura et al. The origin of photoactivated adenylyl cyclase (PAC), the Euglena blue-light receptor: phylogenetic analysis of orthologues of PAC subunits from several euglenoids and trypanosome-type adenylyl cyclases from Euglena gracilis
CN101812434B (en) Invertase and application of encoding gene thereof
Bailly et al. Globin gene family evolution and functional diversification in annelids
JPH10108679A (en) Pyruvate orthophosphate dikinase gene, recombinant dna and production of pyruvate orthophosphate dikinase