JP2004229650A

JP2004229650A - New protein and dna encoding the same

Info

Publication number: JP2004229650A
Application number: JP2003127090A
Authority: JP
Inventors: Yoshihide Hayashizaki; 良英林崎; Mamoru Kamiya; 守神谷; Hideo Kubodera; 英夫久保寺
Original assignee: Mitsubishi Chemical Corp; Dnaform KK; RIKEN Institute of Physical and Chemical Research
Current assignee: Mitsubishi Chemical Corp; Dnaform KK; RIKEN Institute of Physical and Chemical Research
Priority date: 2002-05-02
Filing date: 2003-05-02
Publication date: 2004-08-19

Abstract

<P>PROBLEM TO BE SOLVED: To provide a protein based on a physiological activity by analyzing base sequences of cDNA clones contained in a catalogued full-length cDNA library and specifying the physiological activity of the protein encoded with the cDNA clones for those having new sequences and to provide a method for utilizing the DNA encoding the protein. <P>SOLUTION: The protein is (a) a protein comprising a specific amino acid sequence or (b) a protein comprising an amino acid sequence in which one or several amino acids are deleted, substituted and/or added in the specific amino acid sequence and having a transferase activity or a transporter activity. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、新規なタンパク質、該タンパク質をコードするＤＮＡ、該タンパク質をコードする完全長ｃＤＮＡ、該ＤＮＡを有する組換えベクター、該ＤＮＡの部分配列から成るオリゴヌクレオチド、該ＤＮＡを導入した遺伝子導入細胞、及び該タンパク質に特異的に結合する抗体等に関する。
【０００２】
【従来の技術】
ｃＤＮＡの取得及びその塩基配列解析は、生体内に発現するタンパク質の生理活性を解析し、その活性に基づくタンパク質の利用方法を開発するうえで不可欠である。さらに、全遺伝子種に対応する完全長ｃＤＮＡをカタログ化したライブラリーの作製は、ヒトゲノムプロジェクトの重要な課題の一つである。カタログ化したライブラリーとは、ライブラリーに含まれるｃＤＮＡに重複がないという意味であり、各ｃＤＮＡが１種類ずつ含まれているライブラリーのことである。
【０００３】
完全長ｃＤＮＡクローニング法については、特開平９−２４８１８７号公報及び特開平１０−１２７２９１号公報に記載されている。この方法は、ｍＲＮＡの５’キャップサイトに存在するジオール構造にタグになる分子を結合させる工程、前記タグ分子を結合させたｍＲＮＡを鋳型とし、ｏｌｉｇｏｄＴをプライマーとして逆転写によりＲＮＡ−ＤＮＡ複合体を作製し、この複合体の内、ｍＲＮＡの完全長に対応するＤＮＡを有するものをタグ分子の機能を利用して分離する工程を含むことを特徴とする方法である。
【０００４】
また効率のよい逆転写法として、鋳型が高次構造を形成しないような高温で行うための方法も開発されている（特開平１０−８４９６１号公報）。さらに、合成された完全長ｃＤＮＡライブラリーに含まれるＤＮＡ断片についてその鎖長に関わらず一律にクローニングすることができるクローニングベクターも開発されている（特開平１１−９２７３号公報）。
【０００５】
このような技術により作製された完全長ｃＤＮＡライブラリーは、ライブラリーの個々の要素として全て均等に異なるものが含まれている訳ではなく、存在割合の高いクローンや逆に極微量にしか存在しないクローンもある。この極微量にしか存在しないクローンは新規である可能性が高いため、このようなクローンを濃縮するためのサブトラクション法やノーマライゼーション法も開発されている（特開２０００−３２５０８０号公報；Ｃａｒｎｉｎｃｉ，Ｐ．ｅｔａｌ．，Ｇｅｎｏｍｉｃｓ，３７，３２７−３３６（１９９６））。
かくして得られるカタログ化された完全長ｃＤＮＡライブラリーの各クローンについて、公知の方法により塩基配列の解析を行えば、その塩基配列は同定されるが、該ｃＤＮＡがコードするタンパク質の生理活性は依然不明のままである。
【０００６】
【発明が解決しようとする課題】
本発明は、カタログ化された完全長ｃＤＮＡライブラリーに含まれるｃＤＮＡクローンの塩基配列を解析し、このうち配列が新規なものについては、これがコードするタンパク質の生理活性を特定し、該生理活性に基づくタンパク質およびそれをコードするＤＮＡの利用方法を提案することを目的とする。
【０００７】
【課題を解決するための手段】
本発明者らは、マウス完全長ｃＤＮＡライブラリー中のｃＤＮＡクローンが有する塩基配列を解析し、該配列の相同性に基づきデータベースを検索したところ、該配列にトランスフェラーゼ活性または運搬体活性を有するタンパク質に特異的な配列を見出し、これらのｃＤＮＡがコードするタンパク質がトランスフェラーゼ活性または運搬体活性を有すると同定した。また、（ｉ）これらのｃＤＮＡの各組織における発現量を解析し、（ｉｉ）該ｃＤＮＡがコードするタンパク質を発現させて他のタンパク質との相互作用を解析し、（ｉ）および／または（ｉｉ）の解析結果から該ｃＤＮＡがコードするタンパク質の有する機能を総合的に解析した。本発明は、これらの知見に基づいて成し遂げられたものである。
【０００８】
すなわち本発明によれば、以下の（１）〜（２５）に記載の発明が提供される。
（１）以下の（ａ）または（ｂ）のタンパク質。
（ａ）配列番号４または５に記載のアミノ酸配列からなるタンパク質。
（ｂ）配列番号４または５に記載のアミノ酸配列において１若しくは数個のアミノ酸が欠失、置換及び／または付加されたアミノ酸配列からなり、かつトランスフェラーゼ活性を有するタンパク質。
【０００９】
（２）（１）に記載のタンパク質をコードするＤＮＡ。
（３）（１）に記載のタンパク質をコードする完全長ｃＤＮＡ。
（４）以下の（ａ）、（ｂ）又は（ｃ）のいずれかのＤＮＡ。
（ａ）配列番号１または２に記載の塩基配列を有するＤＮＡ。
（ｂ）配列番号１または２に記載の塩基配列において、１若しくは数個の塩基が欠失、置換及び／または付加された塩基配列を有し、かつトランスフェラーゼ活性を有するタンパク質をコードするＤＮＡ。
（ｃ）配列番号１または２に記載の塩基配列あるいはその相補配列を有するＤＮＡとストリンジェントな条件下でハイブリダイズすることができる塩基配列を有し、かつトランスフェラーゼ活性を有するタンパク質をコードするＤＮＡ。
【００１０】
（５）以下の（ａ）または（ｂ）のタンパク質。
（ａ）配列番号６に記載のアミノ酸配列からなるタンパク質。
（ｂ）配列番号６に記載のアミノ酸配列において１若しくは数個のアミノ酸が欠失、置換及び／または付加されたアミノ酸配列からなり、かつ運搬体活性を有するタンパク質。
【００１１】
（６）（５）に記載のタンパク質をコードするＤＮＡ。
（７）（５）に記載のタンパク質をコードする完全長ｃＤＮＡ。
（８）以下の（ａ）、（ｂ）または（ｃ）のいずれかのＤＮＡ。
（ａ）配列番号３に記載の塩基配列を有するＤＮＡ。
（ｂ）配列番号３に記載の塩基配列において、１若しくは数個の塩基が欠失、置換及び／または付加された塩基配列を有し、かつ運搬体活性を有するタンパク質をコードするＤＮＡ。
（ｃ）配列番号３に記載の塩基配列あるいはその相補配列を有するＤＮＡとストリンジェントな条件下でハイブリダイズすることができる塩基配列を有し、かつ運搬体活性を有するタンパク質をコードするＤＮＡ。
【００１２】
（９）（２）〜（４）のいずれかに記載のＤＮＡを含む組換えベクター。
（１０）（２）〜（４）のいずれかに記載のＤＮＡまたは（９）に記載の組み換えベクターを導入した遺伝子導入細胞または該細胞からなる個体。
（１１）（１０）に記載の細胞により産生される、（１）に記載のタンパク質。
【００１３】
（１２）（６）〜（８）のいずれかに記載のＤＮＡを含む組換えベクター。
（１３）（６）〜（８）のいずれかに記載のＤＮＡまたは（１２）に記載の組み換えベクターを導入した遺伝子導入細胞または該細胞からなる個体。
（１４）（１３）に記載の細胞により産生される、（５）に記載のタンパク質。
【００１４】
（１５）（２）〜（４）または（６）〜（８）のいずれかに記載のＤＮＡの塩基配列中の連続した５〜１００塩基と同じ配列を有するセンスオリゴヌクレオチド、当該センスオリゴヌクレオチドと相補的な配列を有するアンチセンスオリゴヌクレオチド、及び、当該センス又はアンチセンスオリゴヌクレオチドのオリゴヌクレオチド誘導体から成る群から選ばれるオリゴヌクレオチド。
【００１５】
（１６）（１）または（１１）に記載のタンパク質に特異的に結合する抗体あるいはその部分フラグメント。
（１７）抗体がモノクローナル抗体である（１６）に記載の抗体。
（１８）モノクローナル抗体が（１）または（１１）に記載のタンパク質のトランスフェラーゼ活性を中和する作用を有することを特徴とする（１７）に記載の抗体。
【００１６】
（１９）（５）または（１４）に記載のタンパク質に特異的に結合する抗体あるいはその部分フラグメント。
（２０）抗体がモノクローナル抗体である（１９）に記載の抗体。
（２１）モノクローナル抗体が（５）または（１４）に記載のタンパク質の運搬体活性を中和する作用を有することを特徴とする（２０）に記載の抗体。
【００１７】
（２２）（１）、（５）、（１１）または（１４）のいずれかに記載のタンパク質と被検物質を接触させ、該被検物質による該タンパク質が有する活性の変化を測定することを特徴とする、該タンパク質の活性調節物質のスクリーニング方法。
（２３）（１０）または（１３）に記載の遺伝子導入細胞と被検物質を接触させ、該細胞に導入されているＤＮＡの発現レベルの変化を検出することを特徴とする、該ＤＮＡの発現調節物質のスクリーニング方法。
（２４）（１）もしくは（５）に記載のタンパク質のアミノ酸配列から選択される少なくとも１以上のアミノ酸配列情報、および／または（２）〜（４）もしくは（６）〜（８）のいずれかに記載のＤＮＡの塩基配列から選択される少なくとも１以上の塩基配列情報を保存したコンピュータ読み取り可能記録媒体。
（２５）（１）もしくは（５）に記載のタンパク質、および／または（２）〜（４）もしくは（６）〜（８）のいずれかに記載のＤＮＡを結合させた担体。
【００１８】
【発明の実施の形態】
以下、本発明をさらに詳細に説明する。
（１）完全長ｃＤＮＡの取得及び塩基配列の解析
本発明のＤＮＡは、配列番号４〜６に記載のアミノ酸配列からなるタンパク質、またはアミノ酸配列において、１若しくは数個（ここで言う数個の数は特には限定されないが、例えば２０個以下、好ましくは１５個以下、より好ましくは１０個以下、さらに好ましくは５個以下を意味する）のアミノ酸残基の置換、欠失、挿入、付加、若しくは逆位を含むアミノ酸配列からなり、かつトランスフェラーゼ活性または運搬体活性を有するタンパク質をコードし得るものであれば如何なるものであってもよい。具体的には、該アミノ酸配列をコードする翻訳領域のみでも、あるいはそのｃＤＮＡの全長を含むものでもよい。ここで、運搬体活性とは、細胞内外への特異的な物質輸送を行う活性を意味し、輸送される物質は特に制限はされず、該運搬体活性を有するタンパク質を、以下、「運搬体」、「運搬体タンパク質」または「トランスポーター」と称することがある。
【００１９】
具体的には、ｃＤＮＡの全長を含むＤＮＡとしては、例えば、配列番号１〜３に記載の塩基配列からなるＤＮＡ等が挙げられる。また、その翻訳領域としては、配列番号１の塩基番号７８〜２１９８、配列番号２の塩基番号２３５〜２６３１、配列番号３の塩基番号２６〜９４０に示される配列を有するものが挙げられる。さらに上記のｃＤＮＡの全長でなくても、上記翻訳領域とその３’及び／または５’端に隣接する、翻訳領域の発現に最低限必要な部分を含むもの等も本発明のＤＮＡに含まれる。
【００２０】
本発明のＤＮＡは、これを取得できる方法であれば如何なる方法により取得したものでもよいが、具体的には、例えば下述の方法により取得することができる。まず、適当な動物、好ましくは哺乳動物の組織等からそれ自体既知の通常用いられる方法によりｍＲＮＡを調製する。次に、このｍＲＮＡを鋳型としてｃＤＮＡを合成するが、このとき完全長のｃＤＮＡを合成するために５’キャップ（^７ＭｅＧ_ｐｐｐＮ）サイトに特異的なジオール構造にタグになる分子を化学結合させ、このｍＲＮＡを鋳型としてｏｌｉｇｏｄＴをプライマーとして逆転写した後に、タグ分子の機能を利用して完全長のｃＤＮＡのみを分離する方法（特開平９−２４８１８７号公報、特開平１０−１２７２９１号公報）を用いることが好ましい。また、逆転写の際には、鋳型が高次構造を形成して逆転写の効率が低下することを阻止するために、トレハロース等の存在下で、耐熱性逆転写酵素を用いて高温下で逆転写を行う方法（特開平１０−８４９６１号公報）を用いるのが好ましい。ここで、高温下とは４０〜８０℃を意味する。
【００２１】
このようにして取得されたｃＤＮＡは、これを適当なクローニングベクターに挿入してクローニングを行う。ここで用いられるベクターとしては、様々な鎖長のＤＮＡを一律にクローニングすることが可能な、クローニングサイトの両末端にリコンビナーゼ認識配列を有し、感染以外の方法で宿主に挿入される直鎖状のベクター（特開平１１−９２７３号公報）が好ましく用いられる。かくして得られるｃＤＮＡライブラリーは、全てのクローンが均一に存在している（以下、これを「カタログ化されている」と称することがある）訳ではなく、このライブラリー中に極微量にしか存在しないクローンこそ新規である確率が高い。そこで、このようなクローンを濃縮するためのサブトラクション法やノーマライゼーション法（特開２０００−３２５０８０号公報、Ｃａｒｎｉｎｃｉ，Ｐ．ｅｔａｌ．，Ｇｅｎｏｍｉｃｓ，３７，３２７−３３６（１９９６））を用いることが好ましい。
【００２２】
カタログ化されたｃＤＮＡライブラリーは、それ自体既知の通常用いられる方法により塩基配列の解析を行う。本発明のＤＮＡは、ｃＤＮＡ全長の場合にはその末端１００ベースの配列について得られた塩基配列を、ＢＬＡＳＴ（ｈｔｔｐ：／／ｗｗｗ．ｎｃｂｉ．ｎｌｍ．ｎｉｈ．ｇｏｖ／ＢＬＡＳＴ／；ＮａｔｉｏｎａｌＣｅｎｔｅｒｏｆＢｉｏｔｅｃｈｎｏｌｏｇｙＩｎｆｏｒｍａｔｉｏｎ）を用いて、ＮＣＢＩのＧｅｎｂａｎｋ、ＥＭＢＬ、ＤＤＢＪ等のデータベースについて検索し、最も高い相同性を示す配列でも一致度が３０％以下であるものを新規として以下の解析に供することとした。
【００２３】
このような完全長ｃＤＮＡの塩基配列を有するＤＮＡとしては、例えば、配列番号１〜３に記載の塩基配列からなるＤＮＡ等が挙げられる。また、その翻訳領域としては、配列番号１の塩基番号７８〜２１９８、配列番号２の塩基番号２３５〜２６３１、配列番号３の塩基番号２６〜９４０に示される配列を有するものが挙げられる。
【００２４】
かくして取得された新規な塩基配列を、ＢＬＡＳＴ（Ｂａｓｉｃｌｏｃａｌａｌｉｇｎｍｅｎｔｓｅａｒｃｈｔｏｏｌ；Ａｌｔｓｃｈｕｌ，Ｓ．Ｆ．，ｅｔａｌ．，Ｊ．Ｍｏｌ．Ｂｉｏｌ．，２１５，４０３−４１０（１９９０））による相同性検索（ｈｏｍｏｌｏｇｙｓｅａｒｃｈ）や、ＨＭＭＥＲ（隠れＭａｒｋｏｖモデルによる配列解析手法；Ｅｄｄｙ，Ｓ．Ｒ．，Ｂｉｏｉｎｆｏｒｍａｔｉｃｓ１４，７５５−７６３（１９９８））の機能群のひとつであるＨＭＭＰＦＡＭ（ｈｔｔｐ：／／ｐｆａｍ．ｗｕｓｔｌ．ｅｄｕ）による蛋白質特徴検索（ｐｒｏｆｉｌｅｓｅａｒｃｈ）等を行うことにより、該塩基配列がコードするタンパク質の機能を推定することができる。
【００２５】
ＢＬＡＳＴによる相同性検索においては、検索の結果得られた相同性が十分有意なヒット配列に付随する種々のアノテーション情報から、解析対象としているクローンの機能を推定することができる。ここで、十分有意なヒット配列とは、登録されているアミノ酸配列の機能ドメイン部分と本発明のＤＮＡのコードするアミノ酸配列のこれに対応する部分との一致度がｅ−ｖａｌｕｅとして１０^−４以下のものか、あるいは３０％以上のものを示す。
【００２６】
例えば、上位にヒットする機能ドメイン配列の多くがトランスフェラーゼまたは運搬体としての機能を確認されたものであるならば、それと配列上類似である解析対象クローンもまた同じ機能、即ち、トランスフェラーゼ活性または運搬体活性を持つであろうという予測が成り立つ。
【００２７】
ＨＭＭＰＦＡＭでは、Ｐｆａｍという蛋白質プロファイルを集積したデータベース中にあるエントリーが有するアミノ酸配列の特徴を、解析対象であるアミノ酸配列が有するかどうかを洗い出す方法による解析が行われる。プロファイルは一連の同一特徴を持つタンパク質群から抽出されており、一配列対一配列の全長に亘る比較では明確化できない機能でも、配列中にその特徴領域があればこれを見出し、機能予測ができる。かくして行われるタンパク質の機能予測の具体的な例として以下に説明する。
【００２８】
配列番号１に記載の塩基配列がコードするアミノ酸配列は、ＢＬＡＳＴサーチによりデータベース登録記号ＡＬ０３１６７８、ＨｕｍａｎＤＮＡｓｅｑｕｅｎｃｅｆｒｏｍｃｌｏｎｅＲＰ４−８１６Ｋ１７ｏｎｃｈｒｏｍｏｓｏｍｅ２０ｐ１２．２−１３．ＣｏｎｔａｉｎｓｔｈｅＴＧＭ３ｇｅｎｅｆｏｒｔｒａｎｓｇｌｕｔａｍｉｎａｓｅ３が、ｅ−ｖａｌｕｅ：０．０で、また４４２アミノ酸残基に亘り８７％の一致度で、またデータベース登録記号Ｑ０８１８８、ＨｕｍａｎＰｒｏｔｅｉｎ−ｇｌｕｔａｍｉｎｅｇｌｕｔａｍｙｌｔｒａｎｓｆｅｒａｓｅＥ３ｐｒｅｃｕｒｓｏｒが、ｅ−ｖａｌｕｅ：０．０で、７０５アミノ酸残基に亘り４９％の一致度で、さらにデータベース登録記号Ｑ０８１８９、ｍｏｕｓｅＰｒｏｔｅｉｎ−ｇｌｕｔａｍｉｎｅｇｌｕｔａｍｙｌｔｒａｎｓｆｅｒａｓｅＥ３ｐｒｅｃｕｒｓｏｒが、ｅ−ｖａｌｕｅ：０．０で、７０５アミノ酸残基に亘り４８％の一致度でヒットする。
また、配列番号１に示す塩基配列がコードするアミノ酸配列について、ＨＭＭＰＦＡＭによる蛋白質特徴検索を行うとＴｒａｎｓｇｌｕｔａｍｉｎａｓｅ−ｌｉｋｅｓｕｐｅｒｆａｍｉｌｙの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔａｍｉｎ＿Ｎ」としてエントリーされる配列）、Ｔｒａｎｓｇｌｕｔａｍｉｎａｓｅ−ｌｉｋｅｓｕｐｅｒｆａｍｉｌｙの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔ＿ｃｏｒｅ」としてエントリーされるされる配列）、さらにＴｒａｎｓｇｌｕｔａｍｉｎａｓｅｆａｍｉｌｙ，Ｃ−ｔｅｒｍｉｎａｌｉｇｌｉｋｅｄｏｍａｉｎの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔａｍｉｎ＿Ｃ」としてエントリーされる配列）が見出される。これらのことから配列番号１に示した塩基配列がコードするタンパク質はトランスフェラーゼのうち、特にトランスグルタミナーゼ活性を有すると推測できる。
【００２９】
配列番号２に記載の塩基配列がコードするアミノ酸配列はＢＬＡＳＴサーチによりＡＦ３５７９７０、ＨｏｍｏｓａｐｉｅｎｓｃａｒｎｉｔｉｎｅｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅＩＣが、ｅ−ｖａｌｕｅ：０．０で、また８０２アミノ酸残基に亘り８３％の一致度で、またデータベース登録記号Ｕ８８２９４、ＲａｔｔｕｓｎｏｒｖｅｇｉｃｕｓｃａｒｎｉｔｉｎｅｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅＩ（ＣＰＴＩ）が、ｅ−ｖａｌｕｅ：０．０で、７６５アミノ酸残基に亘り５３％の一致度でヒットし、ＨＭＭＰＦＡＭによる蛋白質特徴検索によればＣｈｏｌｉｎｅ／Ｃａｒｎｉｔｉｎｅｏ−ａｃｙｌｔｒａｎｓｆｅｒａｓｅの特徴を示す配列（Ｐｆａｍに「Ｃａｒｎ＿ａｃｙｌｔｒａｎｓｆ」としてエントリーされる配列）が見出される。これらのことから配列番号２に示す塩基配列がコードするタンパク質はトランスフェラーゼのうち、特にｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅ活性を有することが推測される。
【００３０】
配列番号３に記載の塩基配列がコードするアミノ酸配列はＢＬＡＳＴサーチによりデータベース登録記号ＡＣ００４８３２、ＨｏｍｏｓａｐｉｅｎｓＰＡＣｃｌｏｎｅＲＰ４−５３９Ｍ６ｆｒｏｍ２２（ヒトＳＰＦ）が、ｅ−ｖａｌｕｅ：５×１０^−１０９で、また２２１アミノ酸残基に亘り８５％の一致度でヒットし、また、データベース登録記号ＡＦ３０９５５８、Ｒａｔｔｕｓｎｏｒｖｅｇｉｃｕｓｓｕｐｅｒｎａｔａｎｔｐｒｏｔｅｉｎｆａｃｔｏｒ（ラットＳＰＦ）が、ｅ−ｖａｌｕｅ：２×１０^−８６で、２２１アミノ酸残基に亘り６４％の一致度で、さらにＡＦ４８７９７７、Ｂｏｓｔａｕｒｕｓｔｏｃｏｐｈｅｒｏｌ−ａｓｓｏｃｉａｔｅｄｐｒｏｔｅｉｎが、ｅ−ｖａｌｕｅ：２×１０^−８４で、２２１アミノ酸残基に亘り６２％の一致度でヒットする。また、ＨＭＭＰＦＡＭによる蛋白質特徴検索を行ったところＣＲＡＬ／ＴＲＩＯｄｏｍａｉｎの特徴を示す配列（Ｐｆａｍに「ＣＲＡＬ＿ＴＲＩＯ」としてエントリーされる配列）を見出したが、本ドメインを持つタンパク質は、ｒｅｔｉｎａｌ−ｂｉｎｄｉｎｇｐｒｏｔｅｉｎ、ｐｈｏｓｐｈａｔｉｄｙｌｃｈｏｌｉｎｅあるいはａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌなどの運搬体タンパク質と考えられている。また、ＣＲＡＬ／ＴＲＩＯ＿Ｎｔｅｒｍｉｎａｌの特徴を示す配列（ＰｆａｍにＣＲＡＬ／ＴＲＩＯ＿Ｎとしてエントリーされる配列）が見出される。上記のヒトＳＰＦは、文献情報（ＰＮＡＳ２０００，９８，２２４４−２２４９）から、ｓｑｕａｌｅｎｅの運搬機能を有することが示されている。即ち、ヒトＳＰＦはｓｑｕａｌｅｎｅを結合しミクロゾームのｓｑｕａｌｅｎｅｅｐｏｘｉｄｅへ運搬することにより、ｓｑｕａｌｅｎｅ２，３−ｏｘｉｄｅへの変換を触媒させステロール生合成の後期過程を開始させる。また、ヒトＳＰＦは、文献情報（ＢＢＲＣ２００１，２８５，２９５−２９９）から、ａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌ依存的に転写促進活性を有することも示されている。これらのことから配列番号３に示す塩基配列がコードするタンパク質はｓｑｕａｌｅｎｅ、ｒｅｔｉｎａｌｄｅｈｙｄｅ、ｐｈｏｓｐｈａｔｉｄｙｌｃｈｏｌｉｎｅもしくはａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌなどの運搬体タンパク質または転写促進活性を有することが推測できる。
【００３１】
本発明のＤＮＡは、翻訳配列中に塩基の欠失もしくは挿入を有した状態で取得されることがあるが、上記のような相同性検索やたんぱく質特徴検索を行った結果、該ＤＮＡの塩基配列中の欠失もしくは挿入が推測された場合には、当業者において通常用いられているライブラリースクリーニングやＰＣＲクローニング等の方法を用いて塩基の欠失もしくは挿入の無い完全長ｃＤＮＡを取得することができる。かくして得られる完全長ｃＤＮＡを用いて本発明のタンパク質を発現させ、これを機能解析に用いることができる。
【００３２】
かくして取得され、塩基配列が決定され、また機能が推定される本発明のＤＮＡは上記の配列番号１〜３に記載の塩基配列、あるいはその翻訳領域として上記に示した塩基配列を有するものだけでなく、これらの塩基配列において、１若しくは数個（ここで言う数個の数は特には限定されないが、例えば６０個以下、好ましくは３０個以下、より好ましくは２０個以下、さらに好ましくは１０個以下、特に好ましくは５個以下を意味する。）の塩基が欠失、置換及び／または付加された塩基配列を有し、かつトランスフェラーゼ活性または運搬体活性を有するタンパク質をコードするＤＮＡ、並びに、これらとストリンジェントな条件下でハイブリダイズし、かつトランスフェラーゼ活性または運搬体活性を有するタンパク質をコードするＤＮＡ等も含まれる。これらＤＮＡには前記したとおり、配列番号４〜６に記載のタンパク質のアミノ酸配列において１若しくは数個のアミノ酸配列が欠失、置換及び／または付加されたアミノ酸配列からなり、かつトランスフェラーゼ活性または運搬体活性を有するタンパク質をコードするものも含まれる。
ここで、ストリンジェントな条件でハイブリダイズするＤＮＡとは、配列番号１〜３に記載の塩基配列とＢＬＡＳＴ解析で８０％以上、好ましくは９０％以上、さらに好ましくは９５％以上の相同性を有する塩基配列を含むＤＮＡ等が挙げられる。また、ストリンジェントな条件下のハイブリダイゼーションとは、通常のハイブリダイゼーション緩衝液中で、温度が４０〜７０℃、好ましくは６０〜６５℃等で反応を行い、塩濃度が１５ｍＭ〜３００ｍＭ、好ましくは１５ｍＭ〜６０ｍＭ等の洗浄液中で洗浄を行う方法に従って行うことができる。
【００３３】
さらに、本発明のＤＮＡは、上述の方法により取得されたものでも、また合成されたものでもよい。ＤＮＡの塩基配列の置換は、例えばサイトダイレクテッドミュータジェネシスキット（宝酒造社製）や、クイックチェンジサイトダイレクテッドミュータジェネシスキット（ストラタジーン社製）等の市販キットで容易に行うことができる。
【００３４】
また、配列番号１〜３に記載の塩基配列は、マウスを由来とするものであるが、上記したｃＤＮＡライブラリーの作製法に従ってヒトのｃＤＮＡライブラリーを作製し、該ライブラリーに対して配列番号１〜３の塩基配列を有するＤＮＡ断片をプローブとしたハイブリダイゼーションを行うことにより、配列番号１〜３に記載の塩基配列がコードするタンパク質のヒトのホモログタンパク質をコードするＤＮＡ（以下、これを「ヒトホモログＤＮＡ」と称することがある）を取得することもできる。本発明の配列番号１〜３に記載の塩基配列またはその相補配列を有するＤＮＡとストリンジェントな条件でハイブリダイズするＤＮＡには、このようなヒトホモログＤＮＡや下述するヒトのオルソログＤＮＡも含まれる。
【００３５】
また、インフォマティックスを利用して、ヒトホモログＤＮＡが有する塩基配列を予測し、該塩基配列を基に上記のヒトｃＤＮＡライブラリーなどからヒトホモログＤＮＡを取得することもできる。
一般的に、インフォマティックスを利用して目的とするタンパク質のホモログタンパク質をコードする塩基配列を予測する方法としては、例えば、（ｉ）目的とするｃＤＮＡの塩基配列をクエリーとして、ヒト等のｃＤＮＡデータベース（インフォマティックスにより予測されるｃＤＮＡデータベースを含む）に対しＢＬＡＳＴなどを用いて相同性検索を行う方法や、（ｉｉ）目的とするｃＤＮＡの塩基配列をクエリーとして、ヒト等のＥＳＴデータベースに対しＢＬＡＳＴなどを用いて相同性検索を行い、ヒットしたＥＳＴが有する配列を目的とするｃＤＮＡの塩基配列を参照して連結する方法、さらに（ｉｉｉ）目的とするｃＤＮＡの塩基配列をクエリーとして、ヒトなどのゲノムデータベースに対しＢＬＡＳＴなどを用いて相同性検索を行い、目的とするｃＤＮＡの遺伝子が存在するゲノム上の位置を特定し、そのゲノム領域に対してＧｅｎｓｃａｎ（ｈｔｔｐ：／／ｇｅｎｅｓ．ｍｉｔ．ｅｄｕ／ＧＥＮＳＣＡＮ．ｈｔｍｌ）やＳｉｍ４（ＧｅｎｏｍｅＲｅｓ．，８：９７６−７４（１９９８））等を用いて、該ゲノム中の遺伝子部分の塩基配列を予測する方法等が挙げられる。
【００３６】
マウス由来のｃＤＮＡのヒトホモログＤＮＡの塩基配列を予測する場合、上記の方法のいずれも用いることができるが、本発明の配列番号１〜３に記載の塩基配列を有するｃＤＮＡはいずれも新規であり、上記（ｉ）の方法では、ヒトホモログＤＮＡの塩基配列を取得できないと考えられるため、（ｉｉ）あるいは（ｉｉｉ）に記載の方法などが好ましく用いられる。
【００３７】
かくして予測されたヒトホモログＤＮＡの塩基配列を基に、上記のヒトｃＤＮＡライブラリーから、配列番号１〜３に記載の塩基配列を有するＤＮＡに対するヒトホモログＤＮＡを取得することもできる。具体的な取得方法としては、例えば、予測されたヒトホモログＤＮＡの５’端、および３’端の塩基配列に相補的な塩基配列を有するプライマーを用いて、上記ヒトｃＤＮＡライブラリーを鋳型としてＰＣＲを行う方法や、予測されたヒトホモログＤＮＡの一部の配列をプローブとして、上記ヒトｃＤＮＡライブラリーに対してハイブリダイゼーションを行う方法等が挙げられる。
【００３８】
一般的に、目的遺伝子が有する塩基配列とホモロジーの高い塩基配列を有する類似遺伝子を「ホモログ」と呼び、上記の方法においてもヒトホモログＤＮＡの取得を目的としているが、遺伝子の機能解析においては、塩基配列が類似していることだけではなく、ホモログとして取得された遺伝子が、目的遺伝子のファミリーメンバーであることを確認することが重要である。２種類の生物間で「ホモログ」として取得された遺伝子は、共通の祖先遺伝子から進化した同一の遺伝子である「オルソログ」である可能性と、また、共通の祖先遺伝子からの重複によって生じた異なる遺伝子である「パラログ」である可能性がある。
【００３９】
つまり、上記でホモログとして取得されたヒト由来のＤＮＡは、これを、本発明のタンパク質と同一の機能を有すると解するには、また、該ヒト由来のＤＮＡがコードするタンパク質の機能を、本発明のタンパク質のマウスにおける同一の機能として推定検証するには、上記ヒトホモログが本発明のマウス遺伝子の近縁種のオルソログであることを確認することが好ましい。
【００４０】
オルソログであることの確認方法は、例えば、以下の方法などが用いられる。（ｉ）まず、本発明のｃＤＮＡの塩基配列と、取得されたヒトホモログＤＮＡの塩基配列について相同性を解析する。次に、本発明のｃＤＮＡの塩基配列をクエリーとして、ＤＤＢＪ、ＥＭＢＬ、ＧｅｎＢａｎｋなどの国際塩基配列データベースや、特許データベースに含まれるヒト塩基配列について相同性検索を行い、取得されたヒトホモログＤＮＡとクエリーの塩基配列の一致度が、データベースから得られた塩基配列とクエリーの塩基配列の一致度より高いことを確認する。さらに、（ｉｉ）取得されたヒトホモログＤＮＡの塩基配列と、対応する本発明のｃＤＮＡの塩基配列について相同性を解析する。次に、取得されたヒトホモログＤＮＡの塩基配列をクエリーとして、ＤＤＢＪ、ＥＭＢＬ、ＧｅｎＢａｎｋなどの国際塩基配列データベースや、特許データベースに含まれるマウス塩基配列について相同性検索を行い、本発明のｃＤＮＡとクエリーの塩基配列の一致度が、データベースから得られた塩基配列とクエリーの塩基配列との一致度より高いことを確認する。上記（ｉ）および（ｉｉ）を確認することにより、取得されたヒトホモログＤＮＡが、本発明のｃＤＮＡに対応するヒトオルソログＤＮＡであると同定することができる。上記（ｉ）および（ｉｉ）に記載した相同性の解析はアミノ酸配列の比較を用いても良く、また、分子進化系統樹を描いて検討することもできる。また、上記（ｉ）および（ｉｉ）に記載した相同性解析による一致度は、クエリーの全長にわたる一致度として解析することが好ましい。
【００４１】
かくして取得されたヒトホモログＤＮＡ、あるいはオルソログＤＮＡの塩基配列を、ＢＬＡＳＴによる相同性検索やＨＭＭＰＦＡＭによる蛋白質特徴検索等を行うことにより、該塩基配列がコードするタンパク質の機能を推定することができる。
さらに、取得されたヒトホモログＤＮＡまたはヒトオルソログＤＮＡの完全長ｃＤＮＡを用いて本発明のタンパク質を発現させ、これを活性の確認および機能解析等に用いることができる。
【００４２】
（２）新規ｃＤＮＡがコードするタンパク質
本発明のＤＮＡがコードするタンパク質の翻訳領域は、例えば、該ＤＮＡが有する塩基配列について３種類の読み枠によりアミノ酸に変換していき、最も長いポリペプチドをコードする範囲を本発明の翻訳領域としてそのアミノ酸配列を決めること等ができる。このようなアミノ酸配列として例えば、配列番号４〜６に記載のもの等が挙げられる。また、本発明のタンパク質は、上記のアミノ酸配列に限られるものではなく、該アミノ酸配列において１若しくは数個のアミノ酸が置換、欠失、及び／または付加されたアミノ酸配列からなり、かつトランスフェラーゼ活性または運搬体活性を有するものも含まれる。
【００４３】
本発明のタンパク質の取得方法としては、上記（１）に記載の本発明のＤＮＡを適当な方法により転写／翻訳する方法が好ましく用いられる。具体的には、適当な発現用ベクター若しくは適当なベクターに適当なプロモーターとともに挿入した組換えベクターを作製し、この組換えベクターで適当な宿主微生物を形質転換したり、適当な培養細胞に導入することにより発現させ、これを精製することにより取得することができる。
【００４４】
かくして得られるタンパク質が遊離体で得られた場合には、公知の方法あるいはそれに準じる方法によって塩に変換することができ、逆に塩で得られた場合には遊離体、又は他の塩に変換することができる。この様な本発明のタンパク質の塩も本発明のタンパク質に含まれる。また、上記形質転換体が産生するタンパク質を、精製前、又は後に適当なタンパク質修飾酵素を作用させることにより、任意に修飾を加えたり、ポリペプチドを部分的に除去することにより修飾タンパク質とすることができる。これらの修飾タンパク質も上記したトランスフェラーゼ活性または運搬体活性を有するものであれば本発明の範囲に含まれる。
【００４５】
本発明のタンパク質の産生を行う際、本発明のＤＮＡを含む組換えベクターの作製に用いるベクターとしては、形質転換体内で該ＤＮＡが発現されるものであれば特に制限はなく、プラスミドベクター、ファージベクターのいずれでもよい。これらのうち通常は、該ＤＮＡが導入される宿主に適したプロモーター等の発現制御領域ＤＮＡが既に挿入されている市販のタンパク質発現用ベクターを用いる。このようなタンパク質発現用ベクターとして、具体的には例えば、宿主が大腸菌の場合では、ｐＥＴ３、ｐＥＴ１１（ストラタジーン社製）ｐＧＥＸ（アマシャムファルマシアバイオテク社製）等が挙げられ、酵母の場合ではｐＥＳＰ−Ｉエクスプレッションベクター（ストラタジーン社製）等が挙げられ、さらに昆虫細胞の場合ではＢａｃＰＡＫ６（クロンテック社製）等が用いられる。また宿主が動物細胞の場合では、ＺＡＰＥｘｐｒｅｓｓ（ストラタジーン社製）、ｐＳＶＫ３（アマシャムファルマシアバイオテク社製）等が挙げられる。
【００４６】
発現制御領域が挿入されていないベクターを用いる場合には、発現制御領域として少なくともプロモーターを挿入する必要がある。ここでプロモーターとしては、宿主微生物、または培養細胞が保有するプロモーターを用いることができるが、これに限られるものではなく、具体的には例えば、宿主が大腸菌の場合にはＴ３、Ｔ７、ｔａｃ、ｌａｃプロモーター等を用いることができ、酵母の場合にはｎｍｔ１プロモーター、Ｇａｌ１プロモーター等を用いることができる。また宿主が動物細胞の場合にはＳＶ４０プロモーター、ＣＭＶプロモーター等が好ましく用いられる。
【００４７】
また哺乳動物由来のプロモーターが機能可能な宿主を用いる場合には、本発明の遺伝子に固有のプロモーターを用いることもできる。これらのベクターへの本発明のＤＮＡの挿入は、該ＤＮＡまたはこれを含むＤＮＡ断片をベクター中のプロモーターの下流に該遺伝子ＤＮＡがコードするタンパク質のアミノ酸配列を連結して行えばよい。
【００４８】
このようにして作製した組換えベクターは、それ自体既知の方法により後述する宿主を形質転換して、ＤＮＡ導入体を作製することができる。宿主への該ベクターの導入方法として、具体的には、ヒートショック法（Ｊ．Ｍｏｌ．Ｂｉｏｌ．，５３，１５４，（１９７０））、リン酸カルシウム法（Ｓｃｉｅｎｃｅ，２２１，５５１，（１９８３））、ＤＥＡＥデキストラン法（Ｓｃｉｅｎｃｅ，２１５，１６６，（１９８２））、インビトロパッケージング法（Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．ＵＳＡ，７２，５８１，（１９７５））、ウィルスベクター法（Ｃｅｌｌ，３７，１０５３，（１９８４））、および電気パルス法（Ｃｈｕ．ｅｔａｌ．，Ｎｕｃ．ＡｃｉｄｓＲｅｓ．，１５，１３３１（１９８７））等が挙げられる。
【００４９】
ＤＮＡ導入体を作製するための宿主としては、本発明のＤＮＡが体内で発現するものであれば特に限定されないが、例えば大腸菌、酵母、バキュロウィルス（節足動物多角体ウイルス）−昆虫細胞、あるいは動物細胞等が挙げられる。具体的には、大腸菌ではＢＬ２１、ＸＬ−２Ｂｌｕｅ（ストラタジーン社製）等、酵母ではＳＰ−Ｑ０１（ストラタジーン社製）等、バキュロウィルスではＡｃＮＰＶ（Ｊ．Ｂｉｏｌ．Ｃｈｅｍ．，２６３，７４０６，（１９８８））とその宿主であるＳｆ−９細胞（Ｊ．Ｂｉｏｌ．Ｃｈｅｍ．，２６３，７４０６，（１９８８））等が挙げられる。また動物細胞としてはマウス繊維芽細胞Ｃ１２７（Ｊ．Ｖｉｏｌ．，２６，２９１，（１９７８））やチャイニーズハムスター卵巣細胞ＣＨＯ細胞（Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．ＵＳＡ，７７，４２１６，（１９８０））等が挙げられるが、発現量やスクリーニングの簡便さから好ましくはアフリカミドリザル腎臓由来ＣＯＳ−７細胞（ＡＴＣＣＣＲＬ１６５１：アメリカンタイプカルチャーコレクション保存細胞）、ヒト胎児腎臓由来ＨＥＫ２９３細胞（ＡＴＣＣＣＲＬ１５７３）またはヒト子宮頸部癌ＨｅＬａ細胞（ＡＴＣＣＣＣＬ−２）が用いられる。
【００５０】
上記したようなタンパク質発現用ベクターを用いる発現方法の他に、プロモーターを連結した本発明のＤＮＡ断片を宿主微生物の染色体中に直接挿入する相同組換え技術（Ａ．Ａ．Ｖｅｒｔｅｓｅｔａｌ．，Ｂｉｏｓｃｉ．Ｂｉｏｔｅｃｈｎｏｌ．Ｂｉｏｃｈｅｍ．，５７，２０３６，（１９９３））、あるいはトランスポゾンや挿入配列（Ａ．Ａ．Ｖｅｒｔｅｓｅｔａｌ．，ＭｏｌｅｃｕｌａｒＭｉｃｒｏｂｉｏｌ．，１１，７３９，（１９９４））等を用いてＤＮＡ導入体を作製することもできる。
【００５１】
得られた培養物は細胞、あるいは菌体を遠心分離等の方法により収集し、これを適当な緩衝液に懸濁し、超音波、リゾチーム、および／または凍結融解等のそれ自体既知の適当な方法により破壊した後、遠心分離や濾過等によりタンパク質粗精製液を得、さらに適当な精製方法を組み合わせることにより精製することができる。かくして、本発明のタンパク質が取得される。上記したタンパク質発現組換えベクターを用いる発現方法の他に、上記（１）で取得された本発明のＤＮＡを無細胞転写翻訳系に供することによりタンパク質発現を誘導し、本発明のタンパク質を取得することができる。本発明で用いられる無細胞転写翻訳系とは、ＤＮＡからｍＲＮＡへの転写、およびｍＲＮＡからタンパク質への翻訳に必要な全ての要素を含む系であり、そこにＤＮＡを加えることによってそのＤＮＡがコードしているタンパク質が合成されるようなあらゆる系を指す。無細胞転写翻訳系の具体例としては、真核細胞、およびバクテリア細胞、又はそれらの一部からの抽出液に基づいて調製された転写翻訳系が挙げられ、特に好ましい具体例としては、ウサギ網状赤血球、小麦胚芽、大腸菌からの抽出液（大腸菌Ｓ３０抽出液）に基づいて調製された転写翻訳系が挙げられる。
【００５２】
得られた無細胞転写翻訳系の転写翻訳産物からの、本発明のタンパク質の分離、および精製は、それ自体既知の通常用いられる方法で行うことができる。具体的には、例えばエピトープペプチド、ポリヒスチジンペプチド、グルタチオン−Ｓ−トランスフェラーゼ（ＧＳＴ）、マルトース結合タンパク質等をコードするＤＮＡ領域を、前記した転写翻訳されるべきＤＮＡに導入し、前記の通り発現させ、該タンパク質と親和性を有する物質とのアフィニティーを利用して精製することができる。
【００５３】
目的とするタンパク質の発現は、ＳＤＳ−ポリアクリルアミドゲル電気泳動等で分離し、クマシーブリリアントブルー（シグマ社製）等で染色するか、または後述する本発明のタンパク質に特異的に結合する抗体により検出する方法等によって確認できる。また一般的に、発現されたタンパク質は生体内に存在するタンパク質分解酵素により切断されること（プロセッシング）が知られている。本発明のタンパク質も当然のことながら切断されたアミノ酸配列の部分断片であっても、トランスフェラーゼ活性または運搬体活性を有するものであれば、本発明のタンパク質に含まれる。
かくして得られたタンパク質は、他のタンパク質、ＤＮＡとの相互作用等を解析することにより、生体内における多面的な機能を知ることができる。上記相互作用の解析法としては、それ自体既知の常法を用いることができるが、具体的には、例えば、酵母ツーハイブリッド法、蛍光偏光解消法、表面プラズモン法、ファージディスプレイ法、リボソーマルディスプレイ法等が挙げられる。
【００５４】
（３）オリゴヌクレオチドの調製及び該オリゴヌクレオチドを用いる機能解析
上記（１）に記載の方法で取得した本発明のＤＮＡまたはその断片を用いて、ＤＮＡ合成機などを用いる常法により、本発明のＤＮＡの一部の配列を有するアンチセンス・オリゴヌクレオチド、センス・オリゴヌクレオチド等のオリゴヌクレオチドを調製することができる。
【００５５】
該オリゴヌクレオチドとしては、上記ＤＮＡの有する塩基配列中の連続した５〜１００塩基と同じ配列を有するＤＮＡまたは該ＤＮＡと相補的な配列を有するＤＮＡを挙げることができる。具体例としては、配列番号１〜３のいずれかで表される塩基配列中の連続した５〜１００塩基と同じ配列を有するＤＮＡまたは該ＤＮＡと相補的な配列を有するＤＮＡを挙げることができる。センスプライマーおよびアンチセンスプライマーとして用いる場合には、両者の融解温度（Ｔｍ）および塩基数が極端に変わることのない上記のオリゴヌクレオチドが好ましい。また、配列の長さは、一般的には５〜１００塩基であり、好ましくは１０〜６０塩基であり、より好ましくは１５〜５０塩基である。
【００５６】
また、これらオリゴヌクレオチドの誘導体も本発明のオリゴヌクレオチドとして利用することができる。該オリゴヌクレオチド誘導体としては、オリゴヌクレオチド中のリン酸ジエステル結合がホスホロチオエート結合に変換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のリン酸ジエステル結合がＮ３’−Ｐ５’ホスフォアミデート結合に変換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のリボースとリン酸ジエステル結合がペプチド核酸結合に変換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のウラシルがＣ−５プロピニルウラシルで置換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のウラシルがＣ−５チアゾールウラシルで置換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のシトシンがＣ−５プロピニルシトシンで置換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のシトシンがフェノキサジン修飾シトシン（ｐｈｅｎｏｘａｚｉｎｅ−ｍｏｄｉｆｉｅｄｃｙｔｏｓｉｎｅ）で置換されたオリゴヌクレオチド誘導体、オリゴヌクレオチド中のリボースが２’−Ｏ−プロピルリボースで置換されたオリゴヌクレオチド誘導体、あるいはオリゴヌクレオチド中のリボースが２’−メトキシエトキシリボースで置換されたオリゴヌクレオチド誘導体等をあげることができる。
【００５７】
また、本発明のオリゴヌクレオチドは、これを２本鎖ＲＮＡとして調製し、被導入体へ導入し、標的遺伝子の発現を阻害するＲＮＡインターフェアレンス法（以下、これを「ＲＮＡｉ法」と称することがある）に用いることができる。ＲＮＡインターフェアレンス法については、例えば、（Ｅｌｂａｓｈｉｒ，Ｓ．，ｅｔａｌ．，Ｎａｔｕｒｅ，４１１，４９４−４９８（２００１））に記載の方法等を用いることができる。また、上記２本鎖ＲＮＡは必ずしも全てがＲＮＡである必要はなく、例えば、ＷＯ０２／１０３７４号公報に記載のもの等も用いることができる。
【００５８】
ここで、標的遺伝子としては、本発明のＤＮＡであれば、如何なるものであってもよい。これらＤＮＡの少なくとも一部の塩基配列と実質的に同一な配列からなる２本鎖ＲＮＡ（以下、これを「２本鎖ポリヌクレオチド」と称することがある）とは、標的遺伝子の塩基配列のうち、いずれの部分でもよい１５ｂｐ以上の配列と実質的に同一な配列からなるものである。ここで、実質的に同一とは、標的遺伝子の配列と８０％以上の相同性を有することを意味する。ヌクレオチドの鎖長は１５ｂｐから標的遺伝子のオープンリーディングフレーム（ＯＲＦ）の全長までの如何なる長さでもよいが、１５〜５００ｂｐ程度のものが好ましく用いられる。ただし、哺乳類動物由来の細胞おいては、３０ｂｐ以上の長い２本鎖ＲＮＡに反応して活性化するシグナル伝達系の存在が知られている。これはインターフェロン反応と呼ばれており（Ｍａｒｅｕｓ，Ｐ．Ｉ．，ｅｔａｌ．，Ｉｎｔｅｒｆｅｒｏｎ，５，１１５−１８０（１９８３））、該２本鎖ＲＮＡが細胞内に侵入すると、ＰＫＲ（ｄｓＲＮＡ−ｒｅｓｐｏｎｓｉｖｅｐｒｏｔｅｉｎｋｉｎａｓｅ：Ｂａｓｓ，Ｂ．Ｌ．，Ｎａｔｕｒｅ，４１１，４２８−４２９（２００１））を介して多くの遺伝子の翻訳開始が非特異的に阻害され、それと同時に２’、５’ｏｌｉｇｏａｄｅｎｙｌａｔｅｓｙｎｔｈｅｔａｓｅ（Ｂａｓｓ，Ｂ．Ｌ．，Ｎａｔｕｒｅ，４１１，４２８−４２９（２００１））を介してＲｎａｓｅＬの活性化が起こり、細胞内のＲＮＡの非特異的な分解が惹起される。これらの非特異的な反応のために、標的遺伝子の特異的反応が隠蔽されてしまう。従って哺乳類動物、または該動物由来の細胞、あるいは組織を被導入体として用いる場合には１５〜３０ｂｐ、好ましくは１９〜２４ｂｐ、より好ましくは２１ｂｐの２本鎖ポリヌクレオチドを用いることが好ましい。２本鎖ポリヌクレオチドはその全体が２本鎖である必要はなく、５’、または３’末端が一部突出したものも含むが、３’末端が一部突出したものを用いることが好ましい。２本鎖ポリヌクレオチドは相補性を有する２本鎖のポリヌクレオチドを意味するが、自己相補性を有する１本鎖ポリヌクレオチドが自己アニーリングしたものでもよい。自己相補性を有する１本鎖ポリヌクレオチドとしては、例えば、逆方向反復配列を有するもの等が挙げられる。
【００５９】
２本鎖ポリヌクレオチドの調製方法としては、特に制限はないが、それ自体既知の化学合成方法を用いることが好ましい。化学合成は、相補性を有する１本鎖ポリヌクレオチドを別個に合成し、これを適当な方法で会合させることにより２本鎖とすることができる。会合の方法として具体的には、例えば、合成した１本鎖ポリヌクレオチドを混合し、２本鎖が解離する温度にまで加熱し、その後徐々に冷却する方法等が挙げられる。会合した２本鎖ポリヌクレオチドは、アガロースゲル等を用いて確認し、残存する１本鎖ポリヌクレオチドを適当な酵素により分解する等して除去する。
【００６０】
このようにして調製した２本鎖ポリヌクレオチドを導入する被導入体としては、標的遺伝子がその細胞内でＲＮＡに転写、またはタンパク質に翻訳を受け得るものであれば如何なるものであってもよいが、具体的には、植物、動物、原生動物、ウィルス、バクテリア、または真菌種に属するものが挙げられる。植物は単子葉植物、双子葉植物または裸子植物であってよく、動物は、脊椎動物または無脊椎動物であってよい。好ましい微生物は、農業または工業で使用されるものであり、そして植物または動物に対して病原性のものである。真菌には、カビ及び酵母形態両方での生物体が含まれる。脊椎動物の例には、魚類、ウシ、ヤギ、ブタ、ヒツジ、ハムスター、マウス、ラット及びヒトを含む哺乳動物が含まれ、無脊椎動物には、線虫類及び他の虫類、キイロショウジョウバエ（Ｄｒｏｓｏｐｈｉｌａ）、及び他の昆虫が含まれる。好ましくは、細胞は脊椎動物細胞である。
【００６１】
被導入体は、細胞、組織、あるいは個体を意味する。ここで細胞とは、生殖系列または体性、分化全能、または多分化能、分割または非分割、実質組織または上皮、不滅化したものまたは形質転換したもの等からであってよい。細胞は、配偶子または胚であってよく、胚の場合、単一細胞胚または構成性細胞、または多重細胞胚からの細胞であり、胎児組織を含む。さらには、幹細胞のような未分化細胞、または胎児組織を含む器官または組織の細胞からのような分化細胞、または生物内に存在する任意の他の細胞であってよい。分化している細胞型には、脂肪細胞、繊維芽細胞、筋細胞、心筋細胞、内皮細胞、神経細胞、グリア、血液細胞、巨核球、リンパ球、マクロファージ、好中球、好酸球、好塩基球、マスト細胞、白血球、顆粒球、ケラチン生成細胞、軟骨細胞、骨芽細胞、破骨細胞、肝細胞及び内分泌腺または外分泌腺の細胞が含まれる。
【００６２】
被導入体への２本鎖ポリヌクレオチドの導入法としては、被導入体が細胞、あるいは組織の場合は、カルシウムフォスフェート法、エレクトロポレーション法、リポフェクション法、ウィルス感染、２本鎖ポリヌクレオチド溶液への浸漬、あるいは形質転換法等が用いられる。また、胚に導入する方法としては、マイクロインジェクション、エレクトロポレーション法、あるいはウィルス感染等が挙げられる。被導入体が植物の場合には、植物体の体腔または間質細胞等への注入または灌流、あるいは噴霧による方法が用いられる。また、動物個体の場合には、経口、局所、非経口（皮下、筋肉内及び静脈内投与を含む）、経膣、経直腸、経鼻、経眼、腹膜内投与等によって全身的に導入する方法、あるいはエレクトロポレーション法やウィルス感染等が用いられる。経口導入のための方法には、２本鎖ポリヌクレオチドを生物の食物と直接混合することができる。さらに、個体に導入する場合には、例えば埋め込み長期放出製剤等として投与することや、２本鎖ポリヌクレオチドを導入した導入体を摂取させることにより行うこともできる。
【００６３】
導入する２本鎖ポリヌクレオチドの量は、導入体や、標的遺伝子によって適宜選択することができるが、細胞あたり少なくとも１コピー導入されるに充分量を導入することが好ましい。具体的には、例えば、被導入体がヒト培養細胞で、カルシウムフォスフェート法により２本鎖ポリヌクレオチドを導入する場合、０．１〜１０００ｎＭが好ましい。
ＲＮＡインターフェアレンスによる本発明の遺伝子の導入体内での発現抑制により、本発明の遺伝子がコードするタンパク質の機能の確認、あるいは新たな機能の解析等を行うことができる。
【００６４】
（４）本発明のタンパク質に特異的に結合する抗体
本発明のタンパク質と特異的に結合する抗体の調製方法としては、通常用いられる公知の方法を用いることができ、抗原として用いられるポリペプチドについても、公知の方法に従って抗原性が高くエピトープ（抗原決定基）として適した配列を選択して用いることができる。エピトープの選択方法としては、例えばＥｐｉｔｏｐｅＡｄｖｉｓｅｒ（富士通九州システムエンジニアリング社製）等の市販のソフトウェアを用いることができる。
【００６５】
上記の抗原として用いるポリペプチドは、公知の方法に従って合成した合成ペプチドでも、また本発明のタンパク質そのものを用いることもできる。抗原となるポリペプチドは、公知の方法に従って適当な溶液等に調製して、哺乳動物、例えばウサギ、マウス、ラット等に免疫を行えばよいが、安定的な免疫を行ったり抗体価を高めるために抗原ペプチドを適当なキャリアタンパク質とのコンジュゲートにして用いたり、アジュバント等を加えて免疫を行うのが好ましい。
【００６６】
免疫に際しての抗原の投与経路は特に限定されず、例えば皮下、腹腔内、静脈内、あるいは筋肉内等のいずれの経路を用いてもよい。具体的には、例えばＢＡＬＢ／ｃマウスに抗原ポリペプチドを数日〜数週間おきに数回接種する方法等が用いられる。また、抗原の摂取量としては、抗原がポリペプチドの場合０．３〜０．５ｍｇ／１回程度が好ましいが、ポリペプチドの種類、また免疫する動物種によっては適宜調節される。
【００６７】
免疫後、適宜試験的に採血を行って固相酵素免疫検定法（以下、これを「ＥＬＩＳＡ法」と称することがある）やウエスタンブロッティング等の方法で抗体価の上昇を確認し、十分に抗体価の上昇した動物から採血を行う。これに抗体の調製に用いられる適当な処理を行えばポリクローナル抗体を得ることができる。具体的には、例えば、公知の方法に従い血清から抗体成分を精製した精製抗体を取得する方法等が挙げられる。抗体成分の精製は、遠析、イオン交換クロマトグラフィー、アフィニティークロマトグラフィー等の方法を用いることができる。
【００６８】
また、該動物の脾臓細胞とミエローマ細胞とを用いて公知の方法に従って融合させたハイブリドーマを用いる（Ｍｉｌｓｔｅｉｎ，ｅｔａｌ．，Ｎａｔｕｒｅ，２５６，４９５（１９７５））ことによりモノクローナル抗体を作製することもできる。モノクローナル抗体は、例えば以下の方法により取得することができる。
【００６９】
まず、上記した抗原の免疫により抗体価の高まった動物から抗体産生細胞を取得する。抗体産生細胞は、形質細胞、及びその前駆細胞であるリンパ球であり、これは個体の何れから取得してもよいが、好ましくは脾臓、リンパ節、末梢血等から取得する。これらの細胞と融合させるミエローマとしては、一般的にはマウスから得られた株化細胞、例えば８−アザグアニン耐性マウス（ＢＡＬＢ／ｃ由来等）ミエローマ細胞株であるＰ３Ｘ６３−Ａｇ８．６５３（ＡＴＣＣ：ＣＲＬ−１５８０）、Ｐ３−ＮＳ１／１Ａｇ４．１（理研セルバンク：ＲＣＢ００９５）等が好ましく用いられる。細胞の融合は、抗体産生細胞とミエローマ細胞を適当な割合で混合し、適当な細胞融合培地、例えばＲＰＭＩ１６４０やイスコフ改変ダルベッコ培地（ＩＭＤＭ）、あるいはダルベッコ改変イーグル培地（ＤＭＥＭ）等に、５０％ポリエチレングリコール（ＰＥＧ）を溶解したもの等を用いることにより行うことができる。また電気融合法（Ｕ．Ｚｉｍｍｅｒ− ｍａｎｎ．ｅｔａｌ．，Ｎａｔｕｒｗｉｓｓｅｎｓｃｈａｆｔｅｎ，６８，５７７（１９８１））によっても行うことができる。
【００７０】
ハイブリドーマは、用いたミエローマ細胞株が８−アザグアニン耐性株であることを利用して適量のヒポキサンチン・アミノプテリン・チミジン（ＨＡＴ）液を含む正常培地（ＨＡＴ培地）中で５％ＣＯ_２、３７℃で適当時間培養することにより選択することができる。この選択方法は用いるミエローマ細胞株によって適宜選択して用いることができる。選択されたハイブリドーマが産生する抗体の抗体価を上記した方法により解析し、抗体価の高い抗体を産生するハイブリドーマを限界希釈法等により分離し、分離した融合細胞を適当な培地で培養して得られる培養上清から硫安分画、アフィニティクロマトググラフィー等の適当な方法により精製してモノクローナル抗体を得ることができる。また精製には市販のモノクローナル抗体精製キットを用いることもできる。さらには、免疫した動物と同系統の動物、またはヌードマウス等の腹腔内で上記で得られた抗体産生ハイブリドーマを増殖させることにより、本発明のモノクローナル抗体を大量に含む腹水を得ることもできる。
【００７１】
また、本発明のタンパク質としてヒト由来のものを取得した場合には、かかるポリペプチド、あるいはその部分ペプチドを抗原として、ヒト末梢血リンパ球を移植したＳｅｖｅｒｅｃｏｍｂｉｎｅｄｉｍｍｕｎｅｄｅｆｉｃｉｅｎｃｙ（ＳＣＩＤ）マウスに上記した方法と同様にして免疫し、該免疫動物の抗体産生細胞とヒトのミエローマ細胞とのハイブリドーマを作製することによってもヒト型抗体を作製することができる（Ｍｏｓｉｅｒ，Ｄ．Ｅ．，ｅｔａｌ．Ｎａｔｕｒｅ，３３５，２５６−２５９（１９８８）；Ｄｕｃｈｏｓａｌ，Ｍ．Ａ．，ｅｔａｌ．，Ｎａｔｕｒｅ，３５５，２５８−２６２（１９９２））。
【００７２】
また、取得したヒト型抗体を産生するハイブリドーマからＲＮＡを抽出し、目的のヒト型抗体をコードする遺伝子をクローニングして、この遺伝子を適当なベクターに挿入し、これを適当な宿主に導入して発現させることにより、さらに大量にヒト型抗体を作製することができる。ここで、抗原との結合性の低い抗体は、それ自体既知の進化工学的手法を用いることによりさらに結合性の高い抗体として取得することもできる。一価性抗体等の部分フラグメントは、例えばパパイン等を用いてＦａｂ部分とＦｃ部分を切断し、アフィニティカラム等を用いてＦａｂ部分を回収することによって作製することができる。
【００７３】
かくして得られる本発明のタンパク質と特異的に結合する抗体は、本発明のタンパク質に特異的に結合することによって該タンパク質が有するトランスフェラーゼ活性または運搬体活性を阻害する中和抗体として用いることもできる。タンパク質が有する活性を阻害するものの選択方法としては特に制限はないが、例えば、上記（２）で作製したＤＮＡ導入体に抗体を接触させ、導入体中の目的タンパク質の機能が阻害されるか否かを解析する方法等が挙げられる。
【００７４】
かかる中和抗体は、臨床へ応用するに際し、上記有効成分を単独で用いることも可能であるが、薬学的に許容され得る担体と配合して医薬品組成物として用いることもできる。この時の有効成分の担体に対する割合は、１〜９０重量％の間で変動され得る。また、かかる薬剤は種々の形態で投与することができ、それらの投与形態としては、錠剤、カプセル剤、顆粒剤、散剤、あるいはシロップ剤等による経口投与、または注射剤、点滴剤、リポソーム剤、坐薬剤等による非経口投与を挙げることができる。また、その投与量は、症状、年齢、体重等によって適宜選択することができる。
【００７５】
（５）本発明のタンパク質が有する活性の確認および機能の解析
本発明のタンパク質は、これを上記（２）に記載のとおり組み換えタンパク質として作製し、これを解析することにより上記（１）で推測した活性を有していることを確認することができる。さらに上記（４）のとおりに作製した抗体等との組み合わせにより解析することもできる。
【００７６】
本発明のタンパク質が、トランスフェラーゼ活性を有することは、それ自体既知の通常用いられる活性測定方法により解析することができる。
【００７７】
具体的には、基質となる物質を該組み換えタンパク質に接触させ、該組み換えタンパク質のトランスフェラーゼ活性により、基質の有する、トランスフェラーゼが目的とする基が他に転移されることを測定する方法等が挙げられる。例えば、β１，４‐ガラクトシルトランスフェラーゼの場合、基質としてＵＤＰ‐ガラクトース、ピリジルアミノ標識ＧｌｃＮＡｃ２Ｍａｎ３ＧｌｃＮＡｃ２を用い、中性（ｐＨ７．４、ｓｏｄｉｕｍｃａｃｏｄｙｌａｔｅ）緩衝液中、１０ｍＭＭｎＣｌ２存在下で該タンパク質と接触させ、ガラクトースのピリジルアミノ標識ＧｌｃＮＡｃ２Ｍａｎ３ＧｌｃＮＡｃ２への転移を、ＨＰＬＣにて同定、定量する。（Ｕｅｊｉｍａ，Ｔ．ｅｔａｌ．，ＣａｎｃｅｒＲｅｓ．，５２，６１５８）
【００７８】
また、本発明のタンパク質が、運搬体タンパク質としての運搬体活性を有することは、それ自体既知の通常用いられる活性測定法により解析することが出来る。運搬体は、輸送される物質により、ヨード運搬体（Ｎａｔｕｒｅ，３７９：４５８−４５９（１９９６））、グルコース運搬体（Ｎａｔｕｒｅ，３３０：３７９−３８１（１９８７））、多ビタミン運搬体（Ｊ．Ｂｉｏｌ．Ｃｈｅｍ．，２７３：１４８７５−１４８８３（１９９８））、コリン運搬体（特開２００１−１３６９７６号公報）、イオン運搬体（中村桂子他監訳，「細胞の分子生物学」，第３版，教育社，１９９５年７月，ｐ．５１２−５２２）などに分類されるが、これらの輸送される物質の量を指標として活性を測定すればよい。例えばＳｑｕａｌｅｎｅをミクロゾームのｓｑｕａｌｅｎｅｅｐｏｘｉｄｅへ運搬する活性は、文献情報（ＰＮＡＳ２０００，９８，２２４４−２２４９）に記載の方法を用いて、Ｓｑｕａｌｅｎｅのｓｑｕａｌｅｎｅ２，３−ｏｘｉｄｅへの変換を指標として測定すれば良い。具体的には、本発明のタンパク質と^１４Ｃ標識したＳｑｕａｌｅｎｅ、ＦＡＤ、ＮＡＤＰＨ、およびミクロゾーム分画とを混合しインキュベート後、生成物を鹸化し、脂質を抽出した後、薄層クロマトグラフィーを行いイメージアナライザーで測定し^１４Ｃ標識した脂質の変換を検出することができる。
【００７９】
本発明のタンパク質であるトランスフェラーゼまたは運搬体が有する活性は、上記の通り確認することができるが、これらの方法に限定されるものではない。また、これらの活性測定系は、後述する本発明のタンパク質の機能賦活物質（アゴニストなど）や機能阻害物質（アンタゴニストなど）のスクリーニングや本発明のタンパク質の発現調節物質のスクリーニングにも用いることができる。
【００８０】
また、本発明のタンパク質の機能解析の方法として一般的には、例えば、（ｉ）各組織、疾患、あるいは発生段階における発現状態を比較解析する方法、（ｉｉ）他のタンパク質、ＤＮＡとの相互作用を解析する方法、（ｉｉｉ）適当な細胞あるいは個体へ導入して表現型の変化を解析する方法、（ｉｖ）適当な細胞あるいは個体において該タンパク質の発現を阻害して表現型の変化を解析する方法などが挙げられる。また、このような方法によれば、対象タンパク質に特異的な活性を多面的に解析することができる。
【００８１】
（ｉ）の方法においては、本発明のタンパク質の発現を、ｍＲＮＡレベルあるいはタンパク質レベルで解析することができる。ｍＲＮＡレベルで発現量を解析する場合は、例えば、ｉｎｓｉｔｕハイブリダイゼーション法（Ｉｎｓｉｔｕｈｙｂｒｉｄｉｚａｔｉｏｎ：ＡｐｐｌｉｃａｔｉｏｎｔｏＤｅｖｅｌｏｐｍｅｎｔａｌＢｉｏｌｏｇｙ＆Ｍｅｄｉｃｉｎｅ．，Ｅｄ．ｂｙＨａｒｒｉｓ，Ｎ．ａｎｄＷｉｌｋｉｎｓｏｎ，Ｄ．Ｇ．，ＣａｍｂｒｉｄｇｅＵｎｉｖｅｒｓｉｔｙＰｒｅｓｓ（１９９０））、ＤＮＡチップを利用したハイブリダイゼーション法、定量ＰＣＲ法等が用いられる。また、タンパク質レベルで解析する場合には、後述する本発明のタンパク質に特異的に結合する抗体を用いた組織染色法、ＥＬＩＳＡ法、ウェスタンブロット法などが挙げられる。ここで、解析の対象タンパク質に公知のバリアントが存在する場合には、解析対象タンパク質をコードするｃＤＮＡにのみ存在し、公知のバリアントをコードするｃＤＮＡとはハイブリダイズしないプローブを用いることが好ましい。定量ＰＣＲ法の場合には、対象ｃＤＮＡと公知バリアント間で異なる長さの増幅断片ができるプライマーを選択して行う方法（Ｗｏｎｇ，Ｙ．，ＮｅｕｒｏｓｃｉｅｎｃｅＬｅｔ．，３２０：１４１−１４５（２００２））等が挙げられる。また、タンパク質レベルで解析する場合にも、対象タンパク質にのみ反応し、公知のバリアントには反応しない抗体を用いることが好ましい。
【００８２】
（ｉｉ）の方法においては、本発明のタンパク質と既知のタンパク質との相互作用の有無を調べて、本発明のタンパク質の機能を解析することができる。相互作用の解析法としては、それ自体既知の常法を用いることができるが、具体的には、例えば、酵母ツーハイブリッド法、蛍光偏光解消法、表面プラズモン法、ファージディスプレイ法、リボソーマルディスプレイ法等が挙げられる。該方法においても、解析対象タンパク質に公知のバリアントが存在する場合には、公知のバリアントも同様にして相互作用する物質を解析し、対象タンパク質特異的に相互作用する物質を同定することが好ましい。
【００８３】
（ｉｉｉ）の方法では、本発明のｃＤＮＡを導入する細胞は特に制限はないが、ヒト培養細胞等が特に好ましく用いられる。ＤＮＡの細胞への導入法としては、上記（２）に記載のものが挙げられる。さらに導入細胞の表現型としては、細胞の生死、細胞の増殖速度、細胞の分化、細胞が神経細胞の場合には神経突起の伸長度、細胞内タンパク質の局在や移行など顕微鏡等で観察可能なものや、細胞内の特定タンパク質の発現変化など生化学的実験により解析可能なものも含む。これらの表現型は、対象タンパク質に公知のバリアントが存在する場合には、公知のものも同様に細胞へ導入し、比較解析することにより、対象タンパク質が関連する表現型を同定することができる。また、本発明のタンパク質はトランスフェラーゼ活性または運搬体活性を有するものであることがわかっているので、これらのトランスフェラーゼまたは運搬体が関連する疾患に見られる表現型等に注目して解析することも好ましい。
【００８４】
（ｉｖ）の方法では、上記（３）に記載したオリゴヌクレオチドを用いた方法や、ＲＮＡインターフェアレンス法により効率的に行うことができる。この方法においても、解析する対象タンパク質に、公知のバリアントが存在する場合には、公知のバリアントやその他のバリアントについても同様の解析を行い、比較解析することにより対象タンパク質特異的な機能を同定することができる。
【００８５】
（６）本発明のタンパク質が有する活性を調節する物質のスクリーニング
本発明のタンパク質に特異的に結合し、かつ本発明のタンパク質の機能（活性）を阻害、拮抗または増強する作用を有する物質をスクリーニングすることにより本発明のタンパク質の機能調節物質（以下、これを「調節物質」と称することがある）を得ることができる。
【００８６】
この調節物質のスクリーニング方法は、本発明のタンパク質に特異的に結合し、且つ該タンパク質の活性を阻害、拮抗または増強する作用を有する物質が得られる方法であれば如何なるものであってもよい。例えば、まず本発明のタンパク質と被検物質とを接触させ、該タンパク質との結合性を指標として選抜した後に、本発明のタンパク質が有する機能、即ちトランスフェラーゼ活性または運搬体活性の変化を指標として被検物質を選抜する方法を用いることができる。
【００８７】
被検物質としては、本発明のタンパク質と相互作用して、該タンパク質が有する活性に影響を及ぼす可能性のある物質であれば如何なるものであってもよいが、具体的には、例えば、ペプチド、タンパク質、非ペプチド性化合物、低分子化合物、合成化合物、発酵生産物、細胞抽出液、動物組織抽出液等が挙げられる。これらの物質は新規な物質であってもよいし、公知の物質であってもよい。被検物質と本発明のタンパク質との相互作用の解析法としては、それ自体既知の常法を用いることができるが、具体的には、例えば、酵母ツーハイブリッド法、蛍光偏光解消法、表面プラズモン法、ファージディスプレイ法、リボソーマルディスプレイ法、あるいは上記（４）に記載した抗体との競合解析法等が挙げられる。このような方法により、本発明のタンパク質に結合する活性を見いだされた物質は、次に該物質の存在下で本発明のタンパク質が有する活性がどのような影響を受けるかを解析することによって、調節物質として用いられるか否かが同定される。
【００８８】
トランスフェラーゼ活性または運搬体活性の変化の解析は、各種のトランスフェラーゼまたは運搬体の性質に基づいて、それ自体既知の通常用いられる方法により行うことができる。
【００８９】
トランスフェラーゼ活性を調節する物質を解析する場合には、上記（２）に記載したＤＮＡ導入体に基質となるタンパク質も同様の方法で導入する。この導入体について選択された物質の存在下／または非存在下で、基質となるタンパク質におけるトランスフェラーゼが目的とする基が他に転移されるかどうかをそれ自体既知の通常用いられる方法により解析する。具体的には、上記（５）に記載の方法等を用いて行うことができる。トランスフェラーゼが目的とする基の転移が、物質の非存在下の場合と比べて増加した場合には、該物質はトランスフェラーゼ活性物質として機能する可能性があり、また低下、または阻害された場合には物質はトランスフェラーゼ阻害物質として機能する可能性があると同定できる。
本発明のタンパク質が有するトランスフェラーゼ活性を有するタンパク質として、例えば糖転移酵素は、癌遺伝子産物を活性化する機能、神経機能、免疫機能、炎症、細胞分化、ウィルス感染等を制御する機能等を有するものがある。そこで、本スクリーニング方法により同定できる物質は、癌、糖尿病、循環器系疾患、アルツハイマー型痴呆、パーキンソン病、舞踏病、虚血性脳疾患、糖尿病性末梢神経障害、不妊症などの治療薬もしくはそれらの診断薬、または再生組織誘導剤として用いることができる。
【００９０】
また、運搬体活性を調節する物質を解析する場合には、上記（５）に記載の方法等を用いて行うことができる。
前記の通り、本発明のタンパク質は、諸種の生理機能に関与する運搬体タンパク質として重要な機能を有しており、生体内における該タンパク質の異常は様々な疾患の原因となる。従って、上記スクリーニング方法により得られた運搬体活性の調節物質は、諸種の疾患の治療剤、例えば、代謝性疾患、動脈硬化、ビタミン欠乏症、夜盲症、過酸化に起因する疾患、老化、痴呆などの治療剤として用いることができる。
【００９１】
ここで、医薬活性成分のスクリーニングを目的とするため、用いる本発明のＤＮＡ、あるいは組み換えタンパク質については、上記したヒトのホモログＤＮＡあるいはヒトホモログタンパク質を用いることが好ましい。さらに上記方法によってスクリーニングされた物質は、これらの生体内でのスクリーニングによって医薬候補としての選択を行ってもよい。
【００９２】
かかるトランスフェラーゼ活性または運搬体活性の調節物質は、臨床へ応用するに際し、上記有効成分を単独で用いることも可能であるが、薬学的に許容され得る担体と配合して医薬品組成物として用いることもできる。この時の有効成分の担体に対する割合は、１〜９０重量％の間で変動され得る。また、かかる薬剤は種々の形態で投与することができ、それらの投与形態としては、錠剤、カプセル剤、顆粒剤、散剤、あるいはシロップ剤等による経口投与、または注射剤、点滴剤、リポソーム剤、坐薬剤等による非経口投与を挙げることができる。また、その投与量は、症状、年齢、体重等によって適宜選択することができる。
【００９３】
（７）本発明のＤＮＡの発現調節物質のスクリーニング
スクリーニングの方法としては、被検物質の存在下で本発明のタンパク質、あるいはそれをコードするｍＲＮＡの発現量を解析する方法等が挙げられる。具体的には、例えば、上記（２）に記載した本発明のタンパク質を発現する細胞を被検物質を含む適当な培地で培養し、該細胞内に発現している本発明のタンパク質量をＥＬＩＳＡ等の常法を用いて解析するか、あるいは該細胞内の本発明のタンパク質をコードするｍＲＮＡ量を、定量的逆転写ＰＣＲ法や、ノーザンブロット法等により解析することにより行うことができる。
【００９４】
被検物質としては、上記（６）に記載のものを用いることができる。この解析により、被検物質の非存在下で培養された当該細胞内で発現されたタンパク質、あるいはｍＲＮＡ量と比べてその量が増加すれば、物質は本発明のＤＮＡの発現促進物質として機能する可能性があり、逆に減少した場合には、物質は本発明のＤＮＡの発現阻害物質として用いられ得ると判断することができる。
【００９５】
かかる発現調節物質は、臨床へ応用するに際し、上記有効成分を単独で用いることも可能であるが、薬学的に許容され得る担体と配合して医薬品組成物として用いることもできる。この時の有効成分の担体に対する割合は、１〜９０重量％の間で変動され得る。また、かかる薬剤は種々の形態で投与することができ、それらの投与形態としては、錠剤、カプセル剤、顆粒剤、散剤、あるいはシロップ剤等による経口投与、または注射剤、点滴剤、リポソーム剤、坐薬剤等による非経口投与を挙げることができる。また、その投与量は、症状、年齢、体重等によって適宜選択することができる。
【００９６】
（８）本発明のＤＮＡ導入動物
上記（１）に記載の、本発明のＤＮＡを含む導入ＤＮＡを構築し、ヒト以外の哺乳動物の受精卵に導入して、これを雌個体子宮に移植して発生させることにより、本発明のＤＮＡが導入された非ヒト哺乳動物を作製することができる。より、具体的には、例えば、雌個体をホルモン投与により過剰排卵させた後、雄と交配し、交配後１日目の卵管から受精卵を摘出し、該受精卵に導入ＤＮＡをマイクロインジェクション等の方法により導入する。この後、適当な方法で培養した後、生存している受精卵を、偽妊娠させた雌個体（仮親）の子宮に移植して出産させる。新生仔に目的のＤＮＡが導入されているか否かは、該個体の細胞から抽出したＤＮＡのサザンブロット解析を行うことにより同定することができる。ヒト以外の哺乳動物としては、例えばマウス、ラット、モルモット、ハムスター、ウサギ、ヤギ、ブタ、イヌ、ネコ等が挙げられる。
【００９７】
かくして得られた本発明のＤＮＡ導入動物は、この個体を交配し、導入されたＤＮＡが安定的に保持されていることを確認しながら通常の飼育環境で継代飼育することによりその子孫を得ることができる。また、体外受精を繰り返すことによりその子孫を得て、系統を維持することもできる。
本発明のＤＮＡが導入された非ヒト哺乳動物は、本発明のＤＮＡの生体内における機能の解析や、またこれを調節する物質のスクリーニング系等として用いることができる。
【００９８】
（９）本発明のタンパク質及びそれをコードする塩基配列を含むＤＮＡの他の利用
本発明のタンパク質は、それを基盤上に結合させた担体として利用することができる。また、本発明のタンパク質をコードする塩基配列、例えば、配列番号１〜３のいずれかに記載の塩基配列を有するＤＮＡ及びその部分断片は、それらを基盤上に結合させた担体として用いられ得る。これらを、以下、「プロテインチップ」、「ＤＮＡチップ」または「ＤＮＡアレイ」（ＤＮＡマイクロアレイ及びＤＮＡマクロアレイ）と称することがある。これらのプロテインチップ、又はＤＮＡチップもしくはアレイには、本発明のタンパク質やＤＮＡ以外に、他のタンパク質やＤＮＡが含まれていてもよい。
【００９９】
ここで、タンパク質やＤＮＡを結合させる基盤としては、ナイロン膜、ポリプロピレン膜等の樹脂基板、ニトロセルロース膜、ガラスプレート、シリコンプレート等が用いられるが、ハイブリダイゼーションの検出を非ＲＩ的に、例えば、蛍光物質等を用いて行う場合には、蛍光物質を含まないガラスプレート、シリコンプレート等が好適に用いられる。また該基盤へのタンパク質、あるいはＤＮＡの結合は、それ自体公知の通常用いられる方法により容易に行うことができる。これらのプロテインチップ、ＤＮＡチップ、あるいはＤＮＡアレイも、本発明の範囲に含まれる。
【０１００】
また、本発明のタンパク質のアミノ酸配列及びＤＮＡの塩基配列は、配列情報としても用いることができる。このＤＮＡの塩基配列には、対応するＲＮＡの塩基配列も含まれる。すなわち、得られたアミノ酸配列や塩基配列をコンピューターが読みとり可能な所定の形式で適当な記録媒体に格納することにより、アミノ酸配列や塩基配列のデータベースが構築できる。このデータベースには、他の種類のタンパク質やそれをコードするＤＮＡの塩基配列が含まれていてもよい。また、本発明においてデータベースとは、上記配列を適当な記録媒体に書き込み、所定のプログラムに従って検索を行うコンピューターシステムをも意味する。ここで適当な記録媒体としては、例えば、フレキシブルディスク、ハードディスク、磁気テープ等の磁気媒体、ＣＤ−ＲＯＭ、ＭＯ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＤＶＤ−Ｒ、ＤＶＤ−ＲＡＭ等の光ディスク、半導体メモリ等を挙げることができる。
【０１０１】
【実施例】
以下、実施例を挙げて本発明を詳細に説明するが、本発明の範囲はこれらの実施例により限定されるものではない。
実施例１ｃＤＮＡライブラリーの調製
（１）ｍＲＮＡの調製
ｍＲＮＡ調製マウス（Ｃ５７ＢＬ／６）各器官または組織０．５〜１ｇを１０ｍｌの懸濁液でホモジェナイズし、ｐＨ４．０の２Ｍ酢酸ナトリウム１ｍｌと、同量のフェノール／クロロホルム（体積比５：１）混液を加え抽出した。抽出後水層に同量のイソプロパノールを加えると、ＲＮＡが水相から分離沈澱した。この試料を氷の上で１時間インキュベーションした後、１５分間４，０００ｒｐｍで冷却遠心機にかけ、沈澱物を回収した。この検体を７０％エタノールで洗い、８ｍｌの水に溶解後２ｍｌの５ＭＮａＣｌ、１％ＣＴＡＢ（ｃｅｔｙｌｔｒｉｍｅｔｈｙ− ｌａｍｍｏｎｉｕｍｂｒｏｍｉｄｅ）、４Ｍ尿素、５０ｍＭＴｒｉｓを含むｐＨ７．０の水溶液１６ｍｌを加えることでＲＮＡを沈澱させ、ポリサッカライドを除いた（ＣＴＡＢ沈澱）。
【０１０２】
続いて室温で４，０００ｒｐｍ、１５分間遠心機にかけ、ＲＮＡを４ｍｌの７Ｍグアニジン−Ｃｌに溶解した。そして２倍量のエタノールを加えた後、氷上で１時間インキュベーションし、４，０００ｒｐｍ、１５分間遠心機にかけ、生じた沈澱物を７０％エタノールで洗いＲＮＡを回収した、これを再度水に溶解し、ＲＮＡの純度をＯＤ比２６０／２８０（＞１．８）と２３０／２６０（＜０．４５）を読むことによって計測した。
【０１０３】
（２）第１鎖ｃＤＮＡの調製
上記（１）で調製したｍＲＮＡ１５μｇを使って逆転写酵素３，０００ｕｎｉｔにより、最終容量１６５μｌの反応液中で、５−メチル−ｄＣＴＰ、ｄＡＴＰ、ｄＴＴＰ、ｄＧＴＰを各々０．５４ｍＭ、０．６Ｍトレハロース、５０ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ８．３）、７５ｍＭＫＣｌ、３ｍＭＭｇＣｌ２、１０ｍＭＤＴＴ、５２ｎｇ／μｌＢＳＡ、ＲＮａｓｅインヒビター５ｕｎｉｔの条件下で逆転写反応を行った。制限酵素ＸｈｏＩの認識配列を含むオリゴヌクレオチド（配列番号７）（配列中、ＶはＡ，Ｇ又はＣを示し、ＮはＡ，Ｇ，Ｃ又はＴを示す）１２．６μｌをプライマーとして用いた。
【０１０４】
この反応を始める際、反応液の１／４を採取し、それに１．５μｌの［α−^３２Ｐ］−ｄＧＴＰ（３０００Ｃｉ／ｍｍｏｌ、１０μＣｉ／μｌ；Ａｍｅｒｓｈａｍ社製）を加えるこことにより、第１鎖ｃＤＮＡの合成効率を測定した。ＲＩ標識した反応液の０．５μｌをＤＥ−８１ペーパー上にスポットし、０．５Ｍリン酸ナトリウム緩衝液（ｐＨ７．０）で３回洗った前後のＲＩ活性を測定し、計算した。その後、ＲＩ標識した反応液と非標識の反応液を混合し、０．５ＭＥＤＴＡ８μｌ、１０％ＳＤＳ２μｌ、プロテイナーゼ（Ｐｒｏｔｅｉｎａｓｅ）Ｋ２０μｇを加え、４５℃で１５分間加熱した。フェノール／クロロホルムによる抽出、エタノール沈澱後、沈澱をＲＮａｓｅフリーに処理してある水（以下ＲＮａｓｅフリー水とする）４７μｌに溶解した。
【０１０５】
（３）５’キャップ構造及び３’末端へのビオチン付加
ＲＮＡジオールのビオチン化ＲＮＡのジオール部位（Ｃａｐ構造のある５’末端と、ポリＡ鎖のある３’末端のリボースの双方に存在）にビオチンを結合させるために、２段階の反応を行った。それらは、ジオール基の酸化とそれに続くビオチンヒドラジドと酸化ＲＮＡ体のカップリング反応である。まず、逆転写反応で得られたＲＮＡ−第１鎖ｃＤＮＡ複合体１５μｇを、６．６ｍＭ酢酸ナトリウム緩衝液（ｐＨ４．５）と、酸化剤として過ヨウ素酸ナトリウムを用いて５０μｌの反応液中で処理した。この酸化反応は遮光条件の下、氷上で４５分間行った。
【０１０６】
続いて、５Ｍ塩化ナトリウム１１μｌ、１０％ＳＤＳ０．５μｌ、そして同量のイソプロパノールを加え、６０分間氷上に放置した後、４℃で１５分間１５，０００ｒｐｍ遠心し沈澱を取得した。沈澱物は７０％エタノールで洗い、ＲＮａｓｅフリー水５０μｌに再溶解させた。その試料に１Ｍ酢酸ナトリウム（ｐＨ６．１）５μｌ、１０％ＳＤＳ５μｌ、１０ｍＭビオチンヒドラジド（Ｓｉｇｍａ社製）１５０μｌを加え、室温（２２〜２６℃）で終夜反応させた。最後に、５μｌの５ＭＮａＣｌ、１Ｍ酢酸ナトリウム（ｐＨ６．１）７５μｌ、及び２．５倍量のエタノールを加え、１時間の氷上冷却後、４℃において１５分間遠心し、ビオチン化した。反応後、反応液を１５分間遠心し、再度ＲＮＡ−ＤＮＡ複合体を沈澱させた。沈澱物は７０％エタノールで１回、更に８０％エタノールで１回洗い、ＲＮａｓｅフリー水７０μｌに溶解した。
【０１０７】
（４）ＲＮａｓｅＩによる完全長ｃＤＮＡの選択
上記（３）で取得したビオチン化ＲＮＡ−ＤＮＡ複合体について、１本鎖ＲＮＡを消化するＲＮａｓｅＩで処理することにより、逆転写反応時に完全なｃＤＮＡの伸長が得られなかったｍＲＮＡ、およびｍＲＮＡの３’末端に標識されたビオチン残基を取り除いた。具体的には、上記（３）で得られた試料７０μｌに１０×ＲＮａｓｅＩバッファー（１００ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ７．５）、５０ｍＭＥＤＴＡ、２ＭＮａＯＡｃ）１０μｌ、ＲＮａｓｅＩ（ＲＮａｓｅＯｎｅ^ＴＭ；Ｐｒｏｍｅｇａ社製）２００ｕｎｉｔを加えて、３７℃で１５分間１本鎖ＲＮＡを消化した。
【０１０８】
（５）完全長ｃＤＮＡの採取
ストレプトアビジンコートしたマグネティックビーズにｃＤＮＡが非特異的吸着するのを防止するため、１００μｇの酵母ｔＲＮＡ（ＤＮａｓｅＩ処理したもの）を５ｍｇ（５００μｌ）のマグネティックビーズ（ｍａｇｎｅｔｉｃｐｏｒｏｕｓｇｌａｓｓ（ＭＰＧ）ｐａｒｔｉｃｌｅｓｃｏａｔｅｄｗｉｔｈｓｔｒｅｐｔａｖｉｄｉｎ（ＣＰＧ，ＮＪ））に加え、１時間氷上に放置した後、５０ｍＭＥＤＴＡ、２ＭＮａＣｌの溶液にて洗った。
このビーズを５０ｍＭＥＤＴＡ、２ＭＮａＣｌの溶液５００μｌ中に懸濁し、（４）で取得したＲＮａｓｅＩ処理を施されたｃＤＮＡを加えた。室温にて３０分間撹拌することで、マグネティックビーズと完全長ｃＤＮＡを結合させた。完全長ｃＤＮＡを捕獲したビーズを５０ｍＭＥＤＴＡ、２ＭＮａＣｌの溶液で４回、０．４％ＳＤＳ、５０μｇ／μｌ酵母ｔＲＮＡで１回、１０ｍＭＮａＣｌ、０．２ｍＭＥＤＴＡ、１０ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ７．５）、２０％グリセロールで１回、５０μｇ／μｌ酵母ｔＲＮＡ水溶液で１回、ＲＮａｓｅＨバッファー（２０ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ７．５）、１０ｍＭＭｇＣｌ_２、２０ｍＭＫＣｌ、０．１ｍＭＥＤＴＡ、０．１ｍＭジチオスレイトール（ＤＴＴ）で１回洗浄した後、ＲＮａｓｅＨ用バッファー１００μｌに懸濁し、ＲＮａｓｅＨ３ｕｎｉｔを加え、３７℃下３０分間加温した。その後、１０％ＳＤＳ１μｌ、０．５ＭＥＤＴＡ２μｌを加えて、１０分間、６５℃に曝し、その上清を回収した。
このようにして回収された１本鎖完全長ｃＤＮＡはフェノール／クロロホルムで抽出され、スピードバッグにて液量を１００μｌ以下に減じてからＧ２５／Ｇ１００Ｓｅｐｈａｄｅｘクロマトグラフィーに付した。ＲＩ活性を持った分画はシリコン処理したマイクロチューブに収集するとともに、グリコーゲン２μｇを加え、エタノール沈澱にて得られた沈澱物を３０μｌの超純水に溶解した。
【０１０９】
（６）１本鎖ｃＤＮＡへのオリゴｄＧ付加
上記（５）で回収された１本鎖ｃＤＮＡ３０μｌは、最終容量５０μｌの反応液中で、２００ｍＭカコジル酸ナトリウム（ｐＨ６．９）、１ｍＭＭｇＣｌ_２、１ｍＭＣｏＣｌ_２、１ｍＭ２−メルカプトエタノール、１００μＭｄＧＴＰの条件のもと、ターミナルデオキシヌクレオチジルトランスフェラーゼ（ＴａＫａＲａ社製）３２ｕｎｉｔを用いて３７℃で３０分間のオリゴｄＧ付加反応に付した。反応終了時にＥＤＴＡを５０ｍＭとなるように加え、一連のフェノール／クロロホルムによる抽出、エタノール沈澱を経て、３１μｌの超純水に溶解した。
【０１１０】
（７）第２鎖ｃＤＮＡ合成
第１鎖ｃＤＮＡを鋳型にした第２鎖ｃＤＮＡの合成は以下のように行った。最終容量６０μｌの反応系で、第２鎖低バッファー（２００ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ８．７５）、１００ｍＭＫＣｌ、１００ｍＭ（ＮＨ_４）_２ＳＯ_４、２０ｍＭＭｇＳＯ_４、１％ＴｒｉｔｏｎＸ−１００、１ｍｇ／μｌＢＳＡ）３μｌ、第２鎖高バッファー（２００ｍＭＴｒｉｓ−ＨＣｌ（ｐＨ９．２）、６００ｍＭＫＣｌ、２０ｍＭＭｇＣｌ_２）３μｌ、ｄＣＴＰ、ｄＡＴＰ、ｄＴＴＰ、ｄＧＴＰ各々０．２５ｍＭ、β−ＮＡＤＨ６μｌ、オリゴｄＧ付加された第１鎖ｃＤＮＡ３１μｌ、第２鎖プライマー−アダプター（配列番号８）６００ｎｇを加え、ＥｘＴａｑＤＮＡポリメラーゼ（ＴａＫａＲａＥｘＴａｑ；ＴａＫａＲａ社製）１５ｕｎｉｔ、耐熱性ＤＮＡリガーゼ（Ａｍｐｌｉｇａｓｅ；Ｅｐｉｃｅｎｔｒｅ社製）１５０ｕｎｉｔ、耐熱性ＲＮａｓｅＨ（Ｈｙｂｒｉｄａｓｅ；Ｅｐｉｃｅｎｔｒｅ社製）３ｕｎｉｔによって第２鎖ｃＤＮＡを合成した。
【０１１１】
０．５ＭＥＤＴＡを１μｌ加えることで反応を停止させ、更に蛋白成分を溶解するために、１０％ＳＤＳ１μｌ、プロテイナーゼ（Ｐｒｏｔｅｉｎａｓｅ）Ｋ１０μｇの存在下に４５℃で１５分間加熱し、最終的にフェノール／クロロホルムによる抽出、エタノール沈澱にて精製した２本鎖完全長ｃＤＮＡを得た。
【０１１２】
（８）ライブラリーの調製
以上の方法により得られた二本鎖完全長ｃＤＮＡは、λＺＡＰＩＩＩベクターに挿入し、ライブラリーとして回収した。λＺＡＰＩＩＩベクターはλＺＡＰＩＩ（ＳＴＲＡＴＡＧＥＮＥ社製）ベクターのマルチクローニングサイトの一部の配列（配列番号９）を配列番号１０に改変し、二つのＳｆｉＩサイトを新たに導入したものである。
【０１１３】
さらにλＰＳ（ＲＩＫＥＮ）ベクターを作製し、ｃＤＮＡを挿入した。λＰＳ（ＲＩＫＥＮ）（λ−ＦＬＣ−１と命名（ＦＬＣとはＦＵＬＬ−ＬＥＮＧＴＨｃＤＮＡを意味する））とは、ＭｏＢｉＴｅｃ社（ドイツ）のλＰＳベクターをｃＤＮＡ用に改変したものである。即ち１０ｋｂｐｓｔｕｆｆｅｒの両側に存在するクローニングサイトにｃＤＮＡ挿入に便利なＢａｍＨＩならびにＳａｌＩを各々導入するとともに、０．５ｋｂから１３ｋｂ程度までのｃＤＮＡがクローニングできるようにＸｂａＩサイトに６ｋｂのＤＮＡ断片を挿入したものである（特開２０００−３２５０８０号公報）。このλ−ＦＬＣ−１を用いると、例えば肺臓ｃＤＮＡライブラリーの場合には、インサートの平均鎖長は２．５７ｋｂとなり、実際に０．５ｋｂから１２ｋｂまでのインサートをクローニングすることが出来た。従来法のλＺＡＰの場合には、インサートの平均鎖長は０．９７ｋｂであったことから、λ−ＦＬＣ−１を用いることによって、サイズの大きなｃＤＮＡもλＺＡＰに比べて効率よくクローニングできることがわかる。
【０１１４】
実施例２完全長ｃＤＮＡライブラリーのノーマライゼーション／サブトラクション
（１）ドライバーの調製
実施例１（１）で作製したｍＲＮＡ（以下、これを「（ａ）ＲＮＡドライバー」と称することがある）、及びｉｎｖｉｔｒｏ転写反応で作成したＲＮＡをドライバーとして用いた。後者のＲＮＡはさらに２種類（以下、これを「（ｂ）ＲＮＡドライバー、及び「（ｃ）ＲＮＡドライバー」と称する）に分けられる。１つはノーマライゼーションにより除かれたＲＮＡ−ｃＤＮＡからｃＤＮＡを回収し、ファージベクターにクローニングしたものである。大腸菌に感染後１つの出発材料あたり１０００から２０００プラークを混ぜ合わせて１つのライブラリー（ミニライブラリー）とし、常法によりプラスミドＤＮＡに変換する（ファージをヘルパーファージとともに再度大腸菌に感染させ、ファージミドとし、さらにもう一度感染させてプラスミドＤＮＡを得る）。
【０１１５】
得られたＤＮＡについてｉｎｖｉｔｒｏ転写反応（Ｔ３ＲＮＡポリメラーゼまたはＴ７ＲＮＡポリメラーゼを用いる）を行い、ＤＮａｓｅＩ（ＲＱ１−ＲＮａｓｅｆｒｅｅ；Ｐｒｏｍｅｇａ社製）、ＰｒｏｔｅｉｎａｓｅＫ処理後、フェノール／クロロホルム抽出をしてＲＮＡ（ｂ）ＲＮＡドライバーを得た。この際、通常出発材料としては９種類（すい臓、肝臓、肺、腎臓、脳、脾臓、睾丸、小腸、胃）の組織からそれぞれミニライブラリーを作成して、９種類のミニライブラリーを混合してＲＮＡを得る。もう一つのＲＮＡはすでに重複のないクローンとして保存されているライブラリー（クローン数約２万個）を培養し、得られたＤＮＡについて（ｂ）ＲＮＡドライバーと同様にｉｎｖｉｔｒｏ転写反応を行い（ｃ）ＲＮＡドライバーとした。
【０１１６】
これら３種のＲＮＡは、Ｌａｂｅｌ−ＩＴＢｉｏｔｉｎＬａｂｅｌｉｎｇＫｉｔ（ＭｉｒｕｓＣｏｒｐｏｒａｔｉｏｎ製）を用いてビオチン化標識を行ったあと、１：１：１の割合でテスターｃＤＮＡに添加し、Ｒｏｔ１０での反応（４２℃）を行い、ストレプトアビジンビーズ（ＣＰＧ）処理を行って回収した上清について、第２鎖の合成を行った。
【０１１７】
実施例３完全長ｃＤＮＡクローンの塩基配列決定
（１）クローンのｒｅａｒｒａｙ
各クラスタからひとつの代表クローンを選んだ。代表クローンはＱ−ｂｏｔ（ＧＥＮＥＴＩＸＬＩＭＩＴＥＤ製）で選択し、３８４穴プレートにａｒｒａｙ化した。その際、大腸菌は３０℃で１８〜２４時間、５０μｌのＬＢ培地で培養した。このとき、ｃＤＮＡライブラリーがＰＳベクターに導入され大腸菌ＤＨ１０Ｂを形質転換している場合には１００ｍｇ／ｍｌのアンピシリン及び５０ｍｇ／ｍｌのカナマイシンを添加し、Ｚａｐベクターに導入し、ＳＯＬＲシステムに導入している場合には１００ｍｇ／ｍｌのアンピシリン及び２５ｍｇ／ｍｌのストレプトアビジンを添加して行った。
【０１１８】
（２）プラスミドの抽出とＩｎｓＳｉｚｉｎｇ
上記（１）で培養した各クローンは、さらに１００ｍｇ／ｍｌのアンピシリンを含む１．３ｍｌのＨＴ液中で培養され、遠心分離により菌体を回収した後、ＱＩＡｐｒｅｐ９６Ｔｕｒｂｏ（ＱＩＡＧＥＮ社製）を用いてプラスミドＤＮＡを回収、精製した。取得されたプラスミド中に挿入されているｃＤＮＡの鎖長を調べるために、上記で取得したプラスミドＤＮＡの１／３０を制限酵素ＰｙｕＩＩで消化し、１％のａｇａｒｏｓｅゲル電気泳動を行った。
【０１１９】
（３）配列決定
かくして取得されたプラスミド中に挿入された完全長ｃＤＮＡの全長の塩基配列解析には、３種類のシークエンサを用いた。また、プラスミドは挿入配列の長さが２．５ｋｂより短いものと長いものの２つのカテゴリに分けた。このうち２．５ｋｂより短い挿入配列を有するクローンについては両端から塩基配列を解析した。その際、プラスミドはベクターがＰＳの場合には配列番号１１（センス鎖）、及び１２（アンチセンス鎖）に記載のプライマーを用いて、またベクターがＺａｐの場合には配列番号１３（センス鎖）、及び１４（アンチセンス鎖）に記載のプライマーを用いてＴｈｅｒｍｏｓｅｑｕｅｎａｓｅＰｒｉｍｅｒＣｙｃｌｅＳｅｑｕｅｎｃｉｎｇＫｉｔ（ＡｍｅｒｓｈａｍＰｈａｒｍａｃｉａＢｉｏｔｅｃｈ社製）で反応し、ＬｉｃｏｒＤＮＡ４２００（ｌｏｎｇｒｅａｄｓｅｑｕｅｎｃｅｒ）を用いて解析した。
【０１２０】
上記塩基配列解析により解析ができなかったギャップは、プライマウォーキング法により決定した。その際、ＡＢＩＰｒｉｓｍ３７７及び／またはＡＢＩＰｒｉｓｍ３７００（ＡｐｐｌｉｅｄＢｉｏｓｙｓｔｅｍｓＩｎｃ．製）とＢｉｇＤｙｅｔｅｒｍｉｎａｔｏｒｋｉｔとＣｙｃｌｅＳｅｑｕｅｎｃｉｎｇＦＳｒｅａｄｙＲｅａｃｔｉｏｎＫｉｔ（ＡｐｐｌｉｅｄＢｉｏｓｙｓｔｅｍｓＩｎｃ．製）を用いた。
【０１２１】
また、挿入されているｃＤＮＡが２．５ｋｂより長いクローンの配列決定は、ショットガン法によった。その際、ＳｈｉｍａｄｚｕＲＩＳＡ３８４とＤＹＥｎａｍｉｃＥＴｔｅｒｍｉｎａｔｏｒｃｙｃｌｅｓｅｑｕｅｎｃｉｎｇｋｉｔ（ＡｍｅｒｓｈａｍＰｈａｒｍａｃｉａＢｉｏｔｅｃｈ社製）を用いた。ショットガンライブラリを作製するために、４８の独立な代表クローンからＰＣＲで増殖した４８のＤＮＡフラグメントを用いた。増幅されたＤＮＡ断片の末端をＴ４ＤＮＡポリメラーゼによって平滑化した。
このＤＮＡ断片を、ｐＵＣ１８ベクターへ挿入し、更に該組み換えベクターにより大腸菌ＤＨ１０Ｂを形質転換した。この大腸菌から上記（２）と同様にしてプラスミドを調製した。
【０１２２】
それらの代表クローンについては、両末端からの塩基配列解析によって塩基配列を決定し、該塩基配列をコンピューター上で連結した後、ＤｏｕｂｌｅＳｔｒｏｋｅＳｈｅａｒｉｎｇＤｅｖｉｃｅ（ＦｉｏｒｅＩｎｃ．製）によるｓｈｅａｒｉｎｇを行った。ショットガン法による塩基配列決定は、１２〜１５クローンの重複をもって行った。この塩基配列決定により配列が決定できなかったギャップは、上記と同様にプライマウォーキングによって決定した。
【０１２３】
実施例４各完全長ｃＤＮＡクローンの塩基配列の解析
実施例３で決定した完全長ｃＤＮＡクローンの全塩基配列について、ＢＬＡＳＴによる相同性検索や、ＨＭＭＰＦＡＭによるタンパク質特徴検索を行い、各完全長ｃＤＮＡクローンがコードするタンパク質の機能を推定した。
【０１２４】
（１）ｄｎａｆｏｒｍ２７６９８（配列番号１、４）
ｄｎａｆｏｒｍ２７６９８は、配列番号１に示すように、３３１１塩基から成り、そのうち塩基番号７８から２１９８までがオープンリーディングフレーム（終止コドンを含む）になっていた。オープンリーディングフレームから予測されるアミノ酸配列は、７０６アミノ酸残基から成る（配列番号４）。配列番号１がコードするアミノ酸配列についてＢＬＡＳＴを用いて相同性検索を行ったところ、ＳＰＴＲ蛋白質データベース（ＳＷＩＳＳ−ＰＲＯＴ蛋白質配列データベースとＴｒＥＭＢＬ核酸翻訳データベースを統合したもの）中に、（ｉ）データベース登録記号ＡＬ０３１６７８、ＨｕｍａｎＤＮＡｓｅｑｕｅｎｃｅｆｒｏｍｃｌｏｎｅＲＰ４−８１６Ｋ１７ｏｎｃｈｒｏｍｏｓｏｍｅ２０ｐ１２．２−１３．ＣｏｎｔａｉｎｓｔｈｅＴＧＭ３ｇｅｎｅｆｏｒｔｒａｎｓｇｌｕｔａｍｉｎａｓｅ３が、ｅ−ｖａｌｕｅ：０．０で、また４４２アミノ酸残基に亘り８７％の一致度で、また（ｉｉ）データベース登録記号Ｑ０８１８８、ＨｕｍａｎＰｒｏｔｅｉｎ−ｇｌｕｔａｍｉｎｅｇｌｕｔａｍｙｌｔｒａｎｓｆｅｒａｓｅＥ３ｐｒｅｃｕｒｓｏｒが、ｅ−ｖａｌｕｅ：０．０で、７０５アミノ酸残基に亘り４９％の一致度で、さらに（ｉｉｉ）データベース登録記号Ｑ０８１８９、ｍｏｕｓｅＰｒｏｔｅｉｎ−ｇｌｕｔａｍｉｎｅｇｌｕｔａｍｙｌｔｒａｎｓｆｅｒａｓｅＥ３ｐｒｅｃｕｒｓｏｒが、ｅ−ｖａｌｕｅ：０．０で、７０５アミノ酸残基に亘り４８％の一致度でヒットした。これらの結果より配列番号４に示したアミノ酸配列からなるタンパク質はグルタミントランスフェラーゼであることが推測された。
【０１２５】
また、配列番号４に示すアミノ酸配列について、ＨＭＭＰＦＡＭによる蛋白質特徴検索を行ったところ配列番号４のアミノ酸番号１−１２１のアミノ酸配列にＴｒａｎｓｇｌｕｔａｍｉｎａｓｅ−ｌｉｋｅｓｕｐｅｒｆａｍｉｌｙの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔａｍｉｎ＿Ｎ」としてエントリーされる配列）を見出し、アミノ酸番号２６９−３５８のアミノ酸配列にＴｒａｎｓｇｌｕｔａｍｉｎａｓｅ−ｌｉｋｅｓｕｐｅｒｆａｍｉｌｙの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔ＿ｃｏｒｅ」としてエントリーされるされる配列）を見出し、アミノ酸番号４８０−７０４のアミノ酸配列にＴｒａｎｓｇｌｕｔａｍｉｎａｓｅｆａｍｉｌｙ，Ｃ−ｔｅｒｍｉｎａｌｉｇｌｉｋｅｄｏｍａｉｎの特徴を示す配列（Ｐｆａｍに「Ｔｒａｎｓｇｌｕｔａｍｉｎ＿Ｃ」としてエントリーされる配列）を見出した。
これらのことから配列番号１に示す塩基配列がコードするタンパク質はグルタミントランスフェラーゼであることが推測された。
【０１２６】
（２）ｄｎａｆｏｒｍ５０４４１（配列番号２、５）
ｄｎａｆｏｒｍ５０４４１は、配列番号２に示すように、２７５７塩基から成り、そのうち塩基番号２３５から２６３１までがオープンリーディングフレーム（終止コドンを含む）になっていた。オープンリーディングフレームから予測されるアミノ酸配列は、７９８アミノ酸残基から成る（配列番号５）。配列番号２に示す塩基配列がコードするアミノ酸配列についてＢＬＡＳＴを用いて相同性検索を行ったところ、ＳＰＴＲ蛋白質データベース（ＳＷＩＳＳ−ＰＲＯＴ蛋白質配列データベースとＴｒＥＭＢＬ核酸翻訳データベースを統合したもの）中に、（ｉ）データベース登録記号ＡＦ３５７９７０、ＨｏｍｏｓａｐｉｅｎｓｃａｒｎｉｔｉｎｅｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅＩＣが、ｅ−ｖａｌｕｅ：０．０で、また８０２アミノ酸残基に亘り８３％で、また（ｉｉ）データベース登録記号Ｕ８８２９４、ＲａｔｔｕｓｎｏｒｖｅｇｉｃｕｓｃａｒｎｉｔｉｎｅｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅＩ（ＣＰＴＩ）がｅ−ｖａｌｕｅ：０．０で、７６５アミノ酸残基に亘り５３％の一致度でヒットした。
また、配列番号５に示すアミノ酸配列について、ＨＭＭＰＦＡＭによる蛋白質特徴検索を行ったところ配列番号５のアミノ酸番号１７０−７５８のアミノ酸配列にＣｈｏｌｉｎｅ／Ｃａｒｎｉｔｉｎｅｏ−ａｃｙｌｔｒａｎｓｆｅｒａｓｅの特徴を示す配列（Ｐｆａｍに「Ｃａｒｎ＿ａｃｙｌｔｒａｎｓｆ」としてエントリーされる配列）を見出した。
これらの結果より、配列番号２に示す塩基配列がコードするタンパク質はｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅであることが推測された。
【０１２７】
（３）ｄｎａｆｏｒｍ２８７００（配列番号３、６）
ｄｎａｆｏｒｍ２８７００は、配列番号３に示すように、３１８４塩基から成り、そのうち塩基番号２６から９４０までがオープンリーディングフレーム（終止コドンを含む）になっていた。オープンリーディングフレームから予測されるアミノ酸配列は、３０４アミノ酸残基から成る（配列番号６）。配列番号３に示す塩基配列がコードするアミノ酸配列についてＢＬＡＳＴを用いて相同性検索を行ったところ、ＳＰＴＲ蛋白質データベース（ＳＷＩＳＳ−ＰＲＯＴ蛋白質配列データベースとＴｒＥＭＢＬ核酸翻訳データベースを統合したもの）中に、（ｉ）データベース登録記号ＡＣ００４８３２、ＨｏｍｏｓａｐｉｅｎｓＰＡＣｃｌｏｎｅＲＰ４−５３９Ｍ６ｆｒｏｍ２２（ヒトＳＰＦ）が、ｅ−ｖａｌｕｅ：５×１０^−１０９で、２２１アミノ酸残基に亘り８５％で、また（ｉｉ）データベース登録記号ＡＦ３０９５５８、Ｒａｔｔｕｓｎｏｒｖｅｇｉｃｕｓｓｕｐｅｒｎａｔａｎｔｐｒｏｔｅｉｎｆａｃｔｏｒ（Ｓｐｆ）が、ｅ−ｖａｌｕｅ：２×１０^−８６で、２２１アミノ酸残基に亘り６４％の一致度で、さらに（ｉｉｉ）データベース登録記号ＡＦ４８７９７７、Ｂｏｓｔａｕｒｕｓｔｏｃｏｐｈｅｒｏｌ−ａｓｓｏｃｉａｔｅｄｐｒｏｔｅｉｎが、ｅ−ｖａｌｕｅ：２×１０^−８４で、２２１アミノ酸残基に亘り６２％の一致度でヒットした。
また、配列番号６に示すアミノ酸配列について、ＨＭＭＰＦＡＭによる蛋白質特徴検索を行ったところ、配列番号６のアミノ酸番号８５−２４７のアミノ酸配列にＣＲＡＬ／ＴＲＩＯｄｏｍａｉｎの特徴を示す配列（Ｐｆａｍに「ＣＲＡＬ＿ＴＲＩＯ」としてエントリーされる配列）を見出したが、本ドメインを持つタンパク質は、ｒｅｔｉｎａｌ−ｂｉｎｄｉｎｇｐｒｏｔｅｉｎ、ｐｈｏｓｐｈａｔｉｄｙｌｃｈｏｌｉｎｅあるいはａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌなどの運搬体タンパク質と考えられている。また、アミノ酸番号３−７１のアミノ酸配列にＣＲＡＬ／ＴＲＩＯＮｔｅｒｍｉｎａｌの特徴を示す配列（Ｐｆａｍに「ＣＲＡＬ／ＴＲＩＯ＿Ｎ」としてエントリーされる配列）を見出した。上記の（ｉ）のタンパク質であるヒトＳＰＦは、文献情報（ＰＮＡＳ２０００，９８，２２４４−２２４９）から、ｓｑｕａｌｅｎｅの運搬機能を有することが示されている。即ち、ＳＰＦはｓｑｕａｌｅｎｅを結合しミクロゾームのｓｑｕａｌｅｎｅｅｐｏｘｉｄｅへ運搬することにより、ｓｑｕａｌｅｎｅ２，３−ｏｘｉｄｅへの変換を触媒させステロール生合成の後期過程を開始させる。また、ヒトＳＰＦは、文献情報（ＢＢＲＣ２００１，２８５，２９５−２９９）から、ａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌ依存的に転写促進活性を有することも示されている。
これらの結果より、配列番号３に示す塩基配列がコードするタンパク質はｓｑｕａｌｅｎｅ、ｒｅｔｉｎａｌｄｅｈｙｄｅ、ｐｈｏｓｐｈａｔｉｄｙｌｃｈｏｌｉｎｅあるいはａｌｐｈａ−ｔｏｃｏｐｈｅｒｏｌなどの運搬体タンパク質あるいは転写促進活性を有することが推測され、該タンパク質または該タンパク質の発現制御物質、機能賦活物質、あるいは機能阻害物質は、代謝性疾患、動脈硬化、ビタミン欠乏症、夜盲症、過酸化に起因する疾患、老化、痴呆、などの治療薬となる可能性がある。
【０１２８】
実施例５ＤＮＡマイクロアレイを用いた組織発現解析
ＤＮＡマイクロアレイを用いた組織発現解析は、Ｍｉｋｉ，Ｒ．，ｅｔａｌ．，Ｐｒｏｃ．Ｎａｔｌ．Ａｃａｄ．Ｓｃｉ．ＵＳＡ，９８，２１９９−２２０４（２００１）の記載に従って行った。
（１）ＤＮＡマイクロアレイの作成
マウス全長ｃＤＮＡの塩基配列（ｄｎａｆｏｒｍ５０４４１）を、Ｍ１３フォワードおよびリバースプライマーを用いて増幅後、このＰＣＲ産物をイソプロパノールにて沈澱させ１５μｌの３×ＳＳＣ液に溶解した。このＤＮＡ溶液をポリＬリジンコートしたガラススライドに、１６チップ（ＳＭＰ３、ＴｅｌｅＣｈｅｍＩｎｔｅｒｎａｔｉｏｎａｌ、Ｓｕｎｎｙｖａｌｅ、ＣＡ）のＤＮＡアレイヤーを用いてスポットし、ＤＮＡマイクロアレイを作成した（方法の詳細はＨＹＰＥＲＬＩＮＫｈｔｔｐ：／／ｃｍｇｍ．ｓｔａｎｆｏｒｄ．ｅｄｕ／ｐｂｒｏｗｎ／ｍｇｕｉｄｅ／ｉｎｄｅｘ．ｈｔｍｌｈｔｔｐ：／／ｃｍｇｍ．ｓｔａｎｆｏｒｄ．ｅｄｕ／ｐｂｒｏｗｎ／ｍｇｕｉｄｅ／ｉｎｄｅｘ．ｈｔｍｌに記載されている）。マウスβアクチンとグリセルアルデヒド‐３‐フォスフェートデヒドロゲナーゼのｃＤＮＡをポジティブコントロールとし、シロイヌナズナのｃＤＮＡをネガティブコントロールとして用いた。
【０１２９】
このＤＮＡマイクロアレイの検出感度は、１細胞当たりｍＲＮＡ１ないし３コピーであった。ターゲット配列との一致度がおよそ８０％のクローンのシグナル強度は、完全に配列が一致するクローンの１０分の１であった。ターゲット配列との一致度が８０％未満のクローンのシグナル強度は、バックグランドレベルであった。
【０１３０】
（２）プローブの調製
Ｃ５７ＢＬ／６Ｊマウスの胎児、新生仔、アダルトの２２組織（腎臓、脳、脾臓、肺、肝臓、精巣、膵臓、胃、小腸、結腸、胎盤、心臓、胸腺、小脳、子宮、骨、筋肉、背側腎臓由来脂肪細胞、副精巣由来脂肪細胞、内臓脂肪、１０日齢新生児小脳、１０日齢新生児皮膚）から抽出したｍＲＮＡ１μｇを定法に従いランダムプライム逆転写反応を行い蛍光色素Ｃｙ３（ＡｍｅｒｓｈａｍＰｈａｒｍａｃｉａ社）を取りこませた。他方、１７．５日齢の胎児全身から抽出したｍＲＮＡ１μｇをランダムプライム逆転写反応を行い、蛍光色素Ｃｙ５を取りこませ発現解析の対照とした。ＣｙＤｙｅ標識ｃＤＮＡプローブは、ＣｙＳｃｒｉｂｅＧＦＸＰｕｒｉｆｉｃａｔｉｏｎＫｉｔ（ＡｍｅｒｓｈａｍＰｈａｒｍａｃｉａ社）を用いて精製し、滅菌水１７μｌにてカラムから溶出した。これに３μｌの１０μｇ／μｌｏｌｉｇｏ（ｄＡ），３μｌの酵母ｔＲＮＡ２０μｇ／μｌ，１μｌの２０μｇ／μｌマウスＣｏｔ１ＤＮＡ，５．１μｌの２０ＸＳＳＣ，および０．９μｌの１０％ＳＤＳからなるブロッキング溶液を混和してＣｙＤｙｅ標識ｃＤＮＡプローブを調製した。
【０１３１】
（３）ＤＮＡマイクロアレイのハイブリダイゼイション
発現解析対象組織由来ｃＤＮＡプローブ（Ｃｙ３標識）と対照の１７．５日齢胎児由来ｃＤＮＡプローブ（Ｃｙ５標識）を混和した溶液３０μｌを９５℃にて１分間熱処理を行い室温にて冷却した。ＤＮＡマイクロアレイに上記プローブ溶液を添加しカバースリップを被せ、Ｈｙｂｒｉｃａｓｅｔｔｅ（ＡｒｒａｙＩｔ社）中にて６５℃一晩ハイブリダイズさせた。次に、ＤＮＡマイクロアレイを２ＸＳＳＣ，０．１％ＳＤＳを用いて洗浄し、続いて１ＸＳＳＣにて２分間、０．１ＸＳＳＣにて２分間リンスした。マイクロアレイはＳｃａｎＡｒｒａｙ５０００共焦点レーザースキャナーを用いてスキャンし、画像をＩＭＡＧＥＮＥ（ＢｉｏＤｉｓｃｏｖｅｒｙ社）で解析した。
【０１３２】
（４）データ解析
各組織中のｍＲＮＡ量（Ｃｙ３標識）は、対照の１７．５日齢の胎児全身ｍＲＮＡ量（Ｃｙ５標識）との比（Ｃｙ３／Ｃｙ５）を対数（ｌｏｇ_２）で表示した。すなわち、解析対象とする各マウス全長ｃＤＮＡに対応するｍＲＮＡ発現量が、対照組織中よりも各組識中の方が多い場合は正の数値で、少ない場合は負の数値で、等しい場合は０で示される。データの正確性を増すために実験は独立に２回行い、再現性の有る結果を採用した。その結果を表１に示す。
【０１３３】
一般的に、ＤＮＡアレイを使用した発現解析結果は、２倍程度の増減は実験誤差とみなすため、結果の数値が１以上の場合にはある組織中のｍＲＮＡ量が対照である１７．５日齢の胎児全身のｍＲＮＡ量と比較して２倍以上であり、有意に増加しており、逆に、結果の数値が−１以下の場合はある組織中のｍＲＮＡ量が、対照である１７．５日齢の胎児全身のｍＲＮＡ量と比較して２分の１以下であり、有意に減少していると解釈した。また、組織間のｍＲＮＡ発現量を比較検討する際は、各組織における数値の差が１であればｍＲＮＡ量は２倍、２であればｍＲＮＡ量は４倍であり、逆に、組織間の数値の差が−１であればｍＲＮＡ量は１／２倍、−２であればｍＲＮＡ量は１／４倍となる。
【０１３４】
【表１】

【０１３５】
表１から明らかなように、ｄｎａｆｏｒｍ５０４４１は、それ自身をターゲット配列とした発現解析により、対照に比べて、脳、小脳、精巣で強く発現することが分かる。それ以外の臓器では全般的に発現が弱まる傾向があることが分かる。
【０１３６】
実施例６タンパク質−タンパク質相互作用解析
哺乳動物細胞におけるｔｗｏ−ｈｙｂｒｉｄ法（Ｓｕｚｕｋｉ，Ｈ．，ｅｔａｌ．，ＧｅｎｏｍｅＲｅｓｅａｒｃｈ，１１，１７５８−１７６５（２００１））を用いて、マウス全長ｃＤＮＡの塩基配列（ｄｎａｆｏｒｍ５０４４１）がコードするタンパク質のタンパク質−タンパク質相互作用を網羅的に解析した。
（１）ＰＣＲ法を用いた迅速なサンプル調製
哺乳動物細胞でのｔｗｏ−ｈｙｂｒｉｄ実験は、ＣｈｅｃｋＭａｔｅｍａｍｍａｌｉａｎｔｗｏ−ｈｙｂｒｉｄｓｙｓｔｅｍ（Ｐｒｏｍｅｇａ社）を利用した。タンパク質−タンパク質相互解析用のサンプルは、ＣＭＶプロモーターの下流にＧａｌ４遺伝子のＤＮＡ結合領域を挿入したプラスミドベクターｐＢＩＮＤ、ＣＭＶプロモーターの下流にＶＰ１６遺伝子の転写活性化領域を挿入したプラスミドベクターｐＡＣＴ，および５個のＧａｌ４結合領域とＴＡＴＡボックスの下流にレポーターであるルシフェラーゼ遺伝子を挿入したプラスミドベクターｐＧ５ｌｕｃを鋳型として調製した。Ｇａｌ４遺伝子とマウス全長ｃＤＮＡの塩基配列（ｄｎａｆｏｒｍ５０４４１）のタンパク質コード配列との融合遺伝子、並びにＶＰ１６遺伝子とマウスｃＤＮＡライブラリーＦＡＮＴＯＭ（ＨＹＰＥＲＬＩＮＫｈｔｔｐ：／／ｆａｎｔｏｍ．ｇｓｃ．ｒｉｋｅｎ．ｇｏ．ｊｐ／ｈｔｔｐ：／／ｆａｎｔｏｍ．ｇｓｃ．ｒｉｋｅｎ．ｇｏ．ｊｐ／）の各クローンが有する完全長ｃＤＮＡのタンパク質コード配列との融合遺伝子は、基本的にＰｒｏｍｅｇａ社のプロトコールに従い共通配列部分を用いた連結と２段階ＰＣＲ法を組み合わせて作成した。（Ｓｕｚｕｋｉ，Ｈ．，ｅｔａｌ．，ＧｅｎｏｍｅＲｅｓｅａｒｃｈ，１１，１７５８−１７６５（２００１）の図１参照）。マウスｃＤＮＡのタンパク質コード配列を、５’側に共通配列をもち３’側に遺伝子特異的な配列をもつフォワードプライマーおよびＭ１３ユニバーサルプライマーとを用いてＰＣＲ増幅した後、上記増幅産物とｐＢＩＮＤまたはｐＡＣＴのＰＣＲ増幅産物（３’側に共通配列を付加した）とを混和し、それぞれネスティドプライマーを用いて第２段のＰＣＲ増幅を行い、Ｇａｌ４とマウスタンパク質の融合タンパク質を発現させるベクター（ＢＩＮＤサンプル）またはＶＰ１６とマウスタンパク質の融合タンパク質を発現させるベクター（ＡＣＴサンプル）を構築した。
【０１３７】
（２）ハイスループットな哺乳動物細胞でのｔｗｏ−ｈｙｂｒｉｄ実験
ＰＣＲ法で調製したＢＩＮＤおよびＡＣＴサンプルは、それ以上の精製を行わずに直接使用した。ＢＩＮＤサンプルおよびＡＣＴサンプルのそれぞれ０．２５μｌ、３０ｎｇのｐＧ５ｌｕｃ、および９．５μｌのＯｐｔｉ−ＭＥＭ培地（Ｌｉｆｅｔｅｃｈ社）を３８４ウェルプレートに分注した。Ｏｐｔｉ−ＭＥＭ培地にて３２倍希釈したＬＦ２０００トランスフェクション試薬（Ｌｉｆｅｔｅｃｈ社）１０μｌをウェルに加えて混和し２０分間インキュベーション後、Ｆ１２培地にて１，３００細胞／μｌに懸濁したＣＨＯ−Ｋ１チャイニーズハムスター細胞液２０μｌを加えて良く懸濁した。アッセイサンプルをＣＯ_２インキュベーター内で２０時間培養後、ルシフェラーゼ活性はＳｔｅａｄｙ−ＧｌｏＬｕｃｉｆｅｒａｓｅＡｓｓａｙＳｙｓｔｅｍ（Ｐｒｏｍｅｇａ社）を用いて測定し、相互作用を確認した。
【０１３８】
（３）解析結果
上記（２）の結果を表２に示すが、マウス全長ｃＤＮＡの塩基配列（ｄｎａｆｏｒｍ５０４４１）がコードするタンパク質は、マウスｃＤＮＡライブラリーＦＡＮＴＯＭ（ＨＹＰＥＲＬＩＮＫｈｔｔｐ：／／ｆａｎｔｏｍ．ｇｓｃ．ｒｉｋｅｎ．ｇｏ．ｊｐ／ｈｔｔｐ：／／ｆａｎｔｏｍ．ｇｓｃ．ｒｉｋｅｎ．ｇｏ．ｊｐ／）の特定のクローンが有するｃＤＮＡの塩基配列がコードするタンパク質との相互作用を下記のように有していることが明らかとなった。
【０１３９】
実施例４より、ｄｎａｆｏｒｍ５０４４１のオープンリーディングフレームから予測されるアミノ酸配列（配列番号５）を有するタンパク質（以下、これを「本タンパク質」と称する）はｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅであると推測されている。ｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅは脂肪組織からの脂肪酸の流動やミトコンドリアへの脂肪酸輸送に関与することが知られており（ＩｎｔＪＳｐｏｒｔｓＭｅｄ１９９８１９（４）：２３１−４４）、脂質代謝、糖尿病との関連が示唆される。表２から明らかな通り、本タンパク質はｈｙｐｏｔｈｅｔｉｃａｌｏｕｔｅｒａｒｍｄｙｎｅｉｎｌｉｇｈｔｃｈａｉｎ１ｓｔｒｕｃｔｕｒｅｃｏｎｔａｉｎｉｎｇｐｒｏｔｅｉｎと相互作用することが認められた。Ｏｕｔｅｒａｒｍｄｙｎｅｉｎは、モーター蛋白質の１つであり、ＡＡＡ＋ファミリーＡＴＰａｓｅに属し、繊毛や鞭毛の運動を担う「軸糸ダイニン」と細胞内において膜小胞などの輸送や細胞分裂などを担う「細胞質ダイニン」の２種類がある。本タンパク質が、ｈｙｐｏｔｈｅｔｉｃａｌｏｕｔｅｒａｒｍｄｙｎｅｉｎｌｉｇｈｔｃｈａｉｎ１ｓｔｒｕｃｔｕｒｅｃｏｎｔａｉｎｉｎｇｐｒｏｔｅｉｎと相互作用したことから、本タンパク質の有するｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅとしての脂肪酸代謝酵素活性が、繊毛や鞭毛の運動、または細胞内において膜小胞などの輸送や細胞分裂などに関連することが推測された。また、ｂｅｔａ−ａｄｒｅｎｅｒｇｉｃｒｅｃｅｐｔｏｒのアゴニスト刺激が、気道上皮細胞においてＯｕｔｅｒａｒｍｄｙｎｅｉｎのリン酸化をもたらすことが知られていることから（Ｊ．ＡｌｌｅｒｇｙＣｌｉｎＩｍｍｕｎｏｌ２００２１１０（６Ｓｕｐｐｌ）：Ｓ２７５−８１）、本タンパク質が気管の繊毛運動等を介して気管支喘息等の呼吸器疾患に関連していることが推測された。
また、本タンパク質は、ｚｉｎｃｆｉｎｇｅｒｐｒｏｔｅｉｎ，ｓｕｂｆａｍｉｌｙ１Ａ，３（Ａｉｏｌｏｓ）と相互作用することが認められた。Ａｉｏｌｏｓは、Ｉｋａｒｏｓと相互作用することによってリンパ球細胞の分化を調節するリンパ系に限定した転写因子であり（Ｇｅｎｏｍｉｃｓ，６１：３２６−９（１９９９））、リンパ球の分化異常は免疫不全などの免疫系疾患をもたらす。本タンパク質が、Ａｉｏｌｏｓと相互作用したことから、本タンパク質が免疫不全などの免疫系疾患に関連することが推測された。
【０１４０】
【表２】

【０１４１】
実施例７完全長ｃＤＮＡがコードするタンパク質の総合的機能解析
ｄｎａｆｏｒｍ５０４４１のオープンリーディングフレームから予測されるアミノ酸配列（配列番号５）を有するタンパク質（以下、「本タンパク質」と称する）はｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅであると推測された。また、本タンパク質は、実施例５より、脳、小脳、精巣で強く発現していることが示された。ｐａｌｍｉｔｏｙｌｔｒａｎｓｆｅｒａｓｅは、脂質代謝、癌、アポトーシス、ＰＰＡＲとの関連などが報告されており、また、アルツハイマー病において、脂質代謝において重要な役割を担っているアポリポ蛋白質Ｅ４との関連を含めた脂質代謝との関連が報告されている（Ｂｉｏｃｌｉｎｉｃａ２００２年８増大月号）。したがって、本タンパク質または本タンパク質の発現制御物質、機能賦活物質、あるいは機能阻害物質は、癌、糖尿病、アルツハイマー型痴呆、パーキンソン病、舞踏病、虚血性脳疾患、糖尿病性末梢神経障害、不妊症などの治療薬として開発できる可能性がある。
【０１４２】
【発明の効果】
本発明のタンパク質およびそれをコードするＤＮＡはトランスフェラーゼ活性または運搬体活性等を有することから、該タンパク質あるいはそれをコードするＤＮＡを用いて該活性を調節する物質をスクリーニングすることができ、該タンパク質が関連する疾患等に作用し得る医薬の開発に有用である。
本出願は、２００２年５月２日付けの日本特許出願（特願２００２−１３０７０２）および２００２年１２月４日付けの日本特許出願（特願２００２−３５２６９４）に基づくものであり、その内容はここに参照として取り込まれる。また、本明細書にて引用した文献の内容もここに参照として取り込まれる。
【０１４３】
【配列表】

[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a novel protein, a DNA encoding the protein, a full-length cDNA encoding the protein, a recombinant vector having the DNA, an oligonucleotide comprising a partial sequence of the DNA, and a transgenic cell into which the DNA has been introduced. And antibodies that specifically bind to the protein.
[0002]
[Prior art]
Obtaining cDNA and analyzing its base sequence are indispensable for analyzing the physiological activity of a protein expressed in a living body and developing a method for utilizing the protein based on the activity. Furthermore, creating a library in which full-length cDNAs corresponding to all gene types are cataloged is one of the important issues of the human genome project. The cataloged library means that there is no duplication in the cDNAs contained in the library, and refers to a library containing one type of each cDNA.
[0003]
The full-length cDNA cloning method is described in JP-A-9-248187 and JP-A-10-127291. This method comprises the steps of binding a molecule serving as a tag to a diol structure present at the 5 'cap site of mRNA, using the mRNA bound with the tag molecule as a template, oligo dT as a primer, and reverse transcription to an RNA-DNA complex. And separating the complex having a DNA corresponding to the full length of the mRNA using the function of the tag molecule.
[0004]
As an efficient reverse transcription method, a method for performing the transcription at a high temperature such that the template does not form a higher-order structure has been developed (Japanese Patent Laid-Open No. Hei 10-84661). Furthermore, a cloning vector has been developed which can uniformly clone a DNA fragment contained in a synthesized full-length cDNA library regardless of its chain length (Japanese Patent Application Laid-Open No. H11-9273).
[0005]
A full-length cDNA library produced by such a technique does not necessarily include all the elements that are different evenly as individual elements of the library, and is present only in clones with a high abundance ratio or, conversely, in very small amounts. There are clones. Since a clone existing only in such a trace amount is highly likely to be novel, a subtraction method and a normalization method for enriching such a clone have also been developed (Japanese Patent Application Laid-Open No. 2000-325080; Carnini, P. et al. et al., Genomics, 37, 327-336 (1996)).
The nucleotide sequence of each clone of the cataloged full-length cDNA library thus obtained can be identified by a known method, but the nucleotide sequence is identified, but the physiological activity of the protein encoded by the cDNA is still unknown. Remains.
[0006]
[Problems to be solved by the invention]
The present invention analyzes the nucleotide sequence of a cDNA clone contained in a cataloged full-length cDNA library, and among those having a novel sequence, identifies the biological activity of the protein encoded by the cDNA sequence and determines the biological activity. It is an object of the present invention to propose a method of using a protein based thereon and DNA encoding the same.
[0007]
[Means for Solving the Problems]
The present inventors analyzed the nucleotide sequence of a cDNA clone in a mouse full-length cDNA library and searched a database based on the homology of the sequence, and found that a protein having transferase activity or carrier activity in the sequence was identified. Specific sequences were found and the proteins encoded by these cDNAs were identified as having transferase or carrier activity. Further, (i) the expression level of these cDNAs in each tissue is analyzed, (ii) the protein encoded by the cDNA is expressed, and the interaction with other proteins is analyzed, and (i) and / or (ii) From the analysis results in (1), the functions of the protein encoded by the cDNA were comprehensively analyzed. The present invention has been achieved based on these findings.
[0008]
That is, according to the present invention, the following inventions (1) to (25) are provided.
(1) The following protein of (a) or (b):
(A) a protein consisting of the amino acid sequence of SEQ ID NO: 4 or 5;
(B) a protein comprising an amino acid sequence in which one or several amino acids have been deleted, substituted and / or added in the amino acid sequence of SEQ ID NO: 4 or 5, and having a transferase activity.
[0009]
(2) A DNA encoding the protein of (1).
(3) A full-length cDNA encoding the protein according to (1).
(4) The DNA of any one of the following (a), (b) or (c):
(A) DNA having the nucleotide sequence of SEQ ID NO: 1 or 2.
(B) DNA encoding a protein having a base sequence in which one or several bases are deleted, substituted and / or added in the base sequence of SEQ ID NO: 1 or 2, and having a transferase activity.
(C) a DNA having a base sequence capable of hybridizing under stringent conditions with a DNA having the base sequence of SEQ ID NO: 1 or 2 or a sequence complementary thereto, and encoding a protein having transferase activity.
[0010]
(5) The following protein (a) or (b):
(A) a protein consisting of the amino acid sequence of SEQ ID NO: 6;
(B) a protein consisting of an amino acid sequence in which one or several amino acids have been deleted, substituted and / or added in the amino acid sequence of SEQ ID NO: 6, and having a carrier activity;
[0011]
(6) A DNA encoding the protein of (5).
(7) A full-length cDNA encoding the protein according to (5).
(8) The DNA of any of the following (a), (b) or (c):
(A) a DNA having the nucleotide sequence of SEQ ID NO: 3;
(B) a DNA having a base sequence in which one or several bases are deleted, substituted and / or added in the base sequence of SEQ ID NO: 3, and encoding a protein having a carrier activity;
(C) DNA encoding a protein having a base sequence capable of hybridizing under stringent conditions with a DNA having the base sequence of SEQ ID NO: 3 or its complementary sequence, and having a carrier activity.
[0012]
(9) A recombinant vector containing the DNA according to any one of (2) to (4).
(10) A gene-transfected cell into which the DNA according to any one of (2) to (4) or the recombinant vector according to (9) has been introduced, or an individual comprising the cell.
(11) The protein according to (1), which is produced by the cell according to (10).
[0013]
(12) A recombinant vector containing the DNA according to any one of (6) to (8).
(13) A gene-transfected cell into which the DNA according to any one of (6) to (8) or the recombinant vector according to (12) has been introduced, or an individual comprising the cell.
(14) The protein according to (5), which is produced by the cell according to (13).
[0014]
(15) A sense oligonucleotide having the same sequence as 5 to 100 consecutive nucleotides in the base sequence of the DNA according to any of (2) to (4) or (6) to (8), An oligonucleotide selected from the group consisting of an antisense oligonucleotide having a complementary sequence, and an oligonucleotide derivative of the sense or antisense oligonucleotide.
[0015]
(16) An antibody or a partial fragment thereof that specifically binds to the protein according to (1) or (11).
(17) The antibody according to (16), wherein the antibody is a monoclonal antibody.
(18) The antibody according to (17), wherein the monoclonal antibody has an action of neutralizing the transferase activity of the protein according to (1) or (11).
[0016]
(19) An antibody or a partial fragment thereof that specifically binds to the protein according to (5) or (14).
(20) The antibody according to (19), wherein the antibody is a monoclonal antibody.
(21) The antibody according to (20), wherein the monoclonal antibody has an action of neutralizing the carrier activity of the protein according to (5) or (14).
[0017]
(22) Contacting the protein according to any one of (1), (5), (11) and (14) with a test substance and measuring a change in the activity of the protein caused by the test substance. A method for screening for a substance that modulates the activity of the protein.
(23) Expression of the gene, wherein the test substance is brought into contact with the gene-transfected cell according to (10) or (13), and a change in the expression level of the DNA introduced into the cell is detected. A method for screening for a modulator.
(24) At least one or more amino acid sequence information selected from the amino acid sequence of the protein according to (1) or (5), and / or any of (2) to (4) or (6) to (8) A computer-readable recording medium storing at least one or more nucleotide sequence information selected from the nucleotide sequence of the DNA described in 1.
(25) A carrier to which the protein according to (1) or (5) and / or the DNA according to any of (2) to (4) or (6) to (8) is bound.
[0018]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, the present invention will be described in more detail.
(1) Acquisition of full-length cDNA and analysis of nucleotide sequence
The DNA of the present invention may be a protein or an amino acid sequence comprising the amino acid sequences of SEQ ID NOs: 4 to 6 (the number is not particularly limited, for example, 20 or less, preferably 20 or less). Means 15 or less, more preferably 10 or less, and still more preferably 5 or less) amino acid residue substitution, deletion, insertion, addition, or inversion, and comprises a transferase activity or Any substance can be used as long as it can encode a protein having a carrier activity. Specifically, it may be only the translation region encoding the amino acid sequence or may include the full length of the cDNA. Here, the carrier activity means an activity of transporting a specific substance into and out of a cell. The substance to be transported is not particularly limited, and a protein having the carrier activity is hereinafter referred to as a "carrier". , "" Transporter protein "or" transporter. "
[0019]
Specifically, examples of the DNA containing the full-length cDNA include a DNA comprising the nucleotide sequence of SEQ ID NOS: 1 to 3, and the like. Examples of the translation region include those having sequences represented by base numbers 78 to 2198 of SEQ ID NO: 1, base numbers 235 to 2631 of SEQ ID NO: 2, and base numbers 26 to 940 of SEQ ID NO: 3. Further, the DNA of the present invention includes not only the full length of the above-mentioned cDNA but also those containing the above-mentioned translation region and a portion adjacent to the 3 'and / or 5' end thereof, which is the minimum necessary for the expression of the translation region. .
[0020]
The DNA of the present invention may be obtained by any method as long as it can be obtained, but specifically, for example, can be obtained by the method described below. First, mRNA is prepared from a suitable animal, preferably a mammalian tissue or the like by a method known per se and generally used. Next, cDNA is synthesized using this mRNA as a template. At this time, a 5 ′ cap (^7MeG_pppN) A molecule serving as a tag is chemically bonded to a diol structure specific to a site, and reverse transcription is performed using this mRNA as a template and oligo dT as a primer. Then, only the full-length cDNA is separated using the function of the tag molecule. It is preferable to use the method (JP-A-9-248187, JP-A-10-127291). In addition, in the case of reverse transcription, in order to prevent the template from forming a higher-order structure and lowering the efficiency of reverse transcription, in the presence of trehalose or the like, use a thermostable reverse transcriptase at a high temperature. It is preferable to use a method of performing reverse transfer (Japanese Patent Laid-Open No. 10-84661). Here, high temperature means 40-80 degreeC.
[0021]
The thus obtained cDNA is cloned by inserting it into an appropriate cloning vector. The vector used herein has a recombination recognition sequence at both ends of a cloning site capable of uniformly cloning DNAs of various chain lengths, and is a linear vector inserted into a host by a method other than infection. (JP-A-11-9273) is preferably used. In the cDNA library thus obtained, not all clones exist uniformly (hereinafter, this may be referred to as "cataloged"), but only a very small amount exists in this library. A clone that does not have a high probability of being new. Therefore, it is preferable to use a subtraction method or a normalization method for enriching such clones (Japanese Patent Laid-Open No. 2000-325080, Carinci, P. et al., Genomics, 37, 327-336 (1996)).
[0022]
The nucleotide sequence of the cataloged cDNA library is analyzed by a commonly used method known per se. In the case of the DNA of the present invention, in the case of full-length cDNA, the base sequence obtained from the sequence based on the terminal 100 is obtained by converting BLAST (http://www.ncbi.nlm.nih.gov/BLAST/; National Center of Biotechnology Information). ), NCBI databases such as Genbank, EMBL, and DDBJ were searched, and even the sequence showing the highest homology was found to have a similarity of 30% or less as a new sequence for the following analysis.
[0023]
Examples of the DNA having the base sequence of such a full-length cDNA include a DNA having the base sequence described in SEQ ID NOs: 1 to 3, and the like. Examples of the translation region include those having sequences represented by base numbers 78 to 2198 of SEQ ID NO: 1, base numbers 235 to 2631 of SEQ ID NO: 2, and base numbers 26 to 940 of SEQ ID NO: 3.
[0024]
The thus obtained novel nucleotide sequence was subjected to homology search by BLAST (Basic local alignment search tool; Altschul, SF, et al., J. Mol. Biol., 215, 403-410 (1990)). HMMPFAM (http://pfam.edust.edu.edu), which is one of the functional groups of Homology search and HMMER (sequence analysis method using a hidden Markov model; Eddy, SR, Bioinformatics 14, 755-763 (1998)). ), The function of the protein encoded by the nucleotide sequence can be estimated.
[0025]
In the homology search by BLAST, the function of the clone to be analyzed can be estimated from various kinds of annotation information associated with hit sequences having sufficiently significant homology obtained as a result of the search. Here, a sufficiently significant hit sequence means that the degree of coincidence between the registered functional domain portion of the registered amino acid sequence and the corresponding portion of the amino acid sequence encoded by the DNA of the present invention is 10 as an e-value.^-4Show the following or 30% or more.
[0026]
For example, if many of the functional domain sequences hit at the top are confirmed to function as transferases or carriers, the clones to be analyzed that are similar in sequence to them also have the same function, that is, transferase activity or carrier. The prediction holds that it will have activity.
[0027]
In the HMMPFAM, an analysis is performed by a method of checking whether the amino acid sequence to be analyzed has the characteristics of the amino acid sequence of an entry in a database in which a protein profile called Pfam is accumulated. Profiles are extracted from a series of proteins with the same characteristics, and even if a function cannot be clarified by comparing the full length of one sequence to one sequence, if there is a characteristic region in the sequence, it can be found and its function can be predicted. . A specific example of the function prediction of the protein thus performed will be described below.
[0028]
The amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 was obtained by BLAST search under the database registration number AL031678, Human DNA sequence from clone RP4-816K17 on chromasome 20p12.2-13. Contains the TGM3 gene for transglutaminase 3 was e-value: 0.0, with 87% concordance over 442 amino acid residues, and the database registration symbols Q08188, Human Protein-glutaminamine glutamine melamine melamine melamine melamine melamine melamine catalyzed : 0.0, with 49% concordance over 705 amino acid residues, and database entry symbol Q08189, mouse Protein-glutamine glutamyltransferase E3 precursor, e-value: 0.0, 48 over 705 amino acid residues Hit with% match.
In addition, when a protein characteristic search is performed by HMMPFAM for the amino acid sequence encoded by the base sequence represented by SEQ ID NO: 1, a sequence showing characteristics of transglutaminase-like superfamily (a sequence entered as “Transglutamin_N” in Pfam), a transglutaminase lipase-like lipase (A sequence that is entered as “Transglut_core” in Pfam), and a sequence that exhibits the characteristics of Transglutaminase family and C-terminal ultimate like domain (a sequence in which “Transglutamine_C” is found in Pfam) . From these facts, it can be inferred that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 1 has a transglutaminase activity among transferases.
[0029]
The amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 2 was determined by BLAST search to have AF357970, Homo sapiens carnitine palmitoyltransferase IC of e-value: 0.0, and 83% identity over 802 amino acid residues, and Database registration symbol U88294, Rattus norvegicus carnitine palmitoyltransferase I (CPTI) hit with an e-value of 0.0 and 53% identity over 765 amino acid residues, and according to the protein feature search by HMMPFAM Choline / Carline An array showing the characteristics of o-acyltransferase ("Carn_acyltransf" in Pfam) Entry is the array to be) is found. From these facts, it is presumed that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 2 has particularly palmitoyltransferase activity among transferases.
[0030]
The amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 3 was obtained by BLAST search using the database registration code AC004832, Homo sapiens PAC clone RP4-539M6 from 22 (human SPF), and e-value: 5 × 10⁻¹⁰⁹And a hit of 85% over 221 amino acid residues, and the database registration symbol AF309558, Rattus norvegicus supernatant protein factor (rat SPF) showed an e-value of 2 × 10^-86In addition, AF487977 and Bos taurus tocopherol-associated protein were found to have an e-value of 2 × 10 with a coincidence of 64% over 221 amino acid residues.^-84Hit with a 62% match over 221 amino acid residues. In addition, a protein characteristic search by HMMPFAM was performed, and a sequence exhibiting the characteristics of CRAL / TRIO domain (sequence entered as “CRAL_TRIO” in Pfam) was found. However, proteins having this domain were obtained from a retinal-binding protein, phosphoridylcholine. Alternatively, it is considered to be a carrier protein such as alpha-tocopherol. In addition, a sequence showing the characteristics of CRAL / TRIO_N terminal (a sequence that is entered in Pfam as CRAL / TRIO_N) is found. Literature information (PNAS 2000, 98, 2244-2249) indicates that the human SPF has a squalene transport function. That is, human SPF binds squalene and transports it to microsomal squalene oxide, thereby catalyzing the conversion to squalene 2,3-oxide and initiating the late stage of sterol biosynthesis. In addition, literature information (BBRC 2001, 285, 295-299) shows that human SPF has an alpha-tocopherol-dependent transcription promoting activity. From these facts, it can be inferred that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 3 has a carrier protein such as squalene, retinaldehyde, phosphatidylcholine or alpha-tocopherol, or a transcription promoting activity.
[0031]
The DNA of the present invention may be obtained in a state in which a base sequence is deleted or inserted in the translated sequence. As a result of the homology search or the protein feature search as described above, the base sequence of the DNA is determined. If a deletion or insertion in the DNA is estimated, it is possible to obtain a full-length cDNA having no base deletion or insertion by using a method commonly used in the art such as library screening or PCR cloning. it can. The protein of the present invention is expressed using the full-length cDNA thus obtained, and can be used for functional analysis.
[0032]
The DNA of the present invention thus obtained, the base sequence of which is determined, and the function of which is presumed is only the base sequence described in SEQ ID NOS: 1 to 3 or the one having the base sequence shown above as its translation region. In these base sequences, one or several (the number is not particularly limited, but is, for example, 60 or less, preferably 30 or less, more preferably 20 or less, and still more preferably 10 or less) DNAs encoding a protein having a base sequence with deletion, substitution and / or addition of bases, and having transferase activity or carrier activity. Encoding a protein having a transferase activity or a carrier activity under stringent conditions. A and the like are also included. As described above, these DNAs comprise an amino acid sequence in which one or several amino acid sequences have been deleted, substituted and / or added in the amino acid sequence of the protein described in SEQ ID NOs: 4 to 6, and have a transferase activity or a carrier. Also includes those encoding proteins having activity.
Here, DNA that hybridizes under stringent conditions has a homology of 80% or more, preferably 90% or more, and more preferably 95% or more with the base sequence described in SEQ ID NOS: 1 to 3 by BLAST analysis. DNAs containing a base sequence are exemplified. Further, the hybridization under stringent conditions means that the reaction is carried out in a normal hybridization buffer at a temperature of 40 to 70 ° C, preferably 60 to 65 ° C, and a salt concentration of 15 mM to 300 mM, preferably The washing can be performed according to a method of washing in a washing solution of 15 mM to 60 mM or the like.
[0033]
Further, the DNA of the present invention may be obtained by the above-described method or may be synthesized. The DNA base sequence can be easily replaced with a commercially available kit such as a site-directed mutagenesis kit (Takara Shuzo) or a quick change site-directed mutagenesis kit (Stratagene).
[0034]
The nucleotide sequences described in SEQ ID NOs: 1 to 3 are derived from a mouse. A human cDNA library was prepared according to the above-described method for preparing a cDNA library, and the library was subjected to SEQ ID NO: By performing hybridization using a DNA fragment having the nucleotide sequence of 1 to 3 as a probe, a DNA encoding a human homolog protein of the protein encoded by the nucleotide sequence of SEQ ID NOs: 1 to 3 (hereinafter referred to as " May be referred to as "human homolog DNA"). Such a DNA that hybridizes under stringent conditions with a DNA having the base sequence of SEQ ID NOs: 1 to 3 or a sequence complementary thereto includes such a human homologous DNA and a human orthologous DNA described below.
[0035]
Further, it is also possible to predict the base sequence of the human homolog DNA by using informatics, and to obtain the human homolog DNA from the above-mentioned human cDNA library or the like based on the base sequence.
In general, as a method for predicting a base sequence encoding a homolog protein of a target protein using informatics, for example, (i) using a base sequence of a target cDNA as a query, Database (including cDNA database predicted by informatics), a method of performing homology search using BLAST or the like, and (ii) using a base sequence of the target cDNA as a query, A method in which a homology search is performed using BLAST or the like, and the sequence of the hit EST is linked with reference to the base sequence of the target cDNA, and (iii) the base sequence of the target cDNA is used as a query, and Homology search using BLAST etc. against the genome database of Then, the position on the genome where the gene of the cDNA of interest is located is specified, and Genscan (http://genes.mit.edu/GENSCAN.html) or Sim4 (Genome Res., 8: 977-74 (1998)) and the like, and a method of predicting the nucleotide sequence of a gene portion in the genome.
[0036]
When predicting the nucleotide sequence of human homolog DNA of mouse-derived cDNA, any of the above methods can be used, but any cDNA having the nucleotide sequence of SEQ ID NOs: 1 to 3 of the present invention is novel, In the above method (i), it is considered that the nucleotide sequence of the human homolog DNA cannot be obtained, so the method described in (ii) or (iii) is preferably used.
[0037]
Based on the nucleotide sequence of the human homologous DNA thus predicted, a human homologous DNA corresponding to the DNA having the nucleotide sequence of SEQ ID NOs: 1 to 3 can be obtained from the above human cDNA library. As a specific acquisition method, for example, using a primer having a nucleotide sequence complementary to the nucleotide sequence at the 5 ′ end and 3 ′ end of the predicted human homolog DNA, PCR is performed using the above human cDNA library as a template. And a method of performing hybridization with the above human cDNA library using a partial sequence of the predicted human homolog DNA as a probe.
[0038]
In general, a similar gene having a nucleotide sequence having a high homology to the nucleotide sequence of the target gene is referred to as a “homolog”, and the above-described method also aims to obtain a human homolog DNA. It is important to confirm not only that the sequences are similar but also that the gene obtained as a homolog is a family member of the target gene. Genes acquired as "homologs" between two species of organisms are likely to be "orthologs", the same gene evolved from a common ancestral gene, and differ from each other caused by duplication from a common ancestral gene It may be a "paralog" that is a gene.
[0039]
That is, in order for the human-derived DNA obtained as a homologue to have the same function as the protein of the present invention, the function of the protein encoded by the human-derived DNA must be In order to estimate and verify the same function of the protein of the present invention in mice, it is preferable to confirm that the human homolog is an ortholog of a closely related species of the mouse gene of the present invention.
[0040]
For example, the following method is used as a method for confirming the ortholog. (I) First, homology is analyzed between the nucleotide sequence of the cDNA of the present invention and the nucleotide sequence of the obtained human homolog DNA. Next, using the base sequence of the cDNA of the present invention as a query, a homology search was performed for human base sequences contained in international base sequence databases such as DDBJ, EMBL, and GenBank, and patent databases. Confirm that the degree of matching of the base sequence is higher than the degree of matching between the base sequence obtained from the database and the base sequence of the query. Further, (ii) homology is analyzed between the nucleotide sequence of the obtained human homolog DNA and the corresponding nucleotide sequence of the cDNA of the present invention. Next, using the base sequence of the obtained human homolog DNA as a query, a homology search was performed for the mouse base sequence contained in the international base sequence database such as DDBJ, EMBL, and GenBank, and in the patent database. Confirm that the degree of matching of the base sequence is higher than the degree of matching between the base sequence obtained from the database and the base sequence of the query. By confirming the above (i) and (ii), the obtained human homolog DNA can be identified as a human ortholog DNA corresponding to the cDNA of the present invention. The homology analysis described in (i) and (ii) above may be performed by comparing amino acid sequences, or by drawing a molecular evolutionary phylogenetic tree and examining it. In addition, it is preferable that the degree of coincidence by the homology analysis described in the above (i) and (ii) be analyzed as the degree of coincidence over the entire length of the query.
[0041]
By performing a homology search by BLAST or a protein feature search by HMMPFAM on the nucleotide sequence of the thus obtained human homologous DNA or orthologous DNA, the function of the protein encoded by the nucleotide sequence can be estimated.
Furthermore, the protein of the present invention is expressed using the obtained full-length cDNA of human homolog DNA or human ortholog DNA, and can be used for confirmation of activity, functional analysis, and the like.
[0042]
(2) Protein encoded by the novel cDNA
The translation region of the protein encoded by the DNA of the present invention is, for example, a base sequence of the DNA which is converted into amino acids by three types of reading frames, and the range encoding the longest polypeptide is defined as the translation region of the present invention. The amino acid sequence can be determined. Examples of such an amino acid sequence include those described in SEQ ID NOs: 4 to 6. The protein of the present invention is not limited to the above-mentioned amino acid sequence, but comprises an amino acid sequence in which one or several amino acids have been substituted, deleted, and / or added, and has a transferase activity or Those having a carrier activity are also included.
[0043]
As a method for obtaining the protein of the present invention, the method of transcription / translation of the DNA of the present invention described in the above (1) by an appropriate method is preferably used. Specifically, a recombinant vector inserted into a suitable expression vector or a suitable vector together with a suitable promoter is prepared, and this recombinant vector is used to transform a suitable host microorganism or introduced into a suitable cultured cell. Thus, it can be obtained by purifying this.
[0044]
When the protein thus obtained is obtained in a free form, it can be converted to a salt by a known method or a method analogous thereto, and conversely, when the protein is obtained in a salt form, it can be converted to a free form or another salt. can do. Such salts of the protein of the present invention are also included in the protein of the present invention. Further, a protein produced by the transformant may be modified before or after purification by applying an appropriate protein modifying enzyme to modify the protein arbitrarily or partially removing the polypeptide. Can be. These modified proteins are also included in the scope of the present invention as long as they have the above-described transferase activity or carrier activity.
[0045]
When producing the protein of the present invention, the vector used for the production of the recombinant vector containing the DNA of the present invention is not particularly limited as long as the DNA is expressed in the transformant. Any of vectors may be used. Of these, usually, a commercially available protein expression vector into which an expression control region DNA such as a promoter suitable for a host into which the DNA is introduced has already been inserted is used. Specific examples of such a protein expression vector include pET3 and pET11 (manufactured by Stratagene) and pGEX (manufactured by Amersham Pharmacia Biotech) when the host is Escherichia coli, and pESP- when the host is yeast. I expression vector (Stratagene) and the like. In the case of insect cells, BacPAK6 (Clontech) and the like are used. When the host is an animal cell, ZAP Express (manufactured by Stratagene), pSVK3 (manufactured by Amersham Pharmacia Biotech) and the like can be mentioned.
[0046]
When using a vector into which the expression control region has not been inserted, it is necessary to insert at least a promoter as the expression control region. Here, as the promoter, a promoter contained in a host microorganism or a cultured cell can be used. However, the promoter is not limited thereto. For example, when the host is Escherichia coli, T3, T7, tac, A lac promoter or the like can be used. In the case of yeast, an nmt1 promoter, a Gal1 promoter, or the like can be used. When the host is an animal cell, SV40 promoter, CMV promoter and the like are preferably used.
[0047]
When a host capable of functioning with a mammalian-derived promoter is used, a promoter specific to the gene of the present invention can also be used. Insertion of the DNA of the present invention into these vectors may be performed by linking the DNA or a DNA fragment containing the DNA to the amino acid sequence of the protein encoded by the gene DNA downstream of the promoter in the vector.
[0048]
The recombinant vector thus prepared can be used to transform a host described below by a method known per se to prepare a DNA transductant. As a method for introducing the vector into a host, specifically, a heat shock method (J. Mol. Biol., 53, 154 (1970)), a calcium phosphate method (Science, 221, 551, (1983)), DEAE Dextran method (Science, 215, 166, (1982)), in vitro packaging method (Proc. Natl. Acad. Sci. USA, 72, 581, (1975)), virus vector method (Cell, 37, 1053, (1984)). )), And the electric pulse method (Chu. Et al., Nuc. Acids Res., 15, 1331 (1987)).
[0049]
The host for preparing the DNA transfectant is not particularly limited as long as the DNA of the present invention is expressed in the body. For example, Escherichia coli, yeast, baculovirus (arthropod polyhedrosis virus) -insect cells, or Animal cells and the like can be mentioned. Specifically, BL21, XL-2Blue (manufactured by Stratagene) and the like for Escherichia coli, SP-Q01 (manufactured by Stratagene) and the like for yeast, and AcNPV (J. Biol. Chem., 263, 7406, (Baculovirus) for baculovirus) 1988)) and its host, Sf-9 cells (J. Biol. Chem., 263, 7406, (1988)). Examples of animal cells include mouse fibroblast C127 (J. Viol., 26, 291, (1978)) and Chinese hamster ovary cell CHO cells (Proc. Natl. Acad. Sci. USA, 77, 4216, (1980)). Among them, COS-7 cells derived from African green monkey kidney (ATCC CRL1651: American Type Culture Collection preserved cells), HEK293 cells derived from human fetal kidney (ATCC CRL1573) or human cervix are preferred from the viewpoint of expression level and simplicity of screening. HeLa cells (ATCC CCL-2) are used.
[0050]
In addition to the above-described expression method using a protein expression vector, a homologous recombination technique (AA Vertes et al., Biosci) in which a DNA fragment of the present invention linked to a promoter is directly inserted into a chromosome of a host microorganism. Biotechnol. Biochem., 57, 2036 (1993)) or a transposon or an insertion sequence (AA Vertes et al., Molecular Microbiol., 11, 739, (1994)). It can also be made.
[0051]
The obtained culture is obtained by collecting cells or cells by a method such as centrifugation, suspending the cells or the like in a suitable buffer, and sonicating, lysozyme, and / or freezing and thawing. After the disruption, a crude protein solution is obtained by centrifugation, filtration, or the like, and further purified by a combination of appropriate purification methods. Thus, the protein of the present invention is obtained. In addition to the above-described expression method using the protein expression recombinant vector, protein expression is induced by subjecting the DNA of the present invention obtained in (1) to a cell-free transcription / translation system to obtain the protein of the present invention. be able to. The cell-free transcription / translation system used in the present invention is a system containing all the elements necessary for transcription from DNA to mRNA and translation from mRNA to protein. Refers to any system in which the protein being synthesized is synthesized. Specific examples of the cell-free transcription / translation system include a transcription / translation system prepared based on an eukaryotic cell and a bacterial cell, or an extract from a part thereof. A transcription / translation system prepared based on an extract from Erythrocytes, wheat germ and Escherichia coli (Escherichia coli S30 extract) may be mentioned.
[0052]
Separation and purification of the protein of the present invention from the obtained transcription / translation product of the cell-free transcription / translation system can be carried out by a commonly used method known per se. Specifically, for example, a DNA region encoding an epitope peptide, a polyhistidine peptide, glutathione-S-transferase (GST), a maltose binding protein, or the like is introduced into the DNA to be transcribed and translated, and expressed as described above. The protein can be purified by utilizing the affinity of the protein with a substance having affinity.
[0053]
The expression of the target protein is separated by SDS-polyacrylamide gel electrophoresis or the like, and stained with Coomassie Brilliant Blue (manufactured by Sigma) or detected by an antibody that specifically binds to the protein of the present invention described later. It can be confirmed by the method of performing. In general, it is known that an expressed protein is cleaved (processed) by a proteolytic enzyme present in a living body. The protein of the present invention is naturally included in the protein of the present invention as long as it has a transferase activity or a carrier activity, even if it is a partial fragment of the amino acid sequence that has been cleaved.
By analyzing the interaction between the thus obtained protein and other proteins and DNA, it is possible to know the multifaceted functions in the living body. As a method for analyzing the interaction, a conventional method known per se can be used. Specifically, for example, yeast two-hybrid method, fluorescence depolarization method, surface plasmon method, phage display method, ribosomal method Display method and the like can be mentioned.
[0054]
(3) Preparation of oligonucleotide and functional analysis using the oligonucleotide
Using the DNA of the present invention or a fragment thereof obtained by the method described in (1) above, an antisense oligonucleotide having a partial sequence of the DNA of the present invention, a sense -An oligonucleotide such as an oligonucleotide can be prepared.
[0055]
Examples of the oligonucleotide include a DNA having the same sequence as 5 to 100 consecutive bases in the base sequence of the DNA or a DNA having a sequence complementary to the DNA. Specific examples include DNA having the same sequence as 5 to 100 consecutive nucleotides in the base sequence represented by any of SEQ ID NOs: 1 to 3, or DNA having a sequence complementary to the DNA. When used as a sense primer and an antisense primer, the above-mentioned oligonucleotides in which the melting temperature (Tm) and the number of bases of both do not extremely change are preferable. The length of the sequence is generally 5 to 100 bases, preferably 10 to 60 bases, and more preferably 15 to 50 bases.
[0056]
In addition, derivatives of these oligonucleotides can also be used as the oligonucleotide of the present invention. Examples of the oligonucleotide derivative include an oligonucleotide derivative in which a phosphoric diester bond in an oligonucleotide is converted to a phosphorothioate bond, and an oligonucleotide in which a phosphoric diester bond in an oligonucleotide is converted to an N3′-P5 ′ phosphoramidate bond. Nucleotide derivative, oligonucleotide derivative in which ribose and phosphodiester bond in oligonucleotide are converted to peptide nucleic acid bond, oligonucleotide derivative in which uracil in oligonucleotide is substituted with C-5 propynyluracil, uracil in oligonucleotide is Oligonucleotide derivatives substituted with C-5 thiazole uracil, oligonucleotide derivatives substituted with cytosine in the oligonucleotide with C-5 propynylcytosine, oligonucleotides Is an oligonucleotide derivative in which cytosine is substituted by phenoxazine-modified cytosine, an oligonucleotide derivative in which ribose in the oligonucleotide is substituted by 2′-O-propyl ribose, or ribose in the oligonucleotide is 2 Oligonucleotide derivatives substituted with '-methoxyethoxyribose can be mentioned.
[0057]
In addition, the oligonucleotide of the present invention is prepared as a double-stranded RNA, introduced into a recipient, and inhibited by the RNA interference method for inhibiting the expression of a target gene (hereinafter referred to as the “RNAi method”). There is). As the RNA interference method, for example, a method described in (Elbashir, S., et al., Nature, 411, 494-498 (2001)) can be used. The double-stranded RNA does not necessarily have to be all RNA, and for example, those described in WO 02/10374 can be used.
[0058]
Here, the target gene may be any DNA as long as it is the DNA of the present invention. A double-stranded RNA consisting of a sequence substantially identical to at least a part of the base sequence of these DNAs (hereinafter sometimes referred to as “double-stranded polynucleotide”) is defined as a part of the base sequence of the target gene. And a sequence substantially the same as a sequence of 15 bp or more, which may be any part. Here, “substantially the same” means that it has 80% or more homology with the sequence of the target gene. The nucleotide length of the nucleotide may be any length from 15 bp to the entire length of the open reading frame (ORF) of the target gene, but a length of about 15 to 500 bp is preferably used. However, it is known that mammalian-derived cells have a signal transduction system activated in response to a long double-stranded RNA of 30 bp or more. This is called an interferon reaction (Mareus, PI, et al., Interferon, 5, 115-180 (1983)), and when the double-stranded RNA enters a cell, PKR (dsRNA-responsive) is obtained. Protein kinase: Non-specifically inhibits the initiation of translation of many genes via Bass, BL, Nature, 411, 428-429 (2001)), and at the same time, 2 ', 5' oligoadenylate synthetase (Bass, B.L., Nature, 411, 428-429 (2001)), activation of Rnase L occurs, and nonspecific degradation of intracellular RNA is induced. These non-specific reactions mask the specific response of the target gene. Therefore, when a mammal, or a cell or tissue derived from the animal is used as a recipient, a double-stranded polynucleotide of 15 to 30 bp, preferably 19 to 24 bp, more preferably 21 bp is used. The double-stranded polynucleotide does not need to be entirely double-stranded, and includes those in which the 5 'or 3' end is partially protruded, but those in which the 3 'end is partially protruded are preferably used. The double-stranded polynucleotide means a double-stranded polynucleotide having complementarity, but may be a self-annealed single-stranded polynucleotide having self-complementarity. The single-stranded polynucleotide having self-complementarity includes, for example, one having an inverted repeat sequence.
[0059]
The method for preparing the double-stranded polynucleotide is not particularly limited, but a known chemical synthesis method is preferably used. In chemical synthesis, a single-stranded polynucleotide having complementarity can be separately synthesized, and can be converted into a double-stranded strand by associating them by an appropriate method. Specific examples of the method of association include a method in which the synthesized single-stranded polynucleotide is mixed, heated to a temperature at which the double-strand is dissociated, and then gradually cooled. The associated double-stranded polynucleotide is confirmed using an agarose gel or the like, and the remaining single-stranded polynucleotide is removed by, for example, decomposing it with an appropriate enzyme.
[0060]
The transfectant into which the double-stranded polynucleotide thus prepared is introduced may be any as long as the target gene can be transcribed into RNA or translated into protein in the cell. Specific examples include those belonging to plant, animal, protozoan, virus, bacterial, or fungal species. The plant may be a monocotyledonous, dicotyledonous or gymnosperm, and the animal may be a vertebrate or invertebrate. Preferred microorganisms are those used in agriculture or industry, and those that are pathogenic for plants or animals. Fungi include organisms in both mold and yeast forms. Examples of vertebrates include mammals, including fish, cows, goats, pigs, sheep, hamsters, mice, rats, and humans, and invertebrates include nematodes and other reptiles, Drosophila melanogaster ( Drosophila), and other insects. Preferably, the cells are vertebrate cells.
[0061]
The transductant means a cell, tissue, or individual. Here, the cell may be from germline or somatic, totipotent, or pluripotent, split or undivided, parenchymal tissue or epithelium, immortalized or transformed, and the like. The cells may be gametes or embryos, in the case of embryos, single-cell or constitutive cells, or cells from multi-cell embryos, including fetal tissue. Furthermore, they may be undifferentiated cells, such as stem cells, or differentiated cells, such as from cells of an organ or tissue, including fetal tissue, or any other cells present in an organism. Differentiating cell types include adipocytes, fibroblasts, muscle cells, cardiomyocytes, endothelial cells, nerve cells, glial, blood cells, megakaryocytes, lymphocytes, macrophages, neutrophils, eosinophils, eosinophils, Includes basophils, mast cells, leukocytes, granulocytes, keratinocytes, chondrocytes, osteoblasts, osteoclasts, hepatocytes and cells of the endocrine or exocrine glands.
[0062]
As a method for introducing a double-stranded polynucleotide into a transfectant, when the transfectant is a cell or tissue, calcium phosphate method, electroporation method, lipofection method, virus infection, double-stranded polynucleotide solution Immersion, transformation, or the like. Examples of the method for introducing the gene into an embryo include microinjection, electroporation, and virus infection. When the recipient is a plant, a method of injecting or perfusing the plant into a body cavity or stromal cells, or spraying is used. In the case of an individual animal, it is introduced systemically by oral, topical, parenteral (including subcutaneous, intramuscular and intravenous administration), vaginal, rectal, nasal, ocular and intraperitoneal administration. A method, an electroporation method, a virus infection, or the like is used. For methods for oral introduction, the double-stranded polynucleotide can be mixed directly with the food of the organism. Furthermore, when introduced into an individual, it can be administered, for example, by administration as an implanted long-term release preparation or the like, or by ingesting an introduced body into which a double-stranded polynucleotide has been introduced.
[0063]
The amount of the double-stranded polynucleotide to be introduced can be appropriately selected depending on the introduced substance and the target gene, but it is preferable to introduce an amount sufficient to introduce at least one copy per cell. Specifically, for example, when the transfectant is a human cultured cell and the double-stranded polynucleotide is introduced by the calcium phosphate method, 0.1 to 1000 nM is preferable.
By suppressing the expression of the gene of the present invention in the transfectant by RNA interference, the function of the protein encoded by the gene of the present invention can be confirmed, or a new function can be analyzed.
[0064]
(4) Antibodies that specifically bind to the protein of the present invention
As a method for preparing an antibody that specifically binds to the protein of the present invention, a commonly used known method can be used. For a polypeptide used as an antigen, an epitope (antigen determination An appropriate sequence can be selected and used as the group. As a method for selecting an epitope, commercially available software such as Epitope Adviser (manufactured by Fujitsu Kyushu System Engineering Co., Ltd.) can be used.
[0065]
As the polypeptide used as the above antigen, a synthetic peptide synthesized according to a known method or the protein of the present invention itself can be used. The polypeptide serving as an antigen may be prepared in an appropriate solution or the like according to a known method and immunized to a mammal, for example, a rabbit, a mouse, a rat, or the like. It is preferable to use an antigen peptide as a conjugate with a suitable carrier protein or to add an adjuvant or the like for immunization.
[0066]
The route of administration of the antigen upon immunization is not particularly limited, and any route such as subcutaneous, intraperitoneal, intravenous, or intramuscular may be used. Specifically, for example, a method of inoculating BALB / c mice several times every several days to several weeks with an antigen polypeptide is used. The amount of the antigen to be taken is preferably about 0.3 to 0.5 mg / time when the antigen is a polypeptide, but is appropriately adjusted depending on the type of the polypeptide and the animal species to be immunized.
[0067]
After immunization, blood is appropriately collected as a test, and an increase in the antibody titer is confirmed by a method such as enzyme-linked immunosorbent assay (hereinafter sometimes referred to as “ELISA”) or Western blotting. Blood is collected from animals with increased titers. A polyclonal antibody can be obtained by subjecting this to an appropriate treatment used for antibody preparation. Specifically, for example, there is a method of obtaining a purified antibody obtained by purifying an antibody component from serum according to a known method. For purification of the antibody component, methods such as centrifugation, ion exchange chromatography, and affinity chromatography can be used.
[0068]
In addition, a monoclonal antibody can be prepared by using a hybridoma fused with spleen cells and myeloma cells of the animal according to a known method (Milstein, et al., Nature, 256, 495 (1975)). . The monoclonal antibody can be obtained, for example, by the following method.
[0069]
First, antibody-producing cells are obtained from an animal whose antibody titer has increased due to immunization with the above-described antigen. The antibody-producing cells are plasma cells and lymphocytes which are precursor cells thereof, which may be obtained from any of the individuals, but is preferably obtained from the spleen, lymph nodes, peripheral blood and the like. As a myeloma to be fused with these cells, generally, a cell line obtained from a mouse, for example, a P3X63-Ag8.653 (ATCC: CRL) which is an 8-azaguanine-resistant mouse (derived from BALB / c) myeloma cell line is used. -1580), P3-NS1 / 1Ag4.1 (RIKEN cell bank: RCB0095) and the like are preferably used. For cell fusion, antibody-producing cells and myeloma cells are mixed at an appropriate ratio, and 50% polyethylene is added to an appropriate cell fusion medium such as RPMI1640 or Iskov's modified Dulbecco's medium (IMDM) or Dulbecco's modified Eagle's medium (DMEM). It can be carried out by using a solution in which glycol (PEG) is dissolved. It can also be carried out by the electrofusion method (U. Zimmer-mann. Et al., Naturewissenschaften, 68, 577 (1981)).
[0070]
Hybridomas were prepared using a myeloma cell line that was resistant to 8-azaguanine, using 5% CO2 in a normal medium (HAT medium) containing an appropriate amount of hypoxanthine / aminopterin / thymidine (HAT) solution.₂At 37 ° C. for an appropriate time. This selection method can be appropriately selected and used depending on the myeloma cell line to be used. The antibody titer of the antibody produced by the selected hybridoma is analyzed by the above-described method, the hybridoma producing the antibody having a high antibody titer is separated by a limiting dilution method or the like, and the separated fused cells are cultured in an appropriate medium. The monoclonal supernatant can be obtained by purifying the resulting culture supernatant by an appropriate method such as ammonium sulfate fractionation and affinity chromatography. For purification, a commercially available monoclonal antibody purification kit can also be used. Furthermore, ascites containing a large amount of the monoclonal antibody of the present invention can be obtained by growing the antibody-producing hybridoma obtained above in the abdominal cavity of an animal of the same strain as the immunized animal or a nude mouse.
[0071]
When a human-derived protein is obtained as the protein of the present invention, the above-described method is applied to a Severe combined immunodeficiency (SCID) mouse transplanted with human peripheral blood lymphocytes using the polypeptide or a partial peptide thereof as an antigen. A humanized antibody can also be prepared by immunization in the same manner as described above and preparing a hybridoma of the antibody-producing cells of the immunized animal and human myeloma cells (Mosier, DE, et al. Nature, 335, 256-259 (1988); Duchosal, MA, et al., Nature, 355, 258-262 (1992)).
[0072]
Further, RNA is extracted from the obtained hybridoma producing the human antibody, a gene encoding the target human antibody is cloned, this gene is inserted into an appropriate vector, and this is introduced into an appropriate host. By expression, human antibodies can be produced in larger quantities. Here, an antibody with low binding to an antigen can be obtained as an antibody with even higher binding by using an evolutionary engineering technique known per se. A partial fragment such as a monovalent antibody can be prepared by cleaving the Fab and Fc portions using, for example, papain or the like, and collecting the Fab portion using an affinity column or the like.
[0073]
The antibody that specifically binds to the protein of the present invention thus obtained can also be used as a neutralizing antibody that specifically binds to the protein of the present invention and thereby inhibits the transferase activity or the carrier activity of the protein. There is no particular limitation on the method of selecting a substance that inhibits the activity of the protein. For example, it is possible to contact an antibody with the DNA transfectant prepared in (2) above and determine whether the function of the target protein in the transfectant is inhibited. And a method of analyzing the above.
[0074]
Such a neutralizing antibody may be used alone when the clinical application is performed, or may be used as a pharmaceutical composition by mixing with a pharmaceutically acceptable carrier. At this time, the ratio of the active ingredient to the carrier can be varied between 1 and 90% by weight. Such drugs can be administered in various forms, such as tablets, capsules, granules, powders, orally administered by syrup or the like, or injections, drops, liposomes, Parenteral administration with suppositories and the like can be mentioned. In addition, the dose can be appropriately selected depending on symptoms, age, body weight, and the like.
[0075]
(5) Confirmation of activity and analysis of function of the protein of the present invention
The protein of the present invention is prepared as a recombinant protein as described in the above (2), and by analyzing this, it can be confirmed that it has the activity estimated in the above (1). Furthermore, analysis can also be performed by combining the antibody and the like prepared as described in (4) above.
[0076]
The fact that the protein of the present invention has transferase activity can be analyzed by a commonly used activity measurement method known per se.
[0077]
Specifically, a method of contacting a substance serving as a substrate with the recombinant protein and measuring that a transfer group of the substrate, which is a target of the transferase, is transferred to another by the transferase activity of the recombinant protein, and the like can be given. . For example, in the case of β1,4-galactosyltransferase, UDP-galactose and pyridylamino-labeled GlcNAc2Man3GlcNAc2 are used as a substrate, and the protein is brought into contact with the protein in a neutral (pH 7.4, sodium cacodylate) buffer solution in the presence of 10 mM MnCl2 to form a pyridylaminotransferase of galactose. The transfer to the labeled GlcNAc2Man3GlcNAc2 is identified and quantified by HPLC. (Uejima, T. et al., Cancer Res., 52, 6158).
[0078]
Further, the fact that the protein of the present invention has a carrier activity as a carrier protein can be analyzed by a commonly used activity measurement method known per se. Depending on the substance to be transported, an iodine transporter (Nature, 379: 458-449 (1996)), a glucose transporter (Nature, 330: 379-381 (1987)), a multivitamin transporter (J. Biol) can be used. Chem., 273: 14875-14883 (1998)), choline carrier (Japanese Patent Application Laid-Open No. 2001-136976), ion carrier (translated by Keiko Nakamura et al., "Molecular Biology of Cells", 3rd edition, Educational Company. , July 1995, pp. 512-522), and the activity may be measured using the amount of these transported substances as an index. For example, the activity of transporting Squalene to microsomal squalene epoxide can be measured by using the method described in literature information (PNAS 2000, 98, 2244-2249) and measuring the conversion of Squalene to squalene 2,3-oxide as an index. good. Specifically, the protein of the present invention¹⁴After mixing and incubating the C-labeled Squalene, FAD, NADPH, and microsomal fractions, the product was saponified, lipids were extracted, thin layer chromatography was performed and measured with an image analyzer.¹⁴The conversion of C-labeled lipid can be detected.
[0079]
The activity of the transferase or the carrier, which is the protein of the present invention, can be confirmed as described above, but is not limited to these methods. In addition, these activity measuring systems can also be used for screening for a function activator (such as an agonist) or a function inhibitor (such as an antagonist) of the protein of the present invention and a screening for a protein expression regulator of the present invention, which will be described later. .
[0080]
In general, the method for analyzing the function of the protein of the present invention includes, for example, (i) a method for comparing and analyzing the expression state in each tissue, disease or developmental stage, and (ii) a method for analyzing the interaction with other proteins and DNA. (Iii) a method of analyzing phenotypic changes by introducing into appropriate cells or individuals, and (iv) analyzing phenotypic changes by inhibiting the expression of the protein in appropriate cells or individuals. And the like. Moreover, according to such a method, the activity specific to the target protein can be analyzed from many aspects.
[0081]
In the method (i), the expression of the protein of the present invention can be analyzed at the mRNA level or the protein level. When the expression level is analyzed at the mRNA level, for example, an in situ hybridization method (In situ hybridization: Application to Developmental Biology & Medicine., Ed. by Harry, N. ed. 1990)), a hybridization method using a DNA chip, a quantitative PCR method, and the like. When the analysis is performed at the protein level, a tissue staining method using an antibody that specifically binds to the protein of the present invention described later, an ELISA method, a Western blot method, and the like can be mentioned. Here, when a known variant is present in the protein to be analyzed, it is preferable to use a probe that is present only in the cDNA encoding the protein to be analyzed and does not hybridize with the cDNA encoding the known variant. In the case of the quantitative PCR method, a method of selecting primers capable of producing amplified fragments having different lengths between the target cDNA and the known variant (Wong, Y., Neuroscience Let., 320: 141-145 (2002)) and the like Is mentioned. Also, when analyzing at the protein level, it is preferable to use an antibody that reacts only with the target protein and does not react with a known variant.
[0082]
In the method (ii), the function of the protein of the present invention can be analyzed by examining the presence or absence of interaction between the protein of the present invention and a known protein. As a method for analyzing the interaction, a conventional method known per se can be used, and specifically, for example, yeast two-hybrid method, fluorescence depolarization method, surface plasmon method, phage display method, ribosomal display And the like. Also in this method, when a known variant exists in the protein to be analyzed, it is preferable to analyze the interacting substance of the known variant in the same manner to identify a substance that specifically interacts with the target protein.
[0083]
In the method (iii), the cells into which the cDNA of the present invention is introduced are not particularly limited, but human cultured cells and the like are particularly preferably used. Methods for introducing DNA into cells include those described in (2) above. In addition, the phenotype of the introduced cells can be observed with a microscope, such as cell viability, cell growth rate, cell differentiation, neurite outgrowth when cells are neurons, localization and migration of intracellular proteins, etc. And those that can be analyzed by biochemical experiments, such as changes in the expression of specific proteins in cells. When a known variant of the target protein is present, these phenotypes are similarly introduced into cells, and the phenotype associated with the target protein can be identified by comparative analysis. Further, since it is known that the protein of the present invention has a transferase activity or a carrier activity, it is also preferable to analyze the protein by paying attention to the phenotype and the like found in diseases associated with these transferases or carriers. .
[0084]
The method (iv) can be efficiently performed by the method using the oligonucleotide described in the above (3) or the RNA interference method. In this method, when a known variant is present in the target protein to be analyzed, a similar analysis is performed for the known variant and other variants, and a target protein-specific function is identified by comparative analysis. be able to.
[0085]
(6) Screening for a substance that regulates the activity of the protein of the present invention
By screening for a substance that specifically binds to the protein of the present invention and has an action of inhibiting, antagonizing, or enhancing the function (activity) of the protein of the present invention, a function modulator of the protein of the present invention (hereinafter, referred to as (Sometimes referred to as "modulators").
[0086]
This method of screening for a regulatory substance may be any method as long as it can obtain a substance that specifically binds to the protein of the present invention and has an activity of inhibiting, antagonizing or enhancing the activity of the protein. For example, first, the protein of the present invention is brought into contact with a test substance, and selection is carried out using the binding property of the protein as an index. Then, the function of the protein of the present invention, that is, a change in transferase activity or carrier activity is used as an index. A method for selecting a test substance can be used.
[0087]
The test substance may be any substance as long as it interacts with the protein of the present invention and may affect the activity of the protein. , Proteins, non-peptidic compounds, low molecular weight compounds, synthetic compounds, fermentation products, cell extracts, animal tissue extracts and the like. These substances may be novel substances or known substances. As a method for analyzing the interaction between the test substance and the protein of the present invention, a conventional method known per se can be used. Specifically, for example, yeast two-hybrid method, fluorescence depolarization method, surface plasmon Method, a phage display method, a ribosomal display method, or a competition analysis method with the antibody described in the above (4). A substance found to bind to the protein of the present invention by such a method is then analyzed by analyzing how the activity of the protein of the present invention is affected in the presence of the substance. Whether it is used as a modulator is identified.
[0088]
Analysis of the change in transferase activity or carrier activity can be performed by a commonly used method known per se, based on the properties of various transferases or carriers.
[0089]
When analyzing a substance that regulates transferase activity, a protein serving as a substrate is introduced into the DNA transfectant described in (2) in the same manner. In the presence / absence of the substance selected for this transductant, whether or not the transfer group of the transferase in the protein serving as a substrate is transferred to another is analyzed by a commonly known method known per se. Specifically, it can be performed using the method described in (5) above. If the transfer of the group intended by the transferase is increased as compared to the absence of the substance, the substance may function as a transferase active substance, and if the substance is reduced or inhibited, The substance can be identified as potentially functioning as a transferase inhibitor.
As the protein having transferase activity possessed by the protein of the present invention, for example, a glycosyltransferase has a function of activating oncogene products, a function of nervous function, an immune function, a function of controlling inflammation, cell differentiation, virus infection and the like. There is. Therefore, substances that can be identified by the present screening method are therapeutic drugs for cancer, diabetes, circulatory disease, Alzheimer's dementia, Parkinson's disease, chorea, ischemic brain disease, diabetic peripheral neuropathy, infertility, etc. It can be used as a diagnostic or a regenerative tissue inducer.
[0090]
In addition, when analyzing a substance that regulates the activity of a carrier, the method can be performed using the method described in (5) above.
As described above, the protein of the present invention has an important function as a carrier protein involved in various physiological functions, and abnormality of the protein in a living body causes various diseases. Therefore, the modulator of the carrier activity obtained by the above screening method is a therapeutic agent for various diseases, for example, metabolic disease, arteriosclerosis, vitamin deficiency, night blindness, diseases caused by peroxidation, aging, dementia, etc. It can be used as a therapeutic agent.
[0091]
Here, for the purpose of screening a pharmaceutically active ingredient, it is preferable to use the above-mentioned human homolog DNA or human homolog protein for the DNA of the present invention or the recombinant protein to be used. Furthermore, the substances screened by the above method may be selected as drug candidates by screening in vivo.
[0092]
Such a modulator of transferase activity or carrier activity can be used alone as the active ingredient when it is applied to clinical use, but it can also be used as a pharmaceutical composition in combination with a pharmaceutically acceptable carrier. it can. At this time, the ratio of the active ingredient to the carrier can be varied between 1 and 90% by weight. Such drugs can be administered in various forms, such as tablets, capsules, granules, powders, orally administered by syrup or the like, or injections, drops, liposomes, Parenteral administration with suppositories and the like can be mentioned. In addition, the dose can be appropriately selected depending on symptoms, age, body weight, and the like.
[0093]
(7) Screening of the DNA expression regulator of the present invention
Examples of the screening method include a method of analyzing the expression level of the protein of the present invention or the mRNA encoding the same in the presence of a test substance. Specifically, for example, cells expressing the protein of the present invention described in (2) above are cultured in an appropriate medium containing a test substance, and the amount of the protein of the present invention expressed in the cells is determined by ELISA. And the like, or by analyzing the amount of mRNA encoding the protein of the present invention in the cells by quantitative reverse transcription PCR, Northern blotting, or the like.
[0094]
As the test substance, those described in the above (6) can be used. According to this analysis, if the amount of the protein or mRNA expressed in the cells cultured in the absence of the test substance increases as compared with the amount of the test substance, the substance functions as a substance for promoting the expression of the DNA of the present invention. If it is possible and, on the contrary, decreases, it can be determined that the substance can be used as a substance inhibiting the expression of the DNA of the present invention.
[0095]
The above-mentioned active ingredient can be used alone for clinical application, but can also be used as a pharmaceutical composition by blending it with a pharmaceutically acceptable carrier. At this time, the ratio of the active ingredient to the carrier can be varied between 1 and 90% by weight. Such drugs can be administered in various forms, such as tablets, capsules, granules, powders, orally administered by syrup or the like, or injections, drops, liposomes, Parenteral administration with suppositories and the like can be mentioned. In addition, the dose can be appropriately selected depending on symptoms, age, body weight, and the like.
[0096]
(8) The DNA-introduced animal of the present invention
The transfected DNA containing the DNA of the present invention described in the above (1) is constructed, introduced into a fertilized egg of a mammal other than a human, and this is transplanted into a female individual uterus to generate the DNA. A non-human mammal into which DNA has been introduced can be produced. More specifically, for example, after superovulation of a female individual by hormone administration, it is mated with a male, a fertilized egg is extracted from an oviduct on the first day after mating, and the introduced DNA is microinjected into the fertilized egg. And so on. Thereafter, after culturing by an appropriate method, the surviving fertilized eggs are transplanted into the uterus of a pseudopregnant female individual (foster parent) to give birth. Whether or not the target DNA has been introduced into the newborn can be identified by performing Southern blot analysis on DNA extracted from cells of the individual. Examples of mammals other than humans include mice, rats, guinea pigs, hamsters, rabbits, goats, pigs, dogs, cats, and the like.
[0097]
The thus-obtained DNA-introduced animal of the present invention obtains its offspring by crossing this individual and subculturing it in a normal breeding environment while confirming that the introduced DNA is stably retained. be able to. In addition, the offspring can be obtained by repeating in vitro fertilization, and the strain can be maintained.
The non-human mammal into which the DNA of the present invention has been introduced can be used as an analysis of the function of the DNA of the present invention in a living body, or as a screening system for a substance regulating the function.
[0098]
(9) Other uses of the protein of the present invention and DNA containing a nucleotide sequence encoding the same
The protein of the present invention can be used as a carrier having it bound on a substrate. In addition, a base sequence encoding the protein of the present invention, for example, a DNA having a base sequence set forth in any one of SEQ ID NOs: 1 to 3 and a partial fragment thereof can be used as a carrier on which they are bound. These may be hereinafter referred to as “protein chips”, “DNA chips” or “DNA arrays” (DNA microarrays and DNA macroarrays). These protein chips or DNA chips or arrays may contain other proteins and DNAs in addition to the proteins and DNAs of the present invention.
[0099]
Here, a resin substrate such as a nylon film or a polypropylene film, a nitrocellulose film, a glass plate, a silicon plate, or the like is used as a substrate for binding proteins and DNA. When using a fluorescent substance or the like, a glass plate or a silicon plate containing no fluorescent substance is preferably used. The binding of the protein or DNA to the base can be easily carried out by a commonly used method known per se. These protein chips, DNA chips, or DNA arrays are also included in the scope of the present invention.
[0100]
In addition, the amino acid sequence of the protein of the present invention and the nucleotide sequence of DNA can also be used as sequence information. The base sequence of this DNA includes the base sequence of the corresponding RNA. That is, a database of amino acid sequences and nucleotide sequences can be constructed by storing the obtained amino acid sequences and nucleotide sequences in an appropriate recording medium in a computer-readable predetermined format. This database may contain the base sequences of other types of proteins and DNAs encoding them. Further, in the present invention, the database also means a computer system that writes the above-mentioned sequence on an appropriate recording medium and performs a search according to a predetermined program. Suitable recording media include, for example, magnetic media such as flexible disks, hard disks, and magnetic tapes; optical disks such as CD-ROM, MO, CD-R, CD-RW, DVD-R, and DVD-RAM; and semiconductor memories. And the like.
[0101]
【Example】
Hereinafter, the present invention will be described in detail with reference to examples, but the scope of the present invention is not limited to these examples.
Example 1  Preparation of cDNA library
(1) Preparation of mRNA
mRNA-prepared mouse (C57BL / 6) 0.5 to 1 g of each organ or tissue is homogenized with 10 ml of a suspension, and 1 ml of 2 M sodium acetate at pH 4.0 and the same amount of phenol / chloroform (5: 1 by volume). The mixture was added and extracted. When the same amount of isopropanol was added to the aqueous layer after the extraction, RNA separated and precipitated from the aqueous phase. After incubating the sample on ice for 1 hour, the precipitate was collected in a refrigerated centrifuge at 4,000 rpm for 15 minutes. The specimen was washed with 70% ethanol, dissolved in 8 ml of water, and precipitated by adding 16 ml of a pH 7.0 aqueous solution containing 2 ml of 5M NaCl, 1% CTAB (cetyltrimethy-lammonium bromide), 4M urea and 50 mM Tris. To remove the polysaccharide (CTAB precipitation).
[0102]
Subsequently, the RNA was dissolved in 4 ml of 7M guanidine-Cl at 4,000 rpm for 15 minutes at room temperature. After adding twice the volume of ethanol, the mixture was incubated on ice for 1 hour, centrifuged at 4,000 rpm for 15 minutes, and the resulting precipitate was washed with 70% ethanol to collect RNA, which was dissolved again in water. RNA purity was measured by reading the OD ratios 260/280 (> 1.8) and 230/260 (<0.45).
[0103]
(2) Preparation of first strand cDNA
Using 15 μg of the mRNA prepared in (1) above, 5-methyl-dCTP, dATP, dTTP, and dGTP were converted to 0.54 mM and 0.6 M trehalose in a final volume of 165 μl using reverse transcriptase 3,000 units. The reverse transcription reaction was performed under the following conditions: 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl 2, 10 mM DTT, 52 ng / μl BSA, and RNase inhibitor 5 units. 12.6 μl of an oligonucleotide (SEQ ID NO: 7) containing the recognition sequence of the restriction enzyme XhoI (wherein V represents A, G or C and N represents A, G, C or T) was used as a primer.
[0104]
At the start of the reaction, 1/4 of the reaction solution is collected and 1.5 μl of [α-³²By adding P] -dGTP (3000 Ci / mmol, 10 μCi / μl; manufactured by Amersham), the synthesis efficiency of the first strand cDNA was measured. 0.5 μl of the RI-labeled reaction solution was spotted on DE-81 paper, and the RI activity before and after washing three times with 0.5 M sodium phosphate buffer (pH 7.0) was measured and calculated. Thereafter, the RI-labeled reaction solution and the non-labeled reaction solution were mixed, 8 μl of 0.5 M EDTA, 2 μl of 10% SDS and 20 μg of proteinase K were added, and the mixture was heated at 45 ° C. for 15 minutes. After extraction with phenol / chloroform and ethanol precipitation, the precipitate was dissolved in 47 μl of RNase-free water (hereinafter referred to as RNase-free water).
[0105]
(3) 5 'cap structure and addition of biotin to 3' end
Biotinylated RNA Diol In order to bind biotin to the diol site (present at both the 5 'end of the Cap structure and the ribose at the 3' end of the poly A chain), a two-step reaction was performed. They are the oxidation of a diol group followed by the coupling reaction of biotin hydrazide with an oxidized RNA. First, 15 μg of the RNA-first strand cDNA complex obtained by the reverse transcription reaction was placed in a 50 μl reaction mixture using a 6.6 mM sodium acetate buffer (pH 4.5) and sodium periodate as an oxidizing agent. Processed. This oxidation reaction was performed on ice under light-shielding conditions for 45 minutes.
[0106]
Subsequently, 11 μl of 5 M sodium chloride, 0.5 μl of 10% SDS, and the same amount of isopropanol were added, left on ice for 60 minutes, and centrifuged at 4 ° C. for 15 minutes at 15,000 rpm to obtain a precipitate. The precipitate was washed with 70% ethanol and redissolved in 50 μl of RNase-free water. 5 μl of 1 M sodium acetate (pH 6.1), 5 μl of 10% SDS, and 150 μl of 10 mM biotin hydrazide (manufactured by Sigma) were added to the sample, and reacted overnight at room temperature (22 to 26 ° C.). Finally, 5 μl of 5 M NaCl, 75 μl of 1 M sodium acetate (pH 6.1) and 2.5 volumes of ethanol were added, and the mixture was cooled on ice for 1 hour, centrifuged at 4 ° C. for 15 minutes, and biotinylated. After the reaction, the reaction solution was centrifuged for 15 minutes to precipitate the RNA-DNA complex again. The precipitate was washed once with 70% ethanol and once with 80% ethanol, and dissolved in 70 μl of RNase-free water.
[0107]
(4) Selection of full-length cDNA by RNase I
By treating the biotinylated RNA-DNA complex obtained in the above (3) with RNase I that digests single-stranded RNA, mRNA whose mRNA was not completely elongated during the reverse transcription reaction, and mRNA of mRNA The biotin residue labeled at the 3 'end was removed. Specifically, 10 μl of 10 × RNase I buffer (100 mM Tris-HCl (pH 7.5), 50 mM EDTA, 2 M NaOAc) was added to 70 μl of the sample obtained in (3), and RNase I (RNase One).^TM200 units (Promega), and the single-stranded RNA was digested at 37 ° C. for 15 minutes.
[0108]
(5) Collection of full-length cDNA
In order to prevent non-specific adsorption of cDNA to magnetic beads coated with streptavidin, 5 μg (500 μl) of 100 μg of yeast tRNA (treated with DNase I) was used as magnetic beads (MPG) particles coated with streptomyvit. (CPG, NJ)) and left on ice for 1 hour, followed by washing with a solution of 50 mM EDTA and 2 M NaCl.
These beads were suspended in 500 μl of a solution of 50 mM EDTA and 2 M NaCl, and the RNase I-treated cDNA obtained in (4) was added. By stirring for 30 minutes at room temperature, the magnetic beads and the full-length cDNA were bound. The beads capturing the full-length cDNA were subjected to 50 mM EDTA, 2 M NaCl solution four times, 0.4% SDS, 50 μg / μl yeast tRNA once, 10 mM NaCl, 0.2 mM EDTA, 10 mM Tris-HCl (pH 7.5), Once with 20% glycerol, once with 50 μg / μl aqueous yeast tRNA, RNase H buffer (20 mM Tris-HCl (pH 7.5), 10 mM MgCl 2₂After washing once with 20 mM KCl, 0.1 mM EDTA, and 0.1 mM dithiothreitol (DTT), the cells were suspended in RNase H buffer (100 μl), RNase H (3 units) was added, and the mixture was heated at 37 ° C. for 30 minutes. Thereafter, 1 μl of 10% SDS and 2 μl of 0.5 M EDTA were added, the mixture was exposed to 65 ° C. for 10 minutes, and the supernatant was collected.
The thus recovered single-stranded full-length cDNA was extracted with phenol / chloroform, and the volume of the solution was reduced to 100 μl or less using a speed bag, and then subjected to G25 / G100 Sephadex chromatography. The fraction having RI activity was collected in a silicon-treated microtube, and 2 μg of glycogen was added. The precipitate obtained by ethanol precipitation was dissolved in 30 μl of ultrapure water.
[0109]
(6) Addition of oligo dG to single-stranded cDNA
30 μl of the single-stranded cDNA recovered in the above (5) was mixed with 200 mM sodium cacodylate (pH 6.9), 1 mM MgCl 2 in a final volume of 50 μl of the reaction solution.₂, 1 mM CoCl₂Under the conditions of 1 mM 2-mercaptoethanol and 100 μM dGTP, oligo dG addition reaction was carried out at 37 ° C. for 30 minutes using 32 units of terminal deoxynucleotidyl transferase (TaKaRa). At the end of the reaction, EDTA was added to 50 mM and dissolved in 31 μl of ultrapure water through a series of extractions with phenol / chloroform and ethanol precipitation.
[0110]
(7) Second strand cDNA synthesis
The synthesis of the second-strand cDNA using the first-strand cDNA as a template was performed as follows. In a reaction system having a final volume of 60 μl, a second strand low buffer (200 mM Tris-HCl (pH 8.75), 100 mM KCl, 100 mM (NH₄)₂SO₄, 20 mM MgSO₄3% of 1% Triton X-100, 1 mg / μl BSA, second strand high buffer (200 mM Tris-HCl (pH 9.2), 600 mM KCl, 20 mM MgCl₂) 3 μl, 0.25 mM each of dCTP, dATP, dTTP, and dGTP, 6 μl of β-NADH, 31 μl of first-strand cDNA with oligo dG added, 600 ng of second-strand primer-adapter (SEQ ID NO: 8), and Ex Taq DNA polymerase ( Second strand cDNA was synthesized using 15 units of TaKaRa Ex Taq (TaKaRa), 150 units of heat-resistant DNA ligase (Ampligase; Epicentre), and 3 units of heat-resistant RNase H (Hybridase; Epicentre).
[0111]
The reaction was stopped by adding 1 μl of 0.5 M EDTA and further heating at 45 ° C. for 15 minutes in the presence of 1 μl of 10% SDS and 10 μg of proteinase K to dissolve the protein component. A double-stranded full-length cDNA purified by extraction with ethanol / chloroform and ethanol precipitation was obtained.
[0112]
(8) Preparation of library
The double-stranded full-length cDNA obtained by the above method was inserted into a λZAPIII vector and recovered as a library. The λZAPIII vector is a λZAPII (manufactured by STRATAGENE) vector obtained by modifying a partial sequence of a multiple cloning site (SEQ ID NO: 9) to SEQ ID NO: 10 and newly introducing two SfiI sites.
[0113]
Further, a λPS (RIKEN) vector was prepared, and cDNA was inserted. λPS (RIKEN) (named λ-FLC-1 (FLC means FULL-LENGTH cDNA)) is a λPS vector of MoBiTec (Germany) modified for cDNA. That is, BamHI and SalI convenient for cDNA insertion are respectively introduced into cloning sites existing on both sides of a 10 kbp stuffer, and a 6 kb DNA fragment is inserted into an XbaI site so that a cDNA of about 0.5 kb to about 13 kb can be cloned. (JP-A-2000-325080). Using this λ-FLC-1, for example, in the case of a lung cDNA library, the average chain length of the insert was 2.57 kb, and it was possible to actually clone an insert of 0.5 kb to 12 kb. In the case of the conventional method λZAP, the average chain length of the insert was 0.97 kb, indicating that the use of λ-FLC-1 enables the cloning of a large-sized cDNA more efficiently than λZAP.
[0114]
Example 2  Normalization / subtraction of full-length cDNA library
(1) Preparation of driver
The mRNA prepared in Example 1 (1) (hereinafter sometimes referred to as “(a) RNA driver”) and the RNA prepared by in vitro transcription reaction were used as drivers. The latter RNA is further divided into two types (hereinafter referred to as “(b) RNA driver” and “(c) RNA driver”). One is obtained by recovering cDNA from RNA-cDNA removed by normalization and cloning into a phage vector. After infection with Escherichia coli, 1000 to 2000 plaques per starting material are mixed to form one library (mini-library), which is converted into plasmid DNA by a conventional method (the phage is infected again with Escherichia coli together with helper phage to form a phagemid). , And another infection to obtain plasmid DNA).
[0115]
The obtained DNA was subjected to an in vitro transcription reaction (using T3 RNA polymerase or T7 RNA polymerase), treated with DNase I (RQ1-RNase free; manufactured by Promega) and ProteinaseK, and then extracted with phenol / chloroform to obtain RNA (b) RNA. Got a driver. At this time, as a starting material, a mini-library is prepared from each of nine types of tissues (pancreas, liver, lung, kidney, brain, spleen, testes, small intestine, stomach), and the nine types of mini-libraries are mixed. To obtain RNA. As another RNA, a library (about 20,000 clones) already stored as a non-overlapping clone is cultured, and the obtained DNA is subjected to in vitro transcription reaction in the same manner as (b) RNA driver (c). ) RNA driver.
[0116]
These three kinds of RNAs were labeled with biotin using Label-IT Biotin Labeling Kit (manufactured by Mirus Corporation), added to tester cDNA at a ratio of 1: 1: 1, and reacted with Rot10 (42 ° C.). The second strand was synthesized with respect to the supernatant collected after the treatment with streptavidin beads (CPG).
[0117]
Example3Nucleotide sequencing of full-length cDNA clones
(1) clone rearray
One representative clone was selected from each cluster. Representative clones were selected with Q-bot (GENETIX LIMITED) and arrayed on a 384-well plate. At that time, E. coli was cultured in 50 μl of LB medium at 30 ° C. for 18 to 24 hours. At this time, when the cDNA library was introduced into the PS vector and transformed Escherichia coli DH10B, 100 mg / ml ampicillin and 50 mg / ml kanamycin were added, introduced into the Zap vector, and introduced into the SOLR system. If so, 100 mg / ml ampicillin and 25 mg / ml streptavidin were added.
[0118]
(2) Extraction of plasmid and InsSizing
Each of the clones cultured in the above (1) is further cultured in 1.3 ml of an HT solution containing 100 mg / ml of ampicillin, and the cells are collected by centrifugation. Then, QIAprep 96 Turbo (manufactured by QIAGEN) is used. To recover and purify the plasmid DNA. In order to examine the chain length of the cDNA inserted in the obtained plasmid, 1/30 of the plasmid DNA obtained above was digested with the restriction enzyme PyuII and subjected to 1% agarose gel electrophoresis.
[0119]
(3) Sequencing
Three types of sequencers were used to analyze the full-length nucleotide sequence of the full-length cDNA inserted into the thus obtained plasmid. In addition, plasmids were divided into two categories: those having insertion sequences shorter than 2.5 kb and those having longer insertion sequences. Among these, the nucleotide sequence of the clone having an insertion sequence shorter than 2.5 kb was analyzed from both ends. At this time, the plasmid was prepared using the primers of SEQ ID NOS: 11 (sense strand) and 12 (antisense strand) when the vector was PS, and SEQ ID NO: 13 (sense strand) when the vector was Zap. , And 14 (antisense strand), and reacted with a Thermosequenase Primer Cycle Sequencing Kit (manufactured by Amersham Pharmacia Biotech), and analyzed using a Licor DNA4200 (long read sequencer).
[0120]
Gaps that could not be analyzed by the above nucleotide sequence analysis were determined by the primer walking method. At this time, ABI Prism 377 and / or ABI Prism 3700 (manufactured by Applied Biosystems Inc.), BigDye terminator kit and Cycle Sequencing FS Ready Reaction Kit (Applied Systems, Inc.) were used.
[0121]
In addition, the sequence of a clone in which the inserted cDNA was longer than 2.5 kb was determined by the shotgun method. At that time, Shimadzu RISA 384 and DYEnamic ET terminator cycle sequencing kit (manufactured by Amersham Pharmacia Biotech) were used. To generate a shotgun library, 48 DNA fragments grown by PCR from 48 independent representative clones were used. The ends of the amplified DNA fragment were blunt-ended with T4 DNA polymerase.
This DNA fragment was inserted into a pUC18 vector, and Escherichia coli DH10B was transformed with the recombinant vector. A plasmid was prepared from this E. coli in the same manner as in the above (2).
[0122]
About those representative clones, the base sequence was determined by base sequence analysis from both ends, and the base sequences were ligated on a computer, and then subjected to sharing using Double Stroke Sharing Device (manufactured by Fire Inc.). Nucleotide sequence determination by the shotgun method was performed with duplication of 12 to 15 clones. The gaps whose sequence could not be determined by this nucleotide sequence determination were determined by primer walking in the same manner as described above.
[0123]
Example 4  Analysis of nucleotide sequence of each full-length cDNA clone
For the entire base sequence of the full-length cDNA clone determined in Example 3, homology search by BLAST and protein characteristic search by HMMPFAM were performed to estimate the function of the protein encoded by each full-length cDNA clone.
[0124]
(1) dnaform 27698 (SEQ ID NOs: 1 and 4)
As shown in SEQ ID NO: 1, dnaform 27698 was composed of 3311 bases, of which base numbers 78 to 2198 were open reading frames (including a stop codon). The amino acid sequence predicted from the open reading frame consists of 706 amino acid residues (SEQ ID NO: 4). A homology search was performed on the amino acid sequence encoded by SEQ ID NO: 1 using BLAST, and the result was that (i) a database registration symbol was found in the SPTR protein database (integrated SWISS-PROT protein sequence database and TrEMBL nucleic acid translation database). AL031678, Human DNA sequence from clone RP4-816K17 on chromasome 20p12.2-13. Contains the TGM3 gene for transglutaminase 3 with e-value: 0.0, with 87% identity over 442 amino acid residues, and e-value: 0.0, 49% identity over 705 amino acid residues, and (iii) database registration symbol Q08189, mouse Protein-glutamine glutamyltransferase E3 precursor, e-value: 0.0, 705 Hits were found with 48% identity over amino acid residues. From these results, it was inferred that the protein consisting of the amino acid sequence shown in SEQ ID NO: 4 was a glutamine transferase.
[0125]
A protein characteristic search was performed on the amino acid sequence shown in SEQ ID NO: 4 by HMMPFAM. As a result, an amino acid sequence of amino acids No. 1-121 of SEQ ID NO: 4 was found to have a sequence showing characteristics of Transglutaminase-like superfamily (entry as “Transglutamin_N” in Pfam. And a sequence exhibiting characteristics of transglutaminase-like superfamily in the amino acid sequence of amino acids 269-358 (sequence to be entered as “Transglut_core” in Pfam), and an amino acid sequence of amino acids 480-704. Features of Transglutamine Family, C-Terminal Ig Like Domain We found the sequence shown (Pfam sequence that is an entry as "Transglutamin_C" to).
From these facts, it was inferred that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 1 was glutamine transferase.
[0126]
(2) dnaform 50441 (SEQ ID NOs: 2 and 5)
As shown in SEQ ID NO: 2, dnform 50441 was composed of 2775 bases, and nucleotides 235 to 2631 were open reading frames (including a stop codon). The amino acid sequence predicted from the open reading frame consists of 798 amino acid residues (SEQ ID NO: 5). When a homology search was performed using BLAST for the amino acid sequence encoded by the nucleotide sequence shown in SEQ ID NO: 2, the SPTR protein database (integrating the SWISS-PROT protein sequence database and the TrEMBL nucleic acid translation database) contained (i ) Database registration number AF357970, Homo sapiens carnitine palmitoyltransferase IC with e-value: 0.0, and 83% over 802 amino acid residues; Has a hit of 53% over 765 amino acid residues with an e-value of 0.0.
The amino acid sequence of SEQ ID NO: 5 was subjected to protein characteristic search using HMMPFAM. As a result, the amino acid sequence of amino acid numbers 170 to 758 of SEQ ID NO: 5 was identified as a sequence exhibiting the characteristics of Choline / Carnitine o-acyltransferase (Pfam indicates “Carn_acyltransf”). Sequence that is entered as
From these results, it was inferred that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 2 was palmitoyltransferase.
[0127]
(3) dnaform 28700 (SEQ ID NOs: 3, 6)
As shown in SEQ ID NO: 3, dnform 28700 was composed of 3184 bases, of which base numbers 26 to 940 were open reading frames (including a stop codon). The amino acid sequence predicted from the open reading frame consists of 304 amino acid residues (SEQ ID NO: 6). When a homology search was performed on the amino acid sequence encoded by the nucleotide sequence shown in SEQ ID NO: 3 using BLAST, the SPTR protein database (integrating the SWISS-PROT protein sequence database and the TrEMBL nucleic acid translation database) contained (i ) Database registration symbol AC004832, Homo sapiens PAC clone RP4-539M6 from 22 (human SPF) is e-value: 5 × 10⁻¹⁰⁹And 85% over 221 amino acid residues, and (ii) the database registration number AF309558, Rattus norvegicus supernatant protein factor (Spf) has an e-value of 2 × 10^-86And (iii) the database registration symbol AF487977, Bos taurus tocopherol-associated protein, with e-value: 2 × 10^-84In 221 amino acid residues, the hit was 62% identical.
In addition, when a protein characteristic search was performed on the amino acid sequence shown in SEQ ID NO: 6 by HMMPFAM, the amino acid sequence of amino acid numbers 85 to 247 in SEQ ID NO: 6 showed a sequence exhibiting the characteristics of CRAL / TRIO domain (Pfam added “CRAL_TRIO” However, proteins having this domain are considered to be carrier proteins such as retinal-binding protein, phosphatidylcholine or alpha-tocopherol. In addition, a sequence exhibiting the characteristics of CRAL / TRION terminal (sequence entered as “CRAL / TRIO_N” in Pfam) was found in the amino acid sequence of amino acids Nos. 3-71. From the literature information (PNAS 2000, 98, 2244-2249), it has been shown that human SPF, which is the above-mentioned protein (i), has a squalene transport function. In other words, SPF binds squalene and transports it to microsomal squalene epoxide, thereby catalyzing the conversion to squalene 2,3-oxide and initiating the late stage of sterol biosynthesis. In addition, literature information (BBRC 2001, 285, 295-299) shows that human SPF has an alpha-tocopherol-dependent transcription promoting activity.
From these results, it is inferred that the protein encoded by the nucleotide sequence shown in SEQ ID NO: 3 has a carrier protein such as squalene, retinaldehyde, phosphoridylcholine or alpha-tocopherol, or has a transcription promoting activity, and regulates the expression of the protein or the protein. The substance, the function activating substance, or the function inhibiting substance may be a therapeutic drug for metabolic diseases, arteriosclerosis, vitamin deficiency, night blindness, diseases caused by peroxidation, aging, dementia, and the like.
[0128]
Example 5  Tissue expression analysis using DNA microarray
Tissue expression analysis using a DNA microarray is described in Miki, R .; , Et al. , Proc. Natl. Acad. Sci. USA, 98, 2199-2204 (2001).
(1) Preparation of DNA microarray
After amplifying the nucleotide sequence of the mouse full-length cDNA (dnaform 50441) using M13 forward and reverse primers, the PCR product was precipitated with isopropanol and dissolved in 15 μl of 3 × SSC solution. This DNA solution was spotted on a glass slide coated with poly-L-lysine using a DNA arrayer of 16 chips (SMP3, TeleChem International, Sunnyvale, Calif.) To prepare a DNA microarray (for details of the method, see HYPERLINK http: // cmgm). .Stanford.edu / pbrown / mguide / index.html http: // described in //cmgm.stanford.edu/pbrown/mguide/index.html). Mouse β-actin and glyceraldehyde-3-phosphate dehydrogenase cDNA were used as a positive control, and Arabidopsis thaliana cDNA was used as a negative control.
[0129]
The detection sensitivity of this DNA microarray was 1 to 3 copies of mRNA per cell. The signal intensity of clones having approximately 80% identity with the target sequence was one-tenth that of clones with perfect sequence identity. The signal intensity of clones with less than 80% match with the target sequence was at the background level.
[0130]
(2) Preparation of probe
22 tissues of fetal, neonatal and adult C57BL / 6J mice (kidney, brain, spleen, lung, liver, testis, pancreas, stomach, small intestine, colon, placenta, heart, thymus, cerebellum, uterus, bone, muscle, spine 1 μg of mRNA extracted from side kidney-derived adipocytes, epididymal-derived adipocytes, visceral fat, 10-day-old neonatal cerebellum, 10-day-old neonatal skin) was subjected to a random prime reverse transcription reaction according to a standard method to obtain a fluorescent dye Cy3 (Amersham Pharmacia). I took it in. On the other hand, 1 μg of mRNA extracted from the whole body of a 17.5-day-old fetus was subjected to a random prime reverse transcription reaction, and the fluorescent dye Cy5 was taken in as a control for expression analysis. The CyDye-labeled cDNA probe was purified using CyScribe GFX Purification Kit (Amersham Pharmacia) and eluted from the column with 17 μl of sterile water. This was mixed with a blocking solution consisting of 3 μl of 10 μg / μl oligo (dA), 3 μl of yeast tRNA 20 μg / μl, 1 μl of 20 μg / μl mouse Cot1 DNA, 5.1 μl of 20 × SSC, and 0.9 μl of 10% SDS. Thus, a CyDye-labeled cDNA probe was prepared.
[0131]
(3) Hybridization of DNA microarray
30 μl of a solution in which a cDNA probe (Cy3 label) derived from the tissue to be analyzed for expression and a control 17.5 day-old fetal cDNA probe (Cy5 label) were mixed was heat-treated at 95 ° C. for 1 minute, and cooled at room temperature. The probe solution was added to the DNA microarray, covered with a cover slip, and hybridized at 65 ° C. overnight in Hybridasette (ArrayIt). Next, the DNA microarray was washed with 2 × SSC, 0.1% SDS, and subsequently rinsed with 1 × SSC for 2 minutes and 0.1 × SSC for 2 minutes. The microarray was scanned using a ScanArray 5000 confocal laser scanner, and the images were analyzed with IMAGENE (BioDiscovery).
[0132]
(4) Data analysis
The amount of mRNA (Cy3-labeled) in each tissue is expressed as the ratio (Cy3 / Cy5) to the amount of fetal whole-body mRNA at 17.5 days of age (Cy5-labeled) as a logarithm (log).₂). That is, when the mRNA expression amount corresponding to each mouse full-length cDNA to be analyzed is larger in each tissue than in the control tissue, a positive value is obtained; when the mRNA expression amount is smaller, a negative value is obtained; Indicated by Experiments were performed twice independently to increase the accuracy of the data and reproducible results were employed. Table 1 shows the results.
[0133]
Generally, in the expression analysis results using a DNA array, an increase or decrease of about 2 times is regarded as an experimental error. Therefore, when the numerical value of the result is 1 or more, the amount of mRNA in a certain tissue is 17.5 days as a control. 17. The amount of mRNA in a certain tissue is a control when the amount of mRNA is more than twice as much as that of whole body of the fetal whole and significantly increased. Compared to the mRNA amount of the whole body of the fetus at the age of 5 days, the amount was less than half, which was interpreted as significantly reduced. When comparing the mRNA expression levels between tissues, if the difference between the values in each tissue is 1, the mRNA level is 2 times, and if it is 2, the mRNA level is 4 times. If the difference between the numerical values is -1, the amount of mRNA is 1/2 times, and if the difference is -2, the amount of mRNA is 1/4 times.
[0134]
[Table 1]

[0135]
As is clear from Table 1, the expression analysis using dnaform50441 itself as a target sequence reveals that dnaform50441 is more strongly expressed in the brain, cerebellum and testis than in the control. It can be seen that expression is generally weakened in other organs.
[0136]
Example 6 Protein-protein interaction analysis
Using the two-hybrid method in mammalian cells (Suzuki, H., et al., Genome Research, 11, 1758-1765 (2001)), the protein of the protein encoded by the mouse full-length cDNA base sequence (dnaform 50441) is described. A comprehensive analysis of protein interactions was performed.
(1) Rapid sample preparation using PCR method
The CheckMate mammarian two-hybrid system (Promega) was used for the two-hybrid experiments on mammalian cells. Samples for protein-protein interaction analysis were a plasmid vector pBIND having a Gal4 gene DNA binding region inserted downstream of a CMV promoter, a plasmid vector pACT having a VP16 gene transcription activation region inserted downstream of a CMV promoter, and 5 Was prepared using a plasmid vector pG5luc in which a reporter luciferase gene was inserted downstream of the Gal4 binding region and the TATA box. A fusion gene of the Gal4 gene and the protein coding sequence of the mouse full-length cDNA nucleotide sequence (dnaform 50441), and the VP16 gene and the mouse cDNA library FANTOM (HYPERLINK http://fantom.gsc.liken.go.jp/http://www. The fusion gene with the protein coding sequence of the full-length cDNA possessed by each clone of phantom.gsc.liken.go.jp/) is basically obtained by ligation using a common sequence and two-step PCR according to the protocol of Promega. Created in combination. (See FIG. 1 of Suzuki, H., et al., Genome Research, 11, 1758-1765 (2001)). The protein coding sequence of the mouse cDNA was PCR-amplified using a forward primer having a common sequence on the 5 ′ side and a gene-specific sequence on the 3 ′ side and an M13 universal primer, and then the above amplification product and pBIND or pACT were amplified. A PCR amplification product (with a common sequence added to the 3 ′ side) is mixed, and a second-stage PCR amplification is performed using nested primers to express a fusion protein of Gal4 and mouse protein (BIND sample) Alternatively, a vector (ACT sample) for expressing a fusion protein of VP16 and mouse protein was constructed.
[0137]
(2) Two-hybrid experiments on high-throughput mammalian cells
BIND and ACT samples prepared by the PCR method were used directly without further purification. 0.25 μl each of the BIND sample and the ACT sample, 30 ng pG5luc, and 9.5 μl Opti-MEM medium (Lifetech) were dispensed into a 384-well plate. 10 μl of LF2000 transfection reagent (Lifetech) diluted 32 times in Opti-MEM medium was added to the wells, mixed, incubated for 20 minutes, and then suspended in F12 medium at 1,300 cells / μl in CHO-K1 Chinese hamster. 20 μl of the cell solution was added and well suspended. The assay sample is₂After culturing for 20 hours in an incubator, luciferase activity was measured using Steady-Glo Luciferase Assay System (Promega) to confirm the interaction.
[0138]
(3) Analysis results
The results of the above (2) are shown in Table 2, and the protein encoded by the nucleotide sequence of the mouse full-length cDNA (dnaform 50441) was obtained from the mouse cDNA library FANTOM (HYPERLINK http://fantom.gsc.liken.go.jp/).http: // fantom. gsc. liken. go. jp /) Has the following interaction with the protein encoded by the cDNA base sequence of the specific clone.
[0139]
From Example 4, it is estimated that a protein having an amino acid sequence (SEQ ID NO: 5) predicted from the open reading frame of dnaform 50441 (hereinafter, referred to as “the present protein”) is palmitoyl transferase. palmitoyl transferase is known to be involved in the flow of fatty acids from adipose tissue and the transport of fatty acids to mitochondria (Int J Sports Med 1998 (19) (231): 231-44), suggesting a relationship with lipid metabolism and diabetes. Is done. As is evident from Table 2, this protein was found to interact with the hypothetical outer arm dynein light chain 1 structure containing protein. Outer arm dynein is one of the motor proteins and belongs to the AAA + family ATPase, and “axonal dynein” is responsible for cilia and flagella movement and “cytoplasmic dynein is responsible for intracellular transport of membrane vesicles and cell division. There are two types. Since the present protein interacts with the hypothetical outer arm dynein right chain 1 structure containing protein, the fatty acid metabolizing enzyme activity of the present protein as palmitoyltransferase is observed in the motility of fibrils or flagella, such as in the membranes of cilia or flagella. It was speculated that it was related to transport and cell division. Further, since it is known that agonist stimulation of beta-adrenergic receptor causes phosphorylation of Outer arm dynein in airway epithelial cells (J. Allergy Clin Immunol 2002 110 (6Suppl): S275-81), the present protein It is presumed that this is related to respiratory diseases such as bronchial asthma via ciliary movement of the trachea.
In addition, the present protein was found to interact with zinc finger protein, subfamily 1A, 3 (Ailos). Aiolos is a transcription factor restricted to the lymphatic system that regulates lymphocyte differentiation by interacting with Ikaros (Genomics, 61: 326-9 (1999)). Causes immune system disease. Since the present protein interacted with Aiolos, it was speculated that the present protein is related to immune system diseases such as immunodeficiency.
[0140]
[Table 2]

[0141]
Example 7 Comprehensive functional analysis of protein encoded by full-length cDNA
A protein having an amino acid sequence (SEQ ID NO: 5) predicted from the open reading frame of dnaform 50441 (hereinafter, referred to as "the present protein") was estimated to be palmitoyl transferase. In addition, Example 5 showed that the present protein was strongly expressed in brain, cerebellum and testis. palmitoyl transferase has been reported to be associated with lipid metabolism, cancer, apoptosis, PPAR and the like. In addition, in Alzheimer's disease, lipid metabolism including association with apolipoprotein E4 plays an important role in lipid metabolism. Has been reported (Bioclinica, August 2002, August issue). Therefore, the present protein or an expression regulator, a function activator, or a function inhibitor of the present protein may be used for cancer, diabetes, Alzheimer's dementia, Parkinson's disease, chorea, ischemic brain disease, diabetic peripheral neuropathy, infertility, etc. Could be developed as a remedy for
[0142]
【The invention's effect】
Since the protein of the present invention and the DNA encoding the same have transferase activity or carrier activity, a substance that regulates the activity can be screened using the protein or the DNA encoding the protein, and the protein can be screened. It is useful for the development of drugs that can act on related diseases and the like.
This application is based on a Japanese patent application filed on May 2, 2002 (Japanese Patent Application No. 2002-130702) and a Japanese patent application filed on December 4, 2002 (Japanese Patent Application No. 2002-352694). Here incorporated by reference. The contents of the documents cited in the present specification are also incorporated herein by reference.
[0143]
[Sequence list]

Claims

The following protein (a) or (b):
(A) a protein consisting of the amino acid sequence of SEQ ID NO: 4 or 5;
(B) a protein comprising an amino acid sequence in which one or several amino acids have been deleted, substituted and / or added in the amino acid sequence of SEQ ID NO: 4 or 5, and having a transferase activity.

A DNA encoding the protein according to claim 1.

A full-length cDNA encoding the protein according to claim 1.

The DNA according to any one of the following (a), (b) or (c):
(A) DNA having the nucleotide sequence of SEQ ID NO: 1 or 2.
(B) DNA encoding a protein having a base sequence in which one or several bases are deleted, substituted and / or added in the base sequence of SEQ ID NO: 1 or 2, and having a transferase activity.
(C) a DNA having a base sequence capable of hybridizing under stringent conditions with a DNA having the base sequence of SEQ ID NO: 1 or 2 or a sequence complementary thereto, and encoding a protein having transferase activity.

The following protein (a) or (b):
(A) a protein consisting of the amino acid sequence of SEQ ID NO: 6;
(B) a protein consisting of an amino acid sequence in which one or several amino acids have been deleted, substituted and / or added in the amino acid sequence of SEQ ID NO: 6, and having a carrier activity;

A DNA encoding the protein according to claim 5.

A full-length cDNA encoding the protein according to claim 5.

DNA of any of the following (a), (b) or (c):
(A) a DNA having the nucleotide sequence of SEQ ID NO: 3;
(B) a DNA having a base sequence in which one or several bases are deleted, substituted and / or added in the base sequence of SEQ ID NO: 3, and encoding a protein having a carrier activity;
(C) DNA encoding a protein having a nucleotide sequence capable of hybridizing under stringent conditions with a DNA having the nucleotide sequence of SEQ ID NO: 3 or a sequence complementary thereto, and having a carrier activity.

A recombinant vector comprising the DNA according to claim 2.

A transgenic cell into which the DNA according to any one of claims 2 to 4 or the recombinant vector according to claim 9 has been introduced, or an individual comprising the cell.

A protein according to claim 1, which is produced by the cell according to claim 10.

A recombinant vector comprising the DNA according to claim 6.

A transgenic cell into which the DNA according to any one of claims 6 to 8 or the recombinant vector according to claim 12 has been introduced, or an individual comprising the cell.

A protein according to claim 5, which is produced by the cell according to claim 13.

A sense oligonucleotide having the same sequence as 5 to 100 consecutive nucleotides in the base sequence of the DNA according to any one of claims 2 to 4 or 6 to 8, and an antisense oligonucleotide having a sequence complementary to the sense oligonucleotide. An oligonucleotide selected from the group consisting of a nucleotide and an oligonucleotide derivative of the sense or antisense oligonucleotide.

An antibody or a partial fragment thereof that specifically binds to the protein according to claim 1.

17. The antibody according to claim 16, wherein the antibody is a monoclonal antibody.

18. The antibody according to claim 17, wherein the monoclonal antibody has an action of neutralizing the transferase activity of the protein according to claim 1 or 11.

An antibody or a partial fragment thereof that specifically binds to the protein according to claim 5.

The antibody according to claim 19, wherein the antibody is a monoclonal antibody.

21. The antibody according to claim 20, wherein the monoclonal antibody has an action of neutralizing the carrier activity of the protein according to claim 5 or 14.

15. A method for regulating the activity of a protein according to any one of claims 1, 5, 11, and 14, wherein the protein is brought into contact with a test substance, and a change in the activity of the protein caused by the test substance is measured. A method for screening substances.

14. A method for screening a substance regulating the expression of DNA, comprising bringing the test substance into contact with the gene-transfected cell according to claim 10 or 13, and detecting a change in the expression level of the DNA introduced into the cell. .

At least one or more amino acid sequence information selected from the amino acid sequence of the protein according to claim 1 or 5, and / or selected from the DNA base sequence according to any of claims 2 to 4 or 6 to 8. A computer-readable recording medium storing at least one or more base sequence information.

A carrier to which the protein according to claim 1 or 5 and / or the DNA according to any one of claims 2 to 4 or 6 to 8 is bound.