JPH11235187A - Cyp7 promotor binding factor - Google Patents

Cyp7 promotor binding factor

Info

Publication number
JPH11235187A
JPH11235187A JP10348391A JP34839198A JPH11235187A JP H11235187 A JPH11235187 A JP H11235187A JP 10348391 A JP10348391 A JP 10348391A JP 34839198 A JP34839198 A JP 34839198A JP H11235187 A JPH11235187 A JP H11235187A
Authority
JP
Japan
Prior art keywords
leu
ser
gln
lys
glu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP10348391A
Other languages
Japanese (ja)
Inventor
Bei Shan
ベイ・シャン
Masahiro Nitta
正弘 仁田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sumitomo Pharmaceuticals Co Ltd
Tularik Inc
Original Assignee
Sumitomo Pharmaceuticals Co Ltd
Tularik Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sumitomo Pharmaceuticals Co Ltd, Tularik Inc filed Critical Sumitomo Pharmaceuticals Co Ltd
Publication of JPH11235187A publication Critical patent/JPH11235187A/en
Pending legal-status Critical Current

Links

Landscapes

  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)

Abstract

PROBLEM TO BE SOLVED: To obtain an isolated polypeptide CYP7 promotor binding factor which can regulate activation of transcription, and is useful for screening of pharmacological substances, diagnosis, and treatment by making the isolated polypeptide include a specific amino acid sequence. SOLUTION: This factor is obtained by including the sequence shown by formula I, II, or III, or at least ten-residue domain of formula I containing at least one of residues 1-10, 11-15, 16-21, 204-207, and 299-307, ten-residue domain of formula II containing residue 154, or ten-residue domain of formula III containing at least one of residues 3-10, 13-22, and 30-38. It is preferable that the domain binds specifically with CYP7 gene promotor, and that SPF polypeptide is produced by culturing host cells transformed by a vector containing a recombinant nucleic acid coding for the isolated polypeptide.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】本発明の分野はCYP7プロ
モーターに結合する転写因子である。
The field of the invention is transcription factors that bind to the CYP7 promoter.

【0002】[0002]

【従来の技術】哺乳動物の細胞において、コレステロー
ルは、膜生成のための、および正常な細胞機能にとって
不可欠のステロール類および非ステロール類の合成のた
めの必須エレメントである。しかし、過剰のコレステロ
ールは、細胞にとって致死的であるばかりでなく、それ
が動脈に沈着するためアテローム性動脈硬化症という大
きな問題を作り出す。コレステロールのホメオスタシス
を維持するため、細胞、特に肝細胞は、コレステロール
レベルを調節するための主要な三つの方法:1)LDL
レセプターを介する食餌コレステロールの取り込み;
2)内因性コレステロールの生合成、および3)コレス
テロールから胆汁酸への代謝的変換、を採用している。
これらのプロセスを調整する重要分子は、フィードバッ
クシグナルとしての役割を有するコレステロール自身で
ある。コレステロール取り込みまたは生合成のいずれか
によって細胞内コレステロールレベルが上昇すると、L
DLレセプターならびにHMG−CoAシンターゼおよ
びHMG−CoAレダクターゼといった重要なコレステ
ロール生合成酵素を含む遺伝子の転写が抑制される。こ
れらのフィードバックプロセスは、ステロール調節エレ
メント結合蛋白(SREBP)と呼ばれる転写因子の新
規なファミリーによって仲介される。SREBPは、N
末端転写因子ドメイン、2個の疎水性貫膜ドメインおよ
びC末端調節ドメインを含んでいる。細胞内コレステロ
ールレベルが低い場合、二段階の蛋白分解カスケードが
起こり、それはSREBPのN末端転写因子ドメインを
小胞体から放出して、SRE含有遺伝子の活性化が起こ
る核へと移動させる。
BACKGROUND OF THE INVENTION In mammalian cells, cholesterol is an essential element for membrane formation and for the synthesis of sterols and non-sterols essential for normal cell function. However, excess cholesterol is not only lethal to cells, but also creates a major problem of atherosclerosis because it deposits in arteries. To maintain cholesterol homeostasis, cells, especially hepatocytes, have three main ways to regulate cholesterol levels: 1) LDL
Uptake of dietary cholesterol via the receptor;
It employs 2) endogenous cholesterol biosynthesis and 3) metabolic conversion of cholesterol to bile acids.
A key molecule that regulates these processes is cholesterol itself, which acts as a feedback signal. When intracellular cholesterol levels are increased, either by cholesterol uptake or biosynthesis, L
Transcription of DL receptors and genes containing important cholesterol biosynthetic enzymes such as HMG-CoA synthase and HMG-CoA reductase is suppressed. These feedback processes are mediated by a novel family of transcription factors called sterol regulatory element binding proteins (SREBPs). SREBP is N
It contains a terminal transcription factor domain, two hydrophobic transmembrane domains and a C-terminal regulatory domain. When intracellular cholesterol levels are low, a two-step proteolytic cascade occurs, which releases the N-terminal transcription factor domain of SREBP from the endoplasmic reticulum to the nucleus where activation of SRE-containing genes occurs.

【0003】SREBP経路は、LDLレセプターおよ
びHMG−CoAシンターゼのようなコレステロールの
取り込み、ならびにコレステロール生合成に関与する遺
伝子の調節を司っているが、コレステロール異化の分子
的基礎は殆どわかっていない。コレステロール除去のた
めの主たる異化経路は、肝臓でのみ起こる胆汁酸の産生
である。コレステロール7α−ヒドロキシラーゼは、こ
の経路で最初のそして律速的な酵素である。CYP7と
しても知られるコレステロール7αヒドロキシラーゼ遺
伝子は、肝代謝に関わる多くのミクロソーム酵素を含む
シトクロムP−450ファミリーに属している。CYP
7遺伝子の発現は厳格に調節されていることが示されて
いる:これは肝臓でのみ発現され、その発現は食餌コレ
ステロールにより誘導され胆汁酸により抑制され得るの
である。コレステロールの異化はコレステロールのホメ
オスタシスにおいて中心的な役割を果たしていることが
示されている。二つの胆汁酸結合樹脂であるコレスチド
またはコレスチラミンで実験動物を処置すると、血清コ
レステロールレベルは低下する。さらに、ハムスターに
おけるCYP7遺伝子の過剰発現は、総およびLDLコ
レステロールレベルを低下させる。したがって、コレス
テロール7αヒドロキシラーゼはコレステロール低下薬
のための強力な治療上の標的であり、CYP7遺伝子の
発現が調節される機構の理解は特に重要性がある。
[0003] Although the SREBP pathway is responsible for the uptake of cholesterol, such as the LDL receptor and HMG-CoA synthase, and the regulation of genes involved in cholesterol biosynthesis, the molecular basis of cholesterol catabolism is poorly understood. The main catabolic pathway for cholesterol removal is the production of bile acids that occurs only in the liver. Cholesterol 7α-hydroxylase is the first and rate-limiting enzyme in this pathway. The cholesterol 7α hydroxylase gene, also known as CYP7, belongs to the cytochrome P-450 family, which contains many microsomal enzymes involved in liver metabolism. CYP
The expression of 7 genes has been shown to be tightly regulated: it is expressed only in the liver, and its expression can be induced by dietary cholesterol and suppressed by bile acids. Cholesterol catabolism has been shown to play a central role in cholesterol homeostasis. Treatment of experimental animals with two bile acid binding resins, cholestide or cholestyramine, reduces serum cholesterol levels. Furthermore, overexpression of the CYP7 gene in hamsters reduces total and LDL cholesterol levels. Therefore, cholesterol 7α-hydroxylase is a potent therapeutic target for cholesterol-lowering drugs, and understanding the mechanisms by which CYP7 gene expression is regulated is of particular importance.

【0004】ヒトCYP7遺伝子の肝特異的発現の分子
機構を研究するため、本発明者等はモデル系としてHe
pG2細胞を使用したが、これは、このセルラインが最
も良く研究されている肝セルラインの一つであり、ま
た、CYP7遺伝子を包含する幾つかの肝特異的遺伝子
の研究を通してこれが適当なセルラインであることが示
されているためである。本発明者等は、ヒトCYP7プ
ロモーターのDNアーゼI高感受性マッピングから研究
を開始し、このプロモーターの肝特異的エレメントを同
定した。その結果本発明者等は、このプロモーター結合
蛋白をコードしている遺伝子をクローニングし、これを
核オーファンレセプターFtz−F1ファミリーのヒト
オーソログであると同定した。
To study the molecular mechanism of liver-specific expression of the human CYP7 gene, the present inventors used He as a model system.
pG2 cells were used, which is one of the best studied hepatic cell lines, and also suitable for cell lines through the study of several liver-specific genes, including the CYP7 gene. This is because the line is indicated. We started with a DNase I hypersensitive mapping of the human CYP7 promoter and identified liver-specific elements of this promoter. As a result, the present inventors cloned a gene encoding this promoter binding protein and identified it as a human ortholog of the nuclear orphan receptor Ftz-F1 family.

【0005】関連技術 ガラーノーおよびビランガー(1997)、未刊行、受
理番号U93553は、ヒトα1−フェトプロテイン転
写因子(hFTF、配列番号7および8)を記載してお
り;J.D.タグウッド、I.イッセマンおよびS.グリーン
(1991)、未刊行、受理番号M81385はマウス
肝レセプターホモローガス蛋白(LRH−1)mRNA
および予想される翻訳物(mLRH、配列番号9および
10)を記載しており;そしてL.ガラーノー等(199
6)、Mol.Cell Biol.、16巻3853−3865頁は
部分的ラット遺伝子を開示しており;これらは皆、ここ
に開示したCPFポリペプチドとの配列類似性を有して
いる。
Related Art Garano and Villanger (1997), unpublished, accession number U93553, describe a human α1-fetoprotein transcription factor (hFTF, SEQ ID NOs: 7 and 8); JD Tagwood, I. Isseman and S. Green (1991), unpublished, accession number M81385 is mouse liver receptor homologous protein (LRH-1) mRNA
And putative translations (mLRH, SEQ ID NOs: 9 and 10); and L. Garrano et al.
6), Mol. Cell Biol., 16: 3853-3865, discloses partial rat genes; they all have sequence similarity to the CPF polypeptides disclosed herein.

【0006】[0006]

【発明が解決しようとする課題】本発明は、分離された
CPFポリペプチド、関連する核酸、CPF特異的構造
および活性を有するそのポリペプチドドメイン、ならび
にCPF機能、とりわけCYP7プロモーター結合のモ
ジュレーター、に関連する方法および組成物を提供する
ものである。CPFポリペプチドはCYP7プロモータ
ーに連結している遺伝子の活性化を調節することがで
き、よって細胞機能の重要な調節物質となり得る。この
ポリペプチドは、当該CPFポリペプチドをコードして
いる核酸により形質転換された宿主細胞から組換え法に
よって、または哺乳動物細胞からの精製により生産され
得る。本発明は、ここに開示したCPF遺伝子と特異的
にハイブリダイズすることのできる、分離されたCPF
ハイブリダイゼーションプローブおよびプライマー、特
異抗体のようなCPF特異的結合物質、ならびに当該組
成物を製造し、診断(例えばCPF転写物の遺伝子ハイ
ブリダイゼーションスクリーニング)、治療(例えば、
CYP7プロモーター依存転写を活性化するためのCP
Fアクチベーター)および生物薬学的産業(例えば、免
疫原、他の転写調節物質を分離するための試薬、ケミカ
ルライブラリーからのリード薬理物質のスクリーニング
を行うための試薬等)で使用する方法を提供する。
SUMMARY OF THE INVENTION The present invention relates to isolated CPF polypeptides, related nucleic acids, their polypeptide domains having CPF-specific structure and activity, and modulators of CPF function, particularly CYP7 promoter binding. Methods and compositions that provide CPF polypeptides can regulate the activation of genes linked to the CYP7 promoter, and thus can be important regulators of cell function. The polypeptide may be produced by recombinant methods from a host cell transformed with the nucleic acid encoding the CPF polypeptide, or by purification from mammalian cells. The present invention relates to an isolated CPF capable of specifically hybridizing with the CPF gene disclosed herein.
Hybridization probes and primers, CPF-specific binding agents such as specific antibodies, and the compositions are prepared and used for diagnosis (eg, gene hybridization screening of CPF transcripts), therapy (eg,
CP for activating CYP7 promoter-dependent transcription
F activator) and methods for use in the biopharmaceutical industry (eg, immunogens, reagents for separating other transcriptional regulators, reagents for screening lead pharmacological substances from chemical libraries, etc.). I do.

【0007】[0007]

【課題を解決するための手段】ヒトCPFポリペプチド
をコードしている天然cDNAのヌクレオチド配列を配
列番号1、3および5に示し、予想される完全な翻訳物
を配列番号2、4および6としてそれぞれ示す。本発明
に係るCPFポリペプチドは、配列番号2、4または6
の機能的ドメインの1またはそれ以上を含んでおり、こ
のドメインは、少なくとも8、好ましくは少なくとも1
6、より好ましくは少なくとも32、最も好ましくは少
なくとも64の隣接する配列番号2、4または6の残基
を含み、そしてヒトCPF特異的アミノ酸配列および活
性を有している。CPFドメイン特異活性とは、CYP
7プロモーター結合またはトランス活性化活性およびC
PF特異的免疫原性および/または抗原性を包含する。
CPF特異的ポリペプチド配列はhFTFおよびmLR
H(配列番号8および10)と区別され、配列比較によ
り容易に同定される。例えば本明細書の配列1、2およ
び3を参照されたい。配列の例は、残基1−10、11
−15、16−21、204−207および299−3
07のうち少なくとも1個を含む、配列番号2の10残
基ドメイン、残基154を含む配列番号4の10残基ド
メイン、および、残基3−10、13−22および30
−38のうち少なくとも1個を含む配列番号6の10残
基ドメインを包含する。
SUMMARY OF THE INVENTION The nucleotide sequences of the native cDNA encoding the human CPF polypeptide are shown in SEQ ID NOs: 1, 3 and 5, with the predicted complete translations as SEQ ID NOs: 2, 4 and 6. Shown respectively. The CPF polypeptide according to the present invention has SEQ ID NO: 2, 4 or 6.
Comprises one or more functional domains of at least 8, preferably at least 1
It comprises 6, more preferably at least 32, most preferably at least 64 contiguous SEQ ID NO: 2, 4 or 6 residues and has human CPF-specific amino acid sequence and activity. CPF domain-specific activity refers to CYP
7 promoter binding or transactivation activity and C
Includes PF-specific immunogenicity and / or antigenicity.
CPF-specific polypeptide sequences include hFTF and mLR
H (SEQ ID NOs: 8 and 10) and is easily identified by sequence comparison. See, for example, Sequences 1, 2 and 3 herein. Examples of sequences include residues 1-10, 11
-15, 16-21, 204-207 and 299-3
07, a 10-residue domain of SEQ ID NO: 4 including residues 154, and residues 3-10, 13-22 and 30
Includes a 10 residue domain of SEQ ID NO: 6 that includes at least one of -38.

【0008】CPF特異的活性または機能は、簡便な、
インビトロでの、細胞に基づく検定、またはインビボ検
定(例えばインビトロ結合検定、細胞培養検定、動物で
(例えば遺伝子治療、トランスジェニック動物等))に
よって測定することができる。結合検定は、CPFポリ
ペプチドと結合標的との分子相互作用が評価される任意
の検定を包含する。結合標的は、天然の細胞内結合標
的、例えばCYP7プロモーター結合部位、CPF調節
蛋白またはCPF活性もしくはその局在化を直接調節す
るその他の調節物質;または、非天然結合標的、例えば
抗体のような特異的免疫蛋白、合成核酸結合部位(下記
のコンセンサス配列を参照されたい)、または下記のよ
うなスクリーニング検定で同定されるようなCPF特異
物質であってよい。CPF結合特異性は、結合平衡定数
(通常、少なくとも約107-1、好ましくは少なくと
も約108-1、より好ましくは少なくとも約10
9-1)によって、CYP7または合成結合部位レポー
ター発現によって、当該ポリペプチドがCPF発現細胞
中で負の変異体として機能する能力はヘテロローガスな
宿主(例えば齧歯類またはウサギ)中でCPF特異抗体
を誘導する能力等によって、検定することができる。例
えば、この方法で、配列番号2の残基33−123によ
り定義されるドメインが機能的DNA結合ドメインであ
ることがわかり、配列番号2の残基242−333およ
び383−405により定義されるドメインが機能的リ
ガンド結合ドメインであることがわかる。
[0008] CPF-specific activity or function is simple,
In vitro, cell-based assays or in vivo assays (eg, in vitro binding assays, cell culture assays, in animals (eg, gene therapy, transgenic animals, etc.)). Binding assays include any assay in which the molecular interaction between a CPF polypeptide and a binding target is evaluated. The binding target may be a naturally occurring intracellular binding target, eg, a CYP7 promoter binding site, a CPF regulatory protein or other modulator that directly regulates CPF activity or its localization; or a non-naturally occurring binding target, eg, an antibody. Or a synthetic nucleic acid binding site (see consensus sequences below), or a CPF-specific substance as identified in a screening assay as described below. The CPF binding specificity is determined by the binding equilibrium constant (typically at least about 10 7 M -1 , preferably at least about 10 8 M -1 , more preferably at least about 10 8 M -1 .
9 M -1 ), the ability of the polypeptide to function as a negative mutant in CPF-expressing cells by CYP7 or synthetic binding site reporter expression is CPF-specific in heterologous hosts (eg, rodents or rabbits). It can be assayed by the ability to induce antibodies. For example, in this way, the domain defined by residues 33-123 of SEQ ID NO: 2 is found to be a functional DNA binding domain, and the domain defined by residues 242-333 and 383-405 of SEQ ID NO: 2 Is a functional ligand binding domain.

【0009】或る特別な態様においては、欠失突然変異
誘発を用いて、CYP7プロモーターエレメントと結合
する機能的CPFドメインが規定される(下記実施例を
参照されたい)。例えば表1を参照されたい。
In one particular embodiment, functional CPF domains that bind to the CYP7 promoter element are defined using deletion mutagenesis (see Examples below). See, for example, Table 1.

【0010】[0010]

【表1】 表1:CPF機能的ドメインを規定するCPF欠失変異体の例 変異体 配列 NA結合 △N1 配列番号2、残基4−495 + △N2 配列番号2、残基12−494 + △N3 配列番号2、残基24−495 + △N4 配列番号2、残基33−495 + △N5 配列番号2、残基33−123 + △C1 配列番号2、残基1−408 + △C2 配列番号2、残基1−335 + △C3 配列番号2、残基1−267 + △C4 配列番号2、残基1−189 + △C5 配列番号2、残基1−124 +TABLE 1 TABLE 1: CPF functional examples of CPF deletion mutant domains defining the mutant sequence D NA binding △ N1 SEQ ID NO: 2, residues 4-495 + △ N2 SEQ ID NO: 2, residues 12-494 + ΔN3 SEQ ID NO: 2, residues 24-495 + ΔN4 SEQ ID NO: 2, residues 33-495 + ΔN5 SEQ ID NO: 2, residues 33-123 + ΔC1 SEQ ID NO: 2, residues 1-408 + Δ C2 SEQ ID NO: 2, residues 1-335 + ΔC3 SEQ ID NO: 2, residues 1-267 + ΔC4 SEQ ID NO: 2, residues 1-189 + ΔC5 SEQ ID NO: 2, residues 1-124 +

【0011】或る特別な態様において、当該ドメインは
CPF特異抗原および/または免疫原を、特に担体蛋白
と結合した時に提供する。例えば、CPF−およびヒト
CPF−特異的ドメインに相当するペプチドをスカシガ
イ抗原(KLH)と共有結合させ、このコンジュゲート
を完全フロイントアジュバント中で乳化する。研究用ウ
サギを常套的プロトコルに従い免疫し、採血する。CP
F特異抗体の存在を、配列番号2、4または6の固定化
したCPFポリペプチドを用いる固相免疫吸着検定によ
って検定する(例えば表2を参照されたい)。
[0011] In one particular embodiment, the domain provides a CPF-specific antigen and / or immunogen, particularly when bound to a carrier protein. For example, peptides corresponding to CPF- and human CPF-specific domains are covalently linked to keyhole limpet antigen (KLH), and the conjugate is emulsified in complete Freund's adjuvant. Study rabbits are immunized and bled according to routine protocols. CP
The presence of F-specific antibodies is assayed by a solid phase immunosorbent assay using the immobilized CPF polypeptide of SEQ ID NO: 2, 4, or 6 (see, eg, Table 2).

【0012】[0012]

【表2】表2:CPF特異的ウサギポリクローナル抗体
を誘導する免疫原性CPFポリペプチド:上記のプロト
コルに従い免疫されたCPFポリペプチド−KLHコン
ジュゲートCPFポリペプチド配列 免疫原性 配列番号2、残基1−10 +++ 配列番号2、残基4−15 +++ 配列番号2、残基8−20 +++ 配列番号2、残基12−25 +++ 配列番号2、残基15−30 +++ 配列番号2、残基19−32 +++ 配列番号2、残基20−29 +++ 配列番号2、残基200−211 +++ 配列番号4、残基150−159 +++
Table 2: Immunogenic CPF polypeptide that induces CPF-specific rabbit polyclonal antibody: CPF polypeptide-KLH conjugated CPF polypeptide sequence immunized according to the above protocol Immunogenic SEQ ID NO: 2, residues 1-10 ++ SEQ ID NO: 2, residues 4-15 ++ SEQ ID NO: 2, residues 8-20 ++++ SEQ ID NO: 2, residues 12-25 ++++ SEQ ID NO: 2, residues 15-30 ++++ SEQ ID NO: 2, residues 19-32 ++ SEQ ID NO: 2, residues 20-29 ++++ SEQ ID NO: 2, residues 200-211 ++ SEQ ID NO: 4, residues 150-159 ++++

【0013】特許請求されているCPFポリペプチド
は、分離されているか、または純粋である:「分離され
た」ポリペプチドは、天然の状態でそれに付随する物質
の少なくとも幾らかを伴っておらず、好ましくは与えら
れた試料中の総ポリペプチドの少なくとも約0.5%、
より好ましくは少なくとも約5%(重量)を構成してお
り、そして、純粋なポリペプチドは、与えられた試料中
の総ポリペプチドの少なくとも約90%、好ましくは少
なくとも約99%(重量)を構成する。CPFポリペプ
チドおよびポリペプチドドメインは、合成することがで
き、組換え技術により生産することもでき、または哺乳
動物、好ましくは人間の細胞から精製することもでき
る。当該組成物の生化学的合成、分子発現および精製の
ための多岐にわたる分子生物学的および生化学的方法が
利用できる。例えば、モレキュラー・クローニング、ア
・ラボラトリーマニュアル(サムブルック等、コールド
・スプリング・ハーバー・ラボラトリー)、カレント・
プロトコール・イン・モレキュラー・バイオロジー(ア
ウスベル等編、グリーン Publ.Assoc.、ウィレイ−イン
ターサイエンス、NY)または当分野で知られるその他
の文献を参照されたい。
[0013] The claimed CPF polypeptides are isolated or pure: an "isolated" polypeptide does not have at least some of its associated substances in its natural state; Preferably at least about 0.5% of the total polypeptide in a given sample,
More preferably, it comprises at least about 5% (by weight), and pure polypeptide comprises at least about 90%, preferably at least about 99%, by weight of the total polypeptide in a given sample. I do. CPF polypeptides and polypeptide domains can be synthesized, produced by recombinant technology, or purified from mammalian, preferably human, cells. A wide variety of molecular biological and biochemical methods are available for biochemical synthesis, molecular expression and purification of the composition. For example, Molecular Cloning, A Laboratory Manual (Sambrook et al., Cold Spring Harbor Laboratory), Current
See Protocol in Molecular Biology (Ed. Ausubel et al., Green Publ. Assoc., Willey-Interscience, NY) or other literature known in the art.

【0014】本発明は、アゴニスト、アンタゴニスト、
天然の細胞内結合標的等を包含する、CPFポリペプチ
ド、好ましくは特許請求されているCPFポリペプチド
に特異的な結合物質、この様な結合物質を同定し作成す
る方法、ならびに診断、治療および医薬品開発における
それらの用途を提供する。例えば、特異的結合物質は、
特に疾病または疾病の予後が、当該蛋白の関与する経
路、例えばCYP7プロモーター依存転写活性化の不適
切な利用と関係している場合、様々な診断および治療へ
の適用に役立つ。新規なCPF特異的結合物質には、C
PF特異的レセプター/CPF特異的結合蛋白、例えば
特異抗体またはT細胞抗原レセプターのような体細胞組
換えによって生じるポリペプチドレセプター(例えばハ
ーロウおよびレイン(1988)、アンティボディー
ズ、ア・ラボラトリー・マニュアル、コールド・スプリ
ング・ハーバー・ラボラトリー)ならびに、1、2およ
び3ハイブリッドスクリーニングのような検定により同
定されるその他の天然の細胞内結合物質、下記のような
ケミカルライブラリーのスクリーニングで同定される非
天然の細胞内結合物質等が包含される。特に興味深い物
質は、CPF機能、例えばCPF依存性転写活性を調節
する。
The present invention provides an agonist, an antagonist,
CPF polypeptides, including naturally occurring intracellular binding targets and the like, preferably binding agents specific for the claimed CPF polypeptides, methods of identifying and making such binding agents, and diagnostic, therapeutic and pharmaceutical products Provide their use in development. For example, a specific binding substance is
It is useful for a variety of diagnostic and therapeutic applications, especially when the disease or prognosis of the disease is associated with inappropriate use of pathways involving the protein, such as CYP7 promoter-dependent transcriptional activation. Novel CPF-specific binding substances include C
PF-specific receptor / CPF-specific binding protein, eg, a polypeptide receptor produced by somatic recombination such as a specific antibody or T cell antigen receptor (eg, Harrow and Rein (1988), Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory) and other natural intracellular binders identified by assays such as 1, 2 and 3 hybrid screening, non-natural cells identified by screening chemical libraries as described below. Internal binding substances and the like. Of particular interest are those that modulate CPF function, eg, CPF-dependent transcriptional activity.

【0015】したがって、本発明は、CPF活性を調節
する工程を含む、細胞内でのCPFまたはCYP7プロ
モーターの関与するシグナル伝達を調節するための方法
を提供する。その細胞は、培養中のものであっても、イ
ンサイトゥ、即ち天然宿主の内部に存在するものであっ
てもよい。診断用途のためには、CPF結合物質は、結
合物質に直接コンジュゲートさせた、または結合物質に
特異的なプローブにコンジュゲートさせた、例えば蛍
光、放射性、化学ルミネセンス、またはその他の容易に
検出し得る分子で標識されることが多い。インヒビター
の例には、上記のようなCPFのドミナント/ネガティ
ブ変異体型をコードしている核酸等が包含される。
Accordingly, the present invention provides a method for regulating signaling involving a CPF or CYP7 promoter in a cell, comprising the step of regulating CPF activity. The cells may be in culture or in situ, i.e., present inside a natural host. For diagnostic applications, the CPF binding agent may be conjugated directly to the binding agent, or conjugated to a probe specific for the binding agent, eg, fluorescent, radioactive, chemiluminescent, or other readily detectable. Often labeled with a molecule that can. Examples of inhibitors include nucleic acids encoding dominant / negative mutant forms of CPF as described above.

【0016】開示したCPFポリペプチドのアミノ酸配
列を用いて、選択された発現系用に最適化された、CP
Fポリペプチドをコードする核酸に戻し翻訳したり(ホ
ラー等(1993)、Gene、136巻323−328
頁;マーティン等(1995)、Gene、154巻150
−166頁)、または、天然CPFをコードする核酸配
列の分離に使用するためのプローブおよび縮重オリゴヌ
クレオチドプライマーを作成する(「GCG」ソフトウ
ェア、ジェネティクス・コンピューター・グループ、In
c.、マディソン、WI)。CPFをコードする核酸はC
PF発現ベクターに使用され、そして、例えば発現およ
びスクリーニングのために組換え宿主細胞中に組み込ま
れ、例えばCPFにより調節される細胞機能に関連する
疾病用の候補薬物の有効性といったような機能研究のた
めにトランスジェニック動物中に組み込まれる。
Using the amino acid sequences of the disclosed CPF polypeptides, a CP optimized for an expression system of choice.
Translation back to a nucleic acid encoding the F polypeptide (Horror et al. (1993), Gene, 136, 323-328).
Page; Martin et al. (1995), Gene, 154, 150
-Page 166) or to create probes and degenerate oligonucleotide primers for use in isolating nucleic acid sequences encoding native CPF ("GCG" software, Genetics Computer Group, In.
c., Madison, WI). The nucleic acid encoding CPF is C
Used in PF expression vectors and incorporated into recombinant host cells for expression and screening, for example, to study functional studies such as the efficacy of candidate drugs for diseases associated with cell functions regulated by CPF. For transgenic animals.

【0017】さらに本発明は、配列番号1、3または5
の相補鎖を含む第二の核酸と特異的にハイブリダイズ
し、hFTFおよびmLRH cDNA(配列番号7お
よび9)を識別するに充分な、少なくとも12、好まし
くは少なくとも24、より好ましくは少なくとも36、
そして最も好ましくは少なくとも96の隣接する配列番
号1、3または5の鎖の塩基を含むCPF cDNA特
異配列を有する複製/増幅プライマーおよび核酸ハイブ
リダイゼーションプローブを提供する。このようなCP
F特異配列は、配列比較により容易に識別できる。例え
ば本明細書の配列4および5を参照されたい。特異的ハ
イブリダイゼーションを明示するには、一般に、ストリ
ンジェントな条件を必要とし、この条件とは、例えば、
5xSSPE(0.18M NaCl、0.01M Na
PO4、pH7.7、0.001MEDTA)緩衝液中
30%ホルムアミドを含む緩衝液中で42℃の温度でハ
イブリダイズさせ、42℃で0.2xSSPEによる洗
浄に付す時結合が持続し、好ましくは5xSSPE緩衝
液中50%ホルムアミドを含む緩衝液中42℃でハイブ
リダイズさせ、42℃で0.2xSSPE緩衝液による
洗浄に付す時結合が持続するものである。
Furthermore, the present invention relates to SEQ ID NO: 1, 3 or 5
At least 12, preferably at least 24, more preferably at least 36, which hybridizes specifically with a second nucleic acid comprising the complementary strand of hFTF and mLRH cDNA (SEQ ID NOs: 7 and 9).
And most preferably, it provides a replication / amplification primer and a nucleic acid hybridization probe having a CPF cDNA specific sequence comprising at least 96 contiguous bases of SEQ ID NO: 1, 3 or 5. Such a CP
F-specific sequences can be easily identified by sequence comparison. See, for example, sequences 4 and 5 herein. Demonstrating specific hybridization generally requires stringent conditions, such as, for example,
5xSSPE (0.18 M NaCl, 0.01 M Na
PO 4 , pH 7.7, 0.001 MEDTA) Hybridization at a temperature of 42 ° C. in a buffer containing 30% formamide in a buffer and washing at 42 ° C. with 0.2 × SSPE, the binding persists, preferably Hybridization is carried out at 42 ° C. in a buffer containing 50% formamide in 5 × SSPE buffer at 42 ° C. and the binding is sustained when subjected to washing with 0.2 × SSPE buffer at 42 ° C.

【0018】[0018]

【表3】 表3:条件Iおよび/または条件IIの下で配列番号1、3および/または5 の鎖とハイブリダイズするCPF核酸の例 CPF核酸 ハイブリダイゼーション 配列番号1、ヌクレオチド1−26 + 配列番号1、ヌクレオチド52−62 + 配列番号1、ヌクレオチド815−825 + 配列番号1、ヌクレオチド1120−1135 + 配列番号1、ヌクレオチド1630−1650 + 配列番号1、ヌクレオチド1790−1810 + 配列番号1、ヌクレオチド1855−1875 + 配列番号1、ヌクレオチド1910−1925 + 配列番号1、ヌクレオチド2090−2110 + 配列番号1、ヌクレオチド2166−2186 + 配列番号1、ヌクレオチド2266−2286 + 配列番号1、ヌクレオチド2366−2386 + 配列番号1、ヌクレオチド2466−2486 + 配列番号1、ヌクレオチド2566−2586 + 配列番号1、ヌクレオチド2666−2686 + 配列番号1、ヌクレオチド2766−2786 + 配列番号1、ヌクレオチド2866−2886 + 配列番号1、ヌクレオチド2966−2986 + 配列番号1、ヌクレオチド3066−3086 +Table 3: Examples of CPF nucleic acids that hybridize to the strands of SEQ ID NO: 1, 3 and / or 5 under conditions I and / or II CPF nucleic acid hybridization SEQ ID NO: 1, nucleotides 1-26 + sequence SEQ ID NO: 1, nucleotides 52-62 + SEQ ID NO: 1, nucleotides 815-825 + SEQ ID NO: 1, nucleotides 1120-1135 + SEQ ID NO: 1, nucleotides 1630-1650 + SEQ ID NO: 1, nucleotides 1790-1810 + SEQ ID NO: 1, nucleotide 1855 -1875 + SEQ ID NO: 1, nucleotides 1910-1925 + SEQ ID NO: 1, nucleotides 2090-2110 + SEQ ID NO: 1, nucleotides 2166-2186 + SEQ ID NO: 1, nucleotides 2266-2286 + SEQ ID NO: 1, nucleotides 2366-2386 SEQ ID NO: 1, nucleotides 2466-2486 + SEQ ID NO: 1, nucleotides 2566-2586 + SEQ ID NO: 1, nucleotides 2666-2686 + SEQ ID NO: 1, nucleotides 2766-2786 + SEQ ID NO: 1, nucleotides 2866-2886 + SEQ ID NO: 1, nucleotides 2966-2986 + SEQ ID NO: 1, nucleotides 3066-3086 +

【0019】当該核酸は、合成/非天然の配列であり、
そして/または、分離されており、即ち、天然の状態で
付随する物質の少なくとも幾らかを伴わず、好ましくは
与えられた画分中に存在する核酸の総重量の少なくとも
約0.5%、好ましくは少なくとも約5%を構成してお
り、そして通常組換え体である(これは、それらが、非
天然配列を含むか、または、天然染色体上でそれらが結
合しているヌクレオチド以外のヌクレオチドに結合した
天然配列を含むことを意味している)。配列番号1、3
または5のヌクレオチド配列またはその必須フラグメン
トを含む組換え核酸は、この様な配列またはフラグメン
トを、末端に、天然染色体上で結合している配列以外の
配列と直接隣接して(即ち、接続して)含んでいるか、
または、末端にある、もしくは天然染色体上で結合して
いる配列以外の配列と直接隣接している、10kb未
満、好ましくは2kb未満の天然フランキング領域と隣
接している。この核酸は通常RNAまたはDNAである
が、他の塩基またはヌクレオチド類似体を含む核酸を用
いて安定性等を修飾することが有利であることもしばし
ばある。
The nucleic acid is a synthetic / unnatural sequence,
And / or is separated, ie, without at least some of the associated substances in its natural state, and is preferably at least about 0.5%, preferably at least about 0.5% of the total weight of nucleic acids present in a given fraction Constitute at least about 5% and are usually recombinant (either they contain non-natural sequences or bind to nucleotides on the native chromosome other than the nucleotides to which they are attached). Including the native sequence of the present invention). SEQ ID NOs: 1, 3
Alternatively, a recombinant nucleic acid comprising the nucleotide sequence of 5 or an essential fragment thereof may have such a sequence or fragment terminated directly adjacent (ie, connected) to a sequence other than the sequence associated on the native chromosome. ) Contains
Alternatively, it is adjacent to a natural flanking region of less than 10 kb, preferably less than 2 kb, which is directly adjacent to a sequence other than the sequence at the end or on the natural chromosome. This nucleic acid is usually RNA or DNA, but it is often advantageous to modify the stability or the like using a nucleic acid containing other base or nucleotide analogs.

【0020】当該核酸は、翻訳可能な転写物、ハイブリ
ダイゼーションプローブ、PCRプライマー、および診
断用核酸等として、CPF遺伝子および遺伝子転写物の
存在を検出するために、およびさらなるCPFホモログ
および構造類似体をコードしている核酸を検出または増
幅するためなど、幅広く利用することができる。診断に
おいては、CPFハイブリダイゼーションプローブは、
臨床および実験試料中の野生型およびCPFアレル変異
体の同定に用途を見いだせる。アレル変異体を使用し
て、高効率の臨床診断のためのアレル特異的オリゴヌク
レオチド(ASO)プローブを作成することができる。
治療においては、治療用CPF核酸は活性CPFの細胞
発現、細胞内の濃度もしくはその利用の調節に使用され
る。
The nucleic acids may be used to detect the presence of CPF genes and gene transcripts, such as translatable transcripts, hybridization probes, PCR primers, and diagnostic nucleic acids, and to provide additional CPF homologs and structural analogs. It can be widely used, for example, for detecting or amplifying an encoding nucleic acid. In diagnosis, CPF hybridization probes are:
It finds use in identifying wild-type and CPF allelic variants in clinical and experimental samples. Allelic variants can be used to generate allele-specific oligonucleotide (ASO) probes for highly efficient clinical diagnosis.
In therapy, therapeutic CPF nucleic acids are used to modulate cellular expression, intracellular concentration or utilization of active CPF.

【0021】本発明は、CPFで調節され得る細胞機能
のレベルで活性な物質、化合物またはそれらのリード化
合物を同定する有効な方法を提供する。一般に、これら
のスクリーニング法は、天然CPF結合標的とのCPF
の相互作用を調節する化合物を検定することを含んでい
る。標識インビトロ蛋白−蛋白結合検定、イムノアッセ
イ、DNA結合検定、細胞を基礎とする検定等を包含す
る、結合物質のための多岐にわたる検定が提供される。
この方法は、リード化合物を求めるためのケミカルライ
ブラリーの、自動化された、費用効率の良い高効率のス
クリーニングに従う。同定された試薬は、動物および人
間の試験のための薬品工業に用途が見いだせる。例え
ば、この試薬は、医薬品開発のために活性を最適化し毒
性を最小化するため、インビトロおよびインビボ検定で
誘導体化および再スクリーニングすることができる。
The present invention provides an effective method for identifying substances, compounds or their lead compounds that are active at the level of cellular function that can be regulated by CPF. In general, these screening methods involve CPF binding to natural CPF binding targets.
Assaying for compounds that modulate the interaction of. A wide variety of assays for binding agents are provided, including labeled in vitro protein-protein binding assays, immunoassays, DNA binding assays, cell-based assays, and the like.
This method relies on automated, cost-effective and efficient screening of chemical libraries for lead compounds. The identified reagents find use in the pharmaceutical industry for animal and human testing. For example, the reagents can be derivatized and rescreened in in vitro and in vivo assays to optimize activity and minimize toxicity for drug development.

【0022】インビトロ結合検定では、別のペプチドま
たはポリペプチド(例えば検出または結合(anchoring)
等のためのタグ)との融合産物の一部であってよいCP
Fポリペプチドを含んでいる要素の混合物を使用する。
この検定混合物は、天然の細胞内CPF結合標的を含
む。天然の全長結合標的を使用することもできるが、一
部分がこの検定において簡便に測定可能な当該CPFポ
リペプチドに対する結合親和性および結合力を提供する
限り、それらの一部分(例えばオリゴヌクレオチド)を
使用することが好ましいことが多い。検定混合物はさら
に、候補となる薬理物質を含んでいる。候補物質は多数
の化学分類上の物質を包含するが、典型的には有機化合
物であって、好ましくは小さい有機化合物であり、合成
または天然化合物のライブラリーを包含する多様な供給
源から取得する。様々なその他の試薬もまたこの混合物
に含まれ得る。これらは、塩類、緩衝剤、中性蛋白とい
った試薬を包含し、例えばアルブミン、洗浄剤、プロテ
アーゼインヒビター、ヌクレアーゼインヒビター、抗菌
剤等を使用することができる。
In an in vitro binding assay, another peptide or polypeptide (eg, detection or anchoring)
May be part of a fusion product with
A mixture of elements containing an F polypeptide is used.
This assay mixture contains the natural intracellular CPF binding target. Natural full-length binding targets can be used, but portions of them (eg, oligonucleotides) are used as long as they provide the binding affinity and avidity for the CPF polypeptide that can be conveniently measured in this assay. Is often preferred. The assay mixture further contains a candidate pharmacological substance. Candidate substances include numerous chemical classes, but are typically organic compounds, preferably small organic compounds, obtained from a variety of sources including libraries of synthetic or natural compounds. . Various other reagents may also be included in the mixture. These include reagents such as salts, buffers and neutral proteins, and for example, albumin, detergents, protease inhibitors, nuclease inhibitors, antibacterial agents and the like can be used.

【0023】得られた混合物は、その候補薬理物質の存
在が無ければCPFポリペプチドが細胞の結合標的、言
及した結合親和性を有するその一部分または類似体に特
異結合するような条件下でインキュベートする。混合物
の構成成分は、必要な結合をもたらす任意の順序で添加
することができ、インキュベーションは、最適な結合を
促進する任意の温度で実施することができる。インキュ
ベーション時間は同様に、最適な結合がなされるよう
に、しかも迅速な高効率スクリーニングを促進するため
に最小となるよう選択される。
The resulting mixture is incubated under conditions such that, in the absence of the candidate pharmacological agent, the CPF polypeptide specifically binds to a cellular binding target, a portion or analog thereof having the stated binding affinity. . The components of the mixture can be added in any order that provides for the requisite binding, and the incubation can be performed at any temperature that promotes optimal binding. Incubation times are likewise selected for optimal binding and minimized to facilitate rapid high efficiency screening.

【0024】インキュベーションの後、CPFポリペプ
チドおよび1またはそれ以上の結合標的の間の、物質に
よって変化を受けた結合を、任意の簡便な方法により検
出する。該物質の存在下での結合親和性と比較した該物
質非存在下での標的に対するCPFポリペプチドの結合
親和性の差違は、該物質がCPF結合標的に対するCP
Fポリペプチドの結合を調節することを示している。同
様に、以下に記載する、細胞を基礎とする検定において
は、或る物質の存在下および非存在下でのCPF依存転
写活性化の差違が、該物質がCPF機能を調節すること
を示している。本明細書中使用される差違とは、統計学
的に有意であり、且つ好ましくは少なくとも50%、よ
り好ましくは少なくとも90%の差違を表す。
After incubation, the substance-altered binding between the CPF polypeptide and one or more binding targets is detected by any convenient method. The difference in the binding affinity of a CPF polypeptide for a target in the absence of the substance compared to the binding affinity in the presence of the substance indicates that the substance has a binding affinity for the CPF binding target.
It shows that it modulates F polypeptide binding. Similarly, in cell-based assays described below, differences in CPF-dependent transcriptional activation in the presence and absence of a substance indicate that the substance modulates CPF function. I have. As used herein, a difference is statistically significant and preferably represents a difference of at least 50%, more preferably at least 90%.

【0025】以下の実験の部および実施例は、例示のた
めに供されるものであり、限定のためではない。
The following experimental part and examples are provided by way of illustration and not by way of limitation.

【0026】[0026]

【実施例】1.CPFおよびCYP7プロモーターエレ
メントの分 離および特性決定 細胞およびプラスミドHepG2、ヒト肝癌セルライ
ン、293、形質転換された胚腎臓セルライン、および
Caco2、結腸腺癌セルラインをATCCから購入す
る。SV589は形質転換されたヒト線維芽セルライン
である。細胞は、10%牛胎児血清を添加したダルベッ
コ変法イーグル培地−ハムF12(1:1)中、加湿し
たインキュベーター内で37℃、5%CO2で培養し
た。pGL3:CYP7は、ヒトCYP7α遺伝子の−
716/+14領域のDNAフラグメントを含み、これ
をpGL3−ルシフェラーゼレポータープラスミド(プ
ロメガ)中にクローニングした。pGL3:SFMまた
はpGL3:BAMは、各々−130および−129
(GGからTTへ)または−62および−61(AAか
らTCへ)の位置に変異を含む。エクスサイト(ExS
ite)突然変異誘発キット(ストラタジーン)を使用
することにより、pGL3:CYP7中に二塩基対置換
を導入した。プロモーターの−135ないし−118の
野生型の3個のテンドン(tendon)反復、または
−130および−129の位置にGからTへの二塩基対
置換を有する該反復のいずれかを、HSV TK遺伝子
由来のTATA配列を有する修飾されたpGL3中にク
ローニングすることにより、pGL3:3xwtおよび
pGL3:3xmutを組み立てた。pfCPFは、p
CDNA3(インビトロジーン)中にクローニングされ
た遺伝子のN末端に、フラッグ標識された配列を含む。
pfCPF−AF2は、該遺伝子のC末端にAF−2ド
メインの15アミノ酸欠失を有する。pfCPF−VP
は、HSV VP16のトランス活性化ドメイン(aa
412−490)を含み、これはpfCPFのAF−2
ドメインを置き換えたものである。
[Examples] 1. CPF and CYP7 promoter elements
Placement of separation and characterization cells and plasmid HepG2, human hepatocarcinoma cells lines, 293, Buy transformed embryonic kidney cell line, and Caco2, colon adenocarcinoma cell line from ATCC. SV589 is a transformed human fibroblast cell line. Cells were cultured in Dulbecco's modified Eagle's medium-Ham F12 (1: 1) supplemented with 10% fetal calf serum in a humidified incubator at 37 ° C., 5% CO 2 . pGL3: CYP7 is the − of the human CYP7α gene.
A DNA fragment of the 716 / + 14 region was cloned into the pGL3-luciferase reporter plasmid (Promega). pGL3: SFM or pGL3: BAM are -130 and -129, respectively.
Include mutations at positions (GG to TT) or -62 and -61 (AA to TC). Excite (ExS
it) Mutagenesis kit (Stratagene) was used to introduce a two base pair substitution in pGL3: CYP7. Either the -135 to -118 wild-type three tendon repeats of the promoter, or those repeats having a G to T double base pair substitution at positions -130 and -129, were added to the HSV TK gene PGL3: 3xwt and pGL3: 3xmut were assembled by cloning into modified pGL3 with the derived TATA sequence. pfCPF is p
It contains a flag-labeled sequence at the N-terminus of the gene cloned into cDNA3 (Invitrogen).
pfCPF-AF2 has a 15 amino acid deletion of the AF-2 domain at the C-terminus of the gene. pfCPF-VP
Is the transactivation domain of HSV VP16 (aa
412-490), which are AF-2 of pfCPF.
It replaces the domain.

【0027】DNアーゼI高感受性マッピング。細胞
(3x106)を収穫し、50mMトリス−HCl(pH
7.9)、100mM KCl、5mM MgCl2、0.0
5%サポニン、200mM 2−メルカプトエタノール、
50%グリセロールを含有する溶菌緩衝液1.5mL中で
溶解した。核を遠心により集め、100mM NaCl、
50mMトリス−HCl(pH7.9)、3mM MgC
2、1mM DTT、1Xコンプリートプロテアーゼイン
ヒビターカクテル(ベーリンガー・マンハイム)、およ
び連続希釈したDNアーゼI(5、1.7、0.6単位
/mL)を含有する緩衝液に再懸濁した。核の懸濁液を3
7℃で20分間インキュベートした。EDTAを最終濃
度100mMとなるまで加えることにより、反応を停止さ
せた。RNアーゼAおよびプロテアーゼK処理の後、ゲ
ノムDNAを調製し、サザンハイブリダイゼーションに
付した。
[0027] DNase I hypersensitive mapping. Cells (3 × 10 6 ) are harvested and 50 mM Tris-HCl (pH
7.9), 100 mM KCl, 5 mM MgCl 2 , 0.0
5% saponin, 200 mM 2-mercaptoethanol,
Lysis was performed in 1.5 mL of lysis buffer containing 50% glycerol. The nuclei were collected by centrifugation and 100 mM NaCl,
50 mM Tris-HCl (pH 7.9), 3 mM MgC
Resuspended in buffer containing l 2 , 1 mM DTT, 1 × complete protease inhibitor cocktail (Boehringer Mannheim), and serially diluted DNase I (5, 1.7, 0.6 units / mL). 3 nuclear suspensions
Incubated at 7 ° C for 20 minutes. The reaction was stopped by adding EDTA to a final concentration of 100 mM. After treatment with RNase A and protease K, genomic DNA was prepared and subjected to Southern hybridization.

【0028】電気泳動移動度シフト検定。核抽出物を、
NaClの代わりにKClを用いて培養細胞から調製し
た。インビトロ転写および翻訳をTNT系(プロメガ)
で実施した。核抽出物の蛋白1μgまたはインビトロ翻
訳産物0.1−1μLを、10mM Hepes(pH
7.6)、ポリ(dI−dC)1μg、100mM KC
l、7%グリセロール、1mM EDTA、1mM DTT、
5mM MgCl2、および無関係の一本鎖オリゴDNA4
0pmoleを含有する反応緩衝液中で、40000cpm
32P標識されたオリゴヌクレオチドと混合し、室温で
20分間インキュベートした。反応混合物を4%ポリア
クリルアミド−0.5xTBEゲル上で分離した。ゲル
を乾燥し、X線フィルムに暴露した。競合実験では、3
0または60倍モル過剰の競合物質DNAを加えた。抗
体スーパーシフト実験では、抗CPF抗血清または免疫
前血清を、プローブDNAの添加前に反応混合物に加え
た。
Electrophoretic mobility shift assay. Nuclear extract,
It was prepared from cultured cells using KCl instead of NaCl. In vitro transcription and translation using the TNT system (Promega)
It was carried out in. 1 μg of the protein of the nuclear extract or 0.1-1 μL of the in vitro translation product was added to 10 mM Hepes (pH
7.6), 1 μg of poly (dI-dC), 100 mM KC
1, 7% glycerol, 1 mM EDTA, 1 mM DTT,
5 mM MgCl 2 and irrelevant single-stranded oligo DNA 4
40000 cpm in a reaction buffer containing 0 pmole
And mixed with 32 P-labeled oligonucleotide and incubated at room temperature for 20 minutes. The reaction mixture was separated on a 4% polyacrylamide-0.5 × TBE gel. The gel was dried and exposed to X-ray film. In the competition experiment, 3
A 0 or 60-fold molar excess of competitor DNA was added. In antibody supershift experiments, anti-CPF antiserum or pre-immune serum was added to the reaction mixture before the addition of probe DNA.

【0029】トランスフェクションおよびレポーター遺
伝子分析。トランスフェクションの1日前に細胞を6ウ
ェル培養皿にまきこんだ(4x105/ウェル)。一般
に、ルシフェラーゼレポータープラスミド2ugおよび
RSV LTR駆動b−ガラクトシダーゼ発現ベクター
0.1ugを、燐酸カルシウム法により、培養された細
胞中に48時間トランスフェクトした。細胞抽出物を調
製し、ルシフェラーゼ検定系(プロメガ)を用いてルシ
フェラーゼ活性について検定した。ルシフェラーゼ活性
はb−ガラクトシダーゼ活性により標準化した。
Transfection and reporter gene analysis. One day prior to transfection, cells were seeded into 6-well culture dishes (4 × 10 5 / well). Generally, 2 ug of the luciferase reporter plasmid and 0.1 ug of the RSV LTR-driven b-galactosidase expression vector were transfected into the cultured cells by the calcium phosphate method for 48 hours. Cell extracts were prepared and assayed for luciferase activity using a luciferase assay system (Promega). Luciferase activity was normalized by b-galactosidase activity.

【0030】CPFの分子クローニング。Ftz−F1
ボックス配列を含むヒトESTクローン(ジェンバン
ク、受理番号N59515)を用いて、クロンテクより
購入したヒト肝cDNAライブラリーをスクリーニング
した。ファージDNAをpTriplExプラスミドに変換
することにより陽性クローン中のcDNAを回収し、配
列決定した。同じ遺伝子から種々にスプライシングされ
た型であろうと思われる幾つかの陽性クローンの中か
ら、一つのクローン(pTriplEx−113)をさらな
る分析のために選択した。
Molecular cloning of CPF. Ftz-F1
Using a human EST clone containing a box sequence (GenBank, accession number N59515), a human liver cDNA library purchased from Clontech was screened. The cDNA in the positive clone was recovered by converting the phage DNA into the pTriplEx plasmid and sequenced. One of several positive clones (pTriplEx-113) was selected for further analysis out of several positive clones that would have been variously spliced from the same gene.

【0031】CPFの組織特異的発現。ヒト組織由来の
ポリA+RNAのノーザンブロットをクロンテクより購
入した。ノーザンMAXハイブリダイゼーション緩衝液
(アンビオン)を用いてハイブリダイゼーション反応を
実施した。
Tissue-specific expression of CPF. Northern blot of poly A + RNA derived from human tissue was purchased from Clontech. Hybridization reactions were performed using Northern MAX hybridization buffer (Ambion).

【0032】免疫沈降。CPF cDNA配列から誘導
したペプチド(DRMRGGRNFKGPMYKRD
R)を使用して、抗CPFポリクローナル抗体を作製し
た。HepG2または293細胞(1x107)を、1
00μCi/mLの35S−メチオニンを含有する培地で30
分間培養した。細胞を収穫し、50mMトリス−HCl
(pH7.5)、125mM NaCl、5mM EDTA、
0.1%NP−40を含有する緩衝液中での3回の凍結
−融解により溶解した。次いで細胞溶解液を、抗CPF
抗体を用いる免疫沈降のために使用した。沈澱した試料
を10%SDS−PAGEにより分離し、X線フィルム
に暴露した。
Immunoprecipitation. Peptides derived from the CPF cDNA sequence (DRMRGGRNFKGGPMYKRD
R) was used to generate anti-CPF polyclonal antibodies. HepG2 or 293 cells (1 × 10 7 )
In a medium containing 00 μCi / mL 35 S-methionine,
Incubate for minutes. Harvest cells and add 50 mM Tris-HCl
(PH 7.5), 125 mM NaCl, 5 mM EDTA,
It was thawed by three freeze-thaw cycles in a buffer containing 0.1% NP-40. The cell lysate is then washed with anti-CPF
Used for immunoprecipitation with antibodies. The precipitated sample was separated by 10% SDS-PAGE and exposed to X-ray film.

【0033】ヒトCYP7遺伝子のDNアーゼI高感受
性部位マッピング。ヒトCYP7遺伝子の肝特異的発現
の機構を研究するため、まず本発明者等は、該遺伝子の
DNアーゼI高感受性マッピングにより、肝特異的発現
を司ると思われるエレメントを同定しようと試みた。D
NアーゼI高感受性は転写活性に関連していることが知
られている。HepG2、293およびCaco2細胞
から調製された核を、漸増する量のDNアーゼIで処理
した。次にDNAを抽出し、適当な制限酵素で消化し、
−944から−468までのヌクレオチドを含む標識化
されたフラグメントをプローブとして、サザンブロッテ
ィングした。予想された5kbPstIフラグメントに加
え、第二の2.8kbバンドが観察された。増加させた
量のDNアーゼI処理に伴って2.8kbバンドの強度
が増大し、同時に親フラグメントたる5kbバンドの強
度が低下したことは、DNアーゼ高感受性部位の存在を
示すものである。重要なことに、この2.8kbバンド
はHepG2細胞にのみ観察され、調査した他の細胞に
は見られなかった。該フラグメントの大きさは、肝特異
的DNアーゼI高感受性部位が、ヒトCYP7遺伝子の
転写開始部位から−100bpないし−300bpの間
に存在することを示している。この部位の位置を、異な
る領域由来のプローブで異なる制限酵素を用いることに
より、さらに確認した。
[0033] DNase I hypersensitive site mapping of the human CYP7 gene. In order to study the mechanism of liver-specific expression of the human CYP7 gene, the present inventors first attempted to identify elements believed to be responsible for liver-specific expression by DNase I hypersensitive mapping of the gene. D
Nase I hypersensitivity is known to be associated with transcriptional activity. Nuclei prepared from HepG2, 293 and Caco2 cells were treated with increasing amounts of DNaseI. The DNA is then extracted and digested with appropriate restriction enzymes,
Southern blotting was performed using a labeled fragment containing nucleotides from -944 to -468 as a probe. In addition to the expected 5 kb PstI fragment, a second 2.8 kb band was observed. The increase in the intensity of the 2.8 kb band with the increased amount of DNase I treatment and the decrease in the intensity of the 5 kb band as the parent fragment at the same time indicate the presence of a DNase hypersensitive site. Importantly, this 2.8 kb band was observed only in HepG2 cells and not in the other cells examined. The size of the fragment indicates that the liver-specific DNase I hypersensitive site is between -100 bp and -300 bp from the transcription start site of the human CYP7 gene. The location of this site was further confirmed by using different restriction enzymes with probes from different regions.

【0034】肝特異的CYP7プロモーターエレメント
の同定。CYP7遺伝子の肝特異的エレメントをさらに
同定するため、7個の部分的に重複するオリゴヌクレオ
チド(CL5、bp−368−291;CL6、bp−
311−232;CL7、bp−256−177;CL
1、bp−201−122;CL2、bp−140−6
1;CL3、bp−121−42;CL4、bp−60
−+20)を合成し、ゲル移動度シフト実験に使用し
た。標識されたオリゴヌクレオチドCL1およびオリゴ
ヌクレオチドCL2を使用する時、肝細胞特異的DNA
−蛋白複合体の形成があった。標識されていないオリゴ
ヌクレオチドCL1はオリゴヌクレオチドCL2と競合
することから、オリゴヌクレオチドCL1およびCL2
は明らかに同じ複合体を認識した。これらは過剰の非標
識オリゴヌクレオチドCL1およびCL2によって競合
できるが、この領域に隣接するオリゴヌクレオチドCL
3−7によっては競合できないことから、このDNA−
蛋白複合体は配列特異的である。このプロモーター複合
体は、293、Caco2またはSV589核抽出物で
は観察されず、HepG2核抽出物でのみ観察され、こ
の事は、上で同定された肝特異的DNアーゼI高感受性
部位と矛盾しない。これら二つのオリゴヌクレオチドと
重複する配列が、肝特異的DNA−蛋白複合体の形成に
関与しているのは明らかである
Identification of liver-specific CYP7 promoter elements. To further identify the liver-specific element of the CYP7 gene, seven partially overlapping oligonucleotides (CL5, bp-368-291; CL6, bp-
311-232; CL7, bp-256-177; CL
1, bp-201-122; CL2, bp-140-6
1: CL3, bp-121-42; CL4, bp-60
− + 20) was synthesized and used for gel mobility shift experiments. When using labeled oligonucleotides CL1 and CL2, hepatocyte-specific DNA
-There was formation of a protein complex. Because unlabeled oligonucleotide CL1 competes with oligonucleotide CL2, oligonucleotides CL1 and CL2
Clearly recognized the same complex. These can compete with excess unlabeled oligonucleotides CL1 and CL2, but the oligonucleotides CL adjacent to this region
3-7, this DNA-
Protein complexes are sequence-specific. This promoter complex was not observed in 293, Caco2 or SV589 nuclear extracts, but only in HepG2 nuclear extracts, consistent with the liver-specific DNase I hypersensitive sites identified above. It is clear that sequences overlapping with these two oligonucleotides are involved in the formation of a liver-specific DNA-protein complex

【0035】配列分析により、この領域が、核ホルモン
レセプターの結合部位であることが知られている幾つか
の6bp反復エレメントを含むことが明らかとなった。
肝特異的結合に関与している正確な配列を決定するた
め、この反復配列またはその隣接配列の各々に変異を含
んでいる幾つかのオリゴヌクレオチドを合成した。表4
に示される通り、反復AおよびBに変異を含むオリゴヌ
クレオチドは複合体形成で競合するが、反復Cに変異を
含むオリゴヌクレオチドは競合せず、この事は、反復C
が結合に必須であることを示すものである。複合体形成
に必要なヌクレオチドをさらに決定するため、反復Cお
よび隣接配列に詳細な変異を含む幾つかのオリゴヌクレ
オチドを合成し、ゲルシフト実験に使用した。本発明者
等の結果は、複合体形成には9個のヌクレオチドを含む
共通エレメントが必要であることを示した。このエレメ
ントは、Ftz−F1と呼ばれる核ホルモンレセプター
のファミリーのための結合部位であることが知られてい
る。
Sequence analysis revealed that this region contains several 6 bp repeat elements known to be binding sites for nuclear hormone receptors.
To determine the exact sequence involved in liver-specific binding, several oligonucleotides were synthesized containing mutations in this repeat or each of its flanking sequences. Table 4
As shown in Figure 5, oligonucleotides containing mutations in repeats A and B compete for complex formation, whereas oligonucleotides containing mutations in repeat C do not compete, indicating that repeats C
Is essential for binding. To further determine the nucleotides required for complex formation, several oligonucleotides containing detailed mutations in repeat C and adjacent sequences were synthesized and used in gel shift experiments. Our results have shown that complex formation requires a common element containing 9 nucleotides. This element is known to be a binding site for a family of nuclear hormone receptors called Ftz-F1.

【0036】[0036]

【表4】 オリゴヌクレオチド DNA 結合 TCTGATACCTGTGGACTTAGTTCAAGGCCAGTTA + TCTGGAGGATGTGGACTTAGTTCAAGGCCAGTTA + TCTGATACCTGTTATATTAGTTCAAGGCCAGTTA + TCTGGAGGATGTGGACTTCTATCAAGGCCAGTTA + TCTGATACCTGTTATATTCTATCAAGGCCAGTTA + TCTGGAGGATGTGGACTTAGTTCACACAGAGTTA + TCTGATACCTGTGGACTTAGTAGAAGGCCAGTTA - TCTGATACCTGTGGACTTAGTTCTTGGCCAGTTA - TCTGATACCTGTGGACTTAGTTCAATGCCAGTTA - TCTGATACCTGTGGACTTAGTTCAAGTCCAGTTA - TCTGATACCTGTGGACTTAGTTCAAGGAGAGTTA - TCTGATACCTGTGGACTTAGTTCAAGGCCTATTA - TCTGATACCTGTGGACTTAGTTCAAGGCCAATTA + TCTGATACCTGTGGACTTAGTTCAAGGCCAGGTA + TCAAGGCCA CYP7P-結合部位 YCAAGGYCR FTZ-F1 コンセンサス AAAGGTCA NGFI-B コンセンサス TCTGATACCTGTGGACTTAGTCAAAGGCCAGTTA - TCTGATACCTGTGGACTTAGTACCAGGCCAGTTA - TCTGATACCTGTGGACTTAGTAGGAGGCCAGTTA - TCTGATACCTGTGGACTTAGTAAGAGGCCAGTTA - TCTGATACCTGTGGACTTAGTTTCAGGCCAGTTA - TCTGATACCTGTGGACTTAGTCTCAGGCCAGTTA -Table 4 Oligonucleotide DNA binding TCTGATACCTGTGGACTTAGTTCAAGGCCAGTTA + TCTG GAGGA TGTGGACTTAGTTCAAGGCCAGTTA + TCTGATACCTGT TATA TTAGTTCAAGGCCAGTTA + TCTGGAGGATGTGGACTT CTA TCAAGGCCAGTTA + TCTGATACCTGT TATA TT CTA TCAAGGCCAGTTA + TCTGGAGGATGTGGACTTAGTTCA CACAGAG TTA + TCTGATACCTGTGGACTTAGT AG AAGGCCAGTTA - TCTGATACCTGTGGACTTAGTTC TT GGCCAGTTA - TCTGATACCTGTGGACTTAGTTCAA T GCCAGTTA - TCTGATACCTGTGGACTTAGTTCAAG T CCAGTTA - TCTGATACCTGTGGACTTAGTTCAAGG AG AGTTA - TCTGATACCTGTGGACTTAGTTCAAGGCC TA TTA - TCTGATACCTGTGGACTTAGTTCAAGGCCA A TTA + TCTGATACCTGTGGACTTAGTTCAAGGCCAG G TA + TCAAGGCCA CYP7P- binding site YCAAGGYCR FTZ-F1 consensus AAAGGTCA NGFI-B consensus TCTGATACCTGTGGACTTAGT CA AAGGCCAGTTA - TCTGATACCTGTGGACTTAGT A C C AGGCCAGTTA - TCTGATACCTGTGGACTTAGT AGG AGGCCAGTTA - TCTGATACCTGTGGACTTAGT AAG AGGCCAGTTA - TCTGATACCTGTGGACTTAGTT TC AGGCCAGTTA -TCTGATACCTGTGGACTTAGT CTC AGGCCAGTTA-

【0037】Ftz−F1結合部位は、ヒトCYP7遺
伝子の肝特異的発現にとって必須である。ヒトCYP7
遺伝子発現におけるFtz−F1部位の役割を決定する
ため、この部位を2ヌクレオチド置換により変異させ
た。対照として、無関係な領域での変異も作製した。野
生型または変異させたFtz−F1部位のいずれかを含
む+14ないし−716のプロモーター配列、または対
照を、ルシフェラーゼレポータープラスミドpGL3中
にクローニングした。次にこのプラスミドDNAをHe
pG2、293およびCaco2細胞中にトランスフェ
クトし、プロモーター活性をルシフェラーゼ活性により
測定した。Ftz−F1部位での変異はHepG2細胞
におけるプロモーター活性を完全に破壊し、一方293
およびCaco2細胞では殆どまたは全く影響を示さな
かった。対照として、無関係な領域での変異は、調査さ
れた全細胞においてプロモーター活性への影響が観察さ
れなかった。
The Ftz-F1 binding site is essential for liver-specific expression of the human CYP7 gene. Human CYP7
To determine the role of the Ftz-F1 site in gene expression, this site was mutated by a two nucleotide substitution. As a control, mutations in unrelated regions were also made. The promoter sequence from +14 to -716, containing either the wild-type or mutated Ftz-F1 site, or a control, was cloned into the luciferase reporter plasmid pGL3. Next, this plasmid DNA was
The cells were transfected into pG2, 293 and Caco2 cells, and the promoter activity was measured by luciferase activity. Mutation at the Ftz-F1 site completely abolishes promoter activity in HepG2 cells, while 293
And Caco2 cells showed little or no effect. As a control, mutations in unrelated regions had no effect on promoter activity in all cells examined.

【0038】肝特異的CYP7プロモーター結合蛋白の
クローニング。核ホルモンレセプターは、DNA特異的
な、しばしばリガンド依存性の、転写因子である。Ft
z−F1、ショウジョウバエDNA結合蛋白は、核ホル
モンレセプターファミリーのサブグループの原型であ
る。殆どの核ホルモンレセプターと同様、Ftz−F1
は、亜鉛フィンガーDNA結合ドメインおよび推定上の
リガンド結合ドメインを含んでいる。Ftz−F1ファ
ミリー成員のDNA結合ドメインは、2個の亜鉛フィン
ガーモジュールのC末端に、ユニークな26アミノ酸伸
長部(Ftz−F1ボックスと呼ばれる)を含んでい
る。Ftz−F1ボックスの配列は、ショウジョウバエ
から齧歯類まで保存されており、DNAへの配列特異的
結合に深く関与している。ヒトCYP7プロモーター中
にFtz−F1結合部位を同定できたことは、ヒトFt
z−F1様蛋白がヒトCYP7遺伝子中のFtz−F1
エレメントと結合することを示唆している。人間のFt
z−F1をクローニングするため、Ftz−F1ボック
スのDNA配列を用いてESTデータベースを探索し、
ヒトESTクローンを見出した。次いでこのEST配列
をプローブとして使用してヒト肝cDNAライブラリー
をスクリーニングした。幾つかのクローンを分離し、そ
れらのうちの一つ、クローン#113をさらなる分析に
使用した。
Cloning of liver-specific CYP7 promoter binding protein. Nuclear hormone receptors are DNA-specific, often ligand-dependent, transcription factors. Ft
z-F1, a Drosophila DNA binding protein, is the prototype of a subgroup of the nuclear hormone receptor family. Like most nuclear hormone receptors, Ftz-F1
Contains a zinc finger DNA binding domain and a putative ligand binding domain. The DNA binding domain of Ftz-F1 family members contains a unique 26 amino acid extension (termed the Ftz-F1 box) at the C-terminus of the two zinc finger modules. The sequence of the Ftz-F1 box is conserved from Drosophila to rodents and is deeply involved in sequence-specific binding to DNA. The ability to identify the Ftz-F1 binding site in the human CYP7 promoter indicates that human Ft-F1
The z-F1-like protein is Ftz-F1 in the human CYP7 gene.
Suggests binding to the element. Human Ft
To clone z-F1, the EST database was searched using the DNA sequence of the Ftz-F1 box,
A human EST clone was found. The EST sequence was then used as a probe to screen a human liver cDNA library. Several clones were isolated and one of them, clone # 113, was used for further analysis.

【0039】CPFの特性決定。クローン#113は、
最初のATGの30ヌクレオチド上流にフレーム内停止
コドンを含みながら、495アミノ酸の全長ポリペプチ
ドをコードしている。本発明者等はこの蛋白を、CYP
7プロモーター結合因子(YP7 Promoter-binding
Factor)に因んでCPFと命名した。配列分析によ
り、CPFは、Ftz−F1ファミリーの新たな成員で
あることが明らかとなっている。CPFの最も近いホモ
ログは、このファミリーのマウス版、LRH−1(配列
番号7および8)およびヒト変異体、hFTF(配列番
号9および10)である。クローニングされたCPFが
CYP7プロモーター結合活性に関与している因子であ
ることを確認するため、インビトロ翻訳されたCPFを
HepG2核抽出物と並行してゲルシフト実験に使用し
た。本発明者等は、インビトロ翻訳されたCPFは内因
性蛋白が認識するのと同じDNA配列を認識し、そして
これら二者のゲルシフトパターンは同一であることを見
出した。Ftz−F1ボックスを含むペプチドに対して
作製された抗体をゲルシフト実験に使用した。本発明者
等は、HepG2核抽出物またはインビトロ翻訳された
CPFのいずれかを用いて形成されたDNA−蛋白複合
体は、特異抗体により破壊されるが、免疫前血清によっ
ては破壊されないことを見出した。さらに、該抗体は、
インビトロ翻訳されたCPFと一緒に泳動する肝特異的
細胞蛋白を認識した。293細胞には対応する蛋白が無
いため、Ftz−F1に特異的な抗体により認識される
内因性遺伝子産物は、明らかに肝特異的である。
Characterization of CPF. Clone # 113 is
It encodes a 495 amino acid full-length polypeptide, including an in-frame stop codon 30 nucleotides upstream of the first ATG. We call this protein CYP
7 promoter binding factor (C YP7 P romoter-binding
It was named the CPF in honor of the F actor). Sequence analysis has revealed that CPF is a new member of the Ftz-F1 family. The closest homologs of CPF are the murine version of this family, LRH-1 (SEQ ID NOs: 7 and 8) and the human variant, hFTF (SEQ ID NOs: 9 and 10). To confirm that the cloned CPF was a factor involved in CYP7 promoter binding activity, in vitro translated CPF was used in gel shift experiments in parallel with HepG2 nuclear extract. We have found that in vitro translated CPF recognizes the same DNA sequence as the endogenous protein does, and that the gel shift patterns of the two are identical. Antibodies raised against peptides containing the Ftz-F1 box were used in gel shift experiments. We have found that DNA-protein complexes formed using either HepG2 nuclear extract or in vitro translated CPF are destroyed by specific antibodies, but not by pre-immune serum. Was. Further, the antibody
Liver-specific cellular proteins that migrated with in vitro translated CPF were recognized. Since there is no corresponding protein in 293 cells, the endogenous gene product recognized by the antibody specific for Ftz-F1 is clearly liver-specific.

【0040】CPFの転写活性。CPFの転写活性を測
定するため、フラッグ標識された発現プラスミドpfC
PFを、野生型Ftz−F1結合部位を3コピー含むル
シフェラーゼレポータープラスミドと共に293細胞中
にトランスフェクトした。本発明者等は、pfCPFは
限定された転写活性を有することを見出した。この弱い
転写活性が、活性が恐らくはやはりリガンド依存性であ
る、該遺伝子の弱転写ドメインAF2に起因するか否か
を決定するため、CPFのAF2ドメインを強いウイル
ストランス活性化ドメインで置き換えることにより、p
fCPF−VPを組み立てた。fCPF−VPをレポー
タープラスミドと共に293細胞中にトランスフェクト
させると、強い転写活性が観察されるが、これは、CP
Fの転写活性化が、リガンド依存プロセスまたは補助因
子のいずれかの助けを必要とすることを示唆している。
Transcription activity of CPF. To measure the transcriptional activity of CPF, a flag-labeled expression plasmid pfC
PF was transfected into 293 cells with a luciferase reporter plasmid containing three copies of the wild-type Ftz-F1 binding site. The present inventors have found that pfCPF has limited transcription activity. To determine if this weak transcriptional activity is due to the weakly transcriptional domain AF2 of the gene, which activity is also likely ligand dependent, by replacing the AF2 domain of CPF with a strong viral transactivation domain, p
fCPF-VP was assembled. When fCPF-VP was transfected into 293 cells with a reporter plasmid, strong transcriptional activity was observed,
It has been suggested that transcriptional activation of F requires the help of either ligand-dependent processes or cofactors.

【0041】CPFの組織特異的発現。齧歯類ではCY
P7遺伝子は専ら肝で発現されることが報告されてい
る。CPF遺伝子の組織特異的発現を決定するため、一
対のRNA組織ブロットを、標識化CPF cDNAま
たはCYP7 cDNAのいずれかをプローブとして行
った。本発明者等は、CPF遺伝子の発現は膵臓および
肝臓で強いように見受けられ、心臓および肺では発現レ
ベルが低く、その他の組織では殆どまたは全く発現され
ないことを見出した。ヒトCYP7は肝臓でのみ発現さ
れるようである。興味深いことに、低分子量の膵特異的
転写物はヒトCYP7プローブにより認識された。
Tissue-specific expression of CPF. CY in rodents
It has been reported that the P7 gene is exclusively expressed in the liver. To determine tissue-specific expression of the CPF gene, a pair of RNA tissue blots was performed using either the labeled CPF cDNA or CYP7 cDNA as a probe. The present inventors have found that CPF gene expression appears to be strong in pancreas and liver, with low levels of expression in heart and lung and little or no expression in other tissues. Human CYP7 appears to be expressed only in the liver. Interestingly, low molecular weight pancreatic-specific transcripts were recognized by the human CYP7 probe.

【0042】2.ハイスループットインビトロ蛍光偏光
検定 試薬: センサー:ローダミン標識したILRKLLQEペプチ
ド(最終濃度=1−5nM) レセプター:グルタチオン−S−トランスフェラーゼ/
CPFリガンド結合ドメイン(配列番号2、残基1−1
23)融合蛋白(最終濃度=100−200nM) 緩衝液:10mM HEPES、10mM NaCl、6mM
塩化マグネシウム、pH7.6
2. High throughput in vitro fluorescence polarization
Assay reagent: Sensor: rhodamine-labeled ILRKLLQE peptide (final concentration = 1-5 nM) Receptor: glutathione-S-transferase /
CPF ligand binding domain (SEQ ID NO: 2, residues 1-1)
23) Fusion protein (final concentration = 100-200 nM) Buffer: 10 mM HEPES, 10 mM NaCl, 6 mM
Magnesium chloride, pH 7.6

【0043】プロトコル: 1.96ウェル微量定量板の各ウェルに、センサー/レ
セプター混合物90μLを加える。
Protocol: 1. Add 90 μL of the sensor / receptor mixture to each well of a 96-well microtiter plate.

【0044】2.ウェル当たり10μLの被験化合物を
加える。
2. Add 10 μL of test compound per well.

【0045】3.5分間振盪し、フルオロライトFPM
−2蛍光偏光微量定量系(ダイナテク・ラボラトリー
ズ、Inc.)を用いることにより5分以内に蛍光偏光の量
を測定する。
Shake for 3.5 minutes, Fluorite FPM
-2 Measure the amount of fluorescence polarization within 5 minutes by using a fluorescence polarization micro-quantification system (Dynatech Laboratories, Inc.).

【0046】3.細胞を基礎とするレポーター検定のた
めのプロトコル CPFは、293細胞またはHeLa細胞で過剰発現さ
れる時、FTZ−F1レポーターコンストラクトをトラ
ンス活性化することができる。3FTZ−F1結合部位
−ルシフェラーゼレポーター組み立て物をコードしてい
るプラスミドおよびCPFをコードしている様々な量の
発現ベクターによって、293細胞を燐酸カルシウム沈
澱法を用いてトランスフェクトする。36−48時間
後、細胞を処置せずにおくか、または候補リガンド(1
0−50ng/mL)で6時間処置し、その後収穫する。細
胞を溶解し、ルシフェラーゼ活性をルシフェラーゼ検定
キット(プロメガ)を用いて測定する。各トランスフェ
クトにおけるルシフェラーゼ活性は、pRSV−βga
l対照ベクターを同時トランスフェクトすることにより
標準化する。
3. For cell-based reporter assays
Protocol CPF can transactivate FTZ-F1 reporter constructs when overexpressed in 293 or HeLa cells. 293 cells are transfected with a plasmid encoding the 3FTZ-F1 binding site-luciferase reporter assembly and various amounts of expression vector encoding CPF using the calcium phosphate precipitation method. After 36-48 hours, the cells are left untreated, or the candidate ligand (1
(0-50 ng / mL) for 6 hours before harvesting. Cells are lysed and luciferase activity is measured using a luciferase assay kit (Promega). Luciferase activity in each transfection was determined by pRSV-βga
Normalize by co-transfecting the control vector.

【0047】4.配列の並置 当該ポリヌクレオチドおよびポリペプチド配列の様々な
並置を、配列特異的フラグメントを表しつつ第5−8表
に示す。例えば、第7表は、105−、hFTF−およ
びmLRH−特異的ペプチドを表しつつ105、hFT
FおよびmLRHポリペプチド配列の並置を示してい
る。同様にそれら各々のcDNA配列の並置(それぞれ
配列番号5、7および9)は、105−、hFTF−お
よびmLRH−特異的cDNAフラグメントを表してい
る。
4. Sequence alignments Various alignments of the polynucleotide and polypeptide sequences are shown in Tables 5-8, representing sequence-specific fragments. For example, Table 7 shows 105, hFTF- and mLRH-specific peptides while expressing 105, hFT-
1 shows the alignment of the F and mLRH polypeptide sequences. Similarly, the alignment of their respective cDNA sequences (SEQ ID NOs: 5, 7 and 9, respectively) represents 105-, hFTF- and mLRH-specific cDNA fragments.

【0048】この明細書に引用されている全ての刊行物
および特許出願は、一つ一つの刊行物または特許出願
が、引用により本明細書の一部とされる旨、明確且つ個
別的に指示されているかの如く、引用によって本明細書
に組み入れられている。前記発明は、明確な理解の目的
のための例示および実施例によって幾分詳細に記載され
てきたが、付記されている特許請求の範囲の精神または
範囲を逸脱することなく、何らかの変更および修飾を施
し得ることは、本発明の教示に照らし、当業者には極め
て明らかであろう。
All publications and patent applications cited in this specification are expressly and individually indicated to state that each publication or patent application is incorporated by reference. As if incorporated herein by reference. Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, any changes and modifications may be made without departing from the spirit or scope of the appended claims. What can be done will be very apparent to those skilled in the art in light of the teachings of the present invention.

【0049】[0049]

【表5】 113PRO = 配列番号2 hFTFpro = 配列番号8 113PRO MSSNSDTGDL QESLKHG--- -LTP--IVSQ FKMVNYSYDE DLEELCPVCG 44 hFTFpro MLPKVETEAL GLARSHGEQG QMPENMQVSQ FKMVNYSYDE DLEELCPVCG 50 113PRO DKVSGYHYGL LTCESCKGFF KRTVQNNKRY TCIENQNCQI DKTQRKRCPY 94 hFTFpro DKVSGYHYGL LTCESCKGFF KRTVQNNKRY TCIENQNCQI DKTQRKRCPY 100 113PRO CRFQKCLSVG MKLEAVRADR MRGGRNKFGP MYKRDRALKQ QKKALIRANG 144 hFTFpro CRFQKCLSVG MKLEAVRADR MRGGRNKFGP MYKRDRALKQ QKKALIRANG 150 113PRO LKLEAMSQVI QAMPSDLTIS SAIQNIHSAS KGLPLNHAAL PPTDYDRSPF 194 hFTFpro LKLEAMSQVI QAMPSDLTIS SAIQNIHSAS KGLPLNHAAL PPTDYDRSPF 200 113PRO VTSPISMTMP PHGSLQGYQT YGHFPSRAIK SEYPDPYTSS PESIMGYSYM 244 hFTFpro VTSPISMTM- LHGSLQGYQT YGHFPSRAIK SEYPDPYTSS PESIMGYSYM 249 113PRO DSYQTSSPAS IPHLILELLK CEPDEPQVQA KIMAYLQQEQ ANRSKHEKLS 294 hFTFpro DSYQTSSPAS IPHLILELLK CEPDEPQVQA KIMAYLQQEQ ANRSKHEKLS 299 113PRO TFGLMCKMAD QTLFSIVEWA RSSIFFRELK VDDQMKLLQN CWSELLILDH 344 hFTFpro TFGLMCKMAD QTVFSIVEWA RSSIFFRELK VDDQMKLLQN CWSELLILDH 349 113PRO IYRQVVHGKE GSIFLVTGQQ VDYSIIASQA GATLNNLMSH AQELVAKLRS 394 hFTFpro IYRQVVHGKE GSIFLVTGQQ VDYSIIASQA GATLNNLMSH AQELVAKLRS 399 113PRO LQFDQREFVC LKFLVLFSLD VKNLENFQLV EGVQEQVNAA LLDYTMCNYP 444 hFTFpro LQFDQREFVC LKFLVLFSLD VKNLENFQLV EGVQEQVNAA LLDYTMCNYP 449 113PRO QQTEKFGQLL LRLPEIRAIS MQAEEYLYYK HLNGDVPYNN LLIEMLHAKR 494 hFTFpro QQTEKFGQLL LRLPEIRAIS MQAEEYLYYK HLNGDVPYNN LLIEMLHAKR 499 113PRO A 495 hFTFpro A 500TABLE 5 113PRO = SEQ ID NO: 2 hFTFpro = SEQ ID NO: 8 113PRO MSSNSDTGDL QESLKHG --- -LTP - IVSQ FKMVNYSYDE DLEELCPVCG 44 hFTFpro MLPKVETEAL GLARSHGEQG QMPENMQVSQ FKMVNYSYDE DLEELCPVCG 50 113PRO DKVSGYHYGL LTCESCKGFF KRTVQNNKRY TCIENQNCQI DKTQRKRCPY 94 hFTFpro DKVSGYHYGL LTCESCKGFF KRTVQNNKRY TCIENQNCQI DKTQRKRCPY 100 113PRO CRFQKCLSVG MKLEAVRADR MRGGRNKFGP MYKRDRALKQ QKKALIRANG 144 hFTFpro CRFQKCLSVG MKLEAVRADR MRGGRNKFGP MYKRDRALKQ QKKALIRANG 150 113PRO LKLEAMSQVI QAMPSDLTIS SAIQNIHSAS KGLPLNHAAL PPTDYDRSPF 194 hFTFpro LKLEAMSQVI QAMPSDLTIS SAIQNIHSAS KGLPLNHAAL PPTDYDRSPF 200 113PRO VTSPISMTMP PHGSLQGYQT YGHFPSRAIK SEYPDPYTSS PESIMGYSYM 244 hFTFpro VTSPISMTM- LHGSLQGYQT YGHFPSRAIK SEYPDPYTSS PESIMGYSYM 249 113PRO DSYQTSSPAS IPHLILELLK CEPDEPQVQA KIMAYLQQEQ ANRSKHEKLS 294 hFTFpro DSYQTSSPAS IPHLILELLK CEPDEPQVQA KIMAYLQQEQ ANRSKHEKLS 299 113PRO TFGLMCKMAD QTLFSIVEWA RSSIFFRELK VDDQMKLLQN CWSELLILDH 344 hFTFpro TFGLMCKMAD QTVFSIVEWA RSSIFFRELK VDDQMKLLQN CWSELLILDH349 YRQVVHGKE GSIFLVTGQQ VDYSIIASQA GATLNNLMSH AQELVAKLRS 394 hFTFpro IYRQVVHGKE GSIFLVTGQQ VDYSIIASQA GATLNNLMSH AQELVAKLRS 399 113PRO LQFDQREFVC LKFLVLFSLD VKNLENFQLV EGVQEQVNAA LLDYTMCNYP 444 hFTFpro LQFDQREFVC LKFLVLFSLD VKNLENFQLV EGVQEQVNAA LLDYTMCNYP 449 113PRO QQTEKFGQLL LRLPEIRAIS MQAEEYLYYK HLNGDVPYNN LLIEMLHAKR 494 hFTFpro QQTEKFGQLL LRLPEIRAIS MQAEEYLYYK HLNGDVPYNN LLIEMLHAKR 499 113PRO A 495 hFTFpro A 500

【0050】[0050]

【表6】 113PRO = 配列番号1 36PRO = 配列番号4 hFTFpro = 配列番号8 mLRHpro = 配列番号10 113PRO MSSNSDTGDL QESLKHG--- ---------- ---------- ---------- 17 36pro MSSNSDTGDL QESLKHG--- ---------- ---------- ---------- 17 hFTFpro MLPKVETEAL GLARSHG--- ---------- ---------- ---------- 17 mLRHpro MSASLDTGDF QEFLKHGLTA IASAPGSETR HSPKREEQLR EKRAGLPDRH 50 113PRO ---------- ---------- ---------- --LTP--IVS QFKMVNYSYD 33 36pro ---------- ---------- ---------- --LTP--IVS QFKMVNYSYD 33 hFTFpro ---------- ---------- --------EQ GQMPENMQVS QFKMVNYSYD 39 mLRHpro RRPIPARSRL VMLPKVETEA PGLVRSHGEQ GQMPENMQVS QFKMVNYSYD 100 113PRO EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 83 36pro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 83 hFTFpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 89 mLRHpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNQKR YTCIENQNCQ 150 113PRO IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 133 36pro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 133 hFTFpro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 139 mLRHpro IDKTQRKRCP YCRFKKCIDV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 200 113PRO QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 183 36pro QQKKALIRAN GLKLEAMSQV D--------- ---------- ---------- 154 hFTFpro QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 189 mLRHpro QQKKALIRAN GLKLEAMSQV IQAMPSDLT- -SAIQNIHSA SKGLPLSHVA 248 113PRO LPPTDYDRSP FVTSPISMTM PPHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 233 36pro ---------- ---------- ---------- ---------- ---------- 154 hFTFpro LPPTDYDRSP FVTSPISMTM -LHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 238 mLRHpro LPPTDYDRSP FVTSPISMTM PPHSSLHGYQ PYGHFPSRAI KSEYPDPYSS 298 113PRO SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 283 36pro ---------- ---------- ---------- ---------- ---------- 154 hFTFpro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 288 mLRHpro SPESMMGYSY MDGYQTNSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 348 113PRO QANRSKHEKL STFGLMCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 333 36pro ---------- ---------- ---------- ---------- ---DQMKLLQ 161 hFTFpro QANRSKHEKL STFGLMCKMA DQTVFSIVEW ARSSIFFREL KVDDQMKLLQ 338 mLRHpro QSNRNRQEKL SAFGLLCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 398 113PRO NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 383 36pro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 211 hFTFpro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 388 mLRHpro NCWSELLILD HIYRQVAHGK EGTIFLVTGE HVDYSTIISH TEVAFNNLLS 448 113PRO HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 433 36pro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 261 hFTFpro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 438 mLRHpro LAQELVVRLR SLQFDQREFV CLKFLVLFSS DVKNLENLQL VEGVQEQVNA 498 113PRO ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 483 36pro ALLDYTMCNY PQQTEKFRQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 311 hFTFpro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 488 mLRHpro ALLDYTVCNY PQQTEKFGQL LLRLPEIRAI SKQAEDYLYY KHVNGDVPYN 548 113PRO NLLIEMLHAK RA 495 36pro NLLIEMLHAK RA 323 hFTFpro NLLIEMLHAK RA 500 mLRHpro NLLIEMLHAK RA 560[Table 6] 113PRO = SEQ ID NO: 1 36PRO = SEQ ID NO: 4 hFTFpro = SEQ ID NO: 8 mLRHpro = SEQ ID NO: 10 ---------- 17 36pro MSSNSDTGDL QESLKHG --- ---------- ---------- ---------- 17 hFTFpro MLPKVETEAL GLARSHG --- ---------- ---------- ---------- 17 mLRHpro MSASLDTGDF QEFLKHGLTA IASAPGSETR HSPKREEQLR EKRAGLPDRH 50 113PRO ------- --- ---------- ---------- --LTP--IVS QFKMVNYSYD 33 36pro ---------- -------- ----------- --LTP--IVS QFKMVNYSYD 33 hFTFpro ---------- ---------- -------- EQ GQMPENMQVS QFKMVNYSYD 39 mLRHpro RRPIPARSRL VMLPKVETEA PGLVRSHGEQ GQMPENMQVS QFKMVNYSYD 100 113PRO EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 83 36pro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 83 hFTFpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 89 mLRHpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNQKR YTCIENQNCQ 150 113PRO IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 133 36pro IDKTQRKRCP YCRFQKCLSV GMKL EAVRAD RMRGGRNKFG PMYKRDRALK 133 hFTFpro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 139 mLRHpro IDKTQRKRCP YCRFKKCIDV GMKLEARNAD RMRGGRNKFG PMYKRDRALK IQ 113LK -------- 154 hFTFpro QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 189 mL RHpro QQKKALIRAN GLKLEAMSQV IQAMPSDLT- -SAIQNIHSA SKGLPLSHVA 248 113PRO LPPTDYDRSP FVTSPISMTM PPHGSLQGYTSY PGFP --- ---------- ---------- ---------- 154 hFTFpro LPPTDYDRSP FVTSPISMTM -LHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 238 mLRHpro LPPTDYDRSP FVTSPISMTM PPHSSLHGYQ PYGHFPSRAI KSEYPDPYSS 298 113PRO SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 283 36pro ---------- ---------- ---------- ---------- --- ------- 154 hFTFpro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 288 mLRHpro SPESMMGYSY MDGYQTNSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 348 113PRO QANRSKHEKL STFGLMCKMA DQTLFSIVE W ARSSIFFREL KVDDQMKLLQ 333 36pro ---------- ---------- ---------- ---------- --- DQMKLLQ 161 hFTFpro QANRSKHEKL STFGLMCKMA DQTVFSIVEW ARSSIFFREL KVDDQMKLLQ 338 mLRHpro QSNRNRQEKL SAFGLLCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 398 113PRO NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 383 36pro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 211 hFTFpro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 388 mLRHpro NCWSELLILD HIYRQVAHGK EGTIFLVTGE HVDYSTIISH TEVAFNNLLS 448 113PRO HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 433 36pro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 261 hFTFpro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 438 mLRHpro LAQELVVRLR SLQFDQREFV CLKFLVLFSS DVKNLENLQL VEGVQEQVNA 498 113PRO ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 483 36pro ALLDYTMCNY PQQTEKFRQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 311 hFTFpro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 488 mLRHpro ALLDYTVCNY PQQTEKFGQL LLRLPEIRAI SKQA EDYLYY KHVNGDVPYN 548 113PRO NLLIEMLHAK RA 495 36pro NLLIEMLHAK RA 323 hFTFpro NLLIEMLHAK RA 500 mLRHpro NLLIEMLHAK RA 560

【0051】[0051]

【表7】 105pro = 配列番号6 hFTFpro = 配列番号3 mLRHpro = 配列番号10 105pro MSSNSDTGDL QESLKHGLTP IG-------- ---------- ---AGLPDRH 29 hFTFpro ---------- ---------- ---------- ---------- ---------- mLRHpro MSASLDTGDF QEFLKHGLTA IASAPGSETR HSPKREEQLR EKRAGLPDRH 50 105pro GSPIPARGRL VMLPKVETEA LGLARSHGEQ GQMPENMQVS QFKMVNYSYD 79 hFTFpro ---------- -MLPKVETEA LGLARSHGEQ GQMPENMQVS QFKMVNYSYD 39 mLRHpro RRPIPARSRL VMLPKVETEA PGLVRSHGEQ GQMPENMQVS QFKMVNYSYD 100 105pro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 129 hFTFpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 89 mLRHpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNQKR YTCIENQNCQ 150 105pro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 179 hFTFpro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 139 mLRHpro IDKTQRKRCP YCRFKKCIDV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 200 105pro QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 229 hFTFpro QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 189 mLRHpro QQKKALIRAN GLKLEAMSQV IQAMPSDLT- -SAIQNIHSA SKGLPLSHVA 248 105pro LPPTDYDRSP FVTSPISMTM PPHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 279 hFTFpro LPPTDYDRSP FVTSPISMTM -LHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 238 mLRHpro LPPTDYDRSP FVTSPISMTM PPHSSLHGYQ PYGHFPSRAI KSEYPDPYSS 298 105pro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 329 hFTFpro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 288 mLRHpro SPESMMGYSY MDGYQTNSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 348 105pro QANRSKHEKL STFGLMCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 379 hFTFpro QANRSKHEKL STFGLMCKMA DQTVFSIVEW ARSSIFFREL KVDDQMKLLQ 338 mLRHpro QSNRNRQEKL SAFGLLCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 398 105pro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 429 hFTFpro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 388 mLRHpro NCWSELLILD HIYRQVAHGK EGTIFLVTGE HVDYSTIISH TEVAFNNLLS 448 105pro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 479 hFTFpro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 438 mLRHpro LAQELVVRLR SLQFDQREFV CLKFLVLFSS DVKNLENLQL VEGVQEQVNA 498 105pro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 529 hFTFpro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 488 mLRHpro ALLDYTVCNY PQQTEKFGQL LLRLPEIRAI SKQAEDYLYY KHVNGDVPYN 548 105pro NLLIEMLHAK RA 541 hFTFpro NLLIEMLHAK RA 500 mLRHpro NLLIEMLHAK RA 560[Table 7] 105pro = SEQ ID NO: 6 hFTFpro = SEQ ID NO: 3 mLRHpro = SEQ ID NO: 10 105pro MSSNSDTGDL QESLKHGLTP IG ------------------ --- AGLPDRH 29 hFTFpro ------- ---------- ---------- ---------- ---------- mLRHpro MSASLDTGDF QEFLKHGLTA IASAPGSETR HSPKREEQLR EKRAGLPDRH 50 105pro GSPIPARGRL VMLPKVETEA LGLARSHGEQ GQMPENMQVS QFKMVNYSYD 79 hFTFpro ---------- -MLPKVETEA LGLARSHGEQ GQMPENMQVS QFKMVNYSYD 39 mLRHpro RRPIPARSRL VMLPKVETEA PGLVRSHGEQ GQMPENMQVS QFKMVNYSYD 100 105pro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 129 hFTFpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNNKR YTCIENQNCQ 89 mLRHpro EDLEELCPVC GDKVSGYHYG LLTCESCKGF FKRTVQNQKR YTCIENQNCQ 150 105pro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 179 hFTFpro IDKTQRKRCP YCRFQKCLSV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 139 mLRHpro IDKTQRKRCP YCRFKKCIDV GMKLEAVRAD RMRGGRNKFG PMYKRDRALK 200 105pro QQKKALIRAN GLKLEAMSQV IQAMPSDLTI SSAIQNIHSA SKGLPLNHAA 229 hFTFpro QQKKALIRAN GLKLEAMSQV IQAMPS DLTI SSAIQNIHSA SKGLPLNHAA 189 mLRHpro QQKKALIRAN GLKLEAMSQV IQAMPSDLT- -SAIQNIHSA SKGLPLSHVA 248 105pro LPPTDYDRSP FVTSPISMTM PPHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 279 hFTFpro LPPTDYDRSP FVTSPISMTM -LHGSLQGYQ TYGHFPSRAI KSEYPDPYTS 238 mLRHpro LPPTDYDRSP FVTSPISMTM PPHSSLHGYQ PYGHFPSRAI KSEYPDPYSS 298 105pro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 329 hFTFpro SPESIMGYSY MDSYQTSSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 288 mLRHpro SPESMMGYSY MDGYQTNSPA SIPHLILELL KCEPDEPQVQ AKIMAYLQQE 348 105pro QANRSKHEKL STFGLMCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 379 hFTFpro QANRSKHEKL STFGLMCKMA DQTVFSIVEW ARSSIFFREL KVDDQMKLLQ 338 mLRHpro QSNRNRQEKL SAFGLLCKMA DQTLFSIVEW ARSSIFFREL KVDDQMKLLQ 398 105pro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 429 hFTFpro NCWSELLILD HIYRQVVHGK EGSIFLVTGQ QVDYSIIASQ AGATLNNLMS 388 mLRHpro NCWSELLILD HIYRQVAHGK EGTIFLVTGE HVDYSTIISH TEVAFNNLLS 448 105pro HAQELVAKLR SLQFDQREFV CLKFLVLFSL DVKNLENFQL VEGVQEQVNA 479 hFTFpro HAQELVAKLR SLQFDQREFV CLKFLV LFSL DVKNLENFQL VEGVQEQVNA 438 mLRHpro LAQELVVRLR SLQFDQREFV CLKFLVLFSS DVKNLENLQL VEGVQEQVNA 498 105pro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 529 hFTFpro ALLDYTMCNY PQQTEKFGQL LLRLPEIRAI SMQAEEYLYY KHLNGDVPYN 488 mLRHpro ALLDYTVCNY PQQTEKFGQL LLRLPEIRAI SKQAEDYLYY KHVNGDVPYN 548 105pro NLLIEMLHAK RA 541 hFTFpro NLLIEMLHAK RA 500 mLRHpro NLLIEMLHAK RA 560

【0052】[0052]

【表8】 113 = 配列番号1 hFTF = 配列番号7 113 ---------- ---------- ---------- --------GA AAAAAGTACA 12 hFTF GAAACTGGAT ACATGGTTTA CAGCAGGTCA CTAATGTTGG AAAAAGTACA 50 113 GAGTCCAGGG AAAAGACTTG CTTGTAACTT TATGAATTCT GGATTTTTTT 62 hFTF GAGTCCAGGG AAA-GACTTG CTTGTAACTT TATGAATTCT GGA---TTTT 96 113 TTTTCCTTTG CTTTTTCTTA ACTTTCACTA AGGGTTACTG TAGTCTGATG 112 hFTF TTTTCCTTTG CTTTTTCTTA ACTTTCACTA AGGGTTACTG TAGTCTGATG 146 113 TGTCCTTCCC AAGGCCACGA AATTTGACAA GCTGCACTTT TCTTTTGCTC 162 hFTF TGTCCTTCCC AAGGCCACGA AATTTGACAA GCTGCACTTT TCTTTTGCTC 196 113 AATGATTTCT GCTTTAAGCC AAAGAACTGC CTATAATTTC ACTAAGAATG 212 hFTF AATGATTTCT GCTTTAAGCC AAAGAACTGC CTATAATTTC ACTAAGAATG 246 113 TCTTCTAATT CAGATACTGG GGATTTACAA GAGTCTTTAA AGCACGGACT 262 hFTF TCTTCTAATT CAGATACTGG GGATTTACAA GAGTCTTTAA AGCACGGACT 296 113 TACACCTATT ---------- ---------- ---------- ---------- 272 hFTF TACACCTATT GGTGCTGGGC TTCCGGACCG ACACGGATCC CCCATCCCGC 346 113 ---------- ---------- ---------- ---------- ---------- 272 hFTF CCGCGGTCGC CTTGTCATGC TGCCCAAAGT GGAGACGGAA GCCCTGGGAC 396 113 ---------- ---------- ---------- ---------- -------GTG 275 hFTF TGGCTCGATC GCATGGGGAA CAGGGCCAGA TGCCGGAAAA CATGCAAGTG 446 113 TCTCAATTTA AAATGGTGAA TTACTCCTAT GATGAAGATC TGGAAGAGCT 325 hFTF TCTCAATTTA AAATGGTGAA TTACTCCTAT GATGAAGATC TGGAAGAGCT 496 113 TTGTCCCGTG TGTGGAGATA AAGTGTCTGG GTACCATTAT GGGCTCCTCA 375 hFTF TTGTCCCGTG TGTGGAGATA AAGTGTCTGG GTACCATTAT GGGCTCCTCA 546 113 CCTGTGAAAG CTGCAAGGGA TTTTTTAAGC GAACAGTCCA AAATAATAAA 425 hFTF CCTGTGAAAG CTGCAAGGGA TTTTTTAAGC GAACAGTCCA AAATAATAAA 596 113 AGGTACACAT GTATAGAAAA CCAGAACTGC CAAATTGACA AAACACAGAG 475 hFTF AGGTACACAT GTATAGAAAA CCAGAACTGC CAAATTGACA AAACACAGAG 646 113 AAAGCGTTGT CCTTACTGTC GTTTTCAAAA ATGTCTAAGT GTTGGAATGA 525 hFTF AAAGCGTTGT CCTTACTGTC GTTTTCAAAA ATGTCTAAGT GTTGGAATGA 696 113 AGCTAGAAGC TGTAAGGGCC GACCGAATGC GTGGAGGAAG GAATAAGTTT 575 hFTF AGCTAGAAGC TGTAAGGGCC GACCGAATGC GTGGAGGAAG GAATAAGTTT 746 113 GGGCCAATGT ACAAGAGAGA CAGGGCCCTG AAGCAACAGA AAAAAGCCCT 625 hFTF GGGCCAATGT ACAAGAGAGA CAGGGCCCTG AAGCAACAGA AAAAAGCCCT 796 113 CATCCGAGCC AATGGACTTA AGCTAGAAGC CATGTCTCAG GTGATCCAAG 675 hFTF CATCCGAGCC AATGGACTTA AGCTAGAAGC CATGTCTCAG GTGATCCAAG 846 113 CTATGCCCTC TGACCTGACC ATTTCCTCTG CAATTCAAAA CATCCACTCT 725 hFTF CTATGCCCTC TGACCTGACC ATTTCCTCTG CAATTCAAAA CATCCACTCT 896 113 GCCTCCAAAG GCCTACCTCT GAACCATGCT GCCTTGCCTC CTACAGACTA 775 hFTF GCCTCCAAAG GCCTACCTCT GAACCATGCT GCCTTGCCTC CTACAGACTA 946 113 TGACAGAAGT CCCTTTGTAA CATCCCCCAT TAGCATGACA ATGCCCCCTC 825 hFTF TGACAGAAGT CCCTTTGTAA CATCCCCCAT TAGCATGACA ATGC---TGC 993 113 ACGGCAGCCT GCAAGGTTAC CAAACATATG GCCACTTTCC TAGCCGGGCC 875 hFTF ACGGCAGCCT GCAAGGTTAC CAAACATATG GCCACTTTCC TAGCCGGGCC 1043 113 ATCAAGTCTG AGTACCCAGA CCCCTATACC AGCTCACCCG AGTCCATAAT 925 hFTF ATCAAGTCTG AGTACCCAGA CCCCTATACC AGCTCACCCG AGTCCATAAT 1093 113 GGGCTATTCA TATATGGATA GTTACCAGAC GAGCTCTCCA GCAAGCATCC 975 hFTF GGGCTATTCA TATATGGATA GTTACCAGAC GAGCTCTCCA GCAAGCATCC 1143 113 CACATCTGAT ACTGGAACTT TTGAAGTGTG AGCCAGATGA GCCTCAAGTC 1025 hFTF CACATCTGAT ACTGGAACTT TTGAAGTGTG AGCCAGATGA GCCTCAAGTC 1193 113 CAGGCTAAAA TCATGGCCTA TTTGCAGCAA GAGCAGGCTA ACCGAAGCAA 1075 hFTF CAGGCTAAAA TCATGGCCTA TTTGCAGCAA GAGCAGGCTA ACCGAAGCAA 1243 113 GCACGAAAAG CTGAGCACCT TTGGGCTTAT GTGCAAAATG GCAGATCAAA 1125 hFTF GCACGAAAAG CTGAGCACCT TTGGGCTTAT GTGCAAAATG GCAGATCAAA 1293 113 CTCTCTTCTC CATTGTCGAG TGGGCCAGGA GTAGTATCTT CTTCAGAGAA 1175 hFTF CTGTCTTCTC CATTGTCGAG TGGGCCAGGA GTAGTATCTT CTTCAGAGAA 1343 113 CTTAAGGTTG ATGACCAAAT GAAGCTGCTT CAGAACTGCT GGAGTGAGCT 1225 hFTF CTTAAGGTTG ATGACCAAAT GAAGCTGCTT CAGAACTGCT GGAGTGAGCT 1393 113 CTTAATCCTC GACCACATTT ACCGACAAGT GGTACATGGA AAGGAAGGAT 1275 hFTF CTTAATCCTC GACCACATTT ACCGACAAGT GGTACATGGA AAGGAAGGAT 1443 113 CCATCTTCCT GGTTACTGGG CAACAAGTGG ACTATTCCAT AATAGCATCA 1325 hFTF CCATCTTCCT GGTTACTGGG CAACAAGTGG ACTATTCCAT AATAGCATCA 1493 113 CAAGCCGGAG CCACCCTCAA CAACCTCATG AGTCATGCAC AGGAGTTAGT 1375 hFTF CAAGCCGGAG CCACCCTCAA CAACCTCATG AGTCATGCAC AGGAGTTAGT 1543 113 GGCAAAACTT CGTTCTCTCC AGTTTGATCA ACGAGAGTTC GTATGTCTGA 1425 hFTF GGCAAAACTT CGTTCTCTCC AGTTTGATCA ACGAGAGTTC GTATGTCTGA 1593 113 AATTCTTGGT GCTCTTTAGT TTAGATGTCA AAAACCTTGA AAACTTCCAG 1475 hFTF AATTCTTGGT GCTCTTTAGT TTAGATGTCA AAAACCTTGA AAACTTCCAG 1643 113 CTGGTAGAAG GTGTCCAGGA ACAAGTCAAT GCCGCCCTGC TGGACTACAC 1525 hFTF CTGGTAGAAG GTGTCCAGGA ACAAGTCAAT GCCGCCCTGC TGGACTACAC 1693 113 AATGTGTAAC TACCCGCAGC AGACAGAGAA ATTTGGACAG CTACTTCTTC 1575 hFTF AATGTGTAAC TACCCGCAGC AGACAGAGAA ATTTGGACAG CTACTTCTTC 1743 113 GACTACCCGA AATCCGGGCC ATCAGTATGC AGGCTGAAGA ATACCTCTAC 1625 hFTF GACTACCCGA AATCCGGGCC ATCAGTATGC AGGCTGAAGA ATACCTCTAC 1793 113 TACAAGCACC TGAACGGGGA TGTGCCCTAT AATAACCTTC TCATTGAAAT 1675 hFTF TACAAGCACC TGAATGGGGA TGTGCCCTAT AATAACCTTC TCATTGAAAT 1843 113 GTTGCATGCC AAAAGAGCAT AAGTTACAAC CCCTAGGAGC TCTGCTTTCA 1725 hFTF GTTGCATGCC AAAAGAGCAT AAGTTACAAC CCCTAGGAGC TCTGCTTTCA 1893 113 AAACAAAAAG AGATTGGGGG AGTGGGGAGG GGGAAGAAGA ACAGGAAGAA 1775 hFTF AAACAAAAAG AGATTGGGGG AGTGGGGAGG GGGAAGAAGA ACAGGAAGAA 1943 113 AAAAAGTACT CTGAACTGCT CCAAGCAACG CTAATTAAAA ACTTGCTTTA 1825 hFTF AAAAAGTACT CTGAACTGCT CCAAGTAACG CTAATTAAAA ACTTGCTTTA 1993 113 AAGATATTGA ATTTAAAAAG GCATAATAAT CAAATACTTA ATAGCAAATA 1875 hFTF AAGATATTGA ATTTAAAAAG GCATAATAAT CAAATACT-A ATAGCAAATA 2042 113 AATGATGTAT CAGGGTATTT GTATTGCAAA CTGTGAATCA AAGGCTTCAC 1925 hFTF AATGATGTAT CAGGGTATTT GTATTGCAAA CTGTGAATCA AA-GCTTCAC 2091 113 AGCCCCAGAG GATTCCATAT AAAAGACATT GTAATGGAGT GGATTGAACT 1975 hFTF AGCCCCAGAG GATTCCATAT AAAAGACATT GTAATGGAGT GGATTGAACT 2141 113 CACAGATGGA TACCAACACG GTCAGAAGAA AAACGGACAG AACGGTTCTT 2025 hFTF CACAGATGGA TACCAACACG GTCAGAAGAA AAACGGACAG AACGGTTCTT 2191 113 GTATATTTAA ACTGATCTCC ACTATGAAGA AATTTAGGAA CTAATCTTAT 2075 hFTF GTATATTTAA ACTGATCTCC ACTATGAAGA AATTTAGGAA CTAATCTTAT 2241 113 TAATTAGGCT TATACAGCGG GGGATTTGAG CTTACAGGAT TCCTCCATGG 2125 hFTF TAATTAGGCT TATACAGCGG GG-ATTTGAG CTTACAGGAT TCCTCCATGG 2290 113 TAAAGCTGAA CTGAAACAAT TCTCAAGAAT GCATCAGCTG TACCTACAAT 2175 hFTF TAAAGCTGAA CTGAAACAAT TCTCAAGAAT GCATCAGCTG ---------- 2330 113 AGCCCCTCCC TCTTCCTTTG AAGGCCCGAG CACCTCTGCC CTGTGGTCAC 2225 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CGAATCTGTA CTAAGGACCT GTGTTCAGCC ACACCCAGTG GTAGCTCCAC 2275 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CAAATCATGA ACAGCCTAAT TTTGAGTGTC TGTGTCTTAG ACCTGCAAAC 2325 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AGCTAATAGG AAATTCTATT AATATGTTAG CTTGCCATTT TAAATATGTT 2375 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CTGAGGGTTG TTTTGTCTCG TGTTCATGAT GTTAAGAAAA TGCAGGCAGT 2425 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 ATCCCTCATC TTATGTAAGT GTGAATTAAT ATTAAGGGAA ATGACTACAA 2475 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 ACTTTCAAAG CAAATGCTCC ATAGCTAAAG CAACTTAGAC CTTATTTCTG 2525 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CTACTGTTGC TGAAATGTGG CTTTGGCATT GTTGGATTTC ATAAAAAATT 2575 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 TCTGGCAGGA AGTCTTGTTA GTATACATCA GTCTTTTTCA TCATCCAAGT 2625 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 TTGTAGTTCA TTTAAAAATA CAACATTAAA CACATTTTGC TAGGATGTCA 2675 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AATAGTCACA GTTCTAAGTA GTTGGAAACA AAATTGACGC ATGTTAATCT 2725 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 ATGCAAAGAG AAAGGAAAGG ATGAGGTGAT GTATTGACTC AAGGTTCATT 2775 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CTTGCTGCAA TTGAACATCC TCAAGAGTTG GGATGGAAAT GGTGATTTTT 2825 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 ACATGTGTCC TGGAAAGATA TTAAAGTAAT TCAAATCTTC CCCAAAGGGG 2875 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AAAGGAAGAG AGTGATACTG ACCTTTTTAA GTCATAGACC AAAGTCTGCT 2925 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 GTAGAACAAA TATGGGAGGA CAAAGAATCG CAAATTCTTC AAATGACTAT 2975 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 TATCAGTATT ATTAACATGC GATGCCACAG GTATGAAAGT CTTGCCTTAT 3025 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 TTCACAATTT TAAAAGGTAG CTGTGCAGAT GTGGATCAAC ATTTGTTTAA 3075 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AATAAAGTAT TAATACTTTA AAGTCAAAAA AAAAAAAAAA 3115 hFTF ---------- ---------- ---------- ---------- 2330[Table 8] 113 = SEQ ID NO: 1 hFTF = SEQ ID NO: 7 113 ---------- ---------- ---------- ----- --- GA AAAAAGTACA 12 hFTF GAAACTGGAT ACATGGTTTA CAGCAGGTCA CTAATGTTGG AAAAAGTACA 50 113 GAGTCCAGGG AAAAGACTTG CTTGTAACTT TATGAATTCT GGATTTTTTT 62 hFTF GAGTCCAGGG AAA-GACTTG CTTGTAACTT TATGAATTCT GGA --- TTTT 96 113 TTTTCCTTTG CTTTTTCTTA ACTTTCACTA AGGGTTACTG TAGTCTGATG 112 hFTF TTTTCCTTTG CTTTTTCTTA ACTTTCACTA AGGGTTACTG TAGTCTGATG 146 113 TGTCCTTCCC AAGGCCACGA AATTTGACAA GCTGCACTTT TCTTTTGCTC 162 hFTF TGTCCTTCCC AAGGCCACGA AATTTGACAA GCTGCACTTT TCTTTTGCTC 196 113 AATGATTTCT GCTTTAAGCC AAAGAACTGC CTATAATTTC ACTAAGAATG 212 hFTF AATGATTTCT GCTTTAAGCC AAAGAACTGC CTATAATTTC ACTAAGAATG 246 113 TCTTCTAATT CAGATACTGG GGATTTACAA GAGTCTTTAA AGCACGGACT 262 hFTF TCTTCTAATT CAGATACTGG GGATTTACAA GAGTCTTTAA AGCACGGACT 296 113 TACACCTATT --------- ----------- ---------- ---------- 272 hFTF TACACCTATT GGTGCTGGGC TTCCGGACCG ACACGGATCC CCCATCCCGC 346 113 ---------- ---------- ---------- ---------- --------- -272 hFTF CCGCGGTCGC CTTGTCATGC TGCCCAAAGT GGAGACGGAA GCCCTGGGAC 396 113 ---------- ---------- ---------- ---------- ------- GTG 275 hFTF TGGCTCGATC GCATGGGGAA CAGGGCCAGA TGCCGGAAAA CATGCAAGTG 446 113 TCTCAATTTA AAATGGTGAA TTACTCCTAT GATGAAGATC TGGAAGAGCT 325 hFTF TCTCAATTTA AAATGGTGAA TTACTCCTAT GATGAAGATC TGGAAGAGCT 496 113 TTGTCCCGTG TGTGGAGATA AAGTGTCTGG GTACCATTAT GGGCTCCTCA 375 hFTF TTGTCCCGTG TGTGGAGATA AAGTGTCTGG GTACCATTAT GGGCTCCTCA 546 113 CCTGTGAAAG CTGCAAGGGA TTTTTTAAGC GAACAGTCCA AAATAATAAA 425 hFTF CCTGTGAAAG CTGCAAGGGA TTTTTTAAGC GAACAGTCCA AAATAATAAA 596 113 AGGTACACAT GTATAGAAAA CCAGAACTGC CAAATTGACA AAACACAGAG 475 hFTF AGGTACACAT GTATAGAAAA CCAGAACTGC CAAATTGACA AAACACAGAG 646 113 AAAGCGTTGT CCTTACTGTC GTTTTCAAAA ATGTCTAAGT GTTGGAATGA 525 hFTF AAAGCGTTGT CCTTACTGTC GTTTTCAAAA ATGTCTAAGT GTTGGAATGA 696 113 AGCTAGAAGC TGTAAGGGCC GACCGAATGC GTGGAGGAAG GAATAAGTTT 575 hFTF AGCTAGAAGC TGTAAGGGCC GACCGAATGC GTGGAGGAAG GAATAAGTTT 746 113 GGGCCAATGT ACAAGAGAGA CAGGGCCCTG AAGC AACAGA AAAAAGCCCT 625 hFTF GGGCCAATGT ACAAGAGAGA CAGGGCCCTG AAGCAACAGA AAAAAGCCCT 796 113 CATCCGAGCC AATGGACTTA AGCTAGAAGC CATGTCTCAG GTGATCCAAG 675 hFTF CATCCGAGCC AATGGACTTA AGCTAGAAGC CATGTCTCAG GTGATCCAAG 846 113 CTATGCCCTC TGACCTGACC ATTTCCTCTG CAATTCAAAA CATCCACTCT 725 hFTF CTATGCCCTC TGACCTGACC ATTTCCTCTG CAATTCAAAA CATCCACTCT 896 113 GCCTCCAAAG GCCTACCTCT GAACCATGCT GCCTTGCCTC CTACAGACTA 775 hFTF GCCTCCAAAG GCCTACCTCT GAACCATGCT GCCTTGCCTC CTACAGACTA 946 113 TGACAGAAGT CCCTTTGTAA CATCCCCCAT TAGCATGACA ATGCCCCCTC 825 hFTF TGACAGAAGT CCCTTTGTAA CATCCCCCAT TAGCATGACA ATGC --- TGC 993 113 ACGGCAGCCT GCAAGGTTAC CAAACATATG GCCACTTTCC TAGCCGGGCC 875 hFTF ACGGCAGCCT GCAAGGTTAC CAAACATATG GCCACTTTCC TAGCCGGGCC 1043 113 ATCAAGTCTG AGTACCCAGA CCCCTATACC AGCTCACCCG AGTCCATAAT 925 hFTF ATCAAGTCTG AGTACCCAGA CCCCTATACC AGCTCACCCG AGTCCATAAT 1093 113 GGGCTATTCA TATATGGATA GTTACCAGAC GAGCTCTCCA GCAAGCATCC 975 hFTF GGGCTATTCA TATATGGATA GTTACCAGAC GAGCTCTCCA GCAAGCATCC 1143 113 CACATCTGAT ACTGGAA CTT TTGAAGTGTG AGCCAGATGA GCCTCAAGTC 1025 hFTF CACATCTGAT ACTGGAACTT TTGAAGTGTG AGCCAGATGA GCCTCAAGTC 1193 113 CAGGCTAAAA TCATGGCCTA TTTGCAGCAA GAGCAGGCTA ACCGAAGCAA 1075 hFTF CAGGCTAAAA TCATGGCCTA TTTGCAGCAA GAGCAGGCTA ACCGAAGCAA 1243 113 GCACGAAAAG CTGAGCACCT TTGGGCTTAT GTGCAAAATG GCAGATCAAA 1125 hFTF GCACGAAAAG CTGAGCACCT TTGGGCTTAT GTGCAAAATG GCAGATCAAA 1293 113 CTCTCTTCTC CATTGTCGAG TGGGCCAGGA GTAGTATCTT CTTCAGAGAA 1175 hFTF CTGTCTTCTC CATTGTCGAG TGGGCCAGGA GTAGTATCTT CTTCAGAGAA 1343 113 CTTAAGGTTG ATGACCAAAT GAAGCTGCTT CAGAACTGCT GGAGTGAGCT 1225 hFTF CTTAAGGTTG ATGACCAAAT GAAGCTGCTT CAGAACTGCT GGAGTGAGCT 1393 113 CTTAATCCTC GACCACATTT ACCGACAAGT GGTACATGGA AAGGAAGGAT 1275 hFTF CTTAATCCTC GACCACATTT ACCGACAAGT GGTACATGGA AAGGAAGGAT 1443 113 CCATCTTCCT GGTTACTGGG CAACAAGTGG ACTATTCCAT AATAGCATCA 1325 hFTF CCATCTTCCT GGTTACTGGG CAACAAGTGG ACTATTCCAT AATAGCATCA 1493 113 CAAGCCGGAG CCACCCTCAA CAACCTCATG AGTCATGCAC AGGAGTTAGT 1375 hFTF CAAGCCGGAG CCACCCTCAA CAACCTCATG AGTCATGCAC AGGAGT TAGT 1543 113 GGCAAAACTT CGTTCTCTCC AGTTTGATCA ACGAGAGTTC GTATGTCTGA 1425 hFTF GGCAAAACTT CGTTCTCTCC AGTTTGATCA ACGAGAGTTC GTATGTCTGA 1593 113 AATTCTTGGT GCTCTTTAGT TTAGATGTCA AAAACCTTGA AAACTTCCAG 1475 hFTF AATTCTTGGT GCTCTTTAGT TTAGATGTCA AAAACCTTGA AAACTTCCAG 1643 113 CTGGTAGAAG GTGTCCAGGA ACAAGTCAAT GCCGCCCTGC TGGACTACAC 1525 hFTF CTGGTAGAAG GTGTCCAGGA ACAAGTCAAT GCCGCCCTGC TGGACTACAC 1693 113 AATGTGTAAC TACCCGCAGC AGACAGAGAA ATTTGGACAG CTACTTCTTC 1575 hFTF AATGTGTAAC TACCCGCAGC AGACAGAGAA ATTTGGACAG CTACTTCTTC 1743 113 GACTACCCGA AATCCGGGCC ATCAGTATGC AGGCTGAAGA ATACCTCTAC 1625 hFTF GACTACCCGA AATCCGGGCC ATCAGTATGC AGGCTGAAGA ATACCTCTAC 1793 113 TACAAGCACC TGAACGGGGA TGTGCCCTAT AATAACCTTC TCATTGAAAT 1675 hFTF TACAAGCACC TGAATGGGGA TGTGCCCTAT AATAACCTTC TCATTGAAAT 1843 113 GTTGCATGCC AAAAGAGCAT AAGTTACAAC CCCTAGGAGC TCTGCTTTCA 1725 hFTF GTTGCATGCC AAAAGAGCAT AAGTTACAAC CCCTAGGAGC TCTGCTTTCA 1893 113 AAACAAAAAG AGATTGGGGG AGTGGGGAGG GGGAAGAAGA ACAGGAAGAA 1775 hFTF AAACAAAAAG AGATTGG GGG AGTGGGGAGG GGGAAGAAGA ACAGGAAGAA 1943 113 AAAAAGTACT CTGAACTGCT CCAAGCAACG CTAATTAAAA ACTTGCTTTA 1825 hFTF AAAAAGTACT CTGAACTGCT CCAAGTAACG CTAATTAAAA ACTTGCTTTA 1993 113 AAGATATTGA ATTTAAAAAG GCATAATAAT CAAATACTTA ATAGCAAATA 1875 hFTF AAGATATTGA ATTTAAAAAG GCATAATAAT CAAATACT-A ATAGCAAATA 2042 113 AATGATGTAT CAGGGTATTT GTATTGCAAA CTGTGAATCA AAGGCTTCAC 1925 hFTF AATGATGTAT CAGGGTATTT GTATTGCAAA CTGTGAATCA AA-GCTTCAC 2091 113 AGCCCCAGAG GATTCCATAT AAAAGACATT GTAATGGAGT GGATTGAACT 1975 hFTF AGCCCCAGAG GATTCCATAT AAAAGACATT GTAATGGAGT GGATTGAACT 2141 113 CACAGATGGA TACCAACACG GTCAGAAGAA AAACGGACAG AACGGTTCTT 2025 hFTF CACAGATGGA TACCAACACG GTCAGAAGAA AAACGGACAG AACGGTTCTT 2191 113 GTATATTTAA ACTGATCTCC ACTATGAAGA AATTTAGGAA CTAATCTTAT 2075 hFTF GTATATTTAA ACTGATCTCC ACTATGAAGA AATTTAGGAA CTAATCTTAT 2241 113 TAATTAGGCT TATACAGCGG GGGATTTGAG CTTACAGGAT TCCTCCATGG 2125 hFTF TAATTAGGCT TATACAGCGG GG-ATTTGAG CTTACAGGAT TCCTCCATGG 2290 113 TAAAGCTGAA CTGAAACAAT TCTCAAGAAT GCATCAGCTG TACCTAC AAT 2175 hFTF TAAAGCTGAA CTGAAACAAT TCTCAAGAAT GCATCAGCTG ---------- 2330 113 AGCCCCTCCC TCTTCCTTTG AAGGCCCGAG CACCTCTGCC CTGTGGTCAC 2225 hFTF ---------- ---------- ---- ------ ---------- ---------- 2330 113 CGAATCTGTA CTAAGGACCT GTGTTCAGCC ACACCCAGTG GTAGCTCCAC 2275 hFTF ---------- ----- ----- ---------- ---------- ---------- 2330 113 CAAATCATGA ACAGCCTAAT TTTGAGTGTC TGTGTCTTAG ACCTGCAAAC 2325 hFTF ------ ---- ---------- ---------- ---------- ---------- 2330 113 AGCTAATAGG AAATTCTATT AATATGTTAG CTTGCCATTT TAAATATGTT 2375 hFTF ---------- ---------- ---------- ---------- ------- --- 2330 113 CTGAGGGTTG TTTTGTCTCG TGTTCATGAT GTTAAGAAAA TGCAGGCAGT 2425 hFTF ---------- ---------- ---------- -------- ----------- 2330 113 ATCCCTCATC TTATGTAAGT GTGAATTAAT ATTAAGGGAA ATGACTACAA 2475 hFTF ---------- ---------- --------- ----------- ---------- 2330 113 ACTTTCAAAG CAAATGCTCC ATAGCTAAAG CAACTTAGAC CTTATTTCTG 2525 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 CTACTGTTGC TGAAATGT GG CTTTGGCATT GTTGGATTTC ATAAAAAATT 2575 hFTF ---------- ---------- ---------- ---------- ---- ------ 2330 113 TCTGGCAGGA AGTCTTGTTA GTATACATCA GTCTTTTTCA TCATCCAAGT 2625 hFTF ---------- ---------- ---------- ----- ----- ---------- 2330 113 TTGTAGTTCA TTTAAAAATA CAACATTAAA CACATTTTGC TAGGATGTCA 2675 hFTF ---------- ---------- ------ ---- ---------- ---------- 2330 113 AATAGTCACA GTTCTAAGTA GTTGGAAACA AAATTGACGC ATGTTAATCT 2725 hFTF ---------- ------- --- ---------- ---------- ---------- 2330 113 ATGCAAAGAG AAAGGAAAGG ATGAGGTGAT GTATTGACTC AAGGTTCATT 2775 hFTF -------- ----------- ---------- ---------- ---------- 2330 113 CTTGCTGCAA TTGAACATCC TCAAGAGTTG GGATGGAAAT GGTGATTTTT 2825 hFTF ---------- ---------- ---------- ---------- --------- -2330 113 ACATGTGTCC TGGAAAGATA TTAAAGTAAT TCAAATCTTC CCCAAAGGGG 2875 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AAAGGAAGAG AGTGATACTG ACCTTTTTAA GTCATAGACC AAAGTCTGCT 2925 hFTF ---------- ---------- ----------- --------- ------- --- 2330 113 GTAGAACAAA TATGGGAGGA CAAAGAATCG CAAATTCTTC AAATGACTAT 2975 hFTF ---------- ---------- ---------- -------- ----------- 2330 113 TATCAGTATT ATTAACATGC GATGCCACAG GTATGAAAGT CTTGCCTTAT 3025 hFTF ---------- ---------- --------- ----------- ---------- 2330 113 TTCACAATTT TAAAAGGTAG CTGTGCAGAT GTGGATCAAC ATTTGTTTAA 3075 hFTF ---------- ---------- ---------- ---------- ---------- 2330 113 AATAAAGTAT TAATACTTTA AAGTCAAAAA AAAAAAAAAA 3115 hFTF ----------- -------- ---------- ---------- 2330

【0053】[0053]

【配列表】 SEQUE
NCE LISTING <110> Sumitomo Pharmaceuticals Compa
ny Limited <110> Tularik Inc. <120> CYP7 Promoter−Binding Factors <130> 162467 <140> <141> <150> 60/067,708 <151> 1997−12−08 <160> 10 <170> PatentIn Ver. 2.0 <210> 1 <211> 3115 <212> DNA <213> Human <220> <221> CDS <222> (210)..(1694) <400> 1 gaaaaaagta cagagtccag ggaaaagact tgc
ttgtaac tttatgaatt ctggattttt 60 ttttttcctt tgctttttct taactttcac taa
gggttac tgtagtctga tgtgtccttc 120 ccaaggccac gaaatttgac aagctgcact ttt
cttttgc tcaatgattt ctgctttaag 180 ccaaagaact gcctataatt tcactaaga atg
tct tct aat tca gat act ggg 233 Met
Ser Ser Asn Ser Asp Thr Gly 1
5 gat tta caa gag tct tta aag cac gga
ctt aca cct att gtg tct caa 281 Asp Leu Gln Glu Ser Leu Lys His Gly
Leu Thr Pro Ile Val Ser Gln 10 15
20 ttt aaa atg gtg aat tac tcc tat gat
gaa gat ctg gaa gag ctt tgt 329 Phe Lys Met Val Asn Tyr Ser Tyr Asp
Glu Asp Leu Glu Glu Leu Cys 25 30
35 40 ccc gtg tgt gga gat aaa gtg tct ggg
tac cat tat ggg ctc ctc acc 377 Pro Val Cys Gly Asp Lys Val Ser Gly
Tyr His Tyr Gly Leu Leu Thr 45
50 55 tgt gaa agc tgc aag gga ttt ttt aag
cga aca gtc caa aat aat aaa 425 Cys Glu Ser Cys Lys Gly Phe Phe Lys
Arg Thr Val Gln Asn Asn Lys 60 65
70 agg tac aca tgt ata gaa aac cag aac
tgc caa att gac aaa aca cag 473 Arg Tyr Thr Cys Ile Glu Asn Gln Asn
Cys Gln Ile Asp Lys Thr Gln 75 80
85 aga aag cgt tgt cct tac tgt cgt ttt
caa aaa tgt cta agt gtt gga 521 Arg Lys Arg Cys Pro Tyr Cys Arg Phe
Gln Lys Cys Leu Ser Val Gly 90 95
100 atg aag cta gaa gct gta agg gcc gac
cga atg cgt gga gga agg aat 569 Met Lys Leu Glu Ala Val Arg Ala Asp
Arg Met Arg Gly Gly Arg Asn 105 110
115 120 aag ttt ggg cca atg tac aag aga gac
agg gcc ctg aag caa cag aaa 617 Lys Phe Gly Pro Met Tyr Lys Arg Asp
Arg Ala Leu Lys Gln Gln Lys 125
130 135 aaa gcc ctc atc cga gcc aat gga ctt
aag cta gaa gcc atg tct cag 665 Lys Ala Leu Ile Arg Ala Asn Gly Leu
Lys Leu Glu Ala Met Ser Gln 140 145
150 gtg atc caa gct atg ccc tct gac ctg
acc att tcc tct gca att caa 713 Val Ile Gln Ala Met Pro Ser Asp Leu
Thr Ile Ser Ser Ala Ile Gln 155 160
165 aac atc cac tct gcc tcc aaa ggc cta
cct ctg aac cat gct gcc ttg 761 Asn Ile His Ser Ala Ser Lys Gly Leu
Pro Leu Asn His Ala Ala Leu 170 175
180 cct cct aca gac tat gac aga agt ccc
ttt gta aca tcc ccc att agc 809 Pro Pro Thr Asp Tyr Asp Arg Ser Pro
Phe Val Thr Ser Pro Ile Ser 185 190
195 200 atg aca atg ccc cct cac ggc agc ctg
caa ggt tac caa aca tat ggc 857 Met Thr Met Pro Pro His Gly Ser Leu
Gln Gly Tyr Gln Thr Tyr Gly 205
210 215 cac ttt cct agc cgg gcc atc aag tct
gag tac cca gac ccc tat acc 905 His Phe Pro Ser Arg Ala Ile Lys Ser
Glu Tyr Pro Asp Pro Tyr Thr 220 225
230 agc tca ccc gag tcc ata atg ggc tat
tca tat atg gat agt tac cag 953 Ser Ser Pro Glu Ser Ile Met Gly Tyr
Ser Tyr Met Asp Ser Tyr Gln 235 240
245 acg agc tct cca gca agc atc cca cat
ctg ata ctg gaa ctt ttg aag 1001 Thr Ser Ser Pro Ala Ser Ile Pro His
Leu Ile Leu Glu Leu Leu Lys 250 255
260 tgt gag cca gat gag cct caa gtc cag
gct aaa atc atg gcc tat ttg 1049 Cys Glu Pro Asp Glu Pro Gln Val Gln
Ala Lys Ile Met Ala Tyr Leu 265 270
275 280 cag caa gag cag gct aac cga agc aag
cac gaa aag ctg agc acc ttt 1097 Gln Gln Glu Gln Ala Asn Arg Ser Lys
His Glu Lys Leu Ser Thr Phe 285
290 295 ggg ctt atg tgc aaa atg gca gat caa
act ctc ttc tcc att gtc gag 1145 Gly Leu Met Cys Lys Met Ala Asp Gln
Thr Leu Phe Ser Ile Val Glu 300 305
310 tgg gcc agg agt agt atc ttc ttc aga
gaa ctt aag gtt gat gac caa 1193 Trp Ala Arg Ser Ser Ile Phe Phe Arg
Glu Leu Lys Val Asp Asp Gln 315 320
325 atg aag ctg ctt cag aac tgc tgg agt
gag ctc tta atc ctc gac cac 1241 Met Lys Leu Leu Gln Asn Cys Trp Ser
Glu Leu Leu Ile Leu Asp His 330 335
340 att tac cga caa gtg gta cat gga aag
gaa gga tcc atc ttc ctg gtt 1289 Ile Tyr Arg Gln Val Val His Gly Lys
Glu Gly Ser Ile Phe Leu Val 345 350
355 360 act ggg caa caa gtg gac tat tcc ata
ata gca tca caa gcc gga gcc 1337 Thr Gly Gln Gln Val Asp Tyr Ser Ile
Ile Ala Ser Gln Ala Gly Ala 365
370 375 acc ctc aac aac ctc atg agt cat gca
cag gag tta gtg gca aaa ctt 1385 Thr Leu Asn Asn Leu Met Ser His Ala
Gln Glu Leu Val Ala Lys Leu 380 385
390 cgt tct ctc cag ttt gat caa cga gag
ttc gta tgt ctg aaa ttc ttg 1433 Arg Ser Leu Gln Phe Asp Gln Arg Glu
Phe Val Cys Leu Lys Phe Leu 395 400
405 gtg ctc ttt agt tta gat gtc aaa aac
ctt gaa aac ttc cag ctg gta 1481 Val Leu Phe Ser Leu Asp Val Lys Asn
Leu Glu Asn Phe Gln Leu Val 410 415
420 gaa ggt gtc cag gaa caa gtc aat gcc
gcc ctg ctg gac tac aca atg 1529 Glu Gly Val Gln Glu Gln Val Asn Ala
Ala Leu Leu Asp Tyr Thr Met 425 430
435 440 tgt aac tac ccg cag cag aca gag aaa
ttt gga cag cta ctt ctt cga 1577 Cys Asn Tyr Pro Gln Gln Thr Glu Lys
Phe Gly Gln Leu Leu Leu Arg 445
450 455 cta ccc gaa atc cgg gcc atc agt atg
cag gct gaa gaa tac ctc tac 1625 Leu Pro Glu Ile Arg Ala Ile Ser Met
Gln Ala Glu Glu Tyr Leu Tyr 460 465
470 tac aag cac ctg aac ggg gat gtg ccc
tat aat aac ctt ctc att gaa 1673 Tyr Lys His Leu Asn Gly Asp Val Pro
Tyr Asn Asn Leu Leu Ile Glu 475 480
485 atg ttg cat gcc aaa aga gca taagttac
aa cccctaggag ctctgctttc 1724 Met Leu His Ala Lys Arg Ala 490 495 aaaacaaaaa gagattgggg gagtggggag ggg
gaagaag aacaggaaga aaaaaagtac 1784 tctgaactgc tccaagcaac gctaattaaa aac
ttgcttt aaagatattg aatttaaaaa 1844 ggcataataa tcaaatactt aatagcaaat aaa
tgatgta tcagggtatt tgtattgcaa 1904 actgtgaatc aaaggcttca cagccccaga gga
ttccata taaaagacat tgtaatggag 1964 tggattgaac tcacagatgg ataccaacac ggt
cagaaga aaaacggaca gaacggttct 2024 tgtatattta aactgatctc cactatgaag aaa
tttagga actaatctta ttaattaggc 2084 ttatacagcg ggggatttga gcttacagga ttc
ctccatg gtaaagctga actgaaacaa 2144 ttctcaagaa tgcatcagct gtacctacaa tag
cccctcc ctcttccttt gaaggcccga 2204 gcacctctgc cctgtggtca ccgaatctgt act
aaggacc tgtgttcagc cacacccagt 2264 ggtagctcca ccaaatcatg aacagcctaa ttt
tgagtgt ctgtgtctta gacctgcaaa 2324 cagctaatag gaaattctat taatatgtta gct
tgccatt ttaaatatgt tctgagggtt 2384 gttttgtctc gtgttcatga tgttaagaaa atg
caggcag tatccctcat cttatgtaag 2444 tgtgaattaa tattaaggga aatgactaca aac
tttcaaa gcaaatgctc catagctaaa 2504 gcaacttaga ccttatttct gctactgttg ctg
aaatgtg gctttggcat tgttggattt 2564 cataaaaaat ttctggcagg aagtcttgtt agt
atacatc agtctttttc atcatccaag 2624 tttgtagttc atttaaaaat acaacattaa aca
cattttg ctaggatgtc aaatagtcac 2684 agttctaagt agttggaaac aaaattgacg cat
gttaatc tatgcaaaga gaaaggaaag 2744 gatgaggtga tgtattgact caaggttcat tct
tgctgca attgaacatc ctcaagagtt 2804 gggatggaaa tggtgatttt tacatgtgtc ctg
gaaagat attaaagtaa ttcaaatctt 2864 ccccaaaggg gaaaggaaga gagtgatact gac
cttttta agtcatagac caaagtctgc 2924 tgtagaacaa atatgggagg acaaagaatc gca
aattctt caaatgacta ttatcagtat 2984 tattaacatg cgatgccaca ggtatgaaag tct
tgcctta tttcacaatt ttaaaaggta 3044 gctgtgcaga tgtggatcaa catttgttta aaa
taaagta ttaatacttt aaagtcaaaa 3104 aaaaaaaaaa a
3115 <210> 2 <211> 495 <212> PRT <213> Human <400> 2 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 1 5
10 15 Gly Leu Thr Pro Ile Val Ser Gln Phe
Lys Met Val Asn Tyr Ser Tyr 20 25
30 Asp Glu Asp Leu Glu Glu Leu Cys Pro
Val Cys Gly Asp Lys Val Ser 35 40
45 Gly Tyr His Tyr Gly Leu Leu Thr Cys
Glu Ser Cys Lys Gly Phe Phe 50 55
60 Lys Arg Thr Val Gln Asn Asn Lys Arg
Tyr Thr Cys Ile Glu Asn Gln 65 70
75 80 Asn Cys Gln Ile Asp Lys Thr Gln Arg
Lys Arg Cys Pro Tyr Cys Arg 85
90 95 Phe Gln Lys Cys Leu Ser Val Gly Met
Lys Leu Glu Ala Val Arg Ala 100 105
110 Asp Arg Met Arg Gly Gly Arg Asn Lys
Phe Gly Pro Met Tyr Lys Arg 115 120
125 Asp Arg Ala Leu Lys Gln Gln Lys Lys
Ala Leu Ile Arg Ala Asn Gly 130 135
140 Leu Lys Leu Glu Ala Met Ser Gln Val
Ile Gln Ala Met Pro Ser Asp 145 150
155 160 Leu Thr Ile Ser Ser Ala Ile Gln Asn
Ile His Ser Ala Ser Lys Gly 165
170 175 Leu Pro Leu Asn His Ala Ala Leu Pro
Pro Thr Asp Tyr Asp Arg Ser 180 185
190 Pro Phe Val Thr Ser Pro Ile Ser Met
Thr Met Pro Pro His Gly Ser 195 200
205 Leu Gln Gly Tyr Gln Thr Tyr Gly His
Phe Pro Ser Arg Ala Ile Lys 210 215
220 Ser Glu Tyr Pro Asp Pro Tyr Thr Ser
Ser Pro Glu Ser Ile Met Gly 225 230
235 240 Tyr Ser Tyr Met Asp Ser Tyr Gln Thr
Ser Ser Pro Ala Ser Ile Pro 245
250 255 His Leu Ile Leu Glu Leu Leu Lys Cys
Glu Pro Asp Glu Pro Gln Val 260 265
270 Gln Ala Lys Ile Met Ala Tyr Leu Gln
Gln Glu Gln Ala Asn Arg Ser 275 280
285 Lys His Glu Lys Leu Ser Thr Phe Gly
Leu Met Cys Lys Met Ala Asp 290 295
300 Gln Thr Leu Phe Ser Ile Val Glu Trp
Ala Arg Ser Ser Ile Phe Phe 305 310
315 320 Arg Glu Leu Lys Val Asp Asp Gln Met
Lys Leu Leu Gln Asn Cys Trp 325
330 335 Ser Glu Leu Leu Ile Leu Asp His Ile
Tyr Arg Gln Val Val His Gly 340 345
350 Lys Glu Gly Ser Ile Phe Leu Val Thr
Gly Gln Gln Val Asp Tyr Ser 355 360
365 Ile Ile Ala Ser Gln Ala Gly Ala Thr
Leu Asn Asn Leu Met Ser His 370 375
380 Ala Gln Glu Leu Val Ala Lys Leu Arg
Ser Leu Gln Phe Asp Gln Arg 385 390
395 400 Glu Phe Val Cys Leu Lys Phe Leu Val
Leu Phe Ser Leu Asp Val Lys 405
410 415 Asn Leu Glu Asn Phe Gln Leu Val Glu
Gly Val Gln Glu Gln Val Asn 420 425
430 Ala Ala Leu Leu Asp Tyr Thr Met Cys
Asn Tyr Pro Gln Gln Thr Glu 435 440
445 Lys Phe Gly Gln Leu Leu Leu Arg Leu
Pro Glu Ile Arg Ala Ile Ser 450 455
460 Met Gln Ala Glu Glu Tyr Leu Tyr Tyr
Lys His Leu Asn Gly Asp Val 465 470
475 480 Pro Tyr Asn Asn Leu Leu Ile Glu Met
Leu His Ala Lys Arg Ala 485
490 495 <210> 3 <211> 1245 <212> DNA <213> Human <220> <221> CDS <222> (202)..(1170) <400> 3 cggccgcgtc gacggaaaga cttgcttgta act
ttatgaa ttctggattt ttttttttcc 60 tttgcttttt cttaactttc actaagggtt act
gtagtct gatgtgtcct tcccaaggcc 120 acgaaatttg acaagctgca cttttctttt gct
caatgat ttctgcttta agccaaagaa 180 ctgcctataa tttcactaag a atg tct tct
aat tca gat act ggg gat tta 231 Met Ser Ser
Asn Ser Asp Thr Gly Asp Leu 1
5 10 caa gag tct tta aag cac gga ctt aca
cct att gtg tct caa ttt aaa 279 Gln Glu Ser Leu Lys His Gly Leu Thr
Pro Ile Val Ser Gln Phe Lys 15
20 25 atg gtg aat tac tcc tat gat gaa gat
ctg gaa gag ctt tgt ccc gtg 327 Met Val Asn Tyr Ser Tyr Asp Glu Asp
Leu Glu Glu Leu Cys Pro Val 30 35
40 tgt gga gat aaa gtg tct ggg tac cat
tat ggg ctc ctc acc tgt gaa 375 Cys Gly Asp Lys Val Ser Gly Tyr His
Tyr Gly Leu Leu Thr Cys Glu 45 50
55 agc tgc aag gga ttt ttt aag cga aca
gtc caa aat aat aaa agg tac 423 Ser Cys Lys Gly Phe Phe Lys Arg Thr
Val Gln Asn Asn Lys Arg Tyr 60 65
70 aca tgt ata gaa aac cag aac tgc caa
att gac aaa aca cag aga aag 471 Thr Cys Ile Glu Asn Gln Asn Cys Gln
Ile Asp Lys Thr Gln Arg Lys 75 80
85 90 cgt tgt cct tac tgt cgt ttt caa aaa
tgt cta agt gtt gga atg aag 519 Arg Cys Pro Tyr Cys Arg Phe Gln Lys
Cys Leu Ser Val Gly Met Lys 95
100 105 cta gaa gct gta agg gcc gac cga atg
cgt gga gga agg aat aag ttt 567 Leu Glu Ala Val Arg Ala Asp Arg Met
Arg Gly Gly Arg Asn Lys Phe 110 115
120 ggg cca atg tac aag aga gac agg gcc
ctg aag caa cag aaa aaa gcc 615 Gly Pro Met Tyr Lys Arg Asp Arg Ala
Leu Lys Gln Gln Lys Lys Ala 125 130
135 ctc atc cga gcc aat gga ctt aag cta
gaa gcc atg tct cag gtt gat 663 Leu Ile Arg Ala Asn Gly Leu Lys Leu
Glu Ala Met Ser Gln Val Asp 140 145
150 gac caa atg aag ctg ctt cag aac tgc
tgg agt gag ctc tta atc ctc 711 Asp Gln Met Lys Leu Leu Gln Asn Cys
Trp Ser Glu Leu Leu Ile Leu 155 160
165 170 gac cac att tac cga caa gtg gta cat
gga aag gaa gga tcc atc ttc 759 Asp His Ile Tyr Arg Gln Val Val His
Gly Lys Glu Gly Ser Ile Phe 175
180 185 ctg gtt act ggg caa caa gtg gac tat
tcc ata ata gca tca caa gcc 807 Leu Val Thr Gly Gln Gln Val Asp Tyr
Ser Ile Ile Ala Ser Gln Ala 190 195
200 gga gcc acc ctc aac aac ctc atg agt
cat gca cag gag tta gtg gca 855 Gly Ala Thr Leu Asn Asn Leu Met Ser
His Ala Gln Glu Leu Val Ala 205 210
215 aaa ctt cgt tct ctc cag ttt gat caa
cga gag ttc gta tgt ctg aaa 903 Lys Leu Arg Ser Leu Gln Phe Asp Gln
Arg Glu Phe Val Cys Leu Lys 220 225
230 ttc ttg gtg ctc ttt agt tta gat gtc
aaa aac ctt gaa aac ttc cag 951 Phe Leu Val Leu Phe Ser Leu Asp Val
Lys Asn Leu Glu Asn Phe Gln 235 240
245 250 ctg gta gaa ggt gtc cag gaa caa gtc
aat gcc gcc ctg ctg gac tac 999 Leu Val Glu Gly Val Gln Glu Gln Val
Asn Ala Ala Leu Leu Asp Tyr 255
260 265 aca atg tgt aac tac ccg cag cag aca
gag aaa ttt cga cag cta ctt 1047 Thr Met Cys Asn Tyr Pro Gln Gln Thr
Glu Lys Phe Arg Gln Leu Leu 270 275
280 ctt cga cta ccc gaa atc cgg gcc atc
agt atg cag gct gaa gaa tac 1095 Leu Arg Leu Pro Glu Ile Arg Ala Ile
Ser Met Gln Ala Glu Glu Tyr 285 290
295 ctc tac tac aag cac ctg aac ggg gat
gtg ccc tat aat aac ctt ctc 1143 Leu Tyr Tyr Lys His Leu Asn Gly Asp
Val Pro Tyr Asn Asn Leu Leu 300 305
310 att gaa atg ttg cat gcc aaa aga gca
taagttacaa cccctaggag 1190 Ile Glu Met Leu His Ala Lys Arg Ala 315 320 ctctgctttc aaaacaaaaa gagattgggg gag
tggggag ggggaagaag aacag 1245 <210> 4 <211> 323 <212> PRT <213> Human <400> 4 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 1 5
10 15 Gly Leu Thr Pro Ile Val Ser Gln Phe
Lys Met Val Asn Tyr Ser Tyr 20 25
30 Asp Glu Asp Leu Glu Glu Leu Cys Pro
Val Cys Gly Asp Lys Val Ser 35 40
45 Gly Tyr His Tyr Gly Leu Leu Thr Cys
Glu Ser Cys Lys Gly Phe Phe 50 55
60 Lys Arg Thr Val Gln Asn Asn Lys Arg
Tyr Thr Cys Ile Glu Asn Gln 65 70
75 80 Asn Cys Gln Ile Asp Lys Thr Gln Arg
Lys Arg Cys Pro Tyr Cys Arg 85
90 95 Phe Gln Lys Cys Leu Ser Val Gly Met
Lys Leu Glu Ala Val Arg Ala 100 105
110 Asp Arg Met Arg Gly Gly Arg Asn Lys
Phe Gly Pro Met Tyr Lys Arg 115 120
125 Asp Arg Ala Leu Lys Gln Gln Lys Lys
Ala Leu Ile Arg Ala Asn Gly 130 135
140 Leu Lys Leu Glu Ala Met Ser Gln Val
Asp Asp Gln Met Lys Leu Leu 145 150
155 160 Gln Asn Cys Trp Ser Glu Leu Leu Ile
Leu Asp His Ile Tyr Arg Gln 165
170 175 Val Val His Gly Lys Glu Gly Ser Ile
Phe Leu Val Thr Gly Gln Gln 180 185
190 Val Asp Tyr Ser Ile Ile Ala Ser Gln
Ala Gly Ala Thr Leu Asn Asn 195 200
205 Leu Met Ser His Ala Gln Glu Leu Val
Ala Lys Leu Arg Ser Leu Gln 210 215
220 Phe Asp Gln Arg Glu Phe Val Cys Leu
Lys Phe Leu Val Leu Phe Ser 225 230
235 240 Leu Asp Val Lys Asn Leu Glu Asn Phe
Gln Leu Val Glu Gly Val Gln 245
250 255 Glu Gln Val Asn Ala Ala Leu Leu Asp
Tyr Thr Met Cys Asn Tyr Pro 260 265
270 Gln Gln Thr Glu Lys Phe Arg Gln Leu
Leu Leu Arg Leu Pro Glu Ile 275 280
285 Arg Ala Ile Ser Met Gln Ala Glu Glu
Tyr Leu Tyr Tyr Lys His Leu 290 295
300 Asn Gly Asp Val Pro Tyr Asn Asn Leu
Leu Ile Glu Met Leu His Ala 305 310
315 320 Lys Arg Ala <210> 5 <211> 3251 <212> DNA <213> Human <220> <221> CDS <222> (208)..(1830) <400> 5 cgcggccgcg tcgaccaggg aaaagacttg ctt
gtaactt tatgaattct ggattttttt 60 ttttcctttg ctttttctta actttcacta agg
gttactg tagtctgatg tgtccttccc 120 aaggccacga aatttgacaa gctgcacttt tct
tttgctc aatgatttct gctttaagcc 180 aaagaactgc ctataatttc actaaga atg tc
t tct aat tca gat act ggg gat 234 Met Se
r Ser Asn Ser Asp Thr Gly Asp 1
5 tta caa gag tct tta aag cac gga ctt
aca cct att ggt gct ggg ctt 282 Leu Gln Glu Ser Leu Lys His Gly Leu
Thr Pro Ile Gly Ala Gly Leu 10 15
20 25 ccg gac cga cac gga tcc ccc atc ccc
gcc cgc ggt cgc ctt gtc atg 330 Pro Asp Arg His Gly Ser Pro Ile Pro
Ala Arg Gly Arg Leu Val Met 30
35 40 ctg ccc aaa gtg gag acg gaa gcc ctg
gga ctg gct cga tcg cat ggg 378 Leu Pro Lys Val Glu Thr Glu Ala Leu
Gly Leu Ala Arg Ser His Gly 45 50
55 gaa cag ggc cag atg ccg gaa aac atg
caa gtg tct caa ttt aaa atg 426 Glu Gln Gly Gln Met Pro Glu Asn Met
Gln Val Ser Gln Phe Lys Met 60 65
70 gtg aat tac tcc tat gat gaa gat ctg
gaa gaa ctt tgt ccc gtg tgt 474 Val Asn Tyr Ser Tyr Asp Glu Asp Leu
Glu Glu Leu Cys Pro Val Cys 75 80
85 gga gat aaa gtg tct ggg tac cat tat
ggg ctc ctc acc tgt gaa agc 522 Gly Asp Lys Val Ser Gly Tyr His Tyr
Gly Leu Leu Thr Cys Glu Ser 90 95
100 105 tgc aag gga ttt ttt aag cga aca gtc
caa aat aat aaa agg tac aca 570 Cys Lys Gly Phe Phe Lys Arg Thr Val
Gln Asn Asn Lys Arg Tyr Thr 110
115 120 tgt ata gaa aac cag aac tgc caa att
gac aaa aca cag aga aag cgt 618 Cys Ile Glu Asn Gln Asn Cys Gln Ile
Asp Lys Thr Gln Arg Lys Arg 125 130
135 tgt cct tac tgt cgt ttt caa aaa tgt
cta agt gtt gga atg aag cta 666 Cys Pro Tyr Cys Arg Phe Gln Lys Cys
Leu Ser Val Gly Met Lys Leu 140 145
150 gaa gct gta agg gcc gac cga atg cgt
gga gga agg aat aag ttt ggg 714 Glu Ala Val Arg Ala Asp Arg Met Arg
Gly Gly Arg Asn Lys Phe Gly 155 160
165 cca atg tac aag aga gac agg gcc ctg
aag caa cag aaa aaa gcc ctc 762 Pro Met Tyr Lys Arg Asp Arg Ala Leu
Lys Gln Gln Lys Lys Ala Leu 170 175
180 185 atc cga gcc aat gga ctt aag cta gaa
gcc atg tct cag gtg atc caa 810 Ile Arg Ala Asn Gly Leu Lys Leu Glu
Ala Met Ser Gln Val Ile Gln 190
195 200 gct atg ccc tct gac ctg acc att tcc
tct gca att caa aac atc cac 858 Ala Met Pro Ser Asp Leu Thr Ile Ser
Ser Ala Ile Gln Asn Ile His 205 210
215 tct gcc tcc aaa ggc cta cct ctg aac
cat gct gcc ttg cct cct aca 906 Ser Ala Ser Lys Gly Leu Pro Leu Asn
His Ala Ala Leu Pro Pro Thr 220 225
230 gac tat gac aga agt ccc ttt gta aca
tcc ccc att agc atg aca atg 954 Asp Tyr Asp Arg Ser Pro Phe Val Thr
Ser Pro Ile Ser Met Thr Met 235 240
245 ccc cct cac ggc agc ctg caa ggt tac
caa aca tat ggc cac ttt cct 1002 Pro Pro His Gly Ser Leu Gln Gly Tyr
Gln Thr Tyr Gly His Phe Pro 250 255
260 265 agc cgg gcc atc aag tct gag tac cca
gac ccc tat acc agc tca ccc 1050 Ser Arg Ala Ile Lys Ser Glu Tyr Pro
Asp Pro Tyr Thr Ser Ser Pro 270
275 280 gag tcc ata atg ggc tat tca tat atg
gat agt tac cag acg agc tct 1098 Glu Ser Ile Met Gly Tyr Ser Tyr Met
Asp Ser Tyr Gln Thr Ser Ser 285 290
295 cca gca agc atc cca cat ctg ata ctg
gaa ctt ttg aag tgt gag cca 1146 Pro Ala Ser Ile Pro His Leu Ile Leu
Glu Leu Leu Lys Cys Glu Pro 300 305
310 gat gag cct caa gtc cag gct aaa atc
atg gcc tat ttg cag caa gag 1194 Asp Glu Pro Gln Val Gln Ala Lys Ile
Met Ala Tyr Leu Gln Gln Glu 315 320
325 cag gct aac cga agc aag cac gaa aag
ctg agc acc ttt ggg ctt atg 1242 Gln Ala Asn Arg Ser Lys His Glu Lys
Leu Ser Thr Phe Gly Leu Met 330 335
340 345 tgc aaa atg gca gat caa act ctc ttc
tcc att gtc gag tgg gcc agg 1290 Cys Lys Met Ala Asp Gln Thr Leu Phe
Ser Ile Val Glu Trp Ala Arg 350
355 360 agt agt atc ttc ttc aga gaa ctt aag
gtt gat gac caa atg aag ctg 1338 Ser Ser Ile Phe Phe Arg Glu Leu Lys
Val Asp Asp Gln Met Lys Leu 365 370
375 ctt cag aac tgc tgg agt gag ctc tta
atc ctc gac cac att tac cga 1386 Leu Gln Asn Cys Trp Ser Glu Leu Leu
Ile Leu Asp His Ile Tyr Arg 380 385
390 caa gtg gta cat gga aag gaa gga tcc
atc ttc ctg gtt act ggg caa 1434 Gln Val Val His Gly Lys Glu Gly Ser
Ile Phe Leu Val Thr Gly Gln 395 400
405 caa gtg gac tat tcc ata ata gca tca
caa gcc gga gcc acc ctc aac 1482 Gln Val Asp Tyr Ser Ile Ile Ala Ser
Gln Ala Gly Ala Thr Leu Asn 410 415
420 425 aac ctc atg agt cat gca cag gag tta
gtg gca aaa ctt cgt tct ctc 1530 Asn Leu Met Ser His Ala Gln Glu Leu
Val Ala Lys Leu Arg Ser Leu 430
435 440 cag ttt gat caa cga gag ttc gta tgt
ctg aaa ttc ttg gtg ctc ttt 1578 Gln Phe Asp Gln Arg Glu Phe Val Cys
Leu Lys Phe Leu Val Leu Phe 445 450
455 agt tta gat gtc aaa aac ctt gaa aac
ttc cag ctg gta gaa ggt gtc 1626 Ser Leu Asp Val Lys Asn Leu Glu Asn
Phe Gln Leu Val Glu Gly Val 460 465
470 cag gaa caa gtc aat gcc gcc ctg ctg
gac tac aca atg tgt aac tac 1674 Gln Glu Gln Val Asn Ala Ala Leu Leu
Asp Tyr Thr Met Cys Asn Tyr 475 480
485 ccg cag cag aca gag aaa ttt gga cag
cta ctt ctt cga cta ccc gaa 1722 Pro Gln Gln Thr Glu Lys Phe Gly Gln
Leu Leu Leu Arg Leu Pro Glu 490 495
500 505 atc cgg gcc atc agt atg cag gct gaa
gaa tac ctc tac tac aag cac 1770 Ile Arg Ala Ile Ser Met Gln Ala Glu
Glu Tyr Leu Tyr Tyr Lys His 510
515 520 ctg aac ggg gat gtg ccc tat aat aac
ctt ctc att gaa atg ttg cat 1818 Leu Asn Gly Asp Val Pro Tyr Asn Asn
Leu Leu Ile Glu Met Leu His 525 530
535 gcc aaa aga gca taagttacaa cccctagga
g ctctgctttc aaaacaaaaa 1870 Ala Lys Arg Ala 540 gagattgggg gagtggggag ggggaagaag aac
aggaaga aaaaaagtac tctgaactgc 1930 tccaagcaac gctaattaaa aacttgcttt aaa
gatattg aatttaaaaa ggcataataa 1990 tcaaatactt aatagcaaat aaatgatgta tca
gggtatt tgtattgcaa actgtgaatc 2050 aaaggcttca cagccccaga ggattccata taa
aagacat tgtaatggag tggattgaac 2110 tcacagatgg ataccaacac ggtcagaaga aaa
acggaca gaacggttct tgtatattta 2170 aactgatctc cactatgaag aaatttagga act
aatctta ttaattaggc ttatacagcg 2230 ggggatttga gcttacagga ttcctccatg gta
aagctga actgaaacaa ttctcaagaa 2290 tgcatcagct gtacctacaa tagcccctcc ctc
ttccttt gaaggcccga gcacctctgc 2350 cctgtggtca ccgaatctgt actaaggacc tgt
gttcagc cacacccagt ggtagctcca 2410 ccaaatcatg aacagcctaa ttttgagtgt ctg
tgtctta gacctgcaaa cagctaatag 2470 gaaattctat taatatgtta gcttgccatt tta
aatatgt tctgagggtt gttttgtctc 2530 gtgttcatga tgttaagaaa atgcaggcag tat
ccctcat cttatgtaag tgtgaattaa 2590 tattaaggga aatgactaca aactttcaaa gca
aatgctc catagctaaa gcaacttaga 2650 ccttatttct gctactgttg ctgaaatgtg gct
ttggcat tgttggattt cataaaaaat 2710 ttctggcagg aagtcttgtt agtatacatc agt
ctttttc atcatccaag tttgtagttc 2770 atttaaaaat acaacattaa acacattttg cta
ggatgtc aaatagtcac agttctaagt 2830 agttggaaac aaaattgacg catgttaatc tat
gcaaaga gaaaggaaag gatgaggtga 2890 tgtattgact caaggttcat tcttgctgca att
gaacatc ctcaagagtt gggatggaaa 2950 tggtgatttt tacatgtgtc ctggaaagat att
aaagtaa ttcaaatctt ccccaaaggg 3010 gaaaggaaga gagtgatact gaccttttta agt
catagac caaagtctgc tgtagaacaa 3070 atatgggagg acaaagaatc gcaaattctt caa
atgacta ttatcagtat tattaacatg 3130 cgatgccaca ggtatgaaag tcttgcctta ttt
cacaatt ttaaaaggta gctgtgcaga 3190 tgtggatcaa catttgttta aaataaagta tta
atacttt aaagtcaaaa aaaaaaaaaa 3250 a
3251 <210> 6 <211> 541 <212> PRT <213> Human <400> 6 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 1 5
10 15 Gly Leu Thr Pro Ile Gly Ala Gly Leu
Pro Asp Arg His Gly Ser Pro 20 25
30 Ile Pro Ala Arg Gly Arg Leu Val Met
Leu Pro Lys Val Glu Thr Glu 35 40
45 Ala Leu Gly Leu Ala Arg Ser His Gly
Glu Gln Gly Gln Met Pro Glu 50 55
60 Asn Met Gln Val Ser Gln Phe Lys Met
Val Asn Tyr Ser Tyr Asp Glu 65 70
75 80 Asp Leu Glu Glu Leu Cys Pro Val Cys
Gly Asp Lys Val Ser Gly Tyr 85
90 95 His Tyr Gly Leu Leu Thr Cys Glu Ser
Cys Lys Gly Phe Phe Lys Arg 100 105
110 Thr Val Gln Asn Asn Lys Arg Tyr Thr
Cys Ile Glu Asn Gln Asn Cys 115 120
125 Gln Ile Asp Lys Thr Gln Arg Lys Arg
Cys Pro Tyr Cys Arg Phe Gln 130 135
140 Lys Cys Leu Ser Val Gly Met Lys Leu
Glu Ala Val Arg Ala Asp Arg 145 150
155 160 Met Arg Gly Gly Arg Asn Lys Phe Gly
Pro Met Tyr Lys Arg Asp Arg 165
170 175 Ala Leu Lys Gln Gln Lys Lys Ala Leu
Ile Arg Ala Asn Gly Leu Lys 180 185
190 Leu Glu Ala Met Ser Gln Val Ile Gln
Ala Met Pro Ser Asp Leu Thr 195 200
205 Ile Ser Ser Ala Ile Gln Asn Ile His
Ser Ala Ser Lys Gly Leu Pro 210 215
220 Leu Asn His Ala Ala Leu Pro Pro Thr
Asp Tyr Asp Arg Ser Pro Phe 225 230
235 240 Val Thr Ser Pro Ile Ser Met Thr Met
Pro Pro His Gly Ser Leu Gln 245
250 255 Gly Tyr Gln Thr Tyr Gly His Phe Pro
Ser Arg Ala Ile Lys Ser Glu 260 265
270 Tyr Pro Asp Pro Tyr Thr Ser Ser Pro
Glu Ser Ile Met Gly Tyr Ser 275 280
285 Tyr Met Asp Ser Tyr Gln Thr Ser Ser
Pro Ala Ser Ile Pro His Leu 290 295
300 Ile Leu Glu Leu Leu Lys Cys Glu Pro
Asp Glu Pro Gln Val Gln Ala 305 310
315 320 Lys Ile Met Ala Tyr Leu Gln Gln Glu
Gln Ala Asn Arg Ser Lys His 325
330 335 Glu Lys Leu Ser Thr Phe Gly Leu Met
Cys Lys Met Ala Asp Gln Thr 340 345
350 Leu Phe Ser Ile Val Glu Trp Ala Arg
Ser Ser Ile Phe Phe Arg Glu 355 360
365 Leu Lys Val Asp Asp Gln Met Lys Leu
Leu Gln Asn Cys Trp Ser Glu 370 375
380 Leu Leu Ile Leu Asp His Ile Tyr Arg
Gln Val Val His Gly Lys Glu 385 390
395 400 Gly Ser Ile Phe Leu Val Thr Gly Gln
Gln Val Asp Tyr Ser Ile Ile 405
410 415 Ala Ser Gln Ala Gly Ala Thr Leu Asn
Asn Leu Met Ser His Ala Gln 420 425
430 Glu Leu Val Ala Lys Leu Arg Ser Leu
Gln Phe Asp Gln Arg Glu Phe 435 440
445 Val Cys Leu Lys Phe Leu Val Leu Phe
Ser Leu Asp Val Lys Asn Leu 450 455
460 Glu Asn Phe Gln Leu Val Glu Gly Val
Gln Glu Gln Val Asn Ala Ala 465 470
475 480 Leu Leu Asp Tyr Thr Met Cys Asn Tyr
Pro Gln Gln Thr Glu Lys Phe 485
490 495 Gly Gln Leu Leu Leu Arg Leu Pro Glu
Ile Arg Ala Ile Ser Met Gln 500 505
510 Ala Glu Glu Tyr Leu Tyr Tyr Lys His
Leu Asn Gly Asp Val Pro Tyr 515 520
525 Asn Asn Leu Leu Ile Glu Met Leu His
Ala Lys Arg Ala 530 535
540 <210> 7 <211> 2330 <212> DNA <213> Human <220> <221> CDS <222> (363)..(1862) <400> 7 gaaactggat acatggttta cagcaggtca cta
atgttgg aaaaagtaca gagtccaggg 60 aaagacttgc ttgtaacttt atgaattctg gat
ttttttt cctttgcttt ttcttaactt 120 tcactaaggg ttactgtagt ctgatgtgtc ctt
cccaagg ccacgaaatt tgacaagctg 180 cacttttctt ttgctcaatg atttctgctt taa
gccaaag aactgcctat aatttcacta 240 agaatgtctt ctaattcaga tactggggat tta
caagagt ctttaaagca cggacttaca 300 cctattggtg ctgggcttcc ggaccgacac gga
tccccca tcccgcccgc ggtcgccttg 360 tc atg ctg ccc aaa gtg gag acg gaa g
cc ctg gga ctg gct cga tcg 407 Met Leu Pro Lys Val Glu Thr Glu A
la Leu Gly Leu Ala Arg Ser 1 5
10 15 cat ggg gaa cag ggc cag atg ccg gaa
aac atg caa gtg tct caa ttt 455 His Gly Glu Gln Gly Gln Met Pro Glu
Asn Met Gln Val Ser Gln Phe 20
25 30 aaa atg gtg aat tac tcc tat gat gaa
gat ctg gaa gag ctt tgt ccc 503 Lys Met Val Asn Tyr Ser Tyr Asp Glu
Asp Leu Glu Glu Leu Cys Pro 35 40
45 gtg tgt gga gat aaa gtg tct ggg tac
cat tat ggg ctc ctc acc tgt 551 Val Cys Gly Asp Lys Val Ser Gly Tyr
His Tyr Gly Leu Leu Thr Cys 50 55
60 gaa agc tgc aag gga ttt ttt aag cga
aca gtc caa aat aat aaa agg 599 Glu Ser Cys Lys Gly Phe Phe Lys Arg
Thr Val Gln Asn Asn Lys Arg 65 70
75 tac aca tgt ata gaa aac cag aac tgc
caa att gac aaa aca cag aga 647 Tyr Thr Cys Ile Glu Asn Gln Asn Cys
Gln Ile Asp Lys Thr Gln Arg 80 85
90 aag cgt tgt cct tac tgt cgt ttt caa
aaa tgt cta agt gtt gga atg 695 Lys Arg Cys Pro Tyr Cys Arg Phe Gln
Lys Cys Leu Ser Val Gly Met 100
105 110 aag cta gaa gct gta agg gcc gac cga
atg cgt gga gga agg aat aag 743 Lys Leu Glu Ala Val Arg Ala Asp Arg
Met Arg Gly Gly Arg Asn Lys 115 120
125 ttt ggg cca atg tac aag aga gac agg
gcc ctg aag caa cag aaa aaa 791 Phe Gly Pro Met Tyr Lys Arg Asp Arg
Ala Leu Lys Gln Gln Lys Lys 130 135
140 gcc ctc atc cga gcc aat gga ctt aag
cta gaa gcc atg tct cag gtg 839 Ala Leu Ile Arg Ala Asn Gly Leu Lys
Leu Glu Ala Met Ser Gln Val 145 150
155 atc caa gct atg ccc tct gac ctg acc
att tcc tct gca att caa aac 887 Ile Gln Ala Met Pro Ser Asp Leu Thr
Ile Ser Ser Ala Ile Gln Asn 160 165
170 175 atc cac tct gcc tcc aaa ggc cta cct
ctg aac cat gct gcc ttg cct 935 Ile His Ser Ala Ser Lys Gly Leu Pro
Leu Asn His Ala Ala Leu Pro 180
185 190 cct aca gac tat gac aga agt ccc ttt
gta aca tcc ccc att agc atg 983 Pro Thr Asp Tyr Asp Arg Ser Pro Phe
Val Thr Ser Pro Ile Ser Met 195 200
205 aca atg ctg cac ggc agc ctg caa ggt
tac caa aca tat ggc cac ttt 1031 Thr Met Leu His Gly Ser Leu Gln Gly
Tyr Gln Thr Tyr Gly His Phe 210 215
220 cct agc cgg gcc atc aag tct gag tac
cca gac ccc tat acc agc tca 1079 Pro Ser Arg Ala Ile Lys Ser Glu Tyr
Pro Asp Pro Tyr Thr Ser Ser 225 230
235 ccc gag tcc ata atg ggc tat tca tat
atg gat agt tac cag acg agc 1127 Pro Glu Ser Ile Met Gly Tyr Ser Tyr
Met Asp Ser Tyr Gln Thr Ser 240 245
250 255 tct cca gca agc atc cca cat ctg ata
ctg gaa ctt ttg aag tgt gag 1175 Ser Pro Ala Ser Ile Pro His Leu Ile
Leu Glu Leu Leu Lys Cys Glu 260
265 270 cca gat gag cct caa gtc cag gct aaa
atc atg gcc tat ttg cag caa 1223 Pro Asp Glu Pro Gln Val Gln Ala Lys
Ile Met Ala Tyr Leu Gln Gln 275 280
285 gag cag gct aac cga agc aag cac gaa
aag ctg agc acc ttt ggg ctt 1271 Glu Gln Ala Asn Arg Ser Lys His Glu
Lys Leu Ser Thr Phe Gly Leu 290 295
300 atg tgc aaa atg gca gat caa act gtc
ttc tcc att gtc gag tgg gcc 1319 Met Cys Lys Met Ala Asp Gln Thr Val
Phe Ser Ile Val Glu Trp Ala 305 310
315 agg agt agt atc ttc ttc aga gaa ctt
aag gtt gat gac caa atg aag 1367 Arg Ser Ser Ile Phe Phe Arg Glu Leu
Lys Val Asp Asp Gln Met Lys 320 325
330 335 ctg ctt cag aac tgc tgg agt gag ctc
tta atc ctc gac cac att tac 1415 Leu Leu Gln Asn Cys Trp Ser Glu Leu
Leu Ile Leu Asp His Ile Tyr 340
345 350 cga caa gtg gta cat gga aag gaa gga
tcc atc ttc ctg gtt act ggg 1463 Arg Gln Val Val His Gly Lys Glu Gly
Ser Ile Phe Leu Val Thr Gly 355 360
365 caa caa gtg gac tat tcc ata ata gca
tca caa gcc gga gcc acc ctc 1511 Gln Gln Val Asp Tyr Ser Ile Ile Ala
Ser Gln Ala Gly Ala Thr Leu 370 375
380 aac aac ctc atg agt cat gca cag gag
tta gtg gca aaa ctt cgt tct 1559 Asn Asn Leu Met Ser His Ala Gln Glu
Leu Val Ala Lys Leu Arg Ser 385 390
395 ctc cag ttt gat caa cga gag ttc gta
tgt ctg aaa ttc ttg gtg ctc 1607 Leu Gln Phe Asp Gln Arg Glu Phe Val
Cys Leu Lys Phe Leu Val Leu 400 405
410 415 ttt agt tta gat gtc aaa aac ctt gaa
aac ttc cag ctg gta gaa ggt 1655 Phe Ser Leu Asp Val Lys Asn Leu Glu
Asn Phe Gln Leu Val Glu Gly 420
425 430 gtc cag gaa caa gtc aat gcc gcc ctg
ctg gac tac aca atg tgt aac 1703 Val Gln Glu Gln Val Asn Ala Ala Leu
Leu Asp Tyr Thr Met Cys Asn 435 440
445 tac ccg cag cag aca gag aaa ttt gga
cag cta ctt ctt cga cta ccc 1751 Tyr Pro Gln Gln Thr Glu Lys Phe Gly
Gln Leu Leu Leu Arg Leu Pro 450 455
460 gaa atc cgg gcc atc agt atg cag gct
gaa gaa tac ctc tac tac aag 1799 Glu Ile Arg Ala Ile Ser Met Gln Ala
Glu Glu Tyr Leu Tyr Tyr Lys 465 470
475 cac ctg aat ggg gat gtg ccc tat aat
aac ctt ctc att gaa atg ttg 1847 His Leu Asn Gly Asp Val Pro Tyr Asn
Asn Leu Leu Ile Glu Met Leu 480 485
490 495 cat gcc aaa aga gca taagttacaa cccct
aggag ctctgctttc aaaacaaaaa 1902 His Ala Lys Arg Ala
500 gagattgggg gagtggggag ggggaagaag aac
aggaaga aaaaaagtac tctgaactgc 1962 tccaagtaac gctaattaaa aacttgcttt aaa
gatattg aatttaaaaa ggcataataa 2022 tcaaatacta atagcaaata aatgatgtat cag
ggtattt gtattgcaaa ctgtgaatca 2082 aagcttcaca gccccagagg attccatata aaa
gacattg taatggagtg gattgaactc 2142 acagatggat accaacacgg tcagaagaaa aac
ggacaga acggttcttg tatatttaaa 2202 ctgatctcca ctatgaagaa atttaggaac taa
tcttatt aattaggctt atacagcggg 2262 gatttgagct tacaggattc ctccatggta aag
ctgaact gaaacaattc tcaagaatgc 2322 atcagctg
2330 <210> 8 <211> 500 <212> PRT <213> Human <400> 8 Met Leu Pro Lys Val Glu Thr Glu Ala
Leu Gly Leu Ala Arg Ser His 1 5
10 15 Gly Glu Gln Gly Gln Met Pro Glu Asn
Met Gln Val Ser Gln Phe Lys 20 25
30 Met Val Asn Tyr Ser Tyr Asp Glu Asp
Leu Glu Glu Leu Cys Pro Val 35 40
45 Cys Gly Asp Lys Val Ser Gly Tyr His
Tyr Gly Leu Leu Thr Cys Glu 50 55
60 Ser Cys Lys Gly Phe Phe Lys Arg Thr
Val Gln Asn Asn Lys Arg Tyr 65 70
75 80 Thr Cys Ile Glu Asn Gln Asn Cys Gln
Ile Asp Lys Thr Gln Arg Lys 85
90 95 Arg Cys Pro Tyr Cys Arg Phe Gln Lys
Cys Leu Ser Val Gly Met Lys 100 105
110 Leu Glu Ala Val Arg Ala Asp Arg Met
Arg Gly Gly Arg Asn Lys Phe 115 120
125 Gly Pro Met Tyr Lys Arg Asp Arg Ala
Leu Lys Gln Gln Lys Lys Ala 130 135
140 Leu Ile Arg Ala Asn Gly Leu Lys Leu
Glu Ala Met Ser Gln Val Ile 145 150
155 160 Gln Ala Met Pro Ser Asp Leu Thr Ile
Ser Ser Ala Ile Gln Asn Ile 165
170 175 His Ser Ala Ser Lys Gly Leu Pro Leu
Asn His Ala Ala Leu Pro Pro 180 185
190 Thr Asp Tyr Asp Arg Ser Pro Phe Val
Thr Ser Pro Ile Ser Met Thr 195 200
205 Met Leu His Gly Ser Leu Gln Gly Tyr
Gln Thr Tyr Gly His Phe Pro 210 215
220 Ser Arg Ala Ile Lys Ser Glu Tyr Pro
Asp Pro Tyr Thr Ser Ser Pro 225 230
235 240 Glu Ser Ile Met Gly Tyr Ser Tyr Met
Asp Ser Tyr Gln Thr Ser Ser 245
250 255 Pro Ala Ser Ile Pro His Leu Ile Leu
Glu Leu Leu Lys Cys Glu Pro 260 265
270 Asp Glu Pro Gln Val Gln Ala Lys Ile
Met Ala Tyr Leu Gln Gln Glu 275 280
285 Gln Ala Asn Arg Ser Lys His Glu Lys
Leu Ser Thr Phe Gly Leu Met 290 295
300 Cys Lys Met Ala Asp Gln Thr Val Phe
Ser Ile Val Glu Trp Ala Arg 305 310
315 320 Ser Ser Ile Phe Phe Arg Glu Leu Lys
Val Asp Asp Gln Met Lys Leu 325
330 335 Leu Gln Asn Cys Trp Ser Glu Leu Leu
Ile Leu Asp His Ile Tyr Arg 340 345
350 Gln Val Val His Gly Lys Glu Gly Ser
Ile Phe Leu Val Thr Gly Gln 355 360
365 Gln Val Asp Tyr Ser Ile Ile Ala Ser
Gln Ala Gly Ala Thr Leu Asn 370 375
380 Asn Leu Met Ser His Ala Gln Glu Leu
Val Ala Lys Leu Arg Ser Leu 385 390
395 400 Gln Phe Asp Gln Arg Glu Phe Val Cys
Leu Lys Phe Leu Val Leu Phe 405
410 415 Ser Leu Asp Val Lys Asn Leu Glu Asn
Phe Gln Leu Val Glu Gly Val 420 425
430 Gln Glu Gln Val Asn Ala Ala Leu Leu
Asp Tyr Thr Met Cys Asn Tyr 435 440
445 Pro Gln Gln Thr Glu Lys Phe Gly Gln
Leu Leu Leu Arg Leu Pro Glu 450 455
460 Ile Arg Ala Ile Ser Met Gln Ala Glu
Glu Tyr Leu Tyr Tyr Lys His 465 470
475 480 Leu Asn Gly Asp Val Pro Tyr Asn Asn
Leu Leu Ile Glu Met Leu His 485
490 495 Ala Lys Arg Ala 500 <210> 9 <211> 3027 <212> DNA <213> Mouse <220> <221> CDS <222> (159)..(1838) <400> 9 tgttttttcc ccctttttct taactttcac taa
ggaaatg agggttactg tagtctgagg 60 tttccttccc aaagtcacaa aatatgacaa gct
gcaatct ttctcacatt caatgatttc 120 tgctgtaagc caaaggactg ccaataattt cgc
taaga atg tct gct agt ttg gat 176
Met Ser Ala Ser Leu Asp
1 5 act gga gat ttt caa gaa ttt ctt aag
cat gga ctt aca gct att gcg 224 Thr Gly Asp Phe Gln Glu Phe Leu Lys
His Gly Leu Thr Ala Ile Ala 10 15
20 tct gca cca ggg tca gag act cgc cac
tcc ccc aaa cgt gag gaa caa 272 Ser Ala Pro Gly Ser Glu Thr Arg His
Ser Pro Lys Arg Glu Glu Gln 25 30
35 ctc cgg gaa aaa cgt gct ggg ctt ccg
gac cga cac cga cgc ccc att 320 Leu Arg Glu Lys Arg Ala Gly Leu Pro
Asp Arg His Arg Arg Pro Ile 40 45
50 ccc gcc cgc agc cgc ctt gtc atg ctg
ccc aaa gtg gag acg gaa gcc 368 Pro Ala Arg Ser Arg Leu Val Met Leu
Pro Lys Val Glu Thr Glu Ala 55 60
65 70 cca gga ctg gtc cga tcg cat ggg gaa
cag ggg cag atg cca gaa aac 416 Pro Gly Leu Val Arg Ser His Gly Glu
Gln Gly Gln Met Pro Glu Asn 75
80 85 atg caa gtg tct caa ttt aaa atg gtg
aat tac tcc tat gat gaa gat 464 Met Gln Val Ser Gln Phe Lys Met Val
Asn Tyr Ser Tyr Asp Glu Asp 90 95
100 ctg gaa gag cta tgt cct gtg tgt ggc
gat aaa gtg tct ggg tac cat 512 Leu Glu Glu Leu Cys Pro Val Cys Gly
Asp Lys Val Ser Gly Tyr His 105 110
115 tac ggt ctc ctc acg tgc gaa agc tgc
aag ggt ttt ttt aag cga act 560 Tyr Gly Leu Leu Thr Cys Glu Ser Cys
Lys Gly Phe Phe Lys Arg Thr 120 125
130 gtc caa aac caa aaa agg tac acg tgc
ata gag aac cag aat tgc caa 608 Val Gln Asn Gln Lys Arg Tyr Thr Cys
Ile Glu Asn Gln Asn Cys Gln 135 140
145 150 att gac aaa acg cag aga aaa cga tgt
ccc tac tgt cga ttc aaa aaa 656 Ile Asp Lys Thr Gln Arg Lys Arg Cys
Pro Tyr Cys Arg Phe Lys Lys 155
160 165 tgt atc gat gtt ggg atg aag ctg gaa
gcc gta aga gcc gac cgc atg 704 Cys Ile Asp Val Gly Met Lys Leu Glu
Ala Val Arg Ala Asp Arg Met 170 175
180 cga ggg ggc aga aat aag ttt ggg cca
atg tac aag aga gac agg gct 752 Arg Gly Gly Arg Asn Lys Phe Gly Pro
Met Tyr Lys Arg Asp Arg Ala 185 190
195 ttg aag cag cag aag aaa gcc ctc att
cga gcc aat gga ctt aag ctg 800 Leu Lys Gln Gln Lys Lys Ala Leu Ile
Arg Ala Asn Gly Leu Lys Leu 200 205
210 gaa gcc atg tct cag gtg atc caa gca
atg ccc tca gac ctg acc tct 848 Glu Ala Met Ser Gln Val Ile Gln Ala
Met Pro Ser Asp Leu Thr Ser 215 220
225 230 gca att cag aac att cat tcc gcc tcc
aaa ggc cta cct ctg agc cat 896 Ala Ile Gln Asn Ile His Ser Ala Ser
Lys Gly Leu Pro Leu Ser His 235
240 245 gta gcc ttg cct ccg aca gac tat gac
aga agt ccc ttt gtc aca tct 944 Val Ala Leu Pro Pro Thr Asp Tyr Asp
Arg Ser Pro Phe Val Thr Ser 250 255
260 ccc att agc atg aca atg cca cct cac
agc agc ctg cat ggt tac caa 992 Pro Ile Ser Met Thr Met Pro Pro His
Ser Ser Leu His Gly Tyr Gln 265 270
275 ccc tat ggt cac ttt cct agt cgg gcc
atc aag tct gag tac cca gac 1040 Pro Tyr Gly His Phe Pro Ser Arg Ala
Ile Lys Ser Glu Tyr Pro Asp 280 285
290 ccc tac tcc agc tca cct gag tca atg
atg ggt tac tcc tac atg gat 1088 Pro Tyr Ser Ser Ser Pro Glu Ser Met
Met Gly Tyr Ser Tyr Met Asp 295 300
305 310 ggt tac cag aca aac tcc ccg gcc agc
atc cca cac ctg ata ctg gaa 1136 Gly Tyr Gln Thr Asn Ser Pro Ala Ser
Ile Pro His Leu Ile Leu Glu 315
320 325 ctt ttg aag tgt gaa cca gat gag cct
caa gtt caa gcg aag atc atg 1184 Leu Leu Lys Cys Glu Pro Asp Glu Pro
Gln Val Gln Ala Lys Ile Met 330 335
340 gct tac ctc cag caa gag cag agt aac
cga aac agg caa gaa aag ctg 1232 Ala Tyr Leu Gln Gln Glu Gln Ser Asn
Arg Asn Arg Gln Glu Lys Leu 345 350
355 agc gca ttt ggg ctt tta tgc aaa atg
gcg gac cag acc ctg ttc tcc 1280 Ser Ala Phe Gly Leu Leu Cys Lys Met
Ala Asp Gln Thr Leu Phe Ser 360 365
370 att gtt gag tgg gcc agg agt agt atc
ttc ttc agg gaa ctg aag gtt 1328 Ile Val Glu Trp Ala Arg Ser Ser Ile
Phe Phe Arg Glu Leu Lys Val 375 380
385 390 gat gac caa atg aag ctg ctt caa aac
tgc tgg agt gag ctc ttg att 1376 Asp Asp Gln Met Lys Leu Leu Gln Asn
Cys Trp Ser Glu Leu Leu Ile 395
400 405 ctc gat cac att tac cga caa gtg gcg
cat ggg aag gaa ggg aca atc 1424 Leu Asp His Ile Tyr Arg Gln Val Ala
His Gly Lys Glu Gly Thr Ile 410 415
420 ttc ctg gtt act gga gaa cac gtg gac
tac tcc acc atc atc tca cac 1472 Phe Leu Val Thr Gly Glu His Val Asp
Tyr Ser Thr Ile Ile Ser His 425 430
435 aca gaa gtc gcg ttc aac aac ctc ctg
agt ctc gca cag gag ctg gtg 1520 Thr Glu Val Ala Phe Asn Asn Leu Leu
Ser Leu Ala Gln Glu Leu Val 440 445
450 gtg agg ctc cgt tcc ctt cag ttc gat
cag cgg gag ttt gta tgt ctc 1568 Val Arg Leu Arg Ser Leu Gln Phe Asp
Gln Arg Glu Phe Val Cys Leu 455 460
465 470 aag ttc ctg gtg ctg ttc agc tca gat
gtg aag aac ctg gag aac ctg 1616 Lys Phe Leu Val Leu Phe Ser Ser Asp
Val Lys Asn Leu Glu Asn Leu 475
480 485 cag ctg gtg gaa ggt gtc caa gag cag
gtg aat gcc gcc ctg ctg gac 1664 Gln Leu Val Glu Gly Val Gln Glu Gln
Val Asn Ala Ala Leu Leu Asp 490 495
500 tac acg gtt tgc aac tac cca caa cag
act gag aaa ttc gga cag cta 1712 Tyr Thr Val Cys Asn Tyr Pro Gln Gln
Thr Glu Lys Phe Gly Gln Leu 505 510
515 ctt ctt cgg cta ccc gag atc cgg gca
atc agc aag cag gca gaa gac 1760 Leu Leu Arg Leu Pro Glu Ile Arg Ala
Ile Ser Lys Gln Ala Glu Asp 520 525
530 tac ctg tac tat aag cac gtg aac ggg
gat gtg ccc tat aat aac ctc 1808 Tyr Leu Tyr Tyr Lys His Val Asn Gly
Asp Val Pro Tyr Asn Asn Leu 535 540
545 550 ctc att gag atg ctg cat gcc aaa aga
gcc taagtcccca cccctggaag 1858 Leu Ile Glu Met Leu His Ala Lys Arg
Ala 555
560 cttgctctag gaacacagac tggaaggaga aga
ggaggac gatgacagaa acacaatact 1918 ctgaactgct ccaagcaatg ctaattataa act
tggttta aagacactga attttaaaag 1978 cataataatt aaatacctaa tagcaaataa atg
atatatc agggtatttg tactgcaaac 2038 tgtgaatcaa aggctgtatg aatcaaagga ttc
atatgaa agacattgta atggggtgga 2098 ttgaacttac agatggagac caataccaca gca
gaataaa aatggacaga acaatccttg 2158 tatatttaaa ctaatctgct attaagaaat tca
gaagttg atctctgtta ttaattggat 2218 ttgtcctgaa ttactccgtg gtgacgctga aca
actcaag aatacatggg ctgtgcttgg 2278 cagcccctcc ccatccctcc caccaccacc acc
cccaccc ccacaaggcc ctataccttc 2338 tgacctgtga gccctgaagc tattttaagg act
tctgttc agccataccc agtagtagct 2398 ccactaaacc atgatttctg gatgtctgtg tct
tagacct gccaacagct aataagaaca 2458 atgtataaat atgtcagctt gcattttaaa tat
gtgctga agtttgtttt gtcgtgtgtt 2518 cgtaattaaa aagaaaacgg gcagtaaccc tct
tctatat aagcattagt taatattaag 2578 ggaaatcaaa caaatctaag ccaatactcc caa
caagcaa gttagatctt acttctgctg 2638 ctgttgctga aatgtggctt tggcatggtt ggg
tttcata aaactttttg gccaagaggc 2698 ttgttagtat acatccatct gtttagtcat caa
ggtttgt agttcactta aaaaaaaata 2758 aaccactaga catcttttgc tgaatgtcaa ata
gtcacag tctaagtagc caaaaagtca 2818 aagcgtgtta aacattgcca aatgaaggaa agg
gtgagct gcaaagggga tggttcgagg 2878 ttcattccag ttgtgacccg agcgtcccca aaa
cctggga tgcaaagaca gtgattctgc 2938 atatggcctg gaaagacagg aaagccagtc tcc
tacaaag gggaatggaa gatcctggcc 2998 tctaagtcat agaccaaagt ctgctgtag
3027 <210> 10 <211> 560 <212> PRT <213> Mouse <400> 10 Met Ser Ala Ser Leu Asp Thr Gly Asp
Phe Gln Glu Phe Leu Lys His 1 5
10 15 Gly Leu Thr Ala Ile Ala Ser Ala Pro
Gly Ser Glu Thr Arg His Ser 20 25
30 Pro Lys Arg Glu Glu Gln Leu Arg Glu
Lys Arg Ala Gly Leu Pro Asp 35 40
45 Arg His Arg Arg Pro Ile Pro Ala Arg
Ser Arg Leu Val Met Leu Pro 50 55
60 Lys Val Glu Thr Glu Ala Pro Gly Leu
Val Arg Ser His Gly Glu Gln 65 70
75 80 Gly Gln Met Pro Glu Asn Met Gln Val
Ser Gln Phe Lys Met Val Asn 85
90 95 Tyr Ser Tyr Asp Glu Asp Leu Glu Glu
Leu Cys Pro Val Cys Gly Asp 100 105
110 Lys Val Ser Gly Tyr His Tyr Gly Leu
Leu Thr Cys Glu Ser Cys Lys 115 120
125 Gly Phe Phe Lys Arg Thr Val Gln Asn
Gln Lys Arg Tyr Thr Cys Ile 130 135
140 Glu Asn Gln Asn Cys Gln Ile Asp Lys
Thr Gln Arg Lys Arg Cys Pro 145 150
155 160 Tyr Cys Arg Phe Lys Lys Cys Ile Asp
Val Gly Met Lys Leu Glu Ala 165
170 175 Val Arg Ala Asp Arg Met Arg Gly Gly
Arg Asn Lys Phe Gly Pro Met 180 185
190 Tyr Lys Arg Asp Arg Ala Leu Lys Gln
Gln Lys Lys Ala Leu Ile Arg 195 200
205 Ala Asn Gly Leu Lys Leu Glu Ala Met
Ser Gln Val Ile Gln Ala Met 210 215
220 Pro Ser Asp Leu Thr Ser Ala Ile Gln
Asn Ile His Ser Ala Ser Lys 225 230
235 240 Gly Leu Pro Leu Ser His Val Ala Leu
Pro Pro Thr Asp Tyr Asp Arg 245
250 255 Ser Pro Phe Val Thr Ser Pro Ile Ser
Met Thr Met Pro Pro His Ser 260 265
270 Ser Leu His Gly Tyr Gln Pro Tyr Gly
His Phe Pro Ser Arg Ala Ile 275 280
285 Lys Ser Glu Tyr Pro Asp Pro Tyr Ser
Ser Ser Pro Glu Ser Met Met 290 295
300 Gly Tyr Ser Tyr Met Asp Gly Tyr Gln
Thr Asn Ser Pro Ala Ser Ile 305 310
315 320 Pro His Leu Ile Leu Glu Leu Leu Lys
Cys Glu Pro Asp Glu Pro Gln 325
330 335 Val Gln Ala Lys Ile Met Ala Tyr Leu
Gln Gln Glu Gln Ser Asn Arg 340 345
350 Asn Arg Gln Glu Lys Leu Ser Ala Phe
Gly Leu Leu Cys Lys Met Ala 355 360
365 Asp Gln Thr Leu Phe Ser Ile Val Glu
Trp Ala Arg Ser Ser Ile Phe 370 375
380 Phe Arg Glu Leu Lys Val Asp Asp Gln
Met Lys Leu Leu Gln Asn Cys 385 390
395 400 Trp Ser Glu Leu Leu Ile Leu Asp His
Ile Tyr Arg Gln Val Ala His 405
410 415 Gly Lys Glu Gly Thr Ile Phe Leu Val
Thr Gly Glu His Val Asp Tyr 420 425
430 Ser Thr Ile Ile Ser His Thr Glu Val
Ala Phe Asn Asn Leu Leu Ser 435 440
445 Leu Ala Gln Glu Leu Val Val Arg Leu
Arg Ser Leu Gln Phe Asp Gln 450 455
460 Arg Glu Phe Val Cys Leu Lys Phe Leu
Val Leu Phe Ser Ser Asp Val 465 470
475 480 Lys Asn Leu Glu Asn Leu Gln Leu Val
Glu Gly Val Gln Glu Gln Val 485
490 495 Asn Ala Ala Leu Leu Asp Tyr Thr Val
Cys Asn Tyr Pro Gln Gln Thr 500 505
510 Glu Lys Phe Gly Gln Leu Leu Leu Arg
Leu Pro Glu Ile Arg Ala Ile 515 520
525 Ser Lys Gln Ala Glu Asp Tyr Leu Tyr
Tyr Lys His Val Asn Gly Asp 530 535
540 Val Pro Tyr Asn Asn Leu Leu Ile Glu
Met Leu His Ala Lys Arg Ala 545 550
555 560
[Sequence list] SEQUUE
NCE LISTING <110> Sumitomo Pharmaceuticals Compa
ny Limited <110> Tularik Inc. <120> CYP7 Promoter-Binding Factors <130> 162467 <140><141><150> 60 / 067,708 <151> 1997-12-08 <160> 10 <170> PatentIn Ver. 2.0 <210> 1 <211> 3115 <212> DNA <213> Human <220><221> CDS <222> (210). . (1694) <400> 1 gaaaaaaattagagttccag ggaaaaagact tgc
ttgtaac tttattgaatt ctggatttttt 60 tttttttcctt tgctttttttct taactttcac taa
gggttact tgtaggtctga tgtgtcccttc 120 ccaaggccac gaattttgac aagctgcact ttt
ctttttgc tcaatgattt ctgcttttag 180 ccaaaagaact gcctatatatt tcactataaga atg
tct tct aat tca ga t act ggg 233 Met
Ser Ser Asn Ser Asp Thr Gly 1
5 tat tta caa gag tct tta aag cac gga
ctt aca cct att gtg tct caa 281 Asp Leu Gln Glu Ser Leu Lys His Gly
Leu Thr Pro Ile Val Ser Gln 10 15
20 ttt aaa atg gtg aat tac tcc tat gat
gaa gat ctg gaa gag ctt tgt 329 Phe Lys Met Val Asn Tyr Ser Tyr Asp
Glu Asp Leu Glu Glu Leu Cys 25 30
35 40 ccc gtg tgt gga gaa aaa gtg tct ggg
tac cat tat ggg ctc ctc acc 377 Pro Val Cys Gly Asp Lys Val Ser Gly
Tyr His Tyr Gly Leu Leu Thr 45
50 55 tgt ga agc tgc aag gga ttt ttt aag
cga aca gtc ca aat aat aaa 425 Cys Glu Ser Cys Lys Gly Phe Phe Lys
Arg Thr Val Gln Asn Asn Lys 60 65
70 agg tac aca tgt at ga aaa aac cag aac
tgc caa att gac aaa cacag 473 Arg Tyr Thr Cys Ile Glu Asn Gln Asn
Cys Gln Ile Asp Lys Thr Gln 75 80
85 aga aag cgt tgt cct tact tgt cgt ttt
aaa aaa tgt cta agt gtt gga 521 Arg Lys Arg Cys Pro Tyr Cys Arg Phe
Gln Lys Cys Leu Ser Val Gly 90 95
100 atg aag cta gaa gct gta agg gcc gac
cga atg cgt gga gga agg aat 569 Met Lys Leu Glu Ala Val Arg Ala Asp
Arg Met Arg Gly Gly Arg Asn 105 110
115 120 aag ttt ggg cca atg tac aag aga gac
agg gcc ctg aag caag aaa 617 Lys Phe Gly Pro Met Tyr Lys Arg Asp
Arg Ala Leu Lys Gln Gln Lys 125
130 135 aaa gcc ctc atc cga gcc aat gga ctt
aag cta gaa gcc atg tct cag 665 Lys Ala Leu Ile Arg Ala Asn Gly Leu
Lys Leu Glu Ala Met Ser Gln 140 145
150 gtg atc caa gct atg ccc tct gac ctg
acc att tcc tct gca attca 713 Val Ile Gln Ala Met Pro Ser Asp Leu
Thr Ile Ser Ser Ala Ile Gln 155 160
165 aac atc cac tct gcc tcc aaa ggc cta
cct ctg aac cat gct gcc ttg 761 Asn Ile His Ser Ala Ser Lys Gly Leu
Pro Leu Asn His Ala Ala Leu 170 175
180 cct cct aca gac tat gac aga agt ccc
ttt gta aca tcc ccc att agc 809 Pro Pro Thr Asp Tyr Asp Arg Ser Pro
Phe Val Thr Ser Ser Pro Ile Ser 185 190
195 200 atg aca atg ccc cct cac ggc agc ctg
caa ggt tac caa aca tat ggc 857 Met Thr Met Pro Pro His Gly Ser Leu
Gln Gly Tyr Gln Thr Tyr Gly 205
210 215 cac ttt cct agc cgg gcc atc aag tct
gag tac cca gac ccc tat acc 905 His His Pro Ser Ser Arg Ala Ile Lys Ser
Glu Tyr Pro Asp Pro Tyr Thr 220 225
230 agc tca ccc gag tcc ata atg ggc tat
tca tat atg gat ag tac cag 953 Ser Ser Pro Glu Ser Ile Met Gly Tyr
Ser Tyr Met Asp Ser Tyr Gln 235 240
245 acg agc tct cca gca agc atc cca cat
ctg ata ctg gaa ctt ttg aag 1001 Thr Ser Ser Pro Ala Ser Ile Pro His
Leu Ile Leu Glu Leu Leu Lys 250 255
260 tgt gag cca gat gag cct caa gtc cag
gct aaa atc atg gcc tat ttg 1049 Cys Glu Pro Asp Glu Pro Gln Val Gln
Ala Lys Ile Met Ala Tyr Leu 265 270
275 280 cag caa gag cag gct aac cga agc aag
cac gaa aag ctg agc acc ttt 1097 Gln Gln Glu Gln Ala Asn Arg Ser Lys
His Glu Lys Leu Ser Thr Phe 285
290 295 ggg ctt atg tgc aaa atg gca gait caa
act ctc ttc tcc att gtc gag 1145 Gly Leu Met Cys Lys Met Ala Asp Gln
Thr Leu Phe Ser Ile Val Glu 300 305
310 tgg gcc agg agt agt atc ttc ttc aga
gaa ctt aag gtt gat gac caa 1193 Trp Ala Arg Ser Ser Ile Phe Phe Arg
Glu Leu Lys Val Asp Asp Gln 315 320
325 atg aag ctg ctt cag aac tgc tgg agt
gag ctc tta atc ctc gac cac 1241 Met Lys Leu Leu Gln Asn Cys Trp Ser
Glu Leu Leu Ile Leu Asp His 330 330 335
340 att tac cga caa gtg gta cat gga aag
gaa gga tcc atc ttc ctg gtt 1289 Ile Tyr Arg Gln Val Val His Gly Lys
Glu Gly Ser Ile Phe Leu Val 345 350
355 360 act ggg caa ca gtg gac tat tcc data
ata gca tca caa gcc gga gcc 1337 Thr Gly Gln Gln Val Asp Tyr Ser Ile
Ile Ala Ser Gln Ala Gly Ala 365
370 375 acc ctc aac aac ctc atg agt cat gca
cag gag tta gtg gca aaa ctt 1385 Thr Leu Asn Asn Leu Met Ser His Ala
Gln Glu Leu Val Ala Lys Leu 380 385
390 cgt tct ctc cag ttt gaa caa cga gag
ttc gta tgt ctg aaa ttc ttg 1433 Arg Ser Leu Gln Phe Asp Gln Arg Glu
Phe Val Cys Leu Lys Phe Leu 395 400
405 gtg ctc ttt agt tta gat gtc aaa aac
ctt ga aac ttc cag ctg gta 1481 Val Leu Phe Ser Leu Asp Val Lys Asn
Leu Glu Asn Phe Gln Leu Val 410 415
420 gaa ggt gtc cag gaa caa gtc aat gcc
gcc ctg ctg gac tac aca atg 1529 Glu Gly Val Gln Glu Gln Val Asn Ala
Ala Leu Leu Asp Tyr Thr Met 425 430
435 440 tgt aac tac ccg cag cag ca gag aaa
ttt gga cag cta ctt ctt cga 1577 Cys Asn Tyr Pro Gln Gln Thr Glu Lys
Phe Gly Gln Leu Leu Leu Arg 445
450 455 cta ccc ga atc cgg gcc atc agt atg
cag gct gaa gaa tac ctc tac 1625 Leu Pro Glu Ile Arg Ala Ile Ser Met
Gln Ala Glu Glu Tyr Leu Tyr 460 465
470 tac aag cac ctg aac ggg gat gtg ccc
tat aat aac ctt ctc att ga 1673 Tyr Lys His Leu Asn Gly Asp Val Pro
Tyr Asn Asn Leu Leu Ile Glu 475 480
485 atg ttg cat gcc aaa aga gca taatttac
aa cccctaggag ctctgctttc 1724 Met Leu His Ala Lys Arg Ala 490 495 aaaacaaaaaa gagttgggg gaggtgggggggggg
gaagaag aacaggaaga aaaaaaagtac 1784 tctgaactgc tccaaagcaac gcttaattaaa aac
ttgcttt aaaattattg aatttaaaaa 1844 gggataataa tcaaattact aatagcaaat aaa
tgagtta tcagggtatt tgtattgcaa 1904 actgtgaatc aaaggcttca cagccccaga gga
ttccata taaaagacat tgtaatggag 1964 tggattgaac tcacagatgg ataccaacac ggt
caagaaga aaaacgggaca gaacggttct 2024 tgtattatta aactgatctc cactatgaag aaa
tttagga actaattctta ttatattaggc 2084 ttatacagcg ggggattttga gcttacagga ttc
ctccatg gtaaagctga actgaaaacaa 2144 ttctcaagaa tgcatcagct gtacctacaa tag
cccctcc ctcttcctttt gaaggcccga 2204 gcaccctctgc cctgtggtca ccgaatctgt act
aaggcc tgtgttcagc caccaccagt 2264 ggtagctcca ccaatcatg aacagcctaa ttt
tgagtgt ctgttgtctta gacctgcaa 2324 cagctaatag gaatttctat taatatgtta gct
tgccatt ttaatatgt tctgagggtt 2384 gttttgtctc gtgttcatga tgttaagaaaa atg
caggcag tatccctcat cttatgtag 2444 tgtgaattta tattaagggga aatgactaca aac
tttcaaaa gcaatgctc catagctaaa 2504 gcaacttaga ccttattttct gctactgttg ctg
aaagttg gctttggcat tgttggattt 2564 cataaaaaaat ttctggcaggg aagtcttgtt agt
atacatc agtctttttttc atcatccaag 2624 tttttagttttc attaaaaaaat acaacattaaa aca
catttttg ctaggatgtc aaatagtcac 2684 agttctaagt agttggaaac aaaattgacg cat
gttaatc tatgcaagaga aaaaggaaag 2744 gatagggtga tgtattgact caaggttcat tct
tgctgca attgaacatc ctcaagagtt 2804 gggatggaaa tggtgattttt tacatggtgtc ctg
gaagat attaaagtaa ttcaaattctt 2864 ccccaaaaagg gaagaggagaga gagtgactact gac
cttttta agtcatagac caagtctgc 2924 tgttagacaa atatgggggg acaaaagaatc gca
aattcttt caatgacta tattagcatt 2984 tattaacatg cgatgccaca ggttatgaaaag tct
tgccttta tttcacaatt ttaaaaggta 3044 gctgtgcaga tgtggattaca catttgtttta aaa
taaagttattaacttt aaagtcaaaa 3104 aaaaaaaaaaaa a
3115 <210> 2 <211> 495 <212> PRT <213> Human <400> 2 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 15
10 15 Gly Leu Thr Pro Ile Val Ser Gln Phe
Lys Met Val Asn Tyr Ser Tyr 20 25
30 Asp Glu Asp Leu Glu Glu Leu Cys Pro
Val Cys Gly Asp Lys Val Ser 35 40
45 Gly Tyr His Tyr Gly Leu Leu Thr Cys
Glu Ser Cys Lys Gly Phe Phe 50 55
60 Lys Arg Thr Val Gln Asn Asn Lys Arg
Tyr Thr Cys Ile Glu Asn Gln 65 70
7580 Asn Cys Gln Ile Asp Lys Thr Gln Arg
Lys Arg Cys Pro Tyr Cys Arg 85
90 95 Phe Gln Lys Cys Leu Ser Val Gly Met
Lys Leu Glu Ala Val Arg Ala 100 105
110 Asp Arg Met Arg Gly Gly Arg Asn Lys
Phe Gly Pro Met Tyr Lys Arg 115 120
125 Asp Arg Ala Leu Lys Gln Gln Lys Lys
Ala Leu Ile Arg Ala Asn Gly 130 135
140 Leu Lys Leu Glu Ala Met Ser Gln Val
Ile Gln Ala Met Pro Ser Asp 145 150
155 160 Leu Thr Ile Ser Ser Ala Ile Gln Asn
Ile His Ser Ala Ser Lys Gly 165
170 175 Leu Pro Leu Asn His Ala Ala Leu Pro
Pro Thr Asp Tyr Asp Arg Ser 180 185
190 Pro Phe Val Thr Thr Ser Pro Ile Ser Met
Thr Met Pro Pro His His Gly Ser 195 200
205 Leu Gln Gly Tyr Gln Thr Tyr Gly His
Phe Pro Ser Arg Ala Ile Lys 210 215
220 Ser Glu Tyr Pro Asp Pro Tyr Thr Ser
Ser Pro Glu Ser Ile Met Gly 225 230
235 240 Tyr Ser Tyr Met Asp Ser Tyr Gln Thr
Ser Ser Pro Ala Ser Ile Pro 245
250 255 His Leu Ile Leu Glu Leu Leu Lys Cys
Glu Pro Asp Glu Pro Gln Val 260 265
270 Gln Ala Lys Ile Met Ala Tyr Leu Gln
Gln Glu Gln Ala Asn Arg Ser 275 280
285 Lys His Glu Lys Leu Ser Thr Phe Gly
Leu Met Cys Lys Met Ala Asp 290 295
300 Gln Thr Leu Phe Ser Ile Val Glu Trp
Ala Arg Ser Ser Ile Phe Phe 305 310
315 320 Arg Glu Leu Lys Val Asp Asp Gln Met
Lys Leu Leu Gln Asn Cys Trp 325
330 335 Ser Glu Leu Leu Ile Leu Asp His Ile
Tyr Arg Gln Val Val His Gly 340 345
350 Lys Glu Gly Ser Ile Phe Leu Val Thr
Gly Gln Gln Val Asp Tyr Ser 355 360
365 Ile Ile Ala Ser Gln Ala Gly Ala Thr
Leu Asn Asn Leu Met Ser His 370 375
380 Ala Gln Glu Leu Val Ala Lys Leu Arg
Ser Leu Gln Phe Asp Gln Arg 385 390
395 400 Glu Phe Val Cys Leu Lys Phe Leu Val
Leu Phe Ser Leu Asp Val Lys 405
410 415 Asn Leu Glu Asn Phe Gln Leu Val Glu
Gly Val Gln Glu Gln Val Asn 420 425
430 Ala Ala Leu Leu Asp Tyr Thr Met Cys
Asn Tyr Pro Gln Gln Thr Glu 435 440
445 Lys Phe Gly Gln Leu Leu Leu Arg Leu
Pro Glu Ile Arg Ala Ile Ser 450 455
460 Met Gln Ala Glu Glu Tyr Leu Tyr Tyr
Lys His Leu Asn Gly Asp Val 465 470
475 480 Pro Tyr Asn Asn Leu Leu Ile Glu Met
Leu His Ala Lys Arg Ala 485
490 495 <210> 3 <211> 1245 <212> DNA <213> Human <220><221> CDS <222> (202). . (1170) <400> 3 cggccgcgtc gacgggaaaa ctttgctttgta act
tttatgaa ttctggattt ttttttttttcc 60 ttttgttttttt cttaacttttc actaagggtt act
gtagtct gatgtgtcct tcccaaggcc 120 acgaaaatttt acaagctgca cttttcttttt gct
caatgat tctctgcttta agccaaaagaa 180 ctgcctata tttcactaag a atg tct tct
aat tca gat act ggg gat tta 231 Met Ser Ser
Asn Ser Asp Thr Gly Asp Leu 1
5 10 caa gag tct tta aag cac gga ctt aca
cct att gtg tct ca ttt aaa 279 Gln Glu Ser Leu Lys His Gly Leu Thr
Pro Ile Val Ser Gln Phe Lys 15
20 25 atg gtg aat tac tcc tat ga t ga ga t
ctg gaa gag ctt tgt ccc gtg 327 Met Val Asn Tyr Ser Tyr Asp Glu Asp
Leu Glu Glu Leu Cys Pro Val 30 35
40 tgt gga gaa aaa gtg tct ggg tac cat
tat ggg ctc ctc acc tgt gaa 375 Cys Gly Asp Lys Val Ser Gly Tyr His
Tyr Gly Leu Leu Thr Cys Glu 45 50
55 agc tgc aag gga ttt ttt aag cga aca
gtc caa aat aat aaa agg tac 423 Ser Cys Lys Gly Phe Phe Lys Arg Thr
Val Gln Asn Asn Lys Arg Tyr 60 65
70 aca tgt ata gaa aac cag aac tgc caa
att gac aaa aca cag aga aag 471 Thr Cys Ile Glu Asn Gln Asn Cys Gln
Ile Asp Lys Thr Gln Arg Lys 75 80
85 90 cgt tgt cct tact tgt cgt ttt aaa
tgt cta agt gtt gga atg aag 519 Arg Cys Pro Tyr Cys Arg Phe Gln Lys
Cys Leu Ser Val Val Gly Met Lys 95
100 105 cta gaa gct gta agg gcc gac cga atg
cgt gga gga agg aat aag ttt 567 Leu Glu Ala Val Arg Ala Asp Arg Met
Arg Gly Gly Arg Asn Lys Phe 110 115
120 ggg cca atg tac aag ag a gac agg gcc
ctg agag caa aaa aaa gcc 615 Gly Pro Met Tyr Lys Arg Asp Arg Ala
Leu Lys Gln Gln Lys Lys Ala 125 130
135 ctc atc cga gcc aat gga ctt aag cta
gaa gcc atg tct cag gtt gat 663 Leu Ile Arg Ala Asn Gly Leu Lys Leu
Glu Ala Met Ser Gln Val Asp 140 145
150 gac caa atg aag ctg ctt cag aac tgc
tgg agt gag ctc tta atc ctc 711 Asp Gln Met Lys Leu Leu Gln Asn Cys
Trp Ser Glu Leu Leu Ile Leu 155 160
165 170 gac cac att tac cga caa gtg gta cat
gga aag gaa gga tcc atc ttc 759 Asp His Ile Tyr Arg Gln Val Val His
Gly Lys Glu Gly Ser Ile Phe 175
180 185 ctg gtt act ggg caa caa gtg gac tat
tcc ata at gca tca ca gcc 807 Leu Val Thr Gly Gln Gln Val Asp Tyr
Ser Ile Ile Ala Ser Gln Ala 190 195
200 gga gcc acc ctc aac aac ctc atg agt
cat gca cag gag tta gtg gca 855 Gly Ala Thr Leu Asn Asn Leu Met Ser
His Ala Gln Glu Leu Val Ala 205 210
215 aaa ctt cgt tct ctc cag ttt gaa caa
cga gag ttc gta tgt ctg aaa 903 Lys Leu Arg Ser Leu Gln Phe Asp Gln
Arg Glu Phe Val Cys Leu Lys 220 225
230 ttc ttg gtg ctc ttt agt tta gat gtc
aaa aac ctt gaa act tc cag 951 Phe Leu Val Leu Phe Ser Leu Asp Val
Lys Asn Leu Glu Asn Phe Gln 235 240
245 250 ctg gta ga ggt gtc cag gaa caa gtc
aat gcc gcc ctg ctg gactac 999 Leu Val Glu Gly Val Gln Glu Gln Val
Asn Ala Ala Leu Leu Asp Tyr 255
260 265 aca atg tgt aac tac ccg cag cag aca
gag aaa ttt cga cag ctactt 1047 Thr Met Cys Asn Tyr Pro Gln Gln Thr
Glu Lys Phe Arg Gln Leu Leu 270 275
280 ctt cga cta ccc ga atc cgg gcc atc
agt atg cag gct gaa gatac 1095 Leu Arg Leu Pro Glu Ile Arg Ala Ile
Ser Met Gln Ala Glu Glu Tyr 285 290
295 ctc tact tac aag cac ctg aac ggg gat
gtg ccc tat aat aac ctt ctc 1143 Leu Tyr Tyr Lys His Leu Asn Gly Asp
Val Pro Tyr Asn Asn Leu Leu 300 305
310 att ga atg ttg cat gcc aaa aga gca
tagttacaaa cccctaggag 1190 Ile Glu Met Leu His Ala Lys Arg Ala 315 320 ctctgctttc aaaacaaaaaa gagattgggggggag
tgggggag ggggaagaag aacag 1245 <210> 4 <211> 323 <212> PRT <213> Human <400> 4 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 15
10 15 Gly Leu Thr Pro Ile Val Ser Gln Phe
Lys Met Val Asn Tyr Ser Tyr 20 25
30 Asp Glu Asp Leu Glu Glu Leu Cys Pro
Val Cys Gly Asp Lys Val Ser 35 40
45 Gly Tyr His Tyr Gly Leu Leu Thr Cys
Glu Ser Cys Lys Gly Phe Phe 50 55
60 Lys Arg Thr Val Gln Asn Asn Lys Arg
Tyr Thr Cys Ile Glu Asn Gln 65 70
7580 Asn Cys Gln Ile Asp Lys Thr Gln Arg
Lys Arg Cys Pro Tyr Cys Arg 85
90 95 Phe Gln Lys Cys Leu Ser Val Gly Met
Lys Leu Glu Ala Val Arg Ala 100 105
110 Asp Arg Met Arg Gly Gly Arg Asn Lys
Phe Gly Pro Met Tyr Lys Arg 115 120
125 Asp Arg Ala Leu Lys Gln Gln Lys Lys
Ala Leu Ile Arg Ala Asn Gly 130 135
140 Leu Lys Leu Glu Ala Met Ser Gln Val
Asp Asp Gln Met Lys Leu Leu 145 150
155 160 Gln Asn Cys Trp Ser Glu Leu Leu Ile
Leu Asp His Ile Tyr Arg Gln 165
170 175 Val Val His His Gly Lys Glu Gly Ser Ile
Phe Leu Val Thr Gly Gln Gln 180 185
190 Val Asp Tyr Ser Ile Ile Ala Ser Gln
Ala Gly Ala Thr Leu Asn Asn 195 200
205 Leu Met Ser His His Ala Gln Glu Leu Val
Ala Lys Leu Arg Ser Leu Gln 210 215
220 Phe Asp Gln Arg Glu Phe Val Cys Leu
Lys Phe Leu Val Val Leu Phe Ser 225 230
235 240 Leu Asp Val Lys Asn Leu Glu Asn Phe
Gln Leu Val Glu Gly Val Gln 245
250 255 Glu Gln Val Asn Ala Ala Leu Leu Asp
Tyr Thr Met Cys Asn Tyr Pro 260 265
270 Gln Gln Thr Glu Lys Phe Arg Gln Leu
Leu Leu Arg Leu Pro Glu Ile 275 280
285 Arg Ala Ile Ser Met Gln Ala Glu Glu
Tyr Leu Tyr Tyr Lys His Leu 290 295
300 Asn Gly Asp Val Pro Tyr Asn Asn Leu
Leu Ile Glu Met Leu His Ala 305 310
315 320 Lys Arg Ala <210> 5 <211> 3251 <212> DNA <213> Human <220><221> CDS <222> (208). . (1830) <400> 5 cgcggccgcg tcgaccaggg aaaagacttg ctt
gtaactt tatgaattct ggatttttttt 60 tttttccttttg cttttttctta actttcacta agg
gttactg tagtctgatg tgtcctttccc 120 aaggccacga aatttgacaa gctgcacttt tct
tttgctc aatgattttct gctttaagcc 180 aaaagaactgc ctataattttc actaaga atg tc
t tct aat tca gat act ggg gat 234 Met Se
r Ser Asn Ser Asp Thr Gly Asp 1
5 tta caa gag tct tta aag cac gga ctt
aca cct att ggt gct ggg ctt 282 Leu Gln Glu Ser Leu Lys His Gly Leu
Thr Pro Ile Gly Ala Gly Leu 10 15
20 25 ccg gac cga cac gga tcc ccc atc ccc
gcc cgc ggt cgc ctt gtc atg 330 Pro Asp Arg His Gly Ser Pro Ile Pro
Ala Arg Gly Arg Leu Val Met 30
35 40 ctg ccc aaa gtg gag acg gaa gcc ctg
gga ctg gct cga tcg cat ggg 378 Leu Pro Lys Val Glu Thr Glu Ala Leu
Gly Leu Ala Arg Ser His His Gly 4550
55 gaa cag ggc cag atg ccg gaaac atg
ca ggtg tct caattt aaa atg 426 Glu Gln Gly Gln Met Pro Glu Asn Met
Gln Val Ser Ser Gln Phe Lys Met 60 65
70 gtg aat tac tcc tat gat ga ga ctg
gaa gaa ctt tgt ccc ggtg tgt 474 Val Asn Tyr Ser Tyr Asp Glu Asp Leu
Glu Glu Leu Cys Pro Val Cys 75 80
85 gga gaa aaa gtg tct ggg tac cat tat
ggg ctc ctc acc tgt ga agc 522 Gly Asp Lys Val Ser Gly Tyr His Tyr
Gly Leu Leu Thr Cys Glu Ser 90 95
100 105 tgc aag gga ttt ttt aag cga aca gtc
caa aat aat aaa agg tac aca 570 Cys Lys Gly Phe Phe Lys Arg Thr Val
Gln Asn Asn Lys Arg Tyr Thr 110
115 120 tgt at ga aaa aac cag aac tgc aaa att
gac aaa aca cag aga aag cgt 618 Cys Ile Glu Asn Gln Asn Cys Gln Ile
Asp Lys Thr Gln Arg Lys Arg 125 130
135 tgt cct tact tgt cgt ttt caaaaa tgt
cta agt gtt gga atg aag cta 666 Cys Pro Tyr Cys Arg Phe Gln Lys Cys
Leu Ser Val Gly Met Lys Leu 140 145
150 ga gct gta agg gcc gac cga atg cgt
gga gga agg aat aag ttt ggg 714 Glu Ala Val Arg Ala Asp Arg Met Arg
Gly Gly Arg Asn Lys Phe Gly 155 160
165 cca atg tac aag agga gac agg gcc ctg
aag caa cag aaa aaa gcc ctc 762 Pro Met Tyr Lys Arg Asp Arg Ala Leu
Lys Gln Gln Lys Lys Ala Leu 170 175
180 185 atc cga gcc aat gga ctt aag cta gaa
gcc atg tct cag gtg atc caa 810 Ile Arg Ala Asn Gly Leu Lys Leu Glu
Ala Met Ser Gln Val Ile Gln 190
195 200 gct atg ccc tct gac ctg acc att tcc
tct gca att caa aac atc cac 858 Ala Met Pro Ser Asp Leu Thr Ile Ser
Ser Ala Ile Gln Asn Ile His 205 205
215 tct gcc tcc aaa ggg cta cct ctg aac
cat gct gcc ttg cct cct aca 906 Ser Ala Ser Lys Gly Leu Pro Leu Asn
His Ala Ala Leu Pro Pro Thr 220 225
230 gac tat gac aga agt ccc ttt gta aca
tcc ccc att agc atg aca atg 954 Asp Tyr Asp Arg Ser Pro Pro He Val Val Thr
Ser Pro Ile Ser Met Thr Met 235 240
245 ccc cct cac ggc agc ctg caa ggt tac
caa aca tat ggg cac ttt cct 1002 Pro Pro His Gly Ser Leu Gln Gly Tyr
Gln Thr Tyr Gly His Phe Pro 250 255
260 265 agc cgg gcc atc aag tct gag tac cca
gac ccc tat acc agc tca ccc 1050 Ser Arg Ala Ile Lys Ser Glu Tyr Pro
Asp Pro Tyr Thr Ser Ser Pro 270
275 280 gag tcc atat ggc tat tca tat atg
gat agt tac cag acg agc tct 1098 Glu Ser Ile Met Gly Tyr Ser Tyr Met
Asp Ser Tyr Gln Thr Ser Ser 285 290
295 cca gca agc atc cca cat ctg ata ctg
gaa ctt ttg aag tgt gag cca 1146 Pro Ala Ser Ile Pro His Leu Ile Leu
Glu Leu Leu Lys Cys Glu Pro 300 305
310 gat gag cct caa gtc cag gct aaa atc
atg gcc tat ttg cag gag 1194 Asp Glu Pro Gln Val Gln Ala Lys Ile
Met Ala Tyr Leu Gln Gln Glu 315 320
325 cag gct aac cga agc aag cac gaa aag
ctg agc acc ttt ggg ctt atg 1242 Gln Ala Asn Arg Ser Lys His Glu Lys
Leu Ser Thr Phe Gly Leu Met 330 335
340 345 tgc aaa atg gca gat caa act ctc ttc
tcc att gtc gag tgg gcc agg 1290 Cys Lys Met Ala Asp Gln Thr Leu Phe
Ser Ile Val Glu Trp Ala Arg 350
355 360 agt agt atc ttc ttc aga gaa ctt aag
gtt gat gac caatg aag ctg 1338 Ser Ser Ile Phe Phe Arg Glu Leu Lys
Val Asp Asp Gln Met Lys Leu 365 370
375 ctt cag aac tgc tgg agt gag ctc tta
atc ctc gac cac att tac cga 1386 Leu Gln Asn Cys Trp Ser Glu Leu Leu
Ile Leu Asp His Ile Tyr Arg 380 385
390 caa gtg gta cat gga aag gaa gga tcc
atc ttc ctg gtt act ggg caa 1434 Gln Val Val His His Gly Lys Glu Gly Ser
Ile Phe Leu Val Thr Gly Gln 395 400
405 caa gtg gac tat tcc ata ata gca tca
caa gcc gga gcc acc ctc aac 1482 Gln Val Asp Tyr Ser Ile Ile Ala Ser
Gln Ala Gly Ala Thr Leu Asn 410 415
420 425 aac ctc atg agt cat gca cag gag tta
gtg gca aaa ctt cgt tct ctc 1530 Asn Leu Met Ser His Ala Gln Glu Leu
Val Ala Lys Leu Arg Ser Leu 430
435 440 cag ttt gat caa cga gag ttc gta tgt
ctg aaa ttc ttg gtg ctc ttt 1578 Gln Phe Asp Gln Arg Glu Phe Val Cys
Leu Lys Phe Leu Val Val Leu Phe 445 450
455 agt tta gat gtc aaa aac ctt gaa aac
ttc cag ctg gta gaa ggt gtc 1626 Ser Leu Asp Val Lys Asn Leu Glu Asn
Phe Gln Leu Val Glu Gly Val 460 465
470 cag gaa caa gtc aat gcc gcc ctg ctg
gac tac aca atg tgt aact tac 1674 Gln Glu Gln Val Asn Ala Ala Leu Leu
Asp Tyr Thr Met Cys Asn Tyr 475 480
485 ccg cag cag ca gag aaa ttt gga cag
cta ctt ctt cga cta ccc gaa 1722 Pro Gln Gln Thr Glu Lys Phe Gly Gln
Leu Leu Leu Arg Leu Pro Glu 490 495
500 505 atc cgg gcc atc agt atg cag gct gaa
gaa tac ctc tac tac aag cac 1770 Ile Arg Ala Ile Ser Met Gln Ala Glu
Glu Tyr Leu Tyr Tyr Lys His 510
515 520 ctg aac ggg gt gtg ccc tat aat aac
ctt ctc att gaa atg ttg cat 1818 Leu Asn Gly Asp Val Pro Tyr Asn Asn Asn
Leu Leu Ile Glu Met Leu His 525 530
535 gcc aaa aga gca taagttacaa cccctagga
g ctctgctttc aaaacaaaaaa 1870 Ala Lys Arg Ala 540 gagattggggg gaggggggg ggggggaagaag aac
agagaaga aaaaaaagtact tctgaactgc 1930 tccagagcaac gctataattaa aaactgcttt aaa
gattattg aatttaaaa gggataataa 1990 tcaaattact aatagcaaat aaatgattgta tca
gggtatt tgttattgcaa actgttgaatc 2050 aaaggcttca cagccccaga ggattccata taa
aagacat tgtaatggag tggattgaac 2110 tcacagatgg ataccacaacac ggtcagaagaaaa
acggaca gaacgggttct tgttatttatta 2170 aactgattctc actatgaag aaaatttagga act
aattctta ttaatttaggc tatataagcg 2230 ggggatttga gcttacagaga ttcctccatg gta
aagctga actgaaaaca ttctcaagaa 2290 tgcatcaggt gacctacaa tagcccctcc ctc
ttcctttt gaaggcccga gcacctctgc 2350 cctgtgtgtca ccgaatctgtt actaaggacc tgt
gttcagc caaccccaggt gtagtagccca 2410 ccaatcatg aacagcctaa tttttgaggtt ctg
tgtctta gacctgcaaaa cagctatatag 2470 gaatttct taatatgtta gctttgccatt tta
aatattgt tctgagggtt gttttgtctcc 2530 gtgtttcatga tgttaagaaaa atgcaggcag tat
cccctcat cttatgtagag tgtgaattaa 2590 tattaaggga aatgactaca aactttcaaaa gca
aatgctc catagctaaa gcaacttaga 2650 ccttattttct gctactgttg ctgaaatgtg gct
ttggcat tgttggattt cataaaaaaat 2710 ttctggcag aagctcttgttt agtatacatc agt
cttttttc atcatccaag tttttagtagtc 2770 attaaaaaaat acaactatta acacatttttg cta
ggattgtc aaatagtacac gtttctaagt 2830 aggttggaac aaaattgacg catgttaatc tat
gcaagaga gaaagggaaag gatgaggtga 2890 tgtattgact caaggttcat tcttgctgca att
gaactc ctcaagagtt gggatgggaaaa 2950 tggtgattttt tacatgtgtgtc ctggaaaagatt
aaagtaa ttcaaattctt ccccaaaaagg 3010 aaaaggaaga gagtgactact gacctttttta agt
catacac caaaagtctgc tgtagagaacaa 3070 atatgggggg acaaaagaatc gcaattttttcaa
atgacta tattagcattat ttattacatg 3130 cgatgccaca ggtatgaaaag tctttgccttta ttt
cacaatt ttaaaaggta gctgtggaga 3190 tgtggtcaa catttgtttta aaaataagta tta
attacht aaagtcaaaa aaaaaaaaaaaa 3250 a
3251 <210> 6 <211> 541 <212> PRT <213> Human <400> 6 Met Ser Ser Asn Ser Asp Thr Gly Asp
Leu Gln Glu Ser Leu Lys His 15
10 15 Gly Leu Thr Pro Ile Gly Ala Gly Leu
Pro Asp Arg His Gly Ser Pro 20 25
30 Ile Pro Ala Arg Gly Arg Leu Val Met
Leu Pro Lys Val Glu Thr Glu 35 40
45 Ala Leu Gly Leu Ala Arg Ser His Gly
Glu Gln Gly Gln Met Pro Glu 50 55
60 Asn Met Gln Val Ser Gln Phe Lys Met
Val Asn Tyr Ser Tyr Asp Glu 65 70
7580 Asp Leu Glu Glu Leu Cys Pro Val Cys
Gly Asp Lys Val Ser Gly Tyr 85
90 95 His Tyr Gly Leu Leu Thr Cys Glu Ser
Cys Lys Gly Phe Phe Lys Arg 100 105
110 Thr Val Gln Asn Asn Lys Arg Tyr Thr
Cys Ile Glu Asn Gln Asn Cys 115 120
125 Gln Ile Asp Lys Thr Gln Arg Lys Arg
Cys Pro Tyr Cys Arg Phe Gln 130 135
140 Lys Cys Leu Ser Val Val Gly Met Lys Leu
Glu Ala Val Arg Ala Asp Arg 145 150
155 160 Met Arg Gly Gly Gly Arg Asn Lys Phe Gly
Pro Met Tyr Lys Arg Asp Arg 165
170 175 Ala Leu Lys Gln Gln Lys Lys Ala Leu
Ile Arg Ala Asn Gly Leu Lys 180 185
190 Leu Glu Ala Met Ser Gln Val Ile Gln
Ala Met Pro Ser Asp Leu Thr 195 200
205 Ile Ser Ser Ala Ile Gln Asn Ile His
Ser Ala Ser Lys Gly Leu Pro 210 215
220 Leu Asn His His Ala Ala Leu Pro Pro Pro Thr
Asp Tyr Asp Arg Ser Pro Pro He 225 230
235 240 Val Thr Ser Pro Ile Ser Met Thr Met
Pro Pro His His Gly Ser Leu Gln 245
250 255 Gly Tyr Gln Thr Tyr Gly His Phe Pro
Ser Arg Ala Ile Lys Ser Glu 260 265
270 Tyr Pro Asp Pro Tyr Thr Ser Ser Pro
Glu Ser Ile Met Gly Tyr Ser 275 280
285 Tyr Met Asp Ser Tyr Gln Thr Ser Ser
Pro Ala Ser Ile Pro His Leu 290 295
300 Ile Leu Glu Leu Leu Lys Cys Glu Pro
Asp Glu Pro Gln Val Gln Ala 305 310
315 320 Lys Ile Met Ala Tyr Leu Gln Gln Glu
Gln Ala Asn Arg Ser Lys His 325
330 335 Glu Lys Leu Ser Thr Phe Gly Leu Met
Cys Lys Met Ala Asp Gln Thr 340 345
350 Leu Phe Ser Ile Val Glu Trp Ala Arg
Ser Ser Ile Phe Phe Arg Glu 355 360
365 Leu Lys Val Asp Asp Gln Met Lys Leu
Leu Gln Asn Cys Trp Ser Glu 370 375
380 Leu Leu Ile Leu Asp His Ile Tyr Arg
Gln Val Val His His Gly Lys Glu 385 390
395 400 Gly Ser Ile Phe Leu Val Thr Gly Gln
Gln Val Asp Tyr Ser Ile Ile 405
410 415 Ala Ser Gln Ala Gly Ala Thr Leu Asn
Asn Leu Met Ser His His Ala Gln 420 425
430 Glu Leu Val Ala Lys Leu Arg Ser Leu
Gln Phe Asp Gln Arg Glu Phe 435 440
445 Val Cys Leu Lys Phe Leu Val Val Leu Phe
Ser Leu Asp Val Lys Asn Leu 450 455
460 Glu Asn Phe Gln Leu Val Glu Gly Val
Gln Glu Gln Val Asn Ala Ala 465 470
475 480 Leu Leu Asp Tyr Thr Met Cys Asn Tyr
Pro Gln Gln Thr Glu Lys Phe 485
490 495 Gly Gln Leu Leu Leu Arg Leu Pro Glu
Ile Arg Ala Ile Ser Met Gln 500 505
510 Ala Glu Glu Tyr Leu Tyr Tyr Lys His
Leu Asn Gly Asp Val Pro Tyr 515 520
525 Asn Asn Leu Leu Ile Glu Met Leu His
Ala Lys Arg Ala 530 535
540 <210> 7 <211> 2330 <212> DNA <213> Human <220><221> CDS <222> (363). . (1862) <400> 7 gaactggat atatggttta cagcaggtca cta
atgttgg aaaaaagtaca gagttccaggg 60 aaagtactgc ttgttaacttt atgaatttctg ga
ttttttt ccttttgctttt tctttaactt 120 tcactaaggg ttactgtagt ctgatgtgtc ctt
cccaagg cccagaaatt tgacaagctg 180 actttttctt ttgctcaatg attctctgcttt taa
gccaaag aactgcctat aatttcacta 240 agaatgtctt ctaattcaga tactggggtat tta
caaggt ctttaaaagca cggacttca 300 cctattgggtg ctgggcttcc ggaccgacac gga
tccccca tcccgcccgc ggtcgcctttg 360 tc atg ctg ccc aaa gtg gag acg gaag
cc ctg gga ctg gct cga tcg 407 Met Leu Pro Lys Val Glu Thr Glu A
la Leu Gly Leu Ala Arg Ser 15
10 15 cat ggg gaa cag ggc cag atg ccg gaa
aac atg caa gtg tct caattt 455 His Gly Glu Gln Gly Gln Met Pro Glu
Asn Met Gln Val Ser Gln Phe 20
25 30 aaa atg gtg aat tact tcc tat ga t gaa
ga tctg gaa gag ctt tgt ccc 503 Lys Met Val Asn Tyr Ser Tyr Asp Glu
Asp Leu Glu Glu Leu Cys Pro 35 40
45 gtg tgt gga gaaa gtg tct ggg tac
cat tat ggg ctc ctc acc tgt 551 ValCys Gly Asp Lys Val Ser Gly Tyr
His Tyr Gly Leu Leu Thr Cys 50 55
60 ga agc tgc aag gga ttt ttt aag cga
aca gtc caa aat aat aaa agg 599 Glu SerCys Lys Gly Phe Phe Lys Arg
Thr Val Gln Asn Asn Lys Arg 65 70
75 tac aca tgt ata gaa aac cag aac tgc
caa att gac aaa aca cag aga 647 Tyr Thr Cys Ile Glu Asn Gln Asn Cys
Gln Ile Asp Lys Thr Gln Arg 80 85
90 aag cgt tgt cct tact tgt cgt ttt caa
aaa tgt cta agt gtt gga atg 695 Lys Arg Cys Pro Tyr Cys Arg Phe Gln
Lys Cys Leu Ser Val Gly Met 100
105 110 aag cta gaa gct gta agg gcc gac cga
atg cgt gga gga agg aat aag 743 Lys Leu Glu Ala Val Arg Ala Asp Arg
Met Arg Gly Gly Arg Asn Lys 115 120
125 ttt ggg cca atg tac aag agaga gac agg
gcc ctg aaa caag aaa aaa 791 Phe Gly Pro Met Tyr Lys Arg Asp Arg
Ala Leu Lys Gln Gln Lys Lys 130 135
140 gcc ctc atc cga gcc aat gga ctt aag
cta gaa gcc atg tct cag gtg 839 Ala Leu Ile Arg Ala Asn Gly Leu Lys
Leu Glu Ala Met Ser Gln Val 145 150
155 atc caa gct atg ccc tct gac ctg acc
att tcc tct gca att caa aac 887 Ile Gln Ala Met Pro Ser Asp Leu Thr
Ile Ser Ser Ala Ile Gln Asn 160 165
170 175 atc cac tct gcc tcc aaa ggg cta cct
ctg aac cat gct gcc ttg cct 935 Ile His Ser Ala Ser Lys Gly Leu Pro
Leu Asn His Ala Ala Leu Pro 180
185 190 cct aca gac tat gac aga agt ccc ttt
gta aca tcc ccc att agc atg 983 Pro Thr Asp Tyr Asp Arg Ser Pro Pro He
Val Thr Ser Pro Ile Ser Met 195 200
205 aca atg ctg cac ggc agc ctg caa ggt
tac caa aca tat gggc cac ttt 1031 Thr Met Leu His Gly Ser Leu Gln Gly
Tyr Gln Thr Tyr Gly His Phe 210 215
220 cct agc cgg gcc atc aag tct gag tac
cca gac ccc tat acc agc tca 1079 Pro Ser Arg Ala Ile Lys Ser Glu Tyr
Pro Asp Pro Tyr Thr Ser Ser 225 230
235 ccc gag tcc ata atg ggc tat tca tat
atg gat agt tac cag acg agc 1127 Pro Glu Ser Ile Met Gly Tyr Ser Tyr
Met Asp Ser Tyr Gln Thr Ser 240 245
250 255 tct cca gca agc atc cca cat ctg data
ctg gaa ctt ttg aag tgt gag 1175 Ser Pro Ala Ser Ile Pro His Leu Ile
Leu Glu Leu Leu Lys Cys Glu 260
265 270 cca gat gag cct caa gtc cag gct aaa
atc atg gcc tat ttg cag ca1223 Pro Asp Glu Pro Gln Val Gln Ala Lys
Ile Met Ala Tyr Leu Gln Gln 275 280
285 gag cag gct aac cga agc aag cac gaa
aag ctg agc acc ttt ggg ctt 1271 Glu Gln Ala Asn Arg Ser Lys His Glu
Lys Leu Ser Thr Phe Gly Leu 290 295
300 atg tgc aaa atg gca ga t caa act gtc
ttc tcc att gtc gag tgg gcc 1319 Met Cys Lys Met Ala Asp Gln Thr Val
Phe Ser Ile Val Glu Trp Ala 305 310
315 agg agt agt atc ttc ttc aga gaa ctt
aag gtt gat gac caat atg aag 1367 Arg Ser Ser Ile Phe Phe Arg Glu Leu
Lys Val Asp Asp Gln Met Lys 320 325
330 335 ctg ctt cag aac tgc tgg agt gag ctc
tta atc ctc gac cac att tac 1415 Leu Leu Gln Asn Cys Trp Ser Glu Leu
Leu Ile Leu Asp His Ile Tyr 340
345 350 cga caa gtg gta cat gga aag gaa gga
tcc atc ttc ctg gtt act ggg 1463 Arg Gln Val Val His Gly Lys Glu Gly Gly
Ser Ile Phe Leu Val Thr Gly 355 360
365 caa caa gtg gac tat tcc ata ata gca
tca caa gcc gga gcc acc ctc 1511 Gln Gln Val Asp Tyr Ser Ile Ile Ala
Ser Gln Ala Gly Ala Thr Leu 370 375
380 aac aac ctc atg agt cat gca cag gag
tta gtg gca aaa ctt cgt tct 1559 Asn Asn Leu Met Ser His Ala Gln Glu
Leu Val Ala Lys Leu Arg Ser 385 390
395 ctc cag ttt gat caa cga gag tttc gta
tgt ctg aaa ttc ttg gtg ctc 1607 Leu Gln Phe Asp Gln Arg Glu Phe Val
Cys Leu Lys Phe Leu Val Leu 400 405
410 415 ttt agt tta gat gtc aaa aac ctt gaa
aac ttc cag ctg gta gaa ggt 1655 Phe Ser Leu Asp Val Lys Asn Leu Glu
Asn Phe Gln Leu Val Glu Gly 420
425 430 gt cag gaa caa gtc aat gcc gcc ctg
ctg gac tac aca atg tgt aac 1703 Val Gln Glu Gln Val Asn Ala Ala Leu
Leu Asp Tyr Thr Met Cys Asn 435 440
445 tac ccg cag cag ca gag aaa ttt gga
cag cta ctt ctt cga cta ccc 1751 Tyr Pro Gln Gln Thr Glu Lys Phe Gly
Gln Leu Leu Leu Arg Leu Pro 450 455
460 gaa atc cgg gcc atc agt atg cag gct
gaa gaa tac ctc tac tac aag 1799 Glu Ile Arg Ala Ile Ser Met Gln Ala
Glu Glu Tyr Leu Tyr Tyr Lys 465 470
475 cac ctg aat ggg gat gtg ccc tat aat
aac ctt ctc att ga atg ttg 1847 His Leu Asn Gly Asp Val Pro Tyr Asn
Asn Leu Leu Ile Glu Met Leu 480 485
490 495 cat gcc aaa aga gca taagttacaaa cccct
aggag ctctgcttttc aaaacaaaaaa 1902 His Ala Lys Arg Ala
500 gagattggggg gagttgggggg gggggagaagaag aac
agagaaga aaaaaaagtac tctgaactgc 1962 tccaagtaac gctataattaaa aactgcttt aaa
gattattg aatttaaaaa ggcataataa 2022 tcaaactacta atgcaaata aatgatgtat cag
ggtatttt gtattgcaa ctgtgaatca 2082 aagcttcaca gccccagaggg attccatata aaa
gactattg taatggagtg gattgaactc 2142 acagagggt accaacacgg tcagaagaaaa aac
ggacaga acgggttctttg tatttattaa 2202 ctgatctcca tattatagaagaa atttaggaac taa
tctttatt aatttaggtt atacagcgggg 2262 gattttgagct tacaggatttc ctccatgggta aag
ctgaact gaaaaaattc tcaagaatgc 2322 atcagctg
2330 <210> 8 <211> 500 <212> PRT <213> Human <400> 8 Met Leu Pro Lys Val Glu Thr Glu Ala
Leu Gly Leu Ala Arg Ser His 15
10 15 Gly Glu Gln Gly Gln Met Pro Glu Asn
Met Gln Val Ser Gln Phe Lys 20 25
30 Met Val Asn Tyr Ser Tyr Asp Glu Asp
Leu Glu Glu Leu Cys Pro Val 35 40
45 Cys Gly Asp Lys Val Ser Gly Tyr His
Tyr Gly Leu Leu Thr Cys Glu 50 55
60 Ser Cys Lys Gly Phe Phe Lys Arg Thr
Val Gln Asn Asn Lys Arg Tyr 65 70
7580 Thr Cys Ile Glu Asn Gln Asn Cys Gln
Ile Asp Lys Thr Gln Arg Lys 85
90 95 Arg Cys Pro Tyr Cys Arg Phe Gln Lys
Cys Leu Ser Val Gly Met Lys 100 105
110 Leu Glu Ala Val Arg Ala Asp Arg Met
Arg Gly Gly Arg Asn Lys Phe 115 120
125 Gly Pro Met Tyr Lys Arg Asp Arg Ala
Leu Lys Gln Gln Lys Lys Ala 130 135
140 Leu Ile Arg Ala Asn Gly Leu Lys Leu
Glu Ala Met Ser Gln Val Ile 145 150
155 160 Gln Ala Met Pro Ser Asp Leu Thr Ile
Ser Ser Ala Ile Gln Asn Ile 165
170 175 His Ser Ala Ser Lys Gly Leu Pro Leu
Asn His Ala Ala Leu Pro Pro 180 185
190 Thr Asp Tyr Asp Arg Ser Pro Pro He Val
Thr Ser Pro Ile Ser Met Thr 195 200
205 Met Leu His Gly Ser Leu Gln Gly Tyr
Gln Thr Tyr Gly His Phe Pro 210 215
220 Ser Arg Ala Ile Lys Ser Glu Tyr Pro
Asp Pro Tyr Thr Ser Ser Pro 225 230
235 240 Glu Ser Ile Met Gly Tyr Ser Tyr Met
Asp Ser Tyr Gln Thr Ser Ser 245
250 255 Pro Ala Ser Ile Pro His Leu Ile Leu
Glu Leu Leu Lys Cys Glu Pro 260 265
270 Asp Glu Pro Gln Val Gln Ala Lys Ile
Met Ala Tyr Leu Gln Gln Glu 275 280
285 Gln Ala Asn Arg Ser Lys His Glu Lys
Leu Ser Thr Phe Gly Leu Met 290 295
300 Cys Lys Met Ala Asp Gln Thr Val Phe
Ser Ile Val Glu Trp Ala Arg 305 310
315 320 Ser Ser Ile Phe Phe Arg Glu Leu Lys
Val Asp Asp Gln Met Lys Leu 325
330 335 Leu Gln Asn Cys Trp Ser Glu Leu Leu
Ile Leu Asp His Ile Tyr Arg 340 345
350 Gln Val Val His His Gly Lys Glu Gly Ser
Ile Phe Leu Val Thr Gly Gln 355 360
365 Gln Val Asp Tyr Ser Ile Ile Ala Ser
Gln Ala Gly Ala Thr Leu Asn 370 375
380 Asn Leu Met Ser His His Ala Gln Glu Leu
Val Ala Lys Leu Arg Ser Leu 385 390
395 400 Gln Phe Asp Gln Arg Glu Phe Val Cys
Leu Lys Phe Leu Val Leu Phe 405
410 415 Ser Leu Asp Val Lys Asn Leu Glu Asn
Phe Gln Leu Val Glu Gly Val 420 425
430 Gln Glu Gln Val Asn Ala Ala Leu Leu
Asp Tyr Thr Met Cys Asn Tyr 435 440
445 Pro Gln Gln Thr Glu Lys Phe Gly Gln
Leu Leu Leu Arg Leu Pro Glu 450 455
460 Ile Arg Ala Ile Ser Met Gln Ala Glu
Glu Tyr Leu Tyr Tyr Lys His 465 470
475 480 Leu Asn Gly Asp Val Pro Tyr Asn Asn
Leu Leu Ile Glu Met Leu His 485
490 495 Ala Lys Arg Ala 500 <210> 9 <211> 3027 <212> DNA <213> Mouse <220><221> CDS <222> (159). . (1838) <400> 9 tgtttttttcc cccttttttct taactttcac taa
ggaaatg agggttactg tagtctgaggg 60 tttccttccc aaagtcacaa aatatgacaa gct
gcaatct ttctcacatt caatgatttc 120 tgctgtaagc caaaggactg ccaataattt cgc
taaga atg tct gct agt ttg gat 176
Met Ser Ala Ser Leu Asp
15 act gga gatt ttt caa gaa ttt ctt aag
cat gga ctt aca gct att gcg 224 Thr Gly Asp Phe Gln Glu Phe Leu Lys
His Gly Leu Thr Ala Ile Ala 10 15
20 tct gca cca ggg tca gag act cgc cac
tcc ccc aaa cgt gag ga caa 272 Ser Ala Pro Gly Ser Glu Thr Arg His
Ser Pro Lys Arg Glu Glu Gln 25 30
35 ctc cgg aaa aaa cgt gct ggg ctt ccg
gac cga cac cga cgc ccc att 320 Leu Arg Glu Lys Arg Ala Gly Leu Pro
Asp Arg His Arg Arg Pro Ile 40 45
50 ccc gcc cgc agc cgc ctt gtc atg ctg
ccc aaa gtg gag acg gaa gcc 368 Pro Ala Arg Ser Arg Leu Val Met Leu
Pro Lys Val Glu Thr Glu Ala 55 60
65 70 cca gga ctg gtc cga tcg cat ggg gaa
cag ggg cag atg cca ga aac 416 Pro Gly Leu Val Arg Ser His Gly Glu
Gln Gly Gln Met Pro Glu Asn 75
80 85 atg caa gtg tct caattt aaa atg gtg
aat tac tcc tat gat ga gat 464 Met Gln Val Ser Gln Phe Lys Met Val
Asn Tyr Ser Tyr Asp Glu Asp 90 95
100 ctg gaa gag cta tgt cct gtg tgt gggc
gat aaa gtg tct ggg tac cat 512 Leu Glu Glu Leu Cys Pro Val Cys Gly
Asp Lys Val Ser Gly Tyr His 105 110
115 tac ggt ctc ctc acg tgc gaa agc tgc
aag ggt ttt ttt aag cga act 560 Tyr Gly Leu Leu Thr Cys Glu Ser Cys
Lys Gly Phe Phe Lys Arg Thr 120 125
130 gtc caa aac caa aa agg tac acg tgc
ata gag aac cag aat tgc caa 608 Val Gln Asn Gln Lys Arg Tyr Thr Cys
Ile Glu Asn Gln Asn Cys Gln 135 140
145 150 att gac aaa acg cag aga aaa cga tgt
ccc tact tgt cga ttc aaa aaa 656 Ile Asp Lys Thr Gln Arg Lys Arg Cys
Pro Tyr Cys Arg Phe Lys Lys 155
160 165 tgt atc gat gtt ggg atg aag ctg gaa
gcc gta aga gcc gac cgc atg 704 Cys Ile Asp Val Gly Met Lys Leu Glu
Ala Val Arg Ala Asp Arg Met 170 175
180 cga ggg gggc aga aat aag ttt ggg cca
atg tac aag aga gac agg gct 752 Arg Gly Gly Arg Asn Lys Phe Gly Pro
Met Tyr Lys Arg Asp Arg Ala 185 190
195 ttg aag cag cag aag aaa gcc ctc att
cga gcc aat gga ctt aag ctg 800 Leu Lys Gln Gln Lys Lys Ala Leu Ile
Arg Ala Asn Gly Leu Lys Leu 200 205
210 ga gcc atg tct cag gtg atc caa gca
atg ccc tca gac ctg acc tct 848 Glu Ala Met Ser Gln Val Ile Gln Ala
Met Pro Ser Asp Leu Thr Ser 215 220
225 230 gca att cag aac att cat tcc gcc tcc
aaa gc cta cct ctg agc cat 896 Ala Ile Gln Asn Ile His Ser Ala Ser
Lys Gly Leu Pro Leu Ser His His 235
240 245 gta gcc ttg cct ccg aca gac tat gac
aga agt ccc ttt gtc aca tct 944 Val Ala Leu Pro Pro Thr Asp Tyr Asp
Arg Ser Pro Phe Val Thr Ser 250 250
260 ccc att agc atg aca atg cca cct cac
agc agc ctg cat ggt tac caa 992 Pro Ile Ser Met Thr Met Pro Pro His
Ser Ser Leu His Gly Tyr Gln 265 270
275 ccc tat ggt cac ttt cct agt cgg gcc
atc aag tct gag tac cca gac 1040 Pro Tyr Gly His Phe Pro Ser Arg Ala
Ile Lys Ser Glu Tyr Pro Asp 280 285
290 ccc tac tcc agc tca cct gag tca atg
atg ggt tac tcc tac atg gat 1088 Pro Tyr Ser Ser Ser Pro Glu Ser Met
Met Gly Tyr Ser Tyr Met Asp 295 300
305 310 ggt tac cag aca aac tcc ccg gcc agc
atc cca cac ctg ata ctg gaa 1136 Gly Tyr Gln Thr Asn Ser Pro Ala Ser
Ile Pro His Leu Ile Leu Glu 315
320 325 ctt ttg aag tgt gaa cca gat gag cct
ca gtt caa gcg aag atc atg 1184 Leu Leu Lys Cys Glu Pro Asp Glu Pro
Gln Val Gln Ala Lys Ile Met 330 335
340 gct tac ctc cag caa gag cag agt aac
cga aac agg caa aaa aag ctg 1232 Ala Tyr Leu Gln Gln Glu Gln Ser Asn
Arg Asn Arg Gln Glu Lys Leu 345 350
355 agc gca ttt ggg ctt tta tgc aaa atg
gcg gac cag acc ctg ttc tcc 1280 Ser Ala Phe Gly Leu Leu Cys Lys Met
Ala Asp Gln Thr Leu Phe Ser 360 365
370 att gtt gag tgg gcc agg agt agt atc atc
ttc ttc agg gaa ctg aag gtt 1328 Ile Val Glu Trp Ala Arg Ser Ser Ile
Phe Phe Arg Glu Leu Lys Val 375 380
385 390 gat gac caaatg aag ctg ctt caa aac
tgc tgg agt gag ctc ttg att 1376 Asp Asp Gln Met Lys Leu Leu Gln Asn
Cys Trp Ser Glu Leu Leu Ile 395
400 405 ctc gat cac att tac cga caa gtg gcg
cat ggg aag gaaggg aca atc 1424 Leu Asp His Ile Tyr Arg Gln Val Ala
His Gly Lys Glu Gly Thr Ile 410 415
420 ttc ctg gtt act gga gaa cac gtg gac
tac tcc acc atc atc tca cac 1472 Phe Leu Val Thr Gly Glu His Val Asp
Tyr Ser Thr Ile Ile Ser His 425 430
435 aca gaa gtc gcg ttc aac aac ctc ctg
agt ctc gca cag gag ctg gtg 1520 Thr Glu Val Ala Phe Asn Asn Leu Leu
Ser Leu Ala Gln Glu Leu Val 440 445
450 gtg agg ctc cgt tcc ctt cag ttc gat
cag cgg gag ttt gta tgt ctc 1568 Val Arg Leu Arg Ser Leu Gln Phe Asp
Gln Arg Glu Phe Val Cys Leu 455 460
465 470 aag ttc ctg gtg ctg ttc agc tca gat
gtg aag aac ctg gag aac ctg 1616 Lys Phe Leu Val Leu Phe Ser Ser Asp
Val Lys Asn Leu Glu Asn Leu 475
480 485 cag ctg gtg gaa ggt gtc caa gag cag
gtg aat gcc gcc ctg ctg gac 1664 Gln Leu Val Glu Gly Val Gln Glu Gln
Val Asn Ala Ala Leu Leu Asp 490 495
500 tac acg gtt tgc aac tac cca caa cag
act gag aaa tc gga cag cta 1712 Tyr Thr Val Cys Asn Tyr Pro Gln Gln
Thr Glu Lys Phe Gly Gln Leu 505 510
515 ctt ctt cgg cta ccc gag atc cgg gca
atc agc aag cag gca ga gac 1760 Leu Leu Arg Leu Pro Glu Ile Arg Ala
Ile Ser Lys Gln Ala Glu Asp 520 525
530 tac ctg tac tat aag cac gtg aac ggg
gat gtg ccc tat aat aac ctc 1808 Tyr Leu Tyr Tyr Lys His Val Asn Gly
Asp Val Pro Tyr Asn Asn Leu 535 540
545 550 ctc att gag atg ctg cat gcc aaa aga
gcc tagctcccca cccctggagaag 1858 Leu Ile Glu Met Leu His Ala Lys Arg
Ala 555
560 cttgctctag gaacacagac tggaaggaga aga
ggaggac gatgacagaa acacaatact 1918 ctgaactgct ccaagcaatg ctattataa act
tggttta aagacactga attttaaaag 1978 catataattatt aaaattacctaa tagcaaaaata atg
atatatc agggtattttg tactgcaaac 2038 tgtgaatcaa aggctgtattg aatcaaagga ttc
atatgaa agacattgta atgggggtgga 2098 ttgaacttac agatgggagac caataccaca gca
gaataaa aatgggacaga acaatcctttg 2158 tatttattaa ctatctctgct attaagaaat tca
gaagttg attctctgtta ttaattggat 2218 ttgtcctgaa ttactccgtg gtgacgctga aca
actcaag aatacatggg ctgtgctttgg 2278 cagcccctcc ccatccctcc caccaccacc acc
cccaccc cccacaaggcc ctataccttc 2338 tgacctgtga gccctgaagc tattttaagg act
tctgtttc agccataccc aggtagtagct 2398 ccctaaaacc atgatttctgg gattctctgtg tct
tagacct gccacaagct aataagaacaca 2458 atgtataaat atgtcagcttt gcatttttaaa tat
gtgctga agttttgttttt gtcgtgttgtt 2518 cgtaattaaaaaagaaaaacgg gcagtaacccctct
tctatat aagcattagt taatttaag 2578 ggaatcaaaa caatctagag ccaatactcc caa
caagcaa gttagattctt actcttgctg 2638 ctgttgctga aatgtgggctt tggcatggtt ggg
tttcat aaactttttg gccaagaggc 2698 ttgttagtacatccatct gtttagtcat caa
ggtttgt agttcactta aaaaaaaaaata 2758 accactaga catctttttgc tgaatgtcaa data
gtcacag tctaagtagc caaaaaagtca 2818 aagcgtgtta aaattttgcca aatgaaggaa agg
gtgagct gcaaaaggggga tggttcgagg 2878 ttcattccag ttgttgaccccg agcgtcccc aaa
cctggga tgcaaaagaca gtgattctgc 2938 atatggcctg aaaagacagg aaaagccaggtc tcc
tacaaag gggaatggaa gatcctggcc 2998 tctaagtcat agccaaagt ctgctgtag
3027 <210> 10 <211> 560 <212> PRT <213> Mouse <400> 10 Met Ser Ala Ser Leu Asp Thr Gly Asp
Phe Gln Glu Phe Leu Lys His 15
10 15 Gly Leu Thr Ala Ile Ala Ser Ala Pro
Gly Ser Glu Thr Arg His Ser 20 25
30 Pro Lys Arg Glu Glu Gln Leu Arg Glu
Lys Arg Ala Gly Leu Pro Asp 35 40
45 Arg His Arg Arg Pro Ile Pro Ala Arg
Ser Arg Leu Val Met Leu Pro 50 55
60 Lys Val Glu Thr Glu Ala Pro Gly Leu
Val Arg Ser His His Gly Glu Gln 65 70
75 80 Gly Gln Met Pro Glu Asn Met Gln Val
Ser Gln Phe Lys Met Val Asn 85
90 95 Tyr Ser Tyr Asp Glu Asp Leu Glu Glu
Leu Cys Pro Val Cys Gly Asp 100 105
110 Lys Val Ser Gly Tyr His Tyr Gly Leu
Leu Thr Cys Glu Ser Cys Lys 115 120
125 Gly Phe Phe Lys Arg Thr Val Gln Asn
Gln Lys Arg Tyr Thr Cys Ile 130 135
140 Glu Asn Gln Asn Cys Gln Ile Asp Lys
Thr Gln Arg Lys Arg Cys Pro 145 150
155 160 Tyr Cys Arg Phe Lys Lys Cys Ile Asp
Val Gly Met Lys Leu Glu Ala 165
170 175 Val Arg Ala Asp Arg Met Arg Gly Gly
Arg Asn Lys Phe Gly Pro Met 180 185
190 Tyr Lys Arg Asp Arg Ala Leu Lys Gln
Gln Lys Lys Ala Leu Ile Arg 195 200
205 Ala Asn Gly Leu Lys Leu Glu Ala Met
Ser Gln Val Ile Gln Ala Met 210 215
220 Pro Ser Asp Leu Thr Ser Ala Ile Gln
Asn Ile His Ser Ala Ser Lys 225 230
235 240 Gly Leu Pro Leu Ser His His Val Val Ala Leu
Pro Pro Thr Asp Tyr Asp Arg 245
250 255 Ser Pro Pro Val Val Thr Ser Pro Ile Ser
Met Thr Met Pro Pro His Ser Ser 260 265
270 Ser Leu His Gly Tyr Gln Pro Tyr Gly
His Phe Pro Ser Arg Ala Ile 275 280
285 Lys Ser Glu Tyr Pro Asp Pro Tyr Ser
Ser Ser Pro Glu Ser Met Met 290 295
300 Gly Tyr Ser Tyr Met Asp Gly Tyr Gln
Thr Asn Ser Pro Ala Ser Ile 305 310
315 320 Pro His Leu Ile Leu Glu Leu Leu Lys
Cys Glu Pro Asp Glu Pro Gln 325
330 335 Val Gln Ala Lys Ile Met Ala Tyr Leu
Gln Gln Glu Gln Ser Asn Arg 340 345
350 Asn Arg Gln Glu Lys Leu Ser Ala Phe
Gly Leu Leu Cys Lys Met Ala 355 360
365 Asp Gln Thr Leu Phe Ser Ile Val Glu
Trp Ala Arg Ser Ser Ile Phe 370 375
380 Phe Arg Glu Leu Lys Val Asp Asp Gln
Met Lys Leu Leu Gln Asn Cys 385 390
395 400 Trp Ser Glu Leu Leu Ile Leu Asp His
Ile Tyr Arg Gln Val Ala His 405
410 415 Gly Lys Glu Gly Thr Ile Phe Leu Val
Thr Gly Glu His Val Asp Tyr 420 425
430 Ser Thr Ile Ile Ser His Thr Glu Val
Ala Phe Asn Asn Leu Leu Ser 435 440
445 Leu Ala Gln Glu Leu Val Val Arg Leu
Arg Ser Leu Gln Phe Asp Gln 450 455
460 Arg Glu Phe Val Cys Leu Lys Phe Leu
Val Leu Phe Ser Ser Asp Val 465 470
475 480 Lys Asn Leu Glu Asn Leu Gln Leu Val
Glu Gly Val Gln Glu Gln Val 485
490 495 Asn Ala Ala Leu Leu Asp Tyr Thr Val
Cys Asn Tyr Pro Gln Gln Thr 500 505
510 Glu Lys Phe Gly Gln Leu Leu Leu Arg
Leu Pro Glu Ile Arg Ala Ile 515 520
525 Ser Lys Gln Ala Glu Asp Tyr Leu Tyr
Tyr Lys His Val Val Asn Gly Asp 530 535
540 Val Pro Tyr Asn Asn Leu Leu Ile Glu
Met Leu His Ala Lys Arg Ala 545 550
555 560

フロントページの続き (72)発明者 ベイ・シャン アメリカ合衆国94080カリフォルニア州サ ウス・サン・フランシスコ、トゥー・コー ポレイト・ドライブ、テュラリク インコ ーポレイテッド内 (72)発明者 仁田 正弘 大阪府大阪市中央区道修町2丁目2番8号 住友製薬株式会社内Continuing on the front page (72) Inventor Bay Shan U.S.A. 94080 In San Francisco, California, To Corporate Drive, Tularik Inc. (72) Inventor Masahiro Nita 2 Doshomachi, Chuo-ku, Osaka, Osaka Chome 2-8 Sumitomo Pharmaceutical Co., Ltd.

Claims (9)

【特許請求の範囲】[Claims] 【請求項1】 配列番号2、4もしくは6、または、残
基1−10、11−15、16−21、204−207
および299−307のうち少なくとも1個を含む配列
番号2の少なくとも10残基ドメイン、残基154を含
む配列番号4の10残基ドメイン、または残基3−1
0、13−22および30−38のうち少なくとも1個
を含む配列番号6の10残基ドメインを含む、分離され
たポリペプチド。
1. SEQ ID NO: 2, 4 or 6, or residues 1-10, 11-15, 16-21, 204-207
And at least one of residues 299-307, SEQ ID NO: 4 including residues 154, or residue 3-1.
An isolated polypeptide comprising a 10 residue domain of SEQ ID NO: 6 comprising at least one of 0, 13-22 and 30-38.
【請求項2】 該ドメインがCYP7遺伝子プロモータ
ーと特異的に結合する、請求項1に記載の分離されたポ
リペプチド。
2. The isolated polypeptide according to claim 1, wherein said domain specifically binds to a CYP7 gene promoter.
【請求項3】 配列番号1、3もしくは5の鎖を含む分
離された、もしくは組換えの第一の核酸、または各々配
列番号7および9を含む第3および第4の核酸の存在下
で、対応する配列番号1、3もしくは5の相補鎖を含む
第二の核酸と特異的にハイブリダイズするに充分な、対
応する配列番号1、3もしくは5の少なくとも24個の
連続した塩基を有する、配列番号1、3もしくは5の鎖
の一部。
3. In the presence of an isolated or recombinant first nucleic acid comprising the strand of SEQ ID NO: 1, 3 or 5, or a third and fourth nucleic acid comprising SEQ ID NO: 7 and 9, respectively. A sequence having at least 24 contiguous bases of the corresponding SEQ ID No. 1, 3 or 5 sufficient to specifically hybridize with a second nucleic acid comprising the complementary strand of the corresponding SEQ ID No. 1, 3 or 5 Part of the chain numbered 1, 3 or 5.
【請求項4】 請求項1に記載のポリペプチドをコード
している組換え核酸。
4. A recombinant nucleic acid encoding the polypeptide according to claim 1.
【請求項5】 請求項4に記載の核酸を含む細胞。5. A cell containing the nucleic acid according to claim 4. 【請求項6】 CPFポリペプチドを製造する方法であ
って、該方法が、請求項4に記載の核酸を宿主細胞また
は細胞抽出物中に導入し、該宿主細胞または抽出物を、
該核酸が転写物として発現され且つ該転写物が該ポリペ
プチドを含む翻訳産物として発現されるような条件下で
インキュベートし、そして該翻訳産物を分離する、とい
う工程を含む方法。
6. A method for producing a CPF polypeptide, which comprises introducing a nucleic acid according to claim 4 into a host cell or cell extract, wherein said host cell or extract comprises:
Incubating under conditions such that the nucleic acid is expressed as a transcript and the transcript is expressed as a translation product comprising the polypeptide, and separating the translation product.
【請求項7】 結合標的に対するCPFポリペプチドの
相互作用を調節する物質をスクリーニングする方法であ
って、該方法が、配列番号2、4もしくは6、または、
残基1−10、11−15、16−21、204−20
7および299−307のうち少なくとも1個を含む配
列番号2の少なくとも10残基ドメイン、残基154を
含む配列番号4の10残基ドメイン、または残基3−1
0、13−22および30−38のうち少なくとも1個
を含む配列番号6の10残基ドメインを含む、分離され
たポリペプチド、該ポリペプチドの結合標的、および、
候補物質、を含む混合物を、該物質の存在無しでは該ポ
リペプチドが該結合標的に対照の親和性で特異結合する
ような条件の下で、インビトロまたは培養内でインキュ
ベートし、該結合標的に対する該ポリペプチドの結合親
和性を検出して、物質によって偏った親和性(ここで、
物質によって偏った親和性と対照の親和性との相違は、
該物質が結合標的に対する該ポリペプチドの結合を調節
することを示す)を決定する、という工程を含む方法。
7. A method for screening a substance that modulates the interaction of a CPF polypeptide with a binding target, the method comprising the steps of: SEQ ID NO: 2, 4 or 6, or
Residues 1-10, 11-15, 16-21, 204-20
7 and 299-307, at least a 10-residue domain of SEQ ID NO: 2 comprising residues 154, a 10-residue domain of SEQ ID NO: 4 comprising residues 154, or residues 3-1
An isolated polypeptide comprising a 10 residue domain of SEQ ID NO: 6 comprising at least one of 0, 13-22 and 30-38; a binding target of said polypeptide; and
A mixture comprising a candidate substance, in vitro or in culture, under conditions such that the polypeptide specifically binds to the binding target with a control affinity in the absence of the substance. By detecting the binding affinity of the polypeptide, the affinity biased by the substance (here,
The difference between the affinity biased by the substance and the affinity of the control is
Indicating that the substance modulates binding of the polypeptide to a binding target).
【請求項8】 結合標的が、CPFポリペプチドに特異
的に結合するに充分なCYP7プロモーター配列を含む
核酸である、請求項7に記載の方法。
8. The method of claim 7, wherein the binding target is a nucleic acid comprising sufficient CYP7 promoter sequence to specifically bind to a CPF polypeptide.
【請求項9】 CPFポリペプチドと特異的に結合する
物質をスクリーニングする方法であって、この方法が、
配列番号2、4もしくは6、または、残基1−10、1
1−15、16−21、204−207および299−
307のうち少なくとも1個を含む配列番号2の少なく
とも10残基ドメイン、残基154を含む配列番号4の
10残基ドメイン、または残基3−10、13−22お
よび30−38のうち少なくとも1個を含む配列番号6
の10残基ドメインを含む、分離されたポリペプチド、
および候補物質、を含む混合物を、該物質が該ポリペプ
チドと特異的に結合するような条件下で、インビトロま
たは培養内でインキュベートし、そしてこの特異結合し
た物質を検出する、という工程を含む方法。
9. A method for screening for a substance that specifically binds to a CPF polypeptide, the method comprising:
SEQ ID NO: 2, 4 or 6, or residues 1-10, 1
1-15, 16-21, 204-207 and 299-
At least one of SEQ ID NO: 2 comprising at least one of 307, 10 residues of SEQ ID NO: 4 comprising residue 154, or at least one of residues 3-10, 13-22 and 30-38 SEQ ID NO: 6
An isolated polypeptide comprising a 10 residue domain of
And incubating in vitro or in culture under conditions such that the substance specifically binds to the polypeptide, and detecting the specifically bound substance. .
JP10348391A 1997-12-08 1998-12-08 Cyp7 promotor binding factor Pending JPH11235187A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US6770897P 1997-12-08 1997-12-08
US60/067708 1997-12-08

Publications (1)

Publication Number Publication Date
JPH11235187A true JPH11235187A (en) 1999-08-31

Family

ID=22077842

Family Applications (1)

Application Number Title Priority Date Filing Date
JP10348391A Pending JPH11235187A (en) 1997-12-08 1998-12-08 Cyp7 promotor binding factor

Country Status (1)

Country Link
JP (1) JPH11235187A (en)

Similar Documents

Publication Publication Date Title
CA2343939A1 (en) Genes and proteins predictive and therapeutic for stroke, hypertension, diabetes and obesity
JPH10507343A (en) Retinoblastoma protein interacting zinc finger protein
US7777000B2 (en) Anti-viral activity of cathelicidin peptides
US20030232419A1 (en) Molecules interacting with CASL (MICAL) polynucleotides, polypeptides, and methods of using the same
AU2001263952A1 (en) Tumour suppressor and uses thereof
WO2001094581A1 (en) Tumour suppressor and uses thereof
WO1998053061A1 (en) Three novel genes encoding a zinc finger protein, a guanine, nucleotide exchange factor and a heat shock protein or heat shock binding protein
JP2000511765A (en) Purified SR-p70 protein
US6025480A (en) Isolated nucleic acid molecules encoding P57KIP2
JPH09509319A (en) RNA modifying enzyme and method of using the same
US6297019B1 (en) Recombinant polynucleotides encoding CYP7 promoter-binding factors
US6180763B1 (en) Cyclin D binding factor, and uses thereof
AU714793B2 (en) Novel stress proteins
US20070128645A1 (en) Use of KIAA0172 gene in treatment and diagnosis of diseases as well as in pharmaceutical development
JPH11235187A (en) Cyp7 promotor binding factor
US20030054446A1 (en) Novel retina-specific human proteins C7orf9, C12orf7, MPP4 and F379
EP1180525B1 (en) Transcriptional activation inhibitory protein
US6825034B2 (en) Human RRN3 and compositions and methods relating thereto
TWI291991B (en) Isolated dna encoding a gpbp polypeptide, vectors comprising such dna, isolated gpbp polypeptides, and isolated antibodies that bind to gpbp polypeptides
EP1279733A1 (en) Nucleic acids and polypeptides involved in the predisposition to infection to human papilloma virus, to epidermodysplasia verruciformis and/or to psoriasis
US20030028002A1 (en) Cyclin D binding factor, and uses thereof
JPH10127296A (en) Ext2 gene
US8609606B2 (en) Polypeptide acting to enhance ischemic diseases
WO1997043415A1 (en) Cyclin d binding factor, and uses thereof
WO1997043415A9 (en) Cyclin d binding factor, and uses thereof

Legal Events

Date Code Title Description
A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20040803

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A821

Effective date: 20040803