JP2003180359A - New gene and protein encoded with the same - Google Patents

New gene and protein encoded with the same

Info

Publication number
JP2003180359A
JP2003180359A JP2001379607A JP2001379607A JP2003180359A JP 2003180359 A JP2003180359 A JP 2003180359A JP 2001379607 A JP2001379607 A JP 2001379607A JP 2001379607 A JP2001379607 A JP 2001379607A JP 2003180359 A JP2003180359 A JP 2003180359A
Authority
JP
Japan
Prior art keywords
leu
ser
ala
pro
gly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2001379607A
Other languages
Japanese (ja)
Inventor
Takahiro Nagase
隆弘 長瀬
Daisuke Nakajima
大輔 中島
Osamu Obara
收 小原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kazusa DNA Research Institute Foundation
Original Assignee
Kazusa DNA Research Institute Foundation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kazusa DNA Research Institute Foundation filed Critical Kazusa DNA Research Institute Foundation
Priority to JP2001379607A priority Critical patent/JP2003180359A/en
Publication of JP2003180359A publication Critical patent/JP2003180359A/en
Pending legal-status Critical Current

Links

Landscapes

  • Peptides Or Proteins (AREA)

Abstract

<P>PROBLEM TO BE SOLVED: To directly clone a new DNA containing a region encoding a protein from a cDNA library derived from the human adult whole brain, the human tonsils, the human adult hippocampus and the human fetal whole brain, determine the base sequence thereof and further identify functions thereof. <P>SOLUTION: This DNA comprises a base sequence encoding a polypeptide of the following (a) or (b). (a) a polypeptide composed of a plurality of the same or substantially the same amino acid sequences as a plurality of specified amino acid sequences estimated as a plurality of new proteins derived from humans or (b) a polypeptide composed of an amino acid sequence in which a part of amino acids are deleted, substituted or added in the amino acid sequence represented by any one of the polypeptide and having a biological activity substantially homogeneous with functions of the polypeptide (a). A recombinant polypeptide is encoded with the DNA. A protein comprises the polypeptide. <P>COPYRIGHT: (C)2003,JPO

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【発明の属する技術分野】本発明は、DNA及び該DN
Aを含む遺伝子、並びに該DNAにコードされる組換え
ポリペプチド及び該ポリペプチドを含む新規組換え蛋白
質に関する。
TECHNICAL FIELD The present invention relates to a DNA and the DN.
The present invention relates to a gene containing A, a recombinant polypeptide encoded by the DNA, and a novel recombinant protein containing the polypeptide.

【0002】[0002]

【従来の技術】ヒトゲノム計画における大規模シークエ
ンシングによって、2001年2月にヒトゲノムドラフ
ト配列が公開された。ヒトゲノム計画の最終目的は単に
ゲノム全塩基配列を決定することではなく、その構造情
報、即ち、DNAの塩基配列情報からヒトのさまざまな
生命現象を読み解くことにあろう。ヒトゲノム配列中で
蛋白質をコードしている領域はそのごく一部であり、現
在は、ニューラルネットワークや隠れマルコフモデルと
呼ばれる情報科学の手法を用いて、そのコード領域の予
測が行われている。しかしながら、それらの予測精度は
まだ充分なものではない。
2. Description of the Related Art Human genome draft sequences were published in February 2001 by large-scale sequencing in the Human Genome Project. The ultimate purpose of the Human Genome Project will not be to simply determine the entire nucleotide sequence of the genome, but to interpret various human life phenomena from its structural information, that is, the nucleotide sequence information of DNA. The region encoding the protein in the human genome sequence is a small part of it, and at present, the prediction of the coding region is performed using an information science method called a neural network or a hidden Markov model. However, their prediction accuracy is not yet sufficient.

【0003】今回、本発明者は新規な遺伝子を見出すべ
く、ヒト成人全脳、ヒト扁桃、ヒト成人海馬及びヒト胎
児全脳由来のcDNAライブラリーから、蛋白質をコー
ドしている領域を含む新規なDNAを直接クローニング
することに成功し、それらの塩基配列を決定して本発明
を完成させた。
In order to find a novel gene, the present inventor has now found a novel cDNA containing a region encoding a protein from a cDNA library derived from human adult whole brain, human tonsils, human adult hippocampus and human fetal whole brain. We have succeeded in directly cloning DNA and determined their nucleotide sequences to complete the present invention.

【0004】[0004]

【課題を解決するための手段】即ち、本発明は第一の態
様として、以下の(a)又は(b)のポリペプチドをコ
ードする塩基配列を含むDNAに係る: (a)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示されるアミノ酸配列と同一又は実質的に
同一のアミノ酸配列から成るポリペプチド、(b)配列
番号:1乃至21(但し、11は除く)のいずれか一つ
で示されるアミノ酸配列において、一部のアミノ酸が欠
失、置換又は付加されたアミノ酸配列から成り、(a)
のポリペプチドの機能と実質的に同質の生物学的活性を
有するポリペプチド。本発明の第二の態様として、以下
の(a)又は(b)のDNAに係る: (a)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示される塩基配列において、夫々の配列で
示されるアミノ酸配列をコードする塩基配列を含むDN
A、(b)(a)のDNAとストリンジェントな条件下
でハイブリダイズし、(a)のポリペプチドの機能と実
質的に同質の生物学的活性を有する蛋白質をコードする
DNA。以上の本発明の第一及び第二の態様であるDN
Aをまとめて、以下、「本発明DNA」ともいう。又、
本発明はこれらDNAを含む遺伝子にも係る。更に、本
発明は上記DNA又は遺伝子にコードされる組換えポリ
ペプチド(以下、「本発明ポリペプチド」ともい
う。)、及び該ポリペプチドを含む組換え蛋白質に係
る。
That is, the first aspect of the present invention relates to a DNA containing a nucleotide sequence encoding the following polypeptide (a) or (b): (a) SEQ ID NO: 1 To 21 (excluding 11), a polypeptide comprising the same or substantially the same amino acid sequence as the amino acid sequence represented by any one of (1) to 21 (excluding 11), (b) SEQ ID NOs: 1 to 21 (excluding 11) An amino acid sequence represented by any one of (1) to (4), which comprises an amino acid sequence in which some amino acids are deleted, substituted or added,
A polypeptide having biological activity substantially equivalent to the function of the polypeptide. As a second aspect of the present invention, the present invention relates to the following DNA (a) or (b): (a) In the nucleotide sequence shown by any one of SEQ ID NOs: 1 to 21 (however, 11 is excluded) , DN containing a nucleotide sequence encoding the amino acid sequence shown by each sequence
A, (b) A DNA which hybridizes with the DNA of (a) under stringent conditions and which encodes a protein having a biological activity substantially equivalent to the function of the polypeptide of (a). DN which is the first and second aspects of the present invention described above
Hereinafter, A will be collectively referred to as “the DNA of the present invention”. or,
The present invention also relates to genes containing these DNAs. Furthermore, the present invention relates to a recombinant polypeptide encoded by the above DNA or gene (hereinafter, also referred to as “the polypeptide of the present invention”) and a recombinant protein containing the polypeptide.

【0005】本発明DNAを有するクローンの名称、本
発明ポリペプチド又は蛋白質の長さ、その機能について
は、表1乃至表3に示されている。
The names of clones having the DNA of the present invention, the lengths of the polypeptides or proteins of the present invention, and their functions are shown in Tables 1 to 3.

【0006】本発明DNAは、市販されている(クロン
テック社製)ヒト成人全脳、ヒト扁桃、ヒト成人海馬及
びヒト胎児全脳のmRNAを出発材料として、本発明者
が調製したcDNAライブラリーから、cDNA断片と
して単離した後に、塩基配列を決定し同定したものであ
る。即ち、具体的には、小原他の方法(DNA Research V
ol.4,53−59(1997))に従って調製したヒト成人全
脳、ヒト扁桃、ヒト成人海馬及びヒト胎児全脳由来のc
DNAライブラリーからクローンをランダムに単離す
る。次に、ハイブリダイゼーションにより、重複クロー
ン(繰り返し出てくるクローン)を除き、その後インビ
トロでの転写翻訳を行い50kDa以上の産物が認めら
れるクローンについてその両末端の塩基配列を決定す
る。更に、こうして得られた末端塩基配列をクエリーと
して既知遺伝子のデータベースにて相同性検索を行い、
その結果、新規であることが判明したクローンについて
全塩基配列を決定する。また、上記のスクリーニング法
に加えて、cDNAの5'および3'の末端配列をヒトのゲノム
配列に対応させ、それらが挟む領域に未知の長鎖遺伝子
が確認された場合には、そのcDNAの全長解析をおこな
う。このようにして既知の遺伝子に依存した従来のクロ
ーニング方法では得られなかった未知の遺伝子も、シス
テマチックにクローニングを行なうことができる。又、
短い断片や得られた配列に人工的な間違いが起こらない
ように十分な注意を払いながら、RACE等のPCR法
を使用することによっても、本発明DNAを含むヒト由
来遺伝子の全領域を調製することも可能である。
The DNA of the present invention is obtained from a cDNA library prepared by the present inventor using commercially available human adult whole brain (manufactured by Clontech), human tonsils, human adult hippocampus and human fetal whole brain as starting materials. Was isolated as a cDNA fragment, and then the base sequence was determined and identified. That is, specifically, the method of Ohara et al. (DNA Research V
ol.4,53-59 (1997)), c derived from human adult whole brain, human tonsils, human adult hippocampus and human fetal whole brain.
Randomly isolate clones from a DNA library. Next, by duplication, overlapping clones (clones that repeatedly appear) are removed, and then in vitro transcription / translation is performed to determine the nucleotide sequences at both ends of the clone in which a product of 50 kDa or more is recognized. Furthermore, the terminal nucleotide sequence thus obtained is used as a query to perform a homology search in a database of known genes,
As a result, the entire base sequence of the clone found to be novel is determined. In addition to the above screening method, the 5'and 3'end sequences of the cDNA are made to correspond to human genomic sequences, and if an unknown long-chain gene is confirmed in the region flanking them, the cDNA Perform full length analysis. In this way, unknown genes that could not be obtained by conventional cloning methods that depend on known genes can be systematically cloned. or,
The whole region of the human-derived gene including the DNA of the present invention is prepared by using the PCR method such as RACE, paying sufficient attention not to cause artificial mistakes in the short fragment and the obtained sequence. It is also possible.

【0007】更に、本発明は、本発明DNA又は本発明
DNAを含む遺伝子を含有する組換えベクター、該組換
えベクターを保持する形質転換体、該形質転換体を培養
し、本発明ポリペプチド若しくは該ポリペプチドを含む
組換え蛋白質を生成、蓄積せしめ、これを採取すること
を特徴とする、本発明ポリペプチド若しくは該ポリペプ
チドを含む組換え蛋白質、又はその塩の製造方法、及
び、こうして得られる本発明ポリペプチド若しくは該ポ
リペプチドを含む組換え蛋白質又はその塩を提供する。
Further, the present invention provides a recombinant vector containing the DNA of the present invention or a gene containing the DNA of the present invention, a transformant carrying the recombinant vector, the transformant, and the polypeptide of the present invention or A method for producing the polypeptide of the present invention or a recombinant protein containing the polypeptide, or a salt thereof, which comprises producing and accumulating a recombinant protein containing the polypeptide, and collecting the recombinant protein, and thus obtained Provided is a polypeptide of the present invention, a recombinant protein containing the polypeptide, or a salt thereof.

【0008】又、本発明は、本発明DNA又は遺伝子を
含有してなる医薬、本発明ポリペプチド若しくはその部
分ポリペプチド又は該ポリペプチドを含む組換え蛋白質
をコードする塩基配列を含むポリヌクレオチド(DN
A)、それら塩基配列に実質的に相補的な塩基配列を有
するアンチセンスヌクレオチド、該ポリヌクレオチド又
はアンチセンスヌクレオチドを含有してなる医薬、本発
明ポリペプチド若しくはその部分ポリペプチド、及び、
該ポリペプチド又はそれらを含む組換え蛋白質を含有し
てなる医薬に係る。更に、本発明は、本発明ポリペプチ
ド若しくはその部分ポリペプチド又は該ポリペプチドを
含む組換え蛋白質又はそれらの塩に対する抗体、及び、
本発明ポリペプチド、その部分ポリペプチド若しくは該
ポリペプチドを含む組換え蛋白質又はそれらの塩、又は
それらに対する抗体を用いることを特徴とする、それら
物質と特異的に相互作用する物質のスクリーニング方
法、スクリーニング用キット、並びに、該スクリーニン
グ方法によって同定される物質(化合物)自体等にも係
る。
The present invention also provides a polynucleotide comprising a nucleotide sequence encoding a pharmaceutical comprising the DNA or gene of the present invention, the polypeptide of the present invention or a partial polypeptide thereof or a recombinant protein containing the polypeptide (DN).
A), an antisense nucleotide having a base sequence substantially complementary to these base sequences, a drug comprising the polynucleotide or the antisense nucleotide, the polypeptide of the present invention or a partial polypeptide thereof, and
The present invention relates to a medicament containing the polypeptide or a recombinant protein containing them. Furthermore, the present invention provides an antibody against the polypeptide of the present invention or a partial polypeptide thereof or a recombinant protein containing the polypeptide or a salt thereof, and
Screening method for substances that specifically interact with these substances, characterized by using the polypeptide of the present invention, a partial polypeptide thereof, a recombinant protein containing the polypeptide or a salt thereof, or an antibody against them Kit, and a substance (compound) itself identified by the screening method.

【0009】[0009]

【発明の実施の形態】本発明DNAとしては、前述した
本発明ポリペプチドをコードする塩基配列から成るもの
であればいかなるものであってもよい。また、ヒトの
脳、又は、それ以外の組織、例えば、心臓、肺、肝臓、
脾臓、腎臓、精巣、等の細胞・組織に由来するcDNA
ライブラリー等から同定・単離されたcDNA、又は、
合成DNAのいずれでもよい。ライブラリー作成に使用
するベクターは、バクテリオファージ、プラスミド、コ
スミド、ファージミドなどいずれであってもよい。ま
た、前記した細胞・組織よりtotalRNA画分またはm
RNA画分を調製したものを用いて、直接ReverseTrans
cription coupled Polymerase Chain Reaction(以下、
「RT-PCR法」と略称する)によって増幅すること
もできる。
BEST MODE FOR CARRYING OUT THE INVENTION The DNA of the present invention may be any DNA as long as it comprises the nucleotide sequence encoding the above-mentioned polypeptide of the present invention. In addition, human brain or other tissues such as heart, lung, liver,
CDNA derived from cells / tissues such as spleen, kidney, testis, etc.
CDNA identified and isolated from a library, or
Any of synthetic DNA may be used. The vector used for preparing the library may be any of bacteriophage, plasmid, cosmid, phagemid and the like. In addition, the total RNA fraction or m
Using the RNA fraction prepared,
cription coupled Polymerase Chain Reaction (hereinafter,
It can also be amplified by the "RT-PCR method".

【0010】配列番号:1乃至21(但し、11は除
く)のいずれか一つで示されるアミノ酸配列と実質的に
同一のアミノ酸配列とは、配列番号:1乃至21(但
し、11は除く)のいずれか一つで示される全アミノ酸
配列との相同性の程度が、全体の平均で約70%以上、
好ましくは約80%以上、更に好ましくは約90%以
上、特に好ましくは約95%以上であるアミノ酸配列を
意味する。従って、本発明の配列番号:1乃至21(但
し、11は除く)のいずれか一つで示されるアミノ酸配
列と実質的に同一のアミノ酸配列から成るポリペプチド
としては、例えば、前記の各配列番号で示されるアミノ
酸配列に対して上記の相同性を有し、各配列番号で示さ
れるアミノ酸配列から成るポリペプチドの機能と実質的
に同質の生物学的活性(機能)を有するポリペプチドを
挙げることが出来る。ここで、実質的に同質とは、それ
らの活性(機能)が性質的に同質であることを示す。
又、本発明ポリペプチドには、例えば、配列番号:1乃
至21(但し、11は除く)のいずれか一つで示される
アミノ酸配列中の一部(好ましくは、1〜20個程度、
より好ましくは1〜10個程度、さらに好ましくは数
個)のアミノ酸が欠失、置換又は付加したアミノ酸配
列、或いはそれらを組み合わせたアミノ酸配列から成
り、配列番号:1乃至21(但し、11は除く)のいず
れか一つで示されるアミノ酸配列から成るポリペプチド
の機能と実質的に同質の生物学的活性(機能)を有する
ポリペプチドも含まれる。
An amino acid sequence substantially the same as the amino acid sequence represented by any one of SEQ ID NOs: 1 to 21 (excluding 11) means SEQ ID NOs: 1 to 21 (excluding 11) The degree of homology with the entire amino acid sequence represented by any one of the
It means an amino acid sequence which is preferably about 80% or more, more preferably about 90% or more, particularly preferably about 95% or more. Therefore, the polypeptide having an amino acid sequence substantially the same as the amino acid sequence represented by any one of SEQ ID NOs: 1 to 21 (excluding 11) of the present invention is, for example, The polypeptide having the above-mentioned homology to the amino acid sequence shown by and having substantially the same biological activity (function) as the function of the polypeptide consisting of the amino acid sequence shown by each SEQ ID NO: Can be done. Here, “substantially the same quality” means that their activities (functions) are qualitatively the same.
In addition, the polypeptide of the present invention includes, for example, a part (preferably about 1 to 20) of the amino acid sequence represented by any one of SEQ ID NOs: 1 to 21 (excluding 11),
More preferably, it comprises an amino acid sequence in which about 1 to 10 amino acids, more preferably several amino acids are deleted, substituted or added, or an amino acid sequence combining them, and SEQ ID NOs: 1 to 21 (excluding 11) ) Also includes a polypeptide having substantially the same biological activity (function) as the function of the polypeptide consisting of the amino acid sequence represented by any one of (1) to (4) above.

【0011】上記の配列番号:1乃至21(但し、11
は除く)のいずれか一つで示されるアミノ酸配列と実質
的に同一のアミノ酸配列から成るポリペプチド、又はそ
の一部のアミノ酸が欠失、置換又は付加したアミノ酸配
列から成るポリペプチドは、例えば、部位特異的変異導
入法、遺伝子相同組換え法、プライマー伸長法、及びP
CR法等の当業者に周知の方法を適宜組み合わせて、容
易に作成することが可能である。尚、その際に、実質的
に同質の生物学的活性を有するためには、当該ポリペプ
チドを構成するアミノ酸のうち、同族アミノ酸(極性・
非極性アミノ酸、疎水性・親水性アミノ酸、陽性・陰性
荷電アミノ酸、芳香族アミノ酸など)同士の置換が可能
性として考えられる。又、実質的に同質の生物学的活性
の維持のためには、本発明の各ポリペプチドに含まれる
機能ドメイン内のアミノ酸は保持されることが望まし
い。
The above SEQ ID NOs: 1 to 21 (however, 11
Except), a polypeptide consisting of an amino acid sequence substantially the same as the amino acid sequence shown in any one of, or a polypeptide consisting of an amino acid sequence in which a part of amino acids is deleted, substituted or added, for example, Site-directed mutagenesis method, gene homologous recombination method, primer extension method, and P
It can be easily created by appropriately combining methods known to those skilled in the art such as the CR method. At this time, in order to have substantially the same biological activity, among the amino acids constituting the polypeptide, a homologous amino acid (polarity
Substitution between non-polar amino acids, hydrophobic / hydrophilic amino acids, positive / negatively charged amino acids, aromatic amino acids, etc.) is considered possible. Further, in order to maintain substantially the same biological activity, it is desirable that amino acids in the functional domain contained in each polypeptide of the present invention be retained.

【0012】更に、本発明DNAは、配列番号:1乃至
21(但し、11は除く)のいずれか一つで示される塩
基配列において、夫々の配列で示されるアミノ酸配列を
コードする塩基配列を含むDNA、及び、該DNAとス
トリンジェントな条件下でハイブリダイズし、各配列で
示されるアミノ酸配列から成るポリペプチドの機能と同
質の生物学的活性(機能)を有するポリペプチド(蛋白
質)をコードするDNAを包含する。かかる条件下で、
配列番号:1乃至21(但し、11は除く)のいずれか
一つで示される塩基配列において、夫々の配列で示され
るアミノ酸配列をコードする塩基配列を含むDNAとハ
イブリダイズできるDNAとしては、例えば、該DNA
の全塩基配列との相同性の程度が、全体の平均で約80
%以上、好ましくは約90%以上、より好ましくは約9
5%以上である塩基配列を含有するDNA等を挙げるこ
とが出来る。ハイブリダイゼーションは、カレント・プ
ロトコールズ・イン・モレキュラー・バイオロジー(Cu
rrent protocols in molecular biology(edited by Fr
ederick M. Ausubel et al., 1987))に記載の方法
等、当業界で公知の方法あるいはそれに準じる方法に従
って行なうことができる。また、市販のライブラリーを
使用する場合、添付の使用説明書に記載の方法に従って
行なうことができる。ここで、「ストリンジェントな条
件」とは、例えば、65℃の1mM EDTA ナトリウム、0.5
M リン酸水素ナトリウム(pH7.2)、7%SDS 水溶液中
でハイブリダイズさせ、65℃の1mM EDTA ナトリウ
ム、40mM リン酸水素ナトリウム(pH7.2)、1%SDS
水溶液中でメンブレンを洗浄する条件でのサザンブロッ
トハイブリダイゼーションで本発明DNAプローブにハ
イブリダイズする程度の条件である。
Further, the DNA of the present invention includes a nucleotide sequence encoding the amino acid sequence represented by each of the nucleotide sequences represented by any one of SEQ ID NOs: 1 to 21 (excluding 11). It encodes DNA and a polypeptide (protein) which hybridizes with the DNA under stringent conditions and has the same biological activity (function) as the function of the polypeptide consisting of the amino acid sequence shown by each sequence. Includes DNA. Under such conditions,
In the nucleotide sequences represented by any one of SEQ ID NOs: 1 to 21 (excluding 11), a DNA that can hybridize with a DNA containing a nucleotide sequence encoding the amino acid sequence represented by each sequence is, for example, , The DNA
The degree of homology with the entire base sequence of
% Or more, preferably about 90% or more, more preferably about 9%
Examples thereof include DNA containing a nucleotide sequence of 5% or more. Hybridization is performed using the Current Protocols in Molecular Biology (Cu
rrent protocols in molecular biology (edited by Fr
ederick M. Ausubel et al., 1987)) or the like known in the art or a method similar thereto. When a commercially available library is used, it can be performed according to the method described in the attached instruction manual. Here, “stringent conditions” means, for example, 1 mM sodium EDTA at 65 ° C., 0.5
M sodium hydrogen phosphate (pH7.2), hybridized in 7% SDS aqueous solution, 1 mM sodium EDTA (65 mM), 65 mM sodium EDTA, pH 7.2, 1% SDS
The conditions are such that they are hybridized to the DNA probe of the present invention by Southern blot hybridization under the conditions of washing the membrane in an aqueous solution.

【0013】本発明DNAのクローニングの手段として
は、本発明ポリペプチドの部分等の適当な塩基配列を有
する合成DNAプライマーを用いてPCR法によって増
幅するか、または適当なベクターに組み込んだDNAを
本発明ポリペプチドの一部あるいは全領域をコードする
DNA断片もしくは合成DNAを用いて標識したものと
のハイブリダイゼーションによって選別することができ
る。ハイブリダイゼーションの方法は、例えば、上記の
Current protocols in molecular biology(edited by
Frederick M. Ausubel et al., 1987)に記載の方法な
どに従って行なうことができる。また、市販のライブラ
リーを使用する場合、添付の使用説明書に記載の方法に
従って行なうことができる。クローン化されたポリペプ
チドをコードするDNAは目的によりそのまま、または
所望により制限酵素で消化したり、リンカーを付加した
りして使用することができる。該DNAはその5’末端
側に翻訳開始コドンとしてのATGを有し、また3’末
端側には翻訳終止コドンとしてのTAA、TGAまたは
TAGを有していてもよい。これらの翻訳開始コドンや
翻訳終止コドンは、適当な合成DNAアダプターを用い
て付加することもできる。
As a means for cloning the DNA of the present invention, the DNA is amplified by the PCR method using a synthetic DNA primer having an appropriate nucleotide sequence such as a portion of the polypeptide of the present invention, or the DNA is inserted into an appropriate vector. It can be selected by hybridization with a DNA fragment coding for a part or the whole region of the invention polypeptide or one labeled with a synthetic DNA. The hybridization method is, for example, as described above.
Current protocols in molecular biology (edited by
Frederick M. Ausubel et al., 1987) and the like. When a commercially available library is used, it can be performed according to the method described in the attached instruction manual. The cloned DNA encoding the polypeptide can be used as it is, or if desired after digestion with a restriction enzyme or addition of a linker, depending on the purpose. The DNA may have ATG as a translation initiation codon at the 5′-terminal side and TAA, TGA or TAG as a translation termination codon at the 3′-terminal side. These translation initiation codon and translation termination codon can be added using an appropriate synthetic DNA adaptor.

【0014】本発明の蛋白質の発現ベクターは、当該技
術分野で公知の方法に従って作成することが出来る。例
えば、(1)本発明DNA又は本発明DNAを含む遺伝
子を含有するDNA断片を切り出し、(2)該DNA断
片を適当な発現ベクター中のプロモーターの下流に連結
することにより製造することができる。ベクターとして
は、大腸菌由来のプラスミド(例、pBR322,pB
R325,pUC18,pUC118)、枯草菌由来の
プラスミド(例、pUB110,pTP5,pC19
4)、酵母由来プラスミド(例、pSH19,pSH1
5)、λファージなどのバクテリオファージ、レトロウ
イルス,ワクシニアウイルス,バキュロウイルスなどの
動物ウイルス等を利用することが出来る。本発明で用い
られるプロモーターとしては、遺伝子の発現に用いる宿
主に対応した適切なプロモーターであればいかなるもの
でもよい。例えば、宿主が大腸菌である場合は、trp
プロモーター、lacプロモーター、recAプロモー
ター、λPLプロモーター、lppプロモーターなど
が、宿主が枯草菌である場合は、SPO1プロモータ
ー、SPO2プロモーター、penPプロモーターな
ど、宿主が酵母である場合は、PHO5プロモーター、
PGKプロモーター、GAPプロモーター、ADHプロ
モーターなどが好ましい。動物細胞を宿主として用いる
場合は、SRαプロモーター、SV40プロモーター、
LTRプロモーター、CMVプロモーター、HSV-T
Kプロモーターなどが挙げられる。
The expression vector for the protein of the present invention can be prepared by a method known in the art. For example, it can be produced by cutting out (1) a DNA fragment containing the DNA of the present invention or a gene containing the DNA of the present invention, and (2) ligating the DNA fragment downstream of a promoter in an appropriate expression vector. As the vector, a plasmid derived from E. coli (eg, pBR322, pB
R325, pUC18, pUC118), a plasmid derived from Bacillus subtilis (eg, pUB110, pTP5, pC19)
4), yeast-derived plasmid (eg, pSH19, pSH1
5), bacteriophage such as λ phage, animal virus such as retrovirus, vaccinia virus, baculovirus and the like can be used. The promoter used in the present invention may be any promoter as long as it is suitable for the host used for gene expression. For example, when the host is E. coli, trp
Promoter, lac promoter, recA promoter, λPL promoter, lpp promoter, etc., when the host is Bacillus subtilis, SPO1 promoter, SPO2 promoter, penP promoter, etc., when the host is yeast, PHO5 promoter,
PGK promoter, GAP promoter, ADH promoter and the like are preferable. When an animal cell is used as a host, SRα promoter, SV40 promoter,
LTR promoter, CMV promoter, HSV-T
K promoter etc. are mentioned.

【0015】発現ベクターには、以上の他に、所望によ
り当該技術分野で公知の、エンハンサー、スプライシン
グシグナル、ポリA付加シグナル、選択マーカー、SV
40複製オリジン等を付加することができる。また、必
要に応じて、本発明のDNAにコードされた蛋白質を他
の蛋白質(例えば、グルタチオンSトランスフェラーゼ
及びプロテインA)との融合蛋白質として発現させるこ
とも可能である。このような融合蛋白質は、適当なプロ
テアーゼを使用して切断し、それぞれの蛋白質に分離す
ることが出来る。
In addition to the above, the expression vector may optionally include an enhancer, a splicing signal, a poly A addition signal, a selection marker, and an SV which are known in the art.
For example, 40 replication origins can be added. In addition, if necessary, the protein encoded by the DNA of the present invention can be expressed as a fusion protein with other proteins (eg, glutathione S transferase and protein A). Such a fusion protein can be cleaved with an appropriate protease and separated into respective proteins.

【0016】宿主細胞としては、例えば、エシェリヒア
属菌、バチルス属菌、酵母、昆虫細胞、昆虫、動物細胞
などが用いられる。エシェリヒア属菌の具体例として
は、エシェリヒア・コリ(Escherichia coli)K12・
DH1(Proc. Natl. Acad. Sci. USA,60巻,1
60(1968)),JM103(Nucleic Acids Resear
ch,9巻,309(1981)),JA221(Journal
of Molecular Biology,120巻,517(197
8)),及びHB101(Journal of Molecular Biolog
y,41巻,459(1969))等が用いられる。バチ
ルス属菌としては、例えば、バチルス・サチルス(Baci
llus subtilis)MI114(Gene,24巻,255(1
983)),207−21〔Journal of Biochemistry,
95巻,87(1984)〕等が用いられる。酵母として
は、例えば、サッカロマイセス セレビシエ(Saccaromy
ces cerevisiae)AH22,AH22R-,NA87−
11A,DKD−5D,20B−12、シゾサッカロマ
イセス ポンベ(Schizosaccaromyces pombe)NCYC
1913,NCYC2036、サッカロマイセス ピキ
ア パストリス(Saccaromycespicjia pastoris)等が用
いられる。動物細胞としては、例えば、サル細胞COS
−7,Vero,チャイニーズハムスター細胞CHO(以
下、CHO細胞と略記),dhfr遺伝子欠損CHO細
胞,マウスL細胞,マウスAtT−20,マウスミエロ
ーマ細胞,ラットGH3,ヒトFL細胞などが用いられ
る。
As the host cells, for example, Escherichia, Bacillus, yeast, insect cells, insects, animal cells and the like are used. Specific examples of the Escherichia bacterium include Escherichia coli K12.
DH1 (Proc. Natl. Acad. Sci. USA, 60 volumes, 1
60 (1968)), JM103 (Nucleic Acids Resear
ch, Volume 9, 309 (1981)), JA221 (Journal
of Molecular Biology, 120, 517 (197)
8)), and HB101 (Journal of Molecular Biolog
y, 41, 459 (1969)) and the like are used. Examples of the bacterium of the genus Bacillus include Bacillus subtilis (Baci
llus subtilis) MI114 (Gene, 24 volumes, 255 (1
983)), 207-21 [Journal of Biochemistry,
95, 87 (1984)] and the like. As yeast, for example, Saccharomyces cerevisiae (Saccaromyces
ces cerevisiae) AH22, AH22R-, NA87-
11A, DKD-5D, 20B-12, Schizosaccaromyces pombe NCYC
1913, NCYC2036, Saccaromyces picjia pastoris and the like are used. Examples of animal cells include monkey cells COS
-7, Vero, Chinese hamster cell CHO (hereinafter abbreviated as CHO cell), dhfr gene-deficient CHO cell, mouse L cell, mouse AtT-20, mouse myeloma cell, rat GH3, human FL cell and the like are used.

【0017】これら宿主細胞の形質転換は、当該技術分
野で公知の方法に従って行うことが出来る。例えば、以
下に記載の文献を参照することが出来る。Proc. Natl.
Acad. Sci. USA,69巻,2110(1972); Ge
ne,17巻,107(1982);Molecular & General
Genetics,168巻,111(1979);Methods in
Enzymology,194巻,182−187(1991);
Proc. Natl. Acad. Sci. USA),75巻,1929
(1978);細胞工学別冊8 新 細胞工学実験プロトコ
ール.263−267(1995)(秀潤社発行);及
び Virology,52巻,456(1973)。
Transformation of these host cells can be performed according to methods known in the art. For example, the following documents can be referred to. Proc. Natl.
Acad. Sci. USA, Volume 69, 2110 (1972); Ge
ne, Vol. 17, 107 (1982); Molecular & General
Genetics, 168, 111 (1979); Methods in
Enzymology, 194, 182-187 (1991);
Proc. Natl. Acad. Sci. USA), 75, 1929
(1978); Cell Engineering Separate Volume 8 New Cell Engineering Experimental Protocol. 263-267 (1995) (published by Shujunsha); and Virology, 52, 456 (1973).

【0018】このようにして得られた、本発明DNA又
は本発明DNAを含む遺伝子を含有する発現ベクターで
形質転換された形質転換体は、当該技術分野で公知の方
法に従って培養することが出来る。例えば、宿主がエシ
ェリヒア属菌の場合、培養は通常約15〜43℃で約3
〜24時間行ない、必要により、通気や撹拌を加えるこ
ともできる。宿主がバチルス属菌の場合、培養は通常、
約30〜40℃で約6〜24時間行ない、必要により通
気や撹拌を加えることもできる。宿主が酵母である形質
転換体を培養する際、培養は通常、pH約5〜8に調整
された培地を用いて約20℃〜35℃で約24〜72時
間行ない、必要に応じて通気や撹拌を加えることもでき
る。宿主が動物細胞である形質転換体を培養する際、p
Hは約6〜8に調整された培地を用いて、通常約30℃
〜40℃で約15〜60時間行ない、必要に応じて通気
や撹拌を加えることもできる。
The thus obtained transformant transformed with the expression vector containing the DNA of the present invention or the gene containing the DNA of the present invention can be cultured according to a method known in the art. For example, when the host is Escherichia, the culture is usually performed at about 15 to 43 ° C for about 3
It is carried out for about 24 hours, and if necessary, aeration and stirring can be added. When the host is Bacillus, the culture is usually
It is carried out at about 30 to 40 ° C. for about 6 to 24 hours, and aeration and stirring can be added if necessary. When a transformant whose host is yeast is cultivated, it is usually cultivated at about 20 ° C to 35 ° C for about 24 to 72 hours using a medium whose pH is adjusted to about 5 to 8 and, if necessary, aeration or Stirring can also be added. When culturing a transformant whose host is an animal cell, p
H is usually about 30 ° C using a medium adjusted to about 6-8.
It is carried out at -40 ° C for about 15-60 hours, and aeration and stirring can be added if necessary.

【0019】上記培養物から本発明ポリペプチド又は蛋
白質を分離精製するには、例えば、培養後、公知の方法
で菌体あるいは細胞を集め、これを適当な緩衝液に懸濁
し、超音波、リゾチームおよび/または凍結融解などに
よって菌体あるいは細胞を破壊したのち、遠心分離やろ
過により蛋白質の粗抽出液を得る。緩衝液の中に尿素や
塩酸グアニジンなどの蛋白質変性剤や、トリトンX−1
00TMなどの界面活性剤が含まれていてもよい。培養液
中に蛋白質が分泌される場合には、培養終了後、公知の
方法で菌体あるいは細胞と上清とを分離し、上清を集め
る。このようにして得られた培養上清、あるいは抽出液
中に含まれる蛋白質の精製は、公知の分離・精製法を適
切に組み合わせて行なうことができる。こうして得られ
た本発明ポリペプチド(蛋白質)は、公知の方法あるい
はそれに準じる方法によって塩に変換することができ、
逆に塩で得られた場合には公知の方法あるいはそれに準
じる方法により、遊離体または他の塩に変換することが
できる。更に、組換え体が産生する蛋白質を、精製前ま
たは精製後に、トリプシン及びキモトリプシンのような
適当な蛋白修飾酵素を作用させることにより、任意に修
飾を加えたり、ポリペプチドを部分的に除去することも
できる。本発明ポリペプチド(蛋白質)又はその塩の存
在は、様々な結合アッセイ及び特異抗体を用いたエンザ
イムイムノアッセイ等により測定することができる。
To isolate and purify the polypeptide or protein of the present invention from the above-mentioned culture, for example, after culturing, cells or cells are collected by a known method, suspended in an appropriate buffer, and then ultrasonicated or lysozyme. After destroying the cells or cells by freeze-thawing or the like, a crude protein extract is obtained by centrifugation or filtration. In the buffer solution, a protein denaturant such as urea or guanidine hydrochloride, or Triton X-1
A surfactant such as 00 may be included. When the protein is secreted into the culture solution, after the completion of the culture, the bacterial cells or cells are separated from the supernatant by a known method, and the supernatant is collected. The protein contained in the culture supernatant or the extract thus obtained can be purified by appropriately combining known separation / purification methods. The polypeptide (protein) of the present invention thus obtained can be converted into a salt by a known method or a method analogous thereto,
Conversely, when it is obtained as a salt, it can be converted into a free form or another salt by a known method or a method similar thereto. Furthermore, the protein produced by the recombinant can be optionally modified or partially removed by treating it with an appropriate protein-modifying enzyme such as trypsin and chymotrypsin before or after purification. You can also The presence of the polypeptide (protein) of the present invention or a salt thereof can be measured by various binding assays and enzyme immunoassays using specific antibodies.

【0020】本発明ポリペプチド(蛋白質)は、C末端
が通常カルボキシル基(−COOH)またはカルボキシ
レート(−COO-)であるが、C末端がアミド(−CO
NH 2)またはエステル(−COOR)であってもよ
い。ここでエステルにおけるRとしては、例えば、メチ
ル、エチル、n−プロピル、イソプロピルもしくはn−
ブチルなどのC1-6アルキル基、例えば、シクロペンチ
ル、シクロヘキシルなどのC3-8シクロアルキル基、例
えば、フェニル、α−ナフチルなどのC6-12アリール
基、例えば、ベンジル、フェネチルなどのフェニル−C
1-2アルキル基もしくはα−ナフチルメチルなどのα−
ナフチル−C1-2アルキル基などのC7-14アラルキル基
のほか、経口用エステルとして汎用されるピバロイルオ
キシメチルエステルなどが用いられる。
The polypeptide (protein) of the present invention has a C-terminal
Is usually a carboxyl group (-COOH) or carboxy
Rate (-COO-), but the C-terminal is an amide (-COO-).
NH 2) Or an ester (-COOR)
Yes. Here, R in the ester is, for example, methyl.
Or ethyl, n-propyl, isopropyl or n-
C1-6 alkyl groups such as butyl, eg cyclopentyl
C3-8 cycloalkyl groups such as ru, cyclohexyl, eg
For example, C6-12 aryl such as phenyl and α-naphthyl.
Groups, for example phenyl-C, such as benzyl, phenethyl, etc.
1-2 alkyl group or α-, such as α-naphthylmethyl
C7-14 aralkyl group such as naphthyl-C1-2 alkyl group
In addition to, pivaloyl aux is widely used as an oral ester
A xymethyl ester or the like is used.

【0021】本発明ポリペプチド(蛋白質)がC末端以
外にカルボキシル基(またはカルボキシレート)を有し
ている場合、カルボキシル基がアミド化またはエステル
化されているものも本発明の蛋白質に含まれる。この場
合のエステルとしては、例えば上記したC末端のエステ
ルなどが用いられる。さらに、本発明の蛋白質には、N
末端のメチオニン残基のアミノ基が保護基(例えば、ホ
ルミル基、アセチル基などのC1-6アシル基など)で保
護されているもの、生体内で切断されて生成するN末端
のグルタミン酸残基がピログルタミン化したもの、分子
内のアミノ酸の側鎖上にある、例えばOH、COOH、
NH2、SHなどが適当な保護基(例えば、ホルミル
基、アセチル基などのC1-6アシル基など)で保護され
ているもの、あるいは糖鎖が結合したいわゆる糖蛋白質
などの複合蛋白質なども含まれる。
When the polypeptide (protein) of the present invention has a carboxyl group (or carboxylate) in addition to the C-terminal, those in which the carboxyl group is amidated or esterified are also included in the protein of the present invention. As the ester in this case, for example, the above-mentioned C-terminal ester or the like is used. Furthermore, the protein of the present invention contains N
The amino group of the terminal methionine residue is protected by a protecting group (for example, C1-6 acyl group such as formyl group and acetyl group), and the N-terminal glutamic acid residue produced by cleavage in vivo is Pyroglutamine, on the side chain of an amino acid in the molecule, such as OH, COOH,
Including those in which NH 2 , SH, etc. are protected by a suitable protecting group (eg, C 1-6 acyl group such as formyl group, acetyl group, etc.), or complex proteins such as so-called glycoproteins to which sugar chains are bound Be done.

【0022】本発明の蛋白質の部分ポリペプチドとして
は、前記した本発明ポリペプチド(蛋白質)の部分ペプ
チドであって、実質的に同質の活性を有するものであれ
ばいずれのものでもよい。例えば、本発明ポリペプチド
(蛋白質)の構成アミノ酸配列のうち少なくとも10個
以上、好ましくは50個以上、さらに好ましくは70個
以上、より好ましくは100個以上、最も好ましくは2
00個以上のアミノ酸配列を有し、例えば、本発明のポ
リペプチドの機能と実質的に同質の生物学的活性を有す
るするペプチドなどが用いられる。本発明の部分ポリペ
プチドとしては、例えば、各機能ドメインを含むものが
好ましい。又、本発明の部分ペプチドはC末端が通常カ
ルボキシル基(−COOH)またはカルボキシレート
(−COO-)であるが、前記した本発明の蛋白質のご
とく、C末端がアミド(−CONH2)またはエステル
(−COOR)であってもよい。さらに、本発明の部分
ペプチドには、前記した本発明の蛋白質と同様に、N末
端のメチオニン残基のアミノ基が保護基で保護されてい
るもの、N端側が生体内で切断され生成したグルタミル
基がピログルタミン酸化したもの、分子内のアミノ酸の
側鎖上の置換基が適当な保護基で保護されているもの、
あるいは糖鎖が結合したいわゆる糖ペプチドなどの複合
ペプチドなども含まれる。本発明の部分ペプチドは、例
えば、試薬、実験の際の標準物質、又は免疫原若しくは
その一部として使用することが出来る。
The partial polypeptide of the protein of the present invention may be any partial peptide of the above-mentioned polypeptide (protein) of the present invention as long as it has substantially the same activity. For example, at least 10 or more, preferably 50 or more, more preferably 70 or more, more preferably 100 or more, and most preferably 2 out of the constituent amino acid sequences of the polypeptide (protein) of the present invention.
For example, a peptide having an amino acid sequence of 00 or more and having a biological activity substantially equivalent to the function of the polypeptide of the present invention is used. As the partial polypeptide of the present invention, for example, those containing each functional domain are preferable. Although partial peptide C-terminal of the present invention is usually a carboxyl group (-COOH) or carboxylate (-COO-), as the protein of the present invention described above, C-terminal amide (-CONH 2) or ester It may be (-COOR). Further, in the partial peptide of the present invention, similar to the above-mentioned protein of the present invention, the amino group of the N-terminal methionine residue is protected by a protecting group, and the glutamil produced by cleavage of the N-terminal side in vivo is produced. Pyroglutamine-oxidized group, Substituent on the side chain of amino acid in the molecule is protected by a suitable protecting group,
Alternatively, a complex peptide such as a so-called glycopeptide having a sugar chain bound thereto is also included. The partial peptide of the present invention can be used, for example, as a reagent, a standard substance in experiments, or an immunogen or a part thereof.

【0023】本発明ポリペプチド(蛋白質)又はその部
分ペプチドの塩としては、とりわけ生理学的に許容され
る酸付加塩が好ましい。この様な塩としては、例えば、
無機酸(例えば、塩酸、リン酸、臭化水素酸、硫酸)と
の塩、あるいは有機酸(例えば、酢酸、ギ酸、プロピオ
ン酸、フマル酸、マレイン酸、コハク酸、酒石酸、クエ
ン酸、リンゴ酸、蓚酸、安息香酸、メタンスルホン酸、
ベンゼンスルホン酸)との塩などが用いられる。
As the salt of the polypeptide (protein) of the present invention or its partial peptide, a physiologically acceptable acid addition salt is particularly preferable. Examples of such salts include, for example:
Salts with inorganic acids (eg hydrochloric acid, phosphoric acid, hydrobromic acid, sulfuric acid) or organic acids (eg acetic acid, formic acid, propionic acid, fumaric acid, maleic acid, succinic acid, tartaric acid, citric acid, malic acid) , Oxalic acid, benzoic acid, methanesulfonic acid,
For example, a salt with benzenesulfonic acid) is used.

【0024】本発明ポリペプチド(蛋白質)、その部分
ペプチドもしくはそれらの塩またはそれらのアミド体
は、当該技術分野で公知の化学合成方法を用いて調製す
ることも出来る。例えば、通常市販されている蛋白質合
成用樹脂を用い、α−アミノ基と側鎖官能基を適当に保
護したアミノ酸を、目的とする蛋白質の配列通りに、当
業界において自体公知の各種縮合方法に従い、樹脂上で
縮合させる。反応の最後に樹脂から蛋白質を切り出すと
同時に各種保護基を除去し、さらに高希釈溶液中で分子
内ジスルフィド結合形成反応を実施し、目的の蛋白質、
その部分ペプチドまたはそれらのアミド体を取得する。
上記した保護アミノ酸の縮合に関しては、例えば、DC
C、N,N'-ジイソプロピルカルボジイミド、及びN-エチル
-N'-(3-ジメチルアミノプロリル)カルボジイミドのよ
うなカルボジイミド類に代表される蛋白質合成に使用で
きる各種活性化試薬を用いることができる。これらによ
る活性化にはラセミ化抑制添加剤(例えば、HOBt, HOOB
t)とともに保護アミノ酸を直接樹脂に添加するかまた
は、対称とする酸無水物またはHOBtエステルあるいはHO
OBtエステルとしてあらかじめ保護アミノ酸の活性化を
行なった後に樹脂に添加することができる。
The polypeptide (protein) of the present invention, a partial peptide thereof, a salt thereof or an amide thereof can also be prepared by a chemical synthesis method known in the art. For example, using a commercially available resin for protein synthesis, an amino acid having an α-amino group and a side chain functional group appropriately protected is subjected to various condensation methods known per se in the art according to the sequence of the target protein. , To condense on the resin. At the end of the reaction, the protein is cleaved from the resin and at the same time various protective groups are removed, and then an intramolecular disulfide bond forming reaction is carried out in a highly diluted solution to obtain the target protein,
Obtain the partial peptide or amides thereof.
Regarding the condensation of the above protected amino acids, for example, DC
C, N, N'-diisopropylcarbodiimide, and N-ethyl
Various activation reagents that can be used in protein synthesis represented by carbodiimides such as -N '-(3-dimethylaminoprolyl) carbodiimide can be used. For these activations, racemization inhibiting additives (eg HOBt, HOOB
t) with the protected amino acid directly on the resin, or symmetrical acid anhydride or HOBt ester or HO
It can be added to the resin after previously performing activation of the protected amino acid as an OBt ester.

【0025】保護アミノ酸の活性化や樹脂との縮合に用
いられる溶媒としては、酸アミド類、ハロゲン化炭化水
素類、アルコール類、スルオキシド類、及びエーテル類
等、当業界において蛋白質縮合反応に使用しうることが
知られている溶媒から適宜選択されうる。反応温度は蛋
白質結合形成反応に使用され得ることが知られている範
囲から適宜選択される。活性化されたアミノ酸誘導体は
通常1.5〜4倍過剰で用いられる。ニンヒドリン反応
を用いたテストの結果、縮合が不十分な場合には保護基
の脱離を行うことなく縮合反応を繰り返すことにより十
分な縮合を行なうことができる。反応を繰り返しても十
分な縮合が得られないときには、無水酢酸またはアセチ
ルイミダゾールを用いて未反応アミノ酸をアセチル化し
て、後の反応に影響を及ぼさないようにすることができ
る。原料の各アミノ基、カルボキシル基、及びセリン水
酸基等の保護基としても、当該技術分野において、通常
使用される基を使用することができる。原料の反応に関
与すべきでない官能基の保護ならびに保護基、およびそ
の保護基の脱離、反応に関与する官能基の活性化などは
公知の基または公知の手段から適宜選択しうる。
As the solvent used for activation of the protected amino acid or condensation with the resin, acid amides, halogenated hydrocarbons, alcohols, sulfoxides, ethers, etc. are used in the protein condensation reaction in the art. It can be appropriately selected from the solvents known to be obtainable. The reaction temperature is appropriately selected from the range known to be applicable to protein bond-forming reactions. The activated amino acid derivative is usually used in a 1.5 to 4-fold excess. As a result of the test using the ninhydrin reaction, when the condensation is insufficient, sufficient condensation can be performed by repeating the condensation reaction without removing the protecting group. When sufficient condensation cannot be obtained even after repeating the reaction, the unreacted amino acid can be acetylated with acetic anhydride or acetylimidazole so as not to affect the subsequent reaction. As a protective group for each amino group, carboxyl group, serine hydroxyl group and the like of the raw material, groups commonly used in the art can be used. Protection of a functional group which should not be involved in the reaction of the raw material, removal of the protective group, removal of the protective group, activation of the functional group involved in the reaction and the like can be appropriately selected from known groups or known means.

【0026】本発明の部分ペプチドまたはそれらの塩
は、当該技術分野において自体公知のペプチドの合成法
に従って、あるいは本発明の蛋白質を適当なペプチダー
ゼで切断することによって製造することができる。ペプ
チドの合成法としては、例えば、固相合成法、液相合成
法のいずれによっても良い。公知の縮合方法や保護基の
脱離としては、例えば、以下の(1)〜(3)に記載さ
れた方法が挙げられる。 (1)泉屋信夫他、ペプチド合成の基礎と実験、 丸善
(株) (1975年) (2)矢島治明 および榊原俊平、生化学実験講座 1、
蛋白質の化学IV、 205、(1977年) (3)矢島治明監修、続医薬品の開発 第14巻 ペプチド
合成 広川書店 反応後の精製も自体公知の方法、例えば、溶媒抽出・蒸
留・カラムクロマトグラフィー・液体クロマトグラフィ
ー・再結晶などを組み合わせて本発明の部分ペプチドを
精製単離することができる。上記方法で得られる部分ペ
プチドが遊離体である場合は、公知の方法によって適当
な塩に変換することができるし、逆に塩で得られた場合
は、公知の方法によって遊離体に変換することができ
る。
The partial peptide of the present invention or a salt thereof can be produced according to a peptide synthesis method known per se in the art, or by cleaving the protein of the present invention with an appropriate peptidase. The peptide synthesis method may be, for example, either a solid phase synthesis method or a liquid phase synthesis method. Examples of known condensation methods and removal of protective groups include the methods described in (1) to (3) below. (1) Nobuo Izumiya et al., Basics and experiments of peptide synthesis, Maruzen
(1975) (2) Haruaki Yajima and Shunpei Sakakibara, Laboratory for Biochemistry 1,
Protein Chemistry IV, 205, (1977) (3) Supervised by Haruaki Yajima, Development of Pharmaceuticals Volume 14 Peptide Synthesis Hirokawa Shoten Purification after reaction is a method known per se, for example, solvent extraction / distillation / column chromatography -The partial peptide of the present invention can be purified and isolated by combining liquid chromatography, recrystallization and the like. When the partial peptide obtained by the above method is a free form, it can be converted into an appropriate salt by a known method, and conversely, when it is obtained as a salt, it can be converted into a free form by a known method. You can

【0027】本発明ポリペプチド(蛋白質)、その部分
ペプチドまたはそれらの塩に対する抗体は、それらを認
識し得るものであれば、ポリクローナル抗体、モノクロ
ーナル抗体の何れであってもよい。本発明ポリペプチド
(蛋白質)、その部分ペプチドまたはそれらの塩に対す
る抗体は、本発明ポリペプチド(蛋白質)又はその部分
ペプチドを抗原として用い、公知の抗体または抗血清の
製造法に従って製造することができる。本発明の抗体
は、体液や組織などの被検体中に存在する本発明ポリペ
プチド(蛋白質)等を検出するために使用することがで
きる。また、これらを精製するために使用する抗体カラ
ムの作製、精製時の各分画中の本発明ポリペプチド(蛋
白質)の検出、被検細胞内における本発明ポリペプチド
(蛋白質)の挙動の分析などのために使用することがで
きる。
The antibody against the polypeptide (protein) of the present invention, its partial peptide or salts thereof may be either a polyclonal antibody or a monoclonal antibody as long as it can recognize them. An antibody against the polypeptide (protein) of the present invention, a partial peptide thereof or a salt thereof can be produced by using the polypeptide (protein) of the present invention or a partial peptide thereof as an antigen according to a known method for producing an antibody or antiserum. . The antibody of the present invention can be used for detecting the polypeptide (protein) of the present invention present in a subject such as body fluid or tissue. In addition, preparation of an antibody column used for purifying them, detection of the polypeptide of the present invention (protein) in each fraction during purification, analysis of behavior of the polypeptide of the present invention in test cells, etc. Can be used for.

【0028】更に、本発明の抗体は、公知の方法による
被検液中の本発明ポリペプチド(蛋白質)等の定量、特
に、モノクローナル抗体を使用したサンドイッチ免疫測
定法による定量、及び組織染色等による検出などに使用
することができる。それによって、例えば、本発明ポリ
ペプチド(蛋白質)等が関与する疾病の診断を行なうこ
とができる。これらの目的には、抗体分子そのものを用
いてもよく、また、抗体分子のF(ab')2 、Fab'、
あるいはFab画分を用いてもよい。本発明の抗体を用
いる本発明の蛋白質等の定量法は、特に制限されるべき
ものではなく、被測定液中の抗原量(例えば、蛋白質
量)に対応した抗体、抗原もしくは抗体−抗原複合体の
量を化学的または物理的手段により検出し、これを既知
量の抗原を含む標準液を用いて作製した標準曲線より算
出する測定法であれば、いずれの測定法を用いてもよ
い。例えば、ネフロメトリー、競合法、イムノメトリッ
ク法およびサンドイッチ法が好適に用いられるが、感
度、特異性の点で、後述するサンドイッチ法を用いるの
が好ましい。標識物質を用いる測定法に用いられる標識
剤としては、当該技術分野で公知の、例えば、放射性同
位元素、酵素、蛍光物質、発光物質などを用いることが
出来る。
Further, the antibody of the present invention is quantified by a known method such as the polypeptide (protein) of the present invention in a test liquid, particularly by a sandwich immunoassay using a monoclonal antibody, and tissue staining. It can be used for detection and the like. Thereby, for example, a disease associated with the polypeptide (protein) of the present invention can be diagnosed. For these purposes, the antibody molecule itself may be used, or F (ab ') 2, Fab',
Alternatively, the Fab fraction may be used. The method for quantifying the protein or the like of the present invention using the antibody of the present invention is not particularly limited, and may be an antibody, an antigen or an antibody-antigen complex corresponding to the amount of antigen (eg, protein amount) in the liquid to be measured. Any measuring method may be used as long as it is a method of detecting the amount of the compound by chemical or physical means and calculating it from a standard curve prepared using a standard solution containing a known amount of the antigen. For example, nephrometry, competitive method, immunometric method and sandwich method are preferably used, but from the viewpoint of sensitivity and specificity, the sandwich method described later is preferably used. As the labeling agent used in the measurement method using a labeling substance, for example, a radioisotope, an enzyme, a fluorescent substance, a luminescent substance or the like known in the art can be used.

【0029】これらの測定・検出方法に関する一般的な
技術手段の詳細については、総説、成書などを参照する
ことができる。例えば、入江 寛編「続ラジオイムノア
ッセイ〕(講談社、昭和54年発行)、石川栄治ら編
「酵素免疫測定法」(第3版)(医学書院、昭和62年
発行)、「Methods in ENZYMOLOGY」Vol. 70(Immunoche
mical Techniques(Part A))、 同書 Vol. 73(Immunoche
mical Techniques(PartB))、 同書 Vol. 74(Immunochem
ical Techniques(Part C))、 同書 Vol. 84(Immunochem
ical Techniques(Part D:Selected Immunoassays))、
同書 Vol. 92(Immunochemical Techniques(Part E:Mono
clonal Antibodies and General Immunoassay Method
s))、 同書 Vol. 121(Immunochemical Techniques(Part
I:HybridomaTechnology and Monoclonal Antibodies))
(以上、アカデミックプレス社発行)などを参照すること
ができる。
For the details of general technical means relating to these measurement / detection methods, reference can be made to reviews, textbooks and the like. For example, Hiro Irie's "Continued Radioimmunoassay" (Kodansha, published in 1979), Eiji Ishikawa et al., "Enzyme Immunoassay" (3rd edition) (Medical Shoin, published in 1987), "Methods in ENZYMOLOGY" Vol. .70 (Immunoche
mical Techniques (Part A)), ibid Vol. 73 (Immunoche
mical Techniques (Part B)), ibid Vol. 74 (Immunochem
ical Techniques (Part C)), ibid Vol. 84 (Immunochem
ical Techniques (Part D: Selected Immunoassays)),
Ibid Vol. 92 (Immunochemical Techniques (Part E: Mono
clonal Antibodies and General Immunoassay Method
s)), ibid Vol. 121 (Immunochemical Techniques (Part
I: Hybridoma Technology and Monoclonal Antibodies))
(The above is published by Academic Press Co., Ltd.)

【0030】本発明ポリペプチド(蛋白質)又はその部
分ポリペプチドをコードするDNAに実質的に相補的な
塩基配列を有するアンチセンスオリゴヌクレオチド(D
NA)としては、当該DNAの塩基配列に実質的に相補
的な塩基配列を有し、該DNAの発現を抑制し得る作用
を有するものであれば、いずれのアンチセンスDNAで
あってもよい。実質的に相補的な塩基配列とは、例え
ば、本発明DNAに相補的な塩基配列の全塩基配列また
は部分塩基配列と好ましくは約90%以上、より好まし
くは約95%以上、最も好ましくは100%の相同性を
有する塩基配列などが挙げられる。又、これらアンチセ
ンスDNAと同様の作用を有する核酸配列(RNAまた
はDNAの修飾体)も本発明でいうアンチセンスDNA
に含まれる。これらのアンチセンスDNAは、公知のD
NA合成装置などを用いて製造することができる。
An antisense oligonucleotide (D) having a nucleotide sequence substantially complementary to the DNA encoding the polypeptide (protein) of the present invention or a partial polypeptide thereof
NA) may be any antisense DNA as long as it has a base sequence substantially complementary to the base sequence of the DNA and has an action of suppressing the expression of the DNA. The substantially complementary base sequence is, for example, preferably about 90% or more, more preferably about 95% or more, and most preferably 100% with respect to the whole or partial base sequence of the base sequence complementary to the DNA of the present invention. Examples include base sequences having% homology. A nucleic acid sequence (modified RNA or DNA) having the same action as these antisense DNAs is also referred to as the antisense DNA in the present invention.
include. These antisense DNAs are known D
It can be manufactured using an NA synthesizer or the like.

【0031】更に、本発明ポリペプチド(蛋白質)等
は、これら物質と特異的に相互作用する化合物をスクリ
ーニングする為の試薬として有用である。すなわち、本
発明は、本発明ポリペプチド(蛋白質)、その部分ペプ
チド若しくはそれらの塩、又はそれらに対する抗体を用
いることを特徴とする、該物質又はそれらの塩と特異的
に相互作用する化合物のスクリーニング方法、及びその
為のスクリーニング用キットを提供する。本発明のスク
リーニング方法またはスクリーニング用キットを用いて
同定される化合物またはその塩は、上記した試験化合物
から選ばれた化合物であり、本発明ポリペプチド(蛋白
質)等と相互作用し、その生物学的活性を調節、阻害、
促進、又は拮抗等する化合物である。該化合物またはそ
の塩は、本発明の蛋白質等の活性に直接作用するもので
あってもよいし、本発明ポリペプチド(蛋白質)等の発
現に作用することによって間接的に本発明ポリペプチド
(蛋白質)等の活性に作用するものであってもよい。該
化合物の塩としては、例えば、薬学的に許容可能な塩な
どが用いられる。例えば、無機塩基との塩、有機塩基と
の塩、無機酸との塩、有機酸との塩、塩基性または酸性
アミノ酸との塩などがあげられる。本発明ポリペプチド
(蛋白質)等の生物学的活性を阻害する化合物も上記各
種疾病に対する治療・予防剤などの医薬として使用でき
る可能性がある。
Further, the polypeptides (proteins) of the present invention are useful as reagents for screening compounds that specifically interact with these substances. That is, the present invention is characterized by using the polypeptide (protein) of the present invention, a partial peptide thereof or a salt thereof, or an antibody against them, for screening a compound that specifically interacts with the substance or a salt thereof. A method and a screening kit therefor are provided. The compound or its salt identified by using the screening method or the screening kit of the present invention is a compound selected from the above-mentioned test compounds, and interacts with the polypeptide (protein) of the present invention, etc. Regulates, inhibits activity,
It is a compound that promotes or antagonizes. The compound or its salt may directly act on the activity of the protein of the present invention, or indirectly by acting on the expression of the polypeptide (protein) of the present invention. ), Etc., may be used. As the salt of the compound, for example, a pharmaceutically acceptable salt and the like are used. Examples thereof include salts with inorganic bases, salts with organic bases, salts with inorganic acids, salts with organic acids, salts with basic or acidic amino acids, and the like. A compound that inhibits biological activity such as the polypeptide (protein) of the present invention may also be used as a drug such as a therapeutic / prophylactic agent for the above-mentioned various diseases.

【0032】本発明DNA及び該DNAを含む遺伝子を
プローブとして使用することにより、本発明ポリペプチ
ド又はその部分ペプチドをコードするDNAまたはmR
NAの異常(遺伝子異常)を検出することができるの
で、例えば、該DNAまたはmRNAの損傷、突然変異
あるいは発現低下や、該DNAまたはmRNAの増加あ
るいは発現過多などの遺伝子診断剤として有用である。
本発明のDNAを用いる上記の遺伝子診断は、例えば、
公知のノーザンハイブリダイゼーションやPCR−SS
CP法(Genomics,第5巻,874〜879頁(198
9年)、Proceedings of the National Academy of Sci
ences of the United States of America,第86巻,
2766〜2770頁(1989年))などにより実施
することができる。更に、本発明DNA又は遺伝子に異
常があったり、欠損している場合あるいは発現量が減少
している場合、生体内において正常な機能を発揮できな
い患者に対しては、公知手段に従って(1)レトロウイ
ルスベクター、アデノウイルスベクター、アデノウイル
スアソシエーテッドウイルスベクターなどの適当なベク
ターをベヒクルとして使用する遺伝子治療によって、本
発明DNA又は遺伝子を該患者体内に導入し、発現させ
るか、又は(2)本発明の蛋白質等を該患者に注入する
こと等によって、該患者において本発明の蛋白質等の機
能を発揮させることができるものと考えられる。本発明
DNA又は遺伝子を、該DNAを単独、又は、摂取促進
のための補助剤とともに、遺伝子銃やハイドロゲルカテ
ーテルのようなカテーテルによって投与することも可能
である。
By using the DNA of the present invention and the gene containing the DNA as a probe, DNA or mR encoding the polypeptide of the present invention or a partial peptide thereof can be obtained.
Since it is possible to detect an abnormality in NA (gene abnormality), it is useful as a gene diagnostic agent for, for example, damage, mutation or decreased expression of the DNA or mRNA, or increase or excessive expression of the DNA or mRNA.
The above-mentioned gene diagnosis using the DNA of the present invention is, for example,
Known Northern hybridization and PCR-SS
CP method (Genomics, Vol. 5, pp. 874-879 (198)
9 years), Proceedings of the National Academy of Sci
ences of the United States of America, Volume 86,
2766-2770 (1989)) and the like. Further, in the case where the DNA or gene of the present invention is abnormal or deficient or the expression level is reduced, for patients who cannot exert normal functions in vivo, according to known means, (1) Retro The DNA or gene of the present invention is introduced into the patient's body and expressed by gene therapy using a suitable vector such as a viral vector, adenovirus vector, or adenovirus associated virus vector as a vehicle, or (2) the present invention It is considered that the protein or the like of the present invention can be exerted in the patient by injecting the protein or the like of 1. into the patient. The DNA or gene of the present invention can be administered by a gene gun or a catheter such as a hydrogel catheter together with the DNA alone or together with an auxiliary agent for promoting intake.

【0033】本明細書および表面において、塩基やアミ
ノ酸などを略号で表示する場合、IUPAC−IUB C
ommision on Biochemical Nomenclature による略号あ
るいは当該分野における慣用略号に基づくものであり、
またアミノ酸に関し光学異性体があり得る場合は、特に
明示しなければL体を示すものとする。
In the present specification and the surface, when an abbreviation is used for a base or amino acid, IUPAC-IUB C
Based on the abbreviations by ommision on Biochemical Nomenclature or abbreviations commonly used in this field,
When amino acids may have optical isomers, the L form is shown unless otherwise specified.

【0034】[0034]

【実施例】以下に、実施例により本発明をさらに具体的
に説明するが、本発明はそれに限定されるものではな
い。なお、実施例における各種遺伝子操作は、上記のCu
rrent protocols in molecular biology(edited by Fr
ederick M. Ausubel et al.,1987)に記載されている方
法に従った。
The present invention will be described in more detail below with reference to examples, but the present invention is not limited thereto. In addition, various gene manipulations in the examples were performed using the Cu
rrent protocols in molecular biology (edited by Fr
ederick M. Ausubel et al., 1987).

【0035】(1)ヒト成人全脳、ヒト扁桃、ヒト成人
海馬及びヒト胎児全脳由来cDNAライブラリーの構築 NotI部位を有するオリゴヌクレオチド(GACTA
GTTCTAGATCGCGAGCGGCCGCCC
(T)15)(インビトロジェン)をプライマーとし
て、ヒト成人全脳、ヒト扁桃、ヒト成人海馬及びヒト胎
児全脳由来mRNA(クローンテック社製)を鋳型にSu
perScriptII逆転写酵素キット(インビトロジェン社
製)で2本鎖cDNAを合成した。SalI部位を有す
るアダプター(インビトロジェン社製)をcDNAとラ
イゲーションした。その後、NotI消化し、1%濃度
の低融解アガロース電気泳動により、3kb以上のDN
A断片を精製した。精製cDNA断片を、SalI−N
otI制限酵素処理したpBluescript IISK+ プラスミ
ドとライゲーションした。大腸菌 ElectroMax DH10B 株
(インビトロジェン)にエレクトロポレーション法によ
りこの組換えプラスミドを導入した。
(1) Construction of cDNA Library Derived from Human Adult Whole Brain, Human Tonsils, Human Adult Hippocampus and Human Fetal Whole Brain Oligonucleotide (GACTA) having NotI site
GTTCTAGATCGCGAGCGCGCCGCCC
Using (T) 15 ) (Invitrogen) as a primer, mRNA derived from human adult whole brain, human tonsil, human adult hippocampus and human fetal whole brain (manufactured by Clontech) is used as a template and Su is used.
Double-stranded cDNA was synthesized with a perScriptII reverse transcriptase kit (Invitrogen). An adapter having a SalI site (Invitrogen) was ligated with the cDNA. Then digested with NotI and subjected to 1% concentration of low melting agarose electrophoresis to obtain a DNA of 3 kb or more.
The A fragment was purified. The purified cDNA fragment was SalI-N
It was ligated with pBluescript II SK + plasmid treated with otI restriction enzyme. This recombinant plasmid was introduced into Escherichia coli ElectroMax DH10B strain (Invitrogen) by the electroporation method.

【0036】(2)スクリーニング(その1) 次いで、こうして構築したcDNAライブラリーからラ
ンダムにクローンをピックアップし、メンブランにスポ
ッティングした。次に、これまでに本発明者等によって
既に全長の解析が行われている約1,300個のクローンの
塩基配列に基づき作成したオリゴDNA(各21塩基)
の混合物の各3’末端をターミナルトランスフェラーゼ
でDIGラベルし、これらをプローブとして使用してド
ットハイブリダイゼーション(Current protocols in m
olecular biology(edited by Frederick M. Ausubel e
t al., 1987))により、重複クローン(繰り返し出て
くるクローン)を除いた。次に、インビトロでの転写翻
訳(プロメガ社TNT T7 Quick Coupled Transcription/T
ranslation System cat.no.L1107)を行い、50kDa
以上の産物が認められるクローンを選択した。次に、選
択したクローンの末端塩基配列を決定し、得られた配列
をクエリーとして相同検索プログラムBLASTN 2.2.1 (Al
tschul, Stephen F., Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Mille
r, andDavid J. Lipman (1997), "Gapped BLAST and PS
I-BLAST: a new generation ofprotein database searc
h programs", Nucleic Acids Res. 25:3389-3402)を用
いて、nr(All GenBank+EMBL+DDBJ+PDB sequences (but
no EST, STS,GSS, orphase 0,1 or 2 HTGS sequences))
データベースに対して相同検索を行った。その結果、相
同遺伝子が存在しなかったもの、即ち、新規遺伝子であ
るものについて全塩基配列を決定した。 スクリーニング(その2) cDNAの5'および3'の末端配列を、相同検索プログラムBL
ASTN2.2.1を用いて、ヒトのゲノム配列(ftp://ncbi.nl
m.nih.gov/genomes/H_sapiens/)に対応させた。次に、
それらが挟むゲノム領域から、Genscanプログラム(Burg
e, C. and Karlin, S. 1997, Prediction of complete
gene structures in human genomic DNA,J Mol. Biol.,
268, 78-94 、ゲノムから遺伝子を予測するコンピュー
ターソフト)を用いて、コードされる遺伝子を抜き出し
た。これをクエリーとして、相同検索プログラムBLASTN
2.1.3を用いて、mergedb(かずさDNA研究所で決定した
ヒトのcDNAの配列とGenBankのhomo sapiensデータベー
スからESTとゲノムを除いたものを重複なく混ぜ合わせ
た、かずさDNA研究所で独自に作成したDNA配列データベ
ース)に対応させ、新規の長鎖(Genscan予想cdsが1200
bp以上)遺伝子が確認された場合には、5'および3'の末
端配列決定をおこなったcDNAの全長解析をおこなった。
配列決定には、PEアプライドバイオシステム社製のD
NAシークエンサー(ABI PRISM377)と同社製反応キッ
トを使用した。大部分の配列はショットガンクローンを
ダイターミネーター法を用いて決定した。一部の塩基配
列については、決定した塩基配列を元にしてオリゴヌク
レオチドを合成し、プライマーウォーキング法で決定し
た。
(2) Screening (1) Next, clones were randomly picked up from the thus constructed cDNA library and spotted on a membrane. Next, oligo DNAs (21 bases each) prepared based on the base sequences of about 1,300 clones which have been already analyzed by the present inventors for the full length.
Each of the 3'ends of the mixture was labeled with terminal transferase and DIG labeled (Current protocols in m
olecular biology (edited by Frederick M. Ausubel e
, 1987)), and duplicate clones (repeated clones) were removed. Next, in vitro transcription / translation (Promega TNT T7 Quick Coupled Transcription / T
ranslation System cat.no.L1107), 50kDa
The clones in which the above products are recognized were selected. Next, the terminal nucleotide sequence of the selected clone is determined, and the obtained sequence is used as a query for the homology search program BLASTN 2.2.1 (Al.
tschul, Stephen F., Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Mille
r, andDavid J. Lipman (1997), "Gapped BLAST and PS
I-BLAST: a new generation of protein database searc
h programs ", Nucleic Acids Res. 25: 3389-3402), using nr (All GenBank + EMBL + DDBJ + PDB sequences (but
no EST, STS, GSS, orphase 0,1 or 2 HTGS sequences))
A homology search was performed on the database. As a result, the whole nucleotide sequence was determined for those in which the homologous gene did not exist, that is, those which were new genes. Screening (2) The 5'and 3'terminal sequences of the cDNA were analyzed by the homology search program BL.
Using ASTN2.2.1, the human genome sequence (ftp://ncbi.nl
m.nih.gov/genomes/H_sapiens/). next,
The Genscan program (Burg
e, C. and Karlin, S. 1997, Prediction of complete
gene structures in human genomic DNA, J Mol. Biol.,
268, 78-94, computer software that predicts genes from the genome) was used to extract the encoded genes. Using this as a query, the homology search program BLASTN
Using 2.1.3, mergedb (human cDNA sequence determined by Kazusa DNA Research Laboratories and GenBank homo sapiens database excluding EST and genome were mixed without duplication, created by Kazusa DNA Research Laboratories) The new long chain (Genscan predicted cds 1200
When a gene (bp or more) was confirmed, full-length analysis of the cDNA with 5'and 3'terminal sequencing was performed.
For sequencing, PE Applied Biosystems D
NA sequencer (ABI PRISM377) and the reaction kit manufactured by the same company were used. Most sequences were shotgun cloned using the dye terminator method. Regarding some base sequences, oligonucleotides were synthesized based on the determined base sequences and determined by the primer walking method.

【0037】このようにして新規DNA又は遺伝子のス
クリーニングを行なった。その結果、配列表の配列番号
1乃至21(但し、11は除く)のいずれか一つに示さ
れた新規DNA又は遺伝子が検出された。これらの新規
DNA又は遺伝子について、上記の配列決定方法により
その塩基配列を決定した。本発明DNA又は遺伝子を有
するクローンの名称は表1及び表2に示されている。
In this way, screening of new DNA or gene was carried out. As a result, the novel DNA or gene shown in any one of SEQ ID NOS: 1 to 21 (excluding 11) in the sequence listing was detected. The nucleotide sequences of these novel DNAs or genes were determined by the above sequencing method. The names of clones having the DNA or gene of the present invention are shown in Tables 1 and 2.

【0038】(3)本発明DNAの相同性検索 次に、こうして得られた全塩基配列に基づき、クローン
のアミノ酸配列を既知配列ライブラリーnrに対して解析
プログラムBLASTP 2.2.1 (Altschul, StephenF., Thoma
s L. Madden, Alejandro A. Schaffer, Jinghui Zhang,
Zheng Zhang,Webb Miller, and David J. Lipman (199
7), "Gapped BLAST and PSI-BLAST: anew generation o
f protein database search programs", Nucleic Acid
s Res. 25:3389-3402)を用いて検索したところ表3及び
表4に示した各相同遺伝子と相同性を示すことが明らか
になった。尚、表3及び表4には、これら相同遺伝子に
関する情報、即ち、その名称、データベースID、生物
種、蛋白質長、及び記載文献が挙げられている。又、こ
れら各表中の「生物種」の略号の意味は表5で説明され
ている。
(3) Homology search of DNA of the present invention Next, based on the entire nucleotide sequence thus obtained, the amino acid sequence of the clone was analyzed against the known sequence library nr by the analysis program BLASTP 2.2.1 (Altschul, Stephen F. , Thoma
s L. Madden, Alejandro A. Schaffer, Jinghui Zhang,
Zheng Zhang, Webb Miller, and David J. Lipman (199
7), "Gapped BLAST and PSI-BLAST: anew generation o
f protein database search programs ", Nucleic Acid
s Res. 25: 3389-3402) and revealed homology with each homologous gene shown in Tables 3 and 4. In addition, Tables 3 and 4 list information about these homologous genes, that is, their names, database IDs, species, protein lengths, and references. The meanings of the abbreviations of "organism species" in each of these tables are explained in Table 5.

【0039】更に、各クローンに含まれる本発明DNA
又は遺伝子と表3及び表4表に示した各相同遺伝子との
相同性に関する各種データを表6にまとめた。これら表
中の各項目の意味は以下の通りである。 「相同領域 クローン」クローンの相同領域の起点及び
終点 「相同領域 相同遺伝子」相同遺伝子の相同領域の起点
及び終点 「Score」この値が高いほど信頼度が高い 「E-value」この値が0に近いほど信頼度が高い 「相同性」相同領域のアミノ酸残基の一致の割合 「相同範囲率」相同遺伝子中の相同領域の割合
Further, the DNA of the present invention contained in each clone
Alternatively, Table 6 summarizes various data on the homology between genes and the homologous genes shown in Tables 3 and 4. The meaning of each item in these tables is as follows. "Homologous region Clone" Origin and end point of homologous region of clone "Homologous region Homologous gene" Origin and end point of homologous region of homologous gene "Score" The higher this value, the higher the reliability "E-value" This value becomes 0 The closer the degree of confidence is, the higher the degree of concordance between amino acid residues in the homology region of homology.

【0040】(4)各種ドメインの検索 次に、クローンに含まれるDNAがコードするアミノ酸
配列中から、Pfam 6.6に含まれる検索ツールPfam HMM v
er 2.1 Search (HMMPFAM) (Sonnhammer ELL, Eddy SR,
Birney E, Bateman A, Durbin R (1998) Pfam: multipl
e sequence alignments and HMM-profiles of protein
domains, Nucleic Acids Research 26:320-322)を用い
て機能ドメインを検索した。更に、膜蛋白予測プログラ
ムであるSOSUI system (ver. 1.0 / 10, Mar., 1996)
(Takatsugu Hirokawa, Seah Boon-Chieng and Shigeki
Mitaku, SOSUI: Classification and Secondary Struct
ure Prediction System for Membrane Proteins, Bioin
formatics (formerly CABIOS) 1998 May;14(4):378-37
9.) を用いて膜貫通ドメインを検索した。これらの検出
された機能ドメイン及び膜貫通ドメインを表7〜表13
にそれぞれのクローンについて示した。これら表中の各
項目の意味は以下の通りである。 「機能ドメイン」Pfam SOSUIにより検出されたドメイン 「クローン from」クローン機能ドメインの起点 「クローン to」クローン機能ドメインの終点 「相同遺伝子 from」相同遺伝子機能ドメインの起点 「相同遺伝子 to」相同遺伝子機能ドメインの終点 「Score(Pfamのみ)」この値が高いほど信頼度が高い 「Exp(Pfamのみ)」この値が0に近いほど信頼度が高い 又、各機能ドメインの完全標記を表14に示した。
(4) Search for various domains Next, the search tool Pfam HMM v included in Pfam 6.6 was selected from the amino acid sequences encoded by the DNA contained in the clones.
er 2.1 Search (HMMPFAM) (Sonnhammer ELL, Eddy SR,
Birney E, Bateman A, Durbin R (1998) Pfam: multipl
e sequence alignments and HMM-profiles of protein
functional domains were searched using domains, Nucleic Acids Research 26: 320-322). Furthermore, SOSUI system (ver. 1.0 / 10, Mar., 1996) which is a membrane protein prediction program
(Takatsugu Hirokawa, Seah Boon-Chieng and Shigeki
Mitaku, SOSUI: Classification and Secondary Struct
ure Prediction System for Membrane Proteins, Bioin
formatics (formerly CABIOS) 1998 May; 14 (4): 378-37
9.) was used to search for the transmembrane domain. These detected functional domains and transmembrane domains are shown in Tables 7 to 13.
The respective clones are shown in. The meaning of each item in these tables is as follows. "Functional domain" Domain detected by Pfam SOS UI "Clone from" Origin of clone functional domain "Clone to" End of clone functional domain "Homologous gene from" Origin of homologous gene functional domain "Homologous gene to" End point "Score (Pfam only)" The higher this value is, the higher the reliability "Exp (Pfam only)" The closer this value is to 0, the higher the reliability. Further, Table 14 shows the complete notation of each functional domain.

【0041】(5)発現部位 RT-PCR Coupled ELISAを用いて、組織と脳の部位での発
現を、それぞれで一番強い発現を示したものを表15に
示した。尚、組織及び脳の部位の完全標記を表16に示
した。 (6)染色体位置 クローンのDNA配列を、相同検索プログラムBLASTN 2.2.
1を用いてヒトゲノムをコードするクローンのライブラ
リー(ftp://ncbi.nlm.nih.gov/genomes/H_sapiens/)に
対応させた。対応したクローンの説明(Definition)の中
からこのクローンが由来した染色体の番号を抽出し、こ
れを表15に示した。
(5) Expression site Table 15 shows the expression at the tissue and brain sites, which showed the strongest expression by RT-PCR Coupled ELISA. Table 16 shows the complete notation of the tissue and brain parts. (6) The homologous search program BLASTN 2.2.
1 was used to correspond to a library of clones encoding the human genome (ftp://ncbi.nlm.nih.gov/genomes/H_sapiens/). The number of the chromosome from which this clone was derived was extracted from the description of the corresponding clone (Definition), and is shown in Table 15.

【0042】以上の、相同性、相同性遺伝子に関する情
報、各種ドメイン、発現部位、及び染色体位置、等に基
づき、当業者であれば、本発明のDNA又は遺伝子が表
1及び表2に示した各機能を有するものと予測すること
が出来る。
Based on the above-mentioned information on homology, homologous genes, various domains, expression sites, chromosomal positions, etc., those skilled in the art can show the DNA or gene of the present invention in Tables 1 and 2. It can be predicted to have each function.

【0043】本発明で得られた新規なDNA又は遺伝子
を所謂DNAチップ等に集積させ、これに、例えば、精
神病等の脳が関与する疾患の患者と対照としての正常人
の血液又は組織等から作成したプローブをハイブリダイ
ゼーションさせることによって、これら疾患の診断、治
療等に役立てることが出来る。又、本発明のDNA若し
くは遺伝子又はそれらの一部の塩基配列に基づき作成し
た合成DNAプライマーを使用し、ヒトの血液又は組織
から抽出した染色体DNAを用いてPCRを行い、その
産物の塩基配列を決定することにより、本発明のDNA
又は遺伝子中にある個体によって異なる一塩基の変異、
即ち、cSNPsを見出すことが出来る。これにより、
個体の体質等が予測され、各自に適した医薬の開発等が
可能となる。又、クロスハイブリダイゼーションによ
り、マウス等のモデル生物における本発明のDNA又は
遺伝子に対するオルソログ(ホモログ、カウンターパー
ト)遺伝子を単離し、例えば、これら遺伝子をノックア
ウトすることによってヒトの疾患モデル動物を作成し、
ヒトの病因となる遺伝子を探索・同定することも可能で
ある。更に、本発明ポリペプチド、その部分ポリペプチ
ド若しくは該ポリペプチドを含む組換え蛋白質、又は、
本発明のDNA又は遺伝子に対する抗体を網羅的に作成
し、それらを集積させて所謂PROTEINチップを作
成し、患者と正常人との蛋白質発現量の差異を検出する
等のプロテオーム解析から、病気の診断・治療等に役立
てることが出来る。
The novel DNA or gene obtained by the present invention is accumulated on a so-called DNA chip or the like, and from this, for example, blood or tissue of a normal patient as a control and a patient with a disease involving the brain such as mental illness. By hybridizing the prepared probe, it can be utilized for diagnosis and treatment of these diseases. Further, using the DNA or gene of the present invention or a synthetic DNA primer prepared based on the nucleotide sequence of a part thereof, PCR is carried out using chromosomal DNA extracted from human blood or tissue, and the nucleotide sequence of the product is determined. By determining, the DNA of the present invention
Or a single base mutation that differs depending on the individual in the gene,
That is, cSNPs can be found. This allows
It is possible to predict the constitution of each individual and develop a drug suitable for each individual. Also, by cross-hybridization, ortholog (homolog, counterpart) gene for the DNA or gene of the present invention in a model organism such as a mouse is isolated, and, for example, a human disease model animal is prepared by knocking out these genes,
It is also possible to search and identify genes that cause human pathogenesis. Furthermore, the polypeptide of the present invention, a partial polypeptide thereof or a recombinant protein containing the polypeptide, or
Diagnosis of diseases from proteome analysis such as comprehensively producing antibodies against the DNA or gene of the present invention, accumulating them to produce a so-called PROTEIN chip, and detecting a difference in protein expression level between patients and normal persons.・ It can be useful for treatment.

【0044】[0044]

【表1】 [Table 1]

【0045】[0045]

【表2】 [Table 2]

【0046】[0046]

【表3】 [Table 3]

【0047】[0047]

【表4】 [Table 4]

【0048】[0048]

【表5】 [Table 5]

【0049】[0049]

【表6】 [Table 6]

【0050】[0050]

【表7】 [Table 7]

【0051】[0051]

【表8】 [Table 8]

【0052】[0052]

【表9】 [Table 9]

【0053】[0053]

【表10】 [Table 10]

【0054】[0054]

【表11】 [Table 11]

【0055】[0055]

【表12】 [Table 12]

【0056】[0056]

【表13】 [Table 13]

【0057】[0057]

【表14】 [Table 14]

【0058】[0058]

【表15】 [Table 15]

【0059】[0059]

【表16】 [Table 16]

【0060】[0060]

【配列表】 SEQUENCE LISTING <110> KAZUSA DNA Research Institute <120> Novel Genes and Proteins Encoded by the Genes <130> AB01037 <160> 21 <210> 1 <211> 5782 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (580)..(3825) <400> 1 cagctcccaa ggctgagatt actctgcttc atctggatcg cccatctctg gggtctcatg 60 gctgagtttc agttccccaa tcctacctgc tcctcagggg gccagcactg gggctgcagg 120 taggccacct gttgagacct ggtgaaagat caggtataat aatgttctgc agtgaaaaga 180 aattgcgtga agtggaacgg atagtgaaag ccaatgaccg tgaatataat gaaaagttcc 240 agtatgcgga taatcgtatc cacacatcga aatataatat tctcaccttc ttgccaatta 300 atttatttga acagttccaa agagtggcaa atgcctattt tctttgcctt ctgattttac 360 agctaattcc agaaatttcc tccttgacct ggtttaccac cattgtgcct ttggtcctgg 420 tgataactat gacagctgtc aaagatgcca cagatgacta ttttcgccac aagagtgata 480 atcaagtgaa taatcggcag tctgaagtgc tcatcaacag caagtgctga aggtacaaag 540 ctgagggtct aatgacagga attactctgg taattgtaa gga aga aga ttt atc 594 Gly Arg Arg Phe Ile 1 5 ctg gtc ctg aga aaa ata ctg cag aat gaa aaa tgg atg aat gtc aaa 642 Leu Val Leu Arg Lys Ile Leu Gln Asn Glu Lys Trp Met Asn Val Lys 10 15 20 gtg gga gac atc att aaa tta gaa aat aac caa ttt gtt gct gct gat 690 Val Gly Asp Ile Ile Lys Leu Glu Asn Asn Gln Phe Val Ala Ala Asp 25 30 35 tta ctt ctc cta tca agt agt gag cca cat ggt ctc tgt tat gtt gaa 738 Leu Leu Leu Leu Ser Ser Ser Glu Pro His Gly Leu Cys Tyr Val Glu 40 45 50 act gct gag ctt gat ggg gaa acg aac cta aaa gtc cgc cat gca cta 786 Thr Ala Glu Leu Asp Gly Glu Thr Asn Leu Lys Val Arg His Ala Leu 55 60 65 tca gtt act tca gaa ctt gga gca gat atc agc aga ctt gca ggg ttt 834 Ser Val Thr Ser Glu Leu Gly Ala Asp Ile Ser Arg Leu Ala Gly Phe 70 75 80 85 gat ggg att gtt gtc tgt gag gtg cct aac aac aag tta gat aaa ttc 882 Asp Gly Ile Val Val Cys Glu Val Pro Asn Asn Lys Leu Asp Lys Phe 90 95 100 atg gga atc ctt tct tgg aaa gac agc aag cat tcc ctc aac aat gag 930 Met Gly Ile Leu Ser Trp Lys Asp Ser Lys His Ser Leu Asn Asn Glu 105 110 115 aag ata atc ctg aga ggc tgc atc ctg aga aat acc agc tgg tgt ttt 978 Lys Ile Ile Leu Arg Gly Cys Ile Leu Arg Asn Thr Ser Trp Cys Phe 120 125 130 gga atg gtt att ttt gca ggt cct gac act aaa cta atg cag aat agt 1026 Gly Met Val Ile Phe Ala Gly Pro Asp Thr Lys Leu Met Gln Asn Ser 135 140 145 ggt aag aca aag ttt aaa agg aca agc att gat aga ttg atg aat act 1074 Gly Lys Thr Lys Phe Lys Arg Thr Ser Ile Asp Arg Leu Met Asn Thr 150 155 160 165 cta gta cta tgg att ttt ggg ttt ctg ata tgc ttg gga att att ctt 1122 Leu Val Leu Trp Ile Phe Gly Phe Leu Ile Cys Leu Gly Ile Ile Leu 170 175 180 gca ata gga aat tca atc tgg gag agt caa act ggg gac caa ttc aga 1170 Ala Ile Gly Asn Ser Ile Trp Glu Ser Gln Thr Gly Asp Gln Phe Arg 185 190 195 act ttc ctc ttt tgg aat gaa gga gag aag agc tct gtg ttc tcc gga 1218 Thr Phe Leu Phe Trp Asn Glu Gly Glu Lys Ser Ser Val Phe Ser Gly 200 205 210 ttc tta aca ttc tgg tca tat att att att ctc aat aca gtt gta ccc 1266 Phe Leu Thr Phe Trp Ser Tyr Ile Ile Ile Leu Asn Thr Val Val Pro 215 220 225 att tcc tta tat gtg agt gtg gaa gta att cgt cta gga cac agt tat 1314 Ile Ser Leu Tyr Val Ser Val Glu Val Ile Arg Leu Gly His Ser Tyr 230 235 240 245 ttt ata aac tgg gac cgg aag atg tat tat tct cga aaa gca ata cct 1362 Phe Ile Asn Trp Asp Arg Lys Met Tyr Tyr Ser Arg Lys Ala Ile Pro 250 255 260 gca gtg gct cga acg acc acg ctc aat gag gaa ctg ggg cag att gag 1410 Ala Val Ala Arg Thr Thr Thr Leu Asn Glu Glu Leu Gly Gln Ile Glu 265 270 275 tac att ttc tcc gac aaa acg ggt acc ctc act caa aac atc atg acc 1458 Tyr Ile Phe Ser Asp Lys Thr Gly Thr Leu Thr Gln Asn Ile Met Thr 280 285 290 ttt aaa aga tgt tcc att aat ggg aga atc tat ggt gaa gta cat gat 1506 Phe Lys Arg Cys Ser Ile Asn Gly Arg Ile Tyr Gly Glu Val His Asp 295 300 305 gac ctg gat cag aag aca gaa ata act cag gaa aaa gag cct gtg gat 1554 Asp Leu Asp Gln Lys Thr Glu Ile Thr Gln Glu Lys Glu Pro Val Asp 310 315 320 325 ttc tca gtc aaa tct caa gcg gat aga gaa ttt cag ttc ttt gac cac 1602 Phe Ser Val Lys Ser Gln Ala Asp Arg Glu Phe Gln Phe Phe Asp His 330 335 340 cat ctg atg gaa tcc att aaa atg ggt gat ccc aaa gtt cat gaa ttc 1650 His Leu Met Glu Ser Ile Lys Met Gly Asp Pro Lys Val His Glu Phe 345 350 355 ctt agg tta ctt gct ctc tgc cac act gta atg tca gaa gag aat agc 1698 Leu Arg Leu Leu Ala Leu Cys His Thr Val Met Ser Glu Glu Asn Ser 360 365 370 gca gga gag ctg att tac caa gtt cag tca cct gat gaa ggg gct cta 1746 Ala Gly Glu Leu Ile Tyr Gln Val Gln Ser Pro Asp Glu Gly Ala Leu 375 380 385 gtg act gcc gct aga aat ttt ggg ttc att ttt aaa tcc cgg acc cca 1794 Val Thr Ala Ala Arg Asn Phe Gly Phe Ile Phe Lys Ser Arg Thr Pro 390 395 400 405 gag acc ata aca ata gaa gaa ttg gga aca cta gtt act tat caa tta 1842 Glu Thr Ile Thr Ile Glu Glu Leu Gly Thr Leu Val Thr Tyr Gln Leu 410 415 420 ctt gcc ttt ttg gat ttc aac aac acc aga aaa agg atg tct gtc ata 1890 Leu Ala Phe Leu Asp Phe Asn Asn Thr Arg Lys Arg Met Ser Val Ile 425 430 435 gtt cga aac cca gaa gga cag ata aag ctt tat tcc aaa gga gca gat 1938 Val Arg Asn Pro Glu Gly Gln Ile Lys Leu Tyr Ser Lys Gly Ala Asp 440 445 450 act att ctg ttt gaa aaa ctt cat cct tcc aat gaa gtc ctt ttg tct 1986 Thr Ile Leu Phe Glu Lys Leu His Pro Ser Asn Glu Val Leu Leu Ser 455 460 465 ttg acg tca gac cac ctc agt gaa ttt gca ggg gaa ggc ctt cgg acc 2034 Leu Thr Ser Asp His Leu Ser Glu Phe Ala Gly Glu Gly Leu Arg Thr 470 475 480 485 ttg gcc atc gca tac aga gac ctg gat gac aag tac ttt aaa gag tgg 2082 Leu Ala Ile Ala Tyr Arg Asp Leu Asp Asp Lys Tyr Phe Lys Glu Trp 490 495 500 cat aag atg ctt gaa gat gcg aat gct gcc aca gaa gag agg gat gaa 2130 His Lys Met Leu Glu Asp Ala Asn Ala Ala Thr Glu Glu Arg Asp Glu 505 510 515 cga ata gct gag cta tat gaa gaa att gaa aga gat ttg atg cta cta 2178 Arg Ile Ala Glu Leu Tyr Glu Glu Ile Glu Arg Asp Leu Met Leu Leu 520 525 530 ggt gcc act gct gta gaa gat aag tta cag gag ggt gtt att gaa aca 2226 Gly Ala Thr Ala Val Glu Asp Lys Leu Gln Glu Gly Val Ile Glu Thr 535 540 545 gtt aca agt tta tca cta gcc aat att aag atc tgg gtc cta aca gga 2274 Val Thr Ser Leu Ser Leu Ala Asn Ile Lys Ile Trp Val Leu Thr Gly 550 555 560 565 gac aaa caa gaa act gcc atc aac atc ggt tat gcc tgc aac atg ctg 2322 Asp Lys Gln Glu Thr Ala Ile Asn Ile Gly Tyr Ala Cys Asn Met Leu 570 575 580 act gac gac atg aat gat gtg ttt gtg ata gca ggg aat aat gct gtg 2370 Thr Asp Asp Met Asn Asp Val Phe Val Ile Ala Gly Asn Asn Ala Val 585 590 595 gaa gtg aga gaa gaa ctc agg aaa gca aaa caa aat ttg ttt gga caa 2418 Glu Val Arg Glu Glu Leu Arg Lys Ala Lys Gln Asn Leu Phe Gly Gln 600 605 610 aac aga aat ttt tcc aat ggc cat gta gtt tgt gaa aaa aag cag cag 2466 Asn Arg Asn Phe Ser Asn Gly His Val Val Cys Glu Lys Lys Gln Gln 615 620 625 ctg gag ttg gat tct att gta gaa gaa acc ata aca gga gat tat gcc 2514 Leu Glu Leu Asp Ser Ile Val Glu Glu Thr Ile Thr Gly Asp Tyr Ala 630 635 640 645 tta atc ata aat ggc cac agt ttg gct cat gcc cta gaa agt gat gtc 2562 Leu Ile Ile Asn Gly His Ser Leu Ala His Ala Leu Glu Ser Asp Val 650 655 660 aag aat gat ctc cta gaa ctt gct tgc atg tgt aag act gta att tgc 2610 Lys Asn Asp Leu Leu Glu Leu Ala Cys Met Cys Lys Thr Val Ile Cys 665 670 675 tgc agg gtc act cca ctc cag aaa gcc caa gtg gta gag ctg gtg aag 2658 Cys Arg Val Thr Pro Leu Gln Lys Ala Gln Val Val Glu Leu Val Lys 680 685 690 aag tac aga aat gct gtt act ttg gcc att ggt gat gga gcc aat gat 2706 Lys Tyr Arg Asn Ala Val Thr Leu Ala Ile Gly Asp Gly Ala Asn Asp 695 700 705 gtc agc atg att aaa agt gct cac att ggt gtt ggc atc agc ggc cag 2754 Val Ser Met Ile Lys Ser Ala His Ile Gly Val Gly Ile Ser Gly Gln 710 715 720 725 gaa gga ttg caa gca gtc tta gcc agc gac tat tca ttt gca cag ttt 2802 Glu Gly Leu Gln Ala Val Leu Ala Ser Asp Tyr Ser Phe Ala Gln Phe 730 735 740 aga tat ctc caa agg ctt ctc ctt gtt cat gga agg tgg tct tat ttc 2850 Arg Tyr Leu Gln Arg Leu Leu Leu Val His Gly Arg Trp Ser Tyr Phe 745 750 755 cga atg tgc aaa ttc tta tgc tat ttc ttc tat aag aat ttt gca ttt 2898 Arg Met Cys Lys Phe Leu Cys Tyr Phe Phe Tyr Lys Asn Phe Ala Phe 760 765 770 aca ctt gtg cat ttc tgg ttt ggt ttc ttc tgt ggt ttc tca gcc cag 2946 Thr Leu Val His Phe Trp Phe Gly Phe Phe Cys Gly Phe Ser Ala Gln 775 780 785 act gtt tat gac cag tgg ttc atc acc ctt ttt aac att gtt tac aca 2994 Thr Val Tyr Asp Gln Trp Phe Ile Thr Leu Phe Asn Ile Val Tyr Thr 790 795 800 805 tca ctg cct gtt tta gcc atg ggg att ttt gac cag gat gtg agt gac 3042 Ser Leu Pro Val Leu Ala Met Gly Ile Phe Asp Gln Asp Val Ser Asp 810 815 820 cag aac agc gtg gac tgt ccc cag ctc tac aaa cca gga cag ctg aat 3090 Gln Asn Ser Val Asp Cys Pro Gln Leu Tyr Lys Pro Gly Gln Leu Asn 825 830 835 ctg ctt ttt aac aag cgt aaa ttt ttc att tgc gtg ttg cat gga atc 3138 Leu Leu Phe Asn Lys Arg Lys Phe Phe Ile Cys Val Leu His Gly Ile 840 845 850 tac acc tca tta gtc ctt ttc ttc atc ccc tat ggg gcc ttt tac aac 3186 Tyr Thr Ser Leu Val Leu Phe Phe Ile Pro Tyr Gly Ala Phe Tyr Asn 855 860 865 gtg gct gga gaa gat ggg caa cat att gct gac tac cag tcc ttt gca 3234 Val Ala Gly Glu Asp Gly Gln His Ile Ala Asp Tyr Gln Ser Phe Ala 870 875 880 885 gtt acc atg gcc aca tct ttg gtc att gtg gtc agt gtg cag ata gcc 3282 Val Thr Met Ala Thr Ser Leu Val Ile Val Val Ser Val Gln Ile Ala 890 895 900 ttg gat acc agt tac tgg act ttc att aat cac gtc ttc atc tgg ggg 3330 Leu Asp Thr Ser Tyr Trp Thr Phe Ile Asn His Val Phe Ile Trp Gly 905 910 915 agc att gcc att tat ttc tcc att tta ttt aca atg cac agt aat ggc 3378 Ser Ile Ala Ile Tyr Phe Ser Ile Leu Phe Thr Met His Ser Asn Gly 920 925 930 atc ttt ggc atc ttc cca aac cag ttt cca ttt gtt ggt aat gca cga 3426 Ile Phe Gly Ile Phe Pro Asn Gln Phe Pro Phe Val Gly Asn Ala Arg 935 940 945 cat tcc ctg acc cag aag tgc atc tgg ctt gta att ctc tta aca aca 3474 His Ser Leu Thr Gln Lys Cys Ile Trp Leu Val Ile Leu Leu Thr Thr 950 955 960 965 gtg gct tca gtt atg cca gtg gtg gca ttc aga ttt ttg aag gtg gat 3522 Val Ala Ser Val Met Pro Val Val Ala Phe Arg Phe Leu Lys Val Asp 970 975 980 tta tac cca acc ctg agt gat cag atc cgc cgg tgg cag aag gct caa 3570 Leu Tyr Pro Thr Leu Ser Asp Gln Ile Arg Arg Trp Gln Lys Ala Gln 985 990 995 aag aag gca agg cct cca agt agc cga agg cct cgg acc cgc agg tca 3618 Lys Lys Ala Arg Pro Pro Ser Ser Arg Arg Pro Arg Thr Arg Arg Ser 1000 1005 1010 agc tca aga agg tct gga tat gct ttt gct cac caa gaa ggc tat gga 3666 Ser Ser Arg Arg Ser Gly Tyr Ala Phe Ala His Gln Glu Gly Tyr Gly 1015 1020 1025 gag ctt atc aca tct gga aaa aat atg cga gct aaa aat cca ccc cca 3714 Glu Leu Ile Thr Ser Gly Lys Asn Met Arg Ala Lys Asn Pro Pro Pro 1030 1035 1040 1045 aca tca ggg ctg gaa aag aca cat tat aat agc act agc tgg att gaa 3762 Thr Ser Gly Leu Glu Lys Thr His Tyr Asn Ser Thr Ser Trp Ile Glu 1050 1055 1060 aat tta tgt aag aaa acc aca gac acc gtg agc agc ttt agc cag gat 3810 Asn Leu Cys Lys Lys Thr Thr Asp Thr Val Ser Ser Phe Ser Gln Asp 1065 1070 1075 aaa aca gtg aaa ctg tgagtcaata tgaatttaaa ccacgtagtt atcttttcac 3865 Lys Thr Val Lys Leu 1080 ttcaggtgga gctgaaattc tgctggctcc agagtttgag atttgaggca agaggtgggg 3925 caggcagatt gcctcactta acttaaatct gcggcagaca actgccagtg cccatcaaac 3985 aggagtgtgc gctatggaaa accaggccag agggtcactg tctggtttgt gatttggtgg 4045 acaaaacact cgctgttaca agtacagatt tttttttttt taaatcaacc tagataccaa 4105 ttgacctgaa ctttagaatc ttatttatgg agaaaaactt gtaaagctgc atattcactg 4165 aatggatcct caggcggata aaagggtgca ttttaaaggt atatatccaa gctgaaaagc 4225 atgcctattg acagataaac atgtatctgt aagatcagcc tttcccaagg tatactttta 4285 aaatttaaag cgtgtactgt gttgctttca gactgagttg catgtcactc tttagtcttg 4345 atatctacct gtctgttcag ccaggacaac aaatggcttc caagcctgaa gaatacaaaa 4405 gtgtgcttgt gtttctcatt tttataccag tctagggaca aaggagactg aacatctttg 4465 cagcaggata ggctggtaat ttgatcaaat ttattcaaaa agctctcagt ctgtgtcatg 4525 taaggacatg cttatgaaat gtgagagagg ctcgccacta agtattctaa atacttttca 4585 atggcttttc taacaacctc agtagtaatt tgctgagcat catccagacc attaatagaa 4645 tcagcaaagc actggaattt cacactttaa tgataatatt ccacatagtc tatgggcaaa 4705 tattttcaac atttccaatt tttaaagctt cagaattgaa gccaaacaaa ttaataaata 4765 attgttttaa ttactattta aaaactcagg tttagattgt ttaaaattag ttgcttttga 4825 tactcagctg tcatgtttat aattcaaaca tgtagtaaac atatgtaggt aaggttgttt 4885 ttttggagat gttgcagctc aaatttcagt ccacatatga atcatcagtg tattttccat 4945 aaagtgattc gggcatattt gtgtgaaaac ctcagttctg tcacttctta cctctataaa 5005 cttggacgat aatgtgcctt ctctgagact cagtttcttc ctctgtaaaa tgaggacata 5065 ctacctacct catgtggttg gttgatgatt gtctgtcaaa gcacaaactc tgaaattatt 5125 aaaaacataa ttatttcata aacagatgag ttaagttcca gttaactcaa catcagtata 5185 acagagcaat tggaagagaa tatgaaaaaa ctggaatcta aatagtcagt gaggaaggct 5245 ttgataaaat gaaattgcca gaaagatata aaactggtta gggtcctaca gggaaataaa 5305 attataaccg tggaggtaca tttctctacc agaaagcaaa aataaagcat catgtcttaa 5365 tggttttcta caaatcaact tctaattcta cagagtcctt aatctggtcc ctattaaatt 5425 cttggtcaga caaagttaca tttcccaaga gagtcaggtg acacttgagt gagtttgatg 5485 gataatgagc taatgtgata tctataggtc acaatttttt aaaaccaaaa ttttcaagtc 5545 tgggataatc tttcctaaat gggatcaaat gaaataatat gtgtaaaaga gtcaaatgca 5605 gtcctttacc atagtaactg cctatggacg ttgtctttcc cttacatgcc tgcctacact 5665 taaccagatg ttggttttca atgtctaatt tgtcattagt ttcaccacat ttgctcactt 5725 tttgtaacat ttttgcaaga tttgaaaact ttcagtaaat gttttggcac tattggt 5782 <210> 2 <211> 5181 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3292)..(4065) <400> 2 gagagcgccc tgggcgccta ctcgccagtg gactacatga gcatcaccag cttcccgcgg 60 ctgcccgagg acgagccggc gcccgcggcc ccgctgaggg gccgcaagga cgaggacgcc 120 tttctgggag accccgacac cgacccggac tccttcctga agtctgcacg gctgcagcgg 180 ctgccatcgt cgtcgtcgga gatgggcagc caagacgggt cgccgctacg cgagacgcgc 240 aaagacccgt tctccgccgc agcggccgag tgctcctgcc gccaggatgg gctcacggtc 300 atcgtcacgg cctgtctcac cttcgctacc ggtgtcaccg tggcgctggt catgcagatc 360 tacttcgggg acccccagat cttccagcag ggtgccgtgg tgaccgatgc tgcccgctgc 420 acttcactgg gcatcgaggt gctcagtaaa cagggatctt ctgtggacgc agcggtggca 480 gcagccttgt gtttgggtat cgtggctcca cacagttctg gcctgggcgg tgggggcgtg 540 atgctggtac atgacatccg acgaaatgag agccacctaa ttgatttccg ggagtccgca 600 ccaggggccc tcagggaaga gaccctgcaa agatcctggg agaccaagtg acctggtctc 660 ctctctccct cgcctacccg ccttgcccag cctgggctct tggtgggggt tcccggaatg 720 gtgaaggggc tacatgaagc tcaccagctc tatggcaggc tgccatggtc ccaagtcctg 780 gcctttgcag cagctgtggc ccaagatggc ttcaacgtga ctcatgatct aggtcagtgg 840 ggcctgggga tttgggagag acatgaggtt gatggagaag ggtagaatct ttgagatttg 900 gagcccaagc cagagaggcc ctctcttcca gtttgctcct cagaccccct ccccacctta 960 tcccaccctc acctggaggc ggcctcaaac gagggggtct gggaagggcc cttagacatc 1020 cctgcctcgc acttaacagg ctcctggggg tcagggcact tagggtgggc cactcagccc 1080 ccatctaccc catctattgc gcgtctcatc cggtccagtc ctggcgtccc gccctgccca 1140 gcctcccgtc cctgccccta gcccgtgccc tggctgaaca gctgccaccc aacatgtccg 1200 agcgcttccg ggagacgttc ctgccatcgg gccgcccgcc actacctggc tcgttgctgc 1260 atcggcccga cctggctgag gtgctggatg tacttggcac ctccggcccg gctgccttct 1320 acgcaggtgg caacctcaca ctggagatgg tggccgaggc tcagcacgca gggggtgtca 1380 taaccgaaga ggacttcagc aattacagcg cccttgtgga gaagcctgtg tgtggcgtgt 1440 acagaggtga cctctccccc ggctcccagg gtcccccctc aggagaagcc tcccagtcca 1500 tggccacatc gttttggcct agagactcct ctccatttca caggagagaa actaaggcag 1560 tgagctacct gggcagtcaa ctgttgaagc aaaccagagt cagcccacct tctgaaaagt 1620 agctgtgggg tcagttaccc caagtattta ggatattggg gggtggacct ggtcaaaggg 1680 tccaacctgg aagttggggt tctggccata ggttttcttg atttgggctc tgtgttgcct 1740 ctgccctttc cccagcagcg ctgtcactgc cctgtacttc ctaagaattt ttaagacaaa 1800 gtccatccaa gcttcacagt agaatgaacc tttcaagaca gtcatagacc cagctcctca 1860 tagtgccaaa aagaaattga ggcccgagga gggaatataa caactggcca aactcaagaa 1920 aaccaacagg gaacccagaa aaccaagctt atgacatggg tggggtccat gtttactgaa 1980 cccaaggtgg taggtgctgg atttctcaga agattctcag gttctttccc ttctaggaga 2040 gccagatcac atcaagcccc aggaaagggt ttctctgagt tcaactttcc aggcctcaat 2100 ctccaacctg attcctctgc cagaggcagt ggaatatgaa gcaatggaaa gagcctatca 2160 gctgggtgca gtggctcagg cctataaccc cagcacattg ggaggctgag gcaggaggat 2220 cacttgagcc caggagttca tgaccagcct gggcaacata gtgagacacc gtctctataa 2280 aaaatttaaa aattacccag gcgtggtggt gtacctatag tcctagctac tcaggaggtt 2340 gagatgggag gattgcttga gcctgggagg tcaaggctgc agtaagcatg attgtgccac 2400 tgcactacag cctgggtgac agagtgagac cattatcaca cacacacaca cacacacaca 2460 cagaaaagaa aaaagaacaa gcgtagcgtt ggagctagga agacctgggt ttgagtacca 2520 actgtgccat ttaccttgaa caagctactt cccactctga gccttggttc cctccttaaa 2580 atggaaataa taatagtgcc tacatcttag gatgcccata aggatttgat gagatgatgt 2640 aggaagttgc cctgatcaga gcatgcactg aatatagatt ttccatttcc ccttcccagc 2700 ctgggttgtc tcaacaatgc tagtaccttg agtgctctgg attccccttg tctctctgta 2760 ccacagtttc tggacctatc aaatatgtgt gataacagta gccacagtga ggactaaatg 2820 agttaatgtt agtgttatgc atcagctatt attcctttga tgtccaaaca tctagtaaca 2880 gaataggaag agaaggttgc tgctgccaag aagtttgggg ctggatgaga ccatcaggag 2940 tcatggataa tatcctggcc tatgggagag ctaaaaggag acacatgaca ggccaggaat 3000 atgaacatct ggtcacctgt ttctgctgtg ctctgctgat agcctgcagt gtgactctgg 3060 gcaagccaca gccctcttga gccttagttt tctcaaatgg cagaggaaac tgaagaccag 3120 tgttctgagt tcaagtcctg gccctgccct tgacttgcca tgtgaacttg aatgagtctc 3180 tttgcctctc aatgtttcag tttccccatc tggtgaggga gttcagcgca gtggtctcag 3240 tgtgtgatgg tggaatgtgg agatgggatg tgggatgtga aggtgccttg a ccc cta 3297 Pro Leu 1 ccc tgc cat cag tca ctt cct gtc ctt tca ggc cac ctg gtt ctt agt 3345 Pro Cys His Gln Ser Leu Pro Val Leu Ser Gly His Leu Val Leu Ser 5 10 15 ccc cca cct ccg cac acg ggc cct gcc ctc atc agt gct ctc aac atc 3393 Pro Pro Pro Pro His Thr Gly Pro Ala Leu Ile Ser Ala Leu Asn Ile 20 25 30 ctg gag ggc ttc aat ctc acc agc ctg gta tcc cga gaa cag gct ctt 3441 Leu Glu Gly Phe Asn Leu Thr Ser Leu Val Ser Arg Glu Gln Ala Leu 35 40 45 50 cac tgg gtg gca gag acc ctg aag att gca tta gcc ctg gcc agc aga 3489 His Trp Val Ala Glu Thr Leu Lys Ile Ala Leu Ala Leu Ala Ser Arg 55 60 65 ctg gga gat ccc gtc tat gat tct acc atc act gag agc atg gat gac 3537 Leu Gly Asp Pro Val Tyr Asp Ser Thr Ile Thr Glu Ser Met Asp Asp 70 75 80 atg ctc agc aag gtg gag gcc gcc tac ctc cgg ggc cat atc aat gac 3585 Met Leu Ser Lys Val Glu Ala Ala Tyr Leu Arg Gly His Ile Asn Asp 85 90 95 tcc cag gca gcc cct gcc cca ctc ctg cct gtc tat gaa cta gac gga 3633 Ser Gln Ala Ala Pro Ala Pro Leu Leu Pro Val Tyr Glu Leu Asp Gly 100 105 110 gct ccc acg gct gcc cag gtg ctg atc atg gga cct gat gac ttc att 3681 Ala Pro Thr Ala Ala Gln Val Leu Ile Met Gly Pro Asp Asp Phe Ile 115 120 125 130 gtg gcc atg gtt agc tcc ctg aac cag ccc ttt ggc agc ggc ctt atc 3729 Val Ala Met Val Ser Ser Leu Asn Gln Pro Phe Gly Ser Gly Leu Ile 135 140 145 acc ccc tcg ggg atc ctg ctc aac agc cag atg ctg gac ttc tcc tgg 3777 Thr Pro Ser Gly Ile Leu Leu Asn Ser Gln Met Leu Asp Phe Ser Trp 150 155 160 ccc aac cgg aca gct aac cac tct gca ccc agc ctg gag aat tca gtg 3825 Pro Asn Arg Thr Ala Asn His Ser Ala Pro Ser Leu Glu Asn Ser Val 165 170 175 cag cca ggg aag cgg cca ctc tct ttc ctg ctg ccc aca gtg gtc cga 3873 Gln Pro Gly Lys Arg Pro Leu Ser Phe Leu Leu Pro Thr Val Val Arg 180 185 190 ccc gcg gag ggg ctc tgt gga acc tac ctc gct ctg ggg gcc aat gga 3921 Pro Ala Glu Gly Leu Cys Gly Thr Tyr Leu Ala Leu Gly Ala Asn Gly 195 200 205 210 gct gcg cgg ggc ctc agc ggc ctg aca cag gtg agg ttt acg cca tgg 3969 Ala Ala Arg Gly Leu Ser Gly Leu Thr Gln Val Arg Phe Thr Pro Trp 215 220 225 ttg gcc ttt ttc agc cgg gag cca agc tgt ggc cta gac tgc aga tgc 4017 Leu Ala Phe Phe Ser Arg Glu Pro Ser Cys Gly Leu Asp Cys Arg Cys 230 235 240 ctg tca tat ctg tgg ctt gtt tcc att cct cat gcg gca aac atg ggc 4065 Leu Ser Tyr Leu Trp Leu Val Ser Ile Pro His Ala Ala Asn Met Gly 245 250 255 tgaggctgtt ttgtgccagg cgcatcctgg gcaatgtgga catgggctgc gctgagtttc 4125 ttagtgaaaa tggagccctg atgctccctg tctttcccgg ttgcagggga aatggcaggg 4185 tcagtagatt gaaaagtggg gttagcatct gggtggcctg ggttctgtta acctcaagtc 4245 ccattatacc tctacccagc atctgaatag acaaccagcc ctctttaggg tgggatttga 4305 aaggaagggc ctgcttttct ccagaacacc tcagaatggc tccatgaatg tcttgagaac 4365 ctgatatata tgccaggcac ctattttggc aggtggtgag ttaacgatct taagttctgg 4425 aatcagacac tcctgggttc cggcccaagt gctaccaatt tgtaactgtg tgatgctgga 4485 gcattcactt aacagctctg aacctcagtc tcttcatctt taaaatggaa ataataacaa 4545 tagctaacat ttaaagagtt cgtactaagt gctgggcact gtgttatgtg tgttaaccat 4605 tttaaccttc ccagttctgg agagaatcct tatttctatt ttatagatga ggaaaccgaa 4665 gcccagagag gttaagactt taggtcaagt accttcagct agaaagagca gggaggcagt 4725 ttggttccag agcctacgca cactcctaac cactaagtta aaagcaccca cattgtgccc 4785 tagaccactc ctagctatta accttgccct atccagagag aaaagaaggg agccccgggc 4845 tagattaaag agacttggtt ctagtggtgt atcttccctg gcttcctcta aactctggag 4905 taagtcattt tccttgctgg acttcacttt ccctctctgt gaaatggaaa cagtcatcac 4965 aggcctgcct gtcttccagg gctgttagga ggctcagatg acacccggtg taagaattct 5025 caagactaag gagcctggac caagggtgag agctgaggaa tcctggacca tcacccctgg 5085 gacagccctg atttttctgt accttggcta ccacaggttc tgctgaatgt cctgaccttg 5145 aaccggaacc tgagtgacag cctggcccgc ggccgc 5181 <210> 3 <211> 4786 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1276)..(1707) <400> 3 tgcagccgac agcacgcctt gcggccctgg ggatcttcct gtgccccggc gagacccttt 60 ccgcctcact gcattggaac cccattcccg atcacccgct gggatccatc gtcggacccc 120 aagaggagtc cgcgcagccc agccggcacc ccgaagctcc tccttcagcg ggaacggaag 180 cagaagagcg atcaaggagg tcctcaccac aggactccta tgggtccgac cctgggtctc 240 ccgcaggccc ctctggcagt cctcttccca cccgtcgcct cgggctgcgc cgccgccgcc 300 gccgccgcaa cctccagcac cgccgcccca ggccccgcag ccgccgcgtc gccgccattt 360 tttaaagggt ccgcagcctg actctgcgga gtaagggggg gtggagcggg gggagtcggc 420 ctcgccagcg cgcaggcgcg aggcccgagc cgccgcttgg gtcacagtga aagccaccgt 480 tgcccgggga tgggtccctg acacttgggg aagtaggagc cctgtgtgat cgtgcgtctg 540 agtctgggct gagaccagtc ctggccaggg cagttaccag gacggtctcc ggaggccggg 600 attcgcggag ggtccagcag caggaagaaa ccccaggagg aagaaacctc agacagatcg 660 ccggcgaggc agcgcgggat cccagcctct ggcgtgcgcg gacggtgtgc gggtgagtct 720 ccccaaaagt ggagcccttg tgatgacgag cacaggtccg cctgtgtgcc cgtgggctgc 780 tctctcaccg gtggctcgta gtcgcgtaaa gcagaagtag cagtttcagc ggctgacact 840 aagaacaatg catactggca aagtgaggtg tctcacatct ttaatcccag cagtttggga 900 ggctgaggcg gatggatcac ttgaggtcgg gggatcacac cagcctaacc aacatggtga 960 aactccgcgt ctactaaaaa aaagtaccag acagttagcg cggtgtggtg gcatatgcct 1020 gtaaatccca gctatttggg aggctgaggc aggagagtcg gttgagccag tgtggccgag 1080 gttgcagtga gtgtagattg caccactgca ctctagcctg ggtaacaagc acaaaactct 1140 gtctcaaaaa aaaaaaaaaa aattatcagt gcagagttag gccagattca ttttcagagt 1200 aggggggaat ttcaaattat gggtgaaatg tcccaactgc cggaaggtga aagtgttctg 1260 tgatgtaaaa tttag gtc aca ctc tat aaa gct tct cct tat tct atg tgt 1311 Val Thr Leu Tyr Lys Ala Ser Pro Tyr Ser Met Cys 1 5 10 gaa cag cta gtc tgt atc ttc acc tat gca tat tta aaa atc aac ctt 1359 Glu Gln Leu Val Cys Ile Phe Thr Tyr Ala Tyr Leu Lys Ile Asn Leu 15 20 25 acg aag gct ctt ggc cac cat aat cct tcc tac cct gta gca aga gcc 1407 Thr Lys Ala Leu Gly His His Asn Pro Ser Tyr Pro Val Ala Arg Ala 30 35 40 cag cct cag act atg ctt cct tct gcc tcc gag tgc aat gcc ttc ccc 1455 Gln Pro Gln Thr Met Leu Pro Ser Ala Ser Glu Cys Asn Ala Phe Pro 45 50 55 60 cga gtc tcc aag gta cca aac cgg tca ggg gca gct gag aca gct cta 1503 Arg Val Ser Lys Val Pro Asn Arg Ser Gly Ala Ala Glu Thr Ala Leu 65 70 75 ctt gta cga gtc ctc ccc aag ccc ttg aaa tca cag gca tct ctg ctg 1551 Leu Val Arg Val Leu Pro Lys Pro Leu Lys Ser Gln Ala Ser Leu Leu 80 85 90 tcc ctg ccc ttt atc cac atc act gaa caa tta agc aca cac atc tgt 1599 Ser Leu Pro Phe Ile His Ile Thr Glu Gln Leu Ser Thr His Ile Cys 95 100 105 gta gtt tta ttt aaa ctg ttc cat gtg tgt ttg ttt aca att aac aac 1647 Val Val Leu Phe Lys Leu Phe His Val Cys Leu Phe Thr Ile Asn Asn 110 115 120 atg aat gac tgc agt atg gat gca gca gaa agg aat gct gag gga gga 1695 Met Asn Asp Cys Ser Met Asp Ala Ala Glu Arg Asn Ala Glu Gly Gly 125 130 135 140 ctc cag ttg gaa tgagccactt ttccaggact aggaatgaac tttccccagt 1747 Leu Gln Leu Glu accaacaagg cagcctccag ggcaagaaat ggtaacagaa tccatcctgt tgacagtcaa 1807 gatggaatta cagagaattc tcacctcttt tttgactctc aggtggaaac ccccttccca 1867 tggggtgaag cgcagtctgc tccagagacc cactgttagg gtcggggatg tcacagaaat 1927 taacagggca gccaattccc atccagaggg gcccaggatg accccgaacc ctctagctca 1987 gccccaagct cgggccaggg agtgaagcct ttgtccccaa gcaacagagt gagggtctgt 2047 ctctaacccg gtgggctcca gtgaaaggaa aatggcctgg aggtgcccac aggagggcgc 2107 cctagttctg agacaggtct acggctccca gactgcgtcc ccatgagtaa cagagggaca 2167 cgcagagggg catactgagt gcctataggg gactcacaga ctgcagatgc acgctgcgga 2227 tccagaagcc ggtcaggtaa tgaggtgcag aggtgtgggg aagacctgga tgcctccctg 2287 ggtgccaacc agcctaacca gccggggcca ccaaacagaa gtgcttctga cattaacgcg 2347 acacacagaa ggtgtgtagg gtgggtagtg gaagtgaccg cagaggcctc tggggattgt 2407 agttttggag ctagagaggc gggcaagcag ttggcttcgc ccacacggtg tgcctctcac 2467 atctgcacag cgcacacaac caagcttagg agccgactgt taagatgccc cagtgacacg 2527 gacaagtttc tcctctgcaa cccaatgaga gctgctgacc acgagagcat cgcgggaccg 2587 ccaaaggatt ctggaaaatg tggtcccggg tggccagtgt gcgctgacct tcacctgaag 2647 gtgagctgag ctcaccatgc aagtctccac cctggggtgc ccttccccaa gcagatgcca 2707 ggttggggca gaagcagttc agggagaagc gatgcgctcc tgggactgcc ggtttggggg 2767 caacatcggc cctgtgtccc ctagagtagg agggtgtacc aggatgggga accccaaggc 2827 cgttttcagg gagaacaaga ttggcttgag gcaaactccc ggaatggaga tggggagagt 2887 ctggtcgcat ttccccagga gtttagggga tgaaggctgg acttgggtga acagcaaaag 2947 atcccctggt gatagtggtg gtgaggatca gaggcttgag ccagaaattc ccgaggtgtt 3007 tgtttgtttt tttttttttt gagacagagt cttgatctgt ctcccaggat gtagtgcagt 3067 ggcatgctct tggctcattg caacctccgc ctcccgggtt caggcgattc tcctgcctca 3127 gcctcccaag tagctgggat tatgggcgcc tgccactgtg cccacctaaa ttttttttgt 3187 ttttttgtat ttttagtaga gacgatgttt caccacattg cccagcctgg tctcgagctc 3247 ctgacctcaa gtgatccacc tgcctcggcc tcccaaagtg ctgggattgc aggcatgagc 3307 ccccacaccc agctcatttt tatcttttaa cttaaaggtt tttatttttc tctattttta 3367 ctcttacagg tacggcttga aggagacttc tggagggtgg ggtggtgttg aagagaaggc 3427 acaattgtca ccattttgtt caacatgggc gttttctttg tgcattgatt tgcattttta 3487 aaggtattcc attaaaaata tgtttatcta ttttaatgtt ttttttggtg cccctttaaa 3547 ttctctatcc acagtcctga gccttaatac gaagtgaggg cctggcctct gggcatcaag 3607 cagccagaaa gtcccttagg agctttctct gaattgttga attgtgactg atcttttcct 3667 ttcctgactt ccttaatatc cacaatttcc tcgtaaccat ccccccaaag ccacacccac 3727 aaggcacctc caagacctat catgaagcag atattcaaag attatttgta aatcttcaca 3787 ccccacccct tccagaatgt cttccaacat cccttcgcct ggagaaataa aacacaaggt 3847 ttgcatgttg agtatcagcc tctcccattg ggtgggtgtt taggaaaaag aatcagccaa 3907 gagccctatt ccagcccagg taaatgactt tgccaaagat ttaatatcca caaatgtaca 3967 atgctcactg ggaaccaaag tcaggcatgg ggctgggctt taaggagcac aaacaaaaag 4027 gagggactag aaaacttcag aaaggtattg gtgggggatg ttgcgggggg acaggggaca 4087 gcgaggatgt gggatcccga gatcatccaa atccctatgt gtagacatat gtgtataaag 4147 gcctttaaga gactcaggct gatggggtat ctgtaataaa tcaaacataa tataacagca 4207 cgtcaagtga taaggggact ctggaaaaac aagcagcaaa aggagcagta tcaaactcca 4267 cagaaattca caaacatcaa gacaccaaga aagctgcatt catttaaatc aaggtgacag 4327 gctgggctct gtagctccag cctgtaatcc tagcactttg ggaggccaag gtgggcagat 4387 catttgaggt tcaagaccag cctggccatt atggtgaaag ctcctctcta ctaaaaatac 4447 aaaaattagc tgggcatgtt ggtgggtgcc tgtaatccca gctaaagagg ctgaggcagg 4507 agaatcactc tgaacctggg aggcagaggt tgtagtgagc agagatcacg ccattgcact 4567 ccggcctcgg ggacagaacc aaactccatc tcaagtaaat caataaatca aaggtgacag 4627 ttactgagtg atgcgggcca ggtacaatga tataccttca tctggattac cctcgtgaat 4687 ctgaatcact tgtgtgatat ctgtgcttta acgagtccta tttcccatac tcttagctgg 4747 cttgagatgt gaaaatccac ctaaattata aacagagtg 4786 <210> 4 <211> 6646 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (5605)..(5844) <400> 4 attctgtttg tcatatttag tagctgcact ctcctaagga aaggaacctc accttcctgt 60 tcacttttct gtctccattg cctgtacttc cagagccagc ccgtgatgat gcctgaaaaa 120 attttttttg atgtatgaat agatgaacaa acaagcacag aagaatacag tgtgaataca 180 gtgaagccag aggtgtactc tgcgaataca tgagacaccc ttgagtggaa tggtttcagt 240 agatggattg tatgaggtgt gaattgtatc tcaatcaagc ggggcctcag ggagtgccca 300 ccactaaggg tctagctctg tgtggcgttg gtggggctgg ggaaggttta gaaaaggttc 360 cgttgctaag accttggtta gggtagagtt tgcagcagag atttggatag gagggcgttc 420 caggctgaag agccagtgtg agcagagtct ttgtgggaag aacattttat attcctggac 480 ctaatgcttg tttaacaaga tgcttgcttg tggctgggcg cagtggctca agcctgtaat 540 cccagcactt tgggaggctg aggcaggagg atcgcttgag ctcaggaggt caaggttgca 600 gtgagccatg attgcaccac tgcatgtcag cctgcgtgat agaacaagac cctgtctcaa 660 aaaataaaaa atgacaataa agattgcttg ttaaataaaa tgtgatttat ttactttgca 720 taaatactat gccttctgag aatgctgtag gtgcgggcat attttgtttt tatcctaacg 780 cagcccaaat tccaagtaca cactggattt tgagcattta aaggacttcc ggtgctatta 840 aggattccca tttgtttgtt caagtttgct aaacgaattc agtcatgtca cttacttagt 900 tgttgtaagt attatattac agtgtgaagg gggtaaaaag aactgctatg agtatctttc 960 catgaagaag atttagcttg gaaatgtggg cagatttgtc tgcctttgtg tttaggaagg 1020 cacaattata attacaggtg gaggagctag acactggcca gctggaatca gaatttcaca 1080 ttgtagaagt gtctggtctg gaaggcatcc atgggaaacc tacacccttg acccagcatt 1140 ggccacagag ctgcaggcgg tctcctgcct ctgcctctgc ccctgcgtgc ctctcctttg 1200 tagcagccgt ggcatatgca gaggggactg tagatctctt ttctttcttg ctgactttat 1260 tttctaccac actcccccaa cccaaataaa gaaataatga catgtgcatt tgagatagtg 1320 tgtgtgtttt ttttttttct ctttctgaga tggagtctta ctctgttgcc caggctgaag 1380 tgcagtggta caatcttagc tcactgcaac ctctgcttcc caggttcaag accagcctgg 1440 ctaatgtggc aagacctcgt ctctactaaa aatacaaaaa aaaaaaaaaa aaaaaaatta 1500 gccaggtgtg gtacacgcct gtaatcccag ctacttgaga ggctgaggca cgggaattgc 1560 ttgaacctgg caggcggagg ctgcagtgag ctgagatcgc gccactgcat tccagtctcg 1620 gtgacagagc gagactctgt ctcaatttaa aaaaaaagtt atcgtattta tatgaaaaca 1680 tctgaatacc caaaggatat ttaaaaaatc actgccagtt taacagtcct cagttggagt 1740 tttttttttt tttttttgag acggagtttc actcttgttg cccaggctgg agtgcagtgg 1800 cgcgatcttg gctcactgca gcctctgcct cccaggttca agtgattctt ctgcctcagc 1860 ctccgagtag ctgggattac aggcatgtgc caccatgccc ggctaatttt gtatttttct 1920 tagagaaggg gtctctccat gttggtcggg ctggtcttga actcctgacc tcaggtgatc 1980 tgcctgcctc ggcctcccag ggtgctggga ttgcaggtgt gagccaccgc actcggcctc 2040 agttggagtt ttatttggcc tcttttgatg cctggtggct tatgtgtgtc ctatacaaaa 2100 caaattgcct tttttcccca gaccccagaa atgtggacac gtgagttgtc tttattatag 2160 tccacgcttt ccttgtgttc ttctcctcta ggactttcta atatatagga caagtgattt 2220 tgacttgact taaaccctac ggaaataaga ttttgtttgt agatgcattt gtatggtgaa 2280 actggagcaa aagtcttcag cttagcggta tgtatcagca gggagctttg aaggcagcat 2340 ccatgcctgg cccctcctga agagctcaac aggtaggggt tggtcgggac attgagattt 2400 gtaacagcag cctccctact ccccagtaaa cggggatcgt aaggcacagc cgaggctgag 2460 actgctgtgg ggtagtctag gccgcctgct ggaatgtcgg tcggtgctgc attgagcgct 2520 gtctttcaag tggttgcgtg ggagacctgc acttgctgtg tgctaccttg actgaggact 2580 tttgaggaaa ttctgtttga caattacttt aaggtccagc gtgacaccct ccctcagact 2640 tagtctctag tgactttctg atgggagcgc ctctgcttca tcaaggattg gaggcactta 2700 gctctcaaca aacttaggtc gctttttttc ctccacagtt ttgttacctc ccttatttgc 2760 caggcttggc tttgagtgaa tcttggcttt tttggggttg tggggggctc tctgattcat 2820 tcagcagatc ttcactgagt ggcagatgtg tgcaaagtgc ttgggatgcc aaagacaagt 2880 gtgtcgtctg tatttagtgt tgggacagac agtgattgag acaacagatt taatcttcac 2940 tgatgacagt gatgatgcca cgaaagacat gggcgctgac agagggtggc tgctggtgta 3000 ggcaggatgg ttagggacct cctttctcag aactgaccgg gaagggatgg tggggaccag 3060 gccatttggg gcctcacaga tcagggctga tgagtttggc tccatttatg cttgactcag 3120 tttatttgag ggtgggagca ctcctgtttc tcagaattcc ctgtgcctaa ccccaggcgt 3180 ggcagataat actcgctcag taaatgttta ttgaacaaat gattgattga gagtgatgga 3240 acacacagaa ctgtgggtct gggagcagga gcgcgaggcc tgtctctgaa tgcagcctgg 3300 acaatgtgcc cttctcctca tccgccaggt gagaccagca ggagaaagcc atctcagcag 3360 tggttggtgg ttgtatttac agttaaattt tattttgaga tgctcatagt ttcacatgca 3420 gttgaaagaa tacagagaga tcctgtgtcc ccttctccca gctgccccct gggtaacatc 3480 tggtgaaatt caacgcactc tcagcaccag gatcggggca ctgatactgc caggacacag 3540 aacatcagga cggctccgtc cccgtgcaaa tccgtcatgt cggcctttta tagtcccaca 3600 cacaccgcta atctgttctc tatttttgtg attttgtatt tcaagggtgt tgtgtgcatg 3660 gggtcatgga gtgagtggcc ctttggggtt ggctgttttc cttcagtgtg attccctaga 3720 gcgttctcag tcccgccagt caagtttctt ccttttgctg ctgagtggta ttttgcggcc 3780 tggctggcct gcagcttgtt taaccattca cctgttaaaa gacacctggg ctgcgtccag 3840 attttagttg ttacaaataa atgttgctct caacagtcct gagcaggttt ttgttgaata 3900 taagttttca tttctctgat ataaatgccc aagagtacaa cttctgggtc atagggtagt 3960 tacgtattca gctttattaa aaagtgcctg ttttccagag tggccataca gtggtttttg 4020 tttgttcttt taaatgactt aacaatggaa ggaaagacgc ttctgcatgg cctggcatag 4080 aaagcccatg gagagccgcc ctgtgggtgg ggaagcgggt gacgggagcc ccactgctta 4140 ccagaaaccc gagtgaaact gcgtctttcc agtcccaaga ttctcctttt ttgctaaagg 4200 catcagaaga gatgaaagca tttctaccac ccacatttcg gggacactcc tagcttttac 4260 tttagagttg ggagagaggt ttggttttca ttgtcattta aatcctgttg ggtttcagca 4320 gagcctgatc tttagggccc tgggtgcagc tgttgtctgg agatgcagat cactgtctgc 4380 tgaaaagagc cctcctgctg gggttaggga tctcctgatt gaggcatgga tccaagggct 4440 tctttctttg ttctctgatt ccctgaggct cttccttgtg tgtgtggtgc gtgtgcactc 4500 gtgtgagcgc accaggaact atgacagcaa tcaacgggta tgactggggg tggggagcag 4560 aggcagcatg gccaggaatc tatactcgag ctactaagag gaaggaagtc gcagcaaaaa 4620 tagcctgggt ggggtggtag aaaatattaa ggggaaaatg agatgactgg ggtggcgttg 4680 aagaagagct gcttaaagag aaacagggac acacgctggg taaggaatcg catgctgttg 4740 aaactgacgg tgtggaagga aagtgaggtg gcggcggctg ggctgtggta gaggaagccg 4800 atcaccagtg agtgaaaagg gaactagggt acccacgcgt ttgtgttttt tatttaaact 4860 ttggctaagt tttgctagga gcgttttgcc tttttttctc agaccttcta ctcctgtcgt 4920 tctcccctaa gttctgggca tgtggcttct tctctaccta ccagggtagc ctgtcattgc 4980 tttaaaggac aaaatgaaat ccctgatggg caaaatatgt cccccctctt cccccagctt 5040 ttggtcggct ttgctcaccc accaacagct aggagctgga aggctcagaa gagctggggt 5100 gggcatggga gagacagagg aaccccggtg gctggaaaca cagtcccttc ccagggtctc 5160 ttcagcttgc tggttggacc tcttagcctg cttatgaagc ctgaattgac atcatgccaa 5220 ctggtctcct gctgataagt gattccttaa cttcctcggg agcagcagaa acttgttttg 5280 aaaaaaatca gattctaaat ctttacagct gtttacttct gcaagcacaa tggccgtagt 5340 aaggaattca agtcacagcg cagccgtggc agggaaggct ctgcatggag tcacaggaca 5400 aagcctgtcc tggaaacgtg ctggaaggaa gtcggcctca ttctttacga cgggcagcag 5460 ctccaaatga gaaaccaagg ggcttggctc tgggtcagta gcatctgcct ggcatgcggt 5520 gccagctgtg gagaccagtg aggtggcagc agctgcactg ggctcctggg gtgtgcgtgg 5580 ggacctcagg ggtttaggag ctga tat act ttt ggt tct cat cat ctt gtg 5631 Tyr Thr Phe Gly Ser His His Leu Val 1 5 tgt tcc ttg cag aca cag aat gtt cag ctt aac aaa gaa atg aca ctt 5679 Cys Ser Leu Gln Thr Gln Asn Val Gln Leu Asn Lys Glu Met Thr Leu 10 15 20 25 gcc agc aac cgg agc ctg gca gaa gga aac ctt ttg tac cag ccc cag 5727 Ala Ser Asn Arg Ser Leu Ala Glu Gly Asn Leu Leu Tyr Gln Pro Gln 30 35 40 ctg gac acg ttg aaa gca cgc ttg acc cag aaa tac cag gaa ctc cag 5775 Leu Asp Thr Leu Lys Ala Arg Leu Thr Gln Lys Tyr Gln Glu Leu Gln 45 50 55 gtt ctc ttt gaa gcc tat cag ata aag aag acc aaa tta ggt aac ttt 5823 Val Leu Phe Glu Ala Tyr Gln Ile Lys Lys Thr Lys Leu Gly Asn Phe 60 65 70 tta agg gtg atc att cga gaa taaagagttg agagaaggac acattttcaa 5874 Leu Arg Val Ile Ile Arg Glu 75 80 gctattgccc taccaggaaa gaaatattac ggaaaaattt caggttaaga tagcgactct 5934 actgcaatga acaaatagct tatgtatatt atttatttat ttttagagac ggggtcttgc 5994 tctgtcggac tggagtgcag tggcgcaacc atggctcagc acgacctcct gggctcaggt 6054 gatcctcctg tctcagcctc ctgagtggct gggaccacag gtgtgcacta ccccaagcag 6114 gctagttttt catttttttg cggagaggtt gtcttgctat gttgcccagg ctggtcttga 6174 actcaagcca tcctcctgcc tttgcctcac aaagtgctgc tattacagac atgagccact 6234 gtattggact agtttatgta tggaatatgg agggccccca gacctgcggc ctctctgcac 6294 ttgcggttcg agagggcagg gtgtatagca ggactgcccc ctcctcccag tggccctgac 6354 tggtggcggt gactgctgag gtcagcccag gtccacacgc acctccaggg tctgcagtgt 6414 gaagcaggtt gcagaaacgt gggtacctcg gggcagttac cagaaaaggg gctggagatt 6474 aggaggaagc cagccagttt agcagtcacc agtttagaag aggagactta accacttttt 6534 aaattcaaga ccccaccttc tctaaacagc atagaaaaag ctttctgaat tatgatatta 6594 aacagatgtc ttatgacgtt tgcccgtttg tcttatgctg tttgttaatg gt 6646 <210> 5 <211> 5564 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3913)..(4422) <400> 5 gtattagcaa aaaaaaaaaa aaaaaatggt aaccaccgaa atgtaggcag ttaccatatt 60 tttaggatac agaaggagga gaaaggatat gaaacatttg tacaatggca tactctaaaa 120 aagaacgagc tagctgtgta ctaatatgga acaagtgcca agatatacac ttagataaaa 180 taatcgggta tgtaagagta tgcttcaaag tcttcaaaag aaaaggaaaa acaaaagata 240 cagataggct tgtaagaatc tcagaccatt tccagagcac ccaataaact ttaacagagg 300 ttactgctgg gcaatggaaa gaggggattg ggaggtggtt tgaggagtct aggctgacac 360 ctgaatcggc agagaggagg cggtgacgtg ccacttaccc acaacatgta ctggctgaac 420 cgccgcagga agtagaccat ccagtttcca agggacttta aggtgtcagg agcaagcctt 480 ctccagatat tagggatatg cccgataaaa agagacctgg ccacatcatc taactcattg 540 ctcattccaa cttctccagc caaggcctat ttagaatcaa gagcaatcgg gcaggactct 600 atcagaaccc caggtgccac gtatctaggc agtaatctgc acatgagaga caggctcacc 660 ctttgaagtt cagccagaga cttcgtcatc cggaccacaa gcttgttgaa gcgttccagt 720 tcctgcagga gcaccaccga agtgggggag agtcctgttc cgaggcgctt cctcacctgg 780 tccaagtcaa agactttggg catcttgttt tctatttctt tggccacttg gccaatataa 840 tcatcgcggc tgataccact gctggattcc cctaaaacca ggacactttt cagagaaagc 900 aaggcatcca cgtgcagaac cggagcctcc tgagccttcc agttctggga aaggcatcag 960 caacgtggga gggctacaag aagcaatcgt aaaaccccaa tcaaactaat gaagtcctcc 1020 tgagtagagt ttacctgtct gaggctgcag ctccagcagg tgagcccaca tgtctcgagc 1080 cgcctgcgtg taatagccaa tctcagcgtt ggggtggaga ccaaacactt ctggcgtgtt 1140 ggcaagcggg agggcctcga tggcttctgc aatcacagag aacaacggag gtggcactgt 1200 ccagccacct gtgctttctc agggcccaca gcctcaaagt ctccaagaca tgcaagacta 1260 atttccactg tttagagggt gacttttttt tttttttgag acagggtctt agtcacccag 1320 gctgaagtgc actgacatgg tcattgatca ctacagcctc gacctcctgg gctcaagctc 1380 ttctcccatc tcagcctccc aagtagctgg tactacaggt attcaccacc acactcagct 1440 aatatttttg ttttttgttt ttttgttttg attttttgtt tattttttgt agagatggga 1500 tcttgcttgg ttgcccaggc tggtcttgaa cttctgggct caagcgatct gcccttcttg 1560 gccttccaaa gaagggatta caggtgcgag ccaccatgac cggccaatga catgttttta 1620 atgtaaaaat tctacactac cacacccagt ccagccgtat gttcatcttt tacgaaaatg 1680 gctcagtgag cttcactgtg gccattctgc ccttccaggt ggaacctgat ttatttacca 1740 tcattccagg atgggaaatc tgaactctat cagtcagctc tgcagccagg aagcgtctgc 1800 cgagccacct aacaggctta acagaccaca cctgaccagc cgaggagccc ccaccaggag 1860 aggagccggt tgaaatgtat ggtctccctt agaaaaagcc ttgaaaacat cttttcatgt 1920 ctgagaaatc tcaccaacaa atttctcctt ttcatcacca acagggattt tgtagtccac 1980 ttccttgttc cggaagaagt ggaatggctg gaaagtatca aaaatgaagt cccccaggta 2040 ctcatccatg tagatggtca ggatgcggcg atcaaagctg tcgatggccc gtcctccata 2100 catgacctga aattagaggg cctgagagcc agccaggagc tcagaaccta ctgtgtgcac 2160 cctgggggcc cggggccaga ggctcaccca gggtcaagag actccagcag cctggctgct 2220 gcaaggcttg aacctgttga cttgcataac ctttcgggcc actgttcaca caattgatct 2280 gtggctttgc tggcttccca agggataggg cccaaagcag taagaagccc tcaggaagtg 2340 ttgggagctc ctgtgcagca ggcaaagtcc cccaaacaca tcaaacaaac agctgaagaa 2400 gccagcagct gcttctgtct ttggaattga tgtcactgcc tgagccgtgt gcagaccacc 2460 ggggatgctt cctgtgagct ctgtatacag caacagctga ggcaaggggc tagaggatgc 2520 cttgcctgga tccccagcca tggatccagc tcagggaacc tccatgtgaa cagagctcac 2580 ttaggacaaa gaacaaccag gtgagagcag agagaattgt cacagaaacc caaggggaac 2640 acaggggcag aagaggctgg ttctgagggc tcagaggaaa ataggggcac gaggccacag 2700 ctctgtagcc ccagaggcac ctgcggaagg atccccggtc acccctacct ctccaattag 2760 gtacttgagg ctgccccacg ggatccttgg gtcccgttgc tggaaggctt tcgttaagta 2820 cgtgttcaga atttccatgc agaccttgaa agaagagaac acgctcagtt atactgaaac 2880 ccaccaagct gttcacattc tgatactttt ctgaaaaggc tgctgggaac ggtgcaaccc 2940 tgctgtggtt gaacccagag gaaagtcggg aggaagtggg ccgcctaagg aagccatttc 3000 caaggagaag cagccactgt cacctggaag tcagactcat tgaagtcata gtacacgttc 3060 cagccaatct tcccaaactt ccttctctcc tgcaccacag catgaaagaa cgccagcacg 3120 tagaccagcg gcttgaaggc agggtgcggg cactggtcca gcatttcgtg agagatcttg 3180 aagtaagttg ccctcatgtt gagtttcagc ccattgggtg gctcggtgac aaccttcaga 3240 agagaagaaa acacacagtg atggagaccg caggcccaca agcttggaag agtgtgcaca 3300 atgcaggcac tgcgctttcc ttcctctcta cggagccatc tgtctgagac ctccaatagc 3360 taaagtctct tgtccagggc ccaggctgag aagttaaagg actagatgtt ctggctctca 3420 gggtcaaggg tcaagagtct ttgaccttcc tctgaaccat gagatccttt actatatata 3480 aaatgaaagt aaagtcttcc ctccacctcc ccagagagag agagagagag acagagaatg 3540 acaaaacctt tccaaaagaa tttcctggag aataaattaa atggggcatg taaaccacgc 3600 gaaactcctg cagtcagcac tagcgaactg caggctgtgt tggggggggg gccgtgaagt 3660 ttatgaaaag tagcgtagat attgttaggg acacatgcag aattggtaga gataggcctg 3720 ggagtcacaa acacccaacc caggacagcg gctccctcta aagagagagg tgggattgag 3780 aagggtttca accgcaaggt ttcacctgca gcctaaggtt tatttctaca cagacaaagg 3840 cacaccccgg ccccagagca gcatgcaggt gttctgagaa gctgcgtgct ggcatgttga 3900 tgtccatcct ga agc cag acc ttt agg gac ttc tgc aga atc cca atg ggg 3951 Ser Gln Thr Phe Arg Asp Phe Cys Arg Ile Pro Met Gly 1 5 10 aag ccc ttg gtg ggg tcc gtg gtg agc cac agg cgg aag tct ggg tgg 3999 Lys Pro Leu Val Gly Ser Val Val Ser His Arg Arg Lys Ser Gly Trp 15 20 25 ggc ttg gtg atc ctc tcc agg gac ttc tcc aga tct ttc agc cac ttg 4047 Gly Leu Val Ile Leu Ser Arg Asp Phe Ser Arg Ser Phe Ser His Leu 30 35 40 45 acc agg agg tgg cag ttc tgc agc atc agc cac tgc ccc cga gcc acc 4095 Thr Arg Arg Trp Gln Phe Cys Ser Ile Ser His Cys Pro Arg Ala Thr 50 55 60 gcc gtc tcc agc agc tgc agg gcc acc tgc gcc ggg gga gag gga aga 4143 Ala Val Ser Ser Ser Cys Arg Ala Thr Cys Ala Gly Gly Glu Gly Arg 65 70 75 gga gag ggg ttg gca tcc aaa ccc aac ccc gtg ctc tcc aga gac cca 4191 Gly Glu Gly Leu Ala Ser Lys Pro Asn Pro Val Leu Ser Arg Asp Pro 80 85 90 ctc cac acc cac aga agg gcg atg gca gct cca gca gaa gcc aaa agc 4239 Leu His Thr His Arg Arg Ala Met Ala Ala Pro Ala Glu Ala Lys Ser 95 100 105 aag ccg agg atc cag gag cta cag gtg gag cct tca aat aaa cca tct 4287 Lys Pro Arg Ile Gln Glu Leu Gln Val Glu Pro Ser Asn Lys Pro Ser 110 115 120 125 ttg aga aac ccc atg ggc ccc gtg ggt gca ctg agc tgg tca cac cca 4335 Leu Arg Asn Pro Met Gly Pro Val Gly Ala Leu Ser Trp Ser His Pro 130 135 140 gcc tgg aag atg ggt gag cat ttt gca ctc atc tgc ctc cct ggc cct 4383 Ala Trp Lys Met Gly Glu His Phe Ala Leu Ile Cys Leu Pro Gly Pro 145 150 155 gga agt ccc agc tcc tcc cac aca gtc gct cct gtc ccc tgaggactcc 4432 Gly Ser Pro Ser Ser Ser His Thr Val Ala Pro Val Pro 160 165 170 tgagcagggc tgcaagagac cagctaaagc ccaccaggaa gtcgggactg tcctggccac 4492 tgtaagatct tcagggccct aggcctggaa aacaatcacc atgtcccctc ccaaatgtaa 4552 ttcaaagcac aaacaagcta aacacccaaa tcaagccttg ctttctgaaa tggcttaaaa 4612 cacatgagtg ctggcgttac aatagcttat ttcagtgttg tggttgagag agctggcccg 4672 ccttgctggt gcccagggct tgcctcctgg gccatccagc atgggtattg ccatcctcat 4732 ctcacccagg ggctttcagc agtagagggg cttacccaaa gcctcacagt tcatagcaaa 4792 gccaggactg gcagtcagct ccctctgacc ccaacacgca tgcctgcaac catcacactt 4852 actgcagcct caaactcttg ggctcaagca atcctcccac ctcagcttcc ttagtagctt 4912 gactacaggt gtgcaccacc acgcctggct aattcttttt tattttttat tttttagagg 4972 cagtatctca ctatgttgcc caggctgggt ctcgaactcc tggcctcaag cgatcctccc 5032 acatcagctt ccagaagtgc tgggattaca ggtgtgagcc accgcacctg acccagatgt 5092 gcgttttaaa attcattgca aacacaaaga tattgggata tttcaggaaa aaatctgaag 5152 tcctagcttc tttcagagtc agaggatcca gccactgtgg gttcccaatg gcacctggta 5212 tcagtcagca ggagctaagg gcagctgccc agggaccagg accccaccca gcctcatcac 5272 tcactgccat ggcctgctgg ctacaagggc acatctgagc tgcagcccca cacgctccgc 5332 ctggcttcgt accccatgct agagctacat tgtccagtgc gatggccgtg aggcacagca 5392 cttaaaatgt ggctagcccc aattgagatg tgctgtaagt gtgaattcac acaggatttt 5452 gtagacttaa taccagaaaa aagcatgtaa aatgtctcaa tcatgttcat attgattata 5512 cattcaaatg ataattttaa tatattcatt taaataaaat gttattacaa tt 5564 <210> 6 <211> 5271 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3)..(365) <400> 6 tg aga gaa gag ggg aca gaa gct gaa gcc tgc atg ccc acc tcc cct 47 Arg Glu Glu Gly Thr Glu Ala Glu Ala Cys Met Pro Thr Ser Pro 1 5 10 15 ctg gtg ctc gct aaa gac tcc agg tcc cct ggg cag gac aga gag agg 95 Leu Val Leu Ala Lys Asp Ser Arg Ser Pro Gly Gln Asp Arg Glu Arg 20 25 30 gtg gag aga agc cag gag gga cat aag cca gag gac agg aag ttt ctg 143 Val Glu Arg Ser Gln Glu Gly His Lys Pro Glu Asp Arg Lys Phe Leu 35 40 45 gtc cag gtc ccc agg tcc caa gcc cca gat ttt aag cct gaa ctc aaa 191 Val Gln Val Pro Arg Ser Gln Ala Pro Asp Phe Lys Pro Glu Leu Lys 50 55 60 gcc gca cct ctg tgt ctc tta cct gac aga ctg ggg agt ggg tgg gac 239 Ala Ala Pro Leu Cys Leu Leu Pro Asp Arg Leu Gly Ser Gly Trp Asp 65 70 75 gag ctc cag cct cct ccc tca ctg ccc caa gca gct gcc cct ttg gct 287 Glu Leu Gln Pro Pro Pro Ser Leu Pro Gln Ala Ala Ala Pro Leu Ala 80 85 90 95 tct gag ggt ttt gtt gcg agt ggc aag tgg gag cca cca gat gtt cca 335 Ser Glu Gly Phe Val Ala Ser Gly Lys Trp Glu Pro Pro Asp Val Pro 100 105 110 ccg cta aag agt gac aga ggt tcc agg ttc tgaccctcca actcctcatc 385 Pro Leu Lys Ser Asp Arg Gly Ser Arg Phe 115 120 ccaactctga agcttcattc aacttgacgg aagctcatct gggcccacgc ctcctctttc 445 cagacccact ttggtctctt cctctcttga gcatccccga ggcctgagaa gtcaggtcag 505 gtgaaacctc aggctcccca actcccacct cctgctcccc agccacctcc acccaagcca 565 agtcctgcac gatgtgggga gggaaatgga gagggggtgt ctcagcatgg attcccccac 625 gccttgcctc tctttaaaat ttttcattat attctattta atatatttta aagacctcca 685 tcaatgaatt tgaaagtttt aaaatataat acttttaaat ggcactctta gacccatcat 745 ctccccaaag tacttgaaac taatgagccc tgctgaggcc acctgcaggt gcttcccgtc 805 cactcctccg cccccaggta accgccttcc tgaattctgt ctttctaaaa gctgctttga 865 tctaattaca cagcagttct gtcctgaatg ttgtctttct catttctttg cttgtttttt 925 taaagttttg ctgtgtgttt ataaatctct agataattat ttaatttttt aaaaaatttt 985 tagaaagatc tcataggtag gctcttgctt cttgctttct tcatttatca ttatggtgct 1045 aaaatacatc cctgctgttg ttgttgctgc tgtttttgag actgagtttc gctgttgttg 1105 cccaggctgg agtgcaatag tgcggtcttg gctcaccgca acctccgctt cccaggttca 1165 agtgattctc ctgcctcagc ctccccgagt agctgggatt acaggcaggc gccaccatgc 1225 cctgctaatt ttgtattttg agtagagacg gggtttctcc atgtgggtca ggctggtctc 1285 gaactcccga cctcaggtga tctgcctgcc tcaacctccc aaagtgctgg gattttaggc 1345 gtgagccact gtgcctggcc catacctgtt gttttatgtc agtgtagttc attcattccc 1405 atagctgtat aatttttttt tttttttttt tgagacgggg tcttgctctg tcacccaggc 1465 tggagtgcaa tggcgcaatc atagctcact gcagcctcaa cctcctaggc tcaagagatc 1525 ctcccacctc agcctcctga gtagctggca gataggcatg tgccaccacc cctggataat 1585 tattattatt attattatta tttttagaga caagcctgtt gccctggctg gtcttgaact 1645 cctggattcg agcaatccct ctacctcagc ctctcaaagt gctgggatta caggcgtgag 1705 ccatggcacc tggctcatat tcttttgtgt gaatataaaa cggtttattt aaccattcta 1765 cctctgatgg accatggagc tatatttggg gtggtggagg gtaatcataa acaatgctgc 1825 tgtcttagtc cctttgtgct aaatggactg agactggatg atttataaag aagaacagga 1885 acttatttct gacagttctg gaggctggga agtccaagat caaggcactg gaaggttctg 1945 ttgtctggtg agagctgctg ccttcttcca agatggcgcc ttggggctgt accctccgga 2005 ggggaccaat gctgcaccct cacgtggagg aagggatgga agggcaaaag agctgtggct 2065 gtgtgaagtc tcttttctaa agaccttaat cccattcaca agggaggagc cctcatgacc 2125 taatcacttc ctgaaggccc taccgctgta tactatcaca ttgataagtt tcaacatatg 2185 aaatttaaga tcatagcagc acgtctgaat gtacttatcc atgacttctg gaattgctct 2245 tggatttatg gctaggagtg aaattgctgg attgtggggc atgagatgaa ctggaaagtc 2305 caaacctcag ctgggaagaa agagcccccc acctggtcag ttacccagaa tcttccctct 2365 tgctaacttt atctccttct gttcagcttc acaaaaccat gctcaattag gttccaaggt 2425 gtgcctcggc tcccctcttg atcccattct ttggttgtaa ggttagcttc ctggttcttt 2485 ttgtttgtga ggggaagggg agacagagtc tcattctgtc acccagcctg gagtgcattg 2545 gtgggatctc agctcactgc aaacctctgc ctcccaggtt caagtgattc tcatgtctca 2605 gcctcctgag tacctggaat tacaggcgct tgccaccaca cctggctaat ttttgtattt 2665 gtagtagaga caaggtttca ctctattggc ccggctgatc ttgaactcct gacctcaaat 2725 gatccgccca ctttggcctc cccagatgct ggaattgcag gtgtgagcca ccgcacctgg 2785 ccctgattct taaacctact ccaggtttgc actttatcca cctggctttg aagaaaagag 2845 agaagggcaa gcagcccatg cccccataat caactcctgc caggggcatc ataagaattc 2905 caggctttag cacagtgcat ggagggaaga ctttcaggct gcagaggtgc ttcctaaacc 2965 tgccctgcct gtcctgccac ttcggagcac taacacttag tgtttccttg gtattatccc 3025 tcagatgctt ggaagaaatg ttcccatgag aaggctgtgt aggatagtga ttaagactgc 3085 agtctccgga gtgttctgtc tgggttcaaa tcctagcccc attcctttgc taggtgagag 3145 actctagaca agtcccccta ccttccctgg cctcaatttc catgagctgg gtgtgaggaa 3205 gttcactgag tcactgtgag gattaagtta gggtaataca agtcaaacac ctagaacagt 3265 acctggcata aacattcaat aaatgctagg gatcattttt aagtagcata ataaatgaag 3325 cgcacccaga aaaatgtgac caaagcaagc cctaaatgaa catgagctat tggtatgatt 3385 gttatcgtta ccagaaaaac ccctattcat ccttcaagat ccaaccaaga tatccttccc 3445 tgtctctcta ttctgacggc atttggtgca gagctcataa tctcatctat caggttgtgt 3505 tttaatctac ctgtttgtgt gtttgttatt gttgttgctc tttcttcttc ctcagctaga 3565 ttgtatcttt aagaacgggg actatttttc tctgactaat cctcaccctg acgtccttga 3625 tggccagcac agcgtagatg ctccattgat gaaatgaaag acattggggc ttgtatgtct 3685 tagaggcacc tcaaggctct ggaggtgcct ctaagtttgc agagcaggca ataaagggaa 3745 accagcctgg ctgagtttct tcttttctct gggttgcaat aattctgggg gaggaagagc 3805 caaatggtct ctgtcctgcc ttctgtcctc tccagtacct tgcctgtccc tctaggagct 3865 ccagccagat gccaagatag gtgccaaccc acccccaccc cgctattttt tctgaccctc 3925 cctaccacct ggcactgcag aaaactgact tccttcacat tgtgttgttc tcatgggctg 3985 cttcagacca cgaatccctt ttcattctta atagagtcca ggagaggtct tgcacggtga 4045 ttgccgattc aagtacgctc ctctgcagag gtcgattatt gcttttattt aaataatagc 4105 cattgaaatg attagattgc tgggcgctgg aaagtctggg gtgggggatc taatcacaca 4165 cagaataact ggcaaatcat ctcatttcaa ccttttctcc ctaatcacga gctgctcctc 4225 cctttttgag catttgggtg cccctgtcca cctctccaga atctatgtct gtgggaatac 4285 aggtgcaaga aaggagagac atttcgcaag cacattaaat atctctcaaa catgaaccca 4345 agggtgaagg ctctcctgat gctcagagaa gattatgctg cacactgcct gctgaaatca 4405 ctctagcctt cgtggcaggt gatagaggcg ctttgggcgt gatcttctga actgtgctaa 4465 gtaagggctg ccatttggcg ataagctgcc ctcagaagtc caacactcat ccaggcagaa 4525 agagcgtggt catttagcct ggatgctgcc tcttgcaaac tcatctcttg tccacgatgc 4585 acctcaatag tccttggtgg agacccttca aatctatgca gggggtcata tatgtaagtg 4645 ttcatactgt aaacagaagg gtaatttatt catatctatt gaatgcatgc ctaccatgtg 4705 ccagatacta ttaggaatgt gaagtgaata tagcaatgcc aggatgccag ataccaaagg 4765 ccataatcac tttcatatag atagttgatg atgaagaaag agagagagat aagtggaaag 4825 atgataggta gatagggaaa taatttaatt ctgatagtgt aagcatgcac acatctgggg 4885 aagaacattc taagcatcaa caacaagtgt aaaggcttca ggtgcaatac atttggcaca 4945 tttgtggcac catatgtact tttgcaggca attctgttag aaatggagcc ataagcggcc 5005 gggtgtggtg gctcacgctt gtaatcccag cactttggga ggccaaggcg ggtggatcat 5065 gaggtcagga gatcgagaac acgttgaaac cccgtctcta ctaaaaatgc aaaaagttag 5125 ccaggcgtgg tggtgggtgc ctatggtccc agctactcgg agaggctgag gcgggagaat 5185 ggtgtgagcc cgggaggcgg agcttgcagt gagccgagat cgccccactg cactccagcc 5245 tgggcgacag agcaagactc cgtctc 5271 <210> 7 <211> 5354 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3396)..(3626) <400> 7 gttccgggct ccgggctctg ggtggcggcg gctgtgagcg gcggcactgc ggcgcaggcc 60 agcgggcgcc gtcggcggct ggccctgtcg gccgcgggat gaggaagcgg accgagcccg 120 tcgccttgga gcatgagcgc tgcgccgccg cgggctcgtc ctcctccggc tcggccgccg 180 cggcgctgga cgccgactgc cgcctgaagc agaacctacg cctgacgggc ccggcggcgg 240 ctgagccgcg ctgcgcagcc gacgcgggaa tgaagcgggc gctgggcagg cgaaagggcg 300 tgtggttgcg cctgaggaag atacttttct gtgttttggg gttgtacatt gccattccat 360 ttctcatcaa actatgtcct ggaatacagg ccaaactgat tttcttgaat ttcgtaagag 420 ttccctattt cattgatttg aaaaaaccac aggatcaagg tttgaatcac acgtgtaact 480 actacctgca gccagaggaa gacgtgacca ttggagtctg gtgagtgcca gtgagctctg 540 cagcccctcc agcagcagga aaggacccta catgactgag ctggtgctgt ctttgacatt 600 ggcctctgaa atgttgttca tttttatctt cacctcttcc tttcctaaag gaaagatctc 660 agtacatttg atccatccta atgttgaggg attcaatggc tttaaactct attcccagcc 720 tttcaaaaca gcaaaggctt tgtgaacaga ttattatttt tagagagact tgtttctagg 780 aaaaacgaaa agcaaacaaa gaagctagaa acaggatata agtggccggc acagtggctc 840 acacctgtaa tcccagcact ttgggaggct gaggcaagtg gatcacttgg gtccaggagt 900 ttgagaccag ccttggcgac aaggtgaagc cccatctcta ctgaaaatac aaaaattggc 960 caggcatggt ggtgcacacc tgtaatcccg gctacttggg aggctgaggc aggaggatca 1020 cttgactcca ggaggcagaa gttgcagtga gctgagttca cgccactgca ctccagcctc 1080 caggagtttg agaccagcct tggcgacaag gtgaagcccc atctctactg aaaatacaaa 1140 aattggccag gcatggtggt gcacacctgt aatcccggct acttgggagg ctgaggcagg 1200 aggatcactt gactccagga ggcagaagtt gcagtgagct gagttcacgc cactgcactc 1260 cagcctggat gacagagcgg gactctgtgt ccaaaaaaaa gagaatataa gggaagaatc 1320 acaaacactg gtttgaatat atcttccatc caggcgcggt ggctcacgcc tgtaatccca 1380 gcactttggg aggccgagac gggtggatca cttgagatca ggagttcgag accaccctaa 1440 acaacatggt gaaaccccgt ctctactgaa aaaaaaatac aaaaattagc tgggcatggt 1500 ggcacgcgcc tgtagtccca gctacccggg agtttgaggc aagagaattg cttgaacgca 1560 ggaggcagag gctgcagtga gctgagatta tgccactgca ctccagcctg ggcaacagag 1620 cgagactctt gtctcaaaaa aaaaaaaaaa tatatatata tatatatata tcttttccaa 1680 gaggtgaaca tctagatact gaaaacagct tgatttcttt tcacatatag cttcgcacca 1740 gctgatgatg atggtgttgg ctgtcttctg agtatttact tttttaagcc ataagtttac 1800 actggacata agccttgtga cttctctgtg tctccttgga atacatttta atgttttctt 1860 ctgtacacaa gtggtgcttc tgaacagctg aattaaaatt gaccaacctc actgtagcat 1920 ttccgttttt ctctgagaag tgagcctgaa tcacagccct tttatctagc tgaatttatt 1980 tttcagtagc cataacgtcg ctttccactg tctcagaaaa ccactgtgta tgtgagaagc 2040 ttctaaatga aatgctgtga gtagcttccc agtgtggatg agagaacatt cagacatttc 2100 taacagtggt ccttggtaaa gaacataaaa caagaaggga gatttttgtt cttagtagtt 2160 ggtgctggga ttaaagtctt ttgtcacttc ctgatttgat ttaatctagc atttactgac 2220 caatgtgtga ctgtgggact cctcaggttt ctgtctgaag catgacttgt tttgcacaga 2280 gctgctgagt gggacttggt cagccctctt cttggttaac atgagtctgt cctattgcag 2340 gagggtgttt atggtggtga gacttaggct ggaagtacta acaactcctg catacattat 2400 cagctaccag aaaaggtttt gtttagtgat gtcagtggca agtgtcaagg cagtgacagc 2460 cacgctgtgt gttgctgtgt gttggtgggt gcgtgttggt gagtgcggct gctgcccact 2520 ctcatgccct aggcagtcac aatgacattg ggccctcagg cagcagcagg cccagctcac 2580 caactgcccc caccatcgct tcatcagacc ctcccaccag cttcctgaaa ggagggcctc 2640 aaggcctaga gtggagcctg ggggccattc ctggccatgc ctggccatcg cctgtctgct 2700 gagaggagag gcccaggaat gggtttctct gctggctagg agttccctgg ctctttgtca 2760 ggagcccagg tttggggcgg ctggctactg gaagcacatc tgctcccagc accttcaaaa 2820 tgtgacgtcc tatccctcaa atttataact gtatgggagt gatgatcata gtgactagaa 2880 ccatcccaaa agtgatcctt ttgaacggat cagagagaga cgttagaaat gagatgctga 2940 agcttaagat ctagaaaata agaagttggc cattaaattt ttttaaagat tcttgtcact 3000 ggtatccaga atgaatttga ataagcttta cagaaaagtg attaatttgg accctcccca 3060 gtgattaaca gtattcgatg tggcctgagt taaaattaac ctcagtatct ctctgaagga 3120 gggtttggct agatagacca gcagcttcag agtgcccagc caggtgtgga ccatgagcag 3180 actgggaagg tggtgcatac ttgacattag aaatggcgtc agcgatggcc cggtttctag 3240 tctggctcac aaacattggt gttgctgaat tagcaaatgt gacagcatgt actgtttaaa 3300 atccattgtg agttaaactg tggacaattt aagcatgggg ttaactggga tgtggggcac 3360 gtatgaggaa aatgctatta ctcaagatgg actaa gtg ctg ctt ctc tgc cag 3413 Val Leu Leu Leu Cys Gln 1 5 gcc cca gtg gta act ggc aca ttg caa gcc ttt gtg gcc cag agg agg 3461 Ala Pro Val Val Thr Gly Thr Leu Gln Ala Phe Val Ala Gln Arg Arg 10 15 20 ctg gcc tgc cag tca ggg aac gct cct gct gaa gct gtc agt gcc atc 3509 Leu Ala Cys Gln Ser Gly Asn Ala Pro Ala Glu Ala Val Ser Ala Ile 25 30 35 cac aca gcc tgc agt tcc gtg gtt att gat tct agt gat ggg gat agt 3557 His Thr Ala Cys Ser Ser Val Val Ile Asp Ser Ser Asp Gly Asp Ser 40 45 50 ttt cta cca ccc acc ccc atc tgt tcc cct gta gtt aga tgg ggt ctg 3605 Phe Leu Pro Pro Thr Pro Ile Cys Ser Pro Val Val Arg Trp Gly Leu 55 60 65 70 cag ctc cct gct gtc tgg act tgagctgatt aagtaacacc acgtgcagtc 3656 Gln Leu Pro Ala Val Trp Thr 75 gccaggatga ggacagaagg gacatctgat cctgcgctcc cctctgcttc tgtctgcagg 3716 cacaccgtcc ctgcagtctg gtggaagaac gcccaaggca aagaccagat gtggtatgag 3776 gatgccttgg cttccagcca ccctatcatt ctgtacctgc atgggaacgc aggtaccagg 3836 tgagggagag gagcccatca tctttgctgt ggagcatggc tcctttttgg ggtgggaatg 3896 tcctggcccg gtctgccgga aaccagcagg agcaacctgc ggtcccacaa aattgggtgt 3956 actgacccac tgcaatgagg gagagcctcc caagaacgaa gggaaaggca gagttctgaa 4016 ggggttgagg gaaggcggag tttaggtgac gtttacatga agcagggttc ccacaggctg 4076 aggccagcac ggctgtgttc cgtggggacc aagtgtggga agctgtgtgc agaccgtccc 4136 ctatctggag ctgcagcctg gattgggatg aaggaggctg agtctcagtc aggagctgag 4196 ctccccagcc agaggggcat gtttttttct ccttgttgta atctcaaagg tcacagcatc 4256 tgctggtatg tgatttgggg agcagggttt cttggagagt gagggctttg gggtccctcc 4316 gaggaggggt catagtggca cttggctgtg ggtgcctcgc gctcttggga atgggcggct 4376 tcagctcact ggcagcttgt tttctctata tctgtcatgc tgcctgaagt ggcaggcaga 4436 tctttgttac ctgtaagata gctccaggtg gcagacaggt gaagccggag ccagggcagc 4496 cctggaggag gtggcacact tcctggaaaa aagtggcagg tgctgagaag gggcaggagg 4556 gagaggctga aagcaccagg tgctggggtc tttgcaggcc accctacagt gtgagctctc 4616 tgctcaccct aacgggctgt tgctgaagtt catgcactct ttggttaatc agccctgccc 4676 agttttatct cagatgcttt tcctgagtgg gtctaatggc ttgtcctttc cactggctgc 4736 cctggagttg gcctccctgc cttaaaaggg tgctccttcc acaccatgtg attgctgttc 4796 agaagagcct ctgtgggaga acccagggtg ggtgcggagg ggatacattt tctgaacaaa 4856 gcattgtttg aaaaccacca aggtcttcgt gtgtgagaga ggcagccagg agccagcaca 4916 gaggcagacc cagcacggag acagggctgc tgcggggcca gggccgaggt cccgtgtggc 4976 accagggtat gtcagggctg ggggcaaaca ttttataggc agagccagca aggagagctg 5036 tgctgggccc ttctggaagg gaaagcaggc ctaggtgggg tctgcatggt ggaaccctgc 5096 agagagcatt gctgaaagag cctggcggca tggcccgtga agcggtttct gtgtcaggga 5156 tggcgtgtct agctacagaa tcatgcatgt gctgcacagt ttcctttctt ctttgtctgg 5216 gacaagctgc agagggtttc tcttctgctt ttcaaagagc aagtggctac aaacagctct 5276 tgcagcatct ttttataaat gacagaactg tggttggctg tgaataaggt tcagttctgc 5336 agtgcccatc acatggtt 5354 <210> 8 <211> 5175 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3)..(2039) <400> 8 cc cct ctg cct tgg aaa acc aga aga atg cca ata tat tca tca gtt 47 Pro Leu Pro Trp Lys Thr Arg Arg Met Pro Ile Tyr Ser Ser Val 1 5 10 15 aca ctt tca ctg ttc ccg caa agc caa gca aat ata tgg cta ttt gaa 95 Thr Leu Ser Leu Phe Pro Gln Ser Gln Ala Asn Ile Trp Leu Phe Glu 20 25 30 gac act gtt tta att act gga aaa tta gct gat gcc aag tct caa cca 143 Asp Thr Val Leu Ile Thr Gly Lys Leu Ala Asp Ala Lys Ser Gln Pro 35 40 45 agt gtt cag ttt tca aaa gcc tta att aaa ctt cct gac aac cat cat 191 Ser Val Gln Phe Ser Lys Ala Leu Ile Lys Leu Pro Asp Asn His His 50 55 60 att agc aac gtt act ggc tat ctt aca gtt cta caa cag ttt ttg aaa 239 Ile Ser Asn Val Thr Gly Tyr Leu Thr Val Leu Gln Gln Phe Leu Lys 65 70 75 gtg gac aat ttt ctg cat aca act gga att act ctc aat aaa cca ggt 287 Val Asp Asn Phe Leu His Thr Thr Gly Ile Thr Leu Asn Lys Pro Gly 80 85 90 95 ttt gaa aac att gaa ttg act cct ctt gct gca ata tgt gtg aaa ata 335 Phe Glu Asn Ile Glu Leu Thr Pro Leu Ala Ala Ile Cys Val Lys Ile 100 105 110 tat tct gga gga aaa gaa cta aag gtc aat ggc tct att caa gtt tct 383 Tyr Ser Gly Gly Lys Glu Leu Lys Val Asn Gly Ser Ile Gln Val Ser 115 120 125 ctt cct ctt cta cgt ctg aat gat ata agt gca ggg gat cgc ata cct 431 Leu Pro Leu Leu Arg Leu Asn Asp Ile Ser Ala Gly Asp Arg Ile Pro 130 135 140 gct tgg aca ttt gat atg aac aca ggt gct tgg gta aat cat ggt cgg 479 Ala Trp Thr Phe Asp Met Asn Thr Gly Ala Trp Val Asn His Gly Arg 145 150 155 gga atg gtc aag gaa cat aac aat cat tta atc tgg aca tat gat gca 527 Gly Met Val Lys Glu His Asn Asn His Leu Ile Trp Thr Tyr Asp Ala 160 165 170 175 cca cat ttg ggg tac tgg ata gca gct cca ctt cca gga act aga ggt 575 Pro His Leu Gly Tyr Trp Ile Ala Ala Pro Leu Pro Gly Thr Arg Gly 180 185 190 tca ggt ata aat gaa gat tcc aag gac ata act gcc tac cac aca gtg 623 Ser Gly Ile Asn Glu Asp Ser Lys Asp Ile Thr Ala Tyr His Thr Val 195 200 205 ttt ctt aca gcc ata tta gga gga aca ata gtc att gtc att gga ttt 671 Phe Leu Thr Ala Ile Leu Gly Gly Thr Ile Val Ile Val Ile Gly Phe 210 215 220 ttt gct gta cta ctt tgt tat tgc agg gac aag tgt ggt act cca cag 719 Phe Ala Val Leu Leu Cys Tyr Cys Arg Asp Lys Cys Gly Thr Pro Gln 225 230 235 aaa aga gaa aga aat atc act aaa ctt gag gtc ctc aag aga gac cag 767 Lys Arg Glu Arg Asn Ile Thr Lys Leu Glu Val Leu Lys Arg Asp Gln 240 245 250 255 aca act tca aca aca cac ata aat cat atc agt aca gtt aaa gtt gca 815 Thr Thr Ser Thr Thr His Ile Asn His Ile Ser Thr Val Lys Val Ala 260 265 270 tta aaa gct gag gac aag tcg cag tta ttc aat gcc aaa aac tcc tca 863 Leu Lys Ala Glu Asp Lys Ser Gln Leu Phe Asn Ala Lys Asn Ser Ser 275 280 285 tat agt cct cag aaa aag gaa cca tca aag gca gaa aca gaa gaa aga 911 Tyr Ser Pro Gln Lys Lys Glu Pro Ser Lys Ala Glu Thr Glu Glu Arg 290 295 300 gtt tcc atg gta aaa act cgg gac gat ttt aaa atc tac aat gaa gat 959 Val Ser Met Val Lys Thr Arg Asp Asp Phe Lys Ile Tyr Asn Glu Asp 305 310 315 gtt tca ttt cta tca gtc aat caa aat aat tac tca aga aac cca aca 1007 Val Ser Phe Leu Ser Val Asn Gln Asn Asn Tyr Ser Arg Asn Pro Thr 320 325 330 335 cag tct ttg gag ccc aat gta ggg tcc aaa caa cct aaa cat att aac 1055 Gln Ser Leu Glu Pro Asn Val Gly Ser Lys Gln Pro Lys His Ile Asn 340 345 350 aac aat cta tct tca tct cta ggt gat gct caa gat gaa aag agg tat 1103 Asn Asn Leu Ser Ser Ser Leu Gly Asp Ala Gln Asp Glu Lys Arg Tyr 355 360 365 ctc aca ggt aat gag gag gcg tat ggg cgt tcc cat att cct gaa cag 1151 Leu Thr Gly Asn Glu Glu Ala Tyr Gly Arg Ser His Ile Pro Glu Gln 370 375 380 ctt atg cat att tac agc caa ccc att gcc atc ctt caa aca tct gac 1199 Leu Met His Ile Tyr Ser Gln Pro Ile Ala Ile Leu Gln Thr Ser Asp 385 390 395 ctt ttc tcc aca ccg gaa caa tta cat act gct aag tca gct act ttg 1247 Leu Phe Ser Thr Pro Glu Gln Leu His Thr Ala Lys Ser Ala Thr Leu 400 405 410 415 cca aga aag gga cag tta gtc tat ggc caa ttg atg gaa cca gta aat 1295 Pro Arg Lys Gly Gln Leu Val Tyr Gly Gln Leu Met Glu Pro Val Asn 420 425 430 cga gag aac ttt acg cag acc ttg ccc aaa atg cca att cat tct cat 1343 Arg Glu Asn Phe Thr Gln Thr Leu Pro Lys Met Pro Ile His Ser His 435 440 445 gca cag ccc cca gat gcc agg gaa gag gat atc ata ctt gaa ggt caa 1391 Ala Gln Pro Pro Asp Ala Arg Glu Glu Asp Ile Ile Leu Glu Gly Gln 450 455 460 cag agc ctg cca tcc cag gct tca gat tgg agc cga tac tca agc agc 1439 Gln Ser Leu Pro Ser Gln Ala Ser Asp Trp Ser Arg Tyr Ser Ser Ser 465 470 475 tta ctg gaa tcc gtc tct gtt cct gga aca cta aat gag gct gtt gta 1487 Leu Leu Glu Ser Val Ser Val Pro Gly Thr Leu Asn Glu Ala Val Val 480 485 490 495 atg act cca ttt tca tcg gaa ctt caa gga att tca gaa cag acc ctc 1535 Met Thr Pro Phe Ser Ser Glu Leu Gln Gly Ile Ser Glu Gln Thr Leu 500 505 510 ctg gag ctg tcc aaa gga aag ccc tcc ccg cat ccc aga gcc tgg ttt 1583 Leu Glu Leu Ser Lys Gly Lys Pro Ser Pro His Pro Arg Ala Trp Phe 515 520 525 gtg tct ctt gat gga aag cca gtt gca caa gtg agg cac tcc ttt ata 1631 Val Ser Leu Asp Gly Lys Pro Val Ala Gln Val Arg His Ser Phe Ile 530 535 540 gac ctg aaa aag ggc aag aga acc cag agc aat gac acc agt ctg gac 1679 Asp Leu Lys Lys Gly Lys Arg Thr Gln Ser Asn Asp Thr Ser Leu Asp 545 550 555 tct ggg gtg gac atg aat gag ctt cac tca agt aga aag ctc gag agg 1727 Ser Gly Val Asp Met Asn Glu Leu His Ser Ser Arg Lys Leu Glu Arg 560 565 570 575 gag aaa aca ttc atc aaa agc atg cat cag ccc aag atc ctt tac tta 1775 Glu Lys Thr Phe Ile Lys Ser Met His Gln Pro Lys Ile Leu Tyr Leu 580 585 590 gaa gat tta gac cta agc agc agt gag agt gga acc acc gtc tgt tcc 1823 Glu Asp Leu Asp Leu Ser Ser Ser Glu Ser Gly Thr Thr Val Cys Ser 595 600 605 cct gag gac cca gct tta agg cac atc cta gat gga ggg agt gga gtg 1871 Pro Glu Asp Pro Ala Leu Arg His Ile Leu Asp Gly Gly Ser Gly Val 610 615 620 atc atg gag cac cct gga gaa gag tcg cca gga agg aaa agc act gtt 1919 Ile Met Glu His Pro Gly Glu Glu Ser Pro Gly Arg Lys Ser Thr Val 625 630 635 gaa gat ttt gaa gct aat aca tcc ccc act aaa aga agg ggc aga cca 1967 Glu Asp Phe Glu Ala Asn Thr Ser Pro Thr Lys Arg Arg Gly Arg Pro 640 645 650 655 cca cta gcc aaa aga gat agc aag act aac atc tgg aag aag cga gag 2015 Pro Leu Ala Lys Arg Asp Ser Lys Thr Asn Ile Trp Lys Lys Arg Glu 660 665 670 gaa cgc cca ctg att ccc ata aat taactccaat ggggattgtg tgtctgctgt 2069 Glu Arg Pro Leu Ile Pro Ile Asn 675 ctcgtgctgt ttattcttgc ttcttgttgt aaattgcagt acgaacttaa gaaaatgaga 2129 ctgagcaatc tcatggttct tggacatgtc tcaagcagag taaatggtaa ttcagtaatc 2189 agagagaaag ataccaagga atgctttttc tggcctattc atttattttt gggtgatgaa 2249 tttacagtat ctaagttttc aaaatgtaaa atagcttcaa gatgttagtt atctgaaaat 2309 gttgctcagc cagccagttt ggccttgact ctcttaagaa taacagtgaa atatatactc 2369 ctcaagttgc ctccaaaaat gttgcctcta ccatggtgac taccccatgg aacatttaga 2429 aacaaaactg acttcaggca tcatattatt ttaaatgtta ctattacgtc ttcttctgcc 2489 tatacttaaa aataacttga taaatgactt ggactgatgt tactctggag ttatcacaaa 2549 gaaaatgttg tttggtcttt aaagagcatg tgtattgtat catcccaaac gtaaatccta 2609 catttatata agatgggcaa gaagctactt ggtcattaga gagggagaca ccagctcttt 2669 ggttgttttt ggatataact ttacaaaata agtaagatgt taatttagaa atttgagaaa 2729 ttaatgctct aatactgagt ttttatttaa aaattatttt ttcttcccct caacaatgaa 2789 gcaagcttag ctgtcaaggg aaacttttta caaatctgaa aaaaacaatc tatgactttg 2849 gtttaaggct cactgatact tttaggctaa attggtttta atatatttct tctattctaa 2909 aaacctgaac tcagtcactt aaaggctatg aaatttaaaa aaaaagtcga tgtgaaagtt 2969 tcttttgaac actaaaataa aatatgtgca gataaaatat acattgattt gtttttctta 3029 aatgttgatg agaagaaaaa gagatgccat tttcctgagg ctcaaaaata ccttcaggat 3089 agttgtatat ccagttattg attttcttaa aagatgtgta aggaaaacag tttcaatttc 3149 aggggaaaag taaaagtttt tccctaagtc acttaaagcc tttgcaactt cttttttcag 3209 ttttgtaagt aatatatcta tgttcttttc attatagcaa gcattcaatg tgaacaactt 3269 tttaattaac tctgaattac cattcataca tcctaaaaat aaaagctcgt tattcattaa 3329 aatcaactga tcccattttt cttaaaattt ccctgaaggc aaatgtctga agcacctttc 3389 ccttgtgggg gtaaaaatcc taaattgctt tatttttcat tccctcctat tcaacatggg 3449 agcagcatag agacccaaac catgtaaaca agttcagtga accaaaacag ccacattagc 3509 ttcagtaaaa ttatagctag atgtgcaatt ttttcctcca acttctaacg tgtcaaataa 3569 ccttcctact gttctgtgtt aactgaaaga acataaagac cctaggcaaa tatttgctat 3629 atattacccc aatccataga agaaataatg ttttgggtaa tacctaggct tccttttttt 3689 tttttttttt ttttttttag tgataaggct cataacaatt agagaaggct tcttattggt 3749 cttacacaga aagatacatc aaaagcagca tgactcaaaa tgatttggaa aaggttaaag 3809 ttagtgctct gctgaagtgc ctttgatata gacttgcatt attagaagga tataacatct 3869 tttttaagtg tgcattttct ttcagttaac caaattaaac agatgtgcag ttttattaaa 3929 aatatagacc tagtgtttca tgttggaaca ataaatattg catgtgagta gtatttcttg 3989 ttttttgaat acagtatata ttgataaatt gtttatgttg gaatgaagtt agaaactata 4049 tagcaaaaca ttatatttta agtgtttatt tttcccacct ttaaataaaa atgtttcatc 4109 tcagcttggt aatgaaatac acatattggt ataagggtat accattcagg tatgccactt 4169 attttattca tttttgtgta agggaaatga gatgatgtat cccaagggct tttctagaac 4229 tacttgtttg ctttcagaat aaaaccttat tattttttac actgcacatg ctgttctcaa 4289 ttggtaatta taggcaattt atcttttcta atgatcaaaa gagtgtgact tctcatttgt 4349 gagtagttca caaatttcct gttaaaaagc tgaaaccatc tactttttct taacccaagt 4409 gataataaac aatattcaca actttcttaa atttttaaat tgaaaaccaa ggttttttca 4469 aatataaacc tagatgattt tggtcacaaa ttgttaacat ttgtcgatcc tttgtatata 4529 ctttggatat atattaaagg caaaactatc tcttgactaa ctgatggatt catttactaa 4589 agcacagctg tatgtatttt tgaatacata ttatgatctt gagactttat aaatcaattt 4649 ttatgacttt atgcagttgt atagggatta tgccctttca gttctatagg gattatgccc 4709 ttttataata cataatatac cacagagatt acaaatgttg aggaatgaaa gcacttcttt 4769 gctttggcaa tcattttcag accactatgt gtttgaatcc tctggtatca atacgtatta 4829 tagggtttta gagatctgtg ggtcaaatga tgtccctcaa aacttcctaa aaaggtgaag 4889 ctcaaagtca cacattcatt tataaggcgc atgagtttct cattttccca tgtacgagca 4949 ttgtaaagga attcagctgt attaatttct atttcagatc tagaattgac attttgcctt 5009 cttgtttcca ggtgtttcta ttttttgtat tctttcagag aaatctcata tttcggtgta 5069 tttattgctg ttactactat atttactgct gaaaactgta acaacctgaa gatttgtaaa 5129 atgttaaaca tagttcatta aaaataataa aataaatcta aaatgt 5175 <210> 9 <211> 5700 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1)..(1824) <400> 9 gat gag gag gca cct tcc aag caa tgt gtt tct gta gga gtg tca cag 48 Asp Glu Glu Ala Pro Ser Lys Gln Cys Val Ser Val Gly Val Ser Gln 1 5 10 15 gtc aca act tta aag cca gct ttg tcc acc cag aag gcc cag ccc tgt 96 Val Thr Thr Leu Lys Pro Ala Leu Ser Thr Gln Lys Ala Gln Pro Cys 20 25 30 gag aca tgt agc tca ctt ctg aag gac att cta cac ctg gct gag cat 144 Glu Thr Cys Ser Ser Leu Leu Lys Asp Ile Leu His Leu Ala Glu His 35 40 45 gac gga aca cac ccc aag cgt aca gcc aag ctt tac ctg cac caa aag 192 Asp Gly Thr His Pro Lys Arg Thr Ala Lys Leu Tyr Leu His Gln Lys 50 55 60 gag cat ctt aga gag aag ctc acc aga agt gat gaa ggg agg cct tcg 240 Glu His Leu Arg Glu Lys Leu Thr Arg Ser Asp Glu Gly Arg Pro Ser 65 70 75 80 ttt gtg aat gac agt gtt cac ctg gca aag agg aac ctc aca tgc atg 288 Phe Val Asn Asp Ser Val His Leu Ala Lys Arg Asn Leu Thr Cys Met 85 90 95 cag ggt ggc aag gat ttt act ggt gat tca gat ctt caa caa cag gct 336 Gln Gly Gly Lys Asp Phe Thr Gly Asp Ser Asp Leu Gln Gln Gln Ala 100 105 110 ctt cac agt ggg tgg aag cca cac agg gac act cat ggt gtg gag gcc 384 Leu His Ser Gly Trp Lys Pro His Arg Asp Thr His Gly Val Glu Ala 115 120 125 ttt caa agt gga cag aat aat tac agc tgc acc caa tgt ggg aaa gac 432 Phe Gln Ser Gly Gln Asn Asn Tyr Ser Cys Thr Gln Cys Gly Lys Asp 130 135 140 ttt tgc cac caa cat aca ctg ttt gag cac cag aaa atc cac aca gag 480 Phe Cys His Gln His Thr Leu Phe Glu His Gln Lys Ile His Thr Glu 145 150 155 160 gaa agg cct tat gag tgc agt gaa tgt ggc aaa ttg ttt agg tac aac 528 Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Leu Phe Arg Tyr Asn 165 170 175 tct gac ctt att aaa cat cag cga aat cat act gga gaa agg cct tat 576 Ser Asp Leu Ile Lys His Gln Arg Asn His Thr Gly Glu Arg Pro Tyr 180 185 190 aag tgt agt gaa tgt gga aaa gcc ttc agc ctc aaa tac aat gtt gtt 624 Lys Cys Ser Glu Cys Gly Lys Ala Phe Ser Leu Lys Tyr Asn Val Val 195 200 205 caa cac cag aaa att cac act gga gaa agg cct tat gag tgc agt gaa 672 Gln His Gln Lys Ile His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu 210 215 220 tgt ggg aaa gct ttt ctt aga aag tct cac cta ctt cag cac cag agg 720 Cys Gly Lys Ala Phe Leu Arg Lys Ser His Leu Leu Gln His Gln Arg 225 230 235 240 att cac acc agg cca agg cct tat gtg tgt agt gaa tgt ggg aag gcc 768 Ile His Thr Arg Pro Arg Pro Tyr Val Cys Ser Glu Cys Gly Lys Ala 245 250 255 ttc ctt aca cag gct cac ctt gtt ggt cac cag aaa att cat act gga 816 Phe Leu Thr Gln Ala His Leu Val Gly His Gln Lys Ile His Thr Gly 260 265 270 gaa cgg cct tat gga tgc aat gaa tgt ggg aaa tac ttt atg tac agt 864 Glu Arg Pro Tyr Gly Cys Asn Glu Cys Gly Lys Tyr Phe Met Tyr Ser 275 280 285 tca gca ctc att aga cat cag aaa gtt cac act gga gaa agg cct ttt 912 Ser Ala Leu Ile Arg His Gln Lys Val His Thr Gly Glu Arg Pro Phe 290 295 300 tat tgc tgt gaa tgt ggg aaa ttc ttt atg gac agc tgc aca ctc att 960 Tyr Cys Cys Glu Cys Gly Lys Phe Phe Met Asp Ser Cys Thr Leu Ile 305 310 315 320 att cac cag aga gtt cat act gga gaa aaa cct tat gaa tgc aac gaa 1008 Ile His Gln Arg Val His Thr Gly Glu Lys Pro Tyr Glu Cys Asn Glu 325 330 335 tgt ggg aaa ttc ttt aga tac cgt tcc aca ctc att aga cat cag aaa 1056 Cys Gly Lys Phe Phe Arg Tyr Arg Ser Thr Leu Ile Arg His Gln Lys 340 345 350 gtt cac act gga gaa aag cct tat gag tgt agt gaa tgt ggg aag ttc 1104 Val His Thr Gly Glu Lys Pro Tyr Glu Cys Ser Glu Cys Gly Lys Phe 355 360 365 ttt atg gac act tcc aca ctc att att cat cag aga gtt cat act gga 1152 Phe Met Asp Thr Ser Thr Leu Ile Ile His Gln Arg Val His Thr Gly 370 375 380 gaa aag cct tat gaa tgc aac aaa tgt ggg aaa ttc ttt agg tat tgc 1200 Glu Lys Pro Tyr Glu Cys Asn Lys Cys Gly Lys Phe Phe Arg Tyr Cys 385 390 395 400 ttc aca ctg aat aga cat cag aga gtt cac tct gga gag agg cct tat 1248 Phe Thr Leu Asn Arg His Gln Arg Val His Ser Gly Glu Arg Pro Tyr 405 410 415 gaa tgc agt gaa tgt ggc aaa ttc ttt gtg gac agc tgt aca ctg aag 1296 Glu Cys Ser Glu Cys Gly Lys Phe Phe Val Asp Ser Cys Thr Leu Lys 420 425 430 agt cat cag aga gtt cac act gga gaa aga cct ttt gaa tgc agc att 1344 Ser His Gln Arg Val His Thr Gly Glu Arg Pro Phe Glu Cys Ser Ile 435 440 445 tgt ggg aaa tcc ttt aga tgt cgc tcc aca ctt gat aca cat cag aga 1392 Cys Gly Lys Ser Phe Arg Cys Arg Ser Thr Leu Asp Thr His Gln Arg 450 455 460 att cac act ggt gaa agg cct tat gag tgt agt gaa tgt ggg aaa ttc 1440 Ile His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Phe 465 470 475 480 ttt agg cac aac tca aat cat att aga cat cgg aga aat cac ttt gga 1488 Phe Arg His Asn Ser Asn His Ile Arg His Arg Arg Asn His Phe Gly 485 490 495 gaa agg tct ttt gag tgc act gag tgt ggg aga gtt ttt agc caa aat 1536 Glu Arg Ser Phe Glu Cys Thr Glu Cys Gly Arg Val Phe Ser Gln Asn 500 505 510 tcc cac ctc att cgg cac caa aaa gtt cac act agg gaa aga act tac 1584 Ser His Leu Ile Arg His Gln Lys Val His Thr Arg Glu Arg Thr Tyr 515 520 525 aaa tgc agc aaa tgt ggg aaa ttt ttt atg gac agc tcc aca ctc att 1632 Lys Cys Ser Lys Cys Gly Lys Phe Phe Met Asp Ser Ser Thr Leu Ile 530 535 540 agt cat gag aga gtt cat act gga gaa aag cct tat gag tgc agt gaa 1680 Ser His Glu Arg Val His Thr Gly Glu Lys Pro Tyr Glu Cys Ser Glu 545 550 555 560 tgt ggg aaa gtc ttt aga tac aac tcc agc ctc att aaa cat cgg aga 1728 Cys Gly Lys Val Phe Arg Tyr Asn Ser Ser Leu Ile Lys His Arg Arg 565 570 575 att cac act gga gag aga cct tat cag tgc agt gaa tgt gga aga gtc 1776 Ile His Thr Gly Glu Arg Pro Tyr Gln Cys Ser Glu Cys Gly Arg Val 580 585 590 ttt aac caa aat tct cat ctc att cag cac cag aaa gtt cac acc aga 1824 Phe Asn Gln Asn Ser His Leu Ile Gln His Gln Lys Val His Thr Arg 595 600 605 taaagaatgt atatataaag cagatgggga aagacttcac acagaaatct actctgattt 1884 agcactggga cctacgtttt aaaaaaagta ttcttgtaga atacagataa cataaaatct 1944 aacatcttaa ccatgttaaa gtgtatagtt cagtaccgtt aagtcattca cattgtgcaa 2004 tgaatatcta gaagtctttt caacttatga aactaagtct atacctttta aaaccttatt 2064 cctcactcca tccagcctct tgacaagcac cgctctgtat gaattttact agtccgggta 2124 cctcatataa gaaaacttaa gttttggtct tcttgtggtt tattttgtgg cttattttgc 2184 ttaacgttat atttttaagg tttcatgttc taatccatta gaatttccat cctttttaaa 2244 ggctgaataa aattctgtta gtcatgtgtt gcttaacagt ggggaagtgt cctgagaaaa 2304 gtgttattag gtgattttct ttcttttttt ggtggtgggg ggattgcgtg aatgcctagg 2364 ctgtatggta tatcctatag caccttgcta caaacttgta tagcatatta ctgtactgaa 2424 tactgtaggc tgttggaaca catggtaagt aattgttttt aagtatatct aaacagaaaa 2484 ggtacagtaa aaatacagta taaaagaaaa aatgatagac tcacagagaa cttaccatga 2544 atggagctta cagtactgca agttgctcta ggtgagtcag tgagtggtaa gtgaatgtga 2604 aggcctaggt tgttactgtg ctgtagactt tatagacatt gtgtacttag acgacaatac 2664 atttttattt ttattattat ttttgagaca gaatcttgct ctgttgccca gactagagtg 2724 cagtggtgca atcttggctt cctgcaacct cctccacctc ctggttcaag cagttctgcc 2784 tcagcttccc aagtgtctgg gattacaggc atgcaccacc atgccccgct aatttttgta 2844 tttttagtag agaacggggt cttaccatgt tggccaggct ggtctcaaac tcccgacctc 2904 aagtgagcca ctcgctttgg cctcccaaag tgctgggatt acaggcatga gccaccgcac 2964 ccggccggtt ttaccatttt tagagccaag ctttactata tatttgatat ttgttctttc 3024 ttcaacctta gctgaattca cattcctctg atagaaggtg ttttcaaact gatgccgttc 3084 ttagtgcctc aaactagatc ctgttcatac ttgttagaac aagttattac aaattcactt 3144 tggtgtaaaa aattgaaatc catacataat tttttttttt tttttgacag agtctcacta 3204 acgctaggtt ggagtgcagt ggcatggtct cggctcattg caacctccgc ctcctgggtt 3264 caagcagttc tcctgcctca gcctcttgag tagctgggat tacaggtgcc cacaatcacg 3324 cccagctaat ttttgtattt ttagtggaga tgggttttca ctctgttggc caggctgctc 3384 tcgaactcct gacctcaggt gatccacctg cctgggcctc acaaagtgct gtgattacag 3444 gcttaagcca ccacccctgg ccaatttttt cataatatac atttttttct catttttcat 3504 gaaacttttg aagacccctc atattctaga tattccttct cagatatgtg gttttcaaat 3564 actttctccc attgagtctt tttccttttc actctgtcca ttatgtcctt ttttacacag 3624 gaattttgaa tttaaatgga gtctaatata tctgttttat aagtctttga tgcatttgag 3684 ttcattttag caacttaatt cttttgcgtg tggatatcca gttttttttt tttaacatca 3744 aaagaataat gtttttgcct agcattaagg cccttggtag aggcttgtca gttacaattt 3804 tggagcagca gattaagtcc acactcccaa ccattttcct tatcaggctc tcaaactctg 3864 ggccacaata tgtaagaccc aatcacccca ggatcaggaa tcagatatct agggacagct 3924 tctgtgccca ggagcttgta aaattattcc attggtcaat gcacaggggt ccctgaaaac 3984 ctagctaacc ccaatttaca tggcacacac aagctgcccc ctaagctcca gcttgctgtt 4044 atcttgggtt ccctcataac tcttgcagcc ctgcctatgt ccttaggttt caagctgtaa 4104 gtagcaaagt ggtctacatt ttatgattat cattgtgaca tgtcctgaca tcagaaaaac 4164 acctttgtat gttattacta tacaaccagc agaatattat gagtgcagca aatgttagaa 4224 agtattcagc ctaacttcac tgagcaagag taagttcatc ctggagaaag tccttaggaa 4284 tgcaggcaat atactttttt tcctttgtca acaggtcaaa aacagcaaag ctctatcgag 4344 cttgtcttac tcaccctatt tttttgttgc tctgttttgt tttaggcttt tagcctgaag 4404 ccatggtttt gtttctgtct ctagtggtag gtggacaaga ggaatgagat gagaaaggag 4464 ctttactggc ccagctagaa acaaactaag aacccatgac tgtattcttt cccttggatg 4524 accctgtgtt agcttgttga gggagatctc agcctgaaat tgaatctcac atccaaacat 4584 ccacgcaagg gagatttgtt gtaattgtca gatatatggt aaatttttgt gaatgatgtt 4644 gcactttctg acctgcctgg ggcctttcca gagttaagtt gctgaaagtg tgcattacag 4704 aagactcctg ctattagctg tcatggtgcc acaatgtgca tcaccttagt caccttaaat 4764 tacttagaga gtgataaggt ctggacttct ggttaaatgt ttttaaaaaa tggggggtgg 4824 gggggtggtg catagattgc tgtgttctct acctttatct ggaatattca gtcatttgtt 4884 ccctttgggg gcctcattcc cagtcccctg actggtttgg gtgtggacat cacccagctt 4944 tggacagaga acacacgcca acttcagctg gcagcttgta gagatttcct tttttcagag 5004 gtattattag ttgtctgata ctgataatgt tgatgataaa ttttctacct tccaagcttc 5064 ccaacccagt caatttccac ctaagcattg ctgttttctt ctgatgataa aggtcatatt 5124 gtttaagcta catttactct tggggttctc ttcactgtgt gctgcgggtt gagaacaaaa 5184 ttaggctttg ccagaatgaa aaagtgaatg gtttttgggg ccttcaactt tttgtgctct 5244 tgaagaaata agaagacaaa atagctttca atccacatca ggcccaattt gcattgcttc 5304 gggagttcct gggaaagtga cggacttcta tccaaaatcg cgccgtgaat ttgattattg 5364 gtagttctac agtcagcttg agggttgttg gtttgacagt tgtcagagca tgttgcagct 5424 gtatgaggtg ggtatctgta catatggatg tcccatattc tccagcattg cataggaata 5484 gctggtgtct agatcctgcc tcaggagcta tgtgtcctga atttaaaaat caggtatgtt 5544 atatccctgg ggcatgtcag acatacaaca acagtgcatt agtccatttt ccattgctta 5604 caacaaaata ccttaaactg ggtaatttgt aaagaaaata aatttcttac tgttgttttg 5664 ttgtgctttc tttttttcta gatcgcgagc ggccgc 5700 <210> 10 <211> 5718 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1864)..(3324) <400> 10 aaaaaatagc caggcgtggt ggcacatgcc tgtagtccca gctactcggg tggctgaggc 60 gggagaatcg cttgaaccca ggaggcagag gctgcagtga gctgaggtcg cgccactgca 120 ctccagcctg ggtgacagag tgagactctg tctcaaaaaa caaacaaaca agcaaaaatc 180 aaaaacattg tttcagatct tattttccta ttgacctaag agttatttaa tatagacttt 240 tgaattttca ggtagatttt acttgtgttt cttttaattt tattattaat ttccagtttt 300 agtagaaaat gtctatataa tttcaacctt ttttttttgc taactcctag ttttgtgtct 360 agatttcagc tttttaatag agaccacctt atcataattg aaaacctcat cctgtttctc 420 tctcatacac aactcattca gcttgtagat tttcctctgg ctgttcctgt tgctggcttg 480 ccatcctctg gcctacatgt aacgtgactt ctgttagcaa gtccttcaca ttcagatgac 540 tgcaacagtt gaacatgcac attacttttt tccacatttc tgagttacag gcatatcctg 600 tctttttgtg ccatgaattt cataagaact tcagtaagat atagacgtgt tttcctgtcc 660 gaaaaatcac tgatacacat ttcctaaaaa aacaggaatg gccttgacta tgagggaatg 720 gaaaggagaa aacctgagca atgggaggaa agaacaaaac cagttgaaca tttgtcaaat 780 gaatgagtaa tagaataagt tgaagggaag attgagtagg tagaactcct caataccttt 840 gctatggtgt agagacaaag aaaaagttac ccatagctat gtgaaggggc aaatgggcag 900 tgcactgcat aggtaaaggc ataagcagtg gcagatgcat aggaatatac agccacctat 960 ttccattctt gtatttagct gggctgccct ggatgaataa agagaatatt taatcactgt 1020 agtaaggcat gagagactag aggcagagag atacatttag aacaagtatg tagctatggg 1080 aattaaatga gtaaaaccta ttatatgctt agcatttatt acttaatgta gcgttaacta 1140 gggattgttt ctgtccttga ggaggtttat aggtaagaat aggcagtata acaattggag 1200 aagttaataa atatataatt tttaaattga gagctattat aaaggcaagg aatagagaat 1260 agcaacatgg atttaggtac tatcaatagg gtcaacctta aaggaaatga cttttaatct 1320 aagatctcaa gatgaaagga gccagccatg caaagagccc aatcaggaga tttgtgggag 1380 atagaacaga ctgtgaagag tcctaagaga tggaagtaag ttaggacatg tgaggtctgc 1440 aaggaggcct ctccagggag cagtaaaggg gacactggtt tgagatcata tgatgtaggt 1500 aagcagaggc cttcttaacc acagtgagct tgactggatt ctgagaatga agaaagatga 1560 aaaaaattta agtatagatt ctcgacctct catcttcatc agaattatca tgtagagaat 1620 attaaatatc cttccaggac tgcagagtct gatccaggtg gtttgggtgg tgtctaaacc 1680 tcagtgttag taacaagcta tgtggtgatt ctgatccaca cctcaatatg agaaccactg 1740 acttaaaatg tttacttcat caccaatctt gctattttgt atgaactctc atttttcctg 1800 atgttccctt tggctttttt tttctttcca aattgtataa caatatttta gagttttggt 1860 taa gga tta ctt ttg ttg tct ttt att tct aca tta gaa atg ttg ctt 1908 Gly Leu Leu Leu Leu Ser Phe Ile Ser Thr Leu Glu Met Leu Leu 1 5 10 15 ctt tta gta aaa gaa aac agc atg tgt ttg ctt tct tgt ttt att tta 1956 Leu Leu Val Lys Glu Asn Ser Met Cys Leu Leu Ser Cys Phe Ile Leu 20 25 30 gat ttg gag acc aag tat gag acc aag aag tta tct tta gaa aat gac 2004 Asp Leu Glu Thr Lys Tyr Glu Thr Lys Lys Leu Ser Leu Glu Asn Asp 35 40 45 att tat gaa ata aat tta tcc cag tgg aag ata atg gaa aga att gaa 2052 Ile Tyr Glu Ile Asn Leu Ser Gln Trp Lys Ile Met Glu Arg Ile Glu 50 55 60 aac cat ggc ctt aag ggt ctc att tta aaa aat gat tgg gaa tcc aca 2100 Asn His Gly Leu Lys Gly Leu Ile Leu Lys Asn Asp Trp Glu Ser Thr 65 70 75 gga aaa att gaa gga cag gag aga cct caa gaa gga tac ttc agt agt 2148 Gly Lys Ile Glu Gly Gln Glu Arg Pro Gln Glu Gly Tyr Phe Ser Ser 80 85 90 95 gtg aaa atg cca tct gaa aag gtg tcc tct tac cag aaa cgc aca tct 2196 Val Lys Met Pro Ser Glu Lys Val Ser Ser Tyr Gln Lys Arg Thr Ser 100 105 110 gtt act cca cat cag aga ctt cat ttt gtt gat aaa ccc tat gaa tgt 2244 Val Thr Pro His Gln Arg Leu His Phe Val Asp Lys Pro Tyr Glu Cys 115 120 125 aag gaa tgt ggg aag gcg ttc aga gtg cgc caa cag ctt act ttt cat 2292 Lys Glu Cys Gly Lys Ala Phe Arg Val Arg Gln Gln Leu Thr Phe His 130 135 140 cac aga att cat act ggt gaa aaa ccg tat gaa tgt aag gaa tgt ggg 2340 His Arg Ile His Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu Cys Gly 145 150 155 atg gcc ttc aga cag act gca cac ctt act cga cat cag aga ctt cat 2388 Met Ala Phe Arg Gln Thr Ala His Leu Thr Arg His Gln Arg Leu His 160 165 170 175 tct ggt gaa aaa ctc tat gaa tgt aag gaa tgt ggg gaa gct ttc ata 2436 Ser Gly Glu Lys Leu Tyr Glu Cys Lys Glu Cys Gly Glu Ala Phe Ile 180 185 190 tgt ggt gca gat ctt aga gta cat cag aaa atg cat att ggt gag aag 2484 Cys Gly Ala Asp Leu Arg Val His Gln Lys Met His Ile Gly Glu Lys 195 200 205 ccc tat gaa tgt aaa gaa tgt ggg aag gct ttt agg gta cga gga caa 2532 Pro Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Arg Val Arg Gly Gln 210 215 220 ctt act ctg cat cag agg att cat act ggt gag aaa ccc tat gtg tgt 2580 Leu Thr Leu His Gln Arg Ile His Thr Gly Glu Lys Pro Tyr Val Cys 225 230 235 aaa gag tgt gga aaa gcc ttt aga cag tac gca cac ctg act cgg cat 2628 Lys Glu Cys Gly Lys Ala Phe Arg Gln Tyr Ala His Leu Thr Arg His 240 245 250 255 cag aag ctt aat agt gct gac agg ctc tat gaa tgc aaa gaa tgt ggg 2676 Gln Lys Leu Asn Ser Ala Asp Arg Leu Tyr Glu Cys Lys Glu Cys Gly 260 265 270 aag gcc ttt ttg tgt ggc tct ggt ctt aga gta cat cac aaa ctt cat 2724 Lys Ala Phe Leu Cys Gly Ser Gly Leu Arg Val His His Lys Leu His 275 280 285 act ggt gag aaa ccc tat gaa tgt aag gaa tgt ggg aag gcc ttt aga 2772 Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Arg 290 295 300 gtg cga caa caa cta aca ctc cat cag aga att cat act ggt gag aaa 2820 Val Arg Gln Gln Leu Thr Leu His Gln Arg Ile His Thr Gly Glu Lys 305 310 315 ccc tat gaa tgt aag gaa tgt gga aag acc ttt agc cgt ggt tat cat 2868 Pro Tyr Glu Cys Lys Glu Cys Gly Lys Thr Phe Ser Arg Gly Tyr His 320 325 330 335 ctt att ctc cat cac aga att cat act ggt gaa aaa cct tac gaa tgt 2916 Leu Ile Leu His His Arg Ile His Thr Gly Glu Lys Pro Tyr Glu Cys 340 345 350 aag gaa tgc tgg aaa gcc ttt agt cgc tac tca caa ctt att tca cat 2964 Lys Glu Cys Trp Lys Ala Phe Ser Arg Tyr Ser Gln Leu Ile Ser His 355 360 365 cag agt att cat att ggt gtt aag ccc tat gac tgt aag gaa tgc ggg 3012 Gln Ser Ile His Ile Gly Val Lys Pro Tyr Asp Cys Lys Glu Cys Gly 370 375 380 aag gcc ttc aga cta ctt tca caa ctc aca cag cat cag agt att cat 3060 Lys Ala Phe Arg Leu Leu Ser Gln Leu Thr Gln His Gln Ser Ile His 385 390 395 att ggt gag aaa cct tat aaa tgt aag gaa tgt ggc aag gcc ttt aga 3108 Ile Gly Glu Lys Pro Tyr Lys Cys Lys Glu Cys Gly Lys Ala Phe Arg 400 405 410 415 ttg cgc caa aaa ctt act cta cat cag agc att cat act ggc gaa aaa 3156 Leu Arg Gln Lys Leu Thr Leu His Gln Ser Ile His Thr Gly Glu Lys 420 425 430 ccc ttt gag tgt aag gaa tgt agg aag gcc ttt aga ctt aat tca tcc 3204 Pro Phe Glu Cys Lys Glu Cys Arg Lys Ala Phe Arg Leu Asn Ser Ser 435 440 445 ctt att caa cat ctg aga att cat tct ggt gag aaa ccc tat gaa tgt 3252 Leu Ile Gln His Leu Arg Ile His Ser Gly Glu Lys Pro Tyr Glu Cys 450 455 460 aag gaa tgt aag aag gcc ttt agg caa cat tca cac ctt act cat cat 3300 Lys Glu Cys Lys Lys Ala Phe Arg Gln His Ser His Leu Thr His His 465 470 475 ctg aaa att cat aat gta aaa atc taagaaagtc ttttcaactt ctgtgttata 3354 Leu Lys Ile His Asn Val Lys Ile 480 485 gaacattcta tgaatgtagt aattaatcta ttttgctcca tacatgcaac tgcgttggca 3414 ttagaggttt tataccatta aaagagtgtg acaatgtatt gtagtccatc atcactcaaa 3474 cctgaaactt cagcatattt gttctaaaaa ctaatcctgt tataagaatg aaaaagacat 3534 ttaccgtcat ccctatccca tcactttgtg tcatactgga caagacgctt aactgctctg 3594 tgctataatt ttttaattta tgacatggtc atataagagt gcagcagttt gacacacttt 3654 caatcaagag gtggaatcta catccctctg ttgtatacag gaatctacat ccctctgtga 3714 ctgcttcgcc cagtagaata tggtagacgt actgctgtgc tggtttcttg actcagactt 3774 tactggtagg gttacttcat ctcatgaaat gacagaactg tcttgttaca aagtctggct 3834 actttgctga agagaccaca tgaagaggcc ctgaaactac gtagagaggg agaggcacct 3894 gctgaaccct gtcttcttga catccaccaa gataccagac atttgagtga aattttggat 3954 ccaccagacc agccaccagc taaatgccag tgagcgactt ggtaacctga actgatgcca 4014 tgtagaatgg aagaattgac cagctgagcc cctggcccaa atttctgacc cacaaaaaca 4074 tgagatataa taaaatggtt gttgttttaa gccactaaac tttcttttat tatcatacta 4134 aatatttatt tttacataat tcatactaaa ggctgtgggc ataatgagta tagaaatgcc 4194 ttccaccaac atttaatctt cattgaactt gagatatctt attctcgtta aaatattgaa 4254 agtaaggaat tcctttccag tttgttcatt aaaaaacaga taccctagac caggtgaggt 4314 ggcccatgcc tgtaaatcca gcactttggg aggccaagta aggacgattg cttgaggcca 4374 gaagttcgag accagcctgg gcaacatggt gagaccctgt ctctaccaaa aaactttaaa 4434 aattagctgg gcataatggt gcatgcctgt agtcctagct actcgaggct gaggtgggag 4494 gatcatttga acccaagagt tgaggcttca gtgagctgtg attgtaccac tgcactccag 4554 cttgagtgac tgcaagaccc tgtctcaaaa aaaaaaaaaa actacaatca gtattctctg 4614 gccccagtca atattaaaaa ttgacaaagg ataacatgaa actttggaaa ttaaaagtca 4674 cctgtattat gaagccccat taatttttta aactacttca ttgtttttta aaaatgctaa 4734 attgtcaatt tagcattttg aggcaatccc ctcaaattgg ctcctgtgtc cttttatcat 4794 gaccactcca acctttgagc tttttttttt cctgaaccaa atgcattgat tagttatgga 4854 gcatactaga gcattattaa caaataaagg aaaagaatgg aatatttacc ttgcctatcc 4914 tttacaaact atatttcaag acaaccaaaa gttggtgaag gaaagttgtt gattagagaa 4974 ttccaactgg ttaaaaggtc aaaggaggcc aggcgcggtg gctcaagcct gtaatcccag 5034 cactttggga ggccgaggca ggtggatcgt gaggtcagga gatcaagacc atcctggcta 5094 acacggtgaa accccatctc tactaaaaat acaaaaaatt cgccgggcgt ggtggcaggc 5154 gcctgtagtc ccagctactc aggaggctga ggcaggagaa tggcttgaac ccgggaggcg 5214 gagcttgcag tgagccgaga tcgcgccact gcactccagc ctgggtgaca gaccgagact 5274 ctgtctcaaa caaaaaacaa aaaacaaaac aaaacaaaga tcaaatgaat gatagaattt 5334 gaaaactacg ctctttaatt ttacaaaatc atggattttc gtggtgatag caatggatgc 5394 gaagaccatt aggtgaaaaa tggataggaa gcttataatg catggagcag aatgacagga 5454 cactaatcta tattaacatc tctaaatgag atcagccaga tgaacttgat gtgatgaaat 5514 ggatacacac agtggacacc tgtgaagttt tcttggctcc cccaaaactg agaagtacaa 5574 gttagtctcc aaacctaatt accagtttac aggaaacatg gggaataaaa gaacaaatta 5634 acaacacaaa gaagcaaaca accaaatgca caatttggga aattctgcag aagtaatggc 5694 ctagtttttt aaccaataca tgtc 5718 <210> 11 <211> 4342 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (4038)..(4340) <400> 11 cagctctggc agaggactgt tccactagac acgctgaagg gactgggtac gttttccttc 60 aggaccagag ctgagaggag ctgggatcgc ggcggcaatg gaacgggcct cagaaaggcg 120 cacggccagc gcgctttttg cggggttccg ggccttggga cttttcagca acgacattcc 180 acacgtggtg cggttcagcg cgctcaagcg ccggttctat gtaacaacct gcgtgggcaa 240 gagtttccac acctatgacg tgagtgactt cttttgttag cttcccagga aaaccaccct 300 ccttggcctc taactctgtc ctggagcagt ccggttctcc cttcccattt accacgggct 360 tcccttcttt aacccctccc tgtcctatta atatttcgcc atccggtcac gtccacgtgc 420 tgatttggca cttgcagtcc tcctggggcg tgtttcagag gcagcctggc caaggggaaa 480 ctgctagaga acatctcagt tgtatggcag taatatttgt tcagtgaata tttattgacc 540 ttcccttgaa ggtggaaaca taattaagac aaacctctac ctcttgcatt cttaatgaga 600 gagaaaaaca tgcacgctat tacgttattg gtgacacgag ctgagttaga aattcactct 660 aggtagaata cattgggaga accggagatg gtgagattaa ttaccgttag ggaaaatatc 720 taatagcaat cagttatttt caactaacgc aaagtgtctg gcactgtcac tctaagcctt 780 gcagccgttt ctttgcaaca taggtgtcat tataccgtag cacctttata taatctatta 840 aagtaaattt cagataaatc acttgttagt gatgattttt tctcctataa cagttcatcg 900 atcattgtaa tagtgtctct tgtttacttg tttcttttgt cttctccgaa agaatgtaac 960 ctctgtgaga acagaaacct tgatgaggag gatgataata gtcagtgtta attaagtact 1020 ttcttttctg gaaaacaggt agagttgtct ctcttggagg attgtgaata tcaaatggct 1080 tattagcaaa aagagctggg aggtaggaac tagttgagtt agtggaaaag gaaagtgagg 1140 gcagagctga ggtgaagagc aattgggttt ctcagagtct gagttggtaa tggagacagg 1200 gtgggttaga accatattaa tcatgtgaaa gtatttagtg ttaccattct ctgcacttgt 1260 tccttgcttg ctaacttttg cttttgctgg ttaatgtttt cttccttgaa atgtaatttt 1320 tctgagttta ttactgaaat aactcttgaa atcttgtctt tggtacagtg tctttcctga 1380 cactgcatta cacacagaca caggtgtttt tccctatatt ctacttccaa agaaaccctg 1440 tggtgaggag agtaacatat ttgtacattt tataaagtta acaatttact tactgtgtgt 1500 tgacatttag tcataataaa gatggctttt aagttctcct ctagttctaa cactgatgat 1560 acattcgtaa catcatggca tgtaaatgtt ctttacccag actgtacact gctcagatta 1620 accaatgcat tgggctgtga cctctaaaac tgaaaatact tgggagtaaa aagaaaaaaa 1680 ggagctaatg aacctaagtc actaattggg tctacaagtg ctctatccca tgactatggc 1740 tatatatgct gtccccagtc agacttccta cagtcctaca gattagtttc ttctacaggg 1800 gaaactagac aagttgtgaa taggtggtgg tagtgatggt agtagctatc atgtattgag 1860 cagttactat atgactggcc ccattctgtt ttacacaaat ttactcattg aatcctcata 1920 acaatcccat aaagtaaata aatacctgtt ttacagatga agaatttgag gcacagggag 1980 cttatttttc cagggtcata cagctggatc agtagccatg attggggtca gcaaattaca 2040 gtctgtaagc cagtctagcc cactgtttgt ttttgtatga cccaggagtt acgaatgatt 2100 ttgacatgtt taaatgtttg tgggggcagg ctgggggtag ggagaaggat atttcatgac 2160 atgaaaacta tgtgaaaatc aaatttcagt atctatagat aaaagtttaa tatacccatg 2220 cacattattt atctattacg tatggctgtt ttcatgctac ggtggcagaa gtgagtagtt 2280 ggcaacagag accacatgca ttgcctgcaa agcctaaaat atttaatatt tggcccatta 2340 tagaaaacat ttgcctaccc cttccatcac cactcctggt gaaatacagt tgaaaatata 2400 aactttccta ataaagaaaa ttagtgtgag taagtgtctt tcttatgaag gacagcatag 2460 caaatatacg tatgaggtta tatcttaatc ttcaggtgtt aggttattat gattatgttt 2520 gttaatgaaa atttaacctt ttttcttttt taaacaggtt cagaaactta gtctggttgc 2580 agtaagtaag tatggacttt attctgaatt tatgcacatc taaacttttt ttttattttg 2640 acataaatgt gagacttaca gagcatataa ggaagccaag gtaacatgta tgtcttataa 2700 acttacatta tcagcaacaa aaaagggtgc caggataaat catgttttgc ttttagttgt 2760 tatgtatctt tattttcttt aatctggaac tgttctttag actttctttg tgcttcatga 2820 cattgacaca ttgacatttt tgaagagtac aggctcattt agaatgtcct tcagtttgag 2880 tttttttgac atttcttcag gattagattt aggttgtacc ccttgggagg aatcacagtg 2940 aattgtatca gaaggcacat gctgttagtt ggttctgttc ctggtagtga ttggttagga 3000 ggctctattt ttaagttagt gttttgtcat ttgtaacgaa taagtatctt ttggtagtat 3060 gcactgggac tatgtaaata tcttagtact cttcaaagtt attacctgct gcttttatat 3120 tttatgatga tacttgcctg aatcagttat tagtatgatg gttgccaaat gcatgttgat 3180 tatccttaat ccaaaaattt gaaacccaaa atgctcctaa gtctgaaact ttttgagagc 3240 caacatgacg ttaacaggga aatattcatt ggaacatttc cgatttcaga ttttcagatt 3300 aggaacattc acctggttat acatatagtc caaaatccaa aaaaatccgg aatctgaaac 3360 acttctggtc tcaaacattt cagataaagc atagtcagcc tatagtgatt ttcttttgcc 3420 atcattcctt ctactgtaag gagaagtttt cccttttctc acattttggt atttctttcc 3480 aatggaaggg ttttttttaa actatctaat agaataaaac atttctcagg gcagagaggt 3540 acttgtttta ccatcaaata tggttcttgt gagagaaaca gaagacctac aaggactgta 3600 atgtgtatgt gtattaaact tatattttaa taatagatta gaaggcagaa gaaaggcatt 3660 ggaacttaag ttgaatacaa aagtgaatga taaacttata aaacagggtt atgcaaacaa 3720 atactaatat ggaatttgca gcatgaaaag tacaacaatc ctgtgtctac taaccctcct 3780 gccccaaaag aggaaatgaa aaggtcagtg tgaccaggat actgagagaa gaatcatatg 3840 tcacgtgaga aatgctaaag attgataggc atatttccct tggagaagac tgtagaaaca 3900 atattgtctt taaatgtttt aggaataggc atatagaata ccatatgtaa tatttttggg 3960 ttggtccaaa gggtagagtt agagatttag attttgagtt acttttaaaa tttttctgga 4020 gttattagtg aacttga tgt tcc aga aga tac tca cat ggg tta ggc atg 4070 Cys Ser Arg Arg Tyr Ser His Gly Leu Gly Met 1 5 10 tgg ctc acg cct gta atc cca gca ctt tgg aag gcc gag gtg ggc aga 4118 Trp Leu Thr Pro Val Ile Pro Ala Leu Trp Lys Ala Glu Val Gly Arg 15 20 25 tca cct gag gtc agg agt tca aga cca gcc tgt cta aca tgg caa aac 4166 Ser Pro Glu Val Arg Ser Ser Arg Pro Ala Cys Leu Thr Trp Gln Asn 30 35 40 cct gtc tct tct aaa aat aca aaa att agc cgg gtg tgg tgg tgc act 4214 Pro Val Ser Ser Lys Asn Thr Lys Ile Ser Arg Val Trp Trp Cys Thr 45 50 55 cct ata atc cca gct act tgg gag gct gag gca gga gaa tca gtt aaa 4262 Pro Ile Ile Pro Ala Thr Trp Glu Ala Glu Ala Gly Glu Ser Val Lys 60 65 70 75 cct ggg agg cgg atg ttg cag tgt gcc aag att gca tca ctg cac tcc 4310 Pro Gly Arg Arg Met Leu Gln Cys Ala Lys Ile Ala Ser Leu His Ser 80 85 90 agc ctg agt gac aca gca aga ctc cgt ctc tc 4342 Ser Leu Ser Asp Thr Ala Arg Leu Arg Leu 95 100 <210> 12 <211> 4415 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (251)..(1855) <400> 12 gctgtgacct gcggcagcac agccgcctgc gttgagcgcc cacggtgggc tggactttgc 60 actaggtgct gacaggaccg gcagaggtgg ccactgccct cgtccccagc ctgcactcct 120 gggcgaaggc tgacgctgaa cagggtgctg tgggcccaga agcgcccatg cctgagagcg 180 tccaggacca ggcagagctg ggccttaagg gcccttggac cagtgtctgt ctgcagggag 240 cccccggtag cca ctc gga tca gcc cga ggg aag att ctg gac gag acc 289 Pro Leu Gly Ser Ala Arg Gly Lys Ile Leu Asp Glu Thr 1 5 10 ggg gcc gtc ccc cgg gtg gcc cat gga cag cag cag ggg ctc cca gga 337 Gly Ala Val Pro Arg Val Ala His Gly Gln Gln Gln Gly Leu Pro Gly 15 20 25 gtg gcc agg ccc tgc ccg ccc acc atg ctg cag tgt aga ccc gca cag 385 Val Ala Arg Pro Cys Pro Pro Thr Met Leu Gln Cys Arg Pro Ala Gln 30 35 40 45 gag ttc agc ttc ggg ccc cgg gcc ttg aag gac gct ctg gtc tcc act 433 Glu Phe Ser Phe Gly Pro Arg Ala Leu Lys Asp Ala Leu Val Ser Thr 50 55 60 gac gca gcc ctg cag cag ctg tat gtg tcc gcc ttc tcc cct gcc gag 481 Asp Ala Ala Leu Gln Gln Leu Tyr Val Ser Ala Phe Ser Pro Ala Glu 65 70 75 cgg ctc ttc ctg gcc gag gcc tac aac ccg cag agg acg ctc ttc tgc 529 Arg Leu Phe Leu Ala Glu Ala Tyr Asn Pro Gln Arg Thr Leu Phe Cys 80 85 90 acc ctg ctc atc cgc acg ggc ttc gac tgg ctc ctg agc cga ccc gag 577 Thr Leu Leu Ile Arg Thr Gly Phe Asp Trp Leu Leu Ser Arg Pro Glu 95 100 105 gct ccc gag gac ttc cag acc ttc cac gcc tcc ctg cag cac cgg aag 625 Ala Pro Glu Asp Phe Gln Thr Phe His Ala Ser Leu Gln His Arg Lys 110 115 120 125 ccc cgc ctg gct cgg aag cac atc tac cta cag ccg ata gac ctg agc 673 Pro Arg Leu Ala Arg Lys His Ile Tyr Leu Gln Pro Ile Asp Leu Ser 130 135 140 gag gag ccg gtg gga agc tcc ctg ctg cac cag ctg tgc agc tgc aca 721 Glu Glu Pro Val Gly Ser Ser Leu Leu His Gln Leu Cys Ser Cys Thr 145 150 155 gag gcc ttc ttc ctg ggc ctg cgc gtc aag tgc ctg ccg tcg gtg gca 769 Glu Ala Phe Phe Leu Gly Leu Arg Val Lys Cys Leu Pro Ser Val Ala 160 165 170 gcc gcg tcc atc cgc tgc tcc tcg cgg ccc agc cgg gac tct gac agg 817 Ala Ala Ser Ile Arg Cys Ser Ser Arg Pro Ser Arg Asp Ser Asp Arg 175 180 185 ctc cag ctc cac aca gac ggc atc ctg tcc ttc ttg aag aac aac aag 865 Leu Gln Leu His Thr Asp Gly Ile Leu Ser Phe Leu Lys Asn Asn Lys 190 195 200 205 cca ggg gac gcg ctg tgt gtg ctg ggc ctc aca ctg tct gac ctg tac 913 Pro Gly Asp Ala Leu Cys Val Leu Gly Leu Thr Leu Ser Asp Leu Tyr 210 215 220 ccc cat gag gcc tgg agc ttc acc ttc agc aag ttc ctt cca ggg cac 961 Pro His Glu Ala Trp Ser Phe Thr Phe Ser Lys Phe Leu Pro Gly His 225 230 235 gaa gtg ggc gtc tgc agc ttc gcc cgg ttc tca ggg gaa ttc ccg aag 1009 Glu Val Gly Val Cys Ser Phe Ala Arg Phe Ser Gly Glu Phe Pro Lys 240 245 250 tcg ggg ccc agc gcc cct gat ctg gcc ctg gta gag gca gca gca gac 1057 Ser Gly Pro Ser Ala Pro Asp Leu Ala Leu Val Glu Ala Ala Ala Asp 255 260 265 ggc ccc gag gcc ccc ctg cag gac agg ggc tgg gcc ctg tgc ttc agt 1105 Gly Pro Glu Ala Pro Leu Gln Asp Arg Gly Trp Ala Leu Cys Phe Ser 270 275 280 285 gcc ctg ggg atg gtt cag tgc tgc aag gtc acg tgc cac gag ctc tgc 1153 Ala Leu Gly Met Val Gln Cys Cys Lys Val Thr Cys His Glu Leu Cys 290 295 300 cac ctt ctg ggc ctg ggg aac tgc cgc tgg ctc cgc tgc ctc atg cag 1201 His Leu Leu Gly Leu Gly Asn Cys Arg Trp Leu Arg Cys Leu Met Gln 305 310 315 ggt gcg ctc agc ctg gac gag gcc ctg cgg cgg ccc ctg gac ctc tgt 1249 Gly Ala Leu Ser Leu Asp Glu Ala Leu Arg Arg Pro Leu Asp Leu Cys 320 325 330 ccc atc tgc ctg agg aag ctg cag cat gtc ctg ggt ttc agg ctc atc 1297 Pro Ile Cys Leu Arg Lys Leu Gln His Val Leu Gly Phe Arg Leu Ile 335 340 345 gag agg tac cag aga ctc tac acc tgg act cag gcg gtg gtg ggg acg 1345 Glu Arg Tyr Gln Arg Leu Tyr Thr Trp Thr Gln Ala Val Val Gly Thr 350 355 360 365 tgg ccc agc cag gag gcg ggg gag ccg tca gtg tgg gag gac acc ccg 1393 Trp Pro Ser Gln Glu Ala Gly Glu Pro Ser Val Trp Glu Asp Thr Pro 370 375 380 cct gcc agc gcc gac tcg ggc atg tgc tgt gag agt gac tcg gag ccc 1441 Pro Ala Ser Ala Asp Ser Gly Met Cys Cys Glu Ser Asp Ser Glu Pro 385 390 395 ggc acc agt gtg tcg gag ccc ctc acc cct gat gcc ggg agt cac acc 1489 Gly Thr Ser Val Ser Glu Pro Leu Thr Pro Asp Ala Gly Ser His Thr 400 405 410 ttc gcc tca ggg cca gag gaa ggg ctg agc tac ctg gca gcc tca gag 1537 Phe Ala Ser Gly Pro Glu Glu Gly Leu Ser Tyr Leu Ala Ala Ser Glu 415 420 425 gct ccg ctg cca cct ggg ggc cct gcg gag gcc atc aag gag cat gaa 1585 Ala Pro Leu Pro Pro Gly Gly Pro Ala Glu Ala Ile Lys Glu His Glu 430 435 440 445 cgg tgg ctg gcc atg tgc atc cag gcc ctg cag cgg gaa gtg gca gag 1633 Arg Trp Leu Ala Met Cys Ile Gln Ala Leu Gln Arg Glu Val Ala Glu 450 455 460 gag gac ctg gtg cag gtg gac aga gcc gtg gac gcc ctc gac cgc tgg 1681 Glu Asp Leu Val Gln Val Asp Arg Ala Val Asp Ala Leu Asp Arg Trp 465 470 475 gag atg ttc acg ggc cag ctc ccg gcc acc agg cag gac cca ccc agc 1729 Glu Met Phe Thr Gly Gln Leu Pro Ala Thr Arg Gln Asp Pro Pro Ser 480 485 490 agc agg gac agc gtg ggg ctg cgc aag gtg ctg ggg gac aag ttc tcc 1777 Ser Arg Asp Ser Val Gly Leu Arg Lys Val Leu Gly Asp Lys Phe Ser 495 500 505 tcc ctg agg agg aag ctg agt gcc cga aaa ctc gcc aga gca gag tcg 1825 Ser Leu Arg Arg Lys Leu Ser Ala Arg Lys Leu Ala Arg Ala Glu Ser 510 515 520 525 gcc ccc cgt ccc tgg gat ggg gaa gag agt tagtacagca ggggctgccc 1875 Ala Pro Arg Pro Trp Asp Gly Glu Glu Ser 530 535 tacgtctcct tccctaagga tgctggccag cactgtccag tagctgaggc cactactgac 1935 ctgccaggga taaagaggaa gggtctgcct gggtggtggc tcaggcctgt catcccatca 1995 ctttgagagg ccaggagttt gagaccagac tgggcaacat ggtgagactc tgcctctaca 2055 aaagaaaaat taaaaaatta gctggatgaa gtggttcatg cctgtgttcc cagctattca 2115 ggaggctgag gtgggaggat tgcttgagcc taggaggtcg aggctgcagt gggatgtgat 2175 cataccactg tactgcagtc tgggccacac agaaagactg tctccagaaa aaaaaaagtt 2235 ctttggagaa gccacagacc acctgtcttc aggcgcctcc ttcaactcct gagtcccagc 2295 cagccgctcc caggggcctg cacacatgga gaggcctccc tgatcctggg tgcttctcgt 2355 ggagtacaag ccggactgtg ctgaggttgg gacagagccc cctcccctgc agaggcagaa 2415 ggaagcagcg tgcgtcctgt ctccttccag gctgtgggcc tgcccttcag ttatttatag 2475 ctggagctgg agaggctggc tcagatgagg agtgaccccg ggggcacaca ggctccacac 2535 tgccacccag cttccaaggc tgagtctcct ccctaacggg gaagtgacgg ggttttgtct 2595 ctatcatctc aggcgtcaac cacatgcaca cacacactgt cacgttctgt ggcgctaaca 2655 gcatcctgat cctgacggac ttcaccgggg ctctccaggc atctcttctg acaaacactg 2715 caggaggtga gggtgtctga cgtgcactga gggcagaggc ccccttattc ctgaggcggc 2775 tacagctcac cgtggggaag atcaactgtg gtgatgtttt tgggacagtt tcttggcaaa 2835 ggtggccgcg ctgtcagtac caagtagctg gaggtggtga tcagatgatc tgtctttcct 2895 ttttttttcg gtctagttct gtcagttgct gagagagggg tattattgcc atggctgggc 2955 gtttgatctg tctcccttta gttttgccgg attttgcttc ctgcactttg aagctccttt 3015 acgtccctgc acatttgatt gttgtgtctt cctgaggaat tgaccctttt attgtcacac 3075 gtaccccctc agcctgtggt ggagggtctc tgggagagag acttccggtt ctgacagcgg 3135 aatcccttcg gagtgacgag ggcggggtca cagctcgcgc accctcatct ggagagaggc 3195 aagaacaggg cagcttggac cttttgtggg tgggtggctc acacggttat ggagggcttc 3255 taagaaaaac aaaatacaaa actaaaacta gaagggctat gcaggtgagg tgggctgagg 3315 cttccttgtg aatctgacat tggtggaggc cgactgaagg ctcccggtcc tggtcccact 3375 ccgtgttgac ttcagagaag caaagatgca gctcagaagt agcattagga tcttcgtccc 3435 gttctctttt gcgtagcttc aaaaaggcgt aacagtgacc tgggagggga gatgaagagc 3495 taggcctttc agaaagtgac caatggaagg tgccggcaga gcaccctgca gtaacagaag 3555 gtgaaagccg gagcctggtg gctgttgctg caaacaacca cccaaaactt agtggcttaa 3615 aatcaccaca gtctagctca tgaatctgta acttgggcag agctcggtgg agacggtttg 3675 cccctgtggc tcgactggag tgttcgttca cacggctgcc aagtggaatt tggctgggaa 3735 tctcaggcct tggttcctgc ccacgtgaga ctctccatgg aggaggggca tcttcttagg 3795 aaggcagctg ccataccgtg aggaaacccg acaaacaggg cagccagagc cagacagcag 3855 gctcctggtc ccagtcccgg aaaatgcaaa gggacaagta tgtgttgatt tcagagaagc 3915 tcagatgtag cattaggacc ttcatccata cccttctctt ttgcatggct tctaaagggc 3975 atgacagtga cctgggaggt aactgtgcca gaataaagag gggagacgaa aaaaagaggg 4035 tcactttcag aaagggacct gggcttcctc acaatatggc ggttgccctc ctaaggtgga 4095 catcgggagg gtcagatgga agctctgtgg cctttcctaa ctcggcctca cgtcacagag 4155 cgtcaccttt cctgcactcc acggcagcag tgacaaggac cccaccccag gttcacgggg 4215 aggggacgtg gcagggaatg tgagacaaca caatattgct gtggccattc atgaacagtc 4275 agtcagcccc actcggcgtt cactgtgggg atttggctgg tgcacctgcg aggtggcctg 4335 acctgttttc cgatttcctt catcttctgc aaaaggttaa ccgctggagt gatgcgagat 4395 taaacagagg tgataaaaat 4415 <210> 13 <211> 3954 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (130)..(2166) <400> 13 agggtctgtg tcaagagcgg caggggctgc aggagataag agaaacgggc tgccgtgggg 60 tgtgtgtagg cttcagagac atgggatcac ggaagactga agcagaaaca gtggattaag 120 acttcctga gcg ata gct ggc act gcc ttc ccc aca atg gca gag gtg gtg 171 Ala Ile Ala Gly Thr Ala Phe Pro Thr Met Ala Glu Val Val 1 5 10 gct gag gtg gcc gag atg cca aca cag atg tca cca ggg gca gtg gag 219 Ala Glu Val Ala Glu Met Pro Thr Gln Met Ser Pro Gly Ala Val Glu 15 20 25 30 atg tca aca cct atg tcg gca gag atg atg gag atg tca aca gaa gtg 267 Met Ser Thr Pro Met Ser Ala Glu Met Met Glu Met Ser Thr Glu Val 35 40 45 act gag atg aca cct ggg gag gcc ctt gcc tca tcc ctc ttc ttc cag 315 Thr Glu Met Thr Pro Gly Glu Ala Leu Ala Ser Ser Leu Phe Phe Gln 50 55 60 cat cac cag ttc atg tgc tct gag tgt ggc agc ctc tat aac aca ctg 363 His His Gln Phe Met Cys Ser Glu Cys Gly Ser Leu Tyr Asn Thr Leu 65 70 75 gag gaa gtc ctc tca cac cag gag cag cac atg ctt gct gtc tca gag 411 Glu Glu Val Leu Ser His Gln Glu Gln His Met Leu Ala Val Ser Glu 80 85 90 gag gag gca ctg acc aca cag aat gtt ggc ctg gag ccg gag ctg gtg 459 Glu Glu Ala Leu Thr Thr Gln Asn Val Gly Leu Glu Pro Glu Leu Val 95 100 105 110 ccg ggt gct gag ggg ccc ttc cag tgt ggt gaa tgc agc cag ctc atc 507 Pro Gly Ala Glu Gly Pro Phe Gln Cys Gly Glu Cys Ser Gln Leu Ile 115 120 125 ctc tcc cct ggg gag ctc ctg gcc cac cag gat gcc cac ctc cga gag 555 Leu Ser Pro Gly Glu Leu Leu Ala His Gln Asp Ala His Leu Arg Glu 130 135 140 tct gca aac cag atc caa tac cag tgc tgg gac tgc cag gag ctg ttc 603 Ser Ala Asn Gln Ile Gln Tyr Gln Cys Trp Asp Cys Gln Glu Leu Phe 145 150 155 ccc tcg ccc gag ctg tgg gtg gct cat cga aag gcc cag cac ctt tct 651 Pro Ser Pro Glu Leu Trp Val Ala His Arg Lys Ala Gln His Leu Ser 160 165 170 gct acg gta gct gag cca cca gtg cca cct cct ttg cct ccc cca aca 699 Ala Thr Val Ala Glu Pro Pro Val Pro Pro Pro Leu Pro Pro Pro Thr 175 180 185 190 cca ctg cct cca cct tct ccc cca tcc gaa gtc aag atg gag ccc tat 747 Pro Leu Pro Pro Pro Ser Pro Pro Ser Glu Val Lys Met Glu Pro Tyr 195 200 205 gag tgt cct gag tgc tct acc ctc tgc gcc acc cct gag gag ttc ttg 795 Glu Cys Pro Glu Cys Ser Thr Leu Cys Ala Thr Pro Glu Glu Phe Leu 210 215 220 gag cat cag ggc acc cac ttt gac tcc cta gag aaa gag gag cgc aat 843 Glu His Gln Gly Thr His Phe Asp Ser Leu Glu Lys Glu Glu Arg Asn 225 230 235 ggg ttg gag gag gag gaa gag gac gat gag gag gat gaa gaa gat gat 891 Gly Leu Glu Glu Glu Glu Glu Asp Asp Glu Glu Asp Glu Glu Asp Asp 240 245 250 gaa gag atg gag gat gag gag gcc atg gca gag gtc ggt gat gat gct 939 Glu Glu Met Glu Asp Glu Glu Ala Met Ala Glu Val Gly Asp Asp Ala 255 260 265 270 gtg gga ggt gac gag tcc aca gct ggc tgg gct cag ggc tgc ggg gac 987 Val Gly Gly Asp Glu Ser Thr Ala Gly Trp Ala Gln Gly Cys Gly Asp 275 280 285 tgt ccc cag cac cag ccc tca gca ggg gct cgc cgg caa cac cgg cgg 1035 Cys Pro Gln His Gln Pro Ser Ala Gly Ala Arg Arg Gln His Arg Arg 290 295 300 acg gct cac agc ccg gca tct gcc acc cac ccc ttc cac tgc agc cag 1083 Thr Ala His Ser Pro Ala Ser Ala Thr His Pro Phe His Cys Ser Gln 305 310 315 tgt cag cgc agt ttc agc tcc gcc aac cgg ctg cag gct cat ggg cgg 1131 Cys Gln Arg Ser Phe Ser Ser Ala Asn Arg Leu Gln Ala His Gly Arg 320 325 330 gcc cat gtt ggt ggc aca cat gag tgt aca acc tgc tcc aag gtc ttc 1179 Ala His Val Gly Gly Thr His Glu Cys Thr Thr Cys Ser Lys Val Phe 335 340 345 350 aag aaa gca gca tcg ctt gag cag cac ttg cgg ctg cat cgc ggg gaa 1227 Lys Lys Ala Ala Ser Leu Glu Gln His Leu Arg Leu His Arg Gly Glu 355 360 365 gcc cgc tac ctc tgt gta gac tgt ggc cgc ggc ttt ggc aca gaa ctc 1275 Ala Arg Tyr Leu Cys Val Asp Cys Gly Arg Gly Phe Gly Thr Glu Leu 370 375 380 acg ttg gtg gct cac cgg cgg gcc cac act gcc aac cca ttg cat cgc 1323 Thr Leu Val Ala His Arg Arg Ala His Thr Ala Asn Pro Leu His Arg 385 390 395 tgt cgt tgc ggc aag acg ttc agc aac atg acc aag ttc ctc tac cac 1371 Cys Arg Cys Gly Lys Thr Phe Ser Asn Met Thr Lys Phe Leu Tyr His 400 405 410 cgg cgc act cac gcc ggc aaa agc ggg gca cct ccc aca gga gca aca 1419 Arg Arg Thr His Ala Gly Lys Ser Gly Ala Pro Pro Thr Gly Ala Thr 415 420 425 430 gct ccc cca gct cca gcg gag ccc acc cct cca cca cca ccc cct gcc 1467 Ala Pro Pro Ala Pro Ala Glu Pro Thr Pro Pro Pro Pro Pro Pro Ala 435 440 445 cca cct gcc cag ctg ccc tgc cca cag tgc tcc aag tcc ttt gcc tca 1515 Pro Pro Ala Gln Leu Pro Cys Pro Gln Cys Ser Lys Ser Phe Ala Ser 450 455 460 gct tcc cgg ctg tcc cgg cac cgg cgt gca gta cac ggg ccc cct gaa 1563 Ala Ser Arg Leu Ser Arg His Arg Arg Ala Val His Gly Pro Pro Glu 465 470 475 cgg cgt cac cgc tgt ggg gtt tgt ggc aag ggc ttc aag aag ctg atc 1611 Arg Arg His Arg Cys Gly Val Cys Gly Lys Gly Phe Lys Lys Leu Ile 480 485 490 cac gtg cgc aac cac ctg cgg aca cac acg ggt gag agg ccc ttc cag 1659 His Val Arg Asn His Leu Arg Thr His Thr Gly Glu Arg Pro Phe Gln 495 500 505 510 tgc cac tca tgt ggc aag acc ttt gct tct ttg gcc aac ctc agc cgc 1707 Cys His Ser Cys Gly Lys Thr Phe Ala Ser Leu Ala Asn Leu Ser Arg 515 520 525 cac cag ctg acc cat acg ggt gca cgt ccc tac caa tgc ctg gac tgt 1755 His Gln Leu Thr His Thr Gly Ala Arg Pro Tyr Gln Cys Leu Asp Cys 530 535 540 ggc aag cgc ttc aca cag agc tcc aac ctg cag cag cac cgg cgg ttg 1803 Gly Lys Arg Phe Thr Gln Ser Ser Asn Leu Gln Gln His Arg Arg Leu 545 550 555 cac ttg cgg cca gtc gcc ttt gcc cgc gcc ccc cgc ctc ccc atc act 1851 His Leu Arg Pro Val Ala Phe Ala Arg Ala Pro Arg Leu Pro Ile Thr 560 565 570 ggt ctc tac aac aag agt ccc tac tac tgc ggg act tgt ggc cgc tgg 1899 Gly Leu Tyr Asn Lys Ser Pro Tyr Tyr Cys Gly Thr Cys Gly Arg Trp 575 580 585 590 ttc cgc gcc atg gcg ggc ttg cga ctg cat cag cgg gtc cat gcc cga 1947 Phe Arg Ala Met Ala Gly Leu Arg Leu His Gln Arg Val His Ala Arg 595 600 605 gct cgg act ttg acg cta cag cct ccc aga tca cca tct cct gcc cca 1995 Ala Arg Thr Leu Thr Leu Gln Pro Pro Arg Ser Pro Ser Pro Ala Pro 610 615 620 ccc cca cct cca gag cct caa cag act atc atg tgc aca gag ctg ggg 2043 Pro Pro Pro Pro Glu Pro Gln Gln Thr Ile Met Cys Thr Glu Leu Gly 625 630 635 gag acc atc gcc atc att gag aca tcc cag cca ctg gcg ctt gag gac 2091 Glu Thr Ile Ala Ile Ile Glu Thr Ser Gln Pro Leu Ala Leu Glu Asp 640 645 650 acc ctg cag ctg tgc cag gct gca ctg ggg gcc agt gaa gca ggc ggg 2139 Thr Leu Gln Leu Cys Gln Ala Ala Leu Gly Ala Ser Glu Ala Gly Gly 655 660 665 670 ctc ttg cag ttg gac acg gcc ttc gtg tgacgcagct gaaaagcaac 2186 Leu Leu Gln Leu Asp Thr Ala Phe Val 675 aacaaaaggg tttggttgca acagccagtg tgggtacctc tggggagaga ggacctcctc 2246 tgacaaactg gtctggtacc caccatgtgc caggatccac cctggcctct ttttacccac 2306 tgactcccca gaacaaccct tccaggcttc tcttgtcatc tttctctgcc tgaggggaaa 2366 ctgaagctct gaaatgcgat gtgatctgta ccaggtcacc cagctatgct gcaaagtggg 2426 ttggccaagg ccctttgcac tgcatcaccc tggtgcccag caacatcagg taaccttcac 2486 tgagcaccaa gcttatgcca ggtctgtgct ggccactctc atatacctct tcagatcctc 2546 tgcttgtacc cccagccctt gccttccctg gattttgggc acccaggact ttgctctgcc 2606 tggtggaggg tacttgattt ctctgggctt ccttcatctc aattctgaca gtgtggaagg 2666 aaatctgtag gtacccaggt cctcagctcc agactgggtg atgctggaga cccaggagca 2726 agtcagcaca ggctctgccc acaggaggca tgcacaatct ggtaggagaa acgcacaagt 2786 acagatagct tgctctctga gtgtgtctca ttgattcatt cagcctgtgt gtcctaaagc 2846 ctcccttctc ccatactggg tgatgctggg ggcagagatg agtcaggcct aaggcttctc 2906 tgggctgtga tggggatgtg cagggacagt gggatcccag gaaacagacc tccatggtgg 2966 gagatcagga gttaaggact tcctagagaa ggtagtccct gaaggaaaag taggaatctg 3026 gcagtgaaag aggaaaagaa cattccaggc agacagcact gcaagagggt gagctcttgg 3086 tgtatttgga gaccatccag ttattcatct gactgcctac ctgaggcaag caccacactt 3146 accactgtat gtgtactatg atcccaacag gcttgtgaca tagatttcat attatccctg 3206 ttgtatagat ttttaaaact ggtggctggg cgtggtggct cacgcctgta atcccagcac 3266 tttgggaggc cgaggcgggc agatcacgag gtcgggagat cgagaccatc ctggctaaca 3326 cggtgaaacc ccgtctctac taaaaataca aaaaattggc caggagtggt ggcgggcgcc 3386 tgtggtggcc agctactcag gaggctgagg caggagaatg gcgtgaaccc gggaggcgga 3446 gcttgcagtg agccgagatc gcgccactgc actccagcct gggcgacaga gtgagactcc 3506 ctctcaaaag aaaataaaaa taaaaataaa ctggccaggt gcggtggctc acgcttgcaa 3566 tcccagtgct ttgggagact gaggcaggag gatcacttaa ggccaggagt tcgagaccag 3626 cctgggcaac agcaagacct catctctaca aaacttaaaa gttagccagg cctggtggca 3686 catgcctgta atccgagtgc tttgggaggc tgaggcagga ggatcactcg agctcaagaa 3746 ttcaagacct gggcaagaga gagagacttt gtctctatta aaaataaaaa tgattggccg 3806 agcatggtgg tgcacaccag tattcccagc tactctggag gctgaggcgg aggcaggatt 3866 gcttgatatg agggagattg aggctgcagt gagctgtgtt cacaccactg cactccagtc 3926 tgggcaacag tgcaagaccc tgtcactt 3954 <210> 14 <211> 3846 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1)..(1404) <400> 14 aat ttg gtt aca caa ctg aac att cct gca aga ata agg cct agt gaa 48 Asn Leu Val Thr Gln Leu Asn Ile Pro Ala Arg Ile Arg Pro Ser Glu 1 5 10 15 tgt gag acc ctt gga agc aat ttg gga cat aat gca gac tta ctt aat 96 Cys Glu Thr Leu Gly Ser Asn Leu Gly His Asn Ala Asp Leu Leu Asn 20 25 30 gag aat aat att ctt gca aaa aag aaa ccc tat aag tgt gat aaa tgt 144 Glu Asn Asn Ile Leu Ala Lys Lys Lys Pro Tyr Lys Cys Asp Lys Cys 35 40 45 aga aaa gcc ttt att cat aga tca tcg ctt act aaa cat gag aaa aca 192 Arg Lys Ala Phe Ile His Arg Ser Ser Leu Thr Lys His Glu Lys Thr 50 55 60 cat aaa gga gag gga gct ttc cct aat gga aca gat caa gga att tat 240 His Lys Gly Glu Gly Ala Phe Pro Asn Gly Thr Asp Gln Gly Ile Tyr 65 70 75 80 cct gga aag aaa cac cat gaa tgt acc gac tgt ggg aaa acc ttt ctc 288 Pro Gly Lys Lys His His Glu Cys Thr Asp Cys Gly Lys Thr Phe Leu 85 90 95 tgg aag aca cag ctt act gag cat cag aga att cac act ggg gag aaa 336 Trp Lys Thr Gln Leu Thr Glu His Gln Arg Ile His Thr Gly Glu Lys 100 105 110 ccc ttt gaa tgc aat gta tgt gga aag gcc ttc agg cat agc tca tct 384 Pro Phe Glu Cys Asn Val Cys Gly Lys Ala Phe Arg His Ser Ser Ser 115 120 125 ctt ggt cag cat gag aat gct cat acc gga gag aaa ccc tat cag tgt 432 Leu Gly Gln His Glu Asn Ala His Thr Gly Glu Lys Pro Tyr Gln Cys 130 135 140 agt ctc tgt ggg aaa gcc ttc cag cgc agc tcc tcc ctt gtt caa cac 480 Ser Leu Cys Gly Lys Ala Phe Gln Arg Ser Ser Ser Leu Val Gln His 145 150 155 160 cag cga att cac act gga gag aaa ccc tat cga tgt aat cta tgt ggg 528 Gln Arg Ile His Thr Gly Glu Lys Pro Tyr Arg Cys Asn Leu Cys Gly 165 170 175 agg tcc ttt agg cat ggc aca tcc ctc act caa cac gag gtc aca cac 576 Arg Ser Phe Arg His Gly Thr Ser Leu Thr Gln His Glu Val Thr His 180 185 190 agt gga gag aag ccc ttc cag tgt aag gaa tgt ggg aaa gcc ttt agt 624 Ser Gly Glu Lys Pro Phe Gln Cys Lys Glu Cys Gly Lys Ala Phe Ser 195 200 205 cga tgt tct tcc ctt gtc caa cat gag agg act cat act gga gag aaa 672 Arg Cys Ser Ser Leu Val Gln His Glu Arg Thr His Thr Gly Glu Lys 210 215 220 cct ttt gaa tgt agc ata tgt ggg agg gct ttt ggt cag agc cca tcc 720 Pro Phe Glu Cys Ser Ile Cys Gly Arg Ala Phe Gly Gln Ser Pro Ser 225 230 235 240 ctt tat aaa cat atg agg att cat aag aga ggc aaa cct tac caa agc 768 Leu Tyr Lys His Met Arg Ile His Lys Arg Gly Lys Pro Tyr Gln Ser 245 250 255 agt aac tac agc ata gat ttc aag cac agc aca tct ctc act cag gat 816 Ser Asn Tyr Ser Ile Asp Phe Lys His Ser Thr Ser Leu Thr Gln Asp 260 265 270 gaa agc act ctt acc gaa gtg aaa tcc tac cat tgt aat gac tgt ggg 864 Glu Ser Thr Leu Thr Glu Val Lys Ser Tyr His Cys Asn Asp Cys Gly 275 280 285 gaa gac ttt agt cac att aca gac ttt act gac cat cag agg atc cat 912 Glu Asp Phe Ser His Ile Thr Asp Phe Thr Asp His Gln Arg Ile His 290 295 300 act gca gag aac ccc tat gat tgt gag cag gct ttt agt cag caa gct 960 Thr Ala Glu Asn Pro Tyr Asp Cys Glu Gln Ala Phe Ser Gln Gln Ala 305 310 315 320 att tct cat cct gga gag aaa ccc tat caa tgt aat gta tgt ggg aaa 1008 Ile Ser His Pro Gly Glu Lys Pro Tyr Gln Cys Asn Val Cys Gly Lys 325 330 335 gct ttc aaa agg agt aca agt ttc ata gag cat cac aga att cat act 1056 Ala Phe Lys Arg Ser Thr Ser Phe Ile Glu His His Arg Ile His Thr 340 345 350 gga gag aaa ccc tat gaa tgt aat gag tgt gga gaa gcc ttt agt cga 1104 Gly Glu Lys Pro Tyr Glu Cys Asn Glu Cys Gly Glu Ala Phe Ser Arg 355 360 365 cgc tca tcg ctt act caa cat gag aga acc cac act gga gag aaa ccc 1152 Arg Ser Ser Leu Thr Gln His Glu Arg Thr His Thr Gly Glu Lys Pro 370 375 380 tat gaa tgt att gac tgt ggg aaa gcc ttt agt caa agt tca tct ctc 1200 Tyr Glu Cys Ile Asp Cys Gly Lys Ala Phe Ser Gln Ser Ser Ser Leu 385 390 395 400 att cag cat gag aga act cat act gga gag aag ccc tat gaa tgt aat 1248 Ile Gln His Glu Arg Thr His Thr Gly Glu Lys Pro Tyr Glu Cys Asn 405 410 415 gaa tgt ggg aga gcc ttc cga aaa aaa acc aac ctg cat gat cat cag 1296 Glu Cys Gly Arg Ala Phe Arg Lys Lys Thr Asn Leu His Asp His Gln 420 425 430 aga att cat act gga gaa aaa ccc tat tct tgt aag gaa tgt ggg aaa 1344 Arg Ile His Thr Gly Glu Lys Pro Tyr Ser Cys Lys Glu Cys Gly Lys 435 440 445 aac ttc agc cga agt tca gct ctt act aaa cac cag aga att cat act 1392 Asn Phe Ser Arg Ser Ser Ala Leu Thr Lys His Gln Arg Ile His Thr 450 455 460 cga aat aaa ctc taggaaccgt gaaattaagg aatttgcaga atgctttagc 1444 Arg Asn Lys Leu 465 taaaatgttc tgattcagga tcagaggatt cttagagagc ttgggaatgt aatgaattac 1504 gtgtgtgttt atacgttgtg tgtggagaaa actgccagta gacagatttt tttttttttt 1564 taacataaag acacattctc agatctgatt acagactagt gtaaaaacag ctacatgtat 1624 gtagctggtt ggggatgata tgcctgtatg ttggactttg cttttgaata tatgtatgca 1684 ggatatcatc aagtttcaac atcttgactt gtgaccccca atgtcaacag cttttttaaa 1744 aaacaaattc ctgcagtaat gaccaaaacc cattttaaaa attgcttgac aactgcactc 1804 aactgcagct cttacattaa cttcaccatg gaaaccagtt ccaactccag gaagtcacca 1864 ttcaaagaat tagatcaact agcccaacca cttcattgta cagatgaaga ctgaaagcca 1924 aagatgtgaa gtggtttcca cagtatgata cagcctataa gggtaaagct gggttaaaaa 1984 tgcaggtttc ctggatttgg ggccccatgg ccttgccagt gaaaaggtta tttttggact 2044 cagagggctt taaaataaat tttaagatgt atcagataca caaacattta atgggcacct 2104 atgggttgga cactttgaga attcttaaaa gtataagtgg gagcaaaatg tatgcaaatt 2164 tatcacaaac tatttaaagc aacttcttgg aggcttacaa accacaattt aacagaaact 2224 gtagatggtt gaactactag tgactttttt ccccttttcc cagttacaat tatactttca 2284 gctaacatat gccagtttca cagaactatt aagtcccctt attgtacttt ttatggcatg 2344 cccatgaaaa agcactttct taagcctaca gtatcagatc aatgggaaaa caacagaaaa 2404 ctaagaggag aattttcccg ttaattttct tgcagaaaag tataagtcta attgcccatt 2464 gccataaatt ttgtcttgta ctcagagaag caacatgcac tggctcattt tatgtgcaaa 2524 gaaaagattt caccattaaa aaaattaact tggctaggta tggtgtctca cacctgtaat 2584 cccagcactt tgggtggcta aggcagatag actgcttgaa cccaggagtt caagaccagc 2644 ctggacaaca tggtgaaacc ccatctcttt aaaaaaaaaa aaaaaaatcc aaaaattagc 2704 tgggcatggt ggcatgcagt ggtagtccca gctactcagg aggctgaggt gggaggatca 2764 ctggaacccg ggagcagaga ctgcagtgag ctgagatcac actactgcat tccagcctga 2824 gcaacagagc aagacacaca cacacatcaa tttattttag ttgtataatg cttttctatt 2884 agtaaagcat cagctaagct tcagtggcct gctccatccc ctaatgactc ccatgggcta 2944 tcctaaagga acttccagaa cctttgttgg tgtgttgaca ttgaccatgc agaccaattt 3004 gggcacaact ggacattgat tccttttaca caagagctgc ctcccaaaga tagataaatt 3064 ttcccagccc taaatatgaa tcatggggca agatattggt cgtattgatg gtgaaccttt 3124 cctactggat tctttgcatg ccacatagca ggattcattg cctttctctc atcatggatg 3184 gcatgcagca gcacccaagt attcttcatt ctttgcaggg aaaaaattgt gcatgggggc 3244 tgaaatgtag tatgtgtagc tcaattagtc tctcctctgt gatgcaaaat ggaatattca 3304 atggcagatc tgcccttctg agatgctgac catccaaaac accttgttta tggtgcacca 3364 tgattagctc acacacaatg ccaaggctgt gcttctatta tctgatacat agtttgacaa 3424 tgggtaattc tactcagacc ctccctactg attggctagg atgcctgtca ggaactcatt 3484 atgctactgg ttgtttgggg atccccatag tggactactt tcaggaatgg catgaattgt 3544 aaccaactga gtgctgcccc cactgttacg gaagtttata aaaccttagt tccagaagac 3604 ccaaaggaga gtactggttt gtgtttggtg cttggcctag atccagccac cactctgaaa 3664 ctcatcacat cttcattgac agggagggag cccaggacat atgtgtggct cattgaccag 3724 aaggctttct tagtcccaac agccatgaac catgcactta tggataccca gccttttagg 3784 gctacgtgaa atgcatcctt gtaacatcat tgtattcttt caataaatag ccttctgagt 3844 tg 3846 <210> 15 <211> 3002 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2033)..(2884) <400> 15 ccggcttgag gcatccagca gccagtcctt tggtcttgga ccacaccgag gacggctcaa 60 cattcagtca ggcctggagg acggcgatct atatgatgga gcctggtgtg ctgaggagca 120 ggacgccgat ccatggtttc aggtggacgc tgggcacccc acccgcttct cgggtgttat 180 cacacagggc aggaactctg tctggaggta tgactgggtc acatcataca aggtccagtt 240 cagcaatgac agtcggacct ggtggggaag taggaaccac agcagtggga tggacgcagt 300 gagtggtccc actgtggctg gggcctccat gctgggagtt gggcacccag tccaggctag 360 gctgaggctc ctctgaggac aaggaataga cgccagctta ggcttcccag gggggtgtgg 420 cttgttgtca agagggtggc acacggcagg caccattggg agccagctgc tttgggacat 480 gcccacatcc tccccagata atgccaccac agggtgggtg ctgcttcacg gtacagcttc 540 ctcctggcat gccccttctg gcccggggcc tctggtccac atcacttctt gccttctcgt 600 ggttctgact tccgcatctc atggacctct ttttacagca ggctacaatg tggagtcctg 660 gccagctcta ggattggctt cccccgagtc atgtggccaa actggtctaa tgaactgtgt 720 ccaatccaga gagcaaggct gcctagggct gcccattggc aggggctgtg ggccggggtc 780 tgtgtttgat gcacagtgca agtctctagc tgagcccact agggtgggga gacagtgagc 840 ttggaggcct gagctccttc cctgggtcct gggccaggct tctggggttt gagcagccac 900 aacagagaac ttgctgcccc caggtatttc ctgccaattc agacccagaa actccagtgc 960 tgaacctcct gccggagccc caggtggccc gcttcattcg cctgctgccc cagacctggc 1020 tccagggagg cgcgccttgc ctccgggcag agatcctggc ctgcccagtc tcagacccca 1080 atgacctatt ccttgaggcc cctgcgtcgg gatcctctga ccctctagac tttcagcatc 1140 acaattacaa ggccatgagg aaggtcagat ataaccccta tgacctggga aggagggccc 1200 acccatctca ggtccccttc ccaccttccc accggggcac aacctgctgt gactgcgctt 1260 gtatgcccct gctgcctcct gatgtctcag ccttctctcc tgtggacccc taagctccat 1320 cccactttcc cttattatgg cgccccccca gtcctgcccc ttcctcccgg ctctgctgcc 1380 gctcccctcc tgtaccatga tgggatgccc cctctgtgtg ggccatcgct gactttttaa 1440 gtctttccat ggcacatgtg atctgcccct gggtgtaccc ctcccatgcc tcatgccacg 1500 ctacactctg cccaccagct gatgaagcag gtacaagagc aatgccccaa catcacccgc 1560 atctacagca ttgggaagag ctaccagggc ctgaagctgt atgtgatgga aatgtcggac 1620 aagcctgggg agcatgagct gggggagcct gaggtgcgct acgtggctgg catgcatggg 1680 aacgaggccc tggggcggga gttgcttctg ctcctgatgc agttcctgtg ccatgagttc 1740 ctgcgaggga acccacgggt gacccggctg ctctctgaga tgcgcattca cctgctgccc 1800 tccatgaacc ctgatggcta tgagatcgcc taccaccggg gttcagagct ggtgggctgg 1860 gccgagggcc gctggaacaa ccagagcatc gatcttaacc ataattttgc tgacctcaac 1920 acaccactgt gggaagcaca ggacgatggg aaggtgcccc acatcgtccc caaccatcac 1980 ctgccattgc ccacttacta caccctgccc aatgccaccg tgagtatttt ga ggg cgg 2038 Gly Arg 1 cag tgg agg tct gtg ggg ggc gga cct tgt ctc tgt ctc ctg ccc ctc 2086 Gln Trp Arg Ser Val Gly Gly Gly Pro Cys Leu Cys Leu Leu Pro Leu 5 10 15 ctg acc tgc ccc atc cag gtg gct cct gaa acg cgg gca gta atc aag 2134 Leu Thr Cys Pro Ile Gln Val Ala Pro Glu Thr Arg Ala Val Ile Lys 20 25 30 tgg atg aag cgg atc ccc ttt gtg cta agt gcc aac ctc cac ggg ggt 2182 Trp Met Lys Arg Ile Pro Phe Val Leu Ser Ala Asn Leu His Gly Gly 35 40 45 50 gag ctc gtg gtg tcc tac cca ttc gac atg act cgc acc ccg tgg gct 2230 Glu Leu Val Val Ser Tyr Pro Phe Asp Met Thr Arg Thr Pro Trp Ala 55 60 65 gcc cgc gag ctc acg ccc aca cca gat gat gct gtg ttt cgc tgg ctc 2278 Ala Arg Glu Leu Thr Pro Thr Pro Asp Asp Ala Val Phe Arg Trp Leu 70 75 80 agc act gtc tat gct ggc agt aat ctg gcc atg cag gac acc agc cgc 2326 Ser Thr Val Tyr Ala Gly Ser Asn Leu Ala Met Gln Asp Thr Ser Arg 85 90 95 cga ccc tgc cac agc cag gac ttc tcc gtg cac ggc aac atc atc aac 2374 Arg Pro Cys His Ser Gln Asp Phe Ser Val His Gly Asn Ile Ile Asn 100 105 110 ggg gct gac tgg cac acg gtc ccc ggg agc atg aat gac ttc agc tac 2422 Gly Ala Asp Trp His Thr Val Pro Gly Ser Met Asn Asp Phe Ser Tyr 115 120 125 130 cta cac acc aac tgc ttt gag gtc act gtg gag ctg tcc tgt gac aag 2470 Leu His Thr Asn Cys Phe Glu Val Thr Val Glu Leu Ser Cys Asp Lys 135 140 145 ttc cct cac gag aat gaa ttg ccc cag gag tgg gag aac aac aaa gac 2518 Phe Pro His Glu Asn Glu Leu Pro Gln Glu Trp Glu Asn Asn Lys Asp 150 155 160 gcc ctc ctc acc tac ctg gag cag gtg cgc atg ggc att gca gga gtg 2566 Ala Leu Leu Thr Tyr Leu Glu Gln Val Arg Met Gly Ile Ala Gly Val 165 170 175 gtg agg gac aag gac acg gag ctt ggg att gct gac gct gtc att gcc 2614 Val Arg Asp Lys Asp Thr Glu Leu Gly Ile Ala Asp Ala Val Ile Ala 180 185 190 gtg gat ggg att aac cat gac gtg acc acg gcg tgg ggc ggg gat tat 2662 Val Asp Gly Ile Asn His Asp Val Thr Thr Ala Trp Gly Gly Asp Tyr 195 200 205 210 tgg cgt ctg ctg acc cca ggg gac tac atg gtg act gcc agt gcc gag 2710 Trp Arg Leu Leu Thr Pro Gly Asp Tyr Met Val Thr Ala Ser Ala Glu 215 220 225 ggc tac cat tca gtg aca cgg aac tgt cgg gtc acc ttt gaa gag ggc 2758 Gly Tyr His Ser Val Thr Arg Asn Cys Arg Val Thr Phe Glu Glu Gly 230 235 240 ccc ttc ccc tgc aat ttc gtg ctc acc aag act ccc aaa cag agg ctg 2806 Pro Phe Pro Cys Asn Phe Val Leu Thr Lys Thr Pro Lys Gln Arg Leu 245 250 255 cgc gag ctg ctg gca gct ggg gcc aag gtg ccc ccg gac ctt cgc agg 2854 Arg Glu Leu Leu Ala Ala Gly Ala Lys Val Pro Pro Asp Leu Arg Arg 260 265 270 cgc ctg gag cgg cta agg gga cag aag gat tgatacctgc ggtttaagag 2904 Arg Leu Glu Arg Leu Arg Gly Gln Lys Asp 275 280 ccctagggca ggctggacct gtcaagacgg gaaggggaag agtagagagg gagggacaaa 2964 gtgaggaaaa ggtgctcatt aaagctaccg ggcacctt 3002 <210> 16 <211> 3521 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (783)..(1502) <400> 16 ctggggctct gcgctcgtcc ggccggcccc ggcctcgccg ccccgcgcag tacccagccc 60 ggccccgccg acccgcctct actgccggct ccgcgccctt ccccgagggc tggatgatgg 120 gctgtttcgc cctgcaaacg gtggacaccg agctgaccgc ggactcggtg gagtggtgcc 180 cgctgcaagg ctgcaggcac ctgctggcgt gcgggaccta ccagctgcgg cggccggagg 240 accggcctgc cggcccccag aacaagggtg gaatggaagt taaggagcct caggtccgtt 300 taggccgtct cttcctgtac agtttcaatg acaacaactc tattcaccct ctggtcgagg 360 tccaaagaaa agatacttct gcaatcctgg acatgaaatg atatacaacc acttcctgtt 420 gtgtcacact tgcctgcagt aacatgctgt aaggtttgtg gcataggagc agtgggccat 480 accatataat caaggtgtca catcccggtg gctggacatg ccctcttggg cttggcagat 540 gccagtggat ccatacaact gctccgcctg gtggaatctg agttgtattg cgttctacat 600 ccaactccaa gaggaagtca cctcatgtgt caccagcaga agggctgaag tgacaggatg 660 ttcattgacc tgtcagtgga tctgaaagtt ctctaaggag agcctgggca agcattctta 720 ggttgatgct ggggcccaga gtagcagtga gcatcctgtg tgaagatggc atttctcact 780 ga tta ttg gaa aag cac aag agc cac gtg ctg gag cca ttg tcc agc 827 Leu Leu Glu Lys His Lys Ser His Val Leu Glu Pro Leu Ser Ser 1 5 10 15 ctt gcc ctg gag gag cag tgt ctg gct ttg tcc cta gat tgg tcc act 875 Leu Ala Leu Glu Glu Gln Cys Leu Ala Leu Ser Leu Asp Trp Ser Thr 20 25 30 ggg aaa act gga agg gcc ggg gac cag ccc ttg aag atc atc agc agt 923 Gly Lys Thr Gly Arg Ala Gly Asp Gln Pro Leu Lys Ile Ile Ser Ser 35 40 45 gac tcc aca ggg cag ctc cac ctc ctg atg gtg aat gag acg agg ccc 971 Asp Ser Thr Gly Gln Leu His Leu Leu Met Val Asn Glu Thr Arg Pro 50 55 60 agg ctg cag aaa gtg gcc tca tgg cag gca cat caa ttc gag gcc tgg 1019 Arg Leu Gln Lys Val Ala Ser Trp Gln Ala His Gln Phe Glu Ala Trp 65 70 75 att gct gct ttc aat tac tgg cat cca gaa att gtg tat tca ggg ggc 1067 Ile Ala Ala Phe Asn Tyr Trp His Pro Glu Ile Val Tyr Ser Gly Gly 80 85 90 95 gac gat ggc ctt ctg agg ggc tgg gac acc agg gta ccc ggc aaa ttt 1115 Asp Asp Gly Leu Leu Arg Gly Trp Asp Thr Arg Val Pro Gly Lys Phe 100 105 110 ctc ttc acc agc aaa aga cac acc atg ggt gtg tgc agc atc cag agc 1163 Leu Phe Thr Ser Lys Arg His Thr Met Gly Val Cys Ser Ile Gln Ser 115 120 125 agc cct cat cgg gag cac atc ctg gcc acg gga agc tat gat gaa cac 1211 Ser Pro His Arg Glu His Ile Leu Ala Thr Gly Ser Tyr Asp Glu His 130 135 140 atc cta ctg tgg gac aca cga aac atg aag cag ccg ttg gca gat acg 1259 Ile Leu Leu Trp Asp Thr Arg Asn Met Lys Gln Pro Leu Ala Asp Thr 145 150 155 cct gtg cag ggt ggg gta tgg aga atc aag tgg cac cct ttc cac cac 1307 Pro Val Gln Gly Gly Val Trp Arg Ile Lys Trp His Pro Phe His His 160 165 170 175 cac ctg ctc ctg gcc gcc tgc atg cac agt ggc ttt aag atc ctc aac 1355 His Leu Leu Leu Ala Ala Cys Met His Ser Gly Phe Lys Ile Leu Asn 180 185 190 tgc caa aag gca atg ggt ggg agc gca gtg gtg gag tca gag ctc act 1403 Cys Gln Lys Ala Met Gly Gly Ser Ala Val Val Glu Ser Glu Leu Thr 195 200 205 gca gcc cgg acc acc tgg aca caa gca gtc ctc ctg cct caa ctc agt 1451 Ala Ala Arg Thr Thr Trp Thr Gln Ala Val Leu Leu Pro Gln Leu Ser 210 215 220 cct cag tgt cgg gat tac agg ctt gag cca ctg cat cca gcc ttc agt 1499 Pro Gln Cys Arg Asp Tyr Arg Leu Glu Pro Leu His Pro Ala Phe Ser 225 230 235 tct taaagttgaa gaggtgaact aagacaggtc tgagagttca acattccagg 1552 Ser 240 ccagttttgt caggctgtgg ctctctgagg gcttagactc tgccttatta ccttttccta 1612 agctgtagaa gaggtcctgc tggcccgagt ggacgtgttg agcgattgct ggatgaaaga 1672 ctagcggagg tcctgcctgc cctgtgacgc tacctcccaa gggacagtaa cggcctcccc 1732 aggggcttcc tcccgagggt cgccccacac atgctgctcg agggcccctg tggctgtctc 1792 tgaacattca gcatctcact cctcctcgga gccacctctg cccagttgaa gccctgggcc 1852 cagggttggg cctccccggc ccaggcccat acaacctcgg cacgttttgt aatccagtgc 1912 ggtcctgttc tgccctcagc ccttcttcca tcagcactca ttctggttct gattccggtc 1972 actgctggca tcaatccctg tacttcagag gtagcgatgg cagccctttc agagcccaag 2032 tggccaccca gccttgcagc ccctggctca ccttccactt gctctctgca actgcccctg 2092 ccagccctgg ccttgcctcc tggactccag atcgcctcgg gtttggccac gcctcgtcat 2152 tccctctgtt gctgtgtgct ctgggaggca gttgtggcct ctttccagtt gccgtagcaa 2212 cttctataaa gaaactgctc acttaacctt gtcagtggct cctttccaag ccagaatcca 2272 gactctgaaa cagggcgtgc cggaccctgc atgcttggga cgctccagcc cactgtcact 2332 cactgacagt gccatggctg cctgtttgtg gttccccgtg cgagccttgg actttctcac 2392 ctctgctgta cacatgctgc ctgttcctca gacatcccct tccttcccag ctggctaaaa 2452 ctcctttgtt ctttggtttt aagtgttaac tgaggtcttt ccccctctct ggcttttgcc 2512 gtattcttca ccatcaagtg aggttctgta gccccgcttg atgcttctct tggaggcaca 2572 caggcgtgtg cactgcaaga tgacggtctg tttcttctcc aacatatgcc caggtcttgc 2632 tggttccttt gcatgacaca acggctcctc aggaaacact tgctgacctg tgaggggctg 2692 ttcttggagt ctccctgtca ggtgtgacag gttggggaga ggctctgggt gatgttgtgt 2752 ttcctgggtt ctcccatctc tcactccaga ggagaggcag gaggcgacgg tcctgacatc 2812 tcacacattg cccgactcgc tggtgtatgg agccgactgg tcctggctgc tcttccgttc 2872 tctgcagcgg gccccctcgt ggtcctttcc tagcaaccta ggaaccaaga cggcagacct 2932 gaagggtgca agcgagttgc caacaccctg tcatgaatgc agagaggata acgatgggga 2992 gggccatgcc agaccccaga gtggaatgaa gccactcaca gagggcatga ggaagaatgg 3052 cacctggctg caggctacag cagccaccac acgtgactgt ggcgtgaacc cagaagaagc 3112 agactcagcc ttcagcctcc tggccacctg ctccttctat gaccatgcgc tccacctctg 3172 ggagtgggag gggaactgag cttgaaatca tgaagcccct tcccacaagg aaaccaggag 3232 ggagactgcg agtgagtgcc cgggaccacc tcatcagaga tgcttactgc agccctgcag 3292 gtgcctgtgc actgatggaa tccacagtgt agtcagaaaa gctgttgact tctcttaaat 3352 cagcttccct gctgggcccc tgaaagtgga ctgggtgatt ctgtctggca gagagtgggg 3412 aaaagacgcg gtttccagct tgcagatttg ttaagtttct caggcagatt ttgactttca 3472 gcctttcata cttgtttaag caactatttg tattaaatga agttttttg 3521 <210> 17 <211> 2976 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (26)..(1468) <400> 17 gcggtttccc gccgcgggtg aatga gtc cga ctg cgc tgc cgc cgc cgc cgc 52 Val Arg Leu Arg Cys Arg Arg Arg Arg 1 5 cgc ccg cat cga agc tcc ggc cgc gtc ccc cgc gcc agc ccc ggg agc 100 Arg Pro His Arg Ser Ser Gly Arg Val Pro Arg Ala Ser Pro Gly Ser 10 15 20 25 ctg cgc gcc gag acc cct gcg tgt ctg cga aca gac ctc cgg cct cgg 148 Leu Arg Ala Glu Thr Pro Ala Cys Leu Arg Thr Asp Leu Arg Pro Arg 30 35 40 gcc atc acc ccg ggc aga cgc tgc agc ccc tgt ccg ggc ctc ttg cct 196 Ala Ile Thr Pro Gly Arg Arg Cys Ser Pro Cys Pro Gly Leu Leu Pro 45 50 55 gcc gcc ccc atg aga aaa acc aac atg tgg ttc ttg gag cgg ctt cgg 244 Ala Ala Pro Met Arg Lys Thr Asn Met Trp Phe Leu Glu Arg Leu Arg 60 65 70 ggg tct ggg gaa aac ggt gct gcc cgg ggc gtg ggg agt gag gcg ggg 292 Gly Ser Gly Glu Asn Gly Ala Ala Arg Gly Val Gly Ser Glu Ala Gly 75 80 85 gac aag gcc tcc aag ggg ccc ctg tac agc aat gtg ttg acg ccc gac 340 Asp Lys Ala Ser Lys Gly Pro Leu Tyr Ser Asn Val Leu Thr Pro Asp 90 95 100 105 aag atc ccc gac ttt ttc atc ccc ccc aag ctg ccc tcg ggc ccc gcg 388 Lys Ile Pro Asp Phe Phe Ile Pro Pro Lys Leu Pro Ser Gly Pro Ala 110 115 120 gag ggc gag gga cag gcc gcg ctg ggc ccc tcc acg tcg gaa cag aac 436 Glu Gly Glu Gly Gln Ala Ala Leu Gly Pro Ser Thr Ser Glu Gln Asn 125 130 135 ctg gcc tct gcg gcc ccc cgc cag acc cca cgg agc ccc cgg ctg cct 484 Leu Ala Ser Ala Ala Pro Arg Gln Thr Pro Arg Ser Pro Arg Leu Pro 140 145 150 gcc aag ctg gca gcc gag agc aag agc ctg ctg aag gca gcc acc cgg 532 Ala Lys Leu Ala Ala Glu Ser Lys Ser Leu Leu Lys Ala Ala Thr Arg 155 160 165 cac gtg atc cag atc gag agt gcc gag gac tgg ctg tcc gag gag gcc 580 His Val Ile Gln Ile Glu Ser Ala Glu Asp Trp Leu Ser Glu Glu Ala 170 175 180 185 act gac gcc gac ccc cag gcc cag ggt gcc atg tcc ctg ccc tcg gtg 628 Thr Asp Ala Asp Pro Gln Ala Gln Gly Ala Met Ser Leu Pro Ser Val 190 195 200 ccc aag gcc cag acg tcc tac ggc ttc gcc atg ctg gcc gag agc ccc 676 Pro Lys Ala Gln Thr Ser Tyr Gly Phe Ala Met Leu Ala Glu Ser Pro 205 210 215 cac acg agg cgc aag gag tct ctg ttc cac agt gag cac ggg gct ctg 724 His Thr Arg Arg Lys Glu Ser Leu Phe His Ser Glu His Gly Ala Leu 220 225 230 gcc cag gtg ggc tcc cca ggg gcc ggg cgc cgc cgg gca gct gcc aag 772 Ala Gln Val Gly Ser Pro Gly Ala Gly Arg Arg Arg Ala Ala Ala Lys 235 240 245 gcc aac ggg ggt gat ggg ggc ccc agg gag gct ggc ggg gcc ctc atg 820 Ala Asn Gly Gly Asp Gly Gly Pro Arg Glu Ala Gly Gly Ala Leu Met 250 255 260 265 agc ccc ggc cgc tac ttc agt ggc ggg gag agc gac aca ggg tcc tcg 868 Ser Pro Gly Arg Tyr Phe Ser Gly Gly Glu Ser Asp Thr Gly Ser Ser 270 275 280 gcc gag tcc tcc ccc ttc ggg tcc cct ctg ctg tcc cgc tcc gtg tct 916 Ala Glu Ser Ser Pro Phe Gly Ser Pro Leu Leu Ser Arg Ser Val Ser 285 290 295 ctg ctc aaa ggt ttc gcc cag gac agc cag gcc aag gtg agc cag ctc 964 Leu Leu Lys Gly Phe Ala Gln Asp Ser Gln Ala Lys Val Ser Gln Leu 300 305 310 cgg cac tcc gtg ggc cgc cac ggc tcc ctg tct gcg gac gac agc acc 1012 Arg His Ser Val Gly Arg His Gly Ser Leu Ser Ala Asp Asp Ser Thr 315 320 325 ccg gac gcc agc ccc ggg agc cgg cgc cgc ctg acc cgc cgg gca ccc 1060 Pro Asp Ala Ser Pro Gly Ser Arg Arg Arg Leu Thr Arg Arg Ala Pro 330 335 340 345 ccg gaa cct ggc ccc gag tcg ggc cag gcg cgt ggg gag cac acg gtc 1108 Pro Glu Pro Gly Pro Glu Ser Gly Gln Ala Arg Gly Glu His Thr Val 350 355 360 cac gtg ggc cct cgg ggc agc gtg cgg ctg ctg gcc gag tac gag gcc 1156 His Val Gly Pro Arg Gly Ser Val Arg Leu Leu Ala Glu Tyr Glu Ala 365 370 375 ggc cag gcc cgc ctg cgg gtg cac ctg ctg gcc gcc gag ggc ctc tac 1204 Gly Gln Ala Arg Leu Arg Val His Leu Leu Ala Ala Glu Gly Leu Tyr 380 385 390 gac cgc ctg tgc gac gcc cgc agc atc aac tgc tgc gtg ggc ctg tgc 1252 Asp Arg Leu Cys Asp Ala Arg Ser Ile Asn Cys Cys Val Gly Leu Cys 395 400 405 ctg gtg ccg ggc aag ctg cag aag cag cgc agc acc atc gtg aag aac 1300 Leu Val Pro Gly Lys Leu Gln Lys Gln Arg Ser Thr Ile Val Lys Asn 410 415 420 425 agc cgc cgc ccc gtc ttc aac gag gat ttc ttc ttc gac ggc ctg ggg 1348 Ser Arg Arg Pro Val Phe Asn Glu Asp Phe Phe Phe Asp Gly Leu Gly 430 435 440 ccc gcc agc gtc cgg aaa ctg gcc ctc agg atc aag gtg gtg aac aag 1396 Pro Ala Ser Val Arg Lys Leu Ala Leu Arg Ile Lys Val Val Asn Lys 445 450 455 ggc agc agc ctc aag cgg gac acg ctg ctc ggg gag aag gag ctg ccc 1444 Gly Ser Ser Leu Lys Arg Asp Thr Leu Leu Gly Glu Lys Glu Leu Pro 460 465 470 ctg acc tcc ctg ctc ccc ttc ctg tagagcgggg acctgctcgc tgtccgctcg 1498 Leu Thr Ser Leu Leu Pro Phe Leu 475 480 ggtgcagagg gcagaccccg gtctctcctc cacaccgggt gctggcgggc gggtccgctg 1558 gcttgggctg gggctgggcg gggaggggct cacactcagg ctgggctggg cgctgatgct 1618 gtagctgacg caggttttcc tgggtcaccc gatgctgact ggagtattat ttttcatgaa 1678 ttaatttctc ctttggagga ggaggaggag gaggagaggg gcctcggggg aagagaaggg 1738 gccacttcgc ggggcagagg gtcccccttt gcccctccac actgcctcgc gcctttcacg 1798 cagagttcta gaaggaggtt ctcaggggtc caggaatccg tgcacgtggc cagggcgggg 1858 ccgtggacag ggaggtgaca agtcttggag cccacagtct ccttcactgc acgtgtgtct 1918 tcctccccca cagcctgccg ccatccagtt cccaccccgc ggagagcagg gcctcgcggc 1978 cacaccaggc ccaggctctg cccgggacgc cccctcccag gggtcaggct tggctctggg 2038 agaccccagc ttcccaggct ccggtgggga gcggggccag gcgagggtgc cctgacgagg 2098 gtgggcgcgg agggagcagc cgctgccaca gtcggagcag aggaggtctt ggcagaggtc 2158 cgcggggagg agacggctct cctggcctgt tttcctccga cactgtcgcc gctggctttt 2218 tgggtacagg cccgtctggc catttctgct gtggcgccgg agggtcttgc tcagagatgc 2278 ccagagagca gccctatggc ccatccgtcc tcgctggggg ctgccacccc gtcgcccttt 2338 tgtccagcgt ctgcagctgg gagcgacacg cagtgattga cggccgtggc tttgacgcca 2398 cgtgtgtaaa taggctgtgg ctacggtggg tgggtcggag cccaggaggc agctgggggg 2458 acctggctat ggcctttcct gctgttaggt ggctgcaagg cttccattgt gtggttatgt 2518 gggtccttga atgattttgg ggaggtttcc agctcagaat gatgcagaaa tgataagact 2578 caaagcaggg gccaggccag gccagtgcct tcgcctctcc cggctgctgg tgggcacggg 2638 ggaaccaggg cacatctgtg gtacccaggg acgtcccttg tcagcccgtt tgccacacat 2698 tgttcctctt gtccagggga gggtggagga gctgcttccc aggactggag gagcagctgg 2758 gcccctgctg cacgtccggt gggacacacc tgtgagcccc gccttctctg atgctcaccc 2818 ctcccaggcc ctgttctccc tggggtcccc cgtttagtag cccccctgca ctctttgata 2878 tcttagtgtc tgaggttgac tgtgggtaaa tctttaagac actccccagc tgtgtttgtt 2938 tataaatgct gtttatagtg caataaaggt gtttcggg 2976 <210> 18 <211> 5385 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2088)..(2885) <400> 18 cccagcttga ctagttgttt ttcttttctg gtctgttctg ctcttttggt ttaaataatt 60 aaagatcctt gttgacaagt aaaaaaaatt ctttatgaag ctaaaaaaag tggttctaat 120 taaacattta aacattcttc catatacttt taaaaacagt tgtaattata gctctactta 180 atgttaagta ctttaaccaa atgaaaggtt aattatacag actggcaatt ataaaaaata 240 aggtaacata ttcatggcat tatgaataat taataagaac tcttcgattt catttagacc 300 agggattctg tgctaatatg ctctcagctg tcttgaaata atggtgaaat tttaaaatta 360 gttttgtact taaacatgtc cccaaagtga agcacagcgg ccaacgtccg tcacggtgcc 420 gagcagtgcc agccgctcta ccatggccga cggcctgatt cccaagggct gcgtccaggg 480 gccgtgctag tgcagtgaat gttcaggaaa cagaccaggt ggcaggaggg agtcttgtcc 540 tcaaggaacc tcataagtct tgctaggggg acaaacaacg agagaatgaa actggcagcc 600 atagaatccg ggccttgatg cccaggcctc gccatgcata cctgctggcc ggcatctccc 660 tgagacccgt ctggcacagg caacggcagg gcccggccct tggagcacag ggatggcaga 720 gcagggtcat cgaagcaggt ggcgtgcaga cagtggcttc tccgcaagag ctcctggctc 780 tcagggagct gagggctcct caatggggaa ttctggctga gctccagggg aaagcgggtt 840 ggaggcctgg ctcctccgcc cccatctcct ctgccccaac cctggaaggg ggtcagatat 900 caatgctaac atttgtgctg actcaattaa ttcaaacgat tcacaaaaac aaacttgctt 960 aaagtattaa tttcctaaaa agaactaaaa tgagttcaaa taattcacat aattaaatta 1020 ttttaattgt taatttaatt aaaacatctc agttttcaaa gcaccacgtg ccttggtact 1080 ttctccttat tacatgtccc atcttccagg gaggtttcct tccacccact gtagcagcgg 1140 attctggtgt ctcgggtgtt cccaggcttc cgtcagcccc tggggggttt acagttctgc 1200 agcagtctcc atcctaatta atctgcataa gaacatccgt ccgcagggat gtcctgtggc 1260 cctgtggcca cgtctgacat gcagctaaac actggacgtg tcacctggcg acagcggatg 1320 atgaaactgt gggcagggta gcacctctgg gcgccaacac tcttaacatg tttttttttt 1380 acaagaccat tctttaatgg gtgaaccctg aaaggaaacg ctaaagattg tttttaaata 1440 tttttatata gagacaaatc tttatcaaac attcactcaa tgctaaatgt cacttctcca 1500 gccaacccct ttaaaaaata cctaaaacta agactggaat gacaccttca aacacatact 1560 ctaataatgc attcccggga gcagagagag cttagcagct taatgttttc atgatataca 1620 agcatgaagt ttggcgaagg cattttcata tgtttacctg aatattttaa taaaatcaaa 1680 agatcccttt cactttaata ctctacttag aatgtaattt tccattaaat aaatagtcaa 1740 tgttctgtaa cccacatgct gataaacttc aaaactttag aaagtattaa gcgtaatcaa 1800 agattacttt taatttttga gctgggaaga gcctgtgcca gggaacagag ttgagattga 1860 agagataaaa gcacgattat tttatgaaag aaaagcgaaa ttcataaaag agaccatgtg 1920 attctataaa gcaacacaga atccgttttc agaagaggca aacgtggacc acaaaacgcc 1980 agtgttgcct gtgaagagag aaacagttaa aaggaagaat tcgcggtggc ccagtatagg 2040 tcaaggtgtg ggaggacagg gaggcgatga aatgatcctc cttctaa ctt ccc aga 2096 Leu Pro Arg 1 aag gta caa gcg gcc agc act gta act gga aac cac gcg ggc acc agc 2144 Lys Val Gln Ala Ala Ser Thr Val Thr Gly Asn His Ala Gly Thr Ser 5 10 15 gtg aac tca gac ccg aca gcc agc acc gta act gga gac cat gcg ggc 2192 Val Asn Ser Asp Pro Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly 20 25 30 35 acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga gac cat 2240 Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His 40 45 50 gcg ggc acc ggt gtg aac tca gac ccg acg gcc agc acc gta act gga 2288 Ala Gly Thr Gly Val Asn Ser Asp Pro Thr Ala Ser Thr Val Thr Gly 55 60 65 gac cac gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta 2336 Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val 70 75 80 act gga gac cac gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc 2384 Thr Gly Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser 85 90 95 acc gta act gga gac cac gcg ggc acc ggt gtg aac tca gac ccg acg 2432 Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn Ser Asp Pro Thr 100 105 110 115 gcc agc acc gta act gga gac cac gcg ggc acc ggt gtg aac cca gac 2480 Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn Pro Asp 120 125 130 tcc acg gcc agc acc gta act gga gac cac gcg ggc acc ggt gtg aac 2528 Ser Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn 135 140 145 cca gac tcc acg gcc agc acc gta act gga gac cac gcg ggc acc ggt 2576 Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly 150 155 160 gtg aac cca gac ccg acg gcc agc acc gta act gga gac cac gcg ggc 2624 Val Asn Pro Asp Pro Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly 165 170 175 acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga gac cat 2672 Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His 180 185 190 195 gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga 2720 Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly 200 205 210 gac cat gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta 2768 Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val 215 220 225 act gga gac cac aca ggc acc ggt gtg aac tca gac gca acg gca ccc 2816 Thr Gly Asp His Thr Gly Thr Gly Val Asn Ser Asp Ala Thr Ala Pro 230 235 240 tcc tgg gct gct cca gcc acc acc aga acc cgt tct gca gct gaa ctc 2864 Ser Trp Ala Ala Pro Ala Thr Thr Arg Thr Arg Ser Ala Ala Glu Leu 245 250 255 gaa gct gat tct gga agg aac tgaggaggat gaggacgaga tccttctacc 2915 Glu Ala Asp Ser Gly Arg Asn 260 265 tcaaccggga tcaatttttt aatgatttct ttcttttaga aaaatgttaa taagatactt 2975 tgttctaaaa tgaccaggat ttcagcagct tctacatttc ttcatgtcaa tcaaagctta 3035 ggcacgtggg ttacttcact gttctctgca ccctttccaa ataaaacccg gggccgcctg 3095 tgtacccaag agcctgggca ctgagcagag agagaaaacg tccagggcaa cagcgttcac 3155 attccggcgg tgccatagag cagacggggg gagtccgagg cggtgccgca gagcggacgc 3215 ggggagtccg aggcggtgct gcagagcgga cgcggggagt ccgaggtggt gccgtagagt 3275 ggatggggag agttcgagcc acaggtggcc acgcctcccc tgcctccggc cggtgcccac 3335 tgctcacgca ttcctctctc ggcccatcct gagccagggg gccctcgcgt cacccaaggc 3395 cactgcagca cctccaggaa caaaccaccc acagatggac tgctctccat gcaaggtgca 3455 aagattgttt ttaaaactgc tcggctctgg aacttatccc agtaacaggc tagattaaga 3515 aaactgacta aaaacagtag gaggaaaaac tccctccacc cattgaagca caagaaatgc 3575 aaacagcctc tgaaataact gcagacgagt tttcctgggt taggtacgcg gtggggtgac 3635 cctaccccat agcggggccc cggcacacag tgggaactga cgaacatctc tgaataaatg 3695 cgggaaaatg cactcaactc ccaaacagag acagaaactg ctttttttcc ttaaaagctt 3755 ctctctttta ttctcaaaaa gactctcttt ggtggcttca ctgctgaagg gacaagcctg 3815 ttgtccccct ccagcaggct ggccgccttc cagaaagcag tcctggacag tctccagatc 3875 ctatggccct cacatccacc ttgcctagtg atcccgcatt atgaagccta atcatcagaa 3935 agagcctccc agtccacagc tcaggagtgg cgttttattc agggaaacac taataaaaac 3995 ctgggggtca gcgtcaacgc tgtcccctag atactgaggc tccacacaga tggcacacag 4055 acaggcctca ccccttcctg gtgccccaag tctctcctaa cattttatta gctgagtttt 4115 ctggcggttt gttacttttc acctgcccgt ccaaggcagc tacactaaac tacttgtcca 4175 gggcagatcc actcgtaatg accatcagct cagagttgga agtagcgctg gagaatgagg 4235 tggtttcagg ttgctatgct tccctgccct ctcttccagg gatgccaagt aagtttctgc 4295 catggctgat gatgtcagta ctacactgaa attagttcaa cggcaggcca ccaattagta 4355 cttactgaac aacgtcctta agagtgaaac cctgtctcta ctaaaaatac aaaaatttag 4415 ctgggtgtgt tggtgagcac ctataatctc agctactggg gggctgaggc aggagaattg 4475 tttgaaccca ggaggcagag attgcagtga gccaagattg caccattgca ctccagcctg 4535 ggcaaaaaga acgagactcc atctcaaaaa taaaaaaaaa agaaaaagaa aaagaaaaag 4595 aaaaaaatga ttctcacgac actgtcaaca tcttaagttc ttccggaggg tagcgtaaaa 4655 ttatcacaca caacgtaaag ccaattagat caacattctg tcctcaccga tccataccta 4715 aatccttatt tctaattggc ccccaatttt tcagaattta agaagactga gaggaataac 4775 cacgaacaag caagatttac gcagacgtca gtcttcccaa ctctcgctca ggaactcacc 4835 ccaaactcct cctgtggaat cagcctggat aacagtccct ggtggctaca caagaacaga 4895 gagaggattc cacgaaccca tgacacaccc caccctccaa tccaaggcag aggagagagg 4955 cggacagtca caaaaaccag aagagcaggg aactgtgatt cttctgattt aatgtaggtg 5015 attgggccaa aaaacaaggc aaatccacct gtgaatggag tgggctccca gctggctgtg 5075 gatatgaagg aactcacttt cctgcacact gaagtccatc agcgtctcca catcggccgg 5135 cacctgggct gtgatcgccc ctcacagact gggataacag tggcctcatc tggacaggtt 5195 gtcaaggatg gatggataga cgtcctggac agacactgta cagagaccag ggactgggca 5255 gggggaggtg gcacagcctt agggccagca ggggcgaggc cgggaagccg ggtctctggg 5315 gcgttgaggg cccatggggg cctgggcgag ggcctgcgtc tcctcatcca acagaagcag 5375 aagcggccgc 5385 <210> 19 <211> 4017 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (749)..(3361) <400> 19 cggacgcgct ggaggccggt gggcggtcgc cgtagtcgcc ctcccaccac ctcgctcagg 60 tttctccggc ctggggcaag aggcctgtgg cccgcgcggg aacgcactgt ccaccctcag 120 acctgggact gggccggcgc gcggacgcta ccaagaggct gcggctcccg cccccgcggc 180 cggacgtggc gcctcccctg aggccgcggc cggagcctgg aggtggggtc ggagtcagag 240 cccggggctc tgatgtcacc gcgcggctgc gacggcccag gagcgcgtct cggcgggagc 300 ctcagagcct cactttgtcg ccaggctgga gtgcagtggc gcgatctcgg ctcactacaa 360 cttccgcctc ccgggttcaa gcgattcttc tgcctcagcc tcccgagtag ctgggactac 420 aggcgtgtgc caccacgccc agctaatttt tgggaaagat cacatggaga agccaacaga 480 acggccagga agcagcctgg gtggaacagg gttggggagg aaaggaacat gggtacattc 540 tcagcgttcc aggaggtctc agggtaaagg ctgcttgtgt gggcaagggg agcctggcag 600 ggaatgagct ggtcccaatt ccttcccctg agctgggccc tcaaccttca gaggtgctcc 660 ctgggactgg tcctacttcc tctgccacct ctgatcccaa ggtgactctg ccaaaatctg 720 ggggatgcca gcgcttgtga cggtgtga cca ggg agg gga agt gac cca cgc 772 Pro Gly Arg Gly Ser Asp Pro Arg 1 5 agc cac cca gcc tca gca ctc atc tgc gca gcc atg gag gcc ctg gga 820 Ser His Pro Ala Ser Ala Leu Ile Cys Ala Ala Met Glu Ala Leu Gly 10 15 20 cct ggg ggc gac cgc gcc tcc ccg gcc tcg tcc act agc agc ctg gac 868 Pro Gly Gly Asp Arg Ala Ser Pro Ala Ser Ser Thr Ser Ser Leu Asp 25 30 35 40 ctg tgg cat ctg tcc atg cgc gcg gac tcg gcc tac agc tct ttc tcc 916 Leu Trp His Leu Ser Met Arg Ala Asp Ser Ala Tyr Ser Ser Phe Ser 45 50 55 gca gcc tcc ggc ggc ccc gag ccg cgc acg cag tcg ccg ggg aca gac 964 Ala Ala Ser Gly Gly Pro Glu Pro Arg Thr Gln Ser Pro Gly Thr Asp 60 65 70 ctc ctt cct tac cta gac tgg gac tac gtg cgt gtg gtt tgg ggc ggc 1012 Leu Leu Pro Tyr Leu Asp Trp Asp Tyr Val Arg Val Val Trp Gly Gly 75 80 85 ccg ggc ccc gcc ccg ccc gac gct gcc ctt tgc aca tcc ccg cgg ccc 1060 Pro Gly Pro Ala Pro Pro Asp Ala Ala Leu Cys Thr Ser Pro Arg Pro 90 95 100 cgg ccc gcg gtt gca gcc cgc agt ggg ccg cag cca aca gag gtc ccg 1108 Arg Pro Ala Val Ala Ala Arg Ser Gly Pro Gln Pro Thr Glu Val Pro 105 110 115 120 ggg acc ccg gga cca ctg aac agg cag gcc acc ccg ctg ctg tac gcg 1156 Gly Thr Pro Gly Pro Leu Asn Arg Gln Ala Thr Pro Leu Leu Tyr Ala 125 130 135 ctg gcg gcc gag gcg gag gcc gcg gcg cag gct gcc gag ccg ccc agc 1204 Leu Ala Ala Glu Ala Glu Ala Ala Ala Gln Ala Ala Glu Pro Pro Ser 140 145 150 ccg ccg gcc tcg agg gcc gcc tac cgc cag cgg ctt cag ggc gcg cag 1252 Pro Pro Ala Ser Arg Ala Ala Tyr Arg Gln Arg Leu Gln Gly Ala Gln 155 160 165 cgg cga gtg ctc cgg gag acg tcg ttc cag cgc aag gag ctc cgc atg 1300 Arg Arg Val Leu Arg Glu Thr Ser Phe Gln Arg Lys Glu Leu Arg Met 170 175 180 agc ctg ccc gcc cgt ctg cgg ccc act gtc cca gcg cgg ccc ccg gcg 1348 Ser Leu Pro Ala Arg Leu Arg Pro Thr Val Pro Ala Arg Pro Pro Ala 185 190 195 200 act cac ccg cgc tcc gcc tcg ctc agc cac ccg ggc ggg gag ggg gag 1396 Thr His Pro Arg Ser Ala Ser Leu Ser His Pro Gly Gly Glu Gly Glu 205 210 215 ccg gcg cgc tcc cgg gct ccc gcg cca gga act gcc ggc cgg ggt ccc 1444 Pro Ala Arg Ser Arg Ala Pro Ala Pro Gly Thr Ala Gly Arg Gly Pro 220 225 230 ctc gcc aac cag cag cgg aag tgg tgc ttc tca gag cca gga aag ctg 1492 Leu Ala Asn Gln Gln Arg Lys Trp Cys Phe Ser Glu Pro Gly Lys Leu 235 240 245 gat cgt gtg ggt cgg ggc ggt ggg ccg gcg cgg gaa tgc ctg ggt gag 1540 Asp Arg Val Gly Arg Gly Gly Gly Pro Ala Arg Glu Cys Leu Gly Glu 250 255 260 gcc tgc tcc agc tct ggc ctc cct ggg ccc gag ccc ttg gag ttc cag 1588 Ala Cys Ser Ser Ser Gly Leu Pro Gly Pro Glu Pro Leu Glu Phe Gln 265 270 275 280 cat ccg gcg ctg gct aag ttt gaa gat cac gag gtc gga tgg ctg ccc 1636 His Pro Ala Leu Ala Lys Phe Glu Asp His Glu Val Gly Trp Leu Pro 285 290 295 gag acg caa ccc caa ggc tcc atg aac ctg gac tcc ggg tcc ttg aag 1684 Glu Thr Gln Pro Gln Gly Ser Met Asn Leu Asp Ser Gly Ser Leu Lys 300 305 310 ctc ggt gat gcc ttc agg ccc gcc agt cgg agt cgg agc gct tca ggc 1732 Leu Gly Asp Ala Phe Arg Pro Ala Ser Arg Ser Arg Ser Ala Ser Gly 315 320 325 gaa gtc ttg ggt tcc tgg gga gga tca gga ggg acc ata ccc att gtc 1780 Glu Val Leu Gly Ser Trp Gly Gly Ser Gly Gly Thr Ile Pro Ile Val 330 335 340 cag gct gtt ccc caa gga gca gaa acc ccc aga cca ttg ttt cag acc 1828 Gln Ala Val Pro Gln Gly Ala Glu Thr Pro Arg Pro Leu Phe Gln Thr 345 350 355 360 aaa ctt tcc agg ttc ttg cct cag aaa gag gct gcg gtg atg tat cct 1876 Lys Leu Ser Arg Phe Leu Pro Gln Lys Glu Ala Ala Val Met Tyr Pro 365 370 375 gca gag tta ccc cag agc agc cct gct gac agt gaa cag agg gtc tca 1924 Ala Glu Leu Pro Gln Ser Ser Pro Ala Asp Ser Glu Gln Arg Val Ser 380 385 390 gag acc tgc att gtg cct gcc tgg ctc ccc tcc ctt cct gat gaa gtg 1972 Glu Thr Cys Ile Val Pro Ala Trp Leu Pro Ser Leu Pro Asp Glu Val 395 400 405 ttc cta gaa gag gcc cca ctg gtc aga atg aga tca cca cca gac ccc 2020 Phe Leu Glu Glu Ala Pro Leu Val Arg Met Arg Ser Pro Pro Asp Pro 410 415 420 cat gcc tcc cag ggg ccc cca gcc agt gtc cat gcc tct gac cag ccg 2068 His Ala Ser Gln Gly Pro Pro Ala Ser Val His Ala Ser Asp Gln Pro 425 430 435 440 tat gga act ggc tta ggc caa aga act ggc cag gtt aca gtc ccc aca 2116 Tyr Gly Thr Gly Leu Gly Gln Arg Thr Gly Gln Val Thr Val Pro Thr 445 450 455 gag tac ccg ctc cat gag tgt cca gga act gca ggg gca gat gac tgc 2164 Glu Tyr Pro Leu His Glu Cys Pro Gly Thr Ala Gly Ala Asp Asp Cys 460 465 470 tgg cag ggg gtg aat ggt tct gta ggt att tcc agg ccc aca agc cac 2212 Trp Gln Gly Val Asn Gly Ser Val Gly Ile Ser Arg Pro Thr Ser His 475 480 485 acc ccc act ggg act gca aat gat aac atc cca act att gac ccc act 2260 Thr Pro Thr Gly Thr Ala Asn Asp Asn Ile Pro Thr Ile Asp Pro Thr 490 495 500 gga ctg acc acc aat ccc ccc aca gct gca gag agt gac ctc ctc aaa 2308 Gly Leu Thr Thr Asn Pro Pro Thr Ala Ala Glu Ser Asp Leu Leu Lys 505 510 515 520 cct gtc cca gct gat gcc ttg gga ctt tca ggc aat gat act cca ggt 2356 Pro Val Pro Ala Asp Ala Leu Gly Leu Ser Gly Asn Asp Thr Pro Gly 525 530 535 ccc tct cac aat act gcc cta gcc agg ggc act ggc cag cct ggt tcc 2404 Pro Ser His Asn Thr Ala Leu Ala Arg Gly Thr Gly Gln Pro Gly Ser 540 545 550 agg ccc aca tgg cct agt cag tgc ctc gag gag ctg gtt cag gag ctg 2452 Arg Pro Thr Trp Pro Ser Gln Cys Leu Glu Glu Leu Val Gln Glu Leu 555 560 565 gcc aga tta gat ccc tct cta tgt gac cct ctt gct tcc cag ccc agc 2500 Ala Arg Leu Asp Pro Ser Leu Cys Asp Pro Leu Ala Ser Gln Pro Ser 570 575 580 cca gag cca ccc ctg ggc ctg ctg gat gga ctg att cct tta gca gag 2548 Pro Glu Pro Pro Leu Gly Leu Leu Asp Gly Leu Ile Pro Leu Ala Glu 585 590 595 600 gtc cgg gct gca atg cgg cct gcc tgt ggg gag gct gga gag gag gct 2596 Val Arg Ala Ala Met Arg Pro Ala Cys Gly Glu Ala Gly Glu Glu Ala 605 610 615 gcc agt act ttt gag cca ggg tcc tat cag ttc agc ttc acc cag ctc 2644 Ala Ser Thr Phe Glu Pro Gly Ser Tyr Gln Phe Ser Phe Thr Gln Leu 620 625 630 ctg ccg gct cct cgg gag gag aca agg ctt gaa aac cct gcc acc cac 2692 Leu Pro Ala Pro Arg Glu Glu Thr Arg Leu Glu Asn Pro Ala Thr His 635 640 645 cct gtg ctt gac cag cca tgt ggg cag ggg ctc cct gca cca aac aac 2740 Pro Val Leu Asp Gln Pro Cys Gly Gln Gly Leu Pro Ala Pro Asn Asn 650 655 660 agc atc cag ggc aag aaa gtg gag ctg gcc gcc cgc ctc caa aag atg 2788 Ser Ile Gln Gly Lys Lys Val Glu Leu Ala Ala Arg Leu Gln Lys Met 665 670 675 680 ctt cag gac ctt cac acg gag cag gag cgg ctg cag ggg gag gca caa 2836 Leu Gln Asp Leu His Thr Glu Gln Glu Arg Leu Gln Gly Glu Ala Gln 685 690 695 gcg tgg gcc agg cgc caa gcg gct ctg gag gct gca gtg cgc cag gcc 2884 Ala Trp Ala Arg Arg Gln Ala Ala Leu Glu Ala Ala Val Arg Gln Ala 700 705 710 tgt gcc cct cag gag ctg gag cgg ttc agc cgg ttc atg gcc gac cta 2932 Cys Ala Pro Gln Glu Leu Glu Arg Phe Ser Arg Phe Met Ala Asp Leu 715 720 725 gag cgc gtg ctt ggc ctt ctg ctg ctg ctg ggc agt cgc ctg gcg cgc 2980 Glu Arg Val Leu Gly Leu Leu Leu Leu Leu Gly Ser Arg Leu Ala Arg 730 735 740 gtg cgc cgc gcc ctg gcc cgg gcg gcc tca gac agc gac cct gat gag 3028 Val Arg Arg Ala Leu Ala Arg Ala Ala Ser Asp Ser Asp Pro Asp Glu 745 750 755 760 cag gcc tcc ctg ctg cag cga ctc cgg ctc ctg cag cgg cag gag gag 3076 Gln Ala Ser Leu Leu Gln Arg Leu Arg Leu Leu Gln Arg Gln Glu Glu 765 770 775 gac gcc aag gag ctg aag gag cac gta gcg cgg cgc gag cgg gcc gtg 3124 Asp Ala Lys Glu Leu Lys Glu His Val Ala Arg Arg Glu Arg Ala Val 780 785 790 cgg gag gtg ctg gtg cga gca cta ccg gtg gag gag ctg cgc gtc tat 3172 Arg Glu Val Leu Val Arg Ala Leu Pro Val Glu Glu Leu Arg Val Tyr 795 800 805 tgc gcc ctg ctg gcg ggc aag gcc gcc gtc ctg gcc cag cag cgc aac 3220 Cys Ala Leu Leu Ala Gly Lys Ala Ala Val Leu Ala Gln Gln Arg Asn 810 815 820 ctg gac gag cgc atc cgc ctc ctt cag gac caa ctg gac gcc atc agg 3268 Leu Asp Glu Arg Ile Arg Leu Leu Gln Asp Gln Leu Asp Ala Ile Arg 825 830 835 840 gac gac ctt ggc cat cat gcc ccg tct ccc agc ccg gcg cgg ccc cca 3316 Asp Asp Leu Gly His His Ala Pro Ser Pro Ser Pro Ala Arg Pro Pro 845 850 855 ggg acc tgt cct cca gtt cag ccg ccc ttc cct ctt ctc ctt aca 3361 Gly Thr Cys Pro Pro Val Gln Pro Pro Phe Pro Leu Leu Leu Thr 860 865 870 tagttatagg tggagagggt ggggtgcacc gcccctatct ctcacccacg taagtggggt 3421 gatgctgatt tagtctctgc ttttcccttg agggattggg gaggacccag tccaggcctt 3481 tctaagatac tcctccagcc gggcgcggtg gctcacgcct gtaatcccag cactttggga 3541 ggccgaggcg ggcggatcac aaggtcagga gatcgagacc atcctggcta acacggtgaa 3601 accccgtctc tactaaaaat ataaaaaatt agccgggcgt ggtggcgggt gcctgtattg 3661 ccagctactc ggaaggctga ggcaggagaa tggcgtgaac ctgggaggtg gagcttgcag 3721 tgagcggaca tcgcgccact gcactccagc ctgggcgaca gagcaagact ccgtctcaaa 3781 aaaaaaaaaa aaaaaaaaaa aaaaaaagat acttctccaa agtgttctca tgtggcctca 3841 cccaggtctt gtgtattatt tggtaattaa tttatggatc ttaaaaactg cagtattccc 3901 ccattttgtg atgagagtgt ggggctggca ggggttggtt ggagggagga gagaagacag 3961 aggagcactt aaggtgcaaa gcagcctatt ttttcttcaa taaaaattgt taagag 4017 <210> 20 <211> 5949 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3111)..(4112) <400> 20 gcggccgccc acatgatgcg caagacgagc tgcctgcagt atctggacgc ccgcaacacg 60 cccctgctgg accactcggc gcccttcgtg gcccgtgccc tgcgcatccg cagcagcctg 120 gcagtgctgc acttggagaa cgccagcctg tcggggcggc ccctcatgct gctcggtgag 180 ccccaagccc gggagggtga gcaggatgtg cagcctgttg gggggcaggg gcctgcccag 240 tcaccgtgcc ccctccccgt ccccagccac ggccctgaag atgaacatga acctgcggga 300 gctgtacctg gcggacaaca agctcaacgg cctgcaggac tcggcccagc tgggtaacct 360 gctcaagttc aactgctccc tgcagatcct ggacctccgg aacaaccacg tgctagactc 420 gggtgggtgc agtggcccac cccacccaca cccgtcaccc agcacccact ctgcccggcc 480 ctgcgctagg tggtgctggg gacacagaca tggccaagac caccccaacg ctgtcctgct 540 ggggctccca ggaggaagac agacccgccc tagacagtga caacctagag tgggcaggac 600 tgggctggtg gagcccagag gcggggtccg atcagtctgg gggccaggga gggcttccag 660 gaggaggaga cagctcagct gacagccgca ggatgaggcg aagtgaataa ttacaatctg 720 aaaagaaagc agtgttgctg ccataacagg gtcagctaga ggctgctttc agatgggaga 780 ggagggaaga ccaccgtggg gcggggcctc tgggcttcca tccggaggct gggagcagaa 840 ccagccttgt gagaagcctg ggaaagggtc cagaccgcgg gagcagtgag tgcagtggga 900 ggagctctgg tgtttgatgc tgggggttga gcagagaatg gggagccgtc aaaggaggcg 960 ttcctgttgc atcctggagg ggtggggtgg gtcactgtgt ccccaggtgg caggcccaga 1020 ccccactcta ctgagcacct gccaagggcc atggctgtcg gcccctgggc tcagctcgga 1080 gggagacaca gggctctgcc cttgggagtt tgtactccag tctagagaga caacgataag 1140 atgagtgaaa tctccagcct gtcagatggt gggatctgca ttggagggag acagtggtga 1200 ctgtgggtgg gagtgagctg gctgtgaggt gaggtgggcc agaggtggtc acatttgagc 1260 aaagatcaga agcatctggt ggaggctgtc atccaggcac ccaggcccag gggtcacaga 1320 tgtgtgccca ggggtgccag gccgaggcag ggctgcaggg caaggcctgg gactgcgggg 1380 cctccatgga ggtcctaagc agagggggtt gctcctaccc tgcagtcccc tccccaggaa 1440 gcacccggac cccagacagg gctctgactg ccggcctcct ccaggtctgg cctacatctg 1500 cgagggcctc aaggagcaga ggaaggggct ggtgaccctg gtgctgtgga acaaccagct 1560 cacgcacaca ggcatggcct tcctgggcat gacactggtg agtcaggctg gcagggaagg 1620 gaggcacctc ggtccccgct gccacctccc actccagctc tcagcacagt tgcctctagc 1680 tgacggctcc tgtcctgccc atcctgcagg ttcaagacga tttaaactca gccccacctg 1740 cttcctcatt cagcttctgt gcctgttttc cctgaggagg agcccttctt tcattatctc 1800 cttttttttt ttgagaccga gtctcatcct gtcacccagg ctggagtgca gtggcgccat 1860 ctaggctcac tgcagccact ccagcccccg cctcctgggt tcaagccatt cttctgcctc 1920 agcctcccaa gtagctggga ctacaggtgc ccgccacccc gcctagccaa tttttttatt 1980 ttttattttt attttttttg agacggagcc tcaatcgccc aggctggagt gcagtggctc 2040 gatctccgct cactgcaagc cccacctcct gggttcacac cattgtcctg cctcagcctc 2100 ccgagtagtt ggaactacag gcacccacca ccacacccag ctaatttttt tgtatttttt 2160 tagtagagag gggatttcac catgttagcc aggatggtcg caatctgacc ttgtgatcca 2220 ccctcctcgg cctcccaaag tgctgggatt acacgcgtga gccaccgcgc ccggcctaat 2280 ttttgtattt ttagtacaga cggggtttca ccatgttggc caggccggtc ttgaactcct 2340 gacctcgagt gatccccccc acctcggcct ccgaaagtat tgggattatg agtgagtcac 2400 cgtgcctggt tttcagtttt tatgtggtgc ttaaagcaag ccactcaggt acagagactc 2460 tcagggcgag cactcgtgtg ctcaaacccc gggataagag ttgccctcct tgggggcttc 2520 agggcgctca aggtccaggc cctggccagg ggcccaggct gtggggggta gggactgact 2580 gggtgcctgg ggagctggga gtggacgggg gacctggcat ttggggagac ttcaggcaca 2640 ggagggactt caggccaggc ctgcggcagg gcaggacggc gccggtgttc acgcaggcgc 2700 gcgaaagaaa aggagcgccc gggcctggct ctcttcccgg cttctttcct gacttggcct 2760 cctgggtctc ctccgccatc acggcctcct cctcaccctc acaccccctc cctccagccg 2820 cacactcaga gcctggagac gctgaacctg ggccacaacc ccatcgggaa cgagggtgtg 2880 cggcacctca agaacgggct catcagcaac cgcagcgtgc tgcgcttcgg gctggcctcc 2940 accaagctca cgtgcgaggg cgcggtggcg gtggcggagt tcatcgctga gagcccccgc 3000 ctcctgagac tggaccttcg ggagaacgag atcaagacag gcgggctcat ggcactgtcg 3060 ttggccctca aggtgaacca ctcactgctg cgcctggacc tcgaccgtga acc caa 3116 Thr Gln 1 gaa aga ggc ggt gag cag ggg acg gtc ctg cag ccc tgg ggc ggg cgg 3164 Glu Arg Gly Gly Glu Gln Gly Thr Val Leu Gln Pro Trp Gly Gly Arg 5 10 15 aag gcc ggg tgg tgg ggc cgg cct gag agc cct agc cag gcg ctc ccg 3212 Lys Ala Gly Trp Trp Gly Arg Pro Glu Ser Pro Ser Gln Ala Leu Pro 20 25 30 cca cag gtg aag agc ttc atc gag acg cag aag gcg ctg ctg gcc gag 3260 Pro Gln Val Lys Ser Phe Ile Glu Thr Gln Lys Ala Leu Leu Ala Glu 35 40 45 50 atc cag aac ggc tgc aag cgc aac ttg gtg ctg gcg cgg gag agg gag 3308 Ile Gln Asn Gly Cys Lys Arg Asn Leu Val Leu Ala Arg Glu Arg Glu 55 60 65 gag aag gag cag ccg cca cag ctg tcg gcc tcc atg cct gag acc acc 3356 Glu Lys Glu Gln Pro Pro Gln Leu Ser Ala Ser Met Pro Glu Thr Thr 70 75 80 gcc acc gag ccc cag ccc gac gac gag ccc gcc gct ggg gtg cag aac 3404 Ala Thr Glu Pro Gln Pro Asp Asp Glu Pro Ala Ala Gly Val Gln Asn 85 90 95 ggg gcc ccc agc ccc gca ccc agc ccg gac tca gac tca gac tcg gac 3452 Gly Ala Pro Ser Pro Ala Pro Ser Pro Asp Ser Asp Ser Asp Ser Asp 100 105 110 tcg gat ggg gag gaa gag gag gaa gag gaa ggg gag agg gac gag acc 3500 Ser Asp Gly Glu Glu Glu Glu Glu Glu Glu Gly Glu Arg Asp Glu Thr 115 120 125 130 ccc tgt cct gcc ctg gtg ccc ccc acg gac tcc ctg ggc cct ggg gac 3548 Pro Cys Pro Ala Leu Val Pro Pro Thr Asp Ser Leu Gly Pro Gly Asp 135 140 145 agg agt ccc cca ggc agc ccc tcc aca ccc acc gag cag cgg att tcc 3596 Arg Ser Pro Pro Gly Ser Pro Ser Thr Pro Thr Glu Gln Arg Ile Ser 150 155 160 gtg tcc agc ccg ggc cgg ggc cac aag gtg ttt gtg gtg acc cgg gtg 3644 Val Ser Ser Pro Gly Arg Gly His Lys Val Phe Val Val Thr Arg Val 165 170 175 gag agc ccg ccc gag agg gca gag ccc cct gcg tcc ccc acc cct ccc 3692 Glu Ser Pro Pro Glu Arg Ala Glu Pro Pro Ala Ser Pro Thr Pro Pro 180 185 190 tct ccc cca ccc cct ccc tcc cca ccc gcc tca cct tcc cta cca cca 3740 Ser Pro Pro Pro Pro Pro Ser Pro Pro Ala Ser Pro Ser Leu Pro Pro 195 200 205 210 gcc ggg gcc att gac acc cgg gac aca ggg tcc tct gag cct cag cca 3788 Ala Gly Ala Ile Asp Thr Arg Asp Thr Gly Ser Ser Glu Pro Gln Pro 215 220 225 cca ccg gag ccg cct cgg tca ggg cca cca ctg ccc aac ggc ctg aag 3836 Pro Pro Glu Pro Pro Arg Ser Gly Pro Pro Leu Pro Asn Gly Leu Lys 230 235 240 ccc gag ttc gcc ctg gca ctg ccc cct gag ccg ccc ccg ggg cct gag 3884 Pro Glu Phe Ala Leu Ala Leu Pro Pro Glu Pro Pro Pro Gly Pro Glu 245 250 255 gtc aag ggg ggc agc tgc ggc ctg gag cac ggt gag agg ggc cct agg 3932 Val Lys Gly Gly Ser Cys Gly Leu Glu His Gly Glu Arg Gly Pro Arg 260 265 270 gca ggt gtt gag ggg ccc tgg gtg ctg tat gtg acc cca ggc aag ccc 3980 Ala Gly Val Glu Gly Pro Trp Val Leu Tyr Val Thr Pro Gly Lys Pro 275 280 285 290 ctg cct gtt gtg ggc ctc agt ttc ccc ctc ttt aaa atg gtt ctc gca 4028 Leu Pro Val Val Gly Leu Ser Phe Pro Leu Phe Lys Met Val Leu Ala 295 300 305 ggg gcc agc aaa gtg ggg ctt agt tct cat ctc cac cct gct tcc ctt 4076 Gly Ala Ser Lys Val Gly Leu Ser Ser His Leu His Pro Ala Ser Leu 310 315 320 ggg tcc tgg gag ctc ttc ctg ggg tgg ggg ggc atg taagtaaggt 4122 Gly Ser Trp Glu Leu Phe Leu Gly Trp Gly Gly Met 325 330 gtgctgcctt gagcctgacc atctcagcgg tctctgggca ctctgcaggc cctgagggtc 4182 tggctggggc cgggctgggg acaggttgta tttgccccac ccgcctgacg ggggtgctgg 4242 cccgtcctcc cacacagaac tgagctgctc caagaacgag aaggagctcg aggagctgct 4302 tctggaagcc agtcaggaat ccgggcagga gacactgtga cactttaggt gaggccaggc 4362 ccggggccca cagcactcgg gaggagctga gagagcctct ggctctgaca gtctctcccc 4422 caatctctcc tccccaagtt ccctttttcc ggtcggtctg cgatgagctg aggccagagc 4482 catgagaatc tgctcacctt ccccccagcc ttcctgaggc ccaggatgcc aggggtgggg 4542 gccattctgg ggcccccctc cccccacagc aacactacaa ggggtgcagg agctacaggg 4602 agtggccctc cgcgcgtgac tcaagcactt ctatttatga gcccagcact ggaagactct 4662 gggggtgaat gggaggaggg ggagcaggag gaggaggagg tctccaagga catcaggcgc 4722 ctgttctgga ggggccaggc ttgccctgcg gagggcaggc gtcctgggtg gtggtgggat 4782 ggtcccctgt ggccccgggc acagggccgg gcaggcagcc tggtgccgga gaggcggtgc 4842 gtgctggtgg tggttgagat gcgcagaaca gccccagaca gcgcaggccg ggcagggtgg 4902 ggggatggga gcagaggatc agagctttct ttttctcaag tgcaataaat ctatcaggga 4962 gctggggcgg gagcagccgg cactccggga ccctgctgtc caggccactg gaggctgcgc 5022 cctgagaggc actacagccc tttgggggcg agtggcatgg gtgtgggtga gggtgggcag 5082 agggctgggg ctactcctgt cggtgcaact ctgttcacac cttttctaat aaactggggc 5142 tgggttcact ttgcccaacg tctgcttcgt gcggatcctg gaggtggttg ctgggggtgg 5202 gctgggtgga ggccatcaca gctcctggtt acagagcatc tgccagggca ctggtcaggc 5262 attactgcgt cttcccccac agtcccgtct attctgcgtc catgctctgg tgtctcgtac 5322 ggtcacctcc cagccaggcg agtttacacc catcctttct gcagggagag acaggcccca 5382 gatacctcct ggcttcaggt ctagggtctc aggatgctgg atatggagtg gctctgccca 5442 gtcgcccatc gctcaacagg aacagatccc agcctgtccc tggcagcccc aggtgttgtg 5502 caggtgggag tatgccccgg gcccagccag cactgcagga gcctggtggg cggcttgttc 5562 ccctttgcca gcaggggcct ggggggccat gggggtgctg ttgcggattt ctgagaggtc 5622 tgtttttccc tttaggaaag gggggagcag gccgggcatg gtggcccaca cctgtaatcc 5682 cagcactttg ggaggctgag gtgggaggat tgcttgaacc caggaatttg acaccagcct 5742 gggcaacatg gtgaaacccc atctccatag aaaatacaaa aattagctgg gtgtggtggc 5802 gtgtgcctgt agtcccagct actgaggagg ctgagttggg aggatcgctt gaacccagga 5862 ggttgaggct gcagtgaggt gtgattgcac cactgcactt cagcctgggc aacagagtga 5922 gatcctgtct caaaaaaaaa aaaaaag 5949 <210> 21 <211> 4299 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2)..(1696) <400> 21 a tgg cag agc ccg gct gag cct tgt gtt cta cgc cag tgc cag gag ggt 49 Trp Gln Ser Pro Ala Glu Pro Cys Val Leu Arg Gln Cys Gln Glu Gly 1 5 10 15 gtt gtc aca gag tct ggg gtg cgc tgt gtt gtt cat tgt aaa aac cct 97 Val Val Thr Glu Ser Gly Val Arg Cys Val Val His Cys Lys Asn Pro 20 25 30 ttg gag cat ctg gga atg tgc tgc ccc aca tgt cca ggc tgt gtg ttt 145 Leu Glu His Leu Gly Met Cys Cys Pro Thr Cys Pro Gly Cys Val Phe 35 40 45 gag ggt gtg cag tat caa gaa ggg gag gaa ttt cag cca gaa gga agc 193 Glu Gly Val Gln Tyr Gln Glu Gly Glu Glu Phe Gln Pro Glu Gly Ser 50 55 60 aaa tgt acc aag tgt tcc tgc act gga ggc agg aca caa tgt gtg aga 241 Lys Cys Thr Lys Cys Ser Cys Thr Gly Gly Arg Thr Gln Cys Val Arg 65 70 75 80 gaa gtc tgt ccc att ctc tcc tgt ccc cag cac ctt agt cac ata ccc 289 Glu Val Cys Pro Ile Leu Ser Cys Pro Gln His Leu Ser His Ile Pro 85 90 95 cca gga cag tgc tgc ccc aaa tgt ttg ggt cag agg aaa gtg ttt gac 337 Pro Gly Gln Cys Cys Pro Lys Cys Leu Gly Gln Arg Lys Val Phe Asp 100 105 110 ctc cct ttt ggg agc tgt ctc ttt cga agt gat gtt tat gac aat gga 385 Leu Pro Phe Gly Ser Cys Leu Phe Arg Ser Asp Val Tyr Asp Asn Gly 115 120 125 tcc tca ttt ctg tac gat aac tgc aca gct tgt acc tgc agg gac tct 433 Ser Ser Phe Leu Tyr Asp Asn Cys Thr Ala Cys Thr Cys Arg Asp Ser 130 135 140 act gtg gtt tgc aag agg aag tgc tcc cac cct ggt ggc tgt gac caa 481 Thr Val Val Cys Lys Arg Lys Cys Ser His Pro Gly Gly Cys Asp Gln 145 150 155 160 ggc cag gag ggc tgt tgt gaa gag tgc ctc cta cga gtg ccc cca gaa 529 Gly Gln Glu Gly Cys Cys Glu Glu Cys Leu Leu Arg Val Pro Pro Glu 165 170 175 gac atc aaa gta tgc aaa ttt ggc aac aag att ttc cag gat gga gag 577 Asp Ile Lys Val Cys Lys Phe Gly Asn Lys Ile Phe Gln Asp Gly Glu 180 185 190 atg tgg tcc tct atc aat tgt acc atc tgt gct tgt gtg aaa ggc agg 625 Met Trp Ser Ser Ile Asn Cys Thr Ile Cys Ala Cys Val Lys Gly Arg 195 200 205 acg gag tgt cgc aat aag cag tgc att ccc atc agt agc tgc cca cag 673 Thr Glu Cys Arg Asn Lys Gln Cys Ile Pro Ile Ser Ser Cys Pro Gln 210 215 220 ggc aaa att ctc aac aga aaa gga tgc tgt cct att tgc act gaa aag 721 Gly Lys Ile Leu Asn Arg Lys Gly Cys Cys Pro Ile Cys Thr Glu Lys 225 230 235 240 ccc ggc gtt tgc acg gtg ttt gga gat ccc cac tac aac act ttt gac 769 Pro Gly Val Cys Thr Val Phe Gly Asp Pro His Tyr Asn Thr Phe Asp 245 250 255 ggt cgg aca ttt aac ttt cag ggg acg tgt cag tac gtt ttg aca aaa 817 Gly Arg Thr Phe Asn Phe Gln Gly Thr Cys Gln Tyr Val Leu Thr Lys 260 265 270 gac tgc tcc tcc cct gcc tcg ccc ttc cag gtg ctg gtg aag aac gac 865 Asp Cys Ser Ser Pro Ala Ser Pro Phe Gln Val Leu Val Lys Asn Asp 275 280 285 gcc cgc cgg aca cgc tcc ttc tcg tgg acc aag tcg gtg gag ctg gtg 913 Ala Arg Arg Thr Arg Ser Phe Ser Trp Thr Lys Ser Val Glu Leu Val 290 295 300 ctg ggc gag agc agg gtc agc ctg cag cag cac ctc acc gtg cgc tgg 961 Leu Gly Glu Ser Arg Val Ser Leu Gln Gln His Leu Thr Val Arg Trp 305 310 315 320 aac ggc tcg cgc atc gcg ctc ccc tgc cgc gcg cca cac ttc cac atc 1009 Asn Gly Ser Arg Ile Ala Leu Pro Cys Arg Ala Pro His Phe His Ile 325 330 335 gac ctg gat ggc tac ctc ttg aaa gtg acc acc aaa gca ggt ttg gaa 1057 Asp Leu Asp Gly Tyr Leu Leu Lys Val Thr Thr Lys Ala Gly Leu Glu 340 345 350 ata tct tgg gat gga gac agt ttt gta gaa gtc atg gct gcg ccg cat 1105 Ile Ser Trp Asp Gly Asp Ser Phe Val Glu Val Met Ala Ala Pro His 355 360 365 ctc aag ggc aag ctc tgt ggt ctt tgt ggc aac tac aat gga cat aaa 1153 Leu Lys Gly Lys Leu Cys Gly Leu Cys Gly Asn Tyr Asn Gly His Lys 370 375 380 cgt gat gac tta att ggt gga gat gga aac ttc aag ttt gat gtg gat 1201 Arg Asp Asp Leu Ile Gly Gly Asp Gly Asn Phe Lys Phe Asp Val Asp 385 390 395 400 gac ttt gct gaa tct tgg agg gtg gag tcc aat gag ttc tgc aac aga 1249 Asp Phe Ala Glu Ser Trp Arg Val Glu Ser Asn Glu Phe Cys Asn Arg 405 410 415 cct cag aga aag cca gtg cct gaa ctg tgt caa ggg aca gtc aag gta 1297 Pro Gln Arg Lys Pro Val Pro Glu Leu Cys Gln Gly Thr Val Lys Val 420 425 430 aag ctc cgg gcc cat cga gaa tgc caa aag ctc aaa tcc tgg gag ttt 1345 Lys Leu Arg Ala His Arg Glu Cys Gln Lys Leu Lys Ser Trp Glu Phe 435 440 445 cag acc tgc cac tcg act gtg gac tac gcc act ttc tac cgg tcc tgt 1393 Gln Thr Cys His Ser Thr Val Asp Tyr Ala Thr Phe Tyr Arg Ser Cys 450 455 460 gtg aca gac atg tgt gaa tgt cca gtc cat aaa aac tgt tat tgc gag 1441 Val Thr Asp Met Cys Glu Cys Pro Val His Lys Asn Cys Tyr Cys Glu 465 470 475 480 tca ttt ttg gca tat acc cgg gcc tgc cag aga gag ggc atc aaa gtc 1489 Ser Phe Leu Ala Tyr Thr Arg Ala Cys Gln Arg Glu Gly Ile Lys Val 485 490 495 cac tgg gag cct cag cag aat tgt gca gcc acc cag tgt aag cat ggt 1537 His Trp Glu Pro Gln Gln Asn Cys Ala Ala Thr Gln Cys Lys His Gly 500 505 510 gct gtg tac gat acc tgt ggt ccg gga tgt atc aag acc tgt gac aac 1585 Ala Val Tyr Asp Thr Cys Gly Pro Gly Cys Ile Lys Thr Cys Asp Asn 515 520 525 tgg aat gaa att ggt cca tgc aac aag ccg tgc gtt gct ggg tgc cac 1633 Trp Asn Glu Ile Gly Pro Cys Asn Lys Pro Cys Val Ala Gly Cys His 530 535 540 tgt cca gca aac ttg gtc ctt cac aag gga agg tgc atc aag cca gtc 1681 Cys Pro Ala Asn Leu Val Leu His Lys Gly Arg Cys Ile Lys Pro Val 545 550 555 560 ctt tgt ccc cag cgg tgacctttgt ttcgatcctt aagactctga aatctggtga 1736 Leu Cys Pro Gln Arg 565 ctttgacact gaagcggaag agccaatgaa ggactgcagt atttgtgtgc ccgattctgt 1796 aaacacacac acacagagta tatatgtgta tatatatata gatatattca aaaacattgc 1856 atcatttata tgaactatag ggggattatt atatgtatat tttttgctat aagacatgta 1916 ttgtttctag gatcctaacc tgtaagccat tgaacatgtt gtataaatac accaggtgtt 1976 tttaatttaa taaggtggca tgcagataca ttggatagtg ttaacatcac atacatttgt 2036 catttttaag gaagttttct aagagccctc aattgcctgc ctgtattaat tttagttttg 2096 agtcaggatt tgtaatggag tgggaaatgt gtttctctgg agaagggcac atttatctag 2156 gggcatttca ggtttccaaa gaaaggaatg tatgcctggg aaagacagca ggagattggt 2216 gaccggccct aatggtgcat gaaaagcgag tgataggctg ttagaatgta ttaggtcatg 2276 ggctaacatt atttccaaaa ttgattggct ggttgccaag aaatatatat ttgtcactag 2336 ggcataaccg aagaatcaag tgatttcttg ccatccttgc gtagtcctca agtcttctaa 2396 tcatctatgt gtgtatgatg gatgtgacca tttatatgtc atgtgttgaa attaaagggt 2456 gtctggatgc catgttaatt ttattgagca tgagaaagta tttgaagaaa ctgttgcaca 2516 acatatgatg aaattttgat ttctctgaat ctcagagtaa tttaagatgg gcaaatccaa 2576 tgagttgatg taaccaatct acattttgtg gctatttcat gtataaaatg aaaggcttta 2636 gttatttcta gaggagtccc atccaacact atttggcgac tgttaatatg ttataattat 2696 tagacagtgg aaaagccttt attgactaat tgatttaacc tcagtccaaa gaacagtctc 2756 tatacttgcc ctcatttact gataggctgc tactagttac catttgattg tcttgacttt 2816 cgtcttcagt tgaattggtg aaaacatcaa caaaaacaaa acccactgtg tcttaaaata 2876 aaatgttctg cattgtagtc aataaagggc ttaagatgta aaacaatgca ttttctctct 2936 caagatgtcc tgtgttatac tttacaatta gttgtatcaa cttcagattc attaaatttt 2996 aacagctgta tttagttata atttttgtat cttacaacag tatatataaa gactattatt 3056 aatagacaaa agcaaaccca cgtttttgac aagccagcat attgccaatg ctcgtcaaag 3116 acatttatgt ttcttttaca ttcatgacag ctaaagtatt aacttggaat tttttttttt 3176 tttttttgac tggaaaatcc ctagttgttt gaattaactg aataaagtgc agggcgtaat 3236 agtgactggc tgggggcact gggcctttgt gttttttttc attcctgtgt ttggatacta 3296 atcaccctcg tatctgcagc ctgaggtagt caggtcattg tttcaatgct gcccttggaa 3356 agaagacttt gacttaggtg ctttgggatc taagtgtggt ttgtgactat ccccattatt 3416 taagcatttg ggggtctcac ttttgcttat ctgtaacaca aagaggccat gtagtatagt 3476 gagcagggtt gagaatcggg aaaactagac ccctgccact aacaggctag gagactttgg 3536 gcagtaatgg gaggcactga gggtttgcaa atactttcct acttgtaatc tcattgattc 3596 cctgctctgc cctgacagga ggggtaggga aaatcagttg acgttgcctg gtggttccta 3656 ccttctgtaa aacgaacaga ttccttaaga ttagctttaa atcccttttc agttcactgt 3716 tggttctcat ctttgaaaga gggcagcata agggaaatga tggagaaggg agaattgtgg 3776 atggttattt atagctcatg attcatccaa ggactgagtt attctcccca tgaggctaac 3836 cggcctcttt cccaacactt aatcatggca gccaggatgg gtgaatgatg aaagctcttc 3896 tgcatgagcc ggtttcatca tcttgggtgg gttgattaca agtttgtgtg gcattccttt 3956 agctggggat tagcttaatc tctgtacatt tttctcattt ctaattgaag aagaaaggga 4016 gggcagagag gacagaaaat tacttgggtt tactaattca aagagccatc tctgatggga 4076 gagggtgagg aaatcagctg aatggttggc ctgcatattc ttttttaggc cacaggctac 4136 agctgagcaa tggcttagtc taatgattgg gcacaatgag gatttaccaa gtgcttcaat 4196 gagagcagtc ctgtttcatt cacatgggat gttttcaatt ggttcttcaa ctaagctctt 4256 gcagagtgag atgacttgaa ataaaaatgt tctatgaata ttg 4299[Sequence list] SEQUENCE LISTING <110> KAZUSA DNA Research Institute <120> Novel Genes and Proteins Encoded by the Genes <130> AB01037 <160> 21 <210> 1 <211> 5782 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (580) .. (3825) <400> 1 cagctcccaa ggctgagatt actctgcttc atctggatcg cccatctctg gggtctcatg 60 gctgagtttc agttccccaa tcctacctgc tcctcagggg gccagcactg gggctgcagg 120 taggccacct gttgagacct ggtgaaagat caggtataat aatgttctgc agtgaaaaga 180 aattgcgtga agtggaacgg atagtgaaag ccaatgaccg tgaatataat gaaaagttcc 240 agtatgcgga taatcgtatc cacacatcga aatataatat tctcaccttc ttgccaatta 300 atttatttga acagttccaa agagtggcaa atgcctattt tctttgcctt ctgattttac 360 agctaattcc agaaatttcc tccttgacct ggtttaccac cattgtgcct ttggtcctgg 420 tgataactat gacagctgtc aaagatgcca cagatgacta ttttcgccac aagagtgata 480 atcaagtgaa taatcggcag tctgaagtgc tcatcaacag caagtgctga aggtacaaag 540 ctgagggtct aatgacagga attactctgg taattgtaa gga aga aga ttt atc 594                                            Gly Arg Arg Phe Ile                                              1 5 ctg gtc ctg aga aaa ata ctg cag aat gaa aaa tgg atg aat gtc aaa 642 Leu Val Leu Arg Lys Ile Leu Gln Asn Glu Lys Trp Met Asn Val Lys                  10 15 20 gtg gga gac atc att aaa tta gaa aat aac caa ttt gtt gct gct gat 690 Val Gly Asp Ile Ile Lys Leu Glu Asn Asn Gln Phe Val Ala Ala Asp              25 30 35 tta ctt ctc cta tca agt agt gag cca cat ggt ctc tgt tat gtt gaa 738 Leu Leu Leu Leu Ser Ser Ser Glu Pro His Gly Leu Cys Tyr Val Glu          40 45 50 act gct gag ctt gat ggg gaa acg aac cta aaa gtc cgc cat gca cta 786 Thr Ala Glu Leu Asp Gly Glu Thr Asn Leu Lys Val Arg His Ala Leu      55 60 65 tca gtt act tca gaa ctt gga gca gat atc agc aga ctt gca ggg ttt 834 Ser Val Thr Ser Glu Leu Gly Ala Asp Ile Ser Arg Leu Ala Gly Phe  70 75 80 85 gat ggg att gtt gtc tgt gag gtg cct aac aac aag tta gat aaa ttc 882 Asp Gly Ile Val Val Cys Glu Val Pro Asn Asn Lys Leu Asp Lys Phe                  90 95 100 atg gga atc ctt tct tgg aaa gac agc aag cat tcc ctc aac aat gag 930 Met Gly Ile Leu Ser Trp Lys Asp Ser Lys His Ser Leu Asn Asn Glu             105 110 115 aag ata atc ctg aga ggc tgc atc ctg aga aat acc agc tgg tgt ttt 978 Lys Ile Ile Leu Arg Gly Cys Ile Leu Arg Asn Thr Ser Trp Cys Phe         120 125 130 gga atg gtt att ttt gca ggt cct gac act aaa cta atg cag aat agt 1026 Gly Met Val Ile Phe Ala Gly Pro Asp Thr Lys Leu Met Gln Asn Ser     135 140 145 ggt aag aca aag ttt aaa agg aca agc att gat aga ttg atg aat act 1074 Gly Lys Thr Lys Phe Lys Arg Thr Ser Ile Asp Arg Leu Met Asn Thr 150 155 160 165 cta gta cta tgg att ttt ggg ttt ctg ata tgc ttg gga att att ctt 1122 Leu Val Leu Trp Ile Phe Gly Phe Leu Ile Cys Leu Gly Ile Ile Leu                 170 175 180 gca ata gga aat tca atc tgg gag agt caa act ggg gac caa ttc aga 1170 Ala Ile Gly Asn Ser Ile Trp Glu Ser Gln Thr Gly Asp Gln Phe Arg             185 190 195 act ttc ctc ttt tgg aat gaa gga gag aag agc tct gtg ttc tcc gga 1218 Thr Phe Leu Phe Trp Asn Glu Gly Glu Lys Ser Ser Val Phe Ser Gly         200 205 210 ttc tta aca ttc tgg tca tat att att att ctc aat aca gtt gta ccc 1266 Phe Leu Thr Phe Trp Ser Tyr Ile Ile Ile Leu Asn Thr Val Val Pro     215 220 225 att tcc tta tat gtg agt gtg gaa gta att cgt cta gga cac agt tat 1314 Ile Ser Leu Tyr Val Ser Val Glu Val Ile Arg Leu Gly His Ser Tyr 230 235 240 245 ttt ata aac tgg gac cgg aag atg tat tat tct cga aaa gca ata cct 1362 Phe Ile Asn Trp Asp Arg Lys Met Tyr Tyr Ser Arg Lys Ala Ile Pro                 250 255 260 gca gtg gct cga acg acc acg ctc aat gag gaa ctg ggg cag att gag 1410 Ala Val Ala Arg Thr Thr Thr Leu Asn Glu Glu Leu Gly Gln Ile Glu             265 270 275 tac att ttc tcc gac aaa acg ggt acc ctc act caa aac atc atg acc 1458 Tyr Ile Phe Ser Asp Lys Thr Gly Thr Leu Thr Gln Asn Ile Met Thr         280 285 290 ttt aaa aga tgt tcc att aat ggg aga atc tat ggt gaa gta cat gat 1506 Phe Lys Arg Cys Ser Ile Asn Gly Arg Ile Tyr Gly Glu Val His Asp     295 300 305 gac ctg gat cag aag aca gaa ata act cag gaa aaa gag cct gtg gat 1554 Asp Leu Asp Gln Lys Thr Glu Ile Thr Gln Glu Lys Glu Pro Val Asp 310 315 320 325 ttc tca gtc aaa tct caa gcg gat aga gaa ttt cag ttc ttt gac cac 1602 Phe Ser Val Lys Ser Gln Ala Asp Arg Glu Phe Gln Phe Phe Asp His                 330 335 340 cat ctg atg gaa tcc att aaa atg ggt gat ccc aaa gtt cat gaa ttc 1650 His Leu Met Glu Ser Ile Lys Met Gly Asp Pro Lys Val His Glu Phe             345 350 355 ctt agg tta ctt gct ctc tgc cac act gta atg tca gaa gag aat agc 1698 Leu Arg Leu Leu Ala Leu Cys His Thr Val Met Ser Glu Glu Asn Ser         360 365 370 gca gga gag ctg att tac caa gtt cag tca cct gat gaa ggg gct cta 1746 Ala Gly Glu Leu Ile Tyr Gln Val Gln Ser Pro Asp Glu Gly Ala Leu     375 380 385 gtg act gcc gct aga aat ttt ggg ttc att ttt aaa tcc cgg acc cca 1794 Val Thr Ala Ala Arg Asn Phe Gly Phe Ile Phe Lys Ser Arg Thr Pro 390 395 400 405 gag acc ata aca ata gaa gaa ttg gga aca cta gtt act tat caa tta 1842 Glu Thr Ile Thr Ile Glu Glu Leu Gly Thr Leu Val Thr Tyr Gln Leu                 410 415 420 ctt gcc ttt ttg gat ttc aac aac acc aga aaa agg atg tct gtc ata 1890 Leu Ala Phe Leu Asp Phe Asn Asn Thr Arg Lys Arg Met Ser Val Ile             425 430 435 gtt cga aac cca gaa gga cag ata aag ctt tat tcc aaa gga gca gat 1938 Val Arg Asn Pro Glu Gly Gln Ile Lys Leu Tyr Ser Lys Gly Ala Asp         440 445 450 act att ctg ttt gaa aaa ctt cat cct tcc aat gaa gtc ctt ttg tct 1986 Thr Ile Leu Phe Glu Lys Leu His Pro Ser Asn Glu Val Leu Leu Ser     455 460 465 ttg acg tca gac cac ctc agt gaa ttt gca ggg gaa ggc ctt cgg acc 2034 Leu Thr Ser Asp His Leu Ser Glu Phe Ala Gly Glu Gly Leu Arg Thr 470 475 480 485 ttg gcc atc gca tac aga gac ctg gat gac aag tac ttt aaa gag tgg 2082 Leu Ala Ile Ala Tyr Arg Asp Leu Asp Asp Lys Tyr Phe Lys Glu Trp                 490 495 500 cat aag atg ctt gaa gat gcg aat gct gcc aca gaa gag agg gat gaa 2130 His Lys Met Leu Glu Asp Ala Asn Ala Ala Thr Glu Glu Arg Asp Glu             505 510 515 cga ata gct gag cta tat gaa gaa att gaa aga gat ttg atg cta cta 2178 Arg Ile Ala Glu Leu Tyr Glu Glu Ile Glu Arg Asp Leu Met Leu Leu         520 525 530 ggt gcc act gct gta gaa gat aag tta cag gag ggt gtt att gaa aca 2226 Gly Ala Thr Ala Val Glu Asp Lys Leu Gln Glu Gly Val Ile Glu Thr     535 540 545 gtt aca agt tta tca cta gcc aat att aag atc tgg gtc cta aca gga 2274 Val Thr Ser Leu Ser Leu Ala Asn Ile Lys Ile Trp Val Leu Thr Gly 550 555 560 565 gac aaa caa gaa act gcc atc aac atc ggt tat gcc tgc aac atg ctg 2322 Asp Lys Gln Glu Thr Ala Ile Asn Ile Gly Tyr Ala Cys Asn Met Leu                 570 575 580 act gac gac atg aat gat gtg ttt gtg ata gca ggg aat aat gct gtg 2370 Thr Asp Asp Met Asn Asp Val Phe Val Ile Ala Gly Asn Asn Ala Val             585 590 595 gaa gtg aga gaa gaa ctc agg aaa gca aaa caa aat ttg ttt gga caa 2418 Glu Val Arg Glu Glu Leu Arg Lys Ala Lys Gln Asn Leu Phe Gly Gln         600 605 610 aac aga aat ttt tcc aat ggc cat gta gtt tgt gaa aaa aag cag cag 2466 Asn Arg Asn Phe Ser Asn Gly His Val Val Cys Glu Lys Lys Gln Gln     615 620 625 ctg gag ttg gat tct att gta gaa gaa acc ata aca gga gat tat gcc 2514 Leu Glu Leu Asp Ser Ile Val Glu Glu Thr Ile Thr Gly Asp Tyr Ala 630 635 640 645 tta atc ata aat ggc cac agt ttg gct cat gcc cta gaa agt gat gtc 2562 Leu Ile Ile Asn Gly His Ser Leu Ala His Ala Leu Glu Ser Asp Val                 650 655 660 aag aat gat ctc cta gaa ctt gct tgc atg tgt aag act gta att tgc 2610 Lys Asn Asp Leu Leu Glu Leu Ala Cys Met Cys Lys Thr Val Ile Cys             665 670 675 tgc agg gtc act cca ctc cag aaa gcc caa gtg gta gag ctg gtg aag 2658 Cys Arg Val Thr Pro Leu Gln Lys Ala Gln Val Val Glu Leu Val Lys         680 685 690 aag tac aga aat gct gtt act ttg gcc att ggt gat gga gcc aat gat 2706 Lys Tyr Arg Asn Ala Val Thr Leu Ala Ile Gly Asp Gly Ala Asn Asp     695 700 705 gtc agc atg att aaa agt gct cac att ggt gtt ggc atc agc ggc cag 2754 Val Ser Met Ile Lys Ser Ala His Ile Gly Val Gly Ile Ser Gly Gln 710 715 720 725 gaa gga ttg caa gca gtc tta gcc agc gac tat tca ttt gca cag ttt 2802 Glu Gly Leu Gln Ala Val Leu Ala Ser Asp Tyr Ser Phe Ala Gln Phe                 730 735 740 aga tat ctc caa agg ctt ctc ctt gtt cat gga agg tgg tct tat ttc 2850 Arg Tyr Leu Gln Arg Leu Leu Leu Val His Gly Arg Trp Ser Tyr Phe             745 750 755 cga atg tgc aaa ttc tta tgc tat ttc ttc tat aag aat ttt gca ttt 2898 Arg Met Cys Lys Phe Leu Cys Tyr Phe Phe Tyr Lys Asn Phe Ala Phe         760 765 770 aca ctt gtg cat ttc tgg ttt ggt ttc ttc tgt ggt ttc tca gcc cag 2946 Thr Leu Val His Phe Trp Phe Gly Phe Phe Cys Gly Phe Ser Ala Gln     775 780 785 act gtt tat gac cag tgg ttc atc acc ctt ttt aac att gtt tac aca 2994 Thr Val Tyr Asp Gln Trp Phe Ile Thr Leu Phe Asn Ile Val Tyr Thr 790 795 800 805 tca ctg cct gtt tta gcc atg ggg att ttt gac cag gat gtg agt gac 3042 Ser Leu Pro Val Leu Ala Met Gly Ile Phe Asp Gln Asp Val Ser Asp                 810 815 820 cag aac agc gtg gac tgt ccc cag ctc tac aaa cca gga cag ctg aat 3090 Gln Asn Ser Val Asp Cys Pro Gln Leu Tyr Lys Pro Gly Gln Leu Asn             825 830 835 ctg ctt ttt aac aag cgt aaa ttt ttc att tgc gtg ttg cat gga atc 3138 Leu Leu Phe Asn Lys Arg Lys Phe Phe Ile Cys Val Leu His Gly Ile         840 845 850 tac acc tca tta gtc ctt ttc ttc atc ccc tat ggg gcc ttt tac aac 3186 Tyr Thr Ser Leu Val Leu Phe Phe Ile Pro Tyr Gly Ala Phe Tyr Asn     855 860 865 gtg gct gga gaa gat ggg caa cat att gct gac tac cag tcc ttt gca 3234 Val Ala Gly Glu Asp Gly Gln His Ile Ala Asp Tyr Gln Ser Phe Ala 870 875 880 885 gtt acc atg gcc aca tct ttg gtc att gtg gtc agt gtg cag ata gcc 3282 Val Thr Met Ala Thr Ser Leu Val Ile Val Val Ser Val Gln Ile Ala                 890 895 900 ttg gat acc agt tac tgg act ttc att aat cac gtc ttc atc tgg ggg 3330 Leu Asp Thr Ser Tyr Trp Thr Phe Ile Asn His Val Phe Ile Trp Gly             905 910 915 agc att gcc att tat ttc tcc att tta ttt aca atg cac agt aat ggc 3378 Ser Ile Ala Ile Tyr Phe Ser Ile Leu Phe Thr Met His Ser Asn Gly         920 925 930 atc ttt ggc atc ttc cca aac cag ttt cca ttt gtt ggt aat gca cga 3426 Ile Phe Gly Ile Phe Pro Asn Gln Phe Pro Phe Val Gly Asn Ala Arg     935 940 945 cat tcc ctg acc cag aag tgc atc tgg ctt gta att ctc tta aca aca 3474 His Ser Leu Thr Gln Lys Cys Ile Trp Leu Val Ile Leu Leu Thr Thr 950 955 960 965 gtg gct tca gtt atg cca gtg gtg gca ttc aga ttt ttg aag gtg gat 3522 Val Ala Ser Val Met Pro Val Val Ala Phe Arg Phe Leu Lys Val Asp                 970 975 980 tta tac cca acc ctg agt gat cag atc cgc cgg tgg cag aag gct caa 3570 Leu Tyr Pro Thr Leu Ser Asp Gln Ile Arg Arg Trp Gln Lys Ala Gln             985 990 995 aag aag gca agg cct cca agt agc cga agg cct cgg acc cgc agg tca 3618 Lys Lys Ala Arg Pro Pro Ser Ser Arg Arg Pro Arg Thr Arg Arg Ser        1000 1005 1010 agc tca aga agg tct gga tat gct ttt gct cac caa gaa ggc tat gga 3666 Ser Ser Arg Arg Ser Gly Tyr Ala Phe Ala His Gln Glu Gly Tyr Gly    1015 1020 1025 gag ctt atc aca tct gga aaa aat atg cga gct aaa aat cca ccc cca 3714 Glu Leu Ile Thr Ser Gly Lys Asn Met Arg Ala Lys Asn Pro Pro Pro 1030 1035 1040 1045 aca tca ggg ctg gaa aag aca cat tat aat agc act agc tgg att gaa 3762 Thr Ser Gly Leu Glu Lys Thr His Tyr Asn Ser Thr Ser Trp Ile Glu                1050 1055 1060 aat tta tgt aag aaa acc aca gac acc gtg agc agc ttt agc cag gat 3810 Asn Leu Cys Lys Lys Thr Thr Asp Thr Val Ser Ser Phe Ser Gln Asp            1065 1070 1075 aaa aca gtg aaa ctg tgagtcaata tgaatttaaa ccacgtagtt atcttttcac 3865 Lys Thr Val Lys Leu        1080 ttcaggtgga gctgaaattc tgctggctcc agagtttgag atttgaggca agaggtgggg 3925 caggcagatt gcctcactta acttaaatct gcggcagaca actgccagtg cccatcaaac 3985 aggagtgtgc gctatggaaa accaggccag agggtcactg tctggtttgt gatttggtgg 4045 acaaaacact cgctgttaca agtacagatt tttttttttt taaatcaacc tagataccaa 4105 ttgacctgaa ctttagaatc ttatttatgg agaaaaactt gtaaagctgc atattcactg 4165 aatggatcct caggcggata aaagggtgca ttttaaaggt atatatccaa gctgaaaagc 4225 atgcctattg acagataaac atgtatctgt aagatcagcc tttcccaagg tatactttta 4285 aaatttaaag cgtgtactgt gttgctttca gactgagttg catgtcactc tttagtcttg 4345 atatctacct gtctgttcag ccaggacaac aaatggcttc caagcctgaa gaatacaaaa 4405 gtgtgcttgt gtttctcatt tttataccag tctagggaca aaggagactg aacatctttg 4465 cagcaggata ggctggtaat ttgatcaaat ttattcaaaa agctctcagt ctgtgtcatg 4525 taaggacatg cttatgaaat gtgagagagg ctcgccacta agtattctaa atacttttca 4585 atggcttttc taacaacctc agtagtaatt tgctgagcat catccagacc attaatagaa 4645 tcagcaaagc actggaattt cacactttaa tgataatatt ccacatagtc tatgggcaaa 4705 tattttcaac atttccaatt tttaaagctt cagaattgaa gccaaacaaa ttaataaata 4765 attgttttaa ttactattta aaaactcagg tttagattgt ttaaaattag ttgcttttga 4825 tactcagctg tcatgtttat aattcaaaca tgtagtaaac atatgtaggt aaggttgttt 4885 ttttggagat gttgcagctc aaatttcagt ccacatatga atcatcagtg tattttccat 4945 aaagtgattc gggcatattt gtgtgaaaac ctcagttctg tcacttctta cctctataaa 5005 cttggacgat aatgtgcctt ctctgagact cagtttcttc ctctgtaaaa tgaggacata 5065 ctacctacct catgtggttg gttgatgatt gtctgtcaaa gcacaaactc tgaaattatt 5125 aaaaacataa ttatttcata aacagatgag ttaagttcca gttaactcaa catcagtata 5185 acagagcaat tggaagagaa tatgaaaaaa ctggaatcta aatagtcagt gaggaaggct 5245 ttgataaaat gaaattgcca gaaagatata aaactggtta gggtcctaca gggaaataaa 5305 attataaccg tggaggtaca tttctctacc agaaagcaaa aataaagcat catgtcttaa 5365 tggttttcta caaatcaact tctaattcta cagagtcctt aatctggtcc ctattaaatt 5425 cttggtcaga caaagttaca tttcccaaga gagtcaggtg acacttgagt gagtttgatg 5485 gataatgagc taatgtgata tctataggtc acaatttttt aaaaccaaaa ttttcaagtc 5545 tgggataatc tttcctaaat gggatcaaat gaaataatat gtgtaaaaga gtcaaatgca 5605 gtcctttacc atagtaactg cctatggacg ttgtctttcc cttacatgcc tgcctacact 5665 taaccagatg ttggttttca atgtctaatt tgtcattagt ttcaccacat ttgctcactt 5725 tttgtaacat ttttgcaaga tttgaaaact ttcagtaaat gttttggcac tattggt 5782 <210> 2 <211> 5181 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3292) .. (4065) <400> 2 gagagcgccc tgggcgccta ctcgccagtg gactacatga gcatcaccag cttcccgcgg 60 ctgcccgagg acgagccggc gcccgcggcc ccgctgaggg gccgcaagga cgaggacgcc 120 tttctgggag accccgacac cgacccggac tccttcctga agtctgcacg gctgcagcgg 180 ctgccatcgt cgtcgtcgga gatgggcagc caagacgggt cgccgctacg cgagacgcgc 240 aaagacccgt tctccgccgc agcggccgag tgctcctgcc gccaggatgg gctcacggtc 300 atcgtcacgg cctgtctcac cttcgctacc ggtgtcaccg tggcgctggt catgcagatc 360 tacttcgggg acccccagat cttccagcag ggtgccgtgg tgaccgatgc tgcccgctgc 420 acttcactgg gcatcgaggt gctcagtaaa cagggatctt ctgtggacgc agcggtggca 480 gcagccttgt gtttgggtat cgtggctcca cacagttctg gcctgggcgg tgggggcgtg 540 atgctggtac atgacatccg acgaaatgag agccacctaa ttgatttccg ggagtccgca 600 ccaggggccc tcagggaaga gaccctgcaa agatcctggg agaccaagtg acctggtctc 660 ctctctccct cgcctacccg ccttgcccag cctgggctct tggtgggggt tcccggaatg 720 gtgaaggggc tacatgaagc tcaccagctc tatggcaggc tgccatggtc ccaagtcctg 780 gcctttgcag cagctgtggc ccaagatggc ttcaacgtga ctcatgatct aggtcagtgg 840 ggcctgggga tttgggagag acatgaggtt gatggagaag ggtagaatct ttgagatttg 900 gagcccaagc cagagaggcc ctctcttcca gtttgctcct cagaccccct ccccacctta 960 tcccaccctc acctggaggc ggcctcaaac gagggggtct gggaagggcc cttagacatc 1020 cctgcctcgc acttaacagg ctcctggggg tcagggcact tagggtgggc cactcagccc 1080 ccatctaccc catctattgc gcgtctcatc cggtccagtc ctggcgtccc gccctgccca 1140 gcctcccgtc cctgccccta gcccgtgccc tggctgaaca gctgccaccc aacatgtccg 1200 agcgcttccg ggagacgttc ctgccatcgg gccgcccgcc actacctggc tcgttgctgc 1260 atcggcccga cctggctgag gtgctggatg tacttggcac ctccggcccg gctgccttct 1320 acgcaggtgg caacctcaca ctggagatgg tggccgaggc tcagcacgca gggggtgtca 1380 taaccgaaga ggacttcagc aattacagcg cccttgtgga gaagcctgtg tgtggcgtgt 1440 acagaggtga cctctccccc ggctcccagg gtcccccctc aggagaagcc tcccagtcca 1500 tggccacatc gttttggcct agagactcct ctccatttca caggagagaa actaaggcag 1560 tgagctacct gggcagtcaa ctgttgaagc aaaccagagt cagcccacct tctgaaaagt 1620 agctgtgggg tcagttaccc caagtattta ggatattggg gggtggacct ggtcaaaggg 1680 tccaacctgg aagttggggt tctggccata ggttttcttg atttgggctc tgtgttgcct 1740 ctgccctttc cccagcagcg ctgtcactgc cctgtacttc ctaagaattt ttaagacaaa 1800 gtccatccaa gcttcacagt agaatgaacc tttcaagaca gtcatagacc cagctcctca 1860 tagtgccaaa aagaaattga ggcccgagga gggaatataa caactggcca aactcaagaa 1920 aaccaacagg gaacccagaa aaccaagctt atgacatggg tggggtccat gtttactgaa 1980 cccaaggtgg taggtgctgg atttctcaga agattctcag gttctttccc ttctaggaga 2040 gccagatcac atcaagcccc aggaaagggt ttctctgagt tcaactttcc aggcctcaat 2100 ctccaacctg attcctctgc cagaggcagt ggaatatgaa gcaatggaaa gagcctatca 2160 gctgggtgca gtggctcagg cctataaccc cagcacattg ggaggctgag gcaggaggat 2220 cacttgagcc caggagttca tgaccagcct gggcaacata gtgagacacc gtctctataa 2280 aaaatttaaa aattacccag gcgtggtggt gtacctatag tcctagctac tcaggaggtt 2340 gagatgggag gattgcttga gcctgggagg tcaaggctgc agtaagcatg attgtgccac 2400 tgcactacag cctgggtgac agagtgagac cattatcaca cacacacaca cacacacaca 2460 cagaaaagaa aaaagaacaa gcgtagcgtt ggagctagga agacctgggt ttgagtacca 2520 actgtgccat ttaccttgaa caagctactt cccactctga gccttggttc cctccttaaa 2580 atggaaataa taatagtgcc tacatcttag gatgcccata aggatttgat gagatgatgt 2640 aggaagttgc cctgatcaga gcatgcactg aatatagatt ttccatttcc ccttcccagc 2700 ctgggttgtc tcaacaatgc tagtaccttg agtgctctgg attccccttg tctctctgta 2760 ccacagtttc tggacctatc aaatatgtgt gataacagta gccacagtga ggactaaatg 2820 agttaatgtt agtgttatgc atcagctatt attcctttga tgtccaaaca tctagtaaca 2880 gaataggaag agaaggttgc tgctgccaag aagtttgggg ctggatgaga ccatcaggag 2940 tcatggataa tatcctggcc tatgggagag ctaaaaggag acacatgaca ggccaggaat 3000 atgaacatct ggtcacctgt ttctgctgtg ctctgctgat agcctgcagt gtgactctgg 3060 gcaagccaca gccctcttga gccttagttt tctcaaatgg cagaggaaac tgaagaccag 3120 tgttctgagt tcaagtcctg gccctgccct tgacttgcca tgtgaacttg aatgagtctc 3180 tttgcctctc aatgtttcag tttccccatc tggtgaggga gttcagcgca gtggtctcag 3240 tgtgtgatgg tggaatgtgg agatgggatg tgggatgtga aggtgccttg a ccc cta 3297                                                          Pro Leu                                                            1 ccc tgc cat cag tca ctt cct gtc ctt tca ggc cac ctg gtt ctt agt 3345 Pro Cys His Gln Ser Leu Pro Val Leu Ser Gly His Leu Val Leu Ser           5 10 15 ccc cca cct ccg cac acg ggc cct gcc ctc atc agt gct ctc aac atc 3393 Pro Pro Pro Pro His Thr Gly Pro Ala Leu Ile Ser Ala Leu Asn Ile      20 25 30 ctg gag ggc ttc aat ctc acc agc ctg gta tcc cga gaa cag gct ctt 3441 Leu Glu Gly Phe Asn Leu Thr Ser Leu Val Ser Arg Glu Gln Ala Leu  35 40 45 50 cac tgg gtg gca gag acc ctg aag att gca tta gcc ctg gcc agc aga 3489 His Trp Val Ala Glu Thr Leu Lys Ile Ala Leu Ala Leu Ala Ser Arg                  55 60 65 ctg gga gat ccc gtc tat gat tct acc atc act gag agc atg gat gac 3537 Leu Gly Asp Pro Val Tyr Asp Ser Thr Ile Thr Glu Ser Met Asp Asp              70 75 80 atg ctc agc aag gtg gag gcc gcc tac ctc cgg ggc cat atc aat gac 3585 Met Leu Ser Lys Val Glu Ala Ala Tyr Leu Arg Gly His Ile Asn Asp          85 90 95 tcc cag gca gcc cct gcc cca ctc ctg cct gtc tat gaa cta gac gga 3633 Ser Gln Ala Ala Pro Ala Pro Leu Leu Pro Val Tyr Glu Leu Asp Gly     100 105 110 gct ccc acg gct gcc cag gtg ctg atc atg gga cct gat gac ttc att 3681 Ala Pro Thr Ala Ala Gln Val Leu Ile Met Gly Pro Asp Asp Phe Ile 115 120 125 130 gtg gcc atg gtt agc tcc ctg aac cag ccc ttt ggc agc ggc ctt atc 3729 Val Ala Met Val Ser Ser Leu Asn Gln Pro Phe Gly Ser Gly Leu Ile                 135 140 145 acc ccc tcg ggg atc ctg ctc aac agc cag atg ctg gac ttc tcc tgg 3777 Thr Pro Ser Gly Ile Leu Leu Asn Ser Gln Met Leu Asp Phe Ser Trp             150 155 160 ccc aac cgg aca gct aac cac tct gca ccc agc ctg gag aat tca gtg 3825 Pro Asn Arg Thr Ala Asn His Ser Ala Pro Ser Leu Glu Asn Ser Val         165 170 175 cag cca ggg aag cgg cca ctc tct ttc ctg ctg ccc aca gtg gtc cga 3873 Gln Pro Gly Lys Arg Pro Leu Ser Phe Leu Leu Pro Thr Val Val Arg     180 185 190 ccc gcg gag ggg ctc tgt gga acc tac ctc gct ctg ggg gcc aat gga 3921 Pro Ala Glu Gly Leu Cys Gly Thr Tyr Leu Ala Leu Gly Ala Asn Gly 195 200 205 210 gct gcg cgg ggc ctc agc ggc ctg aca cag gtg agg ttt acg cca tgg 3969 Ala Ala Arg Gly Leu Ser Gly Leu Thr Gln Val Arg Phe Thr Pro Trp                 215 220 225 ttg gcc ttt ttc agc cgg gag cca agc tgt ggc cta gac tgc aga tgc 4017 Leu Ala Phe Phe Ser Arg Glu Pro Ser Cys Gly Leu Asp Cys Arg Cys             230 235 240 ctg tca tat ctg tgg ctt gtt tcc att cct cat gcg gca aac atg ggc 4065 Leu Ser Tyr Leu Trp Leu Val Ser Ile Pro His Ala Ala Asn Met Gly         245 250 255 tgaggctgtt ttgtgccagg cgcatcctgg gcaatgtgga catgggctgc gctgagtttc 4125 ttagtgaaaa tggagccctg atgctccctg tctttcccgg ttgcagggga aatggcaggg 4185 tcagtagatt gaaaagtggg gttagcatct gggtggcctg ggttctgtta acctcaagtc 4245 ccattatacc tctacccagc atctgaatag acaaccagcc ctctttaggg tgggatttga 4305 aaggaagggc ctgcttttct ccagaacacc tcagaatggc tccatgaatg tcttgagaac 4365 ctgatatata tgccaggcac ctattttggc aggtggtgag ttaacgatct taagttctgg 4425 aatcagacac tcctgggttc cggcccaagt gctaccaatt tgtaactgtg tgatgctgga 4485 gcattcactt aacagctctg aacctcagtc tcttcatctt taaaatggaa ataataacaa 4545 tagctaacat ttaaagagtt cgtactaagt gctgggcact gtgttatgtg tgttaaccat 4605 tttaaccttc ccagttctgg agagaatcct tatttctatt ttatagatga ggaaaccgaa 4665 gcccagagag gttaagactt taggtcaagt accttcagct agaaagagca gggaggcagt 4725 ttggttccag agcctacgca cactcctaac cactaagtta aaagcaccca cattgtgccc 4785 tagaccactc ctagctatta accttgccct atccagagag aaaagaaggg agccccgggc 4845 tagattaaag agacttggtt ctagtggtgt atcttccctg gcttcctcta aactctggag 4905 taagtcattt tccttgctgg acttcacttt ccctctctgt gaaatggaaa cagtcatcac 4965 aggcctgcct gtcttccagg gctgttagga ggctcagatg acacccggtg taagaattct 5025 caagactaag gagcctggac caagggtgag agctgaggaa tcctggacca tcacccctgg 5085 gacagccctg atttttctgt accttggcta ccacaggttc tgctgaatgt cctgaccttg 5145 aaccggaacc tgagtgacag cctggcccgc ggccgc 5181 <210> 3 <211> 4786 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1276) .. (1707) <400> 3 tgcagccgac agcacgcctt gcggccctgg ggatcttcct gtgccccggc gagacccttt 60 ccgcctcact gcattggaac cccattcccg atcacccgct gggatccatc gtcggacccc 120 aagaggagtc cgcgcagccc agccggcacc ccgaagctcc tccttcagcg ggaacggaag 180 cagaagagcg atcaaggagg tcctcaccac aggactccta tgggtccgac cctgggtctc 240 ccgcaggccc ctctggcagt cctcttccca cccgtcgcct cgggctgcgc cgccgccgcc 300 gccgccgcaa cctccagcac cgccgcccca ggccccgcag ccgccgcgtc gccgccattt 360 tttaaagggt ccgcagcctg actctgcgga gtaagggggg gtggagcggg gggagtcggc 420 ctcgccagcg cgcaggcgcg aggcccgagc cgccgcttgg gtcacagtga aagccaccgt 480 tgcccgggga tgggtccctg acacttgggg aagtaggagc cctgtgtgat cgtgcgtctg 540 agtctgggct gagaccagtc ctggccaggg cagttaccag gacggtctcc ggaggccggg 600 attcgcggag ggtccagcag caggaagaaa ccccaggagg aagaaacctc agacagatcg 660 ccggcgaggc agcgcgggat cccagcctct ggcgtgcgcg gacggtgtgc gggtgagtct 720 ccccaaaagt ggagcccttg tgatgacgag cacaggtccg cctgtgtgcc cgtgggctgc 780 tctctcaccg gtggctcgta gtcgcgtaaa gcagaagtag cagtttcagc ggctgacact 840 aagaacaatg catactggca aagtgaggtg tctcacatct ttaatcccag cagtttggga 900 ggctgaggcg gatggatcac ttgaggtcgg gggatcacac cagcctaacc aacatggtga 960 aactccgcgt ctactaaaaa aaagtaccag acagttagcg cggtgtggtg gcatatgcct 1020 gtaaatccca gctatttggg aggctgaggc aggagagtcg gttgagccag tgtggccgag 1080 gttgcagtga gtgtagattg caccactgca ctctagcctg ggtaacaagc acaaaactct 1140 gtctcaaaaa aaaaaaaaaa aattatcagt gcagagttag gccagattca ttttcagagt 1200 aggggggaat ttcaaattat gggtgaaatg tcccaactgc cggaaggtga aagtgttctg 1260 tgatgtaaaa tttag gtc aca ctc tat aaa gct tct cct tat tct atg tgt 1311                  Val Thr Leu Tyr Lys Ala Ser Pro Tyr Ser Met Cys                    1 5 10 gaa cag cta gtc tgt atc ttc acc tat gca tat tta aaa atc aac ctt 1359 Glu Gln Leu Val Cys Ile Phe Thr Tyr Ala Tyr Leu Lys Ile Asn Leu          15 20 25 acg aag gct ctt ggc cac cat aat cct tcc tac cct gta gca aga gcc 1407 Thr Lys Ala Leu Gly His His Asn Pro Ser Tyr Pro Val Ala Arg Ala      30 35 40 cag cct cag act atg ctt cct tct gcc tcc gag tgc aat gcc ttc ccc 1455 Gln Pro Gln Thr Met Leu Pro Ser Ala Ser Glu Cys Asn Ala Phe Pro  45 50 55 60 cga gtc tcc aag gta cca aac cgg tca ggg gca gct gag aca gct cta 1503 Arg Val Ser Lys Val Pro Asn Arg Ser Gly Ala Ala Glu Thr Ala Leu                  65 70 75 ctt gta cga gtc ctc ccc aag ccc ttg aaa tca cag gca tct ctg ctg 1551 Leu Val Arg Val Leu Pro Lys Pro Leu Lys Ser Gln Ala Ser Leu Leu              80 85 90 tcc ctg ccc ttt atc cac atc act gaa caa tta agc aca cac atc tgt 1599 Ser Leu Pro Phe Ile His Ile Thr Glu Gln Leu Ser Thr His Ile Cys          95 100 105 gta gtt tta ttt aaa ctg ttc cat gtg tgt ttg ttt aca att aac aac 1647 Val Val Leu Phe Lys Leu Phe His Val Cys Leu Phe Thr Ile Asn Asn     110 115 120 atg aat gac tgc agt atg gat gca gca gaa agg aat gct gag gga gga 1695 Met Asn Asp Cys Ser Met Asp Ala Ala Glu Arg Asn Ala Glu Gly Gly 125 130 135 140 ctc cag ttg gaa tgagccactt ttccaggact aggaatgaac tttccccagt 1747 Leu Gln Leu Glu accaacaagg cagcctccag ggcaagaaat ggtaacagaa tccatcctgt tgacagtcaa 1807 gatggaatta cagagaattc tcacctcttt tttgactctc aggtggaaac ccccttccca 1867 tggggtgaag cgcagtctgc tccagagacc cactgttagg gtcggggatg tcacagaaat 1927 taacagggca gccaattccc atccagaggg gcccaggatg accccgaacc ctctagctca 1987 gccccaagct cgggccaggg agtgaagcct ttgtccccaa gcaacagagt gagggtctgt 2047 ctctaacccg gtgggctcca gtgaaaggaa aatggcctgg aggtgcccac aggagggcgc 2107 cctagttctg agacaggtct acggctccca gactgcgtcc ccatgagtaa cagagggaca 2167 cgcagagggg catactgagt gcctataggg gactcacaga ctgcagatgc acgctgcgga 2227 tccagaagcc ggtcaggtaa tgaggtgcag aggtgtgggg aagacctgga tgcctccctg 2287 ggtgccaacc agcctaacca gccggggcca ccaaacagaa gtgcttctga cattaacgcg 2347 acacacagaa ggtgtgtagg gtgggtagtg gaagtgaccg cagaggcctc tggggattgt 2407 agttttggag ctagagaggc gggcaagcag ttggcttcgc ccacacggtg tgcctctcac 2467 atctgcacag cgcacacaac caagcttagg agccgactgt taagatgccc cagtgacacg 2527 gacaagtttc tcctctgcaa cccaatgaga gctgctgacc acgagagcat cgcgggaccg 2587 ccaaaggatt ctggaaaatg tggtcccggg tggccagtgt gcgctgacct tcacctgaag 2647 gtgagctgag ctcaccatgc aagtctccac cctggggtgc ccttccccaa gcagatgcca 2707 ggttggggca gaagcagttc agggagaagc gatgcgctcc tgggactgcc ggtttggggg 2767 caacatcggc cctgtgtccc ctagagtagg agggtgtacc aggatgggga accccaaggc 2827 cgttttcagg gagaacaaga ttggcttgag gcaaactccc ggaatggaga tggggagagt 2887 ctggtcgcat ttccccagga gtttagggga tgaaggctgg acttgggtga acagcaaaag 2947 atcccctggt gatagtggtg gtgaggatca gaggcttgag ccagaaattc ccgaggtgtt 3007 tgtttgtttt tttttttttt gagacagagt cttgatctgt ctcccaggat gtagtgcagt 3067 ggcatgctct tggctcattg caacctccgc ctcccgggtt caggcgattc tcctgcctca 3127 gcctcccaag tagctgggat tatgggcgcc tgccactgtg cccacctaaa ttttttttgt 3187 ttttttgtat ttttagtaga gacgatgttt caccacattg cccagcctgg tctcgagctc 3247 ctgacctcaa gtgatccacc tgcctcggcc tcccaaagtg ctgggattgc aggcatgagc 3307 ccccacaccc agctcatttt tatcttttaa cttaaaggtt tttatttttc tctattttta 3367 ctcttacagg tacggcttga aggagacttc tggagggtgg ggtggtgttg aagagaaggc 3427 acaattgtca ccattttgtt caacatgggc gttttctttg tgcattgatt tgcattttta 3487 aaggtattcc attaaaaata tgtttatcta ttttaatgtt ttttttggtg cccctttaaa 3547 ttctctatcc acagtcctga gccttaatac gaagtgaggg cctggcctct gggcatcaag 3607 cagccagaaa gtcccttagg agctttctct gaattgttga attgtgactg atcttttcct 3667 ttcctgactt ccttaatatc cacaatttcc tcgtaaccat ccccccaaag ccacacccac 3727 aaggcacctc caagacctat catgaagcag atattcaaag attatttgta aatcttcaca 3787 ccccacccct tccagaatgt cttccaacat cccttcgcct ggagaaataa aacacaaggt 3847 ttgcatgttg agtatcagcc tctcccattg ggtgggtgtt taggaaaaag aatcagccaa 3907 gagccctatt ccagcccagg taaatgactt tgccaaagat ttaatatcca caaatgtaca 3967 atgctcactg ggaaccaaag tcaggcatgg ggctgggctt taaggagcac aaacaaaaag 4027 gagggactag aaaacttcag aaaggtattg gtgggggatg ttgcgggggg acaggggaca 4087 gcgaggatgt gggatcccga gatcatccaa atccctatgt gtagacatat gtgtataaag 4147 gcctttaaga gactcaggct gatggggtat ctgtaataaa tcaaacataa tataacagca 4207 cgtcaagtga taaggggact ctggaaaaac aagcagcaaa aggagcagta tcaaactcca 4267 cagaaattca caaacatcaa gacaccaaga aagctgcatt catttaaatc aaggtgacag 4327 gctgggctct gtagctccag cctgtaatcc tagcactttg ggaggccaag gtgggcagat 4387 catttgaggt tcaagaccag cctggccatt atggtgaaag ctcctctcta ctaaaaatac 4447 aaaaattagc tgggcatgtt ggtgggtgcc tgtaatccca gctaaagagg ctgaggcagg 4507 agaatcactc tgaacctggg aggcagaggt tgtagtgagc agagatcacg ccattgcact 4567 ccggcctcgg ggacagaacc aaactccatc tcaagtaaat caataaatca aaggtgacag 4627 ttactgagtg atgcgggcca ggtacaatga tataccttca tctggattac cctcgtgaat 4687 ctgaatcact tgtgtgatat ctgtgcttta acgagtccta tttcccatac tcttagctgg 4747 cttgagatgt gaaaatccac ctaaattata aacagagtg 4786 <210> 4 <211> 6646 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (5605) .. (5844) <400> 4 attctgtttg tcatatttag tagctgcact ctcctaagga aaggaacctc accttcctgt 60 tcacttttct gtctccattg cctgtacttc cagagccagc ccgtgatgat gcctgaaaaa 120 attttttttg atgtatgaat agatgaacaa acaagcacag aagaatacag tgtgaataca 180 gtgaagccag aggtgtactc tgcgaataca tgagacaccc ttgagtggaa tggtttcagt 240 agatggattg tatgaggtgt gaattgtatc tcaatcaagc ggggcctcag ggagtgccca 300 ccactaaggg tctagctctg tgtggcgttg gtggggctgg ggaaggttta gaaaaggttc 360 cgttgctaag accttggtta gggtagagtt tgcagcagag atttggatag gagggcgttc 420 caggctgaag agccagtgtg agcagagtct ttgtgggaag aacattttat attcctggac 480 ctaatgcttg tttaacaaga tgcttgcttg tggctgggcg cagtggctca agcctgtaat 540 cccagcactt tgggaggctg aggcaggagg atcgcttgag ctcaggaggt caaggttgca 600 gtgagccatg attgcaccac tgcatgtcag cctgcgtgat agaacaagac cctgtctcaa 660 aaaataaaaa atgacaataa agattgcttg ttaaataaaa tgtgatttat ttactttgca 720 taaatactat gccttctgag aatgctgtag gtgcgggcat attttgtttt tatcctaacg 780 cagcccaaat tccaagtaca cactggattt tgagcattta aaggacttcc ggtgctatta 840 aggattccca tttgtttgtt caagtttgct aaacgaattc agtcatgtca cttacttagt 900 tgttgtaagt attatattac agtgtgaagg gggtaaaaag aactgctatg agtatctttc 960 catgaagaag atttagcttg gaaatgtggg cagatttgtc tgcctttgtg tttaggaagg 1020 cacaattata attacaggtg gaggagctag acactggcca gctggaatca gaatttcaca 1080 ttgtagaagt gtctggtctg gaaggcatcc atgggaaacc tacacccttg acccagcatt 1140 ggccacagag ctgcaggcgg tctcctgcct ctgcctctgc ccctgcgtgc ctctcctttg 1200 tagcagccgt ggcatatgca gaggggactg tagatctctt ttctttcttg ctgactttat 1260 tttctaccac actcccccaa cccaaataaa gaaataatga catgtgcatt tgagatagtg 1320 tgtgtgtttt ttttttttct ctttctgaga tggagtctta ctctgttgcc caggctgaag 1380 tgcagtggta caatcttagc tcactgcaac ctctgcttcc caggttcaag accagcctgg 1440 ctaatgtggc aagacctcgt ctctactaaa aatacaaaaa aaaaaaaaaa aaaaaaatta 1500 gccaggtgtg gtacacgcct gtaatcccag ctacttgaga ggctgaggca cgggaattgc 1560 ttgaacctgg caggcggagg ctgcagtgag ctgagatcgc gccactgcat tccagtctcg 1620 gtgacagagc gagactctgt ctcaatttaa aaaaaaagtt atcgtattta tatgaaaaca 1680 tctgaatacc caaaggatat ttaaaaaatc actgccagtt taacagtcct cagttggagt 1740 tttttttttt tttttttgag acggagtttc actcttgttg cccaggctgg agtgcagtgg 1800 cgcgatcttg gctcactgca gcctctgcct cccaggttca agtgattctt ctgcctcagc 1860 ctccgagtag ctgggattac aggcatgtgc caccatgccc ggctaatttt gtatttttct 1920 tagagaaggg gtctctccat gttggtcggg ctggtcttga actcctgacc tcaggtgatc 1980 tgcctgcctc ggcctcccag ggtgctggga ttgcaggtgt gagccaccgc actcggcctc 2040 agttggagtt ttatttggcc tcttttgatg cctggtggct tatgtgtgtc ctatacaaaa 2100 caaattgcct tttttcccca gaccccagaa atgtggacac gtgagttgtc tttattatag 2160 tccacgcttt ccttgtgttc ttctcctcta ggactttcta atatatagga caagtgattt 2220 tgacttgact taaaccctac ggaaataaga ttttgtttgt agatgcattt gtatggtgaa 2280 actggagcaa aagtcttcag cttagcggta tgtatcagca gggagctttg aaggcagcat 2340 ccatgcctgg cccctcctga agagctcaac aggtaggggt tggtcgggac attgagattt 2400 gtaacagcag cctccctact ccccagtaaa cggggatcgt aaggcacagc cgaggctgag 2460 actgctgtgg ggtagtctag gccgcctgct ggaatgtcgg tcggtgctgc attgagcgct 2520 gtctttcaag tggttgcgtg ggagacctgc acttgctgtg tgctaccttg actgaggact 2580 tttgaggaaa ttctgtttga caattacttt aaggtccagc gtgacaccct ccctcagact 2640 tagtctctag tgactttctg atgggagcgc ctctgcttca tcaaggattg gaggcactta 2700 gctctcaaca aacttaggtc gctttttttc ctccacagtt ttgttacctc ccttatttgc 2760 caggcttggc tttgagtgaa tcttggcttt tttggggttg tggggggctc tctgattcat 2820 tcagcagatc ttcactgagt ggcagatgtg tgcaaagtgc ttgggatgcc aaagacaagt 2880 gtgtcgtctg tatttagtgt tgggacagac agtgattgag acaacagatt taatcttcac 2940 tgatgacagt gatgatgcca cgaaagacat gggcgctgac agagggtggc tgctggtgta 3000 ggcaggatgg ttagggacct cctttctcag aactgaccgg gaagggatgg tggggaccag 3060 gccatttggg gcctcacaga tcagggctga tgagtttggc tccatttatg cttgactcag 3120 tttatttgag ggtgggagca ctcctgtttc tcagaattcc ctgtgcctaa ccccaggcgt 3180 ggcagataat actcgctcag taaatgttta ttgaacaaat gattgattga gagtgatgga 3240 acacacagaa ctgtgggtct gggagcagga gcgcgaggcc tgtctctgaa tgcagcctgg 3300 acaatgtgcc cttctcctca tccgccaggt gagaccagca ggagaaagcc atctcagcag 3360 tggttggtgg ttgtatttac agttaaattt tattttgaga tgctcatagt ttcacatgca 3420 gttgaaagaa tacagagaga tcctgtgtcc ccttctccca gctgccccct gggtaacatc 3480 tggtgaaatt caacgcactc tcagcaccag gatcggggca ctgatactgc caggacacag 3540 aacatcagga cggctccgtc cccgtgcaaa tccgtcatgt cggcctttta tagtcccaca 3600 cacaccgcta atctgttctc tatttttgtg attttgtatt tcaagggtgt tgtgtgcatg 3660 gggtcatgga gtgagtggcc ctttggggtt ggctgttttc cttcagtgtg attccctaga 3720 gcgttctcag tcccgccagt caagtttctt ccttttgctg ctgagtggta ttttgcggcc 3780 tggctggcct gcagcttgtt taaccattca cctgttaaaa gacacctggg ctgcgtccag 3840 attttagttg ttacaaataa atgttgctct caacagtcct gagcaggttt ttgttgaata 3900 taagttttca tttctctgat ataaatgccc aagagtacaa cttctgggtc atagggtagt 3960 tacgtattca gctttattaa aaagtgcctg ttttccagag tggccataca gtggtttttg 4020 tttgttcttt taaatgactt aacaatggaa ggaaagacgc ttctgcatgg cctggcatag 4080 aaagcccatg gagagccgcc ctgtgggtgg ggaagcgggt gacgggagcc ccactgctta 4140 ccagaaaccc gagtgaaact gcgtctttcc agtcccaaga ttctcctttt ttgctaaagg 4200 catcagaaga gatgaaagca tttctaccac ccacatttcg gggacactcc tagcttttac 4260 tttagagttg ggagagaggt ttggttttca ttgtcattta aatcctgttg ggtttcagca 4320 gagcctgatc tttagggccc tgggtgcagc tgttgtctgg agatgcagat cactgtctgc 4380 tgaaaagagc cctcctgctg gggttaggga tctcctgatt gaggcatgga tccaagggct 4440 tctttctttg ttctctgatt ccctgaggct cttccttgtg tgtgtggggtgc gtgtgcactc 4500 gtgtgagcgc accaggaact atgacagcaa tcaacgggta tgactggggg tggggagcag 4560 aggcagcatg gccaggaatc tatactcgag ctactaagag gaaggaagtc gcagcaaaaa 4620 tagcctgggt ggggtggtag aaaatattaa ggggaaaatg agatgactgg ggtggcgttg 4680 aagaagagct gcttaaagag aaacagggac acacgctggg taaggaatcg catgctgttg 4740 aaactgacgg tgtggaagga aagtgaggtg gcggcggctg ggctgtggta gaggaagccg 4800 atcaccagtg agtgaaaagg gaactagggt acccacgcgt ttgtgttttt tatttaaact 4860 ttggctaagt tttgctagga gcgttttgcc tttttttctc agaccttcta ctcctgtcgt 4920 tctcccctaa gttctgggca tgtggcttct tctctaccta ccagggtagc ctgtcattgc 4980 tttaaaggac aaaatgaaat ccctgatggg caaaatatgt cccccctctt cccccagctt 5040 ttggtcggct ttgctcaccc accaacagct aggagctgga aggctcagaa gagctggggt 5100 gggcatggga gagacagagg aaccccggtg gctggaaaca cagtcccttc ccagggtctc 5160 ttcagcttgc tggttggacc tcttagcctg cttatgaagc ctgaattgac atcatgccaa 5220 ctggtctcct gctgataagt gattccttaa cttcctcggg agcagcagaa acttgttttg 5280 aaaaaaatca gattctaaat ctttacagct gtttacttct gcaagcacaa tggccgtagt 5340 aaggaattca agtcacagcg cagccgtggc agggaaggct ctgcatggag tcacaggaca 5400 aagcctgtcc tggaaacgtg ctggaaggaa gtcggcctca ttctttacga cgggcagcag 5460 ctccaaatga gaaaccaagg ggcttggctc tgggtcagta gcatctgcct ggcatgcggt 5520 gccagctgtg gagaccagtg aggtggcagc agctgcactg ggctcctggg gtgtgcgtgg 5580 ggacctcagg ggtttaggag ctga tat act ttt ggt tct cat cat ctt gtg 5631                            Tyr Thr Phe Gly Ser His His Leu Val                              1 5 tgt tcc ttg cag aca cag aat gtt cag ctt aac aaa gaa atg aca ctt 5679 Cys Ser Leu Gln Thr Gln Asn Val Gln Leu Asn Lys Glu Met Thr Leu  10 15 20 25 gcc agc aac cgg agc ctg gca gaa gga aac ctt ttg tac cag ccc cag 5727 Ala Ser Asn Arg Ser Leu Ala Glu Gly Asn Leu Leu Tyr Gln Pro Gln                  30 35 40 ctg gac acg ttg aaa gca cgc ttg acc cag aaa tac cag gaa ctc cag 5775 Leu Asp Thr Leu Lys Ala Arg Leu Thr Gln Lys Tyr Gln Glu Leu Gln              45 50 55 gtt ctc ttt gaa gcc tat cag ata aag aag acc aaa tta ggt aac ttt 5823 Val Leu Phe Glu Ala Tyr Gln Ile Lys Lys Thr Lys Leu Gly Asn Phe          60 65 70 tta agg gtg atc att cga gaa taaagagttg agagaaggac acattttcaa 5874 Leu Arg Val Ile Ile Arg Glu      75 80 gctattgccc taccaggaaa gaaatattac ggaaaaattt caggttaaga tagcgactct 5934 actgcaatga acaaatagct tatgtatatt atttatttat ttttagagac ggggtcttgc 5994 tctgtcggac tggagtgcag tggcgcaacc atggctcagc acgacctcct gggctcaggt 6054 gatcctcctg tctcagcctc ctgagtggct gggaccacag gtgtgcacta ccccaagcag 6114 gctagttttt catttttttg cggagaggtt gtcttgctat gttgcccagg ctggtcttga 6174 actcaagcca tcctcctgcc tttgcctcac aaagtgctgc tattacagac atgagccact 6234 gtattggact agtttatgta tggaatatgg agggccccca gacctgcggc ctctctgcac 6294 ttgcggttcg agagggcagg gtgtatagca ggactgcccc ctcctcccag tggccctgac 6354 tggtggcggt gactgctgag gtcagcccag gtccacacgc acctccaggg tctgcagtgt 6414 gaagcaggtt gcagaaacgt gggtacctcg gggcagttac cagaaaaggg gctggagatt 6474 aggaggaagc cagccagttt agcagtcacc agtttagaag aggagactta accacttttt 6534 aaattcaaga ccccaccttc tctaaacagc atagaaaaag ctttctgaat tatgatatta 6594 aacagatgtc ttatgacgtt tgcccgtttg tcttatgctg tttgttaatg gt 6646 <210> 5 <211> 5564 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3913) .. (4422) <400> 5 gtattagcaa aaaaaaaaaa aaaaaatggt aaccaccgaa atgtaggcag ttaccatatt 60 tttaggatac agaaggagga gaaaggatat gaaacatttg tacaatggca tactctaaaa 120 aagaacgagc tagctgtgta ctaatatgga acaagtgcca agatatacac ttagataaaa 180 taatcgggta tgtaagagta tgcttcaaag tcttcaaaag aaaaggaaaa acaaaagata 240 cagataggct tgtaagaatc tcagaccatt tccagagcac ccaataaact ttaacagagg 300 ttactgctgg gcaatggaaa gaggggattg ggaggtggtt tgaggagtct aggctgacac 360 ctgaatcggc agagaggagg cggtgacgtg ccacttaccc acaacatgta ctggctgaac 420 cgccgcagga agtagaccat ccagtttcca agggacttta aggtgtcagg agcaagcctt 480 ctccagatat tagggatatg cccgataaaa agagacctgg ccacatcatc taactcattg 540 ctcattccaa cttctccagc caaggcctat ttagaatcaa gagcaatcgg gcaggactct 600 atcagaaccc caggtgccac gtatctaggc agtaatctgc acatgagaga caggctcacc 660 ctttgaagtt cagccagaga cttcgtcatc cggaccacaa gcttgttgaa gcgttccagt 720 tcctgcagga gcaccaccga agtgggggag agtcctgttc cgaggcgctt cctcacctgg 780 tccaagtcaa agactttggg catcttgttt tctatttctt tggccacttg gccaatataa 840 tcatcgcggc tgataccact gctggattcc cctaaaacca ggacactttt cagagaaagc 900 aaggcatcca cgtgcagaac cggagcctcc tgagccttcc agttctggga aaggcatcag 960 caacgtggga gggctacaag aagcaatcgt aaaaccccaa tcaaactaat gaagtcctcc 1020 tgagtagagt ttacctgtct gaggctgcag ctccagcagg tgagcccaca tgtctcgagc 1080 cgcctgcgtg taatagccaa tctcagcgtt ggggtggaga ccaaacactt ctggcgtgtt 1140 ggcaagcggg agggcctcga tggcttctgc aatcacagag aacaacggag gtggcactgt 1200 ccagccacct gtgctttctc agggcccaca gcctcaaagt ctccaagaca tgcaagacta 1260 atttccactg tttagagggt gacttttttt tttttttgag acagggtctt agtcacccag 1320 gctgaagtgc actgacatgg tcattgatca ctacagcctc gacctcctgg gctcaagctc 1380 ttctcccatc tcagcctccc aagtagctgg tactacaggt attcaccacc acactcagct 1440 aatatttttg ttttttgttt ttttgttttg attttttgtt tattttttgt agagatggga 1500 tcttgcttgg ttgcccaggc tggtcttgaa cttctgggct caagcgatct gcccttcttg 1560 gccttccaaa gaagggatta caggtgcgag ccaccatgac cggccaatga catgttttta 1620 atgtaaaaat tctacactac cacacccagt ccagccgtat gttcatcttt tacgaaaatg 1680 gctcagtgag cttcactgtg gccattctgc ccttccaggt ggaacctgat ttatttacca 1740 tcattccagg atgggaaatc tgaactctat cagtcagctc tgcagccagg aagcgtctgc 1800 cgagccacct aacaggctta acagaccaca cctgaccagc cgaggagccc ccaccaggag 1860 aggagccggt tgaaatgtat ggtctccctt agaaaaagcc ttgaaaacat cttttcatgt 1920 ctgagaaatc tcaccaacaa atttctcctt ttcatcacca acagggattt tgtagtccac 1980 ttccttgttc cggaagaagt ggaatggctg gaaagtatca aaaatgaagt cccccaggta 2040 ctcatccatg tagatggtca ggatgcggcg atcaaagctg tcgatggccc gtcctccata 2100 catgacctga aattagaggg cctgagagcc agccaggagc tcagaaccta ctgtgtgcac 2160 cctgggggcc cggggccaga ggctcaccca gggtcaagag actccagcag cctggctgct 2220 gcaaggcttg aacctgttga cttgcataac ctttcgggcc actgttcaca caattgatct 2280 gtggctttgc tggcttccca agggataggg cccaaagcag taagaagccc tcaggaagtg 2340 ttgggagctc ctgtgcagca ggcaaagtcc cccaaacaca tcaaacaaac agctgaagaa 2400 gccagcagct gcttctgtct ttggaattga tgtcactgcc tgagccgtgt gcagaccacc 2460 ggggatgctt cctgtgagct ctgtatacag caacagctga ggcaaggggc tagaggatgc 2520 cttgcctgga tccccagcca tggatccagc tcagggaacc tccatgtgaa cagagctcac 2580 ttaggacaaa gaacaaccag gtgagagcag agagaattgt cacagaaacc caaggggaac 2640 acaggggcag aagaggctgg ttctgagggc tcagaggaaa ataggggcac gaggccacag 2700 ctctgtagcc ccagaggcac ctgcggaagg atccccggtc acccctacct ctccaattag 2760 gtacttgagg ctgccccacg ggatccttgg gtcccgttgc tggaaggctt tcgttaagta 2820 cgtgttcaga atttccatgc agaccttgaa agaagagaac acgctcagtt atactgaaac 2880 ccaccaagct gttcacattc tgatactttt ctgaaaaggc tgctgggaac ggtgcaaccc 2940 tgctgtggtt gaacccagag gaaagtcggg aggaagtggg ccgcctaagg aagccatttc 3000 caaggagaag cagccactgt cacctggaag tcagactcat tgaagtcata gtacacgttc 3060 cagccaatct tcccaaactt ccttctctcc tgcaccacag catgaaagaa cgccagcacg 3120 tagaccagcg gcttgaaggc agggtgcggg cactggtcca gcatttcgtg agagatcttg 3180 aagtaagttg ccctcatgtt gagtttcagc ccattgggtg gctcggtgac aaccttcaga 3240 agagaagaaa acacacagtg atggagaccg caggcccaca agcttggaag agtgtgcaca 3300 atgcaggcac tgcgctttcc ttcctctcta cggagccatc tgtctgagac ctccaatagc 3360 taaagtctct tgtccagggc ccaggctgag aagttaaagg actagatgtt ctggctctca 3420 gggtcaaggg tcaagagtct ttgaccttcc tctgaaccat gagatccttt actatatata 3480 aaatgaaagt aaagtcttcc ctccacctcc ccagagagag agagagagag acagagaatg 3540 acaaaacctt tccaaaagaa tttcctggag aataaattaa atggggcatg taaaccacgc 3600 gaaactcctg cagtcagcac tagcgaactg caggctgtgt tggggggggg gccgtgaagt 3660 ttatgaaaag tagcgtagat attgttaggg acacatgcag aattggtaga gataggcctg 3720 ggagtcacaa acacccaacc caggacagcg gctccctcta aagagagagg tgggattgag 3780 aagggtttca accgcaaggt ttcacctgca gcctaaggtt tatttctaca cagacaaagg 3840 cacaccccgg ccccagagca gcatgcaggt gttctgagaa gctgcgtgct ggcatgttga 3900 tgtccatcct ga agc cag acc ttt agg gac ttc tgc aga atc cca atg ggg 3951               Ser Gln Thr Phe Arg Asp Phe Cys Arg Ile Pro Met Gly                 1 5 10 aag ccc ttg gtg ggg tcc gtg gtg agc cac agg cgg aag tct ggg tgg 3999 Lys Pro Leu Val Gly Ser Val Val Ser His Arg Arg Lys Ser Gly Trp      15 20 25 ggc ttg gtg atc ctc tcc agg gac ttc tcc aga tct ttc agc cac ttg 4047 Gly Leu Val Ile Leu Ser Arg Asp Phe Ser Arg Ser Phe Ser His Leu  30 35 40 45 acc agg agg tgg cag ttc tgc agc atc agc cac tgc ccc cga gcc acc 4095 Thr Arg Arg Trp Gln Phe Cys Ser Ile Ser His Cys Pro Arg Ala Thr                  50 55 60 gcc gtc tcc agc agc tgc agg gcc acc tgc gcc ggg gga gag gga aga 4143 Ala Val Ser Ser Ser Cys Arg Ala Thr Cys Ala Gly Gly Glu Gly Arg              65 70 75 gga gag ggg ttg gca tcc aaa ccc aac ccc gtg ctc tcc aga gac cca 4191 Gly Glu Gly Leu Ala Ser Lys Pro Asn Pro Val Leu Ser Arg Asp Pro          80 85 90 ctc cac acc cac aga agg gcg atg gca gct cca gca gaa gcc aaa agc 4239 Leu His Thr His Arg Arg Ala Met Ala Ala Pro Ala Glu Ala Lys Ser      95 100 105 aag ccg agg atc cag gag cta cag gtg gag cct tca aat aaa cca tct 4287 Lys Pro Arg Ile Gln Glu Leu Gln Val Glu Pro Ser Asn Lys Pro Ser 110 115 120 125 ttg aga aac ccc atg ggc ccc gtg ggt gca ctg agc tgg tca cac cca 4335 Leu Arg Asn Pro Met Gly Pro Val Gly Ala Leu Ser Trp Ser His Pro                 130 135 140 gcc tgg aag atg ggt gag cat ttt gca ctc atc tgc ctc cct ggc cct 4383 Ala Trp Lys Met Gly Glu His Phe Ala Leu Ile Cys Leu Pro Gly Pro             145 150 155 gga agt ccc agc tcc tcc cac aca gtc gct cct gtc ccc tgaggactcc 4432 Gly Ser Pro Ser Ser Ser His Thr Val Ala Pro Val Pro         160 165 170 tgagcagggc tgcaagagac cagctaaagc ccaccaggaa gtcgggactg tcctggccac 4492 tgtaagatct tcagggccct aggcctggaa aacaatcacc atgtcccctc ccaaatgtaa 4552 ttcaaagcac aaacaagcta aacacccaaa tcaagccttg ctttctgaaa tggcttaaaa 4612 cacatgagtg ctggcgttac aatagcttat ttcagtgttg tggttgagag agctggcccg 4672 ccttgctggt gcccagggct tgcctcctgg gccatccagc atgggtattg ccatcctcat 4732 ctcacccagg ggctttcagc agtagagggg cttacccaaa gcctcacagt tcatagcaaa 4792 gccaggactg gcagtcagct ccctctgacc ccaacacgca tgcctgcaac catcacactt 4852 actgcagcct caaactcttg ggctcaagca atcctcccac ctcagcttcc ttagtagctt 4912 gactacaggt gtgcaccacc acgcctggct aattcttt tattttttat tttttagagg 4972 cagtatctca ctatgttgcc caggctgggt ctcgaactcc tggcctcaag cgatcctccc 5032 acatcagctt ccagaagtgc tgggattaca ggtgtgagcc accgcacctg acccagatgt 5092 gcgttttaaa attcattgca aacacaaaga tattgggata tttcaggaaa aaatctgaag 5152 tcctagcttc tttcagagtc agaggatcca gccactgtgg gttcccaatg gcacctggta 5212 tcagtcagca ggagctaagg gcagctgccc agggaccagg accccaccca gcctcatcac 5272 tcactgccat ggcctgctgg ctacaagggc acatctgagc tgcagcccca cacgctccgc 5332 ctggcttcgt accccatgct agagctacat tgtccagtgc gatggccgtg aggcacagca 5392 cttaaaatgt ggctagcccc aattgagatg tgctgtaagt gtgaattcac acaggatttt 5452 gtagacttaa taccagaaaa aagcatgtaa aatgtctcaa tcatgttcat attgattata 5512 cattcaaatg ataattttaa tatattcatt taaataaaat gttattacaa tt 5564 <210> 6 <211> 5271 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3) .. (365) <400> 6 tg aga gaa gag ggg aca gaa gct gaa gcc tgc atg ccc acc tcc cct 47    Arg Glu Glu Gly Thr Glu Ala Glu Ala Cys Met Pro Thr Ser Pro      1 5 10 15 ctg gtg ctc gct aaa gac tcc agg tcc cct ggg cag gac aga gag agg 95 Leu Val Leu Ala Lys Asp Ser Arg Ser Pro Gly Gln Asp Arg Glu Arg                  20 25 30 gtg gag aga agc cag gag gga cat aag cca gag gac agg aag ttt ctg 143 Val Glu Arg Ser Gln Glu Gly His Lys Pro Glu Asp Arg Lys Phe Leu              35 40 45 gtc cag gtc ccc agg tcc caa gcc cca gat ttt aag cct gaa ctc aaa 191 Val Gln Val Pro Arg Ser Gln Ala Pro Asp Phe Lys Pro Glu Leu Lys          50 55 60 gcc gca cct ctg tgt ctc tta cct gac aga ctg ggg agt ggg tgg gac 239 Ala Ala Pro Leu Cys Leu Leu Pro Asp Arg Leu Gly Ser Gly Trp Asp      65 70 75 gag ctc cag cct cct ccc tca ctg ccc caa gca gct gcc cct ttg gct 287 Glu Leu Gln Pro Pro Pro Ser Leu Pro Gln Ala Ala Ala Pro Leu Ala  80 85 90 95 tct gag ggt ttt gtt gcg agt ggc aag tgg gag cca cca gat gtt cca 335 Ser Glu Gly Phe Val Ala Ser Gly Lys Trp Glu Pro Pro Asp Val Pro                 100 105 110 ccg cta aag agt gac aga ggt tcc agg ttc tgaccctcca actcctcatc 385 Pro Leu Lys Ser Asp Arg Gly Ser Arg Phe             115 120 ccaactctga agcttcattc aacttgacgg aagctcatct gggcccacgc ctcctctttc 445 cagacccact ttggtctctt cctctcttga gcatccccga ggcctgagaa gtcaggtcag 505 gtgaaacctc aggctcccca actcccacct cctgctcccc agccacctcc acccaagcca 565 agtcctgcac gatgtgggga gggaaatgga gagggggtgt ctcagcatgg attcccccac 625 gccttgcctc tctttaaaat ttttcattat attctattta atatatttta aagacctcca 685 tcaatgaatt tgaaagtttt aaaatataat acttttaaat ggcactctta gacccatcat 745 ctccccaaag tacttgaaac taatgagccc tgctgaggcc acctgcaggt gcttcccgtc 805 cactcctccg cccccaggta accgccttcc tgaattctgt ctttctaaaa gctgctttga 865 tctaattaca cagcagttct gtcctgaatg ttgtctttct catttctttg cttgtttttt 925 taaagttttg ctgtgtgttt ataaatctct agataattat ttaatttttt aaaaaatttt 985 tagaaagatc tcataggtag gctcttgctt cttgctttct tcatttatca ttatggtgct 1045 aaaatacatc cctgctgttg ttgttgctgc tgtttttgag actgagtttc gctgttgttg 1105 cccaggctgg agtgcaatag tgcggtcttg gctcaccgca acctccgctt cccaggttca 1165 agtgattctc ctgcctcagc ctccccgagt agctgggatt acaggcaggc gccaccatgc 1225 cctgctaatt ttgtattttg agtagagacg gggtttctcc atgtgggtca ggctggtctc 1285 gaactcccga cctcaggtga tctgcctgcc tcaacctccc aaagtgctgg gattttaggc 1345 gtgagccact gtgcctggcc catacctgtt gttttatgtc agtgtagttc attcattccc 1405 atagctgtat aatttttttt tttttttttt tgagacgggg tcttgctctg tcacccaggc 1465 tggagtgcaa tggcgcaatc atagctcact gcagcctcaa cctcctaggc tcaagagatc 1525 ctcccacctc agcctcctga gtagctggca gataggcatg tgccaccacc cctggataat 1585 tattattatt attattatta tttttagaga caagcctgtt gccctggctg gtcttgaact 1645 cctggattcg agcaatccct ctacctcagc ctctcaaagt gctgggatta caggcgtgag 1705 ccatggcacc tggctcatat tcttttgtgt gaatataaaa cggtttattt aaccattcta 1765 cctctgatgg accatggagc tatatttggg gtggtggagg gtaatcataa acaatgctgc 1825 tgtcttagtc cctttgtgct aaatggactg agactggatg atttataaag aagaacagga 1885 acttatttct gacagttctg gaggctggga agtccaagat caaggcactg gaaggttctg 1945 ttgtctggtg agagctgctg ccttcttcca agatggcgcc ttggggctgt accctccgga 2005 ggggaccaat gctgcaccct cacgtggagg aagggatgga agggcaaaag agctgtggct 2065 gtgtgaagtc tcttttctaa agaccttaat cccattcaca agggaggagc cctcatgacc 2125 taatcacttc ctgaaggccc taccgctgta tactatcaca ttgataagtt tcaacatatg 2185 aaatttaaga tcatagcagc acgtctgaat gtacttatcc atgacttctg gaattgctct 2245 tggatttatg gctaggagtg aaattgctgg attgtggggc atgagatgaa ctggaaagtc 2305 caaacctcag ctgggaagaa agagcccccc acctggtcag ttacccagaa tcttccctct 2365 tgctaacttt atctccttct gttcagcttc acaaaaccat gctcaattag gttccaaggt 2425 gtgcctcggc tcccctcttg atcccattct ttggttgtaa ggttagcttc ctggttcttt 2485 ttgtttgtga ggggaagggg agacagagtc tcattctgtc acccagcctg gagtgcattg 2545 gtgggatctc agctcactgc aaacctctgc ctcccaggtt caagtgattc tcatgtctca 2605 gcctcctgag tacctggaat tacaggcgct tgccaccaca cctggctaat ttttgtattt 2665 gtagtagaga caaggtttca ctctattggc ccggctgatc ttgaactcct gacctcaaat 2725 gatccgccca ctttggcctc cccagatgct ggaattgcag gtgtgagcca ccgcacctgg 2785 ccctgattct taaacctact ccaggtttgc actttatcca cctggctttg aagaaaagag 2845 agaagggcaa gcagcccatg cccccataat caactcctgc caggggcatc ataagaattc 2905 caggctttag cacagtgcat ggagggaaga ctttcaggct gcagaggtgc ttcctaaacc 2965 tgccctgcct gtcctgccac ttcggagcac taacacttag tgtttccttg gtattatccc 3025 tcagatgctt ggaagaaatg ttcccatgag aaggctgtgt aggatagtga ttaagactgc 3085 agtctccgga gtgttctgtc tgggttcaaa tcctagcccc attcctttgc taggtgagag 3145 actctagaca agtcccccta ccttccctgg cctcaatttc catgagctgg gtgtgaggaa 3205 gttcactgag tcactgtgag gattaagtta gggtaataca agtcaaacac ctagaacagt 3265 acctggcata aacattcaat aaatgctagg gatcattttt aagtagcata ataaatgaag 3325 cgcacccaga aaaatgtgac caaagcaagc cctaaatgaa catgagctat tggtatgatt 3385 gttatcgtta ccagaaaaac ccctattcat ccttcaagat ccaaccaaga tatccttccc 3445 tgtctctcta ttctgacggc atttggtgca gagctcataa tctcatctat caggttgtgt 3505 tttaatctac ctgtttgtgt gtttgttatt gttgttgctc tttcttcttc ctcagctaga 3565 ttgtatcttt aagaacgggg actatttttc tctgactaat cctcaccctg acgtccttga 3625 tggccagcac agcgtagatg ctccattgat gaaatgaaag acattggggc ttgtatgtct 3685 tagaggcacc tcaaggctct ggaggtgcct ctaagtttgc agagcaggca ataaagggaa 3745 accagcctgg ctgagtttct tcttttctct gggttgcaat aattctgggg gaggaagagc 3805 caaatggtct ctgtcctgcc ttctgtcctc tccagtacct tgcctgtccc tctaggagct 3865 ccagccagat gccaagatag gtgccaaccc acccccaccc cgctattttt tctgaccctc 3925 cctaccacct ggcactgcag aaaactgact tccttcacat tgtgttgttc tcatgggctg 3985 cttcagacca cgaatccctt ttcattctta atagagtcca ggagaggtct tgcacggtga 4045 ttgccgattc aagtacgctc ctctgcagag gtcgattatt gcttttattt aaataatagc 4105 cattgaaatg attagattgc tgggcgctgg aaagtctggg gtgggggatc taatcacaca 4165 cagaataact ggcaaatcat ctcatttcaa ccttttctcc ctaatcacga gctgctcctc 4225 cctttttgag catttgggtg cccctgtcca cctctccaga atctatgtct gtgggaatac 4285 aggtgcaaga aaggagagac atttcgcaag cacattaaat atctctcaaa catgaaccca 4345 agggtgaagg ctctcctgat gctcagagaa gattatgctg cacactgcct gctgaaatca 4405 ctctagcctt cgtggcaggt gatagaggcg ctttgggcgt gatcttctga actgtgctaa 4465 gtaagggctg ccatttggcg ataagctgcc ctcagaagtc caacactcat ccaggcagaa 4525 agagcgtggt catttagcct ggatgctgcc tcttgcaaac tcatctcttg tccacgatgc 4585 acctcaatag tccttggtgg agacccttca aatctatgca gggggtcata tatgtaagtg 4645 ttcatactgt aaacagaagg gtaatttatt catatctatt gaatgcatgc ctaccatgtg 4705 ccagatacta ttaggaatgt gaagtgaata tagcaatgcc aggatgccag ataccaaagg 4765 ccataatcac tttcatatag atagttgatg atgaagaaag agagagagat aagtggaaag 4825 atgataggta gatagggaaa taatttaatt ctgatagtgt aagcatgcac acatctgggg 4885 aagaacattc taagcatcaa caacaagtgt aaaggcttca ggtgcaatac atttggcaca 4945 tttgtggcac catatgtact tttgcaggca attctgttag aaatggagcc ataagcggcc 5005 gggtgtggtg gctcacgctt gtaatcccag cactttggga ggccaaggcg ggtggatcat 5065 gaggtcagga gatcgagaac acgttgaaac cccgtctcta ctaaaaatgc aaaaagttag 5125 ccaggcgtgg tggtgggtgc ctatggtccc agctactcgg agaggctgag gcgggagaat 5185 ggtgtgagcc cgggaggcgg agcttgcagt gagccgagat cgccccactg cactccagcc 5245 tgggcgacag agcaagactc cgtctc 5271 <210> 7 <211> 5354 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3396) .. (3626) <400> 7 gttccgggct ccgggctctg ggtggcggcg gctgtgagcg gcggcactgc ggcgcaggcc 60 agcgggcgcc gtcggcggct ggccctgtcg gccgcgggat gaggaagcgg accgagcccg 120 tcgccttgga gcatgagcgc tgcgccgccg cgggctcgtc ctcctccggc tcggccgccg 180 cggcgctgga cgccgactgc cgcctgaagc agaacctacg cctgacgggc ccggcggcgg 240 ctgagccgcg ctgcgcagcc gacgcgggaa tgaagcgggc gctgggcagg cgaaagggcg 300 tgtggttgcg cctgaggaag atacttttct gtgttttggg gttgtacatt gccattccat 360 ttctcatcaa actatgtcct ggaatacagg ccaaactgat tttcttgaat ttcgtaagag 420 ttccctattt cattgatttg aaaaaaccac aggatcaagg tttgaatcac acgtgtaact 480 actacctgca gccagaggaa gacgtgacca ttggagtctg gtgagtgcca gtgagctctg 540 cagcccctcc agcagcagga aaggacccta catgactgag ctggtgctgt ctttgacatt 600 ggcctctgaa atgttgttca tttttatctt cacctcttcc tttcctaaag gaaagatctc 660 agtacatttg atccatccta atgttgaggg attcaatggc tttaaactct attcccagcc 720 tttcaaaaca gcaaaggctt tgtgaacaga ttattatttt tagagagact tgtttctagg 780 aaaaacgaaa agcaaacaaa gaagctagaa acaggatata agtggccggc acagtggctc 840 acacctgtaa tcccagcact ttgggaggct gaggcaagtg gatcacttgg gtccaggagt 900 ttgagaccag ccttggcgac aaggtgaagc cccatctcta ctgaaaatac aaaaattggc 960 caggcatggt ggtgcacacc tgtaatcccg gctacttggg aggctgaggc aggaggatca 1020 cttgactcca ggaggcagaa gttgcagtga gctgagttca cgccactgca ctccagcctc 1080 caggagtttg agaccagcct tggcgacaag gtgaagcccc atctctactg aaaatacaaa 1140 aattggccag gcatggtggt gcacacctgt aatcccggct acttgggagg ctgaggcagg 1200 aggatcactt gactccagga ggcagaagtt gcagtgagct gagttcacgc cactgcactc 1260 cagcctggat gacagagcgg gactctgtgt ccaaaaaaaa gagaatataa gggaagaatc 1320 acaaacactg gtttgaatat atcttccatc caggcgcggt ggctcacgcc tgtaatccca 1380 gcactttggg aggccgagac gggtggatca cttgagatca ggagttcgag accaccctaa 1440 acaacatggt gaaaccccgt ctctactgaa aaaaaaatac aaaaattagc tgggcatggt 1500 ggcacgcgcc tgtagtccca gctacccggg agtttgaggc aagagaattg cttgaacgca 1560 ggaggcagag gctgcagtga gctgagatta tgccactgca ctccagcctg ggcaacagag 1620 cgagactctt gtctcaaaaa aaaaaaaaaa tatatatata tatatatata tcttttccaa 1680 gaggtgaaca tctagatact gaaaacagct tgatttcttt tcacatatag cttcgcacca 1740 gctgatgatg atggtgttgg ctgtcttctg agtatttact tttttaagcc ataagtttac 1800 actggacata agccttgtga cttctctgtg tctccttgga atacatttta atgttttctt 1860 ctgtacacaa gtggtgcttc tgaacagctg aattaaaatt gaccaacctc actgtagcat 1920 ttccgttttt ctctgagaag tgagcctgaa tcacagccct tttatctagc tgaatttatt 1980 tttcagtagc cataacgtcg ctttccactg tctcagaaaa ccactgtgta tgtgagaagc 2040 ttctaaatga aatgctgtga gtagcttccc agtgtggatg agagaacatt cagacatttc 2100 taacagtggt ccttggtaaa gaacataaaa caagaaggga gatttttgtt cttagtagtt 2160 ggtgctggga ttaaagtctt ttgtcacttc ctgatttgat ttaatctagc atttactgac 2220 caatgtgtga ctgtgggact cctcaggttt ctgtctgaag catgacttgt tttgcacaga 2280 gctgctgagt gggacttggt cagccctctt cttggttaac atgagtctgt cctattgcag 2340 gagggtgttt atggtggtga gacttaggct ggaagtacta acaactcctg catacattat 2400 cagctaccag aaaaggtttt gtttagtgat gtcagtggca agtgtcaagg cagtgacagc 2460 cacgctgtgt gttgctgtgt gttggtgggt gcgtgttggt gagtgcggct gctgcccact 2520 ctcatgccct aggcagtcac aatgacattg ggccctcagg cagcagcagg cccagctcac 2580 caactgcccc caccatcgct tcatcagacc ctcccaccag cttcctgaaa ggagggcctc 2640 aaggcctaga gtggagcctg ggggccattc ctggccatgc ctggccatcg cctgtctgct 2700 gagaggagag gcccaggaat gggtttctct gctggctagg agttccctgg ctctttgtca 2760 ggagcccagg tttggggcgg ctggctactg gaagcacatc tgctcccagc accttcaaaa 2820 tgtgacgtcc tatccctcaa atttataact gtatgggagt gatgatcata gtgactagaa 2880 ccatcccaaa agtgatcctt ttgaacggat cagagagaga cgttagaaat gagatgctga 2940 agcttaagat ctagaaaata agaagttggc cattaaattt ttttaaagat tcttgtcact 3000 ggtatccaga atgaatttga ataagcttta cagaaaagtg attaatttgg accctcccca 3060 gtgattaaca gtattcgatg tggcctgagt taaaattaac ctcagtatct ctctgaagga 3120 gggtttggct agatagacca gcagcttcag agtgcccagc caggtgtgga ccatgagcag 3180 actgggaagg tggtgcatac ttgacattag aaatggcgtc agcgatggcc cggtttctag 3240 tctggctcac aaacattggt gttgctgaat tagcaaatgt gacagcatgt actgtttaaa 3300 atccattgtg agttaaactg tggacaattt aagcatgggg ttaactggga tgtggggcac 3360 gtatgaggaa aatgctatta ctcaagatgg actaa gtg ctg ctt ctc tgc cag 3413                                        Val Leu Leu Leu Cys Gln                                          1 5 gcc cca gtg gta act ggc aca ttg caa gcc ttt gtg gcc cag agg agg 3461 Ala Pro Val Val Thr Gly Thr Leu Gln Ala Phe Val Ala Gln Arg Arg              10 15 20 ctg gcc tgc cag tca ggg aac gct cct gct gaa gct gtc agt gcc atc 3509 Leu Ala Cys Gln Ser Gly Asn Ala Pro Ala Glu Ala Val Ser Ala Ile          25 30 35 cac aca gcc tgc agt tcc gtg gtt att gat tct agt gat ggg gat agt 3557 His Thr Ala Cys Ser Ser Val Val Ile Asp Ser Ser Asp Gly Asp Ser      40 45 50 ttt cta cca ccc acc ccc atc tgt tcc cct gta gtt aga tgg ggt ctg 3605 Phe Leu Pro Pro Thr Pro Ile Cys Ser Pro Val Val Arg Trp Gly Leu  55 60 65 70 cag ctc cct gct gtc tgg act tgagctgatt aagtaacacc acgtgcagtc 3656 Gln Leu Pro Ala Val Trp Thr                  75 gccaggatga ggacagaagg gacatctgat cctgcgctcc cctctgcttc tgtctgcagg 3716 cacaccgtcc ctgcagtctg gtggaagaac gcccaaggca aagaccagat gtggtatgag 3776 gatgccttgg cttccagcca ccctatcatt ctgtacctgc atgggaacgc aggtaccagg 3836 tgagggagag gagcccatca tctttgctgt ggagcatggc tcctttttgg ggtgggaatg 3896 tcctggcccg gtctgccgga aaccagcagg agcaacctgc ggtcccacaa aattgggtgt 3956 actgacccac tgcaatgagg gagagcctcc caagaacgaa gggaaaggca gagttctgaa 4016 ggggttgagg gaaggcggag tttaggtgac gtttacatga agcagggttc ccacaggctg 4076 aggccagcac ggctgtgttc cgtggggacc aagtgtggga agctgtgtgc agaccgtccc 4136 ctatctggag ctgcagcctg gattgggatg aaggaggctg agtctcagtc aggagctgag 4196 ctccccagcc agaggggcat gtttttttct ccttgttgta atctcaaagg tcacagcatc 4256 tgctggtatg tgatttgggg agcagggttt cttggagagt gagggctttg gggtccctcc 4316 gaggaggggt catagtggca cttggctgtg ggtgcctcgc gctcttggga atgggcggct 4376 tcagctcact ggcagcttgt tttctctata tctgtcatgc tgcctgaagt ggcaggcaga 4436 tctttgttac ctgtaagata gctccaggtg gcagacaggt gaagccggag ccagggcagc 4496 cctggaggag gtggcacact tcctggaaaa aagtggcagg tgctgagaag gggcaggagg 4556 gagaggctga aagcaccagg tgctggggtc tttgcaggcc accctacagt gtgagctctc 4616 tgctcaccct aacgggctgt tgctgaagtt catgcactct ttggttaatc agccctgccc 4676 agttttatct cagatgcttt tcctgagtgg gtctaatggc ttgtcctttc cactggctgc 4736 cctggagttg gcctccctgc cttaaaaggg tgctccttcc acaccatgtg attgctgttc 4796 agaagagcct ctgtgggaga acccagggtg ggtgcggagg ggatacattt tctgaacaaa 4856 gcattgtttg aaaaccacca aggtcttcgt gtgtgagaga ggcagccagg agccagcaca 4916 gaggcagacc cagcacggag acagggctgc tgcggggcca gggccgaggt cccgtgtggc 4976 accagggtat gtcagggctg ggggcaaaca ttttataggc agagccagca aggagagctg 5036 tgctgggccc ttctggaagg gaaagcaggc ctaggtgggg tctgcatggt ggaaccctgc 5096 agagagcatt gctgaaagag cctggcggca tggcccgtga agcggtttct gtgtcaggga 5156 tggcgtgtct agctacagaa tcatgcatgt gctgcacagt ttcctttctt ctttgtctgg 5216 gacaagctgc agagggtttc tcttctgctt ttcaaagagc aagtggctac aaacagctct 5276 tgcagcatct ttttataaat gacagaactg tggttggctg tgaataaggt tcagttctgc 5336 agtgcccatc acatggtt 5354 <210> 8 <211> 5175 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3) .. (2039) <400> 8 cc cct ctg cct tgg aaa acc aga aga atg cca ata tat tca tca gtt 47    Pro Leu Pro Trp Lys Thr Arg Arg Met Pro Ile Tyr Ser Ser Val      1 5 10 15 aca ctt tca ctg ttc ccg caa agc caa gca aat ata tgg cta ttt gaa 95 Thr Leu Ser Leu Phe Pro Gln Ser Gln Ala Asn Ile Trp Leu Phe Glu                  20 25 30 gac act gtt tta att act gga aaa tta gct gat gcc aag tct caa cca 143 Asp Thr Val Leu Ile Thr Gly Lys Leu Ala Asp Ala Lys Ser Gln Pro              35 40 45 agt gtt cag ttt tca aaa gcc tta att aaa ctt cct gac aac cat cat 191 Ser Val Gln Phe Ser Lys Ala Leu Ile Lys Leu Pro Asp Asn His His          50 55 60 att agc aac gtt act ggc tat ctt aca gtt cta caa cag ttt ttg aaa 239 Ile Ser Asn Val Thr Gly Tyr Leu Thr Val Leu Gln Gln Phe Leu Lys      65 70 75 gtg gac aat ttt ctg cat aca act gga att act ctc aat aaa cca ggt 287 Val Asp Asn Phe Leu His Thr Thr Gly Ile Thr Leu Asn Lys Pro Gly  80 85 90 95 ttt gaa aac att gaa ttg act cct ctt gct gca ata tgt gtg aaa ata 335 Phe Glu Asn Ile Glu Leu Thr Pro Leu Ala Ala Ile Cys Val Lys Ile                 100 105 110 tat tct gga gga aaa gaa cta aag gtc aat ggc tct att caa gtt tct 383 Tyr Ser Gly Gly Lys Glu Leu Lys Val Asn Gly Ser Ile Gln Val Ser             115 120 125 ctt cct ctt cta cgt ctg aat gat ata agt gca ggg gat cgc ata cct 431 Leu Pro Leu Leu Arg Leu Asn Asp Ile Ser Ala Gly Asp Arg Ile Pro         130 135 140 gct tgg aca ttt gat atg aac aca ggt gct tgg gta aat cat ggt cgg 479 Ala Trp Thr Phe Asp Met Asn Thr Gly Ala Trp Val Asn His Gly Arg     145 150 155 gga atg gtc aag gaa cat aac aat cat tta atc tgg aca tat gat gca 527 Gly Met Val Lys Glu His Asn Asn His Leu Ile Trp Thr Tyr Asp Ala 160 165 170 175 cca cat ttg ggg tac tgg ata gca gct cca ctt cca gga act aga ggt 575 Pro His Leu Gly Tyr Trp Ile Ala Ala Pro Leu Pro Gly Thr Arg Gly                 180 185 190 tca ggt ata aat gaa gat tcc aag gac ata act gcc tac cac aca gtg 623 Ser Gly Ile Asn Glu Asp Ser Lys Asp Ile Thr Ala Tyr His Thr Val             195 200 205 ttt ctt aca gcc ata tta gga gga aca ata gtc att gtc att gga ttt 671 Phe Leu Thr Ala Ile Leu Gly Gly Thr Ile Val Ile Val Ile Gly Phe         210 215 220 ttt gct gta cta ctt tgt tat tgc agg gac aag tgt ggt act cca cag 719 Phe Ala Val Leu Leu Cys Tyr Cys Arg Asp Lys Cys Gly Thr Pro Gln     225 230 235 aaa aga gaa aga aat atc act aaa ctt gag gtc ctc aag aga gac cag 767 Lys Arg Glu Arg Asn Ile Thr Lys Leu Glu Val Leu Lys Arg Asp Gln 240 245 250 255 aca act tca aca aca cac ata aat cat atc agt aca gtt aaa gtt gca 815 Thr Thr Ser Thr Thr His Ile Asn His Ile Ser Thr Val Lys Val Ala                 260 265 270 tta aaa gct gag gac aag tcg cag tta ttc aat gcc aaa aac tcc tca 863 Leu Lys Ala Glu Asp Lys Ser Gln Leu Phe Asn Ala Lys Asn Ser Ser             275 280 285 tat agt cct cag aaa aag gaa cca tca aag gca gaa aca gaa gaa aga 911 Tyr Ser Pro Gln Lys Lys Glu Pro Ser Lys Ala Glu Thr Glu Glu Arg         290 295 300 gtt tcc atg gta aaa act cgg gac gat ttt aaa atc tac aat gaa gat 959 Val Ser Met Val Lys Thr Arg Asp Asp Phe Lys Ile Tyr Asn Glu Asp     305 310 315 gtt tca ttt cta tca gtc aat caa aat aat tac tca aga aac cca aca 1007 Val Ser Phe Leu Ser Val Asn Gln Asn Asn Tyr Ser Arg Asn Pro Thr 320 325 330 335 cag tct ttg gag ccc aat gta ggg tcc aaa caa cct aaa cat att aac 1055 Gln Ser Leu Glu Pro Asn Val Gly Ser Lys Gln Pro Lys His Ile Asn                 340 345 350 aac aat cta tct tca tct cta ggt gat gct caa gat gaa aag agg tat 1103 Asn Asn Leu Ser Ser Ser Leu Gly Asp Ala Gln Asp Glu Lys Arg Tyr             355 360 365 ctc aca ggt aat gag gag gcg tat ggg cgt tcc cat att cct gaa cag 1151 Leu Thr Gly Asn Glu Glu Ala Tyr Gly Arg Ser His Ile Pro Glu Gln         370 375 380 ctt atg cat att tac agc caa ccc att gcc atc ctt caa aca tct gac 1199 Leu Met His Ile Tyr Ser Gln Pro Ile Ala Ile Leu Gln Thr Ser Asp     385 390 395 ctt ttc tcc aca ccg gaa caa tta cat act gct aag tca gct act ttg 1247 Leu Phe Ser Thr Pro Glu Gln Leu His Thr Ala Lys Ser Ala Thr Leu 400 405 410 415 cca aga aag gga cag tta gtc tat ggc caa ttg atg gaa cca gta aat 1295 Pro Arg Lys Gly Gln Leu Val Tyr Gly Gln Leu Met Glu Pro Val Asn                 420 425 430 cga gag aac ttt acg cag acc ttg ccc aaa atg cca att cat tct cat 1343 Arg Glu Asn Phe Thr Gln Thr Leu Pro Lys Met Pro Ile His Ser His             435 440 445 gca cag ccc cca gat gcc agg gaa gag gat atc ata ctt gaa ggt caa 1391 Ala Gln Pro Pro Asp Ala Arg Glu Glu Asp Ile Ile Leu Glu Gly Gln         450 455 460 cag agc ctg cca tcc cag gct tca gat tgg agc cga tac tca agc agc 1439 Gln Ser Leu Pro Ser Gln Ala Ser Asp Trp Ser Arg Tyr Ser Ser Ser     465 470 475 tta ctg gaa tcc gtc tct gtt cct gga aca cta aat gag gct gtt gta 1487 Leu Leu Glu Ser Val Ser Val Pro Gly Thr Leu Asn Glu Ala Val Val 480 485 490 495 atg act cca ttt tca tcg gaa ctt caa gga att tca gaa cag acc ctc 1535 Met Thr Pro Phe Ser Ser Glu Leu Gln Gly Ile Ser Glu Gln Thr Leu                 500 505 510 ctg gag ctg tcc aaa gga aag ccc tcc ccg cat ccc aga gcc tgg ttt 1583 Leu Glu Leu Ser Lys Gly Lys Pro Ser Pro His Pro Arg Ala Trp Phe             515 520 525 gtg tct ctt gat gga aag cca gtt gca caa gtg agg cac tcc ttt ata 1631 Val Ser Leu Asp Gly Lys Pro Val Ala Gln Val Arg His Ser Phe Ile         530 535 540 gac ctg aaa aag ggc aag aga acc cag agc aat gac acc agt ctg gac 1679 Asp Leu Lys Lys Gly Lys Arg Thr Gln Ser Asn Asp Thr Ser Leu Asp     545 550 555 tct ggg gtg gac atg aat gag ctt cac tca agt aga aag ctc gag agg 1727 Ser Gly Val Asp Met Asn Glu Leu His Ser Ser Arg Lys Leu Glu Arg 560 565 570 575 gag aaa aca ttc atc aaa agc atg cat cag ccc aag atc ctt tac tta 1775 Glu Lys Thr Phe Ile Lys Ser Met His Gln Pro Lys Ile Leu Tyr Leu                 580 585 590 gaa gat tta gac cta agc agc agt gag agt gga acc acc gtc tgt tcc 1823 Glu Asp Leu Asp Leu Ser Ser Ser Glu Ser Gly Thr Thr Val Cys Ser             595 600 605 cct gag gac cca gct tta agg cac atc cta gat gga ggg agt gga gtg 1871 Pro Glu Asp Pro Ala Leu Arg His Ile Leu Asp Gly Gly Ser Gly Val         610 615 620 atc atg gag cac cct gga gaa gag tcg cca gga agg aaa agc act gtt 1919 Ile Met Glu His Pro Gly Glu Glu Ser Pro Gly Arg Lys Ser Thr Val     625 630 635 gaa gat ttt gaa gct aat aca tcc ccc act aaa aga agg ggc aga cca 1967 Glu Asp Phe Glu Ala Asn Thr Ser Pro Thr Lys Arg Arg Gly Arg Pro 640 645 650 655 cca cta gcc aaa aga gat agc aag act aac atc tgg aag aag cga gag 2015 Pro Leu Ala Lys Arg Asp Ser Lys Thr Asn Ile Trp Lys Lys Arg Glu                 660 665 670 gaa cgc cca ctg att ccc ata aat taactccaat ggggattgtg tgtctgctgt 2069 Glu Arg Pro Leu Ile Pro Ile Asn             675 ctcgtgctgt ttattcttgc ttcttgttgt aaattgcagt acgaacttaa gaaaatgaga 2129 ctgagcaatc tcatggttct tggacatgtc tcaagcagag taaatggtaa ttcagtaatc 2189 agagagaaag ataccaagga atgctttttc tggcctattc atttattttt gggtgatgaa 2249 tttacagtat ctaagttttc aaaatgtaaa atagcttcaa gatgttagtt atctgaaaat 2309 gttgctcagc cagccagttt ggccttgact ctcttaagaa taacagtgaa atatatactc 2369 ctcaagttgc ctccaaaaat gttgcctcta ccatggtgac taccccatgg aacatttaga 2429 aacaaaactg acttcaggca tcatattatt ttaaatgtta ctattacgtc ttcttctgcc 2489 tatacttaaa aataacttga taaatgactt ggactgatgt tactctggag ttatcacaaa 2549 gaaaatgttg tttggtcttt aaagagcatg tgtattgtat catcccaaac gtaaatccta 2609 catttatata agatgggcaa gaagctactt ggtcattaga gagggagaca ccagctcttt 2669 ggttgttttt ggatataact ttacaaaata agtaagatgt taatttagaa atttgagaaa 2729 ttaatgctct aatactgagt ttttatttaa aaattatttt ttcttcccct caacaatgaa 2789 gcaagcttag ctgtcaaggg aaacttttta caaatctgaa aaaaacaatc tatgactttg 2849 gtttaaggct cactgatact tttaggctaa attggtttta atatatttct tctattctaa 2909 aaacctgaac tcagtcactt aaaggctatg aaatttaaaa aaaaagtcga tgtgaaagtt 2969 tcttttgaac actaaaataa aatatgtgca gataaaatat acattgattt gtttttctta 3029 aatgttgatg agaagaaaaa gagatgccat tttcctgagg ctcaaaaata ccttcaggat 3089 agttgtatat ccagttattg attttcttaa aagatgtgta aggaaaacag tttcaatttc 3149 aggggaaaag taaaagtttt tccctaagtc acttaaagcc tttgcaactt cttttttcag 3209 ttttgtaagt aatatatcta tgttcttttc attatagcaa gcattcaatg tgaacaactt 3269 tttaattaac tctgaattac cattcataca tcctaaaaat aaaagctcgt tattcattaa 3329 aatcaactga tcccattttt cttaaaattt ccctgaaggc aaatgtctga agcacctttc 3389 ccttgtgggg gtaaaaatcc taaattgctt tatttttcat tccctcctat tcaacatggg 3449 agcagcatag agacccaaac catgtaaaca agttcagtga accaaaacag ccacattagc 3509 ttcagtaaaa ttatagctag atgtgcaatt ttttcctcca acttctaacg tgtcaaataa 3569 ccttcctact gttctgtgtt aactgaaaga acataaagac cctaggcaaa tatttgctat 3629 atattacccc aatccataga agaaataatg ttttgggtaa tacctaggct tccttttttt 3689 tttttttttt ttttttttag tgataaggct cataacaatt agagaaggct tcttattggt 3749 cttacacaga aagatacatc aaaagcagca tgactcaaaa tgatttggaa aaggttaaag 3809 ttagtgctct gctgaagtgc ctttgatata gacttgcatt attagaagga tataacatct 3869 tttttaagtg tgcattttct ttcagttaac caaattaaac agatgtgcag ttttattaaa 3929 aatatagacc tagtgtttca tgttggaaca ataaatattg catgtgagta gtatttcttg 3989 ttttttgaat acagtatata ttgataaatt gtttatgttg gaatgaagtt agaaactata 4049 tagcaaaaca ttatatttta agtgtttatt tttcccacct ttaaataaaa atgtttcatc 4109 tcagcttggt aatgaaatac acatattggt ataagggtat accattcagg tatgccactt 4169 attttattca tttttgtgta agggaaatga gatgatgtat cccaagggct tttctagaac 4229 tacttgtttg ctttcagaat aaaaccttat tattttttac actgcacatg ctgttctcaa 4289 ttggtaatta taggcaattt atcttttcta atgatcaaaa gagtgtgact tctcatttgt 4349 gagtagttca caaatttcct gttaaaaagc tgaaaccatc tactttttct taacccaagt 4409 gataataaac aatattcaca actttcttaa atttttaaat tgaaaaccaa ggttttttca 4469 aatataaacc tagatgattt tggtcacaaa ttgttaacat ttgtcgatcc tttgtatata 4529 ctttggatat atattaaagg caaaactatc tcttgactaa ctgatggatt catttactaa 4589 agcacagctg tatgtatttt tgaatacata ttatgatctt gagactttat aaatcaattt 4649 ttatgacttt atgcagttgt atagggatta tgccctttca gttctatagg gattatgccc 4709 ttttataata cataatatac cacagagatt acaaatgttg aggaatgaaa gcacttcttt 4769 gctttggcaa tcattttcag accactatgt gtttgaatcc tctggtatca atacgtatta 4829 tagggtttta gagatctgtg ggtcaaatga tgtccctcaa aacttcctaa aaaggtgaag 4889 ctcaaagtca cacattcatt tataaggcgc atgagtttct cattttccca tgtacgagca 4949 ttgtaaagga attcagctgt attaatttct atttcagatc tagaattgac attttgcctt 5009 cttgtttcca ggtgtttcta ttttttgtat tctttcagag aaatctcata tttcggtgta 5069 tttattgctg ttactactat atttactgct gaaaactgta acaacctgaa gatttgtaaa 5129 atgttaaaca tagttcatta aaaataataa aataaatcta aaatgt 5175 <210> 9 <211> 5700 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1) .. (1824) <400> 9 gat gag gag gca cct tcc aag caa tgt gtt tct gta gga gtg tca cag 48 Asp Glu Glu Ala Pro Ser Lys Gln Cys Val Ser Val Gly Val Ser Gln   1 5 10 15 gtc aca act tta aag cca gct ttg tcc acc cag aag gcc cag ccc tgt 96 Val Thr Thr Leu Lys Pro Ala Leu Ser Thr Gln Lys Ala Gln Pro Cys              20 25 30 gag aca tgt agc tca ctt ctg aag gac att cta cac ctg gct gag cat 144 Glu Thr Cys Ser Ser Leu Leu Lys Asp Ile Leu His Leu Ala Glu His          35 40 45 gac gga aca cac ccc aag cgt aca gcc aag ctt tac ctg cac caa aag 192 Asp Gly Thr His Pro Lys Arg Thr Ala Lys Leu Tyr Leu His Gln Lys      50 55 60 gag cat ctt aga gag aag ctc acc aga agt gat gaa ggg agg cct tcg 240 Glu His Leu Arg Glu Lys Leu Thr Arg Ser Asp Glu Gly Arg Pro Ser  65 70 75 80 ttt gtg aat gac agt gtt cac ctg gca aag agg aac ctc aca tgc atg 288 Phe Val Asn Asp Ser Val His Leu Ala Lys Arg Asn Leu Thr Cys Met                  85 90 95 cag ggt ggc aag gat ttt act ggt gat tca gat ctt caa caa cag gct 336 Gln Gly Gly Lys Asp Phe Thr Gly Asp Ser Asp Leu Gln Gln Gln Ala             100 105 110 ctt cac agt ggg tgg aag cca cac agg gac act cat ggt gtg gag gcc 384 Leu His Ser Gly Trp Lys Pro His Arg Asp Thr His Gly Val Glu Ala         115 120 125 ttt caa agt gga cag aat aat tac agc tgc acc caa tgt ggg aaa gac 432 Phe Gln Ser Gly Gln Asn Asn Tyr Ser Cys Thr Gln Cys Gly Lys Asp     130 135 140 ttt tgc cac caa cat aca ctg ttt gag cac cag aaa atc cac aca gag 480 Phe Cys His Gln His Thr Leu Phe Glu His Gln Lys Ile His Thr Glu 145 150 155 160 gaa agg cct tat gag tgc agt gaa tgt ggc aaa ttg ttt agg tac aac 528 Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Leu Phe Arg Tyr Asn                 165 170 175 tct gac ctt att aaa cat cag cga aat cat act gga gaa agg cct tat 576 Ser Asp Leu Ile Lys His Gln Arg Asn His Thr Gly Glu Arg Pro Tyr             180 185 190 aag tgt agt gaa tgt gga aaa gcc ttc agc ctc aaa tac aat gtt gtt 624 Lys Cys Ser Glu Cys Gly Lys Ala Phe Ser Leu Lys Tyr Asn Val Val         195 200 205 caa cac cag aaa att cac act gga gaa agg cct tat gag tgc agt gaa 672 Gln His Gln Lys Ile His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu     210 215 220 tgt ggg aaa gct ttt ctt aga aag tct cac cta ctt cag cac cag agg 720 Cys Gly Lys Ala Phe Leu Arg Lys Ser His Leu Leu Gln His Gln Arg 225 230 235 240 att cac acc agg cca agg cct tat gtg tgt agt gaa tgt ggg aag gcc 768 Ile His Thr Arg Pro Arg Pro Tyr Val Cys Ser Glu Cys Gly Lys Ala                 245 250 255 ttc ctt aca cag gct cac ctt gtt ggt cac cag aaa att cat act gga 816 Phe Leu Thr Gln Ala His Leu Val Gly His Gln Lys Ile His Thr Gly             260 265 270 gaa cgg cct tat gga tgc aat gaa tgt ggg aaa tac ttt atg tac agt 864 Glu Arg Pro Tyr Gly Cys Asn Glu Cys Gly Lys Tyr Phe Met Tyr Ser         275 280 285 tca gca ctc att aga cat cag aaa gtt cac act gga gaa agg cct ttt 912 Ser Ala Leu Ile Arg His Gln Lys Val His Thr Gly Glu Arg Pro Phe     290 295 300 tat tgc tgt gaa tgt ggg aaa ttc ttt atg gac agc tgc aca ctc att 960 Tyr Cys Cys Glu Cys Gly Lys Phe Phe Met Asp Ser Cys Thr Leu Ile 305 310 315 320 att cac cag aga gtt cat act gga gaa aaa cct tat gaa tgc aac gaa 1008 Ile His Gln Arg Val His Thr Gly Glu Lys Pro Tyr Glu Cys Asn Glu                 325 330 335 tgt ggg aaa ttc ttt aga tac cgt tcc aca ctc att aga cat cag aaa 1056 Cys Gly Lys Phe Phe Arg Tyr Arg Ser Thr Leu Ile Arg His Gln Lys             340 345 350 gtt cac act gga gaa aag cct tat gag tgt agt gaa tgt ggg aag ttc 1104 Val His Thr Gly Glu Lys Pro Tyr Glu Cys Ser Glu Cys Gly Lys Phe         355 360 365 ttt atg gac act tcc aca ctc att att cat cag aga gtt cat act gga 1152 Phe Met Asp Thr Ser Thr Leu Ile Ile His Gln Arg Val His Thr Gly     370 375 380 gaa aag cct tat gaa tgc aac aaa tgt ggg aaa ttc ttt agg tat tgc 1200 Glu Lys Pro Tyr Glu Cys Asn Lys Cys Gly Lys Phe Phe Arg Tyr Cys 385 390 395 400 ttc aca ctg aat aga cat cag aga gtt cac tct gga gag agg cct tat 1248 Phe Thr Leu Asn Arg His Gln Arg Val His Ser Gly Glu Arg Pro Tyr                 405 410 415 gaa tgc agt gaa tgt ggc aaa ttc ttt gtg gac agc tgt aca ctg aag 1296 Glu Cys Ser Glu Cys Gly Lys Phe Phe Val Asp Ser Cys Thr Leu Lys             420 425 430 agt cat cag aga gtt cac act gga gaa aga cct ttt gaa tgc agc att 1344 Ser His Gln Arg Val His Thr Gly Glu Arg Pro Phe Glu Cys Ser Ile         435 440 445 tgt ggg aaa tcc ttt aga tgt cgc tcc aca ctt gat aca cat cag aga 1392 Cys Gly Lys Ser Phe Arg Cys Arg Ser Thr Leu Asp Thr His Gln Arg     450 455 460 att cac act ggt gaa agg cct tat gag tgt agt gaa tgt ggg aaa ttc 1440 Ile His Thr Gly Glu Arg Pro Tyr Glu Cys Ser Glu Cys Gly Lys Phe 465 470 475 480 ttt agg cac aac tca aat cat att aga cat cgg aga aat cac ttt gga 1488 Phe Arg His Asn Ser Asn His Ile Arg His Arg Arg Asn His Phe Gly                 485 490 495 gaa agg tct ttt gag tgc act gag tgt ggg aga gtt ttt agc caa aat 1536 Glu Arg Ser Phe Glu Cys Thr Glu Cys Gly Arg Val Phe Ser Gln Asn             500 505 510 tcc cac ctc att cgg cac caa aaa gtt cac act agg gaa aga act tac 1584 Ser His Leu Ile Arg His Gln Lys Val His Thr Arg Glu Arg Thr Tyr         515 520 525 aaa tgc agc aaa tgt ggg aaa ttt ttt atg gac agc tcc aca ctc att 1632 Lys Cys Ser Lys Cys Gly Lys Phe Phe Met Asp Ser Ser Thr Leu Ile     530 535 540 agt cat gag aga gtt cat act gga gaa aag cct tat gag tgc agt gaa 1680 Ser His Glu Arg Val His Thr Gly Glu Lys Pro Tyr Glu Cys Ser Glu 545 550 555 560 tgt ggg aaa gtc ttt aga tac aac tcc agc ctc att aaa cat cgg aga 1728 Cys Gly Lys Val Phe Arg Tyr Asn Ser Ser Leu Ile Lys His Arg Arg                 565 570 575 att cac act gga gag aga cct tat cag tgc agt gaa tgt gga aga gtc 1776 Ile His Thr Gly Glu Arg Pro Tyr Gln Cys Ser Glu Cys Gly Arg Val             580 585 590 ttt aac caa aat tct cat ctc att cag cac cag aaa gtt cac acc aga 1824 Phe Asn Gln Asn Ser His Leu Ile Gln His Gln Lys Val His Thr Arg         595 600 605 taaagaatgt atatataaag cagatgggga aagacttcac acagaaatct actctgattt 1884 agcactggga cctacgtttt aaaaaaagta ttcttgtaga atacagataa cataaaatct 1944 aacatcttaa ccatgttaaa gtgtatagtt cagtaccgtt aagtcattca cattgtgcaa 2004 tgaatatcta gaagtctttt caacttatga aactaagtct atacctttta aaaccttatt 2064 cctcactcca tccagcctct tgacaagcac cgctctgtat gaattttact agtccgggta 2124 cctcatataa gaaaacttaa gttttggtct tcttgtggtt tattttgtgg cttattttgc 2184 ttaacgttat atttttaagg tttcatgttc taatccatta gaatttccat cctttttaaa 2244 ggctgaataa aattctgtta gtcatgtgtt gcttaacagt ggggaagtgt cctgagaaaa 2304 gtgttattag gtgattttct ttcttttttt ggtggtgggg ggattgcgtg aatgcctagg 2364 ctgtatggta tatcctatag caccttgcta caaacttgta tagcatatta ctgtactgaa 2424 tactgtaggc tgttggaaca catggtaagt aattgttttt aagtatatct aaacagaaaa 2484 ggtacagtaa aaatacagta taaaagaaaa aatgatagac tcacagagaa cttaccatga 2544 atggagctta cagtactgca agttgctcta ggtgagtcag tgagtggtaa gtgaatgtga 2604 aggcctaggt tgttactgtg ctgtagactt tatagacatt gtgtacttag acgacaatac 2664 atttttattt ttattattat ttttgagaca gaatcttgct ctgttgccca gactagagtg 2724 cagtggtgca atcttggctt cctgcaacct cctccacctc ctggttcaag cagttctgcc 2784 tcagcttccc aagtgtctgg gattacaggc atgcaccacc atgccccgct aatttttgta 2844 tttttagtag agaacggggt cttaccatgt tggccaggct ggtctcaaac tcccgacctc 2904 aagtgagcca ctcgctttgg cctcccaaag tgctgggatt acaggcatga gccaccgcac 2964 ccggccggtt ttaccatttt tagagccaag ctttactata tatttgatat ttgttctttc 3024 ttcaacctta gctgaattca cattcctctg atagaaggtg ttttcaaact gatgccgttc 3084 ttagtgcctc aaactagatc ctgttcatac ttgttagaac aagttattac aaattcactt 3144 tggtgtaaaa aattgaaatc catacataat tttttttttt tttttgacag agtctcacta 3204 acgctaggtt ggagtgcagt ggcatggtct cggctcattg caacctccgc ctcctgggtt 3264 caagcagttc tcctgcctca gcctcttgag tagctgggat tacaggtgcc cacaatcacg 3324 cccagctaat ttttgtattt ttagtggaga tgggttttca ctctgttggc caggctgctc 3384 tcgaactcct gacctcaggt gatccacctg cctgggcctc acaaagtgct gtgattacag 3444 gcttaagcca ccacccctgg ccaatttttt cataatatac atttttttct catttttcat 3504 gaaacttttg aagacccctc atattctaga tattccttct cagatatgtg gttttcaaat 3564 actttctccc attgagtctt tttccttttc actctgtcca ttatgtcctt ttttacacag 3624 gaattttgaa tttaaatgga gtctaatata tctgttttat aagtctttga tgcatttgag 3684 ttcattttag caacttaatt cttttgcgtg tggatatcca gttttttttt tttaacatca 3744 aaagaataat gtttttgcct agcattaagg cccttggtag aggcttgtca gttacaattt 3804 tggagcagca gattaagtcc acactcccaa ccattttcct tatcaggctc tcaaactctg 3864 ggccacaata tgtaagaccc aatcacccca ggatcaggaa tcagatatct agggacagct 3924 tctgtgccca ggagcttgta aaattattcc attggtcaat gcacaggggt ccctgaaaac 3984 ctagctaacc ccaatttaca tggcacacac aagctgcccc ctaagctcca gcttgctgtt 4044 atcttgggtt ccctcataac tcttgcagcc ctgcctatgt ccttaggttt caagctgtaa 4104 gtagcaaagt ggtctacatt ttatgattat cattgtgaca tgtcctgaca tcagaaaaac 4164 acctttgtat gttattacta tacaaccagc agaatattat gagtgcagca aatgttagaa 4224 agtattcagc ctaacttcac tgagcaagag taagttcatc ctggagaaag tccttaggaa 4284 tgcaggcaat atactttttt tcctttgtca acaggtcaaa aacagcaaag ctctatcgag 4344 cttgtcttac tcaccctatt tttttgttgc tctgttttgt tttaggcttt tagcctgaag 4404 ccatggtttt gtttctgtct ctagtggtag gtggacaaga ggaatgagat gagaaaggag 4464 ctttactggc ccagctagaa acaaactaag aacccatgac tgtattcttt cccttggatg 4524 accctgtgtt agcttgttga gggagatctc agcctgaaat tgaatctcac atccaaacat 4584 ccacgcaagg gagatttgtt gtaattgtca gatatatggt aaatttttgt gaatgatgtt 4644 gcactttctg acctgcctgg ggcctttcca gagttaagtt gctgaaagtg tgcattacag 4704 aagactcctg ctattagctg tcatggtgcc acaatgtgca tcaccttagt caccttaaat 4764 tacttagaga gtgataaggt ctggacttct ggttaaatgt ttttaaaaaa tggggggtgg 4824 gggggtggtg catagattgc tgtgttctct acctttatct ggaatattca gtcatttgtt 4884 ccctttgggg gcctcattcc cagtcccctg actggtttgg gtgtggacat cacccagctt 4944 tggacagaga acacacgcca acttcagctg gcagcttgta gagatttcct tttttcagag 5004 gtattattag ttgtctgata ctgataatgt tgatgataaa ttttctacct tccaagcttc 5064 ccaacccagt caatttccac ctaagcattg ctgttttctt ctgatgataa aggtcatatt 5124 gtttaagcta catttactct tggggttctc ttcactgtgt gctgcgggtt gagaacaaaa 5184 ttaggctttg ccagaatgaa aaagtgaatg gtttttgggg ccttcaactt tttgtgctct 5244 tgaagaaata agaagacaaa atagctttca atccacatca ggcccaattt gcattgcttc 5304 gggagttcct gggaaagtga cggacttcta tccaaaatcg cgccgtgaat ttgattattg 5364 gtagttctac agtcagcttg agggttgttg gtttgacagt tgtcagagca tgttgcagct 5424 gtatgaggtg ggtatctgta catatggatg tcccatattc tccagcattg cataggaata 5484 gctggtgtct agatcctgcc tcaggagcta tgtgtcctga atttaaaaat caggtatgtt 5544 atatccctgg ggcatgtcag acatacaaca acagtgcatt agtccatttt ccattgctta 5604 caacaaaata ccttaaactg ggtaatttgt aaagaaaata aatttcttac tgttgttttg 5664 ttgtgctttc tttttttcta gatcgcgagc ggccgc 5700 <210> 10 <211> 5718 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1864) .. (3324) <400> 10 aaaaaatagc caggcgtggt ggcacatgcc tgtagtccca gctactcggg tggctgaggc 60 gggagaatcg cttgaaccca ggaggcagag gctgcagtga gctgaggtcg cgccactgca 120 ctccagcctg ggtgacagag tgagactctg tctcaaaaaa caaacaaaca agcaaaaatc 180 aaaaacattg tttcagatct tattttccta ttgacctaag agttatttaa tatagacttt 240 tgaattttca ggtagatttt acttgtgttt cttttaattt tattattaat ttccagtttt 300 agtagaaaat gtctatataa tttcaacctt ttttttttgc taactcctag ttttgtgtct 360 agatttcagc tttttaatag agaccacctt atcataattg aaaacctcat cctgtttctc 420 tctcatacac aactcattca gcttgtagat tttcctctgg ctgttcctgt tgctggcttg 480 ccatcctctg gcctacatgt aacgtgactt ctgttagcaa gtccttcaca ttcagatgac 540 tgcaacagtt gaacatgcac attacttttt tccacatttc tgagttacag gcatatcctg 600 tctttttgtg ccatgaattt cataagaact tcagtaagat atagacgtgt tttcctgtcc 660 gaaaaatcac tgatacacat ttcctaaaaa aacaggaatg gccttgacta tgagggaatg 720 gaaaggagaa aacctgagca atgggaggaa agaacaaaac cagttgaaca tttgtcaaat 780 gaatgagtaa tagaataagt tgaagggaag attgagtagg tagaactcct caataccttt 840 gctatggtgt agagacaaag aaaaagttac ccatagctat gtgaaggggc aaatgggcag 900 tgcactgcat aggtaaaggc ataagcagtg gcagatgcat aggaatatac agccacctat 960 ttccattctt gtatttagct gggctgccct ggatgaataa agagaatatt taatcactgt 1020 agtaaggcat gagagactag aggcagagag atacatttag aacaagtatg tagctatggg 1080 aattaaatga gtaaaaccta ttatatgctt agcatttatt acttaatgta gcgttaacta 1140 gggattgttt ctgtccttga ggaggtttat aggtaagaat aggcagtata acaattggag 1200 aagttaataa atatataatt tttaaattga gagctattat aaaggcaagg aatagagaat 1260 agcaacatgg atttaggtac tatcaatagg gtcaacctta aaggaaatga cttttaatct 1320 aagatctcaa gatgaaagga gccagccatg caaagagccc aatcaggaga tttgtgggag 1380 atagaacaga ctgtgaagag tcctaagaga tggaagtaag ttaggacatg tgaggtctgc 1440 aaggaggcct ctccagggag cagtaaaggg gacactggtt tgagatcata tgatgtaggt 1500 aagcagaggc cttcttaacc acagtgagct tgactggatt ctgagaatga agaaagatga 1560 aaaaaattta agtatagatt ctcgacctct catcttcatc agaattatca tgtagagaat 1620 attaaatatc cttccaggac tgcagagtct gatccaggtg gtttgggtgg tgtctaaacc 1680 tcagtgttag taacaagcta tgtggtgatt ctgatccaca cctcaatatg agaaccactg 1740 acttaaaatg tttacttcat caccaatctt gctattttgt atgaactctc atttttcctg 1800 atgttccctt tggctttttt tttctttcca aattgtataa caatatttta gagttttggt 1860 taa gga tta ctt ttg ttg tct ttt att tct aca tta gaa atg ttg ctt 1908     Gly Leu Leu Leu Leu Ser Phe Ile Ser Thr Leu Glu Met Leu Leu       1 5 10 15 ctt tta gta aaa gaa aac agc atg tgt ttg ctt tct tgt ttt att tta 1956 Leu Leu Val Lys Glu Asn Ser Met Cys Leu Leu Ser Cys Phe Ile Leu                  20 25 30 gat ttg gag acc aag tat gag acc aag aag tta tct tta gaa aat gac 2004 Asp Leu Glu Thr Lys Tyr Glu Thr Lys Lys Leu Ser Leu Glu Asn Asp              35 40 45 att tat gaa ata aat tta tcc cag tgg aag ata atg gaa aga att gaa 2052 Ile Tyr Glu Ile Asn Leu Ser Gln Trp Lys Ile Met Glu Arg Ile Glu          50 55 60 aac cat ggc ctt aag ggt ctc att tta aaa aat gat tgg gaa tcc aca 2100 Asn His Gly Leu Lys Gly Leu Ile Leu Lys Asn Asp Trp Glu Ser Thr      65 70 75 gga aaa att gaa gga cag gag aga cct caa gaa gga tac ttc agt agt 2148 Gly Lys Ile Glu Gly Gln Glu Arg Pro Gln Glu Gly Tyr Phe Ser Ser  80 85 90 95 gtg aaa atg cca tct gaa aag gtg tcc tct tac cag aaa cgc aca tct 2196 Val Lys Met Pro Ser Glu Lys Val Ser Ser Tyr Gln Lys Arg Thr Ser                 100 105 110 gtt act cca cat cag aga ctt cat ttt gtt gat aaa ccc tat gaa tgt 2244 Val Thr Pro His Gln Arg Leu His Phe Val Asp Lys Pro Tyr Glu Cys 115 120 125 aag gaa tgt ggg aag gcg ttc aga gtg cgc caa cag ctt act ttt cat 2292 Lys Glu Cys Gly Lys Ala Phe Arg Val Arg Gln Gln Leu Thr Phe His         130 135 140 cac aga att cat act ggt gaa aaa ccg tat gaa tgt aag gaa tgt ggg 2340 His Arg Ile His Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu Cys Gly     145 150 155 atg gcc ttc aga cag act gca cac ctt act cga cat cag aga ctt cat 2388 Met Ala Phe Arg Gln Thr Ala His Leu Thr Arg His Gln Arg Leu His 160 165 170 175 tct ggt gaa aaa ctc tat gaa tgt aag gaa tgt ggg gaa gct ttc ata 2436 Ser Gly Glu Lys Leu Tyr Glu Cys Lys Glu Cys Gly Glu Ala Phe Ile                 180 185 190 tgt ggt gca gat ctt aga gta cat cag aaa atg cat att ggt gag aag 2484 Cys Gly Ala Asp Leu Arg Val His Gln Lys Met His Ile Gly Glu Lys             195 200 205 ccc tat gaa tgt aaa gaa tgt ggg aag gct ttt agg gta cga gga caa 2532 Pro Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Arg Val Arg Gly Gln         210 215 220 ctt act ctg cat cag agg att cat act ggt gag aaa ccc tat gtg tgt 2580 Leu Thr Leu His Gln Arg Ile His Thr Gly Glu Lys Pro Tyr Val Cys     225 230 235 aaa gag tgt gga aaa gcc ttt aga cag tac gca cac ctg act cgg cat 2628 Lys Glu Cys Gly Lys Ala Phe Arg Gln Tyr Ala His Leu Thr Arg His 240 245 250 255 cag aag ctt aat agt gct gac agg ctc tat gaa tgc aaa gaa tgt ggg 2676 Gln Lys Leu Asn Ser Ala Asp Arg Leu Tyr Glu Cys Lys Glu Cys Gly                 260 265 270 aag gcc ttt ttg tgt ggc tct ggt ctt aga gta cat cac aaa ctt cat 2724 Lys Ala Phe Leu Cys Gly Ser Gly Leu Arg Val His His Lys Leu His             275 280 285 act ggt gag aaa ccc tat gaa tgt aag gaa tgt ggg aag gcc ttt aga 2772 Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Arg         290 295 300 gtg cga caa caa cta aca ctc cat cag aga att cat act ggt gag aaa 2820 Val Arg Gln Gln Leu Thr Leu His Gln Arg Ile His Thr Gly Glu Lys     305 310 315 ccc tat gaa tgt aag gaa tgt gga aag acc ttt agc cgt ggt tat cat 2868 Pro Tyr Glu Cys Lys Glu Cys Gly Lys Thr Phe Ser Arg Gly Tyr His 320 325 330 335 ctt att ctc cat cac aga att cat act ggt gaa aaa cct tac gaa tgt 2916 Leu Ile Leu His His Arg Ile His Thr Gly Glu Lys Pro Tyr Glu Cys                 340 345 350 aag gaa tgc tgg aaa gcc ttt agt cgc tac tca caa ctt att tca cat 2964 Lys Glu Cys Trp Lys Ala Phe Ser Arg Tyr Ser Gln Leu Ile Ser His             355 360 365 cag agt att cat att ggt gtt aag ccc tat gac tgt aag gaa tgc ggg 3012 Gln Ser Ile His Ile Gly Val Lys Pro Tyr Asp Cys Lys Glu Cys Gly         370 375 380 aag gcc ttc aga cta ctt tca caa ctc aca cag cat cag agt att cat 3060 Lys Ala Phe Arg Leu Leu Ser Gln Leu Thr Gln His Gln Ser Ile His     385 390 395 att ggt gag aaa cct tat aaa tgt aag gaa tgt ggc aag gcc ttt aga 3108 Ile Gly Glu Lys Pro Tyr Lys Cys Lys Glu Cys Gly Lys Ala Phe Arg 400 405 410 415 ttg cgc caa aaa ctt act cta cat cag agc att cat act ggc gaa aaa 3156 Leu Arg Gln Lys Leu Thr Leu His Gln Ser Ile His Thr Gly Glu Lys                 420 425 430 ccc ttt gag tgt aag gaa tgt agg aag gcc ttt aga ctt aat tca tcc 3204 Pro Phe Glu Cys Lys Glu Cys Arg Lys Ala Phe Arg Leu Asn Ser Ser             435 440 445 ctt att caa cat ctg aga att cat tct ggt gag aaa ccc tat gaa tgt 3252 Leu Ile Gln His Leu Arg Ile His Ser Gly Glu Lys Pro Tyr Glu Cys         450 455 460 aag gaa tgt aag aag gcc ttt agg caa cat tca cac ctt act cat cat 3300 Lys Glu Cys Lys Lys Ala Phe Arg Gln His Ser His Leu Thr His His     465 470 475 ctg aaa att cat aat gta aaa atc taagaaagtc ttttcaactt ctgtgttata 3354 Leu Lys Ile His Asn Val Lys Ile 480 485 gaacattcta tgaatgtagt aattaatcta ttttgctcca tacatgcaac tgcgttggca 3414 ttagaggttt tataccatta aaagagtgtg acaatgtatt gtagtccatc atcactcaaa 3474 cctgaaactt cagcatattt gttctaaaaa ctaatcctgt tataagaatg aaaaagacat 3534 ttaccgtcat ccctatccca tcactttgtg tcatactgga caagacgctt aactgctctg 3594 tgctataatt ttttaattta tgacatggtc atataagagt gcagcagttt gacacacttt 3654 caatcaagag gtggaatcta catccctctg ttgtatacag gaatctacat ccctctgtga 3714 ctgcttcgcc cagtagaata tggtagacgt actgctgtgc tggtttcttg actcagactt 3774 tactggtagg gttacttcat ctcatgaaat gacagaactg tcttgttaca aagtctggct 3834 actttgctga agagaccaca tgaagaggcc ctgaaactac gtagagaggg agaggcacct 3894 gctgaaccct gtcttcttga catccaccaa gataccagac atttgagtga aattttggat 3954 ccaccagacc agccaccagc taaatgccag tgagcgactt ggtaacctga actgatgcca 4014 tgtagaatgg aagaattgac cagctgagcc cctggcccaa atttctgacc cacaaaaaca 4074 tgagatataa taaaatggtt gttgttttaa gccactaaac tttcttttat tatcatacta 4134 aatatttatt tttacataat tcatactaaa ggctgtgggc ataatgagta tagaaatgcc 4194 ttccaccaac atttaatctt cattgaactt gagatatctt attctcgtta aaatattgaa 4254 agtaaggaat tcctttccag tttgttcatt aaaaaacaga taccctagac caggtgaggt 4314 ggcccatgcc tgtaaatcca gcactttggg aggccaagta aggacgattg cttgaggcca 4374 gaagttcgag accagcctgg gcaacatggt gagaccctgt ctctaccaaa aaactttaaa 4434 aattagctgg gcataatggt gcatgcctgt agtcctagct actcgaggct gaggtgggag 4494 gatcatttga acccaagagt tgaggcttca gtgagctgtg attgtaccac tgcactccag 4554 cttgagtgac tgcaagaccc tgtctcaaaa aaaaaaaaaa actacaatca gtattctctg 4614 gccccagtca atattaaaaa ttgacaaagg ataacatgaa actttggaaa ttaaaagtca 4674 cctgtattat gaagccccat taatttttta aactacttca ttgtttttta aaaatgctaa 4734 attgtcaatt tagcattttg aggcaatccc ctcaaattgg ctcctgtgtc cttttatcat 4794 gaccactcca acctttgagc tttttttttt cctgaaccaa atgcattgat tagttatgga 4854 gcatactaga gcattattaa caaataaagg aaaagaatgg aatatttacc ttgcctatcc 4914 tttacaaact atatttcaag acaaccaaaa gttggtgaag gaaagttgtt gattagagaa 4974 ttccaactgg ttaaaaggtc aaaggaggcc aggcgcggtg gctcaagcct gtaatcccag 5034 cactttggga ggccgaggca ggtggatcgt gaggtcagga gatcaagacc atcctggcta 5094 acacggtgaa accccatctc tactaaaaat acaaaaaatt cgccgggcgt ggtggcaggc 5154 gcctgtagtc ccagctactc aggaggctga ggcaggagaa tggcttgaac ccgggaggcg 5214 gagcttgcag tgagccgaga tcgcgccact gcactccagc ctgggtgaca gaccgagact 5274 ctgtctcaaa caaaaaacaa aaaacaaaac aaaacaaaga tcaaatgaat gatagaattt 5334 gaaaactacg ctctttaatt ttacaaaatc atggattttc gtggtgatag caatggatgc 5394 gaagaccatt aggtgaaaaa tggataggaa gcttataatg catggagcag aatgacagga 5454 cactaatcta tattaacatc tctaaatgag atcagccaga tgaacttgat gtgatgaaat 5514 ggatacacac agtggacacc tgtgaagttt tcttggctcc cccaaaactg agaagtacaa 5574 gttagtctcc aaacctaatt accagtttac aggaaacatg gggaataaaa gaacaaatta 5634 acaacacaaa gaagcaaaca accaaatgca caatttggga aattctgcag aagtaatggc 5694 ctagtttttt aaccaataca tgtc 5718 <210> 11 <211> 4342 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (4038) .. (4340) <400> 11 cagctctggc agaggactgt tccactagac acgctgaagg gactgggtac gttttccttc 60 aggaccagag ctgagaggag ctgggatcgc ggcggcaatg gaacgggcct cagaaaggcg 120 cacggccagc gcgctttttg cggggttccg ggccttggga cttttcagca acgacattcc 180 acacgtggtg cggttcagcg cgctcaagcg ccggttctat gtaacaacct gcgtgggcaa 240 gagtttccac acctatgacg tgagtgactt cttttgttag cttcccagga aaaccaccct 300 ccttggcctc taactctgtc ctggagcagt ccggttctcc cttcccattt accacgggct 360 tcccttcttt aacccctccc tgtcctatta atatttcgcc atccggtcac gtccacgtgc 420 tgatttggca cttgcagtcc tcctggggcg tgtttcagag gcagcctggc caaggggaaa 480 ctgctagaga acatctcagt tgtatggcag taatatttgt tcagtgaata tttattgacc 540 ttcccttgaa ggtggaaaca taattaagac aaacctctac ctcttgcatt cttaatgaga 600 gagaaaaaca tgcacgctat tacgttattg gtgacacgag ctgagttaga aattcactct 660 aggtagaata cattgggaga accggagatg gtgagattaa ttaccgttag ggaaaatatc 720 taatagcaat cagttatttt caactaacgc aaagtgtctg gcactgtcac tctaagcctt 780 gcagccgttt ctttgcaaca taggtgtcat tataccgtag cacctttata taatctatta 840 aagtaaattt cagataaatc acttgttagt gatgattttt tctcctataa cagttcatcg 900 atcattgtaa tagtgtctct tgtttacttg tttcttttgt cttctccgaa agaatgtaac 960 ctctgtgaga acagaaacct tgatgaggag gatgataata gtcagtgtta attaagtact 1020 ttcttttctg gaaaacaggt agagttgtct ctcttggagg attgtgaata tcaaatggct 1080 tattagcaaa aagagctggg aggtaggaac tagttgagtt agtggaaaag gaaagtgagg 1140 gcagagctga ggtgaagagc aattgggttt ctcagagtct gagttggtaa tggagacagg 1200 gtgggttaga accatattaa tcatgtgaaa gtatttagtg ttaccattct ctgcacttgt 1260 tccttgcttg ctaacttttg cttttgctgg ttaatgtttt cttccttgaa atgtaatttt 1320 tctgagttta ttactgaaat aactcttgaa atcttgtctt tggtacagtg tctttcctga 1380 cactgcatta cacacagaca caggtgtttt tccctatatt ctacttccaa agaaaccctg 1440 tggtgaggag agtaacatat ttgtacattt tataaagtta acaatttact tactgtgtgt 1500 tgacatttag tcataataaa gatggctttt aagttctcct ctagttctaa cactgatgat 1560 acattcgtaa catcatggca tgtaaatgtt ctttacccag actgtacact gctcagatta 1620 accaatgcat tgggctgtga cctctaaaac tgaaaatact tgggagtaaa aagaaaaaaa 1680 ggagctaatg aacctaagtc actaattggg tctacaagtg ctctatccca tgactatggc 1740 tatatatgct gtccccagtc agacttccta cagtcctaca gattagtttc ttctacaggg 1800 gaaactagac aagttgtgaa taggtggtgg tagtgatggt agtagctatc atgtattgag 1860 cagttactat atgactggcc ccattctgtt ttacacaaat ttactcattg aatcctcata 1920 acaatcccat aaagtaaata aatacctgtt ttacagatga agaatttgag gcacagggag 1980 cttatttttc cagggtcata cagctggatc agtagccatg attggggtca gcaaattaca 2040 gtctgtaagc cagtctagcc cactgtttgt ttttgtatga cccaggagtt acgaatgatt 2100 ttgacatgtt taaatgtttg tgggggcagg ctgggggtag ggagaaggat atttcatgac 2160 atgaaaacta tgtgaaaatc aaatttcagt atctatagat aaaagtttaa tatacccatg 2220 cacattattt atctattacg tatggctgtt ttcatgctac ggtggcagaa gtgagtagtt 2280 ggcaacagag accacatgca ttgcctgcaa agcctaaaat atttaatatt tggcccatta 2340 tagaaaacat ttgcctaccc cttccatcac cactcctggt gaaatacagt tgaaaatata 2400 aactttccta ataaagaaaa ttagtgtgag taagtgtctt tcttatgaag gacagcatag 2460 caaatatacg tatgaggtta tatcttaatc ttcaggtgtt aggttattat gattatgttt 2520 gttaatgaaa atttaacctt ttttcttttt taaacaggtt cagaaactta gtctggttgc 2580 agtaagtaag tatggacttt attctgaatt tatgcacatc taaacttttt ttttattttg 2640 acataaatgt gagacttaca gagcatataa ggaagccaag gtaacatgta tgtcttataa 2700 acttacatta tcagcaacaa aaaagggtgc caggataaat catgttttgc ttttagttgt 2760 tatgtatctt tattttcttt aatctggaac tgttctttag actttctttg tgcttcatga 2820 cattgacaca ttgacatttt tgaagagtac aggctcattt agaatgtcct tcagtttgag 2880 tttttttgac atttcttcag gattagattt aggttgtacc ccttgggagg aatcacagtg 2940 aattgtatca gaaggcacat gctgttagtt ggttctgttc ctggtagtga ttggttagga 3000 ggctctattt ttaagttagt gttttgtcat ttgtaacgaa taagtatctt ttggtagtat 3060 gcactgggac tatgtaaata tcttagtact cttcaaagtt attacctgct gcttttatat 3120 tttatgatga tacttgcctg aatcagttat tagtatgatg gttgccaaat gcatgttgat 3180 tatccttaat ccaaaaattt gaaacccaaa atgctcctaa gtctgaaact ttttgagagc 3240 caacatgacg ttaacaggga aatattcatt ggaacatttc cgatttcaga ttttcagatt 3300 aggaacattc acctggttat acatatagtc caaaatccaa aaaaatccgg aatctgaaac 3360 acttctggtc tcaaacattt cagataaagc atagtcagcc tatagtgatt ttcttttgcc 3420 atcattcctt ctactgtaag gagaagtttt cccttttctc acattttggt atttctttcc 3480 aatggaaggg ttttttttaa actatctaat agaataaaac atttctcagg gcagagaggt 3540 acttgtttta ccatcaaata tggttcttgt gagagaaaca gaagacctac aaggactgta 3600 atgtgtatgt gtattaaact tatattttaa taatagatta gaaggcagaa gaaaggcatt 3660 ggaacttaag ttgaatacaa aagtgaatga taaacttata aaacagggtt atgcaaacaa 3720 atactaatat ggaatttgca gcatgaaaag tacaacaatc ctgtgtctac taaccctcct 3780 gccccaaaag aggaaatgaa aaggtcagtg tgaccaggat actgagagaa gaatcatatg 3840 tcacgtgaga aatgctaaag attgataggc atatttccct tggagaagac tgtagaaaca 3900 atattgtctt taaatgtttt aggaataggc atatagaata ccatatgtaa tatttttggg 3960 ttggtccaaa gggtagagtt agagatttag attttgagtt acttttaaaa tttttctgga 4020 gttattagtg aacttga tgt tcc aga aga tac tca cat ggg tta ggc atg 4070                    Cys Ser Arg Arg Tyr Ser His Gly Leu Gly Met                      1 5 10 tgg ctc acg cct gta atc cca gca ctt tgg aag gcc gag gtg ggc aga 4118 Trp Leu Thr Pro Val Ile Pro Ala Leu Trp Lys Ala Glu Val Gly Arg              15 20 25 tca cct gag gtc agg agt tca aga cca gcc tgt cta aca tgg caa aac 4166 Ser Pro Glu Val Arg Ser Ser Arg Pro Ala Cys Leu Thr Trp Gln Asn          30 35 40 cct gtc tct tct aaa aat aca aaa att agc cgg gtg tgg tgg tgc act 4214 Pro Val Ser Ser Lys Asn Thr Lys Ile Ser Arg Val Trp Trp Cys Thr      45 50 55 cct ata atc cca gct act tgg gag gct gag gca gga gaa tca gtt aaa 4262 Pro Ile Ile Pro Ala Thr Trp Glu Ala Glu Ala Gly Glu Ser Val Lys  60 65 70 75 cct ggg agg cgg atg ttg cag tgt gcc aag att gca tca ctg cac tcc 4310 Pro Gly Arg Arg Met Leu Gln Cys Ala Lys Ile Ala Ser Leu His Ser                  80 85 90 agc ctg agt gac aca gca aga ctc cgt ctc tc 4342 Ser Leu Ser Asp Thr Ala Arg Leu Arg Leu              95 100 <210> 12 <211> 4415 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (251) .. (1855) <400> 12 gctgtgacct gcggcagcac agccgcctgc gttgagcgcc cacggtgggc tggactttgc 60 actaggtgct gacaggaccg gcagaggtgg ccactgccct cgtccccagc ctgcactcct 120 gggcgaaggc tgacgctgaa cagggtgctg tgggcccaga agcgcccatg cctgagagcg 180 tccaggacca ggcagagctg ggccttaagg gcccttggac cagtgtctgt ctgcagggag 240 cccccggtag cca ctc gga tca gcc cga ggg aag att ctg gac gag acc 289            Pro Leu Gly Ser Ala Arg Gly Lys Ile Leu Asp Glu Thr              1 5 10 ggg gcc gtc ccc cgg gtg gcc cat gga cag cag cag ggg ctc cca gga 337 Gly Ala Val Pro Arg Val Ala His Gly Gln Gln Gln Gly Leu Pro Gly      15 20 25 gtg gcc agg ccc tgc ccg ccc acc atg ctg cag tgt aga ccc gca cag 385 Val Ala Arg Pro Cys Pro Pro Thr Met Leu Gln Cys Arg Pro Ala Gln  30 35 40 45 gag ttc agc ttc ggg ccc cgg gcc ttg aag gac gct ctg gtc tcc act 433 Glu Phe Ser Phe Gly Pro Arg Ala Leu Lys Asp Ala Leu Val Ser Thr                  50 55 60 gac gca gcc ctg cag cag ctg tat gtg tcc gcc ttc tcc cct gcc gag 481 Asp Ala Ala Leu Gln Gln Leu Tyr Val Ser Ala Phe Ser Pro Ala Glu              65 70 75 cgg ctc ttc ctg gcc gag gcc tac aac ccg cag agg acg ctc ttc tgc 529 Arg Leu Phe Leu Ala Glu Ala Tyr Asn Pro Gln Arg Thr Leu Phe Cys          80 85 90 acc ctg ctc atc cgc acg ggc ttc gac tgg ctc ctg agc cga ccc gag 577 Thr Leu Leu Ile Arg Thr Gly Phe Asp Trp Leu Leu Ser Arg Pro Glu      95 100 105 gct ccc gag gac ttc cag acc ttc cac gcc tcc ctg cag cac cgg aag 625 Ala Pro Glu Asp Phe Gln Thr Phe His Ala Ser Leu Gln His Arg Lys 110 115 120 125 ccc cgc ctg gct cgg aag cac atc tac cta cag ccg ata gac ctg agc 673 Pro Arg Leu Ala Arg Lys His Ile Tyr Leu Gln Pro Ile Asp Leu Ser                 130 135 140 gag gag ccg gtg gga agc tcc ctg ctg cac cag ctg tgc agc tgc aca 721 Glu Glu Pro Val Gly Ser Ser Leu Leu His Gln Leu Cys Ser Cys Thr             145 150 155 gag gcc ttc ttc ctg ggc ctg cgc gtc aag tgc ctg ccg tcg gtg gca 769 Glu Ala Phe Phe Leu Gly Leu Arg Val Lys Cys Leu Pro Ser Val Ala         160 165 170 gcc gcg tcc atc cgc tgc tcc tcg cgg ccc agc cgg gac tct gac agg 817 Ala Ala Ser Ile Arg Cys Ser Ser Arg Pro Ser Arg Asp Ser Asp Arg     175 180 185 ctc cag ctc cac aca gac ggc atc ctg tcc ttc ttg aag aac aac aag 865 Leu Gln Leu His Thr Asp Gly Ile Leu Ser Phe Leu Lys Asn Asn Lys 190 195 200 205 cca ggg gac gcg ctg tgt gtg ctg ggc ctc aca ctg tct gac ctg tac 913 Pro Gly Asp Ala Leu Cys Val Leu Gly Leu Thr Leu Ser Asp Leu Tyr                 210 215 220 ccc cat gag gcc tgg agc ttc acc ttc agc aag ttc ctt cca ggg cac 961 Pro His Glu Ala Trp Ser Phe Thr Phe Ser Lys Phe Leu Pro Gly His             225 230 235 gaa gtg ggc gtc tgc agc ttc gcc cgg ttc tca ggg gaa ttc ccg aag 1009 Glu Val Gly Val Cys Ser Phe Ala Arg Phe Ser Gly Glu Phe Pro Lys         240 245 250 tcg ggg ccc agc gcc cct gat ctg gcc ctg gta gag gca gca gca gac 1057 Ser Gly Pro Ser Ala Pro Asp Leu Ala Leu Val Glu Ala Ala Ala Asp     255 260 265 ggc ccc gag gcc ccc ctg cag gac agg ggc tgg gcc ctg tgc ttc agt 1105 Gly Pro Glu Ala Pro Leu Gln Asp Arg Gly Trp Ala Leu Cys Phe Ser 270 275 280 285 gcc ctg ggg atg gtt cag tgc tgc aag gtc acg tgc cac gag ctc tgc 1153 Ala Leu Gly Met Val Gln Cys Cys Lys Val Thr Cys His Glu Leu Cys                 290 295 300 cac ctt ctg ggc ctg ggg aac tgc cgc tgg ctc cgc tgc ctc atg cag 1201 His Leu Leu Gly Leu Gly Asn Cys Arg Trp Leu Arg Cys Leu Met Gln             305 310 315 ggt gcg ctc agc ctg gac gag gcc ctg cgg cgg ccc ctg gac ctc tgt 1249 Gly Ala Leu Ser Leu Asp Glu Ala Leu Arg Arg Pro Leu Asp Leu Cys         320 325 330 ccc atc tgc ctg agg aag ctg cag cat gtc ctg ggt ttc agg ctc atc 1297 Pro Ile Cys Leu Arg Lys Leu Gln His Val Leu Gly Phe Arg Leu Ile     335 340 345 gag agg tac cag aga ctc tac acc tgg act cag gcg gtg gtg ggg acg 1345 Glu Arg Tyr Gln Arg Leu Tyr Thr Trp Thr Gln Ala Val Val Gly Thr 350 355 360 365 tgg ccc agc cag gag gcg ggg gag ccg tca gtg tgg gag gac acc ccg 1393 Trp Pro Ser Gln Glu Ala Gly Glu Pro Ser Val Trp Glu Asp Thr Pro                 370 375 380 cct gcc agc gcc gac tcg ggc atg tgc tgt gag agt gac tcg gag ccc 1441 Pro Ala Ser Ala Asp Ser Gly Met Cys Cys Glu Ser Asp Ser Glu Pro             385 390 395 ggc acc agt gtg tcg gag ccc ctc acc cct gat gcc ggg agt cac acc 1489 Gly Thr Ser Val Ser Glu Pro Leu Thr Pro Asp Ala Gly Ser His Thr         400 405 410 ttc gcc tca ggg cca gag gaa ggg ctg agc tac ctg gca gcc tca gag 1537 Phe Ala Ser Gly Pro Glu Glu Gly Leu Ser Tyr Leu Ala Ala Ser Glu     415 420 425 gct ccg ctg cca cct ggg ggc cct gcg gag gcc atc aag gag cat gaa 1585 Ala Pro Leu Pro Pro Gly Gly Pro Ala Glu Ala Ile Lys Glu His Glu 430 435 440 445 cgg tgg ctg gcc atg tgc atc cag gcc ctg cag cgg gaa gtg gca gag 1633 Arg Trp Leu Ala Met Cys Ile Gln Ala Leu Gln Arg Glu Val Ala Glu                 450 455 460 gag gac ctg gtg cag gtg gac aga gcc gtg gac gcc ctc gac cgc tgg 1681 Glu Asp Leu Val Gln Val Asp Arg Ala Val Asp Ala Leu Asp Arg Trp             465 470 475 gag atg ttc acg ggc cag ctc ccg gcc acc agg cag gac cca ccc agc 1729 Glu Met Phe Thr Gly Gln Leu Pro Ala Thr Arg Gln Asp Pro Pro Ser         480 485 490 agc agg gac agc gtg ggg ctg cgc aag gtg ctg ggg gac aag ttc tcc 1777 Ser Arg Asp Ser Val Gly Leu Arg Lys Val Leu Gly Asp Lys Phe Ser     495 500 505 tcc ctg agg agg aag ctg agt gcc cga aaa ctc gcc aga gca gag tcg 1825 Ser Leu Arg Arg Lys Leu Ser Ala Arg Lys Leu Ala Arg Ala Glu Ser 510 515 520 525 gcc ccc cgt ccc tgg gat ggg gaa gag agt tagtacagca ggggctgccc 1875 Ala Pro Arg Pro Trp Asp Gly Glu Glu Ser                 530 535 tacgtctcct tccctaagga tgctggccag cactgtccag tagctgaggc cactactgac 1935 ctgccaggga taaagaggaa gggtctgcct gggtggtggc tcaggcctgt catcccatca 1995 ctttgagagg ccaggagttt gagaccagac tgggcaacat ggtgagactc tgcctctaca 2055 aaagaaaaat taaaaaatta gctggatgaa gtggttcatg cctgtgttcc cagctattca 2115 ggaggctgag gtgggaggat tgcttgagcc taggaggtcg aggctgcagt gggatgtgat 2175 cataccactg tactgcagtc tgggccacac agaaagactg tctccagaaa aaaaaaagtt 2235 ctttggagaa gccacagacc acctgtcttc aggcgcctcc ttcaactcct gagtcccagc 2295 cagccgctcc caggggcctg cacacatgga gaggcctccc tgatcctggg tgcttctcgt 2355 ggagtacaag ccggactgtg ctgaggttgg gacagagccc cctcccctgc agaggcagaa 2415 ggaagcagcg tgcgtcctgt ctccttccag gctgtgggcc tgcccttcag ttatttatag 2475 ctggagctgg agaggctggc tcagatgagg agtgaccccg ggggcacaca ggctccacac 2535 tgccacccag cttccaaggc tgagtctcct ccctaacggg gaagtgacgg ggttttgtct 2595 ctatcatctc aggcgtcaac cacatgcaca cacacactgt cacgttctgt ggcgctaaca 2655 gcatcctgat cctgacggac ttcaccgggg ctctccaggc atctcttctg acaaacactg 2715 caggaggtga gggtgtctga cgtgcactga gggcagaggc ccccttattc ctgaggcggc 2775 tacagctcac cgtggggaag atcaactgtg gtgatgtttt tgggacagtt tcttggcaaa 2835 ggtggccgcg ctgtcagtac caagtagctg gaggtggtga tcagatgatc tgtctttcct 2895 ttttttttcg gtctagttct gtcagttgct gagagagggg tattattgcc atggctgggc 2955 gtttgatctg tctcccttta gttttgccgg attttgcttc ctgcactttg aagctccttt 3015 acgtccctgc acatttgatt gttgtgtctt cctgaggaat tgacccttt attgtcacac 3075 gtaccccctc agcctgtggt ggagggtctc tgggagagag acttccggtt ctgacagcgg 3135 aatcccttcg gagtgacgag ggcggggtca cagctcgcgc accctcatct ggagagaggc 3195 aagaacaggg cagcttggac cttttgtggg tgggtggctc acacggttat ggagggcttc 3255 taagaaaaac aaaatacaaa actaaaacta gaagggctat gcaggtgagg tgggctgagg 3315 cttccttgtg aatctgacat tggtggaggc cgactgaagg ctcccggtcc tggtcccact 3375 ccgtgttgac ttcagagaag caaagatgca gctcagaagt agcattagga tcttcgtccc 3435 gttctctttt gcgtagcttc aaaaaggcgt aacagtgacc tgggagggga gatgaagagc 3495 taggcctttc agaaagtgac caatggaagg tgccggcaga gcaccctgca gtaacagaag 3555 gtgaaagccg gagcctggtg gctgttgctg caaacaacca cccaaaactt agtggcttaa 3615 aatcaccaca gtctagctca tgaatctgta acttgggcag agctcggtgg agacggtttg 3675 cccctgtggc tcgactggag tgttcgttca cacggctgcc aagtggaatt tggctgggaa 3735 tctcaggcct tggttcctgc ccacgtgaga ctctccatgg aggaggggca tcttcttagg 3795 aaggcagctg ccataccgtg aggaaacccg acaaacaggg cagccagagc cagacagcag 3855 gctcctggtc ccagtcccgg aaaatgcaaa gggacaagta tgtgttgtatt tcagagaagc 3915 tcagatgtag cattaggacc ttcatccata cccttctctt ttgcatggct tctaaagggc 3975 atgacagtga cctgggaggt aactgtgcca gaataaagag gggagacgaa aaaaagaggg 4035 tcactttcag aaagggacct gggcttcctc acaatatggc ggttgccctc ctaaggtgga 4095 catcgggagg gtcagatgga agctctgtgg cctttcctaa ctcggcctca cgtcacagag 4155 cgtcaccttt cctgcactcc acggcagcag tgacaaggac cccaccccag gttcacgggg 4215 aggggacgtg gcagggaatg tgagacaaca caatattgct gtggccattc atgaacagtc 4275 agtcagcccc actcggcgtt cactgtgggg atttggctgg tgcacctgcg aggtggcctg 4335 acctgttttc cgatttcctt catcttctgc aaaaggttaa ccgctggagt gatgcgagat 4395 taaacagagg tgataaaaat 4415 <210> 13 <211> 3954 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (130) .. (2166) <400> 13 agggtctgtg tcaagagcgg caggggctgc aggagataag agaaacgggc tgccgtgggg 60 tgtgtgtagg cttcagagac atgggatcac ggaagactga agcagaaaca gtggattaag 120 acttcctga gcg ata gct ggc act gcc ttc ccc aca atg gca gag gtg gtg 171           Ala Ile Ala Gly Thr Ala Phe Pro Thr Met Ala Glu Val Val             1 5 10 gct gag gtg gcc gag atg cca aca cag atg tca cca ggg gca gtg gag 219 Ala Glu Val Ala Glu Met Pro Thr Gln Met Ser Pro Gly Ala Val Glu  15 20 25 30 atg tca aca cct atg tcg gca gag atg atg gag atg tca aca gaa gtg 267 Met Ser Thr Pro Met Ser Ala Glu Met Met Glu Met Ser Thr Glu Val                  35 40 45 act gag atg aca cct ggg gag gcc ctt gcc tca tcc ctc ttc ttc cag 315 Thr Glu Met Thr Pro Gly Glu Ala Leu Ala Ser Ser Leu Phe Phe Gln              50 55 60 cat cac cag ttc atg tgc tct gag tgt ggc agc ctc tat aac aca ctg 363 His His Gln Phe Met Cys Ser Glu Cys Gly Ser Leu Tyr Asn Thr Leu          65 70 75 gag gaa gtc ctc tca cac cag gag cag cac atg ctt gct gtc tca gag 411 Glu Glu Val Leu Ser His Gln Glu Gln His Met Leu Ala Val Ser Glu      80 85 90 gag gag gca ctg acc aca cag aat gtt ggc ctg gag ccg gag ctg gtg 459 Glu Glu Ala Leu Thr Thr Gln Asn Val Gly Leu Glu Pro Glu Leu Val  95 100 105 110 ccg ggt gct gag ggg ccc ttc cag tgt ggt gaa tgc agc cag ctc atc 507 Pro Gly Ala Glu Gly Pro Phe Gln Cys Gly Glu Cys Ser Gln Leu Ile                 115 120 125 ctc tcc cct ggg gag ctc ctg gcc cac cag gat gcc cac ctc cga gag 555 Leu Ser Pro Gly Glu Leu Leu Ala His Gln Asp Ala His Leu Arg Glu             130 135 140 tct gca aac cag atc caa tac cag tgc tgg gac tgc cag gag ctg ttc 603 Ser Ala Asn Gln Ile Gln Tyr Gln Cys Trp Asp Cys Gln Glu Leu Phe         145 150 155 ccc tcg ccc gag ctg tgg gtg gct cat cga aag gcc cag cac ctt tct 651 Pro Ser Pro Glu Leu Trp Val Ala His Arg Lys Ala Gln His Leu Ser     160 165 170 gct acg gta gct gag cca cca gtg cca cct cct ttg cct ccc cca aca 699 Ala Thr Val Ala Glu Pro Pro Val Pro Pro Pro Leu Pro Pro Pro Thr 175 180 185 190 cca ctg cct cca cct tct ccc cca tcc gaa gtc aag atg gag ccc tat 747 Pro Leu Pro Pro Pro Ser Pro Pro Ser Glu Val Lys Met Glu Pro Tyr                 195 200 205 gag tgt cct gag tgc tct acc ctc tgc gcc acc cct gag gag ttc ttg 795 Glu Cys Pro Glu Cys Ser Thr Leu Cys Ala Thr Pro Glu Glu Phe Leu             210 215 220 gag cat cag ggc acc cac ttt gac tcc cta gag aaa gag gag cgc aat 843 Glu His Gln Gly Thr His Phe Asp Ser Leu Glu Lys Glu Glu Arg Asn         225 230 235 ggg ttg gag gag gag gaa gag gac gat gag gag gat gaa gaa gat gat 891 Gly Leu Glu Glu Glu Glu Glu Asp Asp Glu Glu Asp Glu Glu Asp Asp     240 245 250 gaa gag atg gag gat gag gag gcc atg gca gag gtc ggt gat gat gct 939 Glu Glu Met Glu Asp Glu Glu Ala Met Ala Glu Val Gly Asp Asp Ala 255 260 265 270 gtg gga ggt gac gag tcc aca gct ggc tgg gct cag ggc tgc ggg gac 987 Val Gly Gly Asp Glu Ser Thr Ala Gly Trp Ala Gln Gly Cys Gly Asp                 275 280 285 tgt ccc cag cac cag ccc tca gca ggg gct cgc cgg caa cac cgg cgg 1035 Cys Pro Gln His Gln Pro Ser Ala Gly Ala Arg Arg Gln His Arg Arg             290 295 300 acg gct cac agc ccg gca tct gcc acc cac ccc ttc cac tgc agc cag 1083 Thr Ala His Ser Pro Ala Ser Ala Thr His Pro Phe His Cys Ser Gln         305 310 315 tgt cag cgc agt ttc agc tcc gcc aac cgg ctg cag gct cat ggg cgg 1131 Cys Gln Arg Ser Phe Ser Ser Ala Asn Arg Leu Gln Ala His Gly Arg     320 325 330 gcc cat gtt ggt ggc aca cat gag tgt aca acc tgc tcc aag gtc ttc 1179 Ala His Val Gly Gly Thr His Glu Cys Thr Thr Cys Ser Lys Val Phe 335 340 345 350 aag aaa gca gca tcg ctt gag cag cac ttg cgg ctg cat cgc ggg gaa 1227 Lys Lys Ala Ala Ser Leu Glu Gln His Leu Arg Leu His Arg Gly Glu                 355 360 365 gcc cgc tac ctc tgt gta gac tgt ggc cgc ggc ttt ggc aca gaa ctc 1275 Ala Arg Tyr Leu Cys Val Asp Cys Gly Arg Gly Phe Gly Thr Glu Leu             370 375 380 acg ttg gtg gct cac cgg cgg gcc cac act gcc aac cca ttg cat cgc 1323 Thr Leu Val Ala His Arg Arg Ala His Thr Ala Asn Pro Leu His Arg         385 390 395 tgt cgt tgc ggc aag acg ttc agc aac atg acc aag ttc ctc tac cac 1371 Cys Arg Cys Gly Lys Thr Phe Ser Asn Met Thr Lys Phe Leu Tyr His     400 405 410 cgg cgc act cac gcc ggc aaa agc ggg gca cct ccc aca gga gca aca 1419 Arg Arg Thr His Ala Gly Lys Ser Gly Ala Pro Pro Thr Gly Ala Thr 415 420 425 430 gct ccc cca gct cca gcg gag ccc acc cct cca cca cca ccc cct gcc 1467 Ala Pro Pro Ala Pro Ala Glu Pro Thr Pro Pro Pro Pro Pro Pro Ala                 435 440 445 cca cct gcc cag ctg ccc tgc cca cag tgc tcc aag tcc ttt gcc tca 1515 Pro Pro Ala Gln Leu Pro Cys Pro Gln Cys Ser Lys Ser Phe Ala Ser             450 455 460 gct tcc cgg ctg tcc cgg cac cgg cgt gca gta cac ggg ccc cct gaa 1563 Ala Ser Arg Leu Ser Arg His Arg Arg Ala Val His Gly Pro Pro Glu         465 470 475 cgg cgt cac cgc tgt ggg gtt tgt ggc aag ggc ttc aag aag ctg atc 1611 Arg Arg His Arg Cys Gly Val Cys Gly Lys Gly Phe Lys Lys Leu Ile     480 485 490 cac gtg cgc aac cac ctg cgg aca cac acg ggt gag agg ccc ttc cag 1659 His Val Arg Asn His Leu Arg Thr His Thr Gly Glu Arg Pro Phe Gln 495 500 505 510 tgc cac tca tgt ggc aag acc ttt gct tct ttg gcc aac ctc agc cgc 1707 Cys His Ser Cys Gly Lys Thr Phe Ala Ser Leu Ala Asn Leu Ser Arg                 515 520 525 cac cag ctg acc cat acg ggt gca cgt ccc tac caa tgc ctg gac tgt 1755 His Gln Leu Thr His Thr Gly Ala Arg Pro Tyr Gln Cys Leu Asp Cys             530 535 540 ggc aag cgc ttc aca cag agc tcc aac ctg cag cag cac cgg cgg ttg 1803 Gly Lys Arg Phe Thr Gln Ser Ser Asn Leu Gln Gln His Arg Arg Leu         545 550 555 cac ttg cgg cca gtc gcc ttt gcc cgc gcc ccc cgc ctc ccc atc act 1851 His Leu Arg Pro Val Ala Phe Ala Arg Ala Pro Arg Leu Pro Ile Thr     560 565 570 ggt ctc tac aac aag agt ccc tac tac tgc ggg act tgt ggc cgc tgg 1899 Gly Leu Tyr Asn Lys Ser Pro Tyr Tyr Cys Gly Thr Cys Gly Arg Trp 575 580 585 590 ttc cgc gcc atg gcg ggc ttg cga ctg cat cag cgg gtc cat gcc cga 1947 Phe Arg Ala Met Ala Gly Leu Arg Leu His Gln Arg Val His Ala Arg                 595 600 605 gct cgg act ttg acg cta cag cct ccc aga tca cca tct cct gcc cca 1995 Ala Arg Thr Leu Thr Leu Gln Pro Pro Arg Ser Pro Ser Pro Ala Pro             610 615 620 ccc cca cct cca gag cct caa cag act atc atg tgc aca gag ctg ggg 2043 Pro Pro Pro Pro Glu Pro Gln Gln Thr Ile Met Cys Thr Glu Leu Gly         625 630 635 gag acc atc gcc atc att gag aca tcc cag cca ctg gcg ctt gag gac 2091 Glu Thr Ile Ala Ile Ile Glu Thr Ser Gln Pro Leu Ala Leu Glu Asp     640 645 650 acc ctg cag ctg tgc cag gct gca ctg ggg gcc agt gaa gca ggc ggg 2139 Thr Leu Gln Leu Cys Gln Ala Ala Leu Gly Ala Ser Glu Ala Gly Gly 655 660 665 670 ctc ttg cag ttg gac acg gcc ttc gtg tgacgcagct gaaaagcaac 2186 Leu Leu Gln Leu Asp Thr Ala Phe Val                 675 aacaaaaggg tttggttgca acagccagtg tgggtacctc tggggagaga ggacctcctc 2246 tgacaaactg gtctggtacc caccatgtgc caggatccac cctggcctct ttttacccac 2306 tgactcccca gaacaaccct tccaggcttc tcttgtcatc tttctctgcc tgaggggaaa 2366 ctgaagctct gaaatgcgat gtgatctgta ccaggtcacc cagctatgct gcaaagtggg 2426 ttggccaagg ccctttgcac tgcatcaccc tggtgcccag caacatcagg taaccttcac 2486 tgagcaccaa gcttatgcca ggtctgtgct ggccactctc atatacctct tcagatcctc 2546 tgcttgtacc cccagccctt gccttccctg gattttgggc acccaggact ttgctctgcc 2606 tggtggaggg tacttgattt ctctgggctt ccttcatctc aattctgaca gtgtggaagg 2666 aaatctgtag gtacccaggt cctcagctcc agactgggtg atgctggaga cccaggagca 2726 agtcagcaca ggctctgccc acaggaggca tgcacaatct ggtaggagaa acgcacaagt 2786 acagatagct tgctctctga gtgtgtctca ttgattcatt cagcctgtgt gtcctaaagc 2846 ctcccttctc ccatactggg tgatgctggg ggcagagatg agtcaggcct aaggcttctc 2906 tgggctgtga tggggatgtg cagggacagt gggatcccag gaaacagacc tccatggtgg 2966 gagatcagga gttaaggact tcctagagaa ggtagtccct gaaggaaaag taggaatctg 3026 gcagtgaaag aggaaaagaa cattccaggc agacagcact gcaagagggt gagctcttgg 3086 tgtatttgga gaccatccag ttattcatct gactgcctac ctgaggcaag caccacactt 3146 accactgtat gtgtactatg atcccaacag gcttgtgaca tagatttcat attatccctg 3206 ttgtatagat ttttaaaact ggtggctggg cgtggtggct cacgcctgta atcccagcac 3266 tttgggaggc cgaggcgggc agatcacgag gtcgggagat cgagaccatc ctggctaaca 3326 cggtgaaacc ccgtctctac taaaaataca aaaaattggc caggagtggt ggcgggcgcc 3386 tgtggtggcc agctactcag gaggctgagg caggagaatg gcgtgaaccc gggaggcgga 3446 gcttgcagtg agccgagatc gcgccactgc actccagcct gggcgacaga gtgagactcc 3506 ctctcaaaag aaaataaaaa taaaaataaa ctggccaggt gcggtggctc acgcttgcaa 3566 tcccagtgct ttgggagact gaggcaggag gatcacttaa ggccaggagt tcgagaccag 3626 cctgggcaac agcaagacct catctctaca aaacttaaaa gttagccagg cctggtggca 3686 catgcctgta atccgagtgc tttgggaggc tgaggcagga ggatcactcg agctcaagaa 3746 ttcaagacct gggcaagaga gagagacttt gtctctatta aaaataaaaa tgattggccg 3806 agcatggtgg tgcacaccag tattcccagc tactctggag gctgaggcgg aggcaggatt 3866 gcttgatatg agggagattg aggctgcagt gagctgtgtt cacaccactg cactccagtc 3926 tgggcaacag tgcaagaccc tgtcactt 3954 <210> 14 <211> 3846 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (1) .. (1404) <400> 14 aat ttg gtt aca caa ctg aac att cct gca aga ata agg cct agt gaa 48 Asn Leu Val Thr Gln Leu Asn Ile Pro Ala Arg Ile Arg Pro Ser Glu   1 5 10 15 tgt gag acc ctt gga agc aat ttg gga cat aat gca gac tta ctt aat 96 Cys Glu Thr Leu Gly Ser Asn Leu Gly His Asn Ala Asp Leu Leu Asn              20 25 30 gag aat aat att ctt gca aaa aag aaa ccc tat aag tgt gat aaa tgt 144 Glu Asn Asn Ile Leu Ala Lys Lys Lys Pro Tyr Lys Cys Asp Lys Cys          35 40 45 aga aaa gcc ttt att cat aga tca tcg ctt act aaa cat gag aaa aca 192 Arg Lys Ala Phe Ile His Arg Ser Ser Leu Thr Lys His Glu Lys Thr      50 55 60 cat aaa gga gag gga gct ttc cct aat gga aca gat caa gga att tat 240 His Lys Gly Glu Gly Ala Phe Pro Asn Gly Thr Asp Gln Gly Ile Tyr  65 70 75 80 cct gga aag aaa cac cat gaa tgt acc gac tgt ggg aaa acc ttt ctc 288 Pro Gly Lys Lys His His Glu Cys Thr Asp Cys Gly Lys Thr Phe Leu                  85 90 95 tgg aag aca cag ctt act gag cat cag aga att cac act ggg gag aaa 336 Trp Lys Thr Gln Leu Thr Glu His Gln Arg Ile His Thr Gly Glu Lys             100 105 110 ccc ttt gaa tgc aat gta tgt gga aag gcc ttc agg cat agc tca tct 384 Pro Phe Glu Cys Asn Val Cys Gly Lys Ala Phe Arg His Ser Ser Ser         115 120 125 ctt ggt cag cat gag aat gct cat acc gga gag aaa ccc tat cag tgt 432 Leu Gly Gln His Glu Asn Ala His Thr Gly Glu Lys Pro Tyr Gln Cys     130 135 140 agt ctc tgt ggg aaa gcc ttc cag cgc agc tcc tcc ctt gtt caa cac 480 Ser Leu Cys Gly Lys Ala Phe Gln Arg Ser Ser Ser Leu Val Gln His 145 150 155 160 cag cga att cac act gga gag aaa ccc tat cga tgt aat cta tgt ggg 528 Gln Arg Ile His Thr Gly Glu Lys Pro Tyr Arg Cys Asn Leu Cys Gly                 165 170 175 agg tcc ttt agg cat ggc aca tcc ctc act caa cac gag gtc aca cac 576 Arg Ser Phe Arg His Gly Thr Ser Leu Thr Gln His Glu Val Thr His             180 185 190 agt gga gag aag ccc ttc cag tgt aag gaa tgt ggg aaa gcc ttt agt 624 Ser Gly Glu Lys Pro Phe Gln Cys Lys Glu Cys Gly Lys Ala Phe Ser         195 200 205 cga tgt tct tcc ctt gtc caa cat gag agg act cat act gga gag aaa 672 Arg Cys Ser Ser Leu Val Gln His Glu Arg Thr His Thr Gly Glu Lys     210 215 220 cct ttt gaa tgt agc ata tgt ggg agg gct ttt ggt cag agc cca tcc 720 Pro Phe Glu Cys Ser Ile Cys Gly Arg Ala Phe Gly Gln Ser Pro Ser 225 230 235 240 ctt tat aaa cat atg agg att cat aag aga ggc aaa cct tac caa agc 768 Leu Tyr Lys His Met Arg Ile His Lys Arg Gly Lys Pro Tyr Gln Ser                 245 250 255 agt aac tac agc ata gat ttc aag cac agc aca tct ctc act cag gat 816 Ser Asn Tyr Ser Ile Asp Phe Lys His Ser Thr Ser Leu Thr Gln Asp             260 265 270 gaa agc act ctt acc gaa gtg aaa tcc tac cat tgt aat gac tgt ggg 864 Glu Ser Thr Leu Thr Glu Val Lys Ser Tyr His Cys Asn Asp Cys Gly         275 280 285 gaa gac ttt agt cac att aca gac ttt act gac cat cag agg atc cat 912 Glu Asp Phe Ser His Ile Thr Asp Phe Thr Asp His Gln Arg Ile His     290 295 300 act gca gag aac ccc tat gat tgt gag cag gct ttt agt cag caa gct 960 Thr Ala Glu Asn Pro Tyr Asp Cys Glu Gln Ala Phe Ser Gln Gln Ala 305 310 315 320 att tct cat cct gga gag aaa ccc tat caa tgt aat gta tgt ggg aaa 1008 Ile Ser His Pro Gly Glu Lys Pro Tyr Gln Cys Asn Val Cys Gly Lys                 325 330 335 gct ttc aaa agg agt aca agt ttc ata gag cat cac aga att cat act 1056 Ala Phe Lys Arg Ser Thr Ser Phe Ile Glu His His Arg Ile His Thr             340 345 350 gga gag aaa ccc tat gaa tgt aat gag tgt gga gaa gcc ttt agt cga 1104 Gly Glu Lys Pro Tyr Glu Cys Asn Glu Cys Gly Glu Ala Phe Ser Arg         355 360 365 cgc tca tcg ctt act caa cat gag aga acc cac act gga gag aaa ccc 1152 Arg Ser Ser Leu Thr Gln His Glu Arg Thr His Thr Gly Glu Lys Pro     370 375 380 tat gaa tgt att gac tgt ggg aaa gcc ttt agt caa agt tca tct ctc 1200 Tyr Glu Cys Ile Asp Cys Gly Lys Ala Phe Ser Gln Ser Ser Ser Leu 385 390 395 400 att cag cat gag aga act cat act gga gag aag ccc tat gaa tgt aat 1248 Ile Gln His Glu Arg Thr His Thr Gly Glu Lys Pro Tyr Glu Cys Asn                 405 410 415 gaa tgt ggg aga gcc ttc cga aaa aaa acc aac ctg cat gat cat cag 1296 Glu Cys Gly Arg Ala Phe Arg Lys Lys Thr Asn Leu His Asp His Gln             420 425 430 aga att cat act gga gaa aaa ccc tat tct tgt aag gaa tgt ggg aaa 1344 Arg Ile His Thr Gly Glu Lys Pro Tyr Ser Cys Lys Glu Cys Gly Lys         435 440 445 aac ttc agc cga agt tca gct ctt act aaa cac cag aga att cat act 1392 Asn Phe Ser Arg Ser Ser Ala Leu Thr Lys His Gln Arg Ile His Thr     450 455 460 cga aat aaa ctc taggaaccgt gaaattaagg aatttgcaga atgctttagc 1444 Arg Asn Lys Leu 465 taaaatgttc tgattcagga tcagaggatt cttagagagc ttgggaatgt aatgaattac 1504 gtgtgtgttt atacgttgtg tgtggagaaa actgccagta gacagatttt tttttttttt 1564 taacataaag acacattctc agatctgatt acagactagt gtaaaaacag ctacatgtat 1624 gtagctggtt ggggatgata tgcctgtatg ttggactttg cttttgaata tatgtatgca 1684 ggatatcatc aagtttcaac atcttgactt gtgaccccca atgtcaacag cttttttaaa 1744 aaacaaattc ctgcagtaat gaccaaaacc cattttaaaa attgcttgac aactgcactc 1804 aactgcagct cttacattaa cttcaccatg gaaaccagtt ccaactccag gaagtcacca 1864 ttcaaagaat tagatcaact agcccaacca cttcattgta cagatgaaga ctgaaagcca 1924 aagatgtgaa gtggtttcca cagtatgata cagcctataa gggtaaagct gggttaaaaa 1984 tgcaggtttc ctggatttgg ggccccatgg ccttgccagt gaaaaggtta tttttggact 2044 cagagggctt taaaataaat tttaagatgt atcagataca caaacattta atgggcacct 2104 atgggttgga cactttgaga attcttaaaa gtataagtgg gagcaaaatg tatgcaaatt 2164 tatcacaaac tatttaaagc aacttcttgg aggcttacaa accacaattt aacagaaact 2224 gtagatggtt gaactactag tgactttttt ccccttttcc cagttacaat tatactttca 2284 gctaacatat gccagtttca cagaactatt aagtcccctt attgtacttt ttatggcatg 2344 cccatgaaaa agcactttct taagcctaca gtatcagatc aatgggaaaa caacagaaaa 2404 ctaagaggag aattttcccg ttaattttct tgcagaaaag tataagtcta attgcccatt 2464 gccataaatt ttgtcttgta ctcagagaag caacatgcac tggctcattt tatgtgcaaa 2524 gaaaagattt caccattaaa aaaattaact tggctaggta tggtgtctca cacctgtaat 2584 cccagcactt tgggtggcta aggcagatag actgcttgaa cccaggagtt caagaccagc 2644 ctggacaaca tggtgaaacc ccatctcttt aaaaaaaaaa aaaaaaatcc aaaaattagc 2704 tgggcatggt ggcatgcagt ggtagtccca gctactcagg aggctgaggt gggaggatca 2764 ctggaacccg ggagcagaga ctgcagtgag ctgagatcac actactgcat tccagcctga 2824 gcaacagagc aagacacaca cacacatcaa tttattttag ttgtataatg cttttctatt 2884 agtaaagcat cagctaagct tcagtggcct gctccatccc ctaatgactc ccatgggcta 2944 tcctaaagga acttccagaa cctttgttgg tgtgttgaca ttgaccatgc agaccaattt 3004 gggcacaact ggacattgat tccttttaca caagagctgc ctcccaaaga tagataaatt 3064 ttcccagccc taaatatgaa tcatggggca agatattggt cgtattgatg gtgaaccttt 3124 cctactggat tctttgcatg ccacatagca ggattcattg cctttctctc atcatggatg 3184 gcatgcagca gcacccaagt attcttcatt ctttgcaggg aaaaaattgt gcatgggggc 3244 tgaaatgtag tatgtgtagc tcaattagtc tctcctctgt gatgcaaaat ggaatattca 3304 atggcagatc tgcccttctg agatgctgac catccaaaac accttgttta tggtgcacca 3364 tgattagctc acacacaatg ccaaggctgt gcttctatta tctgatacat agtttgacaa 3424 tgggtaattc tactcagacc ctccctactg attggctagg atgcctgtca ggaactcatt 3484 atgctactgg ttgtttgggg atccccatag tggactactt tcaggaatgg catgaattgt 3544 aaccaactga gtgctgcccc cactgttacg gaagtttata aaaccttagt tccagaagac 3604 ccaaaggaga gtactggttt gtgtttggtg cttggcctag atccagccac cactctgaaa 3664 ctcatcacat cttcattgac agggagggag cccaggacat atgtgtggct cattgaccag 3724 aaggctttct tagtcccaac agccatgaac catgcactta tggataccca gccttttagg 3784 gctacgtgaa atgcatcctt gtaacatcat tgtattcttt caataaatag ccttctgagt 3844 tg 3846 <210> 15 <211> 3002 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2033) .. (2884) <400> 15 ccggcttgag gcatccagca gccagtcctt tggtcttgga ccacaccgag gacggctcaa 60 cattcagtca ggcctggagg acggcgatct atatgatgga gcctggtgtg ctgaggagca 120 ggacgccgat ccatggtttc aggtggacgc tgggcacccc acccgcttct cgggtgttat 180 cacacagggc aggaactctg tctggaggta tgactgggtc acatcataca aggtccagtt 240 cagcaatgac agtcggacct ggtggggaag taggaaccac agcagtggga tggacgcagt 300 gagtggtccc actgtggctg gggcctccat gctgggagtt gggcacccag tccaggctag 360 gctgaggctc ctctgaggac aaggaataga cgccagctta ggcttcccag gggggtgtgg 420 cttgttgtca agagggtggc acacggcagg caccattggg agccagctgc tttgggacat 480 gcccacatcc tccccagata atgccaccac agggtgggtg ctgcttcacg gtacagcttc 540 ctcctggcat gccccttctg gcccggggcc tctggtccac atcacttctt gccttctcgt 600 ggttctgact tccgcatctc atggacctct ttttacagca ggctacaatg tggagtcctg 660 gccagctcta ggattggctt cccccgagtc atgtggccaa actggtctaa tgaactgtgt 720 ccaatccaga gagcaaggct gcctagggct gcccattggc aggggctgtg ggccggggtc 780 tgtgtttgat gcacagtgca agtctctagc tgagcccact agggtgggga gacagtgagc 840 ttggaggcct gagctccttc cctgggtcct gggccaggct tctggggttt gagcagccac 900 aacagagaac ttgctgcccc caggtatttc ctgccaattc agacccagaa actccagtgc 960 tgaacctcct gccggagccc caggtggccc gcttcattcg cctgctgccc cagacctggc 1020 tccagggagg cgcgccttgc ctccgggcag agatcctggc ctgcccagtc tcagacccca 1080 atgacctatt ccttgaggcc cctgcgtcgg gatcctctga ccctctagac tttcagcatc 1140 acaattacaa ggccatgagg aaggtcagat ataaccccta tgacctggga aggagggccc 1200 acccatctca ggtccccttc ccaccttccc accggggcac aacctgctgt gactgcgctt 1260 gtatgcccct gctgcctcct gatgtctcag ccttctctcc tgtggacccc taagctccat 1320 cccactttcc cttattatgg cgccccccca gtcctgcccc ttcctcccgg ctctgctgcc 1380 gctcccctcc tgtaccatga tgggatgccc cctctgtgtg ggccatcgct gactttttaa 1440 gtctttccat ggcacatgtg atctgcccct gggtgtaccc ctcccatgcc tcatgccacg 1500 ctacactctg cccaccagct gatgaagcag gtacaagagc aatgccccaa catcacccgc 1560 atctacagca ttgggaagag ctaccagggc ctgaagctgt atgtgatgga aatgtcggac 1620 aagcctgggg agcatgagct gggggagcct gaggtgcgct acgtggctgg catgcatggg 1680 aacgaggccc tggggcggga gttgcttctg ctcctgatgc agttcctgtg ccatgagttc 1740 ctgcgaggga acccacgggt gacccggctg ctctctgaga tgcgcattca cctgctgccc 1800 tccatgaacc ctgatggcta tgagatcgcc taccaccggg gttcagagct ggtgggctgg 1860 gccgagggcc gctggaacaa ccagagcatc gatcttaacc ataattttgc tgacctcaac 1920 acaccactgt gggaagcaca ggacgatggg aaggtgcccc acatcgtccc caaccatcac 1980 ctgccattgc ccacttacta caccctgccc aatgccaccg tgagtatttt ga ggg cgg 2038                                                           Gly Arg                                                             1 cag tgg agg tct gtg ggg ggc gga cct tgt ctc tgt ctc ctg ccc ctc 2086 Gln Trp Arg Ser Val Gly Gly Gly Pro Cys Leu Cys Leu Leu Pro Leu           5 10 15 ctg acc tgc ccc atc cag gtg gct cct gaa acg cgg gca gta atc aag 2134 Leu Thr Cys Pro Ile Gln Val Ala Pro Glu Thr Arg Ala Val Ile Lys      20 25 30 tgg atg aag cgg atc ccc ttt gtg cta agt gcc aac ctc cac ggg ggt 2182 Trp Met Lys Arg Ile Pro Phe Val Leu Ser Ala Asn Leu His Gly Gly  35 40 45 50 gag ctc gtg gtg tcc tac cca ttc gac atg act cgc acc ccg tgg gct 2230 Glu Leu Val Val Ser Tyr Pro Phe Asp Met Thr Arg Thr Pro Trp Ala                  55 60 65 gcc cgc gag ctc acg ccc aca cca gat gat gct gtg ttt cgc tgg ctc 2278 Ala Arg Glu Leu Thr Pro Thr Pro Asp Asp Ala Val Phe Arg Trp Leu              70 75 80 agc act gtc tat gct ggc agt aat ctg gcc atg cag gac acc agc cgc 2326 Ser Thr Val Tyr Ala Gly Ser Asn Leu Ala Met Gln Asp Thr Ser Arg          85 90 95 cga ccc tgc cac agc cag gac ttc tcc gtg cac ggc aac atc atc aac 2374 Arg Pro Cys His Ser Gln Asp Phe Ser Val His Gly Asn Ile Ile Asn     100 105 110 ggg gct gac tgg cac acg gtc ccc ggg agc atg aat gac ttc agc tac 2422 Gly Ala Asp Trp His Thr Val Pro Gly Ser Met Asn Asp Phe Ser Tyr 115 120 125 130 cta cac acc aac tgc ttt gag gtc act gtg gag ctg tcc tgt gac aag 2470 Leu His Thr Asn Cys Phe Glu Val Thr Val Glu Leu Ser Cys Asp Lys                 135 140 145 ttc cct cac gag aat gaa ttg ccc cag gag tgg gag aac aac aaa gac 2518 Phe Pro His Glu Asn Glu Leu Pro Gln Glu Trp Glu Asn Asn Lys Asp             150 155 160 gcc ctc ctc acc tac ctg gag cag gtg cgc atg ggc att gca gga gtg 2566 Ala Leu Leu Thr Tyr Leu Glu Gln Val Arg Met Gly Ile Ala Gly Val         165 170 175 gtg agg gac aag gac acg gag ctt ggg att gct gac gct gtc att gcc 2614 Val Arg Asp Lys Asp Thr Glu Leu Gly Ile Ala Asp Ala Val Ile Ala     180 185 190 gtg gat ggg att aac cat gac gtg acc acg gcg tgg ggc ggg gat tat 2662 Val Asp Gly Ile Asn His Asp Val Thr Thr Ala Trp Gly Gly Asp Tyr 195 200 205 210 tgg cgt ctg ctg acc cca ggg gac tac atg gtg act gcc agt gcc gag 2710 Trp Arg Leu Leu Thr Pro Gly Asp Tyr Met Val Thr Ala Ser Ala Glu                 215 220 225 ggc tac cat tca gtg aca cgg aac tgt cgg gtc acc ttt gaa gag ggc 2758 Gly Tyr His Ser Val Thr Arg Asn Cys Arg Val Thr Phe Glu Glu Gly             230 235 240 ccc ttc ccc tgc aat ttc gtg ctc acc aag act ccc aaa cag agg ctg 2806 Pro Phe Pro Cys Asn Phe Val Leu Thr Lys Thr Pro Lys Gln Arg Leu         245 250 255 cgc gag ctg ctg gca gct ggg gcc aag gtg ccc ccg gac ctt cgc agg 2854 Arg Glu Leu Leu Ala Ala Gly Ala Lys Val Pro Pro Asp Leu Arg Arg     260 265 270 cgc ctg gag cgg cta agg gga cag aag gat tgatacctgc ggtttaagag 2904 Arg Leu Glu Arg Leu Arg Gly Gln Lys Asp 275 280 ccctagggca ggctggacct gtcaagacgg gaaggggaag agtagagagg gagggacaaa 2964 gtgaggaaaa ggtgctcatt aaagctaccg ggcacctt 3002 <210> 16 <211> 3521 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (783) .. (1502) <400> 16 ctggggctct gcgctcgtcc ggccggcccc ggcctcgccg ccccgcgcag tacccagccc 60 ggccccgccg acccgcctct actgccggct ccgcgccctt ccccgagggc tggatgatgg 120 gctgtttcgc cctgcaaacg gtggacaccg agctgaccgc ggactcggtg gagtggtgcc 180 cgctgcaagg ctgcaggcac ctgctggcgt gcgggaccta ccagctgcgg cggccggagg 240 accggcctgc cggcccccag aacaagggtg gaatggaagt taaggagcct caggtccgtt 300 taggccgtct cttcctgtac agtttcaatg acaacaactc tattcaccct ctggtcgagg 360 tccaaagaaa agatacttct gcaatcctgg acatgaaatg atatacaacc acttcctgtt 420 gtgtcacact tgcctgcagt aacatgctgt aaggtttgtg gcataggagc agtgggccat 480 accatataat caaggtgtca catcccggtg gctggacatg ccctcttggg cttggcagat 540 gccagtggat ccatacaact gctccgcctg gtggaatctg agttgtattg cgttctacat 600 ccaactccaa gaggaagtca cctcatgtgt caccagcaga agggctgaag tgacaggatg 660 ttcattgacc tgtcagtgga tctgaaagtt ctctaaggag agcctgggca agcattctta 720 ggttgatgct ggggcccaga gtagcagtga gcatcctgtg tgaagatggc atttctcact 780 ga tta ttg gaa aag cac aag agc cac gtg ctg gag cca ttg tcc agc 827    Leu Leu Glu Lys His Lys Ser His Val Leu Glu Pro Leu Ser Ser      1 5 10 15 ctt gcc ctg gag gag cag tgt ctg gct ttg tcc cta gat tgg tcc act 875 Leu Ala Leu Glu Glu Gln Cys Leu Ala Leu Ser Leu Asp Trp Ser Thr                  20 25 30 ggg aaa act gga agg gcc ggg gac cag ccc ttg aag atc atc agc agt 923 Gly Lys Thr Gly Arg Ala Gly Asp Gln Pro Leu Lys Ile Ile Ser Ser              35 40 45 gac tcc aca ggg cag ctc cac ctc ctg atg gtg aat gag acg agg ccc 971 Asp Ser Thr Gly Gln Leu His Leu Leu Met Val Asn Glu Thr Arg Pro          50 55 60 agg ctg cag aaa gtg gcc tca tgg cag gca cat caa ttc gag gcc tgg 1019 Arg Leu Gln Lys Val Ala Ser Trp Gln Ala His Gln Phe Glu Ala Trp      65 70 75 att gct gct ttc aat tac tgg cat cca gaa att gtg tat tca ggg ggc 1067 Ile Ala Ala Phe Asn Tyr Trp His Pro Glu Ile Val Tyr Ser Gly Gly  80 85 90 95 gac gat ggc ctt ctg agg ggc tgg gac acc agg gta ccc ggc aaa ttt 1115 Asp Asp Gly Leu Leu Arg Gly Trp Asp Thr Arg Val Pro Gly Lys Phe                 100 105 110 ctc ttc acc agc aaa aga cac acc atg ggt gtg tgc agc atc cag agc 1163 Leu Phe Thr Ser Lys Arg His Thr Met Gly Val Cys Ser Ile Gln Ser             115 120 125 agc cct cat cgg gag cac atc ctg gcc acg gga agc tat gat gaa cac 1211 Ser Pro His Arg Glu His Ile Leu Ala Thr Gly Ser Tyr Asp Glu His         130 135 140 atc cta ctg tgg gac aca cga aac atg aag cag ccg ttg gca gat acg 1259 Ile Leu Leu Trp Asp Thr Arg Asn Met Lys Gln Pro Leu Ala Asp Thr     145 150 155 cct gtg cag ggt ggg gta tgg aga atc aag tgg cac cct ttc cac cac 1307 Pro Val Gln Gly Gly Val Trp Arg Ile Lys Trp His Pro Phe His His 160 165 170 175 cac ctg ctc ctg gcc gcc tgc atg cac agt ggc ttt aag atc ctc aac 1355 His Leu Leu Leu Ala Ala Cys Met His Ser Gly Phe Lys Ile Leu Asn                 180 185 190 tgc caa aag gca atg ggt ggg agc gca gtg gtg gag tca gag ctc act 1403 Cys Gln Lys Ala Met Gly Gly Ser Ala Val Val Glu Ser Glu Leu Thr             195 200 205 gca gcc cgg acc acc tgg aca caa gca gtc ctc ctg cct caa ctc agt 1451 Ala Ala Arg Thr Thr Trp Thr Gln Ala Val Leu Leu Pro Gln Leu Ser         210 215 220 cct cag tgt cgg gat tac agg ctt gag cca ctg cat cca gcc ttc agt 1499 Pro Gln Cys Arg Asp Tyr Arg Leu Glu Pro Leu His Pro Ala Phe Ser     225 230 235 tct taaagttgaa gaggtgaact aagacaggtc tgagagttca acattccagg 1552 Ser 240 ccagttttgt caggctgtgg ctctctgagg gcttagactc tgccttatta ccttttccta 1612 agctgtagaa gaggtcctgc tggcccgagt ggacgtgttg agcgattgct ggatgaaaga 1672 ctagcggagg tcctgcctgc cctgtgacgc tacctcccaa gggacagtaa cggcctcccc 1732 aggggcttcc tcccgagggt cgccccacac atgctgctcg agggcccctg tggctgtctc 1792 tgaacattca gcatctcact cctcctcgga gccacctctg cccagttgaa gccctgggcc 1852 cagggttggg cctccccggc ccaggcccat acaacctcgg cacgttttgt aatccagtgc 1912 ggtcctgttc tgccctcagc ccttcttcca tcagcactca ttctggttct gattccggtc 1972 actgctggca tcaatccctg tacttcagag gtagcgatgg cagccctttc agagcccaag 2032 tggccaccca gccttgcagc ccctggctca ccttccactt gctctctgca actgcccctg 2092 ccagccctgg ccttgcctcc tggactccag atcgcctcgg gtttggccac gcctcgtcat 2152 tccctctgtt gctgtgtgct ctgggaggca gttgtggcct ctttccagtt gccgtagcaa 2212 cttctataaa gaaactgctc acttaacctt gtcagtggct cctttccaag ccagaatcca 2272 gactctgaaa cagggcgtgc cggaccctgc atgcttggga cgctccagcc cactgtcact 2332 cactgacagt gccatggctg cctgtttgtg gttccccgtg cgagccttgg actttctcac 2392 ctctgctgta cacatgctgc ctgttcctca gacatcccct tccttcccag ctggctaaaa 2452 ctcctttgtt ctttggtttt aagtgttaac tgaggtcttt ccccctctct ggcttttgcc 2512 gtattcttca ccatcaagtg aggttctgta gccccgcttg atgcttctct tggaggcaca 2572 caggcgtgtg cactgcaaga tgacggtctg tttcttctcc aacatatgcc caggtcttgc 2632 tggttccttt gcatgacaca acggctcctc aggaaacact tgctgacctg tgaggggctg 2692 ttcttggagt ctccctgtca ggtgtgacag gttggggaga ggctctgggt gatgttgtgt 2752 ttcctgggtt ctcccatctc tcactccaga ggagaggcag gaggcgacgg tcctgacatc 2812 tcacacattg cccgactcgc tggtgtatgg agccgactgg tcctggctgc tcttccgttc 2872 tctgcagcgg gccccctcgt ggtcctttcc tagcaaccta ggaaccaaga cggcagacct 2932 gaagggtgca agcgagttgc caacaccctg tcatgaatgc agagaggata acgatgggga 2992 gggccatgcc agaccccaga gtggaatgaa gccactcaca gagggcatga ggaagaatgg 3052 cacctggctg caggctacag cagccaccac acgtgactgt ggcgtgaacc cagaagaagc 3112 agactcagcc ttcagcctcc tggccacctg ctccttctat gaccatgcgc tccacctctg 3172 ggagtgggag gggaactgag cttgaaatca tgaagcccct tcccacaagg aaaccaggag 3232 ggagactgcg agtgagtgcc cgggaccacc tcatcagaga tgcttactgc agccctgcag 3292 gtgcctgtgc actgatggaa tccacagtgt agtcagaaaa gctgttgact tctcttaaat 3352 cagcttccct gctgggcccc tgaaagtgga ctgggtgatt ctgtctggca gagagtgggg 3412 aaaagacgcg gtttccagct tgcagatttg ttaagtttct caggcagatt ttgactttca 3472 gcctttcata cttgtttaag caactatttg tattaaatga agttttttg 3521 <210> 17 <211> 2976 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (26) .. (1468) <400> 17 gcggtttccc gccgcgggtg aatga gtc cga ctg cgc tgc cgc cgc cgc cgc 52                             Val Arg Leu Arg Cys Arg Arg Arg Arg                               1 5 cgc ccg cat cga agc tcc ggc cgc gtc ccc cgc gcc agc ccc ggg agc 100 Arg Pro His Arg Ser Ser Gly Arg Val Pro Arg Ala Ser Pro Gly Ser  10 15 20 25 ctg cgc gcc gag acc cct gcg tgt ctg cga aca gac ctc cgg cct cgg 148 Leu Arg Ala Glu Thr Pro Ala Cys Leu Arg Thr Asp Leu Arg Pro Arg                  30 35 40 gcc atc acc ccg ggc aga cgc tgc agc ccc tgt ccg ggc ctc ttg cct 196 Ala Ile Thr Pro Gly Arg Arg Cys Ser Pro Cys Pro Gly Leu Leu Pro              45 50 55 gcc gcc ccc atg aga aaa acc aac atg tgg ttc ttg gag cgg ctt cgg 244 Ala Ala Pro Met Arg Lys Thr Asn Met Trp Phe Leu Glu Arg Leu Arg          60 65 70 ggg tct ggg gaa aac ggt gct gcc cgg ggc gtg ggg agt gag gcg ggg 292 Gly Ser Gly Glu Asn Gly Ala Ala Arg Gly Val Gly Ser Glu Ala Gly      75 80 85 gac aag gcc tcc aag ggg ccc ctg tac agc aat gtg ttg acg ccc gac 340 Asp Lys Ala Ser Lys Gly Pro Leu Tyr Ser Asn Val Leu Thr Pro Asp  90 95 100 105 aag atc ccc gac ttt ttc atc ccc ccc aag ctg ccc tcg ggc ccc gcg 388 Lys Ile Pro Asp Phe Phe Ile Pro Pro Lys Leu Pro Ser Gly Pro Ala                 110 115 120 gag ggc gag gga cag gcc gcg ctg ggc ccc tcc acg tcg gaa cag aac 436 Glu Gly Glu Gly Gln Ala Ala Leu Gly Pro Ser Thr Ser Glu Gln Asn             125 130 135 ctg gcc tct gcg gcc ccc cgc cag acc cca cgg agc ccc cgg ctg cct 484 Leu Ala Ser Ala Ala Pro Arg Gln Thr Pro Arg Ser Pro Arg Leu Pro         140 145 150 gcc aag ctg gca gcc gag agc aag agc ctg ctg aag gca gcc acc cgg 532 Ala Lys Leu Ala Ala Glu Ser Lys Ser Leu Leu Lys Ala Ala Thr Arg     155 160 165 cac gtg atc cag atc gag agt gcc gag gac tgg ctg tcc gag gag gcc 580 His Val Ile Gln Ile Glu Ser Ala Glu Asp Trp Leu Ser Glu Glu Ala 170 175 180 185 act gac gcc gac ccc cag gcc cag ggt gcc atg tcc ctg ccc tcg gtg 628 Thr Asp Ala Asp Pro Gln Ala Gln Gly Ala Met Ser Leu Pro Ser Val                 190 195 200 ccc aag gcc cag acg tcc tac ggc ttc gcc atg ctg gcc gag agc ccc 676 Pro Lys Ala Gln Thr Ser Tyr Gly Phe Ala Met Leu Ala Glu Ser Pro             205 210 215 cac acg agg cgc aag gag tct ctg ttc cac agt gag cac ggg gct ctg 724 His Thr Arg Arg Lys Glu Ser Leu Phe His Ser Glu His Gly Ala Leu         220 225 230 gcc cag gtg ggc tcc cca ggg gcc ggg cgc cgc cgg gca gct gcc aag 772 Ala Gln Val Gly Ser Pro Gly Ala Gly Arg Arg Arg Ala Ala Ala Lys     235 240 245 gcc aac ggg ggt gat ggg ggc ccc agg gag gct ggc ggg gcc ctc atg 820 Ala Asn Gly Gly Asp Gly Gly Pro Arg Glu Ala Gly Gly Ala Leu Met 250 255 260 265 agc ccc ggc cgc tac ttc agt ggc ggg gag agc gac aca ggg tcc tcg 868 Ser Pro Gly Arg Tyr Phe Ser Gly Gly Glu Ser Asp Thr Gly Ser Ser                 270 275 280 gcc gag tcc tcc ccc ttc ggg tcc cct ctg ctg tcc cgc tcc gtg tct 916 Ala Glu Ser Ser Pro Phe Gly Ser Pro Leu Leu Ser Arg Ser Val Ser             285 290 295 ctg ctc aaa ggt ttc gcc cag gac agc cag gcc aag gtg agc cag ctc 964 Leu Leu Lys Gly Phe Ala Gln Asp Ser Gln Ala Lys Val Ser Gln Leu         300 305 310 cgg cac tcc gtg ggc cgc cac ggc tcc ctg tct gcg gac gac agc acc 1012 Arg His Ser Val Gly Arg His Gly Ser Leu Ser Ala Asp Asp Ser Thr     315 320 325 ccg gac gcc agc ccc ggg agc cgg cgc cgc ctg acc cgc cgg gca ccc 1060 Pro Asp Ala Ser Pro Gly Ser Arg Arg Arg Leu Thr Arg Arg Ala Pro 330 335 340 345 ccg gaa cct ggc ccc gag tcg ggc cag gcg cgt ggg gag cac acg gtc 1108 Pro Glu Pro Gly Pro Glu Ser Gly Gln Ala Arg Gly Glu His Thr Val                 350 355 360 cac gtg ggc cct cgg ggc agc gtg cgg ctg ctg gcc gag tac gag gcc 1156 His Val Gly Pro Arg Gly Ser Val Arg Leu Leu Ala Glu Tyr Glu Ala             365 370 375 ggc cag gcc cgc ctg cgg gtg cac ctg ctg gcc gcc gag ggc ctc tac 1204 Gly Gln Ala Arg Leu Arg Val His Leu Leu Ala Ala Glu Gly Leu Tyr         380 385 390 gac cgc ctg tgc gac gcc cgc agc atc aac tgc tgc gtg ggc ctg tgc 1252 Asp Arg Leu Cys Asp Ala Arg Ser Ile Asn Cys Cys Val Gly Leu Cys     395 400 405 ctg gtg ccg ggc aag ctg cag aag cag cgc agc acc atc gtg aag aac 1300 Leu Val Pro Gly Lys Leu Gln Lys Gln Arg Ser Thr Ile Val Lys Asn 410 415 420 425 agc cgc cgc ccc gtc ttc aac gag gat ttc ttc ttc gac ggc ctg ggg 1348 Ser Arg Arg Pro Val Phe Asn Glu Asp Phe Phe Phe Asp Gly Leu Gly                 430 435 440 ccc gcc agc gtc cgg aaa ctg gcc ctc agg atc aag gtg gtg aac aag 1396 Pro Ala Ser Val Arg Lys Leu Ala Leu Arg Ile Lys Val Val Asn Lys             445 450 455 ggc agc agc ctc aag cgg gac acg ctg ctc ggg gag aag gag ctg ccc 1444 Gly Ser Ser Leu Lys Arg Asp Thr Leu Leu Gly Glu Lys Glu Leu Pro         460 465 470 ctg acc tcc ctg ctc ccc ttc ctg tagagcgggg acctgctcgc tgtccgctcg 1498 Leu Thr Ser Leu Leu Pro Phe Leu     475 480 ggtgcagagg gcagaccccg gtctctcctc cacaccgggt gctggcgggc gggtccgctg 1558 gcttgggctg gggctgggcg gggaggggct cacactcagg ctgggctggg cgctgatgct 1618 gtagctgacg caggttttcc tgggtcaccc gatgctgact ggagtattat ttttcatgaa 1678 ttaatttctc ctttggagga ggaggaggag gaggagaggg gcctcggggg aagagaaggg 1738 gccacttcgc ggggcagagg gtcccccttt gcccctccac actgcctcgc gcctttcacg 1798 cagagttcta gaaggaggtt ctcaggggtc caggaatccg tgcacgtggc cagggcgggg 1858 ccgtggacag ggaggtgaca agtcttggag cccacagtct ccttcactgc acgtgtgtct 1918 tcctccccca cagcctgccg ccatccagtt cccaccccgc ggagagcagg gcctcgcggc 1978 cacaccaggc ccaggctctg cccgggacgc cccctcccag gggtcaggct tggctctggg 2038 agaccccagc ttcccaggct ccggtgggga gcggggccag gcgagggtgc cctgacgagg 2098 gtgggcgcgg agggagcagc cgctgccaca gtcggagcag aggaggtctt ggcagaggtc 2158 cgcggggagg agacggctct cctggcctgt tttcctccga cactgtcgcc gctggctttt 2218 tgggtacagg cccgtctggc catttctgct gtggcgccgg agggtcttgc tcagagatgc 2278 ccagagagca gccctatggc ccatccgtcc tcgctggggg ctgccacccc gtcgcccttt 2338 tgtccagcgt ctgcagctgg gagcgacacg cagtgattga cggccgtggc tttgacgcca 2398 cgtgtgtaaa taggctgtgg ctacggtggg tgggtcggag cccaggaggc agctgggggg 2458 acctggctat ggcctttcct gctgttaggt ggctgcaagg cttccattgt gtggttatgt 2518 gggtccttga atgattttgg ggaggtttcc agctcagaat gatgcagaaa tgataagact 2578 caaagcaggg gccaggccag gccagtgcct tcgcctctcc cggctgctgg tgggcacggg 2638 ggaaccaggg cacatctgtg gtacccaggg acgtcccttg tcagcccgtt tgccacacat 2698 tgttcctctt gtccagggga gggtggagga gctgcttccc aggactggag gagcagctgg 2758 gcccctgctg cacgtccggt gggacacacc tgtgagcccc gccttctctg atgctcaccc 2818 ctcccaggcc ctgttctccc tggggtcccc cgtttagtag cccccctgca ctctttgata 2878 tcttagtgtc tgaggttgac tgtgggtaaa tctttaagac actccccagc tgtgtttgtt 2938 tataaatgct gtttatagtg caataaaggt gtttcggg 2976 <210> 18 <211> 5385 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2088) .. (2885) <400> 18 cccagcttga ctagttgttt ttcttttctg gtctgttctg ctcttttggt ttaaataatt 60 aaagatcctt gttgacaagt aaaaaaaatt ctttatgaag ctaaaaaaag tggttctaat 120 taaacattta aacattcttc catatacttt taaaaacagt tgtaattata gctctactta 180 atgttaagta ctttaaccaa atgaaaggtt aattatacag actggcaatt ataaaaaata 240 aggtaacata ttcatggcat tatgaataat taataagaac tcttcgattt catttagacc 300 agggattctg tgctaatatg ctctcagctg tcttgaaata atggtgaaat tttaaaatta 360 gttttgtact taaacatgtc cccaaagtga agcacagcgg ccaacgtccg tcacggtgcc 420 gagcagtgcc agccgctcta ccatggccga cggcctgatt cccaagggct gcgtccaggg 480 gccgtgctag tgcagtgaat gttcaggaaa cagaccaggt ggcaggaggg agtcttgtcc 540 tcaaggaacc tcataagtct tgctaggggg acaaacaacg agagaatgaa actggcagcc 600 atagaatccg ggccttgatg cccaggcctc gccatgcata cctgctggcc ggcatctccc 660 tgagacccgt ctggcacagg caacggcagg gcccggccct tggagcacag ggatggcaga 720 gcagggtcat cgaagcaggt ggcgtgcaga cagtggcttc tccgcaagag ctcctggctc 780 tcagggagct gagggctcct caatggggaa ttctggctga gctccagggg aaagcgggtt 840 ggaggcctgg ctcctccgcc cccatctcct ctgccccaac cctggaaggg ggtcagatat 900 caatgctaac atttgtgctg actcaattaa ttcaaacgat tcacaaaaac aaacttgctt 960 aaagtattaa tttcctaaaa agaactaaaa tgagttcaaa taattcacat aattaaatta 1020 ttttaattgt taatttaatt aaaacatctc agttttcaaa gcaccacgtg ccttggtact 1080 ttctccttat tacatgtccc atcttccagg gaggtttcct tccacccact gtagcagcgg 1140 attctggtgt ctcgggtgtt cccaggcttc cgtcagcccc tggggggttt acagttctgc 1200 agcagtctcc atcctaatta atctgcataa gaacatccgt ccgcagggat gtcctgtggc 1260 cctgtggcca cgtctgacat gcagctaaac actggacgtg tcacctggcg acagcggatg 1320 atgaaactgt gggcagggta gcacctctgg gcgccaacac tcttaacatg tttttttttt 1380 acaagaccat tctttaatgg gtgaaccctg aaaggaaacg ctaaagattg tttttaaata 1440 tttttatata gagacaaatc tttatcaaac attcactcaa tgctaaatgt cacttctcca 1500 gccaacccct ttaaaaaata cctaaaacta agactggaat gacaccttca aacacatact 1560 ctaataatgc attcccggga gcagagagag cttagcagct taatgttttc atgatataca 1620 agcatgaagt ttggcgaagg cattttcata tgtttacctg aatattttaa taaaatcaaa 1680 agatcccttt cactttaata ctctacttag aatgtaattt tccattaaat aaatagtcaa 1740 tgttctgtaa cccacatgct gataaacttc aaaactttag aaagtattaa gcgtaatcaa 1800 agattacttt taatttttga gctgggaaga gcctgtgcca gggaacagag ttgagattga 1860 agagataaaa gcacgattat tttatgaaag aaaagcgaaa ttcataaaag agaccatgtg 1920 attctataaa gcaacacaga atccgttttc agaagaggca aacgtggacc acaaaacgcc 1980 agtgttgcct gtgaagagag aaacagttaa aaggaagaat tcgcggtggc ccagtatagg 2040 tcaaggtgtg ggaggacagg gaggcgatga aatgatcctc cttctaa ctt ccc aga 2096                                                     Leu Pro Arg                                                       1 aag gta caa gcg gcc agc act gta act gga aac cac gcg ggc acc agc 2144 Lys Val Gln Ala Ala Ser Thr Val Thr Gly Asn His Ala Gly Thr Ser       5 10 15 gtg aac tca gac ccg aca gcc agc acc gta act gga gac cat gcg ggc 2192 Val Asn Ser Asp Pro Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly  20 25 30 35 acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga gac cat 2240 Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His                  40 45 50 gcg ggc acc ggt gtg aac tca gac ccg acg gcc agc acc gta act gga 2288 Ala Gly Thr Gly Val Asn Ser Asp Pro Thr Ala Ser Thr Val Thr Gly              55 60 65 gac cac gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta 2336 Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val          70 75 80 act gga gac cac gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc 2384 Thr Gly Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser      85 90 95 acc gta act gga gac cac gcg ggc acc ggt gtg aac tca gac ccg acg 2432 Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn Ser Asp Pro Thr 100 105 110 115 gcc agc acc gta act gga gac cac gcg ggc acc ggt gtg aac cca gac 2480 Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn Pro Asp                 120 125 130 tcc acg gcc agc acc gta act gga gac cac gcg ggc acc ggt gtg aac 2528 Ser Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly Val Asn             135 140 145 cca gac tcc acg gcc agc acc gta act gga gac cac gcg ggc acc ggt 2576 Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly Thr Gly         150 155 160 gtg aac cca gac ccg acg gcc agc acc gta act gga gac cac gcg ggc 2624 Val Asn Pro Asp Pro Thr Ala Ser Thr Val Thr Gly Asp His Ala Gly     165 170 175 acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga gac cat 2672 Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly Asp His 180 185 190 195 gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta act gga 2720 Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val Thr Gly                 200 205 210 gac cat gcg ggc acc ggt gtg aac cca gac tcc acg gcc agc acc gta 2768 Asp His Ala Gly Thr Gly Val Asn Pro Asp Ser Thr Ala Ser Thr Val             215 220 225 act gga gac cac aca ggc acc ggt gtg aac tca gac gca acg gca ccc 2816 Thr Gly Asp His Thr Gly Thr Gly Val Asn Ser Asp Ala Thr Ala Pro         230 235 240 tcc tgg gct gct cca gcc acc acc aga acc cgt tct gca gct gaa ctc 2864 Ser Trp Ala Ala Pro Ala Thr Thr Arg Thr Arg Ser Ala Ala Glu Leu     245 250 255 gaa gct gat tct gga agg aac tgaggaggat gaggacgaga tccttctacc 2915 Glu Ala Asp Ser Gly Arg Asn 260 265 tcaaccggga tcaatttttt aatgatttct ttcttttaga aaaatgttaa taagatactt 2975 tgttctaaaa tgaccaggat ttcagcagct tctacatttc ttcatgtcaa tcaaagctta 3035 ggcacgtggg ttacttcact gttctctgca ccctttccaa ataaaacccg gggccgcctg 3095 tgtacccaag agcctgggca ctgagcagag agagaaaacg tccagggcaa cagcgttcac 3155 attccggcgg tgccatagag cagacggggg gagtccgagg cggtgccgca gagcggacgc 3215 ggggagtccg aggcggtgct gcagagcgga cgcggggagt ccgaggtggt gccgtagagt 3275 ggatggggag agttcgagcc acaggtggcc acgcctcccc tgcctccggc cggtgcccac 3335 tgctcacgca ttcctctctc ggcccatcct gagccagggg gccctcgcgt cacccaaggc 3395 cactgcagca cctccaggaa caaaccaccc acagatggac tgctctccat gcaaggtgca 3455 aagattgttt ttaaaactgc tcggctctgg aacttatccc agtaacaggc tagattaaga 3515 aaactgacta aaaacagtag gaggaaaaac tccctccacc cattgaagca caagaaatgc 3575 aaacagcctc tgaaataact gcagacgagt tttcctgggt taggtacgcg gtggggtgac 3635 cctaccccat agcggggccc cggcacacag tgggaactga cgaacatctc tgaataaatg 3695 cgggaaaatg cactcaactc ccaaacagag acagaaactg ctttttttcc ttaaaagctt 3755 ctctctttta ttctcaaaaa gactctcttt ggtggcttca ctgctgaagg gacaagcctg 3815 ttgtccccct ccagcaggct ggccgccttc cagaaagcag tcctggacag tctccagatc 3875 ctatggccct cacatccacc ttgcctagtg atcccgcatt atgaagccta atcatcagaa 3935 agagcctccc agtccacagc tcaggagtgg cgttttattc agggaaacac taataaaaac 3995 ctgggggtca gcgtcaacgc tgtcccctag atactgaggc tccacacaga tggcacacag 4055 acaggcctca ccccttcctg gtgccccaag tctctcctaa cattttatta gctgagtttt 4115 ctggcggttt gttacttttc acctgcccgt ccaaggcagc tacactaaac tacttgtcca 4175 gggcagatcc actcgtaatg accatcagct cagagttgga agtagcgctg gagaatgagg 4235 tggtttcagg ttgctatgct tccctgccct ctcttccagg gatgccaagt aagtttctgc 4295 catggctgat gatgtcagta ctacactgaa attagttcaa cggcaggcca ccaattagta 4355 cttactgaac aacgtcctta agagtgaaac cctgtctcta ctaaaaatac aaaaatttag 4415 ctgggtgtgt tggtgagcac ctataatctc agctactggg gggctgaggc aggagaattg 4475 tttgaaccca ggaggcagag attgcagtga gccaagattg caccattgca ctccagcctg 4535 ggcaaaaaga acgagactcc atctcaaaaa taaaaaaaaa agaaaaagaa aaagaaaaag 4595 aaaaaaatga ttctcacgac actgtcaaca tcttaagttc ttccggaggg tagcgtaaaa 4655 ttatcacaca caacgtaaag ccaattagat caacattctg tcctcaccga tccataccta 4715 aatccttatt tctaattggc ccccaatttt tcagaattta agaagactga gaggaataac 4775 cacgaacaag caagatttac gcagacgtca gtcttcccaa ctctcgctca ggaactcacc 4835 ccaaactcct cctgtggaat cagcctggat aacagtccct ggtggctaca caagaacaga 4895 gagaggattc cacgaaccca tgacacaccc caccctccaa tccaaggcag aggagagagg 4955 cggacagtca caaaaaccag aagagcaggg aactgtgatt cttctgattt aatgtaggtg 5015 attgggccaa aaaacaaggc aaatccacct gtgaatggag tgggctccca gctggctgtg 5075 gatatgaagg aactcacttt cctgcacact gaagtccatc agcgtctcca catcggccgg 5135 cacctgggct gtgatcgccc ctcacagact gggataacag tggcctcatc tggacaggtt 5195 gtcaaggatg gatggataga cgtcctggac agacactgta cagagaccag ggactgggca 5255 gggggaggtg gcacagcctt agggccagca ggggcgaggc cgggaagccg ggtctctggg 5315 gcgttgaggg cccatggggg cctgggcgag ggcctgcgtc tcctcatcca acagaagcag 5375 aagcggccgc 5385 <210> 19 <211> 4017 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (749) .. (3361) <400> 19 cggacgcgct ggaggccggt gggcggtcgc cgtagtcgcc ctcccaccac ctcgctcagg 60 tttctccggc ctggggcaag aggcctgtgg cccgcgcggg aacgcactgt ccaccctcag 120 acctgggact gggccggcgc gcggacgcta ccaagaggct gcggctcccg cccccgcggc 180 cggacgtggc gcctcccctg aggccgcggc cggagcctgg aggtggggtc ggagtcagag 240 cccggggctc tgatgtcacc gcgcggctgc gacggcccag gagcgcgtct cggcgggagc 300 ctcagagcct cactttgtcg ccaggctgga gtgcagtggc gcgatctcgg ctcactacaa 360 cttccgcctc ccgggttcaa gcgattcttc tgcctcagcc tcccgagtag ctgggactac 420 aggcgtgtgc caccacgccc agctaatttt tgggaaagat cacatggaga agccaacaga 480 acggccagga agcagcctgg gtggaacagg gttggggagg aaaggaacat gggtacattc 540 tcagcgttcc aggaggtctc agggtaaagg ctgcttgtgt gggcaagggg agcctggcag 600 ggaatgagct ggtcccaatt ccttcccctg agctgggccc tcaaccttca gaggtgctcc 660 ctgggactgg tcctacttcc tctgccacct ctgatcccaa ggtgactctg ccaaaatctg 720 ggggatgcca gcgcttgtga cggtgtga cca ggg agg gga agt gac cca cgc 772                                Pro Gly Arg Gly Ser Asp Pro Arg                                  1 5 agc cac cca gcc tca gca ctc atc tgc gca gcc atg gag gcc ctg gga 820 Ser His Pro Ala Ser Ala Leu Ile Cys Ala Ala Met Glu Ala Leu Gly      10 15 20 cct ggg ggc gac cgc gcc tcc ccg gcc tcg tcc act agc agc ctg gac 868 Pro Gly Gly Asp Arg Ala Ser Pro Ala Ser Ser Thr Ser Ser Leu Asp  25 30 35 40 ctg tgg cat ctg tcc atg cgc gcg gac tcg gcc tac agc tct ttc tcc 916 Leu Trp His Leu Ser Met Arg Ala Asp Ser Ala Tyr Ser Ser Phe Ser                  45 50 55 gca gcc tcc ggc ggc ccc gag ccg cgc acg cag tcg ccg ggg aca gac 964 Ala Ala Ser Gly Gly Pro Glu Pro Arg Thr Gln Ser Pro Gly Thr Asp              60 65 70 ctc ctt cct tac cta gac tgg gac tac gtg cgt gtg gtt tgg ggc ggc 1012 Leu Leu Pro Tyr Leu Asp Trp Asp Tyr Val Arg Val Val Trp Gly Gly          75 80 85 ccg ggc ccc gcc ccg ccc gac gct gcc ctt tgc aca tcc ccg cgg ccc 1060 Pro Gly Pro Ala Pro Pro Asp Ala Ala Leu Cys Thr Ser Pro Arg Pro      90 95 100 cgg ccc gcg gtt gca gcc cgc agt ggg ccg cag cca aca gag gtc ccg 1108 Arg Pro Ala Val Ala Ala Arg Ser Gly Pro Gln Pro Thr Glu Val Pro 105 110 115 120 ggg acc ccg gga cca ctg aac agg cag gcc acc ccg ctg ctg tac gcg 1156 Gly Thr Pro Gly Pro Leu Asn Arg Gln Ala Thr Pro Leu Leu Tyr Ala                 125 130 135 ctg gcg gcc gag gcg gag gcc gcg gcg cag gct gcc gag ccg ccc agc 1204 Leu Ala Ala Glu Ala Glu Ala Ala Ala Gln Ala Ala Glu Pro Pro Ser             140 145 150 ccg ccg gcc tcg agg gcc gcc tac cgc cag cgg ctt cag ggc gcg cag 1252 Pro Pro Ala Ser Arg Ala Ala Tyr Arg Gln Arg Leu Gln Gly Ala Gln         155 160 165 cgg cga gtg ctc cgg gag acg tcg ttc cag cgc aag gag ctc cgc atg 1300 Arg Arg Val Leu Arg Glu Thr Ser Phe Gln Arg Lys Glu Leu Arg Met     170 175 180 agc ctg ccc gcc cgt ctg cgg ccc act gtc cca gcg cgg ccc ccg gcg 1348 Ser Leu Pro Ala Arg Leu Arg Pro Thr Val Pro Ala Arg Pro Pro Ala 185 190 195 200 act cac ccg cgc tcc gcc tcg ctc agc cac ccg ggc ggg gag ggg gag 1396 Thr His Pro Arg Ser Ala Ser Leu Ser His Pro Gly Gly Glu Gly Glu                 205 210 215 ccg gcg cgc tcc cgg gct ccc gcg cca gga act gcc ggc cgg ggt ccc 1444 Pro Ala Arg Ser Arg Ala Pro Ala Pro Gly Thr Ala Gly Arg Gly Pro             220 225 230 ctc gcc aac cag cag cgg aag tgg tgc ttc tca gag cca gga aag ctg 1492 Leu Ala Asn Gln Gln Arg Lys Trp Cys Phe Ser Glu Pro Gly Lys Leu         235 240 245 gat cgt gtg ggt cgg ggc ggt ggg ccg gcg cgg gaa tgc ctg ggt gag 1540 Asp Arg Val Gly Arg Gly Gly Gly Pro Ala Arg Glu Cys Leu Gly Glu     250 255 260 gcc tgc tcc agc tct ggc ctc cct ggg ccc gag ccc ttg gag ttc cag 1588 Ala Cys Ser Ser Ser Gly Leu Pro Gly Pro Glu Pro Leu Glu Phe Gln 265 270 275 280 cat ccg gcg ctg gct aag ttt gaa gat cac gag gtc gga tgg ctg ccc 1636 His Pro Ala Leu Ala Lys Phe Glu Asp His Glu Val Gly Trp Leu Pro                 285 290 295 gag acg caa ccc caa ggc tcc atg aac ctg gac tcc ggg tcc ttg aag 1684 Glu Thr Gln Pro Gln Gly Ser Met Asn Leu Asp Ser Gly Ser Leu Lys             300 305 310 ctc ggt gat gcc ttc agg ccc gcc agt cgg agt cgg agc gct tca ggc 1732 Leu Gly Asp Ala Phe Arg Pro Ala Ser Arg Ser Arg Ser Ala Ser Gly         315 320 325 gaa gtc ttg ggt tcc tgg gga gga tca gga ggg acc ata ccc att gtc 1780 Glu Val Leu Gly Ser Trp Gly Gly Ser Gly Gly Thr Ile Pro Ile Val     330 335 340 cag gct gtt ccc caa gga gca gaa acc ccc aga cca ttg ttt cag acc 1828 Gln Ala Val Pro Gln Gly Ala Glu Thr Pro Arg Pro Leu Phe Gln Thr 345 350 355 360 aaa ctt tcc agg ttc ttg cct cag aaa gag gct gcg gtg atg tat cct 1876 Lys Leu Ser Arg Phe Leu Pro Gln Lys Glu Ala Ala Val Met Tyr Pro                 365 370 375 gca gag tta ccc cag agc agc cct gct gac agt gaa cag agg gtc tca 1924 Ala Glu Leu Pro Gln Ser Ser Pro Ala Asp Ser Glu Gln Arg Val Ser             380 385 390 gag acc tgc att gtg cct gcc tgg ctc ccc tcc ctt cct gat gaa gtg 1972 Glu Thr Cys Ile Val Pro Ala Trp Leu Pro Ser Leu Pro Asp Glu Val         395 400 405 ttc cta gaa gag gcc cca ctg gtc aga atg aga tca cca cca gac ccc 2020 Phe Leu Glu Glu Ala Pro Leu Val Arg Met Arg Ser Pro Pro Asp Pro     410 415 420 cat gcc tcc cag ggg ccc cca gcc agt gtc cat gcc tct gac cag ccg 2068 His Ala Ser Gln Gly Pro Pro Ala Ser Val His Ala Ser Asp Gln Pro 425 430 435 440 tat gga act ggc tta ggc caa aga act ggc cag gtt aca gtc ccc aca 2116 Tyr Gly Thr Gly Leu Gly Gln Arg Thr Gly Gln Val Thr Val Pro Thr                 445 450 455 gag tac ccg ctc cat gag tgt cca gga act gca ggg gca gat gac tgc 2164 Glu Tyr Pro Leu His Glu Cys Pro Gly Thr Ala Gly Ala Asp Asp Cys             460 465 470 tgg cag ggg gtg aat ggt tct gta ggt att tcc agg ccc aca agc cac 2212 Trp Gln Gly Val Asn Gly Ser Val Gly Ile Ser Arg Pro Thr Ser His         475 480 485 acc ccc act ggg act gca aat gat aac atc cca act att gac ccc act 2260 Thr Pro Thr Gly Thr Ala Asn Asp Asn Ile Pro Thr Ile Asp Pro Thr     490 495 500 gga ctg acc acc aat ccc ccc aca gct gca gag agt gac ctc ctc aaa 2308 Gly Leu Thr Thr Asn Pro Pro Thr Ala Ala Glu Ser Asp Leu Leu Lys 505 510 515 520 cct gtc cca gct gat gcc ttg gga ctt tca ggc aat gat act cca ggt 2356 Pro Val Pro Ala Asp Ala Leu Gly Leu Ser Gly Asn Asp Thr Pro Gly                 525 530 535 ccc tct cac aat act gcc cta gcc agg ggc act ggc cag cct ggt tcc 2404 Pro Ser His Asn Thr Ala Leu Ala Arg Gly Thr Gly Gln Pro Gly Ser             540 545 550 agg ccc aca tgg cct agt cag tgc ctc gag gag ctg gtt cag gag ctg 2452 Arg Pro Thr Trp Pro Ser Gln Cys Leu Glu Glu Leu Val Gln Glu Leu         555 560 565 gcc aga tta gat ccc tct cta tgt gac cct ctt gct tcc cag ccc agc 2500 Ala Arg Leu Asp Pro Ser Leu Cys Asp Pro Leu Ala Ser Gln Pro Ser     570 575 580 cca gag cca ccc ctg ggc ctg ctg gat gga ctg att cct tta gca gag 2548 Pro Glu Pro Pro Leu Gly Leu Leu Asp Gly Leu Ile Pro Leu Ala Glu 585 590 595 600 gtc cgg gct gca atg cgg cct gcc tgt ggg gag gct gga gag gag gct 2596 Val Arg Ala Ala Met Arg Pro Ala Cys Gly Glu Ala Gly Glu Glu Ala                 605 610 615 gcc agt act ttt gag cca ggg tcc tat cag ttc agc ttc acc cag ctc 2644 Ala Ser Thr Phe Glu Pro Gly Ser Tyr Gln Phe Ser Phe Thr Gln Leu             620 625 630 ctg ccg gct cct cgg gag gag aca agg ctt gaa aac cct gcc acc cac 2692 Leu Pro Ala Pro Arg Glu Glu Thr Arg Leu Glu Asn Pro Ala Thr His         635 640 645 cct gtg ctt gac cag cca tgt ggg cag ggg ctc cct gca cca aac aac 2740 Pro Val Leu Asp Gln Pro Cys Gly Gln Gly Leu Pro Ala Pro Asn Asn     650 655 660 agc atc cag ggc aag aaa gtg gag ctg gcc gcc cgc ctc caa aag atg 2788 Ser Ile Gln Gly Lys Lys Val Glu Leu Ala Ala Arg Leu Gln Lys Met 665 670 675 680 ctt cag gac ctt cac acg gag cag gag cgg ctg cag ggg gag gca caa 2836 Leu Gln Asp Leu His Thr Glu Gln Glu Arg Leu Gln Gly Glu Ala Gln                 685 690 695 gcg tgg gcc agg cgc caa gcg gct ctg gag gct gca gtg cgc cag gcc 2884 Ala Trp Ala Arg Arg Gln Ala Ala Leu Glu Ala Ala Val Arg Gln Ala             700 705 710 tgt gcc cct cag gag ctg gag cgg ttc agc cgg ttc atg gcc gac cta 2932 Cys Ala Pro Gln Glu Leu Glu Arg Phe Ser Arg Phe Met Ala Asp Leu         715 720 725 gag cgc gtg ctt ggc ctt ctg ctg ctg ctg ggc agt cgc ctg gcg cgc 2980 Glu Arg Val Leu Gly Leu Leu Leu Leu Leu Gly Ser Arg Leu Ala Arg     730 735 740 gtg cgc cgc gcc ctg gcc cgg gcg gcc tca gac agc gac cct gat gag 3028 Val Arg Arg Ala Leu Ala Arg Ala Ala Ser Asp Ser Asp Pro Asp Glu 745 750 755 760 cag gcc tcc ctg ctg cag cga ctc cgg ctc ctg cag cgg cag gag gag 3076 Gln Ala Ser Leu Leu Gln Arg Leu Arg Leu Leu Gln Arg Gln Glu Glu                 765 770 775 gac gcc aag gag ctg aag gag cac gta gcg cgg cgc gag cgg gcc gtg 3124 Asp Ala Lys Glu Leu Lys Glu His Val Ala Arg Arg Glu Arg Ala Val             780 785 790 cgg gag gtg ctg gtg cga gca cta ccg gtg gag gag ctg cgc gtc tat 3172 Arg Glu Val Leu Val Arg Ala Leu Pro Val Glu Glu Leu Arg Val Tyr         795 800 805 tgc gcc ctg ctg gcg ggc aag gcc gcc gtc ctg gcc cag cag cgc aac 3220 Cys Ala Leu Leu Ala Gly Lys Ala Ala Val Leu Ala Gln Gln Arg Asn     810 815 820 ctg gac gag cgc atc cgc ctc ctt cag gac caa ctg gac gcc atc agg 3268 Leu Asp Glu Arg Ile Arg Leu Leu Gln Asp Gln Leu Asp Ala Ile Arg 825 830 835 840 gac gac ctt ggc cat cat gcc ccg tct ccc agc ccg gcg cgg ccc cca 3316 Asp Asp Leu Gly His His Ala Pro Ser Pro Ser Pro Ala Arg Pro Pro                 845 850 855 ggg acc tgt cct cca gtt cag ccg ccc ttc cct ctt ctc ctt aca 3361 Gly Thr Cys Pro Pro Val Gln Pro Pro Phe Pro Leu Leu Leu Thr             860 865 870 tagttatagg tggagagggt ggggtgcacc gcccctatct ctcacccacg taagtggggt 3421 gatgctgatt tagtctctgc ttttcccttg agggattggg gaggacccag tccaggcctt 3481 tctaagatac tcctccagcc gggcgcggtg gctcacgcct gtaatcccag cactttggga 3541 ggccgaggcg ggcggatcac aaggtcagga gatcgagacc atcctggcta acacggtgaa 3601 accccgtctc tactaaaaat ataaaaaatt agccgggcgt ggtggcgggt gcctgtattg 3661 ccagctactc ggaaggctga ggcaggagaa tggcgtgaac ctgggaggtg gagcttgcag 3721 tgagcggaca tcgcgccact gcactccagc ctgggcgaca gagcaagact ccgtctcaaa 3781 aaaaaaaaaa aaaaaaaaaa aaaaaaagat acttctccaa agtgttctca tgtggcctca 3841 cccaggtctt gtgtattatt tggtaattaa tttatggatc ttaaaaactg cagtattccc 3901 ccattttgtg atgagagtgt ggggctggca ggggttggtt ggagggagga gagaagacag 3961 aggagcactt aaggtgcaaa gcagcctatt ttttcttcaa taaaaattgt taagag 4017 <210> 20 <211> 5949 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (3111) .. (4112) <400> 20 gcggccgccc acatgatgcg caagacgagc tgcctgcagt atctggacgc ccgcaacacg 60 cccctgctgg accactcggc gcccttcgtg gcccgtgccc tgcgcatccg cagcagcctg 120 gcagtgctgc acttggagaa cgccagcctg tcggggcggc ccctcatgct gctcggtgag 180 ccccaagccc gggagggtga gcaggatgtg cagcctgttg gggggcaggg gcctgcccag 240 tcaccgtgcc ccctccccgt ccccagccac ggccctgaag atgaacatga acctgcggga 300 gctgtacctg gcggacaaca agctcaacgg cctgcaggac tcggcccagc tgggtaacct 360 gctcaagttc aactgctccc tgcagatcct ggacctccgg aacaaccacg tgctagactc 420 gggtgggtgc agtggcccac cccacccaca cccgtcaccc agcacccact ctgcccggcc 480 ctgcgctagg tggtgctggg gacacagaca tggccaagac caccccaacg ctgtcctgct 540 ggggctccca ggaggaagac agacccgccc tagacagtga caacctagag tgggcaggac 600 tgggctggtg gagcccagag gcggggtccg atcagtctgg gggccaggga gggcttccag 660 gaggaggaga cagctcagct gacagccgca ggatgaggcg aagtgaataa ttacaatctg 720 aaaagaaagc agtgttgctg ccataacagg gtcagctaga ggctgctttc agatgggaga 780 ggagggaaga ccaccgtggg gcggggcctc tgggcttcca tccggaggct gggagcagaa 840 ccagccttgt gagaagcctg ggaaagggtc cagaccgcgg gagcagtgag tgcagtggga 900 ggagctctgg tgtttgatgc tgggggttga gcagagaatg gggagccgtc aaaggaggcg 960 ttcctgttgc atcctggagg ggtggggtgg gtcactgtgt ccccaggtgg caggcccaga 1020 ccccactcta ctgagcacct gccaagggcc atggctgtcg gcccctgggc tcagctcgga 1080 gggagacaca gggctctgcc cttgggagtt tgtactccag tctagagaga caacgataag 1140 atgagtgaaa tctccagcct gtcagatggt gggatctgca ttggagggag acagtggtga 1200 ctgtgggtgg gagtgagctg gctgtgaggt gaggtgggcc agaggtggtc acatttgagc 1260 aaagatcaga agcatctggt ggaggctgtc atccaggcac ccaggcccag gggtcacaga 1320 tgtgtgccca ggggtgccag gccgaggcag ggctgcaggg caaggcctgg gactgcgggg 1380 cctccatgga ggtcctaagc agagggggtt gctcctaccc tgcagtcccc tccccaggaa 1440 gcacccggac cccagacagg gctctgactg ccggcctcct ccaggtctgg cctacatctg 1500 cgagggcctc aaggagcaga ggaaggggct ggtgaccctg gtgctgtgga acaaccagct 1560 cacgcacaca ggcatggcct tcctgggcat gacactggtg agtcaggctg gcagggaagg 1620 gaggcacctc ggtccccgct gccacctccc actccagctc tcagcacagt tgcctctagc 1680 tgacggctcc tgtcctgccc atcctgcagg ttcaagacga tttaaactca gccccacctg 1740 cttcctcatt cagcttctgt gcctgttttc cctgaggagg agcccttctt tcattatctc 1800 cttttttttt ttgagaccga gtctcatcct gtcacccagg ctggagtgca gtggcgccat 1860 ctaggctcac tgcagccact ccagcccccg cctcctgggt tcaagccatt cttctgcctc 1920 agcctcccaa gtagctggga ctacaggtgc ccgccacccc gcctagccaa tttttttatt 1980 ttttattttt attttttttg agacggagcc tcaatcgccc aggctggagt gcagtggctc 2040 gatctccgct cactgcaagc cccacctcct gggttcacac cattgtcctg cctcagcctc 2100 ccgagtagtt ggaactacag gcacccacca ccacacccag ctaatttttt tgtatttttt 2160 tagtagagag gggatttcac catgttagcc aggatggtcg caatctgacc ttgtgatcca 2220 ccctcctcgg cctcccaaag tgctgggatt acacgcgtga gccaccgcgc ccggcctaat 2280 ttttgtattt ttagtacaga cggggtttca ccatgttggc caggccggtc ttgaactcct 2340 gacctcgagt gatccccccc acctcggcct ccgaaagtat tgggattatg agtgagtcac 2400 cgtgcctggt tttcagtttt tatgtggtgc ttaaagcaag ccactcaggt acagagactc 2460 tcagggcgag cactcgtgtg ctcaaacccc gggataagag ttgccctcct tgggggcttc 2520 agggcgctca aggtccaggc cctggccagg ggcccaggct gtggggggta gggactgact 2580 gggtgcctgg ggagctggga gtggacgggg gacctggcat ttggggagac ttcaggcaca 2640 ggagggactt caggccaggc ctgcggcagg gcaggacggc gccggtgttc acgcaggcgc 2700 gcgaaagaaa aggagcgccc gggcctggct ctcttcccgg cttctttcct gacttggcct 2760 cctgggtctc ctccgccatc acggcctcct cctcaccctc acaccccctc cctccagccg 2820 cacactcaga gcctggagac gctgaacctg ggccacaacc ccatcgggaa cgagggtgtg 2880 cggcacctca agaacgggct catcagcaac cgcagcgtgc tgcgcttcgg gctggcctcc 2940 accaagctca cgtgcgaggg cgcggtggcg gtggcggagt tcatcgctga gagcccccgc 3000 ctcctgagac tggaccttcg ggagaacgag atcaagacag gcgggctcat ggcactgtcg 3060 ttggccctca aggtgaacca ctcactgctg cgcctggacc tcgaccgtga acc caa 3116                                                        Thr Gln                                                          1 gaa aga ggc ggt gag cag ggg acg gtc ctg cag ccc tgg ggc ggg cgg 3164 Glu Arg Gly Gly Glu Gln Gly Thr Val Leu Gln Pro Trp Gly Gly Arg           5 10 15 aag gcc ggg tgg tgg ggc cgg cct gag agc cct agc cag gcg ctc ccg 3212 Lys Ala Gly Trp Trp Gly Arg Pro Glu Ser Pro Ser Gln Ala Leu Pro      20 25 30 cca cag gtg aag agc ttc atc gag acg cag aag gcg ctg ctg gcc gag 3260 Pro Gln Val Lys Ser Phe Ile Glu Thr Gln Lys Ala Leu Leu Ala Glu  35 40 45 50 atc cag aac ggc tgc aag cgc aac ttg gtg ctg gcg cgg gag agg gag 3308 Ile Gln Asn Gly Cys Lys Arg Asn Leu Val Leu Ala Arg Glu Arg Glu                  55 60 65 gag aag gag cag ccg cca cag ctg tcg gcc tcc atg cct gag acc acc 3356 Glu Lys Glu Gln Pro Pro Gln Leu Ser Ala Ser Met Pro Glu Thr Thr              70 75 80 gcc acc gag ccc cag ccc gac gac gag ccc gcc gct ggg gtg cag aac 3404 Ala Thr Glu Pro Gln Pro Asp Asp Glu Pro Ala Ala Gly Val Gln Asn          85 90 95 ggg gcc ccc agc ccc gca ccc agc ccg gac tca gac tca gac tcg gac 3452 Gly Ala Pro Ser Pro Ala Pro Ser Pro Asp Ser Asp Ser Asp Ser Asp     100 105 110 tcg gat ggg gag gaa gag gag gaa gag gaa ggg gag agg gac gag acc 3500 Ser Asp Gly Glu Glu Glu Glu Glu Glu Glu Gly Glu Arg Asp Glu Thr 115 120 125 130 ccc tgt cct gcc ctg gtg ccc ccc acg gac tcc ctg ggc cct ggg gac 3548 Pro Cys Pro Ala Leu Val Pro Pro Thr Asp Ser Leu Gly Pro Gly Asp                 135 140 145 agg agt ccc cca ggc agc ccc tcc aca ccc acc gag cag cgg att tcc 3596 Arg Ser Pro Pro Gly Ser Pro Ser Thr Pro Thr Glu Gln Arg Ile Ser             150 155 160 gtg tcc agc ccg ggc cgg ggc cac aag gtg ttt gtg gtg acc cgg gtg 3644 Val Ser Ser Pro Gly Arg Gly His Lys Val Phe Val Val Thr Arg Val         165 170 175 gag agc ccg ccc gag agg gca gag ccc cct gcg tcc ccc acc cct ccc 3692 Glu Ser Pro Pro Glu Arg Ala Glu Pro Pro Ala Ser Pro Thr Pro Pro     180 185 190 tct ccc cca ccc cct ccc tcc cca ccc gcc tca cct tcc cta cca cca 3740 Ser Pro Pro Pro Pro Pro Ser Pro Pro Ala Ser Pro Ser Leu Pro Pro 195 200 205 210 gcc ggg gcc att gac acc cgg gac aca ggg tcc tct gag cct cag cca 3788 Ala Gly Ala Ile Asp Thr Arg Asp Thr Gly Ser Ser Glu Pro Gln Pro                 215 220 225 cca ccg gag ccg cct cgg tca ggg cca cca ctg ccc aac ggc ctg aag 3836 Pro Pro Glu Pro Pro Arg Ser Gly Pro Pro Leu Pro Asn Gly Leu Lys             230 235 240 ccc gag ttc gcc ctg gca ctg ccc cct gag ccg ccc ccg ggg cct gag 3884 Pro Glu Phe Ala Leu Ala Leu Pro Pro Glu Pro Pro Pro Gly Pro Glu         245 250 255 gtc aag ggg ggc agc tgc ggc ctg gag cac ggt gag agg ggc cct agg 3932 Val Lys Gly Gly Ser Cys Gly Leu Glu His Gly Glu Arg Gly Pro Arg     260 265 270 gca ggt gtt gag ggg ccc tgg gtg ctg tat gtg acc cca ggc aag ccc 3980 Ala Gly Val Glu Gly Pro Trp Val Leu Tyr Val Thr Pro Gly Lys Pro 275 280 285 290 ctg cct gtt gtg ggc ctc agt ttc ccc ctc ttt aaa atg gtt ctc gca 4028 Leu Pro Val Val Gly Leu Ser Phe Pro Leu Phe Lys Met Val Leu Ala                 295 300 305 ggg gcc agc aaa gtg ggg ctt agt tct cat ctc cac cct gct tcc ctt 4076 Gly Ala Ser Lys Val Gly Leu Ser Ser His Leu His Pro Ala Ser Leu             310 315 320 ggg tcc tgg gag ctc ttc ctg ggg tgg ggg ggc atg taagtaaggt 4122 Gly Ser Trp Glu Leu Phe Leu Gly Trp Gly Gly Met         325 330 gtgctgcctt gagcctgacc atctcagcgg tctctgggca ctctgcaggc cctgagggtc 4182 tggctggggc cgggctgggg acaggttgta tttgccccac ccgcctgacg ggggtgctgg 4242 cccgtcctcc cacacagaac tgagctgctc caagaacgag aaggagctcg aggagctgct 4302 tctggaagcc agtcaggaat ccgggcagga gacactgtga cactttaggt gaggccaggc 4362 ccggggccca cagcactcgg gaggagctga gagagcctct ggctctgaca gtctctcccc 4422 caatctctcc tccccaagtt ccctttttcc ggtcggtctg cgatgagctg aggccagagc 4482 catgagaatc tgctcacctt ccccccagcc ttcctgaggc ccaggatgcc aggggtgggg 4542 gccattctgg ggcccccctc cccccacagc aacactacaa ggggtgcagg agctacaggg 4602 agtggccctc cgcgcgtgac tcaagcactt ctatttatga gcccagcact ggaagactct 4662 gggggtgaat gggaggaggg ggagcaggag gaggaggagg tctccaagga catcaggcgc 4722 ctgttctgga ggggccaggc ttgccctgcg gagggcaggc gtcctgggtg gtggtgggat 4782 ggtcccctgt ggccccgggc acagggccgg gcaggcagcc tggtgccgga gaggcggtgc 4842 gtgctggtgg tggttgagat gcgcagaaca gccccagaca gcgcaggccg ggcagggtgg 4902 ggggatggga gcagaggatc agagctttct ttttctcaag tgcaataaat ctatcaggga 4962 gctggggcgg gagcagccgg cactccggga ccctgctgtc caggccactg gaggctgcgc 5022 cctgagaggc actacagccc tttgggggcg agtggcatgg gtgtgggggtga gggtgggcag 5082 agggctgggg ctactcctgt cggtgcaact ctgttcacac cttttctaat aaactggggc 5142 tgggttcact ttgcccaacg tctgcttcgt gcggatcctg gaggtggttg ctgggggtgg 5202 gctgggtgga ggccatcaca gctcctggtt acagagcatc tgccagggca ctggtcaggc 5262 attactgcgt cttcccccac agtcccgtct attctgcgtc catgctctgg tgtctcgtac 5322 ggtcacctcc cagccaggcg agtttacacc catcctttct gcagggagag acaggcccca 5382 gatacctcct ggcttcaggt ctagggtctc aggatgctgg atatggagtg gctctgccca 5442 gtcgcccatc gctcaacagg aacagatccc agcctgtccc tggcagcccc aggtgttgtg 5502 caggtgggag tatgccccgg gcccagccag cactgcagga gcctggtggg cggcttgttc 5562 ccctttgcca gcaggggcct ggggggccat gggggtgctg ttgcggattt ctgagaggtc 5622 tgtttttccc tttaggaaag gggggagcag gccgggcatg gtggcccaca cctgtaatcc 5682 cagcactttg ggaggctgag gtgggaggat tgcttgaacc caggaatttg acaccagcct 5742 gggcaacatg gtgaaacccc atctccatag aaaatacaaa aattagctgg gtgtggtggc 5802 gtgtgcctgt agtcccagct actgaggagg ctgagttggg aggatcgctt gaacccagga 5862 ggttgaggct gcagtgaggt gtgattgcac cactgcactt cagcctgggc aacagagtga 5922 gatcctgtct caaaaaaaaa aaaaaag 5949 <210> 21 <211> 4299 <212> DNA <213> Homo sapiens <220> <221> CDS <222> (2) .. (1696) <400> 21 a tgg cag agc ccg gct gag cct tgt gtt cta cgc cag tgc cag gag ggt 49   Trp Gln Ser Pro Ala Glu Pro Cys Val Leu Arg Gln Cys Gln Glu Gly     1 5 10 15 gtt gtc aca gag tct ggg gtg cgc tgt gtt gtt cat tgt aaa aac cct 97 Val Val Thr Glu Ser Gly Val Arg Cys Val Val His Cys Lys Asn Pro              20 25 30 ttg gag cat ctg gga atg tgc tgc ccc aca tgt cca ggc tgt gtg ttt 145 Leu Glu His Leu Gly Met Cys Cys Pro Thr Cys Pro Gly Cys Val Phe          35 40 45 gag ggt gtg cag tat caa gaa ggg gag gaa ttt cag cca gaa gga agc 193 Glu Gly Val Gln Tyr Gln Glu Gly Glu Glu Phe Gln Pro Glu Gly Ser      50 55 60 aaa tgt acc aag tgt tcc tgc act gga ggc agg aca caa tgt gtg aga 241 Lys Cys Thr Lys Cys Ser Cys Thr Gly Gly Arg Thr Gln Cys Val Arg  65 70 75 80 gaa gtc tgt ccc att ctc tcc tgt ccc cag cac ctt agt cac ata ccc 289 Glu Val Cys Pro Ile Leu Ser Cys Pro Gln His Leu Ser His Ile Pro                  85 90 95 cca gga cag tgc tgc ccc aaa tgt ttg ggt cag agg aaa gtg ttt gac 337 Pro Gly Gln Cys Cys Pro Lys Cys Leu Gly Gln Arg Lys Val Phe Asp             100 105 110 ctc cct ttt ggg agc tgt ctc ttt cga agt gat gtt tat gac aat gga 385 Leu Pro Phe Gly Ser Cys Leu Phe Arg Ser Asp Val Tyr Asp Asn Gly         115 120 125 tcc tca ttt ctg tac gat aac tgc aca gct tgt acc tgc agg gac tct 433 Ser Ser Phe Leu Tyr Asp Asn Cys Thr Ala Cys Thr Cys Arg Asp Ser     130 135 140 act gtg gtt tgc aag agg aag tgc tcc cac cct ggt ggc tgt gac caa 481 Thr Val Val Cys Lys Arg Lys Cys Ser His Pro Gly Gly Cys Asp Gln 145 150 155 160 ggc cag gag ggc tgt tgt gaa gag tgc ctc cta cga gtg ccc cca gaa 529 Gly Gln Glu Gly Cys Cys Glu Glu Cys Leu Leu Arg Val Pro Pro Glu                 165 170 175 gac atc aaa gta tgc aaa ttt ggc aac aag att ttc cag gat gga gag 577 Asp Ile Lys Val Cys Lys Phe Gly Asn Lys Ile Phe Gln Asp Gly Glu             180 185 190 atg tgg tcc tct atc aat tgt acc atc tgt gct tgt gtg aaa ggc agg 625 Met Trp Ser Ser Ile Asn Cys Thr Ile Cys Ala Cys Val Lys Gly Arg         195 200 205 acg gag tgt cgc aat aag cag tgc att ccc atc agt agc tgc cca cag 673 Thr Glu Cys Arg Asn Lys Gln Cys Ile Pro Ile Ser Ser Cys Pro Gln     210 215 220 ggc aaa att ctc aac aga aaa gga tgc tgt cct att tgc act gaa aag 721 Gly Lys Ile Leu Asn Arg Lys Gly Cys Cys Pro Ile Cys Thr Glu Lys 225 230 235 240 ccc ggc gtt tgc acg gtg ttt gga gat ccc cac tac aac act ttt gac 769 Pro Gly Val Cys Thr Val Phe Gly Asp Pro His Tyr Asn Thr Phe Asp                 245 250 255 ggt cgg aca ttt aac ttt cag ggg acg tgt cag tac gtt ttg aca aaa 817 Gly Arg Thr Phe Asn Phe Gln Gly Thr Cys Gln Tyr Val Leu Thr Lys             260 265 270 gac tgc tcc tcc cct gcc tcg ccc ttc cag gtg ctg gtg aag aac gac 865 Asp Cys Ser Ser Pro Ala Ser Pro Phe Gln Val Leu Val Lys Asn Asp         275 280 285 gcc cgc cgg aca cgc tcc ttc tcg tgg acc aag tcg gtg gag ctg gtg 913 Ala Arg Arg Thr Arg Ser Phe Ser Trp Thr Lys Ser Val Glu Leu Val     290 295 300 ctg ggc gag agc agg gtc agc ctg cag cag cac ctc acc gtg cgc tgg 961 Leu Gly Glu Ser Arg Val Ser Leu Gln Gln His Leu Thr Val Arg Trp 305 310 315 320 aac ggc tcg cgc atc gcg ctc ccc tgc cgc gcg cca cac ttc cac atc 1009 Asn Gly Ser Arg Ile Ala Leu Pro Cys Arg Ala Pro His Phe His Ile                 325 330 335 gac ctg gat ggc tac ctc ttg aaa gtg acc acc aaa gca ggt ttg gaa 1057 Asp Leu Asp Gly Tyr Leu Leu Lys Val Thr Thr Lys Ala Gly Leu Glu             340 345 350 ata tct tgg gat gga gac agt ttt gta gaa gtc atg gct gcg ccg cat 1105 Ile Ser Trp Asp Gly Asp Ser Phe Val Glu Val Met Ala Ala Pro His         355 360 365 ctc aag ggc aag ctc tgt ggt ctt tgt ggc aac tac aat gga cat aaa 1153 Leu Lys Gly Lys Leu Cys Gly Leu Cys Gly Asn Tyr Asn Gly His Lys     370 375 380 cgt gat gac tta att ggt gga gat gga aac ttc aag ttt gat gtg gat 1201 Arg Asp Asp Leu Ile Gly Gly Asp Gly Asn Phe Lys Phe Asp Val Asp 385 390 395 400 gac ttt gct gaa tct tgg agg gtg gag tcc aat gag ttc tgc aac aga 1249 Asp Phe Ala Glu Ser Trp Arg Val Glu Ser Asn Glu Phe Cys Asn Arg                 405 410 415 cct cag aga aag cca gtg cct gaa ctg tgt caa ggg aca gtc aag gta 1297 Pro Gln Arg Lys Pro Val Pro Glu Leu Cys Gln Gly Thr Val Lys Val             420 425 430 aag ctc cgg gcc cat cga gaa tgc caa aag ctc aaa tcc tgg gag ttt 1345 Lys Leu Arg Ala His Arg Glu Cys Gln Lys Leu Lys Ser Trp Glu Phe         435 440 445 cag acc tgc cac tcg act gtg gac tac gcc act ttc tac cgg tcc tgt 1393 Gln Thr Cys His Ser Thr Val Asp Tyr Ala Thr Phe Tyr Arg Ser Cys     450 455 460 gtg aca gac atg tgt gaa tgt cca gtc cat aaa aac tgt tat tgc gag 1441 Val Thr Asp Met Cys Glu Cys Pro Val His Lys Asn Cys Tyr Cys Glu 465 470 475 480 tca ttt ttg gca tat acc cgg gcc tgc cag aga gag ggc atc aaa gtc 1489 Ser Phe Leu Ala Tyr Thr Arg Ala Cys Gln Arg Glu Gly Ile Lys Val                 485 490 495 cac tgg gag cct cag cag aat tgt gca gcc acc cag tgt aag cat ggt 1537 His Trp Glu Pro Gln Gln Asn Cys Ala Ala Thr Gln Cys Lys His Gly             500 505 510 gct gtg tac gat acc tgt ggt ccg gga tgt atc aag acc tgt gac aac 1585 Ala Val Tyr Asp Thr Cys Gly Pro Gly Cys Ile Lys Thr Cys Asp Asn         515 520 525 tgg aat gaa att ggt cca tgc aac aag ccg tgc gtt gct ggg tgc cac 1633 Trp Asn Glu Ile Gly Pro Cys Asn Lys Pro Cys Val Ala Gly Cys His     530 535 540 tgt cca gca aac ttg gtc ctt cac aag gga agg tgc atc aag cca gtc 1681 Cys Pro Ala Asn Leu Val Leu His Lys Gly Arg Cys Ile Lys Pro Val 545 550 555 560 ctt tgt ccc cag cgg tgacctttgt ttcgatcctt aagactctga aatctggtga 1736 Leu Cys Pro Gln Arg                 565 ctttgacact gaagcggaag agccaatgaa ggactgcagt atttgtgtgc ccgattctgt 1796 aaacacacac acacagagta tatatgtgta tatatatata gatatattca aaaacattgc 1856 atcatttata tgaactatag ggggattatt atatgtatat tttttgctat aagacatgta 1916 ttgtttctag gatcctaacc tgtaagccat tgaacatgtt gtataaatac accaggtgtt 1976 tttaatttaa taaggtggca tgcagataca ttggatagtg ttaacatcac atacatttgt 2036 catttttaag gaagttttct aagagccctc aattgcctgc ctgtattaat tttagttttg 2096 agtcaggatt tgtaatggag tgggaaatgt gtttctctgg agaagggcac atttatctag 2156 gggcatttca ggtttccaaa gaaaggaatg tatgcctggg aaagacagca ggagattggt 2216 gaccggccct aatggtgcat gaaaagcgag tgataggctg ttagaatgta ttaggtcatg 2276 ggctaacatt atttccaaaa ttgattggct ggttgccaag aaatatatat ttgtcactag 2336 ggcataaccg aagaatcaag tgatttcttg ccatccttgc gtagtcctca agtcttctaa 2396 tcatctatgt gtgtatgatg gatgtgacca tttatatgtc atgtgttgaa attaaagggt 2456 gtctggatgc catgttaatt ttattgagca tgagaaagta tttgaagaaa ctgttgcaca 2516 acatatgatg aaattttgat ttctctgaat ctcagagtaa tttaagatgg gcaaatccaa 2576 tgagttgatg taaccaatct acattttgtg gctatttcat gtataaaatg aaaggcttta 2636 gttatttcta gaggagtccc atccaacact atttggcgac tgttaatatg ttataattat 2696 tagacagtgg aaaagccttt attgactaat tgatttaacc tcagtccaaa gaacagtctc 2756 tatacttgcc ctcatttact gataggctgc tactagttac catttgattg tcttgacttt 2816 cgtcttcagt tgaattggtg aaaacatcaa caaaaacaaa acccactgtg tcttaaaata 2876 aaatgttctg cattgtagtc aataaagggc ttaagatgta aaacaatgca ttttctctct 2936 caagatgtcc tgtgttatac tttacaatta gttgtatcaa cttcagattc attaaatttt 2996 aacagctgta tttagttata atttttgtat cttacaacag tatatataaa gactattatt 3056 aatagacaaa agcaaaccca cgtttttgac aagccagcat attgccaatg ctcgtcaaag 3116 acatttatgt ttcttttaca ttcatgacag ctaaagtatt aacttggaat tttttttttt 3176 tttttttgac tggaaaatcc ctagttgttt gaattaactg aataaagtgc agggcgtaat 3236 agtgactggc tgggggcact gggcctttgt gttttttttc attcctgtgt ttggatacta 3296 atcaccctcg tatctgcagc ctgaggtagt caggtcattg tttcaatgct gcccttggaa 3356 agaagacttt gacttaggtg ctttgggatc taagtgtggt ttgtgactat ccccattatt 3416 taagcatttg ggggtctcac ttttgcttat ctgtaacaca aagaggccat gtagtatagt 3476 gagcagggtt gagaatcggg aaaactagac ccctgccact aacaggctag gagactttgg 3536 gcagtaatgg gaggcactga gggtttgcaa atactttcct acttgtaatc tcattgattc 3596 cctgctctgc cctgacagga ggggtaggga aaatcagttg acgttgcctg gtggttccta 3656 ccttctgtaa aacgaacaga ttccttaaga ttagctttaa atcccttttc agttcactgt 3716 tggttctcat ctttgaaaga gggcagcata agggaaatga tggagaaggg agaattgtgg 3776 atggttattt atagctcatg attcatccaa ggactgagtt attctcccca tgaggctaac 3836 cggcctcttt cccaacactt aatcatggca gccaggatgg gtgaatgatg aaagctcttc 3896 tgcatgagcc ggtttcatca tcttgggtgg gttgattaca agtttgtgtg gcattccttt 3956 agctggggat tagcttaatc tctgtacatt tttctcattt ctaattgaag aagaaaggga 4016 gggcagagag gacagaaaat tacttgggtt tactaattca aagagccatc tctgatggga 4076 gagggtgagg aaatcagctg aatggttggc ctgcatattc ttttttaggc cacaggctac 4136 agctgagcaa tggcttagtc taatgattgg gcacaatgag gatttaccaa gtgcttcaat 4196 gagagcagtc ctgtttcatt cacatgggat gttttcaatt ggttcttcaa ctaagctctt 4256 gcagagtgag atgacttgaa ataaaaatgt tctatgaata ttg 4299

───────────────────────────────────────────────────── フロントページの続き (72)発明者 小原 收 千葉県木更津市矢那1532番3号 財団法人 かずさディー・エヌ・エー研究所内 Fターム(参考) 4B024 AA01 AA11 BA80 CA04 CA09 CA12 CA20 DA06 DA20 EA04 GA11 HA11 HA13 HA14 4H045 AA10 AA20 AA30 BA10 CA40 EA20 EA50 FA74    ─────────────────────────────────────────────────── ─── Continued front page    (72) Inventor Ohara Osamu             1532-3 Yana, Kisarazu City, Chiba Foundation             Kazusa DNA Research Institute F term (reference) 4B024 AA01 AA11 BA80 CA04 CA09                       CA12 CA20 DA06 DA20 EA04                       GA11 HA11 HA13 HA14                 4H045 AA10 AA20 AA30 BA10 CA40                       EA20 EA50 FA74

Claims (5)

【特許請求の範囲】[Claims] 【請求項1】 以下の(a)又は(b)のポリペプチド
をコードする塩基配列を含むDNA: (a)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示されるアミノ酸配列と同一又は実質的に
同一のアミノ酸配列から成るポリペプチド、 (b)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示されるアミノ酸配列において、一部のア
ミノ酸が欠失、置換又は付加されたアミノ酸配列から成
り、(a)のポリペプチドの機能と実質的に同質の生物
学的活性を有するポリペプチド。
1. A DNA containing a nucleotide sequence encoding the polypeptide of (a) or (b) below: (a) represented by any one of SEQ ID NOs: 1 to 21 (excluding 11) A polypeptide consisting of the same or substantially the same amino acid sequence as the amino acid sequence, (b) in the amino acid sequence shown in any one of SEQ ID NOs: 1 to 21 (excluding 11), some amino acids are A polypeptide comprising a deleted, substituted or added amino acid sequence and having a biological activity substantially the same as the function of the polypeptide of (a).
【請求項2】 以下の(a)又は(b)のDNA: (a)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示される塩基配列において、夫々の配列で
示されるアミノ酸配列をコードする塩基配列を含むDN
A、 (b)(a)のDNAとストリンジェントな条件下でハ
イブリダイズし、(a)のポリペプチドの機能と実質的
に同質の生物学的活性を有する蛋白質をコードするDN
A。
2. A DNA of (a) or (b) below: (a) SEQ ID NOs: 1 to 21 (excluding 11), which are respectively represented by respective sequences DN containing a nucleotide sequence encoding an amino acid sequence
A, (b) a DN that hybridizes with the DNA of (a) under stringent conditions and encodes a protein having a biological activity substantially the same as the function of the polypeptide of (a)
A.
【請求項3】 請求項1又は2記載のヒトDNAを含む
遺伝子。
3. A gene comprising the human DNA according to claim 1 or 2.
【請求項4】 以下の(a)又は(b)の組換えポリペ
プチド: (a)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示されるアミノ酸配列と同一又は実質的に
同一のアミノ酸配列から成るポリペプチド、 (b)配列番号:1乃至21(但し、11は除く)のい
ずれか一つで示されるアミノ酸配列において、一部のア
ミノ酸が欠失、置換又は付加されたアミノ酸配列から成
り、(a)のポリペプチドの機能と実質的に同質の生物
学的活性を有するポリペプチド。
4. A recombinant polypeptide of the following (a) or (b): (a) the same or substantially the same amino acid sequence as shown in any one of SEQ ID NOs: 1 to 21 (excluding 11) Consisting of identical amino acid sequences, (b) in the amino acid sequence represented by any one of SEQ ID NOs: 1 to 21 (excluding 11), some amino acids are deleted, substituted or added. A polypeptide having the biological function substantially the same as the function of the polypeptide of (a).
【請求項5】 請求項3に記載の遺伝子にコードされる
組換え蛋白質。
5. A recombinant protein encoded by the gene according to claim 3.
JP2001379607A 2001-12-13 2001-12-13 New gene and protein encoded with the same Pending JP2003180359A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2001379607A JP2003180359A (en) 2001-12-13 2001-12-13 New gene and protein encoded with the same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2001379607A JP2003180359A (en) 2001-12-13 2001-12-13 New gene and protein encoded with the same

Publications (1)

Publication Number Publication Date
JP2003180359A true JP2003180359A (en) 2003-07-02

Family

ID=27591084

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001379607A Pending JP2003180359A (en) 2001-12-13 2001-12-13 New gene and protein encoded with the same

Country Status (1)

Country Link
JP (1) JP2003180359A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112513263A (en) * 2018-05-29 2021-03-16 弗门尼舍有限公司 Method for producing a bryodin compound

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112513263A (en) * 2018-05-29 2021-03-16 弗门尼舍有限公司 Method for producing a bryodin compound

Similar Documents

Publication Publication Date Title
AU2017267184B2 (en) Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy
KR101708544B1 (en) Methods and nucleic acids for analyses of cellular proliferative disorders
KR102046668B1 (en) Methods and nucleic acids for determining the prognosis of a cancer subject
US20230056182A1 (en) Use of adeno-associated viral vectors to correct gene defects/ express proteins in hair cells and supporting cells in the inner ear
KR20180093902A (en) Detection of fetal chromosomal anomalies using differentially methylated diene regions between fetuses and pregnant women
CA2941594A1 (en) Genetic polymorphisms of the protein receptor c (procr) associated with myocardial infarction, methods of detection and uses thereof
KR20220012230A (en) Methods and compositions for modulating splicing and translation
KR20060045950A (en) Prognostic for hematological malignancy
KR20180049093A (en) New biomarkers and methods of treatment of cancer
WO2006022629A1 (en) Methods of identifying risk of type ii diabetes and treatments thereof
US6265157B1 (en) Compositions and methods for detecting altered COL1A1 gene sequences
KR102661616B1 (en) GPR156 variants and their uses
IL179831A (en) In vitro method for detecting the presence of or predisposition to autism or to an autism spectrum disorder, and an in vitro method of selecting biologically active compounds on autism or autism spectrum disorders
KR20210144822A (en) Compounds and methods for modulating UBE3A-ATS
JP2002017376A (en) Secretory protein or membrane protein
WO2006022636A1 (en) Methods for identifying risk of type ii diabetes and treatments thereof
CN100516876C (en) Methods for diagnosing RCC and other solid tumors
WO2006022638A1 (en) Methods for identifying risk of type ii diabetes and treatments thereof
JP2003180359A (en) New gene and protein encoded with the same
JP2003259875A (en) Single base polymorphism (4) in human gene
JP2003116575A (en) New gene and protein coded by the same
JP2024086735A (en) Use of adeno-associated virus vectors to correct genetic defects/express proteins in hair cells and supporting cells of the inner ear
CN114053413A (en) Application of COL4A4 gene as acute ischemic stroke treatment target
KR20220118096A (en) A Composition for diagnosis of resistance to anticancer drug
JP2003245081A (en) New gene and protein encoded thereby