CN113227364A - 用于产生熊去氧胆酸及其前体的细胞和方法 - Google Patents

用于产生熊去氧胆酸及其前体的细胞和方法 Download PDF

Info

Publication number
CN113227364A
CN113227364A CN201980081514.5A CN201980081514A CN113227364A CN 113227364 A CN113227364 A CN 113227364A CN 201980081514 A CN201980081514 A CN 201980081514A CN 113227364 A CN113227364 A CN 113227364A
Authority
CN
China
Prior art keywords
acid sequence
nucleic acid
seq
substantially identical
sequence substantially
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201980081514.5A
Other languages
English (en)
Inventor
玛丽亚·恩奎斯特-纽曼
艾琳·汤姆
克莱奥·何
克里斯多佛·萨维尔
阿比纳夫·库马尔
劳伦·艾瑟
安德里亚·陈
迈克尔·克莱
阿德里安娜·皮古拉
陈祥云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Eliston Genetics
Precigen Inc
Original Assignee
Eliston Genetics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Eliston Genetics filed Critical Eliston Genetics
Publication of CN113227364A publication Critical patent/CN113227364A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0006Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P33/00Preparation of steroids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P33/00Preparation of steroids
    • C12P33/06Hydroxylating
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/011597-Alpha-hydroxysteroid dehydrogenase (1.1.1.159)

Abstract

能够产生熊去氧胆酸(ursodeoxycholic acid,UDCA)、胆酸和/或另一种UDCA前体的遗传修饰的细胞,该遗传修饰的细胞包含至少一种编码参与将糖转化为UDCA、胆酸和/或另一种UDCA前体的代谢途径的酶的异源多核苷酸。使用这种细胞制备UDCA、胆酸和/或另一种UDCA前体的方法。使用这种方法产生的UDCA或UDCA前体用于制造用于治疗疾病或疾病症状的药物的用途。包含使用这种方法制备的UDCA或UDCA前体的药物。治疗疾病或疾病症状的方法,该方法包括施用使用这种方法制备的UDCA或UDCA前体。编码至少一种参与将糖转化为UDCA、胆酸和/或另一种UDCA前体的代谢途径的酶的分离的核酸。包含编码至少一种参与将糖转化为UDCA、胆酸和/或另一种UDCA前体的代谢途径的酶的核酸的载体。制备能够合成UDCA、胆酸和/或另一种UDCA前体的遗传修饰的细胞的方法。包含UDCA或UDCA前体、其游离酸或CoA或其药学上可接受的衍生物或前药的组合物。

Description

用于产生熊去氧胆酸及其前体的细胞和方法
发明背景
本发明的主题涉及经遗传修饰以产生熊去氧胆酸(ursodeoxycholic acid,“UDCA”)或UDCA前体的微生物,诸如酵母和细菌。UDCA,也称为熊二醇(ursodiol),是熊体内产生的次级胆汁酸(bile acid)。当肝产生的初级胆汁酸分泌到肠中并被肠道细菌代谢时,形成次级胆汁酸。
UDCA通过降低肠吸收胆固醇分子的速率,同时破碎含有胆固醇的胶团而有助于调节胆固醇。因此,UDCA被用于非手术地治疗由胆固醇构成的胆结石。对于一些罹患产科胆汁淤积症的女性,UDCA也用于缓解妊娠期间的瘙痒。另外,UDCA可用于治疗原发性胆汁性肝硬化(PDC)。
UDCA从未通过任何已知的微生物系统直接产生。参见例如,Tonin,F.和Arends,I.W.C.E.,“Latest development in the synthesis of ursodeoxycholic acid(UDCA):acritical review,”Beilstein J.Org.Chem.14:470-483(2018);另参见例如,Russell,D.W.,“The enzymes,regulation,and genetics of bile acid synthesis,”Annu RevBiochem 72:134-74(2003)。UDCA目前以高成本从动物来源的原料合成。因此需要更便宜且更高效地产生UDCA。
已知人类肠道中的微生物通过代谢鹅去氧胆酸(chenodeoxycholic acid,CDCA)产生UDCA,鹅去氧胆酸是由人类肝产生的两种初级胆汁酸之一,在人类肝中从胆固醇合成鹅去氧胆酸。然而,微生物不产生CDCA。因此,期望将细胞或微生物工程化以产生CDCA,CDCA本身或作为产生UDCA的中间体可以是有用的。
UDCA也可以从胆酸(cholic acid)化学产生,胆酸是由人类肝产生的另一种初级胆汁酸,并且从胆固醇合成。胆酸本身可用于治疗患有胆汁酸或过氧物酶体紊乱的患者。另外,胆酸可以用作合成除UDCA以外的各种其他化学物质的起始底物,包括次级胆汁酸去氧胆酸,去氧胆酸具有各种医学用途,诸如脂肪乳化剂和治疗重颏(double chin)。
然而,胆酸目前由动物屠宰获得,并且分离该化合物的过程通常是困难和/或昂贵的。与CDCA一样,已知胆酸也不由微生物产生。因此,期望将细胞或微生物工程化以产生胆酸,胆酸本身或作为产生其他有用化学物质的中间体可以是有用的。
发明概述
本发明部分地涉及能够产生UDCA或UDCA前体的遗传修饰的细胞。该细胞可以包含至少一种参与将糖转化为UDCA或UDCA前体的代谢途径的异源酶和/或至少一种编码这样的酶的异源多核苷酸。
本发明还涉及制备UDCA或UDCA前体的方法。该方法包括使底物与前述遗传修饰的细胞接触,并使细胞生长以产生UDCA或UDCA前体。
本发明还涉及UDCA或UDCA前体用于制造用于治疗疾病或疾病症状的药物的用途,以及涉及这样的药物。
本发明还涉及一种治疗疾病或疾病症状的方法,该方法包括向有相应需要的受试者施用UDCA或UDCA前体。
本发明的又另一方面是编码至少一种参与将糖转化为UDCA或UDCA前体的代谢途径的酶的核酸或编码这样的核酸的载体。
本发明的另一方面是一种制备能够合成UDCA或UDCA前体的遗传修饰的细胞的方法,该方法包括:使细胞与至少一种编码参与将糖转化为UDCA或UDCA前体的代谢途径的酶的异源多核苷酸接触;并且使细胞生长,使得所述酶在所述微生物中表达。
本发明的又另一方面是一种组合物,该组合物包含UDCA或UDCA前体、其游离酸或CoA、或其药学上可接受的衍生物或前药。
附图简述
本发明的新颖特征在所附权利要求书中具体阐述。通过参考以下详细描述和附图将获得对本发明的特征和优点的更好理解,该详细描述阐述了利用本发明的原理的说明性实施方案,在附图中:
图1示出了从胆固醇到UDCA的13步酶促途径。将编码这13步酶促途径的基因(其包括CYP7A1、HSD3B7、AKR1D1、AKR1C4、CYP27A1、SLC27A5、消旋酶、ACOX2、HSD17B4、过氧化物酶体硫解酶2、7α-HSD、7β-HSD和胆酰-CoA水解酶)引入酵母中。
图2示出了从胆甾-5,7,24-三烯醇(一种天然酵母固醇)到胆固醇的2步酶促途径。编码这2步酶促途径的基因包括DHCR7和DHCR24。
图3示出了制备用于质谱术分析的样品的步骤。对通篇描述的遗传修饰的微生物进行该方案,以便确定产生的UDCA和/或UDCA前体的水平。
图4示出了两种制备用于质谱术分析的样品的替代方法。对通篇描述的遗传修饰的微生物进行该方案,以便确定产生的UDCA和/或UDCA前体的水平。
图5示出了从表达各种DHCR24变体的酵母菌株产生的相对的胆固醇的量。来自智人(Homo sapiens)和斑马鱼(Danio rerio)(斑马鱼(zebrafish))的DHCR24变体显示出最佳的活性。
图6示出了CYP7A1变体在从胆固醇产生7-α-羟基胆固醇方面的活性。来自小家鼠(Mus musculus)的CYP7A1显示出最佳的活性。
图7示出了HSD3B7变体在从7-α-羟基胆固醇产生7α-羟基-4-胆甾烯-3-酮方面的活性。来自智人的HSD3B7显示出最佳的活性。
图8示出了AKR1D1变体在从7α-羟基-4-胆甾烯-3-酮产生7α-羟基-5β-胆甾烷-3-酮方面的活性。来自智人和小家鼠的AKR1D1显示出最佳的活性。
图9示出了AKR1C4变体在从7α-羟基-5β-胆甾烷-3-酮产生5β-胆甾烷-3α,7α-二醇方面的活性。来自日本猕猴(Macaca fuscata)的AKR1C4显示出最佳的活性。
图10示出了CYP8B1变体在从7α-羟基-4-胆甾烯-3-酮产生7α,12α-二羟基-4-胆甾烯-3-酮方面的活性。来自小家鼠和家兔(Oryctolagus cuniculus)的CYP8B1显示出最佳的活性。
图11示出了CYP27A1变体在从5β-胆甾烷-3α,7α-二醇产生(25R)-3α,7α-二羟基-5β-胆甾烷酸(cholestanoic acid)方面的活性。为了更容易地检测CYP27A1活性,将来自智人的SLC27A5引入菌株中,并通过质谱术测量SLC27A5产物。大多数变体能够产生SLC27A5产物。
图12A和图12B示出,当表达不同的SLC27A5变体时,对(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的CoA连接酶活性。图12A示出了HPLC数据,指示检测到表达连接酶的菌株特异性的峰。图12B示出了质谱术数据,证实表达菌株中存在活性连接酶。还注意到,CoA连接酶也显示出使用3α,5β,7α,12α,24E-三羟基-胆甾-24-烯-26-酸作为底物的活性。
图13A和图13B示出了AMACR变体和ACOX2变体在产生不同产物方面的活性。图13A示出来自智人和褐家鼠(Rattus norvegicus)的AMACR显示出优秀的将(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化为(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的外消旋化活性。图13B示出,来自智人的ACOX2与智人AMACR组合在将(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化为(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA方面具有最佳的活性。
图14示出了ACOX2变体在从(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA产生(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA方面的活性。来自智人和家兔的ACOX2显示出最佳的活性。
图15示出了HSD17B4变体在从(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA产生3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA方面的活性。来自褐家鼠、家牛(Bos taurus)和非洲爪蟾(Xenopus laevis)的HSD17B4显示出最佳的活性。
图16示出了SCP2变体在从3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA产生3α,7α-二羟基-5β-胆烷-24-酰基-CoA方面的活性。通过LC/MS在包括阴性对照在内的所有样品中检测到SCP2活性。然而,在过表达天然酵母基因POT1的菌株中观察到活性增强。
图17示出了7α-HSD变体在从3α,7α-二羟基-5β-胆烷-24-酰基-CoA产生3α-羟基-7-氧-5β-胆烷-24-酰基-CoA方面的活性。来自大肠杆菌(Escherichia coli)和脆弱拟杆菌(Bacteroides fragilis)的7α-HSD显示出最佳的活性。
图18示出了7β-HSD变体在从3α-羟基-7-氧-5β-胆烷-24-酰基-CoA产生3α,7β-二羟基-5β-胆烷-24-酰基-CoA方面的活性。来自撒丁岛梭菌(Clostridium sardiniense)的7β-HSD显示出最佳的活性。
图19示出了硫解酶/SCP2、7α-HSD和7β-HSD的若干种组合的活性。然后通过GC/MS检测菌株产生UDCA/UDC-CoA的能力。以下组合显示出最佳的活性:POT1硫解酶、Escco(大肠杆菌(E.Coli))7α-HSD;以及Closa(撒丁岛梭菌(C.Sardiniense))7β-HSD和POT1硫解酶、Bacfr(脆弱拟杆菌(B.fragilis))7α-HSD和撒丁岛梭菌7β-HSD。
图20示出了本文描述的参与从糖产生UDCA的途径的各种酶、每种酶的产物以及(在适用的情况下)这些产物的对应的CoA和游离酸形式。CoA和游离酸形式通过通篇描述的微生物和方法产生。
图21示出了从胆固醇到胆酸的12步酶促途径。将编码这12步酶促途径的基因(其包括CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C4、CYP27A1、SLC27A5、消旋酶、ACOX2、HSD17B4、过氧化物酶体硫解酶2和胆酰-CoA水解酶)引入酵母中。
图22示出了本文描述的参与从糖产生胆酸的途径的各种酶、每种酶的产物以及(在适用的情况下)这些产物的对应的CoA和游离酸形式。CoA和游离酸形式通过通篇描述的微生物和方法产生。
图23示出了CYP8B1变体在从7α-羟基-4-胆甾烯-3-酮产生7α,12α-二羟基-4-胆甾烯-3-酮方面的活性。来自小家鼠和家兔(Oryctolagus cuniculus)的CYP8B1显示出最佳的活性。
图24描绘了示出对产物进行液相色谱术和质谱术的步骤的流程图。
图25示出了从表达来自小家鼠的CYP8B1的酵母菌株和不表达CYP8B1的酵母菌株检测到的相对的胆酸的量。结果显示,来自小家鼠的CYP8B1具有活性并且产生了胆酰-CoA(检测到胆酸)。在缺乏CYP8B1酶的菌株中未检测到胆酸。
发明详述
定义
如本文使用的涉及参考数值的术语“约”及其语法等同物包括数值本身和从该数值加或减10%的数值范围。例如,量“约10”包括10和从9至11的任何量。
如本文使用的术语“遗传修饰”或“遗传修饰的”及其语法等同物是指对核酸的一种或更多种改变或指含有对其基因组的修饰的细胞。
术语“可操作地连接”、“可操作地偶联”及其语法等同物在本文中可互换使用,并且指两个或更多个一起起作用以产生某一结果的单元。例如,关于基因表达,编码启动子的多核苷酸可以可操作地连接至编码基因的多核苷酸,这在正确的条件下可以导致基因的表达。关于代谢途径,术语可操作地连接可以指两种或更多种酶在该途径中起作用以将底物转化为产物。所述两种或更多种酶在途径中可以是连续的。在一些情况下,所述两种或更多种酶在途径中不是直接连续的。
术语“和/或”和“其任何组合”及其语法等同物在本文中可互换使用,并表示任何组合都是特别预期的。仅为了说明的目的,以下措辞“A、B和/或C”或“A、B、C或其任何组合”可以表示“单独A;单独B;单独C;A和B;B和C;A和C;以及A、B、C。”
如本文使用的术语“糖”及其语法等同物包括但不限于,(i)简单碳水化合物,诸如单糖(例如,葡萄糖、果糖、半乳糖、核糖);二糖(例如,麦芽糖、蔗糖、乳糖);寡糖(例如,棉子糖、水苏糖);或(ii)复杂碳水化合物,诸如淀粉(例如,葡萄糖长链、直链淀粉、支链淀粉);糖原;纤维(例如、纤维素、半纤维素、果胶、树胶、粘液(mucilage))。
如本文使用的术语“醇”及其语法等同物包括但不限于其中羟基官能团(-OH)与饱和碳原子结合的任何有机化合物。例如,术语醇包括:一元醇(例如,甲醇、乙醇、异丙醇、丁醇、戊醇、十六醇);多元醇(例如,乙二醇、丙二醇、甘油、赤藓糖醇、苏糖醇、木糖醇、甘露醇、山梨醇、庚七醇);不饱和脂肪醇(例如,烯丙醇、香叶醇、炔丙醇);和脂环醇(例如,肌醇、薄荷醇)。
如本文使用的术语“脂肪酸”及其语法等同物包括但不限于具有饱和或不饱和的长脂肪链的羧酸。不饱和脂肪酸的实例包括但不限于肉豆蔻油酸、sapienic acid、反式亚油酸(linoelaidic acid)、α-亚麻酸、十八碳四烯酸、二十碳五烯酸、二十二碳六烯酸、亚油酸、γ-亚麻酸、二高-γ-亚麻酸、花生四烯酸、二十二碳四烯酸、棕榈油酸、异油酸(vaccenic acid)、二十烯酸(paullinic acid)、油酸、反油酸、巨头鲸鱼酸(gondoicacid)、芥酸、神经酸和米德酸(mead acid)。饱和脂肪酸的实例包括但不限于丙酸、丁酸、戊酸、己酸、庚酸、辛酸、壬酸、癸酸、十一酸、月桂酸、十三酸、肉豆蔻酸、十五酸、棕榈酸、十七酸、硬脂酸、十九酸、花生酸、二十一酸(heneicosylic acid)、山嵛酸、二十三酸、二十四酸、二十五酸、蜡酸、二十七酸、褐煤酸、二十九酸、蜂花酸、三十一酸、紫胶蜡酸、三十三酸、三十四酸(geddic acid)、三十五酸(ceroplastic acid)、三十六酸、三十七酸和三十八酸。
如本文使用的术语“基本上纯的”及其语法等同物意指,特定物质不包含大部分的另一种物质。例如,“基本上纯的UDCA”可以指,该物质包含至少70%、75%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%、99.99%、99.999%或99.9999%的UDCA。
如本文使用的术语“异源”及其语法等同物意指,物质来源于不同于宿主微生物的物种。例如,“异源基因”意指,该基因来自不同于宿主微生物物种的物种。
如本文关于序列使用的术语“基本相同”及其语法等同物意指,序列至少50%相同。在一些情况下,术语基本相同是指序列与参考序列至少55%、至少60%、至少65%、至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同。两条序列之间的同一性百分比通过将两条序列进行比对来确定,使用例如Needleman和Wunsch(J.Mol.Biol.,1970,48:443)的比对方法,由Smith和Waterman(Adv.Appl.Math.,1981,2:482)修订,从而在两条序列之间获得最高等级的匹配,并且确定两条序列之间相同氨基酸/核苷酸的数目。计算两条氨基酸序列之间的同一性百分比的方法通常是本领域公认的,并且包括例如Carillo和Lipton(SIAM J.Applied Math.,1988,48:1073)描述的方法和在Computational Molecular Biology,Lesk,e.d.OxfordUniversity Press,New York,1988,Biocomputing:Informatics and Genomics Projects中描述的方法。通常,计算机程序将用于这种计算。可以用于这方面的计算机程序包括但不限于GCG(Devereux等人,Nucleic Acids Res.,1984,12:387)、BLASTP、BLASTN和FASTA(Altschul等人,J.Molec.Biol.,1990:215:403)。用于确定两条多肽之间的同一性百分比的一种特别优选的方法包括Clustal W算法(Thompson,J D,Higgines,D G和Gibson T J,1994,Nucleic Acid Res22(22):4673-4680)连同BLOSUM 62评分矩阵(Henikoff S&Henikoff,J G,1992,Proc.Natl.Acad.Sci.USA 89:10915-10919),使用空位开放罚分(gapopening penalty)10和空位延伸罚分0.1,从而在两条序列之间获得最高等级的匹配,其中两条序列之一的总长度的至少50%被包括在比对中。
术语“UDCA中间体”、“UDCA前体”及其语法等同物可互换使用,并且指可以用于产生UDCA的任何底物。这包括远不同于UDCA本身的底物,诸如糖、链甾醇和胆固醇。该术语还明确包括7-α-羟基胆固醇、7α-羟基-4-胆甾烯-3-酮、7α-羟基-5β-胆甾烷-3-酮、5β-胆甾烷-3α,7α-二醇、(25R)-3α,7α-二羟基-5β-胆甾烷酸、(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA、3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α-二羟基-5β-胆烷-24-酰基-CoA、3α-羟基-7-氧-5β-胆烷-24-酰基-CoA、3α,7β-二羟基-5β-胆烷-24-酰基-CoA、7α,12α-二羟基-4-胆甾烯-3-酮、7α,12α-二羟基-5β-胆甾烷-3-酮、5β-胆甾烷-3α,7α,12α-三醇、(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸、(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA、3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA和胆酸。
如本领域技术人员阅读本公开内容后将清楚的,本文描述和例示的个体实施方案中的每一种具有离散的组成部分和特征,其可容易地与其他若干实例中的任一种的特征分开或组合而不偏离本发明的范围或精神。任何阐述的方法可以以事件阐述的顺序或以逻辑上可能的任何其他顺序来进行。
除非在本文另有定义,否则本文使用的所有技术和科学术语具有与本发明所属领域内的普通技术人员通常理解的相同含义。虽然在本发明的实践或测试中也可以使用与本文描述的那些类似或等效的任何方法和材料,但现在描述代表性的示例性方法和材料。
本文讨论的出版物仅由于它们的公开内容在本申请的递交日期之前而提供。本文不应解释为承认由于在先发明使得本发明不具有优先于此类出版物的资格。此外,所提供的出版物的日期可能不同于实际公开日期,实际公开日期可能需要独立确认。
生物合成途径
本发明部分地涉及产生UDCA或UDCA前体的生物合成途径。UDCA,也称为“熊去氧胆酸(ursodeoxycholic acid)”或“熊二醇(ursodiol)”,是一种次级胆汁酸,分子式为C24H40O4,摩尔质量为392.56g/mol,并且CAS编号为128-13-2。
在某些实施方案中,该途径包括将3α,7α-二羟基-5β-胆烷酸(也称为鹅去氧胆酸或CDCA)转化为UDCA。
在某些实施方案中,该途径包括将CDCA的Co-A形式转化为UDCA。CDCA的Co-A形式是3α,7α-二羟基-5β-胆烷-24-酰基-CoA,也称为鹅去氧胆酰-CoA或CDC-CoA。
在某些实施方案中,CDC-CoA向UDCA的转化涉及以下反应中的至少一种:CDC-CoA向3α-羟基-7-氧-5β-胆烷-24-酰基-CoA的转化;3α-羟基-7-氧-5β-胆烷-24-酰基-CoA向3α,7β-二羟基-5β-胆烷-24-酰基-CoA的转化;和/或3α,7β-二羟基-5β-胆烷-24-酰基-CoA向UDCA的转化。
在某些实施方案中,该途径包括胆固醇向CDCA或CDC-CoA的转化。
在某些实施方案中,胆固醇向CDC-CoA的转化涉及以下反应中的至少一种:胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α-羟基-5β-胆甾烷-3-酮的转化;7α-羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α-二醇的转化;5β-胆甾烷-3α,7α-二醇向(25R)-3α,7α-二羟基-5β-胆甾烷酸的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酸向(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA向3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA的转化;和/或3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA向CDC-CoA的转化。
在某些实施方案中,该途径包括胆固醇向胆酸的转化。胆酸可以化学转化为UDCA。
在某些实施方案中,胆固醇向胆酸的转化可涉及以下反应中的至少一种:胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α,12α-二羟基-4-胆甾烯-3-酮的转化;7α,12α-二羟基-4-胆甾烯-3-酮向7α,12α-二羟基-5β-胆甾烷-3-酮的转化;7α,12α-二羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α,12α-三醇的转化;5β-胆甾烷-3α,7α,12α-三醇向(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸向(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA向3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA的转化;3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA向3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA的转化;以及3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA向胆酸的转化。
在某些实施方案中,该途径包括胆甾-5,7,24-三烯醇向胆固醇的转化。胆甾-5,7,24-三烯醇向胆固醇的转化可包括胆甾-5,7,24-三烯醇向链甾醇的转化和/或链甾醇向胆固醇的转化。胆甾-5,7,24-三烯醇由酵母从糖天然产生。
前述反应和/或转化中的每一种可以由酶催化。例如:
7-脱氢胆固醇还原酶(基因名称:DHCR7)催化胆甾-5,7,24-三烯醇向链甾醇的转化。DHCR7可以包含SEQ ID NO:1、3、5、7、9或11中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。DHCR7可以由包含SEQ ID NO:2、4、6、8、10或12中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
24-脱氢胆固醇还原酶(基因名称:DHCR24)催化链甾醇向胆固醇的转化。DHCR24可以包含SEQ ID NO:13、17、21、25、29、33、37、41、43、45或47中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。DHCR24可以由包含SEQ ID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
细胞色素p450家族7亚家族A成员1(缩写和基因名称:CYP7A1)催化胆固醇向7-α-羟基胆固醇的转化。CYP7A1可以包含SEQ ID NO:49、51、53、55、57、59、61、63、65、67、69、71、73、75、77或79中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。CYP7A1可以由包含SEQ ID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
7型3β-羟基类固醇脱氢酶(缩写和基因名称:HSD3B7)催化7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化。HSD3B7可以包含SEQ ID NO:81、83、85或87中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。HSD3B7可以由包含SEQ ID NO:82、84、86或88中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
细胞色素p450家族8亚家族B成员1(缩写和基因名称:CYP8B1)催化7α-羟基-4-胆甾烯-3-酮向7α,12α-二羟基-4-胆甾烯-3-酮的转化。CYP8B1可以包含SEQ ID NO:265、267、269、271、273、275或277中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。CYP8B1可以由包含SEQ ID NO:266、268、270、272、274、276或278中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
3-氧-5-beta(β)-类固醇4-脱氢酶也称为醛酮还原酶家族1成员D1(缩写和基因名称:AKR1D1),催化7α-羟基-4-胆甾烯-3-酮向7α-羟基-5β-胆甾烷-3-酮的转化。AKR1D1也催化7α,12α-二羟基-4-胆甾烯-3-酮向7α,12α-二羟基-5β-胆甾烷-3-酮的转化。AKR1D1可以包含SEQ ID NO:89、91、93或95中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。AKR1D1可由包含SEQ ID NO:90、92、94或96中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
醛酮还原酶家族1成员C4(缩写和基因名称:AKR1C4)催化7α-羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α-二醇的转化。AKR1C4也催化7α,12α-二羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α,12α-三醇的转化,AKR1C4可以包含SEQ ID NO:99、101、103、105、107、109、111、113、115、117、119或121中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。AKR1C4可由包含SEQ ID NO:100、102、104、106、108、110、112、114、116、118、120或122中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
细胞色素p450家族27亚家族A成员1(缩写和基因名称:CYP27A1),也称为固醇27-羟化酶,催化5β-胆甾烷-3α,7α-二醇向(25R)-3α,7α-二羟基-5β-胆甾烷酸的转化。CYP27A1也催化5β-胆甾烷-3α,7α,12α-三醇向(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的转化。CYP27A1可以包含SEQ ID NO:123、125、127、129、131、133、135或137中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。CYP27A1可由包含SEQ ID NO:124、126、128、130、132、134、136或138中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
溶质载体家族27成员5(缩写和基因名称:SLC27A5)或其酵母同源物FAT1,催化(25R)-3α,7α-二羟基-5β-胆甾烷酸向(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化。SLC27A5和FAT1也催化(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸向(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化。SLC27A5可以包含SEQ ID NO:139或141的氨基酸序列或与任一前述序列基本相同的氨基酸序列。SLC27A5可以由包含SEQ ID NO:140或142的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。FAT1可以包含SEQ ID NO:143的氨基酸序列或与其基本相同的氨基酸序列。FAT1可以由包含SEQ ID NO:144的核酸序列或与其基本相同的核酸序列的多核苷酸编码。
α-甲基酰基-CoA消旋酶(缩写和基因名称:AMACR)催化(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化。AMACR也催化(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化。AMACR可以包含SEQ ID NO:145、147、149、151、153、155或157中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。AMACR可以由包含SEQ ID NO:146、148、150、152、154、156或158中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
酰基-CoA氧化酶2(缩写和基因名称:ACOX2)或其酵母同源物POX1催化(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA的转化。ACOX2和POX1也催化(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA的转化。ACOX2可以包含SEQ ID NO:159、161、163、165、167、169、171或173中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。ACOX2可以由包含SEQ ID NO:160、162、164、166、168、170、172或174中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。POX1可以包含SEQ ID NO:175的氨基酸序列或与其基本相同的氨基酸序列。POX1可以由包含SEQ ID NO:176的核酸序列或与其基本相同的核酸序列的多核苷酸编码。
羟基类固醇17-β脱氢酶4(缩写和基因名称:HSD17B4)或其酵母同源物FOX2催化(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA向3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA的转化。HSD17B4和FOX 2也催化(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA向3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA的转化。HSD17B4和FOX 2可以包含SEQ IDNO:177、179、181、183、185、187、189或191中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。HSD17B4可以由包含SEQ ID NO:178、180、182、184、186、188、190或192中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。FOX2可以包含SEQ ID NO:193的氨基酸序列或与其基本相同的氨基酸序列。FOX2可以由包含SEQ ID NO:194的核酸序列或与其基本相同的核酸序列的多核苷酸编码。
固醇载体蛋白2(缩写和基因名称:SCP2)或其酵母同系物POT1或ERG10催化3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA向CDC-CoA的转化。SCP2、POT1和ERG10也催化3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA向3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA的转化。SCP2可以包含SEQ ID NO:195、197、199或201中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。SCP2可以由包含SEQ ID NO:196、198、200或202中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。POT1可以包含SEQ ID NO:203的氨基酸序列或与其基本相同的氨基酸序列。POT1可以由包含SEQ ID NO:204的核酸序列的多核苷酸编码或由具有与其基本相同的核苷酸序列的多核苷酸编码。ERG10可以包含SEQID NO:205的氨基酸序列或与其基本相同的氨基酸序列。ERG10可以由包含SEQ ID NO:206的核酸序列或与其基本相同的核酸序列的多核苷酸编码。
7α-羟基类固醇脱氢酶(缩写和基因名称:7α-HSD)催化CDC-CoA向3α-羟基-7-氧-5β-胆烷-24-酰基-CoA的转化。7α-HSD可以包含SEQ ID NO:207、209、211或213中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。7α-HSD可以由包含SEQ ID NO:208、210、212或214中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
7β-羟基类固醇脱氢酶(缩写和基因名称:7β-HSD)催化3α-羟基-7-氧-5β-胆烷-24-酰基-CoA向3α,7β-二羟基-5β-胆烷-24-酰基-CoA的转化。7β-HSD可以包含SEQ ID NO:215、217、219或221中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。7β-HSD可以由包含SEQ ID NO:216、218、220或222中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
胆酰-CoA水解酶催化3α,7β-二羟基-5β-胆烷-24-酰基-CoA向UDCA的转化。胆酰-CoA水解酶也催化3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA向胆酸的转化。胆酰-CoA水解酶可以包含SEQ ID NO:223、225、227或229中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。胆酰-CoA水解酶可以由包含SEQ ID NO:224、226、228或230中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。在一些情况下,胆酰-CoA水解酶具有EC编号3.12.27。
醛酮还原酶家族1成员C9(缩写和基因名称:AKR1C9)可以包含SEQ ID NO:97的氨基酸序列或与其基本相同的氨基酸序列。AKR1C9可以由包含SEQ ID NO:98的核酸序列或与其基本相同的核酸序列的多核苷酸编码。
胆汁酸-CoA:氨基酸N-酰基转移酶(缩写:N-酰基转移酶)催化3α,7β-二羟基-5β-胆烷-24-酰基-CoA向甘氨酰-熊去氧胆酸(甘氨酰-UDCA)的转化。N-酰基转移酶可以包含SEQ ID NO:232、234、236或238中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。胆酰-CoA水解酶可以由包含SEQ ID NO:224、226、228或232、234、236或238中任一个的核酸序列或与任一前述序列基本相同的核酸序列的多核苷酸编码。
本发明还设想使用任何前述酶的片段。在某些实施方案中,该片段是保留相应全长酶的期望生物活性的片段。这样的片段在本文中将被称为“生物活性”片段。
用于本发明的DHCR7的生物活性片段可以是保留催化胆甾-5,7,24-三烯醇向链甾醇转化的能力的生物活性片段。用于本发明的DHCR24的生物活性片段可以是保留催化链甾醇向胆固醇转化的能力的生物活性片段。用于本发明的CYP7A1的生物活性片段可以是保留催化胆固醇向7-α-羟基胆固醇转化的能力的生物活性片段。用于本发明的HSD3B7的生物活性片段可以是保留催化7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮转化的能力的生物活性片段。用于本发明的CYP8B1的生物活性片段可以是保留催化7α-羟基-4-胆甾烯-3-酮向7α,12α-二羟基-4-胆甾烯-3-酮转化的能力的生物活性片段。用于本发明的AKR1D1的生物活性片段可以是保留催化7α-羟基-4-胆甾烯-3-酮向7α-羟基-5β-胆甾烷-3-酮转化和/或7α,12α-二羟基-4-胆甾烯-3-酮向7α,12α-二羟基-5β-胆甾烷-3-酮转化的能力的生物活性片段。用于本发明的AKR1C4的生物活性片段可以是保留催化7α-羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α-二醇转化和/或7α,12α-二羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α,12α-三醇转化的能力的生物活性片段。用于本发明的CYP27A1的生物活性片段可以是保留催化5β-胆甾烷-3α,7α-二醇向(25R)-3α,7α-二羟基-5β-胆甾烷酸转化和/或5β-胆甾烷-3α,7α,12α-三醇向(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸转化的能力的生物活性片段。用于本发明的SLC27A5或FAT 1的生物活性片段可以是保留催化(25R)-3α,7α-二羟基-5β-胆甾烷酸向(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化和/或(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸向(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA转化的能力的生物活性片段。用于本发明的AMACR的生物活性片段可以是保留催化(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化和/或(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA转化的能力的生物活性片段。用于本发明的ACOX2或POX1的生物活性片段可以是保留催化(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA转化和/或(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA转化的能力的生物活性片段。用于本发明的HSD17B4或FOX2的生物活性片段可以是保留催化(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA向3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA转化和/或(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA向3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA转化的能力的生物活性片段。用于本发明的SCP2、POT1或ERG10的生物活性片段可以是保留催化3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA向CDC-CoA转化和/或3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA向3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA转化的能力的生物活性片段。用于本发明的7α-HSD的生物活性片段可以是保留催化CDC-CoA向3α-羟基-7-氧-5β-胆烷-24-酰基-CoA转化的能力的生物活性片段。用于本发明的7β-HSD的生物活性片段可以是保留催化3α-羟基-7-氧-5β-胆烷-24-酰基-CoA向3α,7β-二羟基-5β-胆烷-24-酰基-CoA转化的能力的生物活性片段。用于本发明的胆酰-CoA水解酶的生物活性片段可以是保留催化3α,7β-二羟基-5β-胆烷-24-酰基-CoA向UDCA转化和/或3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA向胆酸转化的能力的生物活性片段。用于本发明的具有生物活性的N-酰基转移酶的片段可以是保留催化3α,7β-二羟基-5β-胆烷-24-酰基-CoA向甘氨酰-UDCA转化的能力的生物活性片段。
遗传修饰的细胞
本发明部分地涉及能够产生UDCA、胆酸和/或另一种UDCA前体的遗传修饰的细胞。遗传修饰的细胞可以用于在发酵罐中发酵UDCA、胆酸和/或UDCA前体。
在某些实施方案中,该细胞包含至少一种参与产生UDCA、胆酸和/或另一种UDCA前体的生物合成途径(例如先前描述的途径)的异源酶或其生物活性片段。在某些实施方案中,该细胞包含两种或更多种、三种或更多种、四种或更多种、五种或更多种、六种或更多种、七种或更多种、八种或更多种、九种或更多种、十种或更多种、十一种或更多种、十二种或更多种、十三种或更多种、十四种或更多种、十五种或更多种或十六种或更多种这样的酶和/或其生物活性片段。在某些这样的实施方案中,这些酶或其生物活性片段沿生物合成途径可操作地连接。该异源酶可以是例如DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C4、CYP27A1、SLC27A5、AMACR、ACOX2、HSD17B4、SCP2、7α-HSD、7β-HSD、胆酰-CoA水解酶、AKR1C9或N-酰基转移酶。该细胞可以包含具有如先前描述的相应酶的氨基酸序列的酶。
在其中细胞包含异源DHCR7的实施方案中,该酶可以包含SEQ ID NO:1、3、5、7、9或11中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源DHCR24的实施方案中,该酶可以包含SEQ ID NO:13、17、21、25、29、33、37、41、43、45或47中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源CYP7A1的实施方案中,该酶可以包含SEQ ID NO:49、51、53、55、57、59、61、63、65、67、69、71、73、75、77或79中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源HSD3B7的实施方案中,该酶可以包含SEQ ID NO:81、83、85或87中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源AKR1D1的实施方案中,该酶可以包含SEQ ID NO:89、91、93或95中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源CYP8B1的实施方案中,该酶可以包含SEQ ID NO:265、267、269、271、273、275或277中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源AKR1C4的实施方案中,该酶可以包含SEQ ID NO:99、101、103、105、107、109、111、113、115、117、119或121中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源CYP27A1的实施方案中,该酶可以包含SEQ ID NO:123、125、127、129、131、133、135或137中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源SLC27A5的实施方案中,该酶可以包含SEQ ID NO:139或141的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源FAT1的实施方案中,该酶可以包含SEQ ID NO:143的氨基酸序列或与其基本相同的氨基酸序列。
在其中细胞包含异源AMACR的实施方案中,该酶可以包含SEQ ID NO:145、147、149、151、153、155或157中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源ACOX2的实施方案中,该酶可以包含SEQ ID NO:159、161、163、165、167、169、171或173中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源FOX1的实施方案中,该酶可以包含SEQ ID NO:175的氨基酸序列或与其基本相同的氨基酸序列。
在其中细胞包含异源HSD17B4的实施方案中,该酶可以包含SEQ ID NO:177、179、181、183、185、187、189或191中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源FOX2的实施方案中,该酶可以包含SEQ ID NO:193的氨基酸序列或与其基本相同的氨基酸序列。
在其中细胞包含异源SCP2的实施方案中,该酶可以包含SEQ ID NO:195、197、199或201中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源POT1的实施方案中,该酶可以包含SEQ ID NO:203或与其基本相同的氨基酸序列。
在其中细胞包含异源ERG10的实施方案中,该酶可以包含氨基酸序列SEQ ID NO:205或与其基本相同的氨基酸序列。
在其中细胞包含异源7α-HSD的实施方案中,该酶可以包含SEQ ID NO:207、209、211或213中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源7β-HSD的实施方案中,该酶可以包含SEQ ID NO:215、217、219或221中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源胆酰-CoA水解酶的实施方案中,该酶可以包含SEQ ID NO:223、225、227或229中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源AKR1C9的实施方案中,该酶可以包含SEQ ID NO:97的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在其中细胞包含异源N-酰基转移酶的实施方案中,该酶可以包含SEQ ID NO:232、234、236或238中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。
在某些实施方案中,该细胞包含至少一种编码参与产生UDCA、胆酸和/或另一种UDCA前体的生物合成途径(例如先前描述的途径)的酶或其生物活性片段的异源多核苷酸。在某些实施方案中,该细胞包含两种或更多种、三种或更多种、四种或更多种、五种或更多种、六种或更多种、七种或更多种、八种或更多种、九种或更多种、十种或更多种、十一种或更多种、十二种或更多种、十三种或更多种、十四种或更多种、十五种或更多种或十六种或更多种这样的多核苷酸。该异源多核苷酸可以例如编码DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C4、CYP27A1、SLC27A5、AMACR、ACOX2、HSD17B4、SCP2、7α-HSD、7β-HSD和/或胆酰-CoA水解酶和/或这样的酶的生物活性片段。在某些这样的实施方案中,这些酶和/或其生物活性片段沿生物合成途径可操作地连接。
在其中细胞包含编码DHCR7的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:2、4、6、8、10或12中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码DHCR24的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码CYP7A1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码HSD3B7的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:82、84、86或88中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码CYP8B1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:266、268、270、272、274、276或278中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码AKR1D1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:90、92、94或96中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码AKR1C4的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:100、102、104、106、108、110、112、114、116、118、120或122中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码CYP27A1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:124、126、128、130、132、134、136或138中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码SLC27A5的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:140或142的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码FAT1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:144的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码AMACR的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:146、148、150、152、154、156或158中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码ACOX2的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:160、162、164、166、168、170、172或174中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码FOX1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:176的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码HSD17B4的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:178、180、182、184、186、188、190或192中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码FOX2的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:194的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码SCP2的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:196、198、200或202中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码POT1的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:204的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码ERG10的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:206的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码7α-HSD的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:208、210、212或214中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码7β-HSD的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:216、218、220或222中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码胆酰-CoA水解酶的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:224、226、228或230中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中细胞包含编码AKR1C9的异源多核苷酸的实施方案中,该多核苷酸可以包含SEQ ID NO:98的核酸序列或与其基本相同的核酸序列。
在其中细胞包含编码N-酰基转移酶的异源多核苷酸的实施方案中,该多核苷酸可以包括SEQ ID NO:232、234、236或238的核酸序列或与任一前述序列具有基本相同的核酸序列的多核苷酸。
在某些实施方案中,该多核苷酸编码两种或更多种、三种或更多种、四种或更多种、五种或更多种、六种或更多种、七种或更多种、八种或更多种、九种或更多种、十种或更多种、十一种或更多种、十二种或更多种、十三种或更多种、十四种或更多种、十五种或更多种或十六种或更多种这样的酶和/或其生物活性片段。在某些这样的实施方案中,这些酶或其生物活性片段沿生物合成途径可操作地连接。
在某些实施方案中,该细胞包含至少一种异源酶或其生物活性片段,所述异源酶或其生物活性片段能够催化以下转化中的至少一种:胆甾-5,7,24-三烯醇向链甾醇的转化;链甾醇向胆固醇的转化;胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α-羟基-5β-胆甾烷-3-酮的转化;7α-羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α-二醇的转化;5β-胆甾烷-3α,7α-二醇向(25R)-3α,7α-二羟基-5β-胆甾烷酸的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酸向(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA向3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA的转化;以及3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA向CDC-CoA的转化。在某些实施方案中,该细胞包含至少一种编码这样的酶或其生物活性片段的异源多核苷酸。
在某些实施方案中,该细胞包含至少一种异源酶或其生物活性片段,所述异源酶或其生物活性片段催化以下转化中的至少一种:胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α,12α-二羟基-4-胆甾烯-3-酮的转化;7α,12α-二羟基-4-胆甾烯-3-酮向7α,12α-二羟基-5β-胆甾烷-3-酮的转化;7α,12α-二羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α,12α-三醇的转化;5β-胆甾烷-3α,7α,12α-三醇向(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸向(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA向3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA的转化;3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA向3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA的转化;以及3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA向胆酸的转化。在某些实施方案中,该细胞包含至少一种编码这样的酶或其生物活性片段的异源多核苷酸。
在某些实施方案中,该细胞包含至少一种异源酶或其生物活性片段,所述异源酶或其生物活性片段催化以下转化中的至少一种:CDC-CoA向3α-羟基-7-氧-5β-胆烷-24-酰基-CoA的转化;3α-羟基-7-氧-5β-胆烷-24-酰基-CoA向3α,7β-二羟基-5β-胆烷-24-酰基-CoA的转化;以及3α,7β-二羟基-5β-胆烷-24-酰基-CoA向UDCA的转化。在某些实施方案中,该细胞包含至少一种编码这样的酶或其生物活性片段的异源多核苷酸。
另外,水解酶或其生物活性片段可以作用于期望产物的CoA形式,以产生期望产物的游离酸形式。在一些情况下,期望产物的游离酸形式可以包括(25R)-3α,7α-二羟基-5β-胆甾烷酸、(25S)-3α,7α-二羟基-5β-胆甾烷酸、(24E)-3α,7α-二羟基-5β-胆甾-24-烯酸、3α,7α-二羟基-24-氧-5β-胆甾烷酸、3α,7α-二羟基-5β-胆烷酸(鹅去氧胆酸;CDCA)、3α-羟基-7-氧-5β-胆烷酸(海狸胆酸(nutriacholic acid);NCA)、3α,7β-二羟基-5β-胆烷酸(熊去氧胆酸;UDCA)、(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸、(25S)-3α,7α,12α-三羟基-5β-胆甾烷酸、(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酸、3α,7α,12α-三羟基-24-氧-5β-胆甾烷酸、胆酸或其任何组合。
细胞也可以被工程化为表达异源酶或其生物活性片段,以改进UDCA或UDCA前体的产生。
在某些实施方案中,肾上腺皮质铁氧还蛋白还原酶(adrenodoxin reductase,ADR)或其生物活性片段可以用于改进UDCA或UDCA前体的产生。在这样的实施方案中,遗传修饰的细胞可以包含至少一种异源ADR酶或这种酶的生物活性片段。在某些实施方案中,该酶包含SEQ ID NO:239的氨基酸序列或与其基本相同的氨基酸序列。在某些实施方案中,该细胞可以包含至少一种编码ADR或其生物活性片段的异源多核苷酸。该多核苷酸可以包括SEQ ID NO:240的核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
在某些实施方案中,肾上腺皮质铁氧还蛋白(ADX)或其生物活性片段可以用于改进UDCA或UDCA前体的产生。在这样的实施方案中,遗传修饰的细胞可以包含至少一种异源ADX酶或这种酶的生物活性片段。在某些实施方案中,该酶包含SEQ ID NO:241、243、245、247、249、251、253、255、257、259或261中任一个的氨基酸序列或与任一前述序列基本相同的氨基酸序列。在某些实施方案中,该细胞可以包含至少一种编码ADX或其生物活性片段的异源多核苷酸。该多核苷酸可以包括SEQ ID NO:242、244、246、248、250、252、254、256、258、260或262中任一个的核酸序列或与任一前述序列具有基本相同的核苷酸序列的多核苷酸。
在某些实施方案中,截短型(truncated)HMG或其生物活性片段可以用于改进UDCA或UDCA前体的产生。在这样的实施方案中,遗传修饰的细胞可以包含至少一种截短型HMG或这种酶的生物活性片段。在某些实施方案中,该酶包含SEQ ID NO:263的氨基酸序列或与其基本相同的氨基酸序列。在某些实施方案中,该细胞可以包含至少一种编码截短型HMG或其生物活性片段的异源多核苷酸。该多核苷酸可以包括SEQ ID NO:264的核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
在某些实施方案中,酶的氨基酸序列被优化以对应于宿主细胞内的氨基酸使用。
在某些实施方案中,多核苷酸的核酸序列被针对宿主细胞内的使用进行密码子优化。
通篇公开的酶可以来自微生物。例如,酶可以来自细菌、古细菌、真菌、原生动物、藻类和/或病毒。酶也可以来自动物,诸如哺乳动物,例如智人和小家鼠,或来自植物,诸如拟南芥属(Arabidopsis)。
通篇描述的酶或其片段在一些情况下也可以融合或连接在一起。任何片段接头可以用于将两种或更多种酶或其片段连接在一起。在一些情况下,接头可以是氨基酸序列的任何随机阵列。
在某些实施方案中,细胞是微生物或微生物的一部分或是植物、动物或真菌的一部分。微生物可以是酵母、藻类或细菌。微生物可以是原核或真核的。在某些实施方案中,微生物是细菌或酵母。例如,微生物可以是酿酒酵母(Saccharomyces cerevisiae)、解脂耶氏酵母(Yarrowia lipolytica)或大肠杆菌或通篇公开的任何其他细胞。
在某些实施方案中,微生物是酵母。可以使用的酵母的实例包括来自酵母(Saccharomyces)属的那些酵母。在某些实施方案中,酵母属于物种酿酒酵母。
如果遗传修饰的微生物是细菌,则细菌可以来自埃希氏菌属(Escherichia),例如,大肠杆菌。
在某些实施方案中,细胞天然不能够产生UDCA、胆酸和/或其他UDCA前体,或者以低于期望的量产生UDCA、胆酸和/或其他UDCA前体。通过实施本文描述的遗传修饰,可以对细胞进行修饰,使得其中UDCA、胆酸和/或其他UDCA前体的水平高于对应的未修饰的细胞中UDCA、胆酸和/或其他UDCA前体的水平。
在某些实施方案中,细胞天然能够催化产生UDCA、胆酸和/或其他UDCA前体所必需的反应中的一些,但不是全部。例如,细胞可以天然能够催化前述产生UDCA、胆酸和/或其他UDCA前体的生物合成途径中的转化中的一些,但不是全部。
在某些实施方案中,细胞天然能够产生可以用于产生UDCA、胆酸和/或其他UDCA前体的底物。然而,该细胞不是天然地能够产生UDCA、胆酸和/或其他UDCA前体。在这样的实施方案中,遗传修饰可以用于允许细胞将底物转化为UDCA、CDCA、CDC-CoA、胆酸或其他UDCA前体。
在某些实施方案中,遗传修饰的细胞不能产生可以用于产生UDCA、胆酸和/或其他UDCA前体的底物。在这样的实施方案中,可以向细胞提供底物,例如作为细胞生长培养基的一部分。然后细胞可以将该底物转化为UDCA、胆酸和/或其他UDCA前体。
在一些情况下,遗传修饰的微生物可以从一种或更多种底物产生UDCA或UDCA前体,诸如CDC-CoA或胆酸。
分离的多核苷酸
本发明部分地涉及编码参与产生UDCA、胆酸和/或另一种UDCA前体的生物合成途径的酶的分离的多核苷酸。换言之,基因可以呈自然界中不存在的形式,从染色体中分离出来。分离的多核苷酸可以编码至少一种前述酶,并且可以包含任一种编码这种酶的相应序列。
分离的多核苷酸可以插入所使用的细胞/微生物的基因组中。在一些情况下,分离的多核苷酸被插入基因组的特定基因座处,在此处分离的多核苷酸可以以足够的量表达。
在某些实施方案中,分离的多核苷酸编码至少一种酶或其生物活性片段,所述酶或其生物活性片段能够催化以下转化中的至少一种:胆甾-5,7,24-三烯醇向链甾醇的转化;链甾醇向胆固醇的转化;胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α-羟基-5β-胆甾烷-3-酮的转化;7α-羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α-二醇的转化;5β-胆甾烷-3α,7α-二醇向(25R)-3α,7α-二羟基-5β-胆甾烷酸的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酸向(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA向3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA的转化;3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA向CDC-CoA的转化。
在某些实施方案中,分离的多核苷酸编码至少一种酶或其生物活性片段,所述酶或其生物活性片段催化以下转化中的至少一种:胆固醇向7-α-羟基胆固醇的转化;7-α-羟基胆固醇向7α-羟基-4-胆甾烯-3-酮的转化;7α-羟基-4-胆甾烯-3-酮向7α,12α-二羟基-4-胆甾烯-3-酮的转化;7α,12α-二羟基-4-胆甾烯-3-酮向7α,12α-二羟基-5β-胆甾烷-3-酮的转化;7α,12α-二羟基-5β-胆甾烷-3-酮向5β-胆甾烷-3α,7α,12α-三醇的转化;5β-胆甾烷-3α,7α,12α-三醇向(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸向(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA的转化;(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA向(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA的转化;(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA向3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA的转化;3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA向3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA的转化;以及3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA向胆酸的转化。
在某些实施方案中,分离的多核苷酸编码至少一种酶或其生物活性片段,所述酶或其生物活性片段催化以下转化中的至少一种:CDC-CoA向3α-羟基-7-氧-5β-胆烷-24-酰基-CoA的转化;3α-羟基-7-氧-5β-胆烷-24-酰基-CoA向3α,7β-二羟基-5β-胆烷-24-酰基-CoA的转化;以及3α,7β-二羟基-5β-胆烷-24-酰基-CoA向UDCA的转化。
在某些实施方案中,分离的多核苷酸编码DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C4、CYP27A1、SLC27A5、AMACR、ACOX2、HSD17B4、SCP2、7α-HSD、7β-HSD、胆酰-CoA水解酶、AKR1C9和/或N-酰基转移酶和/或这样的酶的生物活性片段。
在其中分离的多核苷酸编码DHCR7的实施方案中,分离的多核苷酸包含SEQ IDNO:2、4、6、8、10或12中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码DHCR24的实施方案中,分离的多核苷酸包含SEQ IDNO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码CYP7A1的实施方案中,分离的多核苷酸包含SEQ IDNO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码HSD3B7的实施方案中,分离的多核苷酸包含SEQ IDNO:82、84、86或88中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码CYP8B1的实施方案中,分离的多核苷酸包含SEQ IDNO:266、268、270、272、274、276或278中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码AKR1D1的实施方案中,分离的多核苷酸包含SEQ IDNO:90、92、94或96中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码AKR1C4的实施方案中,分离的多核苷酸包含SEQ IDNO:100、102、104、106、108、110、112、114、116、118、120或122中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码CYP27A1的实施方案中,分离的多核苷酸包含SEQ IDNO:124、126、128、130、132、134、136或138中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码SLC27A5的实施方案中,分离的多核苷酸包含SEQ IDNO:140或142的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码FAT1的实施方案中,分离的多核苷酸包含SEQ ID NO:144的核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码AMACR的实施方案中,分离的多核苷酸包含SEQ IDNO:146、148、150、152、154、156或158中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码ACOX2的实施方案中,分离的多核苷酸包含SEQ IDNO:160、162、164、166、168、170、172或174中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码FOX1的实施方案中,分离的多核苷酸包含SEQ ID NO:176的核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码HSD17B4的实施方案中,分离的多核苷酸包含SEQ IDNO:178、180、182、184、186、188、190或192中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码FOX2的实施方案中,分离的多核苷酸包含SEQ ID NO:194的核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码SCP2的实施方案中,分离的多核苷酸包含SEQ ID NO:196、198、200或202中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码POT1的实施方案中,分离的多核苷酸包含SEQ ID NO:204的核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码ERG10的实施方案中,分离的多核苷酸包含SEQ IDNO:206的核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码7α-HSD的实施方案中,分离的多核苷酸包含SEQ IDNO:208、210、212或214中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码7β-HSD的实施方案中,分离的多核苷酸包含SEQ IDNO:216、218、220或222中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码胆酰-CoA水解酶的实施方案中,分离的多核苷酸包含SEQ ID NO:224、226、228或230中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中分离的多核苷酸编码AKR1C9的实施方案中,分离的多核苷酸包含SEQ IDNO:98的任一核酸序列或与其基本相同的核酸序列。
在其中分离的多核苷酸编码N-酰基转移酶的实施方案中,分离的多核苷酸包含SEQ ID NO:232、234、236或238中任一个的核酸序列或与任一前述序列具有基本相同的核苷酸序列的多核苷酸。
分离的多核苷酸还可以编码至少一种改进UDCA、胆酸和/或其他UDCA前体的产生的酶(诸如ADR、ADX和/或截短型HMG)和/或这种酶的生物活性片段。
在其中分离的多核苷酸编码ADR的实施方案中,分离的多核苷酸包含SEQ ID NO:240的任一核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
在其中分离的多核苷酸编码ADX的实施方案中,分离的多核苷酸包含SEQ ID NO:242、244、246、248、250、252、254、256、258、260或262中任一个的核酸序列或与任一前述序列具有基本相同的核苷酸序列的多核苷酸。
在其中分离的多核苷酸编码截短型HMG的实施方案中,分离的多核苷酸包含SEQID NO:264的任一核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
载体
由于通篇描述的一些酶及其生物活性片段对于一些细胞和微生物来说不是天然的,因此可以使用表达载体以在大多数微生物和细胞中表达期望的酶和/或片段。因此,本发明还部分地涉及包含如先前描述的编码酶或其生物活性片段的多核苷酸的载体,所述酶或其生物活性片段参与产生UDCA、胆酸和/或另一种UDCA前体的生物合成途径。
为了引入通篇描述的宿主细胞或微生物中而制备的载体构建体通常可以(但不总是)包含被宿主识别的复制系统(即载体)。在一些情况下,载体包含编码期望的酶或其片段的预期多核苷酸片段以及任选地可操作地连接至多肽编码区段的转录和翻译起始调控序列。表达载体可以包含,例如,复制起点或自主复制序列(ARS)、表达控制序列、启动子、增强子和必要的信息处理位点(processing information site),诸如核糖体结合位点、RNA剪接位点、多腺苷酸化位点、转录终止子序列、mRNA稳定序列、与宿主染色体DNA同源的多核苷酸和/或多克隆位点。在适当的情况下,也可以包含信号肽,例如来自同一物种或亲缘物种的分泌多肽,信号肽允许蛋白质通过细胞膜和/或停留在细胞膜中或从细胞中分泌。
表达载体可以使用已确立的技术(包括但不限于电穿孔、磷酸钙沉淀法、DEAE-葡聚糖介导的转染、脂质体介导的转染、在乙酸锂存在的情况下的热激等)稳定地引入宿主细胞中或瞬时地引入宿主细胞中。为了稳定转化,核酸通常还将包含选择性标志物,例如,若干种熟知的选择性标志物中的任一种,诸如新霉素抗性、氨苄青霉素抗性、四环素抗性、氯霉素抗性、卡那霉素抗性等。在一些实施方案中,藉以对宿主细胞进行遗传修饰的核酸是包含含有编码基因产物(例如,酶、转录因子等)的核苷酸序列的核酸的表达载体。
合适的表达载体包括但不限于杆状病毒载体、噬菌体载体、质粒、噬菌粒、黏粒、福斯质粒(fosmid)、细菌人工染色体、病毒载体(例如,基于痘苗病毒、脊髓灰质炎病毒、腺病毒、腺相关病毒、SV40、单纯疱疹病毒等的病毒载体)、基于P1的人工染色体、酵母质粒、酵母人工染色体以及任何其他对感兴趣的特定宿主(诸如酵母)特异的载体。因此,例如,编码基因产物的核酸被包含在用于表达基因产物的各种表达载体中的任一种中。这些载体包含染色体序列、非染色体序列和合成的DNA序列。
在一些情况下,载体中使用的启动子可以对化学物质敏感。例如,在化学物质存在的情况下,启动子被活化或失活。在一些情况下,化学物质可以是糖,诸如葡萄糖或半乳糖。在一些情况下,化学物质可以是铜。在一些情况下,化学物质可以是稀土金属。在一些情况下,稀土金属可以是镧或铈。在一些情况下,稀土金属可以是镨或钕。
载体可以使用标准方法构建(参见例如,Sambrook等人,Molecular Biology:ALaboratory Manual,Cold Spring Harbor,N.Y.1989;和Ausubel等人,Current Protocolsin Molecular Biology,Greene Publishing,Co.N.Y,1995)。
对编码通篇公开的酶或其生物活性片段的多核苷酸的操作通常在重组载体中进行。可以使用的载体包括酵母质粒、细菌质粒、噬菌体、人工染色体、附加型载体(episomalvector)和基因表达载体。可以选择载体以适应编码具有期望尺寸的蛋白质的多核苷酸。在产生选择的载体后,用载体转染或转化合适的宿主细胞(例如,本文描述的微生物)。每个载体包含多种功能组件,功能组件通常包括克隆位点和复制起点。在一些情况下,载体包含至少一种选择性标志物基因。载体可以另外具有一个或更多个以下元件:增强子、启动子、转录终止序列和/或其他信号序列。这些序列元件可以针对选择的宿主物种进行优化。这些序列元件可以位于克隆位点附近,使得它们与编码预选酶的基因可操作地连接。
载体,包括克隆载体和表达载体,可以包含使得载体能够在一种或更多种选择的微生物中复制的多核苷酸。例如,该序列可以是使得载体能够独立于宿主染色体DNA复制的序列,并且可以包括复制起点或自主复制序列。对于各种细菌、酵母和病毒,这样的序列是熟知的。例如,来自质粒pBR322的复制起点适用于大多数革兰氏阴性菌,2微米质粒(2micron plasmid)的复制起点适用于酵母,并且各种病毒复制起点(例如,SV40、腺病毒)可用于克隆载体。
克隆载体或表达载体可以包含选择基因,也称为选择性标志物。该基因编码转化的微生物在选择性培养基中存活或生长所必需的蛋白质。因此,未被含有选择基因的载体转化的微生物将不能在培养基中存活。典型的选择基因编码这样的蛋白质:赋予对抗生素和其他毒素(例如氨苄青霉素、新霉素、氨甲蝶呤、潮霉素、卡那霉素、硫链丝菌素、阿泊拉霉素或四环素)的抗性,补充营养缺陷型缺陷,或提供生长培养基中不可得的关键营养。
载体的复制可以在大肠杆菌中进行。大肠杆菌选择性标志物的一种实例是β-内酰胺酶基因,它赋予对抗生素氨苄青霉素的抗性。这些选择性标志物可以从大肠杆菌质粒诸如pBR322或pUC质粒诸如pUC18或pUC19或pUC119获得。
在其中载体包含编码DHCR7的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:2、4、6、8、10或12中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码DHCR24的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码CYP7A1的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码HSD3B7的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:82、84、86或88中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码CYP8B1的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码AKR1D1的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:90、92、94或96中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码AKR1C4的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:100、102、104、106、108、110、112、114、116、118、120或122中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码CYP27A1的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:124、126、128、130、132、134、136或138中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码SLC27A5的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:140或142的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码FAT1的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:144的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码AMACR的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:146、148、150、152、154、156或158的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码ACOX2的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:160、162、164、166、168、170、172或174的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码FOX1的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:176的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码HSD17B4的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:178、180、182、184、186、188、190或192的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码FOX2的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:194的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码SCP2的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:196、198、200或202中任一个的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码POT1的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:204的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码ERG10的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:206的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码7α-HSD的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:208、210、212或214的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码7β-HSD的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:216、218、220或222的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码胆酰-CoA水解酶的多核苷酸的实施方案中,分离的载体可以包含SEQ ID NO:224、226、228或230的核酸序列或与任一前述序列基本相同的核酸序列。
在其中载体包含编码AKR1C9的多核苷酸的实施方案中,分离的载体可以包含SEQID NO:98的核酸序列或与其基本相同的核酸序列。
在其中载体包含编码N-酰基转移酶的多核苷酸的实施方案中,分离的载体可以包含SEQ ID NO:232、234、236或238的核酸序列或与任一前述序列具有基本相同的核苷酸序列的多核苷酸。
在其中载体包含编码ADR的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:240的核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
在其中载体包含编码ADX的多核苷酸的实施方案中,分离的载体可以包含SEQ IDNO:242、244、246、248、250、252、254、256、258、260或262的核酸序列或与任一前述序列具有基本相同的核苷酸序列的多核苷酸。
在其中载体包含编码截短型HMG的多核苷酸的实施方案中,分离的载体可以包含SEQ ID NO:264的核酸序列或具有与其基本相同的核苷酸序列的多核苷酸。
启动子
载体可以包含被宿主微生物识别的启动子。启动子可以可操作地连接至感兴趣的编码序列。这样的启动子可以是诱导型、阻遏型(repressible)或组成型的。当多核苷酸处于允许它们以其预期的方式发挥功能的关系时,多核苷酸被可操作地连接。
可以使用不同的启动子来驱动基因的表达。例如,如果期望瞬时基因表达(即,非组成型表达),则表达可以由诱导型或阻遏型启动子驱动。在一些情况下,分子开关可以包括这些诱导型或阻遏型启动子。
在一些情况下,期望的基因是瞬时表达的。换言之,期望的基因不是组成型表达的。期望的基因的表达可以由起分子开关作用的诱导型或阻遏型启动子驱动。诱导型或阻遏型开关的实例包括但不限于那些可通过以下诱导或阻遏的启动子:(a)糖,诸如葡萄糖、半乳糖、阿拉伯糖和乳糖(或不可代谢的类似物,例如,异丙基β-D-1-硫代吡喃半乳糖苷(IPTG));(b)金属,诸如铜或钙(或稀土金属,诸如镧或铈);(c)温度;(d)氮源;(e)氧;(f)细胞状态(生长或静止);(g)代谢物,诸如磷酸盐/酯;(h)CRISPRi;(i)jun;(j)fos,(k)金属硫蛋白和/或(l)热激。
可以特别有用的诱导型或阻遏型开关是响应于糖、金属离子和稀土金属的开关。例如,对阿拉伯糖、葡萄糖和/或半乳糖敏感的启动子可以用作这样的开关。在一些情况下,这样的开关可以用来驱动一种或更多种基因的表达。例如,在这样的糖存在的情况下,阿拉伯糖至半乳糖开关或葡萄糖至半乳糖开关可以开启期望基因的表达。
在特定实施方案中,开关是GAL1或GAL10启动子。这些启动子在葡萄糖存在的情况下被强烈阻遏,而葡萄糖的消耗去除阻遏,但不一定触发诱导。然而,在半乳糖存在的情况下,表达被强烈诱导。为了进一步实现高水平的表达,可以敲除编码参与半乳糖介导的转录调控的转录阻遏物的GAL80基因。
在本发明中特别有用的金属离子开关是铜敏感型开关。在一些情况下,铜开关可以是诱导型开关,当环境中存在铜时,该开关可用于“开启”一种或更多种基因的表达。在培养基中不存在铜的情况下,期望的基因的组或载体不被高表达。
其他有用的开关可以是稀土金属开关,诸如镧敏感型开关(也被简称为镧开关)。在一些情况下,镧开关可以是阻遏型开关,其可用于阻遏一种或更多种基因的表达,直到阻遏物被去除(例如,在这种情况下为镧),之后基因被“开启”。例如,在稀土金属镧存在的情况下,期望的基因的组或载体可以被“关闭”。基因的表达通过从培养基中去除镧或将培养基中的镧稀释至其中镧阻遏作用降低、最小化或消除的水平来诱导。可以使用其他稀土金属开关,诸如通篇公开的那些。
组成型表达的启动子也可用于本文的载体系统。例如,一个或更多个期望基因的表达可以由组成型活性启动子控制。这些启动子的实例包括但不限于pPGK1、pTDH3、pENO1、pTEF1、pHIS4、pUGA1、pADH1、pADH2、pGAL1、pGAL10、pGAL1/10、pXoxF、pMxaF和p.Bba.J23111。
适用于原核宿主的启动子可以包括,例如,α-内酰胺酶和乳糖启动子系统、碱性磷酸酶、色氨酸(trp)启动子系统、红霉素启动子、阿泊拉霉素启动子、潮霉素启动子、次甲霉素启动子和杂合启动子,诸如tac启动子。用于细菌系统的启动子通常还将包含可操作地连接至编码序列的Shine-Dalgarno序列。
适用于真核宿主的启动子可以包括,例如,半乳糖启动子、铜启动子、四环素启动子、葡萄糖阻遏型启动子诸如pGAL1和pGAL10、低葡萄糖诱导型启动子诸如pADH2和pHXT7、以及高葡萄糖诱导型启动子诸如pHXT3。这些启动子通常还将包含可操作地连接至编码序列的Kozak序列。
通常,可以使用强启动子来提供期望产物的高水平转录和表达。例如,可以使用的启动子包括但不限于pMxaF、pTDH3、pPGK1、pENO2、pTEF1、pTEF2、pADH1、pCCW12、pGAL1和pGAL10。在一些情况下,突变可以增加启动子的强度,并从而导致表达水平的提高。
然而,在一些情况下,需要较弱的启动子。例如,在这样的情况下,其中某个基因的过表达导致有害效应(例如,细胞死亡)。可以使用弱启动子,例如pPHO84、pPFK1、pCDC19、pBAD、pPHO84、pPFK1、pCLN1、pCYC1、pUGA1、pRAT1和pPFK12。然而,在一些情况下,较弱的启动子可以通过突变产生。例如,pmxaF启动子可以突变为较弱的启动子。
转录单位的一个或更多个启动子可以是诱导型启动子。例如,GFP可以从组成型启动子表达,而诱导型启动子用于驱动编码如本文公开的一种或更多种酶和/或可扩增的选择性标志物的基因的转录。
一些载体可以包含促进载体在宿主细胞中增殖的序列。因此,载体可以具有其他组件,诸如复制起点(例如,使得载体能够在一种或更多种选择的微生物中复制的多核苷酸)、用于选择的抗生素抗性基因和/或琥珀终止密码子(可以允许翻译阅读通过该密码子)。也可以掺入另外的选择性基因。通常,在克隆载体中,复制起点是使得载体能够独立于宿主染色体DNA复制的序列,并且包括复制起点或自主复制序列。这样的序列可以包括细菌中的ColEl复制起点、酵母中的2微米复制起点或其他已知序列。
通篇描述的基因都可以具有驱动它们表达的启动子。本文描述的方法,例如,基因组编辑,可以用于编辑启动子的多核苷酸或用于抑制启动子的有效性。抑制可以通过以下方式实现:阻断转录机制(例如,转录因子)与启动子结合或通过以转录机制不再识别启动子序列的方式改变启动子。
制备遗传修饰的细胞的方法
本发明部分地涉及用于制备前述的遗传修饰的细胞的方法。该方法包括使细胞与至少一种异源多核苷酸接触,所述异源多核苷酸编码参与产生UDCA、胆酸和/或另一种UDCA前体的生物合成途径的酶或这种酶的生物活性片段。这样的多核苷酸如先前描述。该方法可以还包括使细胞生长,从而将异源多核苷酸插入细胞中。
在某些实施方案中,使细胞与至少两种这样的异源多核苷酸接触。在这样的实施方案中,异源多核苷酸可以编码沿该途径可操作地连接的酶和/或其片段。
在某些实施方案中,异源多核苷酸被包含在载体中,如先前描述的。
通篇公开的遗传修饰的细胞和微生物可以用各种方式制备。例如,细胞或微生物可以通过任何方法进行修饰(例如,遗传工程化),以包含和/或表达一种或更多种编码途径中的酶和/或其片段的多核苷酸。例如,通篇讨论的任何基因中的一个或更多个可以插入细胞或微生物中。基因可以通过表达载体插入。基因也可以处于一个或更多个不同/相同启动子的控制下,或者一种或更多种基因可以处于开关的控制下,诸如诱导型或阻遏型启动子,例如,阿拉伯糖开关、葡萄糖至半乳糖开关、异丙基β-D-1-硫代吡喃半乳糖苷(IPTG)开关、铜开关或稀土金属开关。基因也可以稳定地整合到微生物的基因组中。在一些情况下,基因可以以附加型载体表达。
制备本文公开的遗传修饰的细胞或微生物的示例性方法是用编码至少一种前述酶或其片段的多核苷酸接触(或转化)细胞/微生物。插入微生物中的多核苷酸对于细胞/微生物本身可以是异源的。例如,如果微生物是酵母,则插入的多核苷酸可以来自细菌或不同物种的酵母。此外,多核苷酸可以是细胞/微生物的基因组的内源部分。
在一些实施方案中,本发明的方法还包括从宿主微生物和/或培养基分离UDCA、胆酸和/或其他UDCA前体。
在某些实施方案中,将使用遗传修饰的细胞/微生物产生的UDCA前体与未修饰的细胞接触,该未修饰的细胞将UDCA前体转化为另一种UDCA前体或UDCA。
在某些实施方案中,所产生的UDCA前体不是进一步反应的底物。
通常,将遗传修饰的宿主细胞/微生物在合适的培养基中培养,该培养基任选地补充有一种或更多种另外的剂,诸如诱导剂(例如,其中编码基因产物的一条或更多条核苷酸序列处于诱导型启动子的控制下)。在一些实施方案中,培养基覆盖有形成有机层的有机溶剂,例如十二烷。在这种情况下,由遗传修饰的宿主细胞/微生物产生的UDCA、胆酸和/或其他UDCA前体可以分隔到有机层中,从有机层中可以纯化UDCA、胆酸和/或其他UDCA前体。在一些实施方案中,在一个或更多个编码基因产物的核苷酸序列可操作地连接至诱导型启动子的情况下,向培养基添加诱导剂;并且,在合适的时间后,从覆盖在培养基上的有机层中分离UDCA、胆酸和/或其他UDCA前体。
在一些实施方案中,将UDCA、胆酸和/或其他UDCA前体与可能存在于有机层中的其他产物分离。这种分离可以使用例如标准色谱技术来实现。
在一些实施方案中,UDCA、胆酸和/或其他UDCA前体是基本上纯的。
遗传修饰技术
本文公开的细胞/微生物可以通过使用经典的微生物技术进行遗传工程化。一些这样的技术通常在例如Sambrook等人,1989,Molecular Cloning:A Laboratory Manual,Cold Spring Harbor Labs Press中公开。
本文公开的遗传修饰的细胞/微生物可以包含已经被插入、缺失或修饰(即,突变;例如,通过核苷酸的插入、缺失、取代和/或倒位)的多核苷酸,其方式使得这样的修饰提供在细胞/微生物中表达(例如,过表达)如本文提供的一种或更多种酶的期望效果。导致基因表达或功能增加的遗传修饰可被称为基因的扩增、过量产生、过表达、活化、增强、添加或上调。增加基因表达的基因添加可以包括使基因保留在复制型质粒上或使克隆基因整合到生产细胞/微生物的基因组中。此外,增加期望基因的表达可以包括将克隆的基因可操作地连接至天然或异源转录控制元件。
增加期望基因表达的另一种方式可以是将基因的多于一个拷贝整合到基因组中。这可以以若干种方式实现。例如,可以将相同的克隆基因插入基因组中多于一个基因座中(通常在不同的染色体上)。可选地,可以将克隆基因的不同变体,例如不同的启动子/终止子组合,引入多于一个基因座中。除了染色体表达之外,还可以使用质粒上基因表达的组合。也可以使用随机整合技术,其中整合基因的位置和拷贝数是未知的。一种不太常用的方法可以是将基因和表达机制的串联重复序列引入单个基因座中。
在期望的情况下,本文提供的一种或更多种酶或其片段的表达处于调控序列的控制下,该调控序列在发酵期间以时间依赖性方式直接或间接控制表达。诱导型启动子可用于实现这一点。
在一些情况下,用包含编码酶或其片段的异源多核苷酸序列的遗传媒介物(诸如表达载体)转化或转染细胞/微生物。在一些情况下,载体可以是附加型载体,或基因序列可以整合到微生物基因组中,或其任何组合。在一些情况下,包含编码本文提供的酶或其片段的异源多核苷酸序列的载体整合到微生物基因组中。
为了促进编码感兴趣的酶或其片段的不同基因的插入和表达,构建体或表达载体可以设计为具有至少一个克隆位点,以用于插入编码这种酶或片段的任何基因。克隆位点可以是多克隆位点,例如,包含多于一个限制性位点。
转染和转化
标准转染技术可以用于将基因插入微生物中。如本文使用的,术语“转染”或“转化”可以指将外源核酸或多核苷酸插入宿主细胞中。外源核酸或多核苷酸可以保持为非整合载体,例如质粒或附加型载体,或者可选择地,可以整合到宿主细胞基因组中。术语转染(transfecting)或转染(transfection)旨在包括将核酸或多核苷酸引入细胞/微生物中的所有常规技术。转染技术的实例包括但不限于乙酸锂介导的转化、磷酸钙沉淀法、DEAE-葡聚糖介导的转染、脂质体转染、电穿孔、显微注射、氯化铷或聚阳离子介导的转染、原生质体融合和超声波处理。优选在特定宿主细胞系和类型中提供构建体最佳转染频率和表达的转染方法。对于稳定转染子,构建体被整合以稳定地保持在宿主染色体内。在一些情况下,优选的转染是稳定转染。在一些情况下,基因的整合发生在微生物基因组的特定基因座内。
表达载体或其他核酸可以通过许多合适的方法中的任一种引入选择的细胞/微生物中。例如,载体构建体可以通过许多转化方法中的任一种引入合适的细胞中。标准的氯化钙介导的细菌转化仍然常用于将裸DNA引入细菌中(参见例如,Sambrook等人,1989,Molecular Cloning,A Laboratory Manual,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.),但是也可以使用电穿孔和接合(参见例如,Ausubel等人,1988,Current Protocols in Molecular Biology,John Wiley&Sons,Inc.,NY,N.Y.)。
为了将载体构建体引入酵母或其他真菌细胞中,可以使用化学转化方法和电穿孔方法(例如,Rose等人,1990,Methods in Yeast Genetics,Cold Spring HarborLaboratory Press,Cold Spring Harbor,N.Y.)。转化的细胞可以在适合所用选择性标志物的选择性培养基上分离。可选地,或者另外,可以扫描板或从板上提起的过滤器的GFP荧光,以鉴定转化的克隆。
为了将包含差异表达序列的载体引入某些细胞类型中,所使用的方法可以取决于载体的形式。质粒载体可以通过许多转染方法中的任一种引入,包括例如脂质介导的转染(“脂质体转染”)、DEAE-葡聚糖介导的转染、电穿孔或磷酸钙沉淀法(参见例如,Ausubel等人,1988,Current Protocols in Molecular Biology,John Wiley&Sons,Inc.,NY,N.Y.)。
适用于瞬时转染各种各样转化细胞和非转化细胞或原代细胞的脂质体转染试剂和方法广泛可得,使得脂质体转染成为将构建体引入真核细胞并且特别是培养中的哺乳动物细胞的有吸引力的方法。许多公司提供这种类型的转染试剂盒和方法。
宿主细胞可以能够表达编码期望蛋白质的构建体,加工蛋白质并将分泌型蛋白质运送到细胞表面进行分泌。加工包括共翻译修饰和翻译后修饰,诸如前导肽裂解、GPI附接、糖基化、泛素化和二硫键形成。
可以用编码本文公开的一种或更多种酶的上述表达载体或多核苷酸转化或转染细胞/微生物,并在针对特定细胞/微生物进行适当修改的营养培养基中培养,诱导启动子,选择转化体,或扩增编码期望序列的基因。在一些情况下,电穿孔方法可用于递送表达载体。
载体(和载体中包含的基因)的表达可以通过表达测定(例如,qPCR、集落PCR、基因座测序或全基因组测序)或通过测量RNA水平来验证。表达水平也可以指示拷贝数。例如,如果表达水平非常高,这可以表明,基因的多于一个拷贝整合在基因组中。可选地,高表达可以表明,基因整合在高转录区域,例如,高表达启动子附近。表达也可以通过测量蛋白质水平(诸如通过蛋白质印迹法)来验证。
CRISPR/Cas系统
通篇公开的方法可以包括基因的精确插入或基因(或部分基因)的缺失。本文描述的方法可以使用CRISPR/Cas系统。例如,双链断裂(DSB)可以使用CRISPR/Cas系统例如II类CRISPR/Cas系统来产生。在本文公开的方法中使用的Cas酶可以是催化DNA裂解的Cas9。来自酿脓链球菌(Streptococcus pyogenes)的Cas9或任何较近亲缘Cas9的酶促作用可以在靶位点序列处产生双链断裂,所述靶位点序列与引导序列的20个核苷酸杂交,并且在靶序列的20个核苷酸之后具有前间区序列邻近基序(PAM)。
载体可以可操作地连接至编码CRISPR酶(诸如Cas蛋白和Mad7)的酶编码序列。可以使用的Cas蛋白包括1类和2类。Cas蛋白的非限制性实例包括Cas1、Cas1B、Cas1、Cas1B、Cas2、Cas3、Cas4、Cas5、Cas5d、Cas5t、Cas5h、Cas5a、Cas6、Cas7、Cas8、Cas9(也称为Csn1或Csx12)、Cas10、Csy1、Csy2、Csy3、Csy4、Cse1、Cse2、Cse3、Cse4、Cse5e、Csc1、Csc2、Csa5、Csn1、Csn2、Csm1、Csm2、Csm3、Csm4、Csm5、Csm6、Cmr1、Cmr3、Cmr4、Cmr5、Cmr6、Csb1、Csb2、Csb3、Csx17、Csx14、Csx10、Csx16、CsaX、Csx3、Csx1、Csx1S、Csf1、Csf2、CsO、Csf4、Csd1、Csd2、Cst1、Cst2、Csh1、Csh2、Csa1、Csa2、Csa3、Csa4、Csa5、C2c1、C2c2、C2c3、Cpf1、CARF、DinG、其同源物或其修饰形式。未修饰的CRISPR酶可以具有DNA裂解活性,诸如Cas9。CRISPR酶可以引导靶序列处(诸如在靶序列中和/或靶序列的互补序列内)的一条或两条链的裂解。例如,CRISPR酶可以直接裂解距靶序列的第一个或最后一个核苷酸1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90、100、125、150、175、200、300、400、500或更多个碱基对内的一条或两条链。可以使用编码这样的CRISPR酶的载体,该CRISPR酶相对于对应的野生型酶具有突变,使得突变的CRISPR酶缺乏裂解包含靶序列的靶多核苷酸的一条或两条链的能力。
可以使用编码包含一个或更多个核定位序列(NLS)的CRISPR酶的载体。例如,可以使用1个、2个、3个、4个、5个、6个、7个、8个、9个、10个NLS。CRISPR酶可以在氨基末端处或氨基末端附近包含NLS(例如,1个、2个、3个、4个、5个、6个、7个、8个、9个、10个NLS),或者在羧基末端处或羧基末端附近包含NLS(例如,1个、2个、3个、4个、5个、6个、7个、8个、9个、10个NLS),或者这些的任何组合(例如,在氨基末端处包含一个或更多个NLS,并且在羧基末端处包含一个或更多个NLS)。当存在多于一个的NLS时,每个NLS可以独立于其他NLS进行选择,使得单个NLS可以存在于多于一个拷贝中和/或与存在于一个或更多个拷贝中的一个或更多个其他NLS组合。
在方法中使用的CRISPR酶可以包含最多6个NLS。当与NLS最近的氨基酸位于沿多肽链距N末端或C末端50个氨基酸以内(例如,在1个、2个、3个、4个、5个、10个、15个、20个、25个、30个、40个或50个氨基酸以内)时,则认为NLS在N末端或C末端附近。
引导RNA
如本文使用的,术语“引导RNA”及其语法等同物是指能够特异性靶向DNA序列并与Cas蛋白形成复合物的RNA。RNA/Cas复合物可以帮助将Cas蛋白“引导”至靶DNA。
本文公开的方法还可以包括将至少一种引导RNA或编码至少一种引导RNA的核酸(例如,DNA)引入细胞或胚胎中。引导RNA可以与RNA引导的内切核酸酶相互作用,以将内切核酸酶引导至特定靶位点,在该靶位点处,引导RNA的5’末端与染色体序列中的特定前间区序列碱基配对。
引导RNA可以包含两条RNA,例如,CRISPR RNA(crRNA)和反式活化crRNA(tracrRNA)。有时,引导RNA可以包含由crRNA和tracrRNA的一部分(例如,功能部分)融合形成的单链RNA或单引导RNA(sgRNA)。引导RNA也可以是包含crRNA和tracrRNA的双重RNA(dualRNA)。此外,crRNA可以与靶DNA杂交。
如上所述,引导RNA可以是表达产物。例如,编码引导RNA的DNA可以是包含编码引导RNA的序列的载体。可以通过用分离的引导RNA或包含编码引导RNA和启动子的序列的质粒DNA转染细胞或微生物,将引导RNA转移到细胞或微生物中。引导RNA也可以以其他方式转移到细胞或微生物中,诸如使用病毒介导的基因递送。
引导RNA可以是分离的。例如,引导RNA可以以分离的RNA的形式转染到细胞或微生物中。引导RNA可以使用任何体外转录系统通过体外转录来制备。引导RNA可以以分离的RNA的形式而不是以包含编码引导RNA的序列的质粒的形式转移到细胞中。
引导RNA可以包含三个区域:可以与染色体序列中的靶位点互补的在5’末端的第一区域,可以形成茎环结构的第二内部区域,和可以是单链的第三3’区域。每个引导RNA的第一区域也可以不同,使得每个引导RNA将融合蛋白引导至特定的靶位点。此外,每个引导RNA的第二区域和第三区域在所有引导RNA中可以是相同的。
引导RNA的第一区域可以与染色体序列中靶位点处的序列互补,使得引导RNA的第一区域可以与靶位点碱基配对。在一些情况下,引导RNA的第一区域可以包含从10个核苷酸至25个核苷酸(即,从10个核苷酸至25个核苷酸;或10个核苷酸至25个核苷酸;或从10个核苷酸至25个核苷酸;或从10个核苷酸至25个核苷酸或更多)。例如,引导RNA的第一区域和染色体序列中的靶位点之间的碱基配对区域的长度可以是10个、11个、12个、13个、14个、15个、16个、17个、18个、19个、20个、22个、23个、24个、25个或更多个核苷酸。有时,引导RNA的第一区域的长度可以是19个、20个或21个核苷酸。
引导RNA还可以包含形成二级结构的第二区域。例如,由引导RNA形成的二级结构可以包含茎(或发夹)和环。环和茎的长度可以变化。例如,环的长度范围可以是3个至10个核苷酸,并且茎的长度范围可以是6个至20个碱基对。茎可以包含一个或更多个凸起(bulge),每个凸起为1个至10个核苷酸。第二区域的总长度范围可以是16个至60个核苷酸长。例如,环的长度可以是4个核苷酸,并且茎可以是12个碱基对。
引导RNA也可以在3’末端包含第三区域,第三区域可以主要是单链的。例如,第三区域有时与感兴趣细胞中的任何染色体序列都不互补,并且有时与引导RNA的其余部分不互补。此外,第三区域的长度可以变化。第三区域的长度可以多于4个核苷酸。例如,第三区域的长度范围可以是5个至60个核苷酸。
引导RNA可以以RNA分子的形式引入细胞或胚胎中。例如,RNA分子可以在体外转录和/或可以化学合成。RNA可以从合成的DNA分子(例如,
Figure BDA0003107961460000461
基因片段)转录。然后可以将引导RNA以RNA分子的形式引入细胞或胚胎中。引导RNA也可以以非RNA核酸分子(例如,DNA分子)的形式引入细胞或胚胎中。例如,编码引导RNA的DNA可以可操作地连接至启动子控制序列,以用于在感兴趣的细胞或胚胎中表达引导RNA。RNA编码序列可以可操作地连接至被RNA聚合酶III(PolIII)识别的启动子序列。可用于表达引导RNA的质粒载体包括但不限于px330载体和px333载体。在一些情况下,质粒载体(例如,px333载体)可以包含两个编码引导RNA的DNA序列。
编码引导RNA的DNA序列也可以是载体的一部分。此外,载体可以包含另外的表达控制序列(例如,增强子序列、Kozak序列、多腺苷酸化序列、转录终止序列等)、选择性标志物序列(例如,抗生素抗性基因)、复制起点等。编码引导RNA的DNA分子也可以是线性的。编码引导RNA的DNA分子也可以是环状的。
当将编码RNA引导的内切核酸酶和引导RNA的DNA序列引入细胞中时,每个DNA序列可以是单独分子的一部分(例如,一个载体包含RNA引导的内切核酸酶编码序列,并且第二个载体包含引导RNA编码序列),或者两者可以是同一分子的一部分(例如,一个载体包含RNA引导的内切核酸酶和引导RNA二者的编码(和调控)序列)。
位点特异性插入
基因的插入可以是位点特异性的。例如,一种或更多种基因可以插入启动子附近。基因也可以插入基因组的中性位置,诸如插入非编码区或其他处,使得野生型基因功能保持完整。
细胞/微生物的靶向位点的修饰可以通过将DNA引入细胞/微生物中来产生,其中该DNA与靶向位点具有同源性。DNA可以包含一个标志物基因,从而允许选择包含整合构建体的细胞。靶载体中的同源DNA可以与DNA在靶位点处重组。标志物基因的两侧可以侧接同源DNA序列,3’重组臂和5’重组臂。
多种酶可以催化外源DNA向微生物基因组中的插入。例如,位点特异性重组酶可以分成两个具有不同生化特性的蛋白质家族,即酪氨酸重组酶(其中DNA共价附接至酪氨酸残基)和丝氨酸重组酶(其中共价附接发生在丝氨酸残基处)。在一些情况下,重组酶可以包含Cre、ΦC31整合酶(来源于链霉菌噬菌体ΦC31的丝氨酸重组酶)或噬菌体来源的位点特异性重组酶(包括Flp、λ整合酶、噬菌体HK022重组酶、噬菌体R4整合酶和噬菌体TP901-1整合酶)。
CRISPR/Cas系统可以用于进行位点特异性插入。例如,可以通过CRISPR/Cas在基因组中的插入位点上产生切口,以促进在插入位点处的转基因插入。
本文描述的方法可以利用可用于允许DNA或RNA构建体进入宿主细胞中的技术,包括但不限于磷酸钙/DNA共沉淀、将DNA微注射到细胞核中、电穿孔、细菌原生质体与完整细胞融合、转染、脂质体转染、感染、颗粒轰击、精子介导的基因转移或任何其他技术。
本文公开的某些方面可以利用载体(包括上述载体)。可以使用任何质粒和载体,条件是它们在选择的宿主微生物中可复制和存活。本领域已知的载体和那些商购可得的载体(及其变体或衍生物)可以被工程化以包含一个或更多个用于该方法的重组位点。可以使用的载体包括但不限于真核表达载体,诸如pRS、pBluSkII、pET、pFastBac、pFastBacHT、pFastBacDUAL、pSFV和pTet-Splice(Invitrogen)、pEUK-C1、pPUR、pMAM、pMAMneo、pBI101、pBI121、pDR2、pCMVEBNA和pYACneo(Clontech)、pSVK3、pSVL、pMSG、pCH110和pKK232-8(Pharmacia,Inc.)、pXT1、pSG5、pPbac、pMbac、pMClneo和pOG44(Stratagene,Inc.)和pYES2、pAC360、pBlueBa-cHis A、B和C、pVL1392、pBlueBac111、pCDM8、pcDNA1、pZeoSV、pcDNA3、pREP4、pCEP4和pEBVHis(Invitrogen,Corp.)及其变体或衍生物。
这些载体可以用于表达感兴趣的基因或基因的一部分。基因或基因的一部分可以通过使用已知方法(诸如基于限制性酶或PCR的技术)插入。
发酵
在一些实施方案中,可用于本发明的细胞/微生物应当在适于将底物转化为UDCA、胆酸和/或另一种UDCA前体的发酵条件下培养。应当考虑的反应条件包括温度、培养基流速、pH、培养基氧化还原电位、搅拌速率、接种水平、最大底物浓度、确保底物水平不变得受限的将底物引入生物反应器的速率、避免产物抑制的最大产物浓度、气流、气体组成、曝气速率、生物反应器设计和培养基组成。
最佳反应条件将部分取决于所使用的特定细胞/微生物。然而,在一些情况下,优选的是在高于环境压力的压力进行发酵。
使用加压系统可以大幅降低所需生物反应器的体积,并从而降低发酵设备的资本成本。在一些情况下,反应器体积可以与反应器运行压力的增加成线性比例减小,即在10个大气压运行的生物反应器所需的体积仅为在1个大气压运行的生物反应器的十分之一。
发酵条件
在其中细胞/微生物在发酵条件下培养的那些实施方案中,培养基的pH可以基于所使用的细胞/微生物进行优化。例如,所使用的pH范围可以是4至10。在其他情况下,pH可以是5至9、6至8、6.1至7.9、6.2至7.8、6.3至7.7、6.4至7.6、6.5至7.5、6.6至7.4或5.5至7.5。例如,pH可以是6.6至7.4。在一些情况下,pH可以是5至9。在一些情况下,pH可以是6至8。在一些情况下,pH可以是6.1至7.9。在一些情况下,pH可以是6.2至7.8。在一些情况下,pH可以是6.3至7.7。在一些情况下,pH可以是6.4至7.6。在一些情况下,pH可以是6.5至7.5。在一些情况下,用于发酵的pH可以大于约6。在一些情况下,用于发酵的pH可以低于约10。
温度也可以基于所使用的细胞/微生物进行调节。例如,温度范围可以是27℃至45℃、28℃至44℃、29℃至43℃、30℃至42℃、31℃至41℃、32℃至40℃或36℃至39℃。
氧气和其他气体的可用性可以影响产率和发酵速率。例如,当考虑氧气可用性时,发酵培养基中溶解氧(DO)的百分比可以是1%至40%。在某些情况下,DO浓度可以是1.5%至35%、2%至30%、2.5%至25%、3%至20%、4%至19%、5%至18%、6%至17%、7%至16%、8%至15%、9%至14%、10%至13%或11%至12%。例如,在一些情况下,DO浓度可以是2%至30%。在其他情况下,DO可以是3%至20%。在一些情况下,DO可以是4%至10%。在一些情况下,DO可以是1.5%至35%。在一些情况下,DO可以是2.5%至25%。在一些情况下,DO可以是4%至19%。在一些情况下,DO可以是5%至18%。在一些情况下,DO可以是6%至17%。在一些情况下,DO可以是7%至16%。在一些情况下,DO可以是8%至15%。在一些情况下,DO可以是9%至14%。在一些情况下,DO可以是10%至13%。在一些情况下,DO可以是11%至12%。
在一些情况下,大气CO2可以辅助控制细胞培养基中的pH。细胞培养基中包含的pH取决于溶解的CO2和碳酸氢盐(HCO3)的平衡。大气CO2的变化可以改变培养基的pH。在某些情况下,大气CO2可以是0%至10%、0.01%至9%、0.05%至8%、0.1%至7%、0.5%至6%、1%至5%、2%至4%、3%至6%、4%至7%、2%至6%或5%至10%。
在使用开关的情况下,培养基可以包含诱导或阻遏开关的分子。
当镧开关用于阻遏本文描述的一种或更多种基因的表达时,培养基可以包含镧,镧将阻遏处于开关的控制下的一种或更多种基因的表达。在镧的情况下,以下浓度中的任一种可以有效地阻遏一种或更多种基因的表达:0.1μM、0.5μM、1μM、2μM、3μM、4μM、5μM、6μM、7μM、8μM、9μM、10μM、12.5μM、15μM、17.5μM、20μM、25μM、50μM、100μM或更高。在一种情况下,0.1μM镧可以用于阻遏处于镧开关的控制下的一种或更多种基因的表达。在其他情况下,可以使用至少0.5μM镧。在其他情况下,可以使用至少1μM镧。在其他情况下,可以使用至少2μM镧。在其他情况下,至少可以使用3μM镧。在其他情况下,可以使用至少4μM镧。在其他情况下,可以使用至少5μM镧。在其他情况下,可以使用至少6μM镧。在其他情况下,至少可以使用7μM镧。在其他情况下,可以使用至少8μM镧。在其他情况下,可以使用至少9μM镧。在其他情况下,可以使用至少10μM镧。在其他情况下,可以使用至少12.5μM镧。在其他情况下,可以使用至少15μM镧。在其他情况下,可以使用至少17.5μM镧。在其他情况下,可以使用至少20μM的镧。在其他情况下,可以使用至少25μM的镧。在其他情况下,可以使用至少50μM镧。在其他情况下,可以使用至少100μM镧。在一些情况下,0.5μM镧至100μM镧的范围将有效地阻遏基因表达。在一些情况下,0.5μM镧至50μM镧的范围将阻遏基因表达。在其他情况下,1μM镧至20μM镧的范围将阻遏基因表达。在一些情况下,2μM镧至15μM镧的范围将阻遏基因表达。在一些情况下,3μM镧至12.5μM镧的范围将阻遏基因表达。在一些情况下,4μM镧至12μM镧的范围将阻遏基因表达。在一些情况下,5μM镧至11.5μM镧的范围将阻遏基因表达。在一些情况下,6μM镧至11μM镧的范围将阻遏基因表达。在一些情况下,7μM镧至10.5μM镧的范围将阻遏基因表达。在一些情况下,8μM镧至10μM镧的范围将阻遏基因表达。
在一些情况下,培养基中的镧可以被稀释以开启一个或更多个受镧阻遏的基因的表达。例如,在一些情况下,含镧培养基的稀释度可以是1:1(1份含镧培养基与1份不含镧培养基)。在一些情况下,稀释度可以是至少1:2、1:3、1:4、1:5、1:7.5、1:10、1:15、1:20、1:25、1:30、1:35、1:40、1:45、1:50、1:75、1:100、1:200、1:300、1:400、1:500、1:1,000或1:10,000。例如,在一些情况下,可以使用1:2的稀释度。在一些情况下,可以使用至少1:3的稀释度。在一些情况下,可以使用至少1:4的稀释度。在一些情况下,可以使用至少1:5的稀释度。在一些情况下,可以使用至少1:7.5的稀释度。在一些情况下,可以使用至少1:10的稀释度。在一些情况下,可以使用至少1:15的稀释度。在一些情况下,可以使用至少1:20的稀释度。在一些情况下,可以使用至少1:25的稀释度。在一些情况下,可以使用至少1:30的稀释度。在一些情况下,可以使用至少1:35的稀释度。在一些情况下,可以使用至少1:40的稀释度。在一些情况下,可以使用至少1:45的稀释度。在一些情况下,可以使用至少1:50的稀释度。在一些情况下,可以使用至少1:75的稀释度。在一些情况下,可以使用至少1:100的稀释度。在一些情况下,可以使用至少1:200的稀释度。在一些情况下,可以使用至少1:300的稀释度。在一些情况下,可以使用至少1:400的稀释度。在一些情况下,可以使用至少1:500的稀释度。在一些情况下,可以使用至少1:1,000的稀释度。在一些情况下,可以使用至少1:10,000的稀释度。
在一些情况下,可以使细胞/微生物在含镧培养基中生长。然后可以稀释培养基以有效地开启镧阻遏的基因的表达。然后可以使细胞/微生物在促进产生期望产物(诸如UDCA、胆酸和/或其他UDCA前体(如通篇公开的))的条件下生长。
当葡萄糖至半乳糖开关用于阻遏本文描述的一种或更多种基因的表达时(例如,当使用GAL1或GAL10启动子时),培养基可以包含葡萄糖,葡萄糖将阻遏处于开关的控制下的一种或更多种基因的表达。在葡萄糖的情况下,以下浓度中的任一种可以有效地阻遏一种或更多种基因的表达:0.1%、0.5%、1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、12.5%、15%、17.5%、20%、25%、50%、100%或更高。在一种情况下,0.1%的葡萄糖可以用于阻遏处于葡萄糖至半乳糖开关的控制下的一种或更多种基因的表达。在其他情况下,可以使用至少0.5%的葡萄糖。在其他情况下,可以使用至少1%的葡萄糖。在其他情况下,可以使用至少2%的葡萄糖。在其他情况下,可以使用至少3%的葡萄糖。在其他情况下,可以使用至少4%的葡萄糖。在其他情况下,可以使用至少5%的葡萄糖。在其他情况下,可以使用至少6%的葡萄糖。在其他情况下,可以使用至少7%的葡萄糖。在其他情况下,可以使用至少8%的葡萄糖。在其他情况下,可以使用至少9%的葡萄糖。在其他情况下,可以使用至少10%的葡萄糖。在其他情况下,可以使用至少12.5%的葡萄糖。在其他情况下,可以使用至少15%的葡萄糖。在其他情况下,可以使用至少17.5%的葡萄糖。在其他情况下,可以使用至少20%的葡萄糖。在其他情况下,可以使用至少25%的葡萄糖。在其他情况下,可以使用至少50%的葡萄糖。在其他情况下,可以使用至少100%的葡萄糖。在一些情况下,0.5%的葡萄糖至100%的葡萄糖的范围将有效地阻遏基因表达。在一些情况下,0.5%的葡萄糖至50%的葡萄糖的范围将阻遏基因表达。在其他情况下,1%的葡萄糖至20%的葡萄糖的范围将阻遏基因表达。在一些情况下,2%的葡萄糖至15%的葡萄糖的范围将阻遏基因表达。在一些情况下,3%的葡萄糖至12.5%的葡萄糖的范围将阻遏基因表达。在一些情况下,4%的葡萄糖至12%的葡萄糖的范围将阻遏基因表达。在一些情况下,5%的葡萄糖至11.5%的葡萄糖的范围将阻遏基因表达。在一些情况下,6%的葡萄糖至11%的葡萄糖的范围将阻遏基因表达。在一些情况下,7%的葡萄糖至10.5%的葡萄糖的范围将阻遏基因表达。在一些情况下,8%的葡萄糖至10%的葡萄糖的范围将阻遏基因表达。
在一些情况下,培养基中的葡萄糖可以被稀释以开启一个或更多个受葡萄糖阻遏的基因的表达。例如,在一些情况下,含葡萄糖培养基的稀释度可以是1:1(1份含葡萄糖培养基与1份不含葡萄糖培养基)。在一些情况下,稀释度可以是至少1:2、1:3、1:4、1:5、1:7.5、1:10、1:15、1:20、1:25、1:30、1:35、1:40、1:45、1:50、1:75、1:100、1:200、1:300、1:400、1:500、1:1,000或1:10,000。例如,在一些情况下,可以使用1:2的稀释度。在一些情况下,可以使用至少1:3的稀释度。在一些情况下,可以使用至少1:4的稀释度。在一些情况下,可以使用至少1:5的稀释度。在一些情况下,可以使用至少1:7.5的稀释度。在一些情况下,可以使用至少1:10的稀释度。在一些情况下,可以使用至少1:15的稀释度。在一些情况下,可以使用至少1:20的稀释度。在一些情况下,可以使用至少1:25的稀释度。在一些情况下,可以使用至少1:30的稀释度。在一些情况下,可以使用至少1:35的稀释度。在一些情况下,可以使用至少1:40的稀释度。在一些情况下,可以使用至少1:45的稀释度。在一些情况下,可以使用至少1:50的稀释度。在一些情况下,可以使用至少1:75的稀释度。在一些情况下,可以使用至少1:100的稀释度。在一些情况下,可以使用至少1:200的稀释度。在一些情况下,可以使用至少1:300的稀释度。在一些情况下,可以使用至少1:400的稀释度。在一些情况下,可以使用至少1:500的稀释度。在一些情况下,可以使用至少1:1,000的稀释度。在一些情况下,可以使用至少1:10,000的稀释度。
在使用开关的情况下,培养基可以包含解阻遏(de-repress)开关的分子。例如,当葡萄糖至半乳糖开关用于阻遏本文描述的一种或更多种基因的表达时(例如,当使用GAL1或GAL10启动子时),培养基可以包含棉子糖,棉子糖将解阻遏处于开关的控制下的一种或更多种基因的表达。在棉子糖的情况下,以下浓度中的任一种可以有效地阻遏一种或更多种基因的表达:0.1%、0.5%、1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、12.5%、15%、17.5%、20%、25%、50%、100%或更高。在一种情况下,0.1%的棉子糖可以用于解阻遏处于棉子糖开关的控制下的一种或更多种基因的表达。在其他情况下,可以使用至少0.5%的棉子糖。在其他情况下,可以使用至少1%的棉子糖。在其他情况下,可以使用至少2%的棉子糖。在其他情况下,可以使用至少3%的棉子糖。在其他情况下,可以使用至少4%的棉子糖。在其他情况下,可以使用至少5%的棉子糖。在其他情况下,可以使用至少6%的棉子糖。在其他情况下,可以使用至少7%的棉子糖。在其他情况下,可以使用至少8%的棉子糖。在其他情况下,可以使用至少9%的棉子糖。在其他情况下,可以使用至少10%的棉子糖。在其他情况下,可以使用至少12.5%的棉子糖。在其他情况下,可以使用至少15%的棉子糖。在其他情况下,可以使用至少17.5%的棉子糖。在其他情况下,可以使用至少20%的棉子糖。在其他情况下,可以使用至少25%的棉子糖。在其他情况下,可以使用至少50%的棉子糖。在其他情况下,可以使用至少100%的棉子糖。在一些情况下,0.5%的棉子糖至100%的棉子糖的范围将有效地阻遏基因表达。在一些情况下,0.5%的棉子糖至50%的棉子糖的范围将解阻遏基因表达。在其他情况下,1%的棉子糖至20%的棉子糖的范围将阻遏基因表达。在一些情况下,2%的棉子糖至15%的棉子糖的范围将阻遏基因表达。在一些情况下,3%的棉子糖至12.5%的棉子糖的范围将解阻遏基因表达。在一些情况下,4%的棉子糖至12%的棉子糖的范围将解阻遏基因表达。在一些情况下,5%的棉子糖至11.5%的棉子糖的范围将解阻遏基因表达。在一些情况下,6%的棉子糖至11%的棉子糖的范围将解阻遏基因表达。在一些情况下,7%的棉子糖至10.5%的棉子糖的范围将解阻遏基因表达。在一些情况下,8%的棉子糖至10%的棉子糖的范围将解阻遏基因表达。
在使用开关的情况下,培养基可以包含诱导开关的分子。例如,当葡萄糖至半乳糖开关用于诱导一种或更多种基因的表达时(例如,当使用GAL1或GAL10启动子时),培养基可以包含半乳糖,半乳糖将诱导处于开关的控制下的一种或更多种基因的表达。在半乳糖的情况下,以下浓度中的任一种可以有效地诱导一种或更多种基因的表达:0.1%、0.5%、1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、12.5%、15%、17.5%、20%、25%、50%、100%或更高。在一种情况下,0.1%的半乳糖可以用于诱导处于葡萄糖至半乳糖开关的控制下一种或更多种基因的表达。在其他情况下,可以使用至少0.5%的半乳糖。在其他情况下,可以使用至少1%的半乳糖。在其他情况下,可以使用至少2%的半乳糖。在其他情况下,可以使用至少3%的半乳糖。在其他情况下,可以使用至少4%的半乳糖。在其他情况下,可以使用至少5%的半乳糖。在其他情况下,可以使用至少6%的半乳糖。在其他情况下,可以使用至少7%的半乳糖。在其他情况下,可以使用至少8%的半乳糖。在其他情况下,可以使用至少9%的半乳糖。在其他情况下,可以使用至少10%的半乳糖。在其他情况下,可以使用至少12.5%的半乳糖。在其他情况下,可以使用至少15%的半乳糖。在其他情况下,可以使用至少17.5%的半乳糖。在其他情况下,可以使用至少20%的半乳糖。在其他情况下,可以使用至少25%的半乳糖。在其他情况下,可以使用至少50%的半乳糖。在其他情况下,可以使用至少100%的半乳糖。在一些情况下,0.5%的半乳糖至100%的半乳糖的范围将有效地诱导基因表达。在一些情况下,0.5%的半乳糖至50%的半乳糖的范围将诱导基因表达。在其他情况下,1%的半乳糖至20%的半乳糖的范围将诱导基因表达。在一些情况下,2%的半乳糖至15%的半乳糖的范围将诱导基因表达。在一些情况下,3%的半乳糖至12.5%的半乳糖的范围将诱导基因表达。在一些情况下,4%的半乳糖至12%的半乳糖的范围将诱导基因表达。在一些情况下,5%的半乳糖至11.5%的半乳糖的范围将诱导基因表达。在一些情况下,6%的半乳糖至11%的半乳糖的范围将诱导基因表达。在一些情况下,7%的半乳糖至10.5%的半乳糖的范围将诱导基因表达。在一些情况下,8%的半乳糖至10%的半乳糖的范围将诱导基因表达。
当铜开关用于诱导本文描述的一种或更多种基因的表达时,培养基可以包含铜,铜将诱导处于开关的控制下的一种或更多种基因的表达。在铜的情况下,以下浓度中的任一种可以有效地诱导一种或更多种基因的表达:1μM、2.5μM、5μM、10μM、25μM、50μM、75μM、100μM、150μM、200μM、300μM、400μM、500μM、600μM、700μM、800μM、900μM、1mM、10mM或更高。在一种情况下,1μM铜可以用于诱导处于铜启动子的控制下的一种或更多种基因的表达。在其他情况下,可以使用至少5μM铜。在其他情况下,可以使用至少10μM铜。在其他情况下,可以使用至少25μM铜。在其他情况下,可以使用至少50μM铜。在其他情况下,可以使用至少100μM铜。在其他情况下,可以使用至少200μM铜。在其他情况下,可以使用至少300μM铜。在其他情况下,可以使用至少400μM铜。在其他情况下,可以使用至少500μM铜。在其他情况下,可以使用至少600μM铜。在其他情况下,可以使用至少700μM铜。在其他情况下,可以使用至少800μM铜。在其他情况下,可以使用至少900μM铜。在其他情况下,可以使用至少1mM铜。在其他情况下,可以使用至少2.5mM铜。在其他情况下,可以使用至少5mM铜。在其他情况下,可以使用至少7.5mM铜。在其他情况下,可以使用至少10mM铜。在一些情况下,1μM铜至10mM铜的范围将有效地阻遏基因表达。在一些情况下,2.5μM铜至1mM铜的范围将阻遏基因表达。在其他情况下,5μM铜至800μM铜的范围将阻遏基因表达。在一些情况下,10μM铜至600μM铜的范围将阻遏基因表达。在一些情况下,25μM铜至500μM铜的范围将阻遏基因表达。在一些情况下,50μM铜至450μM铜的范围将阻遏基因表达。在一些情况下,75μM铜至400μM铜的范围将阻遏基因表达。在一些情况下,100μM铜至350μM铜的范围将阻遏基因表达。在一些情况下,150μM铜至300μM铜的范围将阻遏基因表达。在一些情况下,200μM铜至250μM铜的范围将阻遏基因表达。
生物反应器
发酵反应可以在任何合适的生物反应器中进行。在一些情况下,生物反应器可以包括第一生长反应器和第二发酵反应器,在第一生长反应器中培养细胞/微生物,来自生长反应器的肉汤向第二发酵反应器进料,并且在第二发酵反应器中产生大部分发酵产物。
产物回收
本文公开的细胞/微生物的发酵可以产生包含期望产物(例如,UDCA、胆酸和/或其他UDCA前体)、一种或更多种副产物和/或细胞/微生物本身的肉汤。
在某些产生产物的方法中,发酵肉汤中的产物浓度为至少0.1g/L。例如,发酵肉汤中产生的产物的浓度可以为0.1g/L至0.5g/L、0.5g/L至1g/L、1g/L至5g/L、2g/L至6g/L、3g/L至7g/L、4g/L至8g/L、5g/L至9g/L或6g/L至10g/L。在一些情况下,产物的浓度可以为至少9g/L。在一些情况下,产物的浓度可以为0.1g/L至10g/L。在一些情况下,产物的浓度可以为0.5g/L至3g/L。在一些情况下,产物的浓度可以为1g/L至5g/L。在一些情况下,产物的浓度可以为2g/L至6g/L。在一些情况下,产物的浓度可以为3g/L至7g/L。在一些情况下,产物的浓度可以为4g/L至8g/L。在一些情况下,产物的浓度可以为5g/L至9g/L。在一些情况下,产物的浓度可以为6g/L至10g/L。在一些情况下,产物的浓度可以为1g/L至3g/L。在一些情况下,产物的浓度可以为约2g/L。
如上所述,在某些情况下,发酵反应中产生的产物被转化为不同的有机产物。例如,产生的产物可以是UDCA前体,UDCA前体用作进一步产生UDCA、胆酸或另一种UDCA前体的底物。在其他情况下,首先将产物从发酵肉汤中回收,然后转化为不同的有机产物。
在一些情况下,可以从一部分肉汤中连续取出产物,并以纯化形式回收。在特定情况下,产物的回收包括将含有产物的肉汤的取出部分通过分离单元,以从肉汤中分离细胞/微生物,以产生无细胞产物渗透物,并将微生物返回至生物反应器。然后,含有无细胞产物的渗透物可以储存或用于随后转化为不同的期望产物。
回收发酵反应中产生的期望产物和/或一种或更多种其他产物或副产物可以包括连续取出一部分肉汤,并从肉汤的取出部分中分别回收产物和一种或更多种其他产物。在一些情况下,产物和/或一种或更多种其他产物的回收包括将含有产物和/或一种或更多种其他产物的肉汤的取出部分通过分离单元,以将细胞/微生物与产物和/或一种或更多种其他产物分离,以产生含有无细胞产物和一种或更多种其他产物的渗透物,并将微生物返回至生物反应器。
在上述情况下,产物和一种或更多种其他产物的回收可以包括首先从无细胞渗透物中取出产物,然后从无细胞渗透物中取出一种或更多种其他产物。然后,也可以将无细胞渗透物返回至生物反应器。
可以从发酵肉汤中回收产物或含有产物的混合产物流。例如,可以使用的方法可以包括但不限于分馏或蒸发、渗透蒸发和萃取发酵。另外的实例包括:使用来自整个发酵肉汤的流进行回收;反渗透与蒸馏组合;涉及产物溶剂萃取的液-液萃取技术;产物在PEG/葡聚糖系统中的双水相萃取;使用醇或酯(例如乙酸乙酯、磷酸三丁酯、乙醚、正丁醇、十二醇、油醇和乙醇/磷酸酯系统)的溶剂萃取;由亲水性溶剂和无机盐组成的双水相系统。一般参见,Voloch,M.等人,(1985)和美国专利公布申请第2012/0045807号。
在一些情况下,从发酵肉汤中回收产物和/或其他副产物可以通过以下进行:从生物反应器中连续取出一部分肉汤,从肉汤中分离微生物细胞(例如,方便地通过过滤),并从肉汤中回收产物和其他物质诸如醇和酸。醇可以例如通过蒸馏方便地回收,而酸可以例如通过吸附在活性炭上回收。将分离的微生物细胞返回至发酵生物反应器。取出醇和酸后剩余的无细胞渗透物也优选返回至发酵生物反应器。在无细胞渗透物返回至生物反应器之前,可以向无细胞渗透物添加额外的营养物以补充营养培养基。
此外,如果在回收产物和/或副产物的过程中调节肉汤的pH,则返回至生物反应器之前,应当将pH重新调节至与发酵生物反应器中的肉汤相似的pH。
体外方法和步骤
在一些实施方案中,本发明部分地涉及体外制备UDCA或UDCA前体的方法。换言之,在这些实施方案中,该方法不包括微生物的使用。例如,可以使底物在培养基中与诸如先前描述的酶或其片段接触。
在一些实施方案中,该方法包括体内步骤和体外步骤两者。例如,沿生物合成途径的一些反应可以在细胞内发生,而沿该途径的一些反应在细胞外发生。在某些这样的方法中,UDCA前体可以由细胞分泌到培养基中,并且然后直接酶促或非酶促(例如,化学上地)转化为不同的产物,诸如UDCA或另一种DCA前体。
辅酶A
通篇描述的微生物和方法可以用于产生通篇描述的产物的CoA形式。在一些情况下,CoA连接酶可以用于产生通篇描述的任何产物的CoA形式。
在一些情况下,SLC27A5可以产生CoA产物,即(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA或(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA。在一些情况下,AMACR可以产生CoA产物,即(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA或(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA。在一些情况下,ACOX2可以产生CoA产物,即(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA或(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA。在一些情况下,HSD17B4可以产生CoA产物,即3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA或3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA。在一些情况下,SCP2/硫解酶可以产生CoA产物,即3α,7α-二羟基-5β-胆烷-24-酰基-CoA(CDC-CoA)或3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA。在一些情况下,7α-HSD可以产生CoA产物,即3α-羟基-7-氧-5β-胆烷-24-酰基-CoA。在一些情况下,7β-HSD可以产生CoA产物,即3α,7β-二羟基-5β-胆烷-24-酰基-CoA(UDC-CoA)。
在一些情况下,一种或更多种产物的CoA形式可以是(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA、(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA、3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α-二羟基-5β-胆烷-24-酰基-CoA(CDC-CoA)、3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA、3α-羟基-7-氧-5β-胆烷-24-酰基-CoA、3α,7β-二羟基-5β-胆烷-24-酰基-CoA(UDC-CoA)或其任何组合。
如通篇公开的产物可以以其CoA形式分离。
游离酸
通篇描述的微生物和方法可以用于产生通篇描述的产物的游离酸形式。在一些情况下,水解酶可以用于产生通篇描述的任何产物的游离酸形式。
在一些情况下,CYP27A1可以产生游离酸产物,即(25R)-3α,7α-二羟基-5β-胆甾烷酸或(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸。在一些情况下,SLC27A5可以产生游离酸产物,即(25R)-3α,7α-二羟基-5β-胆甾烷酸或(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸。在一些情况下,AMACR可以产生游离酸产物,即(25S)-3α,7α-二羟基-5β-胆甾烷酸或(25S)-3α,7α,12α-三羟基-5β-胆甾烷酸。在一些情况下,ACOX2可以产生游离酸产物,即(24E)-3α,7α-二羟基-5β-胆甾-24-烯酸或(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酸。在一些情况下,HSD17B4可以产生游离酸产物,即3α,7α-二羟基-24-氧-5β-胆甾烷酸或3α,7α,12α-三羟基-24-氧-5β-胆甾烷酸。在一些情况下,SCP2/硫解酶可以产生游离酸产物,即3α,7α-二羟基-5β-胆烷酸(鹅去氧胆酸;CDCA)或3α,7α,12α-三羟基-5β-胆烷-24-酸(胆酸)。在一些情况下,7α-HSD可以产生游离酸产物,即3α-羟基-7-氧-5β-胆烷酸(海狸胆酸;NCA)。在一些情况下,7β-HSD可以产生游离酸产物,即3α,7β-二羟基-5β-胆烷酸(熊去氧胆酸;UDCA)。在一些情况下,胆酰-CoA水解酶可以产生游离酸产物,即UDCA或3α,7α,12α-三羟基-5β-胆烷-24-酸(胆酸)。
在一些情况下,一种或更多种产物的游离酸形式可以是(25R)-3α,7α-二羟基-5β-胆甾烷酸、(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸、(25R)-3α,7α-二羟基-5β-胆甾烷酸、(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸、(25S)-3α,7α-二羟基-5β-胆甾烷酸、(25S)-3α,7α,12α-三羟基-5β-胆甾烷酸、(24E)-3α,7α-二羟基-5β-胆甾-24-烯酸、(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酸、3α,7α-二羟基-24-氧-5β-胆甾烷酸、3α,7α,12α-三羟基-24-氧-5β-胆甾烷酸、3α,7α-二羟基-5β-胆烷酸(鹅去氧胆酸;CDCA)、3α,7α,12α-三羟基-5β-胆烷-24-酸(胆酸)、3α-羟基-7-氧-5β-胆烷酸(海狸胆酸;NCA)、3α,7β-二羟基-5β-胆烷酸(熊去氧胆酸;UDCA)、3α,7α,12α-三羟基-5β-胆烷-24-酸(胆酸)或其任何组合。
通篇公开的产物可以以其游离酸形式分离。
组合物
本发明还部分地涉及组合物,所述组合物包含UDCA或UDCA前体、其游离酸或CoA、或其药学上可接受的衍生物或前药。组合物还可以包含赋形剂。组合物可以呈药物形式。“药学上可接受的衍生物”意指其任何药学上可接受的盐、酯、酯盐、前药或其他衍生物。本发明的化合物的药学上可接受的盐包括那些衍生自药学上可接受的无机酸和碱以及有机酸和碱的盐。合适的酸式盐的实例包括乙酸盐、己二酸盐、苯甲酸盐、苯磺酸盐、丁酸盐、柠檬酸盐、二葡萄糖酸盐、十二烷基硫酸盐、甲酸盐、延胡索酸盐、乙醇酸盐、半硫酸盐、庚酸盐、己酸盐、盐酸盐、氢溴酸盐、氢碘酸盐、乳酸盐、马来酸盐、丙二酸盐、甲磺酸盐、2-萘磺酸盐、烟酸盐、硝酸盐、棕榈酸盐、磷酸盐、苦味酸盐、新戊酸盐、丙酸盐、水杨酸盐、琥珀酸盐、硫酸盐、酒石酸盐、甲苯磺酸盐和十一酸盐。由合适的碱衍生的盐包括碱金属(例如,钠)、碱土金属(例如,镁)、铵和N-(烷基)4 +盐。
本发明还部分地涉及将UDCA或UDCA前体配制成药物组合物的方法。
为了从本发明的化合物制备药物组合物,药学上可接受的载体包括固体载体或液体载体。固体形式制品包括粉剂、片剂、丸剂、胶囊、扁囊剂、栓剂和可分散颗粒。固体载体可以是一种或更多种物质,也可用作稀释剂、调味剂、粘合剂、防腐剂、片剂崩解剂或封装材料。关于配制和施用技术的细节在科学和专利文献中有很好的描述,参见,例如,最新版Remington’s Pharmaceutical Sciences,Maack Publishing Co,Easton PA。
在粉剂中,载体是与精细的(finely divided)活性组分混合在一起的精细的固体。在片剂中,活性组分与具有必要结合特性的载体以合适的比例混合,并压制成需要的形状和尺寸。
合适的固体赋形剂为碳水化合物或蛋白质填料,包括但不限于,糖,包括乳糖、蔗糖、甘露醇或山梨醇;来自玉米、小麦、稻、马铃薯或其他植物的淀粉;纤维素,诸如甲基纤维素、羟丙基甲基纤维素或羧甲基纤维素钠;和树胶,包括阿拉伯树胶和黄蓍胶;以及蛋白质,诸如明胶和胶原蛋白。如果需要,添加崩解剂或增溶剂,诸如交联聚乙烯吡咯烷酮、琼脂、藻酸或其盐,诸如藻酸钠。
液体形式制品包括溶液、悬浮液和乳液,例如水或水/丙二醇溶液。对于肠胃外注射,液体制品可以配制在聚乙二醇水溶液的溶液中。
药物制品可以是单位剂型。以此形式,制品被细分为含有适量活性组分的单位剂量。单位剂型可以是包装的制品,包装包含离散量的制品,诸如在小瓶或安瓿中包装的片剂、胶囊和粉剂。此外,单位剂型可以是胶囊、片剂、扁囊剂或锭剂本身,或者单位剂型可以是包装形式的适当数量的任何这些。
本发明还涉及制备药物组合物的方法。在一些情况下,将UDCA或UDCA前体与赋形剂混合以产生药物组合物。
治疗疾病和疾病症状
UDCA或UDCA前体(或如通篇公开的其他游离酸或CoA产物)可以用于治疗疾病。这包括治疗一种或更多种疾病症状。例如,UDCA或UDCA前体(或如通篇公开的其他游离酸或CoA产物)可以用于治疗以下疾病中的一种或更多种:胆结石(例如,胆固醇胆结石)、原发性胆汁性肝硬化、囊性纤维化、胆汁流出障碍(impaired bile flow)、妊娠肝内胆汁淤积症和/或胆石症。
一些疾病或疾病症状可以是人类独有的,但其他疾病或疾病症状可在多于一种动物诸如所有哺乳动物中共有。
本发明部分地涉及一种治疗疾病或疾病症状的方法,该方法包括向需要这种治疗的受试者施用UDCA或UDCA前体、其游离酸或CoA、或其药学上可接受的衍生物或前药。
合适的施用途径包括但不限于口服施用、静脉内施用、直肠施用、气雾剂施用、肠胃外施用、眼部施用、肺部施用、经粘膜施用、经皮施用、阴道施用、耳部施用、鼻部施用和局部施用。另外,仅通过实例的方式,胃肠外递送包括肌内注射、皮下注射、静脉内注射、髓内注射,以及鞘内注射、直接心室内注射、腹膜内注射、淋巴管内和鼻内注射。
UDCA或UDCA前体的用途
本发明还部分地涉及使用前述方法制备的UDCA或UDCA前体在制造用于治疗疾病或疾病症状的药物中的用途。疾病或疾病症状可以是能够通过UDCA或UDCA前体治疗的任何疾病或症状。此类实例包括胆结石、原发性胆汁性肝硬化、囊性纤维化、胆汁流出障碍、妊娠肝内胆汁淤积症和胆石症。
UDCA可以用于治疗胆结石,并且是肠道细菌的副产物。
UDCA前体可以用于制备其他产物,诸如其他UDCA前体或UDCA。
实施例
尽管本文已经示出和描述了一些实例,但这些实例仅通过示例的方式提供。在不偏离本发明的情况下,本领域技术人员现在将想到许多变化、改变和替换。应当理解,在实践本发明时将采用本文描述的本发明的实例的各种替代选择。
实施例1—鉴定将糖转化为UDCA的酶并产生可以制备UDCA的菌株
13种异源酶(从酿酒酵母的角度来看)被鉴定为可能可用于从胆固醇产生UDCA的酶。参见例如,图1。另外两(2)种酶也被鉴定为可能可用于将糖转化为胆固醇的酶。参见例如,图2。
合成编码这些酶的基因,并且然后克隆到酵母表达质粒或整合构建体中。随后使用标准酵母化学转化方案,利用乙酸锂和PEG(3350)将这些质粒或整合构建体转化到酿酒酵母中。使转化的酵母生长至对数中期,然后以4000rpm离心,去除上清液。用水洗涤沉淀物(pellet),并再次离心。将所得沉淀物重悬于含有100mM乙酸锂、40%PEG(MW 3,350)、0.35mg/ml载体DNA(剪切的鲑鱼精DNA)和50ng至500ng待转化DNA的主混合物中。然后将细胞悬浮液在30℃孵育30分钟,随后在42℃热激45分钟。此时,进行营养选择铺板,而抗真菌选择在丰富酵母培养基(rich yeast media)中经历4hr至过夜恢复,然后在含有抗真菌药物的琼脂上铺板。然后将板在30℃孵育2-3天。菌落形成后,通过菌落PCR验证正确的整合,然后在实验中使用菌株。
表1示出了在酵母菌株中表达的代表性基因和显示出最佳活性的酶的遗传来源。来自其他来源的基因也被发现具有活性,但未在表1上示出。
Figure BDA0003107961460000631
Figure BDA0003107961460000641
实施例2—具有产生胆固醇的能力的酵母菌株
对不具有天然产生胆固醇的能力的酿酒酵母进行遗传修饰,以通过过表达由pGAL1启动子驱动的酿酒酵母(S.cerevisiae)tHMG1来上调甲羟戊酸途径。另外,还对酿酒酵母进行遗传修饰以表达由GAL1或GAL10启动子驱动的两种异源基因DHCR7和DHCR24。
所有菌株表达相同的来自拟南芥(A.thaliana)的DCHR7。
使用GC/MS测试这些不同菌株产生固醇化合物的能力。如图5中示出的,表达DHCR24的酵母菌株能够产生胆固醇,其中来自智人和斑马鱼(Danio rerio)(斑马鱼(zebrafish))的DHCR24具有最佳的活性。不具有DHCR24基因的酵母菌株不产生任何胆固醇。
实施例3—将胆固醇转化为7-α-羟基胆固醇
将表达拟南芥DHCR7和智人(H.sapiens)DHCR24的酿酒酵母用细胞色素p450家族7亚家族A成员1(CYP7A1)的若干种变体与不同的肾上腺皮质铁氧还蛋白(ADX)变体的组合进行转化。所有菌株表达家牛肾上腺皮质铁氧还蛋白还原酶(ADR)。
然后,菌株将胆固醇转化为7-α-羟基胆固醇的能力通过其使胆固醇分子的C7碳羟基化的能力来测试。该转化通过GC/MS进行检测。
如图6中示出的,来自小家鼠的CYP7A1显示出最佳的活性。在来自智人、褐家鼠、家兔、家牛和斑马鱼的CYP7A1中也观察到活性。
实施例4—将7-α-羟基胆固醇转化为7α-羟基-4-胆甾烯-3-酮
将表达拟南芥DHCR7和智人DHCR24的菌株遗传工程化,以进一步表达小家鼠(M.musculus)CYP7A1、来自家牛(B.taurus)和斑马鱼(D.rerio)的ADX、家牛肾上腺皮质铁氧还蛋白还原酶(ADR)和7型3β-羟基类固醇脱氢酶(HSD3B7)。
然后通过GC/MS测试菌株将7-α-羟基胆固醇转化为7α-羟基-4-胆甾烯-3-酮的能力。
如图7中示出的,来自智人的HSD3B7显示出最佳的活性。在来自小家鼠和斑马鱼的HSD3B7中也观察到活性。
实施例5—将7α-羟基-4-胆甾烯-3-酮转化为7α-羟基-5β-胆甾烷-3-酮
将表达拟南芥DHCR7和智人DHCR24的菌株遗传工程化,以进一步表达小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7和醛酮还原酶家族1成员D1(AKR1D1)。
然后通过GC/MS测试菌株将7α-羟基-4-胆甾烯-3-酮转化为7α-羟基-5β-胆甾烷-3-酮的能力。
如图8中示出的,来自智人和小家鼠的AKR1D1显示出最佳的活性。
实施例6—将7α-羟基-5β-胆甾烷-3-酮转化为5β-胆甾烷-3α,7α-二醇
将表达拟南芥DHCR7和智人DHCR24的菌株遗传工程化,以进一步表达小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1和醛酮还原酶家族1成员C9(AKR1C9)或醛酮还原酶家族1成员C4(AKR1C4)。
然后通过GC/MS测试菌株将7α-羟基-5β-胆甾烷-3-酮转化为5β-胆甾烷-3α,7α-二醇的能力。
如图9中示出的,来自日本猕猴的AKR1C4显示出最佳的活性。另外,来自智人的AKR1C4显示出非常好的活性。
实施例7—将7α-羟基-4-胆甾烯-3-酮转化为7α,12α-二羟基-4-胆甾烯-3-酮
将表达拟南芥DHCR7和智人DHCR24的菌株遗传工程化,以进一步表达小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7和CYP8B1。
然后通过GC/MS测试这些菌株向胆固醇骨架的C12添加第三羟基基团的能力。测试菌株从7α-羟基-4-胆甾烯-3-酮产生7α,12α-二羟基-4-胆甾烯-3-酮的能力。
如图10中示出的,来自小家鼠和家兔的CYP8B1显示出最佳的活性。来自智人和野猪(Sus scrofa)的CYP8B1也显示出活性。
实施例8—将5β-胆甾烷-3α,7α-二醇转化为(25R)-3α,7α-二羟基-5β-胆甾烷酸(并通过与SLC27A5偶联进一步转化为(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA)
将表达拟南芥DHCR7和智人DHCR24且也用产生5β-胆甾烷-3α,7α-二醇必需的其他酶转化的菌株进一步遗传工程化,以进一步表达不同的CYP27A1变体。测试了7种CYP27A1变体与2种ADX变体(斑马鱼和家牛)和家牛ADR的组合。另外,表达智人SLC27A5以偶联该CYP27A1活性,从而允许替代地通过LC-MS检测SLC27A5产物。
如图11中示出的,大多数CYP27A1变体能够产生SLC27A5产物。
实施例9—(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸转化为(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA
将溶质载体家族27成员5(SLC27A5)的变体整合到已经敲除天然酵母CoA-连接酶FAT1的野生型酵母菌株中。当表达不同的SLC27A5变体时,裂解酵母菌株并检测对(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸的CoA连接酶活性。
如图12A中示出的,HPLC数据表明,检测到表达连接酶的菌株的特异性的峰。此外,如图12B中示出的,质谱数据证实存在这样的峰,该峰证实表达菌株中存在活性连接酶。另外,CoA连接酶也显示出使用3α,5β,7α,12α,24E-三羟基-胆甾-24-烯-26-酸作为底物的活性。
实施例10—将((25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化为(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA
将表达拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1、日本猕猴(M.fuscata)AKR1C4、褐家鼠(R.norvegicus)CYP27A1、智人SLC27A5和ACOX2(来自智人或家兔)的菌株用作背景菌株,以测试若干种α-甲基酰基-CoA消旋酶(AMACR)的活性。由于(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA至(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA的外消旋化难以检测,因此裂解酵母菌株并通过LC/MS测量(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA(ACOX2的产物)。
如图13A中示出的,来自智人和褐家鼠两者的AMACR产生优秀的外消旋化活性。此外,如图13B中示出的,来自智人的ACOX2与智人AMACR组合产生最多的(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA。
实施例11—将(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA转化为(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA
将表达拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1、日本猕猴AKR1C4、褐家鼠CYP27A1和智人SLC27A5以及AMACR(来自智人和褐家鼠)的菌株用作背景菌株,以测试不同酰基-CoA氧化酶2(ACOX2)的活性。裂解酵母菌株并通过LC/MS测量(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA。
如图14中示出的,来自智人和家兔两者的ACOX2显示出最佳的活性。来自褐家鼠、小家鼠和酿酒酵母的ACOX2显示出活性。
实施例12—(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA转化为3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA
将表达SLC27A5-CoA连接酶的菌株用作背景菌株,以测试不同的羟基类固醇17-β脱氢酶4(HSD17B4)的活性。裂解酵母菌株并且用添加的底物3α,5β,7α,12α,24E-三羟基-胆甾-24-烯-26-酸(SLC27A5 CoA-连接酶活性已在该底物上得到验证)进行体外测定。
检测到这种双功能酶HSD17B4的中间产物,醇。如图15中示出的,来自褐家鼠、家牛和非洲爪蟾的HSD17B4产生最佳的活性。来自其余6个来源的HSD17B4也显示出活性。
实施例13—将3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA转化为3α,7α-二羟基-5β-胆烷-24-酰基-CoA
将表达拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1、日本猕猴AKR1C4、褐家鼠CYP27A1和智人SLC27A5、褐家鼠AMACR、智人ACOX2和褐家鼠HSD17B4的菌株用作背景菌株,以测试固醇载体蛋白2(SCP2)的活性。背景菌株还被敲除了其天然酵母基因POT1(该基因编码3-酮酯酰-CoA硫解酶)并且表达脆弱拟杆菌7α-HSD和撒丁岛梭菌7β-HSD。提取酵母沉淀物并随后通过LC/MS分析UDCA/UDC-CoA产物的相对量。
如图16中示出的,通过LC/MS在所有样品包括阴性对照中检测到SCP2活性,然而在过表达天然酵母基因POT1的菌株中观察到活性增强。
实施例14—将3α,7α-二羟基-5β-胆烷-24-酰基-CoA转化为3α-羟基-7-氧-5β-胆烷-24-酰基-CoA转化为3α,7β-二羟基-5β-胆烷-24-酰基-CoA
将表达酿酒酵母截短型HMG、拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、来自斑马鱼和家牛的ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1、日本猕猴AKR1C4、褐家鼠CYP27A1和智人SLC27A5、褐家鼠AMACR、智人ACOX2和褐家鼠HSD17B4、酿酒酵母SCP2、pot1Δ、pox1Δ和fox2Δ的菌株用作背景菌株,以分别确定有效的7α-羟基类固醇脱氢酶和7β-羟基类固醇脱氢酶(7α-HSD和7β-HSD)。
在背景菌株(在该情况下,还表达活性撒丁岛梭菌7β-HSD)中测试了四种7α-HSD变体(大肠杆菌(菌株K12)、Luminiphilus syltensis NOR5-1B、脆弱拟杆菌和睾酮丛毛单胞菌(Comamonas testosteroni)(睾酮假单胞菌(Pseudomonas testosteroni)))产生UDC-CoA(也称为3α,7β-二羟基-5β-胆烷酰基(cholanoyl)-CoA,具有化学式C45H74N7O19P3S,质量为1141.40,并且分子量为1142.10)的能力。
从24孔深孔板中的25mL全细胞肉汤中收集细胞沉淀物。将细胞沉淀物重悬于2mL80%甲醇/水混合溶液中,在4℃涡旋30分钟,在4℃以4000rpm离心5分钟,并将1.8mL上清液转移至24孔深孔板。将所得沉淀物干燥并重悬于200μL的4:1MPA(水中的10mM甲酸铵,pH6):甲醇溶液中。将该重悬液过滤通过0.2μm过滤器。该最终过滤产物通过液相色谱术测量,随后质谱分析UDC-CoA的存在。示出这些步骤的流程图在图3中示出。
如图17中示出的,来自大肠杆菌和脆弱拟杆菌的7α-HSD显示出明显的活性。来自L.syltensis和睾酮丛毛单胞菌(C.testosterioni)的7α-HSD也显示出活性。
还在背景菌株(在该情况下,还表达活性脆弱拟杆菌7α-HSD)中测试了四种7β-HSD变体(Pseudomonas syringae pv.atrofaciens、番木瓜假单胞菌(Pseudomonascaricapapayae)、Drosophila persimilis(果蝇)和撒丁岛梭菌)产生UDC-CoA的能力。使用了上述相同的程序。
如图18中示出的,来自撒丁岛梭菌的7β-HSD显示出最佳的活性。来自番木瓜假单胞菌的7β-HSD也显示出一定的活性。
实施例15—证实产生了UDC-CoA
为了验证从实施例14确实产生了UDC-CoA,进行了两种另外的处理样品方法,以用于质谱术。如图4中可见,将最初的沉淀物分成两个样品。第一个样品用2mL的80%甲醇/H2O洗涤,涡旋,离心,转移和干燥。
第一个样品和第二个样品从这一点开始经历相同的处理。
向沉淀物添加750μL的1N NaOH,并在60℃孵育60分钟。然后用500μL的2N HCl使样品酸化。添加4mL EtOAc并涡旋20分钟。取出3mL的有机层并干燥。将其重悬于200μL甲醇中,并过滤通过0.45μm过滤器。
沉淀物的直接水解和类固醇-CoA提取物的间接水解都产生了可检测到的UDCA、CDCA、(24E)-3α,7α-二羟基-胆甾-24-烯酸和3α,7α-二羟基-5β-胆甾烷酸。沉淀物的直接水解似乎产率更高。
实施例16—硫解酶/7α-HSD/7β-HSD的组合
将表达酿酒酵母截短型HMG、拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、智人HSD3B7、小家鼠AKR1D1、日本猕猴AKR1C4、褐家鼠CYP27A1和智人SLC27A5、褐家鼠AMACR、智人ACOX2和褐家鼠HSD17B4、pot1Δ、pox1Δ和fox2Δ的菌株用作背景菌株,以确定硫解酶/SCP2、7α-HSD和7β-HSD的最佳组合。
然后通过GC/MS测试菌株产生UDCA/UDC-CoA的能力。如图19中可见,酿酒酵母POT1硫解酶、大肠杆菌7α-HSD和撒丁岛梭菌7β-HSD以及酿酒酵母POT1硫解酶、脆弱拟杆菌7α-HSD和撒丁岛梭菌7β-HSD的组合导致最大量的UDCA/UDC-CoA产生。如图19中可见,其他组合产生了可检测到的水平的UDCA/UDC-CoA产物。
实施例17—鉴定将糖转化为胆酸的酶并产生能产生胆酸的菌株
11种异源酶(从酿酒酵母的角度来看)被鉴定为可能可用于从胆固醇产生胆酸的酶。参见例如,图22。另外两(2)种酶也被鉴定为可能可用于将糖转化为胆固醇的酶。参见例如,图2。
合成编码这些酶的基因,并且然后克隆到适于整合到酵母基因组中的酵母表达载体中。随后使用标准酵母化学转化方案,利用乙酸锂和PEG(3350)将这些整合构建体转化到酿酒酵母中。使转化的酵母生长至对数中期,然后以4000rpm离心,去除上清液。用水洗涤沉淀物,并再次离心。将所得沉淀物重悬于含有100mM乙酸锂、40%PEG(MW 3,350)、0.35mg/ml载体DNA(剪切的鲑鱼精DNA)和50ng至500ng待转化DNA的主混合物中。然后将细胞悬浮液在30℃孵育30分钟,随后在42℃热激45分钟。此时,进行营养选择铺板,而抗真菌选择在丰富酵母培养基(rich yeast media)中经历4hr至过夜恢复,然后在含有抗真菌药物的琼脂上铺板。然后将板在30℃孵育2天至3天。菌落形成后,通过菌落PCR验证正确的整合,然后在实验中使用菌株。
表2示出了在酵母菌株中表达的代表性基因和显示出最佳活性的酶的遗传来源。来自其他来源的基因也被发现具有活性,但未在表2上示出。
Figure BDA0003107961460000711
将具有产生胆固醇能力的菌株遗传工程化,以进一步表达CYP7A1、ADX(2种变体)、ADR和HSD3B7。CYP7A1和HSD3B7的活性如实施例3和实施例4中所述进行展示。
实施例18—将7α-羟基-4-胆甾烯-3-酮转化为7α,12α-二羟基-4-胆甾烯-3-酮
将表达拟南芥DHCR7、智人DHCR24的菌株遗传工程化,以进一步表达小家鼠CYP7A1、ADX(来自斑马鱼和家牛)、家牛ADR、智人HSD3B7和CYP8B1。
测试菌株从7α-羟基-4-胆甾烯-3-酮产生7α,12α-二羟基-4-胆甾烯-3-酮的能力。
如图23中示出的,来自小家鼠和家兔的CYP8B1显示出最佳的活性。来自智人和野猪的CYP8B1也显示出活性。
实施例19—证实产生了胆酰-CoA
将表达酿酒酵母截短型HMG、拟南芥DHCR7、智人DHCR24、小家鼠CYP7A1、家牛ADX、家牛ADR、智人HSD3B7、小家鼠AKR1D1、日本猕猴AKR1C4、褐家鼠CYP27A1和智人SLC27A5、褐家鼠AMACR、智人ACOX2、褐家鼠HSD17B4和酿酒酵母SCP2的菌株用作背景菌株,以确定有效的CYP8B1。
在背景菌株中测试了一种CYP8B1变体(小家鼠)产生胆酰-CoA(也称为3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA,具有化学式C45H74N7O20P3S,质量为1157.4,并且分子量为1158.1)的能力。胆酰-CoA的水解酸形式,胆酸(也称为3α,7α,12α-三羟基-5β-胆烷-24-酸,具有化学式C24H40O5,质量为408.3,并且分子量为408.58)是可测量的产物。
从24孔深孔板中的15mL全细胞肉汤中收集细胞沉淀物。将细胞沉淀物重悬于2mL80%甲醇/水混合溶液中,在4℃涡旋30分钟,在4℃以4000rpm离心5分钟,并将1.8mL上清液转移至24孔深孔板。将上清液在40℃在centrivap上干燥过夜。将干燥提取物用750μL 1NNaOH在60℃涡旋1小时进行水解,随后用500μL 2N HCl酸化。酸化样品用4mL乙酸乙酯萃取。将3.5mL有机层转移至24孔深孔板,并在45℃在centrivap上干燥。将干燥提取物重悬于200μL甲醇中,并过滤通过0.2μm过滤器。该最终过滤产物通过液相色谱术测量,随后质谱分析胆酸(水解的胆酰-CoA)的存在。示出这些步骤的流程图在图24中示出。
如图25中示出的,来自小家鼠的CYP8B1具有活性并且产生了胆酰-CoA(检测到胆酸)。在缺乏CYP8B1酶的菌株中未检测到胆酸。
序列表
<110> 英特拉克森公司
<120> 用于产生熊去氧胆酸及其前体的细胞和方法
<130> 75594-299652
<140>
<141>
<150> 62/743,122
<151> 2018-10-09
<160> 278
<170> PatentIn version 3.5
<210> 1
<211> 432
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 1
Met Ala Glu Thr Val His Ser Pro Ile Val Thr Tyr Ala Ser Met Leu
1 5 10 15
Ser Leu Leu Ala Phe Cys Pro Pro Phe Val Ile Leu Leu Trp Tyr Thr
20 25 30
Met Val His Gln Asp Gly Ser Val Thr Gln Thr Phe Gly Phe Phe Trp
35 40 45
Glu Asn Gly Val Gln Gly Leu Ile Asn Ile Trp Pro Arg Pro Thr Leu
50 55 60
Ile Ala Trp Lys Ile Ile Phe Cys Tyr Gly Ala Phe Glu Ala Ile Leu
65 70 75 80
Gln Leu Leu Leu Pro Gly Lys Arg Val Glu Gly Pro Ile Ser Pro Ala
85 90 95
Gly Asn Arg Pro Val Tyr Lys Ala Asn Gly Leu Ala Ala Tyr Phe Val
100 105 110
Thr Leu Ala Thr Tyr Leu Gly Leu Trp Trp Phe Gly Ile Phe Asn Pro
115 120 125
Ala Ile Val Tyr Asp His Leu Gly Glu Ile Phe Ser Ala Leu Ile Phe
130 135 140
Gly Ser Phe Ile Phe Cys Val Leu Leu Tyr Ile Lys Gly His Val Ala
145 150 155 160
Pro Ser Ser Ser Asp Ser Gly Ser Cys Gly Asn Leu Ile Ile Asp Phe
165 170 175
Tyr Trp Gly Met Glu Leu Tyr Pro Arg Ile Gly Lys Ser Phe Asp Ile
180 185 190
Lys Val Phe Thr Asn Cys Arg Phe Gly Met Met Ser Trp Ala Val Leu
195 200 205
Ala Val Thr Tyr Cys Ile Lys Gln Tyr Glu Ile Asn Gly Lys Val Ser
210 215 220
Asp Ser Met Leu Val Asn Thr Ile Leu Met Leu Val Tyr Val Thr Lys
225 230 235 240
Phe Phe Trp Trp Glu Ala Gly Tyr Trp Asn Thr Met Asp Ile Ala His
245 250 255
Asp Arg Ala Gly Phe Tyr Ile Cys Trp Gly Cys Leu Val Trp Val Pro
260 265 270
Ser Val Tyr Thr Ser Pro Gly Met Tyr Leu Val Asn His Pro Val Glu
275 280 285
Leu Gly Thr Gln Leu Ala Ile Tyr Ile Leu Val Ala Gly Ile Leu Cys
290 295 300
Ile Tyr Ile Asn Tyr Asp Cys Asp Arg Gln Arg Gln Glu Phe Arg Arg
305 310 315 320
Thr Asn Gly Lys Cys Leu Val Trp Gly Arg Ala Pro Ser Lys Ile Val
325 330 335
Ala Ser Tyr Thr Thr Thr Ser Gly Glu Thr Lys Thr Ser Leu Leu Leu
340 345 350
Thr Ser Gly Trp Trp Gly Leu Ala Arg His Phe His Tyr Val Pro Glu
355 360 365
Ile Leu Ser Ala Phe Phe Trp Thr Val Pro Ala Leu Phe Asp Asn Phe
370 375 380
Leu Ala Tyr Phe Tyr Val Ile Phe Leu Thr Leu Leu Leu Phe Asp Arg
385 390 395 400
Ala Lys Arg Asp Asp Asp Arg Cys Arg Ser Lys Tyr Gly Lys Tyr Trp
405 410 415
Lys Leu Tyr Cys Glu Lys Val Lys Tyr Arg Ile Ile Pro Gly Ile Tyr
420 425 430
<210> 2
<211> 1299
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 2
atggccgaaa ctgttcactc tcctattgta acctacgctt caatgttgtc attattggct 60
ttttgcccac cttttgttat attgttatgg tataccatgg tccatcagga tggttctgta 120
acacagacct tcggtttctt ctgggagaat ggagttcagg gattgattaa tatctggcct 180
aggccaacat tgattgcctg gaagattata ttctgctacg gagcttttga ggctatctta 240
cagttgttgt tgcctggaaa aagagtagaa ggtccaatct ctccagctgg taacagacca 300
gtctacaagg ctaacggatt ggctgcctac tttgttacct tagccaccta cttgggatta 360
tggtggttcg gtatttttaa ccctgctatt gtttatgacc atttaggtga aatattctct 420
gctttgattt ttggatcttt catattttgt gtcttgttgt acatcaaggg acatgtagca 480
ccttcttctt ctgattctgg ttcatgtggt aatttgatca ttgattttta ctggggtatg 540
gaattatatc caaggatcgg taaatcattc gatataaaag tatttacaaa ttgtaggttt 600
ggtatgatgt cttgggcagt cttagctgtc acatactgta taaaacaata tgaaattaat 660
ggtaaggttt cagattcaat gttggtaaat actattttga tgttggtata tgttacaaag 720
ttcttttggt gggaagcagg ttattggaat accatggaca tcgctcacga tagagcaggt 780
ttttacatct gttggggttg tttggtctgg gttccatctg tatatacatc accaggtatg 840
tatttggtca atcatccagt tgaattgggt actcagttgg ccatatatat cttggttgcc 900
ggaatcttat gtatatatat taattatgat tgtgacagac aaagacagga atttaggaga 960
actaatggaa agtgtttggt atggggaaga gcaccatcta agattgtcgc atcatacact 1020
actacatcag gtgaaacaaa gacatcatta ttattaactt caggatggtg gggattggcc 1080
aggcactttc actacgttcc tgagatcttg tctgctttct tctggacagt ccctgctttg 1140
tttgacaact ttttagccta tttttatgtt atatttttga ctttgttatt attcgataga 1200
gctaagagag atgacgatag atgtagatct aaatatggaa agtactggaa attatattgt 1260
gaaaaagtca aatacagaat tattccaggt atctactaa 1299
<210> 3
<211> 475
<212> PRT
<213> 家牛(Bos taurus)
<400> 3
Met Ala Ala Lys Ser Gln Pro Ser Ala Pro Lys Thr Lys Ser Thr Ser
1 5 10 15
Gly Leu Thr Asn Gly Asn Ala Ala Ala Gln Gly Gln Trp Gly Arg Ala
20 25 30
Trp Glu Val Asp Trp Phe Ser Leu Ala Ser Val Ile Phe Leu Leu Leu
35 40 45
Phe Ala Pro Phe Ile Val Tyr Tyr Phe Ile Met Ala Cys Asp Gln Tyr
50 55 60
Gly Cys Ser Leu Thr Val Pro Val Ala Asp Leu Ala Thr Gly Arg Ala
65 70 75 80
Arg Leu Ala Asp Ile Trp Ala Arg Thr Pro Pro Val Thr Ala Lys Ala
85 90 95
Ala Gln Ile Tyr Thr Ala Trp Val Thr Leu Gln Val Leu Leu Tyr Met
100 105 110
Leu Leu Pro Asp Phe Cys His Lys Phe Leu Pro Gly Tyr Val Gly Gly
115 120 125
Val Gln Glu Gly Ala Val Thr Pro Ala Gly Ala Val Asn Lys Tyr Glu
130 135 140
Ile Asn Gly Leu Gln Ala Trp Leu Leu Thr His Leu Leu Trp Phe Ala
145 150 155 160
Asn Ala His Leu Leu Gly Trp Phe Ser Pro Thr Ile Ile Phe Asp Asn
165 170 175
Trp Ile Pro Leu Leu Trp Cys Ala Asn Ile Leu Gly Tyr Thr Val Ser
180 185 190
Thr Phe Ala Met Val Lys Gly Tyr Leu Phe Pro Thr Asp Ala Arg Glu
195 200 205
Cys Lys Phe Thr Gly Asn Phe Phe Tyr Asn Tyr Met Met Gly Val Glu
210 215 220
Phe Asn Pro Arg Ile Gly Lys Trp Phe Asp Phe Lys Leu Phe Phe Asn
225 230 235 240
Gly Arg Pro Gly Ile Val Ala Trp Thr Leu Ile Asn Leu Ser Phe Ala
245 250 255
Ala Lys Gln Gln Glu Leu Tyr Gly His Val Thr Asn Ser Met Val Leu
260 265 270
Val Asn Ile Leu Gln Ala Ile Tyr Val Leu Asp Phe Phe Trp Asn Glu
275 280 285
Thr Trp Tyr Leu Lys Thr Ile Asp Ile Cys His Asp His Phe Gly Trp
290 295 300
Tyr Leu Gly Trp Gly Asp Cys Val Trp Leu Pro Tyr Leu Tyr Thr Leu
305 310 315 320
Gln Gly Leu Tyr Leu Val Tyr His Pro Val Gln Leu Pro Thr Tyr Tyr
325 330 335
Ala Leu Gly Val Leu Leu Leu Gly Leu Leu Gly Tyr Tyr Ile Phe Arg
340 345 350
Met Thr Asn His Gln Lys Asp Leu Phe Arg Arg Thr Asp Gly Arg Cys
355 360 365
Leu Ile Trp Gly Arg Lys Pro Lys Ala Ile Glu Cys Ser Tyr Thr Ser
370 375 380
Ala Asp Gly Gln Arg His His Ser Lys Leu Leu Val Ser Gly Phe Trp
385 390 395 400
Gly Val Ala Arg His Phe Asn Tyr Thr Gly Asp Leu Met Gly Ser Leu
405 410 415
Ala Tyr Cys Leu Ala Cys Gly Gly Gly His Leu Leu Pro Tyr Phe Tyr
420 425 430
Ile Ile Phe Met Ala Ile Leu Leu Thr His Arg Cys Leu Arg Asp Glu
435 440 445
His Arg Cys Ala Asn Lys Tyr Gly Arg Asp Trp Glu His Tyr Thr Ala
450 455 460
Ala Val Pro Tyr Arg Leu Leu Pro Gly Ile Phe
465 470 475
<210> 4
<211> 1428
<212> DNA
<213> 家牛(Bos taurus)
<400> 4
atggctgcta agtctcaacc atctgctcca aaaactaaat ccacctccgg tttgaccaac 60
ggtaacgctg ctgctcaagg tcaatggggt agagcttggg aagtcgattg gttctctttg 120
gcttctgtta ttttcttgtt gttgtttgcc ccatttatcg tctactactt catcatggct 180
tgtgatcaat acggttgttc cttgactgtt ccagtcgctg acttggctac cggtagagct 240
agattggctg acatctgggc cagaacccca ccagtcaccg ctaaggccgc tcaaatctac 300
actgcttggg tcactttgca agttttgttg tacatgttgt tgccagattt ctgtcacaag 360
ttcttgccag gttacgtcgg tggtgtccaa gaaggtgccg tcaccccagc tggtgctgtc 420
aacaagtacg aaatcaacgg tttgcaagcc tggttgttga cccacttgtt gtggttcgct 480
aacgcccact tgttgggttg gttttctcca accatcatct tcgacaactg gattccattg 540
ttgtggtgtg ctaacatctt gggttacacc gtttctactt ttgctatggt taaaggttac 600
ttgttcccaa ccgacgccag agaatgtaaa ttcactggta acttctttta caactacatg 660
atgggtgttg aatttaaccc aagaattggt aaatggttcg atttcaaatt gttctttaac 720
ggtagaccag gtattgttgc ttggaccttg atcaacttgt ccttcgctgc caaacaacaa 780
gaattgtacg gtcatgttac caactctatg gtcttggtca acatcttgca agctatttac 840
gttttggact tcttctggaa cgaaacctgg tacttgaaga ctattgatat ctgtcacgac 900
cactttggtt ggtacttggg ttggggtgac tgtgtttggt tgccatactt gtacactttg 960
caaggtttgt acttggttta ccatccagtt caattgccaa cttactacgc cttgggtgtc 1020
ttgttgttgg gtttgttggg ttactacatt ttcagaatga ctaaccacca aaaggacttg 1080
ttcagaagaa ccgacggtag atgtttgatc tggggtagaa aaccaaaggc catcgaatgt 1140
tcctacacct ccgctgacgg tcaaagacat cactccaagt tgttggtctc tggtttctgg 1200
ggtgttgcta gacatttcaa ctacaccggt gacttgatgg gttccttggc ttactgtttg 1260
gcctgtggtg gtggtcattt gttgccatac ttctacatca ttttcatggc tatcttgttg 1320
acccatagat gtttgagaga tgaacacaga tgtgctaaca agtacggtag agattgggaa 1380
cactacactg ccgctgttcc atacagattg ttgccaggta tcttctaa 1428
<210> 5
<211> 475
<212> PRT
<213> 智人(Homo sapiens)
<400> 5
Met Ala Ala Lys Ser Gln Pro Asn Ile Pro Lys Ala Lys Ser Leu Asp
1 5 10 15
Gly Val Thr Asn Asp Arg Thr Ala Ser Gln Gly Gln Trp Gly Arg Ala
20 25 30
Trp Glu Val Asp Trp Phe Ser Leu Ala Ser Val Ile Phe Leu Leu Leu
35 40 45
Phe Ala Pro Phe Ile Val Tyr Tyr Phe Ile Met Ala Cys Asp Gln Tyr
50 55 60
Ser Cys Ala Leu Thr Gly Pro Val Val Asp Ile Val Thr Gly His Ala
65 70 75 80
Arg Leu Ser Asp Ile Trp Ala Lys Thr Pro Pro Ile Thr Arg Lys Ala
85 90 95
Ala Gln Leu Tyr Thr Leu Trp Val Thr Phe Gln Val Leu Leu Tyr Thr
100 105 110
Ser Leu Pro Asp Phe Cys His Lys Phe Leu Pro Gly Tyr Val Gly Gly
115 120 125
Ile Gln Glu Gly Ala Val Thr Pro Ala Gly Val Val Asn Lys Tyr Gln
130 135 140
Ile Asn Gly Leu Gln Ala Trp Leu Leu Thr His Leu Leu Trp Phe Ala
145 150 155 160
Asn Ala His Leu Leu Ser Trp Phe Ser Pro Thr Ile Ile Phe Asp Asn
165 170 175
Trp Ile Pro Leu Leu Trp Cys Ala Asn Ile Leu Gly Tyr Ala Val Ser
180 185 190
Thr Phe Ala Met Val Lys Gly Tyr Phe Phe Pro Thr Ser Ala Arg Asp
195 200 205
Cys Lys Phe Thr Gly Asn Phe Phe Tyr Asn Tyr Met Met Gly Ile Glu
210 215 220
Phe Asn Pro Arg Ile Gly Lys Trp Phe Asp Phe Lys Leu Phe Phe Asn
225 230 235 240
Gly Arg Pro Gly Ile Val Ala Trp Thr Leu Ile Asn Leu Ser Phe Ala
245 250 255
Ala Lys Gln Arg Glu Leu His Ser His Val Thr Asn Ala Met Val Leu
260 265 270
Val Asn Val Leu Gln Ala Ile Tyr Val Ile Asp Phe Phe Trp Asn Glu
275 280 285
Thr Trp Tyr Leu Lys Thr Ile Asp Ile Cys His Asp His Phe Gly Trp
290 295 300
Tyr Leu Gly Trp Gly Asp Cys Val Trp Leu Pro Tyr Leu Tyr Thr Leu
305 310 315 320
Gln Gly Leu Tyr Leu Val Tyr His Pro Val Gln Leu Ser Thr Pro His
325 330 335
Ala Val Gly Val Leu Leu Leu Gly Leu Val Gly Tyr Tyr Ile Phe Arg
340 345 350
Val Ala Asn His Gln Lys Asp Leu Phe Arg Arg Thr Asp Gly Arg Cys
355 360 365
Leu Ile Trp Gly Arg Lys Pro Lys Val Ile Glu Cys Ser Tyr Thr Ser
370 375 380
Ala Asp Gly Gln Arg His His Ser Lys Leu Leu Val Ser Gly Phe Trp
385 390 395 400
Gly Val Ala Arg His Phe Asn Tyr Val Gly Asp Leu Met Gly Ser Leu
405 410 415
Ala Tyr Cys Leu Ala Cys Gly Gly Gly His Leu Leu Pro Tyr Phe Tyr
420 425 430
Ile Ile Tyr Met Ala Ile Leu Leu Thr His Arg Cys Leu Arg Asp Glu
435 440 445
His Arg Cys Ala Ser Lys Tyr Gly Arg Asp Trp Glu Arg Tyr Thr Ala
450 455 460
Ala Val Pro Tyr Arg Leu Leu Pro Gly Ile Phe
465 470 475
<210> 6
<211> 1428
<212> DNA
<213> 智人(Homo sapiens)
<400> 6
atggccgcta agtctcaacc aaacattcca aaagccaaat ccttggacgg tgttaccaac 60
gacagaactg cttctcaagg tcaatggggt agagcttggg aagttgactg gttctctttg 120
gcttccgtta tctttttgtt gttgtttgcc ccattcattg tttactactt catcatggct 180
tgtgaccaat actcttgtgc tttgactggt ccagttgttg atatcgttac cggtcacgct 240
agattgtctg atatctgggc caagacccca ccaatcacta gaaaggctgc tcaattgtac 300
accttgtggg tcaccttcca agtcttgttg tacacctctt tgccagactt ctgtcacaag 360
ttcttgccag gttacgtcgg tggtattcaa gaaggtgctg ttactccagc tggtgtcgtc 420
aacaagtacc aaatcaacgg tttgcaagcc tggttgttga cccatttgtt gtggtttgct 480
aacgctcact tgttgtcttg gttctctcca accattattt tcgacaactg gattccattg 540
ttgtggtgtg ctaacatctt gggttacgct gtttctacct tcgccatggt taagggttac 600
ttcttcccaa cctccgctag agattgtaag tttactggta actttttcta caactacatg 660
atgggtattg aatttaaccc aagaattggt aagtggttcg atttcaagtt gttcttcaac 720
ggtagaccag gtattgtcgc ttggactttg atcaacttgt ctttcgccgc caagcaaaga 780
gaattgcact ctcacgtcac caacgctatg gtcttggtca acgtcttgca agccatttac 840
gttattgact tcttctggaa cgaaacctgg tacttgaaga ccatcgacat ttgtcacgac 900
cacttcggtt ggtacttggg ttggggtgac tgtgtttggt tgccatactt gtacaccttg 960
caaggtttgt acttggtcta ccacccagtc caattgtcta ctccacacgc tgttggtgtt 1020
ttgttgttgg gtttggttgg ttactacatc ttcagagtcg ctaaccacca aaaggacttg 1080
ttcagaagaa ccgatggtag atgtttgatc tggggtagaa agccaaaggt cattgaatgt 1140
tcttacacct ccgccgacgg tcaaagacac cactccaagt tgttggtttc tggtttctgg 1200
ggtgttgcta gacatttcaa ctacgttggt gacttgatgg gttctttggc ttactgtttg 1260
gcctgtggtg gtggtcactt gttgccatac ttctacatta tctacatggc tattttgttg 1320
actcacagat gtttgagaga tgaacataga tgtgcctcca agtacggtag agactgggaa 1380
agatacactg ccgctgtccc atacagattg ttgccaggta tcttctaa 1428
<210> 7
<211> 471
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 7
Met Ala Ser Lys Ser Gln His Asn Ala Pro Lys Val Lys Ser Pro Asn
1 5 10 15
Gly Lys Ala Gly Ser Gln Gly Gln Trp Gly Arg Ala Trp Glu Val Asp
20 25 30
Trp Phe Ser Leu Ala Ser Ile Ile Phe Leu Leu Leu Phe Ala Pro Phe
35 40 45
Ile Val Tyr Tyr Phe Ile Met Ala Cys Asp Gln Tyr Ser Cys Ser Leu
50 55 60
Thr Ala Pro Ala Leu Asp Ile Ala Thr Gly His Ala Ser Leu Ala Asp
65 70 75 80
Ile Trp Ala Lys Thr Pro Pro Val Thr Ala Lys Ala Ala Gln Leu Tyr
85 90 95
Ala Leu Trp Val Ser Phe Gln Val Leu Leu Tyr Ser Trp Leu Pro Asp
100 105 110
Phe Cys His Arg Phe Leu Pro Gly Tyr Val Gly Gly Val Gln Glu Gly
115 120 125
Ala Ile Thr Pro Ala Gly Val Val Asn Lys Tyr Glu Val Asn Gly Leu
130 135 140
Gln Ala Trp Leu Ile Thr His Ile Leu Trp Phe Val Asn Ala Tyr Leu
145 150 155 160
Leu Ser Trp Phe Ser Pro Thr Ile Ile Phe Asp Asn Trp Ile Pro Leu
165 170 175
Leu Trp Cys Ala Asn Ile Leu Gly Tyr Ala Val Ser Thr Phe Ala Met
180 185 190
Ile Lys Gly Tyr Leu Phe Pro Thr Ser Ala Glu Asp Cys Lys Phe Thr
195 200 205
Gly Asn Phe Phe Tyr Asn Tyr Met Met Gly Ile Glu Phe Asn Pro Arg
210 215 220
Ile Gly Lys Trp Phe Asp Phe Lys Leu Phe Phe Asn Gly Arg Pro Gly
225 230 235 240
Ile Val Ala Trp Thr Leu Ile Asn Leu Ser Phe Ala Ala Lys Gln Gln
245 250 255
Glu Leu Tyr Gly His Val Thr Asn Ser Met Ile Leu Val Asn Val Leu
260 265 270
Gln Ala Ile Tyr Val Leu Asp Phe Phe Trp Asn Glu Thr Trp Tyr Leu
275 280 285
Lys Thr Ile Asp Ile Cys His Asp His Phe Gly Trp Tyr Leu Gly Trp
290 295 300
Gly Asp Cys Val Trp Leu Pro Tyr Leu Tyr Thr Leu Gln Gly Leu Tyr
305 310 315 320
Leu Val Tyr His Pro Val Gln Leu Ser Thr Pro Asn Ala Leu Gly Ile
325 330 335
Leu Leu Leu Gly Leu Val Gly Tyr Tyr Ile Phe Arg Met Thr Asn His
340 345 350
Gln Lys Asp Leu Phe Arg Arg Thr Asp Gly Arg Cys Leu Ile Trp Gly
355 360 365
Lys Lys Pro Lys Ala Ile Glu Cys Ser Tyr Thr Ser Ala Asp Gly Leu
370 375 380
Lys His His Ser Lys Leu Leu Val Ser Gly Phe Trp Gly Val Ala Arg
385 390 395 400
His Phe Asn Tyr Thr Gly Asp Leu Met Gly Ser Leu Ala Tyr Cys Leu
405 410 415
Ala Cys Gly Gly Gly His Leu Leu Pro Tyr Phe Tyr Ile Ile Tyr Met
420 425 430
Thr Ile Leu Leu Thr His Arg Cys Leu Arg Asp Glu His Arg Cys Ala
435 440 445
Asn Lys Tyr Gly Arg Asp Trp Glu Arg Tyr Thr Ala Ala Val Pro Tyr
450 455 460
Arg Leu Leu Pro Gly Ile Phe
465 470
<210> 8
<211> 1416
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 8
atggcttcta agtctcaaca taacgctcca aaggtcaagt ctccaaacgg taaggctggt 60
tctcaaggtc aatggggtag agcttgggaa gtcgattggt tctccttggc ctccattatc 120
ttcttgttgt tgttcgcccc attcatcgtc tactacttca ttatggcttg tgatcaatac 180
tcctgttctt tgactgctcc agccttggac attgctactg gtcacgcctc cttggctgac 240
atctgggcta agactccacc agtcaccgcc aaggccgctc aattgtacgc tttgtgggtc 300
tccttccaag ttttgttgta ctcctggttg ccagacttct gtcatagatt cttgccaggt 360
tacgttggtg gtgttcaaga aggtgctatc accccagctg gtgtcgtcaa caagtacgaa 420
gtcaacggtt tgcaagcctg gttgattacc cacatcttgt ggttcgtcaa cgcctacttg 480
ttgtcttggt tctctccaac tatcattttc gataactgga ttccattgtt gtggtgtgcc 540
aacatcttgg gttacgctgt ttccactttc gccatgatca agggttactt gttcccaacc 600
tctgctgaag actgtaagtt caccggtaac ttcttctaca actacatgat gggtattgaa 660
tttaacccaa gaattggtaa gtggtttgac tttaagttgt ttttcaacgg tagaccaggt 720
atcgttgcct ggactttgat taacttgtcc ttcgctgcca agcaacaaga attgtacggt 780
catgttacta actccatgat tttggtcaac gtcttgcaag ccatctacgt cttggatttc 840
ttctggaacg aaacttggta cttgaagacc attgacattt gtcatgacca cttcggttgg 900
tacttgggtt ggggtgactg tgtttggttg ccatacttgt acaccttgca aggtttgtac 960
ttggtctacc acccagtcca attgtctact ccaaacgcct tgggtatctt gttgttgggt 1020
ttggttggtt actacatctt cagaatgacc aaccaccaaa aggatttgtt tagaagaact 1080
gacggtagat gtttgatctg gggtaagaag ccaaaggcta ttgaatgttc ctacacctct 1140
gctgacggtt tgaagcacca ttccaagttg ttggtctctg gtttctgggg tgttgctaga 1200
cactttaact acaccggtga cttgatgggt tccttggctt actgtttggc ctgtggtggt 1260
ggtcacttgt tgccatactt ttacattatt tacatgacta ttttgttgac ccacagatgt 1320
ttgagagacg aacacagatg tgccaacaag tacggtagag attgggaaag atacactgct 1380
gctgtcccat acagattgtt gccaggtatt ttttaa 1416
<210> 9
<211> 475
<212> PRT
<213> 加氏大婴猴(Otolemur garnetti)
<400> 9
Met Ala Ala Lys Ser Gln Pro Ser Thr Pro Lys Thr Lys Ser Pro Gly
1 5 10 15
Ser Val Ser Asn Gly Gln Thr Thr Ser Gln Gly Gln Trp Gly Arg Ala
20 25 30
Trp Glu Val Asp Trp Phe Ser Leu Ala Ser Val Ile Phe Leu Leu Leu
35 40 45
Phe Ala Pro Phe Ile Val Tyr Tyr Phe Ile Met Thr Cys Asp Gln Tyr
50 55 60
Ser Cys Ala Leu Thr Ala Pro Val Val Asp Ile Val Thr Gly Arg Gly
65 70 75 80
Arg Leu Ser Asp Ile Trp Ala Arg Thr Pro Ser Val Thr Val Lys Ala
85 90 95
Ala Gln Val Tyr Ala Leu Trp Val Thr Phe Gln Val Leu Leu Tyr Met
100 105 110
Trp Leu Pro Asp Phe Cys His Lys Phe Leu Pro Gly Tyr Val Gly Gly
115 120 125
Ile Gln Glu Gly Ala Val Thr Pro Ala Gly Val Val Asn Lys Tyr Gly
130 135 140
Ile Asn Gly Leu Gln Ala Trp Leu Ile Thr His Leu Leu Trp Phe Ala
145 150 155 160
Asn Ser His Leu Leu Phe Trp Phe Ser Pro Thr Ile Ile Phe Asp Asn
165 170 175
Trp Ile Pro Leu Leu Trp Cys Ala Asn Ile Leu Gly Tyr Ala Val Ser
180 185 190
Thr Phe Ala Met Ile Lys Gly Tyr Phe Phe Pro Thr Ser Ala Gln Asp
195 200 205
Cys Lys Phe Thr Gly Asn Phe Phe Tyr Asn Tyr Met Met Gly Ile Glu
210 215 220
Phe Asn Pro Arg Ile Gly Lys Trp Phe Asp Phe Lys Leu Phe Phe Asn
225 230 235 240
Gly Arg Pro Gly Ile Val Ala Trp Thr Leu Ile Asn Leu Ser Phe Ala
245 250 255
Ala Lys Gln Gln Glu Leu Tyr Gly His Val Thr Asn Ser Met Val Leu
260 265 270
Val Asn Val Leu Gln Ala Ile Tyr Val Leu Asp Phe Phe Trp Asn Glu
275 280 285
Thr Trp Tyr Leu Lys Thr Met Asp Ile Cys His Asp His Phe Gly Trp
290 295 300
Tyr Leu Gly Trp Gly Asp Cys Val Trp Leu Pro Tyr Leu Tyr Thr Leu
305 310 315 320
Gln Gly Leu Tyr Leu Val Tyr His Pro Val Gln Leu Ser Pro Ala His
325 330 335
Ala Thr Gly Val Leu Leu Leu Gly Leu Leu Gly Tyr Tyr Ile Phe Arg
340 345 350
Met Ala Asn His Gln Lys Asp Leu Phe Arg Arg Thr Asp Gly Arg Cys
355 360 365
Leu Ile Trp Gly Arg Lys Pro Lys Ala Ile Glu Cys Ser Tyr Val Ser
370 375 380
Ala Asp Gly Gln Lys His His Ser Lys Leu Leu Val Ser Gly Phe Trp
385 390 395 400
Gly Leu Ala Arg His Phe Asn Tyr Thr Gly Asp Leu Met Gly Ser Leu
405 410 415
Ala Tyr Cys Leu Ala Cys Gly Gly Gly His Leu Leu Pro Tyr Phe Tyr
420 425 430
Ile Ile Tyr Met Ala Ile Leu Leu Ile His Arg Cys Leu Arg Asp Glu
435 440 445
His Arg Cys Ala Ser Lys Tyr Gly Lys Asp Trp Glu Arg Tyr Ile Ala
450 455 460
Ala Val Pro Tyr Arg Leu Leu Pro Gly Leu Phe
465 470 475
<210> 10
<211> 1428
<212> DNA
<213> 加氏大婴猴(Otolemur garnetti)
<400> 10
atggccgcca agtctcaacc atctactcca aaaactaaat ctccaggttc tgtttccaac 60
ggtcaaacta cttcccaagg tcaatggggt agagcttggg aagttgattg gttctccttg 120
gcctccgtca tcttcttgtt gttgttcgcc ccattcattg tctactactt tatcatgact 180
tgtgaccaat actcttgtgc tttgactgct ccagttgttg acattgtcac tggtagaggt 240
agattgtccg acatctgggc cagaacccca tctgttaccg tcaaggccgc tcaagtctac 300
gccttgtggg ttaccttcca agttttgttg tacatgtggt tgccagactt ttgtcacaag 360
ttcttgccag gttacgttgg tggtatccaa gaaggtgccg ttactccagc tggtgttgtc 420
aacaagtacg gtattaacgg tttgcaagcc tggttgatca ctcacttgtt gtggtttgcc 480
aactctcact tgttgttctg gttctcccca actattattt tcgacaactg gattccattg 540
ttgtggtgtg ctaacatctt gggttacgct gtctctacct tcgctatgat caagggttac 600
ttctttccaa cctctgctca agactgtaaa ttcactggta acttcttcta caactacatg 660
atgggtattg aatttaaccc aagaattggt aagtggttcg attttaagtt gtttttcaac 720
ggtagaccag gtattgtcgc ttggactttg atcaacttgt ctttcgccgc taaacaacaa 780
gaattgtacg gtcacgttac taactccatg gttttggtca acgtcttgca agccatctac 840
gttttggatt tcttctggaa cgaaacttgg tacttgaaga ccatggatat ttgtcacgac 900
cacttcggtt ggtacttggg ttggggtgat tgtgtttggt tgccatactt gtacactttg 960
caaggtttgt acttggtcta ccacccagtc caattgtccc cagctcacgc cactggtgtt 1020
ttgttgttgg gtttgttggg ttactacatt ttcagaatgg ctaaccacca aaaggatttg 1080
ttcagaagaa ccgacggtag atgtttgatc tggggtagaa aaccaaaggc tatcgaatgt 1140
tcttacgtct ccgctgacgg tcaaaagcat cactctaaat tgttggtttc cggtttctgg 1200
ggtttggcta gacacttcaa ctacaccggt gacttgatgg gttctttggc ttactgtttg 1260
gcctgtggtg gtggtcactt gttgccatac ttctacatca tttacatggc tatcttgttg 1320
atccacagat gtttgagaga cgaacacaga tgtgcttcta agtacggtaa ggactgggaa 1380
agatacattg ccgctgttcc atacagattg ttgccaggtt tgttttaa 1428
<210> 11
<211> 471
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 11
Met Ala Ser Lys Ser Gln His Asn Ala Ser Lys Ala Lys Asn His Asn
1 5 10 15
Val Lys Ala Glu Ser Gln Gly Gln Trp Gly Arg Ala Trp Glu Val Asp
20 25 30
Trp Phe Ser Leu Val Ser Val Ile Phe Leu Leu Leu Phe Ala Pro Phe
35 40 45
Ile Val Tyr Tyr Phe Ile Met Ala Cys Asp Gln Tyr Ser Cys Ser Leu
50 55 60
Thr Ala Pro Ile Leu Asp Val Ala Thr Gly Arg Ala Ser Leu Ala Asp
65 70 75 80
Ile Trp Ala Lys Thr Pro Pro Val Thr Ala Lys Ala Ala Gln Leu Tyr
85 90 95
Ala Leu Trp Val Ser Phe Gln Val Leu Leu Tyr Ser Trp Leu Pro Asp
100 105 110
Phe Cys His Arg Phe Leu Pro Gly Tyr Val Gly Gly Val Gln Glu Gly
115 120 125
Ala Ile Thr Pro Ala Gly Ile Val Asn Lys Tyr Glu Val Asn Gly Leu
130 135 140
Gln Ala Trp Leu Ile Thr His Phe Leu Trp Phe Val Asn Ala Tyr Leu
145 150 155 160
Leu Ser Trp Phe Ser Pro Thr Ile Ile Phe Asp Asn Trp Ile Pro Leu
165 170 175
Leu Trp Cys Ala Asn Ile Leu Gly Tyr Ala Val Ser Thr Phe Ala Met
180 185 190
Ile Lys Gly Tyr Leu Phe Pro Thr Ser Ala Glu Asp Cys Lys Phe Thr
195 200 205
Gly Asn Phe Phe Tyr Asn Tyr Met Met Gly Ile Glu Phe Asn Pro Arg
210 215 220
Ile Gly Lys Trp Phe Asp Phe Lys Leu Phe Phe Asn Gly Arg Pro Gly
225 230 235 240
Ile Val Ala Trp Thr Leu Ile Asn Leu Ser Phe Ala Ala Lys Gln Gln
245 250 255
Glu Leu Tyr Gly His Val Thr Asn Ser Met Ile Leu Val Asn Val Leu
260 265 270
Gln Ala Ile Tyr Val Leu Asp Phe Phe Trp Asn Glu Thr Trp Tyr Leu
275 280 285
Lys Thr Ile Asp Ile Cys His Asp His Phe Gly Trp Tyr Leu Gly Trp
290 295 300
Gly Asp Cys Val Trp Leu Pro Tyr Leu Tyr Thr Leu Gln Gly Leu Tyr
305 310 315 320
Leu Val Tyr His Pro Val Gln Leu Ser Thr Pro Asn Ala Leu Gly Val
325 330 335
Leu Leu Leu Gly Leu Val Gly Tyr Tyr Ile Phe Arg Met Thr Asn His
340 345 350
Gln Lys Asp Leu Phe Arg Arg Thr Asp Gly His Cys Leu Ile Trp Gly
355 360 365
Lys Lys Pro Lys Ala Ile Glu Cys Ser Tyr Thr Ser Ala Asp Gly Leu
370 375 380
Lys His Arg Ser Lys Leu Leu Val Ser Gly Phe Trp Gly Val Ala Arg
385 390 395 400
His Phe Asn Tyr Thr Gly Asp Leu Met Gly Ser Leu Ala Tyr Cys Leu
405 410 415
Ala Cys Gly Gly Gly His Leu Leu Pro Tyr Phe Tyr Ile Ile Tyr Met
420 425 430
Thr Ile Leu Leu Thr His Arg Cys Leu Arg Asp Glu His Arg Cys Ala
435 440 445
Asn Lys Tyr Gly Arg Asp Trp Glu Arg Tyr Val Ala Ala Val Pro Tyr
450 455 460
Arg Leu Leu Pro Gly Ile Phe
465 470
<210> 12
<211> 1416
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 12
atggcttcta aatcccaaca taacgcttct aaggctaaga accacaacgt caaggctgaa 60
tcccaaggtc aatggggtag agcctgggaa gttgactggt tctctttggt ttccgttatt 120
ttcttgttgt tgttcgctcc attcatcgtt tactacttca ttatggcctg tgatcaatac 180
tcttgttctt tgaccgcccc aatcttggac gttgctactg gtagagcttc tttggctgat 240
atctgggcta agaccccacc agttactgct aaagccgctc aattgtacgc tttgtgggtc 300
tctttccaag ttttgttgta ctcttggttg ccagacttct gtcacagatt cttgccaggt 360
tacgtcggtg gtgttcaaga aggtgctatt accccagccg gtatcgtcaa caagtacgaa 420
gtcaacggtt tgcaagcctg gttgatcact cacttcttgt ggttcgtcaa cgcttacttg 480
ttgtcttggt tctctccaac catcatcttc gataactgga ttccattgtt gtggtgtgct 540
aacatcttgg gttacgctgt ctctaccttt gccatgatta agggttactt gttcccaact 600
tctgccgaag actgtaagtt cactggtaac ttcttttaca actacatgat gggtatcgaa 660
tttaacccaa gaattggtaa gtggtttgac tttaagttgt tcttcaacgg tagaccaggt 720
atcgtcgctt ggactttgat taacttgtcc ttcgccgcta aacaacaaga attgtacggt 780
cacgttacca actccatgat cttggtcaac gtcttgcaag ctatttacgt tttggacttc 840
ttctggaacg aaacctggta cttgaagacc atcgacatct gtcacgacca cttcggttgg 900
tacttgggtt ggggtgactg tgtttggttg ccatacttgt acactttgca aggtttgtac 960
ttggtttacc acccagtcca attgtctact ccaaacgcct tgggtgtctt gttgttgggt 1020
ttggttggtt actacatttt cagaatgact aaccaccaaa aggacttgtt cagaagaacc 1080
gacggtcact gtttgatctg gggtaagaag ccaaaagcta ttgaatgttc ctacacttct 1140
gctgatggtt tgaagcacag atccaagttg ttggtttctg gtttctgggg tgttgctaga 1200
cacttcaact acactggtga cttgatgggt tccttggctt actgtttggc ctgtggtggt 1260
ggtcacttgt tgccatactt ctacatcatt tacatgacta tcttgttgac tcatagatgt 1320
ttgagagacg aacatagatg tgctaacaaa tacggtagag actgggaaag atacgtcgcc 1380
gctgtcccat acagattgtt gccaggtatc ttctaa 1416
<210> 13
<211> 516
<212> PRT
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 13
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Val Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Gly Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Ser Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Trp Gly Leu Glu Ala Ile Cys Glu Lys Phe Thr
245 250 255
Arg Glu Ser Gln Arg Pro Glu Asn Asp Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Lys Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Arg Tyr Leu Lys Thr Ser Arg Glu Gly Leu Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Val Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Arg Cys Leu Ser Gln Ala Val His Thr
385 390 395 400
Phe His Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asp Glu Thr Glu
420 425 430
Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Ser Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Arg Leu Arg Glu Arg
485 490 495
Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 14
<211> 1551
<212> DNA
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 14
atggagcccg cggtgtcgct ggccgtgtgc gcgctgctct tcctgctctg ggtccgcgtg 60
aaggggctgg agttcgtgct catccaccag cgctgggtgt tcgtgtgcct cttcctcctg 120
ccgctctcgc tgatcttcga catctactac tacgtgcgcg cctgggtggt gttcaagctc 180
agcagcgcgc ctcggctgca cgggcagcgc gtgcgggaca tccagaagca ggtgcgggaa 240
tggaaggagc aggggagcaa gactttcatg tgcacgggac gccctggctg gctcactgtc 300
tcgctgcggg ttgggaagta caagaagacg cacaaaaaca tcatgatcaa cctgatggac 360
attctggagg tggacaccaa gaaacagatt gtccgtgtgg agcccttggt gactatgggt 420
caggtgactg ccctgctgac ctccattggc tggacgctgc ctgtgttgcc cgagctcgat 480
gacctcacag tggggggctt gatcatgggc acgggcatcg agtcatcgtc ccacaagtac 540
gggctgttcc agcacatttg cactgcctac gagctggtcc tggccgacgg cagctttgtg 600
cggtgcacgc cgtcggaaaa ctcggacctg ttctatgctg tgccgtggtc ctgtgggacc 660
ctgggcttcc tggtggccgc cgagatccgc atcatccccg ccaagaagta cgtcaagctg 720
cggtttgagc cagtgtgggg cctggaggct atctgcgaaa agttcacccg tgagtcccag 780
cggccggaga acgacttcgt ggaagggctg ctctactccc tggataaggc tgtcatcatg 840
acgggggtca tgacagatga ggcagagccc agcaagctga atagcattgg caactactac 900
aagccctggt tcttcaagca cgtggagcgc tacctgaaga cgagccgcga gggcctggag 960
tacatccctc tgagacacta ctaccaccgt cacacgcgca gcatcttctg ggagctccag 1020
gacatcatcc cctttggcaa caaccccgtc ttccgctacc tctttggttg gatggtgccg 1080
cccaagatct ccctgctgaa gctgacccag ggcgagaccc tgcgcaagct gtacgagcag 1140
caccacgtgg tgcaggacat gctggtgccc atgaggtgcc tgtcgcaggc ggtgcacacc 1200
ttccacaacg acatccacgt ctaccccatc tggctgtgcc cattcatcct gcccagccag 1260
ccgggcctgg tgcaccccaa gggagatgag accgagctct acgtcgacat tggagcatat 1320
ggggagccac gcgtgaagca ctttgaagca aggtcctgca tgcggcagtt ggagaagttt 1380
gtccgaagcg tgcatgggtt ccagatgctg tatgccgact gctacatgag ccgggaggag 1440
ttctgggaga tgttcgacgg ctccctgtac cacaggctgc gggagcggct cggttgccag 1500
gacgccttcc ccgaggtgta cgacaagatc tgcaaggccg ccaggcactg a 1551
<210> 15
<211> 1551
<212> DNA
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 15
atggaaccag ctgtctcctt ggccgtctgt gctttgttgt tcttgttgtg ggtcagagtc 60
aaaggtttgg aatttgtttt gatccaccaa agatgggtct ttgtttgttt gttcttgttg 120
ccattgtctt tgatcttcga catttactac tacgttagag cttgggttgt cttcaagttg 180
tcctctgctc caagattgca tggtcaaaga gttagagata ttcaaaagca agtcagagaa 240
tggaaggaac aaggttctaa gacttttatg tgtactggta gaccaggttg gttgactgtc 300
tctttgagag ttggtaagta caagaagact cacaagaaca tcatgattaa cttgatggac 360
attttggaag ttgataccaa gaagcaaatt gttagagttg aaccattggt tactatgggt 420
caagttaccg ctttgttgac ctctatcggt tggaccttgc cagtcttgcc agaattggat 480
gacttgactg ttggtggttt gattatgggt actggtatcg aatcttcttc tcataagtac 540
ggtttgttcc aacacatttg taccgcttac gaattggtct tggctgatgg ttccttcgtc 600
agatgtactc catccgaaaa ctctgatttg ttctacgctg tcccatggtc ttgtggtact 660
ttgggtttct tggtcgctgc tgaaattaga atcatcccag ccaagaagta cgtcaaattg 720
agatttgaac cagtctgggg tttggaagct atttgtgaaa agttcactag agaatctcaa 780
agaccagaaa acgatttcgt tgaaggtttg ttgtactctt tggacaaggc tgtcattatg 840
actggtgtta tgactgatga agctgaacca tctaagttga actccatcgg taactactac 900
aagccatggt tctttaagca tgttgaaaga tacttgaaga cttccagaga aggtttggaa 960
tacatcccat tgagacatta ctaccacaga cacactagat ccattttctg ggaattgcaa 1020
gacattatcc cattcggtaa caacccagtt ttcagatact tgttcggttg gatggttcca 1080
ccaaagattt ctttgttgaa gttgactcaa ggtgaaacct tgagaaagtt gtacgaacaa 1140
catcacgttg ttcaagatat gttggtccca atgagatgtt tgtcccaagc tgttcatacc 1200
ttccataacg atattcatgt ctacccaatc tggttgtgtc cattcatctt gccatcccaa 1260
ccaggtttgg tccatccaaa aggtgacgaa actgaattgt acgtcgatat cggtgcttac 1320
ggtgaaccaa gagttaagca ttttgaagct agatcctgta tgagacaatt ggaaaagttt 1380
gtcagatccg tccacggttt ccaaatgttg tacgctgact gttacatgtc cagagaagaa 1440
ttctgggaaa tgttcgacgg ttccttgtac cacagattga gagaaagatt gggttgtcaa 1500
gatgcttttc cagaagtcta cgacaagatt tgtaaagctg ccagacacta a 1551
<210> 16
<211> 1551
<212> DNA
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 16
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggttagagtt 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttattat tatgttagag cttgggttgt ttttaaattg 180
tcttctgctc caagattgca tggtcaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaaata taaaaaaact cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt tactatgggt 420
caagttactg ctttgttgac ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgacag ttggtggttt gattatgggt actggtattg aatcttcttc tcataaatat 540
ggtttgtttc aacatatttg tactgcttat gaattggttt tggctgatgg ttcttttgtt 600
agatgtactc catctgaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtact 660
ttgggttttt tggttgctgc tgaaattaga attattccag ctaaaaaata tgttaaattg 720
agatttgaac cagtttgggg tttggaagct atttgtgaaa aatttactag agaatctcaa 780
agaccagaaa atgattttgt tgaaggtttg ttgtattctt tggataaagc tgttattatg 840
actggtgtta tgacagatga agctgaacca tctaaattga attctattgg taattattat 900
aaaccatggt tttttaaaca tgttgaaaga tatttgaaaa cttctagaga aggtttggaa 960
tatattccat tgagacatta ttatcataga catactagat ctattttttg ggaattgcaa 1020
gatattattc catttggtaa taatccagtt tttagatatt tgtttggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaactt tgagaaaatt gtatgaacaa 1140
catcatgttg ttcaagatat gttggttcca atgagatgtt tgtctcaagc tgttcatact 1200
tttcataatg atattcatgt ttatccaatt tggttgtgtc catttatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgatgaa actgaattgt atgttgatat tggtgcttat 1320
ggtgaaccaa gagttaaaca ttttgaagct agatcttgta tgagacaatt ggaaaaattt 1380
gttagatctg ttcatggttt tcaaatgttg tatgctgatt gttatatgtc tagagaagaa 1440
ttttgggaaa tgtttgatgg ttctttgtat catagattga gagaaagatt gggttgtcaa 1500
gatgcttttc cagaagttta tgataaaatt tgtaaagctg ctagacatta a 1551
<210> 17
<211> 561
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 17
Met Ser Asp Leu Gln Thr Pro Leu Val Arg Pro Lys Arg Lys Lys Thr
1 5 10 15
Trp Val Asp Tyr Phe Val Lys Phe Arg Trp Ile Ile Val Ile Phe Ile
20 25 30
Val Leu Pro Phe Ser Ala Thr Phe Tyr Phe Leu Ile Tyr Leu Gly Asp
35 40 45
Met Trp Ser Glu Ser Lys Ser Phe Glu Lys Arg Gln Lys Glu His Asp
50 55 60
Glu Asn Val Lys Lys Val Ile Lys Arg Leu Lys Gly Arg Asp Ala Ser
65 70 75 80
Lys Asp Gly Leu Val Cys Thr Ala Arg Lys Pro Trp Ile Ala Val Gly
85 90 95
Met Arg Asn Val Asp Tyr Lys Arg Ala Arg His Phe Glu Val Asp Leu
100 105 110
Gly Glu Phe Arg Asn Ile Leu Glu Ile Asn Lys Glu Lys Met Thr Ala
115 120 125
Arg Val Glu Pro Leu Val Asn Met Gly Gln Ile Ser Arg Ala Thr Val
130 135 140
Pro Met Asn Leu Ser Leu Ala Val Val Ala Glu Leu Asp Asp Leu Thr
145 150 155 160
Val Gly Gly Leu Ile Asn Gly Tyr Gly Ile Glu Gly Ser Ser His Ile
165 170 175
Tyr Gly Leu Phe Ala Asp Thr Val Glu Ala Tyr Glu Ile Val Leu Ala
180 185 190
Gly Gly Glu Leu Val Arg Ala Thr Arg Asp Asn Glu Tyr Ser Asp Leu
195 200 205
Tyr Tyr Ala Ile Pro Trp Ser Gln Gly Thr Leu Gly Leu Leu Val Ala
210 215 220
Ala Glu Ile Arg Leu Ile Lys Val Lys Glu Tyr Met Arg Leu Thr Tyr
225 230 235 240
Ile Pro Val Lys Gly Asp Leu Gln Ala Leu Ala Gln Gly Tyr Ile Asp
245 250 255
Ser Phe Ala Pro Lys Asp Gly Asp Lys Ser Lys Ile Pro Asp Phe Val
260 265 270
Glu Gly Met Val Tyr Asn Pro Thr Glu Gly Val Met Met Val Gly Thr
275 280 285
Tyr Ala Ser Lys Glu Glu Ala Lys Lys Lys Gly Asn Lys Ile Asn Asn
290 295 300
Val Gly Trp Trp Phe Lys Pro Trp Phe Tyr Gln His Ala Gln Thr Ala
305 310 315 320
Leu Lys Lys Gly Gln Phe Val Glu Tyr Ile Pro Thr Arg Glu Tyr Tyr
325 330 335
His Arg His Thr Arg Cys Leu Tyr Trp Glu Gly Lys Leu Ile Leu Pro
340 345 350
Phe Gly Asp Gln Phe Trp Phe Arg Tyr Leu Leu Gly Trp Leu Met Pro
355 360 365
Pro Lys Val Ser Leu Leu Lys Ala Thr Gln Gly Glu Ala Ile Arg Asn
370 375 380
Tyr Tyr His Asp Met His Val Ile Gln Asp Met Leu Val Pro Leu Tyr
385 390 395 400
Lys Val Gly Asp Ala Leu Glu Trp Val His Arg Glu Met Glu Val Tyr
405 410 415
Pro Ile Trp Leu Cys Pro His Lys Leu Phe Lys Gln Pro Ile Lys Gly
420 425 430
Gln Ile Tyr Pro Glu Pro Gly Phe Glu Tyr Glu Asn Arg Gln Gly Asp
435 440 445
Thr Glu Asp Ala Gln Met Tyr Thr Asp Val Gly Val Tyr Tyr Ala Pro
450 455 460
Gly Cys Val Leu Arg Gly Glu Glu Phe Asp Gly Ser Glu Ala Val Arg
465 470 475 480
Arg Met Glu Lys Trp Leu Ile Glu Asn His Gly Phe Gln Pro Gln Tyr
485 490 495
Ala Val Ser Glu Leu Asp Glu Lys Ser Phe Trp Arg Met Phe Asn Gly
500 505 510
Glu Leu Tyr Glu Glu Cys Arg Lys Lys Tyr Arg Ala Ile Gly Thr Phe
515 520 525
Met Ser Val Tyr Tyr Lys Ser Lys Lys Gly Arg Lys Thr Glu Lys Glu
530 535 540
Val Arg Glu Ala Glu Gln Ala His Leu Glu Thr Ala Tyr Ala Glu Ala
545 550 555 560
Asp
<210> 18
<211> 1686
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 18
atgtcggatc ttcagacacc gcttgtgagg cccaagagga agaagacttg ggttgattac 60
tttgtcaagt tcagatggat cattgtcatc ttcatcgtcc ttccattctc agccacattc 120
tacttcctca tctacctcgg ggacatgtgg tcagagtcca agtcctttga gaaacgtcag 180
aaggaacacg acgagaatgt caagaaagtc atcaaaaggc ttaagggtag ggatgcttcc 240
aaggacgggc ttgtctgcac tgctcgtaag ccctggatcg ctgttggaat gaggaacgtt 300
gactacaaga gagcccggca tttcgaggtt gacttggggg agttccgtaa catccttgag 360
atcaacaagg agaagatgac tgctagagtg gagcctcttg ttaacatggg acagatttcc 420
cgtgctaccg tcccaatgaa cctgtctctc gctgttgttg ctgagcttga tgaccttacc 480
gttggtggac ttatcaatgg atatggtatt gaaggaagct ctcacatcta cggtttgttt 540
gctgataccg ttgaggctta cgagattgtt cttgcgggtg gagagcttgt ccgcgccaca 600
agggataatg agtattctga tctttactac gcaatcccgt ggtcgcaagg aactcttgga 660
ctccttgtag ctgctgagat caggcttatt aaagtcaagg agtacatgag actcacttac 720
ataccagtca agggtgatct tcaagcctta gctcaaggtt acattgattc ttttgctccc 780
aaagacggtg acaagtcgaa aatcccggat ttcgtcgaag gcatggttta caatccaacg 840
gaaggagtga tgatggttgg aacatatgca tctaaagaag aggcaaagaa gaaagggaac 900
aaaatcaaca atgtgggatg gtggttcaag ccgtggttct accagcacgc gcagaccgcc 960
ctgaaaaagg gacagtttgt tgagtacatc ccaactcgtg aatactacca caggcacaca 1020
aggtgcttgt actgggaagg gaagcttatt cttccatttg gtgatcagtt ctggtttagg 1080
tacctcttag gttggttgat gcctccaaag gtctctcttc ttaaggccac tcaaggtgaa 1140
gctatcagga actattacca tgatatgcat gttattcagg atatgcttgt tcctctttac 1200
aaggttggcg atgcactcga atgggtccac cgcgaaatgg aggtgtatcc aatttggctt 1260
tgcccacaca aactcttcaa gcagccaatc aaaggccaaa tctacccaga gccaggcttc 1320
gagtacgaaa acagacaagg agacacagaa gatgcacaga tgtacactga tgttggagtc 1380
tactacgcac ctggctgtgt cctaagaggt gaagagtttg atggatcaga agcagtgcgt 1440
aggatggaga aatggctgat agagaaccat ggattccagc ctcagtacgc ggtgtctgag 1500
ctcgacgaga agagcttctg gagaatgttt aatggtgaat tgtatgagga gtgccgcaag 1560
aagtatagag ctattggaac gttcatgagt gtttactaca agtccaagaa aggaaggaag 1620
actgagaaag aagttagaga agccgaacaa gctcatctcg aaactgctta tgccgaggca 1680
gattaa 1686
<210> 19
<211> 1686
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 19
atgtccgact tgcaaactcc attggttaga ccaaagagaa agaagacctg ggttgactac 60
ttcgttaagt ttagatggat catcgtcatc ttcatcgtct tgccattctc cgctactttc 120
tacttcttga tctacttggg tgatatgtgg tccgaatcta agtcttttga aaagagacaa 180
aaggaacacg atgaaaacgt taagaaggtt atcaagagat tgaaaggtag agacgcttcc 240
aaggacggtt tggtctgtac tgctagaaag ccatggattg ccgtcggtat gagaaacgtt 300
gattacaaaa gagccagaca ctttgaagtt gacttgggtg aatttagaaa catcttggaa 360
atcaacaagg aaaagatgac tgctagagtc gaaccattgg tcaacatggg tcaaatctct 420
agagctactg tcccaatgaa cttgtccttg gctgtcgttg ctgaattgga cgacttgacc 480
gttggtggtt tgatcaacgg ttacggtatc gaaggttctt ctcatattta cggtttgttc 540
gctgacaccg ttgaagccta cgaaatcgtc ttggccggtg gtgaattggt tagagctact 600
agagataacg aatactctga cttgtactac gctattccat ggtctcaagg tactttgggt 660
ttgttggttg ctgctgaaat cagattgatc aaggttaagg aatacatgag attgacctac 720
attccagtca agggtgactt gcaagccttg gctcaaggtt acattgactc tttcgctcca 780
aaggatggtg acaaatctaa gatcccagac ttcgttgaag gtatggtcta caacccaacc 840
gaaggtgtta tgatggtcgg tacttacgct tctaaagaag aagctaagaa gaagggtaac 900
aagatcaaca acgtcggttg gtggttcaag ccatggttct accaacacgc tcaaactgct 960
ttgaagaagg gtcaatttgt cgaatacatc ccaactagag aatactacca cagacacact 1020
agatgtttgt actgggaagg taaattgatt ttgccattcg gtgaccaatt ctggtttaga 1080
tacttgttgg gttggttgat gccaccaaag gtctctttgt tgaaggccac ccaaggtgaa 1140
gctattagaa actactacca cgatatgcac gttatccaag atatgttggt tccattgtac 1200
aaggttggtg atgctttgga atgggttcat agagaaatgg aagtctaccc aatctggttg 1260
tgtccacaca aattgttcaa gcaaccaatc aagggtcaaa tctacccaga accaggtttt 1320
gaatacgaaa acagacaagg tgacactgaa gacgctcaaa tgtacactga cgttggtgtt 1380
tactacgctc caggttgtgt tttgagaggt gaagaatttg atggttctga agccgttaga 1440
agaatggaaa agtggttgat cgaaaaccat ggttttcaac cacaatacgc tgtttccgaa 1500
ttggatgaaa agtccttctg gagaatgttc aacggtgaat tgtacgaaga atgtagaaag 1560
aaatacagag ccattggtac ttttatgtct gtctactaca agtctaagaa gggtagaaag 1620
accgaaaaag aagtcagaga agccgaacaa gctcacttgg aaactgccta cgctgaagct 1680
gattag 1686
<210> 20
<211> 1686
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 20
atgtctgatt tgcaaacacc attggttaga ccaaaaagaa aaaaaacttg ggttgattat 60
tttgttaaat ttagatggat tattgttatt tttattgttt tgccattttc tgctacattt 120
tattttttga tttatttggg tgatatgtgg tctgaatcta aatcttttga aaaaagacaa 180
aaagaacatg atgaaaatgt taaaaaagtt attaaaagat tgaaaggtag agatgcttct 240
aaagatggtt tggtttgtac tgctagaaaa ccatggattg ctgttggtat gagaaatgtt 300
gattataaaa gagctagaca ttttgaagtt gatttgggtg aatttagaaa tattttggaa 360
attaataaag aaaaaatgac tgctagagtt gaaccattgg ttaatatggg tcaaatttct 420
agagctactg ttccaatgaa tttgtctttg gctgttgttg ctgaattgga tgatttgact 480
gttggtggtt tgattaatgg ttatggtatt gaaggttctt ctcatattta tggtttgttt 540
gctgatactg ttgaagctta tgaaattgtt ttggctggtg gtgaattggt tagagctaca 600
agagataatg aatattctga tttgtattat gctattccat ggtctcaagg tactttgggt 660
ttgttggttg ctgctgaaat tagattgatt aaagttaaag aatatatgag attgacttat 720
attccagtta aaggtgattt gcaagcttta gctcaaggtt atattgattc ttttgctcca 780
aaagatggtg ataaatctaa aattccagat tttgttgaag gtatggttta taatccaact 840
gaaggtgtta tgatggttgg tacatatgct tctaaagaag aagctaaaaa aaaaggtaat 900
aaaattaata atgttggttg gtggtttaaa ccatggtttt atcaacatgc tcaaactgct 960
ttgaaaaaag gtcaatttgt tgaatatatt ccaactagag aatattatca tagacataca 1020
agatgtttgt attgggaagg taaattgatt ttgccatttg gtgatcaatt ttggtttaga 1080
tatttgttag gttggttgat gccaccaaaa gtttctttgt tgaaagctac tcaaggtgaa 1140
gctattagaa attattatca tgatatgcat gttattcaag atatgttggt tccattgtat 1200
aaagttggtg atgctttgga atgggttcat agagaaatgg aagtttatcc aatttggttg 1260
tgtccacata aattgtttaa acaaccaatt aaaggtcaaa tttatccaga accaggtttt 1320
gaatatgaaa atagacaagg tgatacagaa gatgctcaaa tgtatactga tgttggtgtt 1380
tattatgctc caggttgtgt tttgagaggt gaagaatttg atggttctga agctgttaga 1440
agaatggaaa aatggttgat tgaaaatcat ggttttcaac cacaatatgc tgtttctgaa 1500
ttggatgaaa aatctttttg gagaatgttt aatggtgaat tgtatgaaga atgtagaaaa 1560
aaatatagag ctattggtac ttttatgtct gtttattata aatctaaaaa aggtagaaaa 1620
actgaaaaag aagttagaga agctgaacaa gctcatttgg aaactgctta tgctgaagct 1680
gattaa 1686
<210> 21
<211> 516
<212> PRT
<213> 家牛(Bos taurus)
<400> 21
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Val Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Arg Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Met Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Ile Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Ala Ile Cys Asp Lys Phe Thr
245 250 255
His Glu Ser Gln Gln Pro Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu His Glu Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Arg Glu Gly Leu Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Cys Leu Pro Gln Ala Leu His Thr
385 390 395 400
Phe His Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asp Glu Ala Glu
420 425 430
Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asp Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Arg Leu Arg Lys Gln
485 490 495
Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 22
<211> 1551
<212> DNA
<213> 家牛(Bos taurus)
<400> 22
atggagcccg ctgtgtcgct ggccgtgtgc gcgctgctct tcctgctctg ggttcgggtg 60
aaggggctgg agttcgttct catccaccag cgctgggtgt ttgtgtgcct cttcctccta 120
cctctctcgc tcatcttcga catctactac tacgtgcgcg cctgggtggt gttcaagctc 180
agcagcgcac cgcggctgca cgaacagcgc gtgcgggaca tccagaaaca ggtgcgggaa 240
tggaaggagc agggcagcaa gaccttcatg tgcacggggc gacctggctg gctcactgtt 300
tcactgcggg ttgggaagta caagaagaca cacaaaaaca taatgatcaa cctgatggac 360
attctggagg tggacaccaa gaaacagatt gtccgagtgg agcccttggt gaccatgggt 420
caggtgactg ccctgctgac ctccattggc tggactctgc ctgtgttgcc cgagctggat 480
gacctcacag tgggaggact gatcatgggc acaggcatcg agtcttcgtc ccataggtat 540
ggcttgttcc agcacatctg caccgcctat gagctggtct tggctgatgg cagctttgtg 600
cgatgtacac cgatggaaaa ctcagacctg ttctacgctg tgccctggtc ctgcgggact 660
ctgggcttcc tggtggctgc cgagatccgc atcatccctg ccaagaagta catcaagctg 720
cggtttgagc cggtgcgcgg cctggaggcc atctgtgaca agttcaccca cgagtcccag 780
cagccggaga accacttcgt ggaagggctg ctctactctc tgcacgaggc cgtcatcatg 840
acgggggtca tgacggacga ggcagagccc agcaagctga acagcattgg caactactac 900
aagccctggt tcttcaagca cgtggagaac tacctgaaga caaaccgaga gggcctggag 960
tacatcccct tgagacacta ctatcaccgc cacacgcgca gcatcttctg ggagctccag 1020
gacatcatcc cctttggcaa caaccccatc ttccgctacc tctttggttg gatggtgcct 1080
cccaagatct ccctgctgaa gctgacccag ggcgagacgc tgcgcaagct gtacgagcag 1140
caccacgtgg tacaggacat gctggtgccc atgaagtgcc tgccgcaggc cctgcacacc 1200
ttccacaacg acatccacgt ctaccccatc tggctgtgcc cattcatcct gcccagccag 1260
ccgggcctgg tgcaccccaa gggagatgag gccgagctct atgtcgatat cggtgcctac 1320
ggggagccac gtgtgaagca ttttgaagcc cggtcctgca tgaggcagtt ggagaagttt 1380
gtccgaagtg tgcacgggtt ccagatgctg tatgccgact gctacatgga ccgggaggag 1440
ttctgggaga tgttcgacgg ctccctgtac cacaggctgc ggaagcagct cggctgccag 1500
gatgccttcc ctgaggtcta cgacaagatc tgcaaggctg ccaggcactg a 1551
<210> 23
<211> 1551
<212> DNA
<213> 家牛(Bos taurus)
<400> 23
atggaaccag ctgtttcctt ggctgtctgt gctttgttgt ttttgttgtg ggttagagtc 60
aaaggtttgg aatttgtttt gattcaccaa agatgggtct tcgtctgttt gttcttgttg 120
ccattgtcct tgattttcga tatttactac tacgttagag cttgggttgt cttcaagttg 180
tcttccgctc caagattgca cgaacaaaga gtcagagaca ttcaaaagca agtcagagaa 240
tggaaggaac aaggttctaa gactttcatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaagta caagaagacc cacaagaaca ttatgatcaa cttgatggac 360
attttggaag ttgacactaa gaaacaaatt gttagagtcg aaccattggt taccatgggt 420
caagttactg ctttgttgac ttctatcggt tggactttgc cagtcttgcc agaattggat 480
gacttgaccg ttggtggttt gattatgggt actggtattg aatcttcttc tcacagatac 540
ggtttgtttc aacacatctg taccgcttac gaattggtct tggctgatgg ttctttcgtc 600
agatgtactc caatggaaaa ctctgacttg ttctacgctg ttccatggtc ttgtggtact 660
ttgggtttct tggttgccgc tgaaatcaga attattccag ccaagaagta catcaagttg 720
agatttgaac cagttagagg tttggaagcc atctgtgaca aattcaccca cgaatctcaa 780
caaccagaaa accactttgt cgaaggtttg ttgtactctt tgcatgaagc tgtcattatg 840
actggtgtca tgactgatga agccgaacca tccaaattga actctattgg taactactac 900
aagccatggt tctttaagca cgtcgaaaac tacttgaaga ctaacagaga aggtttggaa 960
tacatcccat tgagacatta ctaccacaga cacactagat ccatcttctg ggaattgcaa 1020
gacatcatcc catttggtaa caacccaatt tttagatact tgttcggttg gatggtccca 1080
ccaaagattt ctttgttgaa gttgacccaa ggtgaaactt tgagaaagtt gtacgaacaa 1140
caccatgtcg tccaagatat gttggtccca atgaagtgtt tgccacaagc cttgcacacc 1200
ttccataacg atattcatgt ctacccaatt tggttgtgtc cattcatctt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgacgaa gccgaattgt acgtcgatat tggtgcttac 1320
ggtgaaccaa gagttaagca ctttgaagct agatcttgta tgagacaatt ggaaaagttc 1380
gttagatccg tccacggttt ccaaatgttg tacgctgact gttacatgga tagagaagaa 1440
ttttgggaaa tgttcgatgg ttctttgtac cacagattga gaaagcaatt gggttgtcaa 1500
gatgcctttc cagaagttta cgacaagatt tgtaaggctg ctagacacta a 1551
<210> 24
<211> 1551
<212> DNA
<213> 家牛(Bos taurus)
<400> 24
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggttagagtt 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttattat tatgttagag cttgggttgt ttttaaattg 180
tcttctgctc caagattgca tgaacaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaaata taaaaaaaca cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt tactatgggt 420
caagttactg ctttgttgac ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgacag ttggtggttt gattatgggt acaggtattg aatcttcttc tcatagatat 540
ggtttgtttc aacatatttg tactgcttat gaattggttt tggctgatgg ttcttttgtt 600
agatgtacac caatggaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtact 660
ttgggttttt tggttgctgc tgaaattaga attattccag ctaaaaaata tattaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgata aatttactca tgaatctcaa 780
caaccagaaa atcattttgt tgaaggtttg ttgtattctt tgcatgaagc tgttattatg 840
actggtgtta tgactgatga agctgaacca tctaaattga attctattgg taattattat 900
aaaccatggt tttttaaaca tgttgaaaat tatttgaaaa caaatagaga aggtttggaa 960
tatattccat tgagacatta ttatcataga catactagat ctattttttg ggaattgcaa 1020
gatattattc catttggtaa taatccaatt tttagatatt tgtttggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaactt tgagaaaatt gtatgaacaa 1140
catcatgttg ttcaagatat gttggttcca atgaaatgtt tgccacaagc tttgcatact 1200
tttcataatg atattcatgt ttatccaatt tggttgtgtc catttatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgatgaa gctgaattgt atgttgatat tggtgcttat 1320
ggtgaaccaa gagttaaaca ttttgaagct agatcttgta tgagacaatt ggaaaaattt 1380
gttagatctg ttcatggttt tcaaatgttg tatgctgatt gttatatgga tagagaagaa 1440
ttttgggaaa tgtttgatgg ttctttgtat catagattga gaaaacaatt gggttgtcaa 1500
gatgcttttc cagaagttta tgataaaatt tgtaaagctg ctagacatta a 1551
<210> 25
<211> 516
<212> PRT
<213> 智人(Homo sapiens)
<400> 25
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Leu Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Ser Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Ala Ile Cys Ala Lys Phe Thr
245 250 255
His Glu Ser Gln Arg Gln Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Glu Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Arg Glu Gly Leu Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Cys Leu Gln Gln Ala Leu His Thr
385 390 395 400
Phe Gln Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asn Glu Ala Glu
420 425 430
Leu Tyr Ile Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asn Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Lys Leu Arg Glu Lys
485 490 495
Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 26
<211> 1551
<212> DNA
<213> 智人(Homo sapiens)
<400> 26
atggagcccg ccgtgtcgct ggccgtgtgc gcgctgctct tcctgctgtg ggtgcgcctg 60
aaggggctgg agttcgtgct catccaccag cgctgggtgt tcgtgtgcct cttcctcctg 120
ccgctctcgc ttatcttcga tatctactac tacgtgcgcg cctgggtggt gttcaagctc 180
agcagcgctc cgcgcctgca cgagcagcgc gtgcgggaca tccagaagca ggtgcgggaa 240
tggaaggagc agggtagcaa gaccttcatg tgcacggggc gccctggctg gctcactgtc 300
tcactacgtg tcgggaagta caagaagaca cacaaaaaca tcatgatcaa cctgatggac 360
attctggaag tggacaccaa gaaacagatt gtccgtgtgg agcccttggt gaccatgggc 420
caggtgactg ccctgctgac ctccattggc tggactctcc ccgtgttgcc tgagcttgat 480
gacctcacag tggggggctt gatcatgggc acaggcatcg agtcatcatc ccacaagtac 540
ggcctgttcc aacacatctg cactgcttac gagctggtcc tggctgatgg cagctttgtg 600
cgatgcactc cgtccgaaaa ctcagacctg ttctatgccg taccctggtc ctgtgggacg 660
ctgggtttcc tggtggccgc tgagatccgc atcatccctg ccaagaagta cgtcaagctg 720
cgtttcgagc cagtgcgggg cctggaggct atctgtgcca agttcaccca cgagtcccag 780
cggcaggaga accacttcgt ggaagggctg ctctactccc tggatgaggc tgtcattatg 840
acaggggtca tgacagatga ggcagagccc agcaagctga atagcattgg caattactac 900
aagccgtggt tctttaagca tgtggagaac tatctgaaga caaaccgaga gggcctggag 960
tacattccct tgagacacta ctaccaccgc cacacgcgca gcatcttctg ggagctccag 1020
gacattatcc cctttggcaa caaccccatc ttccgctacc tctttggctg gatggtgcct 1080
cccaagatct ccctcctgaa gctgacccag ggtgagaccc tgcgcaagct gtacgagcag 1140
caccacgtgg tgcaggacat gctggtgccc atgaagtgcc tgcagcaggc cctgcacacc 1200
ttccaaaacg acatccacgt ctaccccatc tggctgtgtc cgttcatcct gcccagccag 1260
ccaggcctag tgcaccccaa aggaaatgag gcagagctct acatcgacat tggagcatat 1320
ggggagccgc gtgtgaaaca ctttgaagcc aggtcctgca tgaggcagct ggagaagttt 1380
gtccgcagcg tgcatggctt ccagatgctg tatgccgact gctacatgaa ccgggaggag 1440
ttctgggaga tgtttgatgg ctccttgtac cacaagctgc gagagaagct gggttgccag 1500
gacgccttcc ccgaggtgta cgacaagatc tgcaaggccg ccaggcactg a 1551
<210> 27
<211> 1551
<212> DNA
<213> 智人(Homo sapiens)
<400> 27
atggaaccag ctgtctcttt ggctgtttgt gctttgttgt tcttgttgtg ggttagattg 60
aaaggtttgg aatttgtttt gatccaccaa agatgggtct tcgtttgttt gttcttgttg 120
ccattgtcct tgatcttcga tatctactac tacgttagag cttgggttgt tttcaagttg 180
tcttctgctc caagattgca cgaacaaaga gttagagaca ttcaaaagca agttagagaa 240
tggaaggaac aaggttccaa gactttcatg tgtaccggta gaccaggttg gttgaccgtt 300
tctttgagag tcggtaagta caaaaagact cacaagaaca tcatgatcaa cttgatggac 360
attttggaag ttgatactaa gaagcaaatc gttagagtcg aaccattggt taccatgggt 420
caagttactg ctttgttgac ctctattggt tggaccttgc cagttttgcc agaattggac 480
gatttgactg ttggtggttt gattatgggt actggtatcg aatcttcctc tcataagtac 540
ggtttgttcc aacacatttg taccgcctac gaattggttt tggccgatgg ttcttttgtc 600
agatgtaccc catccgaaaa ctctgacttg ttttacgctg tcccatggtc ttgtggtact 660
ttgggtttct tggttgctgc tgaaatcaga atcattccag ctaagaagta cgtcaagttg 720
agatttgaac cagtcagagg tttggaagct atttgtgcta agttcactca cgaatctcaa 780
agacaagaaa accacttcgt tgaaggtttg ttgtactcct tggacgaagc tgttatcatg 840
accggtgtta tgactgatga agctgaacca tctaagttga actctatcgg taactactac 900
aagccatggt tcttcaagca tgtcgaaaac tacttgaaga ctaacagaga aggtttggaa 960
tacattccat tgagacatta ctaccataga cacactagat ctattttctg ggaattgcaa 1020
gatatcatcc catttggtaa caacccaatt ttcagatact tgttcggttg gatggttcca 1080
ccaaagatct ctttgttgaa gttgactcaa ggtgaaacct tgagaaagtt gtacgaacaa 1140
caccacgtcg ttcaagacat gttggtccca atgaagtgtt tgcaacaagc cttgcatacc 1200
tttcaaaacg atattcatgt ctacccaatc tggttgtgtc cattcatctt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtaacgaa gctgaattgt acattgatat cggtgcttac 1320
ggtgaaccaa gagttaagca ttttgaagct agatcttgta tgagacaatt ggaaaagttc 1380
gtcagatccg tccatggttt ccaaatgttg tacgccgact gttacatgaa cagagaagaa 1440
ttttgggaaa tgttcgacgg ttccttgtac cacaagttga gagaaaagtt gggttgtcaa 1500
gatgcttttc cagaagttta cgacaagatc tgtaaggccg ctagacacta a 1551
<210> 28
<211> 1551
<212> DNA
<213> 智人(Homo sapiens)
<400> 28
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggttagattg 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttattat tatgttagag cttgggttgt ttttaaattg 180
tcttctgctc caagattgca tgaacaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaaata taaaaaaaca cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt tactatgggt 420
caagttactg ctttgttgac ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgacag ttggtggttt gattatgggt acaggtattg aatcttcttc tcataaatat 540
ggtttgtttc aacatatttg tactgcttat gaattggttt tggctgatgg ttcttttgtt 600
agatgtactc catctgaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtact 660
ttgggttttt tggttgctgc tgaaattaga attattccag ctaaaaaata tgttaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgcta aatttactca tgaatctcaa 780
agacaagaaa atcattttgt tgaaggtttg ttgtattctt tggatgaagc tgttattatg 840
acaggtgtta tgacagatga agctgaacca tctaaattga attctattgg taattattat 900
aaaccatggt tttttaaaca tgttgaaaat tatttgaaaa caaatagaga aggtttggaa 960
tatattccat tgagacatta ttatcataga catactagat ctattttttg ggaattgcaa 1020
gatattattc catttggtaa taatccaatt tttagatatt tgtttggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaactt tgagaaaatt gtatgaacaa 1140
catcatgttg ttcaagatat gttggttcca atgaaatgtt tgcaacaagc tttgcatact 1200
tttcaaaatg atattcatgt ttatccaatt tggttgtgtc catttatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtaatgaa gctgaattgt atattgatat tggtgcttat 1320
ggtgaaccaa gagttaaaca ttttgaagct agatcttgta tgagacaatt ggaaaaattt 1380
gttagatctg ttcatggttt tcaaatgttg tatgctgatt gttatatgaa tagagaagaa 1440
ttttgggaaa tgtttgatgg ttctttgtat cataaattga gagaaaaatt gggttgtcaa 1500
gatgcttttc cagaagttta tgataaaatt tgtaaagctg ctagacatta a 1551
<210> 29
<211> 516
<212> PRT
<213> 非洲象(Loxodonta africana)
<400> 29
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Ile Arg Val Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Cys Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Ile Ser Leu Arg Val Gly Lys Tyr Lys Lys Ile His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Ser Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Thr Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Asn Ile Cys Asp Lys Phe Ser
245 250 255
Arg Glu Ser Gln Gln Leu Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Glu Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Gln Glu Gly Leu Glu
305 310 315 320
Tyr Val Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Cys Met Pro Gln Ala Leu His Thr
385 390 395 400
Phe His Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asp Glu Ala Glu
420 425 430
Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Arg Ile Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asn Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Lys Leu Arg Glu Gln
485 490 495
Leu Asn Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 30
<211> 1551
<212> DNA
<213> 非洲象(Loxodonta africana)
<400> 30
atggagcccg ctgtgtccct ggccgtgtgc gcgctgctct tcctgctctg gatccgtgtg 60
aaggggctgg agttcgtgct catccatcag cgctgggtgt tcgtgtgcct cttcctcctg 120
ccactgtcgc taatcttcga catctgctac tacgtgcgcg cctgggtggt gttcaagctc 180
agcagcgcgc cgcggctgca cgagcagcgc gtgcgggaca tccagaagca ggtacgagaa 240
tggaaggagc agggcagcaa gacgttcatg tgcaccgggc gccctggctg gctcaccatc 300
tcactgcggg ttgggaagta caagaagata cacaaaaaca tcatgatcaa cctgatggac 360
attctggagg tggacaccaa gaaacagatt gtccgtgtgg agcccttggt gaccatgggt 420
caggtgactg ccctgctgac ttccattggc tggactctgc ctgtgttgcc cgagctcgat 480
gacctcaccg tagggggctt gatcatgggc actggcatcg agtcgtcatc ccacaagtat 540
ggtctgttcc agcacatctg tacagcctat gagctggtcc tagctgatgg cagctttgtg 600
cgatgtacgc cgtctgaaaa ctcagatctg ttctatgctg tgccctggtc ctgtgggaca 660
ctgggcttcc tggtgaccgc tgagatccgc atcatccctg ccaagaagta cgtcaagctg 720
cgctttgagc cagtacgggg cctggagaat atctgtgaca agttctcccg cgagtctcag 780
cagctggaga accacttcgt ggaagggctg ctatactccc tggatgaggc cgtcatcatg 840
acaggcgtca tgacagacga agcagagccc agcaagctga atagcattgg gaattactac 900
aagccatggt tcttcaagca cgtggagaac tacctgaaga caaaccaaga gggcctggag 960
tacgttcccc tgaggcacta ctatcaccgc cacacccgca gcatcttctg ggagctccag 1020
gacatcatcc ccttcggcaa caaccccatt ttccgctacc tctttggttg gatggtacct 1080
cctaagatct ccctcctgaa gctgacccag ggcgagaccc tgcgcaagct gtatgagcag 1140
caccacgtgg tgcaggacat gctggtgccc atgaagtgca tgccgcaggc cctgcacacc 1200
ttccacaacg acatccacgt ctaccccatc tggctgtgcc ccttcatcct gcccagccag 1260
ccaggcctgg tgcaccccaa aggagatgag gcagagctct acgtcgacat tggggcctat 1320
ggggagccac gcataaagca ctttgaagcc aggtcctgca tgaggcagtt ggagaagttc 1380
gttcgaagcg tgcatggatt ccagatgctg tatgccgatt gctacatgaa ccgggaggag 1440
ttctgggaga tgtttgacgg atccctgtac cacaagctgc gggagcagct caactgccag 1500
gacgccttcc cagaggtgta tgacaagatc tgcaaggccg ccaggcactg a 1551
<210> 31
<211> 1551
<212> DNA
<213> 非洲象(Loxodonta africana)
<400> 31
atggaaccag ctgtctcttt ggccgtttgt gctttgttgt ttttgttgtg gattagagtc 60
aaaggtttgg aatttgtctt gattcaccaa agatgggtct tcgtttgttt gtttttgttg 120
ccattgtcct tgattttcga catctgttac tacgttagag cttgggtcgt ttttaagttg 180
tcttctgctc caagattgca tgaacaaaga gttagagaca ttcaaaagca agtcagagaa 240
tggaaggaac aaggttctaa gactttcatg tgtactggta gaccaggttg gttgactatc 300
tccttgagag ttggtaagta caagaaaatt cacaagaaca ttatgattaa cttgatggat 360
attttggaag ttgacactaa gaagcaaatt gtcagagtcg aaccattggt tactatgggt 420
caagtcactg ctttgttgac ctccattggt tggaccttgc cagttttgcc agaattggat 480
gacttgaccg ttggtggttt gattatgggt actggtattg aatcctcctc tcataaatac 540
ggtttgttcc aacacatctg tactgcctac gaattggttt tggctgatgg ttccttcgtt 600
agatgtaccc catccgaaaa ctctgatttg ttctacgccg tcccatggtc ttgtggtact 660
ttgggttttt tggttactgc tgaaattaga attattccag ccaagaagta cgttaagttg 720
agatttgaac cagtcagagg tttggaaaac atttgtgata agttttctag agaatctcaa 780
caattggaaa accacttcgt tgaaggtttg ttgtactctt tggatgaagc cgtcatcatg 840
accggtgtta tgaccgacga agccgaacca tccaagttga actccatcgg taactactac 900
aagccatggt tcttcaagca cgttgaaaac tacttgaaga ctaaccaaga aggtttggaa 960
tacgtcccat tgagacacta ctaccataga catactagat ctatcttctg ggaattgcaa 1020
gacattattc cattcggtaa caacccaatt ttcagatact tgtttggttg gatggttcca 1080
ccaaagatct ctttgttgaa gttgactcaa ggtgaaactt tgagaaagtt gtacgaacaa 1140
caccacgtcg ttcaagatat gttggtccca atgaagtgta tgccacaagc cttgcacacc 1200
ttccacaacg acattcacgt ctacccaatc tggttgtgtc cattcatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgacgaa gctgaattgt acgttgatat cggtgcctac 1320
ggtgaaccaa gaatcaagca ctttgaagct agatcctgta tgagacaatt ggaaaagttt 1380
gttagatctg ttcacggttt tcaaatgttg tacgccgatt gttacatgaa cagagaagaa 1440
ttctgggaaa tgtttgacgg ttctttgtac cataagttga gagaacaatt gaactgtcaa 1500
gatgcctttc cagaagttta cgataagatc tgtaaggctg ctagacacta a 1551
<210> 32
<211> 1551
<212> DNA
<213> 非洲象(Loxodonta africana)
<400> 32
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg gattagagtt 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttgttat tatgttagag cttgggttgt ttttaaattg 180
tcttctgctc caagattgca tgaacaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactatt 300
tctttgagag ttggtaaata taaaaaaatt cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt tactatgggt 420
caagttactg ctttgttgac ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgactg ttggtggttt gattatgggt actggtattg aatcttcttc tcataaatat 540
ggtttgtttc aacatatttg tacagcttat gaattggttt tggctgatgg ttcttttgtt 600
agatgtactc catctgaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtaca 660
ttgggttttt tggttactgc tgaaattaga attattccag ctaaaaaata tgttaaattg 720
agatttgaac cagttagagg tttggaaaat atttgtgata aattttctag agaatctcaa 780
caattggaaa atcattttgt tgaaggtttg ttgtattctt tggatgaagc tgttattatg 840
acaggtgtta tgacagatga agctgaacca tctaaattga attctattgg taattattat 900
aaaccatggt tttttaaaca tgttgaaaat tatttgaaaa caaatcaaga aggtttggaa 960
tatgttccat tgagacatta ttatcataga catactagat ctattttttg ggaattgcaa 1020
gatattattc catttggtaa taatccaatt tttagatatt tgtttggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaactt tgagaaaatt gtatgaacaa 1140
catcatgttg ttcaagatat gttggttcca atgaaatgta tgccacaagc tttgcatact 1200
tttcataatg atattcatgt ttatccaatt tggttgtgtc catttatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgatgaa gctgaattgt atgttgatat tggtgcttat 1320
ggtgaaccaa gaattaaaca ttttgaagct agatcttgta tgagacaatt ggaaaaattt 1380
gttagatctg ttcatggttt tcaaatgttg tatgctgatt gttatatgaa tagagaagaa 1440
ttttgggaaa tgtttgatgg ttctttgtat cataaattga gagaacaatt gaattgtcaa 1500
gatgcttttc cagaagttta tgataaaatt tgtaaagctg ctagacatta a 1551
<210> 33
<211> 518
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 33
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Val Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Ser Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Asn Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Ile Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Ser Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Ala Ile Cys Glu Lys Phe Thr
245 250 255
Arg Glu Ser Gln Arg Leu Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Glu Ala Val Ala Val Ile Met Thr Gly Val Met Thr Asp
275 280 285
Asp Val Glu Ser Ser Lys Leu Asn Ser Ile Gly Ser Tyr Tyr Lys Pro
290 295 300
Trp Phe Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Arg Glu Gly
305 310 315 320
Leu Glu Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser
325 330 335
Ile Phe Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile
340 345 350
Phe Arg Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu
355 360 365
Lys Leu Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His
370 375 380
Val Val Gln Asp Met Leu Val Pro Met Lys Cys Met Ser Gln Ala Leu
385 390 395 400
His Thr Phe Gln Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro
405 410 415
Phe Ile Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asp Glu
420 425 430
Ala Glu Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys
435 440 445
His Phe Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg
450 455 460
Ser Val His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asn Arg
465 470 475 480
Glu Glu Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Lys Leu Arg
485 490 495
Lys Gln Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile
500 505 510
Cys Lys Ala Ala Arg His
515
<210> 34
<211> 1557
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 34
atggagcccg ccgtgtcgct ggccgtgtgc gcgctgctct ttctgctctg ggtgcgagtg 60
aaggggttgg agttcgttct catccaccag cgctgggtgt tcgtgtgcct cttcttgctg 120
ccgctctcgc tcatcttcga tatctactac tacgtgcgcg cctgggtggt gttcaagctg 180
agcagtgcgc cgcgcctgca cgagcagcgc gtgcgggaca tccagaaaca ggtccgggaa 240
tggaaggaac agggcagtaa gaccttcatg tgcacggggc gcccaggctg gctcactgtc 300
tcgctgcgag tcggaaagta caagaagacc cataagaaca tcatgatcaa cctgatggac 360
atcctggagg tggacaccaa gaaacagatt gttcgagtgg agcccttggt gtctatgggt 420
caggtgacag ctttgctgaa ctccattggc tggaccctgc ctgtgttgcc tgagcttgat 480
gacctcacag tggggggcct gatcatgggc acaggcatcg agtcatcgtc ccacaagtat 540
ggcctgttcc aacacatttg cactgcctac gagctgatcc tggcagacgg cagctttgtg 600
cgctgcacac cgtctgaaaa ctcagacctg ttctatgccg tgccctggtc ctgtgggacc 660
ctgggcttcc tggtggctgc cgagatccgg atcatcccgg ccaagaagta tgtcaagctg 720
cggtttgagc ctgttcgggg cctggaggcc atctgtgaaa aattcacccg cgagtcccag 780
cggctggaga accacttcgt ggaagggttg ctgtactccc tggatgaggc tgtggctgtc 840
atcatgacag gggtcatgac ggacgacgta gagtccagca agctgaatag cattggcagt 900
tactacaagc cctggttctt caagcatgtg gagaactacc tgaagacaaa ccgggagggc 960
ctcgaataca ttcccctgag acactactac caccgacaca cgcgcagcat cttctgggag 1020
ctccaggaca tcatcccttt cggcaacaac cccatcttcc gctacctctt cggctggatg 1080
gtgcctccca agatctccct cctgaagctg acccagggcg agacgctacg caagctgtac 1140
gagcagcacc acgtggtgca ggacatgctg gtgcccatga agtgcatgtc acaggccctg 1200
cataccttcc aaaatgacat ccacgtctac cccatctggc tgtgcccatt catcctgccc 1260
agccagccag gactagtgca tcccaaggga gatgaagcag agctctacgt ggacatcggg 1320
gcatacgggg agccacgtgt gaagcacttc gaggccaggt cctgcatgag gcagctggag 1380
aagtttgtgc ggagtgtgca cgggttccaa atgttatacg ccgattgcta tatgaaccgc 1440
gaggaattct gggagatgtt cgatggctcc ttgtaccaca agctgcgcaa gcagctgggc 1500
tgccaggacg ccttccctga ggtgtacgac aagatctgca aggcggcaag gcactga 1557
<210> 35
<211> 1557
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 35
atggaaccag ctgtttcctt ggccgtttgt gctttgttgt tcttgttgtg ggtcagagtt 60
aaaggtttgg aatttgtttt gattcatcaa agatgggtct tcgtttgttt gttcttgttg 120
ccattgtcct tgattttcga catctactac tacgttagag cttgggttgt tttcaagttg 180
tcttctgccc caagattgca cgaacaaaga gttagagata tccaaaagca agttagagaa 240
tggaaggaac aaggttctaa gacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag tcggtaagta caagaagacc cataagaaca tcatgattaa cttgatggat 360
attttggaag ttgatactaa gaagcaaatc gttagagttg aaccattggt ttccatgggt 420
caagttaccg ccttgttgaa ctctattggt tggactttgc cagttttgcc agaattggac 480
gacttgaccg ttggtggttt gattatgggt actggtatcg aatcctcttc tcacaaatac 540
ggtttgttcc aacacatttg taccgcctac gaattgattt tggctgatgg ttctttcgtc 600
agatgtaccc catctgaaaa ctccgacttg ttttacgccg tcccatggtc ttgtggtacc 660
ttgggtttct tggttgccgc cgaaatcaga atcattccag ccaagaagta cgtcaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgaaa aattcaccag agaatctcaa 780
agattggaaa accactttgt cgaaggtttg ttgtactctt tggatgaagc tgttgccgtt 840
attatgactg gtgtcatgac cgacgacgtt gaatcttcta agttgaactc tatcggttcc 900
tactacaagc catggttctt taagcacgtc gaaaactact tgaaaaccaa cagagaaggt 960
ttggaataca tcccattgag acattactac catagacaca ctagatctat cttctgggaa 1020
ttgcaagaca tcatcccatt cggtaacaac ccaatcttca gatacttgtt cggttggatg 1080
gtcccaccaa agatctcctt gttgaagttg actcaaggtg aaactttgag aaagttgtac 1140
gaacaacatc acgttgtcca agatatgttg gttccaatga agtgtatgtc tcaagccttg 1200
cacaccttcc aaaacgacat ccacgtttac ccaatttggt tgtgtccatt catcttgcca 1260
tctcaaccag gtttggttca tccaaaaggt gacgaagccg aattgtacgt tgacattggt 1320
gcttacggtg aaccaagagt taaacacttt gaagctagat cctgtatgag acaattggaa 1380
aagttcgtta gatctgtcca cggttttcaa atgttgtacg ccgattgtta catgaacaga 1440
gaagaattct gggaaatgtt cgacggttcc ttgtaccaca agttgagaaa acaattgggt 1500
tgtcaagatg cttttccaga agtttacgac aaaatctgta aagccgctag acactaa 1557
<210> 36
<211> 1557
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 36
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggttagagtt 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttattat tatgttagag cttgggttgt ttttaaattg 180
tcttctgctc caagattgca tgaacaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaaata taaaaaaact cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt ttctatgggt 420
caagttacag ctttgttgaa ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgacag ttggtggttt gattatgggt acaggtattg aatcttcttc tcataaatat 540
ggtttgtttc aacatatttg tactgcttat gaattgattt tggctgatgg ttcttttgtt 600
agatgtacac catctgaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtact 660
ttgggttttt tggttgctgc tgaaattaga attattccag ctaaaaaata tgttaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgaaa aatttactag agaatctcaa 780
agattggaaa atcattttgt tgaaggtttg ttgtattctt tggatgaagc tgttgctgtt 840
attatgacag gtgttatgac tgatgatgtt gaatcttcta aattgaattc tattggttct 900
tattataaac catggttttt taaacatgtt gaaaattatt tgaaaacaaa tagagaaggt 960
ttggaatata ttccattgag acattattat catagacata ctagatctat tttttgggaa 1020
ttgcaagata ttattccatt tggtaataat ccaattttta gatatttgtt tggttggatg 1080
gttccaccaa aaatttcttt gttgaaattg actcaaggtg aaactttgag aaaattgtat 1140
gaacaacatc atgttgttca agatatgttg gttccaatga aatgtatgtc tcaagctttg 1200
catacttttc aaaatgatat tcatgtttat ccaatttggt tgtgtccatt tattttgcca 1260
tctcaaccag gtttggttca tccaaaaggt gatgaagctg aattgtatgt tgatattggt 1320
gcttatggtg aaccaagagt taaacatttt gaagctagat cttgtatgag acaattggaa 1380
aaatttgtta gatctgttca tggttttcaa atgttatatg ctgattgtta tatgaataga 1440
gaagaatttt gggaaatgtt tgatggttct ttgtatcata aattgagaaa acaattgggt 1500
tgtcaagatg cttttccaga agtttatgat aaaatttgta aagctgctag acattaa 1557
<210> 37
<211> 516
<212> PRT
<213> 加氏大婴猴(Otolemur garnetti)
<400> 37
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Leu Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Arg Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Thr Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Ala Ile Cys Asp Lys Phe Thr
245 250 255
His Glu Ser Gln Arg Leu Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Glu Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Gln Glu Gly Leu Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Cys Leu Pro Arg Ala Leu Asn Thr
385 390 395 400
Phe His Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asp Glu Thr Glu
420 425 430
Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asn Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Glu Leu Arg Glu Lys
485 490 495
Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 38
<211> 1551
<212> DNA
<213> 加氏大婴猴(Otolemur garnetti)
<400> 38
atggagcccg ccgtgtcgct ggcggtgtgc gcgctgctct tcctgctctg ggtgcgcttg 60
aaggggctgg agttcgtgct catccaccag cgctgggtgt tcgtatgcct cttcctcctg 120
ccgctctcgc tcatcttcga catctactac tacgtgcgcg cctgggtggt gtttaggctc 180
agcagcgcgc cgcgcctgca cgagcagaga gtgcgggaca tccagaagca ggtgcgggaa 240
tggaaggagc agggcagcaa gactttcatg tgcacgggac gccccggctg gctcacggtc 300
tcgctgaggg ttgggaagta caagaagaca cacaaaaaca ttatgatcaa cctgatggac 360
attctggagg tggataccaa gaaacagata gtccgtgtgg agcccttggt gaccatgggt 420
caggtgactg ccctgctgac ctccattggt tggaccctgc ccgtgttgcc cgagcttgat 480
gacctcacag tagggggctt gatcatgggc acaggcatag agtcgtcatc tcacaagtac 540
ggcctgttcc aacacatctg cactgcctac gagctggtcc tggccgacgg cagctttgtg 600
cggtgcacac cgactgaaaa ctcagacctg ttctatgctg tgccttggtc ctgtgggact 660
ctgggcttcc tggtggctgc cgagatccgc atcatccctg ccaagaagta cgtcaagcta 720
cgatttgagc cagtgcgggg cctggaggcc atctgtgaca agttcaccca cgagtcccag 780
cggctggaga accactttgt ggaagggctg ctctactccc tggacgaagc cgtcatcatg 840
acgggcgtca tgacagatga ggcagagcct agcaagctga atagcattgg caattactac 900
aagccgtggt tcttcaagca cgtggagaat tacctgaaga ccaaccagga gggcctggag 960
tacatcccct tgagacacta ctaccaccgc cacacgcgca gcatcttctg ggagctccag 1020
gatatcatcc cctttggcaa caaccccatc ttccgctacc tctttgggtg gatggtaccg 1080
cccaagatct ccctcctgaa gctgacccag ggcgagaccc tgcgcaagct gtacgagcag 1140
caccatgtgg tgcaggacat gctggtgccc atgaagtgcc tgccacgggc cctgaacacc 1200
ttccacaatg acatccacgt ctacccgatc tggctgtgtc cgttcatcct gcccagccag 1260
ccgggcctgg tgcaccccaa gggagacgag acagagctct atgttgacat tggcgcatat 1320
ggggagccac gcgtgaagca ctttgaagcc aggtcttgca tgaggcagtt ggagaagttt 1380
gtccgaagtg tgcatggctt ccagatgctg tacgctgact gctacatgaa ccgggaggag 1440
ttctgggaga tgtttgacgg ctccttgtac cacgagctgc gggagaagct cggttgccag 1500
gatgccttcc ctgaggtgta cgacaagatc tgcaaggccg ccaggcactg a 1551
<210> 39
<211> 1551
<212> DNA
<213> 加氏大婴猴(Otolemur garnetti)
<400> 39
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggtcagattg 60
aaaggtttgg aatttgtttt gattcaccaa agatgggtct ttgtctgttt gttcttgttg 120
ccattgtctt tgattttcga catctactac tacgtcagag cctgggttgt tttcagattg 180
tcctccgctc caagattgca cgaacaaaga gttagagata tccaaaaaca agttagagaa 240
tggaaggaac aaggttctaa gaccttcatg tgtactggta gaccaggttg gttgaccgtc 300
tccttgagag ttggtaagta caagaagact cacaagaaca tcatgatcaa cttgatggac 360
attttggaag ttgacactaa gaagcaaatc gttagagttg aaccattggt tactatgggt 420
caagttaccg ctttgttgac ttccattggt tggaccttgc cagtcttgcc agaattggac 480
gatttgaccg ttggtggttt gatcatgggt actggtatcg aatcttcttc tcacaagtac 540
ggtttgttcc aacacatctg taccgcttac gaattggtct tggccgatgg ttctttcgtt 600
agatgtactc caactgaaaa ctctgatttg ttttacgctg ttccatggtc ttgtggtacc 660
ttgggtttct tggtcgctgc cgaaatcaga attattccag ccaaaaagta cgttaagttg 720
agatttgaac cagttagagg tttggaagct atctgtgata agttcaccca tgaatctcaa 780
agattggaaa accacttcgt cgaaggtttg ttgtactcct tggatgaagc tgttattatg 840
actggtgtta tgaccgacga agccgaacca tctaaattga actctattgg taactactac 900
aagccatggt tcttcaagca tgtcgaaaac tacttgaaaa ctaaccaaga aggtttggaa 960
tacattccat tgagacacta ctaccacaga catactagat ccatcttctg ggaattgcaa 1020
gatatcatcc cattcggtaa caacccaatc tttagatact tgtttggttg gatggtccca 1080
ccaaagattt ctttgttgaa gttgacccaa ggtgaaacct tgagaaagtt gtacgaacaa 1140
caccacgtcg tccaagacat gttggttcca atgaagtgtt tgccaagagc tttgaacacc 1200
ttccacaacg acattcatgt ctacccaatc tggttgtgtc cattcatctt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgatgaa actgaattgt acgttgatat cggtgcttac 1320
ggtgaaccaa gagttaaaca ctttgaagcc agatcttgta tgagacaatt ggaaaagttt 1380
gttagatctg ttcacggttt tcaaatgttg tacgccgatt gttacatgaa cagagaagaa 1440
ttttgggaaa tgttcgacgg ttctttgtac cacgaattga gagaaaagtt gggttgtcaa 1500
gacgccttcc cagaagttta cgataagatc tgtaaggctg ctagacacta g 1551
<210> 40
<211> 1551
<212> DNA
<213> 加氏大婴猴(Otolemur garnetti)
<400> 40
atggaaccag ctgtttcttt ggctgtttgt gctttgttgt ttttgttgtg ggttagattg 60
aaaggtttgg aatttgtttt gattcatcaa agatgggttt ttgtttgttt gtttttgttg 120
ccattgtctt tgatttttga tatttattat tatgttagag cttgggttgt ttttagattg 180
tcttctgctc caagattgca tgaacaaaga gttagagata ttcaaaaaca agttagagaa 240
tggaaagaac aaggttctaa aacttttatg tgtactggta gaccaggttg gttgactgtt 300
tctttgagag ttggtaaata taaaaaaaca cataaaaata ttatgattaa tttgatggat 360
attttggaag ttgatactaa aaaacaaatt gttagagttg aaccattggt tactatgggt 420
caagttactg ctttgttgac ttctattggt tggactttgc cagttttgcc agaattggat 480
gatttgacag ttggtggttt gattatgggt acaggtattg aatcttcttc tcataaatat 540
ggtttgtttc aacatatttg tactgcttat gaattggttt tggctgatgg ttcttttgtt 600
agatgtacac caactgaaaa ttctgatttg ttttatgctg ttccatggtc ttgtggtact 660
ttgggttttt tggttgctgc tgaaattaga attattccag ctaaaaaata tgttaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgata aatttactca tgaatctcaa 780
agattggaaa atcattttgt tgaaggtttg ttgtattctt tggatgaagc tgttattatg 840
actggtgtta tgacagatga agctgaacca tctaaattga attctattgg taattattat 900
aaaccatggt tttttaaaca tgttgaaaat tatttgaaaa ctaatcaaga aggtttggaa 960
tatattccat tgagacatta ttatcataga catactagat ctattttttg ggaattgcaa 1020
gatattattc catttggtaa taatccaatt tttagatatt tgtttggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaactt tgagaaaatt gtatgaacaa 1140
catcatgttg ttcaagatat gttggttcca atgaaatgtt tgccaagagc tttgaatact 1200
tttcataatg atattcatgt ttatccaatt tggttgtgtc catttatttt gccatctcaa 1260
ccaggtttgg ttcatccaaa aggtgatgaa acagaattgt atgttgatat tggtgcttat 1320
ggtgaaccaa gagttaaaca ttttgaagct agatcttgta tgagacaatt ggaaaaattt 1380
gttagatctg ttcatggttt tcaaatgttg tatgctgatt gttatatgaa tagagaagaa 1440
ttttgggaaa tgtttgatgg ttctttgtat catgaattga gagaaaaatt gggttgtcaa 1500
gatgcttttc cagaagttta tgataaaatt tgtaaagctg ctagacatta a 1551
<210> 41
<211> 568
<212> PRT
<213> 番茄(Solanum lycopersicum)
<400> 41
Met Thr Asp Val Gln Ala Pro Pro Pro Arg Pro Lys Arg Lys Lys Asn
1 5 10 15
Ile Met Asp Leu Leu Val Gln Phe Arg Trp Ile Val Val Ile Phe Val
20 25 30
Val Leu Pro Leu Ser Phe Leu Tyr Tyr Phe Ser Ile Tyr Leu Gly Asp
35 40 45
Val Arg Ser Glu Cys Lys Ser Tyr Lys Gln Arg Gln Lys Glu His Asp
50 55 60
Glu Asn Val Lys Lys Val Val Lys Arg Leu Lys Glu Arg Asn Ala Ser
65 70 75 80
Lys Asp Gly Leu Val Cys Thr Ala Arg Lys Pro Trp Val Ala Val Gly
85 90 95
Met Arg Asn Val Asp Tyr Lys Arg Ala Arg His Phe Glu Val Asp Leu
100 105 110
Ser Pro Phe Arg Asn Val Leu Asn Ile Asp Thr Glu Arg Met Ile Ala
115 120 125
Lys Val Glu Pro Leu Val Asn Met Gly Gln Ile Ser Arg Val Thr Val
130 135 140
Pro Leu Asn Val Ser Leu Ala Val Val Ala Glu Leu Asp Asp Leu Thr
145 150 155 160
Val Gly Gly Leu Ile Asn Gly Tyr Gly Ile Glu Gly Ser Ser His Ile
165 170 175
Tyr Gly Leu Phe Ser Asp Thr Val Val Ser Tyr Glu Val Val Leu Ala
180 185 190
Asp Gly Gln Val Val Arg Ala Thr Lys Asp Asn Glu Tyr Ser Asp Leu
195 200 205
Phe Tyr Ala Ile Pro Trp Ser Gln Gly Thr Leu Gly Leu Leu Val Ser
210 215 220
Ala Glu Ile Lys Leu Ile Pro Ile Lys Glu Tyr Met Lys Leu Thr Tyr
225 230 235 240
Lys Pro Val Val Gly Asn Leu Lys Glu Ile Ala Gln Ala Tyr Met Asp
245 250 255
Ser Phe Ser Pro Arg Asp Gly Asp Gln Asp Asn His Glu Lys Val Pro
260 265 270
Asp Phe Val Glu Thr Met Val Tyr Thr Pro Thr Glu Ala Val Cys Met
275 280 285
Thr Gly Arg Tyr Ala Ser Lys Glu Glu Ala Lys Lys Lys Gly Asn Val
290 295 300
Ile Asn Asn Val Gly Trp Trp Phe Lys Thr Trp Phe Tyr Gln His Ala
305 310 315 320
Gln Thr Ala Leu Lys Lys Gly Glu Phe Val Glu Tyr Ile Pro Thr Arg
325 330 335
Glu Tyr Tyr His Arg His Thr Arg Cys Leu Tyr Trp Glu Gly Lys Leu
340 345 350
Ile Leu Pro Phe Gly Asp Gln Trp Trp Phe Arg Phe Leu Phe Gly Trp
355 360 365
Ala Met Pro Pro Lys Val Ser Leu Leu Lys Ala Thr Gln Gly Glu Tyr
370 375 380
Ile Arg Asn Tyr Tyr His Glu Asn His Val Ile Gln Asp Met Leu Val
385 390 395 400
Pro Leu Tyr Lys Val Gly Asp Ala Leu Glu Trp Val His Arg Glu Met
405 410 415
Glu Val Tyr Pro Leu Trp Leu Cys Pro His Arg Leu Tyr Arg Leu Pro
420 425 430
Leu Lys Thr Met Val Tyr Pro Glu Pro Gly Phe Glu Leu Gln Lys Arg
435 440 445
Gln Gly Asp Thr Lys Tyr Ala Gln Met Tyr Thr Asp Val Gly Val Tyr
450 455 460
Tyr Ala Pro Gly Pro Ile Leu Arg Gly Glu Val Phe Asp Gly Ile Glu
465 470 475 480
Ala Val Arg Lys Leu Glu Ser Trp Leu Ile Glu Asn His Gly Phe Gln
485 490 495
Pro Gln Tyr Ala Val Ser Glu Leu Thr Glu Lys Asn Phe Trp Arg Met
500 505 510
Phe Asp Gly Ser Leu Tyr Glu Asn Cys Arg Lys Lys Tyr Arg Ala Ile
515 520 525
Gly Thr Phe Met Ser Val Tyr Tyr Lys Ser Lys Lys Gly Lys Lys Thr
530 535 540
Glu Lys Glu Val Gln Glu Ala Glu Gln Glu Thr Ala Glu Val Glu Thr
545 550 555 560
Pro Glu Val Asp Glu Pro Glu Asp
565
<210> 42
<211> 1707
<212> DNA
<213> 番茄(Solanum lycopersicum)
<400> 42
atgacagatg ttcaggctcc cccccctcgt cctaagagga agaaaaacat tatggacctt 60
cttgtccagt tcagatggat tgttgttatc ttcgtcgtcc ttcctctctc gttcttgtat 120
tatttctcca tatatcttgg ggatgttagg tctgagtgca aatcatacaa gcagcgccag 180
aaggagcatg atgaaaatgt taaaaaggtt gtgaagcgtc ttaaggagag gaatgcatct 240
aaggatggtc ttgtctgcac agctaggaag ccctgggttg ctgttggaat gagaaatgtg 300
gactacaagc gtgctcgtca ttttgaagtt gatctttctc catttagaaa tgttcttaac 360
attgacacgg agcgaatgat tgctaaagtc gagcctctag tcaatatggg acaaatctct 420
agagttactg tccctctgaa tgtttccctt gcagttgttg ctgagcttga tgatctaact 480
gttggtggtc tgatcaacgg ctatgggatt gaaggaagtt ctcacattta tggactgttc 540
tcagacactg ttgtgtctta tgaagttgtt ctagcagatg ggcaggtagt tagagctaca 600
aaggacaatg aatattctga tcttttctat gctattccat ggtctcaagg gactctaggg 660
cttctggttt cagctgagat caagctcatt ccgatcaagg aatacatgaa acttacctac 720
aaacctgtag ttggtaattt gaaagagatt gctcaggctt atatggattc tttttcacct 780
agagacgggg atcaggataa ccatgagaaa gttccagact ttgttgaaac catggtgtat 840
actcccacag aagctgtttg catgactggt agatatgctt caaaagaaga ggccaagaag 900
aagggcaatg tgatcaacaa tgttggttgg tggttcaaaa cctggtttta ccagcacgct 960
caaactgcac tcaagaaggg agaattcgta gagtacatcc caactaggga atactaccac 1020
aggcacacaa gatgcttgta ttgggaaggg aaacttatcc ttccatttgg tgatcaatgg 1080
tggtttaggt ttctctttgg atgggccatg cctcccaagg tttctctact taaagccact 1140
caaggtgaat acattaggaa ctattaccat gaaaaccatg tcattcagga tatgcttgtt 1200
cctctctaca aggttggtga tgctcttgag tgggtccacc gtgagatgga ggtgtatccc 1260
ctctggctct gcccccacag actctacagg ctgcctctta aaacaatggt gtatcctgaa 1320
ccaggttttg agctgcagaa gaggcagggt gacacaaaat atgctcaaat gtacactgat 1380
gttggtgtct actatgctcc tggacctatt ttgaggggtg aggtctttga tggtatagag 1440
gcagtccgta agttggagag ttggttgatt gagaaccatg gattccagcc acagtatgct 1500
gtctctgagc tgacggagaa gaacttctgg agaatgtttg atggaagcct atatgagaac 1560
tgcaggaaaa agtatagagc catcggaacc ttcatgagtg tgtactataa gtctaagaaa 1620
ggaaagaaga cagagaagga ggtgcaggaa gctgagcaag agactgctga agttgagacc 1680
ccagaagttg atgagcctga agattga 1707
<210> 43
<211> 562
<212> PRT
<213> 玉米(Zea mays)
<400> 43
Met Ala Asp Val His Glu Pro Leu Val Arg Arg Lys Arg Lys Lys Val
1 5 10 15
Leu Val Asp Tyr Leu Val Lys Phe Arg Trp Ile Leu Val Ile Phe Val
20 25 30
Val Leu Pro Ile Ser Thr Leu Ile Tyr Phe Asn Ile Phe Leu Gly Asp
35 40 45
Met Trp Ser Ala Met Lys Ser Glu Lys Lys Arg Gln Lys Gln His Asp
50 55 60
Glu Asn Val Gln Lys Val Val Lys Arg Leu Lys Gln Arg Asn Pro Lys
65 70 75 80
Lys Asp Gly Leu Val Cys Thr Ala Arg Lys Pro Trp Ile Ala Val Gly
85 90 95
Met Arg Asn Val Asp Tyr Lys Arg Ala Arg His Phe Glu Val Asp Leu
100 105 110
Ser Ser Phe Arg Asn Ile Leu Glu Ile Asp Lys Glu Arg Met Val Ala
115 120 125
Lys Val Glu Pro Leu Val Asn Met Gly Gln Ile Thr Arg Ala Thr Cys
130 135 140
Pro Met Asn Leu Ala Leu Ala Val Val Ala Glu Leu Asp Asp Leu Thr
145 150 155 160
Val Gly Gly Leu Ile Asn Gly Tyr Gly Ile Glu Gly Ser Ser His Leu
165 170 175
Tyr Gly Leu Phe Ser Asp Thr Val Val Ala Met Glu Val Val Leu Ala
180 185 190
Asp Gly Arg Val Val Arg Ala Thr Lys Asp Asn Glu Tyr Ser Asp Leu
195 200 205
Phe Tyr Gly Ile Pro Trp Ser Gln Gly Thr Leu Gly Phe Leu Val Ser
210 215 220
Ala Glu Ile Lys Leu Ile Pro Ile Lys Glu Tyr Met Lys Leu Thr Tyr
225 230 235 240
Thr Pro Val Lys Gly Gly Leu Lys Glu Ile Ala Gln Ala Tyr Ala Asp
245 250 255
Ser Phe Ala Pro Arg Asp Gly Asp Pro Ala Lys Val Pro Asp Phe Val
260 265 270
Glu Gly Met Val Tyr Thr Glu Ser Glu Gly Val Met Met Thr Gly Val
275 280 285
Tyr Ala Ser Lys Glu Glu Ala Lys Lys Lys Gly Asn Lys Ile Asn Cys
290 295 300
Val Gly Trp Trp Phe Lys Pro Trp Phe Tyr Gln His Ala Gln Thr Ala
305 310 315 320
Leu Asn Arg Gly Glu Phe Val Glu Tyr Ile Pro Thr Arg Glu Tyr Tyr
325 330 335
His Arg His Thr Arg Cys Leu Tyr Trp Glu Gly Lys Leu Ile Leu Pro
340 345 350
Phe Gly Asp Gln Phe Trp Phe Arg Phe Leu Leu Gly Trp Leu Met Pro
355 360 365
Pro Lys Val Ser Leu Leu Lys Ala Thr Gln Gly Glu Ala Ile Arg Asn
370 375 380
Tyr Tyr His Asp Asn His Val Ile Gln Asp Met Leu Val Pro Leu Tyr
385 390 395 400
Lys Val Gly Asp Ala Leu Glu Phe Val His Arg Glu Met Glu Val Tyr
405 410 415
Pro Leu Trp Leu Cys Pro His Arg Leu Tyr Lys Leu Pro Val Lys Thr
420 425 430
Met Val Tyr Pro Glu Pro Gly Phe Glu His Gln His Arg Gln Gly Asp
435 440 445
Ala Ser Tyr Ala Gln Met Phe Thr Asp Val Gly Val Tyr Tyr Ala Pro
450 455 460
Gly Ala Val Leu Arg Gly Glu Glu Phe Asn Gly Ala Glu Ala Val His
465 470 475 480
Arg Leu Glu Gln Trp Leu Ile Glu Asn His Ser Tyr Gln Pro Gln Tyr
485 490 495
Ala Val Ser Glu Leu Asn Glu Lys Asp Ser Trp Arg Met Phe Asp Ala
500 505 510
Ser His Tyr Glu His Cys Arg Gln Lys Tyr Gly Ala Val Gly Thr Phe
515 520 525
Met Ser Val Tyr Tyr Lys Ser Lys Lys Gly Arg Lys Thr Glu Lys Glu
530 535 540
Val Gln Glu Ala Glu Ala Ala Ile Leu Glu Pro Ala Tyr Ala Asp Glu
545 550 555 560
Glu Ala
<210> 44
<211> 1689
<212> DNA
<213> 玉米(Zea mays)
<400> 44
atggcggacg tgcacgaacc tttggtgcgc cgtaagagga agaaggtttt ggtggactac 60
ttggtgaagt tccgatggat cctcgtgatc ttcgtggtcc ttcctatttc aactctgatc 120
tacttcaaca tcttcctggg cgacatgtgg tccgccatga agtcggagaa gaagcgccag 180
aagcagcacg acgagaacgt gcagaaggtc gtgaagcggc tcaagcagag gaacccgaag 240
aaggacggtc ttgtttgcac ggccaggaag ccctggatcg ctgttggcat gcgcaacgtg 300
gactacaagc gtgcgaggca tttcgaggtc gacctttctt ccttcaggaa catccttgag 360
atcgacaaag agaggatggt tgccaaggtc gagccccttg tcaacatggg tcagataacc 420
agagctacct gcccaatgaa ccttgccctt gcggtcgtcg ccgagctcga cgacctcact 480
gttggtgggc tgatcaacgg ttacggcatc gaggggagct ctcacctcta tggccttttc 540
tccgacacgg ttgtcgcgat ggaggttgtt ctcgcagatg gccgggtcgt cagagccacc 600
aaggacaacg agtactctga ccttttctat ggaattccct ggtcccaggg aacactgggg 660
ttccttgtct ctgcagagat caagctgatc cccatcaagg agtacatgaa gctcacctac 720
actccagtca aggggggtct aaaggagatc gcgcaggcct acgcggattc tttcgctccg 780
agggacggtg acccggcaaa ggtccctgac tttgttgaag ggatggtgta cacagagagc 840
gagggtgtca tgatgacggg cgtgtacgct tcgaaagaag aggcgaagaa gaagggcaac 900
aagatcaact gcgtggggtg gtggtttaag ccctggttct accagcacgc tcagacggcg 960
ctgaataggg gcgagtttgt ggagtacatc ccgacgaggg agtactacca ccggcacacc 1020
cggtgcctgt actgggaggg gaagctgatc ctgcccttcg gcgaccagtt ctggttcagg 1080
ttcctgctgg gctggctgat gccaccgaag gtgtccctgc tgaaggcgac ccagggcgag 1140
gctatcagga actactacca cgacaaccat gtgatccagg acatgctggt gccgctgtac 1200
aaggttgggg atgcgctgga gttcgtgcac cgcgagatgg aggtgtatcc tctgtggctg 1260
tgccctcacc ggctgtacaa gctgccggtg aagacgatgg tgtacccgga gcctgggttc 1320
gagcaccagc acaggcaggg cgacgcgagc tacgcacaga tgttcacgga cgtgggcgtg 1380
tactacgccc ccggggcggt gctgaggggg gaggagttca acggcgcgga ggctgtgcac 1440
aggctggagc agtggctgat cgagaaccac agctaccagc cgcagtacgc ggtgtcggag 1500
ctgaacgaga aggactcctg gcgcatgttc gacgcgtcgc actacgagca ctgccgccaa 1560
aagtacgggg cggtgggcac gttcatgagc gtgtactaca agtccaagaa ggggcgcaag 1620
acggagaagg aggtgcagga ggcggaggcg gccatactgg agccggccta cgcggacgag 1680
gaggcctaa 1689
<210> 45
<211> 516
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 45
Met Asp Pro Leu Leu Tyr Leu Gly Gly Leu Ala Val Leu Phe Leu Ile
1 5 10 15
Trp Ile Lys Val Lys Gly Leu Glu Tyr Val Ile Ile His Gln Arg Trp
20 25 30
Ile Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Val Val Phe Asp Val
35 40 45
Tyr Tyr His Leu Arg Ala Trp Ile Ile Phe Lys Met Cys Ser Ala Pro
50 55 60
Lys Gln His Asp Gln Arg Val Arg Asp Ile Gln Arg Gln Val Arg Glu
65 70 75 80
Trp Arg Lys Asp Gly Gly Lys Lys Tyr Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Met Met Asp Ile Leu Glu Val Asp Thr Lys Arg
115 120 125
Lys Val Val Arg Val Glu Pro Leu Ala Asn Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Asn Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Val Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Ile Tyr Gly Leu Phe Gln His Ile Cys Val Ala Phe Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Leu Val Arg Cys Thr Glu Lys Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Gln Lys Trp Val Lys Leu
225 230 235 240
His Tyr Glu Pro Val Arg Gly Leu Asp Ala Ile Cys Lys Lys Phe Ala
245 250 255
Glu Glu Ser Ala Asn Lys Glu Asn Gln Phe Val Glu Gly Leu Gln Tyr
260 265 270
Ser Arg Asp Glu Ala Val Ile Met Thr Gly Val Met Thr Asp His Ala
275 280 285
Glu Pro Asp Lys Thr Asn Cys Ile Gly Tyr Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Arg His Val Glu Ser Phe Leu Lys Gln Asn Arg Val Ala Val Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Leu Phe Arg
340 345 350
Tyr Val Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Ile Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Asp Ile Lys Ala Ala Ile Gln Arg
385 390 395 400
Phe His Glu Asp Ile His Val Tyr Pro Leu Trp Leu Cys Pro Phe Leu
405 410 415
Leu Pro Asn Gln Pro Gly Met Val His Pro Lys Gly Asp Glu Asp Glu
420 425 430
Leu Tyr Val Asp Ile Gly Ala Tyr Gly Glu Pro Lys Val Lys His Phe
435 440 445
Glu Ala Thr Ser Ser Thr Arg Gln Leu Glu Lys Phe Val Arg Asp Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Val Tyr Met Glu Arg Lys Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Thr Leu Tyr His Lys Leu Arg Glu Glu
485 490 495
Leu Gly Cys Lys Asp Ala Phe Pro Glu Val Phe Asp Lys Ile Cys Lys
500 505 510
Ser Ala Arg His
515
<210> 46
<211> 1551
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 46
atggacccat tgttatactt gggtggttta gctgttttgt ttttaatttg gatcaaggtt 60
aaaggtttag aatatgttat tattcatcaa agatggattt ttgtttgttt atttttgttg 120
ccattgtcag ttgttttcga tgtttactac catttgagag cttggatcat ttttaagatg 180
tgttctgcac caaagcaaca tgatcaaaga gttagagata ttcaaagaca agttagagaa 240
tggagaaaag atggtggtaa aaagtacatg tgtactggta gaccaggttg gttgacagtt 300
tcattaagag ttggtaaata caagaaaact cataagaaca tcatgattaa tatgatggat 360
attttagaag ttgatacaaa gagaaaggtt gttagagttg aaccattggc taatatgggt 420
caagttactg cattgttaaa ttctatcggt tggacattgc cagttttacc agaattggat 480
gatttgactg ttggtggttt agttatgggt acaggtatcg aatcttcatc tcatatctat 540
ggtttgttcc aacatatttg tgttgctttc gaattggttt tagcagatgg ttctttagtt 600
agatgtactg aaaaggaaaa ttcagatttg ttttacgctg ttccttggtc ttgtggtaca 660
ttgggtttct tggttgctgc agaaatcaga atcatcccag ctcaaaagtg ggttaaattg 720
cattatgaac cagttagagg tttggatgca atttgtaaga aattcgctga agaatcagca 780
aataaggaaa accaattcgt tgaaggttta caatattcaa gagatgaagc tgttattatg 840
actggtgtta tgacagatca tgcagaacca gataagacta actgtatcgg ttactactac 900
aagccttggt ttttcagaca tgttgaatca tttttaaagc aaaacagagt tgcagttgaa 960
tacatcccat tgagacatta ctaccataga catacaagat caattttctg ggaattacaa 1020
gatattatcc cattcggtaa caacccattg tttagatacg tttttggttg gatggttcca 1080
ccaaaaattt cattgttgaa attgactcaa ggtgaaacaa tcagaaaatt gtacgaacaa 1140
catcatgttg ttcaagatat gttagttcca atgaaggata ttaaggctgc aatccaaaga 1200
ttccatgaag atattcatgt ttacccattg tggttgtgtc catttttgtt accaaatcaa 1260
cctggtatgg ttcatccaaa aggtgacgaa gatgaattgt acgttgatat tggtgcttat 1320
ggtgaaccaa aggttaagca tttcgaagca acttcatcta caagacaatt agaaaagttt 1380
gttagagatg ttcatggttt ccaaatgttg tacgctgatg tttacatgga aagaaaggaa 1440
ttttgggaaa tgttcgatgg tactttgtac cataaattga gagaagaatt gggttgtaaa 1500
gatgcttttc cagaagtttt tgataaaatt tgtaaatctg caagacatta a 1551
<210> 47
<211> 516
<212> PRT
<213> 智人(Homo sapiens)
<400> 47
Met Glu Pro Ala Val Ser Leu Ala Val Cys Ala Leu Leu Phe Leu Leu
1 5 10 15
Trp Val Arg Leu Lys Gly Leu Glu Phe Val Leu Ile His Gln Arg Trp
20 25 30
Val Phe Val Cys Leu Phe Leu Leu Pro Leu Ser Leu Ile Phe Asp Ile
35 40 45
Tyr Tyr Tyr Val Arg Ala Trp Val Val Phe Lys Leu Ser Ser Ala Pro
50 55 60
Arg Leu His Glu Gln Arg Val Arg Asp Ile Gln Lys Gln Val Arg Glu
65 70 75 80
Trp Lys Glu Gln Gly Ser Lys Thr Phe Met Cys Thr Gly Arg Pro Gly
85 90 95
Trp Leu Thr Val Ser Leu Arg Val Gly Lys Tyr Lys Lys Thr His Lys
100 105 110
Asn Ile Met Ile Asn Leu Met Asp Ile Leu Glu Val Asp Thr Lys Lys
115 120 125
Gln Ile Val Arg Val Glu Pro Leu Val Thr Met Gly Gln Val Thr Ala
130 135 140
Leu Leu Thr Ser Ile Gly Trp Thr Leu Pro Val Leu Pro Glu Leu Asp
145 150 155 160
Asp Leu Thr Val Gly Gly Leu Ile Met Gly Thr Gly Ile Glu Ser Ser
165 170 175
Ser His Lys Tyr Gly Leu Phe Gln His Ile Cys Thr Ala Tyr Glu Leu
180 185 190
Val Leu Ala Asp Gly Ser Phe Val Arg Cys Thr Pro Ser Glu Asn Ser
195 200 205
Asp Leu Phe Tyr Ala Val Pro Trp Ser Cys Gly Thr Leu Gly Phe Leu
210 215 220
Val Ala Ala Glu Ile Arg Ile Ile Pro Ala Lys Lys Tyr Val Lys Leu
225 230 235 240
Arg Phe Glu Pro Val Arg Gly Leu Glu Ala Ile Cys Ala Lys Phe Thr
245 250 255
His Glu Ser Gln Arg Gln Glu Asn His Phe Val Glu Gly Leu Leu Tyr
260 265 270
Ser Leu Asp Glu Ala Val Ile Met Thr Gly Val Met Thr Asp Glu Ala
275 280 285
Glu Pro Ser Lys Leu Asn Ser Ile Gly Asn Tyr Tyr Lys Pro Trp Phe
290 295 300
Phe Lys His Val Glu Asn Tyr Leu Lys Thr Asn Arg Glu Gly Leu Glu
305 310 315 320
Tyr Ile Pro Leu Arg His Tyr Tyr His Arg His Thr Arg Ser Ile Phe
325 330 335
Trp Glu Leu Gln Asp Ile Ile Pro Phe Gly Asn Asn Pro Ile Phe Arg
340 345 350
Tyr Leu Phe Gly Trp Met Val Pro Pro Lys Ile Ser Leu Leu Lys Leu
355 360 365
Thr Gln Gly Glu Thr Leu Arg Lys Leu Tyr Glu Gln His His Val Val
370 375 380
Gln Asp Met Leu Val Pro Met Lys Cys Leu Gln Gln Ala Leu His Thr
385 390 395 400
Phe Gln Asn Asp Ile His Val Tyr Pro Ile Trp Leu Cys Pro Phe Ile
405 410 415
Leu Pro Ser Gln Pro Gly Leu Val His Pro Lys Gly Asn Glu Ala Glu
420 425 430
Leu Tyr Ile Asp Ile Gly Ala Tyr Gly Glu Pro Arg Val Lys His Phe
435 440 445
Glu Ala Arg Ser Cys Met Arg Gln Leu Glu Lys Phe Val Arg Ser Val
450 455 460
His Gly Phe Gln Met Leu Tyr Ala Asp Cys Tyr Met Asn Arg Glu Glu
465 470 475 480
Phe Trp Glu Met Phe Asp Gly Ser Leu Tyr His Lys Leu Arg Glu Lys
485 490 495
Leu Gly Cys Gln Asp Ala Phe Pro Glu Val Tyr Asp Lys Ile Cys Lys
500 505 510
Ala Ala Arg His
515
<210> 48
<211> 1551
<212> DNA
<213> 智人(Homo sapiens)
<400> 48
atggaaccag ctgtttcatt ggctgtttgt gcattgttat ttttgttgtg ggttagattg 60
aagggtttag aatttgtttt gattcatcaa agatgggttt tcgtttgttt gtttttgtta 120
ccattgtctt taatcttcga tatatattac tacgttagag cttgggttgt ttttaaatta 180
tcttcagcac caagattgca tgaacaaaga gttagagata ttcaaaagca agttagagaa 240
tggaaggaac aaggttcaaa gacttttatg tgtacaggta gaccaggttg gttgactgtt 300
tctttaagag ttggtaaata caagaaaact cataagaaca tcatgattaa tttgatggat 360
attttagaag ttgatactaa gaaacaaatc gttagagttg aaccattggt tacaatgggt 420
caagttactg ctttgttaac atctattggt tggactttgc cagttttacc agaattggat 480
gatttgactg ttggtggttt aattatgggt acaggtatcg aatcttcatc tcataagtac 540
ggtttgttcc aacatatttg tactgcttat gaattggttt tagcagatgg ttcatttgtt 600
agatgtacac catcagaaaa ttctgatttg ttttatgctg ttccttggtc ttgtggtact 660
ttgggtttct tggttgctgc agaaattaga atcatcccag ctaagaaata cgttaaattg 720
agatttgaac cagttagagg tttggaagct atttgtgcaa agtttactca tgaatcacaa 780
agacaagaaa accatttcgt tgaaggtttg ttgtactctt tagatgaagc tgttattatg 840
actggtgtta tgacagatga agcagaacca tcaaaattaa attctatcgg taactactac 900
aagccttggt ttttcaagca tgttgaaaac tacttaaaga ctaatagaga aggtttagaa 960
tacatcccat tgagacatta ctaccataga catacaagat caattttctg ggaattgcaa 1020
gatattatcc cattcggtaa caacccaatt tttagatact tattcggttg gatggttcca 1080
ccaaaaattt ctttgttgaa attgactcaa ggtgaaacat tgagaaaatt gtacgaacaa 1140
catcatgttg ttcaagatat gttagttcca atgaagtgtt tgcaacaagc attgcataca 1200
ttccaaaacg atattcatgt ttacccaatt tggttgtgtc cttttatttt gccatcacaa 1260
ccaggtttag ttcatccaaa gggtaatgaa gctgaattgt acattgatat tggtgcttat 1320
ggtgaaccaa gagttaagca tttcgaagct agatcatgta tgagacaatt agaaaagttt 1380
gttagatcag ttcatggttt ccaaatgttg tacgcagatt gttacatgaa cagagaagaa 1440
ttttgggaaa tgttcgatgg ttctttgtac cataaattga gagaaaaatt gggttgtcaa 1500
gatgcatttc cagaagttta tgataaaatt tgtaaagctg caagacatta a 1551
<210> 49
<211> 504
<212> PRT
<213> 智人(Homo sapiens)
<400> 49
Met Met Thr Thr Ser Leu Ile Trp Gly Ile Ala Ile Ala Ala Cys Cys
1 5 10 15
Cys Leu Trp Leu Ile Leu Gly Ile Arg Arg Arg Gln Thr Gly Glu Pro
20 25 30
Pro Leu Glu Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Gln Phe
35 40 45
Gly Ala Asn Pro Leu Glu Phe Leu Arg Ala Asn Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Pro Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Phe Ala Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Met Asp Gly Asn Thr Thr Glu Asn Ile Asn Asp Thr
115 120 125
Phe Ile Lys Thr Leu Gln Gly His Ala Leu Asn Ser Leu Thr Glu Ser
130 135 140
Met Met Glu Asn Leu Gln Arg Ile Met Arg Pro Pro Val Ser Ser Asn
145 150 155 160
Ser Lys Thr Ala Ala Trp Val Thr Glu Gly Met Tyr Ser Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Ile Phe Gly Arg Asp Leu
180 185 190
Thr Arg Arg Asp Thr Gln Lys Ala His Ile Leu Asn Asn Leu Asp Asn
195 200 205
Phe Lys Gln Phe Asp Lys Val Phe Pro Ala Leu Val Ala Gly Leu Pro
210 215 220
Ile His Met Phe Arg Thr Ala His Asn Ala Arg Glu Lys Leu Ala Glu
225 230 235 240
Ser Leu Arg His Glu Asn Leu Gln Lys Arg Glu Ser Ile Ser Glu Leu
245 250 255
Ile Ser Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Leu Glu Lys Ala Lys Thr His Leu Val Val Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Asn
290 295 300
Pro Glu Ala Met Lys Ala Ala Thr Glu Glu Val Lys Arg Thr Leu Glu
305 310 315 320
Asn Ala Gly Gln Lys Val Ser Leu Glu Gly Asn Pro Ile Cys Leu Ser
325 330 335
Gln Ala Glu Leu Asn Asp Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ser Leu Arg Leu Ser Ser Ala Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Asn
405 410 415
Gly Lys Thr Lys Thr Thr Phe Tyr Cys Asn Gly Leu Lys Leu Lys Tyr
420 425 430
Tyr Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Ile His Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Ile Glu Gly Gln Ala Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Phe Lys His Leu
500
<210> 50
<211> 1515
<212> DNA
<213> 智人(Homo sapiens)
<400> 50
atgatgacta catctttgat ttggggtatt gctattgctg catgttgttg tttgtggttg 60
atcttgggta ttagaagaag acaaactggt gaaccaccat tggaaaacgg tttgatccca 120
tatttgggtt gtgctttaca attcggtgca aacccattgg aattcttgag agctaaccaa 180
agaaagcatg gtcatgtttt tacttgtaag ttgatgggta aatacgttca tttcatcaca 240
aacccattgt cataccataa agttttatgt catggtaaat acttcgattg gaagaaattc 300
catttcgcta cttctgctaa ggcatttggt catagatcaa ttgatccaat ggatggtaat 360
actacagaaa acatcaacga tacttttatt aagacattgc aaggtcatgc attgaactct 420
ttgacagaat caatgatgga aaatttgcaa agaatcatga gaccaccagt ttcttcaaat 480
tctaaaactg ctgcatgggt tacagaaggc atgtactcat tctgttacag agttatgttc 540
gaagctggtt atttgactat cttcggtaga gatttgacta gaagagatac acaaaaggca 600
catatcttga acaatttgga taacttcaaa caatttgata aagtttttcc agctttggtt 660
gcaggtttac caattcatat gtttagaaca gctcataatg caagagaaaa gttggctgaa 720
tctttgagac atgaaaattt gcaaaagaga gaatctatct cagaattgat ctctttgaga 780
atgtttttga atgatacttt atcaacattc gatgatttgg aaaaggcaaa gactcatttg 840
gttgttttgt gggcttctca agcaaatact attccagcta cattctggtc attgttccaa 900
atgatcagaa acccagaagc aatgaaagct gcaactgaag aagttaagag aacattggaa 960
aacgctggtc aaaaagtttc tttggaaggt aacccaatct gtttgtcaca agcagaattg 1020
aacgatttgc cagttttgga ttctattatt aaggaatcat tgagattgtc ttcagcttct 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcatacaac 1140
atcagaaagg atgatatcat tgctttatat ccacaattaa tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatatttgg atgaaaatgg taaaacaaaa 1260
actacattct actgtaacgg tttgaagttg aagtattact atatgccatt tggttctggt 1320
gctacaattt gtccaggtag attgtttgca atccatgaaa ttaaacaatt cttgatcttg 1380
atgttatctt attttgaatt ggaattgatc gaaggtcagg ctaagtgtcc accattggat 1440
caatcaagag caggtttggg tattttgcca ccattgaacg atattgaatt caaatacaag 1500
tttaaacatt tgtaa 1515
<210> 51
<211> 503
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 51
Met Met Thr Ile Ser Leu Ile Trp Gly Ile Ala Val Leu Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Val Gly Ile Arg Arg Arg Lys Ala Gly Glu Pro
20 25 30
Pro Leu Glu Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Asn Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Asn Asp Gly Asn Thr Thr Glu Asn Ile Asn Asn Thr
115 120 125
Phe Thr Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Asn Ala Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Arg Asp Ile
180 185 190
Ser Lys Thr Asp Thr Gln Lys Ala Leu Ile Leu Asn Asn Leu Asp Asn
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Leu Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Lys Leu Ala Glu
225 230 235 240
Gly Leu Lys His Lys Asn Leu Cys Val Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Ala Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Ser Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Ser Ala Ile Tyr Leu Asp
325 330 335
Gln Val Gln Leu Asn Asp Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ala Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Met Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Ser Asn Gly Asn Lys Leu Lys Cys
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Cys
450 455 460
Phe Glu Leu Glu Phe Val Glu Ser Gln Val Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu His Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 52
<211> 1512
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 52
atgatgacaa tttctttgat ttggggtatc gctgttttag tttcatgttg tatctggttc 60
atcgttggta ttagaagaag aaaggcaggt gaaccaccat tagaaaatgg tttgattcca 120
tacttaggtt gtgctttgaa gttcggttct aacccattgg aattcttgag agcaaaccaa 180
agaaagcatg gtcatgtttt tacatgcaag ttgatgggta aatacgttca tttcatcact 240
aactctttgt cataccataa agttttatgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcaa tcgatccaaa cgatggtaac 360
actacagaaa acatcaacaa cacttttaca aagactttac aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tctgttatga gaccaccagg tttaccaaaa 480
tctaaatcaa atgcttgggt tacagaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt atttgacttt gttcggtaga gatatctcaa aaacagatac tcaaaaagca 600
ttgatcttga acaatttgga taacttcaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtttgc caatccattt gtttaaaaca gctcataagg caagagaaaa gttggctgaa 720
ggtttgaagc ataagaattt gtgtgttaga gatcaagttt ctgaattgat cagattgaga 780
atgttcttga atgatacatt atcaactttc gatgatatgg aaaaggctaa gactcatttg 840
gcaatcttgt gggcttctca agcaaataca attccagcta ctttctggtc tttgttccaa 900
atgatcagat caccagaagc aatgaaagct gcatctgaag aagtttcagg tgctttgcaa 960
tctgcaggtc aagaattatc ttcaggtggt tcagctatat atttggatca agttcaattg 1020
aacgatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagcatca 1080
ttgaacatca gaacagctaa ggaagatttc actttgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatgat tgcattatac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttag atgaatctgg taaagctaag 1260
actacattct actcaaacgg taataagttg aagtgtttct acatgccatt tggttctggt 1320
gctactattt gtccaggtag attatttgca gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt gtttcgaatt ggaattcgtt gaatcacagg ttaagtgtcc accattagat 1440
caatcaagag ctggtttggg tattttacca ccattgcatg atatcgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 53
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 53
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ala Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 54
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 54
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagcatca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 55
<211> 501
<212> PRT
<213> 家兔(Oryctolagus cuniculus)
<400> 55
Met Ile Thr Ile Phe Trp Ile Trp Gly Ile Cys Leu Ser Val Cys Cys
1 5 10 15
Cys Leu Trp Leu Ile Leu Gly Leu Arg Arg Arg Arg Met Gly Glu Pro
20 25 30
Pro Leu Glu Lys Gly Trp Ile Pro Tyr Leu Gly Cys Ala Leu Gln Phe
35 40 45
Gly Ala Asn Pro Leu Asp Phe Leu Arg Ala Asn Gln Arg Lys Tyr Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Phe Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Arg Asp Gly Asn Thr Thr Glu Asn Ile Asn Asn Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Ile Ser Leu Thr Asp Ala
130 135 140
Met Met Glu Asn Leu Gln Leu Thr Leu Arg Arg Pro Glu Pro Lys Ser
145 150 155 160
Arg Ala Trp Val Thr Glu Gly Met Tyr Ser Phe Cys Tyr Arg Val Met
165 170 175
Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Arg Glu Leu Thr Arg Gln
180 185 190
Asp Ala Gln Arg Ala Phe Ile Leu Asn Ser Leu Glu Asp Phe Lys Gln
195 200 205
Phe Asp Lys Val Phe Pro Ala Leu Val Ala Gly Leu Pro Ile His Ile
210 215 220
Phe Met Thr Ala His Asn Ala Arg Glu Lys Leu Ala Glu Gly Leu Lys
225 230 235 240
His Asp Asn Leu Arg Thr Arg Asp His Ile Ser Glu Leu Ile Arg Leu
245 250 255
Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Ala Met Glu Lys
260 265 270
Ala Lys Thr His Leu Ala Ile Leu Trp Ala Ser Gln Ala Asn Thr Ile
275 280 285
Pro Ala Thr Phe Trp Ser Leu Phe His Met Met Arg Ser Ser Glu Ala
290 295 300
Leu Lys Ala Ala Thr Glu Glu Val Asn Lys Ala Leu Glu Asp Ala Asp
305 310 315 320
Gln Gln Ile Asn Phe Glu Gly Lys Pro Ile His Leu Asn Gln Thr Gln
325 330 335
Leu Asn Asp Met Pro Val Leu Asp Ser Ile Ile Lys Glu Ser Leu Arg
340 345 350
Leu Ser Ser Ala Ser Leu Asn Ile Arg Thr Ala Lys Glu Asp Phe Thr
355 360 365
Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp Asp Ile Ile
370 375 380
Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile Tyr Pro Asp
385 390 395 400
Pro Met Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Asn Arg Lys Thr
405 410 415
Lys Thr Thr Phe Tyr Ser Lys Gly Leu Lys Leu Lys Tyr Tyr Tyr Met
420 425 430
Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu Phe Ala Ile
435 440 445
Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr Phe Glu Leu
450 455 460
Glu Phe Val Asp Ser His Val Lys Cys Pro Pro Leu Asp Gln Ser Arg
465 470 475 480
Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu Phe Lys Tyr
485 490 495
Lys Phe Lys His Leu
500
<210> 56
<211> 1506
<212> DNA
<213> 家兔(Oryctolagus cuniculus)
<400> 56
atgattacta ttttctggat ttggggtatc tgtttgtctg tttgttgttg tttgtggttg 60
atcttgggtt tgagaagaag aagaatgggt gaaccaccat tggaaaaagg ttggattcca 120
tatttgggtt gtgctttgca atttggtgca aatccattgg atttcttgag agctaaccaa 180
agaaagtacg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
catttcacta catctgctaa ggcatttggt catagatcaa ttgatccaag agatggtaat 360
actacagaaa acatcaacaa cacttttaat aagacattgc aaggtgacgc tttgatctct 420
ttgactgatg caatgatgga aaatttgcaa ttgacattga gaagaccaga accaaaatct 480
agagcttggg ttactgaagg catgtactca ttctgttaca gagttatgtt cgaagcaggt 540
tacttaactt tgttcggtag agaattgaca agacaagatg ctcaaagagc ttttattttg 600
aactcattgg aagatttcaa acaatttgat aaagtttttc cagctttagt tgcaggtttg 660
ccaatccata tttttatgac tgctcataac gcaagagaaa agttggctga aggtttgaag 720
catgataatt tgagaacaag agatcatatc tctgaattga tcagattgag aatgttcttg 780
aatgatactt tgtcaacatt tgatgctatg gaaaaggcaa agacacattt ggctatcttg 840
tgggcttctc aagcaaatac tattccagca acattctggt cattgttcca tatgatgaga 900
tcttcagaag cattgaaagc tgcaactgaa gaagttaata aggctttgga agatgcagat 960
caacaaatta atttcgaagg taaaccaatc catttgaacc aaacacaatt gaacgatatg 1020
ccagttttgg attctattat taaggaatca ttgagattgt cttcagcttc tttgaacatc 1080
agaactgcaa aggaagattt cacattgcat ttggaagatg gttcatacaa catcagaaag 1140
gatgatatca ttgctttata tccacaattg atgcacttag atccagaaat ctatccagat 1200
ccaatgactt ttaaatacga tagatatttg gatgaaaaca gaaagacaaa gactacattc 1260
tactctaaag gtttaaaatt gaaatattac tatatgccat ttggttcagg tgctacaatt 1320
tgtccaggta gattatttgc aatccaagaa attaaacaat tcttgatctt gatgttatct 1380
tattttgaat tagaatttgt tgattcacat gttaaatgtc caccattgga tcaatctaga 1440
gctggtttgg gtattttacc accattgaac gatatcgaat tcaaatacaa gtttaaacat 1500
ttgtaa 1506
<210> 57
<211> 500
<212> PRT
<213> 家牛(Bos taurus)
<400> 57
Met Met Ser Leu Ser Leu Ile Trp Gly Ile Val Ile Ala Val Cys Cys
1 5 10 15
Cys Leu Tyr Leu Leu Gly Met Arg Arg Arg Gln Met Gly Glu Pro Pro
20 25 30
Leu Glu Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Gln Phe Gly
35 40 45
Ala Asn Pro Leu Glu Phe Leu Arg Ala Asn Gln Arg Lys His Gly His
50 55 60
Val Phe Thr Cys Arg Leu Met Gly Asn Tyr Val His Phe Ile Thr Asn
65 70 75 80
Pro Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp Trp
85 90 95
Lys Lys Phe His Phe Thr Ala Ser Ala Lys Ala Phe Gly His Arg Ser
100 105 110
Ile Asp Pro Ser Asp Gly Asn Thr Thr Asp Thr Ile Ser Lys Thr Ile
115 120 125
Ile Lys Thr Leu Gln Gly Asp Ala Leu Ser Ser Leu Thr Glu Ala Met
130 135 140
Met Gly Asn Leu Gln Leu Val Leu Arg Pro Gln Gly Pro Pro Gln Pro
145 150 155 160
Pro Thr Pro Thr Trp Val Thr Glu Gly Met Tyr Ser Phe Cys Tyr Arg
165 170 175
Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Arg Asp Leu Ala
180 185 190
Gly Gln Asp Ala Gln Lys Ala Leu Ile Leu Asn Ser Leu Asp Asn Phe
195 200 205
Lys Gln Phe Asp Lys Ile Phe Pro Ala Leu Val Ala Gly Phe Pro Ile
210 215 220
His Val Phe Lys Thr Gly His Tyr Ala Arg Glu Lys Leu Thr Glu Gly
225 230 235 240
Leu Arg Leu Gln Lys Phe Arg Glu Arg Asp His Ile Ser Glu Leu Val
245 250 255
Arg Phe Leu Asn Asp Thr Phe Ala Thr Leu Asp Asp Thr Glu Arg Ala
260 265 270
Lys Ser Leu Leu Ala Val Leu Trp Ala Ser Gln Ala Asn Thr Ile Pro
275 280 285
Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Asn Pro Glu Ala Met
290 295 300
Lys Ala Ala Thr Glu Glu Val Asn Lys Thr Leu Glu Asn Ala Gly Gln
305 310 315 320
Lys Val Ser Phe Glu Asp Ser Pro Ile His Leu Asn Gln Thr Gln Leu
325 330 335
Asp Asn Met Pro Val Leu Asp Ser Ile Ile Lys Glu Ser Leu Arg Leu
340 345 350
Ser Ser Ala Ser Leu Asn Ile Arg Thr Ala Lys Glu Asp Phe Thr Leu
355 360 365
His Leu Gln Asp Gly Ser Tyr Asn Ile Arg Lys Asp Asp Ile Ile Ala
370 375 380
Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile Tyr Pro Asp Pro
385 390 395 400
Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Asn Gly Lys Thr Lys
405 410 415
Thr Thr Phe Tyr Ser Asn Gly Leu Lys Leu Lys Tyr Tyr Tyr Met Pro
420 425 430
Phe Gly Ser Gly Val Thr Ile Cys Pro Gly Arg Leu Phe Ala Val Gln
435 440 445
Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr Phe Glu Leu Glu
450 455 460
Leu Val Glu Ser Cys Val Lys Cys Pro Pro Leu Asp Gln Ser Arg Ala
465 470 475 480
Gly Leu Gly Ile Leu Pro Pro Leu Tyr Asp Thr Glu Phe Arg Tyr Lys
485 490 495
Phe Lys His Ser
500
<210> 58
<211> 1503
<212> DNA
<213> 家牛(Bos taurus)
<400> 58
atgatgtctt tgtcattgat ctggggtatc gttatcgcag tttgttgttg tttgtatttg 60
ttgggtatga gaagaagaca aatgggtgaa ccaccattag aaaatggttt gattccatac 120
ttaggttgtg cattgcaatt cggtgctaac ccattggaat tcttgagagc taaccaaaga 180
aagcatggtc atgtttttac atgtagattg atgggtaact acgttcattt catcactaac 240
ccattatctt accataaggt tttgtgtcat ggtaaatact tcgattggaa gaaattccat 300
ttcacagctt cagcaaaggc ttttggtcat agatctattg atccatcaga tggtaatact 360
acagatacta tctctaagac aattattaag actttacaag gtgacgcatt gtcttcatta 420
actgaagcta tgatgggtaa tttgcaatta gttttgagac cacaaggtcc accacaacca 480
ccaactccaa catgggttac agaaggcatg tactcattct gttacagagt tatgttcgaa 540
gcaggttatt tgactttgtt tggtagagat ttggctggtc aagatgcaca aaaagctttg 600
atcttgaact ctttggataa cttcaaacaa tttgataaaa tttttccagc attggttgct 660
ggtttcccaa tccatgtttt taaaacaggt cattacgcaa gagaaaagtt gactgaaggt 720
ttgagattgc aaaagtttag agaaagagat catatctctg aattagttag atttttgaac 780
gatactttcg ctacattgga tgatacagaa agagcaaagt ctttgttagc tgttttgtgg 840
gcatcacaag ctaatacaat tccagcaact ttctggtctt tgttccaaat gatcagaaac 900
ccagaagcta tgaaagctgc aacagaagaa gttaataaga ctttggaaaa tgctggtcaa 960
aaagtttctt tcgaagattc accaatccat ttgaaccaaa ctcaattgga taacatgcca 1020
gttttggatt ctattattaa ggaatcattg agattgtctt cagcatcttt gaacatcaga 1080
acagctaagg aagatttcac tttgcatttg caagatggtt catacaacat cagaaaggat 1140
gatatcatcg ctttgtaccc acaattgatg cacttagatc cagaaatcta tccagatcca 1200
ttgactttta aatacgatag atatttggat gaaaatggta aaactaaaac tacattctac 1260
tctaacggtt tgaagttgaa gtattactat atgccatttg gttcaggtgt tacaatttgt 1320
ccaggtagat tatttgcagt tcaagaaatt aaacaattct tgatcttgat gttgtcttac 1380
tttgaattgg aattagttga atcatgtgtt aagtgtccac cattggatca atcaagagct 1440
ggtttgggta ttttaccacc attgtacgat actgaattca gatataagtt taaacattct 1500
taa 1503
<210> 59
<211> 507
<212> PRT
<213> 袋獾(Sarcophilus harrisii)
<400> 59
Met Leu Thr Ile Ser Ile Ser Leu Ile Trp Gly Phe Val Val Ala Val
1 5 10 15
Cys Cys Cys Leu Trp Leu Ile Ile Gly Ile Arg Arg Arg Arg Leu Gly
20 25 30
Glu Pro Pro Leu Asp Asn Gly Leu Ile Pro Tyr Val Gly Cys Ala Leu
35 40 45
Gln Phe Gly Ala Asn Pro Leu Glu Phe Leu Arg Thr Lys Lys Arg Lys
50 55 60
Tyr Gly His Ile Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe
65 70 75 80
Ile Thr Asn Pro Phe Ser Tyr Asn Thr Val Leu Arg His Gly Lys Tyr
85 90 95
Phe Asp Trp Lys Lys Ile Asn Tyr Ala Thr Ser Ala Lys Ala Phe Gly
100 105 110
His Arg Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Val His
115 120 125
Glu Thr Leu Ile Lys Thr Leu Gln Gly Asp Ala Leu Asn Ser Leu Thr
130 135 140
Glu Ala Met Met Glu Asn Leu Gln Tyr Val Met Lys Pro Ser Val Leu
145 150 155 160
Ser Lys Thr Asn Pro Asp Ser Trp Val Thr Glu Gly Met Cys Ser Phe
165 170 175
Cys Tyr Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys
180 185 190
Asp Leu Thr Arg Gln Glu Val Gln Arg Thr Phe Ile Leu Asn Ser Leu
195 200 205
Asn Asn Phe Lys Gln Phe Asp Lys Ile Phe Pro Ala Leu Val Ala Gly
210 215 220
Leu Pro Ile His Val Phe Lys Asn Ala His Asn Ala Arg Glu Lys Leu
225 230 235 240
Ala Glu Thr Leu Arg His Glu Asn Leu Gln Lys Arg Asp Asn Ile Ser
245 250 255
Glu Leu Ile Thr Thr Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe
260 265 270
Asp Asp Met Glu Lys Ala Lys Thr His Leu Ala Leu Leu Trp Ala Ala
275 280 285
Gln Ala Asn Thr Leu Pro Ala Thr Phe Trp Cys Leu Phe His Thr Ile
290 295 300
Ser Arg Ser Pro Glu Ala Met Lys Thr Ala Thr Glu Glu Val Arg Lys
305 310 315 320
Thr Leu Glu Asn Ser Gly Gln Lys Ile Ser Phe Glu Gly Lys Pro Ile
325 330 335
Ser Leu Ser Gln Met Gln Leu Asn Asp Met Pro Val Leu Asp Ser Ile
340 345 350
Ile Lys Glu Ala Leu Arg Leu Cys Ser Ala Ser Leu Asn Ile Arg Ala
355 360 365
Ala Lys Glu Asp Phe Thr Leu His Leu Glu Glu Gly Ser Tyr Ser Ile
370 375 380
Arg Lys Asp Asp Ile Ile Ala Phe Tyr Pro Gln Leu Leu His Phe Asp
385 390 395 400
Pro Glu Ile Tyr Pro Asp Pro Leu Val Phe Lys Tyr Asp Arg Tyr Leu
405 410 415
Asp Glu Asn Gly Lys Pro Lys Thr Asn Phe Tyr Tyr Asn Gly Ile Lys
420 425 430
Leu Lys Tyr Tyr Tyr Met Pro Phe Gly Ser Gly Leu Ser Leu Cys Pro
435 440 445
Gly Arg Leu Phe Ala Val His Glu Ile Lys Gln Phe Leu Ile Leu Met
450 455 460
Leu Ser Tyr Phe Glu Met Lys Leu Val Asp Ser Gln Val Lys Tyr Pro
465 470 475 480
Pro Leu Asp Gln Ser Arg Leu Gly Leu Gly Ile Leu Pro Pro Thr Asn
485 490 495
Asp Ile Asp Phe Lys Tyr Lys Leu Lys His Leu
500 505
<210> 60
<211> 1524
<212> DNA
<213> 袋獾(Sarcophilus harrisii)
<400> 60
atgttgacaa tctctatctc attgatctgg ggtttcgttg ttgctgtttg ttgttgtttg 60
tggttgatca tcggtattag aagaagaaga ttgggtgaac caccattaga taatggtttg 120
attccatatg ttggttgtgc tttgcaattc ggtgcaaacc cattggaatt cttgagaact 180
aagaaaagaa agtacggtca tatttttact tgtaagttga tgggtaaata cgttcatttc 240
atcactaacc cattttctta caacacagtt ttgagacatg gtaaatactt cgattggaag 300
aaaattaatt acgctacatc agctaaggca tttggtcata gatctattga tccatcagat 360
ggtaacacta cagaaaacgt tcatgaaact ttgattaaaa cattgcaagg tgacgcttta 420
aattctttga ctgaagcaat gatggaaaat ttgcaatacg ttatgaagcc atctgttttg 480
tcaaagacta atccagattc ttgggttaca gaaggcatgt gttcattctg ttacagagtt 540
atgttcgaag ctggttattt gactttgttc ggtaaagatt tgacaagaca agaagttcaa 600
agaactttta ttttgaactc attgaacaac ttcaaacaat ttgataaaat ttttccagct 660
ttagttgcag gtttgccaat ccatgttttt aaaaacgctc ataacgcaag agaaaagttg 720
gcagaaacat tgagacatga aaatttgcaa aagagagata acatctctga attgattact 780
acaagaatgt ttttaaatga tactttgtca acattcgatg atatggaaaa ggctaagact 840
catttggcat tgttgtgggc tgcacaagct aatactttac cagcaacatt ctggtgtttg 900
ttccatacaa tctctagatc accagaagct atgaaaactg caacagaaga agttagaaag 960
actttggaaa attctggtca aaagatttca ttcgaaggta aaccaatctc tttgtcacaa 1020
atgcaattga acgatatgcc agttttggat tctattatta aggaagcttt gagattgtgt 1080
tctgcatcat tgaacatcag agctgcaaag gaagatttca cattgcattt ggaagaaggt 1140
tcttactcaa tcagaaagga tgatatcatc gctttctatc cacaattgtt gcatttcgat 1200
ccagaaatct atccagatcc attagttttt aaatacgata gatatttgga tgaaaatggt 1260
aaaccaaaga ctaacttcta ctacaacggt attaaattga aatattacta tatgccattt 1320
ggttctggtt tgtcattatg tccaggtaga ttatttgcag ttcatgaaat taaacaattc 1380
ttgatcttga tgttgtctta ttttgaaatg aaattagttg attcacaggt taagtaccca 1440
ccattggatc aatctagatt aggtttgggt attttaccac caactaatga tattgatttt 1500
aaatataaat taaaacattt gtaa 1524
<210> 61
<211> 513
<212> PRT
<213> 原鸡(Gallus gallus)
<400> 61
Met Ile Thr Thr Ser Trp Ile Trp Gly Thr Val Ile Ile Val Cys Cys
1 5 10 15
Ser Phe Trp Phe Leu Phe Gly Arg Arg Arg Arg Arg Arg Gln Gly Glu
20 25 30
Pro Pro Leu Glu Asn Gly Phe Leu Pro Tyr Leu Gly Cys Ala Leu Gln
35 40 45
Phe Gly Ala Asn Pro Leu Lys Phe Leu Arg Glu Lys Gln Lys Lys His
50 55 60
Gly His Ile Phe Thr Cys Gln Val Ala Gly Lys Tyr Ile His Phe Leu
65 70 75 80
Thr Asp Pro Phe Ser Tyr His Ser Leu Ile Arg Gln Gly Lys Tyr Leu
85 90 95
Asp Trp Lys Lys Phe His Phe Ala Thr Ser Ala Lys Ala Phe Gly His
100 105 110
Gly Ser Ile Asp Pro Ala Glu Gly Asn Thr Thr Glu Asn Phe His His
115 120 125
Thr Phe Ile Arg Thr Leu Gln Gly Asn Ala Leu Asp Ala Leu Ile Lys
130 135 140
Ala Met Met Glu Asn Leu Gln Tyr Val Met Leu Gln Ser Arg Ala Ser
145 150 155 160
Lys Phe Gln Pro Asn Thr Trp Val Thr Glu Gly Leu Tyr Thr Phe Cys
165 170 175
Cys Gln Val Met Phe Glu Ser Gly Phe Leu Thr Leu Phe Gly Lys Glu
180 185 190
Phe Asn Ser Asn His Asp Lys Asn Leu Ser Lys Arg Glu Thr Glu Arg
195 200 205
Ala Arg Ile Leu Asn Ala Leu Glu Asn Phe Lys Glu Phe Asp Lys Ile
210 215 220
Phe Pro Ala Leu Val Ala Gly Leu Pro Ile His Leu Phe Lys Ser Ala
225 230 235 240
His Ser Ala Arg Glu Lys Leu Gly Glu Ala Leu Leu His Lys Asn Leu
245 250 255
Leu Lys Arg Asp Asn Leu Ser Glu Leu Val Met Leu Arg Met Phe Leu
260 265 270
Asn Asp Thr Leu Ser Thr Phe Asp Asp Met Glu Lys Ala Lys Thr His
275 280 285
Val Ala Val Leu Trp Ala Ser Gln Ala Asn Thr Ile Pro Ala Thr Phe
290 295 300
Trp Ser Leu Phe Thr Phe Leu Arg Asn Pro Glu Ala Met Arg Ala Ala
305 310 315 320
Thr Lys Glu Val Gln Ser Val Leu Glu Ser Ala Gly Glu Lys Ile Ser
325 330 335
Leu Asp Gly Asn Tyr Ile Ser Leu Asn Arg Lys Gln Leu Asp Asn Met
340 345 350
Pro Val Leu Asp Ser Ile Ile Lys Glu Ala Met Arg Leu Ser Ser Ala
355 360 365
Ser Met Thr Phe Arg Val Ala Lys Glu Asp Phe Thr Leu His Leu Glu
370 375 380
Asn Ser Phe Tyr Asn Ile Arg Lys Asp Asp Ile Val Ala Leu Tyr Pro
385 390 395 400
Gln Leu Leu His Phe Asp Pro Glu Ile Tyr Ala Asp Pro Leu Thr Phe
405 410 415
Lys Tyr Asp Arg Tyr Leu Asn Glu Asn Lys Glu Glu Lys Thr Asp Phe
420 425 430
Tyr Arg Asn Gly Arg Lys Leu Lys Tyr Tyr Tyr Met Pro Phe Gly Ala
435 440 445
Gly Ile Ala Lys Cys Pro Gly Arg Leu Phe Ala Val His Glu Ile Lys
450 455 460
Gln Phe Leu Val Leu Ile Phe Ser Tyr Phe Glu Ile Asp Leu Val Asp
465 470 475 480
Ser Asn Val Gln Cys Pro Ser Leu Asp Gln Ser Arg Ala Gly Leu Gly
485 490 495
Ile Leu Gln Pro Ser Asn Asp Ile Asp Phe Arg Tyr Arg Leu Lys Cys
500 505 510
Leu
<210> 62
<211> 1542
<212> DNA
<213> 原鸡(Gallus gallus)
<400> 62
atgattacta catcttggat ttggggtact gttattatcg tttgttgttc attctggttc 60
ttgttcggta gaagaagaag aagaagacaa ggtgaaccac cattggaaaa tggtttcttg 120
ccatatttgg gttgtgcttt acaattcggt gcaaacccat tgaagttctt gagagaaaag 180
caaaagaaac atggtcatat ttttacttgt caagttgctg gtaaatacat ccatttcttg 240
acagatccat tttcttacca ttcattgatc agacagggta aatatttgga ttggaagaaa 300
ttccatttcg ctacatctgc taaggcattt ggtcatggtt caattgatcc agcagaaggt 360
aatactacag aaaacttcca tcatactttt attagaacat tacagggtaa tgctttggat 420
gcattgatta aagctatgat ggaaaatttg caatacgtta tgttgcaatc tagagcatca 480
aagttccaac caaacacttg ggttacagaa ggtttgtaca ctttctgttg tcaagttatg 540
ttcgaatctg gtttcttgac attgttcggt aaagaattca attctaacca tgataagaat 600
ttgtcaaaga gagaaactga aagagctaga attttgaatg cattggaaaa cttcaaggaa 660
ttcgataaga tttttccagc tttagttgca ggtttgccaa ttcatttgtt taaatctgct 720
cattcagcaa gagaaaagtt gggtgaagct ttgttgcata agaatttgtt gaagagagat 780
aatttgtctg aattagttat gttgagaatg tttttgaatg atactttatc aacattcgat 840
gatatggaaa aggctaagac acatgttgca gttttgtggg cttctcaagc aaatactatt 900
ccagctacat tctggtcatt gtttacattt ttgagaaacc cagaagcaat gagagctgca 960
acaaaagaag ttcaatctgt tttggaatca gctggtgaaa agatttcttt agatggtaac 1020
tacatctcat tgaacagaaa gcaattggat aacatgccag ttttggattc tattattaag 1080
gaagctatga gattgtcttc agcatcaatg acttttagag ttgctaagga agatttcaca 1140
ttgcatttgg aaaactcttt ctacaacatc agaaaggatg atatcgttgc tttgtaccca 1200
caattgttgc atttcgatcc agaaatctat gcagatccat tgacttttaa atacgataga 1260
tatttgaacg aaaataagga agaaaagact gatttctaca gaaacggtag aaagttgaag 1320
tattactata tgccatttgg tgctggtatt gcaaaatgtc caggtagatt atttgctgtt 1380
catgaaatta aacaattctt ggttttgatt ttctcttatt ttgaaattga tttggttgat 1440
tcaaatgttc aatgtccatc tttagatcaa tcaagagcag gtttgggtat tttgcaacca 1500
tctaacgata ttgattttag atacagatta aaatgtttgt aa 1542
<210> 63
<211> 512
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 63
Met Ile Leu Thr Ile Ser Phe Ile Trp Ala Ile Val Val Gly Leu Cys
1 5 10 15
Cys Cys Leu Trp Leu Ile Thr Gly Ile Arg Arg Arg His Pro Ala Glu
20 25 30
Pro Pro Leu Glu Asn Gly Trp Ile Pro Phe Leu Gly Cys Ala Leu Gln
35 40 45
Phe Gly Ala Asn Pro Leu Glu Phe Leu Arg Ser Arg Gln Lys Lys His
50 55 60
Gly His Ile Phe Thr Cys Lys Ile Ala Gly Gln Tyr Val His Phe Leu
65 70 75 80
Cys Asp Pro Phe Ser Tyr His Ala Val Ile Arg Gln Gly Arg His Leu
85 90 95
Asp Trp Lys Lys Phe His Phe Asp Ala Ser Ala Lys Ala Phe Gly His
100 105 110
Glu Ser Met Asp Pro Ser Gln Gly Tyr Thr Thr Glu Asn Leu His Gln
115 120 125
Thr Phe Leu Lys Thr Leu Gln Gly Asp Ala Leu Ser Ser Leu Ile Glu
130 135 140
Thr Met Met Glu Asn Leu Gln Gly Thr Met Leu Gln Ser Gly Met Leu
145 150 155 160
Lys Ala Thr Thr Ser Glu Trp Gln Ser Asp Gly Ile Tyr Ala Phe Cys
165 170 175
Tyr Lys Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Glu
180 185 190
Leu Asp Gly Asp Gln Ser Ile Ala Arg Gln Gln Ala Gln Lys Ala Leu
195 200 205
Val Leu Asn Ala Leu Asp Asn Phe Lys Glu Phe Asp Lys Ile Phe Pro
210 215 220
Ala Leu Ile Ala Gly Leu Pro Ile His Val Phe Lys Ser Ala Tyr Ser
225 230 235 240
Ala Arg Glu Lys Leu Ala Lys Thr Met Leu His Glu Asn Leu Ser Arg
245 250 255
Arg Ala Asn Val Ser Asp Leu Ile Ser Leu Arg Met Leu Leu Asn Asp
260 265 270
Thr Leu Ser Thr Phe Asn Glu Leu Ser Lys Ala Arg Thr His Val Ala
275 280 285
Ile Leu Trp Ala Ser Gln Ala Asn Thr Leu Pro Ala Thr Phe Trp Thr
290 295 300
Leu Phe His Met Ile Arg Cys Pro Ala Ala Met Lys Ala Ala Ser Glu
305 310 315 320
Glu Val Arg Arg Thr Phe Glu Ser Ser Asn Gln Lys Val Asp Pro Thr
325 330 335
Asn Ser Arg Leu Val Leu Thr Arg Glu Gln Leu Asp Asn Met Pro Val
340 345 350
Leu Asp Ser Ile Ile Lys Glu Ala Met Arg Leu Ser Ser Ala Ser Leu
355 360 365
Asn Val Arg Met Ala Lys Ser Asp Phe Leu Leu Gln Leu Asp Asn Lys
370 375 380
Glu Ser Tyr His Ile Arg Lys Asp Asp Val Ile Ala Met Tyr Pro Pro
385 390 395 400
Met Ile His Phe Asp Pro Glu Ile Tyr Asp Asp Pro Leu Glu Phe Lys
405 410 415
Tyr Asp Arg Tyr Ile Asp Glu Asn Gly Gln Glu Lys Thr Thr Phe Tyr
420 425 430
Arg Asn Gly Arg Lys Leu Arg Tyr Tyr Tyr Met Pro Phe Gly Ser Gly
435 440 445
Val Thr Lys Cys Pro Gly Arg Phe Phe Ala Val His Glu Ile Lys Gln
450 455 460
Phe Leu Ser Leu Leu Leu Ser Tyr Phe Glu Met Glu Leu Leu Asp Ser
465 470 475 480
Asp Val Lys Glu Pro Pro Leu Asp Gln Ser Arg Ala Gly Leu Gly Val
485 490 495
Leu Gln Pro Thr Tyr Asp Val Asp Phe Arg Tyr Arg Leu Lys Ser Leu
500 505 510
<210> 64
<211> 1539
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 64
atgatcttga ctatctcttt tatttgggca atcgttgttg gtttgtgttg ttgtttgtgg 60
ttgatcacag gtattagaag aagacatcca gctgaaccac cattggaaaa tggttggatt 120
ccatttttag gttgtgcatt gcaattcggt gctaacccat tggaattctt gagatcaaga 180
caaaagaaac atggtcatat ttttacttgt aagatcgcag gtcaatacgt tcatttcttg 240
tgtgatccat tttcttatca tgctgttatt agacaaggta gacatttgga ttggaagaaa 300
ttccatttcg atgcttcagc aaaggctttt ggtcatgaat ctatggaccc atcacaaggt 360
tacactacag aaaatttgca tcaaacattt ttgaagacat tgcaaggtga cgcattatct 420
tcattgatcg aaactatgat ggaaaatttg caaggtacaa tgttgcaatc tggcatgtta 480
aaagctacta catctgaatg gcaatcagat ggtatctatg cattctgtta caaagttatg 540
tttgaagctg gttatttgac tttgttcggt aaagaattgg atggtgacca atcaattgca 600
agacaacaag cacaaaaagc tttagttttg aatgctttgg ataacttcaa ggaattcgat 660
aagatcttcc cagcattgat cgctggtttg ccaatccatg tttttaaatc tgcatactca 720
gctagagaaa agttggcaaa gacaatgttg catgaaaatt tgtctagaag agctaacgtt 780
tctgatttga tctcattgag aatgttgttg aacgatactt tgtctacttt taatgaatta 840
tcaaaagcaa gaactcatgt tgctatttta tgggcatctc aagctaatac attgccagct 900
actttctgga cattgttcca tatgatcaga tgtccagctg caatgaaagc tgcatcagaa 960
gaagttagaa gaacattcga atcttcaaac caaaaggttg atccaactaa ctctagatta 1020
gttttgacaa gagaacaatt ggataacatg ccagttttgg attcaattat taaggaagca 1080
atgagattgt cttcagcttc tttgaacgtt agaatggcaa agtcagattt cttgttgcaa 1140
ttggataata aggaatctta ccatatcaga aaggatgatg ttattgctat gtatccacca 1200
atgatccatt tcgatccaga aatctatgat gatccattgg aattcaaata cgatagatac 1260
atcgatgaaa acggtcaaga aaagactaca ttctacagaa acggtagaaa gttgagatat 1320
tactatatgc catttggttc tggtgttact aaatgtccag gtagattttt cgctgttcat 1380
gaaattaaac aattcttgtc tttgttgttg tcatacttcg aaatggaatt gttggattct 1440
gatgttaaag aaccaccatt agatcaatca agagctggtt taggtgtttt gcaaccaaca 1500
tacgatgttg atttcagata cagattaaaa tctttgtaa 1539
<210> 65
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 65
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Gly Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 66
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 66
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcaggatca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 67
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 67
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Val Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 68
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 68
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagtatca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 69
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 69
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Leu Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 70
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 70
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcactatca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 71
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 71
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ile Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 72
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 72
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcaatatca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 73
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 73
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Phe Ser Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 74
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 74
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcattctca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 75
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 75
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ala Thr Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 76
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 76
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagcaaca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 77
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 77
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ala Val Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 78
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 78
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagcagta 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 79
<211> 503
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 79
Met Met Ser Ile Ser Leu Ile Trp Gly Ile Ala Val Val Val Ser Cys
1 5 10 15
Cys Ile Trp Phe Ile Ile Gly Ile Arg Arg Arg Lys Val Gly Glu Pro
20 25 30
Pro Leu Asp Asn Gly Leu Ile Pro Tyr Leu Gly Cys Ala Leu Lys Phe
35 40 45
Gly Ser Asn Pro Leu Glu Phe Leu Arg Ala Lys Gln Arg Lys His Gly
50 55 60
His Val Phe Thr Cys Lys Leu Met Gly Lys Tyr Val His Phe Ile Thr
65 70 75 80
Asn Ser Leu Ser Tyr His Lys Val Leu Cys His Gly Lys Tyr Phe Asp
85 90 95
Trp Lys Lys Phe His Tyr Thr Thr Ser Ala Lys Ala Phe Gly His Arg
100 105 110
Ser Ile Asp Pro Ser Asp Gly Asn Thr Thr Glu Asn Ile Asn Lys Thr
115 120 125
Phe Asn Lys Thr Leu Gln Gly Asp Ala Leu Cys Ser Leu Ser Glu Ala
130 135 140
Met Met Gln Asn Leu Gln Ser Val Met Arg Pro Pro Gly Leu Pro Lys
145 150 155 160
Ser Lys Ser Ala Val Trp Val Thr Glu Gly Met Tyr Ala Phe Cys Tyr
165 170 175
Arg Val Met Phe Glu Ala Gly Tyr Leu Thr Leu Phe Gly Lys Asp Ile
180 185 190
Ser Lys Thr Asp Ser Gln Arg Ala Phe Ile Gln Asn Asn Leu Asp Ser
195 200 205
Phe Lys Gln Phe Asp Gln Val Phe Pro Ala Leu Val Ala Gly Val Pro
210 215 220
Ile His Leu Phe Lys Thr Ala His Lys Ala Arg Glu Arg Leu Ala Glu
225 230 235 240
Ser Leu Lys His Lys Asn Leu Tyr Met Arg Asp Gln Val Ser Glu Leu
245 250 255
Ile Arg Leu Arg Met Phe Leu Asn Asp Thr Leu Ser Thr Phe Asp Asp
260 265 270
Met Glu Lys Ala Lys Thr His Leu Val Ile Leu Trp Ala Ser Gln Ala
275 280 285
Asn Thr Ile Pro Ala Thr Phe Trp Ser Leu Phe Gln Met Ile Arg Ser
290 295 300
Pro Glu Ala Met Lys Ala Ala Ser Glu Glu Val Asn Gly Ala Leu Gln
305 310 315 320
Ser Ala Gly Gln Glu Leu Ser Ser Gly Gly Asn Ala Ile Tyr Leu Asp
325 330 335
Gln Glu Gln Leu Asn Asn Leu Pro Val Leu Asp Ser Ile Ile Lys Glu
340 345 350
Ala Leu Arg Leu Ser Ser Ala Ala Leu Asn Ile Arg Thr Ala Lys Glu
355 360 365
Asp Phe Thr Leu His Leu Glu Asp Gly Ser Tyr Asn Ile Arg Lys Asp
370 375 380
Asp Ile Ile Ala Leu Tyr Pro Gln Leu Met His Leu Asp Pro Glu Ile
385 390 395 400
Tyr Pro Asp Pro Leu Thr Phe Lys Tyr Asp Arg Tyr Leu Asp Glu Ser
405 410 415
Gly Lys Ala Lys Thr Thr Phe Tyr Arg Asn Gly Asn Lys Leu Lys Tyr
420 425 430
Phe Tyr Met Pro Phe Gly Ser Gly Ala Thr Ile Cys Pro Gly Arg Leu
435 440 445
Phe Ala Val Gln Glu Ile Lys Gln Phe Leu Ile Leu Met Leu Ser Tyr
450 455 460
Phe Glu Leu Glu Leu Val Glu Ser His Thr Lys Cys Pro Pro Leu Asp
465 470 475 480
Gln Ser Arg Ala Gly Leu Gly Ile Leu Pro Pro Leu Asn Asp Ile Glu
485 490 495
Phe Lys Tyr Lys Leu Lys His
500
<210> 80
<211> 1512
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 80
atgatgtcta tctcattgat ctggggtatc gctgttgttg tttcttgttg tatttggttc 60
atcatcggta ttagaagaag aaaggttggt gaaccaccat tggataacgg tttgatccca 120
tatttgggtt gtgcattgaa gttcggttca aacccattgg aattcttgag agctaagcaa 180
agaaagcatg gtcatgtttt tacttgtaag ttaatgggta aatacgttca tttcatcaca 240
aactctttat cataccataa ggttttgtgt catggtaaat acttcgattg gaagaaattc 300
cattacacta catctgctaa ggcatttggt catagatcta ttgatccatc agatggtaat 360
actacagaaa acatcaataa gacttttaat aagacattgc aaggtgacgc tttatgttct 420
ttgtcagaag caatgatgca aaatttgcaa tcagttatga gaccaccagg tttgccaaaa 480
tctaaatcag ctgtttgggt tactgaaggc atgtacgcat tctgttacag agttatgttc 540
gaagctggtt acttgacttt gttcggtaaa gatatctcta agacagattc acaaagagct 600
tttattcaaa acaatttgga ttcttttaaa caatttgatc aagtttttcc agctttagtt 660
gcaggtgttc caatccattt gtttaaaact gctcataaag caagagaaag attagcagaa 720
tctttgaagc ataagaattt gtacatgaga gatcaagttt cagaattaat tagattgaga 780
atgtttttaa atgatacttt gtctacattc gatgatatgg aaaaggctaa gacacatttg 840
gttattttgt gggcttcaca agcaaatact attccagcaa cattctggtc tttgttccaa 900
atgatcagat caccagaagc tatgaaagct gcatctgaag aagttaatgg tgctttacaa 960
tcagcaggtc aagaattgtc ttcaggtggt aatgctatat atttggatca agaacaattg 1020
aacaatttgc cagttttgga ttctattatt aaggaagctt tgagattgtc ttcagcagca 1080
ttgaacatca gaactgcaaa ggaagatttc acattgcatt tggaagatgg ttcttacaac 1140
atcagaaagg atgatatcat tgctttgtac ccacaattga tgcacttaga tccagaaatc 1200
tatccagatc cattgacttt taaatacgat agatacttgg atgaatctgg taaagcaaaa 1260
actacattct acagaaacgg taataagttg aagtattttt acatgccatt tggttcaggt 1320
gcaactattt gtccaggtag attgtttgct gttcaagaaa ttaaacaatt cttgatcttg 1380
atgttgtctt acttcgaatt ggaattggtt gaatcacata caaagtgtcc accattagat 1440
caatctagag ctggtttggg tattttacca ccattgaatg atattgaatt caaatataaa 1500
ttgaaacatt aa 1512
<210> 81
<211> 369
<212> PRT
<213> 智人(Homo sapiens)
<400> 81
Met Ala Asp Ser Ala Gln Ala Gln Lys Leu Val Tyr Leu Val Thr Gly
1 5 10 15
Gly Cys Gly Phe Leu Gly Glu His Val Val Arg Met Leu Leu Gln Arg
20 25 30
Glu Pro Arg Leu Gly Glu Leu Arg Val Phe Asp Gln His Leu Gly Pro
35 40 45
Trp Leu Glu Glu Leu Lys Thr Gly Pro Val Arg Val Thr Ala Ile Gln
50 55 60
Gly Asp Val Thr Gln Ala His Glu Val Ala Ala Ala Val Ala Gly Ala
65 70 75 80
His Val Val Ile His Thr Ala Gly Leu Val Asp Val Phe Gly Arg Ala
85 90 95
Ser Pro Lys Thr Ile His Glu Val Asn Val Gln Gly Thr Arg Asn Val
100 105 110
Ile Glu Ala Cys Val Gln Thr Gly Thr Arg Phe Leu Val Tyr Thr Ser
115 120 125
Ser Met Glu Val Val Gly Pro Asn Thr Lys Gly His Pro Phe Tyr Arg
130 135 140
Gly Asn Glu Asp Thr Pro Tyr Glu Ala Val His Arg His Pro Tyr Pro
145 150 155 160
Cys Ser Lys Ala Leu Ala Glu Trp Leu Val Leu Glu Ala Asn Gly Arg
165 170 175
Lys Val Arg Gly Gly Leu Pro Leu Val Thr Cys Ala Leu Arg Pro Thr
180 185 190
Gly Ile Tyr Gly Glu Gly His Gln Ile Met Arg Asp Phe Tyr Arg Gln
195 200 205
Gly Leu Arg Leu Gly Gly Trp Leu Phe Arg Ala Ile Pro Ala Ser Val
210 215 220
Glu His Gly Arg Val Tyr Val Gly Asn Val Ala Trp Met His Val Leu
225 230 235 240
Ala Ala Arg Glu Leu Glu Gln Arg Ala Thr Leu Met Gly Gly Gln Val
245 250 255
Tyr Phe Cys Tyr Asp Gly Ser Pro Tyr Arg Ser Tyr Glu Asp Phe Asn
260 265 270
Met Glu Phe Leu Gly Pro Cys Gly Leu Arg Leu Val Gly Ala Arg Pro
275 280 285
Leu Leu Pro Tyr Trp Leu Leu Val Phe Leu Ala Ala Leu Asn Ala Leu
290 295 300
Leu Gln Trp Leu Leu Arg Pro Leu Val Leu Tyr Ala Pro Leu Leu Asn
305 310 315 320
Pro Tyr Thr Leu Ala Val Ala Asn Thr Thr Phe Thr Val Ser Thr Asp
325 330 335
Lys Ala Gln Arg His Phe Gly Tyr Glu Pro Leu Phe Ser Trp Glu Asp
340 345 350
Ser Arg Thr Arg Thr Ile Leu Trp Val Gln Ala Ala Thr Gly Ser Ala
355 360 365
Gln
<210> 82
<211> 1110
<212> DNA
<213> 智人(Homo sapiens)
<400> 82
atggctgatt ctgcacaagc tcaaaaattg gtttacttag ttactggtgg ttgtggtttc 60
ttgggtgaac atgttgttag aatgttgtta caaagagaac caagattggg tgaattaaga 120
gtttttgatc aacatttggg tccatggttg gaagaattaa aaactggtcc agttagagtt 180
acagcaattc aaggtgacgt tactcaagct catgaagttg ctgcagctgt tgcaggtgct 240
catgttgtta ttcatacagc aggtttggtt gatgtttttg gtagagcttc accaaagact 300
atccatgaag ttaacgttca aggtacaaga aacgttattg aagcatgtgt tcaaactggt 360
acaagatttt tagtttacac ttcttcaatg gaagttgttg gtccaaatac aaaaggtcat 420
ccattctacc gtggtaacga agatactcca tacgaagctg ttcatagaca tccatatcca 480
tgttctaaag cattggctga atggttggtt ttagaagcaa atggtagaaa agttagaggt 540
ggtttgccat tagttacttg tgctttaaga ccaacaggta tctatggtga aggtcatcaa 600
atcatgagag atttctacag acaaggtttg agattaggtg gttggttgtt tagagcaatt 660
ccagcttcag ttgaacatgg tagagtttat gttggtaatg ttgcatggat gcatgttttg 720
gcagctagag aattagaaca aagagctaca ttgatgggtg gtcaagttta cttctgttac 780
gatggttctc catacagatc atacgaagat ttcaacatgg aattcttggg tccatgtggt 840
ttgagattag ttggtgctag accattgtta ccatactggt tgttggtttt cttggcagct 900
ttgaacgcat tgttgcaatg gttgttgaga ccattggttt tgtacgctcc attgttgaac 960
ccatacactt tagcagttgc taacactact tttactgttt ctacagataa agcacaaaga 1020
catttcggtt acgaaccatt gttttcttgg gaagattcaa gaactagaac aattttatgg 1080
gttcaagcag ctacaggttc agctcaataa 1110
<210> 83
<211> 338
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 83
Met Ala Asp Ser Ala Gln Val Pro Ala Leu Val Tyr Leu Val Thr Gly
1 5 10 15
Gly Cys Gly Phe Leu Gly Glu His Ile Val Arg Met Leu Leu Glu Trp
20 25 30
Glu Pro Arg Leu Arg Glu Leu Arg Val Phe Asp Leu His Leu Ser Ser
35 40 45
Trp Leu Glu Glu Leu Lys Thr Gly Pro Val Gln Val Thr Ala Ile Gln
50 55 60
Gly Asp Val Thr Gln Ala His Glu Val Ala Ala Ala Met Ala Gly Ser
65 70 75 80
His Val Val Ile His Thr Ala Gly Leu Val Asp Val Phe Gly Lys Ala
85 90 95
Ser Pro Glu Thr Ile His Lys Val Asn Val Gln Gly Thr Gln Asn Val
100 105 110
Ile Asp Ala Cys Val Gln Thr Gly Thr Arg Leu Leu Val Tyr Thr Ser
115 120 125
Ser Met Glu Val Val Gly Pro Asn Val Lys Gly His Pro Phe Tyr Arg
130 135 140
Gly Asn Glu Asp Thr Pro Tyr Glu Ala Ile His Arg His Pro Tyr Pro
145 150 155 160
Cys Ser Lys Ala Leu Ala Glu Gln Leu Val Leu Glu Ala Asn Gly Arg
165 170 175
Lys Gly Leu Arg Phe Gly Gly Arg Leu Phe Arg Ala Ile Pro Ala Ser
180 185 190
Val Glu His Gly Arg Val Tyr Val Gly Asn Val Ala Trp Met His Ile
195 200 205
Leu Val Ala Arg Glu Leu Glu Gln Arg Ala Ala Leu Met Gly Gly Gln
210 215 220
Val Tyr Phe Cys Tyr Asp Lys Ser Pro Tyr Lys Ser Tyr Glu Asp Phe
225 230 235 240
Asn Met Glu Phe Leu Ser Pro Cys Gly Leu Arg Leu Ile Gly Thr His
245 250 255
Pro Leu Leu Pro Tyr Trp Leu Leu Val Leu Leu Thr Ala Leu Asn Ala
260 265 270
Leu Leu Gln Trp Leu Leu Arg Pro Leu Val Leu Tyr Thr Pro Leu Leu
275 280 285
Asn Pro Tyr Thr Leu Ala Val Ala Asn Thr Thr Phe Thr Val Ser Thr
290 295 300
Asn Lys Ala Gln Arg His Phe Gly Tyr Lys Pro Leu Phe Ser Trp Glu
305 310 315 320
Glu Ser Arg Ala Arg Thr Ile His Trp Val Gln Ala Met Glu Gly Ser
325 330 335
Ala Trp
<210> 84
<211> 1017
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 84
atggcagatt ctgctcaagt tccagctttg gtttacttag ttactggtgg ttgtggtttc 60
ttgggtgaac atatcgttag aatgttgttg gaatgggaac caagattgag agaattgaga 120
gttttcgatt tgcatttgtc ttcatggttg gaagaattga agactggtcc agttcaagtt 180
acagcaattc aaggtgacgt tactcaagct catgaagttg ctgcagctat ggcaggttct 240
catgttgtta ttcatacagc tggtttggtt gatgtttttg gtaaagcatc accagaaact 300
atccataagg ttaacgttca aggtacacaa aatgttattg atgcttgtgt tcaaactggt 360
acaagattgt tagtttacac ttcttcaatg gaagttgttg gtccaaatgt taaaggtcat 420
ccattctacc gtggtaacga agatacacca tacgaagcta ttcatagaca tccatatcca 480
tgttctaaag cattagctga acaattggtt ttagaagcta atggtagaaa aggtttgaga 540
tttggtggta gattgtttag agcaattcca gcttcagttg aacatggtag agtttatgtt 600
ggtaatgttg catggatgca tattttggtt gctagagaat tggaacaaag agcagctttg 660
atgggtggtc aagtttactt ctgttacgat aagtctccat acaagtcata cgaagatttc 720
aacatggaat tcttgtcacc atgtggtttg agattaattg gtactcatcc attgttacca 780
tactggttgt tagttttgtt aacagcattg aatgctttgt tacaatggtt gttgagacca 840
ttggttttgt acactccatt gttgaaccca tacacattag cagttgctaa cactactttt 900
actgtttcta caaataaggc tcaaagacat ttcggttaca agccattgtt ttcttgggaa 960
gaatcaagag caagaacaat tcattgggtt caagcaatgg aaggttcagc ttggtaa 1017
<210> 85
<211> 369
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 85
Met Ala Asp Ser Ala Gln Val Pro Thr Leu Val Tyr Leu Val Thr Gly
1 5 10 15
Gly Cys Gly Phe Leu Gly Glu His Ile Val Arg Met Leu Leu Glu Arg
20 25 30
Glu Pro Arg Leu Arg Glu Leu Arg Val Phe Asp Leu His Leu Ser Ser
35 40 45
Trp Leu Glu Glu Leu Lys Ala Gly Pro Val Gln Val Thr Ala Ile Gln
50 55 60
Gly Asp Val Thr Gln Ala His Glu Val Ala Ala Ala Met Ser Gly Ser
65 70 75 80
His Val Val Ile His Thr Ala Gly Leu Val Asp Val Phe Gly Lys Ala
85 90 95
Ser Pro Lys Thr Ile His Lys Val Asn Val Gln Gly Thr Gln Asn Val
100 105 110
Ile Asp Ala Cys Val Gln Thr Gly Thr Gln Tyr Leu Val Tyr Thr Ser
115 120 125
Ser Met Glu Val Val Gly Pro Asn Ile Lys Gly His Pro Phe Tyr Arg
130 135 140
Gly Asn Glu Asp Thr Pro Tyr Glu Ala Val His Ser His Pro Tyr Pro
145 150 155 160
Cys Ser Lys Ala Leu Ala Glu Gln Leu Val Leu Glu Ala Asn Gly Arg
165 170 175
Lys Val Asn Gly Gly Leu Pro Leu Val Thr Cys Ala Leu Arg Pro Thr
180 185 190
Gly Ile Tyr Gly Glu Gly His Gln Val Met Arg Asp Phe Tyr Tyr Gln
195 200 205
Gly Leu Arg Phe Gly Gly Arg Leu Phe Arg Ala Val Pro Ala Ser Val
210 215 220
Glu His Gly Arg Val Tyr Val Gly Asn Val Ala Trp Met His Ile Leu
225 230 235 240
Val Ala Arg Glu Leu Glu Gln Arg Ala Ala Leu Met Gly Gly Gln Val
245 250 255
Tyr Phe Cys Tyr Asp Lys Ser Pro Tyr Lys Ser Tyr Glu Asp Phe Asn
260 265 270
Met Glu Phe Leu Ser Pro Cys Gly Leu Arg Leu Ile Gly Ala His Pro
275 280 285
Leu Leu Pro Tyr Trp Leu Leu Val Leu Leu Ala Thr Leu Asn Ala Leu
290 295 300
Leu Gln Trp Leu Leu Arg Pro Leu Val Leu Tyr Thr Pro Leu Leu Asn
305 310 315 320
Pro Tyr Thr Leu Ala Met Ala Asn Thr Thr Phe Thr Val Ser Thr Asn
325 330 335
Lys Ala Gln Arg His Phe Gly Tyr Lys Pro Leu Phe Ser Trp Glu Glu
340 345 350
Ser Arg Thr Arg Thr Ile Gln Trp Val Gln Ala Met Glu Gly Ser Ala
355 360 365
Arg
<210> 86
<211> 1110
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 86
atggcagatt ctgctcaagt tccaactttg gtttatttgg ttacaggtgg ttgtggtttc 60
ttgggtgaac atatcgttag aatgttgttg gaaagagaac caagattgag agaattgaga 120
gttttcgatt tgcatttgtc ttcatggttg gaagaattaa aagcaggtcc agttcaagtt 180
actgctattc aaggtgacgt tacacaagct catgaagttg ctgcagctat gtctggttca 240
catgttgtta ttcatactgc aggtttagtt gatgtttttg gtaaagcttc tccaaagact 300
atccataagg ttaacgttca aggtacacaa aatgttattg atgcatgtgt tcaaactggt 360
acacaatatt tggtttacac ttcttcaatg gaagttgttg gtccaaacat caagggtcat 420
ccattctacc gtggtaacga agatacacca tacgaagctg ttcattctca tccatatcca 480
tgttcaaaag cattagctga acaattggtt ttggaagcaa acggtagaaa ggttaacggt 540
ggtttgccat tagttacttg tgctttgaga ccaacaggta tctatggtga aggtcatcaa 600
gttatgagag atttctacta ccaaggtttg agattcggtg gtagattgtt tagagcagtt 660
ccagcttctg ttgaacatgg tagagtttat gttggtaatg ttgcatggat gcatattttg 720
gttgctagag aattggaaca aagagcagct ttgatgggtg gtcaagttta cttctgttac 780
gataagtctc catacaagtc atacgaagat ttcaacatgg aattcttgtc accatgtggt 840
ttgagattaa ttggtgctca tccattgtta ccatactggt tgttagtttt gttagcaaca 900
ttgaatgctt tgttacaatg gttgttgaga ccattggttt tgtacactcc attgttgaac 960
ccatacacat tagcaatggc taacactact tttactgttt ctacaaataa ggctcaaaga 1020
catttcggtt acaagccatt gttttcttgg gaagaatcaa gaactagaac aattcaatgg 1080
gttcaagcaa tggaaggttc agctagataa 1110
<210> 87
<211> 368
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 87
Met Ser Asn Asn Asn Lys Ser Lys Leu Thr Tyr Val Ile Thr Gly Gly
1 5 10 15
Cys Gly Phe Leu Gly Gln His Leu Leu Arg Val Leu Leu Glu Lys Glu
20 25 30
Lys Asn Val Lys Glu Ile Arg Leu Phe Asp Lys Asn Val Phe Pro Ser
35 40 45
Leu Gln Ser Glu Ser Thr Glu Asp Val Lys Val Val Ile Ile Gln Gly
50 55 60
Asp Ile Thr Lys Tyr Glu Asp Val Arg Asn Ala Phe Leu Gly Ala Asp
65 70 75 80
Leu Val Phe His Ala Ala Ser Leu Val Asp Val Trp Tyr Lys Ile Pro
85 90 95
Glu Lys Val Ile Phe Ala Val Asn Val Gln Gly Thr Glu Asn Ala Ile
100 105 110
Lys Ala Cys Val Asp Ile Gly Ile Gln Tyr Leu Val Tyr Thr Ser Ser
115 120 125
Met Glu Val Val Gly Pro Asn Val Lys Gly Asp Glu Phe Val Arg Gly
130 135 140
Asn Glu Asp Thr Pro Tyr Asn Ile Phe His Glu Met Pro Tyr Pro Lys
145 150 155 160
Ser Lys Ala Ala Ala Glu Lys Ile Val Leu Glu Ala Asn Gly Thr Lys
165 170 175
Val Glu Gly Gly Asn Ile Leu Tyr Thr Cys Cys Leu Arg Pro Thr Gly
180 185 190
Ile Tyr Gly Glu Gln His Gln Leu Met Lys Asp Phe Tyr Leu Asn Ser
195 200 205
Val Arg Asn Gly Gly Trp Val Met Arg Gly Val Pro Pro His Thr Glu
210 215 220
His Gly Arg Val Tyr Ala Gly Asn Val Ala Trp Met His Leu Leu Ala
225 230 235 240
Ala Arg Ala Leu Gln Glu His Pro Asn Arg Leu Gly Gly Glu Cys Tyr
245 250 255
Phe Cys Tyr Asp Asp Ser Pro Tyr Lys Pro Tyr Asp Glu Phe Asn Met
260 265 270
Gln Phe Leu Ser Ala Phe Asn Phe Arg Ser Leu Arg Leu Pro Val Trp
275 280 285
Met Leu Trp Ile Ile Ala Trp Met Asn Asp Met Val Arg Trp Val Leu
290 295 300
Lys Pro Ile Tyr Asn Tyr Thr Pro Leu Leu Asn Lys Tyr Thr Leu Ala
305 310 315 320
Val Ala Cys Thr Ser Phe Thr Val Ser Thr Asp Lys Ala Phe Arg His
325 330 335
Phe Gln Tyr Gln Pro Leu Tyr Ser Trp Gln Gln Cys Leu Ser Arg Thr
340 345 350
Gln Ser Trp Val Asn Thr Phe Pro Phe Glu Thr Ser Thr Lys Asp Lys
355 360 365
<210> 88
<211> 1107
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 88
atgtctaaca acaataagtc aaagttgaca tacgttatta ctggtggttg tggtttcttg 60
ggtcaacatt tgttaagagt tttgttggaa aaggaaaaga atgttaagga aatcagattg 120
tttgataaaa atgtttttcc atctttgcaa tctgaatcaa cagaagatgt taaggttgtt 180
attatccaag gtgacatcac taagtacgaa gatgttagaa acgcattttt gggtgctgat 240
ttggtttttc atgctgcatc attggttgat gtttggtaca agatcccaga aaaagttatt 300
tttgcagtta acgttcaagg tacagaaaac gcaattaaag cttgtgttga tatcggtatt 360
caatatttgg tttacacttc ttcaatggaa gttgttggtc caaatgttaa aggtgacgaa 420
tttgttcgtg gtaacgaaga tacaccatac aacatcttcc atgaaatgcc atacccaaaa 480
tctaaagctg cagctgaaaa gattgttttg gaagctaatg gtactaaggt tgaaggtggt 540
aacatcttgt acacatgttg tttgagacca actggtatct atggtgaaca acatcaattg 600
atgaaggatt tctatttgaa ctcagttaga aatggtggtt gggttatgag aggtgttcca 660
ccacatacag aacatggtag agtttacgct ggtaatgttg cttggatgca tttgttagca 720
gctagagcat tgcaagaaca tccaaacaga ttaggtggtg aatgttactt ctgttacgat 780
gattctccat acaagccata cgatgaattc aatatgcaat tcttgtctgc ttttaatttc 840
agatcattga gattaccagt ttggatgttg tggattattg cttggatgaa cgatatggtt 900
agatgggttt tgaagccaat ctataactac acaccattgt tgaataagta cactttggca 960
gttgcttgta cttcttttac agtttcaact gataaggctt ttagacattt ccaataccaa 1020
ccattgtact cttggcaaca atgtttatct agaacacaat catgggttaa cactttccca 1080
ttcgaaactt caacaaaaga taaataa 1107
<210> 89
<211> 326
<212> PRT
<213> 智人(Homo sapiens)
<400> 89
Met Asp Leu Ser Ala Ala Ser His Arg Ile Pro Leu Ser Asp Gly Asn
1 5 10 15
Ser Ile Pro Ile Ile Gly Leu Gly Thr Tyr Ser Glu Pro Lys Ser Thr
20 25 30
Pro Lys Gly Ala Cys Ala Thr Ser Val Lys Val Ala Ile Asp Thr Gly
35 40 45
Tyr Arg His Ile Asp Gly Ala Tyr Ile Tyr Gln Asn Glu His Glu Val
50 55 60
Gly Glu Ala Ile Arg Glu Lys Ile Ala Glu Gly Lys Val Arg Arg Glu
65 70 75 80
Asp Ile Phe Tyr Cys Gly Lys Leu Trp Ala Thr Asn His Val Pro Glu
85 90 95
Met Val Arg Pro Thr Leu Glu Arg Thr Leu Arg Val Leu Gln Leu Asp
100 105 110
Tyr Val Asp Leu Tyr Ile Ile Glu Val Pro Met Ala Phe Lys Pro Gly
115 120 125
Asp Glu Ile Tyr Pro Arg Asp Glu Asn Gly Lys Trp Leu Tyr His Lys
130 135 140
Ser Asn Leu Cys Ala Thr Trp Glu Ala Met Glu Ala Cys Lys Asp Ala
145 150 155 160
Gly Leu Val Lys Ser Leu Gly Val Ser Asn Phe Asn Arg Arg Gln Leu
165 170 175
Glu Leu Ile Leu Asn Lys Pro Gly Leu Lys His Lys Pro Val Ser Asn
180 185 190
Gln Val Glu Cys His Pro Tyr Phe Thr Gln Pro Lys Leu Leu Lys Phe
195 200 205
Cys Gln Gln His Asp Ile Val Ile Thr Ala Tyr Ser Pro Leu Gly Thr
210 215 220
Ser Arg Asn Pro Ile Trp Val Asn Val Ser Ser Pro Pro Leu Leu Lys
225 230 235 240
Asp Ala Leu Leu Asn Ser Leu Gly Lys Arg Tyr Asn Lys Thr Ala Ala
245 250 255
Gln Ile Val Leu Arg Phe Asn Ile Gln Arg Gly Val Val Val Ile Pro
260 265 270
Lys Ser Phe Asn Leu Glu Arg Ile Lys Glu Asn Phe Gln Ile Phe Asp
275 280 285
Phe Ser Leu Thr Glu Glu Glu Met Lys Asp Ile Glu Ala Leu Asn Lys
290 295 300
Asn Val Arg Phe Val Glu Leu Leu Met Trp Arg Asp His Pro Glu Tyr
305 310 315 320
Pro Phe His Asp Glu Tyr
325
<210> 90
<211> 981
<212> DNA
<213> 智人(Homo sapiens)
<400> 90
atggatttgt ctgctgcatc acatagaatt ccattgtctg atggtaactc aatcccaatc 60
atcggtttgg gtacttattc tgaaccaaaa tcaacaccaa aaggtgcttg tgcaacttct 120
gttaaagttg ctattgatac aggttacaga catatcgatg gtgcatacat ctatcaaaac 180
gaacatgaag ttggtgaagc tattagagaa aagattgcag agggtaaagt tagaagagaa 240
gatattttct attgtggtaa attgtgggct actaatcatg ttccagaaat ggttagacca 300
actttggaaa gaacattgag agttttgcaa ttggattacg ttgatttgta catcatcgaa 360
gttccaatgg cttttaaacc aggtgacgaa atctatccaa gagatgaaaa cggtaaatgg 420
ttgtaccata agtctaattt gtgtgctaca tgggaagcta tggaagcttg taaggatgca 480
ggtttagtta aatctttggg tgtttcaaac ttcaacagaa gacaattgga attgatcttg 540
aataagccag gtttgaagca taagccagtt tcaaaccaag ttgaatgtca tccatacttc 600
actcaaccaa agttgttgaa gttttgtcaa caacatgata tcgttatcac agcttactct 660
ccattgggta cttcaagaaa tccaatttgg gttaatgttt cttcaccacc attgttgaag 720
gatgcattgt tgaactcttt aggtaaaaga tacaataaga cagctgcaca aatcgttttg 780
agattcaata tccaaagagg tgttgttgtt attccaaaat cttttaattt ggaaagaatt 840
aaagaaaact tccaaatctt cgatttttca ttaactgaag aagaaatgaa ggatatcgaa 900
gctttgaata agaacgttag attcgttgaa ttgttaatgt ggagagatca tccagaatat 960
ccatttcatg atgaatacta a 981
<210> 91
<211> 325
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 91
Met Asn Leu Ser Ala Ala His His Gln Ile Ser Leu Ser Asp Gly Asn
1 5 10 15
Asn Ile Pro Leu Ile Gly Leu Gly Thr Tyr Ser Asp Pro Arg Pro Val
20 25 30
Pro Gly Lys Thr Tyr Val Ala Val Lys Thr Ala Ile Asp Glu Gly Tyr
35 40 45
Arg His Ile Asp Gly Ala Tyr Val Tyr His Asn Glu His Glu Val Gly
50 55 60
Glu Ala Ile Arg Glu Lys Ile Ala Glu Gly Lys Val Lys Arg Glu Glu
65 70 75 80
Ile Phe Tyr Cys Gly Lys Leu Trp Asn Thr Glu His Val Pro Ser Met
85 90 95
Val Leu Pro Ala Leu Glu Arg Thr Leu Lys Ala Leu Lys Leu Asp Tyr
100 105 110
Ile Asp Leu Tyr Ile Ile Glu Leu Pro Met Ala Phe Lys Pro Gly Lys
115 120 125
Glu Ile Tyr Pro Arg Asp Glu Asn Gly Arg Ile Ile Tyr Asp Lys Thr
130 135 140
Asn Leu Cys Ala Thr Trp Glu Ala Leu Glu Ala Cys Lys Asp Ala Gly
145 150 155 160
Leu Val Lys Ser Leu Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu
165 170 175
Leu Ile Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Thr Asn Gln
180 185 190
Val Glu Cys His Pro Tyr Phe Thr Gln Thr Lys Leu Leu Lys Phe Cys
195 200 205
Gln Gln His Asp Ile Val Ile Val Ala His Ser Pro Leu Gly Thr Cys
210 215 220
Arg Asn Pro Ser Trp Val Asn Val Ser Ser Pro Pro Leu Leu Asn Asp
225 230 235 240
Glu Leu Leu Thr Ser Leu Gly Lys Lys Tyr Asn Lys Thr Gln Ala Gln
245 250 255
Ile Val Leu Arg Phe Asn Ile Gln Arg Gly Ile Val Val Ile Pro Lys
260 265 270
Ser Phe Thr Pro Glu Arg Ile Lys Glu Asn Phe Gln Ile Phe Asp Phe
275 280 285
Ser Leu Thr Glu Glu Glu Met Lys Asp Ile Asp Ala Leu Asn Lys Asn
290 295 300
Val Arg Tyr Val Glu Leu Leu Met Trp Ser Asp His Pro Glu Tyr Pro
305 310 315 320
Phe His Asp Glu Tyr
325
<210> 92
<211> 978
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 92
atgaatttgt ctgctgcaca tcatcaaatc tctttgtcag atggtaacaa catcccattg 60
atcggtttgg gtacttattc agatccaaga ccagttccag gtaaaactta cgttgctgtt 120
aaaacagcaa ttgatgaagg ttacagacat atcgatggtg cttacgttta ccataatgaa 180
catgaagttg gtgaagctat tagagaaaag attgcagagg gtaaagttaa gagagaagaa 240
attttctatt gtggtaaatt gtggaacact gaacatgttc catctatggt tttaccagct 300
ttggaaagaa cattgaaggc attgaagttg gattacatcg atttgtacat catcgaattg 360
ccaatggctt ttaaacctgg taaagaaatc tatccaagag atgaaaacgg tagaatcatc 420
tatgataaga ctaatttgtg tgctacatgg gaagctttgg aagcttgtaa ggatgcaggt 480
ttagttaaat ctttgggtgt ttcaaacttc aacagaagac aattggaatt gatcttgaat 540
aagccaggtt taaagtacaa gccagttact aaccaagttg aatgtcatcc atacttcact 600
caaacaaagt tgttgaagtt ttgtcaacaa catgatatcg ttatcgttgc tcattctcca 660
ttgggtacat gtagaaatcc atcatgggtt aatgtttctt caccaccatt gttgaacgat 720
gaattgttga cttctttggg taaaaagtac aataagacac aagcacaaat cgttttgaga 780
ttcaatatcc aaagaggtat cgttgttatt ccaaagtctt ttactccaga aagaattaaa 840
gaaaacttcc aaatcttcga tttttcatta acagaagaag aaatgaagga tatcgatgct 900
ttgaataaga acgttagata cgttgaattg ttaatgtggt cagatcatcc agaatatcca 960
tttcatgatg aatactaa 978
<210> 93
<211> 326
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 93
Met Asn Leu Ser Thr Ala Asn His His Ile Pro Leu Asn Asp Gly Asn
1 5 10 15
Ser Ile Pro Ile Ile Gly Leu Gly Thr Tyr Ser Asp Pro Arg Pro Val
20 25 30
Pro Gly Lys Thr Phe Ile Ala Val Lys Thr Ala Ile Asp Glu Gly Tyr
35 40 45
Arg His Ile Asp Gly Ala Tyr Val Tyr Arg Asn Glu His Glu Val Gly
50 55 60
Glu Ala Ile Arg Glu Lys Val Ala Glu Gly Lys Val Lys Arg Glu Glu
65 70 75 80
Ile Phe Tyr Cys Gly Lys Leu Trp Ser Thr Asp His Asp Pro Glu Met
85 90 95
Val Arg Pro Ala Leu Glu Arg Thr Leu Gln Thr Leu Lys Leu Asp Tyr
100 105 110
Ile Asp Leu Tyr Ile Ile Glu Met Pro Met Ala Phe Lys Pro Gly Glu
115 120 125
Glu Phe Tyr Pro Lys Asp Glu Asn Gly Arg Val Ile Tyr His Lys Ser
130 135 140
Asn Leu Cys Ala Thr Trp Glu Ala Leu Glu Ala Cys Lys Asp Ala Gly
145 150 155 160
Leu Val Lys Ser Leu Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu
165 170 175
Val Ile Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Thr Asn Gln
180 185 190
Val Glu Cys His Pro Tyr Phe Thr Gln Thr Lys Leu Leu Glu Val Ser
195 200 205
Ala Ser Ser Met Thr Ser Phe Ile Val Ala Tyr Ser Pro Leu Gly Thr
210 215 220
Cys Arg Asn Pro Leu Trp Val Asn Val Ser Ser Pro Pro Leu Leu Lys
225 230 235 240
Asp Glu Leu Leu Thr Ser Leu Gly Lys Lys Tyr Asn Lys Thr Gln Ala
245 250 255
Gln Ile Val Leu Arg Phe Asp Ile Gln Arg Gly Leu Val Val Ile Pro
260 265 270
Lys Ser Thr Thr Pro Glu Arg Ile Lys Glu Asn Phe Gln Ile Phe Asp
275 280 285
Phe Ser Leu Thr Lys Glu Glu Met Lys Asp Ile Glu Ala Leu Asn Lys
290 295 300
Asn Val Arg Phe Val Glu Met Leu Met Trp Ser Asp His Pro Glu Tyr
305 310 315 320
Pro Phe His Asp Glu Tyr
325
<210> 94
<211> 981
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 94
atgaatttgt ctactgctaa ccatcatatc ccattgaacg atggtaactc aatcccaatc 60
attggtttgg gtacatattc tgatccaaga ccagttccag gtaaaacttt tattgctgtt 120
aagacagcaa tcgatgaagg ttacagacat attgatggtg cttatgttta cagaaatgaa 180
catgaagttg gtgaagctat tagagaaaaa gttgcagagg gtaaagttaa gagagaagaa 240
attttctatt gtggtaaatt gtggtcaact gatcatgatc cagaaatggt tagaccagct 300
ttggaaagaa ctttacaaac attgaagttg gattacatcg atttgtacat catcgaaatg 360
ccaatggctt ttaaaccagg tgaagaattc tacccaaaag atgaaaacgg tagagttata 420
tatcataagt ctaatttgtg tgctacttgg gaagctttag aagcatgtaa agatgcaggt 480
ttggttaagt ctttgggtgt ttcaaacttc aacagaagac aattggaagt tattttgaat 540
aagccaggtt taaagtacaa gccagttaca aaccaagttg aatgtcatcc atacttcact 600
caaacaaagt tgttggaagt ttctgcttct tcaatgactt cttttattgt tgcatattct 660
ccattgggta catgtagaaa tccattatgg gttaatgttt cttcaccacc attgttgaag 720
gatgaattgt tgacttcatt gggtaaaaag tacaataaga cacaagctca aatcgttttg 780
agattcgata tccaaagagg tttagttgtt attccaaagt caactacacc agaaagaatt 840
aaagaaaact tccaaatctt cgatttttct ttaacaaaag aagaaatgaa agatattgaa 900
gcattgaata agaatgttag atttgttgaa atgttaatgt ggtctgatca tccagaatat 960
ccatttcatg atgaatacta a 981
<210> 95
<211> 326
<212> PRT
<213> 家兔(Oryctolagus cuniculus)
<400> 95
Met Asp Leu Ser Ala Thr Asn His Arg Ile Pro Leu Gly Asp Gly Asn
1 5 10 15
Ser Ile Pro Ile Ile Gly Leu Gly Thr Tyr Ser Glu Pro Lys Thr Thr
20 25 30
Pro Lys Gly Ser Cys Ala Thr Ser Val Lys Ile Ala Ile Asp Thr Gly
35 40 45
Tyr Arg His Ile Asp Gly Ala Tyr Ile Tyr Gln Asn Glu His Glu Val
50 55 60
Gly Glu Thr Phe Arg Glu Lys Ile Ala Glu Gly Lys Val Arg Arg Glu
65 70 75 80
Asp Ile Phe Tyr Cys Gly Lys Leu Trp Ala Thr Asn His Asp Pro Val
85 90 95
Met Val Arg Pro Thr Leu Glu Arg Thr Leu Lys Val Leu Lys Leu Asp
100 105 110
Tyr Ile Asp Leu Tyr Ile Ile Glu Ile Pro Met Ala Phe Lys Pro Gly
115 120 125
Asp Val Val Tyr Pro Arg Asp Glu Asn Gly Lys Trp Leu Tyr His Lys
130 135 140
Thr Asn Leu Cys Ala Thr Trp Glu Ala Leu Glu Ala Cys Lys Asp Ala
145 150 155 160
Gly Leu Val Lys Ser Leu Gly Val Ser Asn Phe Asn Arg Gln Gln Leu
165 170 175
Glu Leu Leu Leu Asn Lys Pro Gly Leu Lys His Lys Pro Val Cys Asn
180 185 190
Gln Val Glu Cys His Pro Tyr Phe Thr Gln Pro Lys Leu Leu Lys Phe
195 200 205
Cys Gln Gln His Asp Ile Ile Ile Val Ala Tyr Ser Pro Leu Gly Thr
210 215 220
Cys Arg Asn Pro Met Trp Val Asn Thr Ser Leu Pro Pro Leu Leu Lys
225 230 235 240
Asp Thr Leu Leu Asn Ser Leu Gly Lys Lys Tyr Lys Lys Thr Ala Ala
245 250 255
Gln Ile Val Leu Arg Phe Asn Val Gln Arg Gly Val Val Val Ile Pro
260 265 270
Lys Ser Phe Asn Pro Glu Arg Ile Lys Glu Asn Phe Gln Ile Phe Asp
275 280 285
Phe Ser Leu Thr Glu Glu Glu Met Lys Asp Ile Glu Ala Leu Asn Lys
290 295 300
Asn Val Arg Tyr Val Glu Leu Leu Met Trp Arg Asp His Pro Glu Tyr
305 310 315 320
Pro Phe Asn Asp Glu Tyr
325
<210> 96
<211> 981
<212> DNA
<213> 家兔(Oryctolagus cuniculus)
<400> 96
atggatttgt ctgctacaaa tcatagaatt ccattgggtg acggtaactc tatcccaatc 60
atcggtttgg gtacttattc agaaccaaaa actacaccaa aaggttcttg tgctacttca 120
gttaagatcg caatcgatac aggttacaga catatcgatg gtgcatacat ctatcaaaac 180
gaacatgaag ttggtgaaac ttttagagaa aagattgctg agggtaaagt tagaagagaa 240
gatattttct attgtggtaa attgtgggca actaatcatg atccagttat ggttagacca 300
actttggaaa gaacattgaa ggttttgaag ttggattata ttgatttgta catcatcgaa 360
atcccaatgg cttttaaacc aggtgacgtt gtttacccaa gagatgaaaa cggtaaatgg 420
ttgtaccata agactaattt gtgtgctaca tgggaagctt tggaagcttg taaggatgca 480
ggtttagtta aatctttggg tgtttcaaac ttcaacagac aacaattgga attgttgttg 540
aataagccag gtttgaagca taagccagtt tgtaaccaag ttgaatgtca tccatacttc 600
acacaaccaa agttattgaa gttttgtcaa caacatgata tcatcatcgt tgcttactca 660
ccattaggta cttgtagaaa tccaatgtgg gttaacacat ctttaccacc attattgaag 720
gatactttgt tgaactcatt gggtaaaaag tacaagaaaa ctgctgcaca aatcgttttg 780
agattcaatg ttcaaagagg tgttgttgtt attccaaaat cttttaatcc agaaagaatt 840
aaagaaaact tccaaatctt cgatttttca ttaactgaag aagaaatgaa ggatatcgaa 900
gcattgaata agaacgttag atacgttgaa ttattgatgt ggagagatca tccagaatac 960
ccttttaatg atgaatacta a 981
<210> 97
<211> 322
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 97
Met Asp Ser Ile Ser Leu Arg Val Ala Leu Asn Asp Gly Asn Phe Ile
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Thr Val Pro Glu Lys Val Ala Lys Asp
20 25 30
Glu Val Ile Lys Ala Thr Lys Ile Ala Ile Asp Asn Gly Phe Arg His
35 40 45
Phe Asp Ser Ala Tyr Leu Tyr Glu Val Glu Glu Glu Val Gly Gln Ala
50 55 60
Ile Arg Ser Lys Ile Glu Asp Gly Thr Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Ser Thr Phe His Arg Pro Glu Leu Val Arg
85 90 95
Thr Cys Leu Glu Lys Thr Leu Lys Ser Thr Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Ile Ile His Phe Pro Met Ala Leu Gln Pro Gly Asp Ile Phe
115 120 125
Phe Pro Arg Asp Glu His Gly Lys Leu Leu Phe Glu Thr Val Asp Ile
130 135 140
Cys Asp Thr Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Cys Arg Gln Leu Glu Arg Ile
165 170 175
Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Leu Tyr Leu Asn Gln Ser Lys Met Leu Asp Tyr Cys Lys Ser
195 200 205
Lys Asp Ile Ile Leu Val Ser Tyr Cys Thr Leu Gly Ser Ser Arg Asp
210 215 220
Lys Thr Trp Val Asp Gln Lys Ser Pro Val Leu Leu Asp Asp Pro Val
225 230 235 240
Leu Cys Ala Ile Ala Lys Lys Tyr Lys Gln Thr Pro Ala Leu Val Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Pro Leu Ile Arg Ser Phe
260 265 270
Asn Ala Lys Arg Ile Lys Glu Leu Thr Gln Val Phe Glu Phe Gln Leu
275 280 285
Ala Ser Glu Asp Met Lys Ala Leu Asp Gly Leu Asn Arg Asn Phe Arg
290 295 300
Tyr Asn Asn Ala Lys Tyr Phe Asp Asp His Pro Asn His Pro Phe Thr
305 310 315 320
Asp Glu
<210> 98
<211> 969
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 98
atggattcta tctcattgag agttgctttg aacgatggta acttcatccc agttttgggt 60
tttggtacta cagttccaga aaaggttgca aaggatgaag ttattaaagc tactaagatt 120
gcaattgata acggttttag acatttcgat tctgcttatt tgtacgaagt tgaagaagaa 180
gttggtcaag caatcagatc aaagatcgaa gatggtactg ttaagagaga agatattttc 240
tatacttcta agttgtggtc aacattccat agaccagaat tagttagaac atgtttggaa 300
aagactttga agtctacaca attggattac gttgatttgt acatcatcca tttcccaatg 360
gctttgcaac caggtgacat tttctttcca agagatgaac atggtaaatt gttgttcgaa 420
actgttgata tctgtgatac atgggaagca atggaaaagt gtaaggatgc tggtttggca 480
aagtctatcg gtgtttcaaa cttcaactgt agacaattgg aaagaatttt aaataagcca 540
ggtttgaagt acaagccagt ttgtaaccaa gttgaatgtc atttgtattt gaatcaatct 600
aaaatgttgg attactgtaa gtctaaggat atcattttgg tttcatactg tactttaggt 660
tcttcaagag ataaaacatg ggttgatcaa aaatcaccag ttttgttaga tgatccagtt 720
ttgtgtgcta tcgctaagaa atacaagcaa actccagctt tggttgcatt aagataccaa 780
ttgcaaagag gtgttgttcc attgatcaga tcttttaatg ctaagagaat taaagaattg 840
acacaagttt tcgaattcca attggcttca gaagatatga aggcattgga tggtttgaac 900
agaaacttca gatacaacaa tgctaaatac tttgatgatc atccaaatca tccttttact 960
gatgaataa 969
<210> 99
<211> 323
<212> PRT
<213> 智人(Homo sapiens)
<400> 99
Met Asp Pro Lys Tyr Gln Arg Val Glu Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Pro Glu Val Pro Arg Asn
20 25 30
Arg Ala Val Glu Val Thr Lys Leu Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Ile Asp Ser Ala Tyr Leu Tyr Asn Asn Glu Glu Gln Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Ser Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Thr Phe Phe Gln Pro Gln Met Val Gln
85 90 95
Pro Ala Leu Glu Ser Ser Leu Lys Lys Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Leu His Phe Pro Met Ala Leu Lys Pro Gly Glu Thr Pro
115 120 125
Leu Pro Lys Asp Glu Asn Gly Lys Val Ile Phe Asp Thr Val Asp Leu
130 135 140
Ser Ala Thr Trp Glu Val Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Cys Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Ser Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala His Ser Ala Leu Gly Thr Gln Arg His
210 215 220
Lys Leu Trp Val Asp Pro Asn Ser Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Gln Thr Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Asn Glu Gln Arg Ile Arg Glu Asn Ile Gln Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Val Leu Asp Gly Leu Asn Arg Asn Tyr Arg
290 295 300
Tyr Val Val Met Asp Phe Leu Met Asp His Pro Asp Tyr Pro Phe Ser
305 310 315 320
Asp Glu Tyr
<210> 100
<211> 972
<212> DNA
<213> 智人(Homo sapiens)
<400> 100
atggacccaa agtaccaaag agttgaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt acgctccacc agaagttcca agaaacagag cagttgaagt tacaaaattg 120
gctattgaag caggttttag acatatcgat tctgcttatt tgtacaacaa cgaagaacaa 180
gttggtttag ctatcagatc aaagattgca gatggttcag ttaagagaga agatattttc 240
tatacttcaa aattgtggtg tactttcttt caaccacaaa tggttcaacc agctttggaa 300
tcttctttga agaaattgca attggattat gttgatttgt acttgttaca ttttccaatg 360
gcattgaaac caggtgaaac tccattacca aaggatgaaa acggtaaagt tattttcgat 420
actgttgatt tgtctgcaac atgggaagtt atggaaaagt gtaaggatgc tggtttagca 480
aagtctatcg gtgtttcaaa cttcaactgt agacaattgg aaatgatctt gaataagcca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatactt aaaccaatct 600
aaattgttag atttttgtaa gtcaaaggat attgttttgg ttgctcattc tgcattaggt 660
actcaaagac ataaattgtg ggttgatcca aattcaccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaacaa acaccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgttgttgt tttagctaaa tcttacaatg aacaaagaat tagagaaaac 840
atccaagttt tcgaatttca attgacatca gaagatatga aagttttgga tggtttgaac 900
agaaactata gatacgttgt tatggatttc ttgatggatc atccagatta tccattttca 960
gatgaatact aa 972
<210> 101
<211> 323
<212> PRT
<213> 日本猕猴(Macaca fuscata)
<400> 101
Met Asp Pro Lys Tyr Gln Arg Val Ala Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Ser Tyr Ala Pro Pro Glu Val Pro Arg Asn
20 25 30
Arg Val Val Glu Val Thr Lys Leu Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Ile Asp Ser Ala Tyr Leu Tyr Asn Asn Glu Glu Gln Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Ser Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Thr Phe Phe Arg Pro Gln Leu Val Gln
85 90 95
Pro Ala Leu Glu Ser Ser Leu Lys Lys Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Phe Pro Met Ala Leu Lys Pro Gly Glu Thr Pro
115 120 125
Leu Pro Lys Asp Glu Asn Gly Lys Val Met Phe Asp Thr Val Asp Leu
130 135 140
Cys Ala Ile Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Asn Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Ser Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala His Ser Ala Leu Gly Thr Gln Arg His
210 215 220
Lys Leu Trp Val Asp Gln Asn Ser Pro Ala Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Arg Ser Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Asn Glu Gln Arg Ile Arg Glu Asn Val Gln Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Val Leu Asp Asp Leu Asn Arg Asn Phe Arg
290 295 300
Tyr Val Val Met Asp Phe Leu Val Asp His Pro Asp Tyr Pro Phe Ser
305 310 315 320
Asp Glu Tyr
<210> 102
<211> 972
<212> DNA
<213> 日本猕猴(Macaca fuscata)
<400> 102
atggacccaa aatatcaaag agttgctttg aatgatggtc attttatgcc agttttaggt 60
tttggttctt acgcaccacc agaagttcca agaaacagag ttgttgaagt tactaaattg 120
gctattgaag caggttttag acatatcgat tcagcttatt tgtacaacaa cgaagaacaa 180
gttggtttag ctatcagatc aaagattgca gatggttcag ttaagagaga agatattttc 240
tatacttcta aattgtggtg tactttcttt agaccacaat tagttcaacc agctttggaa 300
tcttctttga agaaattgca attggattac gttgatttgt acttaatcca tttcccaatg 360
gcattgaagc caggtgaaac tccattacca aaggatgaaa acggtaaagt tatgttcgat 420
acagttgatt tgtgtgctat ttgggaagca atggaaaagt gtaaggatgc tggtttagca 480
aagtctattg gtgtttcaaa ttttaataga agacaattgg aaatgatctt gaacaaccca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatactt aaaccaatct 600
aaattgttag atttttgtaa gtcaaaggat attgttttgg ttgctcattc tgcattaggt 660
actcaaagac ataaattgtg ggttgatcaa aattcaccag ctttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaaaga tcaccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgttgttgt tttagcaaaa tcttacaatg aacaaagaat tagagaaaac 840
gttcaagttt tcgaatttca attgacatca gaagatatga aggttttgga tgatttgaac 900
agaaacttca gatacgttgt tatggatttc ttggttgatc atccagatta tccattttca 960
gatgaatact aa 972
<210> 103
<211> 323
<212> PRT
<213> 家牛(Bos taurus)
<400> 103
Met Asp Pro Lys Gly Gln Lys Val Lys Leu Asn Asp Gly His Phe Ile
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Gln Glu Val Ala Lys Arg
20 25 30
Asp Ala Leu Glu Phe Thr Pro Phe Ala Ile Glu Val Gly Phe Arg His
35 40 45
Ile Asp Cys Ala His Ala Tyr Gln Asn Glu Glu Gln Ile Gly Gln Ala
50 55 60
Ile Arg Ser Lys Met Ala Asp Gly Thr Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Cys Thr Ser Lys Leu Trp Cys Thr Ser Phe Arg Pro Glu Leu Val Arg
85 90 95
Pro Ala Leu Glu Lys Ser Leu Lys Ser Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Ile Met His Tyr Pro Leu Ala Leu Lys Pro Gly Glu Glu Leu
115 120 125
Tyr Pro Lys Asp Glu Asn Gly Lys Leu Ile Ala Asp Ser Val Asp Phe
130 135 140
Cys Leu Thr Trp Glu Ala Leu Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn His Lys Gln Leu Glu Lys Ile
165 170 175
Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Arg Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
His Asp Ile Val Leu Val Ala Tyr Ser Ala Leu Gly Ser Gln Arg Val
210 215 220
Lys Gly Trp Val Asn Pro Asn His Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Ser Ala Ile Ala Gln Lys His Lys Lys Thr Ala Ala Leu Val Ala
245 250 255
Leu Arg Tyr Gln Ile Gln Arg Gly Val Val Val Leu Ala Lys Gly Asn
260 265 270
Asn Lys Glu Trp Ile Lys Glu Asn Met Gln Val Phe Asp Phe Glu Leu
275 280 285
Thr Pro Glu Asp Met Lys Ala Ile Asp Gly Leu Asn Arg Asn Ile Arg
290 295 300
Tyr Cys Asp Phe His Pro Gly Val Gly His Pro Glu Phe Pro Phe Ser
305 310 315 320
Glu Glu Tyr
<210> 104
<211> 972
<212> DNA
<213> 家牛(Bos taurus)
<400> 104
atggacccaa agggtcaaaa ggttaaattg aacgatggtc atttcattcc agttttgggt 60
ttcggtactt acgctccaca agaagttgct aaaagagatg ctttggagtt tactccattc 120
gcaatcgaag ttggttttag acatatcgat tgtgctcatg catatcaaaa cgaagaacaa 180
atcggtcaag ctatcagatc aaagatggca gatggtactg ttaagagaga agatattttc 240
tgtacttcta aattgtggtg tacttctttt agaccagaat tagttagacc agctttggaa 300
aaatctttaa aatcattgca attggattat gttgatttgt acatcatgca ttacccattg 360
gctttgaagc caggtgaaga attgtaccca aaggatgaaa acggtaaatt aatcgctgat 420
tcagttgatt tttgtttgac atgggaagca ttagaaaagt gtaaggatgc tggtttagca 480
aagtctattg gtgtttcaaa cttcaaccat aagcaattgg aaaagatttt gaataagcca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatattt gaaccaaaga 600
aaattgttag atttttgtaa gtctcatgat attgttttgg ttgcttactc tgcattaggt 660
tcacaaagag ttaaaggttg ggttaatcca aatcatccag ttttgttaga agatccagtt 720
ttgtcagcta ttgcacaaaa acataagaaa actgctgctt tggttgcttt aagataccaa 780
attcaaagag gtgttgttgt tttagcaaag ggtaacaata aggaatggat caaggaaaac 840
atgcaagttt tcgatttcga attgacacca gaagatatga aagctatcga tggtttgaac 900
agaaacatca gatactgtga ttttcatcca ggtgttggtc atccagaatt tccattttct 960
gaagaatatt aa 972
<210> 105
<211> 323
<212> PRT
<213> 普通狨(Callithrix jacchus)
<400> 105
Met Asp Ser Lys His Arg Cys Met Lys Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Ala Glu Val Pro Lys Ser
20 25 30
Lys Ala Ala Glu Ala Thr Lys Trp Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Ile Asp Ser Ala His Cys Tyr Asn Asn Glu Glu His Val Gly Leu Ala
50 55 60
Ile Arg Asn Lys Ile Ala Asp Gly Ser Val Lys Arg Asp Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Thr Ser His Arg Pro Glu Leu Val Arg
85 90 95
Pro Ala Leu Glu Arg Ser Leu Arg Lys Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Phe Pro Val Ser Leu Lys Pro Ser Glu Glu Leu
115 120 125
Ile Pro Lys Asp Glu Asn Gly Lys Ile Leu Leu Asp Thr Val Asp Leu
130 135 140
Cys Ala Thr Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Lys Pro Gly Leu Arg Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Arg Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala Tyr Ser Ala Leu Gly Ser His Arg Glu
210 215 220
Lys Ala Trp Val Asp Gln Asn Cys Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Arg Ser Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Ala Val Val Leu Ala Lys Ser Tyr
260 265 270
Asn Glu Gln Arg Ile Arg Glu Asn Met Gln Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Thr Ile Asp Gly Leu Asn Lys Asn Val Arg
290 295 300
Tyr Ile Thr Leu His Val Leu Ala Asp His Pro Ser Tyr Pro Phe Ser
305 310 315 320
Asp Glu Tyr
<210> 106
<211> 972
<212> DNA
<213> 普通狨(Callithrix jacchus)
<400> 106
atggattcta agcatagatg tatgaaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgctccagc agaagttcca aaatctaaag ctgcagaagc tacaaaatgg 120
gctattgaag caggttttag acatatcgat tcagcacatt gttacaacaa cgaagaacat 180
gttggtttgg ctattagaaa taagattgca gatggttctg ttaagagaga tgatattttc 240
tatacttcta aattgtggtg tacatcacat agaccagaat tagttagacc agctttggaa 300
agatcattaa gaaaattgca attggattat gttgatttgt acttaatcca tttcccagtt 360
tctttgaagc catcagaaga attaatccca aaggatgaaa acggtaaaat tttgttagat 420
actgttgatt tgtgtgctac atgggaagca atggaaaagt gtaaggatgc tggtttagca 480
aagtctatcg gtgtttcaaa cttcaacaga agacaattgg aaatgatctt gaataagcca 540
ggtttgagat acaaaccagt ttgtaaccaa gttgaatgtc atccatactt aaaccaaaga 600
aaattgttag atttttgtaa atctaaagat attgttttgg ttgcttattc tgcattaggt 660
tcacatagag aaaaagcatg ggttgatcaa aattgtccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaaaga tcaccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgctgttgt tttagcaaaa tcttacaacg aacaaagaat tagagaaaac 840
atgcaagttt tcgaatttca attgacttca gaagatatga aaacaattga tggtttgaat 900
aagaacgtta gatacatcac tttgcatgtt ttagctgatc atccatctta tccattttca 960
gatgaatact aa 972
<210> 107
<211> 326
<212> PRT
<213> 锈腹实蝇(Bactrocera latifrons)
<400> 107
Met Ala Phe Asn Lys Phe Leu Arg Leu Ser Asn Gly Pro Asp Met Pro
1 5 10 15
Ala Phe Gly Leu Arg Leu Tyr Gln Val Lys Arg Asp Asp Val Ser Val
20 25 30
Val Leu Asn Asp Ala Ile Glu Ala Gly Tyr Arg Leu Phe Glu Thr Ser
35 40 45
Pro Ser Tyr Asn Asn Gln Asn Asp Val Gly Asp Val Leu Thr Ala Trp
50 55 60
Leu Lys Gly Asn Lys Ile Lys Arg Glu Glu Leu Phe Ile Val Thr Asn
65 70 75 80
Leu Pro Val Ser Asn Asn Arg Pro His Glu Val Glu Asp Thr Leu Lys
85 90 95
Glu Ser Leu Arg Lys Leu Gln Leu Asp Tyr Val Asp Leu Tyr Leu Val
100 105 110
Glu Ala Pro Phe Ala Ile Lys Met Glu Asn Glu Glu Val Phe Lys Arg
115 120 125
Asp Ser Ala Gly Asn Ala Leu Leu Glu Glu Ala Thr Asp His Val Ala
130 135 140
Ile Trp Glu Ile Met Glu Glu Leu Met Ser Thr Gly Leu Thr Lys Ser
145 150 155 160
Ile Gly Leu Gly Asn Phe Asn Val Asp Gln Ile Gln His Ile Val Glu
165 170 175
Thr Arg Lys Met Ile Pro His Val Leu Gln Ile Glu Tyr His Val Tyr
180 185 190
Leu Gln Gln Pro Glu Leu Ile Asp Tyr Cys Arg Ser Thr Asn Ile Thr
195 200 205
Leu Leu Thr Tyr Ala Ala Leu Gly Ala Val Asn Lys Pro Asp Lys Tyr
210 215 220
Gln Arg Val Ser Val Leu Gly Lys Asp Glu Ile Pro Ile Leu Asp Leu
225 230 235 240
Pro Glu Leu Arg Glu Ile Ala Ala Thr His Lys Lys Thr Pro Ala Gln
245 250 255
Val Ala Phe Arg Trp Val Ile Asp Lys Lys Met Ala Leu Thr Val Lys
260 265 270
Ser Ser Asn Ala Glu Arg Ile Arg Ser Asn Ile Asp Ile Phe Asp Phe
275 280 285
Ser Leu Thr Lys Glu Glu Met Glu Lys Leu Asn Ala Leu Asn Arg Asn
290 295 300
Arg Arg Phe Val Asp Phe Ser Gln Tyr Lys Gly Ile Glu Lys His Pro
305 310 315 320
Asp Tyr Pro Phe His Met
325
<210> 108
<211> 981
<212> DNA
<213> 锈腹实蝇(Bactrocera latifrons)
<400> 108
atggctttta ataagttctt gagattgtct aacggtccag atatgccagc attcggtttg 60
agattgtacc aagttaagag agatgatgtt tcagttgttt tgaatgatgc tatcgaagca 120
ggttatagat tattcgaaac atctccatca tacaacaacc aaaacgatgt tggtgacgtt 180
ttgactgctt ggttaaaagg taataagatt aaaagagaag aattgtttat tgttacaaat 240
ttgccagttt ctaataatag accacatgaa gttgaagata ctttgaagga atcattaaga 300
aaattgcaat tagattatgt tgatttgtac ttagttgaag ctccatttgc aattaaaatg 360
gaaaacgaag aagtttttaa aagagattct gctggtaatg cattgttaga agaagctaca 420
gatcatgttg caatttggga aattatggaa gaattgatgt ctactggttt gacaaagtca 480
atcggtttgg gtaacttcaa cgttgatcaa atccaacata ttgttgaaac tagaaaaatg 540
attccacatg ttttgcaaat cgaataccat gtttacttgc aacaaccaga attaatcgat 600
tactgtagat caactaacat cacattgttg acttacgctg cattgggtgc tgttaataag 660
cctgataagt accaaagagt ttcagttttg ggtaaagatg aaatcccaat tttggatttg 720
ccagaattaa gagaaattgc tgcaacacat aagaaaactc cagctcaagt tgcttttaga 780
tgggttattg ataagaaaat ggctttgaca gttaaatctt caaacgcaga aagaattaga 840
tcaaacatcg atattttcga tttttcatta actaaagaag aaatggaaaa attgaatgca 900
ttaaatagaa atagaagatt tgttgatttt tcacaataca agggtatcga aaagcatcca 960
gattacccat ttcacatgta a 981
<210> 109
<211> 323
<212> PRT
<213> 普通狨(Callithrix jacchus)
<400> 109
Met Asp Pro Arg Cys Gln Arg Val Glu Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Pro Glu Val Pro Arg Asn
20 25 30
Arg Val Val Glu Val Thr Lys Phe Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Leu Asp Ser Ala Tyr Ile Tyr Asn Asn Glu Glu Gln Val Gly Leu Ala
50 55 60
Ile Gln Ser Lys Ile Ala Asp Gly Ser Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Cys Thr Ser Lys Leu Trp Cys Thr Ser His Arg Pro Glu Leu Val Gln
85 90 95
Ser Ala Leu Glu Ser Ser Leu Lys Gln Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Val His Phe Pro Val Ala Leu Lys Pro Gly Glu Asp Ile
115 120 125
Leu Pro Lys Asp Glu Asn Gly Lys Val Ile Phe Asp Thr Val Asp Leu
130 135 140
Cys Ala Thr Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Lys Pro Gly Leu Arg Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Ser Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala His Ser Ala Leu Gly Thr Gln Arg His
210 215 220
Glu Leu Trp Val Asp Gln Ser Ser Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Arg Ser Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Asn Glu Gln Arg Ile Arg Glu Asn Val Gln Val Ser Glu Phe Gln Leu
275 280 285
Ser Ser Ala Asp Met Lys Val Leu Asp Gly Leu Asn Arg Asn Phe Arg
290 295 300
Tyr Val Thr Leu Asp Tyr Leu Ala Gly His Pro Asn Tyr Pro Phe Arg
305 310 315 320
Asp Phe Phe
<210> 110
<211> 972
<212> DNA
<213> 普通狨(Callithrix jacchus)
<400> 110
atggacccaa gatgtcaaag agttgaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgctccacc agaagttcca agaaacagag ttgttgaagt tacaaagttc 120
gctatcgaag caggttttag acatttggat tctgcataca tctataacaa cgaagaacaa 180
gttggtttag ctatccaatc taagatcgca gatggttcag ttaagagaga agatattttc 240
tgtacttcta aattgtggtg tacatcacat agaccagaat tagttcaatc tgctttggaa 300
tcttcattaa agcaattgca attggattat gttgatttgt acttagttca ttttccagtt 360
gcattgaaac caggtgaaga tattttacca aaggatgaaa acggtaaagt tattttcgat 420
actgttgatt tgtgtgctac atgggaagca atggaaaagt gtaaggatgc tggtttagca 480
aagtctatcg gtgtttcaaa cttcaacaga agacaattgg aaatgatctt gaataagcca 540
ggtttgagat acaaaccagt ttgtaaccaa gttgaatgtc atccatactt aaaccaatct 600
aaattgttag atttttgtaa gtcaaaggat attgttttgg ttgctcattc tgcattaggt 660
actcaaagac atgaattgtg ggttgatcaa tcttcaccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaaaga tcaccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgttgttgt tttagctaaa tcttacaatg aacaaagaat tagagaaaac 840
gttcaagttt cagaatttca attatcttca gctgatatga aagttttgga tggtttgaac 900
agaaacttca gatacgttac attggattac ttagcaggtc atccaaatta cccttttaga 960
gatttctttt aa 972
<210> 111
<211> 297
<212> PRT
<213> 恒河猴(Macaca mulatta)
<400> 111
Met Asp Ser Lys His Gln Arg Val Lys Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Val Glu Val Pro Lys Asp
20 25 30
Lys Ala Leu Glu Ala Thr Lys Leu Ala Ile Glu Val Gly Phe Arg His
35 40 45
Val Asp Cys Ala Tyr Ala Tyr Asn Asn Glu Glu Tyr Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Thr Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Asn Ser His Arg Pro Glu Leu Val Arg
85 90 95
Pro Ala Leu Glu Arg Ser Leu Lys Asn Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Ser Pro Val Ser Leu Lys Ala Met Glu Lys Cys
115 120 125
Lys Asp Ala Gly Leu Ala Lys Ser Ile Gly Val Ser Asn Phe Asn Arg
130 135 140
Arg Gln Leu Glu Met Ile Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro
145 150 155 160
Val Cys Asn Gln Val Glu Cys His Pro Tyr Phe Asn Gln Arg Lys Leu
165 170 175
Leu Asp Phe Cys Lys Ser Lys Asp Ile Val Leu Val Ala Phe Ser Ala
180 185 190
Leu Gly Ser His Arg Glu Lys Gln Trp Val Asp Gln Asn Ser Pro Val
195 200 205
Leu Leu Glu Asp Pro Val Leu Cys Ala Leu Ala Lys Lys His Lys Gln
210 215 220
Thr Pro Ala Leu Ile Ala Leu Arg Tyr Gln Leu Gln Arg Gly Val Val
225 230 235 240
Val Leu Ala Lys Ser Tyr Thr Glu Gln Arg Ile Arg Glu Asn Met Lys
245 250 255
Val Phe Glu Phe Gln Leu Thr Ser Glu Asp Met Lys Ala Ile Asp Gly
260 265 270
Leu Asp Arg Asn Ile Arg Tyr Leu Thr Leu Asp Ile Leu Ala Asp Ser
275 280 285
Pro Asn Tyr Pro Tyr Ser Asp Glu Tyr
290 295
<210> 112
<211> 894
<212> DNA
<213> 恒河猴(Macaca mulatta)
<400> 112
atggattcta agcatcaaag agttaaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgctccagt tgaagttcca aaggataaag cattggaagc aacaaaatta 120
gcaatcgaag ttggttttag acatgttgat tgtgcttatg catacaacaa cgaagaatac 180
gttggtttgg ctatcagatc aaagattgca gatggtactg ttaagagaga agatattttc 240
tatacatcta aattgtggtg taactcacat agaccagaat tagttagacc agctttggaa 300
agatcattga aaaatttgca attggattat gttgatttgt acttaatcca ttctccagtt 360
tcattgaagg caatggaaaa gtgtaaggat gctggtttag caaagtctat cggtgtttca 420
aacttcaaca gaagacaatt ggaaatgatc ttgaataagc caggtttgaa gtacaaacca 480
gtttgtaacc aagttgaatg tcatccatac ttcaaccaaa gaaaattgtt agatttttgt 540
aaatctaaag atattgtttt ggttgctttt tctgcattag gttcacatag agaaaagcaa 600
tgggttgatc aaaattcacc agttttgtta gaagatccag ttttgtgtgc tttggctaag 660
aaacataaac aaactccagc tttgattgca ttaagatacc aattgcaaag aggtgttgtt 720
gttttagcta aatcttacac tgaacaaaga attagagaaa acatgaaggt tttcgaattt 780
caattgacat cagaagatat gaaggctatc gatggtttag atagaaacat cagatatttg 840
acattagata ttttggcaga ttctccaaac tatccatact cagatgaata ctaa 894
<210> 113
<211> 309
<212> PRT
<213> 恒河猴(Macaca mulatta)
<400> 113
Met Asp Ser Lys His Gln Arg Val Lys Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Val Glu Val Pro Lys Asp
20 25 30
Lys Ala Leu Glu Ala Thr Lys Leu Ala Ile Glu Val Gly Phe Arg His
35 40 45
Val Asp Cys Ala Tyr Ala Tyr Asn Asn Glu Glu Tyr Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Thr Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Asn Ser His Arg Pro Glu Leu Val Arg
85 90 95
Pro Ala Leu Glu Arg Ser Leu Lys Asn Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Ser Pro Val Ser Leu Lys Pro Gly Glu Glu Leu
115 120 125
Ile Pro Lys Asp Glu Asn Gly Lys Val Leu Phe Asp Thr Val Asp Leu
130 135 140
Cys Ala Thr Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Phe Asn Gln Arg Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala Phe Ser Ala Leu Gly Ser His Arg Glu
210 215 220
Lys Gln Trp Val Asp Gln Asn Ser Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Gln Thr Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Thr Glu Gln Arg Ile Arg Glu Asn Met Lys Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Ala Ile Asp Gly Leu Asp Arg Asn Ile Arg
290 295 300
Tyr Leu Thr Leu Asp
305
<210> 114
<211> 927
<212> DNA
<213> 恒河猴(Macaca mulatta)
<400> 114
atggattcta agcatcaaag agttaaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgctccagt tgaagttcca aaggataaag cattggaagc aacaaaatta 120
gcaatcgaag ttggttttag acatgttgat tgtgcttatg catacaacaa cgaagaatac 180
gttggtttgg ctatcagatc aaagattgca gatggtactg ttaagagaga agatattttc 240
tatacatcta aattgtggtg taactcacat agaccagaat tagttagacc agctttggaa 300
agatcattga aaaatttgca attggattat gttgatttgt acttaatcca ttctccagtt 360
tcattgaagc caggtgaaga attaatccca aaggatgaaa acggtaaagt tttgttcgat 420
actgttgatt tgtgtgctac atgggaagca atggaaaaat gtaaagatgc tggtttggca 480
aagtctatcg gtgtttcaaa cttcaacaga agacaattgg aaatgatctt gaataagcca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatactt caaccaaaga 600
aaattgttag atttttgtaa atctaaagat attgttttgg ttgctttttc tgcattaggt 660
tcacatagag aaaagcaatg ggttgatcaa aattcaccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaacaa actccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgttgttgt tttagctaaa tcttacactg aacaaagaat tagagaaaac 840
atgaaggttt tcgaatttca attgacatca gaagatatga aggcaatcga tggtttagat 900
agaaatatta gatacttgac attagat 927
<210> 115
<211> 323
<212> PRT
<213> 恒河猴(Macaca mulatta)
<400> 115
Met Asp Pro Lys Tyr Gln Arg Val Ala Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Ser Tyr Ala Pro Pro Glu Val Pro Arg Asn
20 25 30
Arg Val Val Glu Val Thr Lys Leu Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Ile Asp Ser Ala Tyr Leu Tyr Asn Asn Glu Glu Gln Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Ser Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Thr Phe Phe Arg Pro Gln Leu Val Gln
85 90 95
Pro Ala Leu Glu Ser Ser Leu Lys Lys Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Phe Pro Met Ala Leu Lys Pro Gly Glu Thr Pro
115 120 125
Leu Pro Lys Asp Glu Asn Gly Lys Val Met Phe Asp Thr Val Asp Leu
130 135 140
Cys Ala Ile Trp Glu Ala Met Glu Lys Cys Lys Asp Ala Gly Met Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Asn Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Phe Asn Gln Arg Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Val Leu Val Ala Phe Ser Ala Leu Gly Ser His Arg Glu
210 215 220
Lys Gln Trp Val Asp Gln Asn Ser Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Gln Thr Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Thr Glu Gln Arg Ile Arg Glu Asn Met Lys Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Ala Ile Asp Gly Leu Asp Arg Asn Ile Arg
290 295 300
Tyr Leu Thr Leu Asp Ile Leu Ala Asp Ser Pro Asn Tyr Pro Tyr Ser
305 310 315 320
Asp Glu Tyr
<210> 116
<211> 975
<212> DNA
<213> 恒河猴(Macaca mulatta)
<400> 116
atggacccaa aatatcaaag agttgctttg aatgatggtc attttatgcc agttttaggt 60
tttggttctt acgcaccacc agaagttcca agaaacagag ttgttgaagt tactaaattg 120
gctattgaag caggttttag acatatcgat tcagcttatt tgtacaacaa cgaagaacaa 180
gttggtttag ctatcagatc aaagattgca gatggttcag ttaagagaga agatattttc 240
tatacttcta aattgtggtg tactttcttt agaccacaat tagttcaacc agctttggaa 300
tcttctttga agaaattgca attggattac gttgatttgt acttaatcca tttcccaatg 360
gcattgaagc caggtgaaac tccattacca aaggatgaaa acggtaaagt tatgttcgat 420
acagttgatt tgtgtgctat ttgggaagca atggaaaaat gtaaagatgc tggtatggca 480
aaatctattg gtgtttcaaa ttttaataga agacaattgg aaatgatctt gaacaaccca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatactt caaccaaaga 600
aaattgttag atttttgtaa atctaaagat attgttttgg ttgctttttc tgcattaggt 660
tcacatagag aaaagcaatg ggttgatcaa aattcaccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaacaa actccagctt tgattgcatt aagataccaa 780
ttgcaaagag gtgttgttgt tttagctaaa tcttacactg aacaaagaat tagagaaaac 840
atgaaggttt tcgaatttca attgacatca gaagatatga aggctatcga tggtttagat 900
agaaacatca gatatttgac attagatatt ttggcagatt ctccaaacta cccatactca 960
gatgaatact aataa 975
<210> 117
<211> 139
<212> PRT
<213> 恒河猴(Macaca mulatta)
<400> 117
Met Asp Ser Lys His Gln Arg Val Lys Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Val Glu Val Pro Lys Asp
20 25 30
Lys Ala Leu Glu Ala Thr Lys Leu Ala Ile Glu Val Gly Phe Arg His
35 40 45
Val Asp Cys Ala Tyr Ala Tyr Asn Asn Glu Glu Tyr Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Thr Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Asn Ser His Arg Pro Glu Leu Val Arg
85 90 95
Pro Ala Leu Glu Arg Ser Leu Lys Asn Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Ile His Ser Pro Val Ser Leu Lys Glu Asp Ile Gly Ile
115 120 125
Ile Met Trp Lys Lys Ser Pro Lys His Asn Ser
130 135
<210> 118
<211> 420
<212> DNA
<213> 恒河猴(Macaca mulatta)
<400> 118
atggattcta agcatcaaag agttaaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgctccagt tgaagttcca aaggataaag cattggaagc aacaaaatta 120
gcaatcgaag ttggttttag acatgttgat tgtgcttatg catacaacaa cgaagaatac 180
gttggtttgg ctatcagatc aaagattgca gatggtactg ttaagagaga agatattttc 240
tatacatcta aattgtggtg taactcacat agaccagaat tagttagacc agctttggaa 300
agatcattga aaaatttgca attggattat gttgatttgt acttaatcca ttctccagtt 360
tcattaaagg aagatattgg tatcatcatg tggaagaaat ctccaaagca taattcataa 420
<210> 119
<211> 323
<212> PRT
<213> 苏门答腊猩猩(Pongo abelii)
<400> 119
Met Asp Ser Lys His Gln Cys Val Lys Leu Asn Asp Gly His Phe Met
1 5 10 15
Pro Val Leu Gly Phe Gly Thr Tyr Ala Pro Pro Glu Val Pro Arg Asn
20 25 30
Arg Ala Val Glu Val Thr Lys Leu Ala Ile Glu Ala Gly Phe Arg His
35 40 45
Ile Asp Ser Ala Tyr Leu Tyr Asp Asn Glu Glu Gln Val Gly Leu Ala
50 55 60
Ile Arg Ser Lys Ile Ala Asp Gly Ser Val Lys Arg Glu Asp Ile Phe
65 70 75 80
Tyr Thr Ser Lys Leu Trp Cys Thr Phe Phe Gln Pro Gln Met Val Gln
85 90 95
Pro Ala Leu Glu Ser Ser Leu Lys Lys Leu Gln Leu Asp Tyr Val Asp
100 105 110
Leu Tyr Leu Leu His Phe Pro Met Ala Leu Lys Pro Gly Glu Met Leu
115 120 125
Leu Pro Lys Asp Glu Asn Gly Lys Val Ile Phe Asp Thr Val Asp Leu
130 135 140
Cys Ala Thr Trp Glu Val Met Glu Lys Cys Lys Asp Ala Gly Leu Ala
145 150 155 160
Lys Ser Ile Gly Val Ser Asn Phe Asn Arg Arg Gln Leu Glu Met Ile
165 170 175
Leu Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu
180 185 190
Cys His Pro Tyr Leu Asn Gln Ser Lys Leu Leu Asp Phe Cys Lys Ser
195 200 205
Lys Asp Ile Ala Leu Val Ala Tyr Ser Ala Leu Gly Thr Gln Arg His
210 215 220
Glu Leu Trp Val Asp Pro Asn Ser Pro Val Leu Leu Glu Asp Pro Val
225 230 235 240
Leu Cys Ala Leu Ala Lys Lys His Lys Arg Thr Pro Ala Leu Ile Ala
245 250 255
Leu Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Tyr
260 265 270
Asn Glu Gln Arg Ile Arg Glu Asn Ile Gln Val Phe Glu Phe Gln Leu
275 280 285
Thr Ser Glu Asp Met Lys Val Leu Asp Gly Leu Asn Arg Asn Tyr Arg
290 295 300
Tyr Ile Val Met Asp Phe Leu Met Asp His Pro Asp Tyr Pro Phe Ser
305 310 315 320
Asp Glu Tyr
<210> 120
<211> 972
<212> DNA
<213> 苏门答腊猩猩(Pongo abelii)
<400> 120
atggattcta agcatcaatg tgttaaattg aacgatggtc atttcatgcc agttttaggt 60
tttggtactt atgcaccacc agaagttcca agaaacagag ctgttgaagt tacaaaattg 120
gctattgaag caggttttag acatatcgat tctgcatatt tgtacgataa cgaagaacaa 180
gttggtttag caatcagatc aaagattgct gatggttcag ttaagagaga agatattttc 240
tatacttcaa aattgtggtg tactttcttt caaccacaaa tggttcaacc agctttggaa 300
tcttctttga agaaattgca attggattat gttgatttgt acttgttgca tttcccaatg 360
gctttgaagc caggtgaaat gttgttgcca aaggatgaaa acggtaaagt tattttcgat 420
actgttgatt tgtgtgctac atgggaagtt atggaaaagt gtaaggatgc tggtttagca 480
aagtctatcg gtgtttcaaa cttcaacaga agacaattgg aaatgatctt gaataagcca 540
ggtttgaagt acaaaccagt ttgtaaccaa gttgaatgtc atccatactt aaaccaatct 600
aaattgttag atttttgtaa gtcaaaggat attgctttgg ttgcatattc tgctttaggt 660
actcaaagac atgaattgtg ggttgatcca aattcaccag ttttgttaga agatccagtt 720
ttgtgtgctt tggctaagaa acataaaaga acaccagcat tgattgcttt aagataccaa 780
ttgcaaagag gtgttgttgt tttagctaaa tcttacaatg aacaaagaat tagagaaaac 840
atccaagttt tcgaatttca attgacttca gaagatatga aagttttgga tggtttgaac 900
agaaactata gatacatcgt tatggatttc ttgatggatc atccagatta tccattttca 960
gatgaatact aa 972
<210> 121
<211> 322
<212> PRT
<213> 野猪(Sus scrofa)
<400> 121
Met Ala Leu Asn Arg Cys Val Lys Leu Asn Asp Gly His Leu Met Pro
1 5 10 15
Val Leu Gly Leu Gly Thr Leu Val Ser Glu Gly Val Pro Lys Ser Lys
20 25 30
Ala Gly Glu Ala Thr Arg Val Ala Ile Glu Val Gly Tyr Arg His Ile
35 40 45
Asp Ala Ala Tyr Val Tyr Glu Asn Glu Glu Glu Val Gly Ser Ala Leu
50 55 60
Arg Glu Lys Ile Ala Asp Gly Thr Val Lys Arg Glu Glu Leu Phe Tyr
65 70 75 80
Thr Thr Lys Leu Trp Ala Thr Phe Phe Arg Pro Glu Leu Val Arg Pro
85 90 95
Ala Leu Glu Arg Ser Leu Lys Lys Leu Arg Leu Asp Tyr Val Asp Leu
100 105 110
Phe Ile Ile His Val Pro Ile Thr Met Lys Pro Gly Glu Glu Leu Leu
115 120 125
Pro Lys Asp Ala Ser Gly Lys Val Ile Phe Asp Thr Val Asp Leu Arg
130 135 140
Asp Thr Trp Ala Ala Leu Glu Lys Cys Lys Asp Ala Gly Leu Thr Lys
145 150 155 160
Ser Ile Gly Val Ser Asn Phe Asn His Lys Gln Leu Glu Met Ile Leu
165 170 175
Asn Lys Pro Gly Leu Lys Tyr Lys Pro Val Cys Asn Gln Val Glu Cys
180 185 190
His Pro Tyr Leu Asn Gln Ser Lys Leu Leu Glu Phe Cys Lys Ser Lys
195 200 205
Asp Ile Val Leu Val Ala Tyr Ser Ala Leu Gly Ser Gln Arg Asn Ser
210 215 220
Lys Trp Val Glu Glu Ser Asn Pro Tyr Leu Leu Glu Asp Pro Val Leu
225 230 235 240
Asn Ala Ile Ala Lys Lys His Asn Arg Ser Pro Ala Gln Val Ala Leu
245 250 255
Arg Tyr Gln Leu Gln Arg Gly Val Val Val Leu Ala Lys Ser Phe Asn
260 265 270
Glu Gln Arg Ile Lys Glu Asn Phe Gln Val Phe Asp Phe Glu Leu Pro
275 280 285
Ser Glu Asp Met Lys Thr Ile Asp Gly Leu Asn Gln Asn Leu Arg Tyr
290 295 300
Phe Lys Leu Leu Phe Ala Val Asp His Pro Tyr Tyr Pro Tyr Ser Glu
305 310 315 320
Glu Tyr
<210> 122
<211> 969
<212> DNA
<213> 野猪(Sus scrofa)
<400> 122
atggctttga atagatgtgt taaattgaac gatggtcatt tgatgccagt tttgggttta 60
ggtactttag tttcagaagg tgttccaaaa tctaaagctg gtgaagcaac aagagttgca 120
attgaagttg gttatagaca tatcgatgct gcttatgttt acgaaaatga agaagaagtt 180
ggttcagctt tgagagaaaa gattgcagat ggtactgtta agagagaaga attgttttat 240
actacaaaat tgtgggctac tttctttaga ccagaattgg ttagaccagc attggaaaga 300
tcattgaaga aattgagatt agattacgtt gatttgttta ttatccatgt tccaattact 360
atgaaaccag gtgaagaatt gttaccaaag gatgcttctg gtaaagttat tttcgatact 420
gttgatttga gagatacatg ggctgcatta gaaaagtgta aggatgcagg tttgacaaag 480
tctattggtg tttcaaactt caaccataag caattggaaa tgatcttgaa taagccaggt 540
ttgaagtaca aaccagtttg taaccaagtt gaatgtcatc catacttaaa ccaatcaaaa 600
ttgttagaat tttgtaaatc taaagatatt gttttggttg cttattctgc attaggttca 660
caaagaaatt ctaagtgggt tgaagaatca aatccatact tgttagaaga tccagttttg 720
aacgctatcg ctaagaaaca taatagatca ccagctcaag ttgcattgag ataccaatta 780
caaagaggtg ttgttgtttt ggctaaatct tttaatgaac aaagaattaa agaaaacttt 840
caagtttttg attttgaatt accatctgaa gatatgaaga ctatcgatgg tttgaaccaa 900
aatttgagat acttcaaatt gttgttcgct gttgatcatc catattaccc atattctgaa 960
gaatactaa 969
<210> 123
<211> 531
<212> PRT
<213> 智人(Homo sapiens)
<400> 123
Met Ala Ala Leu Gly Cys Ala Arg Leu Arg Trp Ala Leu Arg Gly Ala
1 5 10 15
Gly Arg Gly Leu Cys Pro His Gly Ala Arg Ala Lys Ala Ala Ile Pro
20 25 30
Ala Ala Leu Pro Ser Asp Lys Ala Thr Gly Ala Pro Gly Ala Gly Pro
35 40 45
Gly Val Arg Arg Arg Gln Arg Ser Leu Glu Glu Ile Pro Arg Leu Gly
50 55 60
Gln Leu Arg Phe Phe Phe Gln Leu Phe Val Gln Gly Tyr Ala Leu Gln
65 70 75 80
Leu His Gln Leu Gln Val Leu Tyr Lys Ala Lys Tyr Gly Pro Met Trp
85 90 95
Met Ser Tyr Leu Gly Pro Gln Met His Val Asn Leu Ala Ser Ala Pro
100 105 110
Leu Leu Glu Gln Val Met Arg Gln Glu Gly Lys Tyr Pro Val Arg Asn
115 120 125
Asp Met Glu Leu Trp Lys Glu His Arg Asp Gln His Asp Leu Thr Tyr
130 135 140
Gly Pro Phe Thr Thr Glu Gly His His Trp Tyr Gln Leu Arg Gln Ala
145 150 155 160
Leu Asn Gln Arg Leu Leu Lys Pro Ala Glu Ala Ala Leu Tyr Thr Asp
165 170 175
Ala Phe Asn Glu Val Ile Asp Asp Phe Met Thr Arg Leu Asp Gln Leu
180 185 190
Arg Ala Glu Ser Ala Ser Gly Asn Gln Val Ser Asp Met Ala Gln Leu
195 200 205
Phe Tyr Tyr Phe Ala Leu Glu Ala Ile Cys Tyr Ile Leu Phe Glu Lys
210 215 220
Arg Ile Gly Cys Leu Gln Arg Ser Ile Pro Glu Asp Thr Val Thr Phe
225 230 235 240
Val Arg Ser Ile Gly Leu Met Phe Gln Asn Ser Leu Tyr Ala Thr Phe
245 250 255
Leu Pro Lys Trp Thr Arg Pro Val Leu Pro Phe Trp Lys Arg Tyr Leu
260 265 270
Asp Gly Trp Asn Ala Ile Phe Ser Phe Gly Lys Lys Leu Ile Asp Glu
275 280 285
Lys Leu Glu Asp Met Glu Ala Gln Leu Gln Ala Ala Gly Pro Asp Gly
290 295 300
Ile Gln Val Ser Gly Tyr Leu His Phe Leu Leu Ala Ser Gly Gln Leu
305 310 315 320
Ser Pro Arg Glu Ala Met Gly Ser Leu Pro Glu Leu Leu Met Ala Gly
325 330 335
Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr His Leu Ser
340 345 350
Lys Asp Pro Glu Ile Gln Glu Ala Leu His Glu Glu Val Val Gly Val
355 360 365
Val Pro Ala Gly Gln Val Pro Gln His Lys Asp Phe Ala His Met Pro
370 375 380
Leu Leu Lys Ala Val Leu Lys Glu Thr Leu Arg Leu Tyr Pro Val Val
385 390 395 400
Pro Thr Asn Ser Arg Ile Ile Glu Lys Glu Ile Glu Val Asp Gly Phe
405 410 415
Leu Phe Pro Lys Asn Thr Gln Phe Val Phe Cys His Tyr Val Val Ser
420 425 430
Arg Asp Pro Thr Ala Phe Ser Glu Pro Glu Ser Phe Gln Pro His Arg
435 440 445
Trp Leu Arg Asn Ser Gln Pro Ala Thr Pro Arg Ile Gln His Pro Phe
450 455 460
Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ala Cys Leu Gly Arg Arg
465 470 475 480
Ile Ala Glu Leu Glu Met Gln Leu Leu Leu Ala Arg Leu Ile Gln Lys
485 490 495
Tyr Lys Val Val Leu Ala Pro Glu Thr Gly Glu Leu Lys Ser Val Ala
500 505 510
Arg Ile Val Leu Val Pro Asn Lys Lys Val Gly Leu Gln Phe Leu Gln
515 520 525
Arg Gln Cys
530
<210> 124
<211> 1596
<212> DNA
<213> 智人(Homo sapiens)
<400> 124
atggctgcat tgggttgtgc tagattaaga tgggcattga gaggtgctgg tagaggtttg 60
tgtccacatg gtgctagagc aaaagctgca attccagctg cattaccatc tgataaagct 120
actggtgcac caggtgctgg tccaggtgtt agaagaagac aaagatcatt ggaagaaatc 180
ccaagattgg gtcaattgag atttttcttt caattgttcg ttcaaggtta cgcattgcaa 240
ttgcatcaat tgcaagtttt gtacaaggct aagtacggtc caatgtggat gtcttactta 300
ggtccacaaa tgcatgttaa tttggcttca gcaccattgt tagaacaagt tatgagacaa 360
gagggtaaat acccagttag aaacgatatg gaattgtgga aagaacatag agatcaacat 420
gatttgacat atggtccttt tactacagaa ggtcatcatt ggtaccaatt gagacaagct 480
ttgaaccaaa gattgttaaa accagcagaa gctgcattgt acactgatgc ttttaatgaa 540
gttattgatg attttatgac aagattagat caattgagag cagaatctgc ttcaggtaat 600
caagtttctg atatggctca attgttttat tacttcgcat tggaagctat ctgttacatc 660
ttgttcgaaa agagaattgg ttgtttgcaa agatcaattc cagaagatac tgttacattc 720
gttagatcta tcggtttgat gttccaaaac tcattgtatg ctacattttt gccaaaatgg 780
acaagaccag ttttaccatt ttggaaaaga tacttggatg gttggaacgc aattttctct 840
ttcggtaaaa agttgatcga tgaaaagttg gaagatatgg aagctcaatt acaagctgca 900
ggtccagatg gtattcaagt ttctggttat ttgcatttct tgttagcatc tggtcaattg 960
tcaccaagag aagctatggg ttcattacca gaattgttaa tggcaggtgt tgatactaca 1020
tctaatactt tgacatgggc tttgtaccat ttgtcaaaag atccagaaat tcaagaagca 1080
ttacatgaag aagttgttgg tgttgttcca gctggtcaag ttccacaaca taaggatttc 1140
gcacatatgc cattgttgaa ggctgttttg aaggaaactt tgagattgta cccagttgtt 1200
ccaacaaact ctagaatcat cgaaaaggaa atcgaagttg atggtttctt gttccctaaa 1260
aatactcaat tcgttttctg tcattacgtt gtttcaagag atccaacagc attttctgaa 1320
ccagaatcat ttcaaccaca tagatggttg agaaattctc aaccagctac tccaagaatt 1380
caacatccat ttggttcagt tccatttggt tatggtgtta gagcatgttt aggtagaaga 1440
atcgctgaat tggaaatgca attgttgttg gctagattga tccaaaagta caaggttgtt 1500
ttggcaccag aaacaggcga attgaagtct gttgctagaa tcgttttagt tccaaataag 1560
aaagttggtt tacaattctt gcaaagacaa tgttaa 1596
<210> 125
<211> 533
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 125
Met Ala Val Leu Ser Arg Met Arg Leu Arg Trp Ala Leu Leu Asp Thr
1 5 10 15
Arg Val Met Gly His Gly Leu Cys Pro Gln Gly Ala Arg Ala Lys Ala
20 25 30
Ala Ile Pro Ala Ala Leu Arg Asp His Glu Ser Thr Glu Gly Pro Gly
35 40 45
Thr Gly Gln Asp Arg Pro Arg Leu Arg Ser Leu Ala Glu Leu Pro Gly
50 55 60
Pro Gly Thr Leu Arg Phe Leu Phe Gln Leu Phe Leu Arg Gly Tyr Val
65 70 75 80
Leu His Leu His Glu Leu Gln Ala Leu Asn Lys Ala Lys Tyr Gly Pro
85 90 95
Met Trp Thr Thr Thr Phe Gly Thr Arg Thr Asn Val Asn Leu Ala Ser
100 105 110
Ala Pro Leu Leu Glu Gln Val Met Arg Gln Glu Gly Lys Tyr Pro Ile
115 120 125
Arg Asp Ser Met Glu Gln Trp Lys Glu His Arg Asp His Lys Gly Leu
130 135 140
Ser Tyr Gly Ile Phe Ile Thr Gln Gly Gln Gln Trp Tyr His Leu Arg
145 150 155 160
His Ser Leu Asn Gln Arg Met Leu Lys Pro Ala Glu Ala Ala Leu Tyr
165 170 175
Thr Asp Ala Leu Asn Glu Val Ile Ser Asp Phe Ile Ala Arg Leu Asp
180 185 190
Gln Val Arg Thr Glu Ser Ala Ser Gly Asp Gln Val Pro Asp Val Ala
195 200 205
His Leu Leu Tyr His Leu Ala Leu Glu Ala Ile Cys Tyr Ile Leu Phe
210 215 220
Glu Lys Arg Val Gly Cys Leu Glu Pro Ser Ile Pro Glu Asp Thr Ala
225 230 235 240
Thr Phe Ile Arg Ser Val Gly Leu Met Phe Lys Asn Ser Val Tyr Val
245 250 255
Thr Phe Leu Pro Lys Trp Ser Arg Pro Leu Leu Pro Phe Trp Lys Arg
260 265 270
Tyr Met Asn Asn Trp Asp Asn Ile Phe Ser Phe Gly Glu Lys Met Ile
275 280 285
His Gln Lys Val Gln Glu Ile Glu Ala Gln Leu Gln Ala Ala Gly Pro
290 295 300
Asp Gly Val Gln Val Ser Gly Tyr Leu His Phe Leu Leu Thr Lys Glu
305 310 315 320
Leu Leu Ser Pro Gln Glu Thr Val Gly Thr Phe Pro Glu Leu Ile Leu
325 330 335
Ala Gly Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr His
340 345 350
Leu Ser Lys Asn Pro Glu Ile Gln Glu Ala Leu His Lys Glu Val Thr
355 360 365
Gly Val Val Pro Phe Gly Lys Val Pro Gln Asn Lys Asp Phe Ala His
370 375 380
Met Pro Leu Leu Lys Ala Val Ile Lys Glu Thr Leu Arg Leu Tyr Pro
385 390 395 400
Val Val Pro Thr Asn Ser Arg Ile Ile Thr Glu Lys Glu Thr Glu Ile
405 410 415
Asn Gly Phe Leu Phe Pro Lys Asn Thr Gln Phe Val Leu Cys His Tyr
420 425 430
Val Val Ser Arg Asp Pro Ser Val Phe Pro Glu Pro Glu Ser Phe Gln
435 440 445
Pro His Arg Trp Leu Arg Lys Arg Glu Asp Asp Asn Ser Gly Ile Gln
450 455 460
His Pro Phe Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ser Cys Leu
465 470 475 480
Gly Arg Arg Ile Ala Glu Leu Glu Met Gln Leu Leu Leu Ser Arg Leu
485 490 495
Ile Gln Lys Tyr Glu Val Val Leu Ser Pro Gly Met Gly Glu Val Lys
500 505 510
Ser Val Ser Arg Ile Val Leu Val Pro Ser Lys Lys Val Ser Leu Arg
515 520 525
Phe Leu Gln Arg Gln
530
<210> 126
<211> 1599
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 126
atggctgttt tgtctagaat gagattaaga tgggcattgt tagatacaag agttatgggt 60
catggtttgt gtccacaagg tgctagagca aaagctgcaa ttccagctgc attaagagat 120
catgaatcta cagaaggtcc aggtactggt caagatagac caagattaag atcattggct 180
gaattaccag gtccaggtac tttgagattt ttattccaat tatttttgag aggttatgtt 240
ttgcatttgc atgaattgca agctttgaat aaggcaaagt acggtccaat gtggactaca 300
actttcggta caagaactaa cgttaatttg gcttcagcac cattgttaga acaagttatg 360
agacaagagg gtaaataccc aatcagagat tctatggaac aatggaagga acatagagat 420
cataagggtt tatcatacgg tattttcatt acacaaggtc aacaatggta ccatttgaga 480
cattctttga accaaagaat gttgaaacca gctgaagctg cattgtacac agatgcattg 540
aacgaagtta tttcagattt cattgctaga ttagatcaag ttagaactga atctgcttca 600
ggtgaccaag ttccagatgt tgcacatttg ttatatcatt tggctttgga agcaatctgt 660
tacatcttgt tcgaaaagag agttggttgt ttggaaccat ctattccaga agatacagca 720
acttttatta gatccgttgg tttgatgttc aagaactcag tttacgttac atttttgcca 780
aagtggtcta gaccattgtt gccattctgg aagagataca tgaacaactg ggataacatt 840
ttctctttcg gtgaaaagat gatccatcaa aaggttcaag aaatcgaagc tcaattgcaa 900
gctgcaggtc cagatggtgt tcaagtttct ggttatttgc atttcttgtt gacaaaggaa 960
ttgttgtcac cacaagaaac agttggtact ttcccagaat tgatcttggc tggtgttgat 1020
acaacttcta atacattgac ttgggcattg taccatttgt ctaaaaatcc agaaatccaa 1080
gaagctttgc ataaggaagt tactggtgtt gttccattcg gtaaagttcc acaaaataag 1140
gattttgctc atatgccatt gttgaaggca gttattaaag aaacattaag attgtatcca 1200
gttgttccaa ctaattctag aatcatcaca gaaaaggaaa ctgaaattaa tggtttcttg 1260
tttcctaaaa atacacaatt cgttttgtgt cattacgttg tttctagaga tccatcagtt 1320
tttccagaac cagaatcttt tcaaccacat agatggttga gaaagagaga agatgataac 1380
tctggtattc aacatccatt tggttcagtt ccatttggtt atggtgttag atcatgtttg 1440
ggtagaagaa tcgctgaatt ggaaatgcaa ttgttgttgt ctagattgat ccaaaagtac 1500
gaagttgttt tgtcacctgg tatgggtgag gttaagtctg tttcaagaat cgttttagtt 1560
ccatctaaga aagtttcttt gagattttta caaagacaa 1599
<210> 127
<211> 535
<212> PRT
<213> 家兔(Oryctolagus cuniculus)
<400> 127
Met Ala Ala Leu Gly Cys Ala Arg Leu Arg Trp Ala Leu Leu Gly Pro
1 5 10 15
Arg Val Ala Gly Cys Gly Leu Cys Pro Gln Gly Ala Arg Ala Lys Ala
20 25 30
Ala Ile Pro Thr Ala Leu Pro Ala Asp Glu Ala Ala Gln Ala Pro Gly
35 40 45
Ala Gly Pro Gly Asp Arg Arg Arg Arg Arg Ser Leu Glu Glu Leu Pro
50 55 60
Arg Leu Gly Gln Leu Arg Phe Phe Tyr Gln Ala Phe Val Gln Gly Tyr
65 70 75 80
Leu Leu His Leu His Lys Leu Gln Val Leu Asn Lys Ala Arg Tyr Gly
85 90 95
Pro Met Trp Val Ser Tyr Leu Gly Pro Gln Leu Phe Val Asn Leu Ala
100 105 110
Ser Ala Pro Leu Val Glu Thr Val Met Arg Gln Glu Gly Lys Tyr Pro
115 120 125
Val Arg Asn Asp Met Gln Leu Trp Lys Glu His Arg Asp His Gln Asp
130 135 140
Leu Ala Tyr Gly Val Phe Thr Thr Asp Gly His Asp Trp Tyr Gln Leu
145 150 155 160
Arg Gln Ala Leu Asn Gln Arg Leu Leu Lys Pro Ala Glu Ala Ala Leu
165 170 175
Tyr Thr Asp Ala Leu Asn Glu Val Ile Asp Ser Phe Val Val Arg Leu
180 185 190
Asp Gln Leu Arg Ala Glu Ser Ala Ser Gly Asp Gln Val Pro Asp Met
195 200 205
Ala Asp Leu Leu Tyr His Phe Ala Leu Glu Ala Ile Cys Tyr Ile Leu
210 215 220
Phe Glu Lys Arg Ile Gly Cys Leu Glu Ala Ser Ile Pro Lys Asp Thr
225 230 235 240
Glu Asn Phe Ile Arg Ser Val Gly Leu Met Phe Gln Asn Ser Val Tyr
245 250 255
Val Thr Phe Leu Pro Lys Trp Thr Arg Pro Leu Leu Pro Phe Trp Lys
260 265 270
Arg Tyr Leu Asp Gly Trp Asp Thr Ile Phe Ser Phe Gly Lys Asn Leu
275 280 285
Ile Asp Gln Lys Leu Gln Glu Val Val Ala Gln Leu Gln Ser Ala Gly
290 295 300
Ser Asp Gly Val Gln Val Ser Gly Tyr Leu His Ser Leu Leu Thr Ser
305 310 315 320
Gly Gln Leu Ser Pro Arg Glu Ala Leu Gly Ser Leu Pro Glu Leu Leu
325 330 335
Leu Ala Gly Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr
340 345 350
His Leu Ser Lys Asn Pro Glu Ile Gln Ala Ala Leu Arg Lys Glu Val
355 360 365
Val Gly Val Val Ala Ala Gly Gln Val Pro Gln His Lys Asp Phe Ala
370 375 380
His Met Pro Leu Leu Lys Ala Val Leu Lys Glu Thr Leu Arg Leu Tyr
385 390 395 400
Pro Val Ile Pro Ala Asn Ser Arg Ile Ile Val Asp Lys Glu Ile Glu
405 410 415
Val Gly Gly Phe Leu Phe Pro Lys Asn Thr Gln Phe Val Phe Cys His
420 425 430
Tyr Val Thr Ser Arg Asp Pro Ser Thr Phe Ser Glu Pro Asp Thr Phe
435 440 445
Trp Pro Tyr Arg Trp Leu Arg Lys Gly Gln Pro Glu Thr Ser Lys Thr
450 455 460
Gln His Pro Phe Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ala Cys
465 470 475 480
Leu Gly Arg Arg Ile Ala Glu Leu Glu Met Gln Leu Leu Leu Ala Arg
485 490 495
Leu Ile Gln Arg Tyr Glu Leu Met Leu Ala Pro Glu Thr Gly Glu Val
500 505 510
Gln Ser Val Ala Arg Ile Val Leu Val Pro Asn Lys Lys Val Gly Leu
515 520 525
Arg Phe Leu Pro Thr Gln Arg
530 535
<210> 128
<211> 1608
<212> DNA
<213> 家兔(Oryctolagus cuniculus)
<400> 128
atggctgcat tgggttgtgc tagattaaga tgggcattgt taggtccaag agttgctggt 60
tgtggtttgt gtccacaagg tgctagagca aaagctgcaa ttccaacagc tttaccagca 120
gatgaagctg cacaagctcc aggtgcaggt ccaggtgaca gaagaagaag aagatctttg 180
gaagaattgc caagattggg tcaattgaga tttttctatc aagctttcgt tcaaggttac 240
ttgttgcatt tgcataagtt gcaagttttg aataaggcaa gatatggtcc aatgtgggtt 300
tcatacttag gtccacaatt gttcgttaat ttggcttctg caccattggt tgaaacagtt 360
atgagacaag agggtaaata cccagttaga aacgatatgc aattgtggaa agaacataga 420
gatcatcaag atttggctta tggtgttttt actacagatg gtcatgattg gtaccaattg 480
agacaagcat tgaaccaaag attgttgaag ccagctgaag ctgcattgta cactgatgca 540
ttgaacgaag ttattgattc attcgttgtt agattagatc aattgagagc tgaatctgca 600
tcaggtgacc aagttccaga tatggctgat ttgttgtacc atttcgcttt ggaagcaatc 660
tgttacatct tgttcgaaaa gagaattggt tgtttagaag cttctatccc aaaggatact 720
gaaaacttca ttagatcagt tggtttgatg ttccaaaact ctgtttacgt tacatttttg 780
ccaaagtgga caagaccatt gttaccattt tggaaaagat acttggatgg ttgggataca 840
attttctctt tcggtaaaaa tttgatcgat caaaagttgc aagaagttgt tgctcaatta 900
caatctgcag gttcagatgg tgttcaagtt tcaggttatt tgcattcttt gttgacttca 960
ggtcaattat ctccaagaga agctttaggt tctttgccag aattgttatt ggcaggtgtt 1020
gatactacat caaatacttt gacatgggct ttgtaccatt tgtctaaaaa tccagaaatt 1080
caagctgcat taagaaaaga agttgttggt gttgttgctg caggtcaagt tccacaacat 1140
aaggatttcg ctcatatgcc attgttgaag gcagttttga aggaaacatt gagattgtac 1200
ccagttattc cagctaactc aagaatcatc gttgataagg aaattgaagt tggtggtttc 1260
ttgtttccta aaaatactca attcgttttc tgtcattatg ttacttctag agatccatct 1320
acattttcag aaccagatac tttttggcca tacagatggt tgagaaaagg tcaaccagaa 1380
acttcaaaaa cacaacatcc atttggttct gttccatttg gttatggtgt tagagcttgt 1440
ttgggtagaa gaatcgcaga attggaaatg caattgttgt tggctagatt gatccaaaga 1500
tacgaattga tgttagctcc agaaacaggt gaagttcaat ctgttgcaag aatcgtttta 1560
gttccaaata agaaagttgg tttaagattt ttgccaactc aaagataa 1608
<210> 129
<211> 533
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 129
Met Ala Ala Trp Ser Arg Thr Arg Leu Arg Trp Thr Leu Leu Asp Pro
1 5 10 15
Arg Val Val Gly Arg Gly Leu Cys Pro Gln Gly Ala Arg Ala Lys Ala
20 25 30
Thr Ile Pro Ala Ala Leu Gln Ala Gln Glu Ser Thr Glu Gly Pro Gly
35 40 45
Thr Gly Gln Asp Arg Pro Arg Leu Arg Ser Pro Ala Glu Leu Pro Gly
50 55 60
Thr Gly Thr Leu Gln Phe Leu Phe Gln Leu Phe Leu Gln Gly Tyr Val
65 70 75 80
Leu His Leu Pro Asp Leu Gln Val Leu Asn Lys Thr Lys Tyr Gly Pro
85 90 95
Met Trp Thr Thr Ser Phe Gly Thr Tyr Thr Asn Val Asn Leu Ala Ser
100 105 110
Ala Pro Leu Leu Glu Gln Val Met Arg Gln Glu Gly Lys Tyr Pro Ile
115 120 125
Arg Asp His Met Asp Gln Trp Lys Asp His Arg Asp His Lys Gly Leu
130 135 140
Thr Tyr Gly Ile Phe Ile Ala Gln Gly Glu Gln Trp Tyr His Leu Arg
145 150 155 160
Gln Ala Leu Lys Gln Arg Leu Leu Lys Pro Asp Glu Ala Ala Leu Tyr
165 170 175
Thr Asp Ala Leu Asn Glu Val Ile Ser Asp Phe Ile Thr Arg Leu Asp
180 185 190
Gln Val Arg Ala Glu Ser Glu Ser Gly Asp Gln Val Pro Asp Met Ala
195 200 205
His Leu Leu Tyr His Leu Ala Leu Glu Ala Ile Thr Tyr Ile Leu Phe
210 215 220
Glu Lys Arg Ile Gly Cys Leu Lys Pro Ser Ile Pro Glu Asp Thr Ala
225 230 235 240
Ala Phe Ile Arg Ser Val Ala Ile Met Phe Gln Asn Ser Val Tyr Ile
245 250 255
Thr Phe Leu Pro Lys Trp Thr Arg Pro Leu Leu Pro Phe Trp Lys Arg
260 265 270
Tyr Leu Asn Gly Trp Asp Asn Ile Phe Ser Phe Gly Lys Lys Leu Ile
275 280 285
Asp Glu Lys Val Gln Glu Leu Lys Ala Gln Leu Gln Glu Thr Gly Pro
290 295 300
Asp Gly Val Arg Val Ser Gly Tyr Leu His Phe Leu Leu Thr Asn Glu
305 310 315 320
Leu Leu Ser Thr Gln Glu Thr Ile Gly Thr Phe Pro Glu Leu Leu Leu
325 330 335
Ala Gly Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr His
340 345 350
Leu Ser Lys Ser Pro Glu Ile Gln Glu Ala Leu His Lys Glu Val Thr
355 360 365
Gly Val Val Pro Phe Gly Lys Val Pro Gln His Lys Asp Phe Ala His
370 375 380
Met Pro Leu Leu Lys Ala Val Ile Lys Glu Thr Leu Arg Leu Tyr Pro
385 390 395 400
Val Val Pro Thr Asn Ser Arg Ile Ile Thr Glu Lys Glu Thr Glu Ile
405 410 415
Asn Gly Phe Leu Phe Pro Lys Asn Thr Gln Phe Val Leu Cys His Tyr
420 425 430
Val Val Ser Arg Asp Pro Ser Val Phe Pro Glu Pro Asn Ser Phe Gln
435 440 445
Pro His Arg Trp Leu Arg Lys Lys Glu Ala Asp Asn Pro Gly Ile Leu
450 455 460
His Pro Phe Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ser Cys Leu
465 470 475 480
Gly Arg Arg Ile Ala Glu Leu Glu Met Gln Leu Met Leu Ser Arg Leu
485 490 495
Val Gln Lys Tyr Glu Ile Ala Leu Ala Pro Gly Met Gly Glu Val Lys
500 505 510
Thr Val Ser Arg Ile Val Leu Val Pro Ser Lys Lys Val Arg Leu His
515 520 525
Phe Leu Gln Arg Gln
530
<210> 130
<211> 1602
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 130
atggctgcat ggtctagaac aagattaaga tggactttgt tagatccaag agttgttggt 60
agaggtttgt gtccacaagg tgctagagca aaagctacaa ttccagctgc attacaagca 120
caagaatcta cagaaggtcc aggtactggt caagatagac caagattaag atcaccagct 180
gaattgccag gtactggtac attgcaattc ttgttccaat tatttttgca aggttatgtt 240
ttacatttgc cagatttgca agttttgaat aagacaaagt acggtccaat gtggactaca 300
tctttcggta cttacacaaa cgttaatttg gcatcagctc cattgttaga acaagttatg 360
agacaagagg gtaaataccc aattagagat catatggatc aatggaaaga tcatagagat 420
cataagggtt tgacttacgg tattttcatt gcacaaggtg aacaatggta ccatttgaga 480
caagctttga agcaaagatt gttgaagcca gatgaagctg cattgtacac agatgctttg 540
aacgaagtta tttctgattt catcactaga ttggatcaag ttagagcaga atctgaatca 600
ggtgaccaag ttccagatat ggctcatttg ttatatcatt tggcattgga agctatcaca 660
tacatcttgt tcgaaaagag aattggttgt ttgaaaccat ctattccaga agatactgct 720
gcttttatta gatctgttgc aatcatgttc caaaactcag tttacatcac atttttacca 780
aaatggacta gaccattgtt gccattctgg aagagatact taaacggttg ggataacatt 840
ttctctttcg gtaaaaagtt gattgatgaa aaagttcaag aattgaaggc tcaattgcaa 900
gaaacaggtc cagatggtgt tagagtttct ggttatttgc atttcttgtt gactaacgaa 960
ttgttgtcaa ctcaagaaac aatcggtact ttcccagaat tgttgttggc aggtgttgat 1020
actacatcta acactttgac atgggctttg taccatttgt ctaagtcacc agaaatccaa 1080
gaagctttgc ataaggaagt tacaggtgtt gttccattcg gtaaagttcc acaacataag 1140
gatttcgcac atatgccatt gttgaaggct gttattaaag aaacattaag attgtatcca 1200
gttgttccaa ctaattctag aatcatcaca gaaaaggaaa ctgaaattaa tggtttcttg 1260
ttccctaaaa atactcaatt cgttttgtgt cattacgttg tttctagaga tccatcagtt 1320
tttccagaac caaattcatt tcaaccacat agatggttaa gaaagaaaga agcagataat 1380
ccaggtattt tgcatccatt tggttctgtt ccatttggtt atggtgttag atcatgttta 1440
ggtagaagaa tcgctgaatt ggaaatgcaa ttgatgttgt caagattagt tcaaaagtac 1500
gaaatcgcat tggctcctgg tatgggtgaa gttaagactg tttctagaat cgttttagtt 1560
ccatctaaga aagttagatt gcatttcttg caaagacaat aa 1602
<210> 131
<211> 534
<212> PRT
<213> 家牛(Bos taurus)
<400> 131
Met Gly Ala Leu Gly Ser Ala Arg Leu Arg Trp Ala Leu Leu Gly Arg
1 5 10 15
Arg Ala Ala Leu Pro Gly Leu Gly Ser Phe Gly Ala Arg Ala Lys Ala
20 25 30
Ala Ile Pro Ser Ala Leu Pro Ala Ala Gln Ala Ala Glu Ala Pro Gly
35 40 45
Thr Gly Pro Gly Asp Arg Arg Leu Arg Ser Leu Asp Glu Leu Ser Gly
50 55 60
Pro Gly Gln Leu Arg Leu Leu Phe Gln Leu Leu Val Gln Gly Tyr Val
65 70 75 80
Leu His Leu His Gln Leu Gln Val Leu Asn Lys Ala Lys Tyr Gly Pro
85 90 95
Ile Trp Ile Asn Arg Val Gly Pro Gln Met His Val His Leu Ala Ser
100 105 110
Ala Pro Leu Leu Glu Gln Val Met Arg Gln Glu Gly Lys Tyr Pro Val
115 120 125
Arg Asp Asp Met Lys Leu Trp Lys Glu His Arg Asp Gln Gln Gly Leu
130 135 140
Ser Tyr Gly Pro Phe Thr Thr Met Gly Glu Gln Trp Tyr Arg Leu Arg
145 150 155 160
Gln Thr Leu Asn Gln Arg Met Leu Lys Pro Ala Glu Ala Ala Leu Tyr
165 170 175
Thr Asp Ala Leu Asn Glu Val Ile Asn Asp Phe Met Asp Gln Leu Lys
180 185 190
Gln Leu Arg Ala Glu Ser Ala Ser Gly Asp His Val Pro Asp Ile Ala
195 200 205
His Gln Phe Tyr Phe Phe Ala Leu Glu Ala Ile Ser Tyr Ile Leu Phe
210 215 220
Glu Lys Arg Ile Gly Cys Leu Glu Arg Ser Ile Pro Lys Asp Thr Glu
225 230 235 240
Thr Phe Val Arg Ser Val Gly Leu Met Phe His Asn Ser Leu Phe Val
245 250 255
Thr Phe Leu Pro Thr Trp Thr Arg Pro Leu Leu Pro Phe Trp Lys Arg
260 265 270
Tyr Leu Asp Gly Trp Asn Thr Ile Phe Ser Phe Gly Lys Lys Leu Ile
275 280 285
Asp Gln Lys Leu Glu Glu Ile Glu Ala Gln Leu Lys Thr Glu Asn Pro
290 295 300
Glu Lys Thr Gln Ile Ser Gly Tyr Leu His Phe Leu Leu Thr Ser Gly
305 310 315 320
Gln Leu Ser Pro Arg Glu Ala Glu Gly Ser Leu Pro Glu Leu Leu Leu
325 330 335
Ala Gly Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr His
340 345 350
Leu Ser Lys Asn Pro Glu Ile Gln Ala Ala Leu His Lys Glu Val Val
355 360 365
Gly Val Val Pro Ala Gly Gln Val Pro Gln His Lys Asp Leu Ala Arg
370 375 380
Met Pro Leu Leu Lys Ala Val Leu Lys Glu Thr Leu Arg Leu Tyr Pro
385 390 395 400
Val Val Pro Val Asn Ser Arg Val Val Val Asp Lys Glu Ile Glu Val
405 410 415
Gly Gly Phe Leu Phe Pro Lys Asn Thr Gln Phe Val Leu Cys His Tyr
420 425 430
Val Ile Ser Arg Asp Pro Asp Ile Tyr Pro Glu Pro Asp Ser Phe Gln
435 440 445
Pro Gln Arg Trp Leu Arg Lys Asn Gln Pro Asp Ala Leu Lys Thr Gln
450 455 460
His Pro Phe Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ala Cys Leu
465 470 475 480
Gly Arg Arg Ile Ala Glu Leu Glu Met Gln Leu Leu Leu Thr Arg Leu
485 490 495
Ile Gln His Tyr Glu Val Val Leu Ala Pro Glu Thr Gly Glu Val Thr
500 505 510
Ser Val Ala Arg Ile Val Leu Val Pro Asn Lys Lys Val Gly Leu Arg
515 520 525
Phe Leu Gln Arg Gln Ser
530
<210> 132
<211> 1605
<212> DNA
<213> 家牛(Bos taurus)
<400> 132
atgggtgctt taggttctgc aagattgaga tgggctttgt taggtagaag agctgcattg 60
ccaggtttag gttcttttgg tgctagagca aaagctgcaa ttccatcagc tttgccagct 120
gcacaagctg cagaagcacc aggtactggt ccaggtgaca gaagattgag atctttagat 180
gaattgtcag gtccaggtca attgagattg ttgttccaat tgttagttca aggttacgtt 240
ttgcatttgc atcaattgca agttttgaat aaggctaagt acggtccaat ttggattaat 300
agagttggtc cacaaatgca tgttcatttg gcttctgcac cattgttaga acaagttatg 360
agacaagagg gtaaataccc agttagagat gatatgaagt tgtggaaaga acatagagat 420
caacaaggtt tatcatatgg tccttttact actatgggtg aacaatggta cagattgaga 480
caaactttga accaaagaat gttaaaacca gctgaagctg cattatatac agatgcattg 540
aatgaagtta ttaatgattt catggatcaa ttgaaacaat tgagagctga atctgcatca 600
ggtgaccatg ttccagatat cgctcatcaa ttctatttct ttgctttgga agcaatctct 660
tacattttgt ttgaaaagag aattggttgt ttggaaagat caatcccaaa ggatactgaa 720
acattcgtta gatctgttgg tttaatgttc cataactcat tgttcgttac atttttgcca 780
acttggacaa gaccattgtt gccattctgg aagagatatt tggatggttg gaacacaatt 840
ttctctttcg gtaaaaagtt gatcgatcaa aagttggaag aaatcgaagc tcaattgaag 900
actgaaaacc cagaaaagac tcaaatctct ggttacttac atttcttgtt gacatctggt 960
caattgtcac caagagaagc tgaaggttca ttaccagaat tgttattggc aggtgttgat 1020
actacatcta acactttgac atgggctttg taccatttgt ctaaaaatcc agaaattcaa 1080
gctgcattac ataaagaagt tgttggtgtt gttccagcag gtcaagttcc acaacataaa 1140
gatttggcta gaatgccatt gttgaaggca gttttgaagg aaactttgag attataccca 1200
gttgttccag ttaactctag agttgttgtt gataaggaaa tcgaagttgg tggtttcttg 1260
tttcctaaaa atacacaatt cgttttgtgt cattacgtta tctctagaga tccagatatc 1320
tatccagaac cagattcatt tcaaccacaa agatggttaa gaaagaatca accagatgct 1380
ttgaaaactc aacatccatt tggttcagtt ccatttggtt atggtgttag agcttgttta 1440
ggtagaagaa tcgcagaatt ggaaatgcaa ttgttgttga caagattgat ccaacattac 1500
gaagttgttt tggctccaga aactggtgaa gttacatctg ttgcaagaat cgttttggtt 1560
ccaaataaga aagttggttt aagatttttg caaagacaat cataa 1605
<210> 133
<211> 522
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 133
Met Ala Val Ser Phe Ala Leu Ser Ser Ala Glu Arg Leu Gly Trp Cys
1 5 10 15
Phe Leu Arg Pro Thr Thr Ala Ala Thr Gly Phe Arg Arg Ala Ala Gly
20 25 30
Asn Ser Ala Ala Ala Ser Val Ser Val Gln Asp Gly His Arg Lys Leu
35 40 45
Lys Thr Glu Ala Asp Leu Pro Glu Ile Lys Ile Phe Thr Met Leu Tyr
50 55 60
Gln Met Leu Phe Lys Gly Tyr Leu Asn Ser Val His Glu Leu Gln Leu
65 70 75 80
Tyr Gln Lys Gln Val Tyr Gly Pro Leu Trp Lys Ile Asn Ala Gly Asn
85 90 95
Leu Gln Gly Ile Ser Ile Thr Ser Val Glu Leu Leu Glu Glu Leu Leu
100 105 110
Arg Lys Asp Glu Lys Tyr Pro Cys Arg Gly Tyr Met Thr Leu Trp Thr
115 120 125
Glu His Arg Asp Leu Arg Gly Ile Ser Tyr Gly Pro Phe Thr Glu Glu
130 135 140
Gly Glu Lys Trp Tyr Lys Leu Arg Ala Val Leu Asn Lys Arg Met Leu
145 150 155 160
His Pro Lys Asp Ser Leu Gln Tyr Gly Asp Val Val Asn Ala Val Ile
165 170 175
Thr Asp Phe Ile Lys Arg Ile Tyr Tyr Leu Arg Glu Met Ser Pro Thr
180 185 190
Gly Asp Leu Val Ser Asn Leu Thr Asn Glu Leu Tyr Arg Phe Ser Leu
195 200 205
Glu Gly Ile Ala Ser Ile Leu Phe Glu Thr Arg Ile Gly Cys Leu Glu
210 215 220
Lys Glu Ile Pro Ala Glu Thr Gln Glu Phe Ile Asn Ser Ile Ala Gln
225 230 235 240
Met Phe Thr Tyr Asn Met His Val Ala Leu Leu Pro Asn Trp Thr Arg
245 250 255
Asn Tyr Leu Pro Phe Trp Gln Lys Tyr Ile Asp Gly Trp Asp Gly Ile
260 265 270
Phe Lys Phe Gly Thr Lys Met Ile Asn Leu Lys Met Glu Ala Ile Gln
275 280 285
Thr Arg Leu Asp Thr Asn Gln Glu Val Ala Gly Glu Tyr Leu Thr Tyr
290 295 300
Leu Leu Ser Ser Gly Lys Met Ser Cys Lys Asp Val Tyr Gly Ser Val
305 310 315 320
Ser Glu Val Leu Leu Ala Gly Val Asp Thr Thr Ser Asn Thr Met Leu
325 330 335
Trp Ala Leu Tyr Leu Leu Ser Lys Asp Pro Ala Ala Gln Glu Thr Leu
340 345 350
His Gln Glu Val Thr Lys Val Leu Lys Gly Asp Arg Ile Pro Thr Ala
355 360 365
Glu Glu Val Asn Ser Met Pro Phe Leu Lys Ala Val Ile Lys Glu Thr
370 375 380
Leu Arg Leu Tyr Pro Val Val Pro Val Asn Ser Arg Leu Ile Ala Glu
385 390 395 400
Ser Glu Val Ile Ile Gly Glu Tyr Leu Phe Pro Lys Lys Thr Thr Phe
405 410 415
Asn Leu Phe His Tyr Ala Ile Ser His Asp Glu Lys Val Phe Pro Glu
420 425 430
Pro Gln Lys Phe Lys Pro Glu Arg Trp Leu Arg Asp Gly Arg Thr Arg
435 440 445
Pro Asn Pro Phe Gly Ser Ile Pro Phe Gly Phe Gly Val Arg Ala Cys
450 455 460
Val Gly Arg Arg Ile Ala Glu Leu Glu Met His Leu Ala Leu Ala Arg
465 470 475 480
Leu Ile Lys Leu Phe Glu Met Arg Pro Asp Pro Thr Val Gly Glu Val
485 490 495
Lys Ala Asn Phe Arg Ser Val Leu Val Pro Asn Lys Lys Val Asn Leu
500 505 510
His Phe Val Glu Arg Gln Lys Thr Glu Thr
515 520
<210> 134
<211> 1569
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 134
atggcagttt cttttgcttt atcttcagca gaaagattag gttggtgttt cttaagacca 60
actacagctg caacaggttt tagaagagct gctggtaatt cagctgcagc ttctgtttca 120
gttcaagatg gtcatagaaa gttgaagact gaagcagatt tgccagaaat taaaattttt 180
acaatgttgt accaaatgtt gtttaaaggt tacttgaact ctgttcatga attgcaattg 240
taccaaaagc aagtttacgg tccattatgg aagattaatg ctggtaattt gcaaggtatc 300
tctatcactt cagttgaatt gttggaagaa ttgttgagaa aggatgaaaa gtacccatgt 360
agaggttaca tgactttatg gacagaacat agagatttga gaggtatttc ttatggtcct 420
tttactgaag aaggtgaaaa gtggtacaag ttgagagcag ttttgaataa gagaatgttg 480
catccaaagg attcattgca atatggtgac gttgttaatg ctgttattac tgatttcatc 540
aagagaatct attacttaag agaaatgtct ccaactggtg acttggtttc taatttgaca 600
aacgaattgt acagattttc tttggaaggt atcgcttcaa tcttgttcga aacaagaatc 660
ggttgtttag aaaaagaaat tccagctgaa actcaagaat tcattaactc tatcgcacaa 720
atgttcacat acaacatgca tgttgctttg ttgccaaact ggactagaaa ctatttgcca 780
ttctggcaaa agtacattga tggttgggat ggtattttta agttcggtac aaagatgatc 840
aatttgaaaa tggaagcaat ccaaactaga ttggatacaa accaagaagt tgctggtgaa 900
tatttgactt acttgttgtc ttcaggtaaa atgtcttgta aggatgttta cggttctgtt 960
tcagaagttt tgttagcagg tgttgatact acatctaata ctatgttgtg ggctttatac 1020
ttgttatcaa aagatccagc agctcaagaa actttgcatc aagaagttac aaaggttttg 1080
aaaggtgaca gaattccaac agcagaagaa gttaattcaa tgccattttt aaaggctgtt 1140
attaaagaaa ctttgagatt atatccagtt gttccagtta attctagatt gatcgcagaa 1200
tcagaagtta ttatcggtga atatttgttt ccaaagaaaa ctacttttaa tttgttccat 1260
tacgctatct ctcatgatga aaaggttttc ccagaaccac aaaagtttaa accagaaaga 1320
tggttaagag atggtagaac aagaccaaat ccatttggtt caattccatt tggttttggt 1380
gttagagctt gtgttggtag aagaattgca gaattggaaa tgcatttggc attggctaga 1440
ttgattaaat tgttcgaaat gaggccagat ccaactgttg gtgaagttaa agctaacttc 1500
agatctgttt tagttccaaa taagaaagtt aatttgcatt ttgttgaaag acaaaagact 1560
gaaacataa 1569
<210> 135
<211> 531
<212> PRT
<213> 食蟹猴(Macaca fascicularis)
<400> 135
Met Ala Ala Leu Gly Cys Ala Arg Leu Arg Trp Val Leu Arg Gly Ala
1 5 10 15
Gly Arg Gly Leu Cys Pro His Gly Ala Arg Ala Lys Ala Thr Ile Pro
20 25 30
Thr Ala Leu Pro Ser Asp Lys Ala Thr Glu Ala Pro Gly Ala Gly Pro
35 40 45
Gly Ile Arg Arg Arg Gln Arg Ser Leu Lys Glu Ile Pro Arg Leu Gly
50 55 60
Gln Leu Arg Phe Phe Phe Gln Leu Phe Val Gln Gly Tyr Ala Leu Gln
65 70 75 80
Leu His Gln Leu Gln Val Leu Tyr Lys Ala Lys Tyr Gly Pro Met Trp
85 90 95
Met Ser Tyr Leu Gly Pro Gln Met His Val Asn Leu Ala Ser Ala Pro
100 105 110
Leu Leu Glu Gln Val Met Arg Gln Glu Gly Lys Tyr Pro Val Arg Asn
115 120 125
Asp Met Glu Leu Trp Lys Glu His Arg Asp Leu His Asp Leu Thr Tyr
130 135 140
Gly Pro Phe Thr Thr Glu Gly His His Trp Tyr Gln Leu Arg Gln Ala
145 150 155 160
Leu Asn Gln Arg Leu Leu Lys Pro Ala Glu Ala Ala Leu Tyr Thr Asp
165 170 175
Ala Phe Asn Glu Val Ile Asp Asp Phe Met Ile Arg Leu Asp Gln Leu
180 185 190
Arg Ala Glu Ser Ala Ser Gly Asn Gln Val Ser Asp Thr Ala Gln Leu
195 200 205
Phe Tyr Tyr Phe Ala Leu Glu Ala Ile Cys Tyr Ile Leu Phe Glu Lys
210 215 220
Arg Ile Gly Cys Leu Gln Arg Ser Ile Pro Glu Asp Thr Val Thr Phe
225 230 235 240
Val Arg Ser Ile Gly Leu Met Phe Gln Asn Ser Leu Tyr Ala Thr Phe
245 250 255
Leu Pro Lys Trp Thr Arg Pro Val Leu Pro Phe Trp Lys Arg Tyr Leu
260 265 270
Asp Gly Trp Asn Ala Ile Phe Ser Phe Gly Lys Lys Leu Ile Asp Glu
275 280 285
Lys Leu Glu Asp Met Glu Ala Gln Leu Gln Ala Glu Gly Pro Asp Gly
290 295 300
Val Gln Val Ser Gly Tyr Leu His Phe Leu Leu Ala Ser Gly Gln Leu
305 310 315 320
Ser Pro Arg Glu Ala Met Gly Ser Leu Pro Glu Leu Leu Met Ala Gly
325 330 335
Val Asp Thr Thr Ser Asn Thr Leu Thr Trp Ala Leu Tyr His Leu Ser
340 345 350
Lys Asp Pro Glu Ile Gln Glu Ala Leu His Glu Glu Val Val Gly Val
355 360 365
Val Pro Ala Gly Gln Val Pro Gln His Lys Asp Phe Ala His Leu Pro
370 375 380
Leu Leu Lys Ala Val Leu Lys Glu Thr Leu Arg Leu Tyr Pro Val Val
385 390 395 400
Pro Thr Asn Ser Arg Ile Ile Glu Lys Glu Ile Glu Val Asp Gly Phe
405 410 415
Leu Phe Pro Lys Asn Thr Gln Phe Val Phe Cys His Tyr Val Val Ser
420 425 430
Arg Asp Pro Thr Thr Phe Ser Glu Pro Glu Ser Phe Gln Pro His Arg
435 440 445
Trp Leu Arg Asn Ser Gln Pro Ala Thr Pro Arg Ile Gln His Pro Phe
450 455 460
Gly Ser Val Pro Phe Gly Tyr Gly Val Arg Ala Cys Leu Gly Arg Arg
465 470 475 480
Ile Ala Glu Leu Glu Met Gln Leu Leu Leu Ala Arg Leu Ile Gln Lys
485 490 495
Tyr Lys Val Val Leu Ala Pro Glu Thr Gly Glu Leu Lys Ser Val Ala
500 505 510
Arg Ile Val Leu Val Pro Asn Lys Lys Val Gly Leu Gln Phe Leu Gln
515 520 525
Arg Gln Cys
530
<210> 136
<211> 1596
<212> DNA
<213> 食蟹猴(Macaca fascicularis)
<400> 136
atggctgcat tgggttgtgc tagattaaga tgggttttga gaggtgcagg tagaggtttg 60
tgtccacatg gtgctagagc aaaagctact attccaacag ctttaccatc tgataaagca 120
actgaagctc caggtgcagg tccaggtatt agaagaagac aaagatcatt gaaggaaatc 180
ccaagattgg gtcaattgag atttttcttt caattgttcg ttcaaggtta cgctttgcaa 240
ttgcatcaat tgcaagtttt gtacaaggca aagtacggtc caatgtggat gtcttactta 300
ggtccacaaa tgcatgttaa tttggcatca gctccattgt tagaacaagt tatgagacaa 360
gagggtaaat acccagttag aaacgatatg gaattgtgga aagaacatag agatttgcat 420
gatttgacat atggtccttt tactacagaa ggtcatcatt ggtaccaatt gagacaagct 480
ttgaaccaaa gattgttaaa accagctgaa gctgcattat atactgatgc ttttaatgaa 540
gttattgatg atttcatgat tagattagat caattgagag ctgaatctgc atcaggtaat 600
caagtttctg atacagctca attgttttat tacttcgctt tggaagcaat ctgttacatc 660
ttgttcgaaa agagaattgg ttgtttgcaa agatcaattc cagaagatac tgttacattc 720
gttagatcta tcggtttgat gttccaaaac tcattgtatg ctacattttt gccaaaatgg 780
acaagaccag ttttaccatt ttggaaaaga tacttggatg gttggaacgc aattttctct 840
ttcggtaaaa agttgatcga tgaaaagttg gaagatatgg aagctcaatt acaagcagaa 900
ggtccagatg gtgttcaagt ttctggttat ttgcatttct tgttagcttc tggtcaattg 960
tcaccaagag aagcaatggg ttcattacca gaattgttaa tggctggtgt tgatactaca 1020
tctaatactt tgacatgggc attgtaccat ttgtcaaaag atccagaaat tcaagaagct 1080
ttacatgaag aagttgttgg tgttgttcca gcaggtcaag ttccacaaca taaggatttc 1140
gctcatttgc cattgttgaa ggcagttttg aaggaaactt tgagattgta cccagttgtt 1200
ccaacaaact ctagaatcat cgaaaaggaa atcgaagttg atggtttctt gttccctaaa 1260
aatactcaat tcgttttctg tcattacgtt gtttcaagag atccaactac attttctgaa 1320
ccagaatcat ttcaaccaca tagatggtta agaaattctc aaccagctac accaagaatt 1380
caacatccat ttggttcagt tccatttggt tatggtgtta gagcttgttt aggtagaaga 1440
atcgcagaat tggaaatgca attgttgttg gctagattga tccaaaagta caaggttgtt 1500
ttggctccag aaactggtga attgaagtct gttgcaagaa tcgttttagt tccaaataag 1560
aaagttggtt tacaattctt gcaaagacaa tgttaa 1596
<210> 137
<211> 542
<212> PRT
<213> 非洲爪蟾(Xenopus laevis)
<400> 137
Met Ser Arg Gly Gly Leu Leu Leu Lys Thr Cys Arg Val Ala Val Ser
1 5 10 15
Gln Gly Arg Ala Val Thr Gly Gly Pro Pro Ala Ser Arg Leu His Cys
20 25 30
Val Pro Gln Gly Ser Gly Tyr Leu Gln Ala Gly Arg Gly Val Ser Val
35 40 45
Ser Gln Gly Arg Ala Val Thr Gly Ala Ala Val Glu Thr Ala Asp Gly
50 55 60
Arg Lys Glu Met Lys Glu Phe Asp Asp Leu Pro Gly Pro Ser Leu Leu
65 70 75 80
Lys Asn Leu Tyr Tyr Tyr Phe Val Arg Gly Tyr Leu Leu His Thr His
85 90 95
Glu Leu Gln Leu Asn Tyr Lys Lys Met Tyr Gly Pro Leu Trp Arg Ser
100 105 110
Glu Ile Gly Lys Tyr Lys Met Val Asn Ile Gly Asp Pro Glu Ala Leu
115 120 125
Gln Gln Leu Leu Arg Gln Glu Gly Lys Tyr Pro Met Arg Asn Lys Glu
130 135 140
Asp Ile Trp Lys Ala His Arg Asp Gln Arg Lys Leu Ala Tyr Gly Pro
145 150 155 160
Phe Thr Glu Glu Gly Tyr His Trp Tyr Arg Ile Arg Ser Val Leu Asn
165 170 175
Lys Lys Met Leu Lys Pro Ser Glu Ala Ser Ser Tyr Ala Gly Gly Ile
180 185 190
Asn Glu Val Val Thr Asp Phe Met Asn Lys Leu Gln Tyr Met Arg Lys
195 200 205
Ala Ser Pro Ser Gly Asp Met Val Asn Asp Val Ala Asn Ala Leu Tyr
210 215 220
Arg Phe Ala Phe Glu Gly Ile Ser Asn Ile Leu Phe Glu Thr Arg Ile
225 230 235 240
Gly Cys Leu Glu Lys Gln Thr Pro Pro Glu Thr Gln Lys Phe Ile Asp
245 250 255
Ser Ile Gly Tyr Met Phe Lys Asn Ser Val Tyr Val Thr Phe Leu Pro
260 265 270
Gln Trp Thr Lys Gly Ile Leu Pro Tyr Trp Asp Arg Tyr Ile Glu Gly
275 280 285
Trp Asp Asn Ile Phe Asp Phe Gly Lys Gln Leu Val Asp Lys Lys Met
290 295 300
Ser Glu Ile Gln Ser Arg Leu Asp Arg Gly Glu Glu Val Glu Gly Glu
305 310 315 320
Tyr Leu Thr Tyr Leu Leu Ser Ser Ala Asn Leu Asn Ile Gly Glu Val
325 330 335
Tyr Gly Ser Val Cys Glu Leu Leu Leu Ala Gly Val Asp Thr Thr Ser
340 345 350
Asn Thr Leu Cys Trp Ser Met Tyr His Leu Ala Arg Asp Pro Glu Leu
355 360 365
Gln Gln Ala Val Tyr Glu Glu Val Ser Ser Ala Val Pro Met Asp Arg
370 375 380
Ile Pro Val Ala Glu Asp Ile Ser Lys Met Pro Leu Leu Arg Gly Val
385 390 395 400
Ile Lys Glu Thr Leu Arg Leu Tyr Pro Val Val Pro Thr Asn Gly Arg
405 410 415
Ile Val Ser Glu Lys Asp Val Lys Ile Gly Glu Tyr Arg Phe Pro Lys
420 425 430
Asn Thr Leu Phe Val Leu Cys His Phe Ala Ile Ala Arg Asp Glu Glu
435 440 445
Asn Phe Glu Asp Pro Leu Lys Phe Gln Pro Gln Arg Trp Leu Arg Asp
450 455 460
Gly Gly Met Lys His His Pro Phe Ser Ser Ile Pro Phe Gly Tyr Gly
465 470 475 480
Val Arg Ala Cys Val Gly Lys Arg Ile Ala Gln Leu Glu Met His Leu
485 490 495
Ala Leu Ser Arg Ile Ile Arg Ile Phe Glu Leu Arg Pro Asp Pro Lys
500 505 510
Gly Gly Asp Ile Lys Thr Ile Ala Arg Ile Leu Leu Thr Pro Asn Lys
515 520 525
Pro Val Asn Leu Gln Phe Leu Glu Arg Asn Ala His Gln Gly
530 535 540
<210> 138
<211> 1629
<212> DNA
<213> 非洲爪蟾(Xenopus laevis)
<400> 138
atgtctagag gtggtttgtt attgaaaact tgtagagttg ctgtttcaca aggtagagca 60
gttacaggtg gtccaccagc ttctagattg cattgtgttc cacaaggttc aggttatttg 120
caagcaggta gaggtgtttc tgtttcacag ggcagagcag ttactggtgc tgcagttgaa 180
acagcagatg gtagaaagga aatgaaggaa ttcgatgatt tgccaggtcc atctttgttg 240
aaaaatttgt actactactt cgttagaggt tacttattgc atactcatga attgcaattg 300
aactacaaga aaatgtacgg tccattgtgg agatcagaaa tcggtaaata caagatggtt 360
aacattggtg acccagaagc tttgcaacaa ttgttgagac aagagggtaa atacccaatg 420
agaaataagg aagatatttg gaaagcacat agagatcaaa gaaaattggc ttatggtcct 480
tttactgaag aaggttacca ttggtacaga attagatctg ttttgaataa gaaaatgttg 540
aagccatcag aagcttcttc atacgcaggt ggtattaatg aagttgttac agatttcatg 600
aataagttgc aatacatgag aaaggcttct ccatcaggtg acatggttaa cgatgttgct 660
aacgcattgt acagatttgc atttgaaggt atctctaaca tcttgttcga aacaagaatt 720
ggttgtttag aaaaacaaac tccaccagaa acacaaaagt ttattgattc tattggttac 780
atgttcaaga actcagttta cgttacattt ttgccacaat ggacaaaagg tattttacca 840
tactgggata gatacatcga aggttgggat aacatcttcg atttcggtaa acaattggtt 900
gataagaaaa tgtctgaaat ccaatcaaga ttagatagag gtgaagaagt tgagggtgaa 960
tacttgactt atttgttgtc ttcagctaat ttgaacatcg gtgaagttta cggttctgtt 1020
tgtgaattgt tgttagcagg tgttgatact acatctaaca cattgtgttg gtcaatgtac 1080
catttggcaa gagatccaga attgcaacaa gctgtttacg aagaagtttc ttcagcagtt 1140
ccaatggata gaattccagt tgctgaagat atctctaaga tgccattgtt gagaggtgtt 1200
attaaagaaa ctttgagatt gtacccagtt gttccaacaa acggtagaat cgtttcagaa 1260
aaggatgtta aaattggtga atatagattt cctaaaaata ctttgttcgt tttgtgtcat 1320
ttcgctatcg caagagatga agaaaacttc gaagatccat tgaagtttca accacaaaga 1380
tggttaagag atggtggtat gaagcatcat ccattttctt caatcccatt cggttatggt 1440
gttagagctt gtgttggtaa aagaatcgca caattggaaa tgcatttggc tttgtctaga 1500
atcatcagaa tcttcgaatt aagaccagat ccaaaaggtg gtgacatcaa gactatcgct 1560
agaattttgt tgacaccaaa taagccagtt aatttgcaat tcttggaaag aaatgctcat 1620
caaggttaa 1629
<210> 139
<211> 690
<212> PRT
<213> 智人(Homo sapiens)
<400> 139
Met Gly Val Arg Gln Gln Leu Ala Leu Leu Leu Leu Leu Leu Leu Leu
1 5 10 15
Leu Trp Gly Leu Gly Gln Pro Val Trp Pro Val Ala Val Ala Leu Thr
20 25 30
Leu Arg Trp Leu Leu Gly Asp Pro Thr Cys Cys Val Leu Leu Gly Leu
35 40 45
Ala Met Leu Ala Arg Pro Trp Leu Gly Pro Trp Val Pro His Gly Leu
50 55 60
Ser Leu Ala Ala Ala Ala Leu Ala Leu Thr Leu Leu Pro Ala Arg Leu
65 70 75 80
Pro Pro Gly Leu Arg Trp Leu Pro Ala Asp Val Ile Phe Leu Ala Lys
85 90 95
Ile Leu His Leu Gly Leu Lys Ile Arg Gly Cys Leu Ser Arg Gln Pro
100 105 110
Pro Asp Thr Phe Val Asp Ala Phe Glu Arg Arg Ala Arg Ala Gln Pro
115 120 125
Gly Arg Ala Leu Leu Val Trp Thr Gly Pro Gly Ala Gly Ser Val Thr
130 135 140
Phe Gly Glu Leu Asp Ala Arg Ala Cys Gln Ala Ala Trp Ala Leu Lys
145 150 155 160
Ala Glu Leu Gly Asp Pro Ala Ser Leu Cys Ala Gly Glu Pro Thr Ala
165 170 175
Leu Leu Val Leu Ala Ser Gln Ala Val Pro Ala Leu Cys Met Trp Leu
180 185 190
Gly Leu Ala Lys Leu Gly Cys Pro Thr Ala Trp Ile Asn Pro His Gly
195 200 205
Arg Gly Met Pro Leu Ala His Ser Val Leu Ser Ser Gly Ala Arg Val
210 215 220
Leu Val Val Asp Pro Asp Leu Arg Glu Ser Leu Glu Glu Ile Leu Pro
225 230 235 240
Lys Leu Gln Ala Glu Asn Ile Arg Cys Phe Tyr Leu Ser His Thr Ser
245 250 255
Pro Thr Pro Gly Val Gly Ala Leu Gly Ala Ala Leu Asp Ala Ala Pro
260 265 270
Ser His Pro Val Pro Ala Asp Leu Arg Ala Gly Ile Thr Trp Arg Ser
275 280 285
Pro Ala Leu Phe Ile Tyr Thr Ser Gly Thr Thr Gly Leu Pro Lys Pro
290 295 300
Ala Ile Leu Thr His Glu Arg Val Leu Gln Met Ser Lys Met Leu Ser
305 310 315 320
Leu Ser Gly Ala Thr Ala Asp Asp Val Val Tyr Thr Val Leu Pro Leu
325 330 335
Tyr His Val Met Gly Leu Val Val Gly Ile Leu Gly Cys Leu Asp Leu
340 345 350
Gly Ala Thr Cys Val Leu Ala Pro Lys Phe Ser Thr Ser Cys Phe Trp
355 360 365
Asp Asp Cys Arg Gln His Gly Val Thr Val Ile Leu Tyr Val Gly Glu
370 375 380
Leu Leu Arg Tyr Leu Cys Asn Ile Pro Gln Gln Pro Glu Asp Arg Thr
385 390 395 400
His Thr Val Arg Leu Ala Met Gly Asn Gly Leu Arg Ala Asp Val Trp
405 410 415
Glu Thr Phe Gln Gln Arg Phe Gly Pro Ile Arg Ile Trp Glu Val Tyr
420 425 430
Gly Ser Thr Glu Gly Asn Met Gly Leu Val Asn Tyr Val Gly Arg Cys
435 440 445
Gly Ala Leu Gly Lys Met Ser Cys Leu Leu Arg Met Leu Ser Pro Phe
450 455 460
Glu Leu Val Gln Phe Asp Met Glu Ala Ala Glu Pro Val Arg Asp Asn
465 470 475 480
Gln Gly Phe Cys Ile Pro Val Gly Leu Gly Glu Pro Gly Leu Leu Leu
485 490 495
Thr Lys Val Val Ser Gln Gln Pro Phe Val Gly Tyr Arg Gly Pro Arg
500 505 510
Glu Leu Ser Glu Arg Lys Leu Val Arg Asn Val Arg Gln Ser Gly Asp
515 520 525
Val Tyr Tyr Asn Thr Gly Asp Val Leu Ala Met Asp Arg Glu Gly Phe
530 535 540
Leu Tyr Phe Arg Asp Arg Leu Gly Asp Thr Phe Arg Trp Lys Gly Glu
545 550 555 560
Asn Val Ser Thr His Glu Val Glu Gly Val Leu Ser Gln Val Asp Phe
565 570 575
Leu Gln Gln Val Asn Val Tyr Gly Val Cys Val Pro Gly Cys Glu Gly
580 585 590
Lys Val Gly Met Ala Ala Val Gln Leu Ala Pro Gly Gln Thr Phe Asp
595 600 605
Gly Glu Lys Leu Tyr Gln His Val Arg Ala Trp Leu Pro Ala Tyr Ala
610 615 620
Thr Pro His Phe Ile Arg Ile Gln Asp Ala Met Glu Val Thr Ser Thr
625 630 635 640
Phe Lys Leu Met Lys Thr Arg Leu Val Arg Glu Gly Phe Asn Val Gly
645 650 655
Ile Val Val Asp Pro Leu Phe Val Leu Asp Asn Arg Ala Gln Ser Phe
660 665 670
Arg Pro Leu Thr Ala Glu Met Tyr Gln Ala Val Cys Glu Gly Thr Trp
675 680 685
Arg Leu
690
<210> 140
<211> 2073
<212> DNA
<213> 智人(Homo sapiens)
<400> 140
atgggtgtta gacaacaatt ggctttgtta ttattgttgt tgttgttgtt gtggggttta 60
ggtcaaccag tttggccagt tgctgttgca ttgactttaa gatggttatt gggtgaccca 120
acatgttgtg ttttattggg tttagctatg ttggcaagac catggttagg tccatgggtt 180
ccacatggtt tgtctttagc tgcagctgca ttggcattaa ctttattgcc agctagattg 240
ccaccaggtt taagatggtt gccagcagat gttattttct tggctaagat cttgcatttg 300
ggtttgaaga tcagaggttg tttgtctaga caaccaccag atacatttgt tgatgctttt 360
gaaagaagag ctagagcaca accaggtaga gcattattgg tttggactgg tccaggtgct 420
ggttcagtta catttggtga attagatgct agagcatgtc aagctgcatg ggctttaaaa 480
gcagaattgg gtgacccagc atctttgtgt gctggtgaac caactgcttt attggttttg 540
gcttcacaag cagttccagc tttatgtatg tggttaggtt tggcaaaatt gggttgtcca 600
acagcttgga ttaatccaca tggtcgtggt atgccattag cacattctgt tttgtcttca 660
ggtgctagag ttttagttgt tgatccagat ttgagagaat cattggaaga aatcttgcca 720
aagttgcaag ctgaaaacat cagatgtttc tatttgtctc atacttcacc aacaccaggt 780
gttggtgctt taggtgctgc attggatgct gcaccatctc atccagttcc agcagatttg 840
agagctggta ttacttggag atctccagca ttgtttatat atacatcagg tactacaggt 900
ttaccaaaac cagctatctt gactcatgaa agagttttgc aaatgtcaaa gatgttgtct 960
ttgtcaggtg ctactgcaga tgatgttgtt tacacagttt tgccattgta ccatgttatg 1020
ggtttagttg ttggtatttt gggttgttta gatttgggtg caacttgtgt tttggctcca 1080
aaattttcta catcatgttt ctgggatgat tgtagacaac atggtgttac agttattttg 1140
tacgttggtg aattgttgag atacttatgt aacattccac aacaaccaga agatagaact 1200
catacagtta gattggcaat gggtaatggt ttaagagctg atgtttggga aactttccaa 1260
caaagattcg gtccaatcag aatttgggaa gtttatggtt ctacagaagg taatatgggt 1320
ttggttaatt acgttggtag atgtggtgct ttgggtaaaa tgtcttgttt gttgagaatg 1380
ttgtcaccat tcgaattggt tcaattcgat atggaagctg cagaaccagt tagagataat 1440
caaggttttt gtattccagt tggtttaggt gaaccaggtt tattgttgac taaggttgtt 1500
tctcaacaac cattcgttgg ttatagaggt ccaagagaat tgtctgaaag aaaattggtt 1560
agaaacgtta gacaatcagg tgacgtttat tacaatacag gtgacgtttt ggctatggat 1620
agagaaggtt tcttgtactt cagagataga ttgggtgaca cttttagatg gaaaggtgaa 1680
aatgtttcta cacatgaagt tgaaggtgtt ttgtcacaag ttgatttctt gcaacaagtt 1740
aacgtttacg gtgtttgtgt tccaggttgt gagggtaaag ttggtatggc tgcagttcaa 1800
ttggcaccag gtcaaacttt cgatggtgaa aagttgtacc aacatgttag agcttggtta 1860
ccagcttacg caacaccaca tttcatcaga attcaagatg caatggaagt tacttctact 1920
tttaaattga tgaagactag attggttaga gaaggtttta atgttggtat cgttgttgat 1980
ccattgttcg ttttggataa cagagctcaa tcttttagac cattgactgc agaaatgtac 2040
caagctgttt gtgaaggtac atggagattg taa 2073
<210> 141
<211> 690
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 141
Met Gly Val Trp Lys Lys Leu Thr Phe Leu Leu Leu Ser Leu Leu Leu
1 5 10 15
Leu Val Gly Leu Gly Gln Pro Leu Trp Pro Ala Ala Thr Ala Leu Ala
20 25 30
Leu Arg Trp Phe Leu Gly Asp Pro Thr Cys Phe Val Leu Leu Gly Leu
35 40 45
Ala Phe Leu Gly Arg Pro Trp Ile Ser Ser Trp Ile Pro His Trp Leu
50 55 60
Ser Leu Ala Ala Ala Ala Leu Thr Leu Ser Leu Leu Pro Pro Arg Pro
65 70 75 80
Pro Pro Glu Leu Arg Trp Leu His Lys Asp Val Ala Phe Ala Phe Lys
85 90 95
Leu Leu Phe Tyr Gly Leu Asn Leu Arg Arg Arg Leu Asn Arg His Pro
100 105 110
Pro Glu Leu Phe Val Asp Ala Leu Glu Gln Gln Ala Gln Ala Arg Pro
115 120 125
Asp Gln Val Ala Leu Val Cys Thr Gly Ser Glu Gly Cys Ser Ile Thr
130 135 140
Asn Arg Glu Leu Asn Ala Lys Ala Cys Gln Ala Ala Trp Ala Leu Lys
145 150 155 160
Ala Lys Leu Lys Glu Ala Thr Ile Gln Glu Asp Lys Gly Ala Thr Ala
165 170 175
Ile Leu Val Leu Pro Ser Lys Ser Ile Ser Ala Leu Ser Val Phe Leu
180 185 190
Gly Leu Ala Lys Leu Gly Cys Pro Val Ala Trp Ile Asn Pro His Ser
195 200 205
Arg Gly Met Pro Leu Leu His Ser Val Gln Ser Ser Gly Ala Ser Val
210 215 220
Leu Ile Val Asp Pro Asp Leu Gln Glu Asn Leu Glu Glu Val Leu Pro
225 230 235 240
Lys Leu Leu Ala Glu Asn Ile Arg Cys Phe Tyr Leu Gly His Ser Ser
245 250 255
Pro Thr Pro Gly Val Glu Ala Leu Gly Ala Ala Leu Asp Ala Ala Pro
260 265 270
Ser Asp Pro Val Pro Ala Lys Leu Arg Ala Asn Ile Lys Trp Lys Ser
275 280 285
Pro Ala Ile Phe Ile Tyr Thr Ser Gly Thr Thr Gly Leu Pro Lys Pro
290 295 300
Ala Ile Leu Ser His Glu Arg Val Ile Gln Met Ser Asn Val Leu Ser
305 310 315 320
Phe Cys Gly Arg Thr Ala Asp Asp Val Val Tyr Asn Val Leu Pro Leu
325 330 335
Tyr His Ser Met Gly Leu Val Leu Gly Val Leu Gly Cys Leu Gln Leu
340 345 350
Gly Ala Thr Cys Val Leu Ala Pro Lys Phe Ser Ala Ser Arg Tyr Trp
355 360 365
Ala Glu Cys Arg Gln Tyr Ser Val Thr Val Val Leu Tyr Val Gly Glu
370 375 380
Val Leu Arg Tyr Leu Cys Asn Val Pro Gly Gln Pro Glu Asp Lys Lys
385 390 395 400
His Thr Val Arg Phe Ala Leu Gly Asn Gly Leu Arg Ala Asp Val Trp
405 410 415
Glu Asn Phe Gln Gln Arg Phe Gly Pro Ile Gln Ile Trp Glu Leu Tyr
420 425 430
Gly Ser Thr Glu Gly Asn Val Gly Leu Met Asn Tyr Val Gly His Cys
435 440 445
Gly Ala Val Gly Lys Thr Ser Cys Phe Ile Arg Met Leu Thr Pro Leu
450 455 460
Glu Leu Val Gln Phe Asp Ile Glu Thr Ala Glu Pro Val Arg Asp Lys
465 470 475 480
Gln Gly Phe Cys Ile Pro Val Glu Thr Gly Lys Pro Gly Leu Leu Leu
485 490 495
Thr Lys Ile Arg Lys Asn Gln Pro Phe Leu Gly Tyr Arg Gly Ser Gln
500 505 510
Asp Glu Thr Lys Arg Lys Leu Val Ala Asn Val Arg Gln Val Gly Asp
515 520 525
Leu Tyr Tyr Asn Thr Gly Asp Val Leu Ala Leu Asp Gln Glu Gly Phe
530 535 540
Phe Tyr Phe Arg Asp Arg Leu Gly Asp Thr Phe Arg Trp Lys Gly Glu
545 550 555 560
Asn Val Ser Thr Arg Glu Val Glu Gly Val Leu Ser Ile Leu Asp Phe
565 570 575
Leu Glu Glu Val Asn Val Tyr Gly Val Thr Val Pro Gly Cys Glu Gly
580 585 590
Lys Val Gly Met Ala Ala Val Lys Leu Ala Pro Gly Lys Thr Phe Asp
595 600 605
Gly Gln Lys Leu Tyr Gln His Val Arg Ser Trp Leu Pro Ala Tyr Ala
610 615 620
Thr Pro His Phe Ile Arg Ile Gln Asp Ser Leu Glu Ile Thr Asn Thr
625 630 635 640
Tyr Lys Leu Val Lys Ser Gln Leu Ala Arg Glu Gly Phe Asp Val Gly
645 650 655
Val Ile Ala Asp Pro Leu Tyr Ile Leu Asp Asn Lys Ala Glu Thr Phe
660 665 670
Arg Ser Leu Met Pro Asp Val Tyr Gln Ala Val Cys Glu Gly Thr Trp
675 680 685
Lys Leu
690
<210> 142
<211> 2073
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 142
atgggtgttt ggaagaaatt gacatttttg ttgttgtctt tgttgttatt ggttggttta 60
ggtcaaccat tgtggccagc tgcaactgct ttggcattaa gatggttttt aggtgaccca 120
acatgtttcg ttttgttggg tttggcattt ttgggtagac catggatttc ttcatggatt 180
ccacattggt tgtcattagc tgcagctgca ttgacattat ctttattgcc accaagacca 240
ccaccagaat tgagatggtt acataaagat gttgcttttg cttttaaatt gttgttttat 300
ggtttgaatt tgagaagaag attgaacaga catccaccag aattgtttgt tgatgcatta 360
gaacaacaag ctcaagcaag accagatcaa gttgctttgg tttgtactgg ttcagaaggt 420
tgttctatca caaacagaga attgaacgct aaggcatgtc aagctgcatg ggctttgaag 480
gcaaagttga aggaagcaac tatccaagaa gataaaggtg ctacagcaat tttggttttg 540
ccatctaagt caatctctgc tttgtcagtt ttcttgggtt tagctaaatt aggttgtcca 600
gttgcatgga ttaatccaca ttctcgtggt atgccattat tgcattcagt tcaatcttca 660
ggtgcttctg ttttgattgt tgatccagat ttgcaagaaa atttggaaga agttttgcca 720
aagttgttgg cagaaaacat cagatgtttc tatttgggtc attcttcacc aactccaggt 780
gttgaagctt tgggtgctgc attagatgct gcaccatcag atccagttcc tgctaagttg 840
agagcaaaca tcaagtggaa gtcaccagca atttttatct atacttctgg tactacaggt 900
ttgccaaaac cagctatctt gtcacatgaa agagttattc aaatgtcaaa cgttttgtct 960
ttctgtggta gaacagctga tgatgttgtt tacaacgttt tgccattgta ccattctatg 1020
ggtttggttt taggtgtctt gggttgtttg caattaggtg ctacttgtgt tttagcacca 1080
aaattttcag cttctagata ttgggcagaa tgtagacaat actctgttac agttgttttg 1140
tatgttggtg aagttttgag atacttatgt aatgttccag gtcaaccaga agataagaaa 1200
catactgtta gattcgcttt gggtaatggt ttaagagcag atgtttggga aaacttccaa 1260
caaagattcg gtccaatcca aatttgggaa ttgtatggtt caacagaggg taacgttggt 1320
ttaatgaact acgttggtca ttgtggtgca gttggtaaaa cttcttgttt catcagaatg 1380
ttgacaccat tggaattagt tcaattcgat atcgaaactg ctgaaccagt tagagataag 1440
caaggtttct gtatcccagt tgaaactggt aaaccaggtt tattgttgac aaagatcaga 1500
aagaatcaac catttttagg ttatagaggt tctcaagatg aaactaagag aaaattggtt 1560
gcaaacgtta gacaagttgg tgacttgtat tacaatacag gtgacgtttt ggctttggat 1620
caagaaggtt tcttttattt cagagataga ttgggtgaca cttttagatg gaaaggtgaa 1680
aatgtttcaa caagagaagt tgaaggtgtt ttgtctatct tggatttctt ggaagaagtt 1740
aacgtttacg gtgttacagt tccaggttgt gagggtaaag ttggtatggc tgcagttaaa 1800
ttggctccag gtaaaacttt cgatggtcaa aagttgtacc aacatgttag atcatggtta 1860
ccagcttacg caacaccaca tttcatcaga attcaagatt cattggaaat tactaataca 1920
tacaaattgg ttaagtctca attggctaga gaaggttttg atgttggtgt tattgcagat 1980
ccattgtaca tcttggataa taaggctgaa acttttagat ctttgatgcc agatgtttac 2040
caagctgttt gtgaaggtac atggaaatta taa 2073
<210> 143
<211> 669
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 143
Met Ser Pro Ile Gln Val Val Val Phe Ala Leu Ser Arg Ile Phe Leu
1 5 10 15
Leu Leu Phe Arg Leu Ile Lys Leu Ile Ile Thr Pro Ile Gln Lys Ser
20 25 30
Leu Gly Tyr Leu Phe Gly Asn Tyr Phe Asp Glu Leu Asp Arg Lys Tyr
35 40 45
Arg Tyr Lys Glu Asp Trp Tyr Ile Ile Pro Tyr Phe Leu Lys Ser Val
50 55 60
Phe Cys Tyr Ile Ile Asp Val Arg Arg His Arg Phe Gln Asn Trp Tyr
65 70 75 80
Leu Phe Ile Lys Gln Val Gln Gln Asn Gly Asp His Leu Ala Ile Ser
85 90 95
Tyr Thr Arg Pro Met Ala Glu Lys Gly Glu Phe Gln Leu Glu Thr Phe
100 105 110
Thr Tyr Ile Glu Thr Tyr Asn Ile Val Leu Arg Leu Ser His Ile Leu
115 120 125
His Phe Asp Tyr Asn Val Gln Ala Gly Asp Tyr Val Ala Ile Asp Cys
130 135 140
Thr Asn Lys Pro Leu Phe Val Phe Leu Trp Leu Ser Leu Trp Asn Ile
145 150 155 160
Gly Ala Ile Pro Ala Phe Leu Asn Tyr Asn Thr Lys Gly Thr Pro Leu
165 170 175
Val His Ser Leu Lys Ile Ser Asn Ile Thr Gln Val Phe Ile Asp Pro
180 185 190
Asp Ala Ser Asn Pro Ile Arg Glu Ser Glu Glu Glu Ile Lys Asn Ala
195 200 205
Leu Pro Asp Val Lys Leu Asn Tyr Leu Glu Glu Gln Asp Leu Met His
210 215 220
Glu Leu Leu Asn Ser Gln Ser Pro Glu Phe Leu Gln Gln Asp Asn Val
225 230 235 240
Arg Thr Pro Leu Gly Leu Thr Asp Phe Lys Pro Ser Met Leu Ile Tyr
245 250 255
Thr Ser Gly Thr Thr Gly Leu Pro Lys Ser Ala Ile Met Ser Trp Arg
260 265 270
Lys Ser Ser Val Gly Cys Gln Val Phe Gly His Val Leu His Met Thr
275 280 285
Asn Glu Ser Thr Val Phe Thr Ala Met Pro Leu Phe His Ser Thr Ala
290 295 300
Ala Leu Leu Gly Ala Cys Ala Ile Leu Ser His Gly Gly Cys Leu Ala
305 310 315 320
Leu Ser His Lys Phe Ser Ala Ser Thr Phe Trp Lys Gln Val Tyr Leu
325 330 335
Thr Gly Ala Thr His Ile Gln Tyr Val Gly Glu Val Cys Arg Tyr Leu
340 345 350
Leu His Thr Pro Ile Ser Lys Tyr Glu Lys Met His Lys Val Lys Val
355 360 365
Ala Tyr Gly Asn Gly Leu Arg Pro Asp Ile Trp Gln Asp Phe Arg Lys
370 375 380
Arg Phe Asn Ile Glu Val Ile Gly Glu Phe Tyr Ala Ala Thr Glu Ala
385 390 395 400
Pro Phe Ala Thr Thr Thr Phe Gln Lys Gly Asp Phe Gly Ile Gly Ala
405 410 415
Cys Arg Asn Tyr Gly Thr Ile Ile Gln Trp Phe Leu Ser Phe Gln Gln
420 425 430
Thr Leu Val Arg Met Asp Pro Asn Asp Asp Ser Val Ile Tyr Arg Asn
435 440 445
Ser Lys Gly Phe Cys Glu Val Ala Pro Val Gly Glu Pro Gly Glu Met
450 455 460
Leu Met Arg Ile Phe Phe Pro Lys Lys Pro Glu Thr Ser Phe Gln Gly
465 470 475 480
Tyr Leu Gly Asn Ala Lys Glu Thr Lys Ser Lys Val Val Arg Asp Val
485 490 495
Phe Arg Arg Gly Asp Ala Trp Tyr Arg Cys Gly Asp Leu Leu Lys Ala
500 505 510
Asp Glu Tyr Gly Leu Trp Tyr Phe Leu Asp Arg Met Gly Asp Thr Phe
515 520 525
Arg Trp Lys Ser Glu Asn Val Ser Thr Thr Glu Val Glu Asp Gln Leu
530 535 540
Thr Ala Ser Asn Lys Glu Gln Tyr Ala Gln Val Leu Val Val Gly Ile
545 550 555 560
Lys Val Pro Lys Tyr Glu Gly Arg Ala Gly Phe Ala Val Ile Lys Leu
565 570 575
Thr Asp Asn Ser Leu Asp Ile Thr Ala Lys Thr Lys Leu Leu Asn Asp
580 585 590
Ser Leu Ser Arg Leu Asn Leu Pro Ser Tyr Ala Met Pro Leu Phe Val
595 600 605
Lys Phe Val Asp Glu Ile Lys Met Thr Asp Asn His Lys Ile Leu Lys
610 615 620
Lys Val Tyr Arg Glu Gln Lys Leu Pro Lys Gly Leu Asp Gly Asn Asp
625 630 635 640
Thr Ile Phe Trp Leu Lys Asn Tyr Lys Arg Tyr Glu Val Leu Thr Ala
645 650 655
Ala Asp Trp Glu Ala Ile Asp Ala Gln Thr Ile Lys Leu
660 665
<210> 144
<211> 2010
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 144
atgtctccca tacaggttgt tgtctttgcc ttgtcaagga ttttcctgct attattcaga 60
cttatcaagc taattataac ccctatccag aaatcactgg gttatctatt tggtaattat 120
tttgatgaat tagaccgtaa atatagatac aaggaggatt ggtatattat tccttacttt 180
ttgaaaagcg tgttttgtta tatcattgat gtgagaagac ataggtttca aaactggtac 240
ttatttatta aacaggtcca acaaaatggt gaccatttag cgattagtta cacccgtccc 300
atggccgaaa agggagaatt tcaactcgaa acctttacgt atattgaaac ttataacata 360
gtgttgagat tgtctcatat tttgcatttt gattataacg ttcaggccgg tgactacgtg 420
gcaatcgatt gtactaataa acctcttttc gtatttttat ggctttcttt gtggaacatt 480
ggggctattc cagctttttt aaactataat actaaaggca ctccgctggt tcactcccta 540
aagatttcca atattacgca ggtatttatt gaccctgatg ccagtaatcc gatcagagaa 600
tcggaagaag aaatcaaaaa cgcacttcct gatgttaaat taaactatct tgaagaacaa 660
gacttaatgc atgaactttt aaattcgcaa tcaccggaat tcttacaaca agacaacgtt 720
aggacaccac taggcttgac cgattttaaa ccctctatgt taatttatac atctggaacc 780
actggtttgc ctaaatccgc tattatgtct tggagaaaat cctccgtagg ttgtcaagtt 840
tttggtcatg ttttacatat gactaatgaa agcactgtgt tcacagccat gccattgttc 900
cattcaactg ctgccttatt aggtgcgtgc gccattctat ctcacggtgg ttgccttgcg 960
ttatcgcata aattttctgc cagtacattt tggaagcaag tttatttaac aggagccacg 1020
cacatccaat atgtcggaga agtctgtaga tacctgttac atacgccaat ttctaagtat 1080
gaaaagatgc ataaggtgaa ggttgcttat ggtaacgggc tgagacctga catctggcag 1140
gacttcagga agaggttcaa catagaagtt attggtgaat tctatgccgc aactgaagct 1200
ccttttgcta caactacctt ccagaaaggt gactttggaa ttggcgcatg taggaactat 1260
ggtactataa ttcaatggtt tttgtcattc caacaaacat tggtaaggat ggacccaaat 1320
gacgattccg ttatatatag aaattccaag ggtttctgcg aagtggcccc tgttggcgaa 1380
ccaggagaaa tgttaatgag aatctttttc cctaaaaaac cagaaacatc ttttcaaggt 1440
tatcttggta atgccaagga aacaaagtcc aaagttgtga gggatgtctt cagacgtggc 1500
gatgcttggt atagatgtgg agatttatta aaagcggacg aatatggatt atggtatttc 1560
cttgatagaa tgggtgatac tttcagatgg aaatctgaaa atgtttccac tactgaagta 1620
gaagatcagt tgacggccag taacaaagaa caatatgcac aagttctagt tgttggtatt 1680
aaagtaccta aatatgaagg tagagctggt tttgcagtta ttaaactaac tgacaactct 1740
cttgacatca ctgcaaagac caaattatta aatgattcct tgagccggtt aaatctaccg 1800
tcttatgcta tgcccctatt tgttaaattt gttgatgaaa ttaaaatgac agataatcat 1860
aaaattttga agaaggttta tagagagcaa aaattaccaa agggtttgga tggaaatgac 1920
actatttttt ggctcaagaa ttacaagcgc tatgaagtct tgaccgctgc tgattgggaa 1980
gccatcgatg cacaaacaat taaattatga 2010
<210> 145
<211> 382
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 145
Met Ala Leu Arg Gly Val Arg Val Leu Glu Leu Ala Gly Leu Ala Pro
1 5 10 15
Gly Pro Phe Cys Gly Met Ile Leu Ala Asp Phe Gly Ala Glu Val Val
20 25 30
Leu Val Asp Arg Leu Gly Ser Val Asn His Pro Ser His Leu Ala Arg
35 40 45
Gly Lys Arg Ser Leu Ala Leu Asp Leu Lys Arg Ser Pro Gly Ala Ala
50 55 60
Val Leu Arg Arg Met Cys Ala Arg Ala Asp Val Leu Leu Glu Pro Phe
65 70 75 80
Arg Cys Gly Val Met Glu Lys Leu Gln Leu Gly Pro Glu Thr Leu Arg
85 90 95
Gln Asp Asn Pro Lys Leu Ile Tyr Ala Arg Leu Ser Gly Phe Gly Gln
100 105 110
Ser Gly Ile Phe Ser Lys Val Ala Gly His Asp Ile Asn Tyr Val Ala
115 120 125
Leu Ser Gly Val Leu Ser Lys Ile Gly Arg Ser Gly Glu Asn Pro Tyr
130 135 140
Pro Pro Leu Asn Leu Leu Ala Asp Phe Gly Gly Gly Gly Leu Met Cys
145 150 155 160
Thr Leu Gly Ile Leu Leu Ala Leu Phe Glu Arg Thr Arg Ser Gly Leu
165 170 175
Gly Gln Val Ile Asp Ala Asn Met Val Glu Gly Thr Ala Tyr Leu Ser
180 185 190
Thr Phe Leu Trp Lys Thr Gln Ala Met Gly Leu Trp Ala Gln Pro Arg
195 200 205
Gly Gln Asn Leu Leu Asp Gly Gly Ala Pro Phe Tyr Thr Thr Tyr Lys
210 215 220
Thr Ala Asp Gly Glu Phe Met Ala Val Gly Ala Ile Glu Pro Gln Phe
225 230 235 240
Tyr Thr Leu Leu Leu Lys Gly Leu Gly Leu Glu Ser Glu Glu Leu Pro
245 250 255
Ser Gln Met Ser Ile Glu Asp Trp Pro Glu Met Lys Lys Lys Phe Ala
260 265 270
Asp Val Phe Ala Arg Lys Thr Lys Ala Glu Trp Cys Gln Ile Phe Asp
275 280 285
Gly Thr Asp Ala Cys Val Thr Pro Val Leu Thr Leu Glu Glu Ala Leu
290 295 300
His His Gln His Asn Arg Glu Arg Gly Ser Phe Ile Thr Asp Glu Glu
305 310 315 320
Gln His Ala Cys Pro Arg Pro Ala Pro Gln Leu Ser Arg Thr Pro Ala
325 330 335
Val Pro Ser Ala Lys Arg Asp Pro Ser Val Gly Glu His Thr Val Glu
340 345 350
Val Leu Lys Asp Tyr Gly Phe Ser Gln Glu Glu Ile His Gln Leu His
355 360 365
Ser Asp Arg Ile Ile Glu Ser Asn Lys Leu Lys Ala Asn Leu
370 375 380
<210> 146
<211> 1149
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 146
atggctttga gaggtgttag agttttagaa ttggctggtt tggcaccagg tccattttgt 60
ggtatgattt tagctgattt tggtgcagaa gttgttttgg ttgatagatt aggttctgtt 120
aatcatccat cacatttggc tagaggtaaa agatctttag cattggattt gaagagatca 180
ccaggtgctg cagttttaag aagaatgtgt gctagagcag atgttttgtt agaacctttt 240
agatgtggtg ttatggaaaa attgcaatta ggtccagaaa cattgagaca agataaccca 300
aagttgatct atgctagatt gtctggtttt ggtcaatctg gtattttctc taaggttgct 360
ggtcatgata tcaactacgt tgcattgtct ggtgttttgt caaagatcgg tagatcaggt 420
gaaaatccat acccaccatt aaatttgtta gcagattttg gtggtggtgg tttgatgtgt 480
acattgggta ttttgttggc tttgttcgaa agaactagat ctggtttagg tcaagttatt 540
gatgctaata tggttgaagg tactgcatat ttgtcaacat ttttatggaa gactcaagct 600
atgggtttgt gggcacaacc aagaggtcaa aatttgttag atggtggtgc tccattttat 660
actacataca aaacagcaga tggtgaattc atggctgttg gtgcaatcga accacaattc 720
tacactttgt tgttgaaggg tttgggttta gaatctgaag aattgccatc tcaaatgtca 780
atcgaagatt ggccagaaat gaaaaagaaa ttcgctgatg ttttcgcaag aaagacaaag 840
gctgaatggt gtcaaatctt tgatggtact gatgcatgtg ttactccagt tttgacatta 900
gaagaagctt tgcatcatca acataacaga gaaagaggtt cttttattac tgatgaagaa 960
caacatgctt gtccaagacc agcaccacaa ttatcaagaa caccagctgt tccatctgca 1020
aaaagagatc catcagttgg tgaacatact gttgaagttt tgaaggatta cggtttttct 1080
caagaagaaa tccatcaatt gcattctgat agaattattg aatcaaataa gttgaaagct 1140
aatttgtaa 1149
<210> 147
<211> 382
<212> PRT
<213> 智人(Homo sapiens)
<400> 147
Met Ala Leu Gln Gly Ile Ser Val Val Glu Leu Ser Gly Leu Ala Pro
1 5 10 15
Gly Pro Phe Cys Ala Met Val Leu Ala Asp Phe Gly Ala Arg Val Val
20 25 30
Arg Val Asp Arg Pro Gly Ser Arg Tyr Asp Val Ser Arg Leu Gly Arg
35 40 45
Gly Lys Arg Ser Leu Val Leu Asp Leu Lys Gln Pro Arg Gly Ala Ala
50 55 60
Val Leu Arg Arg Leu Cys Lys Arg Ser Asp Val Leu Leu Glu Pro Phe
65 70 75 80
Arg Arg Gly Val Met Glu Lys Leu Gln Leu Gly Pro Glu Ile Leu Gln
85 90 95
Arg Glu Asn Pro Arg Leu Ile Tyr Ala Arg Leu Ser Gly Phe Gly Gln
100 105 110
Ser Gly Ser Phe Cys Arg Leu Ala Gly His Asp Ile Asn Tyr Leu Ala
115 120 125
Leu Ser Gly Val Leu Ser Lys Ile Gly Arg Ser Gly Glu Asn Pro Tyr
130 135 140
Ala Pro Leu Asn Leu Leu Ala Asp Phe Ala Gly Gly Gly Leu Met Cys
145 150 155 160
Ala Leu Gly Ile Ile Met Ala Leu Phe Asp Arg Thr Arg Thr Gly Lys
165 170 175
Gly Gln Val Ile Asp Ala Asn Met Val Glu Gly Thr Ala Tyr Leu Ser
180 185 190
Ser Phe Leu Trp Lys Thr Gln Lys Leu Ser Leu Trp Glu Ala Pro Arg
195 200 205
Gly Gln Asn Met Leu Asp Gly Gly Ala Pro Phe Tyr Thr Thr Tyr Arg
210 215 220
Thr Ala Asp Gly Glu Phe Met Ala Val Gly Ala Ile Glu Pro Gln Phe
225 230 235 240
Tyr Glu Leu Leu Ile Lys Gly Leu Gly Leu Lys Ser Asp Glu Leu Pro
245 250 255
Asn Gln Met Ser Met Asp Asp Trp Pro Glu Met Lys Lys Lys Phe Ala
260 265 270
Asp Val Phe Ala Glu Lys Thr Lys Ala Glu Trp Cys Gln Ile Phe Asp
275 280 285
Gly Thr Asp Ala Cys Val Thr Pro Val Leu Thr Phe Glu Glu Val Val
290 295 300
His His Asp His Asn Lys Glu Arg Gly Ser Phe Ile Thr Ser Glu Glu
305 310 315 320
Gln Asp Val Ser Pro Arg Pro Ala Pro Leu Leu Leu Asn Thr Pro Ala
325 330 335
Ile Pro Ser Phe Lys Arg Asp Pro Phe Ile Gly Glu His Thr Glu Glu
340 345 350
Ile Leu Glu Glu Phe Gly Phe Ser Arg Glu Glu Ile Tyr Gln Leu Asn
355 360 365
Ser Asp Lys Ile Ile Glu Ser Asn Lys Val Lys Ala Ser Leu
370 375 380
<210> 148
<211> 1149
<212> DNA
<213> 智人(Homo sapiens)
<400> 148
atggcattac aaggtatttc tgttgttgaa ttgtcaggtt tagctccagg tccattttgt 60
gctatggttt tggcagattt tggtgctaga gttgttagag ttgatagacc aggttctaga 120
tatgatgttt caagattggg tagaggtaaa agatcattag ttttggattt gaagcaacca 180
agaggtgctg cagttttgag aagattgtgt aagagatctg atgttttgtt ggaacctttt 240
agaagaggtg ttatggaaaa attgcaatta ggtccagaaa tcttgcaaag agaaaaccca 300
agattgatct atgcaagatt gtcaggtttt ggtcaatctg gttcattttg tagattggca 360
ggtcatgata tcaactattt ggctttgtct ggtgttttgt caaagatcgg tagatctggt 420
gaaaatccat acgctccatt gaatttgtta gctgattttg ctggtggtgg tttgatgtgt 480
gcattgggta tcatcatggc tttgttcgat agaactagaa ctggtaaagg tcaagttatt 540
gatgcaaata tggttgaagg tactgcttat ttgtcttcat ttttatggaa gacacaaaag 600
ttgtcattat gggaagctcc aagaggtcaa aatatgttag atggtggtgc accattttat 660
actacataca gaactgctga tggtgaattc atggctgttg gtgcaatcga accacaattc 720
tacgaattgt tgattaaagg tttgggttta aagtcagatg aattgccaaa ccaaatgtct 780
atggatgatt ggccagaaat gaaaaagaaa ttcgcagatg ttttcgctga aaagactaaa 840
gcagaatggt gtcaaatttt tgatggtaca gatgcttgtg ttactccagt tttgacattc 900
gaagaagttg ttcatcatga tcataataag gaaagaggtt cttttattac atcagaagaa 960
caagatgttt caccaagacc agcaccattg ttattgaata ctccagctat cccatctttt 1020
aaaagagatc cttttattgg tgaacataca gaagaaatct tggaagaatt cggtttttct 1080
agagaagaaa tctatcaatt aaattctgat aaaattattg aatcaaataa ggttaaagct 1140
tctttgtaa 1149
<210> 149
<211> 381
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 149
Met Val Leu Arg Gly Val Arg Val Val Glu Leu Ala Gly Leu Ala Pro
1 5 10 15
Gly Pro Phe Cys Gly Met Val Leu Ala Asp Phe Gly Ala Glu Val Val
20 25 30
Arg Val Asn Arg Leu Gly Ser Thr Gly Glu Asn Phe Leu Ala Arg Gly
35 40 45
Lys Arg Ser Leu Ala Leu Asp Leu Lys Arg Ser Gln Gly Val Thr Val
50 55 60
Leu Arg Arg Met Cys Ala Arg Ala Asp Val Leu Leu Glu Pro Phe Arg
65 70 75 80
Cys Gly Val Met Glu Lys Leu Gln Leu Gly Pro Glu Thr Leu Leu Gln
85 90 95
Asp Asn Pro Lys Leu Ile Tyr Ala Arg Leu Ser Gly Phe Gly Gln Ser
100 105 110
Gly Ile Phe Ser Lys Val Ala Gly His Asp Ile Asn Tyr Leu Ala Leu
115 120 125
Ser Gly Val Leu Ser Lys Ile Gly Arg Ser Gly Glu Asn Pro Tyr Pro
130 135 140
Pro Leu Asn Leu Leu Ala Asp Phe Gly Gly Gly Gly Leu Met Cys Thr
145 150 155 160
Leu Gly Ile Val Leu Ala Leu Phe Glu Arg Thr Arg Ser Gly Arg Gly
165 170 175
Gln Val Ile Asp Ser Ser Met Val Glu Gly Thr Ala Tyr Leu Ser Ser
180 185 190
Phe Leu Trp Lys Thr Gln Pro Met Gly Leu Trp Lys Gln Pro Arg Gly
195 200 205
Gln Asn Ile Leu Asp Gly Gly Ala Pro Phe Tyr Thr Thr Tyr Lys Thr
210 215 220
Ala Asp Gly Glu Phe Met Ala Val Gly Ala Ile Glu Pro Gln Phe Tyr
225 230 235 240
Ala Leu Leu Leu Lys Gly Leu Gly Leu Glu Ser Glu Glu Leu Pro Ser
245 250 255
Gln Met Ser Ser Ala Asp Trp Pro Glu Met Lys Lys Lys Phe Ala Asp
260 265 270
Val Phe Ala Lys Lys Thr Lys Ala Glu Trp Cys Gln Ile Phe Asp Gly
275 280 285
Thr Asp Ala Cys Val Thr Pro Val Leu Thr Phe Glu Glu Ala Leu His
290 295 300
His Gln His Asn Arg Glu Arg Ala Ser Phe Ile Thr Asp Gly Glu Gln
305 310 315 320
Leu Pro Ser Pro Arg Pro Ala Pro Leu Leu Ser Arg Thr Pro Ala Val
325 330 335
Pro Ser Ala Lys Arg Asp Pro Ser Val Gly Glu His Thr Val Glu Val
340 345 350
Leu Arg Glu Tyr Gly Phe Ser Gln Glu Glu Ile Leu Gln Leu His Ser
355 360 365
Asp Arg Ile Val Glu Ser Asp Lys Leu Lys Ala Asn Leu
370 375 380
<210> 150
<211> 1146
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 150
atggttttga gaggtgttag agttgttgaa ttggctggtt tagcaccagg tccattttgt 60
ggtatggttt tagctgattt tggtgcagaa gttgttagag ttaatagatt gggttctact 120
ggtgaaaatt tcttggctag aggtaaaaga tcattggctt tggatttgaa aagatcacaa 180
ggtgtcacag ttttgagaag aatgtgtgct agagcagatg ttttgttaga accttttaga 240
tgtggtgtta tggaaaaatt gcaattaggt ccagaaactt tgttgcaaga taacccaaaa 300
ttgatctatg ctagattgtc tggttttggt caatctggta ttttctctaa ggttgctggt 360
catgatatta actatttggc attgtctggt gttttgtcaa aaattggtag atcaggtgaa 420
aatccatacc caccattaaa tttgttagct gattttggtg gtggtggttt gatgtgtact 480
ttgggtatcg ttttggcatt gttcgaaaga actagatcag gtagaggtca agttattgat 540
tcttcaatgg ttgaaggtac tgcttatttg tcttcatttt tatggaagac acaaccaatg 600
ggtttgtgga agcaaccaag aggtcaaaac attttagatg gtggtgctcc attttatact 660
acatacaaaa ctgcagatgg cgagtttatg gctgttggtg caatcgaacc acaattctac 720
gctttgttgt tgaagggttt gggtttagaa tctgaagaat taccatcaca aatgtcttca 780
gcagattggc cagaaatgaa aaagaaattc gctgatgttt tcgctaagaa aactaaggct 840
gaatggtgtc aaatttttga tggtacagat gcatgtgtta ctccagtttt gacatttgaa 900
gaagcattgc atcatcaaca taacagagaa agagcatctt ttattacaga tggtgaacaa 960
ttgccatctc caagaccagc tccattattg tcaagaactc cagctgttcc atctgcaaaa 1020
agagatccat cagttggtga acatacagtt gaagttttga gagaatacgg tttttcacaa 1080
gaagaaattt tgcaattgca ttctgataga attgttgaat cagataaatt gaaagctaat 1140
ttgtaa 1146
<210> 151
<211> 382
<212> PRT
<213> 蓝鲶鱼(Ictalurus furcatus)
<400> 151
Met Ala Leu Ala Gly Val Arg Val Ile Glu Leu Ala Gly Leu Ala Pro
1 5 10 15
Ala Pro Phe Cys Gly Met Ile Leu Ser Asp Phe Gly Ala Arg Val Ile
20 25 30
Arg Val Asp Arg Thr Lys Val Thr Met Ala Met Asp Ala Gln Ala Arg
35 40 45
Gly Lys Gln Ser Val Ala Leu Asn Leu Lys Ser Pro Gln Gly Val Ala
50 55 60
Val Leu Lys Lys Leu Cys Leu Gln Ser Asp Ile Val Leu Glu Pro Phe
65 70 75 80
Arg Lys Gly Val Met Glu Lys Leu Gly Leu Gly Pro Glu Glu Leu Leu
85 90 95
Lys Glu Asn Pro Arg Leu Ile Tyr Ala Arg Leu Thr Gly Tyr Gly Gln
100 105 110
Ser Gly Ser Tyr Ala Lys Ser Ala Gly His Asp Ile Asn Tyr Leu Ala
115 120 125
Met Ser Gly Leu Leu Ser Met Leu Gly Arg Ser Ser Glu Lys Pro Tyr
130 135 140
Ala Pro Leu Asn Leu Val Ala Asp Phe Ala Gly Gly Gly Leu Met Cys
145 150 155 160
Ala Leu Gly Ile Val Leu Ala Leu Leu Glu Arg Asn Glu Ser Gly Gln
165 170 175
Gly Gln Ile Ile Asp Ala Ser Met Val Glu Gly Ala Ala Tyr Val Gly
180 185 190
Ser Phe Met Trp Lys Ser Arg Ser Leu Gly Leu Trp Asn Arg Pro Arg
195 200 205
Gly Glu Asn Met Leu Asp Ser Gly Ala Pro Phe Tyr Asp Thr Tyr Gln
210 215 220
Thr Ser Asp Gly Lys His Met Ala Val Gly Ala Ile Glu Pro Gln Phe
225 230 235 240
Tyr Asp His Leu Ile Lys Gly Leu Gly Leu Asp Ala Ala Ser Leu Pro
245 250 255
Ala Gln Met Ser Ile Ser Asp Trp Thr Glu Leu Arg Arg Thr Phe Thr
260 265 270
Gln Val Phe Ala Gln Lys Thr Gln Ala Glu Trp Ser Arg Ile Phe Asp
275 280 285
Gly Thr Asp Ala Cys Val Thr Pro Val Leu Pro Leu Asp Glu Ala Gly
290 295 300
Ser His Pro His Asn Arg Glu Arg Gly Ser Phe Leu Lys Asp Ala Gln
305 310 315 320
Gly Glu Val Ser Pro Arg Pro Ala Pro Val Leu Ser Arg Thr Pro Ala
325 330 335
Arg Pro Cys Leu Ser Arg Asp Pro Val Val Gly Glu His Thr Arg Ser
340 345 350
Val Leu Gly Glu Tyr Gly Phe Asp Pro Asp His Ile Glu Gln Leu Leu
355 360 365
Ser Ala Gly Val Val Glu Cys Asn Glu Ala Lys Ala Arg Leu
370 375 380
<210> 152
<211> 1149
<212> DNA
<213> 蓝鲶鱼(Ictalurus furcatus)
<400> 152
atggctttag caggtgttag agttattgaa ttggctggtt tagctccagc accattttgt 60
ggtatgattt tgtctgattt tggtgctaga gttattagag ttgatagaac taaggttaca 120
atggcaatgg atgctcaagc aagaggtaaa caatctgttg ctttgaattt gaagtcacca 180
caaggtgttg cagttttgaa gaaattgtgt ttgcaatctg atattgtttt ggaacctttt 240
agaaagggtg ttatggaaaa attgggtttg ggtccagaag aattgttgaa ggaaaaccca 300
agattgatct atgctagatt gactggttac ggtcaatctg gttcttatgc taagtcagca 360
ggtcatgata ttaactactt agctatgtct ggtttgttat caatgttggg tagatcatct 420
gaaaaaccat acgctccatt gaatttggtt gctgattttg ctggtggtgg tttgatgtgt 480
gctttaggta tcgttttggc attgttagaa agaaacgaat ctggtcaagg tcaaattatt 540
gatgcttcaa tggttgaagg tgctgcatac gttggttctt ttatgtggaa atcaagatca 600
ttgggtttat ggaatagacc aagaggtgaa aatatgttag attctggtgc accattttat 660
gatacttacc aaacatcaga tggtaaacac atggctgttg gtgcaatcga accacaattc 720
tacgatcatt tgattaaagg tttgggttta gatgctgcat ctttgccagc tcaaatgtct 780
atttcagatt ggactgaatt aagaagaact tttacacaag ttttcgctca aaagactcaa 840
gcagaatggt caagaatttt tgatggtact gatgcttgtg ttacaccagt tttgccatta 900
gatgaagcag gttctcatcc acataacaga gaaagaggtt catttttgaa agatgctcaa 960
ggtgaagttt ctccaagacc agctccagtt ttatcaagaa ctccagcaag accatgtttg 1020
tcaagagatc cagttgttgg tgaacataca agatcagttt tgggtgaata cggtttcgat 1080
ccagatcata tcgaacaatt gttatctgct ggtgttgttg aatgtaatga agctaaagca 1140
agattgtaa 1149
<210> 153
<211> 378
<212> PRT
<213> 体虱(Pediculus humanus subsp. corporis)
<400> 153
Met Pro Leu Lys Gly Ile Lys Val Leu Glu Leu Ala Gly Leu Ala Pro
1 5 10 15
Ser Pro Phe Cys Gly Ala Ile Leu Ala Asp Phe Gly Ala Ser Val Ile
20 25 30
Arg Ile Asp Lys Ile Ser Ser Ser Ser Thr Ala Asp Cys Leu Ser Asn
35 40 45
Gly Lys Lys Ser Leu Ala Leu Asn Leu Lys Asp Glu Glu Gly Lys Asn
50 55 60
Ile Phe Lys Lys Leu Ser Ser Asn Ala Asp Val Leu Leu Glu Pro Phe
65 70 75 80
Arg Lys Gly Val Met Glu Ser Leu Glu Leu Gly Pro Glu Asn Leu Met
85 90 95
Lys Ser Asn Pro Arg Leu Ile Tyr Ala Arg Leu Ser Gly Phe Gly Gln
100 105 110
Tyr Gly Leu Tyr Ser Ser Arg Ala Gly His Asp Ile Asn Phe Leu Ser
115 120 125
Val Ser Gly Val Leu Ser Phe Leu Gly Arg Tyr Asn Glu Lys Pro Thr
130 135 140
Pro Pro Val Asn Leu Leu Ala Asp Phe Gly Gly Gly Gly Leu Leu Cys
145 150 155 160
Ala Leu Gly Ile Val Leu Ala Leu Phe Glu Arg Thr Lys Ser Asn Lys
165 170 175
Gly Gln Ile Ile Asp Cys Ser Met Val Glu Gly Val Ala Tyr Leu Ser
180 185 190
Ser Trp Leu Phe Arg Ser Gln Lys Leu Pro Ile Trp Gly Asn Glu Arg
195 200 205
Gly Leu Asn Ile Leu Asp Thr Gly Ser His Phe Tyr Asp Thr Tyr Glu
210 215 220
Thr Lys Asp Gly Lys Phe Leu Ala Val Gly Ala Leu Glu Thr Gln Phe
225 230 235 240
Tyr Lys Ile Leu Thr Asp His Leu Lys Ser Asn Asp Leu Ser Asp Gln
245 250 255
Trp Ser Asp Phe Ser Lys Lys Lys Lys Ile Ile Thr Asp Ile Phe Lys
260 265 270
Thr Lys Asn Arg Asp Glu Trp Cys Glu Ile Phe Asp Asn Val Asp Ala
275 280 285
Cys Val Thr Pro Val Leu Asp Lys Thr Glu Val Gly Asp His Val His
290 295 300
Asn Lys Glu Arg Glu Ser Phe Thr Arg Leu Thr Asp Gly Thr Met Ile
305 310 315 320
Pro Asn Pro Ala Pro Lys Leu Ser Arg Thr Pro Gly Val Thr Lys Ala
325 330 335
Lys Val Ser His Val Glu Asn Gly Phe Asn Ser Glu Glu Ile Leu Leu
340 345 350
Glu Leu Gly Tyr Asn Lys Glu Glu Ile Lys Glu Leu Asp Leu Asn Gly
355 360 365
Val Ile Lys Ile Ile Thr Ser Ser Lys Leu
370 375
<210> 154
<211> 1137
<212> DNA
<213> 体虱(Pediculus humanus subsp. corporis)
<400> 154
atgccattga agggtattaa agttttggaa ttagctggtt tagcaccatc tccattttgt 60
ggtgctattt tggcagattt tggtgcttca gttattagaa tcgataaaat ttcttcatct 120
tcaacagctg attgtttgtc taacggtaaa aagtctttgg ctttgaattt gaaggatgaa 180
gaaggtaaaa atatttttaa gaaattgtct tctaacgcag atgttttgtt agaacctttt 240
agaaagggtg ttatggaatc tttagaattg ggtccagaaa atttgatgaa gtctaaccca 300
agattgatct atgctagatt gtcaggtttt ggtcaatatg gtttatactc ttcaagagca 360
ggtcatgata ttaatttctt gtctgtttca ggtgttttgt catttttggg tagatacaac 420
gaaaaaccaa caccaccagt taatttgtta gctgattttg gtggtggtgg tttgttatgt 480
gctttgggta tcgttttagc attgttcgaa agaactaagt ctaataaggg tcaaatcatt 540
gattgttcaa tggttgaagg tgttgcatat ttgtcttctt ggttgtttag atcacaaaaa 600
ttgccaattt ggggtaacga aagaggtttg aacattttgg atacaggttc acatttctac 660
gatacttacg aaacaaagga tggtaaattc ttggctgttg gtgcattgga aacacaattc 720
tacaaaattt tgactgatca tttgaagtct aacgatttgt cagatcaatg gtctgatttc 780
tctaagaaaa agaaaattat cacagatatt tttaaaacta aaaatagaga tgaatggtgt 840
gaaatttttg ataacgttga tgcttgtgtt acaccagttt tggataaaac tgaagttggt 900
gaccatgttc ataataagga aagagaatct tttactagat tgacagatgg tactatgatt 960
ccaaatccag ctccaaaatt gtcaagaaca ccaggtgtta ctaaggcaaa agtttctcat 1020
gttgaaaacg gttttaattc agaagaaatt ttgttagaat taggttataa taaggaagaa 1080
attaaagaat tagatttgaa tggtgttatt aaaattatta cttcttcaaa attgtaa 1137
<210> 155
<211> 386
<212> PRT
<213> 肩突硬蜱(Ixodes scapularis)
<400> 155
Met Val Met Ala Leu Lys Gly Ile Lys Val Leu Glu Met Ala Gly Leu
1 5 10 15
Ala Pro Gly Pro Phe Cys Gly Met Val Leu Arg Asp Phe Gly Ala Thr
20 25 30
Val Ile Arg Val Asp Arg Val Ser Pro Ile Arg Asn Leu Ser Asp Asn
35 40 45
Ile Pro Ala Cys Leu Ser Lys Cys Arg Asp Lys Gly Gly Thr Tyr Arg
50 55 60
Cys Ala Val Pro Ser Pro Arg Gln Asp Cys Ser Arg Gln Cys Leu Ala
65 70 75 80
Asn Ala Gly Val Met Glu Arg Val Gly Leu Gly Pro Asp Val Leu Leu
85 90 95
Gln Thr Asn Pro Arg Leu Val Tyr Ala Arg Ile Thr Gly Phe Gly Gln
100 105 110
Thr Gly Pro Phe Ser Met Met Ala Gly His Asp Ile Asn Tyr Leu Ala
115 120 125
Leu Ser Gly Val Leu Ser Met Leu Gly Glu His Gly Arg Lys Pro Ile
130 135 140
Phe Pro Val Asn Val Ile Ala Asp Phe Gly Gly Gly Gly Leu Leu Ala
145 150 155 160
Ala Leu Gly Ile Cys Met Ala Leu Leu Glu Arg Thr Arg Ser Gly Arg
165 170 175
Gly Gln Val Val Asp Thr Ser Met Ala Ser Thr Arg Ser Ala Tyr Leu
180 185 190
Ser Ser Phe Leu Trp Arg Thr Arg Ser Ser Asn Met Ala Val Pro Ile
195 200 205
Trp Ile Asp Glu Arg Gly Lys Asn Ile Leu Asp Gly Gly Thr His Phe
210 215 220
Tyr Asn Val Tyr Glu Thr Lys Asp Arg Lys Tyr Met Ser Val Gly Ala
225 230 235 240
Leu Glu Pro Asn Phe Tyr Lys Glu Leu Ser Trp Val Arg Leu Gly Leu
245 250 255
Glu Pro Asp Thr Val Pro Gln Met Gly Asp Trp Glu Glu Ser Lys Arg
260 265 270
Val Phe Ala Glu Ile Phe Ala Thr Lys Thr Gln Asp Glu Trp Cys Arg
275 280 285
Val Phe Asp Gln Lys Asp Ala Cys Val Val Pro Val Leu Asp His Asp
290 295 300
Thr Ala His Lys His Pro His Asn Ala Ser Arg Glu Ala Phe His Glu
305 310 315 320
Cys Ser Asp Gly Pro Pro Ile Pro Arg Pro Ala Pro Arg Leu Asp Arg
325 330 335
Thr Pro Ala Glu Pro Asp Tyr Lys Glu Pro Leu Val Gly Glu His Ser
340 345 350
Val Glu Val Leu Lys Glu Ala Gly Leu Ser Asp Gly Glu Ile Arg Thr
355 360 365
Leu Leu Gln Ser Gly Thr Val Glu Ala Pro Cys Phe Asp Pro Asn Leu
370 375 380
Arg Leu
385
<210> 156
<211> 1161
<212> DNA
<213> 肩突硬蜱(Ixodes scapularis)
<400> 156
atggttatgg ctttgaaagg tattaaagtt ttggaaatgg ctggtttagc accaggtcca 60
ttttgtggta tggttttgag agatttcggt gcaactgtta ttagagttga tagagtttca 120
ccaatcagaa atttgtctga taacatccca gcttgtttgt ctaagtgtag agataaaggt 180
ggtacatata gatgtgcagt tccatcacca agacaagatt gttcaagaca atgtttggct 240
aatgcaggtg ttatggaaag agttggttta ggtccagatg ttttgttgca aactaaccca 300
agattggttt atgctagaat tactggtttt ggtcaaacag gtccattttc tatgatggct 360
ggtcatgata ttaactactt ggcattgtca ggtgttttgt ctatgttagg tgaacatggt 420
agaaagccaa tcttcccagt taacgttatc gctgattttg gtggtggtgg tttgttagct 480
gcattgggta tttgtatggc attgttagaa agaactagat caggtagagg tcaagttgtt 540
gatacttcta tggcttctac tagatcagca tacttgtctt catttttatg gagaactaga 600
tcatctaaca tggctgttcc aatttggatc gatgaaagag gtaaaaatat tttagatggt 660
ggtactcatt tctacaacgt ttacgaaaca aaggatagaa aatatatgtc tgttggtgct 720
ttggaaccaa acttctacaa ggaattatca tgggttagat tgggtttaga accagataca 780
gttccacaaa tgggtgactg ggaagaatct aaaagagttt tcgctgaaat ttttgcaact 840
aagacacaag atgaatggtg tagagttttt gatcaaaaag atgcttgtgt tgttccagtt 900
ttagatcatg atactgcaca taaacatcca cataatgctt caagagaagc atttcatgaa 960
tgttctgatg gtccaccaat tccaagacca gctccaagat tggatagaac accagcagaa 1020
ccagattaca aggaaccatt agttggtgaa cattcagttg aagttttgaa agaagctggt 1080
ttgtctgatg gtgaaatcag aactttgtta caatcaggta cagttgaagc accatgtttt 1140
gatccaaatt tgagattata a 1161
<210> 157
<211> 382
<212> PRT
<213> 家牛(Bos taurus)
<400> 157
Met Ala Leu Arg Gly Ile Thr Val Val Glu Leu Ala Gly Leu Ala Pro
1 5 10 15
Val Pro Phe Cys Gly Met Val Leu Ala Asp Phe Gly Ala Gln Val Val
20 25 30
Arg Val Asp Arg Pro Ala Ala Arg Ser Gly Pro Ser Arg Leu Ala Arg
35 40 45
Gly Lys Arg Ser Leu Val Val Asp Leu Lys Gln Pro Arg Gly Ala Ala
50 55 60
Val Leu Arg Arg Leu Cys Ala Arg Ala Asp Val Met Leu Glu Pro Phe
65 70 75 80
Arg Pro Gly Val Met Glu Lys Leu Gln Leu Gly Pro Glu Ile Leu Gln
85 90 95
Lys Glu Asn Pro Arg Leu Ile Tyr Ala Arg Leu Ser Gly Phe Gly Gln
100 105 110
Ser Gly Arg Phe Ser Lys Met Ala Gly His Asp Ile Asn Tyr Leu Ala
115 120 125
Leu Ser Gly Val Leu Ser Arg Ile Gly Arg Ser Gly Glu Asn Pro Tyr
130 135 140
Ala Pro Leu Asn Leu Leu Ala Asp Phe Gly Gly Gly Gly Leu Met Cys
145 150 155 160
Ala Met Gly Ile Ile Met Ala Leu Phe Glu Arg Thr Arg Ser Gly Lys
165 170 175
Gly Gln Val Ile Asp Ala Ser Met Val Glu Gly Thr Ala Tyr Leu Ser
180 185 190
Ser Phe Met Trp Lys Thr Gln Glu Thr Gly Leu Trp Glu Gln Pro Arg
195 200 205
Gly Gln Asn Met Leu Asp Gly Gly Ala Pro Phe Tyr Thr Thr Tyr Arg
210 215 220
Thr Ala Asp Gly Gly Phe Met Ala Val Gly Ala Ile Glu Pro Gln Phe
225 230 235 240
Tyr Glu Leu Leu Ile Lys Gly Leu Gly Leu Lys Ser Asp Glu Leu Pro
245 250 255
Asn Gln Leu Ser Met Lys Asp Trp Pro Glu Met Lys Lys Lys Phe Ala
260 265 270
Asp Ile Phe Ala Lys Lys Thr Lys Ala Glu Trp Cys Gln Ile Phe Asp
275 280 285
Gly Thr Asp Ala Cys Val Thr Pro Val Leu Thr Phe Glu Glu Val Thr
290 295 300
His His Gly His Asn Lys Asp Arg Gly Ser Phe Ile Thr Asp Thr Glu
305 310 315 320
Gln Arg Val Ser Pro Arg Pro Ala Pro Leu Leu Ser Asn Thr Pro Ala
325 330 335
Leu Pro Ser Ile Lys Arg Asp Pro Phe Val Gly Glu His Thr Glu Glu
340 345 350
Ile Leu Lys Glu Phe Gly Phe Ser Gln Lys Glu Ile Asn Gln Leu Lys
355 360 365
Leu Asp Asn Ile Ile Glu Ile His Lys Leu Arg Val Asn Leu
370 375 380
<210> 158
<211> 1149
<212> DNA
<213> 家牛(Bos taurus)
<400> 158
atggctttga gaggtattac tgttgttgaa ttggctggtt tagcaccagt tccattttgt 60
ggtatggttt tagctgattt tggtgcacaa gttgttagag ttgatagacc agctgcaaga 120
tcaggtccat caagattggc tagaggtaaa agatcattgg ttgttgattt gaagcaacca 180
agaggtgctg cagttttgag aagattatgt gctagagcag atgttatgtt ggaacctttt 240
agaccaggtg ttatggaaaa attgcaattg ggtccagaaa ttttacaaaa ggaaaaccca 300
agattgatct atgctagatt gtctggtttc ggtcaatctg gtagattttc aaagatggct 360
ggtcatgata ttaactattt ggcattgtct ggtgttttgt caagaattgg tagatcaggt 420
gaaaatccat acgctccatt aaatttgtta gcagattttg gtggtggtgg tttgatgtgt 480
gctatgggta tcatcatggc attgttcgaa agaactagat caggtaaagg tcaagttatt 540
gatgcttcaa tggttgaagg tacagcatat ttgtcttctt ttatgtggaa gactcaagaa 600
acaggtttgt gggaacaacc aagaggtcaa aatatgttag atggtggtgc tccattttat 660
actacataca gaactgcaga tggtggtttt atggctgttg gtgcaatcga accacaattc 720
tacgaattgt tgattaaagg tttgggtttg aagtctgatg aattgccaaa ccaattgtca 780
atgaaggatt ggccagaaat gaaaaagaaa ttcgctgata ttttcgctaa gaaaactaag 840
gctgaatggt gtcaaatttt tgatggtaca gatgcatgtg ttactccagt tttgacattc 900
gaagaagtta cacatcatgg tcataataag gatagaggtt cttttattac tgatacagaa 960
caaagagttt caccaagacc agctccattg ttatctaata ctccagcatt gccatcaatt 1020
aaaagagatc catttgttgg tgaacataca gaagaaattt taaaagaatt tggtttttct 1080
caaaaagaaa ttaatcaatt gaaattggat aatattattg aaattcataa attgagagtt 1140
aatttgtaa 1149
<210> 159
<211> 681
<212> PRT
<213> 智人(Homo sapiens)
<400> 159
Met Gly Ser Pro Val His Arg Val Ser Leu Gly Asp Thr Trp Ser Arg
1 5 10 15
Gln Met His Pro Asp Ile Glu Ser Glu Arg Tyr Met Gln Ser Phe Asp
20 25 30
Val Glu Arg Leu Thr Asn Ile Leu Asp Gly Gly Ala Gln Asn Thr Ala
35 40 45
Leu Arg Arg Lys Val Glu Ser Ile Ile His Ser Tyr Pro Glu Phe Ser
50 55 60
Cys Lys Asp Asn Tyr Phe Met Thr Gln Asn Glu Arg Tyr Lys Ala Ala
65 70 75 80
Met Arg Arg Ala Phe His Ile Arg Leu Ile Ala Arg Arg Leu Gly Trp
85 90 95
Leu Glu Asp Gly Arg Glu Leu Gly Tyr Ala Tyr Arg Ala Leu Ser Gly
100 105 110
Asp Val Ala Leu Asn Ile His Arg Val Phe Val Arg Ala Leu Arg Ser
115 120 125
Leu Gly Ser Glu Glu Gln Ile Ala Lys Trp Asp Pro Leu Cys Lys Asn
130 135 140
Ile Gln Ile Ile Ala Thr Tyr Ala Gln Thr Glu Leu Gly His Gly Thr
145 150 155 160
Tyr Leu Gln Gly Leu Glu Thr Glu Ala Thr Tyr Asp Ala Ala Thr Gln
165 170 175
Glu Phe Val Ile His Ser Pro Thr Leu Thr Ala Thr Lys Trp Trp Pro
180 185 190
Gly Asp Leu Gly Arg Ser Ala Thr His Ala Leu Val Gln Ala Gln Leu
195 200 205
Ile Cys Ser Gly Ala Arg Arg Gly Met His Ala Phe Ile Val Pro Ile
210 215 220
Arg Ser Leu Gln Asp His Thr Pro Leu Pro Gly Ile Ile Ile Gly Asp
225 230 235 240
Ile Gly Pro Lys Met Asp Phe Asp Gln Thr Asp Asn Gly Phe Leu Gln
245 250 255
Leu Asn His Val Arg Val Pro Arg Glu Asn Met Leu Ser Arg Phe Ala
260 265 270
Gln Val Leu Pro Asp Gly Thr Tyr Val Lys Leu Gly Thr Ala Gln Ser
275 280 285
Asn Tyr Leu Pro Met Val Val Val Arg Val Glu Leu Leu Ser Gly Glu
290 295 300
Ile Leu Pro Ile Leu Gln Lys Ala Cys Val Ile Ala Met Arg Tyr Ser
305 310 315 320
Val Ile Arg Arg Gln Ser Arg Leu Arg Pro Ser Asp Pro Glu Ala Lys
325 330 335
Val Leu Asp Tyr Gln Thr Gln Gln Gln Lys Leu Phe Pro Gln Leu Ala
340 345 350
Ile Ser Tyr Ala Phe His Phe Leu Ala Val Ser Leu Leu Glu Phe Phe
355 360 365
Gln His Ser Tyr Thr Ala Ile Leu Asn Gln Asp Phe Ser Phe Leu Pro
370 375 380
Glu Leu His Ala Leu Ser Thr Gly Met Lys Ala Met Met Ser Glu Phe
385 390 395 400
Cys Thr Gln Gly Ala Glu Met Cys Arg Arg Ala Cys Gly Gly His Gly
405 410 415
Tyr Ser Lys Leu Ser Gly Leu Pro Ser Leu Val Thr Lys Leu Ser Ala
420 425 430
Ser Cys Thr Tyr Glu Gly Glu Asn Thr Val Leu Tyr Leu Gln Val Ala
435 440 445
Arg Phe Leu Val Lys Ser Tyr Leu Gln Thr Gln Met Ser Pro Gly Ser
450 455 460
Thr Pro Gln Arg Ser Leu Ser Pro Ser Val Ala Tyr Leu Thr Ala Pro
465 470 475 480
Asp Leu Ala Arg Cys Pro Ala Gln Arg Ala Ala Asp Phe Leu Cys Pro
485 490 495
Glu Leu Tyr Thr Thr Ala Trp Ala His Val Ala Val Arg Leu Ile Lys
500 505 510
Asp Ser Val Gln His Leu Gln Thr Leu Thr Gln Ser Gly Ala Asp Gln
515 520 525
His Glu Ala Trp Asn Gln Thr Thr Val Ile His Leu Gln Ala Ala Lys
530 535 540
Val His Cys Tyr Tyr Val Thr Val Lys Gly Phe Thr Glu Ala Leu Glu
545 550 555 560
Lys Leu Glu Asn Glu Pro Ala Ile Gln Gln Val Leu Lys Arg Leu Cys
565 570 575
Asp Leu His Ala Ile His Gly Ile Leu Thr Asn Ser Gly Asp Phe Leu
580 585 590
His Asp Ala Phe Leu Ser Gly Ala Gln Val Asp Met Ala Arg Thr Ala
595 600 605
Tyr Leu Asp Leu Leu Arg Leu Ile Arg Lys Asp Ala Ile Leu Leu Thr
610 615 620
Asp Ala Phe Asp Phe Thr Asp Gln Cys Leu Asn Ser Ala Leu Gly Cys
625 630 635 640
Tyr Asp Gly Asn Val Tyr Glu Arg Leu Phe Gln Trp Ala Gln Lys Ser
645 650 655
Pro Thr Asn Thr Gln Glu Asn Pro Ala Tyr Glu Glu Tyr Ile Arg Pro
660 665 670
Leu Leu Gln Ser Trp Arg Ser Lys Leu
675 680
<210> 160
<211> 2046
<212> DNA
<213> 智人(Homo sapiens)
<400> 160
atgggttcac cagttcatag agtttcttta ggtgacacat ggtcaagaca aatgcatcca 60
gatatcgaat ctgaaagata catgcaatca ttcgatgttg aaagattgac aaacatcttg 120
gatggtggtg ctcaaaacac tgcattgaga agaaaggttg aatcaattat tcattcttat 180
ccagaatttt cttgtaagga taactacttc atgactcaaa atgaaagata caaagctgca 240
atgagaagag ctttccatat cagattgatc gcaagaagat tgggttggtt agaagatggt 300
agagaattgg gttatgctta cagagcatta tctggtgacg ttgctttgaa catccataga 360
gttttcgtta gagcattgag atcattaggt tctgaagaac aaattgctaa atgggaccca 420
ttgtgtaaga acatccaaat catcgctaca tacgcacaaa ctgaattggg tcatggtaca 480
tacttgcaag gtttagaaac agaagctact tatgatgctg caactcaaga attcgttatc 540
cattctccaa ctttgacagc tactaaatgg tggcctggtg acttgggtag atctgctact 600
catgcattag ttcaagctca attgatttgt tcaggtgcta gacgtggtat gcatgctttt 660
attgttccaa tcagatcttt acaagatcat acaccattgc caggtatcat catcggtgac 720
atcggtccaa agatggattt cgatcaaact gataacggtt tcttgcaatt gaaccatgtt 780
agagttccaa gagaaaacat gttgtcaaga ttcgctcaag ttttgccaga tggtacatac 840
gttaagttgg gtactgcaca atctaactat ttgccaatgg ttgttgttag agttgaattg 900
ttgtcaggtg aaatcttgcc aatcttgcaa aaggcttgtg ttatcgcaat gagatactct 960
gttattagaa gacaatcaag attaagacca tctgatccag aagctaaagt tttggattac 1020
caaacacaac aacaaaagtt gttcccacaa ttggctatct cttacgcatt ccatttcttg 1080
gctgtttctt tgttggaatt tttccaacat tcatacactg caatcttgaa ccaagatttc 1140
tcatttttgc cagaattgca tgctttgtct actggtatga aagcaatgat gtcagaattt 1200
tgtactcaag gtgctgaaat gtgtagaaga gcatgtggtg gtcatggtta ctcaaagttg 1260
tctggtttgc catctttagt tacaaagttg tctgcttcat gtacatacga aggtgaaaac 1320
actgttttgt acttacaagt tgcaagattt ttagtcaagt catacttgca aacacaaatg 1380
tcaccaggtt ctactccaca aagatctttg tcaccatctg ttgcttattt gactgcacca 1440
gatttggcta gatgtccagc acaaagagct gcagatttct tgtgtccaga attgtacact 1500
acagcttggg cacatgttgc tgttagattg attaaagatt ctgttcaaca tttgcaaaca 1560
ttaactcaat caggtgctga tcaacatgaa gcatggaatc aaactacagt tattcatttg 1620
caagctgcaa aggttcattg ttactacgtt acagttaagg gttttactga agctttggaa 1680
aagttggaaa acgaaccagc aatccaacaa gttttgaaga gattgtgtga tttgcatgct 1740
atccatggta ttttaacaaa ctctggtgac tttttgcatg atgcattttt gtcaggtgca 1800
caagttgata tggctagaac tgcatatttg gatttgttga gattgatcag aaaggatgct 1860
atcttgttga cagatgcatt cgatttcact gatcaatgtt tgaactctgc tttgggttgt 1920
tacgatggta acgtttacga aagattgttt caatgggctc aaaaatcacc aacaaacact 1980
caagaaaacc cagcatacga agaatacatc agaccattgt tgcaatcatg gagatctaaa 2040
ttgtaa 2046
<210> 161
<211> 681
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 161
Met Gly Ser Pro Met His Arg Val Ser Leu Gly Asp His Trp Ser Trp
1 5 10 15
Gln Val His Pro Asp Ile Asp Ser Glu Arg His Ser Pro Ser Phe Ser
20 25 30
Val Glu Arg Leu Thr Asn Ile Leu Asp Gly Gly Leu Pro Asn Thr Val
35 40 45
Leu Arg Arg Lys Val Glu Ser Ile Ile Gln Ser Asp Pro Val Phe Asn
50 55 60
Leu Lys Lys Leu Tyr Phe Met Thr Arg Glu Glu Leu Tyr Glu Asp Ala
65 70 75 80
Ile Gln Lys Arg Phe His Leu Glu Lys Leu Ala Trp Ser Leu Gly Trp
85 90 95
Ser Glu Asp Gly Pro Glu Arg Ile Tyr Ala Asn Arg Val Leu Asp Gly
100 105 110
Asn Val Asn Leu Ser Leu His Gly Val Ala Met Asn Ala Ile Arg Ser
115 120 125
Leu Gly Ser Asp Glu Gln Ile Ala Lys Trp Gly Gln Leu Cys Lys Asn
130 135 140
Phe Gln Ile Ile Thr Thr Tyr Ala Gln Thr Glu Leu Gly His Gly Thr
145 150 155 160
Tyr Leu Gln Gly Leu Glu Thr Glu Ala Thr Tyr Asp Glu Ala Arg Gln
165 170 175
Glu Leu Val Ile His Ser Pro Thr Met Thr Ser Thr Lys Trp Trp Pro
180 185 190
Gly Asp Leu Gly Trp Ser Val Thr His Ala Val Val Leu Ala Gln Leu
195 200 205
Thr Cys Leu Gly Val Arg His Gly Met His Ala Phe Ile Val Pro Ile
210 215 220
Arg Ser Leu Glu Asp His Thr Pro Leu Pro Gly Ile Thr Val Gly Asp
225 230 235 240
Ile Gly Pro Lys Met Gly Leu Glu His Ile Asp Asn Gly Phe Leu Gln
245 250 255
Leu Asn His Val Arg Val Pro Arg Glu Asn Met Leu Ser Arg Phe Ala
260 265 270
Glu Val Leu Pro Asp Gly Thr Tyr Gln Arg Leu Gly Thr Pro Gln Ser
275 280 285
Asn Tyr Leu Gly Met Leu Val Thr Arg Val Gln Leu Leu Cys Lys Gly
290 295 300
Ile Leu Pro Ser Leu Gln Lys Ala Cys Ile Ile Ala Thr Arg Tyr Ser
305 310 315 320
Val Ile Arg His Gln Ser Arg Leu Arg Pro Ser Asp Pro Glu Ala Lys
325 330 335
Ile Leu Glu Tyr Gln Thr Gln Gln Gln Lys Leu Leu Pro Gln Leu Ala
340 345 350
Val Ser Tyr Ala Phe His Phe Thr Ala Thr Ser Leu Ser Glu Phe Phe
355 360 365
His Ser Ser Tyr Ser Ala Ile Leu Lys Arg Asp Phe Ser Leu Leu Pro
370 375 380
Glu Leu His Ala Leu Ser Thr Gly Met Lys Ala Thr Phe Ala Asp Phe
385 390 395 400
Cys Ala Gln Gly Ala Glu Ile Cys Arg Arg Ala Cys Gly Gly His Gly
405 410 415
Tyr Ser Lys Leu Ser Gly Leu Pro Thr Leu Val Ala Arg Ala Thr Ala
420 425 430
Ser Cys Thr Tyr Glu Gly Glu Asn Thr Val Leu Tyr Leu Gln Val Ala
435 440 445
Arg Phe Leu Met Lys Ser Tyr Leu Gln Ala Gln Ala Ser Pro Gly Ala
450 455 460
Thr Pro Gln Lys Pro Leu Pro Gln Ser Val Met Tyr Ile Ala Thr Gln
465 470 475 480
Arg Pro Ala Arg Cys Ser Ala Gln Thr Ala Ala Asp Phe Arg Cys Pro
485 490 495
Asp Val Tyr Thr Thr Ala Trp Ala Tyr Val Ser Thr Arg Leu Ile Arg
500 505 510
Asp Ala Ala His Arg Thr Gln Thr Leu Met Lys Ser Gly Val Asp Gln
515 520 525
His Asp Ala Trp Asn Gln Thr Thr Val Ile His Leu Gln Ala Ala Lys
530 535 540
Ala His Cys Tyr Phe Ile Thr Val Lys Asn Phe Lys Glu Ala Val Glu
545 550 555 560
Lys Leu Asp Lys Glu Pro Glu Ile Gln Arg Val Leu Gln Arg Leu Cys
565 570 575
Asp Leu Tyr Ala Leu His Gly Val Leu Thr Asn Ser Gly Asp Phe Leu
580 585 590
His Asp Gly Phe Leu Ser Gly Ala Gln Val Asp Met Ala Arg Glu Ala
595 600 605
Phe Leu Asp Leu Leu Pro Leu Ile Arg Lys Asp Ala Ile Leu Leu Thr
610 615 620
Asp Ala Phe Asp Phe Ser Asp His Cys Leu Asn Ser Ala Leu Gly Cys
625 630 635 640
Tyr Asp Gly His Val Tyr Glu Arg Leu Phe Glu Trp Ala Gln Lys Tyr
645 650 655
Pro Ala Asn Thr Gln Glu Asn Pro Ala Tyr Lys Lys Tyr Ile Arg Pro
660 665 670
Leu Met Leu Gly Trp Arg His Lys Met
675 680
<210> 162
<211> 2046
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 162
atgggttcac caatgcatag agtttcttta ggtgaccatt ggtcatggca agttcatcca 60
gatattgatt ctgaaagaca ttctccatca ttttctgttg aaagattgac aaacatcttg 120
gatggtggtt tgccaaacac tgttttgaga agaaaggttg aatcaatcat tcaatctgat 180
ccagttttta atttgaagaa attgtacttc atgacaagag aagaattgta cgaagatgca 240
atccaaaaga gattccattt ggaaaagttg gcttggtcat taggttggtc tgaagatggt 300
ccagaaagaa tctatgctaa cagagttttg gatggtaacg ttaatttgtc tttacatggt 360
gttgcaatga atgctatcag atcattaggt tctgatgaac aaattgcaaa atggggtcaa 420
ttgtgtaaga acttccaaat catcactaca tatgctcaaa cagaattggg tcatggtact 480
tacttgcaag gtttagaaac agaagcaact tacgatgaag ctagacaaga attagttatt 540
cattcaccaa ctatgacatc tactaaatgg tggcctggtg acttgggttg gtctgttaca 600
catgcagttg ttttggctca attaacttgt ttgggtgtta gacatggtat gcatgctttt 660
attgttccaa tcagatcatt agaagatcat acaccattgc caggtattac tgttggtgac 720
attggtccaa agatgggttt agaacatatc gataacggtt tcttgcaatt gaaccatgtt 780
agagttccaa gagaaaacat gttgtctaga ttcgctgaag ttttgccaga tggtacatac 840
caaagattgg gtactccaca atcaaactat ttgggcatgt tggttacaag agttcaattg 900
ttgtgtaagg gtattttacc atctttgcaa aaggcatgta tcatcgctac tagatactca 960
gttattagac atcaatcaag attaagacca tctgatccag aagctaagat cttggaatac 1020
caaactcaac aacaaaagtt gttgccacaa ttggcagttt cttacgcttt ccatttcaca 1080
gcaacttcat tgtctgaatt tttccattct tcatactctg ctatcttgaa gagagatttc 1140
tcattgttgc cagaattgca tgcattgtct actggtatga aagcaacttt tgctgatttt 1200
tgtgcacaag gtgctgaaat ttgtagaaga gcttgtggtg gtcatggtta ctcaaagttg 1260
tctggtttgc caacattagt tgctagagca acagcttctt gtacttacga aggtgaaaac 1320
actgttttgt acttacaagt tgctagattt ttaatgaagt cttacttgca agcacaagct 1380
tcaccaggtg caacaccaca aaaaccattg ccacaatctg ttatgtatat tgctacacaa 1440
agaccagcaa gatgttcagc tcaaactgct gcagatttta gatgtccaga tgtttatact 1500
acagcatggg cttacgtttc tactagatta attagagatg ctgcacatag aacacaaact 1560
ttgatgaaat caggtgttga tcaacatgat gcttggaatc aaactacagt tattcatttg 1620
caagctgcaa aagcacattg ttacttcatc acagttaaaa atttcaaaga agctgttgaa 1680
aagttggata aggaaccaga aatccaaaga gttttgcaaa gattgtgtga tttgtacgca 1740
ttgcatggtg ttttgactaa ctctggtgac tttttgcatg atggtttctt gtcaggtgct 1800
caagttgata tggcaagaga agcatttttg gatttgttgc cattgatcag aaaggatgca 1860
atcttgttga cagatgcttt cgatttctct gatcattgtt tgaactcagc attgggttgt 1920
tatgatggtc atgtttacga aagattgttt gaatgggcac aaaagtaccc agctaacact 1980
caagaaaacc cagcttacaa gaaatacatc agaccattga tgttaggttg gagacataaa 2040
atgtaa 2046
<210> 163
<211> 681
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 163
Met Gly Asn Pro Gly Asp Arg Val Ser Leu Gly Glu Thr Trp Ser Arg
1 5 10 15
Glu Val His Pro Asp Ile Asp Ser Glu Arg His Ser Pro Ser Phe Ser
20 25 30
Val Glu Arg Leu Thr Asn Ile Leu Asp Gly Gly Ile Pro Asn Thr Glu
35 40 45
Leu Arg Arg Arg Val Glu Ser Leu Ile Gln Arg Asp Pro Val Phe Asn
50 55 60
Leu Lys His Leu Tyr Phe Met Thr Arg Asp Glu Leu Tyr Glu Asp Ala
65 70 75 80
Val Gln Lys Arg Phe His Leu Glu Lys Leu Ala Trp Ser Leu Gly Trp
85 90 95
Ser Glu Asp Gly Pro Glu Arg Ile Tyr Ala Asp Arg Val Leu Ala Gly
100 105 110
Tyr Asn Asn Leu Asn Leu His Gly Ile Ala Met Asn Ala Ile Arg Ser
115 120 125
Leu Gly Ser Asp Glu Gln Ile Ala Lys Trp Gly Gln Leu Gly Lys Asn
130 135 140
Phe Gln Ile Ile Thr Thr Tyr Ala Gln Thr Glu Leu Gly His Gly Thr
145 150 155 160
Tyr Leu Gln Gly Leu Glu Thr Glu Ala Thr Tyr Asp Ala Thr Thr Gln
165 170 175
Glu Phe Val Ile His Ser Pro Thr Met Thr Ser Ile Lys Trp Trp Pro
180 185 190
Gly Asp Leu Gly Arg Thr Val Thr His Ala Val Val Leu Ala His Leu
195 200 205
Ile Cys Leu Gly Ala Arg His Gly Met His Ala Phe Ile Val Pro Ile
210 215 220
Arg Ser Leu Glu Asp His Thr Pro Leu Pro Gly Ile Thr Val Gly Asp
225 230 235 240
Ile Gly Pro Lys Met Gly Phe Glu Asn Ile Asp Asn Gly Phe Leu Arg
245 250 255
Leu Asn His Val Arg Val Pro Arg Glu Asn Met Leu Ser Arg Phe Ala
260 265 270
Glu Val Leu Pro Asp Gly Thr Tyr Gln Arg Leu Gly Thr Pro Gln Ser
275 280 285
Asn Tyr Leu Gly Met Leu Val Thr Arg Val Gln Leu Leu Tyr Lys Gly
290 295 300
Phe Leu Pro Thr Leu Gln Lys Ala Cys Thr Ile Ala Val Arg Tyr Ala
305 310 315 320
Val Ile Arg His Gln Ser Arg Leu Arg Pro Ser Asp Pro Glu Ala Lys
325 330 335
Ile Leu Glu Tyr Gln Thr Gln Gln Gln Lys Leu Leu Pro Gln Leu Ala
340 345 350
Val Ser Tyr Ala Leu His Phe Met Thr Thr Ser Leu Leu Gln Phe Phe
355 360 365
His Ser Ser Tyr Ser Asp Ile Leu Lys Arg Asp Phe Ser Leu Leu Pro
370 375 380
Glu Leu His Ala Leu Ser Thr Gly Met Lys Ala Met Ser Ser Asp Phe
385 390 395 400
Cys Ala Gln Gly Thr Glu Ile Cys Arg Arg Ala Cys Gly Gly His Gly
405 410 415
Tyr Ser Lys Leu Ser Gly Leu Pro Thr Leu Val Thr Gln Ala Ile Ala
420 425 430
Ser Cys Thr Tyr Glu Gly Glu Asn Thr Val Leu Tyr Leu Gln Val Ala
435 440 445
Arg Phe Leu Met Lys Ser Tyr Leu Gln Ala Gln Val Ser Pro Gly Ser
450 455 460
Ile Pro Gln Lys Pro Leu Pro Gln Ser Val Met Tyr Leu Ala Thr Pro
465 470 475 480
Arg Pro Ala Arg Cys Pro Ala Gln Thr Ala Ala Asp Phe Arg Cys Pro
485 490 495
Glu Val Tyr Thr Thr Ala Trp Ala Tyr Val Ser Ala Arg Leu Ile Arg
500 505 510
Asp Ala Thr Gln His Thr Gln Thr Leu Met Arg Ser Gly Val Asp Gln
515 520 525
Tyr Asp Ala Trp Asn Gln Thr Ser Val Ile His Leu Gln Ala Ala Lys
530 535 540
Ala His Cys Tyr Phe Leu Thr Val Arg Asn Phe Lys Glu Ala Val Glu
545 550 555 560
Lys Leu Asp Asn Glu Pro Glu Ile Gln Arg Val Leu Gln Asn Leu Cys
565 570 575
Asp Leu Tyr Ala Leu Asn Gly Ile Leu Thr Asn Ser Gly Asp Phe Leu
580 585 590
His Asp Gly Phe Leu Ser Gly Ala Gln Val Asp Met Ala Arg Thr Ala
595 600 605
Phe Leu Asp Leu Leu Pro Leu Ile Arg Lys Asp Ala Ile Leu Leu Thr
610 615 620
Asp Ala Phe Asp Phe Ser Asp His Cys Leu Asn Ser Ala Leu Gly Cys
625 630 635 640
Tyr Asp Gly His Val Tyr Gln Arg Leu Phe Glu Trp Ala Gln Lys Ser
645 650 655
Pro Ala Asn Thr Gln Glu Asn Pro Ala Tyr Lys Lys Tyr Ile Arg Pro
660 665 670
Leu Met Gln Ser Trp Lys Pro Lys Leu
675 680
<210> 164
<211> 2046
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 164
atgggtaatc caggtgacag agtttctttg ggtgaaactt ggtctagaga agttcatcca 60
gatattgatt cagaaagaca ttctccatca ttttctgttg aaagattgac taacatcttg 120
gatggtggta ttccaaacac agaattgaga agaagagttg aatctttgat ccaaagagat 180
ccagttttta atttgaagca tttgtacttc atgacaagag atgaattata cgaagatgct 240
gttcaaaaga gattccattt ggaaaagttg gcatggtcat tgggttggtc tgaagatggt 300
ccagaaagaa tctatgcaga tagagttttg gctggttaca acaatttgaa tttgcatggt 360
attgctatga atgcaattag atcattgggt tctgatgaac aaattgctaa atggggtcaa 420
ttgggtaaaa atttccaaat catcactaca tatgcacaaa ctgaattggg tcatggtaca 480
tacttgcaag gtttagaaac tgaagctaca tacgatgcaa ctacacaaga attcgttatc 540
cattcaccaa ctatgacatc tattaaatgg tggcctggtg acttgggtag aactgttaca 600
catgctgttg ttttggcaca tttgatttgt ttgggtgcaa gacatggtat gcatgctttt 660
attgttccaa tcagatcttt ggaagatcat actccattac caggtattac agttggtgac 720
atcggtccaa agatgggttt cgaaaacatc gataacggtt tcttgagatt gaaccatgtt 780
agagttccaa gagaaaacat gttgtcaaga ttcgctgaag ttttaccaga tggtacttac 840
caaagattgg gtacaccaca atctaactat ttgggcatgt tggttactag agttcaattg 900
ttgtacaagg gtttcttgcc aactttgcaa aaagcttgta caattgcagt tagatacgct 960
gttattagac atcaatcaag attaagacca tctgatccag aagctaagat cttggaatac 1020
caaacacaac aacaaaagtt gttgccacaa ttggcagttt catacgcttt gcatttcatg 1080
actacatctt tgttgcaatt tttccattct tcatactcag atatcttgaa gagagatttc 1140
tctttgttgc cagaattgca tgcattgtca actggtatga aagctatgtc ttcagatttt 1200
tgtgcacaag gtacagaaat ttgtagaaga gcttgtggtg gtcatggtta ctcaaagttg 1260
tctggtttgc caactttagt tacacaagct attgcatctt gtacttacga aggtgaaaac 1320
acagttttgt acttacaagt tgctagattt ttgatgaagt catacttaca agcacaagtt 1380
tcaccaggtt ctattccaca aaaaccattg ccacaatctg ttatgtattt ggctactcca 1440
agaccagcaa gatgtccagc tcaaacagct gcagatttta gatgtccaga agtttatact 1500
acagcttggg catacgtttc tgcaagattg attagagatg ctactcaaca tactcaaaca 1560
ttaatgagat caggtgttga tcaatacgat gcttggaatc aaacttctgt tattcatttg 1620
caagctgcaa aagctcattg ttactttttg acagttagaa acttcaagga agcagttgaa 1680
aagttggata acgaaccaga aatccaaaga gttttgcaaa atttgtgtga tttgtacgct 1740
ttgaacggta ttttaacaaa ctctggtgac tttttgcatg atggtttctt gtctggtgca 1800
caagttgata tggctagaac tgcatttttg gatttgttgc cattgatcag aaaggatgca 1860
atcttgttga cagatgcttt cgatttctca gatcattgtt tgaactctgc tttaggttgt 1920
tatgatggtc atgtttacca aagattgttt gaatgggcac aaaaatcacc agctaacact 1980
caagaaaacc cagcttacaa gaaatacatc agaccattga tgcaatcttg gaaaccaaaa 2040
ttataa 2046
<210> 165
<211> 681
<212> PRT
<213> 家兔(Oryctolagus cuniculus)
<400> 165
Met Gly Ile Pro Val His Arg Val Ser Leu Gly Asp Ala Trp Ser Ser
1 5 10 15
Arg Met His Pro Asp Met Glu Ser Glu Arg Cys Ala Gln Ser Phe Ser
20 25 30
Val Glu Arg Leu Thr Asn Ile Leu Asp Gly Gly Ala Gln His Thr Ala
35 40 45
Leu Arg Arg Lys Val Glu Ser Ile Ile His Gly Asn Pro Gln Phe Ser
50 55 60
Ser Lys Asp Asn Tyr Phe Met Ser Gln Asn Glu Leu Tyr Glu Ala Ala
65 70 75 80
Thr Arg Lys Arg Tyr His Leu Gln Lys Ile Ala Gln Arg Met Gly Trp
85 90 95
Thr Glu Glu Gly Arg Glu Leu Glu Tyr Ala His Arg Ala Leu Ser Ala
100 105 110
Asp Leu Asn Leu Asn Leu Gln Gly Ile Phe Leu Lys Ala Leu Arg Ser
115 120 125
Leu Gly Ser Glu Glu Gln Ile Ala Lys Trp Glu Pro Leu Gly Lys Thr
130 135 140
Phe Gln Ile Ile Ser Thr Tyr Ala Gln Thr Glu Leu Gly His Gly Thr
145 150 155 160
Tyr Leu Gln Gly Leu Glu Thr Glu Ala Thr Tyr Asp Ala Ala Thr Gln
165 170 175
Glu Phe Val Ile His Ser Pro Thr Val Thr Ala Thr Lys Trp Trp Pro
180 185 190
Gly Asp Leu Gly Arg Ser Ala Thr His Ala Leu Ile Leu Ala Gln Leu
195 200 205
Ile Cys Ser Gly Ala Arg Arg Gly Met His Ala Phe Ile Val Pro Val
210 215 220
Arg Ser Leu Gln Asp His Thr Pro Leu Pro Gly Ile Thr Ile Gly Asp
225 230 235 240
Ile Gly Pro Lys Met Gly Leu Gln His Ile Asp Asn Gly Phe Leu Lys
245 250 255
Met Asp His Val Arg Val Pro Arg Glu Asn Met Leu Ser Arg Phe Ala
260 265 270
Gln Val Leu Pro Asp Gly Ala Tyr Ile Lys Leu Gly Thr Ala Gln Ser
275 280 285
Asn Tyr Leu Gly Met Leu Val Thr Arg Val His Leu Leu Leu Gly Ala
290 295 300
Ile Leu Ser Pro Leu Gln Lys Ala Cys Val Ile Ala Thr Arg Tyr Ser
305 310 315 320
Val Ile Arg His Gln Cys Arg Leu Arg Pro Ser Asp Pro Glu Val Lys
325 330 335
Ile Leu Glu His Gln Thr Gln Gln Gln Lys Leu Phe Pro Gln Leu Ala
340 345 350
Met Cys Tyr Ala Phe His Phe Leu Ala Thr Gly Leu Leu Glu Phe Phe
355 360 365
Gln Gln Ala Tyr Lys Asn Ile Leu Asp Arg Asp Phe Thr Leu Leu Pro
370 375 380
Glu Leu His Ala Leu Ser Thr Gly Thr Lys Ala Met Met Ser Asp Phe
385 390 395 400
Cys Thr Gln Gly Ala Glu Gln Cys Arg Arg Ala Cys Gly Gly His Gly
405 410 415
Tyr Ser Lys Leu Ser Gly Leu Pro Ser Leu Val Thr Ser Val Thr Ala
420 425 430
Ser Cys Thr Tyr Glu Gly Glu Asn Thr Val Leu Tyr Leu Gln Val Ala
435 440 445
Arg Phe Leu Val Lys Ser Cys Leu Gln Ala Gln Gly Phe Pro Gly Ser
450 455 460
Thr Ser Gln Arg Ser Leu Pro Arg Ser Val Ser Tyr Leu Ala Leu Pro
465 470 475 480
Asp Leu Ala Arg Cys Pro Ala Gln Thr Ala Ala Asp Phe Phe Cys Pro
485 490 495
Ala Leu Tyr Thr Ala Ala Trp Ala His Val Ala Ala Arg Leu Thr Lys
500 505 510
Asp Ser Val His His Leu Gln Ala Leu Arg Gln Ser Gly Ala Asp Glu
515 520 525
His Glu Ala Trp Asn Gln Thr Thr Ile Ile His Leu Gln Ala Ala Lys
530 535 540
Ala His Cys Tyr Tyr Ile Ser Val Lys Ser Phe Lys Glu Ala Leu Glu
545 550 555 560
Lys Leu Glu Asn Glu Pro Ala Ile Gln Gln Val Leu Lys Arg Leu Cys
565 570 575
Asp Leu His Ala Leu His Gly Ile Leu Thr Asn Ser Gly Asp Phe Leu
580 585 590
His Asp Gly Phe Leu Ser Gly Ala Gln Val Asp Met Ala Arg Thr Ala
595 600 605
Tyr Met Asp Leu Leu Pro Leu Ile Arg Lys Asp Ala Ile Leu Leu Thr
610 615 620
Asp Ala Phe Asp Phe Thr Asp Gln Cys Leu Asn Ser Ala Leu Gly Cys
625 630 635 640
Tyr Asp Gly Asn Val Tyr Glu Arg Leu Phe Glu Trp Ala Gln Arg Ser
645 650 655
Pro Thr Asn Thr Gln Glu Asn Pro Ala Tyr Lys Lys Tyr Ile Gln Pro
660 665 670
Leu Leu Gln Ser Trp Arg Ser Asn Leu
675 680
<210> 166
<211> 2046
<212> DNA
<213> 家兔(Oryctolagus cuniculus)
<400> 166
atgggtattc cagttcatag agtttcattg ggtgacgctt ggtcttcaag aatgcatcca 60
gatatggaat ctgaaagatg tgcacaatct ttttcagttg aaagattgac aaacatctta 120
gatggtggtg cacaacatac tgctttgaga agaaaggttg aatcaatcat ccatggtaac 180
ccacaatttt cttcaaagga taactacttc atgtctcaaa atgaattgta cgaagctgca 240
actagaaaga gataccattt gcaaaagatt gctcaaagaa tgggttggac agaagaaggt 300
agagaattgg aatatgctca tagagcattg tctgctgatt tgaatttgaa tttgcaaggt 360
attttcttga aggctttgag atctttgggt tcagaagaac aaattgcaaa atgggaacca 420
ttgggtaaaa ctttccaaat catctctaca tacgctcaaa ctgaattagg tcatggtact 480
tatttgcaag gtttggaaac agaagcaact tacgatgctg caacacaaga attcgttatc 540
cattctccaa ctgttacagc tactaaatgg tggcctggtg acttgggtag atcagcaaca 600
catgctttga tcttggcaca attgatctgt tctggtgcaa gacgtggtat gcatgctttt 660
attgttccag ttagatcttt gcaagatcat acaccattac caggtattac tattggtgac 720
attggtccaa aaatgggttt gcaacatatc gataacggtt tcttgaagat ggatcatgtt 780
agagttccaa gagaaaacat gttgtcaaga ttcgcacaag ttttgccaga tggtgcttac 840
atcaagttgg gtacagcaca atctaactac ttaggcatgt tggttactag agttcatttg 900
ttgttgggtg ctattttatc accattgcaa aaggcatgtg ttatcgctac tagatactca 960
gttattagac atcaatgtag attaagacca tctgatccag aagttaagat cttggaacat 1020
caaacacaac aacaaaagtt gttcccacaa ttggcaatgt gttacgcttt ccatttcttg 1080
gctactggtt tattggaatt tttccaacaa gcatacaaga acatcttgga tagagatttc 1140
acattgttgc cagaattgca tgctttgtca acaggtacta aagcaatgat gtctgatttt 1200
tgtactcaag gtgcagaaca atgtagaaga gcttgtggtg gtcatggtta ctctaaatta 1260
tcaggtttgc catctttagt tacatcagtt actgcttctt gtacatacga aggtgaaaac 1320
actgttttgt atttgcaagt tgctagattt ttggttaagt catgtttgca agcacaaggt 1380
tttccaggtt ctacttcaca aagatctttg ccaagatctg tttcatattt ggctttacca 1440
gatttggcaa gatgtccagc tcaaacagct gcagatttct tttgtccagc tttgtacact 1500
gctgcatggg cacatgttgc tgcaagattg acaaaggatt cagttcatca tttgcaagct 1560
ttaagacaat ctggtgcaga tgaacatgaa gcttggaacc aaactacaat catccatttg 1620
caagctgcaa aagctcattg ttactacatc tctgttaagt cttttaaaga agctttggaa 1680
aagttggaaa acgaaccagc aatccaacaa gttttgaaga gattgtgtga tttgcatgct 1740
ttgcatggta ttttgactaa ttctggtgac tttttgcatg atggtttctt gtctggtgca 1800
caagttgata tggcaagaac agcttacatg gatttgttgc cattgatcag aaaggatgca 1860
atcttgttga cagatgcttt cgatttcact gatcaatgtt tgaactctgc tttaggttgt 1920
tacgatggta acgtttacga aagattgttc gaatgggctc aaagatcacc aacaaacact 1980
caagaaaacc cagcatacaa gaaatacatc caaccattgt tgcaatcttg gagatcaaat 2040
ttgtaa 2046
<210> 167
<211> 661
<212> PRT
<213> 秀丽隐杆线虫(Caenorhabditis elegans)
<400> 167
Met Ala Asn Arg Ser Ile Arg Asp Gly Asp Asn Pro Glu Leu Leu Glu
1 5 10 15
Glu Arg Arg Met Ala Thr Phe Asp Thr Asp Lys Met Ala Ala Val Ile
20 25 30
Tyr Gly Ser Glu Glu Phe Ala Arg Arg Arg Arg Glu Ile Thr Asp Ala
35 40 45
Val Ser Lys Ile Pro Glu Leu Ala Asp Ile Lys Pro Tyr Pro Phe Leu
50 55 60
Thr Arg Glu Glu Lys Val Thr Glu Gly Thr Arg Lys Ile Ser Ile Leu
65 70 75 80
Thr Lys Tyr Leu Asn Gln Leu Ile Asp Arg Asp Asn Glu Glu Glu Ser
85 90 95
Leu His Leu His Arg Glu Val Ile Gly Tyr Glu Gly His Pro Phe Ala
100 105 110
Leu His Asp Ala Leu Phe Ile Pro Thr Leu Gln Ser Gln Ala Ser Asp
115 120 125
Glu Gln Gln Glu Lys Trp Leu Glu Arg Ala Arg Arg Arg Glu Ile Ile
130 135 140
Gly Cys Tyr Ala Gln Thr Glu Leu Gly His Gly Ser Asn Leu Arg Asn
145 150 155 160
Leu Glu Thr Thr Ala Val Tyr Asp Ile Ala Ser Gln Glu Phe Val Leu
165 170 175
His Thr Pro Thr Thr Thr Ala Leu Lys Trp Trp Pro Gly Ala Leu Gly
180 185 190
Lys Ser Cys Asn Tyr Ala Leu Val Val Ala Glu Leu Ile Ile Lys Arg
195 200 205
Asn Asn Tyr Gly Pro His Phe Phe Met Val Gln Leu Arg Asp Glu Lys
210 215 220
Thr His Ile Pro Leu Lys Gly Val Thr Val Gly Asp Ile Gly Pro Lys
225 230 235 240
Met Asn Phe Asn Ala Ala Asp Asn Gly Tyr Leu Gly Leu Asn Asn Leu
245 250 255
Arg Val Pro Arg Thr Asn Leu Leu Met Arg His Cys Lys Val Glu Ala
260 265 270
Asp Gly Thr Tyr Val Lys Pro Pro His Ala Lys Ile Gly Tyr Ser Gly
275 280 285
Met Val Lys Ile Arg Ser Gln Met Ala Met Glu Gln Gly Leu Phe Leu
290 295 300
Ala His Ala Leu Thr Ile Ala Ala Arg Tyr Ser Ala Val Arg Arg Gln
305 310 315 320
Gly His Leu Asp Asp Lys Gln Val Glu Val Lys Val Leu Asp Tyr Gln
325 330 335
Thr Gln Gln His Arg Leu Phe Pro Ser Leu Ala Arg Ala Tyr Ala Phe
340 345 350
Ile Phe Thr Gly Phe Glu Thr Ile His Leu Tyr Ser Gln Leu Leu Lys
355 360 365
Asp Val Asp Met Gly Asn Thr Ser Gly Met Ala Asp Leu His Ala Leu
370 375 380
Thr Ser Gly Leu Lys Ser Val Val Ala His Glu Thr Gly Glu Gly Ile
385 390 395 400
Glu Gln Ala Arg Met Ala Cys Gly Gly His Gly Tyr Ser Met Ala Ser
405 410 415
Tyr Ile Ser Val Val Tyr Gly Ile Ala Ile Gly Gly Cys Thr Tyr Glu
420 425 430
Gly Glu Asn Met Val Met Leu Leu Gln Leu Ala Arg Tyr Leu Val Lys
435 440 445
Ser Val Glu Leu Ile Lys Ala Gly Lys Ala Lys Lys Leu Gly Pro Val
450 455 460
Ala Ser Tyr Leu Ala Asp Lys Ser Asp Glu Thr Asp Leu Thr Ser Leu
465 470 475 480
Asn Gly Tyr Val Lys Met Phe Glu Asn Met Ala Arg Arg Gln Ala Trp
485 490 495
Lys Ala Thr Glu Lys Phe Leu Lys Leu Met Glu Ser Gly Glu Ser Arg
500 505 510
Glu Val Ala Trp Asn Lys Ser Ala Val Glu Leu Thr Arg Ala Ser Arg
515 520 525
Leu His Thr Arg Leu Phe Ile Ile Glu Ala Phe Met Arg Arg Val Ser
530 535 540
Arg Ile Glu Asp Ile Pro Val Lys Glu Val Leu Thr Asp Leu Leu His
545 550 555 560
Leu His Val Asn Tyr Glu Leu Leu Asp Val Ala Thr Tyr Ala Leu Glu
565 570 575
Phe Met Ser Phe Thr Gln Leu Asp Tyr Val Arg Asp Gln Leu Tyr Leu
580 585 590
Tyr Leu Glu Lys Ile Arg Pro Asn Ala Val Ser Leu Val Asp Ser Phe
595 600 605
Gln Ile Ser Asp Met Gln Leu Arg Ser Val Leu Gly Arg Arg Asp Gly
610 615 620
His Val Tyr Glu Asn Leu Phe Lys Trp Ala Lys Ser Ser Pro Leu Asn
625 630 635 640
Asn Ala Asp Val Leu Pro Ser Val Glu Lys Tyr Leu Lys Pro Met Met
645 650 655
Glu Lys Ala Lys Leu
660
<210> 168
<211> 1986
<212> DNA
<213> 秀丽隐杆线虫(Caenorhabditis elegans)
<400> 168
atggctaata gatctattag agatggtgac aatccagaat tgttagaaga aagaagaatg 60
gcaacattcg atactgataa gatggctgct gttatatatg gttctgaaga attcgctaga 120
agaagaagag aaatcacaga tgcagtttca aagatcccag aattggctga tatcaagcca 180
tacccatttt tgacaagaga agaaaaggtt acagaaggta ctagaaagat ctctatcttg 240
actaagtatt tgaatcaatt gattgataga gataacgaag aagaatcatt gcatttgcat 300
agagaagtta ttggttatga aggtcatcca tttgcattgc atgatgcttt gtttattcca 360
actttgcaat ctcaagcttc agatgaacaa caagaaaaat ggttggaaag agcaagaaga 420
agagaaatta ttggttgtta cgctcaaaca gaattgggtc atggttctaa tttgagaaat 480
ttggaaacta cagcagttta cgatatcgct tcacaagaat tcgttttgca tactccaact 540
acaactgcat taaaatggtg gccaggtgct ttgggtaaat cttgtaatta cgcattagtt 600
gttgctgaat tgattattaa gagaaacaac tacggtccac atttctttat ggttcaattg 660
agagatgaaa agactcatat cccattgaaa ggtgttactg ttggtgacat tggtccaaag 720
atgaacttca acgctgcaga taacggttat ttgggtttaa acaatttgag agttccaaga 780
acaaatttgt tgatgagaca ttgtaaagtt gaagcagatg gtacttacgt taaaccacca 840
catgctaaga tcggttactc tggtatggtt aagatcagat cacaaatggc aatggaacaa 900
ggtttgtttt tagctcatgc attgacaatt gctgcaagat actctgctgt tagaagacaa 960
ggtcatttgg atgataagca agttgaagtt aaggttttgg attaccaaac tcaacaacat 1020
agattgttcc catctttggc tagagcatac gcttttattt ttacaggttt cgaaactatc 1080
catttgtact ctcaattgtt gaaggatgtt gatatgggta acacatcagg catggcagat 1140
ttgcatgctt tgacttcagg tttgaaatct gttgttgctc atgaaacagg tgaaggtatt 1200
gaacaagcaa gaatggcttg tggtggtcat ggttattcta tggcatcata catctctgtt 1260
gtttacggta tcgctattgg tggttgtact tacgaaggtg aaaacatggt tatgttgttg 1320
caattggcaa gatatttggt taagtctgtt gaattgatta aagctggtaa agctaagaaa 1380
ttaggtccag ttgcatctta cttggctgat aagtcagatg aaacagattt gacttcattg 1440
aacggttacg ttaagatgtt cgaaaatatg gctagaagac aagcatggaa ggctacagaa 1500
aagttcttga agttgatgga atctggtgaa tctagagaag ttgcatggaa taagtctgct 1560
gttgaattga caagagcatc aagattgcat actagattgt ttattattga agcttttatg 1620
agaagagttt ctagaatcga agatatccca gttaaggaag ttttgactga tttgttgcat 1680
ttgcatgtta actacgaatt gttggatgtt gcaacatacg ctttggaatt catgtctttt 1740
actcaattgg attacgttag agatcaattg tatttgtact tggaaaagat tagaccaaac 1800
gctgtttcat tagttgattc tttccaaatc tcagatatgc aattaagatc tgttttgggt 1860
agaagagatg gtcatgttta cgaaaatttg tttaaatggg caaagtcttc accattaaac 1920
aacgctgatg ttttgccatc agttgaaaag tatttgaagc caatgatgga aaaagctaaa 1980
ttgtaa 1986
<210> 169
<211> 692
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 169
Met Glu Ser Arg Arg Glu Lys Asn Pro Met Thr Glu Glu Glu Ser Asp
1 5 10 15
Gly Leu Ile Ala Ala Arg Arg Ile Gln Arg Leu Ser Leu His Leu Ser
20 25 30
Pro Ser Leu Thr Pro Ser Pro Ser Leu Pro Leu Val Gln Thr Glu Thr
35 40 45
Cys Ser Ala Arg Ser Lys Lys Leu Asp Val Asn Gly Glu Ala Leu Ser
50 55 60
Leu Tyr Met Arg Gly Lys His Ile Asp Ile Gln Glu Lys Ile Phe Asp
65 70 75 80
Phe Phe Asn Ser Arg Pro Asp Leu Gln Thr Pro Ile Glu Ile Ser Lys
85 90 95
Asp Asp His Arg Glu Leu Cys Met Asn Gln Leu Ile Gly Leu Val Arg
100 105 110
Glu Ala Gly Val Arg Pro Phe Arg Tyr Val Ala Asp Asp Pro Glu Lys
115 120 125
Tyr Phe Ala Ile Met Glu Ala Val Gly Ser Val Asp Met Ser Leu Gly
130 135 140
Ile Lys Met Gly Val Gln Tyr Ser Leu Trp Gly Gly Ser Val Ile Asn
145 150 155 160
Leu Gly Thr Lys Lys His Arg Asp Lys Tyr Phe Asp Gly Ile Asp Asn
165 170 175
Leu Asp Tyr Thr Gly Cys Phe Ala Met Thr Glu Leu His His Gly Ser
180 185 190
Asn Val Gln Gly Leu Gln Thr Thr Ala Thr Phe Asp Pro Leu Lys Asp
195 200 205
Glu Phe Val Ile Asp Thr Pro Asn Asp Gly Ala Ile Lys Trp Trp Ile
210 215 220
Gly Asn Ala Ala Val His Gly Lys Phe Ala Thr Val Phe Ala Arg Leu
225 230 235 240
Ile Leu Pro Thr His Asp Ser Lys Gly Val Ser Asp Met Gly Val His
245 250 255
Ala Phe Ile Val Pro Ile Arg Asp Met Lys Thr His Gln Thr Leu Pro
260 265 270
Gly Val Glu Ile Gln Asp Cys Gly His Lys Val Gly Leu Asn Gly Val
275 280 285
Asp Asn Gly Ala Leu Arg Phe Arg Ser Val Arg Ile Pro Arg Asp Asn
290 295 300
Leu Leu Asn Arg Phe Gly Asp Val Ser Arg Asp Gly Thr Tyr Thr Ser
305 310 315 320
Ser Leu Pro Thr Ile Asn Lys Arg Phe Gly Ala Thr Leu Gly Glu Leu
325 330 335
Val Gly Gly Arg Val Gly Leu Ala Tyr Ala Ser Val Gly Val Leu Lys
340 345 350
Ile Ser Ala Thr Ile Ala Ile Arg Tyr Ser Leu Leu Arg Gln Gln Phe
355 360 365
Gly Pro Pro Lys Gln Pro Glu Val Ser Ile Leu Asp Tyr Gln Ser Gln
370 375 380
Gln His Lys Leu Met Pro Met Leu Ala Ser Thr Tyr Ala Tyr His Phe
385 390 395 400
Ala Thr Val Tyr Leu Val Glu Lys Tyr Ser Glu Met Lys Lys Thr His
405 410 415
Asp Glu Gln Leu Val Ala Asp Val His Ala Leu Ser Ala Gly Leu Lys
420 425 430
Ser Tyr Val Thr Ser Tyr Thr Ala Lys Ala Leu Ser Val Cys Arg Glu
435 440 445
Ala Cys Gly Gly His Gly Tyr Ala Ala Val Asn Arg Phe Gly Ser Leu
450 455 460
Arg Asn Asp His Asp Ile Phe Gln Thr Phe Glu Gly Asp Asn Thr Val
465 470 475 480
Leu Leu Gln Gln Val Ala Ala Asp Leu Leu Lys Arg Tyr Lys Glu Lys
485 490 495
Phe Gln Gly Gly Thr Leu Thr Val Thr Trp Ser Tyr Leu Arg Glu Ser
500 505 510
Met Asn Thr Tyr Leu Ser Gln Pro Asn Pro Val Thr Ala Arg Trp Glu
515 520 525
Gly Glu Asp His Leu Arg Asp Pro Lys Phe Gln Leu Asp Ala Phe Arg
530 535 540
Tyr Arg Thr Ser Arg Leu Leu Gln Asn Val Ala Ala Arg Leu Gln Lys
545 550 555 560
His Ser Lys Thr Leu Gly Gly Phe Gly Ala Trp Asn Arg Cys Leu Asn
565 570 575
His Leu Leu Thr Leu Ala Glu Ser His Ile Glu Thr Val Ile Leu Ala
580 585 590
Lys Phe Ile Glu Ala Val Lys Asn Cys Pro Asp Pro Ser Ala Lys Ala
595 600 605
Ala Leu Lys Leu Ala Cys Asp Leu Tyr Ala Leu Asp Arg Ile Trp Lys
610 615 620
Asp Ile Gly Thr Tyr Arg Asn Val Asp Tyr Val Ala Pro Asn Lys Ala
625 630 635 640
Lys Ala Ile His Lys Leu Thr Glu Tyr Leu Ser Phe Gln Val Arg Asn
645 650 655
Val Ala Lys Glu Leu Val Asp Ala Phe Glu Leu Pro Asp His Val Thr
660 665 670
Arg Ala Pro Ile Ala Met Gln Ser Asp Ala Tyr Ser Gln Tyr Thr Gln
675 680 685
Val Val Gly Phe
690
<210> 170
<211> 2079
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 170
atggaatcta gaagagaaaa gaatccaatg acagaagaag aatcagatgg tttgattgct 60
gcaagaagaa ttcaaagatt gtctttgcat ttgtctccat cattaactcc atctccatca 120
ttaccattgg ttcaaactga aacatgttct gcaagatcta agaaattgga tgttaacggt 180
gaagctttgt cattgtacat gagaggtaaa catatcgata tccaagaaaa gatttttgat 240
ttctttaact ctagaccaga tttgcaaaca ccaatcgaaa tctcaaagga tgatcataga 300
gaattgtgta tgaaccaatt gatcggtttg gttagagaag caggtgttag accttttaga 360
tatgttgctg atgatccaga aaagtacttc gctatcatgg aagcagttgg ttctgttgat 420
atgtcattgg gtattaaaat gggtgttcaa tactctttgt ggggtggttc agttattaat 480
ttgggtacta agaaacatcg tgataagtac ttcgatggta tcgataattt ggattacaca 540
ggttgttttg caatgactga attacatcat ggttctaatg ttcaaggttt gcaaactaca 600
gctacattcg atccattgaa ggatgaattc gttattgata ctccaaatga tggtgctatt 660
aaatggtgga ttggtaatgc tgcagttcat ggtaaattcg ctacagtttt cgcaagattg 720
atcttgccaa ctcatgattc taaaggtgtt tcagatatgg gtgttcatgc ttttattgtt 780
ccaatcagag atatgaagac acatcaaact ttgccaggtg ttgaaattca agattgtggt 840
cataaggttg gtttaaacgg tgttgataat ggtgctttga gattcagatc tgttagaatt 900
ccaagagata atttgttgaa cagattcggt gacgtttcaa gagatggtac ttacacatct 960
tcattgccaa ctattaataa gagattcggt gctactttgg gtgaattggt tggtggtaga 1020
gttggtttag cttatgcatc tgttggtgtt ttgaagatct cagctacaat cgcaatcaga 1080
tactctttgt tgagacaaca atttggtcca ccaaagcaac cagaagtttc tatcttggat 1140
taccaatcac aacaacataa gttgatgcca atgttggctt ctacatacgc ataccatttc 1200
gctactgttt atttggttga aaagtactca gaaatgaaga aaactcatga tgaacaatta 1260
gttgcagatg ttcatgcttt atctgcaggt ttgaagtctt acgttacatc atacactgct 1320
aaggcattgt cagtttgtag agaagcttgt ggtggtcatg gttatgctgc agttaataga 1380
tttggttctt taagaaacga tcatgatatc ttccaaacat tcgaaggtga caacactgtt 1440
ttgttacaac aagttgctgc agatttgttg aagagataca aggaaaagtt ccaaggtggt 1500
actttgacag ttacttggtc ttatttgaga gaatcaatga acacatactt gtctcaacca 1560
aatccagtta ctgcaagatg ggaaggtgaa gatcatttga gagatccaaa gttccaattg 1620
gatgctttta gatacagaac atctagattg ttgcaaaacg ttgctgcaag attgcaaaag 1680
cattcaaaga ctttgggtgg ttttggtgca tggaacagat gtttgaacca tttgttgaca 1740
ttggctgaat ctcatatcga aactgttatt ttggcaaagt ttattgaagc tgttaaaaat 1800
tgtccagatc catcagcaaa agctgcattg aagttggcat gtgatttgta cgctttggat 1860
agaatctgga aggatatcgg tacatacaga aacgttgatt acgttgctcc aaataaggct 1920
aaggcaatcc ataagttgac tgaatacttg tctttccaag ttagaaacgt tgcaaaggaa 1980
ttagttgatg ctttcgaatt gccagatcat gttacaagag ctccaattgc aatgcaatct 2040
gatgcttatt cacaatacac tcaagttgtt ggtttttaa 2079
<210> 171
<211> 700
<212> PRT
<213> 解脂耶氏酵母(Yarrowia lipolytica)
<400> 171
Met Asn Pro Asn Asn Thr Gly Thr Ile Glu Ile Asn Gly Lys Glu Tyr
1 5 10 15
Asn Thr Phe Thr Glu Pro Pro Val Ala Met Ala Gln Glu Arg Ala Lys
20 25 30
Thr Ser Phe Pro Val Arg Glu Met Thr Tyr Phe Leu Asp Gly Gly Glu
35 40 45
Lys Asn Thr Leu Lys Asn Glu Gln Ile Met Glu Glu Ile Glu Arg Asp
50 55 60
Pro Leu Phe Asn Asn Asp Asn Tyr Tyr Asp Leu Asn Lys Glu Gln Ile
65 70 75 80
Arg Glu Leu Thr Met Glu Arg Val Ala Lys Leu Ser Leu Phe Val Arg
85 90 95
Asp Gln Pro Glu Asp Asp Ile Lys Lys Arg Phe Ala Leu Ile Gly Ile
100 105 110
Ala Asp Met Gly Thr Tyr Thr Arg Leu Gly Val His Tyr Gly Leu Phe
115 120 125
Phe Gly Ala Val Arg Gly Thr Gly Thr Ala Glu Gln Phe Gly His Trp
130 135 140
Ile Ser Lys Gly Ala Gly Asp Leu Arg Lys Phe Tyr Gly Cys Phe Ser
145 150 155 160
Met Thr Glu Leu Gly His Gly Ser Asn Leu Ala Gly Leu Glu Thr Thr
165 170 175
Ala Ile Tyr Asp Glu Glu Thr Asp Glu Phe Ile Ile Asn Thr Pro His
180 185 190
Ile Ala Ala Thr Lys Trp Trp Ile Gly Gly Ala Ala His Thr Ala Thr
195 200 205
His Thr Val Val Phe Ala Arg Leu Ile Val Lys Gly Lys Asp Tyr Gly
210 215 220
Val Lys Thr Phe Val Val Gln Leu Arg Asn Ile Asn Asp His Ser Leu
225 230 235 240
Lys Val Gly Ile Ser Ile Gly Asp Ile Gly Lys Lys Met Gly Arg Asp
245 250 255
Gly Ile Asp Asn Gly Trp Ile Gln Phe Thr Asn Val Arg Ile Pro Arg
260 265 270
Gln Asn Leu Leu Met Lys Tyr Thr Lys Val Asp Arg Glu Gly Asn Val
275 280 285
Thr Gln Pro Pro Leu Ala Gln Leu Thr Tyr Gly Ser Leu Ile Thr Gly
290 295 300
Arg Val Ser Met Ala Ser Asp Ser His Gln Val Gly Lys Arg Phe Ile
305 310 315 320
Thr Ile Ala Leu Arg Tyr Ala Cys Ile Arg Arg Gln Phe Ser Thr Thr
325 330 335
Pro Gly Gln Pro Glu Thr Lys Ile Ile Asp Tyr Pro Tyr His Gln Arg
340 345 350
Arg Leu Leu Pro Leu Leu Ala Tyr Val Tyr Ala Leu Lys Met Thr Ala
355 360 365
Asp Glu Val Gly Ala Leu Phe Ser Arg Thr Met Leu Lys Met Asp Asp
370 375 380
Leu Lys Pro Asp Asp Lys Ala Gly Leu Asn Glu Val Val Ser Asp Val
385 390 395 400
Lys Glu Leu Phe Ser Val Ser Ala Gly Leu Lys Ala Phe Ser Thr Trp
405 410 415
Ala Cys Ala Asp Val Ile Asp Lys Thr Arg Gln Ala Cys Gly Gly His
420 425 430
Gly Tyr Ser Gly Tyr Asn Gly Phe Gly Gln Ala Tyr Ala Asp Trp Val
435 440 445
Val Gln Cys Thr Trp Glu Gly Asp Asn Asn Ile Leu Thr Leu Ser Ala
450 455 460
Gly Arg Ala Leu Ile Gln Ser Ala Val Ala Leu Arg Lys Gly Glu Pro
465 470 475 480
Val Gly Asn Ala Val Ser Tyr Leu Lys Arg Tyr Lys Asp Leu Ala Asn
485 490 495
Ala Lys Leu Asn Gly Arg Ser Leu Thr Asp Pro Lys Val Leu Val Glu
500 505 510
Ala Trp Glu Val Ala Ala Gly Asn Ile Ile Asn Arg Ala Thr Asp Gln
515 520 525
Tyr Glu Lys Leu Ile Gly Glu Gly Leu Asn Ala Asp Gln Ala Phe Glu
530 535 540
Val Leu Ser Gln Gln Arg Phe Gln Ala Ala Lys Val His Thr Arg Arg
545 550 555 560
His Leu Ile Ala Ala Phe Phe Ser Arg Ile Asp Thr Glu Ala Gly Glu
565 570 575
Ala Ile Lys Gln Pro Leu Leu Asn Leu Ala Leu Leu Phe Ala Leu Trp
580 585 590
Ser Ile Glu Glu Asp Ser Gly Leu Phe Leu Arg Glu Gly Phe Leu Glu
595 600 605
Pro Lys Asp Ile Asp Thr Val Thr Glu Leu Val Asn Lys Tyr Cys Thr
610 615 620
Thr Val Arg Glu Glu Val Ile Gly Tyr Thr Asp Ala Phe Asn Leu Ser
625 630 635 640
Asp Tyr Phe Ile Asn Ala Pro Ile Gly Cys Tyr Asp Gly Asp Ala Tyr
645 650 655
Arg His Tyr Phe Gln Lys Val Asn Glu Gln Asn Pro Ala Arg Asp Pro
660 665 670
Arg Pro Pro Tyr Tyr Ala Ser Thr Leu Lys Pro Phe Leu Phe Arg Glu
675 680 685
Glu Glu Asp Asp Asp Ile Cys Glu Leu Asp Glu Glu
690 695 700
<210> 172
<211> 2103
<212> DNA
<213> 解脂耶氏酵母(Yarrowia lipolytica)
<400> 172
atgaatccaa ataatactgg tacaattgaa attaatggta aagaatacaa cacttttaca 60
gaaccaccag ttgctatggc acaagaaaga gctaaaacat ctttcccagt tagagaaatg 120
acttactttt tggatggtgg tgaaaagaat acattgaaaa atgaacaaat catggaagaa 180
atcgaaagag atccattgtt taataacgat aactactacg atttgaataa ggaacaaatt 240
agagaattga ctatggaaag agttgctaag ttgtctttgt tcgttagaga tcaaccagaa 300
gatgatatta agaaaagatt cgctttgatt ggtattgcag atatgggtac ttatacaaga 360
ttaggtgttc attacggttt gtttttcggt gctgttagag gtactggtac agcagaacaa 420
tttggtcatt ggatttcaaa aggtgctggt gacttgagaa agttctacgg ttgtttctct 480
atgacagaat tgggtcatgg ttcaaatttg gctggtttag aaactacagc aatctatgat 540
gaagaaactg atgaattcat tattaataca ccacatattg ctgcaactaa atggtggatt 600
ggtggtgctg cacatactgc tacacatact gttgttttcg caagattgat cgttaagggt 660
aaagattacg gtgttaagac attcgttgtt caattgagaa acattaatga tcattctttg 720
aaagttggta tctcaatcgg tgacatcggt aaaaagatgg gtagagatgg tatcgataac 780
ggttggattc aattcactaa cgttagaatt ccaagacaaa atttgttgat gaagtacaca 840
aaggttgata gagagggtaa cgttactcaa ccaccattgg ctcaattgac atacggttct 900
ttaatcactg gtagagtttc aatggcatct gattcacatc aagttggtaa aagattcatt 960
acaatcgctt tgagatacgc atgtatcaga agacaatttt ctactacacc aggtcaacca 1020
gaaactaaga tcatcgatta cccataccat caaagaagat tgttgccatt gttggcttac 1080
gtttacgcat tgaagatgac agctgatgaa gttggtgcat tgttttcaag aactatgttg 1140
aagatggatg atttgaagcc agatgataag gctggtttga atgaagttgt ttctgatgtt 1200
aaggaattat tttctgtttc agctggtttg aaagcatttt caacatgggc ttgtgcagat 1260
gttattgata aaactagaca agcttgtggt ggtcatggtt attctggtta caatggtttt 1320
ggtcaagctt acgcagattg ggttgttcaa tgtacatggg aaggtgacaa caacatcttg 1380
actttgtctg ctggtagagc attaattcaa tcagctgttg cattgagaaa aggtgaacca 1440
gttggtaacg ctgtttctta tttgaagaga tacaaggatt tggctaacgc aaagttgaac 1500
ggtagatcat tgacagatcc aaaagttttg gttgaagctt gggaagttgc tgctggtaac 1560
attattaaca gagcaactga tcaatatgaa aaattaattg gtgaaggttt gaatgctgat 1620
caagcattcg aagttttgtc tcaacaaaga ttccaagctg caaaagttca tacaagaaga 1680
catttgattg ctgctttctt ttctagaatt gatactgaag ctggtgaagc aattaaacaa 1740
ccattgttga atttggcttt gttgttcgca ttgtggtcta tcgaagaaga ttcaggtttg 1800
tttttaagag aaggtttctt ggaaccaaag gatatcgata cagttactga attggttaat 1860
aagtactgta ctacagttag agaagaagtt attggttaca ctgatgcttt taatttgtct 1920
gattacttca tcaacgctcc aatcggttgt tacgatggtg acgcatacag acattacttc 1980
caaaaggtta acgaacaaaa cccagctaga gatccaagac caccatacta cgcatcaact 2040
ttgaagccat ttttgtttag agaagaagaa gatgatgata tctgtgaatt ggatgaagaa 2100
taa 2103
<210> 173
<211> 724
<212> PRT
<213> 热带假丝酵母(Candida tropicalis)
<400> 173
Met Ala Met Leu Ser Gln Pro Asn Asp Gly His Asp His Pro Glu Lys
1 5 10 15
Lys Asp Pro Asp Thr Thr Pro Lys Gln Val Ala Gly Val Ile Ser Ser
20 25 30
Gln Asp Pro Pro His Pro Ala Lys Asp Val Ala Glu Glu Arg Ala Arg
35 40 45
Thr Asp Trp Asp Leu Lys Glu Met His Glu Phe Leu Glu Gly Asp Glu
50 55 60
Ala Lys Ser Glu Gln Ile Leu Arg Leu Tyr Gln Ser Ile Glu Arg Asp
65 70 75 80
Pro Ile Leu Gln Thr Arg Pro Glu Gln Phe Asp Tyr Thr Gln Lys Gln
85 90 95
Glu Arg Glu Leu Val Ala Asn Arg Ile Asn Gln Met Thr Lys Phe Leu
100 105 110
Glu Thr Glu Pro Tyr Gly Lys Phe Arg Arg Arg Leu Gln Leu Met Thr
115 120 125
Val Ile Asp Pro Ser Leu Gly Ile Arg Met Leu Val Asn Ile Gly Leu
130 135 140
Phe Leu Asn Cys Val Arg Gly Asn Gly Thr Gln Lys Gln Phe Asp Phe
145 150 155 160
Trp Ser Asn Lys Lys Glu Ala Gly Ile Val Lys Gln Leu Tyr Gly Cys
165 170 175
Phe Gly Met Thr Glu Leu Gly His Gly Ser Asn Val Ala Gly Cys Glu
180 185 190
Thr Thr Ala Thr Phe Asp Glu Lys Thr Asp Glu Phe Ile Ile Asp Thr
195 200 205
Pro His Ile Gly Ala Thr Lys Trp Trp Ile Gly Gly Ala Ala His Ser
210 215 220
Ala Thr His Thr Val Cys Tyr Ala Arg Leu Ile Val Lys Asp Val Asp
225 230 235 240
Tyr Gly Val Lys Thr Phe Ile Val Pro Leu Arg Asp Ser Arg His Ser
245 250 255
Leu Leu Pro Gly Ile Ala Ile Gly Asp Ile Gly Ala Lys Met Gly Arg
260 265 270
Gln Gly Val Asp Asn Gly Trp Ile Gln Phe Thr Glu Val Arg Val Pro
275 280 285
Arg Phe Phe Met Leu Gln Arg Trp Cys Lys Val Asp Arg Gln Gly Asn
290 295 300
Val Thr Leu Pro Pro Leu Glu Gln Leu Ser Tyr Ile Ser Leu Leu Glu
305 310 315 320
Gly Arg Val Gly Met Ala Thr Asp Ser Tyr Arg Ile Gly Ala Arg Tyr
325 330 335
Thr Thr Ile Ala Leu Arg Tyr Ala Val Gly Arg Arg Gln Phe Ser Lys
340 345 350
Lys Ala Gly Glu Pro Glu Thr Lys Leu Ile Asp Tyr Thr Leu His Gln
355 360 365
Arg Arg Leu Leu Pro Tyr Leu Ala Leu Thr Tyr Ala Ala Ala Val Gly
370 375 380
Thr Asp Arg Leu Glu Arg Gln His Glu Glu Leu Leu Ala Asn Leu Asp
385 390 395 400
Ile Ala Leu Ala Lys Lys Asp Lys Leu Leu Leu Lys Asn Thr Ile Thr
405 410 415
Gly Thr Lys Ser Met Phe Val Asp Ser Gly Ser Leu Lys Ser Thr Leu
420 425 430
Thr Trp Leu Ala Ala Asp Leu Ile Asn Glu Thr Arg Gln Ala Cys Gly
435 440 445
Gly His Gly Tyr Ser Ser Tyr Asn Gly Phe Gly Lys Thr Tyr Asp Asp
450 455 460
Trp Val Val Gln Cys Thr Trp Glu Gly Asp Asn Asn Val Leu Ala Met
465 470 475 480
Ser Ala Gly Lys Thr Ile Ile Lys Thr Val Gln Gln Val Leu Asn Gly
485 490 495
Lys Glu Leu Lys Asp Ser Thr Leu Glu Phe Leu Asn Ala Ala Pro Glu
500 505 510
Leu Ser Lys Ala Lys Lys Ala Val Ile Arg Ile Arg Asp His Val Asp
515 520 525
Asp Val Asp Arg Val Leu Lys Ala Ile Ala Gly Leu Ile Ser Lys Phe
530 535 540
Ser Lys Asp Leu Ile Pro Ile Ser Tyr Gln Ser Trp Asp Ser Ile Gly
545 550 555 560
Ala Gln Arg Val Ile Leu Ser Lys Leu Arg Cys His Tyr Tyr Leu Leu
565 570 575
Glu Thr Phe Asn Glu Arg Leu Asn Asp Lys Ile Lys Ala Lys Ser Pro
580 585 590
Ala Arg Pro His Leu Glu Asn Ile Ile Lys Leu Tyr Tyr Val Thr Asn
595 600 605
Ile Leu Gly Pro Phe Ile Asp Glu Phe Leu Arg Phe Gly Val Ile Ser
610 615 620
Pro Gln Val Ala Lys Tyr Ile Thr Tyr Glu Tyr Pro Gln Lys Leu Cys
625 630 635 640
Ala Asn Ile Arg Pro Tyr Val Ile Gly Leu Thr Asp Ser Phe Gln Gln
645 650 655
Pro Asp Asn Phe Ile Asn Ser Leu Ile Gly Lys Tyr Asp Gly Asn Ile
660 665 670
Tyr Thr Asn Tyr Leu Glu Ser Val Lys Asp Val Asn Asp Pro Ser Asn
675 680 685
Tyr Lys Ala Pro Tyr Ser Glu Ala Leu Glu Ala Met Leu Asn Arg Ser
690 695 700
Ala Leu Glu Asn Arg Glu Arg Ser Glu Arg Gly Lys Ala Ala Ala Asp
705 710 715 720
Ile Leu Ser Lys
<210> 174
<211> 2175
<212> DNA
<213> 热带假丝酵母(Candida tropicalis)
<400> 174
atggcaatgt tgtctcaacc aaatgatggt catgatcatc cagaaaagaa agatccagat 60
actacaccaa aacaagttgc tggtgttatt tcttcacaag atccaccaca tccagctaaa 120
gatgttgcag aagaaagagc tagaactgat tgggatttga aggaaatgca tgaattcttg 180
gaaggtgacg aagcaaaatc agaacaaatc ttgagattgt accaatctat cgaaagagat 240
ccaatcttgc aaacaagacc agaacaattc gattacactc aaaagcaaga aagagaattg 300
gttgctaaca gaattaatca aatgacaaag ttcttggaaa ctgaaccata cggtaaattc 360
agaagaagat tgcaattgat gacagttatt gatccatcat tgggtattag aatgttggtt 420
aacatcggtt tatttttgaa ttgtgttcgt ggtaacggta ctcaaaagca attcgatttc 480
tggtcaaata agaaagaagc tggtatcgtt aagcaattgt acggttgttt tggtatgaca 540
gaattaggtc atggttctaa tgttgcaggt tgtgaaacta cagctacatt cgatgaaaag 600
actgatgaat tcattatcga tacaccacat attggtgcta ctaaatggtg gattggtggt 660
gctgcacatt ctgcaactca tacagtttgt tacgctagat tgatcgttaa ggatgttgat 720
tacggtgtta agacttttat tgttccattg agagattcta gacattcatt gttaccaggt 780
attgcaattg gtgacattgg tgctaaaatg ggtagacaag gtgttgataa tggttggatt 840
caattcactg aagttagagt tccaagattt ttcatgttgc aaagatggtg taaggttgat 900
agacagggta acgttacatt accaccattg gaacaattgt cttacatctc attgttagaa 960
ggtagagttg gtatggcaac tgattcatat agaattggtg ctagatacac tacaattgca 1020
ttgagatatg ctgttggtag aagacaattt tctaagaaag ctggtgaacc agaaacaaag 1080
ttgatcgatt acactttgca tcaaagaaga ttgttgccat atttggcatt gacatacgct 1140
gcagctgttg gtactgatag attggaaaga caacatgaag aattgttggc taatttggat 1200
atcgctttag ctaagaaaga caagttgttg ttgaaaaata ctatcacagg tactaagtca 1260
atgttcgttg attctggttc attgaaatct acattgactt ggttagcagc tgatttgatt 1320
aatgaaacta gacaagcttg tggtggtcat ggttactctt catacaacgg tttcggtaaa 1380
acatacgatg attgggttgt tcaatgtact tgggaaggtg acaataatgt tttggctatg 1440
tctgcaggta aaacaattat taagactgtt caacaagttt tgaatggtaa agaattgaag 1500
gattcaacat tggaattctt gaacgcagct ccagaattgt ctaaggctaa gaaagcagtt 1560
attagaatta gagatcatgt tgatgatgtt gatagagttt tgaaagctat tgcaggttta 1620
atctctaaat tttcaaagga tttgattcca atttcttacc aatcatggga ttctattggt 1680
gctcaaagag ttattttgtc aaaattaaga tgtcattatt acttattaga aacttttaat 1740
gaaagattga acgataagat taaagcaaaa tctccagcta gaccacattt ggaaaacatt 1800
attaagttgt actacgttac aaacatcttg ggtcctttta ttgatgaatt cttgagattc 1860
ggtgttattt ctccacaagt tgcaaagtac atcacatacg aatacccaca aaagttgtgt 1920
gctaacatca gaccatacgt tatcggttta actgattcat tccaacaacc agataacttc 1980
atcaactctt tgatcggtaa atatgatggt aatatctata ctaattactt agaatcagtt 2040
aaggatgtta acgatccatc aaactacaag gcaccatact ctgaagcttt ggaagcaatg 2100
ttgaacagat cagctttgga aaacagagaa agatctgaac gtggtaaagc agctgcagat 2160
attttatcta aataa 2175
<210> 175
<211> 748
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 175
Met Thr Arg Arg Thr Thr Ile Asn Pro Asp Ser Val Val Leu Asn Pro
1 5 10 15
Gln Lys Phe Ile Gln Lys Glu Arg Ala Asp Ser Lys Ile Lys Val Asp
20 25 30
Gln Val Asn Thr Phe Leu Glu Ser Ser Pro Glu Arg Arg Thr Leu Thr
35 40 45
His Ala Leu Ile Asp Gln Ile Val Asn Asp Pro Ile Leu Lys Thr Asp
50 55 60
Thr Asp Tyr Tyr Asp Ala Lys Lys Met Gln Glu Arg Glu Ile Thr Ala
65 70 75 80
Lys Lys Ile Ala Arg Leu Ala Ser Tyr Met Glu His Asp Ile Lys Thr
85 90 95
Val Arg Lys His Phe Arg Asp Thr Asp Leu Met Lys Glu Leu Gln Ala
100 105 110
Asn Asp Pro Asp Lys Ala Ser Pro Leu Thr Asn Lys Asp Leu Phe Ile
115 120 125
Phe Asp Lys Arg Leu Ser Leu Val Ala Asn Ile Asp Pro Gln Leu Gly
130 135 140
Thr Arg Val Gly Val His Leu Gly Leu Phe Gly Asn Cys Ile Lys Gly
145 150 155 160
Asn Gly Thr Asp Glu Gln Ile Arg Tyr Trp Leu Gln Glu Arg Gly Ala
165 170 175
Thr Leu Met Lys Gly Ile Tyr Gly Cys Phe Ala Met Thr Glu Leu Gly
180 185 190
His Gly Ser Asn Val Ala Gln Leu Gln Thr Arg Ala Val Tyr Asp Lys
195 200 205
Gln Asn Asp Thr Phe Val Ile Asp Thr Pro Asp Leu Thr Ala Thr Lys
210 215 220
Trp Trp Ile Gly Gly Ala Ala His Ser Ala Thr His Ala Ala Val Tyr
225 230 235 240
Ala Arg Leu Ile Val Glu Gly Lys Asp Tyr Gly Val Lys Thr Phe Val
245 250 255
Val Pro Leu Arg Asp Pro Ser Thr Phe Gln Leu Leu Ala Gly Val Ser
260 265 270
Ile Gly Asp Ile Gly Ala Lys Met Gly Arg Asp Gly Ile Asp Asn Gly
275 280 285
Trp Ile Gln Phe Arg Asn Val Val Ile Pro Arg Glu Phe Met Leu Ser
290 295 300
Arg Phe Thr Lys Val Val Arg Ser Pro Asp Gly Ser Val Thr Val Lys
305 310 315 320
Thr Glu Pro Gln Leu Asp Gln Ile Ser Gly Tyr Ser Ala Leu Leu Ser
325 330 335
Gly Arg Val Asn Met Val Met Asp Ser Phe Arg Phe Gly Ser Lys Phe
340 345 350
Ala Thr Ile Ala Val Arg Tyr Ala Val Gly Arg Gln Gln Phe Ala Pro
355 360 365
Arg Lys Gly Leu Ser Glu Thr Gln Leu Ile Asp Tyr Pro Leu His Gln
370 375 380
Tyr Arg Val Leu Pro Gln Leu Cys Val Pro Tyr Leu Val Ser Pro Val
385 390 395 400
Ala Phe Lys Leu Met Asp Asn Tyr Tyr Ser Thr Leu Asp Glu Leu Tyr
405 410 415
Asn Ala Ser Ser Ser Ala Tyr Lys Ala Ala Leu Val Thr Val Ser Lys
420 425 430
Lys Leu Lys Asn Leu Phe Ile Asp Ser Ala Ser Leu Lys Ala Thr Asn
435 440 445
Thr Trp Leu Ile Ala Thr Leu Ile Asp Glu Leu Arg Gln Thr Cys Gly
450 455 460
Gly His Gly Tyr Ser Gln Tyr Asn Gly Phe Gly Lys Gly Tyr Asp Asp
465 470 475 480
Trp Val Val Gln Cys Thr Trp Glu Gly Asp Asn Asn Val Leu Ser Leu
485 490 495
Thr Ser Ala Lys Ser Ile Leu Lys Lys Phe Ile Asp Ser Ala Thr Lys
500 505 510
Gly Arg Phe Asp Asn Thr Leu Asp Val Asp Ser Phe Ser Tyr Leu Lys
515 520 525
Pro Gln Tyr Ile Gly Ser Val Val Ser Gly Glu Ile Lys Ser Gly Leu
530 535 540
Lys Glu Leu Gly Asp Tyr Thr Glu Ile Trp Ser Ile Thr Leu Ile Lys
545 550 555 560
Leu Leu Ala His Ile Gly Thr Leu Val Glu Lys Ser Arg Ser Ile Asp
565 570 575
Ser Val Ser Lys Leu Leu Val Leu Val Ser Lys Phe His Ala Leu Arg
580 585 590
Cys Met Leu Lys Thr Tyr Tyr Asp Lys Leu Asn Ser Arg Asp Ser His
595 600 605
Ile Ser Asp Glu Ile Thr Lys Glu Ser Met Trp Asn Val Tyr Lys Leu
610 615 620
Phe Ser Leu Tyr Phe Ile Asp Lys His Ser Gly Glu Phe Gln Gln Phe
625 630 635 640
Lys Ile Phe Thr Pro Asp Gln Ile Ser Lys Val Val Gln Pro Gln Leu
645 650 655
Leu Ala Leu Leu Pro Ile Val Arg Lys Asp Cys Ile Gly Leu Thr Asp
660 665 670
Ser Phe Glu Leu Pro Asp Ala Met Leu Asn Ser Pro Ile Gly Tyr Phe
675 680 685
Asp Gly Asp Ile Tyr His Asn Tyr Phe Asn Glu Val Cys Arg Asn Asn
690 695 700
Pro Val Glu Ala Asp Gly Ala Gly Lys Pro Ser Tyr His Ala Leu Leu
705 710 715 720
Ser Ser Met Leu Gly Arg Gly Phe Glu Phe Asp Gln Lys Leu Gly Gly
725 730 735
Ala Ala Asn Ala Glu Ile Leu Ser Lys Ile Asn Lys
740 745
<210> 176
<211> 2247
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 176
atgacgagac gtactactat taatcccgat tcggtggttc tgaatcctca aaaatttatc 60
cagaaagaaa gggcggattc gaaaatcaaa gttgaccaag ttaacacatt tttagagtca 120
tccccggaga ggagaactct gacgcacgcc ttaatagacc aaatagtgaa tgatcctata 180
ttgaaaactg atacggacta ttacgatgct aaaaaaatgc aagagagaga aattactgcc 240
aaaaaaatag ctaggcttgc tagttatatg gagcacgata tcaaaacagt gcgcaaacac 300
tttcgcgaca ctgacctgat gaaagagttg caagcaaatg atccagacaa agcttcgcct 360
ttaacaaaca aagacctttt tatattcgat aagagattgt cacttgtagc aaatattgat 420
cctcaattgg gtacgcgcgt gggtgtacac ttggggctat ttggtaattg tatcaagggc 480
aatggtactg atgagcaaat ccggtattgg ttgcaggaga gaggtgccac tttgatgaaa 540
ggtatatatg gctgttttgc aatgactgag ttaggacatg gttccaatgt tgcccagctg 600
cagactaggg ctgtgtacga taagcaaaat gatacttttg taattgatac acctgatcta 660
actgccacca aatggtggat tggtggggct gcccattctg ccacgcacgc tgccgtgtac 720
gccagattga tcgttgaagg taaagactac ggtgtaaaaa cattcgttgt tcctctgaga 780
gacccttcga ctttccaact gttagctggt gtttccatag gggatattgg agcgaagatg 840
ggtcgtgacg gtattgataa tggctggatc cagttcagaa acgtagttat ccctagagaa 900
tttatgctaa gtagatttac caaagttgtc cgttctccag atggttcagt caccgtcaaa 960
actgagccac aattggatca aatttctggt tatagtgcat tgttaagtgg tagagttaac 1020
atggtcatgg attcatttag gtttggctcc aaatttgcta ctattgctgt acgttacgcg 1080
gttggtcgtc agcaattcgc acctagaaag ggattgtctg aaacacaatt aatcgactat 1140
ccccttcacc aatatcgtgt tttaccacaa ttgtgtgttc catatttggt gtcacctgta 1200
gcttttaagt taatggacaa ctattattcc actttggacg agttatacaa cgcttcctca 1260
tctgcataca aagctgctct ggttaccgtg agtaaaaagt tgaagaattt atttattgat 1320
agcgccagct tgaaagccac caatacttgg ttaattgcta cactgattga tgagttgaga 1380
cagacttgcg gaggacatgg gtattcacag tataacggat ttggtaaagg ctatgacgac 1440
tgggtggttc agtgcacatg ggagggtgat aataatgttt tatctttaac ttcagcaaaa 1500
tcaatattga aaaaatttat cgattcagcc acaaagggta gatttgacaa cacactggat 1560
gtggactcat tctcttactt aaaacctcag tacataggat ctgtggtttc tggagaaata 1620
aagagtggtt taaaggagtt gggtgattat actgaaattt ggtctatcac cttaatcaaa 1680
ttactggcac atattggtac tttagttgaa aaatcaagaa gtattgatag cgtttctaag 1740
cttttagtct tagtatccaa atttcatgcc ttgcgctgca tgttgaaaac ctattacgac 1800
aagttaaact ctcgtgattc acatatttcc gatgaaatta caaaggaatc tatgtggaat 1860
gtttataagt tattttcctt gtattttatt gacaagcatt ccggagaatt ccaacaattc 1920
aagatcttca ctcctgatca gatctctaaa gttgtgcagc cacaactatt ggctcttttg 1980
ccaattgtga ggaaagactg tataggtctg acagactcct ttgaattacc tgacgcgatg 2040
ttaaattctc ctataggtta ctttgatggc gatatctatc acaattactt caatgaagtt 2100
tgccgcaata atccagtgga ggcagatggg gcagggaagc cttcttatca tgcgctgttg 2160
agcagcatgc tcggtagagg tttcgaattt gaccaaaagt taggtggtgc agctaatgcg 2220
gaaattttat cgaaaataaa caagtga 2247
<210> 177
<211> 736
<212> PRT
<213> 智人(Homo sapiens)
<400> 177
Met Gly Ser Pro Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Ala Gly Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg
20 25 30
Gly Ala Leu Val Val Val Asn Asp Leu Gly Gly Asp Phe Lys Gly Val
35 40 45
Gly Lys Gly Ser Leu Ala Ala Asp Lys Val Val Glu Glu Ile Arg Arg
50 55 60
Arg Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Glu Gly Glu
65 70 75 80
Lys Val Val Lys Thr Ala Leu Asp Ala Phe Gly Arg Ile Asp Val Val
85 90 95
Val Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ala Arg Ile Ser
100 105 110
Asp Glu Asp Trp Asp Ile Ile His Arg Val His Leu Arg Gly Ser Phe
115 120 125
Gln Val Thr Arg Ala Ala Trp Glu His Met Lys Lys Gln Lys Tyr Gly
130 135 140
Arg Ile Ile Met Thr Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Leu Leu Gly Leu Ala Asn
165 170 175
Ser Leu Ala Ile Glu Gly Arg Lys Ser Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Asn Ala Gly Ser Arg Met Thr Gln Thr Val Met Pro Glu Asp
195 200 205
Leu Val Glu Ala Leu Lys Pro Glu Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Thr Leu Gly Ala Ile
245 250 255
Val Arg Gln Lys Asn His Pro Met Thr Pro Glu Ala Val Lys Ala Asn
260 265 270
Trp Lys Lys Ile Cys Asp Phe Glu Asn Ala Ser Lys Pro Gln Ser Ile
275 280 285
Gln Glu Ser Thr Gly Ser Ile Ile Glu Val Leu Ser Lys Ile Asp Ser
290 295 300
Glu Gly Gly Val Ser Ala Asn His Thr Ser Arg Ala Thr Ser Thr Ala
305 310 315 320
Thr Ser Gly Phe Ala Gly Ala Ile Gly Gln Lys Leu Pro Pro Phe Ser
325 330 335
Tyr Ala Tyr Thr Glu Leu Glu Ala Ile Met Tyr Ala Leu Gly Val Gly
340 345 350
Ala Ser Ile Lys Asp Pro Lys Asp Leu Lys Phe Ile Tyr Glu Gly Ser
355 360 365
Ser Asp Phe Ser Cys Leu Pro Thr Phe Gly Val Ile Ile Gly Gln Lys
370 375 380
Ser Met Met Gly Gly Gly Leu Ala Glu Ile Pro Gly Leu Ser Ile Asn
385 390 395 400
Phe Ala Lys Val Leu His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro
405 410 415
Leu Pro Arg Ala Gly Lys Leu Lys Cys Glu Ala Val Val Ala Asp Val
420 425 430
Leu Asp Lys Gly Ser Gly Val Val Ile Ile Met Asp Val Tyr Ser Tyr
435 440 445
Ser Glu Lys Glu Leu Ile Cys His Asn Gln Phe Ser Leu Phe Leu Val
450 455 460
Gly Ser Gly Gly Phe Gly Gly Lys Arg Thr Ser Asp Lys Val Lys Val
465 470 475 480
Ala Val Ala Ile Pro Asn Arg Pro Pro Asp Ala Val Leu Thr Asp Thr
485 490 495
Thr Ser Leu Asn Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Trp Asn
500 505 510
Pro Leu His Ile Asp Pro Asn Phe Ala Ser Leu Ala Gly Phe Asp Lys
515 520 525
Pro Ile Leu His Gly Leu Cys Thr Phe Gly Phe Ser Ala Arg Arg Val
530 535 540
Leu Gln Gln Phe Ala Asp Asn Asp Val Ser Arg Phe Lys Ala Ile Lys
545 550 555 560
Ala Arg Phe Ala Lys Pro Val Tyr Pro Gly Gln Thr Leu Gln Thr Glu
565 570 575
Met Trp Lys Glu Gly Asn Arg Ile His Phe Gln Thr Lys Val Gln Glu
580 585 590
Thr Gly Asp Ile Val Ile Ser Asn Ala Tyr Val Asp Leu Ala Pro Thr
595 600 605
Ser Gly Thr Ser Ala Lys Thr Pro Ser Glu Gly Gly Lys Leu Gln Ser
610 615 620
Thr Phe Val Phe Glu Glu Ile Gly Arg Arg Leu Lys Asp Ile Gly Pro
625 630 635 640
Glu Val Val Lys Lys Val Asn Ala Val Phe Glu Trp His Ile Thr Lys
645 650 655
Gly Gly Asn Ile Gly Ala Lys Trp Thr Ile Asp Leu Lys Ser Gly Ser
660 665 670
Gly Lys Val Tyr Gln Gly Pro Ala Lys Gly Ala Ala Asp Thr Thr Ile
675 680 685
Ile Leu Ser Asp Glu Asp Phe Met Glu Val Val Leu Gly Lys Leu Asp
690 695 700
Pro Gln Lys Ala Phe Phe Ser Gly Arg Leu Lys Ala Arg Gly Asn Ile
705 710 715 720
Met Leu Ser Gln Lys Leu Gln Met Ile Leu Lys Asp Tyr Ala Lys Leu
725 730 735
<210> 178
<211> 2211
<212> DNA
<213> 智人(Homo sapiens)
<400> 178
atgggttctc cattgagatt tgatggtaga gttgttttag ttacaggtgc tggtgcaggt 60
ttgggtagag cttatgcatt agcttttgca gaaagaggtg ctttggttgt tgttaatgat 120
ttgggtggtg actttaaagg tgttggtaaa ggttctttgg ctgcagataa ggttgttgaa 180
gaaatcagaa gaagaggtgg taaagctgtt gcaaattacg attcagttga agaaggtgaa 240
aaagttgtta aaactgcttt ggatgcattc ggtagaatcg atgttgttgt taacaacgca 300
ggtattttaa gagatagatc attcgctaga atctctgatg aagattggga tatcatccat 360
agagttcatt tgagaggttc ttttcaagtt acaagagctg catgggaaca tatgaagaaa 420
caaaagtacg gtagaatcat tatgacttct tcagcatcag gtatctatgg taacttcggt 480
caagctaact actctgctgc aaagttgggt ttgttgggtt tagcaaactc attggctatc 540
gaaggtagaa agtctaacat ccattgtaac acaattgctc caaatgcagg ttctagaatg 600
actcaaacag ttatgccaga agatttggtt gaagcattga aaccagaata cgttgctcca 660
ttggttttat ggttgtgtca tgaatcatgt gaagaaaatg gtggtttgtt tgaagttggt 720
gcaggttgga ttggtaaatt gagatgggaa agaacattag gtgctattgt tagacaaaag 780
aatcatccaa tgactccaga agctgttaag gcaaactgga agaaaatttg tgatttcgaa 840
aacgcttcta agccacaatc aatccaagaa tctacaggtt caatcatcga agttttgtct 900
aagatcgatt cagaaggtgg tgtttctgct aatcatactt ctagagcaac ttcaacagct 960
acttctggtt ttgctggtgc aatcggtcaa aagttaccac cattttctta cgcatacaca 1020
gaattggaag caattatgta tgctttaggt gttggtgctt caattaaaga tccaaaggat 1080
ttgaagttta tatatgaagg ttcttcagat ttctcatgtt tgccaacttt cggtgttatt 1140
atcggtcaaa aatctatgat gggtggtggt ttggcagaaa ttccaggttt atcaattaat 1200
ttcgctaagg ttttgcatgg tgaacaatat ttggaattgt acaagccatt gccaagagct 1260
ggtaaattaa aatgtgaagc tgttgttgca gatgttttgg ataaaggttc tggtgttgtt 1320
attattatgg atgtttattc ttactcagaa aaggaattga tctgtcataa ccaattttca 1380
ttatttttgg ttggttctgg tggtttcggt ggtaaaagaa catcagataa ggttaaggtt 1440
gctgttgcaa ttccaaatag accaccagat gctgttttga ctgatactac atcattgaac 1500
caagctgcat tgtacagatt gtctggtgac tggaatccat tgcatatcga tccaaacttc 1560
gcttctttgg caggtttcga taagccaatc ttgcatggtt tgtgtacttt cggtttttct 1620
gcaagaagag ttttgcaaca attcgctgat aacgatgttt caagattcaa agctattaaa 1680
gcaagattcg ctaagccagt ttatccaggt caaacattac aaactgaaat gtggaaggag 1740
ggtaacagaa ttcatttcca aacaaaggtt caagaaactg gtgacatcgt tatctcaaac 1800
gcatacgttg atttggctcc aacatctggt acttcagcta aaacaccatc agaaggtggt 1860
aaattgcaat ctactttcgt tttcgaagaa atcggtagaa gattgaagga tatcggtcca 1920
gaagttgtta agaaagttaa cgcagttttc gaatggcata tcacaaaagg tggtaacatc 1980
ggtgctaagt ggactatcga tttgaaatct ggttcaggta aagtttatca aggtccagct 2040
aaaggtgctg cagatactac aatcatcttg tctgatgaag atttcatgga agttgttttg 2100
ggtaaattag atccacaaaa ggctttcttt tctggtagat tgaaggctcg tggtaacatc 2160
atgttatctc aaaaattgca aatgatttta aaagattacg ctaaattata a 2211
<210> 179
<211> 735
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 179
Met Ala Ser Pro Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg
20 25 30
Gly Ala Leu Val Ile Val Asn Asp Leu Gly Gly Asp Phe Lys Gly Ile
35 40 45
Gly Lys Gly Ser Ser Ala Ala Asp Lys Val Val Ala Glu Ile Arg Arg
50 55 60
Lys Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Ala Gly Glu
65 70 75 80
Lys Leu Val Lys Thr Ala Leu Asp Thr Phe Gly Arg Ile Asp Val Val
85 90 95
Val Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ser Arg Ile Ser
100 105 110
Asp Glu Asp Trp Asp Ile Ile His Arg Val His Leu Arg Gly Ser Phe
115 120 125
Gln Val Thr Arg Ala Ala Trp Asp His Met Lys Lys Gln Asn Tyr Gly
130 135 140
Arg Ile Leu Met Thr Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Ile Leu Gly Leu Cys Asn
165 170 175
Thr Leu Ala Ile Glu Gly Arg Lys Asn Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Asn Ala Gly Ser Arg Met Thr Glu Thr Val Leu Pro Glu Asp
195 200 205
Leu Val Glu Ala Leu Lys Pro Glu Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Thr Leu Gly Ala Ile
245 250 255
Val Arg Lys Arg Asn Gln Pro Met Thr Pro Glu Ala Val Arg Asp Asn
260 265 270
Trp Glu Lys Ile Cys Asp Phe Ser Asn Ala Ser Lys Pro Gln Thr Ile
275 280 285
Gln Glu Ser Thr Gly Gly Ile Val Glu Val Leu His Lys Val Asp Ser
290 295 300
Glu Gly Ile Ser Pro Asn Arg Thr Ser His Ala Ala Pro Ala Ala Thr
305 310 315 320
Ser Gly Phe Val Gly Ala Val Gly His Lys Leu Pro Ser Phe Ser Ser
325 330 335
Ser Tyr Thr Glu Leu Gln Ser Ile Met Tyr Ala Leu Gly Val Gly Ala
340 345 350
Ser Val Lys Asn Pro Lys Asp Leu Lys Phe Val Tyr Glu Gly Ser Ala
355 360 365
Asp Phe Ser Cys Leu Pro Thr Phe Gly Val Ile Val Ala Gln Lys Ser
370 375 380
Met Met Asn Gly Gly Leu Ala Glu Val Pro Gly Leu Ser Phe Asn Phe
385 390 395 400
Ala Lys Ala Leu His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro Leu
405 410 415
Pro Arg Ser Gly Glu Leu Lys Cys Glu Ala Val Ile Ala Asp Ile Leu
420 425 430
Asp Lys Gly Ser Gly Val Val Ile Val Met Asp Val Tyr Ser Tyr Ser
435 440 445
Gly Lys Glu Leu Ile Cys Tyr Asn Gln Phe Ser Val Phe Val Val Gly
450 455 460
Ser Gly Gly Phe Gly Gly Lys Arg Thr Ser Glu Lys Leu Lys Ala Ala
465 470 475 480
Val Ala Val Pro Asn Arg Pro Pro Asp Ala Val Leu Arg Asp Ala Thr
485 490 495
Ser Leu Asn Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Trp Asn Pro
500 505 510
Leu His Ile Asp Pro Asp Phe Ala Ser Val Ala Gly Phe Glu Lys Pro
515 520 525
Ile Leu His Gly Leu Cys Thr Phe Gly Phe Ser Ala Arg His Val Leu
530 535 540
Gln Gln Phe Ala Asp Asn Asp Val Ser Arg Phe Lys Ala Ile Lys Val
545 550 555 560
Arg Phe Ala Lys Pro Val Tyr Pro Gly Gln Thr Leu Gln Thr Glu Met
565 570 575
Trp Lys Glu Gly Asn Arg Ile His Phe Gln Thr Lys Val His Glu Thr
580 585 590
Gly Asp Val Val Ile Ser Asn Ala Tyr Val Asp Leu Val Pro Ala Ser
595 600 605
Gly Val Ser Thr Gln Thr Pro Ser Glu Gly Gly Glu Leu Gln Ser Ala
610 615 620
Leu Val Phe Gly Glu Ile Gly Arg Arg Leu Lys Ser Val Gly Arg Glu
625 630 635 640
Val Val Lys Lys Ala Asn Ala Val Phe Glu Trp His Ile Thr Lys Gly
645 650 655
Gly Thr Val Ala Ala Lys Trp Thr Ile Asp Leu Lys Ser Gly Ser Gly
660 665 670
Glu Val Tyr Gln Gly Pro Ala Lys Gly Ser Ala Asp Val Thr Ile Ile
675 680 685
Ile Ser Asp Glu Asp Phe Met Glu Val Val Phe Gly Lys Leu Asp Pro
690 695 700
Gln Lys Ala Phe Phe Ser Gly Arg Leu Lys Ala Arg Gly Asn Ile Met
705 710 715 720
Leu Ser Gln Lys Leu Gln Met Ile Leu Lys Asp Tyr Ala Lys Leu
725 730 735
<210> 180
<211> 2208
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 180
atggcttctc cattaagatt tgatggtaga gttgttttgg ttactggtgc tggtggtggt 60
ttgggtagag cttatgcatt ggcttttgca gaaagaggtg ctttagttat tgttaacgat 120
ttgggtggtg actttaaagg tattggtaaa ggttcttcag ctgcagataa ggttgttgca 180
gaaatcagaa gaaaaggtgg taaagctgtt gcaaattacg attctgttga agctggtgaa 240
aaattagtta agactgcatt ggatacattc ggtagaatcg atgttgttgt taacaacgct 300
ggtattttaa gagatagatc attttctaga atctctgatg aagattggga tatcatccat 360
agagttcatt tgagaggttc atttcaagtt actagagctg catgggatca tatgaagaaa 420
caaaactacg gtagaatttt aatgacatct tcagcttctg gtatctatgg taacttcggt 480
caagcaaact actcagctgc aaagttgggt attttgggtt tatgtaacac tttggctatc 540
gaaggtagaa agaataacat ccattgtaac acaattgctc caaatgcagg ttctagaatg 600
actgaaacag ttttaccaga agatttggtt gaagctttaa aaccagaata cgttgcacca 660
ttggttttat ggttgtgtca tgaatcatgt gaagaaaacg gtggtttgtt tgaagttggt 720
gctggttgga ttggtaaatt aagatgggaa agaactttgg gtgctatcgt tagaaagaga 780
aaccaaccaa tgacaccaga agcagttaga gataactggg aaaagatttg tgatttctca 840
aacgcttcta agccacaaac tattcaagaa tctacaggtg gtatcgttga agttttgcat 900
aaggttgatt cagaaggtat ctctccaaat agaacttcac atgctgcacc agctgcaaca 960
tctggttttg ttggtgctgt tggtcataag ttgccatcat tttcttcatc ttacactgaa 1020
ttgcaatcta tcatgtacgc tttgggtgtt ggtgcatcag ttaaaaatcc aaaggatttg 1080
aagttcgttt acgaaggttc agctgatttc tcttgtttgc caacattcgg tgttattgtt 1140
gctcaaaaat ctatgatgaa tggtggttta gcagaagttc caggtttgtc ttttaatttc 1200
gctaaggcat tgcatggtga acaatatttg gaattgtaca agccattgcc aagatctggt 1260
gaattgaagt gtgaagctgt tattgcagat atcttggata agggttcagg tgttgttatt 1320
gttatggatg tttactcata ctctggtaaa gaattgatct gttacaacca attttcagtt 1380
tttgttgttg gttctggtgg tttcggtggt aaaagaactt ctgaaaagtt aaaagctgca 1440
gttgctgttc caaatagacc accagatgct gttttaagag atgcaacatc tttgaatcaa 1500
gctgcattat acagattgtc tggtgactgg aatccattgc atattgatcc agattttgct 1560
tctgttgcag gtttcgaaaa gccaatcttg catggtttgt gtacattcgg tttttctgct 1620
agacatgttt tgcaacaatt cgcagataac gatgtttcaa gattcaaagc tattaaagtt 1680
agattcgcaa agccagttta tccaggtcaa actttgcaaa cagaaatgtg gaaggagggt 1740
aacagaattc atttccaaac taaggttcat gaaacaggtg acgttgttat ttctaatgct 1800
tacgttgatt tggttccagc atcaggtgtt tctactcaaa caccatcaga aggtggtgaa 1860
ttgcaatctg ctttagtttt cggtgaaatc ggtagaagat tgaagtcagt tggtagagaa 1920
gttgttaaga aagctaacgc agttttcgaa tggcatatca ctaaaggtgg tacagttgct 1980
gcaaaatgga ctattgattt gaaatcaggt tctggtgaag tttatcaagg tccagctaag 2040
ggttcagcag atgttacaat catcatctct gatgaagatt tcatggaagt tgttttcggt 2100
aaattagatc cacaaaaggc tttcttttct ggtagattga aggcacgtgg taacatcatg 2160
ttgtcacaaa aattacaaat gattttgaaa gattacgcta aattgtaa 2208
<210> 181
<211> 751
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 181
Met Ala Ser Pro Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg
20 25 30
Gly Ala Leu Val Val Val Asn Asp Leu Gly Gly Asp Phe Lys Gly Val
35 40 45
Gly Lys Gly Ser Ser Ala Ala Asp Lys Val Val Glu Glu Ile Arg Arg
50 55 60
Arg Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Ala Gly Glu
65 70 75 80
Lys Leu Val Lys Thr Ala Leu Asp Thr Phe Gly Arg Ile Asp Val Val
85 90 95
Val Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ser Arg Ile Ser
100 105 110
Asp Glu Asp Trp Asp Ile Ile Gln Arg Val His Leu Arg Gly Ser Phe
115 120 125
Gln Val Thr Arg Ala Ala Trp Asp His Met Lys Lys Gln Asn Tyr Gly
130 135 140
Arg Ile Ile Met Thr Ala Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Leu Leu Gly Leu Ala Asn
165 170 175
Thr Leu Val Ile Glu Gly Arg Lys Asn Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Asn Ala Gly Ser Arg Met Thr Glu Thr Val Met Pro Glu Asp
195 200 205
Leu Val Glu Ala Leu Lys Pro Glu Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Thr Leu Gly Ala Ile
245 250 255
Val Arg Lys Arg Asn Gln Pro Met Thr Pro Glu Ala Val Arg Asp Asn
260 265 270
Trp Val Lys Ile Cys Asp Phe Ser Asn Ala Ser Lys Pro Lys Ser Ile
275 280 285
Gln Glu Ser Thr Gly Gly Ile Ile Glu Val Leu His Lys Ile Asp Ser
290 295 300
Glu Gly Ile Ser Gln Asn His Thr Gly Gln Val Ala Ser Ala Asp Ala
305 310 315 320
Ser Gly Phe Ala Gly Val Val Gly His Lys Leu Pro Ser Phe Ser Ser
325 330 335
Ser Tyr Thr Glu Leu Gln Cys Ile Met Tyr Ala Leu Gly Val Gly Ala
340 345 350
Ser Val Lys Asn Pro Lys Asp Leu Lys Phe Val Tyr Glu Gly Ser Ala
355 360 365
Asp Phe Ser Cys Leu Pro Thr Phe Gly Val Ile Val Ala Gln Lys Ser
370 375 380
Leu Met Ser Gly Gly Leu Ala Glu Val Pro Gly Leu Ser Ile Asn Phe
385 390 395 400
Ala Lys Val Leu His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro Leu
405 410 415
Pro Arg Ser Gly Glu Leu Lys Cys Glu Ala Val Ile Ala Asp Ile Leu
420 425 430
Asp Lys Gly Ser Gly Ile Val Ile Val Met Asp Val Tyr Ser Tyr Ser
435 440 445
Gly Lys Glu Leu Ile Cys Tyr Asn Gln Phe Ser Val Phe Val Val Gly
450 455 460
Ser Gly Gly Phe Gly Gly Lys Arg Thr Ser Glu Lys Leu Lys Ala Ala
465 470 475 480
Val Ala Val Pro Ser Arg Pro Pro Asp Ala Val Leu Arg Asp Thr Thr
485 490 495
Ser Leu Asn Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Ser Asn Pro
500 505 510
Leu His Ile Asp Pro Ser Phe Ala Ser Ile Ala Gly Phe Glu Lys Pro
515 520 525
Ile Leu His Gly Leu Cys Thr Phe Gly Phe Ser Ala Arg His Val Leu
530 535 540
Gln Gln Phe Ala Asp Asn Asp Val Ser Arg Phe Lys Ala Ile Lys Val
545 550 555 560
Arg Phe Ala Lys Pro Val Tyr Pro Gly Gln Thr Leu Gln Thr Glu Met
565 570 575
Trp Lys Glu Gly Asn Arg Ile His Phe Gln Thr Lys Val Gln Glu Thr
580 585 590
Gly Asp Ile Val Ile Ser Asn Ala Tyr Val Asp Leu Val Pro Thr Ser
595 600 605
Gly Val Ser Ala Gln Thr Pro Ser Glu Gly Gly Ala Leu Gln Ser Ala
610 615 620
Leu Val Phe Gly Glu Ile Gly Arg Arg Leu Lys Asp Val Gly Arg Glu
625 630 635 640
Val Val Lys Lys Val Asn Ala Val Phe Glu Trp His Ile Thr Lys Asn
645 650 655
Gly Asn Val Ala Ala Lys Trp Met Glu Leu Thr Ile Ser Phe Ser Val
660 665 670
Ser Ser Leu Leu Pro Ala Asn Ala Ile Asp Leu Lys Asn Gly Ser Gly
675 680 685
Glu Val Tyr Gln Gly Pro Ala Lys Gly Ser Ala Asp Thr Thr Ile Thr
690 695 700
Ile Ser Asp Glu Asp Phe Met Glu Val Val Leu Gly Lys Leu Asn Pro
705 710 715 720
Gln Asn Ala Phe Phe Ser Gly Arg Leu Lys Ala Arg Gly Asn Ile Met
725 730 735
Leu Ser Gln Lys Leu Gln Met Ile Leu Lys Asp Tyr Ala Lys Leu
740 745 750
<210> 182
<211> 2256
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 182
atggcttcac cattgagatt tgatggtaga gttgttttag ttactggtgc tggtggtggt 60
ttgggtagag cttatgcatt agcttttgca gaaagaggtg ctttggttgt tgttaatgat 120
ttgggtggtg actttaaagg tgttggtaaa ggttcttcag ctgcagataa ggttgttgaa 180
gaaatcagaa gaagaggtgg taaagctgtt gcaaattacg attctgttga agcaggtgaa 240
aaattggtta agactgcttt ggatacattc ggtagaatcg atgttgttgt taacaacgct 300
ggtattttga gagatagatc attttctaga atctcagatg aagattggga tatcatccaa 360
agagttcatt tgagaggttc ttttcaagtt actagagctg catgggatca tatgaagaaa 420
caaaactacg gtagaatcat tatgacagct tcagcatctg gtatctatgg taacttcggt 480
caagctaact actctgctgc aaaattgggt ttgttgggtt tagcaaacac tttggttatt 540
gaaggtagaa agaataacat ccattgtaac acaattgcac caaatgctgg ttcaagaatg 600
actgaaacag ttatgccaga agatttggtt gaagcattga aaccagaata cgttgctcca 660
ttggttttat ggttgtgtca tgaatcttgt gaagaaaacg gtggtttatt tgaagttggt 720
gctggttgga ttggtaaatt gagatgggaa agaactttgg gtgctatcgt tagaaagaga 780
aaccaaccaa tgacaccaga agcagttaga gataactggg ttaagatctg tgatttctca 840
aacgcttcta agccaaagtc aatccaagaa tctactggtg gtatcatcga agttttgcat 900
aagatcgatt cagaaggtat ctctcaaaat catacaggtc aagttgcttc agcagatgct 960
tctggttttg ctggtgttgt tggtcataag ttgccatcat tttcttcatc ttacactgaa 1020
ttgcaatgta tcatgtacgc attaggtgtt ggtgcttctg ttaaaaatcc aaaggatttg 1080
aagttcgttt acgaaggttc agctgatttc tcttgtttgc caacattcgg tgttattgtt 1140
gcacaaaaat cattgatgtc tggtggttta gctgaagttc caggtttgtc tattaatttc 1200
gcaaaggttt tgcatggtga acaatatttg gaattgtaca agccattgcc aagatctggt 1260
gaattgaagt gtgaagcagt tattgctgat atcttggata agggttctgg tatcgttatc 1320
gttatggatg tttactcata ctctggtaaa gaattgatct gttacaacca attttcagtt 1380
ttcgttgttg gttctggtgg tttcggtggt aaaagaactt cagaaaagtt gaaagctgca 1440
gttgcagttc catctagacc accagatgct gttttgagag atactacatc tttgaaccaa 1500
gctgcattgt acagattgtc tggtgactct aacccattgc atatcgatcc atcattcgca 1560
tctatcgctg gtttcgaaaa gccaatcttg catggtttgt gtacattcgg tttttcagca 1620
agacatgttt tgcaacaatt cgctgataac gatgtttcta gattcaaagc aattaaagtt 1680
agattcgcta agccagttta tccaggtcaa actttacaaa cagaaatgtg gaaggagggt 1740
aacagaattc atttccaaac taaggttcaa gaaacaggtg acatcgttat ctctaacgct 1800
tacgttgatt tggttccaac ttcaggtgtt tctgcacaaa caccatcaga aggtggtgca 1860
ttacaatctg ctttggtttt cggtgaaatc ggtagaagat tgaaggatgt tggtagagaa 1920
gttgttaaga aagttaacgc tgttttcgaa tggcatatca ctaaaaatgg taacgttgct 1980
gcaaagtgga tggaattgac aatctcattt tctgtttcat ctttgttgcc agcaaacgct 2040
atcgatttga aaaatggttc tggtgaagtt tatcaaggtc cagcaaaagg ttcagctgat 2100
actacaatca caatctctga tgaagatttc atggaagttg ttttgggtaa attgaaccca 2160
caaaacgctt tcttttctgg tagattgaag gctcgtggta acatcatgtt atctcaaaaa 2220
ttgcaaatga ttttaaaaga ttacgctaaa ttataa 2256
<210> 183
<211> 736
<212> PRT
<213> 家牛(Bos taurus)
<400> 183
Met Ala Ser Thr Leu Arg Phe Asn Gly Arg Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg
20 25 30
Gly Ala Ser Val Val Val Asn Asp Leu Gly Gly Asp Phe Thr Gly Val
35 40 45
Gly Lys Gly Ser Leu Ala Ala Asp Lys Val Val Glu Glu Ile Arg Arg
50 55 60
Lys Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Glu Gly Glu
65 70 75 80
Lys Ile Val Lys Thr Ala Leu Asp Ala Phe Gly Arg Ile Asp Ile Val
85 90 95
Ile Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ser Arg Ile Ser
100 105 110
Asp Glu Asp Trp Asp Lys Ile Gln Arg Val His Leu Arg Gly Ser Phe
115 120 125
Leu Val Thr Arg Ala Ala Trp Asp His Met Lys Lys Gln Lys Phe Gly
130 135 140
Arg Ile Ile Met Thr Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Cys Ala Ala Lys Leu Gly Leu Leu Gly Leu Ser Asn
165 170 175
Cys Leu Ala Val Glu Gly Lys Lys Asn Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Thr Ala Gly Ser Arg Met Thr Gln Ser Ile Leu Pro Glu Asp
195 200 205
Leu Val Glu Ala Leu Lys Pro Asp Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Ser Leu Gly Ala Leu
245 250 255
Val Arg Gln Arg Thr Gln Pro Met Thr Pro Glu Ala Val Lys Ala Asn
260 265 270
Trp Thr Lys Ile Cys Asp Phe Asp Asn Ala Thr Lys Pro Lys Ser Ile
275 280 285
Gln Glu Ser Ile Gly Ser Ile Val Glu Ala Leu Asn Lys Ile Asn Ser
290 295 300
Gly Gly Glu Val Ser Ala Asn Pro Thr Ser Arg Ala Thr Ser Ala Thr
305 310 315 320
Thr Ser Glu Phe Ala Arg Ala Ile Gly His Lys Phe Pro Pro Leu Tyr
325 330 335
Ser Ser Tyr Ala Glu Leu Asp Thr Ile Met Tyr Ala Leu Gly Val Gly
340 345 350
Ala Ser Ile Lys Glu Pro Lys Asp Met Lys Phe Ile Tyr Glu Gly Ser
355 360 365
Ser Asp Phe Ser Cys Leu Pro Thr Phe Gly Val Ile Leu Ala Gln Lys
370 375 380
Ser Ile Met Asn Gly Gly Leu Ala Glu Ile Pro Gly Leu Ser Ile Asn
385 390 395 400
Leu Ala Lys Ile Leu His Gly Glu Gln Tyr Leu Glu Leu His Lys Pro
405 410 415
Ile Pro Arg Ala Gly Lys Leu Arg Cys Glu Ala Ile Val Ala Asp Ile
420 425 430
Leu Asp Lys Gly Ser Gly Leu Val Ile Leu Val Asp Val Tyr Thr Tyr
435 440 445
Ser Gly Glu Glu Leu Ile Cys Tyr Asn Gln Phe Ser Ile Phe Val Val
450 455 460
Gly Ser Gly Gly Ser Gly Gly Lys Arg Thr Ser Asp Lys Ala Lys Ala
465 470 475 480
Ala Val Ala Ile Pro Asn Arg Pro Pro Asp Ala Val Leu Thr Asp Thr
485 490 495
Thr Ser Leu Asn Gln Ala Ala Leu Tyr Arg Leu Ser Gly Asp Trp Asn
500 505 510
Pro Leu His Ile Asp Pro Asn Phe Ala Ser Leu Ala Gly Phe Asp Lys
515 520 525
Pro Ile Leu His Gly Leu Cys Thr Phe Gly Phe Ser Ala Arg His Val
530 535 540
Leu Gln Gln Phe Ala Asp Asn Asp Val Ser Arg Phe Lys Ala Ile Lys
545 550 555 560
Val Arg Phe Ala Lys Pro Val Tyr Pro Gly Gln Thr Leu Gln Thr Glu
565 570 575
Met Trp Lys Glu Gly Asn Arg Ile His Phe Gln Thr Lys Val Gln Glu
580 585 590
Thr Gly Gly Ile Val Ile Ser Asn Ala Tyr Val Asp Leu Val Pro Ala
595 600 605
Ser Ala Ile Ser Ala Lys Thr Pro Ser Glu Gly Ala Gly Leu Gln Ser
610 615 620
Thr Leu Val Phe Glu Glu Ile Gly Arg Arg Leu Gln Gly Ile Gly Glu
625 630 635 640
Glu Val Val Lys Lys Val Arg Ala Val Phe Glu Trp His Ile Thr Lys
645 650 655
Gly Glu Asn Thr Ala Ala Lys Trp Thr Ile Asp Leu Lys Thr Gly Ser
660 665 670
Gly Lys Val Tyr Gln Gly Pro Ala Lys Gly Ser Ala Asp Val Thr Ile
675 680 685
Thr Leu Ser Asp Glu Asp Phe Met Glu Val Val Leu Gly Lys Leu Asp
690 695 700
Pro Gln Lys Ala Val Phe Ser Gly Arg Leu Lys Ala Arg Gly Asn Ile
705 710 715 720
Leu Leu Ser Gln Lys Leu Gln Met Ile Leu Lys Asp Tyr Ala Lys Leu
725 730 735
<210> 184
<211> 2211
<212> DNA
<213> 家牛(Bos taurus)
<400> 184
atggcatcta ctttgagatt caatggtaga gttgttttag ttacaggtgc tggtggtggt 60
ttgggtagag cttatgcatt ggcttttgca gaaagaggtg cttctgttgt tgttaatgat 120
ttgggtggtg actttactgg tgttggtaaa ggttcattag ctgcagataa ggttgttgaa 180
gaaatcagaa gaaaaggtgg taaagctgtt gcaaattacg attctgttga agaaggtgaa 240
aagattgtta aaacagcttt ggatgcattc ggtagaatcg atatcgttat taataatgct 300
ggtattttaa gagatagatc tttttcaaga atctctgatg aagattggga taagatccaa 360
agagttcatt tgagaggttc atttttggtt actagagctg catgggatca tatgaagaaa 420
caaaagttcg gtagaatcat tatgacatct tcagcttctg gtatctatgg taacttcggt 480
caagcaaact actgtgctgc aaagttgggt ttgttgggtt tatcaaactg tttggctgtt 540
gaaggtaaaa agaataacat ccattgtaac actattgctc caacagcagg ttcaagaatg 600
actcaatcta tcttgccaga agatttggtt gaagctttaa aaccagatta tgttgcacca 660
ttggttttat ggttgtgtca tgaatcttgt gaagaaaacg gtggtttgtt tgaagttggt 720
gcaggttgga tcggtaaatt gagatgggaa agatcattag gtgctttggt tagacaaaga 780
actcaaccaa tgacaccaga agctgttaag gcaaactgga ctaagatctg tgatttcgat 840
aatgctacaa agccaaagtc tatccaagaa tctatcggtt caatcgttga agcattgaat 900
aagattaatt caggtggtga agtttctgct aatccaactt ctagagctac atcagcaact 960
acatctgaat ttgctagagc aatcggtcat aagtttccac cattatactc ttcatacgct 1020
gaattggata ctattatgta tgctttaggt gttggtgcat caattaaaga accaaaggat 1080
atgaagttta tatatgaagg ttcttcagat ttttcatgtt tgccaacatt tggtgttatt 1140
ttggctcaaa aatctatcat gaatggtggt ttggcagaaa ttccaggttt gtctattaat 1200
ttggctaaga tcttgcatgg tgaacaatat ttggaattgc ataagccaat cccaagagct 1260
ggtaaattga gatgtgaagc tatcgttgca gatatcttgg ataagggttc tggtttagtt 1320
attttggttg atgtttacac ttactcaggt gaagaattga tctgttacaa ccaattttct 1380
atctttgttg ttggttctgg tggttcaggt ggtaaaagaa cttcagataa agctaaagct 1440
gcagttgcaa ttccaaatag accaccagat gctgttttga cagatactac atcattgaac 1500
caagctgcat tgtacagatt gtctggtgac tggaatccat tgcatatcga tccaaacttc 1560
gcttctttgg caggtttcga taagccaatc ttgcatggtt tgtgtacatt cggtttttct 1620
gctagacatg ttttgcaaca attcgcagat aacgatgttt caagattcaa agctattaaa 1680
gttagattcg caaagccagt ttatccaggt caaactttac aaacagaaat gtggaaggag 1740
ggtaacagaa ttcatttcca aactaaggtt caagaaacag gtggtatcgt tatctctaac 1800
gcatacgttg atttggttcc agcttctgca atttcagcta aaactccatc agaaggtgca 1860
ggtttacaat ctacattggt tttcgaagaa atcggtagaa gattgcaagg tattggtgaa 1920
gaagttgtta agaaagttag agctgttttc gaatggcata tcactaaggg tgaaaataca 1980
gctgcaaaat ggactattga tttgaaaaca ggttctggta aagtttatca aggtccagct 2040
aaaggttcag cagatgttac tatcacattg tctgatgaag atttcatgga agttgttttg 2100
ggtaaattag atccacaaaa agctgttttc tctggtagat tgaaggcacg tggtaacatc 2160
ttgttatctc aaaaattaca aatgattttg aaagattacg ctaaattata a 2211
<210> 185
<211> 736
<212> PRT
<213> 袋獾(Sarcophilus harrisii)
<400> 185
Met Asp Gly Gln Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg
20 25 30
Gly Ala Ser Val Val Val Asn Asp Leu Gly Gly Asp Phe Lys Gly Ala
35 40 45
Gly Lys Ser Ser Ser Pro Ala Asn Asn Val Val Glu Glu Ile Arg Lys
50 55 60
Lys Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Ala Gly Glu
65 70 75 80
Lys Val Val Lys Thr Ala Leu Glu Ala Phe Gly Lys Ile Asp Ile Val
85 90 95
Ile Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Val Arg Ile Ser
100 105 110
Asp Glu Asp Trp Asp Val Ile His Lys Val His Leu Arg Gly Ser Phe
115 120 125
Gln Val Thr Arg Ala Ala Trp Asp His Met Lys Lys Gln Lys Phe Gly
130 135 140
Arg Ile Ile Met Thr Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Leu Leu Gly Leu Ser Asn
165 170 175
Thr Leu Ala Ile Glu Gly Arg Lys Phe Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Thr Ala Gly Ser Arg Met Thr Lys Thr Ile Leu Pro Pro Asp
195 200 205
Leu Leu Asp Ser Leu Lys Pro Asp Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Thr Leu Gly Ala Ile
245 250 255
Val Arg Gln Lys Asn Gln Pro Met Thr Pro Glu Ala Val Lys Ala Asn
260 265 270
Trp Arg Lys Ile Cys Asp Phe Asp Asn Ala Ser Lys Pro Gln Thr Ile
275 280 285
Gln Glu Ser Thr Ala Gly Val Ile Glu Val Leu Ser Lys Ile Asp Ser
290 295 300
Gln Gly Glu Ile Ser Met Asn His Thr Ser His Ala Ala Ser Ala Thr
305 310 315 320
Thr Ser Asp Phe Thr Arg Ala Ile Gly Tyr Lys Leu Pro Gln Arg Thr
325 330 335
Phe Ser Tyr Thr Glu Leu Glu Ala Ile Met Tyr Ala Leu Gly Val Gly
340 345 350
Ala Ser Val Lys His Pro Glu Asn Leu Lys Phe Val Tyr Glu Gly Ser
355 360 365
Ser Asp Phe Ser Cys Leu Pro Thr Phe Gly Val Ile Pro Ala Gln Lys
370 375 380
Cys Met Met Glu Gly Gly Leu Ser Glu Val Pro Gly Leu Asn Ile Asp
385 390 395 400
Phe Ala Lys Val Leu His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro
405 410 415
Leu Pro Arg Thr Gly Gln Leu Thr Asn Glu Ser Ile Ile Val Asp Ile
420 425 430
Leu Asp Lys Gly Ser Gly Leu Val Ile Leu Leu Asp Val Tyr Ser Tyr
435 440 445
Ser Gly Lys Glu Leu Ile Cys Phe Asn Gln Phe Ser Val Phe Val Val
450 455 460
Gly Ser Gly Gly Phe Gly Gly Lys Lys Thr Ser Asn Lys Ala Lys Val
465 470 475 480
Thr Val Pro Pro Pro Lys Arg Ser Pro Asp Ala Val Leu Val Asp Thr
485 490 495
Thr Ser Leu Asn Gln Ala Val Leu Tyr Arg Leu Ser Gly Asp Trp Asn
500 505 510
Pro Leu His Ile Asp Pro Ser Phe Ala Ser Leu Gly Gly Phe Asp Lys
515 520 525
Pro Ile Leu His Gly Leu Cys Ser Phe Gly Phe Ser Ala Arg His Val
530 535 540
Leu Gln Gln Phe Gly Asn Asn Asp Val Ser Arg Phe Lys Ala Ile Lys
545 550 555 560
Ala Arg Phe Ala Lys Pro Val Tyr Pro Gly Gln Thr Leu Leu Thr Glu
565 570 575
Met Trp Lys Glu Gly Asn Arg Ile His Phe Gln Thr Lys Val Gln Glu
580 585 590
Thr Gly Asp Ile Val Leu Ser Asn Ala Tyr Val Asp Leu Val Pro Thr
595 600 605
Ser Asp Phe Ser Ala Thr Val Ser Ser Lys Asp Gly Val Leu Gln Ser
610 615 620
Thr Leu Val Phe Glu Glu Ile Gly Arg Arg Ile Lys Asp Leu Gly Lys
625 630 635 640
Glu Leu Val Lys Lys Val Asn Ala Val Phe Glu Trp Asn Ile Thr Lys
645 650 655
Gln Gly Gln Thr Ala Ala Gln Trp Thr Ile Asp Leu Lys Asn Gly Ser
660 665 670
Gly Glu Leu Tyr Gln Gly Pro Ala Arg Gly Ser Ala Asp Thr Ala Phe
675 680 685
Thr Leu Ser Asp Glu Asp Phe Met Glu Val Val Leu Gly Lys Leu Asn
690 695 700
Pro Gln Lys Ala Phe Phe Ser Gly Lys Leu Arg Val Lys Gly Asn Ile
705 710 715 720
Met Leu Ser Gln Lys Leu Glu Met Ile Leu Lys Asp Tyr Ala Lys Leu
725 730 735
<210> 186
<211> 2211
<212> DNA
<213> 袋獾(Sarcophilus harrisii)
<400> 186
atggatggtc aattgagatt tgatggtaga gttgttttag ttacaggtgc tggtggtggt 60
ttgggtagag cttatgcatt agcttttgca gaaagaggtg cttcagttgt tgttaatgat 120
ttgggtggtg actttaaagg tgctggtaaa tcttcatctc cagctaacaa cgttgttgaa 180
gaaatcagaa agaaaggtgg taaagctgtt gcaaattacg attctgttga agcaggtgaa 240
aaagttgtta aaactgcttt ggaagcattc ggtaaaatcg atatcgttat taataatgct 300
ggtattttaa gagatagatc tttcgttaga atctcagatg aagattggga tgttatccat 360
aaggttcatt tgagaggttc atttcaagtt acaagagctg catgggatca tatgaagaaa 420
caaaagttcg gtagaatcat tatgacttca tctgcatctg gtatctatgg taacttcggt 480
caagctaact actctgctgc aaagttgggt ttgttgggtt tatcaaacac tttggctatc 540
gaaggtagaa agtttaatat ccattgtaac actattgctc caacagcagg ttcaagaatg 600
actaaaacaa ttttgccacc agatttgttg gattctttga agccagatta cgttgctcca 660
ttggttttat ggttgtgtca tgaatcttgt gaagaaaatg gtggtttatt tgaagttggt 720
gcaggttgga ttggtaaatt gagatgggaa agaacattag gtgctattgt tagacaaaag 780
aatcaaccaa tgactccaga agctgttaag gcaaactgga gaaagatctg tgatttcgat 840
aacgcatcta agccacaaac tattcaagaa tcaacagctg gtgttattga agttttgtca 900
aagatcgatt ctcaaggtga aatctctatg aaccatacat cacatgctgc atctgcaact 960
acatcagatt tcactagagc tatcggttac aagttgccac aaagaacttt ttcatacaca 1020
gaattggaag caatcatgta cgctttaggt gttggtgctt ctgttaagca tccagaaaat 1080
ttgaagttcg tttacgaagg ttcatctgat ttctcatgtt tgccaacttt cggtgttatt 1140
ccagcacaaa aatgtatgat ggaaggtggt ttgtctgaag ttccaggttt aaacatcgat 1200
ttcgctaagg ttttgcatgg tgaacaatat ttggaattgt acaagccatt gccaagaact 1260
ggtcaattga caaacgaatc tatcatcgtt gatatcttgg ataagggttc aggtttagtt 1320
attttgttgg atgtttactc atactctggt aaagaattga tctgtttcaa ccaattttct 1380
gtttttgttg ttggttcagg tggtttcggt ggtaaaaaga cttctaataa ggctaaggtt 1440
actgttccac caccaaaaag atctccagat gcagttttgg ttgatactac atcattgaac 1500
caagctgtct tgtacagatt gtctggtgac tggaatccat tgcatattga tccatcattt 1560
gcatctttag gtggtttcga taagccaatc ttgcatggtt tgtgttcttt cggtttttca 1620
gctagacatg ttttgcaaca attcggtaac aacgatgttt ctagattcaa agctattaaa 1680
gcaagattcg ctaagccagt ttatccaggt caaactttgt taacagaaat gtggaaggag 1740
ggtaacagaa ttcatttcca aactaaggtt caagaaacag gtgacattgt tttgtcaaat 1800
gcatacgttg atttggttcc aacatcagat ttttctgcta ctgtttcatc taaggatggt 1860
gttttgcaat ctactttggt tttcgaagaa atcggtagaa gaattaaaga tttgggtaaa 1920
gaattggtta agaaagttaa cgctgttttc gaatggaaca tcactaaaca aggtcaaaca 1980
gctgcacaat ggactatcga tttgaaaaat ggttctggtg aattatatca aggtccagct 2040
agaggttctg cagatactgc ttttacattg tcagatgaag atttcatgga agttgttttg 2100
ggtaaattga acccacaaaa ggctttcttt tctggtaaat tgagagttaa aggtaatatt 2160
atgttatctc aaaaattgga aatgatttta aaagattacg ctaaattata a 2211
<210> 187
<211> 725
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 187
Met Ser Val Pro Leu Arg Phe Asp Gly Lys Val Val Leu Val Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Arg Glu Tyr Ala Leu Ala Phe Gly Gln Arg
20 25 30
Gly Ala Ala Val Ile Val Asn Asp Leu Gly Gly Asp Ile Lys Gly Gly
35 40 45
Gly Lys Ser Ser Ala Ala Ala Asp Lys Val Val Glu Glu Ile Arg Ala
50 55 60
Ala Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Asp Gly Glu
65 70 75 80
Lys Leu Ile Gln Thr Ala Leu Asp Ala Phe Gly Arg Ile Asp Val Val
85 90 95
Val Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ala Arg Thr Ser
100 105 110
Asp Val Asp Trp Asp Leu Ile Gln Arg Val His Leu Arg Gly Ser Phe
115 120 125
Leu Val Thr Arg Ala Ala Trp Asn His Met Lys Gln Gln Lys Phe Gly
130 135 140
Arg Ile Ile Met Thr Ser Ser Ala Ala Gly Ile Tyr Gly Asn Phe Gly
145 150 155 160
Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Leu Leu Gly Leu Ala Asn
165 170 175
Thr Leu Ala Ile Glu Gly Gln Lys Tyr Asn Ile His Cys Asn Thr Ile
180 185 190
Ala Pro Thr Ala Gly Ser Arg Leu Thr Glu Thr Val Met Pro Pro Asp
195 200 205
Leu Val Gln Ser Leu Lys Ala Glu Tyr Val Ala Pro Leu Val Leu Trp
210 215 220
Leu Cys His Glu Ser Cys Gln Glu Asn Ser Gly Leu Phe Glu Val Gly
225 230 235 240
Ala Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Ser Leu Gly Arg Ile
245 250 255
Val Arg Gln Lys Ser Glu Cys Val Thr Pro Glu Ala Val Arg Asp Ala
260 265 270
Trp Arg Asp Ile Cys Asp Phe Thr Asn Ala Thr Lys Pro Ala Ser Ile
275 280 285
Gln Glu Ser Leu Gln Thr Leu Val Glu Val Leu Ser Arg Val Glu Asp
290 295 300
Glu Arg Lys Ile Gly Ala Asn Pro Thr Ala Val Ala Thr Asn Pro Ala
305 310 315 320
Gln Ala Ile Gly His Val Leu Pro Asp Met Thr Phe Thr His Thr His
325 330 335
Met Asn Cys Ile Leu Tyr Ala Leu Gly Val Gly Val Ser Ser Arg Asp
340 345 350
Pro Gln Gln Leu Gln Phe Leu Tyr Glu Gly His Thr His Phe Ser Cys
355 360 365
Leu Pro Thr Phe Gly Val Ile Pro Ala Gln Gly Ala Leu Leu Gly Leu
370 375 380
Gly Ser Ile Pro Gly Leu Asp Ile Asp Phe Thr Arg Leu Leu His Gly
385 390 395 400
Glu Gln Tyr Leu Glu Leu Tyr Lys Pro Leu Pro Thr Ser Gly Thr Leu
405 410 415
Thr Ser Arg Ala Thr Val Ala Asp Val Leu Asp Lys Gly Ser Gly Met
420 425 430
Leu Ile Leu Leu Asp Val His Thr Tyr Ser Glu Gln Glu Leu Leu Cys
435 440 445
Tyr Asn Gln Phe Ser Val Phe Ile Val Gly Ser Gly Gly Phe Gly Gly
450 455 460
Lys Arg Val Ser Gln Lys Ala Val Ala Pro Ala Ala Pro Pro Asp Arg
465 470 475 480
Pro Ala Asp Ala Val Val Val Glu Glu Thr Ser Arg Asp Gln Ala Ala
485 490 495
Leu Tyr Arg Leu Ser Gly Asp Trp Asn Pro Leu His Ile Asp Pro Asn
500 505 510
Phe Ala Ala Met Gly Gly Phe Gln Ser Pro Ile Leu His Gly Leu Cys
515 520 525
Ser Phe Gly Phe Ala Ala Arg His Val Leu Lys Gln Phe Ala Gly Asn
530 535 540
Asp Val Ser Arg Phe Lys Ala Met Lys Val Arg Phe Val Lys Pro Val
545 550 555 560
Tyr Pro Gly Gln Ser Leu Gln Thr Glu Met Trp Lys Glu Asn Ser Arg
565 570 575
Val His Ile Gln Cys Thr Val Lys Glu Ser Gly Ala Val Val Leu Ser
580 585 590
Gly Ala Tyr Ile Asp Leu His Pro Ala Ala Ser Val Asn Thr Gly Pro
595 600 605
Pro Gln Thr Glu Leu Gln Ser Asp Leu Val Phe Ala Glu Ile Glu Arg
610 615 620
Arg Ile Lys Asp Ser Gly Glu Glu Leu Val Lys Lys Val Asn Ala Val
625 630 635 640
Phe Gly Trp Glu Ile Thr Thr Asp Gly Glu Thr Arg Arg His Trp Thr
645 650 655
Val Asp Leu Lys Thr Gly Arg Gly Ser Val Gln Arg Ala Ala Ala Lys
660 665 670
Ala Asp Val Thr Phe Thr Val Ser Asp Gln Asp Phe Met Glu Val Val
675 680 685
Met Gly Lys Leu Asn Pro Gln Lys Ala Phe Phe Ala Gly Lys Leu Lys
690 695 700
Val Lys Gly Asn Ile Met Leu Ser Gln Lys Leu Glu Ala Val Leu Lys
705 710 715 720
Asp Gln Ala Arg Leu
725
<210> 188
<211> 2211
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 188
atggatggtc aattgagatt tgatggtaga gttgttttag ttacaggtgc tggtggtggt 60
ttgggtagag cttatgcatt agcttttgca gaaagaggtg cttcagttgt tgttaatgat 120
ttgggtggtg actttaaagg tgctggtaaa tcttcatctc cagctaacaa cgttgttgaa 180
gaaatcagaa agaaaggtgg taaagctgtt gcaaattacg attctgttga agcaggtgaa 240
aaagttgtta aaactgcttt ggaagcattc ggtaaaatcg atatcgttat taataatgct 300
ggtattttaa gagatagatc tttcgttaga atctcagatg aagattggga tgttatccat 360
aaggttcatt tgagaggttc atttcaagtt acaagagctg catgggatca tatgaagaaa 420
caaaagttcg gtagaatcat tatgacttca tctgcatctg gtatctatgg taacttcggt 480
caagctaact actctgctgc aaagttgggt ttgttgggtt tatcaaacac tttggctatc 540
gaaggtagaa agtttaatat ccattgtaac actattgctc caacagcagg ttcaagaatg 600
actaaaacaa ttttgccacc agatttgttg gattctttga agccagatta cgttgctcca 660
ttggttttat ggttgtgtca tgaatcttgt gaagaaaatg gtggtttatt tgaagttggt 720
gcaggttgga ttggtaaatt gagatgggaa agaacattag gtgctattgt tagacaaaag 780
aatcaaccaa tgactccaga agctgttaag gcaaactgga gaaagatctg tgatttcgat 840
aacgcatcta agccacaaac tattcaagaa tcaacagctg gtgttattga agttttgtca 900
aagatcgatt ctcaaggtga aatctctatg aaccatacat cacatgctgc atctgcaact 960
acatcagatt tcactagagc tatcggttac aagttgccac aaagaacttt ttcatacaca 1020
gaattggaag caatcatgta cgctttaggt gttggtgctt ctgttaagca tccagaaaat 1080
ttgaagttcg tttacgaagg ttcatctgat ttctcatgtt tgccaacttt cggtgttatt 1140
ccagcacaaa aatgtatgat ggaaggtggt ttgtctgaag ttccaggttt aaacatcgat 1200
ttcgctaagg ttttgcatgg tgaacaatat ttggaattgt acaagccatt gccaagaact 1260
ggtcaattga caaacgaatc tatcatcgtt gatatcttgg ataagggttc aggtttagtt 1320
attttgttgg atgtttactc atactctggt aaagaattga tctgtttcaa ccaattttct 1380
gtttttgttg ttggttcagg tggtttcggt ggtaaaaaga cttctaataa ggctaaggtt 1440
actgttccac caccaaaaag atctccagat gcagttttgg ttgatactac atcattgaac 1500
caagctgtct tgtacagatt gtctggtgac tggaatccat tgcatattga tccatcattt 1560
gcatctttag gtggtttcga taagccaatc ttgcatggtt tgtgttcttt cggtttttca 1620
gctagacatg ttttgcaaca attcggtaac aacgatgttt ctagattcaa agctattaaa 1680
gcaagattcg ctaagccagt ttatccaggt caaactttgt taacagaaat gtggaaggag 1740
ggtaacagaa ttcatttcca aactaaggtt caagaaacag gtgacattgt tttgtcaaat 1800
gcatacgttg atttggttcc aacatcagat ttttctgcta ctgtttcatc taaggatggt 1860
gttttgcaat ctactttggt tttcgaagaa atcggtagaa gaattaaaga tttgggtaaa 1920
gaattggtta agaaagttaa cgctgttttc gaatggaaca tcactaaaca aggtcaaaca 1980
gctgcacaat ggactatcga tttgaaaaat ggttctggtg aattatatca aggtccagct 2040
agaggttctg cagatactgc ttttacattg tcagatgaag atttcatgga agttgttttg 2100
ggtaaattga acccacaaaa ggctttcttt tctggtaaat tgagagttaa aggtaatatt 2160
atgttatctc aaaaattgga aatgatttta aaagattacg ctaaattata a 2211
<210> 189
<211> 741
<212> PRT
<213> 非洲爪蟾(Xenopus laevis)
<400> 189
Met Asp Ser Gln Val Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr
1 5 10 15
Gly Ala Gly Gly Gly Leu Gly Arg Thr Tyr Ala Leu Ala Phe Ala Glu
20 25 30
Arg Gly Ala Ser Val Val Val Asn Asp Leu Gly Gly Asp Ile Lys Gly
35 40 45
Glu Gly Lys Ser Ser Phe Ala Ala Asp Lys Val Val Glu Glu Ile Arg
50 55 60
Ala Lys Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Ala Gly
65 70 75 80
Glu Lys Leu Val Gln Ser Ala Leu Asp Ala Phe Gly Arg Ile Asp Ile
85 90 95
Ile Ile Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ala Arg Ile
100 105 110
Ser Asp Ala Asp Trp Asp Ile Ile His Arg Val His Leu Lys Gly Ser
115 120 125
Phe Leu Ile Thr Arg Ala Ala Trp Asn His Met Lys Asn Gln Lys Phe
130 135 140
Gly Arg Ile Ile Met Thr Ser Ser Ala Ala Gly Ile Tyr Gly Asn Phe
145 150 155 160
Gly Gln Ala Asn Tyr Ser Ala Ala Lys Leu Gly Leu Val Gly Leu Ser
165 170 175
Asn Thr Leu Ala Ile Glu Gly Thr Lys Tyr Asn Ile Gln Ser Asn Cys
180 185 190
Ile Ala Pro Thr Ala Gly Ser Arg Leu Thr Gln Thr Val Met Pro Gln
195 200 205
Asp Leu Leu Asp Ala Leu Lys Pro Glu Tyr Val Thr Pro Leu Val Leu
210 215 220
Trp Leu Cys His Glu Arg Cys Gln Glu Thr Gly Ser Leu Phe Glu Val
225 230 235 240
Gly Ala Gly Trp Val Gly Lys Leu Arg Trp Glu Arg Ser Leu Gly Ala
245 250 255
Ile Ile Arg Gln Thr Asn Arg Pro Met Thr Pro Glu Ala Val Arg Asp
260 265 270
Glu Trp Ala Lys Ile Cys Asp Phe Asp Asn Ala Asp Lys Pro Gln Thr
275 280 285
Ile Gln Asp Ser Ile Asn Pro Leu Tyr Gln Val Leu Ser Gln Val Asp
290 295 300
Ser Glu Lys Gly Val Ser Met Asn Pro Thr Ser His Gly Thr Ser Leu
305 310 315 320
Ser Ser Ser Ser Ile Asp Pro Ala Lys Ala Ile Gly Gln Lys Leu Pro
325 330 335
Val Thr Leu Tyr Lys Tyr Ser His Leu Glu Pro Ile Leu Tyr Ala Leu
340 345 350
Gly Val Gly Met Ser Thr Arg Asp Pro Asp His Leu Lys Phe Leu Tyr
355 360 365
Glu Gly Ser Glu Asp Phe Ser Cys Leu Pro Ser Phe Gly Val Val Val
370 375 380
Ser Gln Ala Ala Phe Met Ser Gly Gly Leu Ala Ser Val Pro Gly Leu
385 390 395 400
Asn Ile Asp Phe Thr Arg Val Leu His Gly Glu Gln Tyr Leu Glu Val
405 410 415
Tyr Lys Pro Leu Pro Thr Ser Gly Glu Met Thr Ser His Ala Thr Val
420 425 430
Ala Asp Ile Met Asp Lys Gly Ser Gly Ala Ile Ile Leu Leu Asp Val
435 440 445
His Thr Tyr His Gly Ala Asp Leu Ile Cys Tyr Asn Gln Phe Ser Val
450 455 460
Phe Val Val Gly Ala Gly Gly Phe Gly Gly Lys Arg Ser Ser Ser Lys
465 470 475 480
Ala Lys Ala Thr Glu Asn Pro Pro Ser Arg Pro Pro Asp Val Val Glu
485 490 495
Ile Asp Val Thr Asn Ala Asp Gln Ala Ala Leu Tyr Arg Leu Ser Gly
500 505 510
Asp Trp Asn Pro Leu His Ile Asp Pro Ser Phe Ala Ala Leu Gly Gly
515 520 525
Phe Glu Arg Pro Ile Leu His Gly Leu Cys Ser Phe Gly Phe Ser Ala
530 535 540
Arg His Val Leu Lys His Phe Ala Asn Asn Asp Val Thr Lys Phe Lys
545 550 555 560
Ala Ile Lys Val Arg Phe Ala Lys Pro Val Leu Pro Gly Gln Thr Leu
565 570 575
Gln Thr Glu Met Trp Lys Glu Gly Asn Arg Ile Phe Leu Gln Thr Lys
580 585 590
Val Lys Glu Thr Gly Glu Ile Ala Ile Ala Gly Ala Tyr Val Asp Leu
595 600 605
Ala Ser Thr Val Asn Asn Pro Glu Ser Lys Ala Ala Val Gln Asp Gly
610 615 620
Gly Leu Gln Ser Asp Leu Val Phe Glu Glu Ile Ser Arg Arg Val Lys
625 630 635 640
Asp Val Gly Gly Gln Leu Val Lys Lys Val Asn Ala Val Phe Gln Trp
645 650 655
Asp Ile Thr Lys Asp Gly Lys Thr Ala Ser Gln Trp Thr Ile Asp Leu
660 665 670
Lys Ser Gly Gly Ser Gly Glu Val Tyr Arg Gly Lys Ala Arg Gly Arg
675 680 685
Ala Asp Thr Ser Phe Thr Leu Ser Asp Glu Asp Phe Met Glu Leu Val
690 695 700
Leu Gly Lys Val Asn Pro Gln Lys Ala Phe Phe Ala Gly Lys Leu Lys
705 710 715 720
Val Lys Gly Asn Ile Met Leu Ser Gln Lys Leu Glu Met Ile Leu Lys
725 730 735
Asp Tyr Ala Lys Leu
740
<210> 190
<211> 2226
<212> DNA
<213> 非洲爪蟾(Xenopus laevis)
<400> 190
atggattctc aagttttgag attcgatggt agagttgttt tggttacagg tgctggtggt 60
ggtttgggta gaacttatgc tttagcattt gctgaaagag gtgcatcagt tgttgttaat 120
gatttgggtg gtgacattaa aggtgagggt aaatcttcat ttgctgcaga taaggttgtt 180
gaagaaatca gagctaaagg tggtaaagca gttgctaatt acgattctgt tgaagcaggt 240
gaaaaattgg ttcaatcagc tttagatgca ttcggtagaa tcgatatcat tattaacaat 300
gctggtattt tgagagatag atctttcgct agaatttcag atgcagattg ggatatcatc 360
catagagttc atttgaaggg ttcatttttg atcacaagag ctgcatggaa tcatatgaag 420
aaccaaaagt tcggtagaat cattatgact tcttcagctg caggtatcta tggtaacttc 480
ggtcaagcta actactctgc tgcaaagttg ggtttagttg gtttatcaaa cacattggca 540
attgaaggta ctaagtacaa catccaatct aactgtattg ctccaacagc aggttcaaga 600
ttaactcaaa cagttatgcc acaagatttg ttagatgctt tgaaaccaga atacgttaca 660
ccattggttt tatggttgtg tcatgaaaga tgtcaagaaa ctggttcttt atttgaagtt 720
ggtgctggtt gggttggtaa attgagatgg gaaagatcat taggtgcaat catcagacaa 780
actaacagac caatgacacc agaagctgtt agagatgaat gggcaaagat ctgtgatttc 840
gataacgctg ataagccaca aactatccaa gattctatta atccattgta ccaagttttg 900
tcacaagttg attctgaaaa aggtgtttca atgaatccaa cttcacatgg tacatcttta 960
tcttcatctt caattgatcc agcaaaagct attggtcaaa agttgccagt tacattgtac 1020
aagtactctc atttggaacc aatcttgtat gctttgggtg ttggcatgtc aactagagat 1080
ccagatcatt tgaagttctt gtacgaaggt tcagaagatt tctcttgttt gccatcattt 1140
ggtgttgttg tttctcaagc tgcttttatg tctggtggtt tggcttcagt tccaggttta 1200
aacatcgatt tcacaagagt tttgcatggt gaacaatatt tggaagttta caagccattg 1260
ccaacttctg gtgaaatgac ttcacatgct acagttgcag atattatgga taaaggttct 1320
ggtgctatca tcttgttgga tgttcatact taccatggtg cagatttgat ctgttacaac 1380
caattttcag tttttgttgt tggtgctggt ggttttggtg gtaaaagatc ttcatctaaa 1440
gcaaaagcta cagaaaatcc accatctaga ccaccagatg ttgttgaaat tgatgttact 1500
aatgcagatc aagctgcatt atatagattg tctggtgact ggaatccatt acatattgat 1560
ccatcatttg ctgcattggg tggttttgaa agaccaatct tgcatggttt gtgttctttc 1620
ggtttttcag ctagacatgt tttgaagcat ttcgcaaaca acgatgttac aaagtttaaa 1680
gctattaaag ttagattcgc aaagccagtt ttgccaggtc aaactttaca aacagaaatg 1740
tggaaggagg gtaacagaat tttcttgcaa actaaggtta aggaaacagg tgaaatcgca 1800
attgctggtg catacgttga tttggcttct actgttaata atccagaatc aaaagctgca 1860
gttcaagatg gtggtttaca atctgatttg gttttcgaag aaatttcaag aagagttaaa 1920
gatgttggtg gtcaattggt taagaaagtt aacgctgttt tccaatggga tatcactaag 1980
gatggtaaaa cagcatctca atggactatt gatttgaaat caggtggttc tggtgaagtt 2040
tatcgtggta aagctagagg tagagcagat acttctttta cattgtcaga tgaagatttc 2100
atggaattag ttttgggtaa agttaaccca caaaaagctt tctttgctgg taaattaaaa 2160
gttaaaggta atattatgtt atctcaaaaa ttggaaatga ttttaaaaga ttacgctaaa 2220
ttgtaa 2226
<210> 191
<211> 731
<212> PRT
<213> 普通水螅(Hydra vulgaris)
<400> 191
Met Ser Ser Leu Ser Phe Ala Gly Arg Val Ala Val Ile Thr Gly Ala
1 5 10 15
Gly Gly Gly Leu Gly Arg Glu Tyr Ala Leu Glu Phe Ala Lys Arg Gly
20 25 30
Ala Gln Val Val Val Asn Asp Leu Gly Gly Ser Phe Lys Gly Glu Gly
35 40 45
Ser Ser Thr Leu Leu Ala Asp Gln Val Val Lys Glu Ile Ile Asn Ala
50 55 60
Gly Gly Lys Ala Val Ala Asn Tyr Asp Ser Val Glu Asn Gly Glu Gln
65 70 75 80
Ile Ile Lys Thr Ala Ile Gln Glu Phe Gly Lys Val Asp Ile Leu Ile
85 90 95
Asn Asn Ala Gly Ile Leu Arg Asp Arg Ser Phe Ser Lys Met Ser Asp
100 105 110
Lys Asp Trp Glu Gln Ile Phe Lys Val His Val Asp Gly Ala Phe Lys
115 120 125
Cys Thr Gln Ala Val Trp Pro Tyr Met Gln Lys Gln Lys Phe Gly Arg
130 135 140
Ile Ile Met Thr Ser Ser Pro Ala Gly Leu Tyr Gly Asn Phe Gly Gln
145 150 155 160
Ala Asn Tyr Ser Ala Ala Lys Ala Ala Leu Ile Gly Leu Met Asn Thr
165 170 175
Leu Ser Ile Glu Gly Lys Lys Ala Asn Ile Asn Val Asn Val Ile Ala
180 185 190
Pro Leu Ala Glu Thr Arg Met Thr Ala Asp Ile Leu Pro Gly Ala Gly
195 200 205
Leu Leu Pro Glu His Val Ala Pro Phe Val Val Phe Met Cys His Glu
210 215 220
Ser Cys Val Asp Thr Gly Ile Ile Leu Glu Ala Ala Gly Gly Phe Ala
225 230 235 240
Cys Lys Thr Arg Leu Gln Arg Ser Gln Gly Ile Gln Leu Arg Lys Tyr
245 250 255
Ile Gly Asp Lys Pro Thr Val Glu Cys Val Gln Lys Asn Trp Thr Lys
260 265 270
Ile Ser Asp Phe Ser Leu Ser Cys Asn Pro Arg Ser Val Gln Glu Ala
275 280 285
Ser Asn Lys Ile Met Glu Ser Ile Gly Asp Leu Pro Ser Glu Pro Leu
290 295 300
Ser Thr Ser Ala Ser Leu Leu Glu Lys Val Arg Ser Tyr Lys Phe Pro
305 310 315 320
Ser Ile Thr Val Ile Tyr Asp Gln Asn Asp Ile Ile Lys Tyr Ala Leu
325 330 335
Ser Val Gly Ser Ser Leu Pro Asp Asp Ser Gln Phe Leu Tyr Glu Gly
340 345 350
His Ala Asn Phe Ser Ala Ile Pro Thr Phe Ala Ala Ile Leu Ser Gln
355 360 365
Lys Ala Val Phe Ser Glu Leu Ala Glu Gly Asn Ile Pro Gly Met Asp
370 375 380
Met Ile Asp Leu Ser Lys Val Leu His Gly Glu Gln Phe Ile Glu Ile
385 390 395 400
Phe Lys Pro Ile Pro Thr Ser Gly Gln Phe Thr Val Lys Gly Gln Ile
405 410 415
Arg Asp Ile Leu Asp Lys His Lys Phe Cys Gln Phe Ile Ile Asp Val
420 425 430
Asn Val Phe Asp Ala Lys Asn Glu Leu Val Cys Met Ser Gln Phe Val
435 440 445
Leu Leu Phe Ile Gly Ser Lys Gly Ile Gly His Arg Gly Lys Tyr Asp
450 455 460
Gly Gln Lys Pro Thr Leu Phe Pro Pro Lys Arg Lys Pro Asp His Val
465 470 475 480
Val Glu Glu Val Thr Ser Ile Asn Gln Ala Ala Leu Tyr Arg Leu Asn
485 490 495
Gly Asp Phe Asn Pro Leu His Ile Asp Pro Gln Ile Ser Ser Met Leu
500 505 510
Gly Phe Glu Lys Pro Leu Leu His Gly Leu Cys Thr Tyr Gly Tyr Ala
515 520 525
Leu Arg His Val Leu Lys Ala Tyr Ala Asn Asn Asp Ala Ser Phe Phe
530 535 540
Lys Ser Ile Lys Ala Gln Phe Ser Lys Pro Val Ile Pro Gly Gln Thr
545 550 555 560
Ile Met Thr Glu Met Trp His Glu Ala Asn Arg Val Tyr Tyr Gln Val
565 570 575
Lys Val Lys Glu Thr Gly Asp Val Val Ile Lys Gly Gly Tyr Val Asp
580 585 590
Phe His Lys Glu Leu Lys Gly Gln Ser Ser Val Ser Ala Ser Ala His
595 600 605
Ser Tyr Gly Ile Asp Ser Ser Leu Gln Ser Ser His Ala Met Lys Lys
610 615 620
Ile Glu Asp Ser Leu Lys Thr Ala Asp Glu Ala Val Leu Lys Gln Ile
625 630 635 640
Asn Gly Ser Phe Leu Phe Gln Ile Thr Lys Glu Asn Lys Leu Ala Gly
645 650 655
Glu Trp Leu Leu Asn Phe Asn Gln Phe Pro Val Thr Val Thr Tyr Gly
660 665 670
Val Pro Ile Thr Lys Pro Asp Val Thr Ile Thr Ile Asn Asp Asp Asp
675 680 685
Phe Val Leu Ile Ala Thr Gly Lys Leu Asn Pro Met Gln Ala Phe Ser
690 695 700
Gln Gly Lys Leu Lys Ala Phe Gly Lys Val Ile Leu Ala Leu Lys Leu
705 710 715 720
Gly Asp Ile Phe Lys Ser Val Ser Ser Lys Leu
725 730
<210> 192
<211> 2196
<212> DNA
<213> 普通水螅(Hydra vulgaris)
<400> 192
atgtcttcat tatcttttgc tggtagagtt gcagttatta ctggtgctgg tggtggtttg 60
ggtagagaat atgctttgga atttgcaaaa agaggtgctc aagttgttgt taatgatttg 120
ggtggttctt ttaaaggtga aggttcttca actttgttgg cagatcaagt tgttaaggaa 180
attattaacg ctggtggtaa agctgttgca aattacgatt ctgttgaaaa cggtgaacaa 240
attattaaga cagcaatcca agaattcggt aaagttgata tcttgattaa taacgctggt 300
attttgagag atagatcttt ttcaaagatg tctgataagg attgggaaca aatttttaag 360
gttcatgttg atggtgcttt taaatgtact caagctgttt ggccatacat gcaaaagcaa 420
aagttcggta gaatcatcat gacatcttca ccagcaggtt tatacggtaa cttcggtcaa 480
gctaactact cagctgcaaa agctgcattg atcggtttga tgaacacatt gtctatcgaa 540
ggtaaaaagg ctaacatcaa cgttaacgtt atcgctccat tggcagaaac tagaatgaca 600
gcagatattt taccaggtgc tggtttgtta ccagaacatg ttgcaccatt cgttgttttt 660
atgtgtcatg aatcatgtgt tgatactggt atcatcttag aagctgcagg tggtttcgct 720
tgtaagacaa gattgcaaag atctcaaggt attcaattga gaaagtacat cggtgacaaa 780
ccaactgttg aatgtgttca aaagaattgg acaaagatct ctgatttctc tttgtcatgt 840
aatccaagat cagttcaaga agcatctaat aagatcatgg aatcaatcgg tgacttgcca 900
tctgaaccat tgtctacttc agcttctttg ttagaaaaag ttagatcata taaatttcca 960
tctattactg ttatatatga tcaaaacgat attattaagt acgctttgtc agttggttct 1020
tcattgccag atgattctca attcttgtac gaaggtcatg caaacttctc tgctattcca 1080
acttttgctg caattttgtc acaaaaagca gttttctctg aattagctga gggtaacatc 1140
cctggtatgg atatgatcga tttgtcaaag gttttgcatg gtgaacaatt cattgaaatt 1200
tttaagccaa tcccaacttc tggtcaattc actgttaaag gtcaaatcag agatatcttg 1260
gataagcata agttttgtca attcattatt gatgttaatg tttttgatgc taaaaatgaa 1320
ttggtttgta tgtcacaatt cgttttgttg tttattggtt ctaaaggtat tggtcatcgt 1380
ggtaaatacg atggtcaaaa gccaactttg tttccaccaa aaagaaaacc agatcatgtt 1440
gttgaagaag ttacatctat taatcaagct gcattgtaca gattgaatgg tgacttcaac 1500
ccattgcata tcgatccaca aatctcttca atgttgggtt tcgaaaagcc attgttgcat 1560
ggtttgtgta cttatggtta cgctttgaga catgttttga aggcttacgc aaacaacgat 1620
gcatctttct ttaagtctat taaagctcaa ttttcaaagc cagttattcc aggtcaaact 1680
attatgacag aaatgtggca tgaagctaac agagtttact accaagttaa ggttaaagaa 1740
acaggtgacg ttgttattaa aggtggttac gttgatttcc ataaggaatt gaaaggtcaa 1800
tcttcagttt cagcttctgc acattcttac ggtattgatt cttcattgca atcttcacat 1860
gcaatgaaga aaattgaaga ttcattgaag actgctgatg aagcagtttt gaagcaaatt 1920
aatggttcat ttttgttcca aatcacaaag gaaaataagt tggctggtga atggttgttg 1980
aacttcaacc aattcccagt tactgttaca tatggtgttc caatcactaa gccagatgtt 2040
actatcacaa ttaatgatga tgatttcgtt ttgatcgcaa ctggtaaatt gaacccaatg 2100
caagcttttt cacagggtaa attgaaggca ttcggtaaag ttattttggc tttgaagttg 2160
ggtgacattt ttaagtctgt ttcttcaaaa ttgtaa 2196
<210> 193
<211> 900
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 193
Met Pro Gly Asn Leu Ser Phe Lys Asp Arg Val Val Val Ile Thr Gly
1 5 10 15
Ala Gly Gly Gly Leu Gly Lys Val Tyr Ala Leu Ala Tyr Ala Ser Arg
20 25 30
Gly Ala Lys Val Val Val Asn Asp Leu Gly Gly Thr Leu Gly Gly Ser
35 40 45
Gly His Asn Ser Lys Ala Ala Asp Leu Val Val Asp Glu Ile Lys Lys
50 55 60
Ala Gly Gly Ile Ala Val Ala Asn Tyr Asp Ser Val Asn Glu Asn Gly
65 70 75 80
Glu Lys Ile Ile Glu Thr Ala Ile Lys Glu Phe Gly Arg Val Asp Val
85 90 95
Leu Ile Asn Asn Ala Gly Ile Leu Arg Asp Val Ser Phe Ala Lys Met
100 105 110
Thr Glu Arg Glu Phe Ala Ser Val Val Asp Val His Leu Thr Gly Gly
115 120 125
Tyr Lys Leu Ser Arg Ala Ala Trp Pro Tyr Met Arg Ser Gln Lys Phe
130 135 140
Gly Arg Ile Ile Asn Thr Ala Ser Pro Ala Gly Leu Phe Gly Asn Phe
145 150 155 160
Gly Gln Ala Asn Tyr Ser Ala Ala Lys Met Gly Leu Val Gly Leu Ala
165 170 175
Glu Thr Leu Ala Lys Glu Gly Ala Lys Tyr Asn Ile Asn Val Asn Ser
180 185 190
Ile Ala Pro Leu Ala Arg Ser Arg Met Thr Glu Asn Val Leu Pro Pro
195 200 205
His Ile Leu Lys Gln Leu Gly Pro Glu Lys Ile Val Pro Leu Val Leu
210 215 220
Tyr Leu Thr His Glu Ser Thr Lys Val Ser Asn Ser Ile Phe Glu Leu
225 230 235 240
Ala Ala Gly Phe Phe Gly Gln Leu Arg Trp Glu Arg Ser Ser Gly Gln
245 250 255
Ile Phe Asn Pro Asp Pro Lys Thr Tyr Thr Pro Glu Ala Ile Leu Asn
260 265 270
Lys Trp Lys Glu Ile Thr Asp Tyr Arg Asp Lys Pro Phe Asn Lys Thr
275 280 285
Gln His Pro Tyr Gln Leu Ser Asp Tyr Asn Asp Leu Ile Thr Lys Ala
290 295 300
Lys Lys Leu Pro Pro Asn Glu Gln Gly Ser Val Lys Ile Lys Ser Leu
305 310 315 320
Cys Asn Lys Val Val Val Val Thr Gly Ala Gly Gly Gly Leu Gly Lys
325 330 335
Ser His Ala Ile Trp Phe Ala Arg Tyr Gly Ala Lys Val Val Val Asn
340 345 350
Asp Ile Lys Asp Pro Phe Ser Val Val Glu Glu Ile Asn Lys Leu Tyr
355 360 365
Gly Glu Gly Thr Ala Ile Pro Asp Ser His Asp Val Val Thr Glu Ala
370 375 380
Pro Leu Ile Ile Gln Thr Ala Ile Ser Lys Phe Gln Arg Val Asp Ile
385 390 395 400
Leu Val Asn Asn Ala Gly Ile Leu Arg Asp Lys Ser Phe Leu Lys Met
405 410 415
Lys Asp Glu Glu Trp Phe Ala Val Leu Lys Val His Leu Phe Ser Thr
420 425 430
Phe Ser Leu Ser Lys Ala Val Trp Pro Ile Phe Thr Lys Gln Lys Ser
435 440 445
Gly Phe Ile Ile Asn Thr Thr Ser Thr Ser Gly Ile Tyr Gly Asn Phe
450 455 460
Gly Gln Ala Asn Tyr Ala Ala Ala Lys Ala Ala Ile Leu Gly Phe Ser
465 470 475 480
Lys Thr Ile Ala Leu Glu Gly Ala Lys Arg Gly Ile Ile Val Asn Val
485 490 495
Ile Ala Pro His Ala Glu Thr Ala Met Thr Lys Thr Ile Phe Ser Glu
500 505 510
Lys Glu Leu Ser Asn His Phe Asp Ala Ser Gln Val Ser Pro Leu Val
515 520 525
Val Leu Leu Ala Ser Glu Glu Leu Gln Lys Tyr Ser Gly Arg Arg Val
530 535 540
Ile Gly Gln Leu Phe Glu Val Gly Gly Gly Trp Cys Gly Gln Thr Arg
545 550 555 560
Trp Gln Arg Ser Ser Gly Tyr Val Ser Ile Lys Glu Thr Ile Glu Pro
565 570 575
Glu Glu Ile Lys Glu Asn Trp Asn His Ile Thr Asp Phe Ser Arg Asn
580 585 590
Thr Ile Asn Pro Ser Ser Thr Glu Glu Ser Ser Met Ala Thr Leu Gln
595 600 605
Ala Val Gln Lys Ala His Ser Ser Lys Glu Leu Asp Asp Gly Leu Phe
610 615 620
Lys Tyr Thr Thr Lys Asp Cys Ile Leu Tyr Asn Leu Gly Leu Gly Cys
625 630 635 640
Thr Ser Lys Glu Leu Lys Tyr Thr Tyr Glu Asn Asp Pro Asp Phe Gln
645 650 655
Val Leu Pro Thr Phe Ala Val Ile Pro Phe Met Gln Ala Thr Ala Thr
660 665 670
Leu Ala Met Asp Asn Leu Val Asp Asn Phe Asn Tyr Ala Met Leu Leu
675 680 685
His Gly Glu Gln Tyr Phe Lys Leu Cys Thr Pro Thr Met Pro Ser Asn
690 695 700
Gly Thr Leu Lys Thr Leu Ala Lys Pro Leu Gln Val Leu Asp Lys Asn
705 710 715 720
Gly Lys Ala Ala Leu Val Val Gly Gly Phe Glu Thr Tyr Asp Ile Lys
725 730 735
Thr Lys Lys Leu Ile Ala Tyr Asn Glu Gly Ser Phe Phe Ile Arg Gly
740 745 750
Ala His Val Pro Pro Glu Lys Glu Val Arg Asp Gly Lys Arg Ala Lys
755 760 765
Phe Ala Val Gln Asn Phe Glu Val Pro His Gly Lys Val Pro Asp Phe
770 775 780
Glu Ala Glu Ile Ser Thr Asn Lys Asp Gln Ala Ala Leu Tyr Arg Leu
785 790 795 800
Ser Gly Asp Phe Asn Pro Leu His Ile Asp Pro Thr Leu Ala Lys Ala
805 810 815
Val Lys Phe Pro Thr Pro Ile Leu His Gly Leu Cys Thr Leu Gly Ile
820 825 830
Ser Ala Lys Ala Leu Phe Glu His Tyr Gly Pro Tyr Glu Glu Leu Lys
835 840 845
Val Arg Phe Thr Asn Val Val Phe Pro Gly Asp Thr Leu Lys Val Lys
850 855 860
Ala Trp Lys Gln Gly Ser Val Val Val Phe Gln Thr Ile Asp Thr Thr
865 870 875 880
Arg Asn Val Ile Val Leu Asp Asn Ala Ala Val Lys Leu Ser Gln Ala
885 890 895
Lys Ser Lys Leu
900
<210> 194
<211> 2703
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 194
atgcctggaa atttatcctt caaagataga gttgttgtaa tcacgggcgc tggagggggc 60
ttaggtaagg tgtatgcact agcttacgca agcagaggtg caaaagtggt cgtcaatgat 120
ctaggtggca ctttgggtgg ttcaggacat aactccaaag ctgcagactt agtggtggat 180
gagataaaaa aagccggagg tatagctgtg gcaaattacg actctgttaa tgaaaatgga 240
gagaaaataa ttgaaacggc tataaaagaa ttcggcaggg ttgatgtact aattaacaac 300
gctggaatat taagggatgt ttcatttgca aagatgacag aacgtgagtt tgcatctgtg 360
gtagatgttc atttgacagg tggctataag ctatcgcgtg ctgcttggcc ttatatgcgc 420
tctcagaaat ttggtagaat cattaacacc gcttcccctg ccggtctatt tggaaatttt 480
ggtcaagcta attattcagc agctaaaatg ggcttagttg gtttggcgga aaccctcgcg 540
aaggagggtg ccaaatacaa cattaatgtt aattcaattg cgccattggc tagatcacgt 600
atgacagaaa acgtgttacc accacatatc ttgaaacagt taggaccgga aaaaattgtt 660
cccttagtac tctatttgac acacgaaagt acgaaagtgt caaactccat ttttgaactc 720
gctgctggat tctttggaca gctcagatgg gagaggtctt ctggacaaat tttcaatcca 780
gaccccaaga catatactcc tgaagcaatt ttaaataagt ggaaggaaat cacagactat 840
agggacaagc catttaacaa aactcagcat ccatatcaac tctcggatta taatgattta 900
atcaccaaag caaaaaaatt acctcccaat gaacaaggct cagtgaaaat caagtcgctt 960
tgcaacaaag tcgtagtagt tacgggtgca ggaggtggtc ttgggaagtc tcatgcaatc 1020
tggtttgcac ggtacggtgc gaaggtagtt gtaaatgaca tcaaggatcc tttttcagtt 1080
gttgaagaaa taaataaact atatggtgaa ggcacagcca ttccagattc ccatgatgtg 1140
gtcaccgaag ctcctctcat tatccaaact gcaataagta agtttcagag agtagacatc 1200
ttggtcaata acgctggtat tttgcgtgac aaatcttttt taaaaatgaa agatgaggaa 1260
tggtttgctg tcctgaaagt ccaccttttt tccacatttt cattgtcaaa agcagtatgg 1320
ccaatattta ccaaacaaaa gtctggattt attatcaata ctacttctac ctcaggaatt 1380
tatggtaatt ttggacaggc caattatgcc gctgcaaaag ccgccatttt aggattcagt 1440
aaaactattg cactggaagg tgccaagaga ggaattattg ttaatgttat cgctcctcat 1500
gcagaaacgg ctatgacaaa gactatattc tcggagaagg aattatcaaa ccactttgat 1560
gcatctcaag tctccccact tgttgttttg ttggcatctg aagaactaca aaagtattct 1620
ggaagaaggg ttattggcca attattcgaa gttggcggtg gttggtgtgg gcaaaccaga 1680
tggcaaagaa gttccggtta tgtttctatt aaagagacta ttgaaccgga agaaattaaa 1740
gaaaattgga accacatcac tgatttcagt cgcaacacta tcaacccgag ctccacagag 1800
gagtcttcta tggcaacctt gcaagccgtg caaaaagcgc actcttcaaa ggagttggat 1860
gatggattat tcaagtacac taccaaggat tgtatcttgt acaatttagg acttggatgc 1920
acaagcaaag agcttaagta cacctacgag aatgatccag acttccaagt tttgcccacg 1980
ttcgccgtca ttccatttat gcaagctact gccacactag ctatggacaa tttagtcgat 2040
aacttcaatt atgcaatgtt actgcatgga gaacaatatt ttaagctctg cacgccgaca 2100
atgccaagta atggaactct aaagacactt gctaaacctt tacaagtact tgacaagaat 2160
ggtaaagccg ctttagttgt tggtggcttc gaaacttatg acattaaaac taagaaactc 2220
atagcttata acgaaggatc gttcttcatc aggggcgcac atgtacctcc agaaaaggaa 2280
gtgagggatg ggaaaagagc caagtttgct gtccaaaatt ttgaagtgcc acatggaaag 2340
gtaccagatt ttgaggcgga gatttctacg aataaagatc aagccgcatt gtacaggtta 2400
tctggcgatt tcaatccttt acatatcgat cccacgctag ccaaagcagt taaatttcct 2460
acgccaattc tgcatgggct ttgtacatta ggtattagtg cgaaagcatt gtttgaacat 2520
tatggtccat atgaggagtt gaaagtgaga tttaccaatg ttgttttccc aggtgatact 2580
ctaaaggtta aagcttggaa gcaaggctcg gttgtcgttt ttcaaacaat tgatacgacc 2640
agaaacgtca ttgtattgga taacgccgct gtaaaactat cgcaggcaaa atctaaacta 2700
taa 2703
<210> 195
<211> 462
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 195
Met Glu Lys Ala Ile Glu Arg Gln Arg Val Leu Leu Glu His Leu Arg
1 5 10 15
Pro Ser Ser Ser Ser Ser His Asn Tyr Glu Ala Ser Leu Ser Ala Ser
20 25 30
Ala Cys Leu Ala Gly Asp Ser Ala Ala Tyr Gln Arg Thr Ser Leu Tyr
35 40 45
Gly Asp Asp Val Val Ile Val Ala Ala His Arg Thr Pro Leu Cys Lys
50 55 60
Ser Lys Arg Gly Asn Phe Lys Asp Thr Tyr Pro Asp Asp Leu Leu Ala
65 70 75 80
Pro Val Leu Arg Ala Leu Ile Glu Lys Thr Asn Leu Asn Pro Ser Glu
85 90 95
Val Gly Asp Ile Val Val Gly Thr Val Leu Ala Pro Gly Ser Gln Arg
100 105 110
Ala Ser Glu Cys Arg Met Ala Ala Phe Tyr Ala Gly Phe Pro Glu Thr
115 120 125
Val Ala Val Arg Thr Val Asn Arg Gln Cys Ser Ser Gly Leu Gln Ala
130 135 140
Val Ala Asp Val Ala Ala Ala Ile Lys Ala Gly Phe Tyr Asp Ile Gly
145 150 155 160
Ile Gly Ala Gly Leu Glu Ser Met Thr Thr Asn Pro Met Ala Trp Glu
165 170 175
Gly Ser Val Asn Pro Ala Val Lys Lys Phe Ala Gln Ala Gln Asn Cys
180 185 190
Leu Leu Pro Met Gly Val Thr Ser Glu Asn Val Ala Gln Arg Phe Gly
195 200 205
Val Ser Arg Gln Glu Gln Asp Gln Ala Ala Val Asp Ser His Arg Lys
210 215 220
Ala Ala Ala Ala Thr Ala Ala Gly Lys Phe Lys Asp Glu Ile Ile Pro
225 230 235 240
Val Lys Thr Lys Leu Val Asp Pro Lys Thr Gly Asp Glu Lys Pro Ile
245 250 255
Thr Val Ser Val Asp Asp Gly Ile Arg Pro Thr Thr Thr Leu Ala Ser
260 265 270
Leu Gly Lys Leu Lys Pro Val Phe Lys Lys Asp Gly Thr Thr Thr Ala
275 280 285
Gly Asn Ser Ser Gln Val Ser Asp Gly Ala Gly Ala Val Leu Leu Met
290 295 300
Lys Arg Ser Val Ala Met Gln Lys Gly Leu Pro Val Leu Gly Val Phe
305 310 315 320
Arg Thr Phe Ala Ala Val Gly Val Asp Pro Ala Ile Met Gly Ile Gly
325 330 335
Pro Ala Val Ala Ile Pro Ala Ala Val Lys Ala Ala Gly Leu Glu Leu
340 345 350
Asp Asp Ile Asp Leu Phe Glu Ile Asn Glu Ala Phe Ala Ser Gln Phe
355 360 365
Val Tyr Cys Arg Asn Lys Leu Gly Leu Asp Pro Glu Lys Ile Asn Val
370 375 380
Asn Gly Gly Ala Met Ala Ile Gly His Pro Leu Gly Ala Thr Gly Ala
385 390 395 400
Arg Cys Val Ala Thr Leu Leu His Glu Met Lys Arg Arg Gly Lys Asp
405 410 415
Cys Arg Phe Gly Val Val Ser Met Cys Ile Gly Thr Gly Met Gly Ala
420 425 430
Ala Ala Val Phe Glu Arg Gly Asp Gly Val Asp Glu Leu Arg Asn Ala
435 440 445
Arg Lys Val Glu Ala Gln Gly Leu Leu Ser Lys Asp Ala Arg
450 455 460
<210> 196
<211> 1389
<212> DNA
<213> 拟南芥(Arabidopsis thaliana)
<400> 196
atggaaaagg ctatcgaaag acaaagagtt ttgttggaac atttgagacc atcttcatct 60
tcatctcata actacgaagc ttcattatct gcttcagcat gtttggctgg tgactctgct 120
gcatatcaaa gaacatcatt atacggtgac gatgttgtta ttgttgctgc acatagaaca 180
ccattgtgta agtctaagcg tggtaacttc aaggatactt acccagatga tttgttagct 240
ccagttttga gagcattgat cgaaaagact aatttgaatc catcagaagt tggtgacatt 300
gttgttggta ctgttttggc tccaggttct caaagagcat cagaatgtag aatggctgca 360
ttttatgctg gttttccaga aactgttgca gttagaacag ttaatagaca atgttcatct 420
ggtttacaag ctgttgcaga tgttgctgca gctattaaag ctggtttcta cgatatcggt 480
attggtgcag gtttggaatc tatgactaca aatccaatgg cttgggaagg ttcagttaat 540
ccagcagtta agaaattcgc tcaagcacaa aactgtttgt tgccaatggg tgttacatct 600
gaaaatgttg ctcaaagatt tggtgtttca agacaagaac aagatcaagc agctgttgat 660
tctcatagaa aagcagctgc agctactgca gctggtaaat tcaaagatga aatcatccca 720
gttaaaacta aattagttga tccaaaaaca ggtgacgaaa aaccaattac tgtttctgtt 780
gatgatggta ttagaccaac tacaactttg gcttcattgg gtaaattgaa gccagttttt 840
aagaaagatg gtacaactac agctggtaat tcatctcaag tttctgatgg tgctggtgca 900
gttttgttga tgaagagatc agttgctatg caaaagggtt taccagtttt gggtgttttt 960
agaacatttg cagctgttgg tgttgatcca gctattatgg gtattggtcc agctgttgca 1020
attccagcag ctgttaaagc agctggtttg gaattggatg atatcgattt gttcgaaatt 1080
aatgaagctt tcgcatctca attcgtttac tgtagaaata agttgggttt agatccagaa 1140
aagattaatg ttaacggtgg tgctatggca attggtcatc cattgggtgc tacaggtgca 1200
agatgtgttg ctactttgtt gcatgaaatg aagagacgtg gtaaagattg tagattcggt 1260
gttgtttcta tgtgtattgg tactggtatg ggtgcagctg cagtttttga aagaggtgac 1320
ggtgttgatg aattgagaaa tgctagaaaa gttgaagcac aaggtttgtt atcaaaagat 1380
gctagataa 1389
<210> 197
<211> 389
<212> PRT
<213> 红杆菌科(Rhodobacteraceae)细菌HTCC2083
<400> 197
Met Lys Gln Ala Val Ile Val Ser Thr Ala Arg Ser Gly Leu Ala Lys
1 5 10 15
Ser Phe Arg Gly Ser Leu Asn Gln Thr His Gly Ala Thr Leu Gly Ala
20 25 30
His Ser Val Gln Asn Ala Ile Ser Arg Ala Gly Ile Asp Pro Ala Ser
35 40 45
Val Glu Asp Val Leu Ile Gly Cys Ala Thr Pro Glu Gly Ala Thr Gly
50 55 60
Gly Asn Ile Ala Arg Gln Ile Ala Leu Arg Ala Gly Leu Pro Val Ser
65 70 75 80
Val Cys Gly Ala Thr Val Asn Arg Phe Cys Ser Ser Gly Leu Gln Thr
85 90 95
Ile Ala Met Ala Ala Gln Ser Ile Gln Asn Gly Ala Gly Pro Met Val
100 105 110
Ala Gly Gly Val Glu Ser Ile Ser Leu Thr Gly Asn His Ala Val Pro
115 120 125
Ser His Asp Pro Trp Ile Lys Glu His Lys Pro Ala Val Tyr Met Thr
130 135 140
Met Ile Glu Thr Ala Asp Asn Val Ala Glu Arg Tyr Lys Ile Ser Arg
145 150 155 160
Asp Ala Gln Asp Glu Tyr Gly Leu Arg Ser Gln Leu Arg Met Ala Ala
165 170 175
Ala Gln Ala Ala Gly Lys Phe Ala Asp Glu Ile Val Pro Met Ala Ala
180 185 190
Thr Met Ala Val Lys Asp Lys Glu Thr Gly Glu Ile Ser Gln His Glu
195 200 205
Val Thr Val Asp Arg Asp Glu Cys Asn Arg Pro Gln Thr Asn Ile Glu
210 215 220
Gly Leu Thr Gly Leu Ser Pro Val Arg Glu Gly Gly Tyr Val Thr Ala
225 230 235 240
Gly Asn Ala Ser Gln Leu Ser Asp Gly Ser Ala Ala Val Val Leu Met
245 250 255
Glu Ala Ser Glu Ala Glu Arg Gln Gly Ile Glu Pro Leu Gly Ala Phe
260 265 270
Lys Gly Phe Ala Val Ala Gly Cys Glu Pro Asp Glu Met Gly Ile Gly
275 280 285
Pro Val Tyr Ala Val Pro Arg Leu Leu Glu Arg His Gly Leu Lys Val
290 295 300
Asp Asp Ile Asp Leu Trp Glu Leu Asn Glu Ala Phe Ala Ser Gln Ala
305 310 315 320
Leu Tyr Ser Arg Asp Arg Leu Gly Ile Asp Asp Glu Lys Cys Asn Val
325 330 335
Asn Gly Gly Ser Ile Ala Ile Gly His Pro Phe Gly Met Ser Gly Thr
340 345 350
Arg Met Thr Gly His Leu Leu Leu Glu Gly Lys Arg Arg Gly Ala Lys
355 360 365
Leu Gly Val Val Thr Met Cys Ile Gly Gly Gly Met Gly Ala Ala Gly
370 375 380
Leu Phe Glu Ile Phe
385
<210> 198
<211> 1170
<212> DNA
<213> 红杆菌科(Rhodobacteraceae)细菌HTCC2083
<400> 198
atgaaacaag ctgttattgt ttcaactgca agatctggtt tggctaagtc ttttagaggt 60
tctttgaacc aaactcatgg tgcaacatta ggtgctcatt cagttcaaaa tgcaatttct 120
agagctggta ttgatccagc atcagttgaa gatgttttga ttggttgtgc aactccagaa 180
ggtgctacag gtggtaatat tgctagacaa attgcattaa gagctggttt gccagtttca 240
gtttgtggtg caactgttaa cagattctgt tcttcaggtt tgcaaacaat tgctatggct 300
gcacaatcta ttcaaaatgg tgcaggtcca atggttgctg gtggtgttga atctatctca 360
ttgacaggta accatgcagt tccatctcat gatccatgga tcaaggaaca taagccagct 420
gtttacatga ctatgatcga aacagcagat aacgttgctg aaagatacaa gatctcaaga 480
gatgctcaag atgaatacgg tttaagatct caattgagaa tggctgcagc tcaagcagct 540
ggtaaatttg cagatgaaat tgttccaatg gcagctacta tggctgttaa ggataaggaa 600
acaggtgaaa tctcacaaca tgaagttact gttgatagag atgaatgtaa cagaccacaa 660
actaacatcg aaggtttgac aggtttgtct ccagttagag aaggtggtta cgttacagct 720
ggtaatgctt ctcaattgtc agatggttct gcagctgttg ttttaatgga agcatctgaa 780
gctgaaagac aaggtattga accattgggt gcttttaaag gttttgcagt tgctggttgt 840
gaaccagatg aaatgggtat tggtccagtt tatgctgttc caagattgtt ggaaagacat 900
ggtttgaagg ttgatgatat cgatttgtgg gaattgaatg aagcatttgc ttcacaagct 960
ttatactcta gagatagatt gggtatcgat gatgaaaagt gtaacgttaa cggtggttca 1020
attgctattg gtcatccatt tggcatgtct ggtactagaa tgacaggtca tttgttattg 1080
gaaggtaaaa gaagaggtgc taaattgggt gttgttacta tgtgtattgg tggtggtatg 1140
ggtgcagctg gtttatttga aattttctaa 1170
<210> 199
<211> 419
<212> PRT
<213> 足马杜拉分枝菌(Madurella mycetomatis)
<400> 199
Met Ala Val Leu Pro Arg Gly Ile Lys Ala Val Leu Thr Lys Ala Pro
1 5 10 15
Thr Asp Val Val Ile Val Ser Ser Leu Arg Thr Pro Ile Cys Arg Ser
20 25 30
Tyr Arg Gly Gln Leu Lys Asp Ala Tyr Pro Glu Glu Leu Leu Ser Val
35 40 45
Val Leu Arg Ala Thr Leu Asp Lys Asn Pro Gln Leu Asp Pro Ala Ala
50 55 60
Val Asp Asp Val Ala Val Gly Val Val Leu Ser Glu Leu Gly Gly Ser
65 70 75 80
Lys Ala Ala Arg Met Ala Met Asn His Val Gly Phe Pro Ser Thr Thr
85 90 95
Ser Leu Tyr Thr Thr Asn Arg Ala Cys Ala Ser Ser Met Gln Ser Ile
100 105 110
Ala Leu Val Ala Ala Gln Ile Arg Thr Glu Met Ile Asp Val Gly Ile
115 120 125
Gly Ala Gly Met Glu Ser Met Thr Arg Asn Tyr Gly Ser Lys Ala Ile
130 135 140
Pro Val Asp Ala Trp Pro Ala Leu Lys Glu Ser Pro Val Lys Asp Ala
145 150 155 160
Arg Asp Cys Val Met Pro Met Gly Leu Thr Ser Glu Asn Val Ala Ser
165 170 175
Arg Tyr Gly Val Ser Arg Ala Asp Gln Asp Ala Phe Ala Val Glu Ser
180 185 190
His Leu Arg Ala Ala Arg Ala Arg Asp Ala Gly Ala Phe Asp Ala Glu
195 200 205
Ile Val Ala Val Thr Thr Arg Phe Gln Glu Val Asp Lys Gln Gly Asn
210 215 220
Lys Val Gly Asp Glu Gln Thr Val Thr Val Thr Arg Asp Asp Gly Ile
225 230 235 240
Arg Thr Asn Ala Ser Leu Glu Gly Leu Ala Lys Leu Lys Pro Ala Phe
245 250 255
Lys Pro Asp Gly Ala Ser Thr Ala Gly Asn Ser Ser Gln Val Ser Asp
260 265 270
Gly Ala Ala Ala Thr Leu Leu Met Arg Arg Ser Thr Ala Thr Arg Leu
275 280 285
Gly Leu Ala Asp Ser Ile Met Gly Lys Phe Val Gly Ala Ala Val Ala
290 295 300
Gly Cys Ala Pro Asp Glu Met Gly Ile Gly Pro Ala Leu Ala Ile Pro
305 310 315 320
Lys Val Leu Asn Gln Leu Gly Leu Thr Asn Ala Asp Val His Arg Trp
325 330 335
Glu Ile Asn Glu Ala Phe Ala Ser Gln Ala Ile His Cys Val His Glu
340 345 350
Leu Gly Leu Glu Lys Ala Trp Gln Asp Gly Arg Val Asn Pro Asp Gly
355 360 365
Gly Ala Ile Ala Leu Gly His Pro Leu Gly Ala Thr Gly Ala Arg Met
370 375 380
Val Ser Thr Leu Met His Gly Met Arg Arg Ser Gly Asp Glu Ile Gly
385 390 395 400
Val Val Ser Met Cys Ile Gly Thr Gly Met Gly Met Ala Gly Val Phe
405 410 415
Val Arg Glu
<210> 200
<211> 1260
<212> DNA
<213> 足马杜拉分枝菌(Madurella mycetomatis)
<400> 200
atggctgttt taccaagagg tattaaagca gttttgacaa aagctccaac tgatgttgtt 60
attgtttctt cattgagaac accaatctgt agatcataca gaggtcaatt gaaagatgca 120
tacccagaag aattgttgtc tgttgttttg agagctactt tggataagaa tccacaatta 180
gatccagctg cagttgatga tgttgcagtt ggtgttgttt tgtctgaatt aggtggttca 240
aaagctgcaa gaatggctat gaatcatgtt ggtttcccat ctactacatc attgtacact 300
acaaacagag catgtgcttc ttcaatgcaa tctattgctt tggttgctgc acaaatcaga 360
acagaaatga tcgatgttgg tattggtgct ggtatggaat caatgactag aaactacggt 420
tctaaggcta ttccagttga tgcatggcca gctttaaaag aatcaccagt taaagatgca 480
agagattgtg ttatgccaat gggtttgaca tctgaaaatg ttgcatcaag atacggtgtt 540
tctagagctg atcaagatgc atttgctgtt gaatctcatt tgagagctgc aagagctaga 600
gatgcaggtg cttttgatgc agaaattgtt gctgttacta caagattcca agaagttgat 660
aagcaaggta ataaggttgg tgacgaacaa actgttacag ttactagaga tgatggtatt 720
agaactaatg cttctttgga aggtttagca aaattgaaac cagcttttaa accagatggt 780
gcatcaacag ctggtaattc ttcacaagtt tctgatggtg ctgcagctac tttgttaatg 840
agaagatcaa cagcaactag attgggtttg gctgattcta tcatgggtaa attcgttggt 900
gcagctgttg caggttgtgc tccagatgaa atgggtattg gtccagcatt ggctatccca 960
aaggttttga accaattggg tttgacaaat gctgatgttc atagatggga aattaatgaa 1020
gcatttgctt cacaagcaat tcattgtgtt catgaattgg gtttagaaaa agcttggcaa 1080
gatggtagag ttaatccaga tggtggtgca attgctttag gtcatccatt gggtgcaaca 1140
ggtgctagaa tggtttctac tttaatgcat ggtatgagaa gatcaggtga cgaaattggt 1200
gttgtttcta tgtgtattgg tactggtatg ggtatggctg gtgtttttgt tagagaataa 1260
<210> 201
<211> 440
<212> PRT
<213> 鹰嘴豆(Cicer arietinum)
<400> 201
Met Glu Lys Ala Ile Glu Arg Gln Arg Val Leu Leu Glu His Leu Gln
1 5 10 15
Pro Asn Ser Ser Asn Ser Ala Phe Leu Ser His Thr His Gln Ser Thr
20 25 30
Asp Leu Ser Ala Ser Phe Cys Ser Ala Gly Gln Thr Gly Gly Ser Glu
35 40 45
Asn Asp Val Val Ile Val Ala Ala Tyr Arg Thr Ala Ile Cys Lys Ala
50 55 60
Lys Arg Gly Gly Phe Lys Asp Thr Leu Pro Asp Asp Leu Leu Ala Pro
65 70 75 80
Val Leu Lys Ala Val Ile Glu Lys Thr Asn Val Glu Pro Ser Glu Val
85 90 95
Gly Asp Ile Ile Val Gly Thr Val Leu Gly Pro Gly Ser Glu Lys Ala
100 105 110
Thr Glu Cys Arg Met Ala Ala Phe Tyr Ala Gly Phe Pro Glu Thr Val
115 120 125
Pro Leu Arg Thr Val Asn Arg Gln Cys Ser Ser Gly Leu Gln Ala Val
130 135 140
Ala Asp Val Ala Ala Tyr Ile Lys Ala Gly Phe Tyr Asp Ile Gly Ile
145 150 155 160
Gly Ala Gly Leu Glu Cys Met Ser Gln Asp Asn Ile Ser Arg Leu Arg
165 170 175
Asn Ile Asn Pro Lys Val Glu Thr Phe Ala Gln Ala Arg Asp Cys Leu
180 185 190
Leu Pro Met Gly Ile Thr Ser Glu Asn Val Ala Glu Arg Tyr Gly Val
195 200 205
Thr Arg Gln Glu Gln Asp Gln Ala Ala Val Glu Ser His Arg Arg Ala
210 215 220
Ala Ala Ala Thr Ala Ala Gly Lys Phe Lys Glu Glu Ile Ile Pro Val
225 230 235 240
Ser Thr Lys Ile Val Asp Pro Lys Thr Gly Glu Glu Lys Gln Ile Ile
245 250 255
Val Ser Val Asp Asp Gly Phe Arg Pro Asn Ala Asn Leu Thr Asp Leu
260 265 270
Ala Lys Leu Lys Pro Ala Phe Lys Lys Asp Gly Thr Thr Thr Ala Gly
275 280 285
Asn Ala Ser Gln Ile Ser Asp Gly Ala Ala Ala Val Leu Leu Met Lys
290 295 300
Arg Ser Val Ala Val Gln Lys Gly Leu Pro Ile Leu Gly Ile Phe Arg
305 310 315 320
Ser Phe Ala Ala Val Gly Val Asp Pro Ala Val Met Gly Val Gly Pro
325 330 335
Ala Phe Ala Ile Pro Ala Ala Val Lys Ser Ala Gly Leu Glu Leu Gly
340 345 350
Asn Ile Asp Leu Phe Glu Ile Asn Glu Ala Phe Ala Ser Gln Phe Val
355 360 365
Tyr Ser Cys Lys Lys Leu Gly Leu Asp Arg Ser Lys Val Asn Val Asn
370 375 380
Gly Gly Ala Ile Ala Leu Gly His Pro Leu Gly Ala Thr Gly Ala Arg
385 390 395 400
Ser Val Ala Thr Leu Leu Asn Glu Met Lys Arg Arg Gly Lys Asp Cys
405 410 415
Arg Tyr Gly Val Ile Ser Met Cys Ile Gly Ser Gly Met Gly Ala Ala
420 425 430
Ala Val Phe Glu Arg Gly Asp Phe
435 440
<210> 202
<211> 1323
<212> DNA
<213> 鹰嘴豆(Cicer arietinum)
<400> 202
atggaaaagg ctatcgaaag acaaagagtt ttgttggaac atttgcaacc aaactcttca 60
aactctgcat ttttgtcaca tacacatcaa tctactgatt tgtctgcttc attttgttct 120
gcaggtcaaa caggtggttc agaaaacgat gttgttattg ttgctgcata cagaacagct 180
atctgtaagg caaagagagg tggttttaaa gatactttgc cagatgattt gttagctcca 240
gttttgaagg cagttattga aaagactaac gttgaaccat ctgaagttgg tgacattatt 300
gttggtacag ttttgggtcc aggttcagaa aaagctactg aatgtagaat ggctgcattt 360
tacgcaggtt ttccagaaac agttccattg agaactgtta acagacaatg ttcttcaggt 420
ttgcaagctg ttgcagatgt tgctgcatac atcaaggctg gtttctacga tattggtatt 480
ggtgcaggtt tagaatgtat gtctcaagat aacatctcaa gattgagaaa catcaatcca 540
aaagttgaaa catttgctca agcaagagat tgtttgttac caatgggtat cacatctgaa 600
aacgttgctg aaagatatgg tgttactaga caagaacaag atcaagctgc agttgaatca 660
catagaagag ctgcagctgc aacagctgct ggtaaattca aagaagaaat catcccagtt 720
tctacaaaga tcgttgatcc aaagactggt gaagaaaagc aaatcatcgt ttcagttgat 780
gatggtttta gaccaaacgc taatttgact gatttggcta agttgaaacc agcttttaag 840
aaagatggta ctacaactgc tggtaatgca tctcaaattt cagatggtgc tgcagctgtt 900
ttgttgatga agagatctgt tgctgttcaa aagggtttgc caatcttggg tatttttaga 960
tcatttgcag ctgttggtgt tgatccagct gttatgggtg ttggtccagc ttttgcaatt 1020
ccagcagctg ttaaatctgc aggtttggaa ttgggtaaca tcgatttgtt cgaaattaat 1080
gaagctttcg catctcaatt cgtttactct tgtaagaaat tgggtttgga tagatctaag 1140
gttaacgtta atggtggtgc tattgcatta ggtcatccat tgggtgctac aggtgcaaga 1200
tcagttgcta ctttgttgaa cgaaatgaag agacgtggta aagattgtag atacggtgtt 1260
atttctatgt gtattggttc aggcatgggt gcagctgcag tttttgaaag aggtgacttt 1320
taa 1323
<210> 203
<211> 417
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 203
Met Ser Gln Arg Leu Gln Ser Ile Lys Asp His Leu Val Glu Ser Ala
1 5 10 15
Met Gly Lys Gly Glu Ser Lys Arg Lys Asn Ser Leu Leu Glu Lys Arg
20 25 30
Pro Glu Asp Val Val Ile Val Ala Ala Asn Arg Ser Ala Ile Gly Lys
35 40 45
Gly Phe Lys Gly Ala Phe Lys Asp Val Asn Thr Asp Tyr Leu Leu Tyr
50 55 60
Asn Phe Leu Asn Glu Phe Ile Gly Arg Phe Pro Glu Pro Leu Arg Ala
65 70 75 80
Asp Leu Asn Leu Ile Glu Glu Val Ala Cys Gly Asn Val Leu Asn Val
85 90 95
Gly Ala Gly Ala Thr Glu His Arg Ala Ala Cys Leu Ala Ser Gly Ile
100 105 110
Pro Tyr Ser Thr Pro Phe Val Ala Leu Asn Arg Gln Cys Ser Ser Gly
115 120 125
Leu Thr Ala Val Asn Asp Ile Ala Asn Lys Ile Lys Val Gly Gln Ile
130 135 140
Asp Ile Gly Leu Ala Leu Gly Val Glu Ser Met Thr Asn Asn Tyr Lys
145 150 155 160
Asn Val Asn Pro Leu Gly Met Ile Ser Ser Glu Glu Leu Gln Lys Asn
165 170 175
Arg Glu Ala Lys Lys Cys Leu Ile Pro Met Gly Ile Thr Asn Glu Asn
180 185 190
Val Ala Ala Asn Phe Lys Ile Ser Arg Lys Asp Gln Asp Glu Phe Ala
195 200 205
Ala Asn Ser Tyr Gln Lys Ala Tyr Lys Ala Lys Asn Glu Gly Leu Phe
210 215 220
Glu Asp Glu Ile Leu Pro Ile Lys Leu Pro Asp Gly Ser Ile Cys Gln
225 230 235 240
Ser Asp Glu Gly Pro Arg Pro Asn Val Thr Ala Glu Ser Leu Ser Ser
245 250 255
Ile Arg Pro Ala Phe Ile Lys Asp Arg Gly Thr Thr Thr Ala Gly Asn
260 265 270
Ala Ser Gln Val Ser Asp Gly Val Ala Gly Val Leu Leu Ala Arg Arg
275 280 285
Ser Val Ala Asn Gln Leu Asn Leu Pro Val Leu Gly Arg Tyr Ile Asp
290 295 300
Phe Gln Thr Val Gly Val Pro Pro Glu Ile Met Gly Val Gly Pro Ala
305 310 315 320
Tyr Ala Ile Pro Lys Val Leu Glu Ala Thr Gly Leu Gln Val Gln Asp
325 330 335
Ile Asp Ile Phe Glu Ile Asn Glu Ala Phe Ala Ala Gln Ala Leu Tyr
340 345 350
Cys Ile His Lys Leu Gly Ile Asp Leu Asn Lys Val Asn Pro Arg Gly
355 360 365
Gly Ala Ile Ala Leu Gly His Pro Leu Gly Cys Thr Gly Ala Arg Gln
370 375 380
Val Ala Thr Ile Leu Arg Glu Leu Lys Lys Asp Gln Ile Gly Val Val
385 390 395 400
Ser Met Cys Ile Gly Thr Gly Met Gly Ala Ala Ala Ile Phe Ile Lys
405 410 415
Glu
<210> 204
<211> 1254
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 204
atgtctcaaa gactacaaag tatcaaggat catttggtgg agagcgccat gggtaagggt 60
gaatcgaaga ggaagaactc gttgctggag aaaagacccg aagatgtagt tattgtggct 120
gctaacaggt ctgccatcgg taaaggtttt aaaggtgcct tcaaagatgt aaacacagac 180
tacttattat acaactttct caatgagttc atcgggaggt ttccggaacc tttgagggct 240
gatttgaact taatcgaaga agttgcctgt ggaaatgttc tcaatgttgg agccggtgct 300
acagaacaca gggctgcatg cttggcaagt gggattccct actcgacgcc atttgtcgct 360
ttaaacagac aatgttcttc aggtttaacg gcggtgaacg atattgccaa caagattaag 420
gttgggcaaa ttgatattgg tttggcgctg ggagtggaat caatgaccaa taactacaaa 480
aacgtcaatc ccttgggcat gatctcctct gaagagctgc aaaaaaaccg agaagcgaag 540
aaatgtctaa taccaatggg cattactaat gagaatgttg ccgctaattt caagatcagt 600
agaaaggatc aagacgagtt cgctgcgaat tcatatcaaa aagcttacaa ggcgaaaaat 660
gaggggcttt tcgaagatga aattttacct ataaaattac cagatggctc aatttgccag 720
tcggacgaag ggccacgccc taacgtcact gcggagtcgc tttcaagcat caggcctgcc 780
tttatcaaag acagaggaac cacaactgcg ggcaatgcat cccaggtctc cgatggtgtg 840
gcaggtgtct tgttagcccg caggtccgta gccaaccagt taaatctgcc tgtgctaggt 900
cgctacatcg attttcaaac agtgggggtt ccccctgaaa tcatgggtgt gggccctgca 960
tacgccatac caaaagtcct ggaagctact ggcttgcaag tccaagatat cgatattttt 1020
gaaataaatg aagcattcgc ggcccaagca ttatactgca tccataaact gggcatcgat 1080
ttgaataaag taaatccaag aggtggtgca atcgcgttag gccatccctt gggttgtact 1140
ggcgcaaggc aagtagctac catactaaga gaactgaaaa aggatcaaat cggggttgtt 1200
agtatgtgta tcggtactgg tatgggtgcc gccgccatct ttattaaaga atag 1254
<210> 205
<211> 398
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 205
Met Ser Gln Asn Val Tyr Ile Val Ser Thr Ala Arg Thr Pro Ile Gly
1 5 10 15
Ser Phe Gln Gly Ser Leu Ser Ser Lys Thr Ala Val Glu Leu Gly Ala
20 25 30
Val Ala Leu Lys Gly Ala Leu Ala Lys Val Pro Glu Leu Asp Ala Ser
35 40 45
Lys Asp Phe Asp Glu Ile Ile Phe Gly Asn Val Leu Ser Ala Asn Leu
50 55 60
Gly Gln Ala Pro Ala Arg Gln Val Ala Leu Ala Ala Gly Leu Ser Asn
65 70 75 80
His Ile Val Ala Ser Thr Val Asn Lys Val Cys Ala Ser Ala Met Lys
85 90 95
Ala Ile Ile Leu Gly Ala Gln Ser Ile Lys Cys Gly Asn Ala Asp Val
100 105 110
Val Val Ala Gly Gly Cys Glu Ser Met Thr Asn Ala Pro Tyr Tyr Met
115 120 125
Pro Ala Ala Arg Ala Gly Ala Lys Phe Gly Gln Thr Val Leu Val Asp
130 135 140
Gly Val Glu Arg Asp Gly Leu Asn Asp Ala Tyr Asp Gly Leu Ala Met
145 150 155 160
Gly Val His Ala Glu Lys Cys Ala Arg Asp Trp Asp Ile Thr Arg Glu
165 170 175
Gln Gln Asp Asn Phe Ala Ile Glu Ser Tyr Gln Lys Ser Gln Lys Ser
180 185 190
Gln Lys Glu Gly Lys Phe Asp Asn Glu Ile Val Pro Val Thr Ile Lys
195 200 205
Gly Phe Arg Gly Lys Pro Asp Thr Gln Val Thr Lys Asp Glu Glu Pro
210 215 220
Ala Arg Leu His Val Glu Lys Leu Arg Ser Ala Arg Thr Val Phe Gln
225 230 235 240
Lys Glu Asn Gly Thr Val Thr Ala Ala Asn Ala Ser Pro Ile Asn Asp
245 250 255
Gly Ala Ala Ala Val Ile Leu Val Ser Glu Lys Val Leu Lys Glu Lys
260 265 270
Asn Leu Lys Pro Leu Ala Ile Ile Lys Gly Trp Gly Glu Ala Ala His
275 280 285
Gln Pro Ala Asp Phe Thr Trp Ala Pro Ser Leu Ala Val Pro Lys Ala
290 295 300
Leu Lys His Ala Gly Ile Glu Asp Ile Asn Ser Val Asp Tyr Phe Glu
305 310 315 320
Phe Asn Glu Ala Phe Ser Val Val Gly Leu Val Asn Thr Lys Ile Leu
325 330 335
Lys Leu Asp Pro Ser Lys Val Asn Val Tyr Gly Gly Ala Val Ala Leu
340 345 350
Gly His Pro Leu Gly Cys Ser Gly Ala Arg Val Val Val Thr Leu Leu
355 360 365
Ser Ile Leu Gln Gln Glu Gly Gly Lys Ile Gly Val Ala Ala Ile Cys
370 375 380
Asn Gly Gly Gly Gly Ala Ser Ser Ile Val Ile Glu Lys Ile
385 390 395
<210> 206
<211> 1197
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 206
atgtctcaga acgtttacat tgtatcgact gccagaaccc caattggttc attccagggt 60
tctctatcct ccaagacagc agtggaattg ggtgctgttg ctttaaaagg cgccttggct 120
aaggttccag aattggatgc atccaaggat tttgacgaaa ttatttttgg taacgttctt 180
tctgccaatt tgggccaagc tccggccaga caagttgctt tggctgccgg tttgagtaat 240
catatcgttg caagcacagt taacaaggtc tgtgcatccg ctatgaaggc aatcattttg 300
ggtgctcaat ccatcaaatg tggtaatgct gatgttgtcg tagctggtgg ttgtgaatct 360
atgactaacg caccatacta catgccagca gcccgtgcgg gtgccaaatt tggccaaact 420
gttcttgttg atggtgtcga aagagatggg ttgaacgatg cgtacgatgg tctagccatg 480
ggtgtacacg cagaaaagtg tgcccgtgat tgggatatta ctagagaaca acaagacaat 540
tttgccatcg aatcctacca aaaatctcaa aaatctcaaa aggaaggtaa attcgacaat 600
gaaattgtac ctgttaccat taagggattt agaggtaagc ctgatactca agtcacgaag 660
gacgaggaac ctgctagatt acacgttgaa aaattgagat ctgcaaggac tgttttccaa 720
aaagaaaacg gtactgttac tgccgctaac gcttctccaa tcaacgatgg tgctgcagcc 780
gtcatcttgg tttccgaaaa agttttgaag gaaaagaatt tgaagccttt ggctattatc 840
aaaggttggg gtgaggccgc tcatcaacca gctgatttta catgggctcc atctcttgca 900
gttccaaagg ctttgaaaca tgctggcatc gaagacatca attctgttga ttactttgaa 960
ttcaatgaag ccttttcggt tgtcggtttg gtgaacacta agattttgaa gctagaccca 1020
tctaaggtta atgtatatgg tggtgctgtt gctctaggtc acccattggg ttgttctggt 1080
gctagagtgg ttgttacact gctatccatc ttacagcaag aaggaggtaa gatcggtgtt 1140
gccgccattt gtaatggtgg tggtggtgct tcctctattg tcattgaaaa gatatga 1197
<210> 207
<211> 255
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 207
Met Phe Asn Ser Asp Asn Leu Arg Leu Asp Gly Lys Cys Ala Ile Ile
1 5 10 15
Thr Gly Ala Gly Ala Gly Ile Gly Lys Glu Ile Ala Ile Thr Phe Ala
20 25 30
Thr Ala Gly Ala Ser Val Val Val Ser Asp Ile Asn Ala Asp Ala Ala
35 40 45
Asn His Val Val Asp Glu Ile Gln Gln Leu Gly Gly Gln Ala Phe Ala
50 55 60
Cys Arg Cys Asp Ile Thr Ser Glu Gln Glu Leu Ser Ala Leu Ala Asp
65 70 75 80
Phe Ala Ile Ser Lys Leu Gly Lys Val Asp Ile Leu Val Asn Asn Ala
85 90 95
Gly Gly Gly Gly Pro Lys Pro Phe Asp Met Pro Met Ala Asp Phe Arg
100 105 110
Arg Ala Tyr Glu Leu Asn Val Phe Ser Phe Phe His Leu Ser Gln Leu
115 120 125
Val Ala Pro Glu Met Glu Lys Asn Gly Gly Gly Val Ile Leu Thr Ile
130 135 140
Thr Ser Met Ala Ala Glu Asn Lys Asn Ile Asn Met Thr Ser Tyr Ala
145 150 155 160
Ser Ser Lys Ala Ala Ala Ser His Leu Val Arg Asn Met Ala Phe Asp
165 170 175
Leu Gly Glu Lys Asn Ile Arg Val Asn Gly Ile Ala Pro Gly Ala Ile
180 185 190
Leu Thr Asp Ala Leu Lys Ser Val Ile Thr Pro Glu Ile Glu Gln Lys
195 200 205
Met Leu Gln His Thr Pro Ile Arg Arg Leu Gly Gln Pro Gln Asp Ile
210 215 220
Ala Asn Ala Ala Leu Phe Leu Cys Ser Pro Ala Ala Ser Trp Val Ser
225 230 235 240
Gly Gln Ile Leu Thr Val Ser Gly Gly Gly Val Gln Glu Leu Asn
245 250 255
<210> 208
<211> 768
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 208
atgttcaact ctgataattt gagattggat ggtaaatgtg ctatcatcac tggtgcaggt 60
gctggtatcg gtaaagaaat cgcaatcact tttgcaacag ctggtgcatc tgttgttgtt 120
tcagatatta atgctgatgc tgcaaaccat gttgttgatg aaatccaaca attgggtggt 180
caagcttttg catgtagatg tgatatcaca tctgaacaag aattgtcagc tttggcagat 240
ttcgctatct ctaagttggg taaagttgat attttagtta ataatgctgg tggtggtggt 300
cctaaaccat ttgatatgcc aatggctgat ttcagaagag catacgaatt gaacgttttc 360
tctttctttc atttgtcaca attagttgct ccagaaatgg aaaagaatgg tggtggtgtt 420
attttgacta tcacatctat ggctgcagaa aataagaaca tcaacatgac ttcttacgct 480
tcttcaaaag ctgcagcttc acatttggtt agaaacatgg cattcgattt gggtgaaaag 540
aatatcagag ttaacggtat cgctccaggt gcaatcttga ctgatgcttt gaagtcagtt 600
attacaccag aaatcgaaca aaagatgttg caacatactc caattagaag attaggtcaa 660
ccacaagata tcgctaacgc agctttgttt ttatgttctc cagcagcttc ttgggtttca 720
ggtcaaattt tgactgtttc tggtggtggt gttcaagaat taaattaa 768
<210> 209
<211> 253
<212> PRT
<213> Luminiphilus syltensis
<400> 209
Met Asp Leu Gly Ile Lys Gly Lys Val Ala Leu Ile Thr Gly Ser Thr
1 5 10 15
Lys Gly Ile Gly Arg Gly Ile Ala Glu Ala Phe Ala Ala Glu Gly Cys
20 25 30
His Val Gly Ile Cys Ala Arg Asn Ser Asp Glu Val Asp Ala Ala Val
35 40 45
Lys Glu Leu Ser Ala Ser Gly Val Lys Val Ala Gly Gly Val Val Asp
50 55 60
Val Ala Asp Pro Ala Ser Leu Glu Thr Trp Val Ser Gln Cys Val Ala
65 70 75 80
Glu Leu Gly Gly Val Asp Phe Phe Val Pro Asn Val Ser Ala Gly Gly
85 90 95
Ala Asp Ala Ser Glu Asp Gly Trp Arg Ala Asn Phe Glu Ala Asp Leu
100 105 110
Leu Ser Thr Trp Arg Gly Val Gln Leu Thr Gln Pro His Ile Glu Lys
115 120 125
Ser Glu Cys Gly Ala Ile Val Val Ile Ser Ser Thr Ala Ala Ile Glu
130 135 140
Ala Phe Ala Gly Ala Thr Pro Tyr Gly Ala Met Lys Ala Ala Leu Leu
145 150 155 160
Asn Tyr Ala Gly Asn Leu Ala His Asp Leu Ala Pro Lys Gly Ile Arg
165 170 175
Val Asn Ser Val Ser Pro Gly Pro Ile Phe Ile Glu Gly Gly Ala Trp
180 185 190
Asp Gln Ile Lys Glu Ala Met Pro Glu Ile Tyr Glu Gly Thr Val Ala
195 200 205
Ala Ile Pro Met Gly Arg Met Gly Ser Ala Gln Glu Val Ala Asp Gln
210 215 220
Val Val Phe Leu Cys Ser Pro Arg Ala Ser Phe Thr Thr Gly Thr Asn
225 230 235 240
Val Val Leu Asp Gly Ala Phe Thr Lys Gly Leu Gln Phe
245 250
<210> 210
<211> 762
<212> DNA
<213> Luminiphilus syltensis
<400> 210
atggatttgg gtattaaagg taaagttgct ttgatcactg gttctacaaa aggtattggt 60
agaggtattg ctgaagcatt tgctgcagaa ggttgtcatg ttggtatttg tgctagaaat 120
tcagatgaag ttgatgctgc agttaaagaa ttgtctgctt caggtgttaa agttgcaggt 180
ggtgttgttg atgttgcaga tccagcttct ttggaaactt gggtttcaca atgtgttgct 240
gaattaggtg gtgttgattt ctttgttcca aatgtttctg caggtggtgc tgatgcatca 300
gaagatggtt ggagagcaaa cttcgaagct gatttgttgt ctacttggag aggtgttcaa 360
ttgacacaac cacatatcga aaaatcagaa tgtggtgcta ttgttgttat ttcttcaact 420
gctgcaattg aagcttttgc aggtgctaca ccatatggtg ctatgaaagc tgcattgtta 480
aattacgctg gtaatttggc tcatgatttg gcaccaaaag gtattagagt taattctgtt 540
tcaccaggtc caattttcat tgaaggtggt gcttgggatc aaattaaaga agcaatgcca 600
gaaatctatg aaggtacagt tgctgcaatt ccaatgggta gaatgggttc tgctcaagaa 660
gttgcagatc aagttgtttt cttgtgttct ccaagagctt cttttactac aggtactaac 720
gttgttttgg atggtgcttt tactaaaggt ttacaatttt aa 762
<210> 211
<211> 259
<212> PRT
<213> 脆弱拟杆菌(Bacteroides fragilis)
<400> 211
Met Asn Arg Phe Glu Asn Lys Ile Ile Ile Ile Thr Gly Ala Ala Gly
1 5 10 15
Gly Ile Gly Ala Ser Thr Thr Arg Arg Ile Val Ser Glu Gly Gly Lys
20 25 30
Val Val Ile Ala Asp Tyr Ser Arg Glu Lys Ala Asp Gln Phe Ala Ala
35 40 45
Glu Leu Ser Asn Ser Gly Ala Asp Val Arg Pro Val Tyr Phe Ser Ala
50 55 60
Thr Glu Leu Lys Ser Cys Lys Glu Leu Ile Thr Phe Thr Met Lys Glu
65 70 75 80
Tyr Gly Gln Ile Asp Val Leu Val Asn Asn Val Gly Gly Thr Asn Pro
85 90 95
Arg Arg Asp Thr Asn Ile Glu Thr Leu Asp Met Asp Tyr Phe Asp Glu
100 105 110
Ala Phe His Leu Asn Leu Ser Cys Thr Met Tyr Leu Ser Gln Leu Val
115 120 125
Ile Pro Ile Met Ser Thr Gln Gly Gly Gly Asn Ile Val Asn Val Ala
130 135 140
Ser Ile Ser Gly Ile Thr Ala Asp Ser Asn Gly Thr Leu Tyr Gly Ala
145 150 155 160
Ser Lys Ala Gly Val Ile Asn Leu Thr Lys Tyr Ile Ala Thr Gln Thr
165 170 175
Gly Lys Lys Asn Ile Arg Cys Asn Ala Val Ala Pro Gly Leu Ile Leu
180 185 190
Thr Pro Ala Ala Leu Asn Asn Leu Asn Glu Glu Val Arg Lys Ile Phe
195 200 205
Leu Gly Gln Cys Ala Thr Pro Tyr Leu Gly Glu Pro Gln Asp Val Ala
210 215 220
Ala Thr Ile Ala Phe Leu Ala Ser Glu Asp Ala Arg Tyr Ile Thr Gly
225 230 235 240
Gln Thr Ile Val Val Asp Gly Gly Leu Thr Ile His Asn Pro Thr Ile
245 250 255
Asn Leu Val
<210> 212
<211> 780
<212> DNA
<213> 脆弱拟杆菌(Bacteroides fragilis)
<400> 212
atgaacagat tcgaaaataa gatcatcatc atcactggtg ctgcaggtgg tattggtgct 60
tctactacaa gaagaattgt ttcagaaggt ggtaaagttg ttattgctga ttactctaga 120
gaaaaggcag atcaatttgc tgcagaattg tctaattcag gtgctgatgt tagaccagtt 180
tacttctctg caactgaatt gaagtcttgt aaggaattga tcacttttac aatgaaggaa 240
tacggtcaaa tcgatgtttt ggttaacaac gttggtggta caaatccaag aagagatact 300
aacatcgaaa cattggatat ggattacttt gatgaagctt tccatttgaa tttgtcttgt 360
actatgtact tgtcacaatt agttattcca atcatgtcaa cacaaggtgg tggtaacatc 420
gttaacgttg cttctatctc aggtattact gcagattcta atggtacatt gtatggtgct 480
tcaaaggcag gtgttattaa tttgactaag tacatcgcta ctcaaacagg taaaaagaat 540
atcagatgta acgctgttgc accaggtttg attttaacac cagctgcatt gaacaatttg 600
aacgaagaag ttagaaagat tttcttgggt caatgtgcta ctccatattt gggtgaacca 660
caagatgttg ctgcaacaat tgcatttttg gcatctgaag atgcaagata cattactggt 720
caaacaattg ttgttgatgg tggtttgact attcataatc caactattaa tttggtttaa 780
<210> 213
<211> 256
<212> PRT
<213> 睾酮丛毛单胞菌(Comamonas testosteroni)
<400> 213
Met Asn Glu Ile Phe Arg Gln Phe Ser Leu Glu Gly Lys Val Ala Val
1 5 10 15
Val Thr Gly Ala Gly Lys Gly Ile Gly Arg Ala Cys Ala Val Thr Leu
20 25 30
Ala Lys Ala Gly Ala Asp Val Ala Leu Phe Ala Arg Thr Glu Ala Asp
35 40 45
Leu Gln Ala Val Lys Ala Glu Ile Glu Ala Leu Gly Arg Arg Ala Ile
50 55 60
Ala Val Gln Gly Asp Val Asn Lys Glu Glu Asp Leu Asp Lys Leu Ile
65 70 75 80
Val Arg Thr Val Glu Glu Leu Gly Lys Ile Asn Val Leu Ile Asn Asn
85 90 95
Val Gly Gly Gly Gly Pro Asn Asp Pro Arg Lys Val Ala Gly Lys Ala
100 105 110
Val Gly Asp Met Leu Ala Phe Asn Val Val Pro Ala Tyr Thr Leu Ile
115 120 125
Gln Lys Ala Ala Ala Ala Met Glu Ala Ala Gly Gly Gly Ala Val Val
130 135 140
Asn Ile Ser Ser Thr Ala Ala Arg Tyr Ser Gln Lys Tyr Phe Ser Ala
145 150 155 160
Tyr Gly Ala Ala Lys Ala Ala Leu Asn Gln Met Thr Arg Cys Leu Ala
165 170 175
Gln Asp Phe Gly Pro Lys Val Arg Ile Asn Ala Ile Glu Pro Gly Thr
180 185 190
Ile Met Thr Asp Ala Leu Ala Pro Phe Leu Thr Pro Asp Arg Lys Glu
195 200 205
Arg Met Glu Lys Thr Thr Pro Met Ala Arg Met Gly Gln Pro Glu Asp
210 215 220
Ile Ala Asn Ala Ala Leu Phe Leu Ala Ser Pro Ala Ser Ser Trp Val
225 230 235 240
Thr Gly Lys Val Leu Gly Val Asp Gly Gly Val Glu Ala Pro Asn Phe
245 250 255
<210> 214
<211> 771
<212> DNA
<213> 睾酮丛毛单胞菌(Comamonas testosteroni)
<400> 214
atgaacgaaa tttttagaca attttctttg gagggtaaag ttgcagttgt tactggtgct 60
ggtaaaggta ttggtagagc ttgtgcagtt acattagcta aagcaggtgc tgatgttgca 120
ttgtttgcta gaacagaagc agatttgcaa gcagttaaag ctgaaattga agctttgggt 180
agaagagcaa ttgctgttca aggtgacgtt aataaggaag aagatttgga taagttgatt 240
gttagaactg ttgaagaatt gggtaaaatt aatgttttga ttaataatgt tggtggtggt 300
ggtccaaatg atccaagaaa agttgctggt aaagctgttg gtgacatgtt ggcttttaat 360
gttgttccag cttacacttt gattcaaaaa gctgcagctg caatggaagc tgctggtggt 420
ggtgctgttg ttaacatctc ttcaacagct gcaagatact ctcaaaagta cttctcagct 480
tatggtgctg caaaagctgc attgaatcaa atgactagat gtttagcaca agatttcggt 540
ccaaaagtta gaattaatgc tatcgaacca ggtactatca tgacagatgc attggctcca 600
tttttaacac cagatagaaa ggaaagaatg gaaaagacta caccaatggc aagaatgggt 660
caaccagaag atattgctaa tgctgcattg tttttagcat caccagcttc ttcatgggtt 720
actggtaaag ttttaggtgt tgatggtggt gttgaagctc caaattttta a 771
<210> 215
<211> 272
<212> PRT
<213> 丁香假单胞菌(Pseudomonas syringae)
<400> 215
Met Pro Ile Ala Leu Ile Thr Gly Cys Ser Ser Gly Ile Gly Arg Ala
1 5 10 15
Leu Ala Asp Ala Phe Lys Ala Thr Gly Tyr Glu Val Trp Ala Thr Ala
20 25 30
Arg Lys Ala Asp Asp Val Ala Ala Leu Ser Ala Ala Gly Phe Ile Ala
35 40 45
Val Gln Leu Asp Val Asn Asp Ser Met Ala Leu Glu Gln Leu Ala Ala
50 55 60
Gly Leu Glu His Ser Gly Leu Asp Val Leu Ile Asn Asn Ala Gly Tyr
65 70 75 80
Gly Ala Met Gly Pro Leu Leu Asp Gly Gly Val Gln Ala Leu Gln Arg
85 90 95
Gln Phe Glu Thr Asn Val Phe Ser Val Ile Gly Val Thr Arg Ala Leu
100 105 110
Phe Pro Ala Leu Arg Arg Asn Lys Gly Leu Val Val Asn Ile Gly Ser
115 120 125
Val Ser Gly Val Leu Val Thr Pro Phe Ala Gly Ala Tyr Cys Ala Ser
130 135 140
Lys Ala Ala Val His Ala Leu Ser Asp Ala Leu Arg Leu Glu Leu Ala
145 150 155 160
Pro Phe Gly Val Gln Val Met Glu Val Gln Pro Gly Ala Ile Ala Ser
165 170 175
Ser Phe Ala Lys Asn Ala Ser His Glu Ala Glu Gln Leu Ile Ser Glu
180 185 190
Gln Ser Pro Trp Trp Pro Ile Arg Glu Gly Ile Arg Ala Arg Ala Arg
195 200 205
Ala Ser Leu Asp Asn Pro Thr Pro Val Thr Glu Phe Ala Arg Asp Leu
210 215 220
Leu Lys Ala Val Gln His Thr Arg Pro Pro Arg Leu Leu Arg Leu Gly
225 230 235 240
Asn Gly Ser Arg Leu Leu Pro Leu Met Ala Trp Leu Leu Pro Lys Gly
245 250 255
Leu Leu Asp Met Ala Leu Arg Lys Arg Phe Gly Leu Asn Ala Asp Leu
260 265 270
<210> 216
<211> 819
<212> DNA
<213> 丁香假单胞菌(Pseudomonas syringae)
<400> 216
atgccaattg ctttaattac tggttgttct tcaggtattg gtagagcttt ggcagatgct 60
tttaaagcta ctggttatga agtttgggca acagctagaa aagcagatga tgttgctgca 120
ttatctgctg caggttttat tgctgttcaa ttggatgtta acgattctat ggcattggaa 180
caattggctg caggtttaga acattcaggt ttggatgttt tgattaataa cgctggttac 240
ggtgcaatgg gtccattgtt agatggtggt gttcaagctt tgcaaagaca attcgaaact 300
aacgttttct ctgttattgg tgttacaaga gctttatttc cagcattgag aagaaataag 360
ggtttagttg ttaacatcgg ttctgtttca ggtgttttgg ttactccatt tgcaggtgct 420
tattgtgctt ctaaagctgc agttcatgct ttatcagatg cattgagatt agaattggca 480
ccatttggtg ttcaagttat ggaagttcaa ccaggtgcaa ttgcttcttc atttgctaaa 540
aatgcatctc atgaagctga acaattgatc tctgaacaat caccatggtg gccaattaga 600
gaaggtatta gagcaagagc tagagcatca ttggataatc caactccagt tacagaattc 660
gctagagatt tgttgaaagc agttcaacat acaagaccac caagattgtt gagattgggt 720
aacggttcta gattgttgcc attgatggct tggttgttac caaaaggttt gttggatatg 780
gctttgagaa agagattcgg tttgaatgca gatttgtaa 819
<210> 217
<211> 272
<212> PRT
<213> 番木瓜假单胞菌(Pseudomonas caricapapayae)
<400> 217
Met Pro Ile Ala Leu Ile Thr Gly Cys Ser Ser Gly Ile Gly Arg Ala
1 5 10 15
Leu Ala Asp Ala Phe Lys Ala Thr Gly Tyr Glu Val Trp Ala Thr Ala
20 25 30
Arg Lys Ala Asp Asp Val Ala Ala Leu Ser Ala Ala Gly Phe Ile Ala
35 40 45
Val Gln Leu Asp Val Asn Asp Ser Leu Thr Val Glu Gln Leu Ala Ala
50 55 60
Gly Leu Glu His Ser Gly Leu Asp Val Leu Ile Asn Asn Ala Gly Tyr
65 70 75 80
Gly Ala Met Gly Pro Leu Leu Asp Gly Gly Val Asp Ala Leu Gln Arg
85 90 95
Gln Phe Glu Thr Asn Val Phe Ser Val Val Gly Val Thr Arg Ala Leu
100 105 110
Phe Pro Ala Leu Arg Arg Asn Lys Gly Leu Val Val Asn Ile Gly Ser
115 120 125
Val Ser Gly Val Leu Val Thr Pro Phe Ala Gly Ala Tyr Cys Ala Ser
130 135 140
Lys Ala Ala Val His Ala Leu Ser Asp Ala Leu Arg Leu Glu Leu Ala
145 150 155 160
Pro Phe Gly Val Gln Val Met Glu Val Gln Pro Gly Ala Ile Ala Ser
165 170 175
Ser Phe Ala Lys Asn Ala Ser Gln Gln Ala Glu Gln Leu Ile Ser Glu
180 185 190
Gln Ser Pro Trp Trp Pro Ile Arg Glu Gly Ile Arg Ala Arg Ala Arg
195 200 205
Ala Ser Leu Asp Asn Pro Thr Pro Ala Thr Glu Phe Ala Arg Asp Leu
210 215 220
Leu Lys Ala Ala Gln Gln Ala His Pro Pro Arg Leu Leu Arg Leu Gly
225 230 235 240
Asn Gly Ser Arg Leu Leu Pro Leu Ile Ala Trp Leu Leu Pro Lys Ala
245 250 255
Leu Leu Glu Asn Val Leu Arg Lys Arg Phe Gly Leu Asn Ala Asp Leu
260 265 270
<210> 218
<211> 819
<212> DNA
<213> 番木瓜假单胞菌(Pseudomonas caricapapayae)
<400> 218
atgccaattg ctttaattac tggttgttct tcaggtattg gtagagcttt ggcagatgct 60
tttaaagcta ctggttatga agtttgggca acagctagaa aagcagatga tgttgctgca 120
ttatctgctg caggttttat tgctgttcaa ttggatgtta acgattcttt gacagttgaa 180
caattagctg caggtttgga acattcaggt ttggatgttt tgattaataa cgctggttac 240
ggtgcaatgg gtccattgtt agatggtggt gttgatgctt tgcaaagaca attcgaaact 300
aacgttttct ctgttgttgg tgttacaaga gctttatttc cagcattgag aagaaataag 360
ggtttagttg ttaacatcgg ttctgtttca ggtgttttgg ttactccatt tgcaggtgct 420
tattgtgctt ctaaagctgc agttcatgct ttatcagatg cattgagatt agaattggca 480
ccatttggtg ttcaagttat ggaagttcaa ccaggtgcaa ttgcttcttc atttgctaaa 540
aatgcatctc aacaagctga acaattgatc tctgaacaat caccatggtg gccaattaga 600
gaaggtatta gagcaagagc tagagcatca ttggataatc caactccagc tacagaattt 660
gcaagagatt tgttaaaagc tgcacaacaa gctcatccac caagattgtt gagattgggt 720
aacggttcta gattgttgcc attgattgct tggttgttgc caaaggcatt gttggaaaac 780
gttttgagaa aaagatttgg tttaaatgca gatttgtaa 819
<210> 219
<211> 255
<212> PRT
<213> Drosophila persimilis
<400> 219
Met Ile Lys Asn Ala Val Thr Leu Val Thr Gly Gly Ala Ser Gly Leu
1 5 10 15
Gly Arg Ala Thr Ala Glu Arg Leu Ala Arg Gln Gly Ala Ser Val Val
20 25 30
Leu Ala Asp Leu Pro Ser Ser Lys Gly Asn Glu Val Ala Lys Glu Leu
35 40 45
Gly Asp Lys Val Val Phe Val Pro Val Asp Val Thr Ser Glu Lys Asp
50 55 60
Val Ser Ala Ala Leu Gln Ile Ala Lys Asp Lys Phe Gly Arg Leu Asp
65 70 75 80
Leu Thr Val Asn Cys Ala Gly Thr Ala Thr Ala Val Lys Thr Phe Asn
85 90 95
Phe Asn Lys Asn Val Ala His Arg Leu Glu Asp Phe Gln Arg Val Ile
100 105 110
Asn Ile Asn Thr Val Gly Thr Phe Asn Val Ile Arg Leu Ser Ala Gly
115 120 125
Leu Met Gly Ala Asn Glu Pro Asn Gln Asp Gly Gln Arg Gly Val Ile
130 135 140
Val Asn Thr Ala Ser Val Ala Ala Phe Asp Gly Gln Ile Gly Gln Ala
145 150 155 160
Ala Tyr Ala Ala Ser Lys Ala Ala Val Val Gly Met Thr Leu Pro Ile
165 170 175
Ala Arg Asp Leu Ser Thr Gln Gly Ile Arg Ile Cys Thr Ile Ala Pro
180 185 190
Gly Leu Phe Asn Thr Pro Met Leu Ala Ala Leu Pro Glu Lys Val Arg
195 200 205
Thr Phe Leu Ala Lys Ser Ile Pro Phe Pro Gln Arg Leu Gly Glu Pro
210 215 220
Ser Glu Tyr Ala His Leu Val Gln Ser Ile Phe Glu Asn Pro Leu Leu
225 230 235 240
Asn Gly Glu Val Ile Arg Ile Asp Gly Ala Leu Arg Met Met Pro
245 250 255
<210> 220
<211> 768
<212> DNA
<213> Drosophila persimilis
<400> 220
atgattaaga atgctgttac tttggttaca ggtggtgcat ctggtttagg tagagctact 60
gcagaaagat tggctagaca aggtgcatca gttgttttgg ctgatttgcc atcttcaaag 120
ggtaacgaag ttgcaaagga attgggtgac aaggttgttt tcgttccagt tgatgttaca 180
tctgaaaaag atgtttcagc tgcattgcaa atcgctaagg ataagttcgg tagattggat 240
ttgactgtta attgtgcagg tactgctaca gcagttaaga cttttaattt caataagaac 300
gttgctcata gattggaaga tttccaaaga gttattaata tcaacactgt tggtactttt 360
aatgttatca gattgtcagc tggtttaatg ggtgcaaatg aaccaaatca agatggtcaa 420
agaggtgtta ttgttaatac tgcttctgtt gctgcatttg atggtcaaat tggtcaagct 480
gcatatgctg catcaaaagc tgcagttgtt ggtatgacat tgccaattgc tagagatttg 540
tctactcaag gtattagaat ctgtacaatc gcaccaggtt tgtttaatac tccaatgttg 600
gctgcattgc cagaaaaagt tagaacattt ttggctaagt ctatcccatt tccacaaaga 660
ttaggtgaac catctgaata cgcacatttg gttcaatcaa tcttcgaaaa cccattgttg 720
aacggtgaag ttattagaat cgatggtgct ttgagaatga tgccataa 768
<210> 221
<211> 261
<212> PRT
<213> 撒丁岛梭菌(Clostridium sardiniensis)
<400> 221
Met Asn Phe Arg Glu Lys Tyr Gly Gln Trp Gly Ile Val Leu Gly Ala
1 5 10 15
Thr Glu Gly Ile Gly Lys Ala Ser Ala Phe Glu Leu Ala Lys Arg Gly
20 25 30
Met Asp Val Ile Leu Val Gly Arg Arg Lys Glu Ala Leu Glu Glu Leu
35 40 45
Ala Lys Ala Ile His Glu Glu Thr Gly Lys Glu Ile Arg Val Leu Pro
50 55 60
Gln Asp Leu Ser Glu Tyr Asp Ala Ala Glu Arg Leu Ile Glu Ala Thr
65 70 75 80
Lys Asp Leu Asp Met Gly Val Ile Glu Tyr Val Ala Cys Leu His Ala
85 90 95
Met Gly Gln Tyr Asn Lys Val Asp Tyr Ala Lys Tyr Glu Gln Met Tyr
100 105 110
Arg Val Asn Ile Arg Thr Phe Ser Lys Leu Leu His His Tyr Ile Gly
115 120 125
Glu Phe Lys Glu Arg Asp Arg Gly Ala Phe Ile Thr Ile Gly Ser Leu
130 135 140
Ser Gly Trp Thr Ser Leu Pro Phe Cys Ala Glu Tyr Ala Ala Glu Lys
145 150 155 160
Ala Tyr Met Met Thr Val Thr Glu Gly Val Ala Tyr Glu Cys Ala Asn
165 170 175
Thr Asn Val Asp Val Met Leu Leu Ser Ala Gly Ser Thr Ile Thr Pro
180 185 190
Thr Trp Leu Lys Asn Lys Pro Ser Asp Pro Lys Ala Val Ala Ala Ala
195 200 205
Met Tyr Pro Glu Asp Val Ile Lys Asp Gly Phe Glu Gln Leu Gly Lys
210 215 220
Lys Phe Thr Tyr Leu Ala Gly Glu Leu Asn Arg Glu Lys Met Lys Glu
225 230 235 240
Asn Asn Ala Met Asp Arg Asn Asp Leu Ile Ala Lys Leu Gly Lys Met
245 250 255
Phe Asp His Met Ala
260
<210> 222
<211> 786
<212> DNA
<213> 撒丁岛梭菌(Clostridium sardiniensis)
<400> 222
atgaacttca gagaaaagta cggtcaatgg ggtattgttt tgggtgcaac agaaggtatt 60
ggtaaagcat ctgcttttga attggctaaa cgtggtatgg atgttatttt agttggtaga 120
agaaaggaag ctttggaaga attggcaaag gctatccatg aagaaactgg taaagaaatc 180
agagttttgc cacaagattt gtcagaatac gatgctgcag aaagattgat cgaagcaact 240
aaggatttgg atatgggtgt tattgaatac gttgcatgtt tgcatgctat gggtcaatac 300
aataaggttg attacgctaa gtacgaacaa atgtacagag ttaacatcag aactttttct 360
aaattgttgc atcattacat cggtgaattc aaagaaagag atagaggtgc ttttattaca 420
attggttctt tgtcaggttg gacttcatta ccattttgtg cagaatatgc tgcagaaaaa 480
gcttacatga tgactgttac agaaggtgtt gcatacgaat gtgctaacac aaacgttgat 540
gttatgttgt tgtctgcagg ttcaactatc acaccaactt ggttgaaaaa taagccatct 600
gatccaaaag ctgttgctgc agctatgtac ccagaagatg ttattaaaga tggtttcgaa 660
caattgggta aaaagtttac ttacttggct ggtgaattga acagagaaaa gatgaaggaa 720
aacaacgcaa tggatagaaa cgatttgatc gctaagttag gtaaaatgtt tgatcatatg 780
gcttaa 786
<210> 223
<211> 319
<212> PRT
<213> 智人(Homo sapiens)
<400> 223
Met Ser Ser Pro Gln Ala Pro Glu Asp Gly Gln Gly Cys Gly Asp Arg
1 5 10 15
Gly Asp Pro Pro Gly Asp Leu Arg Ser Val Leu Val Thr Thr Val Leu
20 25 30
Asn Leu Glu Pro Leu Asp Glu Asp Leu Phe Arg Gly Arg His Tyr Trp
35 40 45
Val Pro Ala Lys Arg Leu Phe Gly Gly Gln Ile Val Gly Gln Ala Leu
50 55 60
Val Ala Ala Ala Lys Ser Val Ser Glu Asp Val His Val His Ser Leu
65 70 75 80
His Cys Tyr Phe Val Arg Ala Gly Asp Pro Lys Leu Pro Val Leu Tyr
85 90 95
Gln Val Glu Arg Thr Arg Thr Gly Ser Ser Phe Ser Val Arg Ser Val
100 105 110
Lys Ala Val Gln His Gly Lys Pro Ile Phe Ile Cys Gln Ala Ser Phe
115 120 125
Gln Gln Ala Gln Pro Ser Pro Met Gln His Gln Phe Ser Met Pro Thr
130 135 140
Val Pro Pro Pro Glu Glu Leu Leu Asp Cys Glu Thr Leu Ile Asp Gln
145 150 155 160
Tyr Leu Arg Asp Pro Asn Leu Gln Lys Arg Tyr Pro Leu Ala Leu Asn
165 170 175
Arg Ile Ala Ala Gln Glu Val Pro Ile Glu Ile Lys Pro Val Asn Pro
180 185 190
Ser Pro Leu Ser Gln Leu Gln Arg Met Glu Pro Lys Gln Met Phe Trp
195 200 205
Val Arg Ala Arg Gly Tyr Ile Gly Glu Gly Asp Met Lys Met His Cys
210 215 220
Cys Val Ala Ala Tyr Ile Ser Asp Tyr Ala Phe Leu Gly Thr Ala Leu
225 230 235 240
Leu Pro His Gln Trp Gln His Lys Val His Phe Met Val Ser Leu Asp
245 250 255
His Ser Met Trp Phe His Ala Pro Phe Arg Ala Asp His Trp Met Leu
260 265 270
Tyr Glu Cys Glu Ser Pro Trp Ala Gly Gly Ser Arg Gly Leu Val His
275 280 285
Gly Arg Leu Trp Arg Gln Asp Gly Val Leu Ala Val Thr Cys Ala Gln
290 295 300
Glu Gly Val Ile Arg Val Lys Pro Gln Val Ser Glu Ser Lys Leu
305 310 315
<210> 224
<211> 960
<212> DNA
<213> 智人(Homo sapiens)
<400> 224
atgtcttcac cacaagctcc agaagatggt caaggttgtg gtgacagagg tgacccacca 60
ggtgacttga gatcagtttt agttactaca gttttgaatt tggaaccatt ggatgaagat 120
ttgtttagag gtagacatta ctgggttcca gcaaaaagat tatttggtgg tcaaattgtt 180
ggtcaagctt tggttgctgc agctaaatct gtttcagaag atgttcatgt tcattctttg 240
cattgttact tcgttagagc aggtgaccca aaattgccag ttttatacca agttgaaaga 300
actagaacag gttcttcatt ttctgttaga tcagttaaag ctgttcaaca tggtaaacca 360
attttcattt gtcaagcatc tttccaacaa gctcaaccat caccaatgca acatcaattt 420
tctatgccaa ctgttccacc accagaagaa ttgttggatt gtgaaacatt gatcgatcaa 480
tatttgagag atccaaattt gcaaaagaga tacccattgg cattaaatag aattgcagct 540
caagaagttc caatcgaaat taaaccagtt aacccatctc cattgtcaca attgcaaaga 600
atggaaccaa agcaaatgtt ttgggttaga gctagaggtt atattggtga aggtgacatg 660
aaaatgcatt gttgtgttgc agcttatatt tctgattacg catttttggg tactgctttg 720
ttaccacatc aatggcaaca taaggttcat ttcatggttt ctttggatca ttcaatgtgg 780
tttcatgcac cttttagagc tgatcattgg atgttgtacg aatgtgaatc tccatgggct 840
ggtggttcaa gaggtttagt tcatggtaga ttgtggagac aagatggtgt tttagcagtt 900
acatgtgctc aagaaggtgt tattagagtt aagccacaag tttctgaatc aaagttgtaa 960
<210> 225
<211> 320
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 225
Met Ser Ala Pro Glu Gly Leu Gly Asp Ala His Gly Asp Ala Asp Arg
1 5 10 15
Gly Asp Leu Ser Gly Asp Leu Arg Ser Val Leu Val Thr Ser Val Leu
20 25 30
Asn Leu Glu Pro Leu Asp Glu Asp Leu Tyr Arg Gly Arg His Tyr Trp
35 40 45
Val Pro Thr Ser Gln Arg Leu Phe Gly Gly Gln Ile Met Gly Gln Ala
50 55 60
Leu Val Ala Ala Ala Lys Ser Val Ser Glu Asp Val His Val His Ser
65 70 75 80
Leu His Cys Tyr Phe Val Arg Ala Gly Asp Pro Lys Val Pro Val Leu
85 90 95
Tyr His Val Glu Arg Ile Arg Thr Gly Ala Ser Phe Ser Val Arg Ala
100 105 110
Val Lys Ala Val Gln His Gly Lys Ala Ile Phe Ile Cys Gln Ala Ser
115 120 125
Phe Gln Gln Met Gln Pro Ser Pro Leu Gln His Gln Phe Ser Met Pro
130 135 140
Ser Val Pro Pro Pro Glu Asp Leu Leu Asp His Glu Ala Leu Ile Asp
145 150 155 160
Gln Tyr Leu Arg Asp Pro Asn Leu His Lys Lys Tyr Arg Val Gly Leu
165 170 175
Asn Arg Val Ala Ala Gln Glu Val Pro Ile Glu Ile Lys Val Val Asn
180 185 190
Pro Pro Thr Leu Thr Gln Leu Gln Ala Leu Glu Pro Lys Gln Met Phe
195 200 205
Trp Val Arg Ala Arg Gly Tyr Ile Gly Glu Gly Asp Ile Lys Met His
210 215 220
Cys Cys Val Ala Ala Tyr Ile Ser Asp Tyr Ala Phe Leu Gly Thr Ala
225 230 235 240
Leu Leu Pro His Gln Ser Lys Tyr Lys Val Asn Phe Met Ala Ser Leu
245 250 255
Asp His Ser Met Trp Phe His Ala Pro Phe Arg Ala Asp His Trp Met
260 265 270
Leu Tyr Glu Cys Glu Ser Pro Trp Ala Gly Gly Ser Arg Gly Leu Val
275 280 285
His Gly Arg Leu Trp Arg Arg Asp Gly Val Leu Ala Val Thr Cys Ala
290 295 300
Gln Glu Gly Val Ile Arg Leu Lys Pro Gln Val Ser Glu Ser Lys Leu
305 310 315 320
<210> 226
<211> 963
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 226
atgtctgcac cagaaggttt aggtgacgca catggtgacg ctgatagagg tgacttgtct 60
ggtgacttga gatcagtttt agttacttct gttttgaatt tggaaccatt ggatgaagat 120
ttgtatagag gtagacatta ctgggttcca acatcacaaa gattgttcgg tggtcaaatc 180
atgggtcaag ctttggttgc tgcagctaaa tctgtttcag aagatgttca tgttcattct 240
ttgcattgtt acttcgttag agcaggtgac ccaaaagttc cagttttgta ccatgttgaa 300
agaattagaa ctggtgcttc tttttcagtt agagcagtta aagctgttca acatggtaaa 360
gcaattttca tttgtcaagc ttcattccaa caaatgcaac catctccatt acaacatcaa 420
ttttctatgc catcagttcc accaccagaa gatttgttgg atcatgaagc tttgatcgat 480
caatatttga gagatccaaa tttgcataag aaatacagag ttggtttgaa tagagttgca 540
gctcaagaag ttccaatcga aattaaagtt gttaatccac caactttgac acaattacaa 600
gcattggaac caaaacaaat gttttgggtt agagctagag gttatattgg tgaaggtgac 660
attaaaatgc attgttgtgt tgcagcttac atctcagatt acgcattttt aggtactgct 720
ttgttaccac atcaatctaa gtacaaggtt aacttcatgg catctttaga tcattcaatg 780
tggtttcatg caccttttag agctgatcat tggatgttgt acgaatgtga atcaccatgg 840
gctggtggtt ctagaggttt agttcatggt agattgtgga gaagagatgg tgttttggca 900
gttacatgtg ctcaagaagg tgttattaga ttgaagccac aagtttctga atcaaaattg 960
taa 963
<210> 227
<211> 320
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 227
Met Ser Lys Pro Glu Asp Leu Gly Asp Ala Asn Gly Asp Ala Asp Arg
1 5 10 15
Gly Asp Leu Ser Gly Asp Leu Arg Ser Val Leu Val Thr Ser Val Leu
20 25 30
Asn Leu Glu Pro Leu Asp Glu Asp Leu Tyr Arg Gly Arg His Tyr Trp
35 40 45
Val Pro Thr Ser Gln Arg Leu Phe Gly Gly Gln Ile Val Gly Gln Ala
50 55 60
Leu Val Ala Ala Ala Lys Ser Val Ser Glu Asp Val His Val His Ser
65 70 75 80
Leu His Cys Tyr Phe Val Arg Ala Gly Asp Pro Lys Val Pro Val Leu
85 90 95
Tyr His Val Glu Arg Thr Arg Thr Gly Ala Ser Phe Ser Val Arg Ala
100 105 110
Val Lys Ala Val Gln His Gly Lys Ala Ile Phe Ile Cys Gln Ala Ser
115 120 125
Phe Gln Gln Met Gln Pro Ser Pro Leu Gln His Gln Phe Ser Met Pro
130 135 140
Thr Val Pro Pro Pro Glu Glu Leu Leu Asp His Glu Ala Leu Ile Asp
145 150 155 160
Gln Tyr Leu Arg Asp Pro Asn Leu His Glu Lys Tyr Arg Val Gly Leu
165 170 175
Asn Arg Ile Ala Ala Arg Glu Val Pro Ile Glu Ile Lys Leu Val Asn
180 185 190
Pro Pro Ala Leu Asn Gln Leu Gln Thr Leu Glu Pro Lys Gln Met Phe
195 200 205
Trp Val Arg Ala Arg Gly Tyr Ile Gly Glu Gly Asp Ile Lys Met His
210 215 220
Cys Cys Val Ala Ala Tyr Ile Ser Asp Tyr Ala Phe Leu Gly Thr Ala
225 230 235 240
Leu Leu Pro His Gln Ser Lys Tyr Lys Val Asn Phe Met Val Ser Leu
245 250 255
Asp His Ser Met Trp Phe His Ala Pro Phe Arg Ala Asp His Trp Met
260 265 270
Leu Tyr Glu Cys Glu Ser Pro Trp Ala Gly Gly Ser Arg Gly Leu Val
275 280 285
His Gly Arg Leu Trp Arg Arg Asp Gly Val Leu Ala Val Thr Cys Ala
290 295 300
Gln Glu Gly Val Ile Arg Ser Lys Pro Arg Val Ser Glu Ser Lys Leu
305 310 315 320
<210> 228
<211> 963
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 228
atgtctaaac cagaagattt gggtgacgca aatggtgacg ctgatagagg tgacttgtct 60
ggtgacttga gatcagtttt agttacttct gttttgaatt tggaaccatt ggatgaagat 120
ttgtatagag gtagacatta ctgggttcca acatctcaaa gattatttgg tggtcaaatt 180
gttggtcaag cattggttgc tgcagctaaa tctgtttcag aagatgttca tgttcattca 240
ttgcattgtt acttcgttag agcaggtgac ccaaaagttc cagttttgta ccatgttgaa 300
agaactagaa caggtgcttc tttttcagtt agagcagtta aagctgttca acatggtaaa 360
gcaattttca tttgtcaagc ttcattccaa caaatgcaac catctccatt acaacatcaa 420
ttttcaatgc caactgttcc accaccagaa gaattgttgg atcatgaagc tttgatcgat 480
caatatttga gagatccaaa tttgcatgaa aagtacagag ttggtttgaa cagaattgca 540
gctagagaag ttccaatcga aattaaattg gttaatccac cagcattgaa ccaattgcaa 600
actttggaac caaagcaaat gttttgggtt agagctagag gttatattgg tgaaggtgac 660
attaaaatgc attgttgtgt tgcagcttat atttctgatt acgcattttt gggtacagct 720
ttgttaccac atcaatcaaa gtacaaggtt aacttcatgg tttctttaga tcattcaatg 780
tggtttcatg caccttttag agctgatcat tggatgttgt acgaatgtga atcaccatgg 840
gctggtggtt ctagaggttt agttcatggt agattgtgga gaagagatgg tgttttagca 900
gttacatgtg ctcaagaagg tgttattaga tctaagccaa gagtttctga atcaaagttg 960
taa 963
<210> 229
<211> 290
<212> PRT
<213> 链霉菌属的种(Streptomyces sp.)
<400> 229
Met Thr Asn Pro Ala Glu Arg Leu Val Asp Leu Leu Asp Leu Glu Arg
1 5 10 15
Ile Glu Val Asp Ile Phe Arg Gly Arg Ser Pro Glu Glu Ser Leu Gln
20 25 30
Arg Val Phe Gly Gly Gln Val Ala Gly Gln Ala Leu Val Ala Ala Gly
35 40 45
Arg Thr Thr Asp Gly Asp Arg Pro Val His Ser Leu His Ala Tyr Phe
50 55 60
Leu Arg Pro Gly Arg Pro Gly Val Pro Ile Val Tyr Gln Val Glu Arg
65 70 75 80
Val Arg Asp Gly Arg Ser Phe Thr Thr Arg Arg Val Thr Ala Val Gln
85 90 95
Gln Gly Arg Thr Ile Phe Asn Leu Thr Ala Ser Phe His Arg Pro Glu
100 105 110
Glu Ala Gly Phe Glu His Gln Leu Pro Pro Ala Arg Ile Val Pro Asp
115 120 125
Pro Glu Glu Leu Pro Thr Val Ala Glu Glu Val Arg Glu His Leu Gly
130 135 140
Val Leu Pro Glu Ala Leu Glu Arg Met Ala Arg Arg Gln Pro Phe Asp
145 150 155 160
Ile Arg Tyr Val Asp Arg Leu Arg Trp Thr Lys Asp Glu Val Arg Asp
165 170 175
Ala Asp Pro Arg Ser Ala Val Trp Met Arg Ala Val Gly Pro Leu Gly
180 185 190
Asp Asp Pro Leu Val His Thr Cys Ala Leu Thr Tyr Ala Ser Asp Met
195 200 205
Thr Leu Leu Asp Ala Val Arg Ile Pro Val Glu Pro Leu Trp Gly Pro
210 215 220
Arg Gly Phe Asp Met Ala Ser Leu Asp His Ala Met Trp Phe His Arg
225 230 235 240
Pro Phe Arg Ala Asp Glu Trp Phe Leu Tyr Asp Gln Glu Ser Pro Ile
245 250 255
Ala Thr Gly Gly Arg Gly Leu Ala Arg Gly Arg Ile Tyr Asp Arg Ser
260 265 270
Gly Gln Leu Leu Val Ser Val Val Gln Glu Gly Leu Phe Arg Arg Leu
275 280 285
Glu Gly
290
<210> 230
<211> 873
<212> DNA
<213> 链霉菌属的种(Streptomyces sp.)
<400> 230
atgactaatc cagctgaaag attggttgat ttgttggatt tggaaagaat cgaagttgat 60
atttttagag gtagatctcc agaagaatca ttgcaaagag tttttggtgg tcaagttgct 120
ggtcaagcat tagttgctgc aggtagaact acagatggtg acagaccagt tcattctttg 180
catgcatact ttttgagacc aggtagacca ggtgttccaa ttgtctacca agttgaaaga 240
gttagagatg gtagatcttt tactacaaga agagttacag ctgttcaaca aggtagaact 300
atttttaatt tgacagcatc atttcataga ccagaagaag ctggttttga acatcaattg 360
ccaccagcaa gaattgttcc agatccagaa gaattaccaa ctgttgctga agaagttaga 420
gaacatttgg gtgttttacc agaagctttg gaaagaatgg caagaagaca accattcgat 480
atcagatacg ttgatagatt gagatggaca aaggatgaag ttagagatgc tgatccaaga 540
tctgcagttt ggatgagagc tgttggtcca ttgggtgacg atccattagt tcatacttgt 600
gctttaacat acgcatcaga tatgactttg ttagatgcag ttagaattcc agttgaacca 660
ttgtggggtc caagaggttt tgatatggca tctttagatc atgctatgtg gtttcataga 720
ccttttagag ctgatgaatg gtttttgtat gatcaagaat caccaattgc aacaggtggt 780
agaggtttag ctagaggtag aatctatgat agatctggtc aattgttagt ttcagttgtt 840
caagaaggtt tgtttagaag attagaaggt taa 873
<210> 231
<211> 418
<212> PRT
<213> 智人(Homo sapiens)
<400> 231
Met Ile Gln Leu Thr Ala Thr Pro Val Ser Ala Leu Val Asp Glu Pro
1 5 10 15
Val His Ile Arg Ala Thr Gly Leu Ile Pro Phe Gln Met Val Ser Phe
20 25 30
Gln Ala Ser Leu Glu Asp Glu Asn Gly Asp Met Phe Tyr Ser Gln Ala
35 40 45
His Tyr Arg Ala Asn Glu Phe Gly Glu Val Asp Leu Asn His Ala Ser
50 55 60
Ser Leu Gly Gly Asp Tyr Met Gly Val His Pro Met Gly Leu Phe Trp
65 70 75 80
Ser Leu Lys Pro Glu Lys Leu Leu Thr Arg Leu Leu Lys Arg Asp Val
85 90 95
Met Asn Arg Pro Phe Gln Val Gln Val Lys Leu Tyr Asp Leu Glu Leu
100 105 110
Ile Val Asn Asn Lys Val Ala Ser Ala Pro Lys Ala Ser Leu Thr Leu
115 120 125
Glu Arg Trp Tyr Val Ala Pro Gly Val Thr Arg Ile Lys Val Arg Glu
130 135 140
Gly Arg Leu Arg Gly Ala Leu Phe Leu Pro Pro Gly Glu Gly Leu Phe
145 150 155 160
Pro Gly Val Ile Asp Leu Phe Gly Gly Leu Gly Gly Leu Leu Glu Phe
165 170 175
Arg Ala Ser Leu Leu Ala Ser Arg Gly Phe Ala Ser Leu Ala Leu Ala
180 185 190
Tyr His Asn Tyr Glu Asp Leu Pro Arg Lys Pro Glu Val Thr Asp Leu
195 200 205
Glu Tyr Phe Glu Glu Ala Ala Asn Phe Leu Leu Arg His Pro Lys Val
210 215 220
Phe Gly Ser Gly Val Gly Val Val Ser Val Cys Gln Gly Val Gln Ile
225 230 235 240
Gly Leu Ser Met Ala Ile Tyr Leu Lys Gln Val Thr Ala Thr Val Leu
245 250 255
Ile Asn Gly Thr Asn Phe Pro Phe Gly Ile Pro Gln Val Tyr His Gly
260 265 270
Gln Ile His Gln Pro Leu Pro His Ser Ala Gln Leu Ile Ser Thr Asn
275 280 285
Ala Leu Gly Leu Leu Glu Leu Tyr Arg Thr Phe Glu Thr Thr Gln Val
290 295 300
Gly Ala Ser Gln Tyr Leu Phe Pro Ile Glu Glu Ala Gln Gly Gln Phe
305 310 315 320
Leu Phe Ile Val Gly Glu Gly Asp Lys Thr Ile Asn Ser Lys Ala His
325 330 335
Ala Glu Gln Ala Ile Gly Gln Leu Lys Arg His Gly Lys Asn Asn Trp
340 345 350
Thr Leu Leu Ser Tyr Pro Gly Ala Gly His Leu Ile Glu Pro Pro Tyr
355 360 365
Ser Pro Leu Cys Cys Ala Ser Thr Thr His Asp Leu Arg Leu His Trp
370 375 380
Gly Gly Glu Val Ile Pro His Ala Ala Ala Gln Glu His Ala Trp Lys
385 390 395 400
Glu Ile Gln Arg Phe Leu Arg Lys His Leu Ile Pro Asp Val Thr Ser
405 410 415
Gln Leu
<210> 232
<211> 1257
<212> DNA
<213> 智人(Homo sapiens)
<400> 232
atgattcaat tgactgcaac accagtttct gctttagttg atgaaccagt tcatattaga 60
gcaactggtt tgatcccatt ccaaatggtt tctttccaag cttcattgga agatgaaaac 120
ggtgacatgt tctattctca agcacattac agagctaacg aattcggtga agttgatttg 180
aaccatgctt cttcattggg tggtgactat atgggtgttc atccaatggg tttgttttgg 240
tcattgaagc cagaaaagtt gttgacaaga ttgttaaaaa gagatgttat gaacagacca 300
ttccaagttc aggttaagtt gtacgatttg gaattgatcg ttaataataa ggttgcatct 360
gctccaaaag catcattgac tttagaaaga tggtatgttg ctccaggtgt tacaagaatt 420
aaagttagag aaggtagatt gagaggtgca ttgtttttac caccaggtga aggtttattt 480
ccaggtgtta ttgatttgtt tggtggttta ggtggtttgt tagaattcag agcatctttg 540
ttagcttcaa gaggttttgc ttctttggca ttagcttacc ataactacga agatttgcca 600
agaaaaccag aagttactga tttggaatac tttgaagaag ctgcaaattt cttgttgaga 660
catccaaagg tttttggttc tggtgttggt gttgtttcag tttgtcaagg tgttcaaatc 720
ggtttgtcaa tggcaatata tttgaagcaa gttactgcta cagttttgat taatggtaca 780
aacttcccat tcggtattcc acaagtttac catggtcaaa ttcatcaacc attgccacat 840
tctgcacaat tgatctcaac taacgctttg ggtttgttgg aattgtacag aacattcgaa 900
actacacaag ttggtgcatc tcaatacttg tttccaattg aagaagctca aggtcaattt 960
ttgtttattg ttggtgaagg tgacaagact attaattcta aggcacatgc tgaacaagct 1020
attggtcaat tgaagagaca tggtaaaaat aactggacat tgttatcata tccaggtgca 1080
ggtcatttga ttgaaccacc atactctcca ttatgttgtg cttcaactac acatgatttg 1140
agattacatt ggggtggtga agttattcca catgctgcag ctcaagaaca tgcttggaag 1200
gaaatccaaa gatttttgag aaagcatttg atcccagatg ttacttcaca attataa 1257
<210> 233
<211> 420
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 233
Met Ala Lys Leu Thr Ala Val Pro Leu Ser Ala Leu Val Asp Glu Pro
1 5 10 15
Val His Ile Gln Val Thr Gly Leu Ala Pro Phe Gln Val Val Cys Leu
20 25 30
Gln Ala Ser Leu Lys Asp Glu Lys Gly Asn Leu Phe Ser Ser Gln Ala
35 40 45
Phe Tyr Arg Ala Ser Glu Val Gly Glu Val Asp Leu Glu His Asp Pro
50 55 60
Ser Leu Gly Gly Asp Tyr Met Gly Val His Pro Met Gly Leu Phe Trp
65 70 75 80
Ser Leu Lys Pro Glu Lys Leu Leu Gly Arg Leu Ile Lys Arg Asp Val
85 90 95
Met Asn Ser Pro Tyr Gln Ile His Ile Lys Ala Cys His Pro Tyr Phe
100 105 110
Pro Leu Gln Asp Ile Val Val Ser Pro Pro Leu Asp Ser Leu Thr Leu
115 120 125
Glu Arg Trp Tyr Val Ala Pro Gly Val Lys Arg Ile Gln Val Lys Glu
130 135 140
Ser Arg Ile Arg Gly Ala Leu Phe Leu Pro Pro Gly Glu Gly Pro Phe
145 150 155 160
Pro Gly Val Ile Asp Leu Phe Gly Gly Ala Gly Gly Leu Met Glu Phe
165 170 175
Arg Ala Ser Leu Leu Ala Ser Arg Gly Phe Ala Thr Leu Ala Leu Ala
180 185 190
Tyr Trp Asn Tyr Asp Asp Leu Pro Ser Arg Leu Glu Lys Val Asp Leu
195 200 205
Glu Tyr Phe Glu Glu Gly Val Glu Phe Leu Leu Arg His Pro Lys Val
210 215 220
Leu Gly Pro Gly Val Gly Ile Leu Ser Val Cys Ile Gly Ala Glu Ile
225 230 235 240
Gly Leu Ser Met Ala Ile Asn Leu Lys Gln Ile Arg Ala Thr Val Leu
245 250 255
Ile Asn Gly Pro Asn Phe Val Ser Gln Ser Pro His Val Tyr His Gly
260 265 270
Gln Val Tyr Pro Pro Val Pro Ser Asn Glu Glu Phe Val Val Thr Asn
275 280 285
Ala Leu Gly Leu Val Glu Phe Tyr Arg Thr Phe Gln Glu Thr Ala Asp
290 295 300
Lys Asp Ser Lys Tyr Cys Phe Pro Ile Glu Lys Ala His Gly His Phe
305 310 315 320
Leu Phe Val Val Gly Glu Asp Asp Lys Asn Leu Asn Ser Lys Val His
325 330 335
Ala Asn Gln Ala Ile Ala Gln Leu Met Lys Asn Gly Lys Lys Asn Trp
340 345 350
Thr Leu Leu Ser Tyr Pro Gly Ala Gly His Leu Ile Glu Pro Pro Tyr
355 360 365
Thr Pro Leu Cys Gln Ala Ser Arg Met Pro Ile Leu Ile Pro Ser Leu
370 375 380
Ser Trp Gly Gly Glu Val Ile Pro His Ala Ala Ala Gln Glu His Ser
385 390 395 400
Trp Lys Glu Ile Gln Lys Phe Leu Lys Gln His Leu Leu Pro Asp Leu
405 410 415
Ser Ser Gln Leu
420
<210> 234
<211> 1263
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 234
atggcaaaat tgactgctgt tccattgtct gctttagttg atgaaccagt tcatattcaa 60
gttacaggtt tggcaccatt tcaagttgtt tgtttgcaag cttcattgaa ggatgaaaag 120
ggtaatttgt tttcttcaca agcattctac agagcttctg aagttggtga agtcgatttg 180
gaacatgatc catcattggg tggtgactac atgggtgttc atccaatggg tttgttttgg 240
tctttgaagc cagaaaagtt gttgggtaga ttgattaaaa gagatgttat gaactcacca 300
taccaaatcc atatcaaggc ttgtcatcca tactttccat tgcaagatat tgttgtttct 360
ccaccattag attcattgac tttggaaaga tggtatgttg caccaggtgt taagagaatt 420
caagttaagg aatctagaat tagaggtgct ttgtttttac caccaggtga aggtccattt 480
ccaggtgtta ttgatttgtt tggtggtgca ggtggtttaa tggaattcag agcatctttg 540
ttagcttcaa gaggttttgc tacattggca ttagcttatt ggaattacga tgatttgcca 600
tcaagattgg aaaaggttga tttggaatac ttcgaagaag gtgttgaatt cttgttgaga 660
catccaaagg ttttgggtcc aggtgttggt attttatctg tttgtatcgg tgcagaaatc 720
ggtttgtcaa tggctattaa tttgaagcaa atcagagcaa ctgttttgat taatggtcca 780
aacttcgttt ctcaatcacc acatgtttat catggtcaag tttacccacc agttccatct 840
aacgaagaat tcgttgttac aaacgctttg ggtttagttg aattctacag aactttccaa 900
gaaacagcag ataaggattc taagtactgt ttcccaatcg aaaaggctca tggtcatttc 960
ttgtttgttg ttggtgaaga tgataagaat ttgaactcaa aggttcatgc taaccaagca 1020
atcgctcaat tgatgaagaa cggtaaaaag aattggactt tgttatctta tccaggtgca 1080
ggtcatttga ttgaaccacc atacacacca ttatgtcaag cttcaagaat gccaattttg 1140
attccatctt tatcatgggg tggtgaagtt attccacatg ctgcagctca agaacattct 1200
tggaaggaaa tccaaaagtt cttgaagcaa catttgttac cagatttgtc ttcacaatta 1260
taa 1263
<210> 235
<211> 420
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 235
Met Ala Lys Leu Thr Ala Val Pro Leu Ser Ala Leu Val Asp Glu Pro
1 5 10 15
Val His Ile Arg Val Thr Gly Leu Thr Pro Phe Gln Val Val Cys Leu
20 25 30
Gln Ala Ser Leu Lys Asp Asp Lys Gly Asn Leu Phe Asn Ser Gln Ala
35 40 45
Phe Tyr Arg Ala Ser Glu Val Gly Glu Val Asp Leu Glu Arg Asp Ser
50 55 60
Ser Leu Gly Gly Asp Tyr Met Gly Val His Pro Met Gly Leu Phe Trp
65 70 75 80
Ser Met Lys Pro Glu Lys Leu Leu Thr Arg Leu Val Lys Arg Asp Val
85 90 95
Met Asn Arg Pro His Lys Val His Ile Lys Leu Cys His Pro Tyr Phe
100 105 110
Pro Val Glu Gly Lys Val Ile Ser Ser Ser Leu Asp Ser Leu Ile Leu
115 120 125
Glu Arg Trp Tyr Val Ala Pro Gly Val Thr Arg Ile His Val Lys Glu
130 135 140
Gly Arg Ile Arg Gly Ala Leu Phe Leu Pro Pro Gly Glu Gly Pro Phe
145 150 155 160
Pro Gly Val Ile Asp Leu Phe Gly Gly Ala Gly Gly Leu Phe Glu Phe
165 170 175
Arg Ala Ser Leu Leu Ala Ser His Gly Phe Ala Thr Leu Ala Leu Ala
180 185 190
Tyr Trp Gly Tyr Asp Asp Leu Pro Ser Arg Leu Glu Lys Val Asp Leu
195 200 205
Glu Tyr Phe Glu Glu Gly Val Glu Phe Leu Leu Arg His Pro Lys Val
210 215 220
Leu Gly Pro Gly Val Gly Ile Leu Ser Val Cys Ile Gly Ala Glu Ile
225 230 235 240
Gly Leu Ser Met Ala Ile Asn Leu Lys Gln Ile Thr Ala Thr Val Leu
245 250 255
Ile Asn Gly Pro Asn Phe Val Ser Ser Asn Pro His Val Tyr Arg Gly
260 265 270
Lys Val Phe Gln Pro Thr Pro Cys Ser Glu Glu Phe Val Thr Thr Asn
275 280 285
Ala Leu Gly Leu Val Glu Phe Tyr Arg Thr Phe Glu Glu Thr Ala Asp
290 295 300
Lys Asp Ser Lys Tyr Cys Phe Pro Ile Glu Lys Ala His Gly His Phe
305 310 315 320
Leu Phe Val Val Gly Glu Asp Asp Lys Asn Leu Asn Ser Lys Val His
325 330 335
Ala Lys Gln Ala Ile Ala Gln Leu Met Lys Ser Gly Lys Lys Asn Trp
340 345 350
Thr Leu Leu Ser Tyr Pro Gly Ala Gly His Leu Ile Glu Pro Pro Tyr
355 360 365
Ser Pro Leu Cys Ser Ala Ser Arg Met Pro Phe Val Ile Pro Ser Ile
370 375 380
Asn Trp Gly Gly Glu Val Ile Pro His Ala Ala Ala Gln Glu His Ser
385 390 395 400
Trp Lys Glu Ile Gln Lys Phe Leu Lys Gln His Leu Asn Pro Gly Phe
405 410 415
Asn Ser Gln Leu
420
<210> 236
<211> 1263
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 236
atggcaaaat tgactgctgt tccattgtca gcattagttg atgaaccagt tcatattaga 60
gttactggtt tgacaccatt tcaagttgtt tgtttgcaag cttctttgaa ggatgataag 120
ggtaatttgt ttaattcaca agcattctac agagcttctg aagttggtga agtcgatttg 180
gaaagagatt cttcattggg tggtgactac atgggtgttc atccaatggg tttgttttgg 240
tctatgaagc cagaaaagtt gttgactaga ttagttaaga gagatgttat gaacagacca 300
cataaggttc atatcaagtt gtgtcatcca tacttcccag ttgagggtaa agttatttct 360
tcatctttgg attctttgat tttagaaaga tggtatgttg caccaggtgt tacaagaatt 420
catgttaagg aaggtagaat tagaggtgct ttgtttttac caccaggtga aggtccattt 480
ccaggtgtta ttgatttgtt tggtggtgca ggtggtttat ttgaattcag agcatcattg 540
ttagcttctc atggttttgc tacattggca ttagcttatt ggggttacga tgatttgcca 600
tctagattgg aaaaggttga tttggaatac ttcgaagaag gtgttgaatt cttgttgaga 660
catccaaagg ttttgggtcc aggtgttggt attttatcag tttgtatcgg tgcagaaatc 720
ggtttgtcta tggctattaa tttgaagcaa atcactgcaa cagttttgat taatggtcca 780
aacttcgttt catctaaccc acatgtttac cgtggtaaag tttttcaacc aactccatgt 840
tcagaagaat tcgttactac aaacgctttg ggtttagttg aattctacag aactttcgaa 900
gaaacagcag ataaggattc taagtactgt ttcccaatcg aaaaggctca tggtcatttc 960
ttgtttgttg ttggtgaaga tgataagaat ttgaactcaa aggttcatgc taagcaagca 1020
atcgctcaat tgatgaagtc aggtaaaaag aattggacat tgttgtctta tccaggtgca 1080
ggtcatttga ttgaaccacc atactcacca ttatgttcag cttctagaat gccattcgtt 1140
atcccatcta ttaattgggg tggtgaagtt attccacatg ctgcagctca agaacattca 1200
tggaaggaaa tccaaaagtt cttgaagcaa catttgaacc caggttttaa ttctcaatta 1260
taa 1263
<210> 237
<211> 626
<212> PRT
<213> 皮氏罗尔斯顿菌(Ralstonia pickettii)
<400> 237
Met Gly Thr Phe Ala Leu Ser Val Thr Pro Ala Asp Asp Leu Ile Asp
1 5 10 15
Val Ser Arg Gly Ile Val Val Thr Gly Leu Ala Pro Gly Thr Gln Val
20 25 30
Gly Ile Val Ala Gln Thr Arg Arg Gly Asn Asp Val Leu Trp His Ser
35 40 45
Arg Ala Ala Phe Val Ala Asp Ala Gln Gly Thr Val Asp Leu Thr Arg
50 55 60
Asp Ala Pro Val Ser Gly Asp Tyr Ala Gly Val Ser Ala Met Gly Ile
65 70 75 80
Val Trp Ser Gln Arg Pro Glu Asp Gly Lys Ala Arg Glu Val Phe Pro
85 90 95
Gln Pro Val Ala Glu Pro Leu Thr Thr Thr Leu Thr Ala Thr Ala Asn
100 105 110
Gly Glu Ser Val His Ala Ser Phe Val Gln Arg Leu Ala Ala Pro Gly
115 120 125
Val Thr Arg His Asp Val Arg Asp Asp Gly Leu Val Gly Thr Leu Tyr
130 135 140
Leu Pro Asp Pro Tyr Ala His Pro Gly Pro Arg Pro Ala Val Leu Ile
145 150 155 160
Leu Asn Gly Ser Gly Gly Gly Ile Asn Glu Pro Arg Ala Ala Leu Tyr
165 170 175
Ala Ser His Gly Tyr Ala Ala Phe Ala Leu Ala Tyr Phe Lys Ala Pro
180 185 190
Gly Leu Pro Asp Tyr Ile Ser Asn Thr Pro Leu Glu Tyr Phe Glu Arg
195 200 205
Ala Leu Ala Trp Leu Arg Lys Arg Val Glu Pro Leu His Asp Phe Val
210 215 220
Ala Val Ser Gly Gln Ser Arg Gly Gly Glu Leu Ala Leu Leu Leu Gly
225 230 235 240
Ala Thr Phe Pro Glu Ala Val Ser Ala Val Ile Gly Tyr Val Pro Gly
245 250 255
Ala Val Val His Ser Gly Gln Asn Ala Ala Asp Pro Ala Val Gly Arg
260 265 270
Glu Gly Pro Thr Trp Leu Tyr Arg Gly Gln Pro Leu Pro His Leu Trp
275 280 285
Glu Gly Asn Arg Thr Ala Ser Trp Ala Pro Phe Asp Glu Gly Pro Ala
290 295 300
Pro His Arg His Glu Arg Ala Ile Arg Thr Ala Leu Gln Asp Thr Asp
305 310 315 320
Ala Val Ala Arg Ala Arg Ile Arg Ile Glu Arg Ala Arg Gly Pro Val
325 330 335
Leu Leu Leu Ser Ala Thr Asp Asp Gly Ser Trp Pro Ser Ser Asp Tyr
340 345 350
Ser Arg Met Val Thr Thr Lys Leu Ala Glu Val Arg His Pro Tyr Pro
355 360 365
Val Gln His Phe Asp Tyr Glu Gly Ala Gly His Ala Ile Val Phe Pro
370 375 380
Tyr Val Pro Thr Thr Gln Leu Val Tyr Ala His Pro Val Ser Gly Arg
385 390 395 400
Ile Ser Thr Gly Gly Gly Glu Pro Arg Ala Asn Ala Arg Ala Asp Ala
405 410 415
Gln Ser Trp Ala Ala Val Leu Arg Phe Leu Ala Ser Ala Val Ala Ala
420 425 430
Arg Gly Ala Ser Val Pro Asp Ser Arg Ser Leu Ser Ser Met Asp Phe
435 440 445
Thr Pro Ala His Asp Val Ala Asp Gln Val Ala Gly Leu Asp Asp Gly
450 455 460
Ser Pro Thr His Ala Leu Arg His Ala Arg Glu Lys Val Ala Thr Ala
465 470 475 480
Thr Gln Gly Ser Tyr Asn Ala Leu Phe Asp Ala Gly Leu Pro Gly Leu
485 490 495
Thr Leu Gly Glu Arg Leu Leu Val Ala Leu Tyr Ala Cys Arg Leu Thr
500 505 510
Pro Ala Pro Glu Leu Ala Glu His Tyr Arg Ala Arg Leu Ala Ser Thr
515 520 525
Pro Val Asp Ala Asp Ala Leu Gln Ala Val Asp His Gly Asp Ile Asp
530 535 540
Thr Leu Thr Asp Ala Arg Leu Arg Ala Ile Leu Thr Phe Thr Arg Thr
545 550 555 560
Leu Val Glu Arg Pro Ile Glu Gly Asp Arg Asp Ala Leu Leu Arg Leu
565 570 575
Pro Ala Ala Gly Leu Ala Thr Ala Asp Val Val Thr Leu Ala Gln Leu
580 585 590
Ile Ala Phe Leu Ser Tyr Gln Thr Arg Leu Val Ala Gly Leu Arg Ala
595 600 605
Leu Arg Glu Ala Ala Gly Ser Gly Ser Ala Thr Ala Ser Thr Glu Thr
610 615 620
Ala Ala
625
<210> 238
<211> 1881
<212> DNA
<213> 皮氏罗尔斯顿菌(Ralstonia pickettii)
<400> 238
atgggtactt ttgctttgtc agttacacca gcagatgatt tgattgatgt ttctagaggt 60
attgttgtta ctggtttggc tccaggtaca caagttggta ttgttgcaca aactagacgt 120
ggtaatgatg ttttgtggca ttctagagct gcatttgttg ctgatgcaca aggtactgtt 180
gatttgacaa gagatgctcc agtttctggt gactatgcag gtgtttcagc tatgggtatt 240
gtttggtctc aaagaccaga agatggtaaa gctagagaag tttttccaca accagttgca 300
gaaccattga ctacaacttt aacagctact gcaaatggtg aatctgttca tgcttcattt 360
gttcaaagat tggctgcacc aggtgttact agacatgatg ttagagatga tggtttagtt 420
ggtacattgt atttgccaga tccatacgct catccaggtc caagaccagc agttttgatt 480
ttaaatggtt ctggtggtgg tattaatgaa ccaagagctg cattgtatgc ttctcatggt 540
tacgctgcat ttgctttggc atactttaaa gctccaggtt taccagatta catctctaac 600
actccattag aatactttga aagagctttg gcatggttaa gaaaaagagt tgaaccattg 660
catgattttg ttgctgtttc tggtcaatca agaggtggtg aattagcttt gttattgggt 720
gcaacatttc cagaagcagt ttcagctgtt attggttatg ttccaggtgc tgttgttcat 780
tctggtcaaa atgctgcaga tccagctgtt ggtagagaag gtccaacttg gttgtacaga 840
ggtcaaccat tgccacattt gtgggaaggt aatagaacag catcatgggc tccatttgat 900
gaaggtccag ctccacatag acatgaaaga gcaattagaa ctgctttaca agatacagat 960
gctgttgcaa gagctagaat tagaattgaa agagctagag gtccagtttt attgttatct 1020
gcaactgatg atggttcatg gccatcttca gattactcta gaatggttac aactaagttg 1080
gctgaagtta gacatccata tccagttcaa cattttgatt acgaaggtgc aggtcatgct 1140
attgtttttc catatgttcc aacaactcaa ttagtttacg ctcatccagt ttctggtaga 1200
atttcaactg gtggtggtga accaagagca aatgctagag cagatgctca atcatgggct 1260
gcagttttga gatttttagc atcagctgtt gctgcaagag gtgcttctgt tccagattct 1320
agatcattgt cttcaatgga tttcactcca gcacatgatg ttgctgatca agttgcaggt 1380
ttggatgatg gttcaccaac acatgcttta agacatgcaa gagaaaaagt tgcaacagct 1440
actcaaggtt cttataatgc tttgtttgat gcaggtttac caggtttgac tttaggtgaa 1500
agattgttag ttgcattgta tgcttgtaga ttaacaccag ctccagaatt ggcagaacat 1560
tacagagcaa gattagcttc tactccagtt gatgcagatg ctttgcaagc tgttgatcat 1620
ggtgacattg atacattaac tgatgctaga ttgagagcaa tcttgacttt tactagaaca 1680
ttagttgaaa gaccaattga aggtgacaga gatgctttgt taagattgcc agctgcaggt 1740
ttagcaactg ctgatgttgt tacattggct caattgatcg catttttgtc ataccaaact 1800
agattagttg ctggtttgag agcattaaga gaagctgcag gttctggttc agcaacagct 1860
tctactgaaa cagctgcata a 1881
<210> 239
<211> 461
<212> PRT
<213> 家牛(Bos taurus)
<400> 239
Met Ser Thr Gln Glu Gln Thr Pro Gln Ile Cys Val Val Gly Ser Gly
1 5 10 15
Pro Ala Gly Phe Tyr Thr Ala Gln His Leu Leu Lys His His Ser Arg
20 25 30
Ala His Val Asp Ile Tyr Glu Lys Gln Leu Val Pro Phe Gly Leu Val
35 40 45
Arg Phe Gly Val Ala Pro Asp His Pro Glu Val Lys Asn Val Ile Asn
50 55 60
Thr Phe Thr Gln Thr Ala Arg Ser Asp Arg Cys Ala Phe Tyr Gly Asn
65 70 75 80
Val Glu Val Gly Arg Asp Val Thr Val Gln Glu Leu Arg Asp Ala Tyr
85 90 95
His Ala Val Val Leu Ser Tyr Gly Ala Glu Asp His Gln Ala Leu Asp
100 105 110
Ile Pro Gly Glu Glu Leu Pro Gly Val Phe Ser Ala Arg Ala Phe Val
115 120 125
Gly Trp Tyr Asn Gly Leu Pro Glu Asn Arg Glu Leu Ala Pro Asp Leu
130 135 140
Ser Cys Asp Thr Ala Val Ile Leu Gly Gln Gly Asn Val Ala Leu Asp
145 150 155 160
Val Ala Arg Ile Leu Leu Thr Pro Pro Asp His Leu Glu Lys Thr Asp
165 170 175
Ile Thr Glu Ala Ala Leu Gly Ala Leu Arg Gln Ser Arg Val Lys Thr
180 185 190
Val Trp Ile Val Gly Arg Arg Gly Pro Leu Gln Val Ala Phe Thr Ile
195 200 205
Lys Glu Leu Arg Glu Met Ile Gln Leu Pro Gly Thr Arg Pro Met Leu
210 215 220
Asp Pro Ala Asp Phe Leu Gly Leu Gln Asp Arg Ile Lys Glu Ala Ala
225 230 235 240
Arg Pro Arg Lys Arg Leu Met Glu Leu Leu Leu Arg Thr Ala Thr Glu
245 250 255
Lys Pro Gly Val Glu Glu Ala Ala Arg Arg Ala Ser Ala Ser Arg Ala
260 265 270
Trp Gly Leu Arg Phe Phe Arg Ser Pro Gln Gln Val Leu Pro Ser Pro
275 280 285
Asp Gly Arg Arg Ala Ala Gly Ile Arg Leu Ala Val Thr Arg Leu Glu
290 295 300
Gly Ile Gly Glu Ala Thr Arg Ala Val Pro Thr Gly Asp Val Glu Asp
305 310 315 320
Leu Pro Cys Gly Leu Val Leu Ser Ser Ile Gly Tyr Lys Ser Arg Pro
325 330 335
Ile Asp Pro Ser Val Pro Phe Asp Pro Lys Leu Gly Val Val Pro Asn
340 345 350
Met Glu Gly Arg Val Val Asp Met Pro Gly Leu Tyr Cys Ser Gly Trp
355 360 365
Val Lys Arg Gly Pro Thr Gly Val Ile Thr Thr Thr Met Thr Asp Ser
370 375 380
Phe Leu Thr Gly Gln Ile Leu Leu Gln Asp Leu Lys Ala Gly His Leu
385 390 395 400
Pro Ser Gly Pro Arg Pro Gly Ser Thr Phe Ile Lys Ala Leu Leu Asp
405 410 415
Ser Arg Gly Ala Trp Pro Val Ser Phe Ser Asp Trp Glu Lys Leu Asp
420 425 430
Ala Glu Glu Val Ser Arg Gly Gln Ala Ser Gly Lys Pro Arg Glu Lys
435 440 445
Leu Leu Asp Pro Gln Glu Met Leu Arg Leu Leu Gly His
450 455 460
<210> 240
<211> 1386
<212> DNA
<213> 家牛(Bos taurus)
<400> 240
atgtccacac aggagcagac cccccagatc tgtgtggtgg gcagtggccc agctggcttt 60
tacacggccc agcacctgct aaagcaccac tcccgggccc acgtggatat ctacgagaaa 120
cagctggtgc ccttcggcct ggtgcgcttt ggtgtggcgc ctgaccaccc cgaggtcaag 180
aatgttatca acacctttac ccagacggcc cgctctgacc gctgtgcctt ctatggcaac 240
gtggaggtgg gcagggatgt gactgtgcag gagctgcggg acgcctacca cgccgtggtg 300
ctgagctatg gggcagagga ccatcaggcc ctggatatcc ctggtgagga gttgcccggc 360
gtgttctcgg cccgggcctt tgtgggctgg tacaatgggc ttcctgagaa ccgggagctg 420
gccccggacc tgagctgtga cacagccgtg attctggggc aggggaatgt ggctctggac 480
gtggcccgga tcctgctgac cccccccgac cacctggaga aaacggacat cactgaggcc 540
gccctgggag ccctgagaca gagtcgggtg aagacggtgt ggatcgtggg ccgacgtgga 600
cccctacaag tggccttcac cataaaggag cttcgggaga tgattcagtt accaggaact 660
cggcccatgt tggatcctgc ggatttcttg ggtctccagg acagaatcaa ggaggccgct 720
cgcccgagga agcggctgat ggaactgctg cttcgaacag ccacggagaa gccaggggtg 780
gaggaggctg cccgccgggc atcagcctcc cgtgcctggg gcctccgctt cttccgaagc 840
ccacagcagg tcctgccctc gccagatggg cggcgggcgg caggcatccg cctggcagtc 900
accagactgg agggcattgg agaggccacc cgggcagtgc ccactgggga tgtggaggac 960
ctcccctgtg ggctggtgct gagcagcatt gggtataaga gccgccccat cgaccccagt 1020
gtgccctttg accccaagct cggggtcgtc cccaatatgg agggccgggt tgtggatatg 1080
ccaggcctct actgcagcgg ctgggtgaag cggggaccca caggtgtcat caccaccacc 1140
atgaccgaca gcttcctcac tggccagatt ctgctacagg acctgaaggc cgggcacctg 1200
ccgtctggcc ccaggccggg ctctacattc atcaaggccc tgctggacag ccgaggggcc 1260
tggcccgtgt ctttctcgga ctgggagaaa ctggatgctg aggaggtgtc ccggggccag 1320
gcctcgggga agcccagaga gaagctgctg gatcctcagg agatgctgcg gctgctgggg 1380
cactga 1386
<210> 241
<211> 129
<212> PRT
<213> 家牛(Bos taurus)
<400> 241
Met Ser Ser Ser Glu Asp Lys Ile Thr Val His Phe Ile Asn Arg Asp
1 5 10 15
Gly Glu Thr Leu Thr Thr Lys Gly Lys Ile Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Val Gln Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Gln His Ile
50 55 60
Phe Glu Lys Leu Glu Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Ile Cys
85 90 95
Leu Thr Lys Ala Met Asp Asn Met Thr Val Arg Val Pro Asp Ala Val
100 105 110
Ser Asp Ala Arg Glu Ser Ile Asp Met Gly Met Asn Ser Ser Lys Ile
115 120 125
Glu
<210> 242
<211> 390
<212> DNA
<213> 家牛(Bos taurus)
<400> 242
atgagcagct cagaagataa aataacagtc cactttataa accgtgatgg tgaaacatta 60
acaaccaaag gaaaaattgg tgactctctg ctagatgttg tggttcaaaa taatctagat 120
attgatggtt ttggtgcatg tgagggaacc ttggcttgtt ctacctgtca cctcatcttt 180
gaacagcaca tatttgagaa attggaagca atcactgatg aggagaatga catgcttgat 240
ctggcatatg gactaacaga tagatcgcgg ttgggctgcc agatctgttt gacaaaggct 300
atggacaata tgactgttcg agtacctgat gccgtgtctg atgccagaga gtccattgat 360
atgggcatga actcctcaaa gatagaataa 390
<210> 243
<211> 129
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 243
Met Leu Arg Ala Glu Glu Lys Val Thr Val His Phe Leu Asn Arg Asp
1 5 10 15
Gly Lys Arg Ile Thr Val Lys Ala Ser Ile Gly Glu Ser Leu Leu Asp
20 25 30
Val Val Val Asp Arg Asp Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Glu Asp Val
50 55 60
Tyr Lys Lys Leu Gly Pro Val Ser Asp Glu Glu Met Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Thr Ser Arg Leu Gly Cys Gln Val Cys
85 90 95
Leu Arg Lys Asp Leu Asp Gly Met Ile Leu Arg Val Pro Asp Val Ile
100 105 110
Ser Asp Ala Arg Ala Asp Ser Glu Lys Glu Ser Ser Thr Ala Pro Pro
115 120 125
Lys
<210> 244
<211> 390
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 244
atgttaagag ctgaagagaa agttactgtt catttcttga atagggatgg taagagaatc 60
actgttaagg cttcaatcgg tgaatcatta ttggacgttg tcgtagatag agacttggac 120
atagatggtt ttggtgcttg tgaaggaaca ttggcttgtt ctacttgtca cttaatattc 180
gaggaagatg tctataagaa attaggtcca gtctcagatg aggaaatgga tatgttagac 240
ttggcttatg gtttaactga tacctctagg ttaggttgcc aagtatgttt aagaaaggac 300
ttagatggta tgatattgag agttccagac gttatttcag atgcaagagc tgattcagag 360
aaggagtctt ctactgcacc accaaaatga 390
<210> 245
<211> 124
<212> PRT
<213> 智人(Homo sapiens)
<400> 245
Met Ser Ser Ser Glu Asp Lys Ile Thr Val His Phe Ile Asn Arg Asp
1 5 10 15
Gly Glu Thr Leu Thr Thr Lys Gly Lys Val Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Val Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Asp His Ile
50 55 60
Tyr Glu Lys Leu Asp Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Ile Cys
85 90 95
Leu Thr Lys Ser Met Asp Asn Met Thr Val Arg Val Pro Glu Thr Val
100 105 110
Ala Asp Ala Arg Gln Ser Ile Asp Val Gly Lys Thr
115 120
<210> 246
<211> 375
<212> DNA
<213> 智人(Homo sapiens)
<400> 246
atgtcatcat cagaagataa gattactgtc cattttatca acagagatgg tgaaactttg 60
actaccaaag gtaaagtagg agactcatta ttagacgtag tcgtcgaaaa taatttggat 120
atagatggtt tcggtgcttg tgaaggaaca ttggcatgtt ctacctgtca cttaatattc 180
gaggaccaca tttatgagaa gttagatgct attaccgatg aagaaaatga tatgttagat 240
ttggcttacg gtttgacaga tagatcaaga ttgggatgtc aaatctgctt gactaaatct 300
atggataata tgactgttag ggttccagaa acagtcgcag atgctagaca gtcaatagat 360
gttggaaaaa cttga 375
<210> 247
<211> 124
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 247
Met Ser Ser Ser Glu Asp Lys Ile Thr Val His Phe Lys Asn Arg Asp
1 5 10 15
Gly Glu Thr Leu Thr Thr Lys Gly Lys Ile Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Ile Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Asp His Ile
50 55 60
Tyr Glu Lys Leu Asp Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Phe Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Val Cys
85 90 95
Leu Thr Lys Ala Met Asp Asn Met Thr Val Arg Val Pro Glu Ala Val
100 105 110
Ala Asp Val Arg Gln Ser Val Asp Met Ser Lys Asn
115 120
<210> 248
<211> 375
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 248
atgtcttcat cagaagacaa gataaccgtt cacttcaaaa acagagacgg tgagacattg 60
actaccaagg gtaagatcgg tgattcatta ttagacgttg tcattgaaaa taatttagat 120
attgatggtt tcggagcatg tgaaggaaca ttggcatgtt ctacctgtca cttgatcttc 180
gaggatcata tatacgaaaa attggacgca attacagatg aggagaacga tatgttagac 240
ttggcctttg gattaactga taggtctaga ttgggttgcc aggtttgttt gactaaagca 300
atggacaaca tgactgtaag agttccagaa gccgttgcag acgttagaca atctgtagat 360
atgtctaaaa attga 375
<210> 249
<211> 128
<212> PRT
<213> 野猪(Sus scrofa)
<400> 249
Met Ser Ser Ser Glu Asp Lys Ile Thr Val His Phe Ile Asn Arg Asp
1 5 10 15
Gly Lys Thr Leu Thr Thr Gln Gly Lys Val Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Ile Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Asp His Ile
50 55 60
Phe Glu Lys Leu Glu Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Ile Cys
85 90 95
Leu Thr Lys Ala Met Asp Asn Met Thr Val Arg Val Pro Glu Ala Val
100 105 110
Ala Asp Ala Arg Glu Ser Ile Asp Leu Gly Lys Asn Ser Ser Lys Leu
115 120 125
<210> 250
<211> 387
<212> DNA
<213> 野猪(Sus scrofa)
<400> 250
atgtcatcat cagaagataa aattactgtt cactttataa acagagacgg taagaccttg 60
acaactcaag gtaaggtagg tgattcatta ttagatgttg ttatagagaa taacttagac 120
atcgacggtt ttggtgcttg tgaaggtact ttggcttgtt ctacttgtca tttgattttt 180
gaagaccata tctttgaaaa attggaagct attactgatg aagagaatga tatgttggac 240
ttagcctacg gattgactga tagatctaga ttgggttgtc agatatgttt aacaaaggca 300
atggataata tgacagtcag agtcccagag gctgtcgctg acgcaagaga gtcaatagac 360
ttaggtaaaa attcatctaa attgtga 387
<210> 251
<211> 124
<212> PRT
<213> 短尾负鼠(Monodelphis domestica)
<400> 251
Met Arg Ser Ser Glu Asp Lys Val Thr Ile His Phe Val Asn Arg Asp
1 5 10 15
Gly Glu Lys Leu Thr Thr Gln Gly Lys Val Gly Asp Thr Leu Leu Asp
20 25 30
Ile Val Val Asn Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Val Phe Glu Glu His Ile
50 55 60
Phe Gly Lys Leu Glu Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Thr Ser Arg Leu Gly Cys Gln Ile Cys
85 90 95
Leu Thr Lys Ser Met Asn Asn Met Thr Val Arg Val Pro Glu Ala Val
100 105 110
Ala Asp Ala Arg Gln Ser Ile Asp Leu Gly Lys Asn
115 120
<210> 252
<211> 375
<212> DNA
<213> 短尾负鼠(Monodelphis domestica)
<400> 252
atgagatcat ctgaagacaa agtcaccatc cattttgtca acagagacgg agaaaagttg 60
accacccaag gtaaagttgg tgataccttg ttggatattg tcgtcaataa taatttagat 120
atagacggtt ttggtgcttg tgaaggtact ttagcttgct ctacttgtca tttagttttt 180
gaagaacaca tttttggtaa attggaagct attaccgatg aagaaaacga tatgttagac 240
ttagcttacg gtttgactga tacatcaaga ttgggttgcc aaatatgctt aactaaatca 300
atgaataaca tgactgttag ggttccagaa gcagttgcag acgctagaca atctattgat 360
ttaggaaaga actga 375
<210> 253
<211> 124
<212> PRT
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 253
Met Ser Ser Ser Glu Asp Lys Ile Thr Val His Phe Val Asn Arg Asp
1 5 10 15
Gly Glu Thr Leu Thr Ala Lys Gly Arg Val Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Ile Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Glu His Ile
50 55 60
Phe Glu Lys Leu Glu Ala Val Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Ile Tyr
85 90 95
Leu Thr Lys Ser Met Asp Asn Met Thr Val Arg Val Pro Asp Val Val
100 105 110
Ala Asp Ala Arg Gln Ser Met Asp Val Gly Lys Asn
115 120
<210> 254
<211> 375
<212> DNA
<213> 大熊猫(Ailuropoda melanoleuca)
<400> 254
atgtcatctt cagaagataa aataaccgta catttcgtca atagggatgg tgagaccttg 60
acagcaaaag gtagggtagg tgattcatta ttggatgtag tcattgagaa caatttagat 120
attgacggat ttggagcttg tgaaggtact ttggcatgtt caacatgtca cttgatcttc 180
gaggaacata tttttgaaaa attggaagct gttacagacg aagagaatga tatgttggat 240
ttggcttatg gattgacaga taggtctaga ttaggttgtc aaatatactt gactaaatca 300
atggataata tgacagtcag agtacctgat gttgtagctg acgccaggca atctatggat 360
gtcggtaaga actga 375
<210> 255
<211> 123
<212> PRT
<213> 盔珠鸡(Numida meleagris)
<400> 255
Met Ser Ser Glu Asp Lys Ile Thr Val His Phe Ile Asn Arg Asp Gly
1 5 10 15
Asp Lys Leu Thr Ala Lys Gly Lys Pro Gly Asp Ser Leu Leu Asp Val
20 25 30
Val Val Asp Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu Gly
35 40 45
Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Asp His Ile Phe
50 55 60
Glu Lys Leu Asp Ala Ile Thr Asp Glu Glu Met Asp Met Leu Asp Leu
65 70 75 80
Ala Tyr Gly Leu Thr Glu Thr Ser Arg Leu Gly Cys Gln Ile Cys Leu
85 90 95
Lys Lys Ser Met Asp Asn Met Thr Val Arg Val Pro Glu Ala Val Ala
100 105 110
Asp Ala Arg Gln Ser Val Asp Leu Ser Lys Asn
115 120
<210> 256
<211> 372
<212> DNA
<213> 盔珠鸡(Numida meleagris)
<400> 256
atgtcatcag aagataagat tacagtacat ttcattaata gggacggtga caaattaacc 60
gctaaaggaa aaccaggaga ctcattatta gatgttgttg tagacaataa tttagatata 120
gatggtttcg gagcttgtga gggtacatta gcatgttcaa catgccactt aatctttgaa 180
gatcacatat ttgaaaaatt agatgctatt acagatgagg aaatggacat gttggattta 240
gcctatggtt taactgagac ttcaagatta ggttgtcaga tttgcttgaa aaagtctatg 300
gataatatga ctgtcagagt tccagaagct gtagctgatg caagacagtc agtagattta 360
tcaaagaact ga 372
<210> 257
<211> 124
<212> PRT
<213> 豚鼠(Cavia porcellus)
<400> 257
Met Ser Ser Ser Glu Asp Lys Ile Thr Ile His Phe Ile Asn Arg Asp
1 5 10 15
Gly Glu Lys Leu Thr Thr Gln Gly Lys Ile Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Val Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Asp His Ile
50 55 60
Tyr Glu Lys Leu Asp Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Ile Tyr
85 90 95
Leu Thr Lys Ser Met Asp Asn Met Thr Val Arg Val Pro Asp Ala Val
100 105 110
Ala Asp Ala Arg Gln Ser Val Asp Val Gly Lys Asn
115 120
<210> 258
<211> 375
<212> DNA
<213> 豚鼠(Cavia porcellus)
<400> 258
atgtcttctt cagaagataa gattactatt cactttatca atagagacgg agaaaagttg 60
acaacccaag gtaagattgg agattctttg ttagacgttg tcgtagagaa taatttagac 120
attgatggtt ttggagcctg cgaaggaacc ttagcttgtt ctacctgtca tttgattttc 180
gaggatcaca tctatgagaa gttagatgca attaccgacg aggagaatga catgttagat 240
ttagcctatg gtttaaccga cagatcaagg ttaggttgtc agatctactt gactaaatct 300
atggataaca tgactgttag ggttccagat gccgttgcag atgctagaca gtctgttgac 360
gttggtaaaa actga 375
<210> 259
<211> 124
<212> PRT
<213> 加氏大婴猴(Otolemur garnetti)
<400> 259
Met Ser Ser Ser Glu Asp Lys Val Thr Val His Phe Val Asn Arg Asp
1 5 10 15
Gly Glu Thr Ile Thr Ala Lys Gly Lys Val Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Val Glu Asn Asn Leu Asp Ile Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Glu Glu His Ile
50 55 60
Phe Glu Lys Leu Asp Ala Ile Thr Asp Glu Glu Asn Asp Met Leu Asp
65 70 75 80
Leu Ala Phe Gly Leu Thr Asp Arg Ser Arg Leu Gly Cys Gln Val Cys
85 90 95
Leu Thr Lys Ser Met Asp Asn Met Thr Val Arg Val Pro Glu Ala Val
100 105 110
Ala Asp Ala Arg Gln Ser Met Asp Met Gly Lys Thr
115 120
<210> 260
<211> 375
<212> DNA
<213> 加氏大婴猴(Otolemur garnetti)
<400> 260
atgtcttctt ctgaggataa ggtcacagtt cattttgtaa acagagacgg agaaacaata 60
acagctaaag gaaaagttgg tgattcattg ttagatgtcg ttgtagaaaa taacttggat 120
attgacggtt ttggtgcatg tgaaggtaca ttagcctgct caacatgcca cttgattttt 180
gaagaacata ttttcgagaa attggacgcc ataactgacg aggaaaatga tatgttagat 240
ttggccttcg gtttgacaga tagatctaga ttgggttgcc aagtttgttt aactaaatca 300
atggataaca tgactgttag agtaccagaa gccgttgctg atgcaagaca gtctatggat 360
atgggtaaga cttga 375
<210> 261
<211> 122
<212> PRT
<213> 花斑剑尾鱼(Xiphophorus maculatus)
<400> 261
Met Leu Arg Ser Asp Ser Lys Val Thr Val His Phe Ile Asn Arg Asp
1 5 10 15
Gly Glu Lys Ile Thr Ala Lys Ala Ser Pro Gly Asp Ser Leu Leu Asp
20 25 30
Val Val Ile Asn Glu Asp Leu Asp Phe Asp Gly Phe Gly Ala Cys Glu
35 40 45
Gly Thr Leu Ala Cys Ser Thr Cys His Leu Ile Phe Asp Glu Glu Met
50 55 60
Tyr Lys Lys Leu Gly Pro Val Thr Asp Glu Glu Met Asp Met Leu Asp
65 70 75 80
Leu Ala Tyr Gly Leu Thr Glu Thr Ser Arg Leu Gly Cys Gln Ile Cys
85 90 95
Leu Thr Lys Ser Leu Glu Gly Met Val Ala Arg Val Pro Glu Ser Val
100 105 110
Ala Asp Ile Arg Gln Thr Lys Asp Gly Ser
115 120
<210> 262
<211> 369
<212> DNA
<213> 花斑剑尾鱼(Xiphophorus maculatus)
<400> 262
atgttgaggt ctgattcaaa ggttactgtt cattttatca atagagacgg tgagaagatc 60
acagctaaag cctcacctgg tgattcatta ttagatgttg taattaacga agatttggat 120
ttcgatggat ttggtgcttg cgagggaaca ttggcctgct ctacctgcca tttgatattt 180
gatgaagaaa tgtacaagaa gttgggacca gtaacagacg aagagatgga catgttggat 240
ttagcttatg gtttaacaga aacttcaaga ttgggatgtc aaatctgttt gaccaagtct 300
ttagaaggta tggtagcaag agttcctgaa tctgttgccg atattagaca gacaaaggat 360
ggatcttga 369
<210> 263
<211> 526
<212> PRT
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 263
Met Ala Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr
1 5 10 15
Ala Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val
20 25 30
Ile Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser
35 40 45
Ser Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu
50 55 60
Ser Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu
65 70 75 80
Ser Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu
85 90 95
Val Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly
100 105 110
Asp Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu
115 120 125
Ala Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr
130 135 140
Asp Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr
145 150 155 160
Met Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr
165 170 175
Ser Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser
180 185 190
Ala Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr
195 200 205
Val Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro
210 215 220
Thr Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu
225 230 235 240
Gly Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala
245 250 255
Arg Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met
260 265 270
Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser
275 280 285
Lys Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp
290 295 300
Glu Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys
305 310 315 320
Lys Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val
325 330 335
Ala Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser
340 345 350
Asp Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly
355 360 365
Ser Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn
370 375 380
Leu Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn
385 390 395 400
Val Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp
405 410 415
Leu Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly
420 425 430
Gly Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly
435 440 445
Val Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu
450 455 460
Ala Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys
465 470 475 480
Ala Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn
485 490 495
Arg Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp
500 505 510
Ile Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
515 520 525
<210> 264
<211> 1581
<212> DNA
<213> 酿酒酵母(Saccharomyces cerevisiae)
<400> 264
atggcagacc aattggtgaa aactgaagtc accaagaagt cttttactgc tcctgtacaa 60
aaggcttcta caccagtttt aaccaataaa acagtcattt ctggatcgaa agtcaaaagt 120
ttatcatctg cgcaatcgag ctcatcagga ccttcatcat ctagtgagga agatgattcc 180
cgcgatattg aaagcttgga taagaaaata cgtcctttag aagaattaga agcattatta 240
agtagtggaa atacaaaaca attgaagaac aaagaggtcg ctgccttggt tattcacggt 300
aagttacctt tgtacgcttt ggagaaaaaa ttaggtgata ctacgagagc ggttgcggta 360
cgtaggaagg ctctttcaat tttggcagaa gctcctgtat tagcatctga tcgtttacca 420
tataaaaatt atgactacga ccgcgtattt ggcgcttgtt gtgaaaatgt tataggttac 480
atgcctttgc ccgttggtgt tataggcccc ttggttatcg atggtacatc ttatcatata 540
ccaatggcaa ctacagaggg ttgtttggta gcttctgcca tgcgtggctg taaggcaatc 600
aatgctggcg gtggtgcaac aactgtttta actaaggatg gtatgacaag aggcccagta 660
gtccgtttcc caactttgaa aagatctggt gcctgtaaga tatggttaga ctcagaagag 720
ggacaaaacg caattaaaaa agcttttaac tctacatcaa gatttgcacg tctgcaacat 780
attcaaactt gtctagcagg agatttactc ttcatgagat ttagaacaac tactggtgac 840
gcaatgggta tgaatatgat ttctaaaggt gtcgaatact cattaaagca aatggtagaa 900
gagtatggct gggaagatat ggaggttgtc tccgtttctg gtaactactg taccgacaaa 960
aaaccagctg ccatcaactg gatcgaaggt cgtggtaaga gtgtcgtcgc agaagctact 1020
attcctggtg atgttgtcag aaaagtgtta aaaagtgatg tttccgcatt ggttgagttg 1080
aacattgcta agaatttggt tggatctgca atggctgggt ctgttggtgg atttaacgca 1140
catgcagcta atttagtgac agctgttttc ttggcattag gacaagatcc tgcacaaaat 1200
gttgaaagtt ccaactgtat aacattgatg aaagaagtgg acggtgattt gagaatttcc 1260
gtatccatgc catccatcga agtaggtacc atcggtggtg gtactgttct agaaccacaa 1320
ggtgccatgt tggacttatt aggtgtaaga ggcccgcatg ctaccgctcc tggtaccaac 1380
gcacgtcaat tagcaagaat agttgcctgt gccgtcttgg caggtgaatt atccttatgt 1440
gctgccctag cagccggcca tttggttcaa agtcatatga cccacaacag gaaacctgct 1500
gaaccaacaa aacctaacaa tttggacgcc actgatataa atcgtttgaa agatgggtcc 1560
gtcacctgca ttaaatccta a 1581
<210> 265
<211> 501
<212> PRT
<213> 智人(Homo sapiens)
<400> 265
Met Val Leu Trp Gly Pro Val Leu Gly Ala Leu Leu Val Val Ile Ala
1 5 10 15
Gly Tyr Leu Cys Leu Pro Gly Met Leu Arg Gln Arg Arg Pro Trp Glu
20 25 30
Pro Pro Leu Asp Lys Gly Thr Val Pro Trp Leu Gly His Ala Met Ala
35 40 45
Phe Arg Lys Asn Met Phe Glu Phe Leu Lys Arg Met Arg Thr Lys His
50 55 60
Gly Asp Val Phe Thr Val Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val
65 70 75 80
Met Asp Pro Leu Ser Phe Gly Ser Ile Leu Lys Asp Thr Gln Arg Lys
85 90 95
Leu Asp Phe Gly Gln Tyr Ala Lys Lys Leu Val Leu Lys Val Phe Gly
100 105 110
Tyr Arg Ser Val Gln Gly Asp His Glu Met Ile His Ser Ala Ser Thr
115 120 125
Lys His Leu Arg Gly Asp Gly Leu Lys Asp Leu Asn Glu Thr Met Leu
130 135 140
Asp Ser Leu Ser Phe Val Met Leu Thr Ser Lys Gly Trp Ser Leu Asp
145 150 155 160
Ala Ser Cys Trp His Glu Asp Ser Leu Phe Arg Phe Cys Tyr Tyr Ile
165 170 175
Leu Phe Thr Ala Gly Tyr Leu Ser Leu Phe Gly Tyr Thr Lys Asp Lys
180 185 190
Glu Gln Asp Leu Leu Gln Ala Gly Glu Leu Phe Met Glu Phe Arg Lys
195 200 205
Phe Asp Leu Leu Phe Pro Arg Phe Val Tyr Ser Leu Leu Trp Pro Arg
210 215 220
Glu Trp Leu Glu Val Gly Arg Leu Gln Arg Leu Phe His Lys Met Leu
225 230 235 240
Ser Val Ser His Ser Gln Glu Lys Glu Gly Ile Ser Asn Trp Leu Gly
245 250 255
Asn Met Leu Gln Phe Leu Arg Glu Gln Gly Val Pro Ser Ala Met Gln
260 265 270
Asp Lys Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly
275 280 285
Pro Thr Ser Phe Trp Ala Leu Leu Tyr Leu Leu Lys His Pro Glu Ala
290 295 300
Ile Arg Ala Val Arg Glu Glu Ala Thr Gln Val Leu Gly Glu Ala Arg
305 310 315 320
Leu Glu Thr Lys Gln Ser Phe Ala Phe Lys Leu Gly Ala Leu Gln His
325 330 335
Thr Pro Val Leu Asp Ser Val Val Glu Glu Thr Leu Arg Leu Arg Ala
340 345 350
Ala Pro Thr Leu Leu Arg Leu Val His Glu Asp Tyr Thr Leu Lys Met
355 360 365
Ser Ser Gly Gln Glu Tyr Leu Phe Arg His Gly Asp Ile Leu Ala Leu
370 375 380
Phe Pro Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro
385 390 395 400
Thr Val Phe Lys Tyr Asp Arg Phe Leu Asn Pro Asn Gly Ser Arg Lys
405 410 415
Val Asp Phe Phe Lys Thr Gly Lys Lys Ile His His Tyr Thr Met Pro
420 425 430
Trp Gly Ser Gly Val Ser Ile Cys Pro Gly Arg Phe Phe Ala Leu Ser
435 440 445
Glu Val Lys Leu Phe Ile Leu Leu Met Val Thr His Phe Asp Leu Glu
450 455 460
Leu Val Asp Pro Asp Thr Pro Leu Pro His Val Asp Pro Gln Arg Trp
465 470 475 480
Gly Phe Gly Thr Met Gln Pro Ser His Asp Val Arg Phe Arg Tyr Arg
485 490 495
Leu His Pro Thr Glu
500
<210> 266
<211> 1506
<212> DNA
<213> 智人(Homo sapiens)
<400> 266
atggttttgt ggggtccagt tttaggtgct ttgttagttg ttattgcagg ttatttgtgt 60
ttgccaggca tgttgagaca aagaagacca tgggaaccac cattggataa aggtactgtt 120
ccatggttag gtcatgctat ggcttttaga aagaatatgt tcgaattctt gaaaagaatg 180
agaactaaac atggtgacgt ttttacagtt caattaggtg gtcaatactt cactttcgtt 240
atggacccat tgtcatttgg ttctatcttg aaggatacac aaagaaagtt ggatttcggt 300
caatacgcta agaaattggt tttgaaggtt ttcggttaca gatctgttca aggtgaccat 360
gaaatgatcc attctgcatc aacaaagcat ttgagaggtg acggtttgaa ggatttgaac 420
gaaactatgt tggattcttt gtcattcgtt atgttgacat caaaaggttg gtctttagat 480
gcatcatgtt ggcatgaaga ttctttgttt agattctgtt actacatctt gtttactgct 540
ggttatttgt cattgttcgg ttacacaaag gataaggaac aagatttgtt acaagctggt 600
gaattgttta tggaattcag aaagttcgat ttgttatttc caagatttgt ttattctttg 660
ttatggccaa gagaatggtt ggaagttggt agattgcaaa gattgttcca taagatgttg 720
tctgtttcac attctcaaga aaaggaaggt atctctaact ggttgggtaa catgttgcaa 780
ttcttgagag aacaaggtgt tccatcagct atgcaggata agtttaattt catgatgttg 840
tgggcatctc aaggtaatac tggtccaaca tcattctggg ctttgttgta cttgttgaag 900
catccagaag ctatcagagc agttagagaa gaagctactc aagttttggg tgaagcaaga 960
ttggaaacaa agcaatcttt cgcttttaaa ttgggtgcat tacaacatac tccagttttg 1020
gattcagttg ttgaagaaac tttgagattg agagctgcac caacattgtt aagattggtt 1080
catgaagatt acacattgaa gatgtcttca ggtcaagaat acttgtttag acatggtgac 1140
atcttggctt tgttcccata tttgtctgtt catatggacc cagatatcca tccagaacca 1200
actgttttta aatacgatag atttttaaac ccaaacggtt caagaaaggt tgatttcttt 1260
aagactggta aaaagattca tcattacaca atgccatggg gttcaggtgt ttctatttgt 1320
ccaggtagat ttttcgcttt gtctgaggtt aagttgttta ttttgttgat ggttactcat 1380
ttcgatttgg aattagttga tccagataca ccattgccac atgttgatcc acaaagatgg 1440
ggttttggta ctatgcaacc atcacatgat gttagattca gatacagatt acatccaaca 1500
gaataa 1506
<210> 267
<211> 500
<212> PRT
<213> 家兔(Oryctolagus cuniculus)
<400> 267
Met Val Leu Trp Gly Leu Leu Gly Ala Leu Leu Met Val Met Val Gly
1 5 10 15
Trp Leu Cys Leu Pro Gly Leu Leu Arg Gln Arg Arg Pro Gln Glu Pro
20 25 30
Pro Leu Asp Lys Gly Ser Ile Pro Trp Leu Gly His Ala Met Thr Phe
35 40 45
Arg Lys Asn Met Leu Glu Phe Leu Lys His Met Arg Ser Lys His Gly
50 55 60
Asp Val Phe Thr Val Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val Met
65 70 75 80
Asp Pro Val Ser Phe Gly Pro Ile Leu Lys Asp Gly Gln Arg Lys Leu
85 90 95
Asp Phe Val Glu Tyr Ala Lys Gly Leu Val Leu Lys Val Phe Gly Tyr
100 105 110
Gln Ser Ile Glu Gly Asp His Arg Met Ile His Leu Ala Ser Thr Lys
115 120 125
His Leu Met Gly His Gly Leu Glu Glu Leu Asn Lys Ala Met Leu Asp
130 135 140
Ser Leu Ser Leu Val Met Leu Gly Pro Glu Gly Arg Ser Pro Asp Ala
145 150 155 160
Ser Arg Trp His Glu Asp Gly Leu Phe His Phe Cys Tyr Gly Val Met
165 170 175
Phe Lys Ala Gly Tyr Leu Ser Leu Phe Gly His Thr Ser Asp Lys Arg
180 185 190
Gln Asp Leu Leu Gln Ala Glu Glu Ile Phe Ile Lys Phe Arg Arg Phe
195 200 205
Asp Leu Leu Phe Pro Arg Phe Val Tyr Ser Leu Leu Gly Pro Arg Glu
210 215 220
Trp Arg Glu Val Gly Arg Leu Gln Gln Leu Phe His Glu Leu Leu Ser
225 230 235 240
Val Lys His Asn Pro Glu Lys Asp Gly Met Ser Asn Trp Ile Gly His
245 250 255
Met Leu Gln Tyr Leu Ser Glu Gln Gly Val Ala Pro Ala Met Gln Asp
260 265 270
Lys Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly Pro
275 280 285
Ala Ser Phe Trp Ala Leu Ile Tyr Leu Leu Lys His Pro Glu Ala Met
290 295 300
Arg Ala Val Lys Glu Glu Ala Thr Arg Val Leu Gly Glu Pro Arg Leu
305 310 315 320
Glu Ala Lys Gln Ser Phe Thr Val Gln Leu Ser Ala Leu Gln His Ile
325 330 335
Pro Val Leu Asp Ser Val Met Glu Glu Thr Leu Arg Leu Gly Ala Ala
340 345 350
Pro Thr Leu Tyr Arg Val Val Gln Lys Asp Ile Leu Leu Lys Met Ala
355 360 365
Ser Gly Gln Glu Cys Leu Leu Arg Gln Gly Asp Ile Val Thr Leu Phe
370 375 380
Pro Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro Thr
385 390 395 400
Thr Phe Lys Tyr Asp Arg Phe Leu Asn Pro Asn Gly Ser Arg Lys Val
405 410 415
Asp Phe Tyr Lys Ala Gly Gln Lys Ile His His Tyr Thr Met Pro Trp
420 425 430
Gly Ser Gly Val Ser Ile Cys Pro Gly Arg Phe Phe Ala Leu Ser Glu
435 440 445
Met Lys Leu Phe Val Leu Leu Met Val Gln Tyr Phe Asp Leu Glu Leu
450 455 460
Val Asp Pro Asn Thr Pro Val Pro Pro Ile Asp Pro Arg Arg Trp Gly
465 470 475 480
Phe Gly Thr Met Gln Pro Thr His Asp Val Arg Ile Arg Tyr Arg Leu
485 490 495
Lys Pro Leu Glu
500
<210> 268
<211> 1503
<212> DNA
<213> 家兔(Oryctolagus cuniculus)
<400> 268
atggttttgt ggggtttgtt aggtgcattg ttaatggtta tggttggttg gttgtgttta 60
ccaggtttgt taagacaaag aagaccacaa gaaccaccat tggataaagg ttcaattcca 120
tggttaggtc atgctatgac ttttagaaag aatatgttgg aattcttgaa acatatgaga 180
tctaaacatg gtgacgtttt tactgttcaa ttaggtggtc aatacttcac attcgttatg 240
gacccagttt cttttggtcc aattttgaaa gatggtcaaa gaaagttgga tttcgttgaa 300
tacgctaagg gtttggtttt gaaggttttc ggttaccaat caatcgaagg tgaccataga 360
atgatccatt tggcttctac taagcatttg atgggtcatg gtttggaaga attgaataag 420
gcaatgttgg attctttgtc attagttatg ttaggtccag aaggtagatc tccagatgct 480
tcaagatggc atgaagatgg tttgttccat ttctgttacg gtgttatgtt caaggcaggt 540
tacttgtctt tgttcggtca tacatcagat aagagacaag atttgttgca agctgaagaa 600
attttcatta agtttagaag attcgatttg ttatttccaa gatttgttta ttctttgtta 660
ggtccaagag aatggagaga agttggtaga ttgcaacaat tgttccatga attgttgtct 720
gttaagcata acccagaaaa ggatggcatg tcaaactgga tcggtcatat gttgcaatat 780
ttgtctgaac aaggtgttgc tccagcaatg caggataagt ttaatttcat gatgttgtgg 840
gcatctcaag gtaatactgg tccagcttca ttctgggcat tgatatattt gttgaagcat 900
ccagaagcta tgagagcagt taaagaagaa gctactagag ttttgggtga accaagattg 960
gaagctaagc aatcttttac agttcaattg tcagcattac aacatattcc agttttggat 1020
tctgttatgg aagaaacttt gagattaggt gctgcaccaa cattatacag agttgttcaa 1080
aaggatatct tgttgaagat ggcttcaggt caagaatgtt tgttaagaca aggtgacatc 1140
gttacattgt tcccatattt gtctgttcat atggacccag atatccatcc agaaccaact 1200
acttttaaat acgatagatt tttaaatcca aacggttcta gaaaggttga tttctacaag 1260
gcaggtcaaa agattcatca ttacactatg ccatggggtt ctggtgtttc aatttgtcca 1320
ggtagatttt tcgctttgtc agaaatgaag ttgttcgttt tgttgatggt tcaatacttt 1380
gatttggaat tagttgatcc aaatacacca gttccaccaa ttgatccaag aagatggggt 1440
tttggtacta tgcaaccaac acatgatgtt agaattagat acagattgaa accattagaa 1500
taa 1503
<210> 269
<211> 500
<212> PRT
<213> 小家鼠(Mus musculus)
<400> 269
Met Thr Leu Trp Cys Thr Val Leu Gly Ala Leu Leu Thr Val Val Gly
1 5 10 15
Cys Leu Cys Leu Ser Leu Leu Leu Arg His Arg Arg Pro Trp Glu Pro
20 25 30
Pro Leu Asp Lys Gly Phe Val Pro Trp Leu Gly His Ser Met Ala Phe
35 40 45
Arg Lys Asn Met Phe Glu Phe Leu Lys Gly Met Arg Ala Lys His Gly
50 55 60
Asp Val Phe Thr Val Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val Met
65 70 75 80
Asp Pro Leu Ser Phe Gly Pro Ile Ile Lys Asn Thr Glu Lys Ala Leu
85 90 95
Asp Phe Gln Ser Tyr Ala Lys Glu Leu Val Leu Lys Val Phe Gly Tyr
100 105 110
Gln Ser Val Asp Gly Asp His Arg Met Ile His Leu Ala Ser Thr Lys
115 120 125
His Leu Met Gly Gln Gly Leu Glu Glu Leu Asn Gln Ala Met Leu Asp
130 135 140
Ser Leu Ser Leu Val Met Leu Gly Pro Lys Gly Ser Ser Leu Gly Ala
145 150 155 160
Ser Ser Trp Cys Glu Asp Gly Leu Phe His Phe Cys Tyr Arg Ile Leu
165 170 175
Phe Lys Ala Gly Phe Leu Ser Leu Phe Gly Tyr Thr Lys Asp Lys Gln
180 185 190
Gln Asp Leu Asp Glu Ala Asp Glu Leu Phe Arg Lys Phe Arg Arg Phe
195 200 205
Asp Phe Leu Phe Pro Arg Phe Val Tyr Ser Leu Leu Gly Pro Arg Glu
210 215 220
Trp Val Glu Val Ser Gln Leu Gln Arg Leu Phe His Gln Arg Leu Ser
225 230 235 240
Val Glu Gln Asn Leu Glu Lys Asp Gly Ile Ser Cys Trp Leu Gly Tyr
245 250 255
Met Leu Gln Phe Leu Arg Glu Gln Gly Ile Ala Ser Ser Met Gln Asp
260 265 270
Lys Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly Pro
275 280 285
Thr Cys Phe Trp Val Leu Leu Phe Leu Leu Lys His Gln Asp Ala Met
290 295 300
Lys Ala Val Arg Glu Glu Ala Thr Arg Val Met Gly Lys Ala Arg Leu
305 310 315 320
Glu Ala Lys Lys Ser Phe Thr Phe Thr Pro Ser Ala Leu Lys His Thr
325 330 335
Pro Val Leu Asp Ser Val Met Glu Glu Ser Leu Arg Leu Cys Ala Thr
340 345 350
Pro Thr Leu Leu Arg Val Val Gln Glu Asp Tyr Val Leu Lys Met Ala
355 360 365
Ser Gly Gln Glu Tyr Gln Ile Arg Arg Gly Asp Lys Val Ala Leu Phe
370 375 380
Pro Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro Thr
385 390 395 400
Ala Phe Lys Tyr Asp Arg Phe Leu Asn Pro Asp Gly Thr Arg Lys Val
405 410 415
Asp Phe Tyr Lys Ser Gly Lys Lys Ile His His Tyr Ser Met Pro Trp
420 425 430
Gly Ser Gly Val Ser Lys Cys Pro Gly Arg Phe Phe Ala Leu Ser Glu
435 440 445
Met Lys Thr Phe Val Leu Leu Met Ile Met Tyr Phe Asp Phe Lys Leu
450 455 460
Val Asp Pro Asp Ile Pro Val Pro Pro Ile Asp Pro Arg Arg Trp Gly
465 470 475 480
Phe Gly Thr Ser Gln Pro Ser His Glu Val Arg Phe Leu Tyr Arg Leu
485 490 495
Lys Pro Val Gln
500
<210> 270
<211> 1503
<212> DNA
<213> 小家鼠(Mus musculus)
<400> 270
atgactttat ggtgtacagt tttgggtgct ttgttgactg ttgttggttg tttgtgtttg 60
tctttgttgt tgagacatag aagaccatgg gaaccaccat tagataaagg ttttgttcca 120
tggttgggtc attcaatggc ttttagaaag aatatgttcg aattcttgaa gggtatgaga 180
gcaaaacatg gtgacgtttt tactgttcaa ttaggtggtc aatacttcac attcgttatg 240
gacccattgt ctttcggtcc aattattaag aatactgaaa aggctttgga tttccaatca 300
tacgcaaagg aattagtttt gaaagttttt ggttaccaat ctgttgatgg tgaccataga 360
atgatccatt tggcttcaac aaagcatttg atgggtcaag gtttggaaga attgaaccaa 420
gcaatgttgg attctttgtc attggttatg ttgggtccaa aaggttcttc attgggtgct 480
tcttcatggt gtgaagatgg tttgttccat ttctgttaca gaattttgtt taaagcaggt 540
ttcttgtctt tgttcggtta cacaaaggat aagcaacaag atttggatga agctgatgaa 600
ttgtttagaa agtttagaag attcgatttc ttgttcccaa gattcgttta ctctttgttg 660
ggtccaagag aatgggttga agtttcacaa ttgcaaagat tgttccatca aagattgtct 720
gttgaacaaa atttggaaaa ggatggtatc tcatgttggt tgggttacat gttgcaattc 780
ttgagagaac aaggtatcgc ttcttcaatg caggataagt ttaatttcat gatgttgtgg 840
gcatctcaag gtaatactgg tccaacatgt ttctgggttt tgttgttttt attgaaacat 900
caagatgcta tgaaagcagt tagagaagaa gctactagag ttatgggtaa agctagattg 960
gaagctaaga aatcttttac ttttactcca tcagcattga agcatacacc agttttggat 1020
tctgttatgg aagaatcatt gagattgtgt gctactccaa cattgttgag agttgttcaa 1080
gaagattacg ttttgaagat ggcttctggt caagaatacc aaattagaag aggtgacaag 1140
gttgcattgt tcccatattt gtcagttcat atggacccag atattcatcc agaaccaact 1200
gcttttaaat acgatagatt tttgaatcca gatggtacaa gaaaggttga tttctacaag 1260
tctggtaaaa agattcatca ttactcaatg ccatggggtt ctggtgtttc aaaatgtcca 1320
ggtagatttt tcgctttatc tgaaatgaaa acttttgttt tgttgatgat catgtacttc 1380
gatttcaaat tggttgatcc agatattcca gttccaccaa ttgatccaag aagatggggt 1440
tttggtacat ctcaaccatc acatgaagtt agatttttat acagattgaa accagttcaa 1500
taa 1503
<210> 271
<211> 501
<212> PRT
<213> 野猪(Sus scrofa)
<400> 271
Met Val Leu Trp Gly Pro Val Leu Gly Val Leu Leu Val Ala Ile Val
1 5 10 15
Gly Tyr Leu Cys Leu Gln Gly Leu Leu Arg Gln Arg Arg Pro Glu Glu
20 25 30
Pro Pro Leu Asp Lys Gly Ser Val Pro Trp Leu Gly His Ala Met Thr
35 40 45
Phe Arg Lys Asn Met Leu Glu Phe Leu Lys His Met Trp Ala Arg His
50 55 60
Gly Asp Ile Phe Thr Val Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val
65 70 75 80
Met Asp Pro Leu Ser Phe Gly Pro Ile Leu Lys Asp Ala Lys Arg Lys
85 90 95
Leu Asp Phe Val Glu Tyr Ala Glu Lys Leu Val Leu Lys Val Phe Gly
100 105 110
Tyr Arg Ser Met Gln Gly Asp His Arg Met Ile His Ser Ala Ser Thr
115 120 125
Lys His Leu Met Gly Asp Gly Leu Glu Glu Leu Asn Lys Ala Met Leu
130 135 140
Asp Asn Leu Ser Leu Val Met Leu Gly Pro Lys Gly Pro Ser Pro Asp
145 150 155 160
Ala Ser Cys Trp Arg Glu Asp Gly Leu Phe His Phe Cys Tyr Asp Ile
165 170 175
Leu Phe Lys Ala Gly Tyr Leu Ser Leu Phe Gly Arg Thr Glu Asp Lys
180 185 190
Glu Gln Asp Leu Leu Gln Ala Glu Glu Leu Phe Met Gln Phe Arg Lys
195 200 205
Phe Asp Arg Met Phe Pro Arg Phe Val Tyr Ser Leu Leu Gly Pro Arg
210 215 220
Glu Trp Leu Glu Val Gly Arg Leu Gln Cys Leu Phe His Lys Met Leu
225 230 235 240
Ser Val Glu His Ser Leu Glu Arg His Gly Ile Ser Ser Trp Ile Thr
245 250 255
Asp Met Leu Gln Val Leu Arg Glu Gln Gly Val Ala Pro Ala Met Gln
260 265 270
Asp Lys Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly
275 280 285
Pro Thr Thr Phe Trp Ala Leu Leu Phe Leu Leu Lys His Pro Glu Ala
290 295 300
Met Arg Ala Val Arg Glu Glu Ala Thr Arg Val Leu Gly Glu Ala Arg
305 310 315 320
Leu Glu Asp Lys Gln Ser Phe Asp Val Glu Val Ser Ala Leu Asn His
325 330 335
Met Pro Val Leu Asp Ser Val Met Glu Glu Thr Leu Arg Leu Gly Ala
340 345 350
Ala Pro Thr Leu Leu Arg Val Val Asn Ser Asp Gln Ile Leu Lys Met
355 360 365
Ala Ser Gly Gln Glu Tyr Arg Leu Arg His Gly Asp Ile Leu Ala Leu
370 375 380
Phe Pro Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro
385 390 395 400
Thr Thr Phe Lys Tyr Asp Arg Phe Leu Thr Pro Ser Gly Ser Arg Lys
405 410 415
Val Asn Phe Tyr Lys Ala Gly Lys Lys Ile His His Tyr Thr Met Pro
420 425 430
Trp Gly Ser Gly Ile Ser Ile Cys Pro Gly Arg Phe Phe Ala Leu Thr
435 440 445
Glu Met Lys Leu Phe Val Leu Leu Met Val Thr His Phe Asp Leu Glu
450 455 460
Leu Val Asp Pro Asp Thr Pro Val Pro Pro Val Asp Pro Gln Arg Trp
465 470 475 480
Gly Phe Gly Thr Met Gln Pro Ser Tyr Glu Val Arg Phe Arg Tyr Arg
485 490 495
Leu Arg Pro Thr Glu
500
<210> 272
<211> 1506
<212> DNA
<213> 野猪(Sus scrofa)
<400> 272
atggttttgt ggggtccagt tttaggtgtt ttgttagttg ctatcgttgg ttatttgtgt 60
ttgcaaggtt tgttgagaca aagaagacca gaagaaccac cattggataa aggttctgtt 120
ccatggttag gtcatgctat gacttttaga aagaatatgt tggaattctt gaaacatatg 180
tgggcaagac atggtgacat ttttactgtt caattgggtg gtcaatactt tacatttgtt 240
atggacccat tgtcttttgg tccaatcttg aaggatgcta agagaaagtt ggattttgtt 300
gaatatgcag aaaaattggt tttaaaagtt tttggttaca gatcaatgca aggtgaccat 360
agaatgatcc attctgcttc aacaaagcat ttgatgggtg acggtttgga agaattgaat 420
aaggcaatgt tggataattt gtcattagtt atgttgggtc caaaaggtcc atctccagat 480
gcttcatgtt ggagagaaga tggtttgttc catttctgtt acgatatctt gtttaaagca 540
ggttacttgt ctttgttcgg tagaactgaa gataaggaac aagatttgtt gcaagctgaa 600
gaattgttta tgcaattcag aaagttcgat agaatgttcc caagattcgt ttactcattg 660
ttgggtccaa gagaatggtt ggaagttggt agattgcaat gtttgttcca taagatgttg 720
tctgttgaac attcattgga aagacatggt atctcttcat ggatcactga tatgttgcaa 780
gttttgagag aacaaggtgt tgctccagca atgcaggata agtttaattt catgatgttg 840
tgggcttctc aaggtaatac aggtccaact acattctggg cattgttatt tttgttgaag 900
catccagaag ctatgagagc agttagagaa gaagctacta gagttttggg tgaagcaaga 960
ttggaagata agcaatcttt cgatgttgaa gtttcagctt tgaatcatat gccagttttg 1020
gattctgtta tggaagaaac tttgagatta ggtgctgcac caacattgtt aagagttgtt 1080
aactctgatc aaatcttgaa gatggcttca ggtcaagaat acagattgag acatggtgac 1140
atcttggcat tatttccata cttgtcagtt catatggacc cagatatcca tccagaacca 1200
actactttta aatacgatag atttttaaca ccatctggtt caagaaaggt taacttctac 1260
aaggcaggta aaaagattca tcattacact atgccatggg gttctggtat ttcaatttgt 1320
ccaggtagat ttttcgcttt gactgaaatg aagttgttcg ttttgttgat ggttacacat 1380
ttcgatttgg aattagttga tccagatact ccagttccac cagttgatcc acaaagatgg 1440
ggttttggta caatgcaacc atcttacgaa gttagattca gatacagatt gagaccaact 1500
gaataa 1506
<210> 273
<211> 499
<212> PRT
<213> 褐家鼠(Rattus norvegicus)
<400> 273
Met Leu Trp Gly Ser Val Leu Gly Ala Leu Leu Met Ala Val Gly Cys
1 5 10 15
Leu Cys Leu Ser Leu Leu Pro Arg His Arg Arg Pro Trp Glu Pro Pro
20 25 30
Leu Asp Lys Gly Phe Val Pro Trp Leu Gly His Thr Met Ala Phe Arg
35 40 45
Lys Asn Met Phe Glu Phe Leu Lys Gly Met Arg Ala Lys His Gly Asp
50 55 60
Val Phe Thr Leu Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val Met Asp
65 70 75 80
Pro Leu Ser Phe Gly Pro Ile Ile Lys Ser Thr Gln Lys Val Leu Asp
85 90 95
Phe Val Thr Tyr Ala Arg Glu Leu Val Phe Lys Val Phe Gly Tyr Gln
100 105 110
Ser Met Asp Glu Asp His Gln Met Leu His Val Ala Ser Thr Lys His
115 120 125
Leu Met Gly Gln Gly Leu Glu Asp Leu Asn Arg Ala Met Leu Asp Ser
130 135 140
Leu Ser Leu Val Met Leu Gly Pro Lys Gly Arg Ser Leu Gly Ala Arg
145 150 155 160
Ser Trp Cys Glu Asp Gly Leu Phe His Phe Cys Tyr Ser Ile Leu Phe
165 170 175
Lys Ala Gly Phe Leu Ser Leu Phe Gly Cys Thr Lys Asp Lys Glu Gln
180 185 190
Asp Leu Asp Glu Ala Asp Glu Leu Phe Arg Lys Phe Arg Arg Phe Asp
195 200 205
Leu Leu Phe Pro Arg Phe Val Tyr Ser Leu Leu Gly Pro Leu Glu Trp
210 215 220
Val Glu Val Ser Gln Leu Gln Arg Leu Phe His Gln Arg Leu Ser Val
225 230 235 240
Glu Gln Asn Leu Glu Lys Asp Gly Ile Ser Asn Trp Leu Gly Phe Met
245 250 255
Leu Arg Phe Leu Arg Glu Arg Gly Met Ala Ser Ser Met Gln Asp Lys
260 265 270
Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly Pro Thr
275 280 285
Cys Phe Trp Ala Leu Leu Phe Leu Leu Lys His Gln Asp Ala Met Lys
290 295 300
Ala Val Arg Glu Glu Ala Thr Arg Val Leu Gly Glu Ala Arg Leu Glu
305 310 315 320
Ala Glu Thr Ser Phe Ala Phe Thr Leu Ser Ala Leu Lys Cys Thr Pro
325 330 335
Val Leu Asp Ser Val Met Glu Glu Thr Leu Arg Leu Cys Ala Thr Pro
340 345 350
Thr Leu Leu Gly Val Val Gln Glu Asp Tyr Val Leu Lys Met Ala Ser
355 360 365
Gly Gln Glu Tyr Gln Ile Arg Arg Gly Asp Lys Val Ala Leu Phe Pro
370 375 380
Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro Thr Thr
385 390 395 400
Phe Lys Tyr Asn Arg Phe Leu Asn Pro Asp Gly Thr Arg Lys Val Asp
405 410 415
Phe Tyr Lys Ser Gly Lys Lys Ile His His Tyr Asn Met Pro Trp Gly
420 425 430
Ser Gly Val Ser Ile Cys Pro Gly Arg Phe Phe Ala Pro Ser Glu Met
435 440 445
Lys Thr Phe Val Leu Leu Met Val Met Tyr Phe Asp Phe Glu Leu Val
450 455 460
Asp Pro Asp Met Pro Val Pro Pro Ile Asp Pro Arg Arg Trp Gly Phe
465 470 475 480
Gly Thr Ser Gln Pro Ser His Glu Val Arg Phe Arg Tyr Arg Leu Lys
485 490 495
Pro Met Gln
<210> 274
<211> 1500
<212> DNA
<213> 褐家鼠(Rattus norvegicus)
<400> 274
atgttgtggg gttctgtttt aggtgctttg ttaatggcag ttggttgttt gtgtttatca 60
ttgttaccaa gacatagaag accatgggaa ccaccattgg ataaaggttt tgttccatgg 120
ttaggtcata ctatggcttt tagaaagaat atgttcgaat tcttgaaggg tatgagagca 180
aagcatggtg acgtttttac tttgcaatta ggtggtcaat acttcacatt cgttatggac 240
ccattgtctt tcggtccaat tattaagtca actcaaaagg ttttggattt cgttacatac 300
gcaagagaat tagtttttaa agtttttggt taccaatcta tggatgaaga tcatcaaatg 360
ttgcatgttg cttcaactaa acatttgatg ggtcaaggtt tggaagattt gaatagagca 420
atgttggatt ctttgtcatt agttatgttg ggtccaaaag gtagatcttt aggtgctaga 480
tcatggtgtg aagatggttt gttccatttc tgttactcta tcttgtttaa agcaggtttc 540
ttgtcattgt tcggttgtac aaaggataag gaacaagatt tggatgaagc tgatgaattg 600
tttagaaagt ttagaagatt cgatttgtta tttccaagat ttgtttactc tttgttaggt 660
ccattagaat gggttgaagt ttcacaattg caaagattgt tccatcaaag attgtctgtt 720
gaacaaaatt tggaaaagga tggtatctca aactggttgg gttttatgtt gagattttta 780
agagaacgtg gtatggcttc ttcaatgcag gataagttta atttcatgat gttgtgggct 840
tctcaaggta atactggtcc aacatgtttc tgggcattgt tatttttgtt gaagcatcaa 900
gatgctatga aagcagttag agaagaagca actagagttt tgggtgaagc tagattagaa 960
gcagaaactt ctttcgcttt tacattgtca gcattgaaat gtactccagt tttggattct 1020
gttatggaag aaacattgag attgtgtgct actccaacat tgttaggtgt tgttcaagaa 1080
gattacgttt tgaagatggc ttctggtcaa gaataccaaa ttagaagagg tgacaaagtt 1140
gcattgtttc catatttgtc agttcatatg gacccagata tccatccaga accaactact 1200
tttaaataca acagattttt gaatccagat ggtacaagaa aggttgattt ctacaagtct 1260
ggtaaaaaga ttcatcatta caacatgcca tggggttctg gtgtttcaat ttgtccaggt 1320
agatttttcg ctccatcaga aatgaagact ttcgttttgt tgatggttat gtacttcgat 1380
ttcgaattgg ttgatccaga tatgccagtt ccaccaattg atccaagaag atggggtttt 1440
ggtacatctc aaccatcaca tgaagttaga ttcagataca gattgaagcc aatgcaataa 1500
<210> 275
<211> 496
<212> PRT
<213> 智人(Homo sapiens)
<400> 275
Met Val Leu Trp Gly Pro Val Leu Gly Ala Leu Leu Val Val Ile Ala
1 5 10 15
Gly Tyr Leu Cys Leu Pro Gly Met Leu Arg Gln Arg Arg Pro Trp Glu
20 25 30
Pro Pro Leu Asp Lys Gly Thr Val Pro Trp Leu Gly His Ala Met Ala
35 40 45
Phe Arg Lys Asn Met Phe Glu Phe Leu Lys Arg Met Arg Thr Lys His
50 55 60
Gly Asp Val Phe Thr Val Gln Leu Gly Gly Gln Tyr Phe Thr Phe Val
65 70 75 80
Met Asp Pro Leu Ser Phe Gly Ser Ile Leu Lys Asp Thr Gln Arg Lys
85 90 95
Leu Asp Phe Gly Gln Tyr Ala Lys Lys Leu Val Leu Lys Val Phe Gly
100 105 110
Tyr Arg Ser Val Gln Gly Asp His Glu Met Ile His Ser Ala Ser Thr
115 120 125
Lys His Leu Arg Gly Asp Gly Leu Lys Asp Leu Asn Glu Thr Met Leu
130 135 140
Asp Ser Leu Ser Phe Val Met Leu Thr Ser Lys Gly Trp Ser Leu Asp
145 150 155 160
Ala Ser Cys Trp His Glu Asp Ser Leu Phe Arg Phe Cys Tyr Tyr Ile
165 170 175
Leu Phe Thr Ala Gly Tyr Leu Ser Leu Phe Gly Tyr Thr Lys Asp Lys
180 185 190
Glu Gln Asp Leu Leu Gln Ala Gly Glu Leu Phe Met Glu Phe Arg Lys
195 200 205
Phe Asp Leu Leu Phe Pro Arg Phe Val Tyr Ser Leu Leu Trp Pro Arg
210 215 220
Glu Trp Leu Glu Val Gly Arg Leu Gln Arg Leu Phe His Lys Met Leu
225 230 235 240
Ser Val Ser His Ser Gln Glu Lys Glu Gly Ile Ser Asn Trp Leu Gly
245 250 255
Asn Met Leu Gln Phe Leu Arg Glu Gln Gly Val Pro Ser Ala Met Gln
260 265 270
Asp Lys Phe Asn Phe Met Met Leu Trp Ala Ser Gln Gly Asn Thr Gly
275 280 285
Pro Thr Ser Phe Trp Ala Leu Leu Tyr Leu Leu Lys His Pro Glu Ala
290 295 300
Ile Arg Ala Val Arg Glu Glu Ala Thr Gln Val Leu Gly Glu Ala Arg
305 310 315 320
Leu Glu Thr Lys Gln Ser Phe Ala Phe Lys Leu Gly Ala Leu Gln His
325 330 335
Thr Pro Val Leu Asp Ser Val Val Glu Glu Thr Leu Arg Leu Arg Ala
340 345 350
Ala Pro Thr Leu Leu Arg Leu Val His Glu Asp Tyr Thr Leu Lys Met
355 360 365
Ser Ser Gly Gln Glu Tyr Leu Phe Arg His Gly Asp Ile Leu Ala Leu
370 375 380
Phe Pro Tyr Leu Ser Val His Met Asp Pro Asp Ile His Pro Glu Pro
385 390 395 400
Thr Val Phe Lys Tyr Asp Arg Phe Leu Asn Pro Asn Gly Ser Arg Lys
405 410 415
Val Asp Phe Phe Lys Thr Gly Lys Lys Ile His His Tyr Thr Met Pro
420 425 430
Trp Gly Ser Gly Val Ser Ile Cys Pro Gly Arg Phe Phe Ala Leu Ser
435 440 445
Glu Ala Asp Phe Glu Arg Asn Val Asp Ser Gly Pro Gln Leu Thr Gln
450 455 460
Ser Ile Tyr Asp Thr Met Ile Ser Trp Leu Met Glu Leu Phe Ser Ala
465 470 475 480
Ala Glu Thr Glu Pro Leu Leu Arg Glu Pro Trp Ser Pro Pro Thr Leu
485 490 495
<210> 276
<211> 1491
<212> DNA
<213> 智人(Homo sapiens)
<400> 276
atggttttgt ggggtccagt tttaggtgct ttgttagttg ttattgcagg ttatttgtgt 60
ttgccaggca tgttgagaca aagaagacca tgggaaccac cattggataa aggtactgtt 120
ccatggttag gtcatgctat ggcttttaga aagaatatgt tcgaattctt gaaaagaatg 180
agaactaaac atggtgacgt ttttacagtt caattaggtg gtcaatactt cactttcgtt 240
atggacccat tgtcatttgg ttctatcttg aaggatacac aaagaaagtt ggatttcggt 300
caatacgcta agaaattggt tttgaaggtt ttcggttaca gatctgttca aggtgaccat 360
gaaatgatcc attctgcttc aacaaagcat ttgagaggtg acggtttgaa ggatttgaac 420
gaaactatgt tggattcttt gtcattcgtt atgttgacat caaaaggttg gtctttagat 480
gcatcatgtt ggcatgaaga ttctttgttt agattctgtt actacatctt gtttactgct 540
ggttatttgt cattgttcgg ttacacaaag gataaggaac aagatttgtt acaagctggt 600
gaattgttta tggaattcag aaagttcgat ttgttatttc caagatttgt ttattctttg 660
ttatggccaa gagaatggtt ggaagttggt agattgcaaa gattgttcca taagatgttg 720
tctgtttcac attctcaaga aaaggaaggt atctctaact ggttgggtaa catgttgcaa 780
ttcttgagag aacaaggtgt tccatcagct atgcaggata agtttaattt catgatgttg 840
tgggcatctc aaggtaatac tggtccaaca tcattctggg ctttgttgta cttgttgaag 900
catccagaag ctatcagagc agttagagaa gaagctactc aagttttggg tgaagcaaga 960
ttggaaacaa agcaatcttt cgcttttaaa ttgggtgcat tacaacatac tccagttttg 1020
gattcagttg ttgaagaaac tttgagattg agagctgcac caacattgtt aagattggtt 1080
catgaagatt acacattgaa gatgtcttca ggtcaagaat acttgtttag acatggtgac 1140
atcttggctt tgttcccata tttgtctgtt catatggacc cagatatcca tccagaacca 1200
actgttttta aatacgatag atttttaaac ccaaacggtt caagaaaggt tgatttcttt 1260
aagactggta aaaagattca tcattacaca atgccatggg gttcaggtgt ttctatttgt 1320
ccaggtagat ttttcgcttt gtctgaagca gattttgaaa gaaatgttga ttctggtcca 1380
caattgactc aatcaatcta tgatacaatg atctcttggt tgatggaatt attttcagct 1440
gcagaaactg aaccattgtt aagagaacca tggtcaccac caacattgta a 1491
<210> 277
<211> 510
<212> PRT
<213> 斑马鱼(Danio rerio)
<400> 277
Met Ala Leu Val Gln Ile Leu Leu Ala Leu Leu Ile Ser Val Ile Gly
1 5 10 15
Ala Leu Tyr Leu Leu Gly Ser Phe Arg Arg Arg Arg Thr Gly Glu Pro
20 25 30
Pro Leu Glu Lys Gly Pro Ile Pro Trp Leu Gly His Val Leu Glu Phe
35 40 45
Arg Lys Asp Thr Ala Lys Phe Leu Asn Arg Met Lys Ala Lys His Gly
50 55 60
Asp Ile Phe Thr Val Gln Leu Gly Gly Phe Tyr Phe Thr Phe Ile Thr
65 70 75 80
Asp Pro Leu Ser Phe Gly Ala Val Val Lys Glu Ala Arg Ala Lys Leu
85 90 95
Asp Phe Thr Lys Phe Ala Glu Gln Leu Val Gln Arg Val Phe Gly Tyr
100 105 110
His Ser Ile Gln Ser Glu His Lys Val Leu Gln Ala Ser Ser Thr Lys
115 120 125
His Leu Met Gly Asp Gly Leu Val Val Met Thr Gln Ala Met Met Tyr
130 135 140
Asn Leu Gln Asn Leu Met Leu His Ser Val Gly Ser Gly Asn Gly Lys
145 150 155 160
Val Trp Gln Glu Ser Gly Leu Phe Ala Tyr Ser Tyr Asn Ile Val Phe
165 170 175
Arg Ala Gly Tyr Leu Ser Leu Phe Gly Asn Glu Ser Pro Lys Gly Thr
180 185 190
Gly Lys Glu Ser Val Glu Lys Ala Lys Glu Ile Asp Arg Gln Glu Ser
195 200 205
Asn Asp Leu Phe Trp Glu Phe Arg Lys Tyr Asp Gln Leu Phe Pro Asn
210 215 220
Leu Ala Tyr Gly Val Leu Gly Pro Ser Glu Lys Met Glu Ala Glu Arg
225 230 235 240
Leu Lys Arg Leu Phe Trp Ser Thr Leu Ser Val Gln Lys Met Arg Ala
245 250 255
Arg Asp Asn Ile Ser Gly Trp Val Ser Asp Gln Gln Gln Val Arg Ala
260 265 270
Glu His Gly Met Gln Glu Phe Met Gln Asp Arg Tyr Met Phe Leu Leu
275 280 285
Leu Trp Ala Ser Gln Gly Asn Thr Gly Pro Ser Ala Phe Trp Leu Leu
290 295 300
Leu Tyr Leu Met Lys His Pro Glu Ala Met Ser Ala Val Arg Lys Glu
305 310 315 320
Val Glu Glu Ile Leu Lys Glu Ala Gly Gln Glu Val Lys Pro Gly Gly
325 330 335
Pro Leu Ile Asp Leu Ser Arg Asp Met Leu Leu Lys Thr Pro Ile Leu
340 345 350
Asp Ser Ala Val Glu Glu Thr Leu Arg Leu Thr Ala Ala Pro Ile Leu
355 360 365
Thr Arg Ala Val Met Gln Asp Met Thr Ile Ile Met Ala Asn Gly Gln
370 375 380
Glu Tyr Lys Ile Arg Glu Gly Asp Arg Val Ala Val Phe Pro Tyr Val
385 390 395 400
Val His Val Asp Pro Glu Val His Pro Asp Pro Leu Thr Phe Lys Tyr
405 410 415
Asp Arg Phe Leu Asn Ala Asp Gly Ser Arg Lys Thr Asp Phe Tyr Lys
420 425 430
Gly Gly Lys Lys Leu Lys Tyr Tyr Ser Met Pro Trp Gly Ala Gly Thr
435 440 445
Thr Met Cys Pro Gly Arg Phe Phe Ala Thr Asn Glu Leu Lys Gln Phe
450 455 460
Val Phe Leu Met Leu Ser Tyr Phe Asp Phe Glu Leu Thr Asn Pro Asn
465 470 475 480
Glu Gln Ile Pro Gly Ile Asp Ile Arg Arg Trp Gly Phe Gly Ser Met
485 490 495
Gln Ser Asp Arg Asp Ile Gln Phe Arg Tyr Arg Pro Arg Ile
500 505 510
<210> 278
<211> 1533
<212> DNA
<213> 斑马鱼(Danio rerio)
<400> 278
atggctttgg ttcaaatctt gttggcattg ttgatctctg ttattggtgc tttgtatttg 60
ttgggttctt ttagaagaag aagaactggt gaaccaccat tagaaaaagg tccaattcca 120
tggttgggtc atgttttaga attcagaaag gatactgcta agttcttgaa cagaatgaaa 180
gcaaagcatg gtgacatttt tacagttcaa ttgggtggtt tctacttcac ttttattaca 240
gatccattat cttttggtgc tgttgttaag gaagctagag caaagttgga tttcacaaag 300
ttcgcagaac aattagttca aagagttttc ggttaccatt ctatccaatc agaacataag 360
gttttgcaag cttcttcaac taagcatttg atgggtgacg gtttagttgt tatgacacaa 420
gcaatgatgt acaatttgca aaatttgatg ttacattctg ttggttcagg caatggtaaa 480
gtttggcaag aatctggttt gttcgcttac tcatacaaca tcgtttttag agcaggttat 540
ttgtctttgt tcggtaacga atcaccaaaa ggtactggta aagaatctgt tgaaaaggct 600
aaggaaatcg atagacaaga atcaaacgat ttgttttggg aattcagaaa gtacgatcaa 660
ttgttcccaa atttggctta cggtgtttta ggtccatctg aaaagatgga agcagaaaga 720
ttgaagagat tattttggtc tactttgtca gttcaaaaga tgagagctag agataacatc 780
tctggttggg tttcagatca acaacaagtt agagcagaac atggtatgca agaattcatg 840
caagatagat acatgttctt gttgttgtgg gcttctcaag gtaatacagg tccatcagca 900
ttctggttgt tgttgtactt gatgaagcat ccagaagcta tgtcagcagt tagaaaggaa 960
gttgaagaaa tcttgaagga agctggtcaa gaagttaaac caggtggtcc attgatcgat 1020
ttgtctagag atatgttgtt gaagacacca atcttggatt cagcagttga agaaactttg 1080
agattaacag ctgcaccaat cttgactaga gctgttatgc aagatatgac aatcatcatg 1140
gcaaacggtc aagaatacaa gatcagagaa ggtgacagag ttgctgtttt tccatacgtt 1200
gttcatgttg atccagaagt tcatccagat ccattgactt ttaaatacga tagattttta 1260
aacgctgatg gttctagaaa gacagatttc tacaaaggtg gtaaaaagtt gaagtactac 1320
tcaatgccat ggggtgctgg tactacaatg tgtccaggta gatttttcgc aactaatgaa 1380
ttgaaacaat ttgttttctt gatgttgtct tacttcgatt tcgaattgac aaacccaaac 1440
gaacaaatcc caggtatcga tatcagaaga tggggttttg gttctatgca atcagataga 1500
gatatccaat tcagatacag accaagaatt tga 1533

Claims (44)

1.一种遗传修饰的细胞,所述遗传修饰的细胞能够产生UDCA或UDCA前体,所述遗传修饰的细胞包含至少一种异源多核苷酸,所述异源多核苷酸编码参与将糖转化为UDCA或UDCA前体的代谢途径的酶。
2.根据权利要求1所述的细胞,所述细胞包含至少两种异源多核苷酸,每种异源多核苷酸编码参与将糖转化为UDCA或UDCA前体的代谢途径的酶,其中所述至少两种异源多核苷酸编码的酶沿所述代谢途径可操作地连接。
3.根据权利要求1或2所述的细胞,其中所述UDCA前体是链甾醇、胆固醇、7-α-羟基胆固醇、7α-羟基-4-胆甾烯-3-酮、7α-羟基-5β-胆甾烷-3-酮、5β-胆甾烷-3α,7α-二醇、(25R)-3α,7α-二羟基-5β-胆甾烷酸、(25R)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α-二羟基-5β-胆甾烷酰基-CoA、(24E)-3α,7α-二羟基-5β-胆甾-24-烯酰基-CoA、3α,7α-二羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α-二羟基-5β-胆烷-24-酰基-CoA、3α-羟基-7-氧-5β-胆烷-24-酰基-CoA、3α,7β-二羟基-5β-胆烷-24-酰基-CoA、7α,12α-二羟基-4-胆甾烯-3-酮、7α,12α-二羟基-5β-胆甾烷-3-酮、5β-胆甾烷-3α,7α,12α-三醇、(25R)-3α,7α,12α-三羟基-5β-胆甾烷-26-酸、(25R)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(25S)-3α,7α,12α-三羟基-5β-胆甾烷酰基-CoA、(24E)-3α,7α,12α-三羟基-5β-胆甾-24-烯酰基-CoA、3α,7α,12α-三羟基-24-氧-5β-胆甾烷酰基-CoA、3α,7α,12α-三羟基-5β-胆烷-24-酰基-CoA或胆酸。
4.根据权利要求1-3中任一项所述的细胞,其中所述编码的酶是DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C9、AKR1C4、CYP27A1、SLC27A5、FAT1、AMACR、ACOX2、POX1、HSD17B4、FOX2、SCP2、POT1、ERG10、7α-HSD、7β-HSD或胆酰-CoA水解酶。
5.根据权利要求1-4中任一项所述的细胞,其中所述编码的酶参与将糖转化为胆固醇的代谢途径。
6.根据权利要求1-4中任一项所述的细胞,其中所述编码的酶参与将胆固醇转化为CDC-CoA的代谢途径。
7.根据权利要求1-4中任一项所述的细胞,其中所述编码的酶参与将胆固醇转化为胆酸的代谢途径。
8.根据权利要求1-4中任一项所述的细胞,其中所述编码的酶参与将CDC-CoA转化为UDCA的代谢途径。
9.根据权利要求1-5中任一项所述的细胞,其中所述编码的酶是:
DHCR7,并且由包含与SEQ ID NO:2、4、6、8、10或12中的任一个基本相同的核酸序列的多核苷酸编码;或
DHCR24,并且由包含与SEQ ID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中的任一个基本相同的核酸序列的多核苷酸编码。
10.根据权利要求1-4或6-7中任一项所述的细胞,其中所述编码的酶是:
CYP7A1,并且由包含与SEQ ID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中的任一个基本相同的核酸序列的多核苷酸编码;
HSD3B7,并且由包含与SEQ ID NO:82、84、86或88中的任一个基本相同的核酸序列的多核苷酸编码;
CYP8B1,并且由包含与SEQ ID NO:266、268、270、272、274、276或278中的任一个基本相同的核酸序列的多核苷酸编码;
AKR1D1,并且由包含与SEQ ID NO:90、92、94或96中的任一个基本相同的核酸序列的多核苷酸编码;
AKR1C9,并且由包含与SEQ ID NO:98基本相似的核酸序列的多核苷酸编码;
AKR1C4,并且由包含与SEQ ID NO:100、102、104、106、108、110、112、114、116、118、120或122中的任一个基本相同的核酸序列的多核苷酸编码;
CYP27A1,并且由包含与SEQ ID NO:124、126、128、130、132、134、136或138中的任一个基本相同的核酸序列的多核苷酸编码;
SLC27A5,并且由包含与SEQ ID NO:140或142基本相同的核酸序列的多核苷酸编码;
FAT1,并且由包含与SEQ ID NO:144基本相同的核酸序列的多核苷酸编码;
AMACR,并且由包含与SEQ ID NO:146、148、150、152、154、156或158中的任一个基本相同的核酸序列的多核苷酸编码;
ACOX2,并且由包含与SEQ ID NO:160、162、164、166、168、170、172或174中的任一个基本相同的核酸序列的多核苷酸编码;
POX1,并且由包含与SEQ ID NO:176基本相同的核酸序列的多核苷酸编码;
HSD17B4,并且由包含与SEQ ID NO:178、180、182、184、186、188、190或192中的任一个基本相同的核酸序列的多核苷酸编码;
FOX2,并且由包含与SEQ ID NO:194基本相同的核酸序列的多核苷酸编码;
SCP2,并且由包含与SEQ ID NO:196、198、200或202中的任一个基本相同的核酸序列的多核苷酸编码;
POT1,并且由包含与SEQ ID NO:204基本相同的核酸序列的多核苷酸编码;或
ERG10,并且由包含与SEQ ID NO:206基本相同的核酸序列的多核苷酸编码。
11.根据权利要求8所述的细胞,其中所述编码的酶是:
7α-HSD,并且由包含与SEQ ID NO:208、210、212或214中的任一个基本相同的核酸序列的多核苷酸编码;
7β-HSD,并且由包含与SEQ ID NO:216、218、220或222中的任一个基本相同的核酸序列的多核苷酸编码;和
胆酰-CoA水解酶,并且由包含与SEQ ID NO:224、226、228或230中的任一个基本相同的核酸序列的多核苷酸编码。
12.根据权利要求1-11中任一项所述的细胞,所述细胞还包含编码ADR、ADX和/或截短型HMG的异源多核苷酸。
13.根据权利要求1-12中任一项所述的细胞,其中所述细胞是微生物或微生物的一部分。
14.根据权利要求1-13中任一项所述的细胞,其中所述细胞是细菌或酵母。
15.根据权利要求1-14中任一项所述的细胞,其中所述细胞是酿酒酵母(Saccharomyces cerevisiae)。
16.一种制备UDCA或UDCA前体的方法,所述方法包括:
(a)使底物与根据权利要求1-15中任一项所述的遗传修饰的细胞接触;并且
(b)使所述细胞生长以产生UDCA或UDCA前体。
17.根据权利要求16所述的方法,所述方法还包括从所述细胞分离所述UDCA或UDCA前体。
18.使用根据权利要求16或17所述的方法制备的UDCA或UDCA前体用于制造用于治疗疾病或疾病症状的药物的用途。
19.根据权利要求19所述的用途,其中所述疾病或疾病症状是胆结石、原发性胆汁性肝硬化、囊性纤维化、胆汁流出障碍、妊娠肝内胆汁淤积症和/或胆石症。
20.一种药物,所述药物包含使用根据权利要求16或17所述的方法制备的UDCA或UDCA前体。
21.一种治疗疾病或疾病症状的方法,所述方法包括向有相应需要的受试者施用使用根据权利要求15或16所述的方法制备的UDCA或UDCA前体。
22.根据权利要求21所述的方法,其中所述疾病或疾病症状是胆结石、原发性胆汁性肝硬化、囊性纤维化、胆汁流出障碍、妊娠肝内胆汁淤积症和/或胆石症。
23.一种分离的多核苷酸,所述多核苷酸编码至少一种参与将糖转化为UDCA或UDCA前体的代谢途径的酶。
24.根据权利要求23所述的多核苷酸,其中所述编码的酶是DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C9、AKR1C4、CYP27A1、SLC27A5、FAT1、AMACR、ACOX2、POX1、HSD17B4、FOX2、SCP2、POT1、ERG10、7α-HSD、7β-HSD或胆酰-CoA水解酶。
25.根据权利要求23或24所述的多核苷酸,其中所述编码的酶参与将糖转化为胆固醇的代谢途径。
26.根据权利要求23或24所述的多核苷酸,其中所述编码的酶参与将胆固醇转化为CDC-CoA的代谢途径。
27.根据权利要求23或24所述的多核苷酸,其中所述编码的酶参与将胆固醇转化为胆酸的代谢途径。
28.根据权利要求23或24所述的多核苷酸,其中所述编码的酶参与将CDC-CoA转化为UDCA的代谢途径。
29.根据权利要求23-25中任一项所述的多核苷酸,其中所述编码的酶是:
DHCR7,并且所述多核苷酸包含与SEQ ID NO:2、4、6、8、10或12中的任一个基本相同的核酸序列;或
DHCR24,并且所述多核苷酸包含与SEQ ID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中的任一个基本相同的核酸序列。
30.根据权利要求23-24和26-27中任一项所述的多核苷酸,其中所述编码的酶是:
CYP7A1,并且所述多核苷酸包含与SEQ ID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中的任一个基本相同的核酸序列;
HSD3B7,并且所述多核苷酸包含与SEQ ID NO:82、84、86或88中的任一个基本相同的核酸序列;
CYP8B1,并且所述多核苷酸包含与SEQ ID NO:266、268、270、272、274、276或278中的任一个基本相同的核酸序列;
AKR1D1,并且所述多核苷酸包含与SEQ ID NO:90、92、94或96中的任一个基本相同的核酸序列;
AKR1C9,并且所述多核苷酸包含与SEQ ID NO:98基本相似的核酸序列;
AKR1C4,并且所述多核苷酸包含与SEQ ID NO:100、102、104、106、108、110、112、114、116、118、120或122中的任一个基本相同的核酸序列;
CYP27A1,并且所述多核苷酸包含与SEQ ID NO:124、126、128、130、132、134、136或138中的任一个基本相同的核酸序列;
SLC27A5,并且所述多核苷酸包含与SEQ ID NO:140或142基本相同的核酸序列;
FAT1,并且所述多核苷酸包含与SEQ ID NO:144基本相同的核酸序列;
AMACR,并且所述多核苷酸包含与SEQ ID NO:146、148、150、152、154、156或158中的任一个基本相同的核酸序列;
ACOX2,并且所述多核苷酸包含与SEQ ID NO:160、162、164、166、168、170、172或174中的任一个基本相同的核酸序列;
POX1,并且所述多核苷酸包含与SEQ ID NO:176基本相同的核酸序列;
HSD17B4,并且所述多核苷酸包含与SEQ ID NO:178、180、182、184、186、188、190或192中的任一个基本相同的核酸序列;
FOX2,并且所述多核苷酸包含与SEQ ID NO:194基本相同的核酸序列;
SCP2,并且所述多核苷酸包含与SEQ ID NO:196、198、200或202中的任一个基本相同的核酸序列;
POT1,并且所述多核苷酸包含与SEQ ID NO:204基本相同的核酸序列;或
ERG10,并且所述多核苷酸包含与SEQ ID NO:206基本相同的核酸序列。
31.根据权利要求23-24和28中任一项所述的多核苷酸,其中所述编码的酶是:
7α-HSD,并且所述多核苷酸包含与SEQ ID NO:208、210、212或214中的任一个基本相同的核酸序列;
7β-HSD,并且所述多核苷酸包含与SEQ ID NO:216、218、220或222中的任一个基本相同的核酸序列;和
胆酰-CoA水解酶,并且所述多核苷酸包含与SEQ ID NO:224、226、228或230中的任一个基本相同的核酸序列。
32.一种载体,所述载体包含编码至少一种参与将糖转化为UDCA或UDCA前体的代谢途径的酶的核酸。
33.根据权利要求32所述的载体,其中所述编码的酶是DHCR7、DHCR24、CYP7A1、HSD3B7、CYP8B1、AKR1D1、AKR1C9、AKR1C4、CYP27A1、SLC27A5、FAT1、AMACR、ACOX2、POX1、HSD17B4、FOX2、SCP2、POT1、ERG10、7α-HSD、7β-HSD或胆酰-CoA水解酶。
34.根据权利要求32或33所述的载体,其中所述编码的酶参与将糖转化为胆固醇的代谢途径。
35.根据权利要求32或33所述的载体,其中所述编码的酶参与将胆固醇转化为CDC-CoA的代谢途径。
36.根据权利要求32或33所述的载体,其中所述编码的酶参与将胆固醇转化为胆酸的代谢途径。
37.根据权利要求32或33所述的载体,其中所述编码的酶参与将CDC-CoA转化为UDCA的代谢途径。
38.根据权利要求32-34中任一项所述的载体,其中所述编码的酶是:
DHCR7,并且所述载体包含与SEQ ID NO:2、4、6、8、10或12中的任一个基本相同的核酸序列;或
DHCR24,并且所述载体包含与SEQ ID NO:14、15、16、18、19、20、22、23、24、26、27、28、30、31、32、34、35、36、38、39、40、42、44、46或48中的任一个基本相同的核酸序列。
39.根据权利要求32-33和35-36中任一项所述的载体,其中所述编码的酶是:
CYP7A1,并且所述载体包含与SEQ ID NO:50、52、54、56、58、60、62、64、66、68、70、72、74、76、78或80中的任一个基本相同的核酸序列;
HSD3B7,并且所述载体包含与SEQ ID NO:82、84、86或88中的任一个基本相同的核酸序列;
CYP8B1,并且所述载体包含与SEQ ID NO:266、268、270、272、274、276或278中的任一个基本相同的核酸序列;
AKR1D1,并且所述载体包含与SEQ ID NO:90、92、94或96中的任一个基本相同的核酸序列;
AKR1C9,并且所述载体包含与SEQ ID NO:98基本相同的核酸序列;
AKR1C4,并且所述载体包含与SEQ ID NO:100、102、104、106、108、110、112、114、116、118、120或122中的任一个基本相同的核酸序列;
CYP27A1,并且所述载体包含与SEQ ID NO:124、126、128、130、132、134、136或138中的任一个基本相同的核酸序列;
SLC27A5,并且所述载体包含与SEQ ID NO:140或142基本相同的核酸序列;
FAT1,并且所述载体包含与SEQ ID NO:144基本相同的核酸序列;
AMACR,并且所述载体包含与SEQ ID NO:146、148、150、152、154、156或158中的任一个基本相同的核酸序列;
ACOX2,并且所述载体包含与SEQ ID NO:160、162、164、166、168、170、172或174中的任一个基本相同的核酸序列;
POX1,并且所述载体包含与SEQ ID NO:176基本相同的核酸序列;
HSD17B4,并且所述载体包含与SEQ ID NO:178、180、182、184、186、188、190或192中的任一个基本相同的核酸序列;
FOX2,并且所述载体包含与SEQ ID NO:194基本相同的核酸序列;
SCP2,并且所述载体包含与SEQ ID NO:196、198、200或202中的任一个基本相同的核酸序列;
POT1,并且所述载体包含与SEQ ID NO:204基本相同的核酸序列;或
ERG10,并且所述载体包含与SEQ ID NO:206基本相同的核酸序列。
40.根据权利要求32-33和37中任一项所述的载体,其中所述编码的酶是:
7α-HSD,并且所述载体包含与SEQ ID NO:208、210、212或214中的任一个基本相同的核酸序列;
7β-HSD,并且所述载体包含与SEQ ID NO:216、218、220或222中的任一个基本相同的核酸序列;和
胆酰-CoA水解酶,并且所述载体包含与SEQ ID NO:224、226、228或230中的任一个基本相同的核酸序列。
41.一种制备能够合成UDCA或UDCA前体的遗传修饰的细胞的方法,所述方法包括:
(a)使细胞与至少一种编码参与将糖转化为UDCA或UDCA前体的代谢途径的酶的异源多核苷酸接触;并且
(b)使所述细胞生长,使得所述多核苷酸插入所述微生物中。
42.根据权利要求41所述的方法,其中所述细胞是细菌或酵母细胞。
43.根据权利要求41或42所述的方法,其中所述细胞是酿酒酵母细胞。
44.一种组合物,所述组合物包含UDCA或UDCA前体、其游离酸或CoA、或其药学上可接受的衍生物或前药,所述UDCA、UDCA前体、其游离酸或CoA、或其药学上可接受的衍生物或前药通过权利要求16或17所述的方法产生。
CN201980081514.5A 2018-10-09 2019-10-08 用于产生熊去氧胆酸及其前体的细胞和方法 Pending CN113227364A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201862743122P 2018-10-09 2018-10-09
US62/743,122 2018-10-09
PCT/US2019/055180 WO2020076819A1 (en) 2018-10-09 2019-10-08 Cells and methods for the production of ursodeoxycholic acid and precursors thereof

Publications (1)

Publication Number Publication Date
CN113227364A true CN113227364A (zh) 2021-08-06

Family

ID=68318983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980081514.5A Pending CN113227364A (zh) 2018-10-09 2019-10-08 用于产生熊去氧胆酸及其前体的细胞和方法

Country Status (4)

Country Link
US (1) US20210340504A1 (zh)
EP (1) EP3864144A1 (zh)
CN (1) CN113227364A (zh)
WO (1) WO2020076819A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114231508A (zh) * 2021-12-28 2022-03-25 宋建芳 一种7β-羟基类固醇脱氢酶突变体及其应用

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3933036A1 (en) * 2020-07-02 2022-01-05 Herbrand PharmaChemicals GmbH Process for 7-beta-hydroxylation of bile acid derivatives
CN112852652A (zh) * 2021-01-15 2021-05-28 江南大学 高效转化鹅去氧胆酸合成熊去氧胆酸的重组酵母菌株、构建与应用
CN112725212A (zh) * 2021-01-15 2021-04-30 江南大学 高效转化鹅去氧胆酸的重组酵母底盘细胞改造及重组菌株构建与应用
CN112779175A (zh) * 2021-02-10 2021-05-11 上海中医药大学 一种制备人工熊胆粉的工程酿酒酵母及方法
CN114134067A (zh) * 2021-10-19 2022-03-04 山东睿智医药科技有限公司 一种大肠杆菌及应用
CN115287330B (zh) * 2022-08-03 2023-09-01 四川大学 一种检测细胞色素cyp8b1酶活性的方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5166374A (en) * 1989-04-17 1992-11-24 Giuliani S.P.A. Bile acid derivatives, processes for the preparation thereof and pharmaceutical compositions containing them
CN103097400A (zh) * 2010-05-27 2013-05-08 细胞制药有限公司 新的7α-羟类固醇脱氢酶敲除突变体及其用途

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103415618A (zh) 2010-08-19 2013-11-27 新西兰郎泽科技公司 使用对含一氧化碳的底物的微生物发酵来生产化学物质的方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5166374A (en) * 1989-04-17 1992-11-24 Giuliani S.P.A. Bile acid derivatives, processes for the preparation thereof and pharmaceutical compositions containing them
CN103097400A (zh) * 2010-05-27 2013-05-08 细胞制药有限公司 新的7α-羟类固醇脱氢酶敲除突变体及其用途

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114231508A (zh) * 2021-12-28 2022-03-25 宋建芳 一种7β-羟基类固醇脱氢酶突变体及其应用
CN114231508B (zh) * 2021-12-28 2022-11-11 宋建芳 一种7β-羟基类固醇脱氢酶突变体及其应用

Also Published As

Publication number Publication date
US20210340504A1 (en) 2021-11-04
WO2020076819A1 (en) 2020-04-16
EP3864144A1 (en) 2021-08-18

Similar Documents

Publication Publication Date Title
AU2020203872B2 (en) Optimal maize loci
AU2020204196B2 (en) Optimal maize loci
AU2018203835B2 (en) Recombinant dna constructs and methods for modulating expression of a target gene
AU2020202369B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
CN113227364A (zh) 用于产生熊去氧胆酸及其前体的细胞和方法
KR102644935B1 (ko) 항-PD1/PD-L1/PD-L2 항체에 대한 반응성의 마커로서의 미생물총 조성물, 및 항-PD1/PD-L1/PD-L2 Ab-기반 치료의 효능을 개선하기 위한 미생물 조정제의 용도
AU2020201743B2 (en) Compositions and methods for making (R)-Reticuline and precursers thereof
KR102607213B1 (ko) 암모니아-산화 니트로소모나스 유트로파 균주 d23
AU2021201338B2 (en) Complete genome sequence of the methanogen methanobrevibacter ruminantium
KR20130117753A (ko) 포스포케톨라아제를 포함하는 재조합 숙주 세포
AU2016274683A1 (en) Streptomyces endophyte compositions and methods for improved agronomic traits in plants
KR102521444B1 (ko) 세균 균주를 포함하는 조성물
KR20140015136A (ko) 3-히드록시프로피온산 및 다른 생성물의 제조 방법
KR20070086634A (ko) 공업적으로 유용한 미생물
KR20120136349A (ko) 고가의 화학적 생성물의 미생물 생산, 및 관련 조성물, 방법 및 시스템
CN113366009A (zh) 用于生物合成大麻素的双向多酶支架
KR20200111172A (ko) 네페탈락톨 산화 환원 효소, 네페탈락톨 합성 효소, 및 네페탈락톤을 생산할 수 있는 미생물
KR20130055571A (ko) 콘드로이틴의 박테리아 생산을 위한 조성물 및 방법
AU2016295174A1 (en) Genetic testing for predicting resistance of salmonella species against antimicrobial agents
AU2016295177A1 (en) Genetic testing for predicting resistance of serratia species against antimicrobial agents
KR20210097723A (ko) 발효에 의한 1,5-디아미노펜탄의 생산을 위한 조작된 생합성 경로
AU2022202318A1 (en) Methods of increasing specific plants traits by over-expressing polypeptides in a plant
TWI651412B (zh) 用於改善代謝症候群的新穎副乾酪乳桿菌gks6、其培養基、培養方法、用途、醫藥組合物及可食用組合物
CN116368233A (zh) 通过发酵生产4-氨基苯乙胺的工程化生物合成途径
JP2002355074A (ja) 腸管出血性病原性大腸菌o157:h7に特異的な核酸分子およびポリペプチド並びにこれらの使用方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination