CN1199995C - 具有抑癌功能的新的人蛋白及其编码序列 - Google Patents
具有抑癌功能的新的人蛋白及其编码序列 Download PDFInfo
- Publication number
- CN1199995C CN1199995C CN 01145279 CN01145279A CN1199995C CN 1199995 C CN1199995 C CN 1199995C CN 01145279 CN01145279 CN 01145279 CN 01145279 A CN01145279 A CN 01145279A CN 1199995 C CN1199995 C CN 1199995C
- Authority
- CN
- China
- Prior art keywords
- pro
- leu
- ser
- ala
- ctg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本发明公开了一类新的具有抑癌功能的人蛋白,编码此多肽的多核苷酸和经重组技术产生该多肽的方法。本发明还公开了此多肽用于治疗多种疾病如癌症等的方法。本发明还公开了抗此多肽的拮抗剂及其治疗作用。本发明还公开了编码这类新的具有抑癌功能的人蛋白的多核苷酸的用途。
Description
技术领域
本发明属于生物技术领域,具体地说,本发明涉及新的编码具有抑癌功能的人蛋白的多核苷酸和此多核苷酸编码的多肽。本发明还涉及此多核苷酸和多肽的用途和制备。
背景技术
人基因组学研究目前是国际上的热点,除人染色体DNA大规模测序,表达序列测序(EST)的方法外,还缺少从功能开始的筛选具有功能基因的高通量的方法。
癌症是危害人类健康的主要疾病之一。为了有效地治疗和预防肿瘤,目前人们已越来越关注肿瘤的基因治疗。因此,本领域迫切需要开发研究具有抑癌功能的人蛋白及其激动剂/抑制剂。
发明内容
本发明的目的是提供一类新的具有抑癌功能的人蛋白多肽以及其片段、类似物和衍生物。
本发明的另一目的是提供编码这些多肽的多核苷酸。
本发明的另一目的是提供生产这些多肽的方法以及该多肽和编码序列的用途。
在本发明的第一方面,提供新颖的分离出的具有抑癌功能的蛋白多肽,它包含具有选自下组的氨基酸序列的多肽:SEQ ID NO:3、6、9、12、15、18、21、24;或其保守性变异多肽、或其活性片段、或其活性衍生物。
较佳地,该多肽是具有选自下组的氨基酸序列的多肽:SEQ ID NO:3、6、9、12、15、18、21、24。
在本发明的第二方面,提供了一种分离的多核苷酸,它包含一核苷酸序列,该核苷酸序列与选自下组的一种核苷酸序列有至少85%相同性:(a)编码上述的具有抑癌功能的蛋白多肽的多核苷酸;(b)与多核苷酸(a)互补的多核苷酸。较佳地,该多核苷酸编码的多肽具有选自下组的氨基酸序列:SEQ ID NO:3、6、9、12、15、18、21、24。更佳地,该多核苷酸的序列选自下组:SEQ ID NO:2、5、8、11、14、17、20、23的编码区序列或全长序列。
在本发明的第三方面,提供了含有上述多核苷酸的载体,以及被该载体转化或转导的宿主细胞或者被上述多核苷酸直接转化或转导的宿主细胞。
在本发明的第四方面,提供了制备具有抑癌功能的蛋白活性的多肽的制备方法,该方法包含:(a)在适合表达具有抑癌功能的蛋白的条件下,培养上述被转化或转导的宿主细胞:(b)从培养物中分离出具有抑癌功能的蛋白活性的多肽。
在本发明的第五方面,提供了与上述的具有抑癌功能的蛋白多肽特异性结合的抗体。还提供了可用于检测的核酸分子,它含有上述的多核苷酸中连续10个核苷酸至全长核苷酸,较佳地它含有连续的约10-800个核苷酸。
在本发明的第六方面,提供了一种药物组合物,它含有安全有效量的本发明的具有抑癌功能的蛋白多肽以及药学上可接受的载体。这些药物组合物可治疗癌症以及细胞异常增殖等病症。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。
具体实施方式
3T3细胞是一种小鼠成纤维细胞(J.Cell.Biol.,17:299,1963)(也称为NIH/3T3细胞)。在癌症研究领域中,常将外源基因(尤其是人基因)引入3T3细胞,观察其对3T3细胞生长的影响情况。通常认为,对3T3细胞生长有影响的基因是癌症相关基因,其中对3T3细胞生长有抑制作用的基因大多是抑癌基因,而对3T3细胞生长有促进作用的基因大多是(原)癌基因。
本发明采用大规模cDNA克隆转染小鼠胚胎成纤维细胞,在获得具有抑癌作用的基础上,经测序证明为新的基因,进一步得到全长cDNA克隆。DNA转染试验证明,本发明的具有抑癌功能的蛋白对3T3细胞具有抑制克隆形成的作用,其抑制率≥50%。
如本文所用,“分离的”是指物质从其原始环境中分离出来(如果是天然的物质,原始环境即是天然环境)。如活体细胞内的天然状态下的多聚核苷酸和多肽是没有分离纯化的,但同样的多聚核苷酸或多肽如从天然状态中同存在的其他物质中分开,则为分离纯化的。
如本文所用,“分离的具有抑癌功能的蛋白或多肽”是指具有抑癌功能的蛋白多肽基本上不含天然与其相关的其它蛋白、脂类、糖类或其它物质。本领域的技术人员能用标准的蛋白质纯化技术纯化具有抑癌功能的蛋白。基本上纯的多肽在非还原聚丙烯酰胺凝胶上能产生单一的主带。
本发明的多肽可以是重组多肽、天然多肽、合成多肽,优选重组多肽。本发明的多肽可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、高等植物、昆虫和哺乳动物细胞)中产生。根据重组生产方案所用的宿主,本发明的多肽可以是糖基化的,或可以是非糖基化的。本发明的多肽还可包括或不包括起始的甲硫氨酸残基。
本发明还包括具有抑癌功能的人蛋白的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明的天然具有抑癌功能的人蛋白相同的生物学功能或活性的多肽。本发明的多肽片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)成熟多肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前导序列或分泌序列或用来纯化此多肽的序列或蛋白原序列)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。以PP11303蛋白(在本申请中,蛋白质的命名采用其克隆编号)为例,编码成熟多肽的编码区序列可以与SEQ ID NO:2所示的编码区序列相同或者是简并的变异体。如本文所用,“简并的变异体”对于PP11303而言是指编码具有SEQ ID NO:3的蛋白质,但与SEQ ID NO:2所示的编码区序列有差别的核酸序列。再以PP12899蛋白为例,编码成熟多肽的编码区序列可以与SEQ ID NO:5所示的编码区序列相同或者是简并的变异体;“简并的变异体”对于PP12899而言是指编码具有SEQ ID NO:6的蛋白质,但与SEQ ID NO:5所示的编码区序列有差别的核酸序列。对于本发明的其他具有抑癌功能的蛋白,可依此类推。
编码成熟多肽的多核苷酸包括:只编码成熟多肽的编码序列;成熟多肽的编码序列和各种附加编码序列;成熟多肽的编码序列(和任选的附加编码序列)以及非编码序列。
术语“编码多肽的多核苷酸”可以是包括编码此多肽的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或多肽的片段、类似物和衍生物。此多核苷酸的变异体可以是天然发生的等位变异体或非天然发生的变异体。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的多肽的功能。
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指:(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在95%以上,更好是97%以上时才发生杂交。并且,可杂交的多核苷酸编码的多肽与SEQ IDNO:3所示的成熟多肽有相同的生物学功能(以PP11303蛋白为例)和活性。
本发明还涉及与上述的序列杂交的核酸片段。如本文所用,“核酸片段”的长度至少含15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸以上。核酸片段可用于核酸的扩增技术(如PCR)以确定和/或分离编码具有抑癌功能的蛋白的多聚核苷酸。
本发明中的多肽和多核苷酸优选以分离的形式提供,更佳地被纯化至均质。
本发明的DNA序列能用几种方法获得。例如,用本领域熟知的杂交技术分离DNA。这些技术包括但不局限于:1)用探针与基因组或cDNA文库杂交以检出同源性核苷酸序列,和2)表达文库的抗体筛选以检出具有共同结构特征的克隆的DNA片段。
编码具有抑癌功能的蛋白的特异DNA片段序列产生也能用下列方法获得:1)从基因组DNA分离双链DNA序列:2)化学合成DNA序列以获得所需多肽的双链DNA。
当需要的多肽产物的整个氨基酸序列已知时,DNA序列的直接化学合成是经常选用的方法。如果所需的氨基酸的整个序列不清楚时,DNA序列的直接化学合成是不可能的,选用的方法是cDNA序列的分离。分离感兴趣的cDNA的标准方法是从高表达该基因的供体细胞分离mRNA并进行逆转录,形成质粒或噬菌体cDNA文库。提取mRNA的方法已有多种成熟的技术,试剂盒也可从商业途径获得(Qiagene)。而构建cDNA文库也是通常的方法(Sambrook,et al.,Molecular Cloning,A Laboratory Manual,Cold SpringHarbor Laboratory.New York,1989)。还可得到商业供应的cDNA文库,如Clontech公司的不同cDNA文库。当结合使用聚合酶反应技术时,即使极少的表达产物也能克隆。
可用常规方法从这些cDNA文库中筛选本发明的基因。这些方法包括(但不限于):(1)DNA-DNA或DNA-RNA杂交;(2)标志基因的功能出现或丧失;(3)测定具有抑癌功能的蛋白的转录本的水平;(4)通过免疫学技术或测定生物学活性,来检测基因表达的蛋白产物。上述方法可单用,也可多种方法联合应用。
在第(1)种方法中,杂交所用的探针是与本发明的多核苷酸的任何一部分同源,其长度至少15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸。此外,探针的长度通常在2kb之内,较佳地为1kb之内。此处所用的探针通常是在本发明的基因DNA序列信息的基础上化学合成的DNA序列。本发明的基因本身或者片段当然可以用作探针。DNA探针的标记可用放射性同位素,荧光素或酶(如碱性磷酸酶)等。
在第(4)种方法中,检测具有抑癌功能的蛋白基因表达的蛋白产物可用免疫学技术如Western印迹法,放射免疫沉淀法,酶联免疫吸附法(ELISA)等。
应用PCR技术扩增DNA/RNA的方法(Saiki,et al.Science 1985;230:1350-1354)被优选用于获得本发明的基因。特别是很难从文库中得到全长的cDNA时,可优选使用RACE法(RACE-cDNA末端快速扩增法),用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。
如上所述得到的本发明的基因,或者各种DNA片段等的核苷酸序列的测定可用常规方法如双脱氧链终止法(Sanger et al.PNAS,1977,74:5463-5467)。这类核苷酸序列测定也可用商业测序试剂盒等。为了获得全长的cDNA序列,测序需反复进行。有时需要测定多个克隆的cDNA序列,才能拼接成全长的cDNA序列。
本发明也涉及包含本发明多核苷酸的载体,以及用本发明载体或具有抑癌功能的蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。
通过常规的重组DNA技术(Science,1984;224:1431),可利用本发明的多聚核苷酸序列可用来表达或生产重组的具有抑癌功能的蛋白多肽。一般来说有以下步骤:
(1).用本发明的编码具有抑癌功能的人蛋白的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;
(2).在合适的培养基中培养的宿主细胞;
(3).从培养基或细胞中分离、纯化蛋白质。
本发明中,具有抑癌功能的人蛋白多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。在本发明中适用的载体包括但不限于:在细菌中表达的基于T7的表达载体(Rosenberg,et al.Gene,1987,56:125);在哺乳动物细胞中表达的pMSXND表达载体(Lee and Nathans,J Bio Chem.263:3521,1988)和在昆虫细胞中表达的来源于杆状病毒的载体。总之,只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。
本领域的技术人员熟知的方法能用于构建含具有抑癌功能的人蛋白编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等(Sambroook,et al.)。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。这些启动子的代表性例子有:大肠杆菌的lac或trp启动子;λ噬菌体PL启动子;真核启动子包括CMV立即早期启动子、早期和晚期SV40启动子、反转录病毒的LTRs和其他一些已知的可控制基因在原核或真核细胞或其病毒中表达的启动子。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,链霉菌属;鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母;植物细胞;果蝇S2或Sf9的昆虫细胞;CHO、COS或Bowes黑素瘤细胞的动物细胞等。
本发明的多核苷酸在高等真核细胞中表达时,如果在载体中插入增强子序列时将会使转录得到增强。增强子是DNA的顺式作用因子,通常大约有10到300个碱基对,作用于启动子以增强基因的转录。可举的例子包括在复制起始点晚期一侧的100到270个碱基对的SV40增强子、在复制起始点晚期一侧的多瘤增强子以及腺病毒增强子等。
本领域一般技术人员都清楚如何选择适当的载体、启动子、增强子和宿主细胞。
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl,法处理,所用的步骤在本领域众所周知。可供选择的是用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法:磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。
在上面的方法中的重组多肽可包被于细胞内、细胞外或在细胞膜上表达或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。
重组的具有抑癌功能的人蛋白或多肽有多方面的用途。这些用途包括(但不限于):直接做为药物治疗具有抑癌功能的蛋白功能低下或丧失所致的疾病,和用于筛选促进或对抗具有抑癌功能的蛋白功能的抗体、多肽或其它配体。例如,抗体可用于激活或抑制具有抑癌功能的人蛋白的功能。用表达的重组具有抑癌功能的人蛋白筛选多肽库可用于寻找有治疗价值的能抑制或刺激具有抑癌功能的人蛋白功能的多肽分子。
本发明也提供了筛选药物以鉴定提高(激动剂)或阻遏(拮抗剂)具有抑癌功能的人蛋白的药剂的方法。激动剂提高具有抑癌功能的人蛋白刺激细胞增殖等生物功能,而拮抗剂阻止和治疗与细胞过度增殖有关的紊乱如各种癌症。例如,能在药物的存在下,将哺乳动物细胞或表达具有抑癌功能的人蛋白的膜制剂与标记的具有抑癌功能的人蛋白一起培养。然后测定药物提高或阻遏此相互作用的能力。
具有抑癌功能的人蛋白的拮抗剂包括筛选出的抗体、化合物、受体缺失物和类似物等。具有抑癌功能的人蛋白的拮抗剂可以与具有抑癌功能的人蛋白结合并消除其功能,或是抑制具有抑癌功能的人蛋白的产生,或是与多肽的活性位点结合使多肽不能发挥生物学功能。具有抑癌功能的人蛋白的拮抗剂可用于治疗用途。
在筛选作为拮抗剂的化合物时,可以将本发明蛋白加入生物分析测定中,通过测定化合物影响具有抑癌功能的蛋白和其受体之间的相互作用来确定化合物是否是拮抗剂。用上述筛选化合物的同样方法,可以筛选出起拮抗剂作用的受体缺失物和类似物。
本发明的多肽可直接用于疾病治疗,例如,各种恶性肿瘤、和细胞异常增殖等。
本发明的多肽,及其片段、衍生物、类似物或它们的细胞可以用来作为抗原以生产抗体。这些抗体可以是多克隆或单克隆抗体。多克隆抗体可以通过将此多肽直接注射动物的方法得到。制备单克隆抗体的技术包括杂交瘤技术,三瘤技术,人B-细胞杂交瘤技术,EBV-杂交瘤技术等。
可以将本发明的多肽和拮抗剂与合适的药物载体组合后使用。这些载体可以是水、葡萄糖、乙醇、盐类、缓冲液、甘油以及它们的组合。组合物包含安全有效量的多肽或拮抗剂以及不影响药物效果的载体和赋形剂。这些组合物可以作为药物用于疾病治疗。
本发明还提供含有一种或多种容器的药盒或试剂盒,容器中装有一种或多种本发明的药用组合物成分。与这些容器一起,可以有由制造、使用或销售药品或生物制品的政府管理机构所给出的指示性提示,该提示反映出生产、使用或销售的政府管理机构许可其在人体上施用。此外,本发明的多肽可以与其它的治疗化合物结合使用。
药物组合物可以以方便的方式给药,如通过局部、静脉内、腹膜内、肌内、皮下、鼻内或皮内的给药途径。具有抑癌功能的蛋白以有效地治疗和/或预防具体的适应症的量来给药。施用于患者的具有抑癌功能的蛋白的量和剂量范围将取决于许多因素,如给药方式、待治疗者的健康条件和诊断医生的判断。
具有抑癌功能的人蛋白的多聚核苷酸也可用于多种治疗目的。基因治疗技术可用于治疗由于具有抑癌功能的蛋白的无表达或异常/无活性的具有抑癌功能的蛋白的表达所致的细胞增殖、发育或代谢异常。重组的基因治疗载体可用于治疗具有抑癌功能的蛋白表达或活性异常所致的疾病。来源于病毒的表达载体如逆转录病毒、腺病毒、腺病毒相关病毒、单纯疱疹病毒、细小病毒等可用于将具有抑癌功能的蛋白基因转移至细胞内。构建携带具有抑癌功能的蛋白基因的重组病毒载体的方法可见于已有文献(Sambrook,etal.)。另外重组具有抑癌功能的人蛋白基因可包装到脂质体中转移至细胞内。
抑制具有抑癌功能的人蛋白mRNA的寡聚核苷酸(包括反义RNA和DNA)以及核酶也在本发明的范围之内。核酶是一种能特异性分解特定RNA的酶样RNA分子,其作用机制是核酶分子与互补的靶RNA特异性杂交后进行核酸内切作用。反义的RNA和DNA及核酶可用已有的任何RNA或DNA合成技术获得,如固相磷酸酰胺化学合成法合成寡核苷酸的技术已广泛应用。反义RNA分子可通过编码该RNA的DNA序列在体外或体内转录获得。这种DNA序列已整合到载体的RNA聚合酶启动子的下游。为了增加核酸分子的稳定性,可用多种方法对其进行修饰,如增加两侧的序列长度,核糖核苷之间的连接应用磷酸硫酯键或肽键而非磷酸二酯键。
多聚核苷酸导入组织或细胞内的方法包括:将多聚核苷酸直接注入到体内组织中;或在体外通过载体(如病毒、噬菌体或质粒等)先将多聚核苷酸导入细胞中,再将细胞移植到体内等。
本发明的多肽还可用作肽谱分析,例如,多肽可用物理的、化学或酶进行特异性切割,并进行一维或二维或三维的凝胶电泳分析。
本发明还提供了针对具有抑癌功能的人蛋白抗原决定簇的抗体。这些抗体包括(但不限于):多克隆抗体、单克隆抗体、嵌合抗体、单链抗体、Fab片段和Fab表达文库产生的片段。这些抗体可用常规方法制备。抗具有抑癌功能的人蛋白的抗体可用于免疫组织化学技术中,检测活检标本中的具有抑癌功能的人蛋白。
与具有抑癌功能的人蛋白结合的单克隆抗体也可用放射性同位素标记,注入体内可跟踪其位置和分布。本发明中的抗体可用于治疗或预防与具有抑癌功能的人蛋白相关的疾病。给予适当剂量的抗体可以刺激或阻断具有抑癌功能的人蛋白的产生或活性。
抗体也可用于设计针对体内某一特殊部位的免疫毒素。如具有抑癌功能的人蛋白高亲和性的单克隆抗体可与细菌或植物毒素(如白喉毒素,蓖麻蛋白,红豆碱等)共价结合。
多克隆抗体的生产可用具有抑癌功能的人蛋白或多肽免疫动物,如家兔,小鼠,大鼠等。多种佐剂可用于增强免疫反应,包括但不限于弗氏佐剂等。
具有抑癌功能的人蛋白单克隆抗体可用杂交瘤技术生产(Kohler and Milstein.Nature,1975,256:495-497)。将人恒定区和非人源的可变区结合的嵌合抗体可用已有的技术生产(Morrison et al,PNAS,1985,81:6851)。而已有的生产单链抗体的技术(U.S.PatNo.4946778)也可用于生产抗具有抑癌功能的人蛋白的单链抗体。
能与本发明蛋白结合的多肽分子可通过筛选由各种可能组合的氨基酸结合于固相物组成的随机多肽库而获得。筛选时,必须对具有抑癌功能的人蛋白分子进行标记。
本发明还涉及定量和定位检测具有抑癌功能的人蛋白水平的诊断试验方法。这些试验是本领域所熟知的,且包括FISH测定和放射免疫测定。试验中所检测的具有抑癌功能的人蛋白水平,可以用作解释具有抑癌功能的人蛋白在各种疾病中的重要性和用于诊断具有抑癌功能的蛋白起作用的疾病。
具有抑癌功能的蛋白的多聚核苷酸可用于具有抑癌功能的蛋白相关疾病的诊断和治疗。在诊断方面,具有抑癌功能的蛋白的多聚核苷酸可用于检测具有抑癌功能的蛋白的表达与否或在疾病状态下具有抑癌功能的蛋白的异常表达。如具有抑癌功能的蛋白DNA序列可用于对活检标本的杂交以判断具有抑癌功能的蛋白的表达异常。杂交技术包括Southern印迹法,Northern印迹法、原位杂交等。这些技术方法都是公开的成熟技术,相关的试剂盒都可从商业途径得到。本发明的多核苷酸的一部分或全部可作为探针固定在微阵列(Microarray)或DNA芯片(又称为“基因芯片”)上,用于分析组织中基因的差异表达分析和基因诊断。用具有抑癌功能的蛋白特异的引物进行RNA-聚合酶链反应(RT-PCR)体外扩增也可检测具有抑癌功能的蛋白的转录产物。
检测具有抑癌功能的蛋白基因的突变也可用于诊断具有抑癌功能的蛋白相关的疾病。具有抑癌功能的蛋白突变的形式包括与正常野生型具有抑癌功能的蛋白DNA序列相比的点突变、易位、缺失、重组和其它任何异常等。可用已有的技术如Southern印迹法、DNA序列分析、PCR和原位杂交检测突变。另外,突变有可能影响蛋白的表达,因此用Northern印迹法、Western印迹法可间接判断基因有无突变。
本发明的序列对染色体鉴定也是有价值的。这些序列会特异性地针对某条人染色体具体位置且并可以与其杂交。目前,需要鉴定染色体上的各基因的具体位点。然而现在只有很少的基于实际序列数据(重复多态性)的染色体标记物可用于标记染色体位置。为了将这些序列与疾病相关基因相关联。第一步就是将本发明DNA序列定位于染色体上。
简而言之,根据cDNA制备PCR引物(优选15-35bp),可以将序列定位于染色体上。然后,将这些引物用于PCR筛选含各条人染色体的体细胞杂合细胞。只有那些含有相应于引物的人基因的杂合细胞会产生扩增的片段。
体细胞杂合细胞的PCR定位法,是将DNA定位到具体染色体的快捷方法。使用本发明的的寡核苷酸引物,通过类似方法,可利用一组来自特定染色体的片段或大量基因组克隆而实现亚定位。可用于染色体定位的其它类似策略包括原位杂交、用标记的流式分选的染色体预筛选和杂交预选,从而构建染色体特异的cDNA库。
将cDNA克隆与中期染色体进行荧光原位杂交(FISH),可以在一个步骤中精确地进行染色体定位。此技术的综述,参见Verma等,Human Chromosomes:a Manual of BasicTechniques,Pergamon Press,New York(1988)。
一旦序列被定位到准确的染色体位置,此序列在染色体上的物理位置就可以与基因图数据相关联。这些数据可见于例如,V.Mckusick,Mendelian Inheritance in Man(可通过与Johns Hopkins University Welch Medical Library联机获得)。然后可通过连锁分析,确定基因与业已定位到染色体区域上的疾病之间的关系。
接着,需要测定患病和未患病个体间的cDNA或基因组序列差异。如果在一些或所有的患病个体中观察到某突变,而该突变在任何正常个体中未观察到,则该突变可能是疾病的病因。比较患病和未患病个体,通常涉及首先寻找染色体中结构的变化,如从染色体水平可见的或用基于cDNA序列的PCR可检测的缺失或易位。
本发明的具有抑癌功能的蛋白核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。
目前,已经可以完全通过化学合成来编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中的各种DNA分子(如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。
此外,由于本发明的具有抑癌功能的蛋白具有源自人的天然氨基酸序列,因此,与来源于其他物种的同族蛋白相比,预计在施用于人时将具有更高的活性和/或更低的副作用(例如在人体内的免疫原性更低或没有)。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor LaboratoryPress,1989)中所述的条件,或按照制造厂商所建议的条件。
实施例1:cDNA基因的获得及对小鼠NIH/3T3细胞克隆形成的抑制作用
PP11303、PP12899和PP14183是通过用常规方法构建人胎盘cDNA文库获得的;FP504、FP972、FP6628、FP6651和FP7162是通过用常规方法构建人胎儿cDNA文库获得的。取3、6、9月龄的胎盘组织(PP克隆)或胎儿组织(FP克隆),用Trizol试剂(GIBCO BRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用pCMV-script TMXR cDNA文库构建试剂盒(Stratagene公司)构建上述mRNA的cDNA文库。其中反转录酶改用MMLV-RT-Superscript II(GIBCO BRL),反转录反应在42℃进行。转化XL 10-Gold感受细胞,获得了1×106cfu/μg滴度的cDNA文库。第一轮随机挑取cDNA克隆,其后以高丰度cDNA克隆和已证明有抑制癌细胞生长功能的cDNA克隆为探针,杂交筛选cDNA文库,挑取弱阳性及阴性克隆。用Qiagen 96孔板质粒抽提试剂盒,按厂家说明书进行质粒DNA的提取。质粒DNA和空载体同时转染小鼠NIH/3T3细胞。100ng DNA酒精沉淀干燥后,加6μl H2O溶解,待转染。每份DNA样品中加0.74μl脂质体及9.3μl无血清培液,混匀后,室温放置10分钟。每管中加150μl无血清培液,均分加入3孔生长于96孔板的小鼠NIH/3T3细胞中,37℃放置2小时,每孔再加50μl无血清培液,37℃24小时。每孔换100μl全培液,37℃24小时,换含G418的全培液100μl,37℃24-48小时,边观察,边换G418浓度不等的培液。约2-3次后,直到镜检细胞有克隆形成,计数。发现上述克隆有抑制NIH/3T3细胞克隆形成作用,结果如下表所示。
cDNA克隆转染细胞(3T3)克隆形成情况
cDNA克隆名称 | cDNA克隆数(三个重复) | 空载体克隆数(三个重复) | ||||
PP11303 | 0 | 0 | 1 | 36 | 34 | 30 |
PP12899 | 6 | 10 | 6 | 36 | 34 | 30 |
PP14183 | 4 | 4 | 3 | 36 | 34 | 30 |
FP504 | 3 | 7 | 4 | 36 | 34 | 30 |
FP972 | 7 | 7 | 9 | 36 | 34 | 30 |
FP6628 | 7 | 10 | 6 | 40 | 38 | 30 |
FP6651 | 1 | 0 | 1 | 40 | 38 | 30 |
FP7162 | 0 | 2 | 1 | 11 | 27 | 16 |
对cDNA克隆采用双脱氧终止法,在ABI377 DNA自动测序仪上测定其一端近500bp的核苷酸序列。分析后,确定为新基因克隆,进行另一端测序,仍未获得全长cDNA序列,设计引物,再次进行测序,直到获得全长序列(SEQ ID NO:1、4、7、10、13、16、19、22)。
实施例2:从胎盘或胎儿cDNA中PCR获得全长基因:
取3、6、9月龄的胎盘组织(PP克隆)或胎儿组织(即克隆),用Trizol试剂(GIBCOBRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用MMLV-RT-Superscript II(GIBCO BRL),反转录酶在42℃进行反转录反应,获得胎盘或胎儿cDNA。利用各个基因的特异引物(如下表所示),按97℃3′,1个循环。94℃30″,60℃ 30″,72℃1′,共35个循环;72℃10′,1个循环进行PCR扩增,获得含有完整开放阅读框序列的各蛋白基因的扩增产物。扩增产物经测序验证,与实施例1测得的序列相符,随后用常规技术将扩增产物转入宿主细胞,获得重组蛋白(SEQ ID NO:SEQ ID NO:3、6、9、12、15、18、21、24)。
基因特异引物
克隆名称 | 特异引物1(5′→3′) | 特异引物2(3′→5′) |
PP11303 | (146)GTGCTGGGAGCTGTCGTAA | ATGGGCGTAGGTGAAGGA(2599) |
PP12899 | (76)GGAATGAAGCGATGTAGC | GTATCAACAGGACGACCG(3245) |
PP14183 | (5)GAATCTCACAGCCCTCAC | ACATATTTGACATGTCTGGCAC(2102) |
FP504 | (81)ACCCATCGTATTTGTAAAGC | CCGAGGAAGTGGATGGAGT(4804) |
FP972 | (154)ACCGCTATGTCTGCTCTG | TTATTTCTGACTCAACGGTC(3058) |
FP6628 | (27)CATGCCACCATACTACAG | TCTCGCTCTTAGACAGAG(3083) |
FP6651 | (24)GGTGCTATGAAGAGCGTGTA | CGTTGTCTCCCTCTGAGGT(2432) |
FP7162 | (23)AGGCTGGTCTAGAACTCC | TTTGCACTGCACCACTCG(2385) |
实施例3:cDNA克隆序列分析
1.PP11303
A:核苷酸序列(SEQ ID NO:1)长度:2662个碱基
1 GTGACAGTCC ACGGCCCCGC TGGGATGGAG CCCTGCTGGG TGCCCGCACC GTGCTCAGTG
61 TGGCATGCGG CCCGGGTGTG GAGGGAGACG GTGGAGCATC CCGTGCCTAG CGTGGTGCCA
121 GCCAAGGGCG GGTGGCTGGG GAGCTGTGCT GGGAGCTGTC GTAAACCCGT GGTGGCTTTG
181 ATCCTAGGGC CGTCTTTCTG CTCCACTTCC CGGGCACTGT TCCGAGGGAG GCTCAGGTGG
241 GGAAGCGAGT GAGCCTAAAG CCCAGGCTTG TCTCCTTGGT GCCAGGCCCT GCTTGCTGGA
301 ATCTGGTGAT CTTAGGAGGT CACTGTTGCA AGGGAGGGGA CCCAGGAGCC ACCTAGTCGA
361 ACCTCTTTGT GGTACAGATG GAGAAACCAA GGCCCAAACA GTGGCCCCCT TGATCAGCCA
421 GAAGCAGAGC TGGGTGGGGC AGCAGGGGAT CCCCACCACC AGGCTCAGAC TCCTTCAGGA
481 TGCTTTTGCT TCGCAGATGA GGAGACCGAG GCTTAGAGAA GAGTAGAGAC TTGCTACCCG
541 TTGAGGTGGT AACAGCCAGG CTAGAATCTC CTGAAACGGG GCAGGGTGGG GAGGTCTGGT
601 TGGGCTACCT GGGGCCGGGC GCCTTTCCCC CAGGATGGGG TGTACTGCCC GCCCTCCCCC
661 AGTCATGGTG CTGGTGCCAG CTGGTGCAGG GGGAGGGCTC TGCAGGCCTT AGCACTGAGG
721 CAGGTGGCGA GCAGCAGGGG AAGGGTCTTC TCCACCCACC CCAACTGCCC AAGGTTCCGT
781 GGCTCCTCCT TAGACAGCAG TGAGGGTTGG GGGTGACAGG CAAGCCACTG AGCCTCAGCA
841 CCGCGACTCA CCCCTCCCAC TCAGCAGTCC AGCCAGGGTC ATCCCCAGCC TCAGAGGAGC
901 CTGGGAACAA GGGCAGCGGC AGGGCCGGCG GGGGCCTGGA GGGTGAGCAG GGGCCTTTCT
961 TCCTGCAGAC AGCCCTCAGC GCCTTTTTCA GGAGACCAAC ATCCCCTACA GCCACCATCA
1021 CCACCAGATG GTAAGTGTCC CCGGAGTCCC CAGTTCTGGA TTGGGCGGAA GGAGGCCGAG
1081 CTAGTTCTGT GTATAAGCAG CCCCTGGCCC CGGTCTACGA GGGCGCTGGT GCAGGCGGGG
1141 CTCGACCTCT TTGGAGATGG GTCAGCAGGA GTCCCGGCTC CATGGGTCCT GCACTTAATC
1201 TTGCCTGTGC CAGCTCCCCC TGAGACCTGG GGGGCGCTGG CCTCTGGGGC AATGAAGCTC
1261 CTTACCCTAC AGCCCCCGGG GATGCTGTGG CTGATGGAAA GGGGTGGGCT GGGAAAGCCT
1321 CGTGGCCCCA GGCACCGTGG GCTCCTGAGA GTGAGGCTGG GTCGGTTCAT CTCAAGGCTT
1381 CTCCTCTGGG AACCCCTGGG CGGCGGACAG GCTTGGGGAT CTGGGGAAGG AACACAGAGC
1441 CTTCCGAGAA TGGGCCAGCC ACGCATCTCC CCTTGGGAGG CAGTGGGGGC CCCTCCAGGA
1501 AGGGGTGCTC ACCCCATCTC TCCTCTCTTC CCCTCACAGA TGTGCACCCC CGCCAATACC
1561 CCTGCTACAC CCCCCAACTT CCCTGACGCT CTCACCATGT TCTCCCGTCT CAAGGCCTCC
1621 GAGAGCTTCC ACAGCGGTGG CAGCGGCAGC CCGATGGCCG CGACAGCCAC GTCACCCCCG
1681 CCACACTTCC CCCATGCCGC CACCAGCAGC TCTGCGGCCT CGAGCTGGCC CACGGCGGCC
1741 TCGCCCCCGG GGGGCCCACA GCACCACCAG CCACAGCCGC CCCTGTGGAC TCCAACACCC
1801 CCTTCTCCGG CTTCAGACTG GCCACCCCTG GCCCCCCAAC AGGCCACCTC AGAACCCAGG
1861 GCCCACCCTG CCATGGAGGC AGAGAGATAA GGGAGGCCCC TCCCCCCTCC CGGAGGCCAG
1921 GACCCGTGGG GCGGGGGAGA GGACGTCTCT GCGGGCCCCC TTCACCCCTT TTCTGTCTGC
1981 ACCCCTTGTT CCCCGGAGCC CTGGAGGGGA GAGCGCGGAC TCTAGCCAGG CAGGGACACG
2041 TCTGGTGCCA GAACACGCAG CTGCCCACAC GCAAGGTCAT GGCCCCAGCG GCCCCGGCAC
2101 ATGGAGTGGT TCAGAGCGGC CTGGGTGCCT GGCGGACAGA ACTTCAGAGA CCACGCAGCC
2161 TTCCTTCGAA GACGCACCTG CCCAGCCCAG CCCAGGGGTG CCGTGGAGGA CCACCCTGGC
2221 GGAGACATTG CTGATCCCTG GCTTGGAGCT CCTTGGGGGC CGGCAGGCCT CGAACCCCCA
2281 CCCTAGGGAA TGCAGAGCCT CTCCGCATGT GTGCGCGTGG CCGTGTCTGT GTATTTCTAC
2341 GTGTGTCGCT CTTCAGAAGC AACCTAGTTC CTGGGGCAGC TGGACTTTGC ATGTTAGTGT
2401 GAGCCCCCAG CCCCCTGCCC GCCGCCCCCT CCCCAGGGCC CTGCCTCCTC CCCACCCCCT
2461 CGTCAGCCAG CGTTGCTGTT CCTTGCAGAG AAAAGGATTG TGGGAAACTC CAGGACTCTT
2521 CCCACCGCCT CCCAGCGCCT GCCTGCTGGG GCTGCCTGCA TGCCTCCCCT GCACCTGGGG
2581 GTACCCGCAT CCACTTCCTT TCCCCCTTTT AACAAAAGAG AAGAACGAAT TCCAAAAAAA
2641 AAAAAAAAAA AAAAAAAAAA AA
B:核苷酸序列(SEQ ID NO:3)长度:212个氨基酸
1 MKLLTLQPPG MLWLMERGGL GKPRGPRHRG LLRVRLGRFI SRLLLWEPLG GGQAWGSGEG
61 TQSLPRMGQP RISPWEAVGA PPGRGAHPIS PLFPSQMCTP ANTPATPPNF PDALTMFSRL
121 KASESFHSGG SGSPMAATAT SPPPHFPHAA TSSSAASSWP TAASPPGGPQ HHQPQPPLWT
181 PTPPSPASDW PPLAPQQATS EPRAHPAMEA ER
C.核苷酸及氨基酸组合序列(SEQ ID NO:2)克隆号和蛋白名称:PP11303
起始编码子:1252 ATG终止编码子:1888 TAA蛋白质分子量:22392.19
1 GTG ACA GTC CAC GGC CCC GCT GGG ATG GAG CCC TGC TGG GTG CCC GCA CCG TGC TCA GTG 60
61 TGG CAT GCG GCC CGG GTG TGG AGG GAG ACG GTG GAG CAT CCC GTG CCT AGC GTG GTG CCA 120
121 CCC AAC GGC GGG TGG CTG GGG AGC TGT GCT GGG AGC TGT CGT AAA CCC GTG GTG GCT TTG 180
181 ATC CTA GGG CCG TCT TTC TGC TCC ACT TCC CGG CCA CTC TTC CGA GGG AGG CTC AGG TGG 240
241 GGA AGC GAG TGA GCC TAA AGC CCA GGC TTG TCT CCT TGG TGC CAG GCC CTG CTT GCT GGA 300
301 ATC TGG TGA TCT TAG GAG GTC ACT GTT GCA AGG GAG GGG ACC CAG GAG CCA CCT AGT CGA 360
361 ACC TCT TTG TGG TAC AGA TGG AGA AAC CAA GGC CCA AAC AGT GGC CCC CTT GAT CAG CCA 420
421 GAA GCA GAG CTG GGT GGG GCA GCA GGG GAT CCC CAC CAC CAG GCT CAG ACT CCT TCA GGA 480
481 TGC TTT TGC TTC GCA GAT GAG GAG ACC GAG GCT TAG AGA AGA GTA GAG ACT TGC TAC CCG 540
541 TTG AGG TGG TAA CAG CCA GGC TAG AAT CTC CTG AAA CGG GGC AGG GTG GGG AGG TCT GGT 600
601 TGG GCT ACC TGG GGC CGG GCG CCT TTC CCC CAG GAT GGG GTG TAC TGC CCG CCC TCC CCC 660
661 AGT CAT GGT GCT GGT GCC AGC TGG TGC AGG GGG AGG GCT CTG CAG GCC TTA GCA CTG AGG 720
721 CAG GTG GCG AGC AGC AGG GGA AGG GTC TTC TCC ACC CAC CCC AAC TGC CCA AGG TTC CGT 780
781 GGC TCC TCC TTA GAC AGC AGT GAG GGT TGG GGG TGA CAG GCA AGC CAC TGA GCC TCA GCA 840
841 CCG CGA CTC ACC CCT CCC ACT CAG CAG TCC AGC CAG GGT CAT CCC CAG CCT CAG AGG AGC 900
901 CTG GGA ACA AGG GCA GCG GCA GGG CCG GCG GGG GCC TGG AGG GTG AGC AGG GGC CTT TCT 960
961 TCC TGC AGA CAG CCC TCA GCG CCT TTT TCA GGA GAC CAA CAT CCC CTA CAG CCA CCA TCA 10201021 CCA CCA GAT GGT AAG TGT CCC CGG AGT CCC CAG TTC TGG ATT GGG CGG AAG GAG GCC GAG 10801081 CTA GTT CTG TGT ATA AGC AGC CCC TGG CCC CGG TGT ACG AGG GCG CTG GTG CAG GCG GGG 11401141 CTC GAC CTC TTT GGA GAT GGG TCA GCA GGA GTC CCG GCT CCA TGG GTC CTG CAC TTA ATC 12001201 TTG CCT GTG CCA GCT CCC CCT GAG ACC TGG GGG GCG CTG GCC TCT GGG GCA ATG AAG CTC 1260
1 Met Lys Leu 31261 CTT ACC CTA CAG CCC CCG GGG ATG CTG TGG CTG ATG GAA AGG GGT GGG CTG GGA AAG CCT 1320
4 Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu Arg Gly Gly Leu Gly Lys Pro 231321 CGT GGC CCC AGG CAC CGT GGG CTC CTG AGA GTG AGG CTG GGT CGG TTC ATC TCA AGG CTT 1380
24 Arg Gly Pro Arg His Arg Gly Leu Leu Arg Val Arg Leu Gly Arg Phe Ile Ser Arg Leu 431381 CTC CTC TGG GAA CCC CTG GGC GGC GGA CAG GCT TGG GGA TCT GGG GAA GGA ACA CAG AGC 1440
44 Leu Leu Trp Glu Pro Leu Gly Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser 631441 CTT CCG AGA ATG GGC CAG CCA CGC ATC TCC CCT TGG GAG GCA GTG GGG GCC CCT CCA GGA 1500
64 Leu Pro Arg Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala Val Gly Ala Pro Pro Gly 831501 AGG GGT GCT CAC CCC ATC TCT CCT CTC TTC CCC TCA CAG ATG TGC ACC CCC GCC AAT ACC 1560
84 Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln Met Cys Thr Pro Ala Asn Thr 1031561 CCT GCT ACA CCC CCC AAC TTC CCT GAC GCT CTC ACC ATG TTC TCC CGT CTC AAG GCC TCC 1620
104 Pro Ala Thr Pro Pro Asn Phe Pro Asp Ala Leu Thr Met Phe Ser Arg Leu Lys Ala Ser 1231621 GAG AGC TTC CAC AGC GGT GGC AGC GGC AGC CCG ATG GCC GCG ACA GCC ACG TCA CCC CCG 1680
124 Glu Ser Phe His Ser Gly Gly Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro 1431681 CCA CAC TTC CCC CAT GCC GCC AGC AGC AGC TCT GCG GCC TCC AGC TGG CCC ACG GCG GCC 1740
144 Pro His Phe Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro Thr Ala Ala 1631741 TCG CCC CCG GGG GGC CCA CAG CAC CAC CAG CCA CAG CCG CCC CTG TGG ACT CCA ACA CCC 1800
164 Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro Pro Leu Trp Thr Pro Thr Pro 183
1801 CCT TCT CCG GCT TCA GAC TGG CCA CCC CTG GCC CCC CAA CAG GCC ACC TCA GAA CCC AGG 1860
184 Pro Ser Pro Ala Ser Asp Trp Pro Pro Leu Ala Pro Gln Gln Ala Thr Ser Glu Pro Arg 203
1861 GCC CAC CCT GCC ATG GAG GCA CAG AGA TAA GGG AGG CCC CTC CCC CCT CCC GGA GGC CAG 1920
204 Ala His Pro Ala Met Glu Ala Glu Arg *** 213
1921 GAC CCG TGG GGC GGG GGA GAG GAC GTC TCT GCG GGC CCC CTT CAC CCC TTT TCT GTC TGC 1980
1981 ACC CCT TGT TCC CCG GAG CCC TGG AGG GGA GAG CGC GGA CTC TAG CCA GGC AGG GAC ACG 2040
2041 TCT GGT GCC AGA ACA CGC AGC TGC CCA CAC GCA AGG TCA TGG CCC CAG CGG CCC CGG CAC 2100
2101 ATG GAG TGG TTC AGA GCG GCC TGG GTG CCT GGC GGA CAG AAC TTC AGA GAC CAC GCA GCC 2160
2161 TTC CTT CGA AGA CGC ACC TGC CCA GCC CAG CCC AGG GGT GCC GTG GAG GAC CAC CCT GGC 2220
2221 GCA GAC ATT GCT GAT CCC TGG CTT CGA GCT CCT TGG GGG CCG GCA GGC CTC GAA CCC CCA 2280
2281 CCC TAG GGA ATG CAG AGC CTC TCC GCA TGT GTG CGC GTG GCC GTG TCT GTG TAT TTC TAC 2340
2341 GTG TGT CGC TCT TCA GAA GCA ACC TAG TTC CTG GGG CAG CTG GAC TTT GCA TGT TAG TGT 2400
2401 GAG CCC CCA GCC CCC TGC CCG CCG CCC CCT CCC CAG GGC CCT GCC TCC TCC CCA CCC CCT 2460
2461 CGT CAG CCA GCG TTG CTG TTC CTT GCA GAG AAA AGG ATT GTG GGA AAC TCC AGG ACT CTT 2520
2521 CCC ACC GCC TCC CAG CGC CTG CCT GCT GGG GCT GCC TGC ATG CCT CCC CTG CAC CTG GGG 2580
2581 GTA CCC GCA TCC ACT TCC TTT CCC CCT TTT AAC AAA AGA GAA GAA CGA ATT CCA AAA AAA 2640
2641 AAA AAA AAA AAA AAA AAA AAA A 2662
2. PP12899
A:核苷酸序列(SEQ ID NO:4)长度:3325个碱基
1 GGCCGCGCGA GGGTGGTGGG CATCGAGGTC CCAGCAGCGG ACGAGGGAGG TGCCGCCGTC
61 GCCCAGGATG GGCTGGGAAT GAAGCGATGT AGCCTTTTAA GAGATTTGCT CTGACCCATC
121 TGAAGTCCAT ATGGCTCTGT ATGATGAAGA CCTCCTGAAA AATCCTTTCT ATCTGGCTCT
181 GCAAAAGTGC CGCCCTGACT TGTGCAGCAA AGTGGCCCAA ATCCATGGCA TTGTCTTAGT
241 ACCCTGCAAA GGAAGCCTGT CGAGCAGCAT CCAGTCTACT TGTCAGTTTG AGTCCTACAT
301 TTTGATACCT GTGGAAGAGC ATTTTCAGAC CTTAAATGGA AAGGATGTCT TTATTCAAGG
361 GAACAGGATT AAATTAGGAG CTGGTTTTGC CTGTCTTCTC TCAGTGCCCA TTCTCTTTGA
421 AGAAACTTTC TACAATGAAA AAGAAGAGAG TTTCAGCATC CTGTGTATAG CCCATCCTTT
481 GGAAAACAGA GAGAGTTCAG AAGAGCCTTT GGCACCCTCA GATCCCTTTT CCCTGAAAAC
541 CATTGAAGAT GTGAGAGAGT TCTTGGGAAG ACACTCCGAG CGATTTGACA GGAACATCGC
601 CTCTTTCCTA ATCGAACATT CCGAGAATGC GAGAGAAAGA CCCTCCGTCA CCACATAGAC
661 TCAGCGAATG CTCTCTACAC CAAATGCCTC CAGCAGCTTC TGAGGGACTC TCACCTGAAA
721 ATGCTCGCCA AGCAGGAGGC CCAGATGAAC CTGATGAAGC AGGCAGTGGA GATATACGTC
781 CATCATGAAA TTTACAACCT GATCTTTAAA TACGTGGGGA CCATGGAGGC AAGTGAGGAT
841 GCGGCCTTTA ACAAAAATCA CAAGAAGCCT TCAAGATCTT CAGCAGAAAG ATATTGGTGT
901 GAAACCGGAG TTCAGCTTTA ACATACCTCG TGCCAAAAGA GAGCTGGCTC AGCTGAACAA
961 ATGCACCTCC CCACAGCAGA AGCTTGTCTG CTTGCGAAAA GTGGTGCAGC TCATTACACA
1021 GTCTCCAAGC CAGAGAGTGA ACCTGGAGAC CATGTGTGCT GATGATCTGC TATCAGTCCT
1081 GTTATACTTG CTTGTGAAAA CGGAGATCCC TAATTGGATG GCAAATTTGA GTTACATCAA
1141 AAACTTCAGG TTTAGCAGCT TGGCAAAGGA TGAACTGGGA TACTGCCTGA CCTCATTCGA
1201 AGCTGCCATT GAATATATTC GGCAAGGAAG CCTCTCTGCT AAACCCCCTG TAAGATCTCA
1261 CCCCTGCCCT GGCCTTCCTT TGTGGGCATC ATGGTTCCCT TGATAGGGTG CTGGGGTTGG
1321 TATGTGGGCA GACGGATTCT TAAATTGCCT CCCAGGAATG GGGCCTCAGC TGTTTGAGGG
1381 CTGTGAGTCT TAAAAATCAC TCAGTGAAGA GAACACCAAG CCCCCAATTG GTGGTAAAAA
1441 TTGGTGGGTT ATCATTGGGA TTTACATTGT TAATATCCTA CTTCATTAGT CCCCATCCTC
1501 TCCAAAGACA TGTGGGTGCA AAGGGAAGCC AGAAGTAGGG AATTTGGATT TCTTGACCTT
1561 GATAGTCAAG AAGTGATGTC ACGGGATCCC TGGACTGTCG CTTTTCCAGC CGGAAACCTC
1621 TGTGGCTGGT GGCTCCTTTG CCTGAGTTTT GTTCGGGCCT GCTGGGCTCA TTTCACGCTC
1681 TTGGCCTGGC AGGCTGCGCT CGGCTTGTGC TACTGGCCTG GATCCCATGC CTGCCAAGGG
1741 CGAGCCAGGT GTGGAGTGGC GAGGGGTATG TGAGCAAGTG CAGGGTCTGG CCACTGCACA
1801 CAACCAGGTG TGCCGACTGA GGTGGGGTGG GCAGCTCCAA GTTGCTTGTA CAGGGTCCTG
1861 CTCCATGCAA GGCTGCAGCT AGAGCAGGCG TACTGTAGGC CGCTTCCACG GTGGGCACTG
1921 GGGAACACAG TGGGGCCTGG AAGCTTGGAG ACACCAGGAA CTGCAGAGCC CCAAAGAGGG
1981 TGTCATAGCC CTGGCTCGGG GAACTCCTAG GTTGGGCTCC CTGAAGGGCC AGAGCTCTTC
2041 TCTCCTCTCG TCACCTGCAA TGTAGTGAGT CGGGAGCATG TTTTAGCTCT CTTTATGTTA
2101 CAGCTCTTTC AGTCCTGCCA TTTGGTGGGT CCCGAGTTCT TGTCCCATGT CGAGGAAGAA
2161 TGAGGTACGT AGACTAGTGG AGGGTGAGCA AGGCAGAGAG GAGCTTTACT GAACGGCAGA
2221 ATAGCTCTCA GGAGACCCAC AGTGGGCAGC TTCTTTCCAC AGGCAGGTCG TCCTGACGAG
2281 TTAAAGAGGC CTGACGTAGG TAGCTCCTTC CTGCAGTTGG TAGTCCCGAC ATCTGTCTGA
2341 GTCTGGCTGA GTCCGGGGTT TTTTATGGCT CAGAAGGGAG GGAGTATGTG CTGATTGGTC
2401 CATAGGTGGG CCTGGAGAAA GCACCATGAG TTCTCAGTCT GGGCCGTGGA CTCCACTTGG
2461 AACTGACAGC CCAGCCCCCA GGCTTTAGGC TGTCCCTGTC TTGAAGGTGG GGCTTCACTG
2521 GGCACCTGCA CCTTTCCACC CAGAAGCGTG TCTGCCTTCT GCCACCATCA ACATGCTGGC
2581 CAGTGCATCC AGGCTGTTTG TGCCAAGGGG CATCTGCAGG CCTGCACTGA GCTGCCCTCA
2641 GCCCCTACCT TGACTACTCT CCCATGCTCA TCAGCGCCCA AAATCTTGGA GGGGCTGAGG
2701 CATCAGGAGG CTGGTGTGTC AGTGTCACAC CAAGCATGTG CACACATGGC TGGGTTGCAA
2761 CAGTACCCGG GCTTGGCCTC AGCTTTGCTC TGAAATTGAA GTCGGTGCCA GGAGTGGGGA
2821 GGAGCGGGAG CAGGCACTTA CGAGCCTGCG GCGGCAGGGA TGCTTCCTGG GCCCCTGAGA
2881 GTGCAGAGAT TCCTGGATCC AGAGCTGCGG CTGGGCGGCT GCAGCTGCGC CTGGGAGTGC
2941 AGGGCTCCCG CCCTGCCAGC TCAGTAGGAG ATGGGGGCTC CTGCCTATTC CTGGCTCCTG
3001 TTGGCCCTGC AGAGTGCACA ACCCTGGCCG CGCTTCCTCC ACTGCAGCTT ACGTCTTTGC
3061 AGCAGCCACT CCCGATGGGC TGCCACTGCC ATCTGTGAGA CAATTAATGT GTGCAATTTG
3121 AGGACTCAGT GGCCTTGCCA TTGTTTCCCT TGGTTTTTAT TGAGCATTGG CTGGGGTCGG
3181 CGAGGGGATG TGATTATATT TCTATGTGAA TCGTGAGAAT CTTGAACCAT AGTTGTCCTG
3241 CTGGCCTGTT TTACTACATA CCAATGAGTA AAATGTGATC ATACAGAAAT CACAAAGTTG
3301 AAATCCTAAA AAAAAAAAAA AAAAA
B:核苷酸序列(SEQ ID NO:6)长度:175个氨基酸
1 MALYDEDLLK NPFYLALQKC RPDLCSKVAQ IHGIVLVPCK GSLSSSIQST CQPESYILIP
61 VEEHFQTLNG KDVFIQGNRI KLGAGFACLL SVPILFEETF YNEKEESFSI LCIAHPLEKR
121 ESSEEPLAPS DPFSLKTIED VREFLGRHSE RFDRNIASFL IEHSENARER ASVTT
C.核苷酸及氨基酸组合序列(SEQ ID NO:5)克隆号和蛋白名称:PP12899
起始编码子:131 ATG终止编码子:656 TAG蛋白质分子量:19828.54
1 G GCC GCG CGA GGG TGG TGG GCA TCG AGG TCC CAG CAG CGG ACG AGG GAG GTG CCG CCG 58
59 TCG CCC AGG ATG GGC TGG GAA TGA AGC GAT GTA GCC TTT TAA GAG ATT TGC TCT GAC CCA 118
119 TCT GAA GTC CAT ATG GCT CTG TAT GAT GAA GAC CTC CTG AAA AAT CCT TTC TAT CTG GCT 178
1 Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe Tyr Leu Ala 16
179 CTG CAA AAG TGC CGC CCT GAC TTG TGC AGC AAA GTG GCC CAA ATC CAT GGC ATT GTC TTA 238
17 Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala Gln Ile His Gly Ile Val Leu 36
239 GTA CCC TGC AAA GGA AGC CTG TCG AGC AGC ATC CAG TCT ACT TGT CAG TTT GAG TCC TAC 298
37 Val Pro Cys Lys Gly Ser Leu Ser Ser Ser Ile Gln Ser Thr Cys Gln Phe Glu Ser Tyr 56
299 ATT TTG ATA CCT GTG GAA GAG CAT TTT CAG ACC TTA AAT GGA AAG GAT GTC TTT ATT CAA 358
57 Ile Leu Ile Pro Val Glu Glu His Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln 76
359 GGG AAC AGG ATT AAA TTA GGA GCT GGT TTT GCC TGT CTT CTC TCA GTG CCC ATT CTC TTT 418
77 Gly Asn Arg Ile Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro Ile Leu Phe 96
419 GAA GAA ACT TTC TAC AAT GAA AAA GAA GAG AGT TTC AGC ATC CTG TGT ATA GCC CAT CCT 478
97 Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser Ile Leu Cys Ile Ala His Pro 116
479 TTG GAA AAG AGA GAG AGT TCA GAA GAG CCT TTG GCA CCC TCA GAT CCC TTT TCC CTG AAA 538
117 Leu Glu Lys Arg Glu Ser Ser Glu Glu Pro Leu Ala Pro Ser Asp Pro Phe Ser Leu Lys 136
539 ACC ATT GAA GAT GTG AGA GAG TTC TTG GGA AGA CAC TCC GAG CGA TTT GAC AGG AAC ATC 598
137 Thr Ile Glu Asp Val Arg Glu Phe Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile 156
599 GCC TCT TTC CTA ATC GAA CAT TCC GAG AAT GCG AGA GAA AGA GCC TCC GTC ACC ACA TAG 658
157 Ala Ser Phe Leu Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val Thr Thr *** 176
659 ACT CAG CGA ATG CTC TCT ACA CCA AAT GCC TCC AGC AGC TTC TGA GGG ACT CTC ACC TGA 718
719 AAA TGC TCG CCA AGC AGG AGG CCC AGA TGA ACC TGA TGA AGC AGG CAG TGG AGA TAT ACG 778
779 TCC ATC ATG AAA TTT ACA ACC TGA TCT TTA AAT ACG TGG GGA CCA TGG AGG CAA GTG AGG 838
839 ATG CGG CCT TTA ACA AAA ATC ACA AGA AGC CTT CAA GAT CTT CAG CAG AAA GAT ATT GGT 898
899 GTG AAA CCG GAG TTC AGC TTT AAC ATA CCT CGT GCC AAA AGA GAG CTG GCT CAG CTG AAC 958
959 AAA TGC ACC TCC CCA CAG CAG AAG CTT GTC TGC TTG CGA AAA GTG GTG CAG CTC ATT ACA 1018
1019 CAG TCT CCA AGC CAG AGA GTG AAC CTG GAG ACC ATG TGT GCT GAT GAT CTG CTA TCA GTC 1078
1079 CTG TTA TAC TTG CTT GTG AAA ACG GAG ATC CCT AAT TGG ATG GCA AAT TTG AGT TAC ATC 1138
1139 AAA AAC TTC AGG TTT AGC AGC TTG GCA AAG GAT GTG CTG GGA TAC TGC CTG ACC TCA TTC 1198
1199 GAA GCT GCC ATT GAA TAT ATT CGG CAA GGA AGC CTC TCT GCT AAA CCC CCT GTA AGA TCT 1258
1259 CAC CCC TGC CCT GGC CTT CCT TTG TGG GCA TCA TGG TTC CCT TGA TAG GGT GCT GGG GTT 1318
1319 GGT ATG TGG GCA GAC GGA TTC TTA AAT TGC CTC CCA GGA ATG GGG CCT CAG CTG TTT GAG 1378
1379 GGC TGT GAG TCT TAA AAA TCA CTC AGT GAA GAG AAC ACC AAG CCC CCA ATT GGT GGT AAA 1438
1439 AAT TGG TGG GTT ATC ATT GGG ATT TAC ATT GTT AAT ATC CTA CTT CAT TAG TCC CCA TCC 1498
1499 TCT CCA AAG ACA TGT GGG TGC AAA GGG AAG CCA GTA GTA GGG AAT TTG GAT TTC TTG ACC 1558
1559 TTG ATA GTC AAG AAG TGA TGT CAC GGG ATC CCT GGA CTG TCG CTT TTC CAG CCG GAA ACC 1618
1619 TCT GTG GCT GGT GGC TCC TTT GCC TGA GTT TTG TTC GGG CCT GCT GGG CTC ATT TCA CGC 1678
1679 TCT TGG CCT GGC AGG CTG CGC TCG GCT TGT GCT ACT GGC CTG GAT CCC ATG CCT GCC AAG 1738
1739 GGC GAG CCA GGT GTG GAG TGG CGA GGG GTA TGT GAG CAA GTG CAG GGT CTG GCC ACT GCA 1798
1799 CAC AAC CAG GTG TGC CGA CTG AGG TGG GGT GGG CAG CTC CAA GTT GCT TGT ACA GGG TCC 1858
1859 TGC TCC ATG CAA GGC TGC AGC TAG AGC AGG CGT ACT GTA GGC CGC TTC CAC GGT GGG CAC 1918
1919 TGG GGA ACA CAG TGG GGC CTG GAA GCT TGG AGA CAC CAG GAA CTG CAG AGC CCC AAA GAG 1978
1979 GGT GTC ATA GCC CTG GCT CGG GGA ACT CCT AGG TTG GGC TCC CTG AAG GGC CAG AGC TCT 2038
2039 TCT CTC CTC TCG TCA CCT GCA ATG TAG TGA GTC GGG AGC ATG TTT TAG CTC TCT TTA TGT 2098
2099 TAC AGC TCT TTC AGT CCT GCC ATT TGG TGG GTC CCG AGT TCT TGT CCC ATG TCG AGG AAG 2158
2159 AAT GAG GTA CGT AGA CTA GTG GAG GCT GAG CAA GGC AGA GAG GAG CTT TAC TGA ACG GCA 2218
2219 GAA TAG CTC TCA GGA GAC CCA CAG TGG GCA GCT TCT TTC CAC AGG CAG GTC GTC CTG ACG 2278
2279 AGT TAA AGA GGC CTG ACG TAG GTA GCT CCT TCC TGC AGT TGG TAG TCC CGA CAT CTG TCT 2338
2339 GAG TCT GGC TGA GTC CGG GGT TTT TTA TGG CTC AGA AGG GAG GGA GTA TGT GCT GAT TGG 2398
2399 TCC ATA GGT GGG CCT GGA GAA AGC ACC ATG AGT TCT CAG TCT GGG CCG TGG ACT CCA CTT 2458
2459 GGA ACT GAC AGC CCA GCC CCC AGG CTT TAG GCT GTC CCT GTC TTG AAG GTG GGG CTT CAC 2518
2519 TGG GCA CCT GCA CCT TTC CAC CCA GAA GCG TGT CTG CCT TCT GCC ACC ATC AAC ATG CTG 2578
2579 GCC AGT GCA TCC AGG CTG TTT GTG CCA AGG GGC ATC TGC AGG CCT GCA CTG AGC TGC CCT 2638
2639 CAG CCC CTA CCT TGA CTA CTC TCC CAT GCT CAT CAG CGC CCA AAA TCT TGG AGG GGC TGA 2698
2699 GGC ATC AGG AGG CTG GTG TGT CAG TGT CAC ACC AAG CAT GTG CAC ACA TGG CTG GGT TGC 2758
2759 AAC AGT ACC CGG GCT TGG CCT CAG CTT TGC TCT GAA ATT GAA GTC GGT GCC AGG AGT GGG 2818
2819 GAG GAG CGG GAG CAG GCA CTT ACG AGC CTG CGG CGG CAG GGA TGC TTC CTG GGC CCC TGA 2878
2879 GAG TGC AGA GAT TCC TGG ATC CAG AGC TGC GGC TGG GCG GCT GCA GCT GCG CCT GGG AGT 2938
2939 GCA GGG CTC CCG CCC TGC CAG CTC AGT AGG AGA TGG GGG CTC CTG CCT ATT CCT GGC TCC 2998
2999 TGT TGG CCC TGC AGA GTG CAC AAC CCT GGC CGC GCT TCC TCC ACT GCA GCT TAC GTC TTT 3058
3059 GCA GCA GCC ACT CCC GAT GGG CTG CCA CTG CCA TCT GTG AGA CAA TTA ATG TGT GCA ATT 3118
3119 TGA GGA CTC AGT GGC CTT GCC ATT GTT TCC CTT GGT TTT TAT TGA GCA TTG GCT GGG GTC 3178
3179 GGC GAG GGG ATG TGA TTA TAT TTC TAT GTG AAT CGT GAG AAT CTT GAA CCA TAG TTG TCC 3238
3239 TGC TGG CCT GTT TTA CTA CAT ACC AAT GAG TAA AAT GTG ATC ATA CAG AAA TCA CAA AGT 3298
3299 TGA AAT CCT AAA AAA AAA AAA AAA AAA 3325
3. PP14183
A:核苷酸序列(SEQ ID NO:7)长度:2154个碱基
1 GGGGGAATCT CACAGCCCTC ACCTACCTCA ACCTCAGCCG AAACCAGCTG TCGCTGCTGC
61 CACCCTACAT CTGCCAGCTG CCCCTGAGGG TCCTCATCGT CAGCAACAAC AAGCTGGGAG
121 CCCTGCCCCC TGACATCGGC ACCCTGGGAA GCCTGCGACA GCTTGACGTG AGCAGCAACG
181 AGCTCCAATC CCTGCCCTCG GAACTGTGTG GCCTCTCTTC CCTGCGGGAC CTCAATGTCC
241 GGAGGAACCA GCTCAGTACG CTGCCCGAAG AGCTGGGGGA CCTCCCTCTG GTCCCCTGGA
301 TTTCTCCTGT AACCGCGTCT CCCGAATCCC AGTCTCCTTC TGCCGCCTGA GGCACCTGCA
361 GGTCATTCTG CTGGACAGCA ACCCTCTGCA GAGTCCACCT GCCCAGGTCT GCCTGAAGGG
421 GAAACTTCAC ATCTTCAAGT ATTTGTCCAC AGAGGCCGGG CAGCGTGGGT CGGCCCTGGG
481 GGACCTGGCC CCTTCTCGGC CCCCGAGTTT CAGTCCCTGC CCTGCAGAGG ATCTATTTCC
541 GGGACATCGG TACGATGGTG GGCTGGACTC AGGCTTCCAC AGCGTTGATA GTGGCAGCAA
601 GAGGTGGTCT GGAAATGAGT CAACAGATGA ATTTTCAGAG CTGTCATTCC GGATCTCAGA
661 GCTGGCCCGG GAGCCCCGGG GACCCAGAGA ACGCAAGGAG GATGGCTCAG CGGACGGAGA
721 CCCTGTGCAG ATTGACTTCA TCGACAGCCA TGTCCCCGGG GAGGATGAAG AGCGAGGCAC
781 TGTGGAGGAG CAGCGACCAC CCGAATTAAG CCCTGGGGCA GGGGACAGGG AGAGGGCACC
841 AAGCAGCAGG CGGGAGGAGC CGGCAGGGGA GGAGCGGCGG CGCCCGGACA CCTTGCAGCT
901 GTGGCAGGAG CGGGAACGGC GGCAGCAGCA GCAGAGCGGG GCGTGGGGGG CCCCGAGGAA
961 GGATAGCGGC TCGCCTAAGT CCAGTGCCTC CCAAGCAGGG GCTGCAGCGG GGCAGGGAGC
1021 CCCCGCCCCT GCCCCTGCCT CCCAAGAGCC CCTTCCCATA GCTGGACCAG CGACAGCACC
1081 CTGCTCCACG GCCACTTGGC TCCATTCAGA GACCAAACAG CTTCCTCTTC CGTTCCTCCT
1141 CTCAGAGTGG CTCAGGCCCT TCCTCACCAG ACTCTGTCCT GAGACCTCGG CGGTACCCCC
1201 AGGTTCCAGA TGAGAAGGAC TTAATGACTC AGCTGCGCCA GGTCCTTGAG TCCCGGCTGC
1261 AGCGGCCCCT GCCTGAGGAC CTGGCGAGGC TCTGGCCAAG TGGGGTCATC CTGTGCCAGC
1321 TGGCCAACCA GCTACGGCCG CGCTCCGTGC CCTTCATCCA TGTGCCCTCC CCTGCTGTGC
1381 CAAAACTCAG TGCCCTCAAG GCTCGGAAGA ATGTGGAGAG TTTTCTAGAA GCCTGTCGAA
1441 AAATGGGGGT GCCTGAGGCT GACCTGTGCT GGCCCTCGGA TCTCCTCCAG GGCACTGCCC
1501 GGGGGCTGCG GACCGCGCTG GAGGCCGTGA AGCGGGTGGG GGGCAAGGCC CTACCGCCCC
1561 TCTGGCCCCC CTCTGGTCTG GGCGGCTTCG TCGTCTTCTA CGTGGTCCTC ATGCTGCTGC
1621 TCTATGTCAC CTACACTCGG CTCCTGGGTT CCTAGGCCCC AAAATCGGCC CTCCCTCACC
1681 CCTTTCCCTT CCTCTCTATT TATAAGGTCC CTGCTCCACC CGACCCCACC TGCGGTGCCT
1741 TCAGCCCCAA CCAAAGACAC TAGTGCACCC CCTTCACAGA CACTGACCTC AGAGGCCCCA
1801 CTCTGGTGCC CCCAGACCCT GGGCCCCCAG CCTCTGGCCT CCCTCCAGTA GCCCCACGAG
1861 TCCCCACCTC TCAGTGCTGA CGGTGCCTTC ATGTCCCCGC CGGCCCTGCC CCTGCCCTCT
1921 GTACCCCGTG AGGGGTGGCA GGAGCTGGAG TCTCCCCCTT CCTCCTGTGC CCTCCCCTTC
1981 CCCCCCCAAC AGCTGCTATG GGGGGGCTAA ATTATCTCTA TTTTGTAGAG AGGATCTATA
2041 TTTGTAGGGG TTCGGGGCCC AGGCCGGGTC CCTATCTCTG TGTATAAACT GTACAGACCG
2101 TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA
B:核苷酸序列(SEQ ID NO:9)长度:143个氨基酸
1 MTQLRQVLES RLQRPLPEDL ARLWPSGVIL CQLANQLRPR SVPFIHVPSP AVPKLSALKA
61 RKNVESFLEA CRKMGVPEAD LCSPSDLLQG TARGLRTALE AVKRVGGKAL PPLWPPSGLG
121 GFVVFYVVLM LLLYVTYTRL LGS
C.核苷酸及氨基酸组合序列(SEQID NO:8)克隆号和蛋白名称:PP14183
起始编码子:1224 ATG终止编码子:1653 TAG蛋白质分子量:15712.83
1 GG GGG AAT CTC ACA GCC CTC ACC TAC CTC AAC CTC AGC CGA AAC CAG CTG TCG CTG CTG 59
60 CCA CCC TAC ATC TGC CAG CTG CCC CTG AGG GTC CTC ATC GTC AGC AAC AAC AAG CTG GGA 119
120 GCC CTG CCC CCT GAC ATC GGC ACC CTG GGA AGC CTG CGA CAG CTT GAC GTG AGC AGC AAC 179
180 GAG CTC CAA TCC CTG CCC TCG GAA CTG TGT GGC CTC TCT TCC CTG CGG GAC CTC AAT GTC 239
240 CGG AGG AAC CAG CTC AGT ACG CTG CCC GAA GAG CTG GGG GAC CTC CCT CTG GTC CCC TGG 299
300 ATT TCT CCT GTA ACC GCG TCT CCC GAA TCC CAG TCT CCT TCT GCC GCC TGA GGC ACC TGC 359
360 AGG TCA TTC TGC TGG ACA GCA ACC CTC TGC AGA GTC CAC CTG CCC AGG TCT GCC TGA AGG 419
420 GGA AAC TTC ACA TCT TCA AGT ATT TGT CCA CAG AGG CCG GGC AGC GTG GGT CGG CCC TGG 479
480 GGG ACC TGG CCC CTT CTC GGC CCC CGA GTT TCA GTC CCT GCC CTG CAG AGG ATC TAT TTC 539
540 CGG GAC ATC GGT ACG ATG GTG GGC TGG ACT CAG GCT TCC ACA GCG TTG ATA GTG GCA GCA 599
600 AGA GGT GGT CTG GAA ATG AGT CAA CAG ATG AAT TTT CAG AGC TGT CAT TCC GGA TCT CAG 659
660 AGC TGG CCC GGG AGC CCC GGG GAC CCA GAG AAC GCA AGG AGG ATG GCT CAG CGG ACG GAG 719
720 ACC CTG TGC AGA TTG ACT TCA TCG ACA GCC ATG TCC CCG GGG AGG ATG AAG AGC GAG GCA 779
780 CTG TGG AGG AGC AGC GAC CAC CCG AAT TAA GCC CTG GGG CAG GGG ACA GGG AGA GGG CAC 839
840 CAA GCA GCA GGC GGG AGG AGC CGG CAG GGG AGG AGC GGC GGC GCC CGG ACA CCT TGC AGC 899
900 TGT GGC AGG AGC GGG AAC GGC GGC AGC AGC AGC AGA GCG GGG CGT GGG GGG CCC CGA GGA 959
960 AGG ATA GCG GCT CGC CTA AGT CCA GTG CCT CCC AAG CAG GGG CTG CAG CGG GGC AGG GAG 1019
1020 CCC CCG CCC CTG CCC CTG CCT CCC AAG AGC CCC TTC CCA TAG CTG GAC CAG CGA CAG CAC 1079
1080 CCT GCT CCA CGG CCA CTT GGC TCC ATT CAG AGA CCA AAC AGC TTC CTC TTC CGT TCC TCC 1139
1140 TCT CAG AGT GGC TCA GGC CCT TCC TCA CCA GAC TCT GTC CTG AGA CCT CGG CGG TAC CCC 1199
1200 CAG GTT CCA GAT GAG AAG GAC TTA ATG ACT CAG CTG CGC CAG GTC CTT GAG TCC CGG CTG 1259
1 Met Thr Gln Leu Arg Gln Val Leu Glu Ser Arg Leu 12
1260 CAG CGG CCC CTG CCT GAG GAC CTG GCG AGG CTC TGG CCA AGT GGG GTC ATC CTG TGC CAG 1319
13 Gln Arg Pro Leu Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser Gly Val Ile Leu Cys Gln 32
1320 CTG GCC AAC CAG CTA CGG CCG CGC TCC GTG CCC TTC ATC CAT GTG CCC TCC CCT GCT GTG 1379
33 Leu Ala Asn Gln Leu Arg Pro Arg Ser Val Pro Phe Ile His Val Pro Ser Pro Ala Val 52
1380 CCA AAA CTC AGT GCC CTC AAG GCT CGG AAG AAT GTG GAG AGT TTT CTA GAA GCC TGT CGA 1439
53 Pro Lys Leu Ser Ala Leu Lys Ala Arg Lys Asn Val Glu Ser Phe Leu Glu Ala Cys Arg 72
1440 AAA ATG GGG GTG CCT GAG GCT GAC CTG TGC TCG CCC TCG GAT CTC CTC CAG GGC ACT GCC 1499
73 Lys Met Gly Val Pro Glu Ala Asp Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly Thr Ala 92
1500 CGG GGG CTG CGG ACC GCG CTG GAG GCC GTG AAG CGG GTG GGG GGC AAG GCC CTA CCG CCC 1559
93 Arg Gly Leu Arg Thr Ala Leu Glu Ala Val Lys Arg Val Gly Gly Lys Ala Leu Pro Pro 112
1560 CTC TGG CCC CCC TCT GGT CTG GGC GGC TTC GTC GTC TTC TAC GTG GTC CTC ATG CTG CTG 1619
113 Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe Val Val Phe Tyr Val Val Leu Met Leu Leu 132
1620 CTC TAT GTC ACC TAC ACT CGG CTC CTG GGT TCC TAG GCC CCA AAA TCG GCC CTC CCT CAC 1679
133 Leu Tyr Val Thr Tyr Thr Arg Leu Leu Gly Ser *** 144
1680 CCC TTT CCC TTC CTC TCT ATT TAT AAG GTC CCT GCT CCA CCC GAC CCC ACC TGC GGT GCC 1739
1740 TTC AGC CCC AAC CAA AGA CAC TAG TGC ACC CCC TTC ACA GAC ACT GAC CTC AGA GGC CCC 1799
1800 ACT CTG GTG CCC CCA GAC CCT GGG CCC CCA GCC TCT GGC CTC CCT CCA GTA GCC CCA CGA 1859
1860 GTC CCC ACC TCT CAG TGC TGA CGG TGC CTT CAT GTC CCC GCC GGC CCT GCC CCT GCC CTC 1919
1920 TGT ACC CCG TGA GGG GTG GCA GGA GCT GGA GTC TCC CCC TTC CTC CTG TGC CCT CCC CTT 1979
1980 CCC CCC CCA ACA GCT GCT ATG GGG GGG CTA AAT TAT CTC TAT TTT GTA GAG AGG ATC TAT 2039
2040 ATT TGT AGG GGT TCG GGG CCC AGG CCG GGT CCC TAT CTC TGT GTA TAA ACT GTA CAG ACC 2099
2100 GTG AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 2154
4.FP504
A:核苷酸序列(SEQ ID NO:10)长度:4952个碱基
1 GCTAAGCAGT AAACTAAAGG ATTATATATT ATTAGTCTCA GTGGTTTTCA GATTTATTTT
61 TAAAGGGGAA AACAGGGAAA ACCCATCGTA TTTGTAAAGC ACTTTAGGAT TTTGCCGTTT
121 GTTTCTGATT GTTTGAAGAT TAGGGCTTTT TGGTGCGTGG TCACCTTTCA CCTCTCCTTT
181 TAGGATTTAG TCCTTTCCAG TCTGCTCTTT TTGTGCGTGT CACAACCATA TTCTTGTGGT
241 TCTGGCTCAT ATTGTAGAAC TGCTGAACAT AAGGAGAGGT AGCCAGCTGT ATGGTCGGAT
301 TTAATATATA ATGTTATATG TTGGGATATC TTAGTGGTTT GTTTTCTGAG GTAAGTTTCT
361 TAGTGTTGTG TTTGAGACAT TGTGTTTGCG TTTATGGCGA CACTGTCATT CATGCACTTG
421 GCCATCTGAG CGTGGATACA GCGGGCACTC GGGTCTCTCT GCCAGATGGA TGAAAGCAGT
481 GTACATTCCA GTGTGGGAGA CAGACATGTG GACAGGTAAA TTACAAGGCA GTGTGATAAA
541 GAGTAGAGAG TTGGTTGAGA GAGATCTTAG ACACCATCCT CATTGTGTAG ATGAGAAAGC
601 AAAAGTCACC AAGGCAGCCT GGCAGCTGGG ACCCAAGAAG CCTAGGGTGC CAGTCCTTGG
661 GCAGTGCGGG GTTAGGCACA CCCAGGGCCC TCCTGGTTCC TGGCTGACTC TTGGACTCTT
721 TGTCTCTAAT TGGAGGCCAT GATGCCCAGC TGTAAGGTGG TCAGCTTCAT TTGAGACACT
781 ATATCCTTTA GCACAGCGGG GTAATTTCTT CCCTCCTGTT TCATTCATTT ACCAAATGGC
841 CTCCTAAATG ATCTAAAATC ACTTGGATCT TTTGTCTTTG TGGACCTAAC ACCTGGCTTT
901 TAAAGTTTAA CTTTCTGTCC CCTCTTCAGC TTGCTAAAAT TGAAAAGTGT TGCAGCCCAA
961 CCTCCACAAA TCTTGTCTCA GGAAATAAGA GACATTTGTT AACATTTGTT TTGTACCTCT
1021 CAGCAGCTTA GTTGACAAGG GCACCGTGTG GGATTTCCTG TTCTTGCTCA TTTGGAAAGA
1081 GAATGTTCTT TGTTCTTAGA CCCTCAGCTC TCATGTGAGA GCCATAGAAT GTTGCGAGGT
1141 GGAGTTCTGT GGATACAGAA GGAATGTTTT CAAGTTAGAC TTACTGCCAA TGTTAGGATT
1201 TGGGACTTTG CATGATTGGG AGGGAGAGGG AGTGCTGGAG AACAGGTTAA AAGTTGTCCC
1261 GCTGAGCTTG GAGCATCTCC TGCCAACCCG GAGTGCTTCC CAGGAACCCT GCCAGTGTCA
1321 CTTGGGGTTA TGTTTTCTGA TTTGGAAACA TTAAGCCGTA TGCAGGTCTC TTCAGAACTG
1381 GTTCTTCAGC CGGATTGCCC TGGAAAGCAG AGATTGCAGC TCTTCTAAAA CTGCCTCTCA
1441 CAGAAGTTCC AAGGCCAGGC TAAATATTGA ATGCAGTACT CAGCAGCTGG GACACCTGAT
1501 GCTTTGGTGG CCATCCCTTT CTTCCATCCA AAAGGGCCCC CACTGGAAGG CATCTGTTGT
1561 TTTAAAAATA TTTTAGGACT CATTTTTACT TCCCCCACTC CCTCAAGATC ACATACACTC
1621 CCCAGTGGGG GTTACAGCCT CTTAGGAGGA ATCGCTGCTT CATGACTTCC TCAGGCATTT
1681 ACATTTTTCT CTTCTGTTGA CTTAAATCAT GAACTAAAAT TTATCCCTAG AGGAAAAAAG
1741 AATGCTTCCT CCATTCTGGG CTCTTCTCAC TGTACCCAGA CTATGTCTTC AGGACTCTCA
1801 TCTCTTGTCA GTTCTGTTGT GCTAGAAAGA CTGGTTTGAA AAAATTCAGC TCGTGTAAAC
1861 CTGTGCCCTC CACCCTGTGG GGAACCCATG TGGGGAGCCT TTGAAAATAT CACTTATCAG
1921 CTGGGCGCAG TGGCTCATGC CTGTAATCCC AGCACGTTGG GAGGATGAGG TGGGCGGATC
1981 ATGAGGTCAG GAGTTCAAGA CCAGCCTGGC CAACATAGTG AAACCCCGTC TCTACTAAAA
2041 ATACAAAAAA AAAAAAAAAA AAAACCGAGA CTAGTTCTCT CTCTGTCTCC TGCCTGAACC
2101 CTCCTCCTCT TTTTGTTCTG ATCTTTGAGC TCCCTAGAGC CCATAATTCT TTAGAGCAGG
2161 TATGTCCCGA GTCTGAAACA TGCCCTTATT TGTCCCAAGC TCTGGACATT TCTCACCCCA
2221 AGGCGGATCA ATCATGATTA AATCACTCCA ATTAAACTTT AGGCTCCAGT CAGACCTTCA
2281 GCCAAATGGA AAAAAAAACT AGGGGATAAG GGAGGTAGTT GGAGCAAGAA AATGTTATTA
2341 GTTGAAACCT TACGGGACCT TCCTCCCTTA GTGAGTCTGT TGGCTAAAGG TTCTCTGGCT
2401 TCGTGAATTA GAATTGGATA CTGTTTCCAA GTTAGCAAAA CCAACTCTAC CCCAGCACCC
2461 CACGAGGAAG AATGTGGAAG GATCTCCCAT TGGCCCGTTG GGGCAAAAGC CTGAGGCAAT
2521 CTTTCATCCC CTTTTGCCAA GGCGAGACTT TCCCAGTGAC GGTGATGTAG TTGGCCACTC
2581 TGACTATGGG TGGACTCGGG TGTAGACCTC TGAAGCTGAG ATCACACGAA AACCTGGCCT
2641 CCCCGCCATG TAGCTGTTGG AGAGTAGAAA AATAGAGCAC GCCTGATGTT TCTAAATGAG
2701 AAGACTTTCA ATAGTAATGA AGAATCCATG GCACTCTCCT CACCCTCAAA CACATGGCAG
2761 TCATTCACAT ACAGGCCCCA AAGCCACTGT TAGTGCTGCA GTAGCTCCTG TGGACATTGG
2821 AAAGCCCGGA GAGGGCGTGG AAGAAATCAG CTGGCCCCCG GCAGGTTCTC TGGGGTTTTG
2881 TGCCCAAGGC TCCTGGAGCC CTAAAAACTT TCAAAAGTTA ACTCCCCACG TCCCCATCCT
2941 GCTTGGGTTT CTGGACTTTT CTGAGGCACC GGCAGAGGGG TCTCGTTGCT CCCTTGAGTG
3001 TAGGGGCAGC CCTTTAACCT GGCTCCTTGA GTCCCTGCTT TTTCTGCTTC TGTTGCCTTC
3061 TTCCTCGTCT TCCTCTCTCT CAATATCTCC CTCTCTTTGT CCCTCCCCAG TTCCTGACCT
3121 GGCCATCCCG GGGTGCCCTT GACCAGCCCC GTGCCTCCTC AGGGTGTCCC AGCACCAGCC
3181 TGGCACAGAG TGGGGCTCAG TTAGAGTATG TGGGATGTTG GTTTCGCCAG GTGAGTGAAT
3241 GAAAGGACTC GACCACCACA GCTGAGCCAC TAGCTGGGCC ATGCGAAGAG TTCTAGGTGC
3301 AAAGGCTGGA GGGTGGAATT CATTTTTGAG AGGTGTGTGA GCAGCTTCCG ACCCCTGCCC
3361 CATTTGAACG GGGGCCTTGC TGGTCGCGTC CCTGCATTCA CCTGCGCGGC CATCCCGTCA
3421 TCCAACAGTT GATCCTAACT GAGCACGCCC ACGGCCCTGG TCTGGCCTGG GCACCGGCCA
3481 CCGTAGCCCA TCCCTTGATG GCCTCTGTGT CCCCAGGAGG GCGGGCCGGG GGGTTGCCCA
3541 GGGGCTGGAG CAGTGGACTG TGGCTCCATA GAGGTAGGCT GGAGGGTGTG AGGGCAGATT
3601 CAAGCTATCC CCAGGGCTCT GCTCTGGTCG GAGCCAGCCC CTTCTCCCTC TCTGCCTTCC
3661 CCGCCCCATT CCTGATGCTG AACTGTTCTG GACCCCTGGC CCTGAGTCTC TCAGGACCAA
3721 AGTGGGCACG GGAACAGCTG TAGTGTGTGC CCCCCCGGGC TTTGGCACAG GTCTCCCTCT
3781 CGAGGTGTGG TTGTGACTGC GACCCTTCCC TTGCCGTGAT GCCTTCCTCC CCCGGGGCTT
3841 GGTCCAGCTC CTTCACTCTC TAGCAGCTGC TGGGGCCCAC CTCCCATGCC GAGGACCAGC
3901 AGGGGAAACC TCCAGGGAGC ATCTGCAGGC TCTGCTTCTG CCCGGCTGCT GGCTTGCTCT
3961 CCCTGGTGGC TCTCCAGCGG CCAGCTTCCT CACCCACCCG GCACTCTGCT TTGCTCTGTC
4021 TCCTGAGGTG GGCCTGACCA ACCTCCCCTT CTCTGCCTCA GTCCCTGGGC TCCAGGGCTC
4081 AGCTCCACAG CCCTCTGCCT AGCAGGCTGG TTCTCCCTGC CAAGCCCATA CCTGTGGTCA
4141 CCTGGCCCTC CTGTGGTCTG AGTACCACTC CCCTGCCCCA GGAGCCACTC CCACTCCAGC
4201 TGCCTGTTTC CAGCAGGTTC CCAGTGTCCC CGACAAGCCC CTGCTGGTGT CTCCATCTCC
4261 TGCCAAGCAT CCTCCAGTGC CTCCTCCTGT GGGCCTGGCC TCAGGGCTAT GGACAGACTC
4321 CTGTCCCATC CCAGAGACCC CTCGTGATCG TGCCCTGGCA CGTGGGCCGT GGCCCGGCTG
4381 GGTCGGCTGA AGAACTGCGG ATGGAAGCTG CGGAAGAGGC CCTGATGGGG CCCACCATCC
4441 CGGACCCAAG TCTTCTTCCT GGCGGGCCTC TCGTCTCCTT CCTGGTTTGG GCGGAAGCCA
4501 TCACCTGGAT GCCTACGTGG GAAGGGACCT CGAATGTGGG ACCCCAGCCC CTCTCCAGCT
4561 CGAAATCCCT CCACAGCCAC GGGGACACCC TGCACCTATT CCCACGGGAC AGGCTGGACC
4621 CAAAGACTCT GGACCCGGGG CCTCCCCTTG AGTAGAGACC CGCCCTCTGA CTGATGGACG
4681 CCGCTGACCT GGGGTCAGAC CCGTGGGCTG GACCCCTGCC CACCCCGCAG GAACCCTGAG
4741 GCCTAGGGGA GCTGTTGAGC CTTCAGTGTC TGCATGTGGG AAGTGGGCTC CTTCACCTAC
4801 CTCACAGGGC TGTTGTGAGG GGCGCTGTGA TGCGGTTCCA AAGCACAGGG CTTGGCGCAC
4861 CCCCCTGTGC TCTCAATAAA TGTGTTTCCT GTCTTAAAAA AAAAAAAAAA AAAAAAAAAA
4921 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA
B:核苷酸序列(SEQ ID NO:12)长度:148个氨基酸
1 MRRLSIVMKN PWHSPHPQTH GSHSHTGPKA TVSAAVAPVD IGKPGEGVEE ISWPPAGSLG
61 FCAQGSWSPK NFQKLTPHVP ILLGFLDFSE APAEGSRCSL ECRGSPLTWL LESLLFLLLL
121 PSSSSSSLSI SPSLCPSPVP DLAIPGCP
C.核苷酸及氨基酸组合序列(SEQ ID NO:11)克隆号和蛋白名称:FP504
起始编码子:2696 ATG终止编码子:3140 TGA蛋白质分子量:15611.03
1 G CTA AGC AGT AAA CTA AAG GAT TAT ATA TTA TTA GTC TCA GTG GTT TTC AGA TTT ATT 58
59 TTT AAA GGG GAA AAC AGG GAA AAC CCA TCG TAT TTG TAA AGC ACT TTA GGA TTT TGC CGT 118
119 TTG TTT CTG ATT GTT TGA AGA TTA GGG CTT TTT GGT GCG TGG TCA CCT TTC ACC TCT CCT 178
179 TTT AGG ATT TAG TCC TTT CCA GTC TGC TCT TTT TGT GCG TGT CAC AAC CAT ATT CTT GTG 238
239 GTT CTG GCT CAT ATT GTA GAA CTG CTG AAC ATA AGG AGA GGT AGC CAG CTG TAT GGT CGG 298
299 ATT TAA TAT ATA ATG TTA TAT GTT GGG ATA TCT TAG TGG TTT GTT TTC TGA GGT AAG TTT 358
359 CTT AGT GTT GTG TTT GAG ACA TTG TGT TTG CGT TTA TGG CGA CAC TGT CAT TCA TGC ACT 418
419 TGG CCA TCT GAG CGT GGA TAC AGC GGG CAC TCG GGT CTC TCT GCC AGA TGG ATG AAA GCA 478
479 GTG TAC ATT CCA GTG TGG GAG ACA GAC ATG TGG ACA GGT AAA TTA CAA GGC AGT GTG ATA 538
539 AAG AGT AGA GAG TTG GTT GAG AGA GAT CTT AGA CAC CAT CCT CAT TGT GTA GAT GAG AAA 598
599 GCA AAA GTC ACC AAG GCA GCC TGG CAG CTG GGA CCC AAG AAG CCT AGG GTG CCA GTC CTT 658
659 GGG CAG TGC GGG GTT AGG CAC ACC CAG GGC CCT CCT GGT TCC TGG CTG ACT CTT GGA CTC 718
719 TTT GTC TCT AAT TGG AGG CCA TGA TGC CCA GCT GTA AGG TGG TCA GCT TCA TTT GAG ACA 778
779 CTA TAT CCT TTA GCA CAG CGG GGT AAT TTC TTC CCT CCT GTT TCA TTC ATT TAC CAA ATG 838
839 GCC TCC TAA ATG ATC TAA AAT CAC TTG GAT CTT TTG TCT TTG TGG ACC TAA CAC CTG GCT 898
899 TTT AAA GTT TAA CTT TCT GTC CCC TCT TCA GCT TGC TAA AAT TGA AAA GTG TTG CAG CCC 958
959 AAC CTC CAC AAA TCT TGT CTC AGG AAA TAA GAG ACA TTT GTT AAC ATT TGT TTT GTA CCT 1018
1019 CTC AGC AGC TTA GTT GAC AAG GGC ACC GTG TGG GAT TTC CTG TTC TTG CTC ATT TGG AAA 1078
1079 GAG AAT GTT CTT TGT TCT TAG ACC CTC AGC TCT CAT GTG AGA GCC ATA GAA TGT TGC GAG 1138
1139 GTG GAG TTC TGT GGA TAC AGA AGG AAT GTT TTC AAG TTA GAC TTA CTG CCA ATG TTA GGA 1198
1199 TTT GGG ACT TTG CAT GAT TGG GAG GGA GAG GGA GTG CTG GAG AAC AGG TTA AAA GTT GTC 1258
1259 CCG CTG AGC TTG GAG CAT CTC CTG CCA ACC CGG AGT GCT TCC CAG GAA CCC TGC CAG TGT 1318
1319 CAC TTG GGG TTA TGT TTT CTG ATT TGG AAA CAT TAA GCC GTA TGC AGG TCT CTT CAG AAC 1378
1379 TGG TTC TTC AGC CGG ATT GCC CTG GAA AGC AGA GAT TGC AGC TCT TCT AAA ACT GCC TCT 1438
1439 CAC AGA AGT TCC AAG GCC AGG CTA AAT ATT GAA TGC AGT ACT CAG CAG CTG GGA CAC CTG 1498
1499 ATG CTT TGG TGG CCA TCC CTT TCT TCC ATC CAA AAG GGC CCC CAC TGG AAG GCA TCT GTT 1558
1559 GTT TTA AAA ATA TTT TAG GAC TCA TTT TTA CTT CCC CCA CTC CCT CAA GAT CAC ATA CAC 1618
1619 TCC CCA GTG GGG GTT ACA GCC TCT TAG GAG GAA TCG CTG CTT CAT GAC TTC CTC AGG CAT 1678
1679 TTA CAT TTT TCT CTT CTG TTG ACT TAA ATC ATG AAC TAA AAT TTA TCC CTA GAG GAA AAA 1738
1739 AGA ATG CTT CCT CCA TTC TGG GCT CTT CTC ACT GTA CCC AGA CTA TGT CTT CAG GAC TCT 1798
1799 CAT CTC TTG TCA GTT CTG TTG TGC TAG AAA GAC TGG TTT GAA AAA ATT CAG CTC GTG TAA 1858
1859 ACC TGT GCC CTC CAC CCT GTG GGG AAC CCA TGT GGG GAG CCT TTG AAA ATA TCA CTT ATC 1918
1919 AGC TGG GCG CAG TGG CTC ATG CCT GTA ATC CCA GCA CGT TGG GAG GAT GAG GTG GGC GGA 1978
1979 TCA TGA GGT CAG GAG TTC AAG ACC AGC CTG GCC AAC ATA GTG AAA CCC CGT CTC TAC TAA 2038
2039 AAA TAC AAA AAA AAA AAA AAA AAA AAC CGA GAC TAG TTC TCT CTC TGT CTC CTG CCT GAA 2098
2099 CCC TCC TCC TCT TTT TGT TCT GAT CTT TGA GCT CCC TAG AGC CCA TAA TTC TTT AGA GCA 2158
2159 GGT ATG TCC CGA GTC TGA AAC ATG CCC TTA TTT GTC CCA AGC TCT GGA CAT TTC TCA CCC 2218
2219 CAA GGC GGA TCA ATC ATG ATT AAA TCA CTC CAA TTA AAC TTT AGG CTC CAG TCA GAC CTT 2278
2279 CAG CCA AAT GGA AAA AAA AAC TAG GGG ATA AGG GAG GTA GTT GGA GCA AGA AAA TGT TAT 2338
2339 TAG AAG AAA CCT TAC GGG ACC TTC CTC CCT TAG TGA GTC TGT TGG CTA AAG GTT CTC TGG 2398
2399 CTT CGT GAA TTA GAA TTG GAT ACT GTT TCC AAG TTA GCA AAA CCA ACT CTA CCC CAG CAC 2458
2459 CCC ACG AGG AAG AAT GTG GAA GGA TCT CCC ATT GGC CGG TTG GGG CAA AAG CCT GAG GCA 2518
2519 ATC TTT CAT CCC CTT TTG CCA AGG CGA GAC TTT CCC AGT GAC GGT GAT GTA GTT GGC CAC 2578
2579 TCT GAC TAT GGG TGG ACT CGG GTG TAG ACC TCT GAA GCT GAG ATC ACA CGA AAA CCT GGC 2638
2639 CTC CCC GCC ATG TAG CTG TTG GAG AGT AGA AAA ATA GAG CAC GCC TGA TGT TTC TAA ATG 2698
1 Met 1
2699 AGA AGA CTT TCA ATA GTA ATG AAG AAT CCA TGG CAC TCT CCT CAC CCT CAA ACA CAT GGC 2758
2 Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His Pro Gln Thr His Gly 21
2759 AGT CAT TCA CAT ACA GGC CCC AAA GCC ACT GTT AGT GCT GCA GTA GCT CCT GTG GAC ATT 2818
22 Ser His Ser His Thr Gly Pro Lys Ala Thr Val Ser Ala Ala Val Ala Pro Val Asp Ile 41
2819 GGA AAG CCC GGA GAG GGC GTG GAA GAA ATC AGC TGG CCC CCG GCA GGT TCT CTG GGG TTT 2878
42 Gly Lys Pro Gly Glu Gly Val Glu Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe 61
2879 TGT GCC CAA GGC TCC TGG AGC CCT AAA AAC TTT CAA AAG TTA ACT CCC CAC GTC CCC ATC 2938
62 Cys Ala Gln Gly Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro Ile 81
2939 CTG CTT GGG TTT CTG GAC TTT TCT GAG GCA CCG GCA GAG GGG TCT CGT TGC TCC CTT GAG 2998
82 Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu Gly Ser Arg Cys Ser Leu Glu 101
2999 TGT AGG GGC AGC CCT TTA ACC TGG CTC CTT GAG TCC CTG CTT TTT CTG CTT CTG TTG CCT 3058
102 Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu Ser Leu Leu Phe Leu Leu Leu Leu Pro 121
3059 TCT TCC TCG TCT TCC TCT CTC TCA ATA TCT CCC TCT CTT TGT CCC TCC CCA GTT CCT GAC 3118
122 Ser Ser Ser Ser Ser Ser Leu Ser Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp 141
3119 CTG GCC ATC CCG GGG TGC CCT TGA CCA GCC CCG TGC CTC CTC AGG GTG TCC CAG CAC CAG 3178
142 Leu Ala Ile Pro Gly Cys Pro *** 149
3179 CCT GGC ACA GAG TGG GGC TCA GTT AGA GTA TGT GGG ATG TTG GTT TCG CCA GGT GAG TGA 3238
3239 ATG AAA GGA CTC GAC CAC CAC AGC TGA GCC ACT AGC TGG GCC ATG CGA AGA GTT CTA GGT 3298
3299 GCA AAG GCT GGA GGG TGG AAT TCA TTT TTG AGA GGT GTG TGA GCA GCT TCC GAC CCC TGC 3358
3359 CCC ATT TGA ACG GGG GCC TTG CTG GTC GCG TCC CTG CAT TCA CCT GCG CGG CCA TCC CGT 3418
3419 CAT CCA ACA GTT GAT CCT AAC TGA GCA CGC CCA CGG CCC TGG TCT GGC CTG GGC ACC GGC 3478
3479 CAC CGT AGC CCA TCC CTT GAT GGC CTC TGT GTC CCC AGG AGG GCG GGC CGG GGG GTT GCC 3538
3539 CAG GGG CTG GAG CAG TGG ACT GTG GCT CCA TAG AGG TAG GCT GGA GGG TGT GAG GGC AGA 3598
3599 TTC AAG CTA TCC CCA GGG CTC TGC TCT GGT CGG AGC CAG CCC CTT CTC CCT CTC TGC CTT 3658
3659 CCC CGC CCC ATT CCT GAT GCT GAA CTG TTC TGG ACC CCT GGC CCT GAG TCT CTC AGG ACC 3718
3719 AAA GTG GGC ACG GGA ACA GCT GTA GTG TGT GCC CCC CCG GGC TTT GGC ACA GGT CTC CCT 3778
3779 CTC GAG GTG TGG TTG TGA CTG CGA CCC TTC CCT TGC CGT GAT GCC TTC CTC CCC CGG GGC 3838
3839 TTG GTC CAG CTC CTT CAC TCT CTA GCA GCT GCT GGG GCC CAC CTC CCA TGC CGA GGA CCA 3898
3899 GCA GGG GAA ACC TCC AGG GAG CAT CTG CAG GCT CTG CTT CTG CCC GGC TGC TGG CTT GCT 3958
3959 CTC CCT GGT GGC TCT CCA GCG GCC AGC TTC CTC ACC CAC CCG GCA CTC TGC TTT GCT CTG 4018
4019 TCT CCT GAG GTG GGC CTG ACC AAC CTC CCC TTC TCT GCC TCA GTC CCT GGG CTC CAG GGC 4078
4079 TCA GCT CCA CAG CCC TCT GCC TAG CAG GCT GGT TCT CCC TGC CAA GCC CAT ACC TGT GGT 4138
4139 CAC CTG GCC CTC CTG TGG TCT GAG TAC CAC TCC CCT GCC CCA GGA GCC ACT CCC ACT CCA 4198
4199 GCT GCC TGT TTC CAG CAG GTT CCC AGT GTC CCC GAC AAG CCC CTG CTG GTG TCT CCA TCT 4258
4259 CCT GCC AAG CAT CCT CCA GTG CCT CCT CCT GTG GGC CTG GCC TCA GGG CTA TGG ACA GAC 4318
4319 TCC TGT CCC ATC CCA GAG ACC CCT CGT GAT CGT GCC CTG GCA CGT GGG CCG TGG CCC GGC 4378
4379 TGG GTC GGC TGA AGA ACT GCG GAT GGA AGC TGC GGA AGA GGC CCT GAT GGG GCC CAC CAT 4438
4439 CCC GGA CCC AAG TCT TCT TCC TGG CGG GCC TCT CGT CTC CTT CCT GGT TTG GGC GGA AGC 4498
4499 CAT CAC CTG GAT GCC TAC GTG GGA AGG GAC CTC GAA TGT GGG ACC CCA GCC CCT CTC CAG 4558
4559 CTC GAA ATC CCT CCA CAG CCA CGG GGA CAC CCT GCA CCT ATT CCC ACG GGA CAG GCT GGA 4618
4619 CCC AAA GAC TCT GGA CCC GGG GCC TCC CCT TGA GTA GAG ACC CGC CCT CTG ACT GAT GGA 4678
4679 CGC CGC TGA CCT GGG GTC AGA CCC GTG GGC TGG ACC CCT GCC CAC CCC GCA GGA ACC CTG 4738
4739 AGG CCT AGG GGA GCT GTT GAG CCT TCA GTG TCT GCA TGT GGG AAG TGG GCT CCT TCA CCT 4798
4799 ACC TCA CAG GGC TGT TGT GAG GGG CGC TGT GAT GCG GTT CCA AAG CAC AGG GCT TGG CGC 4858
4859 ACC CCC CTG TGC TCT CAA TAA ATG TGT TTC CTG TCT TAA AAA AAA AAA AAA AAA AAA AAA 4918
4919 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 4952
5.FP972
A:核苷酸序列(SEQ ID NO:13)长度:3112个碱基
1 GCGACGGCGA GAGCTAGAGC GGGCGCAGCG TTAGGGTGGC CGTGCAAGGG GAGCCGTGGC
61 CCGGGCCCGG GGCGTGCGAG ACGGCGGAAG CAGCCCAGGG CCTTGCTGCC GCCATGACTG
121 AGGAATCAGA GGAGACAGTC CTGTACATTG AGCACCGCTA TGTCTGCTCT GAGTGCAACC
181 AGCTGTATGG ATCACTGGAA GAGGTGCTTA TGCACCAAAA CTCCCACGTG CCCCAGCAGC
241 ACTTTGAGCT GGTGGGCGTG GCTGATCCCG GAGTCACTGT GGCCACAGAC ACAGCTTCAG
301 GCACGGGCCT CTATCAGACC CTTGTGCAGG AGAGCCAGTA CCAGTGCCTG GAGTGTGGTC
361 AACTGCTGAT GTCACCCAGC CAGCTCCTGG AGCACCAGGA GCTGCACCTG AAGATGATGG
421 CACCCCAGGA GGCAGTGCCA GCTGAGCCAT CACCTAAGGC ACCACCCCTG AGCTCCAGCA
481 CCATCCACTA CGAGTGTGTG GATTGCAAGG CTCTCTTTGC CAGCCAGGAG CTCTGGCTGA
541 ACCACCGGCA GACGCACCTC CGGGCCACAC CCACCAAGGC TCCTGCCCCT GTTGTCCTGG
601 GGTCCCCAGT TGTTCTAGGG CCTCCTGTGG GCCAGGCCCG AGTGGCTGTG GAGCACTCAT
661 ACCGAAAGGC AGAAGAGGGT GGGGAAGGGG CGACTGTCCC ATCTGCCGCT GCCACCACCA
721 CTGAGGTAGT GACTGAGGTG GAGCTGCTCC TCTACAAGTG CTCTGAGTGC TCCCAGCTCT
781 TCCAGCTGCC GGCGGATTTC CTGGAGCACC AGGCCACTCA CTTCCCTGCT CCTGTACCCG
841 AGTCTCAGGA GCCTGCCTTA CAGCAGGAGG TGCAGGCCTC GTCACCTGCA GAGGTGCCTG
901 TGTCTCAGCC TGACCCCTTG CCAGCTTCTG ACCACAGTTA CGAGCTGCGC AATGGTGAAG
961 CCATTGGGCG GGATCGCCGG GGGCGCAGGG CCCGGAGGAA CAACAGTGGA GAAGCAGGCG
1021 GGGCAGCCAC ACAGGAGCTC TTCTGCTCAG CCTGTGACCA GCTCTTTCTC TCACCCCACC
1081 AGCTACAGCA GCACCTGCGG AGTCACCGGG AGGGCGTCTT TAAGTGCCCC CTGTGCAGTC
1141 GTGTCTTCCC TAGCCCTTCC AGTCTGGACC AGCACCTTGG AGACCATAGC AGCGAGTCAC
1201 ACTTCCTGTG TGTAGACTGT GGCCTGGCCT TCGGCACAGA GGCCCTCCTC CTGGCCCACC
1261 GGCGAGCCCA CACCCCGAAT CCTCTGCATT CATGTCCATG TGGGAAGACC TTTGTCAACC
1321 TTACCAAGTT CCTTTATCAC CGGCGTACTC ATGGGGTAGG GGGGGTGTCC CTCTGCCCAC
1381 AACACCAGTC CCACCAGAGG AACCTGTCAT TGGTTTCCCT GAGCCAGCCC CAGCAGAGAC
1441 TGGAGAGCCA GAGGCCCCTG AGCCCCCTGT GTCTGAGGAG ACCTCAGCAG GGCCCGCTGC
1501 CCCAGGCACC TACCGCTGCC TCCTGTGCAG CCGTGAATTT GGAAAGGCCT TGCAGCTGAC
1561 CCGGCACCAA CGTTTTGTGC ATCGGCTGGA GCGGCGCCAT AAATGCAGCA TTTGTGGCAA
1621 GATGTTCAAG AAGAAGTCTC ACGTGCGTAA CCACCTGCGC ACACACACAG GGGAGCGGCC
1681 CTTCCCCTGC CCTGACTGCT CCAAGCCCTT CAACTCACCT GCCAACCTGG CCCGCCACCG
1741 GCTCACACAC ACAGGAGAGC GGCCCTACCG GTGTGGGGAC TGTGGCAAGG CTTTCACGCA
1801 AAGCTCCACA CTGAGGCAGC ACCGCTTGGT GCATGCCCAG CACTTTCCCT ACCGCTGCCA
1861 GGAATGTGGG GTGCGTTTTC ACCGTCCTTA CCGGCTGCTC ATGCACCGCT ACCATCACAC
1921 AGGTGAATAC CCCTACAAGT GTCGCGAGTG CCCCCGCTCC TTCTTGCTGC GTCGGCTGCT
1981 GGAGGTGCAC CAGCTCGTGG TCCATGCCGG GCGCCAGCCC CACCGCTGCC CATCCTGTGG
2041 GGCTGCCTTC CCCTCCTCAC TGCGGCTCCG GGAGCACCGC TGTGCAGCCG CTGCTGCCCA
2101 GGCCCCACGG CGCTTTGAGT GTGGCACCTG TGGCAAGAAA GTGGGCTCAG CTGCTCGACT
2161 GCAGGCACAC GAGGCGGCCC ATGCAGCTGC TGGGCCTGGA GAGGTCCTGG CTAAGGAGCC
2221 CCCTGCCCCT CGAGCCCCAC GGGCCACTCG TGCACCAGTT GCCTCTCCAG CAGCCCTTGG
2281 AAGCACTGCT ACAGCATCCC CTGCGGCCCC TGCCCGCCGC CGGGGTCTAG AGTGCAGCGA
2341 GTGCAAGAAG CTGTTCAGCA CAGAGACGTC ACTGCAGGTG CACCGGCGCA TCCACACAGG
2401 TGAGCGGCCA TACCCATGTC CAGACTGTGG CAAAGCGTTC CGTCAGAGTA CCCACCTGAA
2461 AGACACCGGC GCCTGCACAC AGGTGAGCGG CCCTTTGCCT GTGAAGTGTG TGGCAAGGCC
2521 TTTGCCATCT CCATGCGCCT GGCAGAACAT CGCCGCATCC ACACAGGCGA ACGACCCTAC
2581 TCCTGCCCTG ACTGTGGCAA GAGCTACCGC TCCTTCTCCA ACCTCTGGAA GCACCGCAAG
2641 ACCCATCAGC AGCAGCATCA GGCAGCTGTG CGGCAGCAGC TGGCAGAGGC GGAGGCTGCC
2701 GTTGGCCTGG CCGTCATGGA GACTGCTGTG GAGGCGCTAC CCCTGGTGGA AGCCATTGAG
2761 ATCTACCCTC TGGCCGAGGC TGAGGGGGTC CAGATCAGTG GCTGACTCTG CCCGACTTCC
2821 TCTTTGGCAC CTCCATTCCC TGTTGCTGAA GGCCCTCCAG CATCCCCTTA AGCATCTGTA
2881 CATACTGTGT CCCTTCCTCT TCCCATCCCC ACCACCTTGT AAGTTCTAAA TTGGATTTAT
2941 TCTCTCGTGA GGGGGGTGCT CTGGGGTCCT TGACACACAT AAAGGTGCCC CCCCACCTTC
3001 CACCTCTTAG CACTGGTGAC CCCAAAAATG AAACCATCAA TAAAGACTGA GTTGCCAGCA
3061 GTGTGTAGAG TGGAAAAAAA AAAAAAAAAA AAAAAAAAA AAAAAAAAAAA AA
B:核苷酸序列(SEQ ID NO:15)长度:545个氨基酸
1 MSMWEDLCQP YQVPLSPAYS WGRGGVPLPT TPVPPEEPVI GFPEPAPAET GEPEAPEPPV
61 SEETSAGPAA PGTYRCLLCS REFGKALQLT RHQRFVHRLE RRHKCSICGK MFKKKSHVRN
121 HLRTHTGERP FPCPDCSKPF NSPANLARHR LTHTGERPYR CGDCGKAFTQ SSTLRQHRLV
181 HAQHFPYRCQ ECGVRFHRPY RLLMHRYHHT GEYPYKCREC PRSFLLRRLL EVHQLVVHAG
241 RQPHRCPSCG AAFPSSLRLR EHRCAAAAAQ APRRFECGTC GKKVGSAARL QAHEAAHAAA
301 GPGEVLAKEP PAPRAPRATR APVASPAALG STATASPAAP ARRRGLECSE CKKLFSTETS
361 LQVHRRIHTG ERPYPCPDCG KAFRQSTHLK DTGACTQVSG PLPVKCVARP LPSPCAWQNI
421 AASTQANDPT PALTVARATA PSPTSGSTAR PISSSIRQLC GSSWQRRRLP LAWPSWRLLW
481 RRYPWWKPLR STLWPRLRGS RSVADSARLP LWHLHSLLLK ALQHPLKHLY ILCPFLFPSP
541 PPCKF
C.核苷酸及氨基酸组合序列(SEQ ID NO:14)克隆号和蛋白名称:FP972
起始编码子:1292 ATG终止编码子:2927 TAA蛋白质分子量:60743.81
1 G CGA CGG CGA GAG CTA GAG CGG GCG CAG CGT TAG GGT GGC CGT GCA AGG GGA GCC GTG 58
59 GCC CGG GCC CGG GGC GTG CGA GAC GGC GGA AGC AGC CCA GGG CCT TGC TGC CGC CAT GAC 118
119 TGA GGA ATC AGA GGA GAC AGT CCT GTA CAT TGA GCA CCG CTA TGT CTG CTC TGA GTG CAA 178
179 CCA GCT GTA TGG ATC ACT GGA AGA GGT GCT TAT GCA CCA AAA CTC CCA CGT GCC CCA GCA 238
239 GCA CTT TGA GCT GGT GGG CGT GGC TGA TCC CGG AGT CAC TGT GGC CAC AGA CAC AGC TTC 298
299 AGG CAC GGG CCT CTA TCA GAC CCT TGT GCA GGA GAG CCA GTA CCA GTG CCT GGA GTG TGG 358
359 TCA ACT GCT GAT GTC ACC CAG CCA GCT CCT GGA GCA CCA GGA GCT GCA CCT GAA GAT GAT 418
419 GGC ACC CCA GGA GGC AGT GCC AGC TGA GCC ATC ACC TAA GGC ACC ACC CCT GAG CTC CAG 478
479 CAC CAT CCA CTA CGA GTG TGT GGA TTG CAA GGC TCT CTT TGC CAG CCA GGA GCT CTG GCT 538
539 GAA CCA CCG GCA GAC GCA CCT CCG GGC CAC ACC CAC CAA GGC TCC TGC CCC TGT TGT CCT 598
599 GGG GTC CCC AGT TGT TCT AGG GCC TCC TGT GGG CCA GGC CCG AGT GGC TGT GGA GCA CTC 658
659 ATA CCG AAA GGC AGA AGA GGG TGG GGA AGG GGC GAC TGT CCC ATC TGC CGC TGC CAC CAC 718
719 CAC TGA GGT AGT GAC TGA GGT GGA GCT GCT CCT CTA CAA GTG CTC TGA GTG CTC CCA GCT 778
779 CTT CCA GCT GCC GGC GGA TTT CCT GGA GCA CCA GGC CAC TCA CTT CCC TGC TCC TGT ACC 838
839 CGA GTC TCA GGA GCC TGC CTT ACA GCA GGA GGT GCA GGC CTC GTC ACC TGC AGA GGT GCC 898
899 TGT GTC TCA GCC TGA CCC CTT GCC AGC TTC TGA CCA CAG TTA CGA GCT GCG CAA TGG TGA 958
959 AGC CAT TGG GCG GGA TCG CCG GGG GCG CAG GGC CCG GAG GAA CAA CAG TGG AGA AGC AGG 1018
1019 CGG GGC AGC CAC ACA GGA GCT CTT CTG CTC AGC CTG TGA CCA GCT CTT TCT CTC ACC CCA 1078
1079 CCA GCT ACA GCA GCA CCT GCG GAG TCA CCG GGA GGG CGT CTT TAA GTG CCC CCT GTG CAG 1138
1139 TCG TGT CTT CCC TAG CCC TTC CAG TCT GGA CCA GCA CCT TGG AGA CCA TAG CAG CGA GTC 1198
1199 ACA CTT CCT GTG TGT AGA CTG TGG CCT GGC CTT CGG CAC AGA GGC CCT CCT CCT GGC CCA 1258
1259 CCG GCG AGC CCA CAC CCC GAA TCC TCT GCA TTC ATG TCC ATG TGG GAA GAC CTT TGT CAA 1318
1 Met Ser Met Trp Glu Asp Leu Cys Gln 9
1319 CCT TAC CAA GTT CCT TTA TCA CCG GCG TAC TCA TGG GGT AGG GGG GGT GTC CCT CTG CCC 1378
10 Pro Tyr Gln Val Pro Leu Ser Pro Ala Tyr Ser Trp Gly Arg Gly Gly Val Pro Leu Pro 29
1379 ACA ACA CCA GTC CCA CCA GAG GAA CCT GTC ATT GGT TTC CCT GAG CCA GCC CCA GCA GAG 1438
30 Thr Thr Pro Val Pro Pro Glu Glu Pro Val Ile Gly Phe Pro Glu Pro Ala Pro Ala Glu 49
1439 ACT GGA GAG CCA GAG GCC CCT GAG CCC CCT GTG TCT GAG GAG ACC TCA GCA GGG CCC GCT 1498
50 Thr Gly Glu Pro Glu Ala Pro Glu Pro Pro Val Ser Glu Glu Thr Ser Ala Gly Pro Ala 69
1499 GCC CCA GGC ACC TAC CGC TGC CTC CTG TGC AGC CGT GAA TTT GGA AAG GCC TTG CAG CTG 1558
70 Ala Pro Gly Thr Tyr Arg Cys Leu Leu Cys Ser Arg Glu Phe Gly Lys Ala Leu Gln Leu 89
1559 ACC CGG CAC CAA CGT TTT GTG CAT CGG CTG GAG CGG CGC CAT AAA TGC AGC ATT TGT GGC 1618
90 Thr Arg His Gln Arg Phe Val His Arg Leu Glu Arg Arg His Lys Cys Ser Ile Cys Gly 109
1619 AAG ATG TTC AAG AAG AAG TCT CAC GTG CGT AAC CAC CTG CGC ACA CAC ACA GGG GAG CGG 1678
110 Lys Met Phe Lys Lys Lys Ser His Val Arg Asn His Leu Arg Thr His Thr Gly Glu Arg 129
1679 CCC TTC CCC TGC CCT GAC TGC TCC AAG CCC TTC AAC TCA CCT GCC AAC CTG GCC CGC CAC 1738
130 Pro Phe Pro Cys Pro Asp Cys Ser Lys Pro Phe Asn Ser Pro Ala Asn Leu Ala Arg His 149
1739 CGG CTC ACA CAC ACA GGA GAG CGG CCC TAC CGG TGT GGG GAC TGT GGC AAG GCT TTC ACG 1798
150 Arg Leu Thr His Thr Gly Glu Arg Pro Tyr Arg Cys Gly Asp Cys Gly Lys Ala Phe Thr 169
1799 CAA AGC TCC ACA CTG AGG CAG CAC CGC TTG GTG CAT GCC CAG CAC TTT CCC TAC CGC TGC 1858
170 Gln Ser Ser Thr Leu Arg Gln His Arg Leu Val His Ala Gln His Phe Pro Tyr Arg Cys 189
1859 CAG GAA TGT GGG GTG CGT TTT CAC CGT CCT TAC CGG CTG CTC ATG CAC CGC TAC CAT CAC 1918
190 Gln Glu Cys Gly Val Arg Phe His Arg Pro Tyr Arg Leu Leu Met His Arg Tyr His His 209
1919 ACA GGT GAA TAC CCC TAC AAG TGT CGC GAG TGC CCC CGC TCC TTC TTG CTG CGT CGG CTG 1978
210 Thr Gly Glu Tyr Pro Tyr Lys Cys Arg Glu Cys Pro Arg Ser Phe Leu Leu Arg Arg Leu 229
1979 CTG GAG GTG CAC CAG CTC GTG GTC CAT GCC GGG CGC CAG CCC CAC CGC TGC CCA TCC TGT 2038
230 Leu Glu Val His Gln Leu Val Val His Ala Gly Arg Gln Pro His Arg Cys Pro Ser Cys 249
2039 GGG GCT GCC TTC CCC TCC TCA CTG CGG CTC CGG GAG CAC CGC TGT GCA GCC GCT GCT GCC 2098
250 Gly Ala Ala Phe Pro Ser Ser Leu Arg Leu Arg Glu His Arg Cys Ala Ala Ala Ala Ala 269
2099 CAG GCC CCA CGG CGC TTT GAG TGT GGC ACC TGT GGC AAG AAA GTG GGC TCA GCT GCT CGA 2158
270 Gln Ala Pro Arg Arg Phe Glu Cys Gly Thr Cys Gly Lys Lys Val Gly Ser Ala Ala Arg 289
2159 CTG CAG GCA CAC GAG GCG GCC CAT GCA GCT GCT GGG CCT GGA GAG GTC CTG GCT AAG GAG 2218
290 Leu Gln Ala His Glu Ala Ala His Ala Ala Ala Gly Pro Gly Glu Val Leu Ala Lys Glu 309
2219 CCC CCT GCC CCT CGA GCC CCA CGG GCC ACT CGT GCA CCA GTT GCC TCT CCA GCA GCC CTT 2278
310 Pro Pro Ala Pro Arg Ala Pro Arg Ala Thr Arg Ala Pro Val Ala Ser Pro Ala Ala Leu 329
2279 GGA AGC ACT GCTACA GCA TCC CCT GCG GCC CCT GCC CGC CGC CGG GGT CTA GAG TGC AGC 2338
330 Gly Ser Thr Ala Thr Ala Ser Pro Ala Ala Pro Ala Arg Arg Arg Gly Leu Glu Cys Ser 349
2339 GAG TGC AAG AAG CTG TTC AGC ACA GAG ACG TCA CTG CAG GTG CAC CGG CGC ATC CAC ACA 2398
350 Glu Cys Lys Lys Leu Phe Ser Thr Glu Thr Ser Leu Gln Val His Arg Arg Ile His Thr 369
2399 GGT GAG CGG CCA TAC CCA TGT CCA GAC TGT GGC AAA GCG TTC CGT CAG AGT ACC CAC CTG 2458
370 Gly Glu Arg Pro Tyr Pro Cys Pro Asp Cys Gly Lys Ala Phe Arg Gln Ser Thr His Leu 389
2459 AAA GAC ACC GGC GCC TGC ACA CAG GTG AGC GGC CCT TTG CCT GTG AAG TGT GTG GCA AGG 2518
390 Lys Asp Thr Gly Ala Cys Thr Gln Val Ser Gly Pro Leu Pro Val Lys Cys Val Ala Arg 409
2519 CCT TTG CCA TCT CCA TGC GCC TGG CAG AAC ATC GCC GCA TCC ACA CAG GCG AAC GAC CCT 2578
410 Pro Leu Pro Ser Pro Cys Ala Trp Gln Asn Ile Ala Ala Ser Thr Gln Ala Asn Asp Pro 429
2579 ACT CCT GCC CTG ACT GTG GCA AGA GCT ACC GCT CCT TCT CCA ACC TCT GGA AGC ACC GCA 2638
430 Thr Pro Ala Leu Thr Val Ala Arg Ala Thr Ala Pro Ser Pro Thr Ser Gly Ser Thr Ala 449
2639 AGA CCC ATC AGC AGC AGC ATC AGG CAG CTG TGC GGC AGC AGC TGG CAG AGG CGG AGG CTG 2698
450 Arg Pro Ile Ser Ser Ser Ile Arg Gln Leu Cys Gly Ser Ser Trp Gln Arg Arg Arg Leu 469
2699 CCG TTG GCC TGG CCG TCA TGG AGA CTG CTG TGG AGG CGC TAC CCC TGG TGG AAG CCA TTG 2758
470 Pro Leu Ala Trp Pro Ser Trp Arg Leu Leu Trp Arg Arg Tyr Pro Trp Trp Lys Pro Leu 489
2759 AGA TCT ACC CTC TGG CCG AGG CTG AGG GGG TCC AGA TCA GTG GCT GAC TCT GCC CGA CTT 2818
490 Arg Ser Thr Leu Trp Pro Arg Leu Arg Gly Ser Arg Ser Val Ala Asp Ser Ala Arg Leu 509
2819 CCT CTT TGG CAC CTC CAT TCC CTG TTG CTG AAG GCC CTC CAG CAT CCC CTT AAG CAT CTG 2878
510 Pro Leu Trp His Leu His Ser Leu Leu Leu Lys Ala Leu Gln His Pro Leu Lys His Leu 529
2879 TAC ATA CTG TGT CCC TTC CTC TTC CCA TCC CCA CCA CCT TGT AAG TTC TAA ATT GGA TTT 2938
530 Tyr Ile Leu Cys Pro Phe Leu Phe Pro Ser Pro Pro Pro Cys Lys Phe *** 546
2939 ATT CTC TCG TGA GGG GGG TGC TCT GGG GTC CTT GAC ACA CAT AAA GGT GCC CCC CCA CCT 2998
2999 TCC ACC TCT TAG CAC TGG TGA CCC CAA AAA TGA AAC CAT CAA TAA AGA CTG AGT TGC CAG 3058
3059 CAG TGT GTA GAG TGG AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 3112
6.FP6628
A:核苷酸序列(SEQ ID NO:16)长度:3102个碱基
1 GGGCAGAGGT TGCAGTAACC CAAGATCATG CCACCATACT ACAGACTGTG TGACAGAGCG
61 AGACTCTGTC TCAAAACAAC AACAAAAAAA CAAACTCACC ATTGTACCTG TGCTTATGCA
121 AGGTTTAGTA GGAACGTAAA TTGGTTTAAC CTTTGTGGAC AGAAGTTTTA AAAATATATA
181 TTAAAATTAA AAGTATGCTC TGAAGGAGGA ACTCCACTTC TGGTAATTTA TCTCAAGAGA
241 ATAACTGGGC CAGCACAAAG GCTGCTGTTT AACAATGTGT AATGATGCAG TGACAGCTAC
301 AATTGCAAAA ATAACCTAGA CATTCACCAA TGAGGACTGG TTAAATGAAC TAGTATAACC
361 ATACTGCAGA ATATCATAAA GATAACAAAA AAATGATATG GATCTGTTTC TTGGCATAAA
421 TATATCCATA AGTTTTAAGA AGAGATGCTA TATATACGGT GGTCCCATTG ATGTATAACT
481 GTTAGGACTA AAAATAGTAC CTTCCTCATA ATGATGTTTT GAGGAATTAA TGAGTTTATT
541 CATGCAAAAT GCTTAGAATG GTACCTGGCA CACAGACAAT GTTTAAGAAA TGTTTGTTAT
601 TGTTATTACT ATGTCTCTGT ATATATGCAT AGGAAAAATC TGGAAGGATA AAATAAAAAA
661 TGAATATTTT TGGGTGGTGA GACTAAATTT TTGTCTAATT TTATGGATAA GTTTTATGAT
721 TTATATTTAT AATAAAAATA AAGCTATAAA AATTAATTAT GATGTTTCTT GCTCATGTCA
781 GCTACTTCAC TACATACTGA GTTCCCATCC CCATTTGTTA CAGGAGCAAC TCCTGGTTAA
841 GTACCTTTTT TGTAACTGTG AAATTCCCTT GACATTCATC ATATACTGAT GACTTTTCCT
901 AATACATGGA AACAAACAGG ATTGTGATTT TTCTCTCATT TTGTACACTA AGTTCTATGC
961 CAGCCGATTT CAGAGAGACA CTCTGCAAAG TTCCTATGAA AAGTCTTCAA AAATGTATTA
1021 CCTTGCTGTT TAATACCAAT ACCAAAATTC AAATGGACTT ATCAATTAAA CTCACCTCAA
1081 ACACAGTAAT GCACTCACAG TTATGAGCAG TGCTCACTAC TGCCAATCAT TTCTGCTTCC
1141 AGAATGGTTA AAGGAGCCAC AAACTCTGCC CTTATCAGAA GCAGTAGCCT GATAACAGGT
1201 AAGAATAGGA ATGTTCCGTT TCTCCCCAAA TTAAGAGTGG TATCAATAAT CTGACTTTTC
1261 CAGGCATTTA TCTCACAGAA ATGTTTATGA GACATGCTAA GATCAACATG GTAATATCTG
1321 ACTATTGTTT TTATTAGAAA TAAGGGGGCC AGCCAGGCAC AGTACCTTAC ACCTGTAATC
1381 CCAGCACCTG GGGAGGTTGA GGTGGGAGGA TTGCTTGAGC CCAGGAGTTT GAGACAAGCC
1441 TGGGCAACAC AGGGAGACAC CAGCTCTATT AAAAAAAAAA AAAGTAAGGG GGCTATAATG
1501 TAACCCTTAT TGACTGATCT TTGAGGCTAC TGTTGTGAGA TTTCTACATC CCTCTTTATT
1561 ATAAAAGATC CCAAATGCGG CTTTACTTGG AAAGGAAGCA ATTTGACAGT GATGAGGAAT
1621 GATGTGCAGA ATGGAGATTC AGAACCCTAA CAGACTCTGG TATTGATATC TAGTGCTCAT
1681 ATTTCTGGGA GTCTGCTAGG GTTATGGGAG TTTGCATTTA AATTGTAGGT TGTTGCAGAA
1741 AACAGAATTT ATATGTGGAA AATTGTAACG AATCCACTAA AAAACTATTA GAACTAATAA
1801 TCAAGTTTGG CAAGGTTGTA AGACATAAGT CAGTATACAA AAATCAACTG TATTTCTATA
1861 CATTTGTGAC AATCTGAAAA TGAAATTAGG AAAACAAATC CATTTACGAT AGCAACAAGA
1921 AGTATAAAAT ACTTAGGAAG AAGTATAACA AAAGATGTGC ACAATTTATA TTCTGAAAAC
1981 TACAAATAGT GTTTAAAGAA ATTAAAGAAT ATTAAAATAA ATGGAAAAAT ATCCCATGTT
2041 CATGGACTGG AAGAATTACT CTTAAGATGT CAATACTCCT CAAATTGATC TACATATTTG
2101 ATACAATCCT TGTAAGAACC CGAACTGACT TCTTTGTAGA AATTGACAAA TTGATTCTAA
2161 GATTCATACA GGATTGCCAT AGATCCAGAA TAGCCACATC AATTTTAAAA AAAGAAGAAA
2221 GTACAAAGAC TCACATTACC TGATTTAAAA ACATACCATA AAGCAATGTT AGGACAGTGT
2281 GGTATTGACA TAAGGATAGA CACATAGATC AATGAAAAGG AAAGGGAGCC CAGAAGTAAA
2341 ACCACATCAA CTGATTTTCA ACAAAGATGC CAAGACCATT CAATTGAGGA AAGAATAGTC
2401 CCTTCAACAA ATGGTGCTGC AACCAGACAG TCATATGCAA AAGAATGAAA TTTAACCTTT
2461 ACAAAATTTA ACCATATATA AAAATTAATT CAAATGGATC AAAGACATAT AAGGGCTGAA
2521 ACTATAAAAT TGTTAAGAGA ACATAGGAAT AAATATTCAT GACCTTGGAT TTGGCAGTGG
2581 ATTCTTAGCT ATAACATCAA AGCACAAGTA AGAAAAGAGA GATAAATTGG ATTTCATGAA
2641 AATTAAAAAC CTGTGCTTCA AAGACACTAT CAAGAAAGTG ACAAGGCAAC CCACAGAATG
2701 GGAAAAACTG CAGATTATCT GATAAGGGAC TTCTATCTAG AATATATAAA AATCTCTCAC
2761 AACTCAGAAA TAAGACAATC CAGTTAAAAT AAGGGTAAAG GAGCCGGGCA TGGTGGCTCA
2821 CGCCTGTAAT CCCAGAGCTT TGGGAGGTGG AGGTGGGCAG ATCACCTGAG GTCAGGAGTT
2881 CACGACCAGC CTGGCCAACA TGGTAAAACC CCATCTCTAC TAAAAATACA AAAATTAGCC
2941 GGGTGTGGTG GTGCATGCCT GTAATCCCAG CTACTTGGGA GGCTGAGGCA GAAGAATCAC
3001 TTGAACCTGG GAGGTGGAGG TTGCAGTGAG CCGAGATCGC GCCACTGCAC TCCAGCCTGG
3061 GCGACAGAGC GAGAATCTGT CTCGAAAAA AAAAAAAAAAA AA
B:核苷酸序列(SEQ ID NO:18)长度:99个氨基酸
1 MFVIVITMSL YICIGKIWKD KIKNEYFWVV RLNFCLILWI SFMIYIYNKN KAIKINYDVS
61 CSCQLLHYIL SSHPHLLQEQ LLVKYLFCNC EIPLTFIIY
C.核苷酸及氨基酸组合序列(SEQ ID NO:17)克隆号和蛋白名称:FP6628
起始编码子:590 ATG终止编码子:887 TGA蛋白质分子量:11979.94
1 G GGC AGA GGT TGC AGT AAC CCA AGA TCA TGC CAC CAT ACT ACA GAC TGT GTG ACA GAG 58
59 CGA GAC TCT GTC TCA AAA CAA CAA CAA AAA AAC AAA CTC ACC ATT GTA CCT GTG CTT ATG 118
119 CAA GGT TTA GTA GGA ACG TAA ATT GGT TTA ACC TTT GTG GAC AGA AGT TTT AAA AAT ATA 178
179 TAT TAA AAT TAA AAG TAT GCT CTG AAG GAG GAA CTC CAC TTC TGG TAA TTT ATC TCA AGA 238
239 GAA TAA CTG GGC CAG CAC AAA GGC TGC TGT TTA ACA ATG TGT AAT GAT GCA GTG ACA GCT 298
299 ACA ATT GCA AAA ATA ACC TAG ACA TTC ACC AAT GAG GAC TGG TTA AAT GAA CTA GTA TAA 358
359 CCA TAC TGC AGA ATA TCA TAA AGA TAA CAA AAA AAT GAT ATG GAT CTG TTT CTT GGC ATA 418
419 AAT ATA TCC ATA AGT TTT AAG AAG AGA TGC TAT ATA TAC GGT GGT CCC ATT GAT GTA TAA 478
479 CTG TTA GGA CTA AAA ATA GTA CCT TCC TCA TAA TGA TGT TTT GAG GAA TTA ATG AGT TTA 538
539 TTC ATG CAA AAT GCT TAG AAT GGT ACC TGG CAC ACA GAC AAT GTT TAA GAA ATG TTT GTT 598
1 Met Phe Val 3
599 ATT GTT ATT ACT ATG TCT CTG TAT ATA TGC ATA GGA AAA ATC TGG AAG GAT AAA ATA AAA 658
4 Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys Ile Trp Lys Asp Lys Ile Lys 23
659 AAT GAA TAT TTT TGG GTG GTG AGA CTA AAT TTT TGT CTA ATT TTA TGG ATA AGT TTT ATG 718
24 Asn Glu Tyr Phe Trp Val Val Arg Leu Asn Phe Cys Leu Ile Leu Trp Ile Ser Phe Met 43
719 ATT TAT ATT TAT AAT AAA AAT AAA GCT ATA AAA ATT AAT TAT GAT GTT TCT TGC TCA TGT 778
44 Ile Tyr Ile Tyr Asn Lys Asn Lys Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys 63
779 CAG CTA CTT CAC TAC ATA CTG AGT TCC CAT CCC CAT TTG TTA CAG GAG CAA CTC CTG GTT 838
64 Gln Leu Leu His Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln Leu Leu Val 83
839 AAG TAC CTT TTT TGT AAC TGT GAA ATT CCC TTG ACA TTC ATC ATA TAC TGA TGA CTT TTC 898
84 Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe Ile Ile Tyr *** 100
899 CTA ATA CAT GGA AAC AAA CAG GAT TGT GAT TTT TCT CTC ATT TTG TAC ACT AAG TTC TAT 958
959 GCC AGC CGA TTT CAG AGA GAC ACT CTG CAA AGT TCC TAT GAA AAG TCT TCA AAA ATG TAT 1018
1019 TAC CTT GCT GTT TAA TAC CAA TAC CAA AAT TCA AAT GGA CTT ATC AAT TAA ACT CAC CTC 1078
1079 AAA CAC AGT AAT GCA CTC ACA GTT ATG AGC AGT GCT CAC TAC TGC CAA TCA TTT CTG CTT 1138
1139 CCA GAA TGG TTA AAG GAG CCA CAA ACT CTG CCC TTA TCA GAA GCA GTA GCC TGA TAA CAG 1198
1199 GTA AGA ATA GGA ATG TTC CGT TTC TCC CCA AAT TAA GAG TGG TAT CAA TAA TCT GAC TTT 1258
1259 TCC AGG CAT TTA TCT CAC AGA AAT GTT TAT GAG ACA TGC TAA GAT CAA CAT GGT AAT ATC 1318
1319 TGA CTA TTG TTT TTA TTA GAA ATA AGG GGG CCA GCC AGG CAC AGT AGC TTA CAC CTG TAA 1378
1379 TCC CAG CAC CTG GGG AGG TTG AGG TGG GAG GAT TGC TTG AGC CCA GGA GTT TGA GAC AAG 1438
1439 CCT GGG CAA CAC AGG GAG ACA CCA GCT CTA TTA AAA AAA AAA AAA GTA AGG GGG CTA TAA 1498
1499 TGT AAC CCT TAT TGA CTG ATC TTT GAG GCT ACT GTT GTG AGA TTT CTA CAT CCC TCT TTA 1558
1559 TTA TAA AAG ATC CCA AAT GCG GCT TTA CTT GGA AAG GAA GCA ATT TGA CAG TGA TGA GGA 1618
1619 ATG ATG TGC AGA ATG GAG ATT CAG AAC CCT AAC AGA CTC TGG TAT TGA TAT CTA GTG CTC 1678
1679 ATA TTT CTG GGA GTC TGC TAG GGT TAT GGG AGT TTG CAT TTA AAT TGT AGG TTG TTG CAG 1738
1739 AAA ACA GAA TTT ATA TGT GGA AAA TTG TAA CGA ATC CAC TAA AAA ACT ATT AGA ACT AAT 1798
1799 AAT CAA GTT TGG CAA GGT TGT AAG ACA TAA GTC AGT ATA CAA AAA TCA ACT GTA TTT CTA 1858
1859 TAC ATT TGT GAC AAT CTG AAA ATG AAA TTA GGA AAA CAA ATC CAT TTA CGA TAG CAA CAA 1918
1919 GAA GTA TAA AAT ACT TAG GAA GAA GTA TAA CAA AAG ATG TGC ACA ATT TAT ATT CTG AAA 1978
1979 ACT ACA AAT AGT GTT TAA AGA AAT TAA AGA ATA TTA AAA TAA ATG GAA AAA TAT CCC ATG 2038
2039 TTC ATG GAC TGG AAG AAT TAC TCT TAA GAT GTC AAT ACT CCT CAA ATT GAT CTA CAT ATT 2098
2099 TGA TAC AAT CCT TGT AAG AAC CCG AAC TGA CTT CTT TGT AGA AAT TGA CAA ATT GAT TCT 2158
2159 AAG ATT CAT ACA GGA TTG CCA TAG ATC CAG AAT AGC CAC ATC AAT TTT AAA AAA AGA AGA 2218
2219 AAG TAC AAA GAC TCA CAT TAC CTG ATT TAA AAA CAT ACC ATA AAG CAA TGT TAG GAC AGT 2278
2279 GTG GTA TTG ACA TAA GGA TAG ACA CAT AGA TCA ATG AAA AGG AAA GGG AGC CCA GAA GTA 2338
2339 AAA CCA CAT CAA CTG ATT TTC AAC AAA GAT GCC AAG ACC ATT CAA TTG AGG AAA GAA TAG 2398
2399 TCC CTT CAA CAA ATG GTG CTG CAA CCA GAC AGT CAT ATG CAA AAG AAT GAA ATT TAA CCT 2458
2459 TTA CAA AAT TTA ACC ATA TAT AAA AAT TAA TTC AAA TGG ATC AAA GAC ATA TAA GGG CTG 2518
2519 AAA CTA TAA AAT TGT TAA GAG AAC ATA GGA ATA AAT ATT CAT GAC CTT GGA TTT GGC AGT 2578
2579 GGA TTC TTA GCT ATA ACA TCA AAG CAC AAG TAA GAA AAG AGA GAT AAA TTG GAT TTC ATG 2638
2639 AAA ATT AAA AAC CTG TGC TTC AAA GAC ACT ATC AAG AAA GTG ACA AGG CAA CCC ACA GAA 2698
2699 TGG GAA AAA CTG CAG ATT ATC TGA TAA GGG ACT TCT ATC TAG AAT ATA TAA AAA TCT CTC 2758
2759 ACA ACT CAG AAA TAA GAC AAT CCA GTT AAA ATA AGG GTA AAG GAG CCG GGC ATG GTG GCT 2818
2819 CAC GCC TGT AAT CCC AGA GCT TTG GGA GGT GGA GGT GGG CAG ATC ACC TGA GGT CAG GAG 2878
2879 TTC ACG ACC AGC CTG GCC AAC ATG GTA AAA CCC CAT CTC TAC TAA AAA TAC AAA AAT TAG 2938
2939 CCG GGT GTG GTG GTG CAT GCC TGT AAT CCC AGC TAC TTG GGA GGC TGA GGC AGA AGA ATC 2998
2999 ACT TGA ACC TGG GAG GTG GAG GTT GCA GTG AGC CGA GAT CGC GCC ACT GCA CTC CAG CCT 3058
3059 GGG CGA CAG AGC GAG AAT CTG TCT CGA AAA AAA AAA AAA AAA AA 3102
7.FP6651
A:核苷酸序列(SEQ ID NO:19)长度:2455个碱基
1 GTTCTAGGTA GTAGAAAGCA AAGGGTGCTA TGAAGAGCGT GTACACAGAC TCCCAACTGT
61 TTTGGGAGTT AAGGAAGGTT TCTTGGAGGA AGTGGCATTC AAGCTATAAG ACCTGATGAT
121 CAGGTGGAGT TAGCTGGAGA GCAGGGACAG AGAGAATAGC CTGTGCAAAA GGCCTATTCT
181 TCAGGAGAGA ATGACACATG AATGGGACTG AAGAAGTAAA CTGGTATCTC ATATGAAGGA
241 CCTTTTATAT CTTGTTAAGG ATTTTGAACT TCCTCCTTTT TTTTTTTTTG AGACAGAGTT
301 TCTCTCTGTC ACCCAGGCTG AAGTGCATTG GCGTGATCTC GGCTCATGGC AGCCTCCACC
361 TACCAGGTTC AAGCTATTCT CCTGCCTCAG CTTCCCAGAT AGCTGGGATT ACAGTCATGT
421 GCCACCACGC CGGGCTAATT TTTGTATTTT TAGTAGAGAC AGGGTTTCAC CGTGTTGGCC
481 AGGCTGGTCT CGATTTCCTG ACCTCAAGTG ATCTGCCTGC CTTGGCCTGC CCCAGTGCCG
541 GAATTACAGG AGTGAGCCCC CGCGCCTGGC CTGGACTTCT GCTTAAAGGC AATAAGGAAG
601 CCTTTACTAG ATTTAAAATA GGAGCTCAGT TTAAATTAGT AAGGATTTGT ATTTCATCAA
661 GAGCTCTCTT TGGCCCTAGT CTGGTAGAAG ATGAGTCGAA GTAGAGAGAC TAGTTACAAA
721 GCTGTTCCCA ATAATCCAGG TGAAAAATAG TGGTGACCCT AGATTAAGGT AGTATTGGTG
781 TGGGTAGGGA GAAGTGGACA GTCATATTTG AGAGGTACCT AGGGAATAGA ATTGCAAAGA
841 CCTGGGAGTA GATTGGATAT TCAGTGGGAG GAAGGGAGAG AAGTAATCTC TCAAGTGTTG
901 CTCAAGCCAT AACCTTGGAT GGTACTGTCC ACTGATACAG TAGGAGGAAA ATGTTTGAGG
961 GAAAGTAGTG ATGAATTTGT GGTGCACTAA CATGGCCAAC ACTAAATATT AGAAAGATTA
1021 ATGTGGTCAT GTAGAAGATG AATGAAAAGA AGATACCTCA GAAGTGGAGA GATAGTTAAA
1081 TGGCTTTTGT AGGAATCTCA GCTAGAAGTG TCAGTATTCT TAAGTGCAGA ACTAACAGGT
1141 GTGGGAAAGT AATGGGAAGT AGACACCAAA CAAATAGTTC CCCAAAGATG GTATCAAATA
1201 TCCCAGTGAC AGCTTGCAGC CTGCTCAGCT TTATGATATG CCCCTGAGAT CATTTTTCAG
1261 GACAAAAAGT AGTGAAACTA CCTTTATTTA CTTCTCAAAT TTACCTTTAT TTACTTCTCA
1321 AATATACATA GAAAGTAATA TTGTAAAAAG CAGCTCTGGC TGGGCGCTGT GGCTCAAGCC
1381 TATAGTCCCA GCACTTTGGG AGGCTGAGGG GGGCAGATGA CTTGAGGTCA AGAGTTCAAG
1441 ACCATCCTGG CCAACATGGC AAAACCGCAT TTCTACTAAA AATACAAAAA TTCGCGTGGC
1501 AGCACGTGCC TGTAACCCCA GCTACTCTGG AGGCTGAGGT ACAAGAGTCG CTTGAATTTG
1561 GGAGGTGGAG ACTGCAGCGA GCCGAGATCC TACCACTGCA CTCCAGCTTG GGGGACAGTG
1621 CGAGACTCTG TCTTAAAAAA CAGTGGCCTG GCGCACTGGC TCACGCTTGT AATCCCAGCA
1681 CTTTGGGAGG CCGAGGTGGG CGGGGGTGGA TCATTGAGGT CAGGAGATCA AGACCATCCC
1741 GGCCAACGTG GTGCAACCCC GTCTCTACTA CAAATACAAA AATTAGCTGG ACATGGTGGT
1801 GTACGCCTGT AGTCCCAGCT ACTCGGGAGA GTAAGACGGG AATCGCTTGA ACCTGGGAGG
1861 TGGGAGGTTG CAGTGAGCCA AGATTGTGCC ACTGCACTCC AGCCTGGCGA CAGAGCAAGA
1921 CTGTCTTAAA AAAAAAAAAA AAAAAAAAAG GATATTTTCA CTCTTGGGAC TTGATAAAGC
1981 TAGTTTATTT TGATTATCTC CTATATCCTA TACATATTTA ATTGGCCCCT ATGAACAATG
2041 TTACCTCTTT ATGAGGGGAC CCAAAGAAGT AGCTGCTGGT GTGAGAGTGA GAGATCATCC
2101 ATCTTTTTTA TTGTGCTTTT TGTTGTTTCT TTGTCCTGCT ATGTGTTATA AGTAAGGCCG
2161 GGCACGGTGG CTCATGCCTG TAATCCCAGC ACTTAGGGAG GCCAAGGCCA GATCCCTGAG
2221 GTCAAGAGTT TGAGACCAGC CTAGCCAACA TGGTGAAACC TTGTCTTTAC TGAAAATACA
2281 AAAAAATTAG CTGGGCAGGG TGGCATGCGC CTGTAGTCCC AGCTACTCGC AGAGGCTGAG
2341 GCAGGAGAAT TGCTTGAACC TGGGAGGCGG AGGTTGCGGT GAGCCAAGAT CCTGCCACTG
2401 CACTCCAGCC TGGGCAACAG AGGGAGACTC CATCTCAA AAAAAAAAAAA AAAAAA
B:核苷酸序列(SEQ ID NO:21)长度:95个氨基酸
1 MCHHAGLIFV FLVETGFHRV GQAGLDFLTS SDLPALACPS AGITGVSPRA WPGLLLKGNK
61 EAFTRFKIGA QFKLVRICIS SRALFGPSLV EDESK
C.核苷酸及氨基酸组合序列(SEQ ID NO:20)克隆号和蛋白名称:FP6651
起始编码子:417 ATG终止编码子:702 TAG蛋白质分子量:10202.37
1 GT TCT AGG TAG TAG AAA GCA AAG GGT GCT ATG AAG AGC GTG TAC ACA GAC TCC CAA CTG 59
60 TTT TGG GAG TTA AGG AAG GTT TCT TGG AGG AAG TGG CAT TCA AGC TAT AAG ACC TGA TGA 119
120 TCA GGT GGA GTT AGC TGG AGA GCA GGG ACA GAG AGA ATA GCC TGT GCA AAA GGC CTA TTC 179
180 TTC AGG AGA GAA TGA CAC ATG AAT GGG ACT GAA GAA GTA AAC TGG TAT CTC ATA TGA AGG 239
240 ACC TTT TAT ATC TTG TTA AGG ATT TTG AAC TTC CTC CTT TTT TTT TTT TTG AGA CAG AGT 299
300 TTC TCT CTG TCA CCC AGG CTG AAG TGC ATT GGC GTG ATC TCG GCT CAT GGC AGC CTC CAC 359
360 CTA CCA GGT TCA AGC TAT TCT CCT GCC TCA GCT TCC CAG ATA GCT GGG ATT ACA GTC ATG 419
1 Met 1
420 TGC CAC CAC GCC GGG CTA ATT TTT GTA TTT TTA GTA GAG ACA GGG TTT CAC CGT GTT GGC 479
2 Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly Phe His Arg Val Gly 21
480 CAG GCT GGT CTC GAT TTC CTG ACC TCA AGT GAT CTG CCT GCC TTG GCC TGC CCC AGT GCC 539
22 Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp Leu Pro Ala Leu Ala Cys Pro Ser Ala 41
540 GGA ATT ACA GGA GTG AGC CCC CGC GCC TGG CCT GGA CTT CTG CTT AAA GGC AAT AAG GAA 599
42 Gly Ile Thr Gly Val Ser Pro Arg Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu 61
600 GCC TTT ACT AGA TTT AAA ATA GGA GCT CAG TTT AAA TTA GTA AGG ATT TGT ATT TCA TCA 659
62 Ala Phe Thr Arg Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser Ser 81
660 AGA GCT CTC TTT GGC CCT AGT CTG GTA GAA GAT GAG TCG AAG TAG AGA GAC TAG TTA CAA 719
82 Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys *** 96
720 AGC TGT TCC CAA TCC TCC AGG TGA AAA ATA GTG GTG ACC CTA GAT TAA GGT AGT ATT GGT 779
780 GTG GGT AGG GAG AAG TGG ACA GTC ATA TTT GAG AGG TAC CTA GGG AAT AGA ATT GCA AAG 839
840 ACC TGG GAG TAG ATT GGA TAT TCA GTG GGA GGA AGG GAG AGA AGT AAT CTC TCA AGT GTT 899
900 GCT CAA GCC ATA ACC TTG GAT GGT ACT GTC CAC TGA TAC AGT AGG AGG AAA ATG TTT GAG 959
960 GGA AAG TAG TGA TGA ATT TGT GGT GCA CTA ACA TGG CCA ACA CTA AAT ATT AGA AAG ATT 1019
1020 AAT GTG GTC ATG TAG AAG ATG AAT GAA AAG AAG ATA CCT CAG AAG TGG AGA GAT AGT TAA 1079
1080 ATG GCT TTT GTA GGA ATC TCA GCT AGA AGT GTC AGT ATT CTT AAG TGC AGA ACT AAC AGG 1139
1140 TGT GGG AAA GTA ATG GGA AGT AGA CAC CAA ACA AAT AGT TCC CCA AAG ATG GTA TCA AAT 1199
1200 ATC CCA GTG ACA GCT TGC AGC CTG CTC AGC TTT ATG ATA TGC CCC TGA GAT CAT TTT TCA 1259
1260 GGA CAA AAA GTA GTG AAA CTA CCT TTA TTT ACT TCT CAA ATT TAC CTT TAT TTA CTT CTC 1319
1320 AAA TAT ACA TAG AAA GTA ATA TTG TAA AAA GCA GCT CTG GCT GGG CGC TGT GGC TCA AGC 1379
1380 CTA TAG TCC CAG CAC TTT GGG AGG CTG AGG GGG GCA GAT GAC TTG AGG TCA AGA GTT CAA 1439
1440 GAC CAT CCT GGC CAA CAT GGC AAA ACC GCA TTT CTA CTA AAA ATA CAA AAA TTC GCG TGG 1499
1500 CAG CAC GTG CCT GTA ACC CCA GCT ACT CTG GAG GCT GAG GTA CAA GAG TCG CTT GAA TTT 1559
1560 GGG AGG TGG AGA CTG CAG CGA GCC GAG ATC CTA CCA CTG CAC TCC AGC TTG GGG GAC AGT 1619
1620 GCG AGA CTC TGT CTT AAA AAA CAG TGG CCT GGC GCA CTG GCT CAC GCT TGT AAT CCC AGC 1679
1680 ACT TTG GGA GGC CGA GGT GGG CGG GGG TGG ATC ATT GAG GTC AGG AGA TCA AGA CCA TCC 1739
1740 CGG CCA ACG TGG TGC AAC CCC GTC TCT ACT ACA AAT ACA AAA ATT AGC TGG ACA TGG TGG 1799
1800 TGT ACG CCT GTA GTC CCA GCT ACT CGG GAG AGT AAG ACG GGA ATC GCT TGA ACC TGG GAG 1859
1860 GTG GGA GGT TGC AGT GAG CCA AGA TTG TGC CAC TGC ACT CCA GCC TGG CGA CAG AGC AAG 1919
1920 ACT GTC TTA AAA AAA AAA AAA AAA AAA AAA GGA TAT TTT CAC TCT TGG GAC TTG ATA AAG 1979
1980 CTA GTT TAT TTT GAT TAT CTC CTA TAT CCT ATA CAT ATT TAA TTG GCC CCT ATG AAC AAT 2039
2040 GTT ACC TCT TTA TGA GGG GAC CCA AAG AAG TAG CTG CTG GTG TGA GAG TGA GAG ATC ATC 2099
2100 CAT CTT TTT TAT TGT GCT TTT TGT TGT TTC TTT GTC CTG CTA TGT GTT ATA AGT AAG GCC 2159
2160 GGG CAC GGT GGC TCA TGC CTG TAA TCC CAG CAC TTA GGG AGG CCA AGG CCA GAT CCC TGA 2219
2220 GGT CAA GAG TTT GAG ACC AGC CTA GCC AAC ATG GTG AAA CCT TGT CTT TAC TGA AAA TAC 2279
2280 AAA AAA ATT AGC TGG GCA GGG TGG CAT GCG CCT GTA GTC CCA GCT ACT CGC AGA GGC TGA 2339
2340 GGC AGG AGA ATT GCT TGA ACC TGG GAG GCG GAG GTT GCG GTG AGC CAA GAT CCT GCC ACT 2399
2400 GCA CTC CAG CCT GGG CAA CAG AGG GAG ACT CCA TCT CAA AAA AAA AAA AAA AAA AA 2455
8.FP7162
A:核苷酸序列(SEQ ID NO:22)长度:2572个碱基
1 GCGGGGTTTC ACTATGTTGG CCAGGCTGGT CTAGAACTCC TGACCTCAAG TGATCTGCCC
61 GCCTCGGCCT CCCAAAGTGC TGGGATTGCA GGCGTGAGAC ACTGCACCCG GACAATTTTC
121 CTTTTCTTAC AAGAACACTG CTCACACTGC ATTCAGGGCC AACCCTAACC CAGTATCGCC
181 TCATCCTGGT TTGATTATAT CGGCACAGAC CTTGCTTCCG AGCGAGGCCA CTTTCTCAGG
241 TACTGGTGGA CATGAGTCTT CGGAGACGCT GCTCAACCCA CAGTGCTCCT CCAGCTTGGT
301 TTCTGTGACT TGCCTTCCCC AGAGGAGGGG TGCCCTGAGA GGTCTCCACT CCCTGACCGG
361 CTCCTTGGTG CCGCGCACTC TGAGAGGCTT CCCAGGGAAC AGAGCACACA GGACCGCCCT
421 CCTGGGTAGA CCAATCAGCA TCTGAGCTCA CAATTTCCCA GCAGGGCAGT GGGGTGGAGA
481 GAGAAGCCTG GGCTGGGCTG GGCTGGGCTG GGCTGGGGAA GCTTCTCCGG GCGGGGGGAC
541 GTCAGAGCAG GATCTGGGGC TGATAAAAGC CCGCCCCTGG GTGGGGGCTG AGTGGTGCGG
601 AAGCTGAGCC CGACACGTGG GGATGGAGGA CAGGCTGTGG GAGGGTGTGA ACCGGATACT
661 GCTTGAAGGG GTGCTGGGGA CTTTGAGAGA GGGCGGCTGG CCCTGTCTGG TCGGGGATGC
721 TGGCCCAGAC ACAGGCCATG GCTGGGATGG GGTTCAGAAA CAGGACCGCT GTCTCTCCCG
781 GGCCAGGGCC CTCCCCAGCT GCTCCTGGCT TTCTGGTTCT TGGGGTCAGG GGCAGGCCTG
841 TGCCATGACC CCGCCACTGA GGCTGTGAGG AGGCTGTCGG TGCCCAAGGG CACCAAGGCA
901 CACCCCTACT CTTGCACCCC ATGTGTGGGC CCGAGCACCT GCTCTGCTGC CCCAAAGATC
961 TGGCGATGTT TCCCAGGCAA CTGTCTCTCA CAGCCTGTCT GCCTGGCACT CCCGTATCCC
1021 ATAAATGCCA CCACATCTGG CTATGGGTGG GCGTGCCTGC CTGGCATCCA CGGGCCAGCA
1081 GGTGTGGTGG AGCACAGCCC AGTTCCTGGC TGCGTCAGAA GGCTGCCCGG GCCTTTTGGC
1141 TGTCCTTGCC AGCAGGTGAG CACTGCCAGG GCACCGTGTG TGGGTGCTGG GCCATTTAGC
1201 CACATGGGAA GGGGTGGAGG CAGCCCAGTG CCTTCAGCAT GTGCCCAGGG TGCCTGTCGG
1261 CCACAGGTCT CATTTGGAAA TTGGGAGGGT GCACGGCCAC CGGGCTGCTT AGGCCTGCCA
1321 GCCTCAGGGC CCGTCACCGC TGTCTTAGCC TGATTTGCAG GGTGTCAACG CTGGGCAGAG
1381 ATGAACATTT GGGTGACTCT GAGGATGCCA GTGGCTGGGA CACTTGTTCT TCCGCGGTGG
1441 AAGGAGTTGG AGAGGCCTGG CTCCCTGACC TACGGCCAGC CTGGCTTCTG AAACCAGCTC
1501 AGTGGGCTGG GGCCTGATTC ATCATCCATA AATGTGTCCT TTTTTGCCAC AGAGGGTAAG
1561 GGGCCTCCTA GCCCACCGGT CTGCAGGTGC GGGAGTAGGA GATGGGTGGC TCTGATGCCC
1621 CCACCCACTC GATCACCTTC TGCTCTGCCT GGGATGCAAA CTCCCACAGC TGAAACGTTC
1681 TTTTGTAAAC ATGAATTTTG GCTTAGAAAA AACTCATTTC CACTGTGCAC GTGTCAGTCC
1741 CAACCAGAAA TTATTTTCCA ATAAAGCAAA ACTCCGTCAC CACAGCAGCA GATGGCTCCG
1801 AAGAAGTGGA GCGTTTTCAT CAGGTTCAAC TTTGAAACCT CCACCATCAC CATCACCAGC
1861 ACCGCTGTGT CATGCTGATA ACTTGAGGAC AGGCAGGACA AGGCCTTCTG GCGGCCGCCC
1921 CTGGTTTCTC CTGGGGGGTG ATGAGCGGGA GCGGCTCTGG GCCGAGCTAC TGCGCACGGT
1981 GAGCCCGGAG CTGATCCTGG ATCACGAGGT GCCTTCACTG CCCGCCTTCC CAGGACAGGA
2041 GCCCAGGTGC GGCCCGGAGC CCACTGAAGT CTTCACTGTC GGACCCAAGA CCTTTTCCTG
2101 GACACCCTTT CCGCCGGACC TGTGGGGCCC GGGCCGTTCC TACCGGCTGC TTCACGGGGC
2161 AGGAGGGCAC CTGGAATCCC CCGCCAGGTC CCTGCCCCAG CGCCCGGCAC CTGATCCCTG
2221 CAGGGCCCCC AGGGTGGAGC AGCAACCGTC TGTGGAGGGT GCCGCGGCCC TGCGCAACTG
2281 CCCCATGTGC CAGAAGGAGT TTGCCCCCAG GCTGACCCAG CTGGATGTTG ACAGCCACCT
2341 GGCCCAGTGC TTGGCCGAAA GCACAAAAAA CGTGACGTGG TGAGCGCCAT CCAAGAGCCC
2401 TGCGCAGAGT GCAGCGCCCG GACACGCTTT CCCCCGCCAG CAGCCCCGCC TCTCGGCTCC
2461 CCCGCCAACA GCCCCGCCTT TCGGCTCCCC CGCATGGGCA TTAAAACAGG GCGGGCTCCT
2521 GTCTGTCTCT GTGTTGTGAT GAAAAAAAAA AAAAAAAAA AAAAAAAAAAA AA
B:核苷酸序列(SEQ ID NO:24)长度:230个氨基酸
1 MNFGLEKTHF HCARVSPNQK LF5NKAKLRH HSSRWLRRSG AFS5GSTLKP PPSPSPAPLC
61 HADNLRTGRT RPSGGRPWFL LGGDERERLW AELLRTVSPE LILDHEVPSL PAFPGQEPRC
121 GPEPTEVFTV GPKTFSWTPF PPDLWGPGRS YRLLHGAGGH LESPARSLPQ RPAPDPCRAP
181 RVEQQPSVEG AAALRNCPMC QKEFAPRLTQ LDVDSHLAQC LAESTKNVTW
C.核苷酸及氨基酸组合序列(SEQ ID NO:23)克隆号和蛋白名称:FP7162
起始编码子:1691 ATG终止编码子:2381 TGA蛋白质分子量:25496.60
1 G CGG GGT TTC ACT ATG TTG GCC AGG CTG GTC TAG AAC TCC TGA CCT CAA GTG ATC TGC 58
59 CCG CCT CGG CCT CCC AAA GTG CTG GGA TTG CAG GCG TGA GAC ACT GCA CCC GGA CAA TTT 118
119 TCC TTT TCT TAC AAG AAC ACT GCT CAC ACT GCA TTC AGG GCC AAC CCT AAC CCA GTA TCG 178
179 CCT CAT CCT GGT TTG ATT ATA TCG GCA CAG ACC TTG CTT CCG AGC GAG GCC ACT TTC TCA 238
239 GGT ACT GGT GGA CAT GAG TCT TCG GAG ACG CTG CTC AAC CCA CAG TGC TCC TCC AGC TTG 298
299 GTT TCT GTG ACT TGC CTT CCC CAG AGG AGG GGT GCC CTG AGA GGT CTC CAC TCC CTG ACC 358
359 GGC TCC TTG GTG CCG CGC ACT CTG AGA GGC TTC CCA GGG AAC AGA GCA CAC AGG ACC GCC 418
419 CTC CTG GGT AGA CCA ATC AGC ATC TGA GCT CAC AAT TTC CCA GCA GGG CAG TGG GGT GGA 478
479 GAG AGA AGC CTG GGC TGG GCT GGG CTG GGC TGG GCT GGG GAA GCT TCT CCG GGC GGG GGG 538
539 ACG TCA GAG CAG GAT CTG GGG CTG ATA AAA GCC CGC CCC TGG GTG GGG GCT GAG TGG TGC 598
599 GGA AGC TGA GCC CGA CAC GTG GGG ATG GAG GAC AGG CTG TGG GAG GGT GTG AAC CGG ATA 658
659 CTG CTT GAA GGG GTG CTG GGG ACT TTG AGA GAG GGC GGC TGG CCC TGT CTG GTC GGG GAT 718
719 GCT GGC CCA GAC ACA GGC CAT GGC TGG GAT GGG GTT CAG AAA CAG GAC CGC TGT CTC TCC 778
779 CGG GCC AGG GCC CTC CCC AGC TGC TCC TGG CTT TCT GGT TCT TGG GGT CAG GGG CAG GCC 838
839 TGT GCC ATG ACC CCG CCA CTG AGG CTG TGA GGA GGC TGT CGG TGC CCA AGG GCA CCA AGG 898
899 CAC ACC CCT ACT CTT GCA CCC CAT GTG TGG GCC CGA GCA CCT GCT CTG CTG CCC CAA AGA 958
959 TCT GGC GAT GTT TCC CAG GCA ACT GTC TCT CAC AGC CTG TCT GCC TGG CAC TCC CGT ATC 1018
1019 CCA TAA ATG CCA CCA CAT CTG GCT ATG GGT GGG CGT GCC TGC CTG GCA TCC ACG GGC CAG 1078
1079 CAG GTG TGG TGG AGC ACA GCC CAG TTC CTG GCT GCG TCA GAA GGC TGC CCG GGC CTT TTG 1138
1139 GCT GTC CTT GCC AGC AGG TGA GCA CTG CCA GGG CAC CGT GTG TGG GTG CTG GGC CAT TTA 1198
1199 GCC ACA TGG GAA GGG GTG GAG GCA GCC CAG TGC CTT CAG CAT GTG CCC AGG GTG CCT GTC 1258
1259 GGC CAC AGG TCT CAT TTG GAA ATT GGG AGG GTG CAC GGC CAC CGG GCT GCT TAG GCC TGC 1318
1319 CAG CCT CAG GGC CCG TCA CCG CTG TCT TAG CCT GAT TTG CAG GGT GTC AAC GCT GGG CAG 1378
1379 AGA TGA ACA TTT GGG TGA CTC TGA GGA TGC CAG TGG CTG GGA CAC TTG TTC TTC CGC GGT 1438
1439 GGA AGG AGT TGG AGA GGC CTG GCT CCC TGA CCT ACG GCC AGC CTG GCT TCT GAA ACC AGC 1498
1499 TCA GTG GGC TGG GGC CTG ATT CAT CAT CCA TAA ATG TGT CCT TTT TTG CCA CAG AGG GTA 1558
1559 AGG GGC CTC CTA GCC CAC CGG TCT GCA GGT GCG GGA GTA GGA GAT GGG TGG CTC TGA TGC 1618
1619 CCC CAC CCA CTC GAT CAC CTT CTG CTC TGC CTG GGA TGC AAA CTC CCA CAG CTG AAA CGT 1678
1679 TCT TTT GTA AAC ATG TTT TTT GGC TTA GAA AAA ACT CAT TTC CAC TGT GCA CGT GTC AGT 1738
1 Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala Arg Val Ser 16
1739 CCC AAC CAG AAA TTA TTT TCC AAT AAA GCA AAA CTC CGT CAC CAC AGC AGC AGA TGG CTC 1798
17 Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg His His Ser Ser Arg Trp Leu 36
1799 CGA AGA AGT GGA GCG TTT TCA TCA GGT TCA ACT TTG AAA CCT CCA CCA TCA CCA TCA CCA 1858
37 Arg Arg Ser Gly Ala Phe Ser Ser Gly Ser Thr Leu Lys Pro Pro Pro Ser Pro Ser Pro 56
1859 GCA CCG CTG TGT CAT GCT GAT AAC TTG AGG ACA GGC AGG ACA AGG CCT TCT GGC GGC CGC 1918
57 Ala Pro Leu Cys His Ala Asp Asn Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg 76
1919 CCC TGG TTT CTC CTG GGG GGT GAT GAG CGG GAG CGG CTC TGG GCC GAG CTA CTG CGC ACG 1978
77 Pro Trp Phe Leu Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu Leu Arg Thr 96
1979 GTG AGC CCG GAG CTG ATC CTG GAT CAC GAG GTG CCT TCA CTG CCC GCC TTC CCA GGA CAG 2038
97 Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser Leu Pro Ala Phe Pro Gly Gln 116
2039 GAG CCC AGG TGC GGC CCG GAG CCC ACT GAA GTC TTC ACT GTC GGA CCC AAG ACC TTT TCC 2098
117 Glu Pro Arg Cys Gly Pro Glu Pro Thr Glu Val Phe Thr Val Gly Pro Lys Thr Phe Ser 136
2099 TGG ACA CCC TTT CCG CCG GAC CTG TGG GGC CCG GGC CGT TCC TAC CGG CTG CTT CAC GGG 2158
137 Trp Thr Pro Phe Pro Pro Asp Leu Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly 156
2159 GCA GGA GGG CAC CTG GAA TCC CCC GCC AGG TCC CTG CCC CAG CGC CCG GCA CCT GAT CCC 2218
157 Ala Gly Gly His Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala Pro Asp Pro 176
2219 TGC AGG GCC CCC AGG GTG GAG CAG CAA CCG TCT GTG GAG GGT GCC GCG GCC CTG CGC AAC 2278
177 Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu Gly Ala Ala Ala Leu Arg Asn 196
2279 TGC CCC ATG TGC CAG AAG GAG TTT GCC CCC AGG CTG ACC CAG CTG GAT GTT GAC AGC CAC 2338
197 Cys Pro Met Cys Gln Lys Glu Phe Ala Pro Arg Leu Thr Gln Leu Asp Val Asp Ser His 216
2339 CTG GCC CAG TGC TTG GCC GAA AGC ACA AAA AAC GTG ACG TGG TGA GCG CCA TCC AAG AGC 2398
217 Leu Ala Gln Cys Leu Ala Glu Ser Thr Lys Asn Val Thr Trp *** 231
2399 CCT GCG CAG AGT GCA GCG CCC GGA CAC GCT TTC CCC CGC CAG CAG CCC CGC CTC TCG GCT 2458
2459 CCC CCG CCA ACA GCC CCG CCT TTC GGC TCC CCC GCA TGG GCA TTA AAA CAG GGC GGG CTC 2518
2519 CTG TCT GTC TCT GTG TTG TGA TGA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 2572
序列表
<110>上海新世界基因技术开发有限公司
<120>具有抑癌功能的新的人蛋白及其编码序列
<130>017519
<160>24
<170>PatentIn version 3.0
<210>1
<211>2662
<212>DNA
<213>智人(Homo sapiens)
<400>1
gtgacagtcc acggccccgc tgggatggag ccctgctggg tgcccgcacc gtgctcagtg 60
tggcatgcgg cccgggtgtg gagggagacg gtggagcatc ccgtgcctag cgtggtgcca 120
gccaagggcg ggtggctggg gagctgtgct gggagctgtc gtaaacccgt ggtggctttg 180
atcctagggc cgtctttctg ctccacttcc cgggcactgt tccgagggag gctcaggtgg 240
ggaagcgagt gagcctaaag cccaggcttg tctccttggt gccaggccct gcttgctgga 300
atctggtgat cttaggaggt cactgttgca agggagggga cccaggagcc acctagtcga 360
acctctttgt ggtacagatg gagaaaccaa ggcccaaaca gtggccccct tgatcagcca 420
gaagcagagc tgggtggggc agcaggggat ccccaccacc aggctcagac tccttcagga 480
tgcttttgct tcgcagatga ggagaccgag gcttagagaa gagtagagac ttgctacccg 540
ttgaggtggt aacagccagg ctagaatctc ctgaaacggg gcagggtggg gaggtctggt 600
tgggctacct ggggccgggc gcctttcccc caggatgggg tgtactgccc gccctccccc 660
agtcatggtg ctggtgccag ctggtgcagg gggagggctc tgcaggcctt agcactgagg 720
caggtggcga gcagcagggg aagggtcttc tccacccacc ccaactgccc aaggttccgt 780
ggctcctcct tagacagcag tgagggttgg gggtgacagg caagccactg agcctcagca 840
ccgcgactca cccctcccac tcagcagtcc agccagggtc atccccagcc tcagaggagc 900
ctgggaacaa gggcagcggc agggccggcg ggggcctgga gggtgagcag gggcctttct 960
tcctgcagac agccctcagc gcctttttca ggagaccaac atcccctaca gccaccatca 1020
ccaccagatg gtaagtgtcc ccggagtccc cagttctgga ttgggcggaa ggaggccgag 1080
ctagttctgt gtataagcag cccctggccc cggtgtacga gggcgctggt gcaggcgggg 1140
ctcgacctct ttggagatgg gtcagcagga gtcccggctc catgggtcct gcacttaatc 1200
ttgcctgtgc cagctccccc tgagacctgg ggggcgctgg cctctggggc aatgaagctc 1260
cttaccctac agcccccggg gatgctgtgg ctgatggaaa ggggtgggct gggaaagcct 1320
cgtggcccca ggcaccgtgg gctcctgaga gtgaggctgg gtcggttcat ctcaaggctt 1380
ctcctctggg aacccctggg cggcggacag gcttggggat ctggggaagg aacacagagc 1440
cttccgagaa tgggccagcc acgcatctcc ccttgggagg cagtgggggc ccctccagga 1500
aggggtgctc accccatctc tcctctcttc ccctcacaga tgtgcacccc cgccaatacc 1560
cctgctacac cccccaactt ccctgacgct ctcaccatgt tctcccgtct caaggcctcc 1620
gagagcttcc acagcggtgg cagcggcagc ccgatggccg cgacagccac gtcacccccg 1680
ccacacttcc cccatgccgc caccagcagc tctgcggcct ccagctggcc cacggcggcc 1740
tcgcccccgg ggggcccaca gcaccaccag ccacagccgc ccctgtggac tccaacaccc 1800
ccttctccgg cttcagactg gccacccctg gccccccaac aggccacctc agaacccagg 1860
gcccaccctg ccatggaggc agagagataa gggaggcccc tcccccctcc cggaggccag 1920
gacccgtggg gcgggggaga ggacgtctct gcgggccccc ttcacccctt ttctgtctgc 1980
accccttgtt ccccggagcc ctggagggga gagcgcggac tctagccagg cagggacacg 2040
tctggtgcca gaacacgcag ctgcccacac gcaaggtcat ggccccagcg gccccggcac 2100
atggagtggt tcagagcggc ctgggtgcct ggcggacaga acttcagaga ccacgcagcc 2160
ttccttcgaa gacgcacctg cccagcccag cccaggggtg ccgtggagga ccaccctggc 2220
ggagacattg ctgatccctg gcttggagct ccttgggggc cggcaggcct cgaaccccca 2280
ccctagggaa tgcagagcct ctccgcatgt gtgcgcgtgg ccgtgtctgt gtatttctac 2340
gtgtgtcgct cttcagaagc aacctagttc ctggggcagc tggactttgc atgttagtgt 2400
gagcccccag ccccctgccc gccgccccct ccccagggcc ctgcctcctc cccaccccct 2460
cgtcagccag cgttgctgtt ccttgcagag aaaaggattg tgggaaactc caggactctt 2520
cccaccgcct cccagcgcct gcctgctggg gctgcctgca tgcctcccct gcacctgggg 2580
gtacccgcat ccacttcctt tccccctttt aacaaaagag aagaacgaat tccaaaaaaa 2640
aaaaaaaaaa aaaaaaaaaa aa 2662
<210>2
<211>2662
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(1252)..(1887)
<400>2
gtgacagtcc acggccccgc tgggatggag ccctgctggg tgcccgcacc gtgctcagtg 60
tggcatgcgg cccgggtgtg gagggagacg gtggagcatc ccgtgcctag cgtggtgcca 120
gccaagggcg ggtggctggg gagctgtgct gggagctgtc gtaaacccgt ggtggctttg 180
atcctagggc cgtctttctg ctccacttcc cgggcactgt tccgagggag gctcaggtgg 240
ggaagcgagt gagcctaaag cccaggcttg tctccttggt gccaggccct gcttgctgga 300
atctggtgat cttaggaggt cactgttgca agggagggga cccaggagcc acctagtcga 360
acctctttgt ggtacagatg gagaaaccaa ggcccaaaca gtggccccct tgatcagcca 420
gaagcagagc tgggtggggc agcaggggat ccccaccacc aggctcagac tccttcagga 480
tgcttttgct tcgcagatga ggagaccgag gcttagagaa gagtagagac ttgctacccg 540
ttgaggtggt aacagccagg ctagaatctc ctgaaacggg gcagggtggg gaggtctggt 600
tgggctacct ggggccgggc gcctttcccc caggatgggg tgtactgccc gccctccccc 660
agtcatggtg ctggtgccag ctggtgcagg gggagggctc tgcaggcctt agcactgagg 720
caggtggcga gcagcagggg aagggtcttc tccacccacc ccaactgccc aaggttccgt 780
ggctcctcct tagacagcag tgagggttgg gggtgacagg caagccactg agcctcagca 840
ccgcgactca cccctcccac tcagcagtcc agccagggtc atccccagcc tcagaggagc 900
ctgggaacaa gggcagcggc agggccggcg ggggcctgga gggtgagcag gggcctttct 960
tcctgcagac agccctcagc gcctttttca ggagaccaac atcccctaca gccaccatca 1020
ccaccagatg gtaagtgtcc ccggagtccc cagttctgga ttgggcggaa ggaggccgag 1080
ctagttctgt gtataagcag cccctggccc cggtgtacga gggcgctggt gcaggcgggg 1140
ctcgacctct ttggagatgg gtcagcagga gtcccggctc catgggtcct gcacttaatc 1200
ttgcctgtgc cagctccccc tgagacctgg ggggcgctgg cctctggggc a atg aag 1257
Met Lys
1
ctc ctt acc cta cag ccc ccg ggg atg ctg tgg ctg atg gaa agg ggt 1305
Leu Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu Arg Gly
5 10 15
ggg ctg gga aag cct cgt ggc ccc agg cac cgt ggg ctc ctg aga gtg 1353
Gly Leu Gly Lys Pro Arg Gly Pro Arg His Arg Gly Leu Leu Arg Val
20 25 30
agg ctg ggt cgg ttc atc tca agg ctt ctc ctc tgg gaa ccc ctg ggc 1401
Arg Leu Gly Arg Phe Ile Ser Arg Leu Leu Leu Trp Glu Pro Leu Gly
35 40 45 50
ggc gga cag gct tgg gga tct ggg gaa gga aca cag agc ctt ccg aga 1449
Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser Leu Pro Arg
55 60 65
atg ggc cag cca cgc atc tcc cct tgg gag gca gtg ggg gcc cct cca 1497
Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala Val Gly Ala Pro Pro
70 75 80
gga agg ggt gct cac ccc atc tct cct ctc ttc ccc tca cag atg tgc 1545
Gly Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln Met Cys
85 90 95
acc ccc gcc aat acc cct gct aca ccc ccc aac ttc cct gac gct ctc 1593
Thr Pro Ala Asn Thr Pro Ala Thr Pro Pro Asn Phe Pro Asp Ala Leu
100 105 110
acc atg ttc tcc cgt ctc aag gcc tcc gag agc ttc cac agc ggt ggc 1641
Thr Met Phe Ser Arg Leu Lys Ala Ser Glu Ser Phe His Ser Gly Gly
115 120 125 130
agc ggc agc ccg atg gcc gcg aca gcc acg tca ccc ccg cca cac ttc 1689
Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro Pro His Phe
135 140 145
ccc cat gcc gcc acc agc agc tct gcg gcc tcc agc tgg ccc acg gcg 1737
Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro Thr Ala
150 155 160
gcc tcg ccc ccg ggg ggc cca cag cac cac cag cca cag ccg ccc ctg 1785
Ala Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro Pro Leu
165 170 175
tgg act cca aca ccc cct tct ccg gct tca gac tgg cca ccc ctg gcc 1833
Trp Thr Pro Thr Pro Pro Ser Pro Ala Ser Asp Trp Pro Pro Leu Ala
180 185 190
ccc caa cag gcc acc tca gaa ccc agg gcc cac cct gcc atg gag gca 1881
Pro Gln Gln Ala Thr Ser Glu Pro Arg Ala His Pro Ala Met Glu Ala
195 200 205 210
gag aga taagggaggc ccctcccccc tcccggaggc caggacccgt ggggcggggg 1937
Glu Arg
agaggacgtc tctgcgggcc cccttcaccc cttttctgtc tgcacccctt gttccccgga 1997
gccctggagg ggagagcgcg gactctagcc aggcagggac acgtctggtg ccagaacacg 2057
cagctgccca cacgcaaggt catggcccca gcggccccgg cacatggagt ggttcagagc 2117
ggcctgggtg cctggcggac agaacttcag agaccacgca gccttccttc gaagacgcac 2177
ctgcccagcc cagcccaggg gtgccgtgga ggaccaccct ggcggagaca ttgctgatcc 2237
ctggcttgga gctccttggg ggccggcagg cctcgaaccc ccaccctagg gaatgcagag 2297
cctctccgca tgtgtgcgcg tggccgtgtc tgtgtatttc tacgtgtgtc gctcttcaga 2357
agcaacctag ttcctggggc agctggactt tgcatgttag tgtgagcccc cagccccctg 2417
cccgccgccc cctccccagg gccctgcctc ctccccaccc cctcgtcagc cagcgttgct 2477
gttccttgca gagaaaagga ttgtgggaaa ctccaggact cttcccaccg cctcccagcg 2537
cctgcctgct ggggctgcct gcatgcctcc cctgcacctg ggggtacccg catccacttc 2597
ctttccccct tttaacaaaa gagaagaacg aattccaaaa aaaaaaaaaa aaaaaaaaaa 2657
aaaaa 2662
<210>3
<211>212
<212>PRT
<213>智人(Homo sapiens)
<400>3
Met Lys Leu Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu
1 5 10 15
Arg Gly Gly Leu Gly Lys Pro Arg Gly Pro Arg His Arg Gly Leu Leu
20 25 30
Arg Val Arg Leu Gly Arg Phe Ile Ser Arg Leu Leu Leu Trp Glu Pro
35 40 45
Leu Gly Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser Leu
50 55 60
Pro Arg Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala ValGly Ala
65 70 75 80
Pro Pro Gly Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln
85 90 95
Met Cys Thr Pro Ala Asn Thr Pro Ala Thr Pro Pro Asn Phe Pro Asp
100 105 110
Ala Leu Thr Met Phe Ser Arg Leu Lys Ala Ser Glu Ser Phe His Ser
115 120 125
Gly Gly Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro Pro
130 135 140
His Phe Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro
145 150 155 160
Thr Ala Ala Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro
165 170 175
Pro Leu Trp Thr Pro Thr Pro Pro Ser Pro Ala Ser Asp Trp Pro Pro
180 185 190
Leu Ala Pro Gln Gln Ala Thr Ser Glu Pro Arg Ala His Pro Ala Met
195 200 205
Glu Ala Glu Arg
210
<210>4
<211>3325
<212>DNA
<213>智人(Homo sapiens)
<400>4
ggccgcgcga gggtggtggg catcgaggtc ccagcagcgg acgagggagg tgccgccgtc 60
gcccaggatg ggctgggaat gaagcgatgt agccttttaa gagatttgct ctgacccatc 120
tgaagtccat atggctctgt atgatgaaga cctcctgaaa aatcctttct atctggctct 180
gcaaaagtgc cgccctgact tgtgcagcaa agtggcccaa atccatggca ttgtcttagt 240
accctgcaaa ggaagcctgt cgagcagcat ccagtctact tgtcagtttg agtcctacat 300
tttgatacct gtggaagagc attttcagac cttaaatgga aaggatgtct ttattcaagg 360
gaacaggatt aaattaggag ctggttttgc ctgtcttctc tcagtgccca ttctctttga 420
agaaactttc tacaatgaaa aagaagagag tttcagcatc ctgtgtatag cccatccttt 480
ggaaaagaga gagagttcag aagagccttt ggcaccctca gatccctttt ccctgaaaac 540
cattgaagat gtgagagagt tcttgggaag acactccgag cgatttgaca ggaacatcgc 600
ctctttccta atcgaacatt ccgagaatgc gagagaaaga gcctccgtca ccacatagac 660
tcagcgaatg ctctctacac caaatgcctc cagcagcttc tgagggactc tcacctgaaa 720
atgctcgcca agcaggaggc ccagatgaac ctgatgaagc aggcagtgga gatatacgtc 780
catcatgaaa tttacaacct gatctttaaa tacgtgggga ccatggaggc aagtgaggat 840
gcggccttta acaaaaatca caagaagcct tcaagatctt cagcagaaag atattggtgt 900
gaaaccggag ttcagcttta acatacctcg tgccaaaaga gagctggctc agctgaacaa 960
atgcacctcc ccacagcaga agcttgtctg cttgcgaaaa gtggtgcagc tcattacaca 1020
gtctccaagc cagagagtga acctggagac catgtgtgct gatgatctgc tatcagtcct 1080
gttatacttg cttgtgaaaa cggagatccc taattggatg gcaaatttga gttacatcaa 1140
aaacttcagg tttagcagct tggcaaagga tgaactggga tactgcctga cctcattcga 1200
agctgccatt gaatatattc ggcaaggaag cctctctgct aaaccccctg taagatctca 1260
cccctgccct ggccttcctt tgtgggcatc atggttccct tgatagggtg ctggggttgg 1320
tatgtgggca gacggattct taaattgcct cccaggaatg gggcctcagc tgtttgaggg 1380
ctgtgagtct taaaaatcac tcagtgaaga gaacaccaag cccccaattg gtggtaaaaa 1440
ttggtgggtt atcattggga tttacattgt taatatccta cttcattagt ccccatcctc 1500
tccaaagaca tgtgggtgca aagggaagcc agaagtaggg aatttggatt tcttgacctt 1560
gatagtcaag aagtgatgtc acgggatccc tggactgtcg cttttccagc cggaaacctc 1620
tgtggctggt ggctcctttg cctgagtttt gttcgggcct gctgggctca tttcacgctc 1680
ttggcctggc aggctgcgct cggcttgtgc tactggcctg gatcccatgc ctgccaaggg 1740
cgagccaggt gtggagtggc gaggggtatg tgagcaagtg cagggtctgg ccactgcaca 1800
caaccaggtg tgccgactga ggtggggtgg gcagctccaa gttgcttgta cagggtcctg 1860
ctccatgcaa ggctgcagct agagcaggcg tactgtaggc cgcttccacg gtgggcactg 1920
gggaacacag tggggcctgg aagcttggag acaccaggaa ctgcagagcc ccaaagaggg 1980
tgtcatagcc ctggctcggg gaactcctag gttgggctcc ctgaagggcc agagctcttc 2040
tctcctctcg tcacctgcaa tgtagtgagt cgggagcatg ttttagctct ctttatgtta 2100
cagctctttc agtcctgcca tttggtgggt cccgagttct tgtcccatgt cgaggaagaa 2160
tgaggtacgt agactagtgg agggtgagca aggcagagag gagctttact gaacggcaga 2220
atagctctca ggagacccac agtgggcagc ttctttccac aggcaggtcg tcctgacgag 2280
ttaaagaggc ctgacgtagg tagctccttc ctgcagttgg tagtcccgac atctgtctga 2340
gtctggctga gtccggggtt ttttatggct cagaagggag ggagtatgtg ctgattggtc 2400
cataggtggg cctggagaaa gcaccatgag ttctcagtct gggccgtgga ctccacttgg 2460
aactgacagc ccagccccca ggctttaggc tgtccctgtc ttgaaggtgg ggcttcactg 2520
ggcacctgca cctttccacc cagaagcgtg tctgccttct gccaccatca acatgctggc 2580
cagtgcatcc aggctgtttg tgccaagggg catctgcagg cctgcactga gctgccctca 2640
gcccctacct tgactactct cccatgctca tcagcgccca aaatcttgga ggggctgagg 2700
catcaggagg ctggtgtgtc agtgtcacac caagcatgtg cacacatggc tgggttgcaa 2760
cagtacccgg gcttggcctc agctttgctc tgaaattgaa gtcggtgcca ggagtgggga 2820
ggagcgggag caggcactta cgagcctgcg gcggcaggga tgcttcctgg gcccctgaga 2880
gtgcagagat tcctggatcc agagctgcgg ctgggcggct gcagctgcgc ctgggagtgc 2940
agggctcccg ccctgccagc tcagtaggag atgggggctc ctgcctattc ctggctcctg 3000
ttggccctgc agagtgcaca accctggccg cgcttcctcc actgcagctt acgtctttgc 3060
agcagccact cccgatgggc tgccactgcc atctgtgaga caattaatgt gtgcaatttg 3120
aggactcagt ggccttgcca ttgtttccct tggtttttat tgagcattgg ctggggtcgg 3180
cgaggggatg tgattatatt tctatgtgaa tcgtgagaat cttgaaccat agttgtcctg 3240
ctggcctgtt ttactacata ccaatgagta aaatgtgatc atacagaaat cacaaagttg 3300
aaatcctaaa aaaaaaaaaa aaaaa 3325
<210>5
<211>3325
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(131)..(655)
<400>5
ggccgcgcga gggtggtggg catcgaggtc ccagcagcgg acgagggagg tgccgccgtc 60
gcccaggatg ggctgggaat gaagcgatgt agccttttaa gagatttgct ctgacccatc 120
tgaagtccat atg gct ctg tat gat gaa gac ctc ctg aaa aat cct ttc 169
Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe
1 5 10
tat ctg gct ctg caa aag tgc cgc cct gac ttg tgc agc aaa gtg gcc 217
Tyr Leu Ala Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala
15 20 25
caa atc cat ggc att gtc tta gta ccc tgc aaa gga agc ctg tcg agc 265
Gln Ile His Gly Ile Val Leu Val Pro Cys Lys Gly Ser Leu Ser Ser
30 35 40 45
agc atc cag tct act tgt cag ttt gag tcc tac att ttg ata cct gtg 313
Ser Ile Gln Ser Thr Cys Gln Phe Glu Ser Tyr Ile Leu Ile Pro Val
50 55 60
gaa gag cat ttt cag acc tta aat gga aag gat gtc ttt att caa ggg 361
Glu Glu His Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln Gly
65 70 75
aac agg att aaa tta gga gct ggt ttt gcc tgt ctt ctc tca gtg ccc 409
Asn Arg Ile Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro
80 85 90
att ctc ttt gaa gaa act ttc tac aat gaa aaa gaa gag agt ttc agc 457
Ile Leu Phe Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser
95 100 105
atc ctg tgt ata gcc cat cct ttg gaa aag aga gag agt tca gaa gag 505
Ile Leu Cys Ile Ala His Pro Leu Glu Lys Arg Glu Ser Ser Glu Glu
110 115 120 125
cct ttg gca ccc tca gat ccc ttt tcc ctg aaa acc att gaa gat gtg 553
Pro Leu Ala Pro Ser Asp Pro Phe Ser Leu Lys Thr Ile Glu Asp Val
130 135 140
aga gag ttc ttg gga aga cac tcc gag cga ttt gac agg aac atc gcc 601
Arg Glu Phe Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile Ala
145 150 155
tct ttc cta atc gaa cat tcc gag aat gcg aga gaa aga gcc tcc gtc 649
Ser Phe Leu Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val
160 165 170
acc aca tagactcagc gaatgctctc tacaccaaat gcctccagca gcttctgagg 705
Thr Thr
175
gactctcacc tgaaaatgct cgccaagcag gaggcccaga tgaacctgat gaagcaggca 765
gtggagatat acgtccatca tgaaatttac aacctgatct ttaaatacgt ggggaccatg 825
gaggcaagtg aggatgcggc ctttaacaaa aatcacaaga agccttcaag atcttcagca 885
gaaagatatt ggtgtgaaac cggagttcag ctttaacata cctcgtgcca aaagagagct 945
ggctcagctg aacaaatgca cctccccaca gcagaagctt gtctgcttgc gaaaagtggt 1005
gcagctcatt acacagtctc caagccagag agtgaacctg gagaccatgt gtgctgatga 1065
tctgctatca gtcctgttat acttgcttgt gaaaacggag atccctaatt ggatggcaaa 1125
tttgagttac atcaaaaact tcaggtttag cagcttggca aaggatgaac tgggatactg 1185
cctgacctca ttcgaagctg ccattgaata tattcggcaa ggaagcctct ctgctaaacc 1245
ccctgtaaga tctcacccct gccctggcct tcctttgtgg gcatcatggt tcccttgata 1305
gggtgctggg gttggtatgt gggcagacgg attcttaaat tgcctcccag gaatggggcc 1365
tcagctgttt gagggctgtg agtcttaaaa atcactcagt gaagagaaca ccaagccccc 1425
aattggtggt aaaaattggt gggttatcat tgggatttac attgttaata tcctacttca 1485
ttagtcccca tcctctccaa agacatgtgg gtgcaaaggg aagccagaag tagggaattt 1545
ggatttcttg accttgatag tcaagaagtg atgtcacggg atccctggac tgtcgctttt 1605
ccagccggaa acctctgtgg ctggtggctc ctttgcctga gttttgttcg ggcctgctgg 1665
gctcatttca cgctcttggc ctggcaggct gcgctcggct tgtgctactg gcctggatcc 1725
catgcctgcc aagggcgagc caggtgtgga gtggcgaggg gtatgtgagc aagtgcaggg 1785
tctggccact gcacacaacc aggtgtgccg actgaggtgg ggtgggcagc tccaagttgc 1845
ttgtacaggg tcctgctcca tgcaaggctg cagctagagc aggcgtactg taggccgctt 1905
ccacggtggg cactggggaa cacagtgggg cctggaagct tggagacacc aggaactgca 1965
gagccccaaa gagggtgtca tagccctggc tcggggaact cctaggttgg gctccctgaa 2025
gggccagagc tcttctctcc tctcgtcacc tgcaatgtag tgagtcggga gcatgtttta 2085
gctctcttta tgttacagct ctttcagtcc tgccatttgg tgggtcccga gttcttgtcc 2145
catgtcgagg aagaatgagg tacgtagact agtggagggt gagcaaggca gagaggagct 2205
ttactgaacg gcagaatagc tctcaggaga cccacagtgg gcagcttctt tccacaggca 2265
ggtcgtcctg acgagttaaa gaggcctgac gtaggtagct ccttcctgca gttggtagtc 2325
ccgacatctg tctgagtctg gctgagtccg gggtttttta tggctcagaa gggagggagt 2385
atgtgctgat tggtccatag gtgggcctgg agaaagcacc atgagttctc agtctgggcc 2445
gtggactcca cttggaactg acagcccagc ccccaggctt taggctgtcc ctgtcttgaa 2505
ggtggggctt cactgggcac ctgcaccttt ccacccagaa gcgtgtctgc cttctgccac 2565
catcaacatg ctggccagtg catccaggct gtttgtgcca aggggcatct gcaggcctgc 2625
actgagctgc cctcagcccc taccttgact actctcccat gctcatcagc gcccaaaatc 2685
ttggaggggc tgaggcatca ggaggctggt gtgtcagtgt cacaccaagc atgtgcacac 2745
atggctgggt tgcaacagta cccgggcttg gcctcagctt tgctctgaaa ttgaagtcgg 2805
tgccaggagt ggggaggagc gggagcaggc acttacgagc ctgcggcggc agggatgctt 2865
cctgggcccc tgagagtgca gagattcctg gatccagagc tgcggctggg cggctgcagc 2925
tgcgcctggg agtgcagggc tcccgccctg ccagctcagt aggagatggg ggctcctgcc 2985
tattcctggc tcctgttggc cctgcagagt gcacaaccct ggccgcgctt cctccactgc 3045
agcttacgtc tttgcagcag ccactcccga tgggctgcca ctgccatctg tgagacaatt 3105
aatgtgtgca atttgaggac tcagtggcct tgccattgtt tcccttggtt tttattgagc 3165
attggctggg gtcggcgagg ggatgtgatt atatttctat gtgaatcgtg agaatcttga 3225
accatagttg tcctgctggc ctgttttact acataccaat gagtaaaatg tgatcataca 3285
gaaatcacaa agttgaaatc ctaaaaaaaa aaaaaaaaaa 3325
<210>6
<211>175
<212>PRT
<213>智人(Homo sapiens)
<400>6
Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe Tyr Leu Ala
1 5 10 15
Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala Gln Ile His
20 25 30
Gly Ile Val Leu Val Pro Cys Lys Gly Ser Leu Ser Ser Ser Ile Gln
35 40 45
Ser Thr Cys Gln Phe Glu Ser Tyr Ile Leu Ile Pro Val Glu Glu His
50 55 60
Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln Gly Asn Arg Ile
65 70 75 80
Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro Ile Leu Phe
85 90 95
Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser Ile Leu Cys
100 105 110
Ile Ala His Pro Leu Glu Lys Arg Glu Ser Ser Glu Glu Pro Leu Ala
115 120 125
Pro Ser Asp Pro Phe Ser Leu Lys Thr Ile Glu Asp Val Arg Glu Phe
130 135 140
Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile Ala Ser Phe Leu
145 150 155 160
Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val Thr Thr
165 170 175
<210>7
<211>2154
<212>DNA
<213>智人(Homo sapiens)
<400>7
gggggaatct cacagccctc acctacctca acctcagccg aaaccagctg tcgctgctgc 60
caccctacat ctgccagctg cccctgaggg tcctcatcgt cagcaacaac aagctgggag 120
ccctgccccc tgacatcggc accctgggaa gcctgcgaca gcttgacgtg agcagcaacg 180
agctccaatc cctgccctcg gaactgtgtg gcctctcttc cctgcgggac ctcaatgtcc 240
ggaggaacca gctcagtacg ctgcccgaag agctggggga cctccctctg gtcccctgga 300
tttctcctgt aaccgcgtct cccgaatccc agtctccttc tgccgcctga ggcacctgca 360
ggtcattctg ctggacagca accctctgca gagtccacct gcccaggtct gcctgaaggg 420
gaaacttcac atcttcaagt atttgtccac agaggccggg cagcgtgggt cggccctggg 480
ggacctggcc ccttctcggc ccccgagttt cagtccctgc cctgcagagg atctatttcc 540
gggacatcgg tacgatggtg ggctggactc aggcttccac agcgttgata gtggcagcaa 600
gaggtggtct ggaaatgagt caacagatga attttcagag ctgtcattcc ggatctcaga 660
gctggcccgg gagccccggg gacccagaga acgcaaggag gatggctcag cggacggaga 720
ccctgtgcag attgacttca tcgacagcca tgtccccggg gaggatgaag agcgaggcac 780
tgtggaggag cagcgaccac ccgaattaag ccctggggca ggggacaggg agagggcacc 840
aagcagcagg cgggaggagc cggcagggga ggagcggcgg cgcccggaca ccttgcagct 900
gtggcaggag cgggaacggc ggcagcagca gcagagcggg gcgtgggggg ccccgaggaa 960
ggatagcggc tcgcctaagt ccagtgcctc ccaagcaggg gctgcagcgg ggcagggagc 1020
ccccgcccct gcccctgcct cccaagagcc ccttcccata gctggaccag cgacagcacc 1080
ctgctccacg gccacttggc tccattcaga gaccaaacag cttcctcttc cgttcctcct 1140
ctcagagtgg ctcaggccct tcctcaccag actctgtcct gagacctcgg cggtaccccc 1200
aggttccaga tgagaaggac ttaatgactc agctgcgcca ggtccttgag tcccggctgc 1260
agcggcccct gcctgaggac ctggcgaggc tctggccaag tggggtcatc ctgtgccagc 1320
tggccaacca gctacggccg cgctccgtgc ccttcatcca tgtgccctcc cctgctgtgc 1380
caaaactcag tgccctcaag gctcggaaga atgtggagag ttttctagaa gcctgtcgaa 1440
aaatgggggt gcctgaggct gacctgtgct cgccctcgga tctcctccag ggcactgccc 1500
gggggctgcg gaccgcgctg gaggccgtga agcgggtggg gggcaaggcc ctaccgcccc 1560
tctggccccc ctctggtctg ggcggcttcg tcgtcttcta cgtggtcctc atgctgctgc 1620
tctatgtcac ctacactcgg ctcctgggtt cctaggcccc aaaatcggcc ctccctcacc 1680
cctttccctt cctctctatt tataaggtcc ctgctccacc cgaccccacc tgcggtgcct 1740
tcagccccaa ccaaagacac tagtgcaccc ccttcacaga cactgacctc agaggcccca 1800
ctctggtgcc cccagaccct gggcccccag cctctggcct ccctccagta gccccacgag 1860
tccccacctc tcagtgctga cggtgccttc atgtccccgc cggccctgcc cctgccctct 1920
gtaccccgtg aggggtggca ggagctggag tctccccctt cctcctgtgc cctccccttc 1980
cccccccaac agctgctatg ggggggctaa attatctcta ttttgtagag aggatctata 2040
tttgtagggg ttcggggccc aggccgggtc cctatctctg tgtataaact gtacagaccg 2100
tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2154
<210>8
<211>2154
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(1224)..(1652)
<400>8
gggggaatct cacagccctc acctacctca acctcagccg aaaccagctg tcgctgctgc 60
caccctacat ctgccagctg cccctgaggg tcctcatcgt cagcaacaac aagctgggag 120
ccctgccccc tgacatcggc accctgggaa gcctgcgaca gcttgacgtg agcagcaacg 180
agctccaatc cctgccctcg gaactgtgtg gcctctcttc cctgcgggac ctcaatgtcc 240
ggaggaacca gctcagtacg ctgcccgaag agctggggga cctccctctg gtcccctgga 300
tttctcctgt aaccgcgtct cccgaatccc agtctccttc tgccgcctga ggcacctgca 360
ggtcattctg ctggacagca accctctgca gagtccacct gcccaggtct gcctgaaggg 420
gaaacttcac atcttcaagt atttgtccac agaggccggg cagcgtgggt cggccctggg 480
ggacctggcc ccttctcggc ccccgagttt cagtccctgc cctgcagagg atctatttcc 540
gggacatcgg tacgatggtg ggctggactc aggcttccac agcgttgata gtggcagcaa 600
gaggtggtct ggaaatgagt caacagatga attttcagag ctgtcattcc ggatctcaga 660
gctggcccgg gagccccggg gacccagaga acgcaaggag gatggctcag cggacggaga 720
ccctgtgcag attgacttca tcgacagcca tgtccccggg gaggatgaag agcgaggcac 780
tgtggaggag cagcgaccac ccgaattaag ccctggggca ggggacaggg agagggcacc 840
aagcagcagg cgggaggagc cggcagggga ggagcggcgg cgcccggaca ccttgcagct 900
gtggcaggag cgggaacggc ggcagcagca gcagagcggg gcgtgggggg ccccgaggaa 960
ggatagcggc tcgcctaagt ccagtgcctc ccaagcaggg gctgcagcgg ggcagggagc 1020
ccccgcccct gcccctgcct cccaagagcc ccttcccata gctggaccag cgacagcacc 1080
ctgctccacg gccacttggc tccattcaga gaccaaacag cttcctcttc cgttcctcct 1140
ctcagagtgg ctcaggccct tcctcaccag actctgtcct gagacctcgg cggtaccccc 1200
aggttccaga tgagaaggac tta atg act cag ctg cgc cag gtc ctt gag tcc 1253
Met Thr Gln Leu Arg Gln Val Leu Glu Ser
1 5 10
cgg ctg cag cgg ccc ctg cct gag gac ctg gcg agg ctc tgg cca agt 1301
Arg Leu Gln Arg Pro Leu Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser
15 20 25
ggg gtc atc ctg tgc cag ctg gcc aac cag cta cgg ccg cgc tcc gtg 1349
Gly Val Ile Leu Cys Gln Leu Ala Asn Gln Leu Arg Pro Arg Ser Val
30 35 40
ccc ttc atc cat gtg ccc tcc cct gct gtg cca aaa ctc agt gcc ctc 1397
Pro Phe Ile His Val Pro Ser Pro Ala Val Pro Lys Leu Ser Ala Leu
45 50 55
aag gct cgg aag aat gtg gag agt ttt cta gaa gcc tgt cga aaa atg 1445
Lys Ala Arg Lys Asn Val Glu Ser Phe Leu Glu Ala Cys Arg Lys Met
60 65 70
ggg gtg cct gag gct gac ctg tgc tcg ccc tcg gat ctc ctc cag ggc 1493
Gly Val Pro Glu Ala Asp Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly
75 80 85 90
act gcc cgg ggg ctg cgg acc gcg ctg gag gcc gtg aag cgg gtg ggg 1541
Thr Ala Arg Gly Leu Arg Thr Ala Leu Glu Ala Val Lys Arg Val Gly
95 100 105
ggc aag gcc cta ccg ccc ctc tgg ccc ccc tct ggt ctg ggc ggc ttc 1589
Gly Lys Ala Leu Pro Pro Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe
110 115 120
gtc gtc ttc tac gtg gtc ctc atg ctg ctg ctc tat gtc acc tac act 1637
Val Val Phe Tyr Val Val Leu Met Leu Leu Leu Tyr Val Thr Tyr Thr
125 130 135
cgg ctc ctg ggt tcc taggccccaa aatcggccct ccctcacccc tttcccttcc 1692
Arg Leu Leu Gly Ser
140
tctctattta taaggtccct gctccacccg accccacctg cggtgccttc agccccaacc 1752
aaagacacta gtgcaccccc ttcacagaca ctgacctcag aggccccact ctggtgcccc 1812
cagaccctgg gcccccagcc tctggcctcc ctccagtagc cccacgagtc cccacctctc 1872
agtgctgacg gtgccttcat gtccccgccg gccctgcccc tgccctctgt accccgtgag 1932
gggtggcagg agctggagtc tcccccttcc tcctgtgccc tccccttccc cccccaacag 1992
ctgctatggg ggggctaaat tatctctatt ttgtagagag gatctatatt tgtaggggtt 2052
cggggcccag gccgggtccc tatctctgtg tataaactgt acagaccgtg aaaaaaaaaa 2112
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2154
<210>9
<211>143
<212>PRT
<213>智人(Homo sapiens)
<400>9
Met Thr Gln Leu Arg Gln Val Leu Glu Ser Arg Leu Gln Arg Pro Leu
1 5 10 15
Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser Gly Val Ile Leu Cys Gln
20 25 30
Leu Ala Asn Gln Leu Arg Pro Arg Ser Val Pro Phe Ile His Val Pro
35 40 45
Ser Pro Ala Val Pro Lys Leu Ser Ala Leu Lys Ala Arg Lys Asn Val
50 55 60
Glu Ser Phe Leu Glu Ala Cys Arg Lys Met Gly Val Pro Glu Ala Asp
65 70 75 80
Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly Thr Ala Arg Gly Leu Arg
85 90 95
Thr Ala Leu Glu Ala Val Lys Arg Val Gly Gly Lys Ala Leu Pro Pro
100 105 110
Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe Val Val Phe Tyr Val Val
115 120 125
Leu Met Leu Leu Leu Tyr Val Thr Tyr Thr Arg Leu Leu Gly Ser
130 135 140
<210>10
<211>4952
<212>DNA
<213>智人(Homo sapiens)
<400>10
gctaagcagt aaactaaagg attatatatt attagtctca gtggttttca gatttatttt 60
taaaggggaa aacagggaaa acccatcgta tttgtaaagc actttaggat tttgccgttt 120
gtttctgatt gtttgaagat tagggctttt tggtgcgtgg tcacctttca cctctccttt 180
taggatttag tcctttccag tctgctcttt ttgtgcgtgt cacaaccata ttcttgtggt 240
tctggctcat attgtagaac tgctgaacat aaggagaggt agccagctgt atggtcggat 300
ttaatatata atgttatatg ttgggatatc ttagtggttt gttttctgag gtaagtttct 360
tagtgttgtg tttgagacat tgtgtttgcg tttatggcga cactgtcatt catgcacttg 420
gccatctgag cgtggataca gcgggcactc gggtctctct gccagatgga tgaaagcagt 480
gtacattcca gtgtgggaga cagacatgtg gacaggtaaa ttacaaggca gtgtgataaa 540
gagtagagag ttggttgaga gagatcttag acaccatcct cattgtgtag atgagaaagc 600
aaaagtcacc aaggcagcct ggcagctggg acccaagaag cctagggtgc cagtccttgg 660
gcagtgcggg gttaggcaca cccagggccc tcctggttcc tggctgactc ttggactctt 720
tgtctctaat tggaggccat gatgcccagc tgtaaggtgg tcagcttcat ttgagacact 780
atatccttta gcacagcggg gtaatttctt ccctcctgtt tcattcattt accaaatggc 840
ctcctaaatg atctaaaatc acttggatct tttgtctttg tggacctaac acctggcttt 900
taaagtttaa ctttctgtcc cctcttcagc ttgctaaaat tgaaaagtgt tgcagcccaa 960
cctccacaaa tcttgtctca ggaaataaga gacatttgtt aacatttgtt ttgtacctct 1020
cagcagctta gttgacaagg gcaccgtgtg ggatttcctg ttcttgctca tttggaaaga 1080
gaatgttctt tgttcttaga ccctcagctc tcatgtgaga gccatagaat gttgcgaggt 1140
ggagttctgt ggatacagaa ggaatgtttt caagttagac ttactgccaa tgttaggatt 1200
tgggactttg catgattggg agggagaggg agtgctggag aacaggttaa aagttgtccc 1260
gctgagcttg gagcatctcc tgccaacccg gagtgcttcc caggaaccct gccagtgtca 1320
cttggggtta tgttttctga tttggaaaca ttaagccgta tgcaggtctc ttcagaactg 1380
gttcttcagc cggattgccc tggaaagcag agattgcagc tcttctaaaa ctgcctctca 1440
cagaagttcc aaggccaggc taaatattga atgcagtact cagcagctgg gacacctgat 1500
gctttggtgg ccatcccttt cttccatcca aaagggcccc cactggaagg catctgttgt 1560
tttaaaaata ttttaggact catttttact tcccccactc cctcaagatc acatacactc 1620
cccagtgggg gttacagcct cttaggagga atcgctgctt catgacttcc tcaggcattt 1680
acatttttct cttctgttga cttaaatcat gaactaaaat ttatccctag aggaaaaaag 1740
aatgcttcct ccattctggg ctcttctcac tgtacccaga ctatgtcttc aggactctca 1800
tctcttgtca gttctgttgt gctagaaaga ctggtttgaa aaaattcagc tcgtgtaaac 1860
ctgtgccctc caccctgtgg ggaacccatg tggggagcct ttgaaaatat cacttatcag 1920
ctgggcgcag tggctcatgc ctgtaatccc agcacgttgg gaggatgagg tgggcggatc 1980
atgaggtcag gagttcaaga ccagcctggc caacatagtg aaaccccgtc tctactaaaa 2040
atacaaaaaa aaaaaaaaaa aaaaccgaga ctagttctct ctctgtctcc tgcctgaacc 2100
ctcctcctct ttttgttctg atctttgagc tccctagagc ccataattct ttagagcagg 2160
tatgtcccga gtctgaaaca tgcccttatt tgtcccaagc tctggacatt tctcacccca 2220
aggcggatca atcatgatta aatcactcca attaaacttt aggctccagt cagaccttca 2280
gccaaatgga aaaaaaaact aggggataag ggaggtagtt ggagcaagaa aatgttatta 2340
gttgaaacct tacgggacct tcctccctta gtgagtctgt tggctaaagg ttctctggct 2400
tcgtgaatta gaattggata ctgtttccaa gttagcaaaa ccaactctac cccagcaccc 2460
cacgaggaag aatgtggaag gatctcccat tggccggttg gggcaaaagc ctgaggcaat 2520
ctttcatccc cttttgccaa ggcgagactt tcccagtgac ggtgatgtag ttggccactc 2580
tgactatggg tggactcggg tgtagacctc tgaagctgag atcacacgaa aacctggcct 2640
ccccgccatg tagctgttgg agagtagaaa aatagagcac gcctgatgtt tctaaatgag 2700
aagactttca atagtaatga agaatccatg gcactctcct caccctcaaa cacatggcag 2760
tcattcacat acaggcccca aagccactgt tagtgctgca gtagctcctg tggacattgg 2820
aaagcccgga gagggcgtgg aagaaatcag ctggcccccg gcaggttctc tggggttttg 2880
tgcccaaggc tcctggagcc ctaaaaactt tcaaaagtta actccccacg tccccatcct 2940
gcttgggttt ctggactttt ctgaggcacc ggcagagggg tctcgttgct cccttgagtg 3000
taggggcagc cctttaacct ggctccttga gtccctgctt tttctgcttc tgttgccttc 3060
ttcctcgtct tcctctctct caatatctcc ctctctttgt ccctccccag ttcctgacct 3120
ggccatcccg gggtgccctt gaccagcccc gtgcctcctc agggtgtccc agcaccagcc 3180
tggcacagag tggggctcag ttagagtatg tgggatgttg gtttcgccag gtgagtgaat 3240
gaaaggactc gaccaccaca gctgagccac tagctgggcc atgcgaagag ttctaggtgc 3300
aaaggctgga gggtggaatt catttttgag aggtgtgtga gcagcttccg acccctgccc 3360
catttgaacg ggggccttgc tggtcgcgtc cctgcattca cctgcgcggc catcccgtca 3420
tccaacagtt gatcctaact gagcacgccc acggccctgg tctggcctgg gcaccggcca 3480
ccgtagccca tcccttgatg gcctctgtgt ccccaggagg gcgggccggg gggttgccca 3540
ggggctggag cagtggactg tggctccata gaggtaggct ggagggtgtg agggcagatt 3600
caagctatcc ccagggctct gctctggtcg gagccagccc cttctccctc tctgccttcc 3660
ccgccccatt cctgatgctg aactgttctg gacccctggc cctgagtctc tcaggaccaa 3720
agtgggcacg ggaacagctg tagtgtgtgc ccccccgggc tttggcacag gtctccctct 3780
cgaggtgtgg ttgtgactgc gacccttccc ttgccgtgat gccttcctcc cccggggctt 3840
ggtccagctc cttcactctc tagcagctgc tggggcccac ctcccatgcc gaggaccagc 3900
aggggaaacc tccagggagc atctgcaggc tctgcttctg cccggctgct ggcttgctct 3960
ccctggtggc tctccagcgg ccagcttcct cacccacccg gcactctgct ttgctctgtc 4020
tcctgaggtg ggcctgacca acctcccctt ctctgcctca gtccctgggc tccagggctc 4080
agctccacag ccctctgcct agcaggctgg ttctccctgc caagcccata cctgtggtca 4140
cctggccctc ctgtggtctg agtaccactc ccctgcccca ggagccactc ccactccagc 4200
tgcctgtttc cagcaggttc ccagtgtccc cgacaagccc ctgctggtgt ctccatctcc 4260
tgccaagcat cctccagtgc ctcctcctgt gggcctggcc tcagggctat ggacagactc 4320
ctgtcccatc ccagagaccc ctcgtgatcg tgccctggca cgtgggccgt ggcccggctg 4380
ggtcggctga agaactgcgg atggaagctg cggaagaggc cctgatgggg cccaccatcc 4440
cggacccaag tcttcttcct ggcgggcctc tcgtctcctt cctggtttgg gcggaagcca 4500
tcacctggat gcctacgtgg gaagggacct cgaatgtggg accccagccc ctctccagct 4560
cgaaatccct ccacagccac ggggacaccc tgcacctatt cccacgggac aggctggacc 4620
caaagactct ggacccgggg cctccccttg agtagagacc cgccctctga ctgatggacg 4680
ccgctgacct ggggtcagac ccgtgggctg gacccctgcc caccccgcag gaaccctgag 4740
gcctagggga gctgttgagc cttcagtgtc tgcatgtggg aagtgggctc cttcacctac 4800
ctcacagggc tgttgtgagg ggcgctgtga tgcggttcca aagcacaggg cttggcgcac 4860
ccccctgtgc tctcaataaa tgtgtttcct gtcttaaaaa aaaaaaaaaa aaaaaaaaaa 4920
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 4952
<210>11
<211>4952
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(2696)..(3139)
<400>11
gctaagcagt aaactaaagg attatatatt attagtctca gtggttttca gatttatttt 60
taaaggggaa aacagggaaa acccatcgta tttgtaaagc actttaggat tttgccgttt 120
gtttctgatt gtttgaagat tagggctttt tggtgcgtgg tcacctttca cctctccttt 180
taggatttag tcctttccag tctgctcttt ttgtgcgtgt cacaaccata ttcttgtggt 240
tctggctcat attgtagaac tgctgaacat aaggagaggt agccagctgt atggtcggat 300
ttaatatata atgttatatg ttgggatatc ttagtggttt gttttctgag gtaagtttct 360
tagtgttgtg tttgagacat tgtgtttgcg tttatggcga cactgtcatt catgcacttg 420
gccatctgag cgtggataca gcgggcactc gggtctctct gccagatgga tgaaagcagt 480
gtacattcca gtgtgggaga cagacatgtg gacaggtaaa ttacaaggca gtgtgataaa 540
gagtagagag ttggttgaga gagatcttag acaccatcct cattgtgtag atgagaaagc 600
aaaagtcacc aaggcagcct ggcagctggg acccaagaag cctagggtgc cagtccttgg 660
gcagtgcggg gttaggcaca cccagggccc tcctggttcc tggctgactc ttggactctt 720
tgtctctaat tggaggccat gatgcccagc tgtaaggtgg tcagcttcat ttgagacact 780
atatccttta gcacagcggg gtaatttctt ccctcctgtt tcattcattt accaaatggc 840
ctcctaaatg atctaaaatc acttggatct tttgtctttg tggacctaac acctggcttt 900
taaagtttaa ctttctgtcc cctcttcagc ttgctaaaat tgaaaagtgt tgcagcccaa 960
cctccacaaa tcttgtctca ggaaataaga gacatttgtt aacatttgtt ttgtacctct 1020
cagcagctta gttgacaagg gcaccgtgtg ggatttcctg ttcttgctca tttggaaaga 1080
gaatgttctt tgttcttaga ccctcagctc tcatgtgaga gccatagaat gttgcgaggt 1140
ggagttctgt ggatacagaa ggaatgtttt caagttagac ttactgccaa tgttaggatt 1200
tgggactttg catgattggg agggagaggg agtgctggag aacaggttaa aagttgtccc 1260
gctgagcttg gagcatctcc tgccaacccg gagtgcttcc caggaaccct gccagtgtca 1320
cttggggtta tgttttctga tttggaaaca ttaagccgta tgcaggtctc ttcagaactg 1380
gttcttcagc cggattgccc tggaaagcag agattgcagc tcttctaaaa ctgcctctca 1440
cagaagttcc aaggccaggc taaatattga atgcagtact cagcagctgg gacacctgat 1500
gctttggtgg ccatcccttt cttccatcca aaagggcccc cactggaagg catctgttgt 1560
tttaaaaata ttttaggact catttttact tcccccactc cctcaagatc acatacactc 1620
cccagtgggg gttacagcct cttaggagga atcgctgctt catgacttcc tcaggcattt 1680
acatttttct cttctgttga cttaaatcat gaactaaaat ttatccctag aggaaaaaag 1740
aatgcttcct ccattctggg ctcttctcac tgtacccaga ctatgtcttc aggactctca 1800
tctcttgtca gttctgttgt gctagaaaga ctggtttgaa aaaattcagc tcgtgtaaac 1860
ctgtgccctc caccctgtgg ggaacccatg tggggagcct ttgaaaatat cacttatcag 1920
ctgggcgcag tggctcatgc ctgtaatccc agcacgttgg gaggatgagg tgggcggatc 1980
atgaggtcag gagttcaaga ccagcctggc caacatagtg aaaccccgtc tctactaaaa 2040
atacaaaaaa aaaaaaaaaa aaaaccgaga ctagttctct ctctgtctcc tgcctgaacc 2100
ctcctcctct ttttgttctg atctttgagc tccctagagc ccataattct ttagagcagg 2160
tatgtcccga gtctgaaaca tgcccttatt tgtcccaagc tctggacatt tctcacccca 2220
aggcggatca atcatgatta aatcactcca attaaacttt aggctccagt cagaccttca 2280
gccaaatgga aaaaaaaact aggggataag ggaggtagtt ggagcaagaa aatgttatta 2340
gttgaaacct tacgggacct tcctccctta gtgagtctgt tggctaaagg ttctctggct 2400
tcgtgaatta gaattggata ctgtttccaa gttagcaaaa ccaactctac cccagcaccc 2460
cacgaggaag aatgtggaag gatctcccat tggccggttg gggcaaaagc ctgaggcaat 2520
ctttcatccc cttttgccaa ggcgagactt tcccagtgac ggtgatgtag ttggccactc 2580
tgactatggg tggactcggg tgtagacctc tgaagctgag atcacacgaa aacctggcct 2640
ccccgccatg tagctgttgg agagtagaaa aatagagcac gcctgatgtt tctaa atg 2698
Met
1
aga aga ctt tca ata gta atg aag aat cca tgg cac tct cct cac cct 2746
Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His Pro
5 10 15
caa aca cat ggc agt cat tca cat aca ggc ccc aaa gcc act gtt agt 2794
Gln Thr His Gly Ser His Ser His Thr Gly Pro Lys Ala Thr Val Ser
20 25 30
gct gca gta gct cct gtg gac att gga aag ccc gga gag ggc gtg gaa 2842
Ala Ala Val Ala Pro Val Asp Ile Gly Lys Pro Gly Glu Gly Val Glu
35 40 45
gaa atc agc tgg ccc ccg gca ggt tct ctg ggg ttt tgt gcc caa ggc 2890
Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe Cys Ala Gln Gly
50 55 60 65
tcc tgg agc cct aaa aac ttt caa aag tta act ccc cac gtc ccc atc 2938
Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro Ile
70 75 80
ctg ctt ggg ttt ctg gac ttt tct gag gca ccg gca gag ggg tct cgt 2986
Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu Gly Ser Arg
85 90 95
tgc tcc ctt gag tgt agg ggc agc cct tta acc tgg ctc ctt gag tcc 3034
Cys Ser Leu Glu Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu Ser
100 105 110
ctg ctt ttt ctg ctt ctg ttg cct tct tcc tcg tct tcc tct ctc tca 3082
Leu Leu Phe Leu Leu Leu Leu Pro Ser Ser Ser Ser Ser Ser Leu Ser
115 120 125
ata tct ccc tct ctt tgt ccc tcc cca gtt cct gac ctg gcc atc ccg 3130
Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp Leu Ala Ile Pro
130 135 140 145
ggg tgc cct tgaccagccc cgtgcctcct cagggtgtcc cagcaccagc 3179
Gly Cys Pro
ctggcacaga gtggggctca gttagagtat gtgggatgtt ggtttcgcca ggtgagtgaa 3239
tgaaaggact cgaccaccac agctgagcca ctagctgggc catgcgaaga gttctaggtg 3299
caaaggctgg agggtggaat tcatttttga gaggtgtgtg agcagcttcc gacccctgcc 3359
ccatttgaac gggggccttg ctggtcgcgt ccctgcattc acctgcgcgg ccatcccgtc 3419
atccaacagt tgatcctaac tgagcacgcc cacggccctg gtctggcctg ggcaccggcc 3479
accgtagccc atcccttgat ggcctctgtg tccccaggag ggcgggccgg ggggttgccc 3539
aggggctgga gcagtggact gtggctccat agaggtaggc tggagggtgt gagggcagat 3599
tcaagctatc cccagggctc tgctctggtc ggagccagcc ccttctccct ctctgccttc 3659
cccgccccat tcctgatgct gaactgttct ggacccctgg ccctgagtct ctcaggacca 3719
aagtgggcac gggaacagct gtagtgtgtg cccccccggg ctttggcaca ggtctccctc 3779
tcgaggtgtg gttgtgactg cgacccttcc cttgccgtga tgccttcctc ccccggggct 3839
tggtccagct ccttcactct ctagcagctg ctggggccca cctcccatgc cgaggaccag 3899
caggggaaac ctccagggag catctgcagg ctctgcttct gcccggctgc tggcttgctc 3959
tccctggtgg ctctccagcg gccagcttcc tcacccaccc ggcactctgc tttgctctgt 4019
ctcctgaggt gggcctgacc aacctcccct tctctgcctc agtccctggg ctccagggct 4079
cagctccaca gccctctgcc tagcaggctg gttctccctg ccaagcccat acctgtggtc 4139
acctggccct cctgtggtct gagtaccact cccctgcccc aggagccact cccactccag 4199
ctgcctgttt ccagcaggtt cccagtgtcc ccgacaagcc cctgctggtg tctccatctc 4259
ctgccaagca tcctccagtg cctcctcctg tgggcctggc ctcagggcta tggacagact 4319
cctgtcccat cccagagacc cctcgtgatc gtgccctggc acgtgggccg tggcccggct 4379
gggtcggctg aagaactgcg gatggaagct gcggaagagg ccctgatggg gcccaccatc 4439
ccggacccaa gtcttcttcc tggcgggcct ctcgtctcct tcctggtttg ggcggaagcc 4499
atcacctgga tgcctacgtg ggaagggacc tcgaatgtgg gaccccagcc cctctccagc 4559
tcgaaatccc tccacagcca cggggacacc ctgcacctat tcccacggga caggctggac 4619
ccaaagactc tggacccggg gcctcccctt gagtagagac ccgccctctg actgatggac 4679
gccgctgacc tggggtcaga cccgtgggct ggacccctgc ccaccccgca ggaaccctga 4739
ggcctagggg agctgttgag ccttcagtgt ctgcatgtgg gaagtgggct ccttcaccta 4799
cctcacaggg ctgttgtgag gggcgctgtg atgcggttcc aaagcacagg gcttggcgca 4859
cccccctgtg ctctcaataa atgtgtttcc tgtcttaaaa aaaaaaaaaa aaaaaaaaaa 4919
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 4952
<210>12
<211>148
<212>PRT
<213>智人(Homo sapiens)
<400>12
Met Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His
1 5 10 15
Pro Gln Thr His Gly Ser His Ser His Thr Gly Pro Lys Ala Thr Val
20 25 30
Ser Ala Ala Val Ala Pro Val Asp Ile Gly Lys Pro Gly Glu Gly Val
35 40 45
Glu Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe Cys Ala Gln
50 55 60
Gly Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro
65 70 75 80
Ile Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu Gly Ser
85 90 95
Arg Cys Ser Leu Glu Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu
100 105 110
Ser Leu Leu Phe Leu Leu Leu Leu Pro Ser Ser Ser Ser Ser Ser Leu
115 120 125
Ser Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp Leu Ala Ile
130 135 140
Pro Gly Cys Pro
145
<210>13
<211>3112
<212>DNA
<213>智人(Homo sapiens)
<400>13
gcgacggcga gagctagagc gggcgcagcg ttagggtggc cgtgcaaggg gagccgtggc 60
ccgggcccgg ggcgtgcgag acggcggaag cagcccaggg ccttgctgcc gccatgactg 120
aggaatcaga ggagacagtc ctgtacattg agcaccgcta tgtctgctct gagtgcaacc 180
agctgtatgg atcactggaa gaggtgctta tgcaccaaaa ctcccacgtg ccccagcagc 240
actttgagct ggtgggcgtg gctgatcccg gagtcactgt ggccacagac acagcttcag 300
gcacgggcct ctatcagacc cttgtgcagg agagccagta ccagtgcctg gagtgtggtc 360
aactgctgat gtcacccagc cagctcctgg agcaccagga gctgcacctg aagatgatgg 420
caccccagga ggcagtgcca gctgagccat cacctaaggc accacccctg agctccagca 480
ccatccacta cgagtgtgtg gattgcaagg ctctctttgc cagccaggag ctctggctga 540
accaccggca gacgcacctc cgggccacac ccaccaaggc tcctgcccct gttgtcctgg 600
ggtccccagt tgttctaggg cctcctgtgg gccaggcccg agtggctgtg gagcactcat 660
accgaaaggc agaagagggt ggggaagggg cgactgtccc atctgccgct gccaccacca 720
ctgaggtagt gactgaggtg gagctgctcc tctacaagtg ctctgagtgc tcccagctct 780
tccagctgcc ggcggatttc ctggagcacc aggccactca cttccctgct cctgtacccg 840
agtctcagga gcctgcctta cagcaggagg tgcaggcctc gtcacctgca gaggtgcctg 900
tgtctcagcc tgaccccttg ccagcttctg accacagtta cgagctgcgc aatggtgaag 960
ccattgggcg ggatcgccgg gggcgcaggg cccggaggaa caacagtgga gaagcaggcg 1020
gggcagccac acaggagctc ttctgctcag cctgtgacca gctctttctc tcaccccacc 1080
agctacagca gcacctgcgg agtcaccggg agggcgtctt taagtgcccc ctgtgcagtc 1140
gtgtcttccc tagcccttcc agtctggacc agcaccttgg agaccatagc agcgagtcac 1200
acttcctgtg tgtagactgt ggcctggcct tcggcacaga ggccctcctc ctggcccacc 1260
ggcgagccca caccccgaat cctctgcatt catgtccatg tgggaagacc tttgtcaacc 1320
ttaccaagtt cctttatcac cggcgtactc atggggtagg gggggtgtcc ctctgcccac 1380
aacaccagtc ccaccagagg aacctgtcat tggtttccct gagccagccc cagcagagac 1440
tggagagcca gaggcccctg agccccctgt gtctgaggag acctcagcag ggcccgctgc 1500
cccaggcacc taccgctgcc tcctgtgcag ccgtgaattt ggaaaggcct tgcagctgac 1560
ccggcaccaa cgttttgtgc atcggctgga gcggcgccat aaatgcagca tttgtggcaa 1620
gatgttcaag aagaagtctc acgtgcgtaa ccacctgcgc acacacacag gggagcggcc 1680
cttcccctgc cctgactgct ccaagccctt caactcacct gccaacctgg cccgccaccg 1740
gctcacacac acaggagagc ggccctaccg gtgtggggac tgtggcaagg ctttcacgca 1800
aagctccaca ctgaggcagc accgcttggt gcatgcccag cactttccct accgctgcca 1860
ggaatgtggg gtgcgttttc accgtcctta ccggctgctc atgcaccgct accatcacac 1920
aggtgaatac ccctacaagt gtcgcgagtg cccccgctcc ttcttgctgc gtcggctgct 1980
ggaggtgcac cagctcgtgg tccatgccgg gcgccagccc caccgctgcc catcctgtgg 2040
ggctgccttc ccctcctcac tgcggctccg ggagcaccgc tgtgcagccg ctgctgccca 2100
ggccccacgg cgctttgagt gtggcacctg tggcaagaaa gtgggctcag ctgctcgact 2160
gcaggcacac gaggcggccc atgcagctgc tgggcctgga gaggtcctgg ctaaggagcc 2220
ccctgcccct cgagccccac gggccactcg tgcaccagtt gcctctccag cagcccttgg 2280
aagcactgct acagcatccc ctgcggcccc tgcccgccgc cggggtctag agtgcagcga 2340
gtgcaagaag ctgttcagca cagagacgtc actgcaggtg caccggcgca tccacacagg 2400
tgagcggcca tacccatgtc cagactgtgg caaagcgttc cgtcagagta cccacctgaa 2460
agacaccggc gcctgcacac aggtgagcgg ccctttgcct gtgaagtgtg tggcaaggcc 2520
tttgccatct ccatgcgcct ggcagaacat cgccgcatcc acacaggcga acgaccctac 2580
tcctgccctg actgtggcaa gagctaccgc tccttctcca acctctggaa gcaccgcaag 2640
acccatcagc agcagcatca ggcagctgtg cggcagcagc tggcagaggc ggaggctgcc 2700
gttggcctgg ccgtcatgga gactgctgtg gaggcgctac ccctggtgga agccattgag 2760
atctaccctc tggccgaggc tgagggggtc cagatcagtg gctgactctg cccgacttcc 2820
tctttggcac ctccattccc tgttgctgaa ggccctccag catcccctta agcatctgta 2880
catactgtgt cccttcctct tcccatcccc accaccttgt aagttctaaa ttggatttat 2940
tctctcgtga ggggggtgct ctggggtcct tgacacacat aaaggtgccc ccccaccttc 3000
cacctcttag cactggtgac cccaaaaatg aaaccatcaa taaagactga gttgccagca 3060
gtgtgtagag tggaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3112
<210>14
<211>3112
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(1292)..(2926)
<400>14
gcgacggcga gagctagagc gggcgcagcg ttagggtggc cgtgcaaggg gagccgtggc 60
ccgggcccgg ggcgtgcgag acggcggaag cagcccaggg ccttgctgcc gccatgactg 120
aggaatcaga ggagacagtc ctgtacattg agcaccgcta tgtctgctct gagtgcaacc 180
agctgtatgg atcactggaa gaggtgctta tgcaccaaaa ctcccacgtg ccccagcagc 240
actttgagct ggtgggcgtg gctgatcccg gagtcactgt ggccacagac acagcttcag 300
gcacgggcct ctatcagacc cttgtgcagg agagccagta ccagtgcctg gagtgtggtc 360
aactgctgat gtcacccagc cagctcctgg agcaccagga gctgcacctg aagatgatgg 420
caccccagga ggcagtgcca gctgagccat cacctaaggc accacccctg agctccagca 480
ccatccacta cgagtgtgtg gattgcaagg ctctctttgc cagccaggag ctctggctga 540
accaccggca gacgcacctc cgggccacac ccaccaaggc tcctgcccct gttgtcctgg 600
ggtccccagt tgttctaggg cctcctgtgg gccaggcccg agtggctgtg gagcactcat 660
accgaaaggc agaagagggt ggggaagggg cgactgtccc atctgccgct gccaccacca 720
ctgaggtagt gactgaggtg gagctgctcc tctacaagtg ctctgagtgc tcccagctct 780
tccagctgcc ggcggatttc ctggagcacc aggccactca cttccctgct cctgtacccg 840
agtctcagga gcctgcctta cagcaggagg tgcaggcctc gtcacctgca gaggtgcctg 900
tgtctcagcc tgaccccttg ccagcttctg accacagtta cgagctgcgc aatggtgaag 960
ccattgggcg ggatcgccgg gggcgcaggg cccggaggaa caacagtgga gaagcaggcg 1020
gggcagccac acaggagctc ttctgctcag cctgtgacca gctctttctc tcaccccacc 1080
agctacagca gcacctgcgg agtcaccggg agggcgtctt taagtgcccc ctgtgcagtc 1140
gtgtcttccc tagcccttcc agtctggacc agcaccttgg agaccatagc agcgagtcac 1200
acttcctgtg tgtagactgt ggcctggcct tcggcacaga ggccctcctc ctggcccacc 1260
ggcgagccca caccccgaat cctctgcatt c atg tcc atg tgg gaa gac ctt 1312
Met Ser Met Trp Glu Asp Leu
1 5
tgt caa cct tac caa gtt cct tta tca ccg gcg tac tca tgg ggt agg 1360
Cys Gln Pro Tyr Gln Val Pro Leu Ser Pro Ala Tyr Ser Trp Gly Arg
10 15 20
ggg ggt gtc cct ctg ccc aca aca cca gtc cca cca gag gaa cct gtc 1408
Gly Gly Val Pro Leu Pro Thr Thr Pro Val Pro Pro Glu Glu Pro Val
25 30 35
att ggt ttc cct gag cca gcc cca gca gag act gga gag cca gag gcc 1456
Ile Gly Phe Pro Glu Pro Ala Pro Ala Glu Thr Gly Glu Pro Glu Ala
40 45 50 55
cct gag ccc cct gtg tct gag gag acc tca gca ggg ccc gct gcc cca 1504
Pro Glu Pro Pro Val Ser Glu Glu Thr Ser Ala Gly Pro Ala Ala Pro
60 65 70
ggc acc tac cgc tgc ctc ctg tgc agc cgt gaa ttt gga aag gcc ttg 1552
Gly Thr Tyr Arg Cys Leu Leu Cys Ser Arg Glu Phe Gly Lys Ala Leu
75 80 85
cag ctg acc cgg cac caa cgt ttt gtg cat cgg ctg gag cgg cgc cat 1600
Gln Leu Thr Arg His Gln Arg Phe Val His Arg Leu Glu Arg Arg His
90 95 100
aaa tgc agc att tgt ggc aag atg ttc aag aag aag tct cac gtg cgt 1648
Lys Cys Ser Ile Cys Gly Lys Met Phe Lys Lys Lys Ser His Val Arg
105 110 115
aac cac ctg cgc aca cac aca ggg gag cgg ccc ttc ccc tgc cct gac 1696
Asn His Leu Arg Thr His Thr Gly Glu Arg Pro Phe Pro Cys Pro Asp
120 125 130 135
tgc tcc aag ccc ttc aac tca cct gcc aac ctg gcc cgc cac cgg ctc 1744
Cys Ser Lys Pro Phe Asn Ser Pro Ala Asn Leu Ala Arg His Arg Leu
140 145 150
aca cac aca gga gag cgg ccc tac cgg tgt ggg gac tgt ggc aag gct 1792
Thr His Thr Gly Glu Arg Pro Tyr Arg Cys Gly Asp Cys Gly Lys Ala
155 160 165
ttc acg caa agc tcc aca ctg agg cag cac cgc ttg gtg cat gcc cag 1840
Phe Thr Gln Ser Ser Thr Leu Arg Gln His Arg Leu Val His Ala Gln
170 175 180
cac ttt ccc tac cgc tgc cag gaa tgt ggg gtg cgt ttt cac cgt cct 1888
His Phe Pro Tyr Arg Cys Gln Glu Cys Gly Val Arg Phe His Arg Pro
185 190 195
tac cgg ctg ctc atg cac cgc tac cat cac aca ggt gaa tac ccc tac 1936
Tyr Arg Leu Leu Met His Arg Tyr His His Thr Gly Glu Tyr Pro Tyr
200 205 210 215
aag tgt cgc gag tgc ccc cgc tcc ttc ttg ctg cgt cgg ctg ctg gag 1984
Lys Cys Arg Glu Cys Pro Arg Ser Phe Leu Leu Arg Arg Leu Leu Glu
220 225 230
gtg cac cag ctc gtg gtc cat gcc ggg cgc cag ccc cac cgc tgc cca 2032
Val His Gln Leu Val Val His Ala Gly Arg Gln Pro His Arg Cys Pro
235 240 245
tcc tgt ggg gct gcc ttc ccc tcc tca ctg cgg ctc cgg gag cac cgc 2080
Ser Cys Gly Ala Ala Phe Pro Ser Ser Leu Arg Leu Arg Glu His Arg
250 255 260
tgt gca gcc gct gct gcc cag gcc cca cgg cgc ttt gag tgt ggc acc 2128
Cys Ala Ala Ala Ala Ala Gln Ala Pro Arg Arg Phe Glu Cys Gly Thr
265 270 275
tgt ggc aag aaa gtg ggc tca gct gct cga ctg cag gca cac gag gcg 2176
Cys Gly Lys Lys Val Gly Ser Ala Ala Arg Leu Gln Ala His Glu Ala
280 285 290 295
gcc cat gca gct gct ggg cct gga gag gtc ctg gct aag gag ccc cct 2224
Ala His Ala Ala Ala Gly Pro Gly Glu Val Leu Ala Lys Glu Pro Pro
300 305 310
gcc cct cga gcc cca cgg gcc act cgt gca cca gtt gcc tct cca gca 2272
Ala Pro Arg Ala Pro Arg Ala Thr Arg Ala Pro Val Ala Ser Pro Ala
315 320 325
gcc ctt gga agc act gct aca gca tcc cct gcg gcc cct gcc cgc cgc 2320
Ala Leu Gly Ser Thr Ala Thr Ala Ser Pro Ala Ala Pro Ala Arg Arg
330 335 340
cgg ggt cta gag tgc agc gag tgc aag aag ctg ttc agc aca gag acg 2368
Arg Gly Leu Glu Cys Ser Glu Cys Lys Lys Leu Phe Ser Thr Glu Thr
345 350 355
tca ctg cag gtg cac cgg cgc atc cac aca ggt gag cgg cca tac cca 2416
Ser Leu Gln Val His Arg Arg Ile His Thr Gly Glu Arg Pro Tyr Pro
360 365 370 375
tgt cca gac tgt ggc aaa gcg ttc cgt cag agt acc cac ctg aaa gac 2464
Cys Pro Asp Cys Gly Lys Ala Phe Arg Gln Ser Thr His Leu Lys Asp
380 385 390
acc ggc gcc tgc aca cag gtg agc ggc cct ttg cct gtg aag tgt gtg 2512
Thr Gly Ala Cys Thr Gln Val Ser Gly Pro Leu Pro Val Lys Cys Val
395 400 405
gca agg cct ttg cca tct cca tgc gcc tgg cag aac atc gcc gca tcc 2560
Ala Arg Pro Leu Pro Ser Pro Cys Ala Trp Gln Asn Ile Ala Ala Ser
410 415 420
aca cag gcg aac gac cct act cct gcc ctg act gtg gca aga gct acc 2608
Thr Gln Ala Asn Asp Pro Thr Pro Ala Leu Thr Val Ala Arg Ala Thr
425 430 435
gct cct tct cca acc tct gga agc acc gca aga ccc atc agc agc agc 2656
Ala Pro Ser Pro Thr Ser Gly Ser Thr Ala Arg Pro Ile Ser Ser Ser
440 445 450 455
atc agg cag ctg tgc ggc agc agc tgg cag agg cgg agg ctg ccg ttg 2704
Ile Arg Gln Leu Cys Gly Ser Ser Trp Gln Arg Arg Arg Leu Pro Leu
460 465 470
gcc tgg ccg tca tgg aga ctg ctg tgg agg cgc tac ccc tgg tgg aag 2752
Ala Trp Pro Ser Trp Arg Leu Leu Trp Arg Arg Tyr Pro Trp Trp Lys
475 480 485
cca ttg aga tct acc ctc tgg ccg agg ctg agg ggg tcc aga tca gtg 2800
Pro Leu Arg Ser Thr Leu Trp Pro Arg Leu Arg Gly Ser Arg Ser Val
490 495 500
gct gac tct gcc cga ctt cct ctt tgg cac ctc cat tcc ctg ttg ctg 2848
Ala Asp Ser Ala Arg Leu Pro Leu Trp His Leu His Ser Leu Leu Leu
505 510 515
aag gcc ctc cag cat ccc ctt aag cat ctg tac ata ctg tgt ccc ttc 2896
Lys Ala Leu Gln His Pro Leu Lys His Leu Tyr Ile Leu Cys Pro Phe
520 525 530 535
ctc ttc cca tcc cca cca cct tgt aag ttc taaattggat ttattctctc 2946
Leu Phe Pro Ser Pro Pro Pro Cys Lys Phe
540 545
gtgagggggg tgctctgggg tccttgacac acataaaggt gcccccccac cttccacctc 3006
ttagcactgg tgaccccaaa aatgaaacca tcaataaaga ctgagttgcc agcagtgtgt 3066
agagtggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3112
<210>15
<211>545
<212>PRT
<213>智人(Homo sapiens)
<400>15
Met Ser Met Trp Glu Asp Leu Cys Gln Pro Tyr Gln Val Pro Leu Ser
1 5 10 15
Pro Ala Tyr Ser Trp Gly Arg Gly Gly Val Pro Leu Pro Thr Thr Pro
20 25 30
Val Pro Pro Glu Glu Pro Val Ile Gly Phe Pro Glu Pro Ala Pro Ala
35 40 45
Glu Thr Gly Glu Pro Glu Ala Pro Glu Pro Pro Val Ser Glu Glu Thr
50 55 60
Ser Ala Gly Pro Ala Ala Pro Gly Thr Tyr Arg Cys Leu Leu Cys Ser
65 70 75 80
Arg Glu Phe Gly Lys Ala Leu Gln Leu Thr Arg His Gln Arg Phe Val
85 90 95
His Arg Leu Glu Arg Arg His Lys Cys Ser Ile Cys Gly Lys Met Phe
100 105 110
Lys Lys Lys Ser His Val Arg Asn His Leu Arg Thr His Thr Gly Glu
115 120 125
Arg Pro Phe Pro Cys Pro Asp Cys Ser Lys Pro Phe Asn Ser Pro Ala
130 135 140
Asn Leu Ala Arg His Arg Leu Thr His Thr Gly Glu Arg Pro Tyr Arg
145 150 155 160
Cys Gly Asp Cys Gly Lys Ala Phe Thr Gln Ser Ser Thr Leu Arg Gln
165 170 175
His Arg Leu Val His Ala Gln His Phe Pro Tyr Arg Cys Gln Glu Cys
180 185 190
Gly Val Arg Phe His Arg Pro Tyr Arg Leu Leu Met His Arg Tyr His
195 200 205
His Thr Gly Glu Tyr Pro Tyr Lys Cys Arg Glu Cys Pro Arg Ser Phe
210 215 220
Leu Leu Arg Arg Leu Leu Glu Val His Gln Leu Val Val His Ala Gly
225 230 235 240
Arg Gln Pro His Arg Cys Pro Ser Cys Gly Ala Ala Phe Pro Ser Ser
245 250 255
Leu Arg Leu Arg Glu His Arg Cys Ala Ala Ala Ala Ala Gln Ala Pro
260 265 270
Arg Arg Phe Glu Cys Gly Thr Cys Gly Lys Lys Val Gly Ser Ala Ala
275 280 285
Arg Leu Gln Ala His Glu Ala Ala His Ala Ala Ala Gly Pro Gly Glu
290 295 300
Val Leu Ala Lys Glu Pro Pro Ala Pro Arg Ala Pro Arg Ala Thr Arg
305 310 315 320
Ala Pro Val Ala Ser Pro Ala Ala Leu Gly Ser Thr Ala Thr Ala Ser
325 330 335
Pro Ala Ala Pro Ala Arg Arg Arg Gly Leu Glu Cys Ser Glu Cys Lys
340 345 350
Lys Leu Phe Ser Thr Glu Thr Ser Leu Gln Val His Arg Arg Ile His
355 360 365
Thr Gly Glu Arg Pro Tyr Pro Cys Pro Asp Cys Gly Lys Ala Phe Arg
370 375 380
Gln Ser Thr His Leu Lys Asp Thr Gly Ala Cys Thr Gln Val Ser Gly
385 390 395 400
Pro Leu Pro Val Lys Cys Val Ala Arg Pro Leu Pro Ser Pro Cys Ala
405 410 415
Trp Gln Ash Ile Ala Ala Ser Thr Gln Ala Asn Asp Pro Thr Pro Ala
420 425 430
Leu Thr Val Ala Arg Ala Thr Ala Pro Ser Pro Thr Ser Gly Ser Thr
435 440 445
Ala Arg Pro Ile Ser Ser Ser Ile Arg Gln Leu Cys Gly Ser Ser Trp
450 455 460
Gln Arg Arg Arg Leu Pro Leu Ala Trp Pro Ser Trp Arg Leu Leu Trp
465 470 475 480
Arg Arg Tyr Pro Trp Trp Lys Pro Leu Arg Ser Thr Leu Trp Pro Arg
485 490 495
Leu Arg Gly Ser Arg Ser Val Ala Asp Ser Ala Arg Leu Pro Leu Trp
500 505 510
His Leu His Ser Leu Leu Leu Lys Ala Leu Gln His Pro Leu Lys His
515 520 525
Leu Tyr Ile Leu Cys Pro Phe Leu Phe Pro Ser Pro Pro Pro Cys Lys
530 535 540
Phe
545
<210>16
<211>3102
<212>DNA
<213>智人(Homo sapiens)
<400>16
gggcagaggt tgcagtaacc caagatcatg ccaccatact acagactgtg tgacagagcg 60
agactctgtc tcaaaacaac aacaaaaaaa caaactcacc attgtacctg tgcttatgca 120
aggtttagta ggaacgtaaa ttggtttaac ctttgtggac agaagtttta aaaatatata 180
ttaaaattaa aagtatgctc tgaaggagga actccacttc tggtaattta tctcaagaga 240
ataactgggc cagcacaaag gctgctgttt aacaatgtgt aatgatgcag tgacagctac 300
aattgcaaaa ataacctaga cattcaccaa tgaggactgg ttaaatgaac tagtataacc 360
atactgcaga atatcataaa gataacaaaa aaatgatatg gatctgtttc ttggcataaa 420
tatatccata agttttaaga agagatgcta tatatacggt ggtcccattg atgtataact 480
gttaggacta aaaatagtac cttcctcata atgatgtttt gaggaattaa tgagtttatt 540
catgcaaaat gcttagaatg gtacctggca cacagacaat gtttaagaaa tgtttgttat 600
tgttattact atgtctctgt atatatgcat aggaaaaatc tggaaggata aaataaaaaa 660
tgaatatttt tgggtggtga gactaaattt ttgtctaatt ttatggataa gttttatgat 720
ttatatttat aataaaaata aagctataaa aattaattat gatgtttctt gctcatgtca 780
gctacttcac tacatactga gttcccatcc ccatttgtta caggagcaac tcctggttaa 840
gtaccttttt tgtaactgtg aaattccctt gacattcatc atatactgat gacttttcct 900
aatacatgga aacaaacagg attgtgattt ttctctcatt ttgtacacta agttctatgc 960
cagccgattt cagagagaca ctctgcaaag ttcctatgaa aagtcttcaa aaatgtatta 1020
ccttgctgtt taataccaat accaaaattc aaatggactt atcaattaaa ctcacctcaa 1080
acacagtaat gcactcacag ttatgagcag tgctcactac tgccaatcat ttctgcttcc 1140
agaatggtta aaggagccac aaactctgcc cttatcagaa gcagtagcct gataacaggt 1200
aagaatagga atgttccgtt tctccccaaa ttaagagtgg tatcaataat ctgacttttc 1260
caggcattta tctcacagaa atgtttatga gacatgctaa gatcaacatg gtaatatctg 1320
actattgttt ttattagaaa taagggggcc agccaggcac agtagcttac acctgtaatc 1380
ccagcacctg gggaggttga ggtgggagga ttgcttgagc ccaggagttt gagacaagcc 1440
tgggcaacac agggagacac cagctctatt aaaaaaaaaa aaagtaaggg ggctataatg 1500
taacccttat tgactgatct ttgaggctac tgttgtgaga tttctacatc cctctttatt 1560
ataaaagatc ccaaatgcgg ctttacttgg aaaggaagca atttgacagt gatgaggaat 1620
gatgtgcaga atggagattc agaaccctaa cagactctgg tattgatatc tagtgctcat 1680
atttctggga gtctgctagg gttatgggag tttgcattta aattgtaggt tgttgcagaa 1740
aacagaattt atatgtggaa aattgtaacg aatccactaa aaaactatta gaactaataa 1800
tcaagtttgg caaggttgta agacataagt cagtatacaa aaatcaactg tatttctata 1860
catttgtgac aatctgaaaa tgaaattagg aaaacaaatc catttacgat agcaacaaga 1920
agtataaaat acttaggaag aagtataaca aaagatgtgc acaatttata ttctgaaaac 1980
tacaaatagt gtttaaagaa attaaagaat attaaaataa atggaaaaat atcccatgtt 2040
catggactgg aagaattact cttaagatgt caatactcct caaattgatc tacatatttg 2100
atacaatcct tgtaagaacc cgaactgact tctttgtaga aattgacaaa ttgattctaa 2160
gattcataca ggattgccat agatccagaa tagccacatc aattttaaaa aaagaagaaa 2220
gtacaaagac tcacattacc tgatttaaaa acataccata aagcaatgtt aggacagtgt 2280
ggtattgaca taaggataga cacatagatc aatgaaaagg aaagggagcc cagaagtaaa 2340
accacatcaa ctgattttca acaaagatgc caagaccatt caattgagga aagaatagtc 2400
ccttcaacaa atggtgctgc aaccagacag tcatatgcaa aagaatgaaa tttaaccttt 2460
acaaaattta accatatata aaaattaatt caaatggatc aaagacatat aagggctgaa 2520
actataaaat tgttaagaga acataggaat aaatattcat gaccttggat ttggcagtgg 2580
attcttagct ataacatcaa agcacaagta agaaaagaga gataaattgg atttcatgaa 2640
aattaaaaac ctgtgcttca aagacactat caagaaagtg acaaggcaac ccacagaatg 2700
ggaaaaactg cagattatct gataagggac ttctatctag aatatataaa aatctctcac 2760
aactcagaaa taagacaatc cagttaaaat aagggtaaag gagccgggca tggtggctca 2820
cgcctgtaat cccagagctt tgggaggtgg aggtgggcag atcacctgag gtcaggagtt 2880
cacgaccagc ctggccaaca tggtaaaacc ccatctctac taaaaataca aaaattagcc 2940
gggtgtggtg gtgcatgcct gtaatcccag ctacttggga ggctgaggca gaagaatcac 3000
ttgaacctgg gaggtggagg ttgcagtgag ccgagatcgc gccactgcac tccagcctgg 3060
gcgacagagc gagaatctgt ctcgaaaaaa aaaaaaaaaa aa 3102
<210>17
<211>3102
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(590)..(886)
<400>17
gggcagaggt tgcagtaacc caagatcatg ccaccatact acagactgtg tgacagagcg 60
agactctgtc tcaaaacaac aacaaaaaaa caaactcacc attgtacctg tgcttatgca 120
aggtttagta ggaacgtaaa ttggtttaac ctttgtggac agaagtttta aaaatatata 180
ttaaaattaa aagtatgctc tgaaggagga actccacttc tggtaattta tctcaagaga 240
ataactgggc cagcacaaag gctgctgttt aacaatgtgt aatgatgcag tgacagctac 300
aattgcaaaa ataacctaga cattcaccaa tgaggactgg ttaaatgaac tagtataacc 360
atactgcaga atatcataaa gataacaaaa aaatgatatg gatctgtttc ttggcataaa 420
tatatccata agttttaaga agagatgcta tatatacggt ggtcccattg atgtataact 480
gttaggacta aaaatagtac cttcctcata atgatgtttt gaggaattaa tgagtttatt 540
catgcaaaat gcttagaatg gtacctggca cacagacaat gtttaagaa atg ttt gtt 598
Met Phe Val
1
att gtt att act atg tct ctg tat ata tgc ata gga aaa atc tgg aag 646
Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys Ile Trp Lys
5 10 15
gat aaa ata aaa aat gaa tat ttt tgg gtg gtg aga cta aat ttt tgt 694
Asp Lys Ile Lys Asn Glu Tyr Phe Trp Val Val Arg Leu Asn Phe Cys
20 25 30 35
cta att tta tgg ata agt ttt atg att tat att tat aat aaa aat aaa 742
Leu Ile Leu Trp Ile Ser Phe Met Ile Tyr Ile Tyr Asn Lys Asn Lys
40 45 50
gct ata aaa att aat tat gat gtt tct tgc tca tgt cag cta ctt cac 790
Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys Gln Leu Leu His
55 60 65
tac ata ctg agt tcc cat ccc cat ttg tta cag gag caa ctc ctg gtt 838
Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln Leu Leu Val
70 75 80
aag tac ctt ttt tgt aac tgt gaa att ccc ttg aca ttc atc ata tac 886
Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe Ile Ile Tyr
85 90 95
tgatgacttt tcctaataca tggaaacaaa caggattgtg atttttctct cattttgtac 946
actaagttct atgccagccg atttcagaga gacactctgc aaagttccta tgaaaagtct 1006
tcaaaaatgt attaccttgc tgtttaatac caataccaaa attcaaatgg acttatcaat 1066
taaactcacc tcaaacacag taatgcactc acagttatga gcagtgctca ctactgccaa 1126
tcatttctgc ttccagaatg gttaaaggag ccacaaactc tgcccttatc agaagcagta 1186
gcctgataac aggtaagaat aggaatgttc cgtttctccc caaattaaga gtggtatcaa 1246
taatctgact tttccaggca tttatctcac agaaatgttt atgagacatg ctaagatcaa 1306
catggtaata tctgactatt gtttttatta gaaataaggg ggccagccag gcacagtagc 1366
ttacacctgt aatcccagca cctggggagg ttgaggtggg aggattgctt gagcccagga 1426
gtttgagaca agcctgggca acacagggag acaccagctc tattaaaaaa aaaaaaagta 1486
agggggctat aatgtaaccc ttattgactg atctttgagg ctactgttgt gagatttcta 1546
catccctctt tattataaaa gatcccaaat gcggctttac ttggaaagga agcaatttga 1606
cagtgatgag gaatgatgtg cagaatggag attcagaacc ctaacagact ctggtattga 1666
tatctagtgc tcatatttct gggagtctgc tagggttatg ggagtttgca tttaaattgt 1726
aggttgttgc agaaaacaga atttatatgt ggaaaattgt aacgaatcca ctaaaaaact 1786
attagaacta ataatcaagt ttggcaaggt tgtaagacat aagtcagtat acaaaaatca 1846
actgtatttc tatacatttg tgacaatctg aaaatgaaat taggaaaaca aatccattta 1906
cgatagcaac aagaagtata aaatacttag gaagaagtat aacaaaagat gtgcacaatt 1966
tatattctga aaactacaaa tagtgtttaa agaaattaaa gaatattaaa ataaatggaa 2026
aaatatccca tgttcatgga ctggaagaat tactcttaag atgtcaatac tcctcaaatt 2086
gatctacata tttgatacaa tccttgtaag aacccgaact gacttctttg tagaaattga 2146
caaattgatt ctaagattca tacaggattg ccatagatcc agaatagcca catcaatttt 2206
aaaaaaagaa gaaagtacaa agactcacat tacctgattt aaaaacatac cataaagcaa 2266
tgttaggaca gtgtggtatt gacataagga tagacacata gatcaatgaa aaggaaaggg 2326
agcccagaag taaaaccaca tcaactgatt ttcaacaaag atgccaagac cattcaattg 2386
aggaaagaat agtcccttca acaaatggtg ctgcaaccag acagtcatat gcaaaagaat 2446
gaaatttaac ctttacaaaa tttaaccata tataaaaatt aattcaaatg gatcaaagac 2506
atataagggc tgaaactata aaattgttaa gagaacatag gaataaatat tcatgacctt 2566
ggatttggca gtggattctt agctataaca tcaaagcaca agtaagaaaa gagagataaa 2626
ttggatttca tgaaaattaa aaacctgtgc ttcaaagaca ctatcaagaa agtgacaagg 2686
caacccacag aatgggaaaa actgcagatt atctgataag ggacttctat ctagaatata 2746
taaaaatctc tcacaactca gaaataagac aatccagtta aaataagggt aaaggagccg 2806
ggcatggtgg ctcacgcctg taatcccaga gctttgggag gtggaggtgg gcagatcacc 2866
tgaggtcagg agttcacgac cagcctggcc aacatggtaa aaccccatct ctactaaaaa 2926
tacaaaaatt agccgggtgt ggtggtgcat gcctgtaatc ccagctactt gggaggctga 2986
ggcagaagaa tcacttgaac ctgggaggtg gaggttgcag tgagccgaga tcgcgccact 3046
gcactccagc ctgggcgaca gagcgagaat ctgtctcgaa aaaaaaaaaa aaaaaa 3102
<210>18
<211>99
<212>PRT
<213>智人(Homo sapiens)
<400>18
Met Phe Val Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys
1 5 10 15
Ile Trp Lys Asp Lys Ile Lys Asn Glu Tyr Phe Trp Val Vál Arg Leu
20 25 30
Asn Phe Cys Leu Ile Leu Trp Ile Ser Phe Met Ile Tyr Ile Tyr Asn
35 40 45
Lys Asn Lys Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys Gln
50 55 60
Leu Leu His Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln
65 70 75 80
Leu Leu Val Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe
85 90 95
Ile Ile Tyr
<210>19
<211>2455
<212>DNA
<213>智人(Homo sapiens)
<400>19
gttctaggta gtagaaagca aagggtgcta tgaagagcgt gtacacagac tcccaactgt 60
tttgggagtt aaggaaggtt tcttggagga agtggcattc aagctataag acctgatgat 120
caggtggagt tagctggaga gcagggacag agagaatagc ctgtgcaaaa ggcctattct 180
tcaggagaga atgacacatg aatgggactg aagaagtaaa ctggtatctc atatgaagga 240
ccttttatat cttgttaagg attttgaact tcctcctttt tttttttttg agacagagtt 300
tctctctgtc acccaggctg aagtgcattg gcgtgatctc ggctcatggc agcctccacc 360
taccaggttc aagctattct cctgcctcag cttcccagat agctgggatt acagtcatgt 420
gccaccacgc cgggctaatt tttgtatttt tagtagagac agggtttcac cgtgttggcc 480
aggctggtct cgatttcctg acctcaagtg atctgcctgc cttggcctgc cccagtgccg 540
gaattacagg agtgagcccc cgcgcctggc ctggacttct gcttaaaggc aataaggaag 600
cctttactag atttaaaata ggagctcagt ttaaattagt aaggatttgt atttcatcaa 660
gagctctctt tggccctagt ctggtagaag atgagtcgaa gtagagagac tagttacaaa 720
gctgttccca ataatccagg tgaaaaatag tggtgaccct agattaaggt agtattggtg 780
tgggtaggga gaagtggaca gtcatatttg agaggtacct agggaataga attgcaaaga 840
cctgggagta gattggatat tcagtgggag gaagggagag aagtaatctc tcaagtgttg 900
ctcaagccat aaccttggat ggtactgtcc actgatacag taggaggaaa atgtttgagg 960
gaaagtagtg atgaatttgt ggtgcactaa catggccaac actaaatatt agaaagatta 1020
atgtggtcat gtagaagatg aatgaaaaga agatacctca gaagtggaga gatagttaaa 1080
tggcttttgt aggaatctca gctagaagtg tcagtattct taagtgcaga actaacaggt 1140
gtgggaaagt aatgggaagt agacaccaaa caaatagttc cccaaagatg gtatcaaata 1200
tcccagtgac agcttgcagc ctgctcagct ttatgatatg cccctgagat catttttcag 1260
gacaaaaagt agtgaaacta cctttattta cttctcaaat ttacctttat ttacttctca 1320
aatatacata gaaagtaata ttgtaaaaag cagctctggc tgggcgctgt ggctcaagcc 1380
tatagtccca gcactttggg aggctgaggg gggcagatga cttgaggtca agagttcaag 1440
accatcctgg ccaacatggc aaaaccgcat ttctactaaa aatacaaaaa ttcgcgtggc 1500
agcacgtgcc tgtaacccca gctactctgg aggctgaggt acaagagtcg cttgaatttg 1560
ggaggtggag actgcagcga gccgagatcc taccactgca ctccagcttg ggggacagtg 1620
cgagactctg tcttaaaaaa cagtggcctg gcgcactggc tcacgcttgt aatcccagca 1680
ctttgggagg ccgaggtggg cgggggtgga tcattgaggt caggagatca agaccatccc 1740
ggccaacgtg gtgcaacccc gtctctacta caaatacaaa aattagctgg acatggtggt 1800
gtacgcctgt agtcccagct actcgggaga gtaagacggg aatcgcttga acctgggagg 1860
tgggaggttg cagtgagcca agattgtgcc actgcactcc agcctggcga cagagcaaga 1920
ctgtcttaaa aaaaaaaaaa aaaaaaaaag gatattttca ctcttgggac ttgataaagc 1980
tagtttattt tgattatctc ctatatccta tacatattta attggcccct atgaacaatg 2040
ttacctcttt atgaggggac ccaaagaagt agctgctggt gtgagagtga gagatcatcc 2100
atctttttta ttgtgctttt tgttgtttct ttgtcctgct atgtgttata agtaaggccg 2160
ggcacggtgg ctcatgcctg taatcccagc acttagggag gccaaggcca gatccctgag 2220
gtcaagagtt tgagaccagc ctagccaaca tggtgaaacc ttgtctttac tgaaaataca 2280
aaaaaattag ctgggcaggg tggcatgcgc ctgtagtccc agctactcgc agaggctgag 2340
gcaggagaat tgcttgaacc tgggaggcgg aggttgcggt gagccaagat cctgccactg 2400
cactccagcc tgggcaacag agggagactc catctcaaaa aaaaaaaaaa aaaaa 2455
<210>20
<211>2455
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(417)..(701)
<400>20
gttctaggta gtagaaagca aagggtgcta tgaagagcgt gtacacagac tcccaactgt 60
tttgggagtt aaggaaggtt tcttggagga agtggcattc aagctataag acctgatgat 120
caggtggagt tagctggaga gcagggacag agagaatagc ctgtgcaaaa ggcctattct 180
tcaggagaga atgacacatg aatgggactg aagaagtaaa ctggtatctc atatgaagga 240
ccttttatat cttgttaagg attttgaact tcctcctttt tttttttttg agacagagtt 300
tctctctgtc acccaggctg aagtgcattg gcgtgatctc ggctcatggc agcctccacc 360
taccaggttc aagctattct cctgcctcag cttcccagat agctgggatt acagtc atg 419
Met
1
tgc cac cac gcc ggg cta att ttt gta ttt tta gta gag aca ggg ttt 467
Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly Phe
5 10 15
cac cgt gtt ggc cag gct ggt ctc gat ttc ctg acc tca agt gat ctg 515
His Arg Val Gly Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp Leu
20 25 30
cct gcc ttg gcc tgc ccc agt gcc gga att aca gga gtg agc ccc cgc 563
Pro Ala Leu Ala Cys Pro Ser Ala Gly Ile Thr Gly Val Ser Pro Arg
35 40 45
gcc tgg cct gga ctt ctg ctt aaa ggc aat aag gaa gcc ttt act aga 611
Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu Ala Phe Thr Arg
50 55 60 65
ttt aaa ata gga gct cag ttt aaa tta gta agg att tgt att tca tca 659
Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser Ser
70 75 80
aga gct ctc ttt ggc cct agt ctg gta gaa gat gag tcg aag 701
Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys
85 90 95
tagagagact agttacaaag ctgttcccaa taatccaggt gaaaaatagt ggtgacccta 761
gattaaggta gtattggtgt gggtagggag aagtggacag tcatatttga gaggtaccta 821
gggaatagaa ttgcaaagac ctgggagtag attggatatt cagtgggagg aagggagaga 881
agtaatctct caagtgttgc tcaagccata accttggatg gtactgtcca ctgatacagt 941
aggaggaaaa tgtttgaggg aaagtagtga tgaatttgtg gtgcactaac atggccaaca 1001
ctaaatatta gaaagattaa tgtggtcatg tagaagatga atgaaaagaa gatacctcag 1061
aagtggagag atagttaaat ggcttttgta ggaatctcag ctagaagtgt cagtattctt 1121
aagtgcagaa ctaacaggtg tgggaaagta atgggaagta gacaccaaac aaatagttcc 1181
ccaaagatgg tatcaaatat cccagtgaca gcttgcagcc tgctcagctt tatgatatgc 1241
ccctgagatc atttttcagg acaaaaagta gtgaaactac ctttatttac ttctcaaatt 1301
tacctttatt tacttctcaa atatacatag aaagtaatat tgtaaaaagc agctctggct 1361
gggcgctgtg gctcaagcct atagtcccag cactttggga ggctgagggg ggcagatgac 1421
ttgaggtcaa gagttcaaga ccatcctggc caacatggca aaaccgcatt tctactaaaa 1481
atacaaaaat tcgcgtggca gcacgtgcct gtaaccccag ctactctgga ggctgaggta 1541
caagagtcgc ttgaatttgg gaggtggaga ctgcagcgag ccgagatcct accactgcac 1601
tccagcttgg gggacagtgc gagactctgt cttaaaaaac agtggcctgg cgcactggct 1661
cacgcttgta atcccagcac tttgggaggc cgaggtgggc gggggtggat cattgaggtc 1721
aggagatcaa gaccatcccg gccaacgtgg tgcaaccccg tctctactac aaatacaaaa 1781
attagctgga catggtggtg tacgcctgta gtcccagcta ctcgggagag taagacggga 1841
atcgcttgaa cctgggaggt gggaggttgc agtgagccaa gattgtgcca ctgcactcca 1901
gcctggcgac agagcaagac tgtcttaaaa aaaaaaaaaa aaaaaaaagg atattttcac 1961
tcttgggact tgataaagct agtttatttt gattatctcc tatatcctat acatatttaa 2021
ttggccccta tgaacaatgt tacctcttta tgaggggacc caaagaagta gctgctggtg 2081
tgagagtgag agatcatcca tcttttttat tgtgcttttt gttgtttctt tgtcctgcta 2141
tgtgttataa gtaaggccgg gcacggtggc tcatgcctgt aatcccagca cttagggagg 2201
ccaaggccag atccctgagg tcaagagttt gagaccagcc tagccaacat ggtgaaacct 2261
tgtctttact gaaaatacaa aaaaattagc tgggcagggt ggcatgcgcc tgtagtccca 2321
gctactcgca gaggctgagg caggagaatt gcttgaacct gggaggcgga ggttgcggtg 2381
agccaagatc ctgccactgc actccagcct gggcaacaga gggagactcc atctcaaaaa 2441
aaaaaaaaaa aaaa 2455
<210>21
<211>95
<212>PRT
<213>智人(Homo sapiens)
<400>21
Met Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly
1 5 10 15
Phe His Arg Val Gly Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp
20 25 30
Leu Pro Ala Leu Ala Cys Pro Ser Ala Gly Ile Thr Gly Val Ser Pro
35 40 45
Arg Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu Ala Phe Thr
50 55 60
Arg Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser
65 70 75 80
Ser Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys
85 90 95
<210>22
<211>2572
<212>DNA
<213>智人(Homo sapiens)
<400>22
gcggggtttc actatgttgg ccaggctggt ctagaactcc tgacctcaag tgatctgccc 60
gcctcggcct cccaaagtgc tgggattgca ggcgtgagac actgcacccg gacaattttc 120
cttttcttac aagaacactg ctcacactgc attcagggcc aaccctaacc cagtatcgcc 180
tcatcctggt ttgattatat cggcacagac cttgcttccg agcgaggcca ctttctcagg 240
tactggtgga catgagtctt cggagacgct gctcaaccca cagtgctcct ccagcttggt 300
ttctgtgact tgccttcccc agaggagggg tgccctgaga ggtctccact ccctgaccgg 360
ctccttggtg ccgcgcactc tgagaggctt cccagggaac agagcacaca ggaccgccct 420
cctgggtaga ccaatcagca tctgagctca caatttccca gcagggcagt ggggtggaga 480
gagaagcctg ggctgggctg ggctgggctg ggctggggaa gcttctccgg gcggggggac 540
gtcagagcag gatctggggc tgataaaagc ccgcccctgg gtgggggctg agtggtgcgg 600
aagctgagcc cgacacgtgg ggatggagga caggctgtgg gagggtgtga accggatact 660
gcttgaaggg gtgctgggga ctttgagaga gggcggctgg ccctgtctgg tcggggatgc 720
tggcccagac acaggccatg gctgggatgg ggttcagaaa caggaccgct gtctctcccg 780
ggccagggcc ctccccagct gctcctggct ttctggttct tggggtcagg ggcaggcctg 840
tgccatgacc ccgccactga ggctgtgagg aggctgtcgg tgcccaaggg caccaaggca 900
cacccctact cttgcacccc atgtgtgggc ccgagcacct gctctgctgc cccaaagatc 960
tggcgatgtt tcccaggcaa ctgtctctca cagcctgtct gcctggcact cccgtatccc 1020
ataaatgcca ccacatctgg ctatgggtgg gcgtgcctgc ctggcatcca cgggccagca 1080
ggtgtggtgg agcacagccc agttcctggc tgcgtcagaa ggctgcccgg gccttttggc 1140
tgtccttgcc agcaggtgag cactgccagg gcaccgtgtg tgggtgctgg gccatttagc 1200
cacatgggaa ggggtggagg cagcccagtg ccttcagcat gtgcccaggg tgcctgtcgg 1260
ccacaggtct catttggaaa ttgggagggt gcacggccac cgggctgctt aggcctgcca 1320
gcctcagggc ccgtcaccgc tgtcttagcc tgatttgcag ggtgtcaacg ctgggcagag 1380
atgaacattt gggtgactct gaggatgcca gtggctggga cacttgttct tccgcggtgg 1440
aaggagttgg agaggcctgg ctccctgacc tacggccagc ctggcttctg aaaccagctc 1500
agtgggctgg ggcctgattc atcatccata aatgtgtcct tttttgccac agagggtaag 1560
gggcctccta gcccaccggt ctgcaggtgc gggagtagga gatgggtggc tctgatgccc 1620
ccacccactc gatcaccttc tgctctgcct gggatgcaaa ctcccacagc tgaaacgttc 1680
ttttgtaaac atgaattttg gcttagaaaa aactcatttc cactgtgcac gtgtcagtcc 1740
caaccagaaa ttattttcca ataaagcaaa actccgtcac cacagcagca gatggctccg 1800
aagaagtgga gcgttttcat caggttcaac tttgaaacct ccaccatcac catcaccagc 1860
accgctgtgt catgctgata acttgaggac aggcaggaca aggccttctg gcggccgccc 1920
ctggtttctc ctggggggtg atgagcggga gcggctctgg gccgagctac tgcgcacggt 1980
gagcccggag ctgatcctgg atcacgaggt gccttcactg cccgccttcc caggacagga 2040
gcccaggtgc ggcccggagc ccactgaagt cttcactgtc ggacccaaga ccttttcctg 2100
gacacccttt ccgccggacc tgtggggccc gggccgttcc taccggctgc ttcacggggc 2160
aggagggcac ctggaatccc ccgccaggtc cctgccccag cgcccggcac ctgatccctg 2220
cagggccccc agggtggagc agcaaccgtc tgtggagggt gccgcggccc tgcgcaactg 2280
ccccatgtgc cagaaggagt ttgcccccag gctgacccag ctggatgttg acagccacct 2340
ggcccagtgc ttggccgaaa gcacaaaaaa cgtgacgtgg tgagcgccat ccaagagccc 2400
tgcgcagagt gcagcgcccg gacacgcttt cccccgccag cagccccgcc tctcggctcc 2460
cccgccaaca gccccgcctt tcggctcccc cgcatgggca ttaaaacagg gcgggctcct 2520
gtctgtctct gtgttgtgat gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2572
<210>23
<211>2572
<212>DNA
<213>智人(Homo sapiens)
<220>
<221>CDS
<222>(1691)..(2380)
<400>23
gcggggtttc actatgttgg ccaggctggt ctagaactcc tgacctcaag tgatctgccc 60
gcctcggcct cccaaagtgc tgggattgca ggcgtgagac actgcacccg gacaattttc 120
cttttcttac aagaacactg ctcacactgc attcagggcc aaccctaacc cagtatcgcc 180
tcatcctggt ttgattatat cggcacagac cttgcttccg agcgaggcca ctttctcagg 240
tactggtgga catgagtctt cggagacgct gctcaaccca cagtgctcct ccagcttggt 300
ttctgtgact tgccttcccc agaggagggg tgccctgaga ggtctccact ccctgaccgg 360
ctccttggtg ccgcgcactc tgagaggctt cccagggaac agagcacaca ggaccgccct 420
cctgggtaga ccaatcagca tctgagctca caatttccca gcagggcagt ggggtggaga 480
gagaagcctg ggctgggctg ggctgggctg ggctggggaa gcttctccgg gcggggggac 540
gtcagagcag gatctggggc tgataaaagc ccgcccctgg gtgggggctg agtggtgcgg 600
aagctgagcc cgacacgtgg ggatggagga caggctgtgg gagggtgtga accggatact 660
gcttgaaggg gtgctgggga ctttgagaga gggcggctgg ccctgtctgg tcggggatgc 720
tggcccagac acaggccatg gctgggatgg ggttcagaaa caggaccgct gtctctcccg 780
ggccagggcc ctccccagct gctcctggct ttctggttct tggggtcagg ggcaggcctg 840
tgccatgacc ccgccactga ggctgtgagg aggctgtcgg tgcccaaggg caccaaggca 900
cacccctact cttgcacccc atgtgtgggc ccgagcacct gctctgctgc cccaaagatc 960
tggcgatgtt tcccaggcaa ctgtctctca cagcctgtct gcctggcact cccgtatccc 1020
ataaatgcca ccacatctgg ctatgggtgg gcgtgcctgc ctggcatcca cgggccagca 1080
ggtgtggtgg agcacagccc agttcctggc tgcgtcagaa ggctgcccgg gccttttggc 1140
tgtccttgcc agcaggtgag cactgccagg gcaccgtgtg tgggtgctgg gccatttagc 1200
cacatgggaa ggggtggagg cagcccagtg ccttcagcat gtgcccaggg tgcctgtcgg 1260
ccacaggtct catttggaaa ttgggagggt gcacggccac cgggctgctt aggcctgcca 1320
gcctcagggc ccgtcaccgc tgtcttagcc tgatttgcag ggtgtcaacg ctgggcagag 1380
atgaacattt gggtgactct gaggatgcca gtggctggga cacttgttct tccgcggtgg 1440
aaggagttgg agaggcctgg ctccctgacc tacggccagc ctggcttctg aaaccagctc 1500
agtgggctgg ggcctgattc atcatccata aatgtgtcct tttttgccac agagggtaag 1560
gggcctccta gcccaccggt ctgcaggtgc gggagtagga gatgggtggc tctgatgccc 1620
ccacccactc gatcaccttc tgctctgcct gggatgcaaa ctcccacagc tgaaacgttc 1680
ttttgtaaac atg aat ttt ggc tta gaa aaa act cat ttc cac tgt gca 1729
Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala
1 5 10
cgt gtc agt ccc aac cag aaa tta ttt tcc aat aaa gca aaa ctc cgt 1777
Arg Val Ser Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg
15 20 25
cac cac agc agc aga tgg ctc cga aga agt gga gcg ttt tca tca ggt 1825
His His Ser Ser Arg Trp Leu Arg Arg Ser Gly Ala Phe Ser Ser Gly
30 35 40 45
tca act ttg aaa cct cca cca tca cca tca cca gca ccg ctg tgt cat 1873
Ser Thr Leu Lys Pro Pro Pro Ser Pro Ser Pro Ala Pro Leu Cys His
50 55 60
gct gat aac ttg agg aca ggc agg aca agg cct tct ggc ggc cgc ccc 1921
Ala Asp Asn Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg Pro
65 70 75
tgg ttt ctc ctg ggg ggt gat gag cgg gag cgg ctc tgg gcc gag cta 1969
Trp Phe Leu Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu
80 85 90
ctg cgc acg gtg agc ccg gag ctg atc ctg gat cac gag gtg cct tca 2017
Leu Arg Thr Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser
95 100 105
ctg ccc gcc ttc cca gga cag gag ccc agg tgc ggc ccg gag ccc act 2065
Leu Pro Ala Phe Pro Gly Gln Glu Pro Arg Cys Gly Pro Glu Pro Thr
110 115 120 125
gaa gtc ttc act gtc gga ccc aag acc ttt tcc tgg aca ccc ttt ccg 2113
Glu Val Phe Thr Val Gly Pro Lys Thr Phe Ser Trp Thr Pro Phe Pro
130 135 140
ccg gac ctg tgg ggc ccg ggc cgt tcc tac cgg ctg ctt cac ggg gca 2161
Pro Asp Leu Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly Ala
145 150 155
gga ggg cac ctg gaa tcc ccc gcc agg tcc ctg ccc cag cgc ccg gca 2209
Gly Gly His Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala
160 165 170
cct gat ccc tgc agg gcc ccc agg gtg gag cag caa ccg tct gtg gag 2257
Pro Asp Pro Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu
175 180 185
ggt gcc gcg gcc ctg cgc aac tgc ccc atg tgc cag aag gag ttt gcc 2305
Gly Ala Ala Ala Leu Arg Asn Cys Pro Met Cys Gln Lys Glu Phe Ala
190 195 200 205
ccc agg ctg acc cag ctg gat gtt gac agc cac ctg gcc cag tgc ttg 2353
Pro Arg Leu Thr Gln Leu Asp Val Asp Ser His Leu Ala Gln Cys Leu
210 215 220
gcc gaa agc aca aaa aac gtg acg tgg tgagcgccat ccaagagccc 2400
Ala Glu Ser Thr Lys Asn Val Thr Trp
225 230
tgcgcagagt gcagcgcccg gacacgcttt cccccgccag cagccccgcc tctcggctcc 2460
cccgccaaca gccccgcctt tcggctcccc cgcatgggca ttaaaacagg gcgggctcct 2520
gtctgtctct gtgttgtgat gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2572
<210>24
<211>230
<212>PRT
<213>智人(Homo sapiens)
<400>24
Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala Arg Val Ser
1 5 10 15
Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg His His Ser
20 25 30
Ser Arg Trp Leu Arg Arg Ser Gly Ala Phe Ser Ser Gly Ser Thr Leu
35 40 45
Lys Pro Pro Pro Ser Pro Ser Pro Ala Pro Leu Cys His Ala Asp Asn
50 55 60
Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg Pro Trp Phe Leu
65 70 75 80
Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu Leu Arg Thr
85 90 95
Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser Leu Pro Ala
100 105 110
Phe Pro Gly Gln Glu Pro Arg Cys Gly Pro Glu Pro Thr Glu Val Phe
115 120 125
Thr Val Gly Pro Lys Thr Phe Ser Trp Thr Pro Phe Pro Pro Asp Leu
130 135 140
Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly Ala Gly Gly His
145 150 155 160
Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala Pro Asp Pro
165 170 175
Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu Gly Ala Ala
180 185 190
Ala Leu Arg Asn Cys Pro Met Cys Gln Lys Glu Phe Ala Pro Arg Leu
195 200 205
Thr Gln Leu Asp Val Asp Ser His Leu Ala Gln Cys Leu Ala Glu Ser
210 215 220
Thr Lys Asn Val Thr Trp
225 230
Claims (6)
1.一种分离的多核苷酸,其特征在于,它包含一核苷酸序列,该核苷酸序列选自下组:
(a)编码具有抑癌功能的多肽的多核苷酸,所述的多肽具有选自下组的氨基酸序列:SEQ ID NO:3、6、9、12、15、18、21、24;
(b)与多核苷酸(a)互补的多核苷酸。
2.如权利要求1所述的多核苷酸,其特征在于,该多核苷酸编码的多肽具有选自下组的氨基酸序列:SEQ ID NO:3、6、9、12、15、18、21、24。
3.如权利要求1所述的多核苷酸,其特征在于,该多核苷酸的序列选自下组:
SEQ ID NO:2、5、8、11、14、17、20、23的编码区序列或全长序列。
4.一种载体,其特征在于,它含有权利要求1所述的多核苷酸。
5.一种遗传工程化的宿主细胞,其特征在于,它是选自下组的一种宿主细胞:
(a)用权利要求4所述的载体转化或转导的宿主细胞;
(b)用权利要求1所述的多核苷酸转化或转导的宿主细胞。
6.一种抑制癌细胞克隆形成的组合物,其特征在于,它含有安全有效量的权利要求1所述的多核苷酸,以及可接受的载体。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01145279 CN1199995C (zh) | 2001-12-30 | 2001-12-30 | 具有抑癌功能的新的人蛋白及其编码序列 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01145279 CN1199995C (zh) | 2001-12-30 | 2001-12-30 | 具有抑癌功能的新的人蛋白及其编码序列 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1429839A CN1429839A (zh) | 2003-07-16 |
CN1199995C true CN1199995C (zh) | 2005-05-04 |
Family
ID=4678108
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 01145279 Expired - Fee Related CN1199995C (zh) | 2001-12-30 | 2001-12-30 | 具有抑癌功能的新的人蛋白及其编码序列 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1199995C (zh) |
-
2001
- 2001-12-30 CN CN 01145279 patent/CN1199995C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1429839A (zh) | 2003-07-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1199995C (zh) | 具有抑癌功能的新的人蛋白及其编码序列 | |
CN1177864C (zh) | 在肝癌组织中具有表达差异的新的人蛋白及其编码序列 | |
CN1199997C (zh) | 具有促进小鼠nih/3t3细胞转化功能的新的人蛋白及其编码序列 | |
CN1155615C (zh) | 具有抑制癌细胞生长功能的新的人蛋白及其编码序列 | |
CN1169954C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1177048C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1169833C (zh) | 具有抑癌功能的新的人蛋白及其编码序列 | |
CN1932016A (zh) | 影响sre活性的多核苷酸及其编码多肽和用途 | |
CN1231496C (zh) | 具有抑制癌细胞生长功能的新的人蛋白及其编码序列 | |
CN1199998C (zh) | 具有抑制癌细胞生长功能的新的人蛋白及其编码序列 | |
CN1177049C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1222616C (zh) | 具有抑癌功能的新的人蛋白及其编码序列 | |
CN1209373C (zh) | 具有抑制癌细胞生长功能的新的人蛋白及其编码序列 | |
CN1177047C (zh) | 编码具有抑癌功能的人蛋白的多核苷酸 | |
CN1177050C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1246457C (zh) | 人tsc403基因和人ing1l基因 | |
CN1199999C (zh) | 具有促进3t3细胞转化功能的新的人蛋白及其编码序列 | |
CN1229386C (zh) | 具有抑癌功能的新的人蛋白及其编码序列 | |
CN1230445C (zh) | 具有促进小鼠nih/3t3细胞转化功能的新的人蛋白及其编码序列 | |
CN1169956C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1231497C (zh) | 具有促进小鼠nih/3t3细胞转化功能的新的人蛋白及其编码序列 | |
CN1155616C (zh) | 具有促进癌细胞生长功能的新的人蛋白及其编码序列 | |
CN1170929C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 | |
CN1205225C (zh) | 具有抑癌功能的新的人蛋白及其编码序列 | |
CN1169955C (zh) | 编码具有抑制癌细胞生长功能的人蛋白的多核苷酸 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |