CN117343957A - 一种表达重组人源化胶原蛋白的工程细胞构建方法 - Google Patents

一种表达重组人源化胶原蛋白的工程细胞构建方法 Download PDF

Info

Publication number
CN117343957A
CN117343957A CN202210783483.1A CN202210783483A CN117343957A CN 117343957 A CN117343957 A CN 117343957A CN 202210783483 A CN202210783483 A CN 202210783483A CN 117343957 A CN117343957 A CN 117343957A
Authority
CN
China
Prior art keywords
gly
pro
ala
glu
lys
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210783483.1A
Other languages
English (en)
Inventor
尹鸿萍
张军华
卢春阳
刘永军
乐龙
张健
丁志英
冯瑞茹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Dongwan Biotechnology Co ltd
Original Assignee
Nanjing Dongwan Biotechnology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Dongwan Biotechnology Co ltd filed Critical Nanjing Dongwan Biotechnology Co ltd
Priority to CN202210783483.1A priority Critical patent/CN117343957A/zh
Publication of CN117343957A publication Critical patent/CN117343957A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/78Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • C12N15/1137Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against enzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0684Cells of the urinary tract or kidneys
    • C12N5/0686Kidney cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/90Isomerases (5.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y503/00Intramolecular oxidoreductases (5.3)
    • C12Y503/04Intramolecular oxidoreductases (5.3) transposing S-S bonds (5.3.4)
    • C12Y503/04001Protein disulfide-isomerase (5.3.4.1), i.e. disufide bond-forming enzyme
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • C12N2510/02Cells for production
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y304/00Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
    • C12Y304/24Metalloendopeptidases (3.4.24)
    • C12Y304/24014Procollagen N-endopeptidase (3.4.24.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y304/00Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
    • C12Y304/24Metalloendopeptidases (3.4.24)
    • C12Y304/24019Procollagen C-endopeptidase (3.4.24.19)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Toxicology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Urology & Nephrology (AREA)
  • Virology (AREA)
  • Cell Biology (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

本发明属于细胞工程领域,更具体地,涉及一种表达重组人源化胶原蛋白的工程细胞构建方法。通过上述方法制备的工程细胞,能够表达具有高级结构的胶原蛋白,即天然三螺旋结构的胶原蛋白。相比原核细胞及毕赤酵母发酵表达体系,HEK293细胞属于人源细胞系,可以准确表达目的蛋白,所得重组蛋白更有可能具有与内源性人类蛋白一致的翻译后修饰,且应用与人体不会出现严重的免疫排异反应。

Description

一种表达重组人源化胶原蛋白的工程细胞构建方法
技术领域
本发明属于细胞工程领域,更具体地,涉及一种表达重组人源化胶原蛋白的工程细胞构建方法。
背景技术
胶原蛋白是生物高分子,动物结缔组织中的主要成分,也是哺乳动物体内含量最多、分布最广的功能性蛋白,占蛋白质总量的25%~30%,某些生物体甚至高达80%以上。胶原蛋白是一类蛋白质家族,已至少发现了30余种胶原蛋白链的编码基因,可以形成16种以上的胶原蛋白分子,根据其结构,可以分为纤维胶原、基膜胶原、微纤维胶原、锚定胶原、六边网状胶原、非纤维胶原、跨膜胶原等。根据它们在体内的分布和功能特点,可以将胶原分成间质胶原、基底膜胶原和细胞外周胶原。间质型胶原蛋白分子占整个机体胶原的绝大部分,包括Ⅰ、Ⅱ、Ⅲ型胶原蛋白分子,Ⅰ型胶原蛋白主要分布于皮肤、肌腱等组织,也是水产品加工废弃物(皮、骨和鳞)含量最多的蛋白质,占全部胶原蛋白含量的80%~90%左右,在医学上的应用最为广泛。Ⅰ型胶原在鱼类胶原中一个最显著的的特点是热稳定性比较低,并呈现有鱼种的特异性。Ⅱ型胶原蛋白由软骨细胞产生;基底膜胶原蛋白通常是指Ⅳ型胶原蛋白,其主要分布于基底膜;细胞外周胶原蛋白通常中指Ⅴ型胶原蛋白,在结缔组织中大量存在。按功能,可将胶原分为两组,第一组是成纤维胶原,包括第Ⅰ、Ⅱ、Ⅲ、Ⅺ、ⅩⅩⅣ和ⅩⅩⅦ型胶原;其余是第二组,非成纤维胶原。非成纤维胶原的α-链既含有三螺旋域(胶原域,COL),还含有非三螺旋域(非胶原域,NC),其中成纤维胶原约占胶原总数的90%。
目前,胶原蛋白根据来源可以分为两大类,即组织提取胶原蛋白以及生物发酵胶原蛋白。组织提取胶原蛋白即为从各类动物组织提取的胶原蛋白分子,但因其外源性以及其他病原微生物等风险,其应用受到了非常大的限制。生物发酵胶原蛋白即通过工程细胞发酵获得胶原蛋白,也叫重组胶原蛋白,目前市售所有类型的重组胶原蛋白都是短链胶原蛋白肽,其虽无外源性风险,但其生物学性能也大打折扣。同时,由于目前已有重组胶原蛋白均为大肠杆菌或酵母菌发酵表达,其不具备完整的哺乳动物细胞完整的胶原蛋白发酵表达体系,因此这类产物无高级结构。
因此,现有技术需求一种哺乳动物细胞的胶原蛋白发酵表达体系,能够完整复刻生物体内具有高级结构的胶原蛋白的合成,从而获得具备体内相同高级结构的胶原蛋白。
发明内容
有鉴于此,本发明提供一种表达重组人源化胶原蛋白的工程细胞构建方法,包括:
失活HEK 293细胞或CHO细胞中的Procollagen N-endopeptidase、ProcollagenC-endopeptidase基因;
将PDI和Bip基因导入上述失活后的HEK 293细胞或CHO细胞中共表达;将胶原蛋白基因导入上述失活后的HEK 293细胞或CHO细胞;以及
筛选获得携带有胶原蛋白基因的工程细胞。
通过上述方法制备的工程细胞,能够表达具有高级结构的胶原蛋白,即天然三螺旋结构的胶原蛋白。相比原核细胞及毕赤酵母发酵表达体系,HEK293细胞属于人源细胞系,可以准确表达目的蛋白,所得重组蛋白更有可能具有与内源性人类蛋白一致的翻译后修饰,且应用与人体不会出现严重的免疫排异反应;CHO细胞是目前生物制药领域应用广泛的生产细胞,它具有良好的类似人源细胞中准确的翻译后加工修饰作用和蛋白表达稳定整合功能,同时其内源性蛋白分泌较少,可生产出安全性高、低毒性、低免疫原性的目的蛋白。这两种细胞产生的目的蛋白可直接分泌至培养基中,便于收集纯化,更有利于进行大规模的培养和生产。
在一些具体的实施方案中,一种表达重组人源化胶原蛋白的工程细胞构建方法,包括:
S1:失活HEK 293细胞或CHO细胞中的Procollagen N-endopeptidase、Procollagen C-endopeptidase基因;
S2:将PDI和Bip基因导入经步骤S1获得的HEK 293细胞或CHO细胞中共表达;
S3:将胶原蛋白基因导入经步骤S2获得的HEK 293细胞或CHO细胞;以及
S4:筛选获得携带有胶原蛋白基因的工程细胞。
在一些具体的实施方案中,所述“失活”是指使得目标基因不表达。例如,通过基因敲除,或者siRNA。更具体地,所述基因敲除通过基因编辑技术实现。
在一些具体的实施方案中,所述基因敲除通过基因编辑技术实现的步骤包括:
1、针对HEK 293细胞或CHO细胞设计靶向Procollagen N-endopeptidase、Procollagen C-endopeptidase基因的sgRNA;
2、将步骤1得到的成对sgRNA退火、配对,获得带有粘性末端的双链DNA片段;
3、将步骤2得到的双链DNA片段与酶切后的载体链接,获得重组表达载体;
4、将步骤3得到的重组表达载体转染HEK 293细胞或者CHO细胞,培养,获得基因敲除的靶标细胞。
在一些具体的实施方案中,步骤1所得的靶向Procollagen N-endopeptidase基因的sgRNA如SEQ ID NO:1~3所示;靶向Procollagen C-endopeptidase基因的sgRNA如SEQID NO:4~6所示。
在一些具体的实施方案中,所述步骤1得到的成对sgRNA是SEQ ID NO:1~3中的任意两个。
在一些具体的实施方案中,所述步骤1得到的成对sgRNA是SEQ ID NO:4~6中的任意两个。
在一些具体的实施方案中,所述PDI基因的蛋白序列如SEQ ID NO:7或8所示,其核苷酸序列如SEQ ID NO:9或10所示
在一些具体的实施方案中,所述Bip基因的蛋白序列如SEQ ID NO:11所示,其核苷酸序列如SEQ ID NO:12所示。
所述胶原蛋白基因导入HEK 293细胞或CHO细胞的步骤包括:
将载体去除抗生素基因,并导入缺陷型筛选基因,然后将胶原蛋白基因导入载体,获得携带有胶原蛋白基因的载体;
将上述携带有胶原蛋白基因的载体导入HEK 293细胞或CHO细胞。
进一步地,所述载体为pcDNA3.1质粒。
进一步地,所述缺陷型筛选基因为GS缺陷基因。
进一步地,所述胶原蛋白基因编码Ⅰ型、Ⅱ型、Ⅲ型、Ⅴ型和X型胶原蛋白。
进一步地,所述胶原蛋白基因编码I型胶原蛋白。进一步地,所述胶原蛋白基因编码的I型胶原蛋白具备三螺旋结构。
进一步地,所述胶原蛋白基因编码Ⅱ型胶原蛋白。进一步地,所述胶原蛋白基因编码的Ⅱ型胶原蛋白具备三螺旋结构。
进一步地,所述胶原蛋白基因编码Ⅲ型胶原蛋白。进一步地,所述胶原蛋白基因编码的Ⅲ型胶原蛋白具备三螺旋结构。
进一步地,所述胶原蛋白基因编码Ⅴ型胶原蛋白。进一步地,所述胶原蛋白基因编码的Ⅴ型胶原蛋白具备三螺旋结构。
进一步地,所述胶原蛋白基因编码X型胶原蛋白。进一步地,所述胶原蛋白基因编码的Ⅺ型胶原蛋白具备三螺旋结构。
在一些具体的实施方案中,其三螺旋结构的蛋白占整个表达蛋白的90%以上。
在一些具体的实施方案中,其三螺旋结构的蛋白占整个表达蛋白的100%。
进一步地,所述胶原蛋白基因的核苷酸序列如SEQ ID NO:13~16所示,其编码的蛋白序列如SEQ ID NO:17~20所示。
在一些具体的实施方案中,所述筛选获得携带有胶原蛋白基因的工程细胞包括:根据载体构建中的缺陷型筛选基因,选择适宜的培养基,进行单克隆筛选。
又一方面,提供一种表达重组人源化胶原蛋白的工程细胞,其根据上述工程细胞的构建方法获得。
又一方面,提供一种重组人源化胶原蛋白,其由上述的工程细胞表达纯化而获得。
附图说明
图1为重组人源化胶原蛋白色谱试验图;
图2为人胶原HPLC图谱;
图3为重组人源化胶原蛋白HPLC图谱;
图4为重组人源化胶原蛋白扫描电镜图;
图5为Procollagen N-endopeptidase、Procollagen C-endopeptidase敲除/失活重组胶原蛋白还原电泳图;
图6为PDI,Bip导入,共表达还原电泳及非还原电泳图。
具体实施方式
下文将结合具体实施方案和实施例,具体阐述本发明,本发明的优点和各种效果将由此更加清楚地呈现。本领域技术人员应理解,这些具体实施方案和实施例是用于说明本发明,而非限制本发明。
实施例1、本发明所使用的各种载体的构建
胶原蛋白基因载体构建:
将pcDNA3.1质粒去除抗生素基因,并导入缺陷型筛选基因,然后将人源Ⅰ型胶原蛋白基因(或II型、III型、V型、XI型)导入质粒,质粒名称为pcDNA_DW。
缺陷型筛选基因:
MTTSASSHLNKGIKQVYMSLPQGEKVQAMYIWIDGTGEGLRCKTRTLDSEPKCVEELPEWNFDGSSTLQSEGSNSDMYLVPAAMFRDPFRKDPNKLVLCEVFKYNRRPAETNLRHTCKRIMDMVSNQHPWFGMEQEYTLMGTDGHPFGWPSNGFPGPQGPYYCGVGADRAYGRDIVEAHYRACLYAGVKIAGTNAEVMPAQWEFQIGPCEGISMGDHLWVARFILHRVCEDFGVIATFDPKPIPGNWNGAGCHTNFSTKAMREENGLKYIEEAIEKLSKRHQYHIRAYDPKGGLDNARRLTGFHETSNINDFSAGVANRSASIRIPRTVGQEKKGYFEDRRPSANCDPFSVTEALIRTCLLNETGDEPFQYK
失活HEK 293细胞的载体构建:
针对HEK 293细胞基因设计靶向Procollagen N-endopeptidase、Procollagen C-endopeptidase基因的sgRNA的靶向识别区域;
将得到的成对sgRNA退火、配对,获得带有粘性末端的双链DNA片段;
将得到的双链DNA片段与酶切后的载体链接,获得重组表达载体。
sgRNA序列:
N:
sgRNA1:atttgtggtggctcacgaga(SEQ ID NO:1)
sgRNA2:tcacgagacgggccatgtgc(SEQ ID NO:2)
sgRNA3:gacgggccatgtgctgggca(SEQ ID NO:3);
C:
sgRNA1:tggcatcgtggtccatgagc(SEQ ID NO:4)
sgRNA2:tgggccatgtcattggcttc(SEQ ID NO:5)
sgRNA3:catgagctgggccatgtcat(SEQ ID NO:6);
重组表达载体序列(SEQ ID NO:21):
GACGGATCGGGAGATCTCCCGATCCCCTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATCTGCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGACAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATACGCGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCTGGCTAACTAGAGAACCCACTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAGGGAGACCCAAGCTGGCTAGCGTTTAAACTTAAGCTTGGTACCGAGCTCGGATCCACTAGTCCAGTGTGGTGGAATTCatgttcagctttgtggacctccggctcctgctcctcttagcggccaccgccctcctgacgcacggccaagaggaaggccaagtcgagggccaagacgaagacatcccaccaatcacctgcgtacagaacggcctcaggtaccatgaccgagacgtgtggaaacccgagccctgccggatctgcgtctgcgacaacggcaaggtgttgtgcgatgacgtgatctgtgacgagaccaagaactgccccggcgccgaagtccccgagggcgagtgctgtcccgtctgccccgacggctcagagtcacccaccgaccaagaaaccaccggcgtcgagggacccaagggagacactggcccccgaggcccaaggggacccgcaggcccccctggccgagatggcatccctggacagcctggacttcccggaccccccggaccccccggacctcccggaccccctggcctcggaggaaactttgctccccagctgtcttatggctatgatgagaaatcaaccggaggaatttccgtgcctggccccatgggtccctctggtcctcgtggtctccctggcccccctggtgcacctggtccccaaggcttccaaggtccccctggtgagcctggcgagcctggagcttcaggtcccatgggtccccgaggtcccccaggtccccctggaaagaatggagatgatggggaagctggaaaacctggtcgtcctggtgagcgtgggcctcctgggcctcagggtgctcgaggattgcccggaacagctggcctccctggaatgaagggacacagaggtttcagtggtttggatggtgccaagggagatgctggtcctgctggtcctaagggtgagcctggcagccctggtgaaaatggagctcctggtcagatgggcccccgtggcctgcctggtgagagaggtcgccctggagcccctggccctgctggtgctcgtggaaatgatggtgctactggtgctgccgggccccctggtcccaccggccccgctggtcctcctggcttccctggtgctgttggtgctaagggtgaagctggtccccaagggccccgaggctctgaaggtccccagggtgtgcgtggtgagcctggcccccctggccctgctggtgctgctggccctgctggaaaccctggtgctgatggacagcctggtgctaaaggtgccaatggtgctcctggtattgctggtgctcctggcttccctggtgcccgaggcccctctggaccccagggccccggcggccctcctggtcccaagggtaacagcggtgaacctggtgctcctggcagcaaaggagacactggtgctaagggagagcctggccctgttggtgttcaaggaccccctggccctgctggagaggaaggaaagcgaggagctcgaggtgaacccggacccactggcctgcccggaccccctggcgagcgtggtggacctggtagccgtggtttccctggcgcagatggtgttgctggtcccaagggtcccgctggtgaacgtggttctcctggccctgctggccccaaaggatctcctggtgaagctggtcgtcccggtgaagctggtctgcctggtgccaagggtctgactggaagccctggcagccctggtcctgatggcaaaactggcccccctggtcccgccggtcaagatggtcgccccggacccccaggcccacctggtgcccgtggtcaggctggtgtgatgggattccctggacctaaaggtgctgctggagagcccggcaaggctggagagcgaggtgttcccggaccccctggcgctgtcggtcctgctggcaaagatggagaggctggagctcagggaccccctggccctgctggtcccgctggcgagagaggtgaacaaggccctgctggctcccccggattccagggtctccctggtcctgctggtcctccaggtgaagcaggcaaacctggtgaacagggtgttcctggagaccttggcgcccctggcccctctggagcaagaggcgagagaggtttccctggcgagcgtggtgtgcaaggtccccctggtcctgctggtccccgaggggccaacggtgctcccggcaacgatggtgctaagggtgatgctggtgcccctggagctcccggtagccagggcgcccctggccttcagggaatgcctggtgaacgtggtgcagctggtcttccagggcctaagggtgacagaggtgatgctggtcccaaaggtgctgatggctctcctggcaaagatggcgtccgtggtctgactggccccattggtcctcctggccctgctggtgcccctggtgacaagggtgaaagtggtcccagcggccctgctggtcccactggagctcgtggtgcccccggagaccgtggtgagcctggtccccccggccctgctggctttgctggcccccctggtgctgacggccaacctggtgctaaaggcgaacctggtgatgctggtgctaaaggcgatgctggtccccctggccctgccggacccgctggaccccctggccccattggtaatgttggtgctcctggagccaaaggtgctcgcggcagcgctggtccccctggtgctactggtttccctggtgctgctggccgagtcggtcctcctggcccctctggaaatgctggaccccctggccctcctggtcctgctggcaaagaaggcggcaaaggtccccgtggtgagactggccctgctggacgtcctggtgaagttggtccccctggtccccctggccctgctggcgagaaaggatcccctggtgctgatggtcctgctggtgctcctggtactcccgggcctcaaggtattgctggacagcgtggtgtggtcggcctgcctggtcagagaggagagagaggcttccctggtcttcctggcccctctggtgaacctggcaaacaaggtccctctggagcaagtggtgaacgtggtccccctggtcccatgggcccccctggattggctggaccccctggtgaatctggacgtgagggggctcctggtgccgaaggttcccctggacgagacggttctcctggcgccaagggtgaccgtggtgagaccggccccgctggaccccctggtgctcctggtgctcctggtgcccctggccccgttggccctgctggcaagagtggtgatcgtggtgagactggtcctgctggtcccgccggtcctgtcggccctgttggcgcccgtggccccgccggaccccaaggcccccgtggtgacaagggtgagacaggcgaacagggcgacagaggcataaagggtcaccgtggcttctctggcctccagggtccccctggccctcctggctctcctggtgaacaaggtccctctggagcctctggtcctgctggtccccgaggtccccctggctctgctggtgctcctggcaaagatggactcaacggtctccctggccccattgggccccctggtcctcgcggtcgcactggtgatgctggtcctgttggtccccccggccctcctggacctcctggtccccctggtcctcccagcgctggtttcgacttcagcttcctgccccagccacctcaagagaaggctcacgatggtggccgctactaccgggctgatgatgccaatgtggttcgtgaccgtgacctcgaggtggacaccaccctcaagagcctgagccagcagatcgagaacatccggagcccagagggcagccgcaagaaccccgcccgcacctgccgtgacctcaagatgtgccactctgactggaagagtggagagtactggattgaccccaaccaaggctgcaacctggatgccatcaaagtcttctgcaacatggagactggtgagacctgcgtgtaccccactcagcccagtgtggcccagaagaactggtacatcagcaagaaccccaaggacaagaggcatgtctggttcggcgagagcatgaccgatggattccagttcgagtatggcggccagggctccgaccctgccgatgtggccatccagctgaccttcctgcgcctgatgtccaccgaggcctcccagaacatcacctaccactgcaagaacagcgtggcctacatggaccagcagactggcaacctcaagaaggccctgctcctccagggctccaacgagatcgagatccgcgccgagggcaacagccgcttcacctacagcgtcactgtcgatggctgcacgagtcacaccggagcctggggcaagacagtgattgaatacaaaaccaccaagacctcccgcctgcccatcatcgatgtggcccccttggacgttggtgccccagaccaggaattcggcttcgacgttggccctgtctgcttcctgtaaTGCAGATATCCAGCACAGTGGCGGCCGCTCGAGTCTAGAGGGCCCGTTTAAACCCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCTTCTGAGGCGGAAAGAACCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTAATTCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCTGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCCCGGGAGCTTGTATATCCATTTTCGGATCTGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGTATACCGTCGACCTCTAGCTAGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC
实施例2、工程细胞的构建
1、通过实施例1制备的失活HEK 293细胞的载体使得293细胞中的Procollagen N-endopeptidase、Procollagen C-endopeptidase基因不表达,获得失活HEK 293细胞;
2、将实施例1制备的胶原蛋白基因载体导入上述失活HEK 293细胞;
3、将上述细胞放置在GS缺陷培养基中进行培养,筛选单克隆,获得目的工程细胞。
实施例3、胶原蛋白的制备
扩增表达鉴定:(1)取状态良好的细胞进行传代发酵。1*106个/mL密度的时候进行传代至32瓶摇瓶,每个摇瓶细胞密度控制在2*105-5*105个/mL,进行培养发酵蛋白。
(2)细胞传代后3-4天可达到最高密度(对数生长期)。继续培养至5-7天后细胞活力会下降至80%左右,此时需要收集发酵液。
(3)取相应个数的50mL离心管,把每个摇瓶里的细胞发酵液分装到离心管。
(4)根据吊杯中离心管孔数将离心管配平,低温高速离心机中水平转子离心,离心条件:500-3500rpm 10-60min 4℃;结束后取50mL离心管收集上清,重新配平,于低温高速离心机角转子离心,离心条件:5000-12000rpm/min 4℃离心10-60min。
(5)两次离心后弃去细胞沉淀,收集发酵液上清,-20℃冰箱保存。
使用SDS-PAGE电泳对切向流浓缩后的样品进行纯度和杂质的检测,电泳分离胶浓度选择为6%或8%,浓缩胶浓度选择为5%。
分离纯化:将发酵产物浓缩1-50倍后,过0.22um滤膜,加入NaCl使其终浓度为1-5M,高速离心,取沉淀;将沉淀用纯化水复溶后,用胃蛋白酶或胰蛋白水解;将水解产物体系pH值调至大于6.8,再次离心,保留沉淀。
实施例4、胶原蛋白的表征
将实施例3中获得沉淀用大量水清洗后,用1-3%的醋酸复溶,复溶后冻干,对冻干产物进行CD色谱表征,具体结果如图1所示。
从图中可以看出,在远紫外区域,胶原蛋白的CD光谱中,在222nm处具有最大正峰值,在196nm处具有最大负峰值,并且交叉点在约214nm处。正峰与负峰的比率(Rpn)是三螺旋结构构象所独有的,可用于鉴定三螺旋构象。有报道指出,具有三螺旋结构的胶原蛋白的Rpn范围为0.09~0.15。以摩尔椭圆率为纵坐标(1dmol=0.1mol),色谱图符合胶原蛋白的特征,其Rpn值为0.1312,表明Ⅰ型胶原蛋白三螺旋结构的完整性。
将实施例3中获得沉淀用大量水清洗后,用1-3%的醋酸复溶,复溶后冻干,将冻干产物配制成5mg/ml的溶液,取人I型胶原蛋白标准品,配置成5mg/ml的溶液,进行HPLC试验,具体结果如图2~3所示(图2为人胶原图谱;图3为重组I型胶原图谱)。从图中可以据面积归一化法,计算重组人源化胶原蛋白纯度为90%以上。
将实施例3中获得沉淀用大量水清洗后,用1-3%的醋酸复溶,复溶后冻干,将冻干产物配制成5mg/ml的溶液,进行扫描电镜试验,结果如图4所示。从图中可以看出,不同放大倍数下,可以观察到重组人源化胶原蛋白在溶液中以“发丝”状纤维,其纤维的粗细不一,和理论分泌出的胶原蛋白相比,更加“粗壮”。在现有的分辨率和检测下对这种现象可能的解释是,胶原蛋白是纤维状的,在浓缩,复溶等操作下,纤维类蛋白之间会强迫靠近,靠近之后纤维相互交联,形成更加粗壮的纤维,粗壮的纤维也会继续交联形成网状结构。
实施例5、对比工程细胞的蛋白表达
首先构建了未敲除Procollagen N-endopeptidase、Procollagen C-endopeptidase基因的工程细胞,对其表达的重组蛋白与本申请实施例2所构建的工程细胞的蛋白进行产量对比。实验结果如图5所示(A:MARK;B:敲除/失活Procollagen N-endopeptidase;C:未敲除/失活Procollagen N-endopeptidase、Procollagen C-endopeptidase;D:敲除/失活Procollagen C-endopeptidase;E:敲除/失活ProcollagenN-endopeptidase、Procollagen C-endopeptidase),从图中明显可以看出,未敲除的工程细胞,几乎不表达重组蛋白,而敲除的工程细胞表达大量的重组蛋白。
其次,构建了未导入PDI、Bip,仅导入PDI,以及仅导入Bip基因的工程细胞,对其表达的重组蛋白与本申请实施例2所构建的工程细胞的蛋白进行还原电泳及非还原电泳。实验结果如图6所示(A:MARK;B:PDI、Bip导入共表达还原电泳;C:单独导入PDI非还原电泳;D:单独导入BIP非还原电泳;E:未导入PDI、BIP非还原电泳;F:PDI、Bip导入共表达非还原电泳),从图中可以看出,由于重组胶原蛋白分子量为390kd,但其三条单链全都形成高级结构(三螺旋结构)时,其非还原电泳因分子量过大而不能在胶上显示斑点,如图6所示,当共表达PDI、BIP时,其还原电泳显示有条带,非还原电泳没有条带,标示此时所有单链蛋白均形成高级结构;而单独导入PDI或BIP时,其形成高级结构的单链蛋白相对较少,当未导入PDI、BIP时,其单链蛋白量明显高于单独导入。故,当PDI、BIP共表达时,能帮助其高级结构的形成。
序列表
<110> 南京东万生物技术有限公司
<120> 一种表达重组人源化胶原蛋白的工程细胞构建方法
<160> 21
<170> PatentIn version 3.5
<210> 1
<211> 20
<212> DNA
<213> 人工序列
<400> 1
atttgtggtg gctcacgaga 20
<210> 2
<211> 20
<212> DNA
<213> 人工序列
<400> 2
tcacgagacg ggccatgtgc 20
<210> 3
<211> 20
<212> DNA
<213> 人工序列
<400> 3
gacgggccat gtgctgggca 20
<210> 4
<211> 20
<212> DNA
<213> 人工序列
<400> 4
tggcatcgtg gtccatgagc 20
<210> 5
<211> 20
<212> DNA
<213> 人工序列
<400> 5
tgggccatgt cattggcttc 20
<210> 6
<211> 20
<212> DNA
<213> 人工序列
<400> 6
catgagctgg gccatgtcat 20
<210> 7
<211> 509
<212> PRT
<213> 智人
<400> 7
Met Leu Ser Arg Ser Leu Leu Cys Leu Ala Leu Ala Trp Val Ala Arg
1 5 10 15
Val Gly Ala Asp Ala Pro Glu Glu Glu Asp Asn Val Leu Val Leu Lys
20 25 30
Lys Ser Asn Phe Ala Glu Ala Leu Ala Ala His Asn Tyr Leu Leu Val
35 40 45
Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu
50 55 60
Tyr Ala Lys Ala Ala Ala Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg
65 70 75 80
Leu Ala Lys Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr
85 90 95
Gly Val Arg Gly Tyr Pro Thr Ile Lys Phe Phe Lys Asn Gly Asp Thr
100 105 110
Ala Ser Pro Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val
115 120 125
Asn Trp Leu Lys Lys Arg Thr Gly Pro Ala Ala Thr Thr Leu Ser Asp
130 135 140
Thr Ala Ala Ala Glu Thr Leu Ile Asp Ser Ser Glu Val Ala Val Ile
145 150 155 160
Gly Phe Phe Lys Asp Val Glu Ser Asp Ser Ala Lys Gln Phe Leu Leu
165 170 175
Ala Ala Glu Ala Val Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser
180 185 190
Gly Val Phe Ser Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe
195 200 205
Lys Lys Phe Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys
210 215 220
Glu Lys Leu Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile
225 230 235 240
Glu Phe Thr Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys
245 250 255
Thr His Ile Leu Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Asp Gly
260 265 270
Lys Leu Gly Asn Phe Lys Lys Ala Ala Glu Gly Phe Lys Gly Lys Ile
275 280 285
Leu Phe Ile Phe Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu
290 295 300
Glu Phe Phe Gly Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile
305 310 315 320
Thr Leu Glu Glu Glu Met Thr Lys Tyr Lys Pro Glu Ser Asp Glu Leu
325 330 335
Thr Ala Glu Lys Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys
340 345 350
Ile Lys Pro His Leu Met Ser Gln Glu Leu Pro Glu Asp Trp Asp Lys
355 360 365
Gln Pro Val Lys Val Leu Val Gly Lys Asn Phe Glu Glu Val Ala Phe
370 375 380
Asp Glu Lys Lys Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly
385 390 395 400
His Cys Lys Gln Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr
405 410 415
Lys Asp His Glu Asn Ile Ile Ile Ala Lys Met Asp Ser Thr Ala Asn
420 425 430
Glu Val Glu Ala Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe
435 440 445
Pro Ala Thr Ala Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr
450 455 460
Leu Asp Gly Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala
465 470 475 480
Gly Asp Asp Asp Asp Val Asp Leu Glu Glu Ala Leu Glu Pro Asp Met
485 490 495
Glu Glu Asp Asp Asp Gln Lys Ala Val Lys Asp Glu Leu
500 505
<210> 8
<211> 508
<212> PRT
<213> 智人
<400> 8
Met Leu Arg Arg Ala Leu Leu Cys Leu Ala Val Ala Ala Leu Val Arg
1 5 10 15
Ala Asp Ala Pro Glu Glu Glu Asp His Val Leu Val Leu Arg Lys Ser
20 25 30
Asn Phe Ala Glu Ala Leu Ala Ala His Lys Tyr Leu Leu Val Glu Phe
35 40 45
Tyr Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Glu Tyr Ala
50 55 60
Lys Ala Ala Gly Lys Leu Lys Ala Glu Gly Ser Glu Ile Arg Leu Ala
65 70 75 80
Lys Val Asp Ala Thr Glu Glu Ser Asp Leu Ala Gln Gln Tyr Gly Val
85 90 95
Arg Gly Tyr Pro Thr Ile Lys Phe Phe Arg Asn Gly Asp Thr Ala Ser
100 105 110
Pro Lys Glu Tyr Thr Ala Gly Arg Glu Ala Asp Asp Ile Val Asn Trp
115 120 125
Leu Lys Lys Arg Thr Gly Pro Ala Ala Thr Thr Leu Pro Asp Gly Ala
130 135 140
Ala Ala Glu Ser Leu Val Glu Ser Ser Glu Val Ala Val Ile Gly Phe
145 150 155 160
Phe Lys Asp Val Glu Ser Asp Ser Ala Lys Gln Phe Leu Gln Ala Ala
165 170 175
Glu Ala Ile Asp Asp Ile Pro Phe Gly Ile Thr Ser Asn Ser Asp Val
180 185 190
Phe Ser Lys Tyr Gln Leu Asp Lys Asp Gly Val Val Leu Phe Lys Lys
195 200 205
Phe Asp Glu Gly Arg Asn Asn Phe Glu Gly Glu Val Thr Lys Glu Asn
210 215 220
Leu Leu Asp Phe Ile Lys His Asn Gln Leu Pro Leu Val Ile Glu Phe
225 230 235 240
Thr Glu Gln Thr Ala Pro Lys Ile Phe Gly Gly Glu Ile Lys Thr His
245 250 255
Ile Leu Leu Phe Leu Pro Lys Ser Val Ser Asp Tyr Asp Gly Lys Leu
260 265 270
Ser Asn Phe Lys Thr Ala Ala Glu Ser Phe Lys Gly Lys Ile Leu Phe
275 280 285
Ile Phe Ile Asp Ser Asp His Thr Asp Asn Gln Arg Ile Leu Glu Phe
290 295 300
Phe Gly Leu Lys Lys Glu Glu Cys Pro Ala Val Arg Leu Ile Thr Leu
305 310 315 320
Glu Glu Glu Met Thr Lys Tyr Lys Pro Glu Ser Glu Glu Leu Thr Ala
325 330 335
Glu Arg Ile Thr Glu Phe Cys His Arg Phe Leu Glu Gly Lys Ile Lys
340 345 350
Pro His Leu Met Ser Gln Glu Leu Pro Glu Asp Trp Asp Lys Gln Pro
355 360 365
Val Lys Val Leu Val Gly Lys Asn Phe Glu Asp Val Ala Phe Asp Glu
370 375 380
Lys Lys Asn Val Phe Val Glu Phe Tyr Ala Pro Trp Cys Gly His Cys
385 390 395 400
Lys Gln Leu Ala Pro Ile Trp Asp Lys Leu Gly Glu Thr Tyr Lys Asp
405 410 415
His Glu Asn Ile Val Ile Ala Lys Met Asp Ser Thr Ala Asn Glu Val
420 425 430
Glu Ala Val Lys Val His Ser Phe Pro Thr Leu Lys Phe Phe Pro Ala
435 440 445
Ser Ala Asp Arg Thr Val Ile Asp Tyr Asn Gly Glu Arg Thr Leu Asp
450 455 460
Gly Phe Lys Lys Phe Leu Glu Ser Gly Gly Gln Asp Gly Ala Gly Asp
465 470 475 480
Asp Asp Asp Leu Glu Asp Leu Glu Glu Ala Glu Glu Pro Asp Met Glu
485 490 495
Glu Asp Asp Asp Gln Lys Ala Val Lys Asp Glu Leu
500 505
<210> 9
<211> 1530
<212> DNA
<213> 智人
<400> 9
atgctgagcc gttctctgct gtgcctggcc ctggcctggg tggccagggt gggcgccgac 60
gccccggagg aggaggacaa cgtcctggtg ctgaagaaga gcaacttcgc agaggcgctg 120
gcggcacaca actacctgct ggtggagttc tatgccccgt ggtgtggcca ctgcaaagct 180
ctggcccctg agtatgccaa agccgctgca aagctgaagg cagaaggctc cgagattcga 240
ctcgccaagg tggacgccac cgaagagtct gacctggctc agcagtatgg cgtccgcggc 300
taccctacaa tcaagttctt caagaatgga gacacagcgt cccctaagga gtacacagct 360
ggcagggaag ctgacgacat tgtgaactgg ctgaagaagc gaacaggtcc tgctgccaca 420
accctgtctg acaccgcagc ggccgagacc ttgatagact caagcgaggt ggctgtcatc 480
ggtttcttca aggatgtaga gtcagactct gccaagcagt tcttgctggc ggctgaggct 540
gtcgatgaca taccttttgg gatcacgtcc aacagtggtg tgttctccaa gtaccagctg 600
gacaaggatg gggtggtcct ctttaagaag tttgatgaag gccgaaacaa ctttgaaggt 660
gaggtcacca aggagaagct gctagacttc attaagcaca accaactgcc tttggtcatc 720
gagttcactg aacagacagc cccgaagatt tttggaggtg aaatcaagac acacattctg 780
ctattcctgc ccaagagtgt atctgactac gatggcaagt tgggcaactt caagaaagca 840
gccgaaggct tcaagggcaa gatcctgttt atcttcatcg acagcgacca cactgacaac 900
cagcgtatcc tggagttctt tggcctgaag aaggaggagt gtccagctgt gcggcttatt 960
accctagagg aagaaatgac caagtacaaa ccagagtcag atgagctaac agctgagaag 1020
atcacagaat tttgccaccg cttcctggag ggcaagatca agccccactt gatgagccag 1080
gaactgcctg aagactggga taaacagcca gtgaaagtgc tagttgggaa aaactttgaa 1140
gaagttgctt ttgatgagaa aaagaacgtc tttgtggaat tctatgcccc ctggtgtggc 1200
cactgcaagc agctagcccc catttgggac aaactgggag agacatacaa ggatcatgaa 1260
aatatcatca ttgctaagat ggactcaaca gctaatgagg tggaagccgt caaagtgcac 1320
agtttcccca ccctgaagtt cttcccagca actgcagaca gaacggtcat tgattataat 1380
ggtgagcgga cgctagacgg ttttaagaaa ttcctggaga gtggtggcca ggatggtgca 1440
ggagacgatg atgatgtgga cctagaggaa gctttagagc cagacatgga ggaggatgat 1500
gaccagaaag ctgtaaagga cgaattgtag 1530
<210> 10
<211> 900
<212> DNA
<213> 智人
<400> 10
gctctcgtcg cccccgctgt cccggcggcg ccaaccgaag cgccccgcct gatccgtgtc 60
cgacatgctg cgccgcgctc tgctgtgcct ggccgtggcc gccctggtgc gcgccgacgc 120
ccccgaggag gaggaccacg tcctggtgct gcggaaaagc aacttcgcgg aggcgctggc 180
ggcccacaag tacctgctgg tggagttcta tgccccttgg tgtggccact gcaaggctct 240
ggcccctgag tatgccaaag ccgctgggaa gctgaaggca gaaggttccg agatcaggtt 300
ggccaaggtg gacgccacgg aggagtctga cctggcccag cagtacggcg tgcgcggcta 360
tcccaccatc aagttcttca ggaatggaga cacggcttcc cccaaggaat atacagctgg 420
cagagaggct gatgacatcg tgaactggct gaagaagcgc acgggcccgg ctgccaccac 480
cctgcctgac ggcgcagctg cagagtcctt ggtggagtcc agcgaggtgg ctgtcatcgg 540
cttcttcaag gacgtggagt cggactctgc caagcagttt ttgcaggcag cagaggccat 600
cgatgacata ccatttggga tcacttccaa cagtgacgtg ttctccaaat accagctcga 660
caaagatggg gttgtcctct ttaagaagtt tgatgaaggc cggaacaact ttgaagggga 720
ggtcaccaag gagaacctgc tggactttat caaacacaac cagctgcccc ttgtcatcga 780
gttcaccgag cagacagccc cgaagatttt tggaggtgaa atcaagactc acatcctgct 840
gttcttgccc aagagtgtgt ctgactatga cggcaaactg agcaacttca aaacagcagc 900
<210> 11
<211> 654
<212> PRT
<213> 智人
<400> 11
Met Lys Phe Pro Met Val Ala Ala Ala Leu Leu Leu Leu Cys Ala Val
1 5 10 15
Arg Ala Glu Glu Glu Asp Lys Lys Glu Asp Val Gly Thr Val Val Gly
20 25 30
Ile Asp Leu Gly Thr Thr Tyr Ser Cys Val Gly Val Phe Lys Asn Gly
35 40 45
Arg Val Glu Ile Ile Ala Asn Asp Gln Gly Asn Arg Ile Thr Pro Ser
50 55 60
Tyr Val Ala Phe Thr Pro Glu Gly Glu Arg Leu Ile Gly Asp Ala Ala
65 70 75 80
Lys Asn Gln Leu Thr Ser Asn Pro Glu Asn Thr Val Phe Asp Ala Lys
85 90 95
Arg Leu Ile Gly Arg Thr Trp Asn Asp Pro Ser Val Gln Gln Asp Ile
100 105 110
Lys Phe Leu Pro Phe Lys Val Val Glu Lys Lys Thr Lys Pro Tyr Ile
115 120 125
Gln Val Asp Ile Gly Gly Gly Gln Thr Lys Thr Phe Ala Pro Glu Glu
130 135 140
Ile Ser Ala Met Val Leu Thr Lys Met Lys Glu Thr Ala Glu Ala Tyr
145 150 155 160
Leu Gly Lys Lys Val Thr His Ala Val Val Thr Val Pro Ala Tyr Phe
165 170 175
Asn Asp Ala Gln Arg Gln Ala Thr Lys Asp Ala Gly Thr Ile Ala Gly
180 185 190
Leu Asn Val Met Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala Ile Ala
195 200 205
Tyr Gly Leu Asp Lys Arg Glu Gly Glu Lys Asn Ile Leu Val Phe Asp
210 215 220
Leu Gly Gly Gly Thr Phe Asp Val Ser Leu Leu Thr Ile Asp Asn Gly
225 230 235 240
Val Phe Glu Val Val Ala Thr Asn Gly Asp Thr His Leu Gly Gly Glu
245 250 255
Asp Phe Asp Gln Arg Val Met Glu His Phe Ile Lys Leu Tyr Lys Lys
260 265 270
Lys Thr Gly Lys Asp Val Arg Lys Asp Asn Arg Ala Val Gln Lys Leu
275 280 285
Arg Arg Glu Val Glu Lys Ala Lys Arg Ala Leu Ser Ser Gln His Gln
290 295 300
Ala Arg Ile Glu Ile Glu Ser Phe Phe Glu Gly Glu Asp Phe Ser Glu
305 310 315 320
Thr Leu Thr Arg Ala Lys Phe Glu Glu Leu Asn Met Asp Leu Phe Arg
325 330 335
Ser Thr Met Lys Pro Val Gln Lys Val Leu Glu Asp Ser Asp Leu Lys
340 345 350
Lys Ser Asp Ile Asp Glu Ile Val Leu Val Gly Gly Ser Thr Arg Ile
355 360 365
Pro Lys Ile Gln Gln Leu Val Lys Glu Phe Phe Asn Gly Lys Glu Pro
370 375 380
Ser Arg Gly Ile Asn Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val
385 390 395 400
Gln Ala Gly Val Leu Ser Gly Asp Gln Asp Thr Gly Asp Leu Val Leu
405 410 415
Leu Asp Val Cys Pro Leu Thr Leu Gly Ile Glu Thr Val Gly Gly Val
420 425 430
Met Thr Lys Leu Ile Pro Arg Asn Thr Val Val Pro Thr Lys Lys Ser
435 440 445
Gln Ile Phe Ser Thr Ala Ser Asp Asn Gln Pro Thr Val Thr Ile Lys
450 455 460
Val Tyr Glu Gly Glu Arg Pro Leu Thr Lys Asp Asn His Leu Leu Gly
465 470 475 480
Thr Phe Asp Leu Thr Gly Ile Pro Pro Ala Pro Arg Gly Val Pro Gln
485 490 495
Ile Glu Val Thr Phe Glu Ile Asp Val Asn Gly Ile Leu Arg Val Thr
500 505 510
Ala Glu Asp Lys Gly Thr Gly Asn Lys Asn Lys Ile Thr Ile Thr Asn
515 520 525
Asp Gln Asn Arg Leu Thr Pro Glu Glu Ile Glu Arg Met Val Asn Asp
530 535 540
Ala Glu Lys Phe Ala Glu Glu Asp Lys Lys Leu Lys Glu Arg Ile Asp
545 550 555 560
Thr Arg Asn Glu Leu Glu Ser Tyr Ala Tyr Ser Leu Lys Asn Gln Ile
565 570 575
Gly Asp Lys Glu Lys Leu Gly Gly Lys Leu Ser Ser Glu Asp Lys Glu
580 585 590
Thr Met Glu Lys Ala Val Glu Glu Lys Ile Glu Trp Leu Glu Ser His
595 600 605
Gln Asp Ala Asp Ile Glu Asp Phe Lys Ala Lys Lys Lys Glu Leu Glu
610 615 620
Glu Ile Val Gln Pro Ile Ile Ser Lys Leu Tyr Gly Ser Ala Gly Pro
625 630 635 640
Pro Pro Thr Gly Glu Glu Asp Thr Ser Glu Lys Asp Glu Leu
645 650
<210> 12
<211> 1965
<212> DNA
<213> 智人
<400> 12
atgaagttcc ctatggtggc ggcggcgctg ctgctgctct gcgcggtgcg ggccgaggag 60
gaggacaaga aggaggatgt gggcacggtg gtcggcatcg acctggggac cacctattcc 120
tgcgttggtg tgttcaagaa cggccgcgtg gagatcatag ccaacgatca gggcaaccgc 180
atcacgccgt cgtatgtggc cttcactcct gaaggcgagc gtctgattgg cgatgcggcc 240
aagaaccagc tcacctccaa tcccgagaac acggtcttcg acgccaagcg cctcatcgga 300
cgcacttgga atgacccttc agtgcagcag gacatcaagt tcttgccttt caaggtggtt 360
gaaaagaaaa ctaaaccata cattcaagtt gatattggag gtgggcaaac caaaacattt 420
gccccagaag aaatttctgc catggttctc actaaaatga aagaaactgc tgaagcatat 480
ttgggaaaga aggttaccca tgcagttgtt actgtgccgg cttacttcaa tgatgcccag 540
cgccaagcaa ccaaagatgc tggcaccatt gctggactga atgtcatgcg gatcatcaat 600
gagcccacag cagctgctat tgcgtatggc ctggataaga gagagggcga gaagaacatc 660
ctcgtttttg acctgggcgg tggaaccttc gatgtgtctc ttctgaccat tgacaatggt 720
gtctttgaag tggtggccac gaatggagac actcatctcg gtggggaaga ctttgatcag 780
cgggttatgg aacacttcat caagctgtac aaaaagaaaa ctgggaaaga cgttagaaaa 840
gacaacagag ctgtgcagaa acttcgtcgt gaggtggaaa aggctaagcg agccctgtct 900
tctcagcatc aagcaagaat tgagatagag tccttctttg aaggagaaga cttctctgag 960
accctgactc gggccaaatt tgaagagttg aacatggacc tgttccgatc taccatgaag 1020
ccagtccaga aagtgttgga agactctgat ctgaagaaat cagacattga tgaaattgtt 1080
cttgtcggtg ggtctactcg gattcccaag attcagcagc tggtgaaaga gttcttcaat 1140
ggcaaggagc catcccgtgg cataaaccca gatgaggctg tagcatacgg tgctgctgtc 1200
caggctggtg tcctctctgg tgatcaagat acaggtgatc tggtactgct tgatgtatgt 1260
cctcttacac ttggtattga aacagtggga ggtgtcatga ccaaactgat tccaaggaac 1320
actgtggtac ccaccaagaa gtctcagatc ttttccacag cttctgataa tcagccaact 1380
gtaacaatca aggtctatga aggtgaacga cccctaacaa aagacaacca tcttctgggt 1440
acatttgatc tgactggaat tcctcctgct cctcgtgggg taccccagat tgaagtcacc 1500
tttgagatag atgttaatgg tattcttcga gtgacagctg aagacaaagg tacagggaac 1560
aaaaacaaaa tcacaattac caatgaccaa aatcgcctga cacctgaaga aattgaaagg 1620
atggttaatg atgcagagaa gtttgctgag gaagacaaaa agctcaaaga gcgcattgat 1680
accaggaacg agttggaaag ctatgcttac tctctcaaga accagattgg agataaagaa 1740
aagctgggcg gtaaactttc ctctgaagat aaagaaacca tggagaaagc tgtagaggaa 1800
aagattgaat ggctggaaag ccaccaggat gcagacattg aagactttaa agctaaaaag 1860
aaggaactag aggaaattgt tcagcctatt attagcaaac tctatggaag tgcaggccct 1920
cccccaactg gtgaagagga tacatcagaa aaagatgagt tgtag 1965
<210> 13
<211> 5914
<212> DNA
<213> 智人
<400> 13
gcagacggga gtttctcctc ggggtcggag caggaggcac gcggagtgtg aggccacgca 60
tgagcggacg ctaaccccct ccccagccac aaagagtcta catgtctagg gtctagacat 120
gttcagcttt gtggacctcc ggctcctgct cctcttagcg gccaccgccc tcctgacgca 180
cggccaagag gaaggccaag tcgagggcca agacgaagac atcccaccaa tcacctgcgt 240
acagaacggc ctcaggtacc atgaccgaga cgtgtggaaa cccgagccct gccggatctg 300
cgtctgcgac aacggcaagg tgttgtgcga tgacgtgatc tgtgacgaga ccaagaactg 360
ccccggcgcc gaagtccccg agggcgagtg ctgtcccgtc tgccccgacg gctcagagtc 420
acccaccgac caagaaacca ccggcgtcga gggacccaag ggagacactg gcccccgagg 480
cccaagggga cccgcaggcc cccctggccg agatggcatc cctggacagc ctggacttcc 540
cggacccccc ggaccccccg gacctcccgg accccctggc ctcggaggaa actttgctcc 600
ccagctgtct tatggctatg atgagaaatc aaccggagga atttccgtgc ctggccccat 660
gggtccctct ggtcctcgtg gtctccctgg cccccctggt gcacctggtc cccaaggctt 720
ccaaggtccc cctggtgagc ctggcgagcc tggagcttca ggtcccatgg gtccccgagg 780
tcccccaggt ccccctggaa agaatggaga tgatggggaa gctggaaaac ctggtcgtcc 840
tggtgagcgt gggcctcctg ggcctcaggg tgctcgagga ttgcccggaa cagctggcct 900
ccctggaatg aagggacaca gaggtttcag tggtttggat ggtgccaagg gagatgctgg 960
tcctgctggt cctaagggtg agcctggcag ccctggtgaa aatggagctc ctggtcagat 1020
gggcccccgt ggcctgcctg gtgagagagg tcgccctgga gcccctggcc ctgctggtgc 1080
tcgtggaaat gatggtgcta ctggtgctgc cgggccccct ggtcccaccg gccccgctgg 1140
tcctcctggc ttccctggtg ctgttggtgc taagggtgaa gctggtcccc aagggccccg 1200
aggctctgaa ggtccccagg gtgtgcgtgg tgagcctggc ccccctggcc ctgctggtgc 1260
tgctggccct gctggaaacc ctggtgctga tggacagcct ggtgctaaag gtgccaatgg 1320
tgctcctggt attgctggtg ctcctggctt ccctggtgcc cgaggcccct ctggacccca 1380
gggccccggc ggccctcctg gtcccaaggg taacagcggt gaacctggtg ctcctggcag 1440
caaaggagac actggtgcta agggagagcc tggccctgtt ggtgttcaag gaccccctgg 1500
ccctgctgga gaggaaggaa agcgaggagc tcgaggtgaa cccggaccca ctggcctgcc 1560
cggaccccct ggcgagcgtg gtggacctgg tagccgtggt ttccctggcg cagatggtgt 1620
tgctggtccc aagggtcccg ctggtgaacg tggttctcct ggccctgctg gccccaaagg 1680
atctcctggt gaagctggtc gtcccggtga agctggtctg cctggtgcca agggtctgac 1740
tggaagccct ggcagccctg gtcctgatgg caaaactggc ccccctggtc ccgccggtca 1800
agatggtcgc cccggacccc caggcccacc tggtgcccgt ggtcaggctg gtgtgatggg 1860
attccctgga cctaaaggtg ctgctggaga gcccggcaag gctggagagc gaggtgttcc 1920
cggaccccct ggcgctgtcg gtcctgctgg caaagatgga gaggctggag ctcagggacc 1980
ccctggccct gctggtcccg ctggcgagag aggtgaacaa ggccctgctg gctcccccgg 2040
attccagggt ctccctggtc ctgctggtcc tccaggtgaa gcaggcaaac ctggtgaaca 2100
gggtgttcct ggagaccttg gcgcccctgg cccctctgga gcaagaggcg agagaggttt 2160
ccctggcgag cgtggtgtgc aaggtccccc tggtcctgct ggtccccgag gggccaacgg 2220
tgctcccggc aacgatggtg ctaagggtga tgctggtgcc cctggagctc ccggtagcca 2280
gggcgcccct ggccttcagg gaatgcctgg tgaacgtggt gcagctggtc ttccagggcc 2340
taagggtgac agaggtgatg ctggtcccaa aggtgctgat ggctctcctg gcaaagatgg 2400
cgtccgtggt ctgactggcc ccattggtcc tcctggccct gctggtgccc ctggtgacaa 2460
gggtgaaagt ggtcccagcg gccctgctgg tcccactgga gctcgtggtg cccccggaga 2520
ccgtggtgag cctggtcccc ccggccctgc tggctttgct ggcccccctg gtgctgacgg 2580
ccaacctggt gctaaaggcg aacctggtga tgctggtgct aaaggcgatg ctggtccccc 2640
tggccctgcc ggacccgctg gaccccctgg ccccattggt aatgttggtg ctcctggagc 2700
caaaggtgct cgcggcagcg ctggtccccc tggtgctact ggtttccctg gtgctgctgg 2760
ccgagtcggt cctcctggcc cctctggaaa tgctggaccc cctggccctc ctggtcctgc 2820
tggcaaagaa ggcggcaaag gtccccgtgg tgagactggc cctgctggac gtcctggtga 2880
agttggtccc cctggtcccc ctggccctgc tggcgagaaa ggatcccctg gtgctgatgg 2940
tcctgctggt gctcctggta ctcccgggcc tcaaggtatt gctggacagc gtggtgtggt 3000
cggcctgcct ggtcagagag gagagagagg cttccctggt cttcctggcc cctctggtga 3060
acctggcaaa caaggtccct ctggagcaag tggtgaacgt ggtccccctg gtcccatggg 3120
cccccctgga ttggctggac cccctggtga atctggacgt gagggggctc ctggtgccga 3180
aggttcccct ggacgagacg gttctcctgg cgccaagggt gaccgtggtg agaccggccc 3240
cgctggaccc cctggtgctc ctggtgctcc tggtgcccct ggccccgttg gccctgctgg 3300
caagagtggt gatcgtggtg agactggtcc tgctggtccc gccggtcctg tcggccctgt 3360
tggcgcccgt ggccccgccg gaccccaagg cccccgtggt gacaagggtg agacaggcga 3420
acagggcgac agaggcataa agggtcaccg tggcttctct ggcctccagg gtccccctgg 3480
ccctcctggc tctcctggtg aacaaggtcc ctctggagcc tctggtcctg ctggtccccg 3540
aggtccccct ggctctgctg gtgctcctgg caaagatgga ctcaacggtc tccctggccc 3600
cattgggccc cctggtcctc gcggtcgcac tggtgatgct ggtcctgttg gtccccccgg 3660
ccctcctgga cctcctggtc cccctggtcc tcccagcgct ggtttcgact tcagcttcct 3720
gccccagcca cctcaagaga aggctcacga tggtggccgc tactaccggg ctgatgatgc 3780
caatgtggtt cgtgaccgtg acctcgaggt ggacaccacc ctcaagagcc tgagccagca 3840
gatcgagaac atccggagcc cagagggcag ccgcaagaac cccgcccgca cctgccgtga 3900
cctcaagatg tgccactctg actggaagag tggagagtac tggattgacc ccaaccaagg 3960
ctgcaacctg gatgccatca aagtcttctg caacatggag actggtgaga cctgcgtgta 4020
ccccactcag cccagtgtgg cccagaagaa ctggtacatc agcaagaacc ccaaggacaa 4080
gaggcatgtc tggttcggcg agagcatgac cgatggattc cagttcgagt atggcggcca 4140
gggctccgac cctgccgatg tggccatcca gctgaccttc ctgcgcctga tgtccaccga 4200
ggcctcccag aacatcacct accactgcaa gaacagcgtg gcctacatgg accagcagac 4260
tggcaacctc aagaaggccc tgctcctcca gggctccaac gagatcgaga tccgcgccga 4320
gggcaacagc cgcttcacct acagcgtcac tgtcgatggc tgcacgagtc acaccggagc 4380
ctggggcaag acagtgattg aatacaaaac caccaagacc tcccgcctgc ccatcatcga 4440
tgtggccccc ttggacgttg gtgccccaga ccaggaattc ggcttcgacg ttggccctgt 4500
ctgcttcctg taaactccct ccatcccaac ctggctccct cccacccaac caactttccc 4560
cccaacccgg aaacagacaa gcaacccaaa ctgaaccccc tcaaaagcca aaaaatggga 4620
gacaatttca catggacttt ggaaaatatt tttttccttt gcattcatct ctcaaactta 4680
gtttttatct ttgaccaacc gaacatgacc aaaaaccaaa agtgcattca accttaccaa 4740
aaaaaaaaaa aaaaaaagaa taaataaata actttttaaa aaaggaagct tggtccactt 4800
gcttgaagac ccatgcgggg gtaagtccct ttctgcccgt tgggcttatg aaaccccaat 4860
gctgcccttt ctgctccttt ctccacaccc cccttggggc ctcccctcca ctccttccca 4920
aatctgtctc cccagaagac acaggaaaca atgtattgtc tgcccagcaa tcaaaggcaa 4980
tgctcaaaca cccaagtggc ccccaccctc agcccgctcc tgcccgccca gcacccccag 5040
gccctggggg acctggggtt ctcagactgc caaagaagcc ttgccatctg gcgctcccat 5100
ggctcttgca acatctcccc ttcgtttttg agggggtcat gccgggggag ccaccagccc 5160
ctcactgggt tcggaggaga gtcaggaagg gccacgacaa agcagaaaca tcggatttgg 5220
ggaacgcgtg tcaatccctt gtgccgcagg gctgggcggg agagactgtt ctgttccttg 5280
tgtaactgtg ttgctgaaag actacctcgt tcttgtcttg atgtgtcacc ggggcaactg 5340
cctgggggcg gggatggggg cagggtggaa gcggctcccc attttatacc aaaggtgcta 5400
catctatgtg atgggtgggg tggggaggga atcactggtg ctatagaaat tgagatgccc 5460
ccccaggcca gcaaatgttc ctttttgttc aaagtctatt tttattcctt gatatttttc 5520
tttttttttt tttttttttg tggatgggga cttgtgaatt tttctaaagg tgctatttaa 5580
catgggagga gagcgtgtgc ggctccagcc cagcccgctg ctcactttcc accctctctc 5640
cacctgcctc tggcttctca ggcctctgct ctccgacctc tctcctctga aaccctcctc 5700
cacagctgca gcccatcctc ccggctccct cctagtctgt cctgcgtcct ctgtccccgg 5760
gtttcagaga caacttccca aagcacaaag cagtttttcc ccctaggggt gggaggaagc 5820
aaaagactct gtacctattt tgtatgtgta taataatttg agatgttttt aattattttg 5880
attgctggaa taaagcatgt ggaaatgacc caaa 5914
<210> 14
<211> 5072
<212> DNA
<213> 智人
<400> 14
agcaccacgg cagcaggagg tttcggctaa gttggaggta ctggccacga ctgcatgccc 60
gcgcccgcca ggtgatacct ccgccggtga cccaggggct ctgcgacaca aggagtctgc 120
atgtctaagt gctagacatg ctcagctttg tggatacgcg gactttgttg ctgcttgcag 180
taaccttatg cctagcaaca tgccaatctt tacaagagga aactgtaaga aagggcccag 240
ccggagatag aggaccacgt ggagaaaggg gtccaccagg ccccccaggc agagatggtg 300
aagatggtcc cacaggccct cctggtccac ctggtcctcc tggcccccct ggtctcggtg 360
ggaactttgc tgctcagtat gatggaaaag gagttggact tggccctgga ccaatgggct 420
taatgggacc tagaggccca cctggtgcag ctggagcccc aggccctcaa ggtttccaag 480
gacctgctgg tgagcctggt gaacctggtc aaactggtcc tgcaggtgct cgtggtccag 540
ctggccctcc tggcaaggct ggtgaagatg gtcaccctgg aaaacccgga cgacctggtg 600
agagaggagt tgttggacca cagggtgctc gtggtttccc tggaactcct ggacttcctg 660
gcttcaaagg cattagggga cacaatggtc tggatggatt gaagggacag cccggtgctc 720
ctggtgtgaa gggtgaacct ggtgcccctg gtgaaaatgg aactccaggt caaacaggag 780
cccgtgggct tcctggtgag agaggacgtg ttggtgcccc tggcccagct ggtgcccgtg 840
gcagtgatgg aagtgtgggt cccgtgggtc ctgctggtcc cattgggtct gctggccctc 900
caggcttccc aggtgcccct ggccccaagg gtgaaattgg agctgttggt aacgctggtc 960
ctgctggtcc cgccggtccc cgtggtgaag tgggtcttcc aggcctctcc ggccccgttg 1020
gacctcctgg taatcctgga gcaaacggcc ttactggtgc caagggtgct gctggccttc 1080
ccggcgttgc tggggctccc ggcctccctg gaccccgcgg tattcctggc cctgttggtg 1140
ctgccggtgc tactggtgcc agaggacttg ttggtgagcc tggtccagct ggctccaaag 1200
gagagagcgg taacaagggt gagcccggct ctgctgggcc ccaaggtcct cctggtccca 1260
gtggtgaaga aggaaagaga ggccctaatg gggaagctgg atctgccggc cctccaggac 1320
ctcctgggct gagaggtagt cctggttctc gtggtcttcc tggagctgat ggcagagctg 1380
gcgtcatggg ccctcctggt agtcgtggtg caagtggccc tgctggagtc cgaggaccta 1440
atggagatgc tggtcgccct ggggagcctg gtctcatggg acccagaggt cttcctggtt 1500
cccctggaaa tatcggcccc gctggaaaag aaggtcctgt cggcctccct ggcatcgacg 1560
gcaggcctgg cccaattggc ccagctggag caagaggaga gcctggcaac attggattcc 1620
ctggacccaa aggccccact ggtgatcctg gcaaaaacgg tgataaaggt catgctggtc 1680
ttgctggtgc tcggggtgct ccaggtcctg atggaaacaa tggtgctcag ggacctcctg 1740
gaccacaggg tgttcaaggt ggaaaaggtg aacagggtcc ccctggtcct ccaggcttcc 1800
agggtctgcc tggcccctca ggtcccgctg gtgaagttgg caaaccagga gaaaggggtc 1860
tccatggtga gtttggtctc cctggtcctg ctggtccaag aggggaacgc ggtcccccag 1920
gtgagagtgg tgctgccggt cctactggtc ctattggaag ccgaggtcct tctggacccc 1980
cagggcctga tggaaacaag ggtgaacctg gtgtggttgg tgctgtgggc actgctggtc 2040
catctggtcc tagtggactc ccaggagaga ggggtgctgc tggcatacct ggaggcaagg 2100
gagaaaaggg tgaacctggt ctcagaggtg aaattggtaa ccctggcaga gatggtgctc 2160
gtggtgctcc tggtgctgta ggtgcccctg gtcctgctgg agccacaggt gaccggggcg 2220
aagctggggc tgctggtcct gctggtcctg ctggtcctcg gggaagccct ggtgaacgtg 2280
gtgaggtcgg tcctgctggc cccaatggat ttgctggtcc tgctggtgct gctggtcaac 2340
ctggtgctaa aggagaaaga ggagccaaag ggcctaaggg tgaaaacggt gttgttggtc 2400
ccacaggccc cgttggagct gctggcccag ctggtccaaa tggtcccccc ggtcctgctg 2460
gaagtcgtgg tgatggaggc ccccctggta tgactggttt ccctggtgct gctggacgga 2520
ctggtccccc aggaccctct ggtatttctg gccctcctgg tccccctggt cctgctggga 2580
aagaagggct tcgtggtcct cgtggtgacc aaggtccagt tggccgaact ggagaagtag 2640
gtgcagttgg tccccctggc ttcgctggtg agaagggtcc ctctggagag gctggtactg 2700
ctggacctcc tggcactcca ggtcctcagg gtcttcttgg tgctcctggt attctgggtc 2760
tccctggctc gagaggtgaa cgtggtctac caggtgttgc tggtgctgtg ggtgaacctg 2820
gtcctcttgg cattgccggc cctcctgggg cccgtggtcc tcctggtgct gtgggtagtc 2880
ctggagtcaa cggtgctcct ggtgaagctg gtcgtgatgg caaccctggg aacgatggtc 2940
ccccaggtcg cgatggtcaa cccggacaca agggagagcg cggttaccct ggcaatattg 3000
gtcccgttgg tgctgcaggt gcacctggtc ctcatggccc cgtgggtcct gctggcaaac 3060
atggaaaccg tggtgaaact ggtccttctg gtcctgttgg tcctgctggt gctgttggcc 3120
caagaggtcc tagtggccca caaggcattc gtggcgataa gggagagccc ggtgaaaagg 3180
ggcccagagg tcttcctggc ttaaagggac acaatggatt gcaaggtctg cctggtatcg 3240
ctggtcacca tggtgatcaa ggtgctcctg gctccgtggg tcctgctggt cctaggggcc 3300
ctgctggtcc ttctggccct gctggaaaag atggtcgcac tggacatcct ggtacagttg 3360
gacctgctgg cattcgaggc cctcagggtc accaaggccc tgctggcccc cctggtcccc 3420
ctggccctcc tggacctcca ggtgtaagcg gtggtggtta tgactttggt tacgatggag 3480
acttctacag ggctgaccag cctcgctcag caccttctct cagacccaag gactatgaag 3540
ttgatgctac tctgaagtct ctcaacaacc agattgagac ccttcttact cctgaaggct 3600
ctagaaagaa cccagctcgc acatgccgtg acttgagact cagccaccca gagtggagca 3660
gtggttacta ctggattgac cctaaccaag gatgcactat ggatgctatc aaagtatact 3720
gtgatttctc tactggcgaa acctgtatcc gggcccaacc tgaaaacatc ccagccaaga 3780
actggtatag gagctccaag gacaagaaac acgtctggct aggagaaact atcaatgctg 3840
gcagccagtt tgaatataat gtagaaggag tgacttccaa ggaaatggct acccaacttg 3900
ccttcatgcg cctgctggcc aactatgcct ctcagaacat cacctaccac tgcaagaaca 3960
gcattgcata catggatgag gagactggca acctgaaaaa ggctgtcatt ctacagggct 4020
ctaatgatgt tgaacttgtt gctgagggca acagcaggtt cacttacact gttcttgtag 4080
atggctgctc taaaaagaca aatgaatggg gaaagacaat cattgaatac aaaacaaata 4140
agccatcacg cctgcccttc cttgatattg cacctttgga catcggtggt gctgaccagg 4200
aattctttgt ggacattggc ccagtctgtt tcaaataaat gaactcaatc taaattaaaa 4260
aagaaagaaa tttgaaaaaa ctttctcttt gccatttctt cttcttcttt tttaactgaa 4320
agctgaatcc ttccatttct tctgcacatc tacttgctta aattgtgggc aaaagagaaa 4380
aagaaggatt gatcagagca ttgtgcaata cagtttcatt aactccttcc cccgctcccc 4440
caaaaatttg aatttttttt tcaacactct tacacctgtt atggaaaatg tcaacctttg 4500
taagaaaacc aaaataaaaa ttgaaaaata aaaaccataa acatttgcac cacttgtggc 4560
ttttgaatat cttccacaga gggaagttta aaacccaaac ttccaaaggt ttaaactacc 4620
tcaaaacact ttcccatgag tgtgatccac attgttaggt gctgacctag acagagatga 4680
actgaggtcc ttgttttgtt ttgttcataa tacaaaggtg ctaattaata gtatttcaga 4740
tacttgaaga atgttgatgg tgctagaaga atttgagaag aaatactcct gtattgagtt 4800
gtatcgtgtg gtgtattttt taaaaaattt gatttagcat tcatattttc catcttattc 4860
ccaattaaaa gtatgcagat tatttgccca aatcttcttc agattcagca tttgttcttt 4920
gccagtctca ttttcatctt cttccatggt tccacagaag ctttgtttct tgggcaagca 4980
gaaaaattaa attgtaccta ttttgtatat gtgagatgtt taaataaatt gtgaaaaaaa 5040
tgaaataaag catgtttggt tttccaaaag aa 5072
<210> 15
<211> 5490
<212> DNA
<213> 智人
<400> 15
ggctgagttt tatgacgggc ccggtgctga agggcaggga acaacttgat ggtgctactt 60
tgaactgctt ttcttttctc ctttttgcac aaagagtctc atgtctgata tttagacatg 120
atgagctttg tgcaaaaggg gagctggcta cttctcgctc tgcttcatcc cactattatt 180
ttggcacaac aggaagctgt tgaaggagga tgttcccatc ttggtcagtc ctatgcggat 240
agagatgtct ggaagccaga accatgccaa atatgtgtct gtgactcagg atccgttctc 300
tgcgatgaca taatatgtga cgatcaagaa ttagactgcc ccaacccaga aattccattt 360
ggagaatgtt gtgcagtttg cccacagcct ccaactgctc ctactcgccc tcctaatggt 420
caaggacctc aaggccccaa gggagatcca ggccctcctg gtattcctgg gagaaatggt 480
gaccctggta ttccaggaca accagggtcc cctggttctc ctggcccccc tggaatctgt 540
gaatcatgcc ctactggtcc tcagaactat tctccccagt atgattcata tgatgtcaag 600
tctggagtag cagtaggagg actcgcaggc tatcctggac cagctggccc cccaggccct 660
cccggtcccc ctggtacatc tggtcatcct ggttcccctg gatctccagg ataccaagga 720
ccccctggtg aacctgggca agctggtcct tcaggccctc caggacctcc tggtgctata 780
ggtccatctg gtcctgctgg aaaagatgga gaatcaggta gacccggacg acctggagag 840
cgaggattgc ctggacctcc aggtatcaaa ggtccagctg ggatacctgg attccctggt 900
atgaaaggac acagaggctt cgatggacga aatggagaaa agggtgaaac aggtgctcct 960
ggattaaagg gtgaaaatgg tcttccaggc gaaaatggag ctcctggacc catgggtcca 1020
agaggggctc ctggtgagcg aggacggcca ggacttcctg gggctgcagg tgctcggggt 1080
aatgacggtg ctcgaggcag tgatggtcaa ccaggccctc ctggtcctcc tggaactgcc 1140
ggattccctg gatcccctgg tgctaagggt gaagttggac ctgcagggtc tcctggttca 1200
aatggtgccc ctggacaaag aggagaacct ggacctcagg gacacgctgg tgctcaaggt 1260
cctcctggcc ctcctgggat taatggtagt cctggtggta aaggcgaaat gggtcccgct 1320
ggcattcctg gagctcctgg actgatggga gcccggggtc ctccaggacc agccggtgct 1380
aatggtgctc ctggactgcg aggtggtgca ggtgagcctg gtaagaatgg tgccaaagga 1440
gagcccggac cacgtggtga acgcggtgag gctggtattc caggtgttcc aggagctaaa 1500
ggcgaagatg gcaaggatgg atcacctgga gaacctggtg caaatgggct tccaggagct 1560
gcaggagaaa ggggtgcccc tgggttccga ggacctgctg gaccaaatgg catcccagga 1620
gaaaagggtc ctgctggaga gcgtggtgct ccaggccctg cagggcccag aggagctgct 1680
ggagaacctg gcagagatgg cgtccctgga ggtccaggaa tgaggggcat gcccggaagt 1740
ccaggaggac caggaagtga tgggaaacca gggcctcccg gaagtcaagg agaaagtggt 1800
cgaccaggtc ctcctgggcc atctggtccc cgaggtcagc ctggtgtcat gggcttcccc 1860
ggtcctaaag gaaatgatgg tgctcctggt aagaatggag aacgaggtgg ccctggagga 1920
cctggccctc agggtcctcc tggaaagaat ggtgaaactg gacctcaggg acccccaggg 1980
cctactgggc ctggtggtga caaaggagac acaggacccc ctggtccaca aggattacaa 2040
ggcttgcctg gtacaggtgg tcctccagga gaaaatggaa aacctgggga accaggtcca 2100
aagggtgatg ccggtgcacc tggagctcca ggaggcaagg gtgatgctgg tgcccctggt 2160
gaacgtggac ctcctggatt ggcaggggcc ccaggactta gaggtggagc tggtccccct 2220
ggtcccgaag gaggaaaggg tgctgctggt cctcctgggc cacctggtgc tgctggtact 2280
cctggtctgc aaggaatgcc tggagaaaga ggaggtcttg gaagtcctgg tccaaagggt 2340
gacaagggtg aaccaggcgg tccaggtgct gatggtgtcc cagggaaaga tggcccaagg 2400
ggtcctactg gtcctattgg tcctcctggc ccagctggcc agcctggaga taagggtgaa 2460
ggtggtgccc ccggacttcc aggtatagct ggacctcgtg gtagccctgg tgagagaggt 2520
gaaactggcc ctccaggacc tgctggtttc cctggtgctc ctggacagaa tggtgaacct 2580
ggtggtaaag gagaaagagg ggctccgggt gagaaaggtg aaggaggccc tcctggagtt 2640
gcaggacccc ctggaggttc tggacctgct ggtcctcctg gtccccaagg tgtcaaaggt 2700
gaacgtggca gtcctggtgg acctggtgct gctggcttcc ctggtgctcg tggtcttcct 2760
ggtcctcctg gtagtaatgg taacccagga cccccaggtc ccagcggttc tccaggcaag 2820
gatgggcccc caggtcctgc gggtaacact ggtgctcctg gcagccctgg agtgtctgga 2880
ccaaaaggtg atgctggcca accaggagag aagggatcgc ctggtgccca gggcccacca 2940
ggagctccag gcccacttgg gattgctggg atcactggag cacggggtct tgcaggacca 3000
ccaggcatgc caggtcctag gggaagccct ggccctcagg gtgtcaaggg tgaaagtggg 3060
aaaccaggag ctaacggtct cagtggagaa cgtggtcccc ctggacccca gggtcttcct 3120
ggtctggctg gtacagctgg tgaacctgga agagatggaa accctggatc agatggtctt 3180
ccaggccgag atggatctcc tggtggcaag ggtgatcgtg gtgaaaatgg ctctcctggt 3240
gcccctggcg ctcctggtca tccaggccca cctggtcctg tcggtccagc tggaaagagt 3300
ggtgacagag gagaaagtgg ccctgctggc cctgctggtg ctcccggtcc tgctggttcc 3360
cgaggtgctc ctggtcctca aggcccacgt ggtgacaaag gtgaaacagg tgaacgtgga 3420
gctgctggca tcaaaggaca tcgaggattc cctggtaatc caggtgcccc aggttctcca 3480
ggccctgctg gtcagcaggg tgcaatcggc agtccaggac ctgcaggccc cagaggacct 3540
gttggaccca gtggacctcc tggcaaagat ggaaccagtg gacatccagg tcccattgga 3600
ccaccagggc ctcgaggtaa cagaggtgaa agaggatctg agggctcccc aggccaccca 3660
gggcaaccag gccctcctgg acctcctggt gcccctggtc cttgctgtgg tggtgttgga 3720
gccgctgcca ttgctgggat tggaggtgaa aaagctggcg gttttgcccc gtattatgga 3780
gatgaaccaa tggatttcaa aatcaacacc gatgagatta tgacttcact caagtctgtt 3840
aatggacaaa tagaaagcct cattagtcct gatggttctc gtaaaaaccc cgctagaaac 3900
tgcagagacc tgaaattctg ccatcctgaa ctcaagagtg gagaatactg ggttgaccct 3960
aaccaaggat gcaaattgga tgctatcaag gtattctgta atatggaaac tggggaaaca 4020
tgcataagtg ccaatccttt gaatgttcca cggaaacact ggtggacaga ttctagtgct 4080
gagaagaaac acgtttggtt tggagagtcc atggatggtg gttttcagtt tagctacggc 4140
aatcctgaac ttcctgaaga tgtccttgat gtgcatctgg cattccttcg acttctctcc 4200
agccgagctt cccagaacat cacatatcac tgcaaaaata gcattgcata catggatcag 4260
gccagtggaa atgtaaagaa ggccctgaag ctgatggggt caaatgaagg tgaattcaag 4320
gctgaaggaa atagcaaatt cacctacaca gttctggagg atggttgcac gaaacacact 4380
ggggaatgga gcaaaacagt ctttgaatat cgaacacgca aggctgtgag actacctatt 4440
gtagatattg caccctatga cattggtggt cctgatcaag aatttggtgt ggacgttggc 4500
cctgtttgct ttttataaac caaactctat ctgaaatccc aacaaaaaaa atttaactcc 4560
atatgtgttc ctcttgttct aatcttgtca accagtgcaa gtgaccgaca aaattccagt 4620
tatttatttc caaaatgttt ggaaacagta taatttgaca aagaaaaatg atacttctct 4680
ttttttgctg ttccaccaaa tacaattcaa atgctttttg ttttattttt ttaccaattc 4740
caatttcaaa atgtctcaat ggtgctataa taaataaact tcaacactct ttatgataac 4800
aacactgtgt tatattcttt gaatcctagc ccatctgcag agcaatgact gtgctcacca 4860
gtaaaagata acctttcttt ctgaaatagt caaatacgaa attagaaaag ccctccctat 4920
tttaactacc tcaactggtc agaaacacag attgtattct atgagtccca gaagatgaaa 4980
aaaattttat acgttgataa aacttataaa tttcattgat taatctcctg gaagattggt 5040
ttaaaaagaa aagtgtaatg caagaattta aagaaatatt tttaaagcca caattatttt 5100
aatattggat atcaactgct tgtaaaggtg ctcctctttt ttcttgtcat tgctggtcaa 5160
gattactaat atttgggaag gctttaaaga cgcatgttat ggtgctaatg tactttcact 5220
tttaaactct agatcagaat tgttgacttg cattcagaac ataaatgcac aaaatctgta 5280
catgtctccc atcagaaaga ttcattggca tgccacaggg gattctcctc cttcatcctg 5340
taaaggtcaa caataaaaac caaattatgg ggctgctttt gtcacactag catagagaat 5400
gtgttgaaat ttaactttgt aagcttgtat gtggttgttg atcttttttt tccttacaga 5460
cacccataat aaaatatcat attaaaattc 5490
<210> 16
<211> 3302
<212> DNA
<213> 智人
<400> 16
accttctgca ctgctcatct gggcagagga agcttcagaa agctgccaag gcaccatctc 60
caggaactcc cagcacgcag aatccatctg agaatatgct gccacaaata ccctttttgc 120
tgctagtatc cttgaacttg gttcatggag tgttttacgc tgaacgatac caaatgccca 180
caggcataaa aggcccacta cccaacacca agacacagtt cttcattccc tacaccataa 240
agagtaaagg tatagcagta agaggagagc aaggtactcc tggtccacca ggccctgctg 300
gacctcgagg gcacccaggt ccttctggac caccaggaaa accaggctac ggaagtcctg 360
gactccaagg agagccaggg ttgccaggac caccgggacc atcagctgta gggaaaccag 420
gtgtgccagg actcccagga aaaccaggag agagaggacc atatggacca aaaggagatg 480
ttggaccagc tggcctacca ggaccccggg gcccaccagg accacctgga atccctggac 540
cggctggaat ttctgtgcca ggaaaacctg gacaacaggg acccacagga gccccaggac 600
ccaggggctt tcctggagaa aagggtgcac caggagtccc tggtatgaat ggacagaaag 660
gggaaatggg atatggtgct cctggtcgtc caggtgagag gggtcttcca ggccctcagg 720
gtcccacagg accatctggc cctcctggag tgggaaaaag aggtgaaaat ggggttccag 780
gacagccagg catcaaaggt gatagaggtt ttccgggaga aatgggacca attggcccac 840
caggtcccca aggccctcct ggggaacgag ggccagaagg cattggaaag ccaggagctg 900
ctggagcccc aggccagcca gggattccag gaacaaaagg tctccctggg gctccaggaa 960
tagctgggcc cccagggcct cctggctttg ggaaaccagg cttgccaggc ctgaagggag 1020
aaagaggacc tgctggcctt cctgggggtc caggtgccaa aggggaacaa gggccagcag 1080
gtcttcctgg gaagccaggt ctgactggac cccctgggaa tatgggaccc caaggaccaa 1140
aaggcatccc gggtagccat ggtctcccag gccctaaagg tgagacaggg ccagctgggc 1200
ctgcaggata ccctggggct aagggtgaaa ggggttcccc tgggtcagat ggaaaaccag 1260
ggtacccagg aaaaccaggt ctcgatggtc ctaagggtaa cccagggtta ccaggtccaa 1320
aaggtgatcc tggagttgga ggacctcctg gtctcccagg ccctgtgggc ccagcaggag 1380
caaagggaat gcccggacac aatggagagg ctggcccaag aggtgcccct ggaataccag 1440
gtactagagg ccctattggg ccaccaggca ttccaggatt ccctgggtct aaaggggatc 1500
caggaagtcc cggtcctcct ggcccagctg gcatagcaac taagggcctc aatggaccca 1560
ccgggccacc agggcctcca ggtccaagag gccactctgg agagcctggt cttccagggc 1620
cccctgggcc tccaggccca ccaggtcaag cagtcatgcc tgagggtttt ataaaggcag 1680
gccaaaggcc cagtctttct gggacccctc ttgttagtgc caaccagggg gtaacaggaa 1740
tgcctgtgtc tgcttttact gttattctct ccaaagctta cccagcaata ggaactccca 1800
taccatttga taaaattttg tataacaggc aacagcatta tgacccaagg actggaatct 1860
ttacttgtca gataccagga atatactatt tttcatacca cgtgcatgtg aaagggactc 1920
atgtttgggt aggcctgtat aagaatggca cccctgtaat gtacacctat gatgaataca 1980
ccaaaggcta cctggatcag gcttcaggga gtgccatcat cgatctcaca gaaaatgacc 2040
aggtgtggct ccagcttccc aatgccgagt caaatggcct atactcctct gagtatgtcc 2100
actcctcttt ctcaggattc ctagtggctc caatgtgagt acacacagag ctaatctaaa 2160
tcttgtgcta gaaaaagcat tctctaactc taccccaccc tacaaaatgc atatggaggt 2220
aggctgaaaa gaatgtaatt tttattttct gaaatacaga tttgagctat cagaccaaca 2280
aaccttcccc ctgaaaagtg agcagcaacg taaaaacgta tgtgaagcct ctcttgaatt 2340
tctagttagc aatcttaagg ctctttaagg ttttctccaa tattaaaaaa tatcaccaaa 2400
gaagtcctgc tatgttaaaa acaaacaaca aaaaacaaac aacaaaaaaa aaattaaaaa 2460
aaaaaacaga aatagagctc taagttatgt gaaatttgat ttgagaaact cggcatttcc 2520
tttttaaaaa agcctgtttc taactatgaa tatgagaact tctaggaaac atccaggagg 2580
tatcatataa ctttgtagaa cttaaatact tgaatattca aatttaaaag acactgtatc 2640
ccctaaaata tttctgatgg tgcactactc tgaggcctgt atggcccctt tcatcaatat 2700
ctattcaaat atacaggtgc atatatactt gttaaagctc ttatataaaa aagccccaaa 2760
atattgaagt tcatctgaaa tgcaaggtgc tttcatcaat gaaccttttc aaacttttct 2820
atgattgcag agaagctttt tatataccca gcataacttg gaaacaggta tctgacctat 2880
tcttatttag ttaacacaag tgtgattaat ttgatttctt taattcctta ttgaatctta 2940
tgtgatatga ttttctggat ttacagaaca ttagcacatg taccttgtgc ctcccattca 3000
agtgaagtta taatttacac tgagggtttc aaaattcgac tagaagtgga gatatattat 3060
ttatttatgc actgtactgt atttttatat tgctgtttaa aacttttaag ctgtgcctca 3120
cttattaaag cacaaaatgt tttacctact ccttatttac gacgcaataa aataacatca 3180
atagattttt aggctgaatt aatttgaaag cagcaatttg ctgttctcaa ccattctttc 3240
aaggcttttc attgttcaaa gttaataaaa aagtaggaca ataaagtgat gggtggcttt 3300
ta 3302
<210> 17
<211> 1464
<212> PRT
<213> 智人
<400> 17
Met Phe Ser Phe Val Asp Leu Arg Leu Leu Leu Leu Leu Ala Ala Thr
1 5 10 15
Ala Leu Leu Thr His Gly Gln Glu Glu Gly Gln Val Glu Gly Gln Asp
20 25 30
Glu Asp Ile Pro Pro Ile Thr Cys Val Gln Asn Gly Leu Arg Tyr His
35 40 45
Asp Arg Asp Val Trp Lys Pro Glu Pro Cys Arg Ile Cys Val Cys Asp
50 55 60
Asn Gly Lys Val Leu Cys Asp Asp Val Ile Cys Asp Glu Thr Lys Asn
65 70 75 80
Cys Pro Gly Ala Glu Val Pro Glu Gly Glu Cys Cys Pro Val Cys Pro
85 90 95
Asp Gly Ser Glu Ser Pro Thr Asp Gln Glu Thr Thr Gly Val Glu Gly
100 105 110
Pro Lys Gly Asp Thr Gly Pro Arg Gly Pro Arg Gly Pro Ala Gly Pro
115 120 125
Pro Gly Arg Asp Gly Ile Pro Gly Gln Pro Gly Leu Pro Gly Pro Pro
130 135 140
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala
145 150 155 160
Pro Gln Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly Ile Ser
165 170 175
Val Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro
180 185 190
Pro Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly Pro Pro Gly Glu Pro
195 200 205
Gly Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly
210 215 220
Pro Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg
225 230 235 240
Pro Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly Ala Arg Gly Leu Pro
245 250 255
Gly Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly
260 265 270
Leu Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu
275 280 285
Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gln Met Gly Pro Arg
290 295 300
Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly
305 310 315 320
Ala Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro
325 330 335
Thr Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys
340 345 350
Gly Glu Ala Gly Pro Gln Gly Pro Arg Gly Ser Glu Gly Pro Gln Gly
355 360 365
Val Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro
370 375 380
Ala Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Ala Asn
385 390 395 400
Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly
405 410 415
Pro Ser Gly Pro Gln Gly Pro Gly Gly Pro Pro Gly Pro Lys Gly Asn
420 425 430
Ser Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys
435 440 445
Gly Glu Pro Gly Pro Val Gly Val Gln Gly Pro Pro Gly Pro Ala Gly
450 455 460
Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu
465 470 475 480
Pro Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro
485 490 495
Gly Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly
500 505 510
Ser Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg
515 520 525
Pro Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro
530 535 540
Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly
545 550 555 560
Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gln
565 570 575
Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro
580 585 590
Gly Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly
595 600 605
Pro Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln Gly Pro Pro Gly Pro
610 615 620
Ala Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly Pro Ala Gly Ser Pro
625 630 635 640
Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly
645 650 655
Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro
660 665 670
Ser Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gln
675 680 685
Gly Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly
690 695 700
Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser
705 710 715 720
Gln Gly Ala Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Ala Ala
725 730 735
Gly Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly
740 745 750
Ala Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro
755 760 765
Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser
770 775 780
Gly Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly
785 790 795 800
Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro
805 810 815
Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala
820 825 830
Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly
835 840 845
Pro Pro Gly Pro Ile Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala
850 855 860
Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala
865 870 875 880
Gly Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly
885 890 895
Pro Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu
900 905 910
Thr Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro
915 920 925
Gly Pro Ala Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly
930 935 940
Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala Gly Gln Arg Gly Val
945 950 955 960
Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro
965 970 975
Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro Ser Gly Ala Ser Gly
980 985 990
Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro
995 1000 1005
Pro Gly Glu Ser Gly Arg Glu Gly Ala Pro Gly Ala Glu Gly Ser
1010 1015 1020
Pro Gly Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu
1025 1030 1035
Thr Gly Pro Ala Gly Pro Pro Gly Ala Pro Gly Ala Pro Gly Ala
1040 1045 1050
Pro Gly Pro Val Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu
1055 1060 1065
Thr Gly Pro Ala Gly Pro Ala Gly Pro Val Gly Pro Val Gly Ala
1070 1075 1080
Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly Asp Lys Gly Glu
1085 1090 1095
Thr Gly Glu Gln Gly Asp Arg Gly Ile Lys Gly His Arg Gly Phe
1100 1105 1110
Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly Ser Pro Gly Glu
1115 1120 1125
Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly Pro Arg Gly Pro
1130 1135 1140
Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp Gly Leu Asn Gly Leu
1145 1150 1155
Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly Arg Thr Gly Asp
1160 1165 1170
Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Pro
1175 1180 1185
Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu Pro Gln
1190 1195 1200
Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg Ala
1205 1210 1215
Asp Asp Ala Asn Val Val Arg Asp Arg Asp Leu Glu Val Asp Thr
1220 1225 1230
Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn Ile Arg Ser Pro
1235 1240 1245
Glu Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Arg Asp Leu Lys
1250 1255 1260
Met Cys His Ser Asp Trp Lys Ser Gly Glu Tyr Trp Ile Asp Pro
1265 1270 1275
Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val Phe Cys Asn Met
1280 1285 1290
Glu Thr Gly Glu Thr Cys Val Tyr Pro Thr Gln Pro Ser Val Ala
1295 1300 1305
Gln Lys Asn Trp Tyr Ile Ser Lys Asn Pro Lys Asp Lys Arg His
1310 1315 1320
Val Trp Phe Gly Glu Ser Met Thr Asp Gly Phe Gln Phe Glu Tyr
1325 1330 1335
Gly Gly Gln Gly Ser Asp Pro Ala Asp Val Ala Ile Gln Leu Thr
1340 1345 1350
Phe Leu Arg Leu Met Ser Thr Glu Ala Ser Gln Asn Ile Thr Tyr
1355 1360 1365
His Cys Lys Asn Ser Val Ala Tyr Met Asp Gln Gln Thr Gly Asn
1370 1375 1380
Leu Lys Lys Ala Leu Leu Leu Gln Gly Ser Asn Glu Ile Glu Ile
1385 1390 1395
Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Ser Val Thr Val Asp
1400 1405 1410
Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys Thr Val Ile Glu
1415 1420 1425
Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile Ile Asp Val Ala
1430 1435 1440
Pro Leu Asp Val Gly Ala Pro Asp Gln Glu Phe Gly Phe Asp Val
1445 1450 1455
Gly Pro Val Cys Phe Leu
1460
<210> 18
<211> 1366
<212> PRT
<213> 智人
<400> 18
Met Leu Ser Phe Val Asp Thr Arg Thr Leu Leu Leu Leu Ala Val Thr
1 5 10 15
Leu Cys Leu Ala Thr Cys Gln Ser Leu Gln Glu Glu Thr Val Arg Lys
20 25 30
Gly Pro Ala Gly Asp Arg Gly Pro Arg Gly Glu Arg Gly Pro Pro Gly
35 40 45
Pro Pro Gly Arg Asp Gly Glu Asp Gly Pro Thr Gly Pro Pro Gly Pro
50 55 60
Pro Gly Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala Ala Gln
65 70 75 80
Tyr Asp Gly Lys Gly Val Gly Leu Gly Pro Gly Pro Met Gly Leu Met
85 90 95
Gly Pro Arg Gly Pro Pro Gly Ala Ala Gly Ala Pro Gly Pro Gln Gly
100 105 110
Phe Gln Gly Pro Ala Gly Glu Pro Gly Glu Pro Gly Gln Thr Gly Pro
115 120 125
Ala Gly Ala Arg Gly Pro Ala Gly Pro Pro Gly Lys Ala Gly Glu Asp
130 135 140
Gly His Pro Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Val Val Gly
145 150 155 160
Pro Gln Gly Ala Arg Gly Phe Pro Gly Thr Pro Gly Leu Pro Gly Phe
165 170 175
Lys Gly Ile Arg Gly His Asn Gly Leu Asp Gly Leu Lys Gly Gln Pro
180 185 190
Gly Ala Pro Gly Val Lys Gly Glu Pro Gly Ala Pro Gly Glu Asn Gly
195 200 205
Thr Pro Gly Gln Thr Gly Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg
210 215 220
Val Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Ser Asp Gly Ser Val
225 230 235 240
Gly Pro Val Gly Pro Ala Gly Pro Ile Gly Ser Ala Gly Pro Pro Gly
245 250 255
Phe Pro Gly Ala Pro Gly Pro Lys Gly Glu Ile Gly Ala Val Gly Asn
260 265 270
Ala Gly Pro Ala Gly Pro Ala Gly Pro Arg Gly Glu Val Gly Leu Pro
275 280 285
Gly Leu Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ala Asn Gly
290 295 300
Leu Thr Gly Ala Lys Gly Ala Ala Gly Leu Pro Gly Val Ala Gly Ala
305 310 315 320
Pro Gly Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Val Gly Ala Ala
325 330 335
Gly Ala Thr Gly Ala Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly
340 345 350
Ser Lys Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ser Ala Gly Pro
355 360 365
Gln Gly Pro Pro Gly Pro Ser Gly Glu Glu Gly Lys Arg Gly Pro Asn
370 375 380
Gly Glu Ala Gly Ser Ala Gly Pro Pro Gly Pro Pro Gly Leu Arg Gly
385 390 395 400
Ser Pro Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val
405 410 415
Met Gly Pro Pro Gly Ser Arg Gly Ala Ser Gly Pro Ala Gly Val Arg
420 425 430
Gly Pro Asn Gly Asp Ala Gly Arg Pro Gly Glu Pro Gly Leu Met Gly
435 440 445
Pro Arg Gly Leu Pro Gly Ser Pro Gly Asn Ile Gly Pro Ala Gly Lys
450 455 460
Glu Gly Pro Val Gly Leu Pro Gly Ile Asp Gly Arg Pro Gly Pro Ile
465 470 475 480
Gly Pro Ala Gly Ala Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly
485 490 495
Pro Lys Gly Pro Thr Gly Asp Pro Gly Lys Asn Gly Asp Lys Gly His
500 505 510
Ala Gly Leu Ala Gly Ala Arg Gly Ala Pro Gly Pro Asp Gly Asn Asn
515 520 525
Gly Ala Gln Gly Pro Pro Gly Pro Gln Gly Val Gln Gly Gly Lys Gly
530 535 540
Glu Gln Gly Pro Pro Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro
545 550 555 560
Ser Gly Pro Ala Gly Glu Val Gly Lys Pro Gly Glu Arg Gly Leu His
565 570 575
Gly Glu Phe Gly Leu Pro Gly Pro Ala Gly Pro Arg Gly Glu Arg Gly
580 585 590
Pro Pro Gly Glu Ser Gly Ala Ala Gly Pro Thr Gly Pro Ile Gly Ser
595 600 605
Arg Gly Pro Ser Gly Pro Pro Gly Pro Asp Gly Asn Lys Gly Glu Pro
610 615 620
Gly Val Val Gly Ala Val Gly Thr Ala Gly Pro Ser Gly Pro Ser Gly
625 630 635 640
Leu Pro Gly Glu Arg Gly Ala Ala Gly Ile Pro Gly Gly Lys Gly Glu
645 650 655
Lys Gly Glu Pro Gly Leu Arg Gly Glu Ile Gly Asn Pro Gly Arg Asp
660 665 670
Gly Ala Arg Gly Ala Pro Gly Ala Val Gly Ala Pro Gly Pro Ala Gly
675 680 685
Ala Thr Gly Asp Arg Gly Glu Ala Gly Ala Ala Gly Pro Ala Gly Pro
690 695 700
Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg Gly Glu Val Gly Pro Ala
705 710 715 720
Gly Pro Asn Gly Phe Ala Gly Pro Ala Gly Ala Ala Gly Gln Pro Gly
725 730 735
Ala Lys Gly Glu Arg Gly Ala Lys Gly Pro Lys Gly Glu Asn Gly Val
740 745 750
Val Gly Pro Thr Gly Pro Val Gly Ala Ala Gly Pro Ala Gly Pro Asn
755 760 765
Gly Pro Pro Gly Pro Ala Gly Ser Arg Gly Asp Gly Gly Pro Pro Gly
770 775 780
Met Thr Gly Phe Pro Gly Ala Ala Gly Arg Thr Gly Pro Pro Gly Pro
785 790 795 800
Ser Gly Ile Ser Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Glu
805 810 815
Gly Leu Arg Gly Pro Arg Gly Asp Gln Gly Pro Val Gly Arg Thr Gly
820 825 830
Glu Val Gly Ala Val Gly Pro Pro Gly Phe Ala Gly Glu Lys Gly Pro
835 840 845
Ser Gly Glu Ala Gly Thr Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln
850 855 860
Gly Leu Leu Gly Ala Pro Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly
865 870 875 880
Glu Arg Gly Leu Pro Gly Val Ala Gly Ala Val Gly Glu Pro Gly Pro
885 890 895
Leu Gly Ile Ala Gly Pro Pro Gly Ala Arg Gly Pro Pro Gly Ala Val
900 905 910
Gly Ser Pro Gly Val Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly
915 920 925
Asn Pro Gly Asn Asp Gly Pro Pro Gly Arg Asp Gly Gln Pro Gly His
930 935 940
Lys Gly Glu Arg Gly Tyr Pro Gly Asn Ile Gly Pro Val Gly Ala Ala
945 950 955 960
Gly Ala Pro Gly Pro His Gly Pro Val Gly Pro Ala Gly Lys His Gly
965 970 975
Asn Arg Gly Glu Thr Gly Pro Ser Gly Pro Val Gly Pro Ala Gly Ala
980 985 990
Val Gly Pro Arg Gly Pro Ser Gly Pro Gln Gly Ile Arg Gly Asp Lys
995 1000 1005
Gly Glu Pro Gly Glu Lys Gly Pro Arg Gly Leu Pro Gly Leu Lys
1010 1015 1020
Gly His Asn Gly Leu Gln Gly Leu Pro Gly Ile Ala Gly His His
1025 1030 1035
Gly Asp Gln Gly Ala Pro Gly Ser Val Gly Pro Ala Gly Pro Arg
1040 1045 1050
Gly Pro Ala Gly Pro Ser Gly Pro Ala Gly Lys Asp Gly Arg Thr
1055 1060 1065
Gly His Pro Gly Thr Val Gly Pro Ala Gly Ile Arg Gly Pro Gln
1070 1075 1080
Gly His Gln Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro Pro
1085 1090 1095
Gly Pro Pro Gly Val Ser Gly Gly Gly Tyr Asp Phe Gly Tyr Asp
1100 1105 1110
Gly Asp Phe Tyr Arg Ala Asp Gln Pro Arg Ser Ala Pro Ser Leu
1115 1120 1125
Arg Pro Lys Asp Tyr Glu Val Asp Ala Thr Leu Lys Ser Leu Asn
1130 1135 1140
Asn Gln Ile Glu Thr Leu Leu Thr Pro Glu Gly Ser Arg Lys Asn
1145 1150 1155
Pro Ala Arg Thr Cys Arg Asp Leu Arg Leu Ser His Pro Glu Trp
1160 1165 1170
Ser Ser Gly Tyr Tyr Trp Ile Asp Pro Asn Gln Gly Cys Thr Met
1175 1180 1185
Asp Ala Ile Lys Val Tyr Cys Asp Phe Ser Thr Gly Glu Thr Cys
1190 1195 1200
Ile Arg Ala Gln Pro Glu Asn Ile Pro Ala Lys Asn Trp Tyr Arg
1205 1210 1215
Ser Ser Lys Asp Lys Lys His Val Trp Leu Gly Glu Thr Ile Asn
1220 1225 1230
Ala Gly Ser Gln Phe Glu Tyr Asn Val Glu Gly Val Thr Ser Lys
1235 1240 1245
Glu Met Ala Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn Tyr
1250 1255 1260
Ala Ser Gln Asn Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr
1265 1270 1275
Met Asp Glu Glu Thr Gly Asn Leu Lys Lys Ala Val Ile Leu Gln
1280 1285 1290
Gly Ser Asn Asp Val Glu Leu Val Ala Glu Gly Asn Ser Arg Phe
1295 1300 1305
Thr Tyr Thr Val Leu Val Asp Gly Cys Ser Lys Lys Thr Asn Glu
1310 1315 1320
Trp Gly Lys Thr Ile Ile Glu Tyr Lys Thr Asn Lys Pro Ser Arg
1325 1330 1335
Leu Pro Phe Leu Asp Ile Ala Pro Leu Asp Ile Gly Gly Ala Asp
1340 1345 1350
Gln Glu Phe Phe Val Asp Ile Gly Pro Val Cys Phe Lys
1355 1360 1365
<210> 19
<211> 1466
<212> PRT
<213> 智人
<400> 19
Met Met Ser Phe Val Gln Lys Gly Ser Trp Leu Leu Leu Ala Leu Leu
1 5 10 15
His Pro Thr Ile Ile Leu Ala Gln Gln Glu Ala Val Glu Gly Gly Cys
20 25 30
Ser His Leu Gly Gln Ser Tyr Ala Asp Arg Asp Val Trp Lys Pro Glu
35 40 45
Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Ser Val Leu Cys Asp Asp
50 55 60
Ile Ile Cys Asp Asp Gln Glu Leu Asp Cys Pro Asn Pro Glu Ile Pro
65 70 75 80
Phe Gly Glu Cys Cys Ala Val Cys Pro Gln Pro Pro Thr Ala Pro Thr
85 90 95
Arg Pro Pro Asn Gly Gln Gly Pro Gln Gly Pro Lys Gly Asp Pro Gly
100 105 110
Pro Pro Gly Ile Pro Gly Arg Asn Gly Asp Pro Gly Ile Pro Gly Gln
115 120 125
Pro Gly Ser Pro Gly Ser Pro Gly Pro Pro Gly Ile Cys Glu Ser Cys
130 135 140
Pro Thr Gly Pro Gln Asn Tyr Ser Pro Gln Tyr Asp Ser Tyr Asp Val
145 150 155 160
Lys Ser Gly Val Ala Val Gly Gly Leu Ala Gly Tyr Pro Gly Pro Ala
165 170 175
Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly Thr Ser Gly His Pro Gly
180 185 190
Ser Pro Gly Ser Pro Gly Tyr Gln Gly Pro Pro Gly Glu Pro Gly Gln
195 200 205
Ala Gly Pro Ser Gly Pro Pro Gly Pro Pro Gly Ala Ile Gly Pro Ser
210 215 220
Gly Pro Ala Gly Lys Asp Gly Glu Ser Gly Arg Pro Gly Arg Pro Gly
225 230 235 240
Glu Arg Gly Leu Pro Gly Pro Pro Gly Ile Lys Gly Pro Ala Gly Ile
245 250 255
Pro Gly Phe Pro Gly Met Lys Gly His Arg Gly Phe Asp Gly Arg Asn
260 265 270
Gly Glu Lys Gly Glu Thr Gly Ala Pro Gly Leu Lys Gly Glu Asn Gly
275 280 285
Leu Pro Gly Glu Asn Gly Ala Pro Gly Pro Met Gly Pro Arg Gly Ala
290 295 300
Pro Gly Glu Arg Gly Arg Pro Gly Leu Pro Gly Ala Ala Gly Ala Arg
305 310 315 320
Gly Asn Asp Gly Ala Arg Gly Ser Asp Gly Gln Pro Gly Pro Pro Gly
325 330 335
Pro Pro Gly Thr Ala Gly Phe Pro Gly Ser Pro Gly Ala Lys Gly Glu
340 345 350
Val Gly Pro Ala Gly Ser Pro Gly Ser Asn Gly Ala Pro Gly Gln Arg
355 360 365
Gly Glu Pro Gly Pro Gln Gly His Ala Gly Ala Gln Gly Pro Pro Gly
370 375 380
Pro Pro Gly Ile Asn Gly Ser Pro Gly Gly Lys Gly Glu Met Gly Pro
385 390 395 400
Ala Gly Ile Pro Gly Ala Pro Gly Leu Met Gly Ala Arg Gly Pro Pro
405 410 415
Gly Pro Ala Gly Ala Asn Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly
420 425 430
Glu Pro Gly Lys Asn Gly Ala Lys Gly Glu Pro Gly Pro Arg Gly Glu
435 440 445
Arg Gly Glu Ala Gly Ile Pro Gly Val Pro Gly Ala Lys Gly Glu Asp
450 455 460
Gly Lys Asp Gly Ser Pro Gly Glu Pro Gly Ala Asn Gly Leu Pro Gly
465 470 475 480
Ala Ala Gly Glu Arg Gly Ala Pro Gly Phe Arg Gly Pro Ala Gly Pro
485 490 495
Asn Gly Ile Pro Gly Glu Lys Gly Pro Ala Gly Glu Arg Gly Ala Pro
500 505 510
Gly Pro Ala Gly Pro Arg Gly Ala Ala Gly Glu Pro Gly Arg Asp Gly
515 520 525
Val Pro Gly Gly Pro Gly Met Arg Gly Met Pro Gly Ser Pro Gly Gly
530 535 540
Pro Gly Ser Asp Gly Lys Pro Gly Pro Pro Gly Ser Gln Gly Glu Ser
545 550 555 560
Gly Arg Pro Gly Pro Pro Gly Pro Ser Gly Pro Arg Gly Gln Pro Gly
565 570 575
Val Met Gly Phe Pro Gly Pro Lys Gly Asn Asp Gly Ala Pro Gly Lys
580 585 590
Asn Gly Glu Arg Gly Gly Pro Gly Gly Pro Gly Pro Gln Gly Pro Pro
595 600 605
Gly Lys Asn Gly Glu Thr Gly Pro Gln Gly Pro Pro Gly Pro Thr Gly
610 615 620
Pro Gly Gly Asp Lys Gly Asp Thr Gly Pro Pro Gly Pro Gln Gly Leu
625 630 635 640
Gln Gly Leu Pro Gly Thr Gly Gly Pro Pro Gly Glu Asn Gly Lys Pro
645 650 655
Gly Glu Pro Gly Pro Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly
660 665 670
Gly Lys Gly Asp Ala Gly Ala Pro Gly Glu Arg Gly Pro Pro Gly Leu
675 680 685
Ala Gly Ala Pro Gly Leu Arg Gly Gly Ala Gly Pro Pro Gly Pro Glu
690 695 700
Gly Gly Lys Gly Ala Ala Gly Pro Pro Gly Pro Pro Gly Ala Ala Gly
705 710 715 720
Thr Pro Gly Leu Gln Gly Met Pro Gly Glu Arg Gly Gly Leu Gly Ser
725 730 735
Pro Gly Pro Lys Gly Asp Lys Gly Glu Pro Gly Gly Pro Gly Ala Asp
740 745 750
Gly Val Pro Gly Lys Asp Gly Pro Arg Gly Pro Thr Gly Pro Ile Gly
755 760 765
Pro Pro Gly Pro Ala Gly Gln Pro Gly Asp Lys Gly Glu Gly Gly Ala
770 775 780
Pro Gly Leu Pro Gly Ile Ala Gly Pro Arg Gly Ser Pro Gly Glu Arg
785 790 795 800
Gly Glu Thr Gly Pro Pro Gly Pro Ala Gly Phe Pro Gly Ala Pro Gly
805 810 815
Gln Asn Gly Glu Pro Gly Gly Lys Gly Glu Arg Gly Ala Pro Gly Glu
820 825 830
Lys Gly Glu Gly Gly Pro Pro Gly Val Ala Gly Pro Pro Gly Gly Ser
835 840 845
Gly Pro Ala Gly Pro Pro Gly Pro Gln Gly Val Lys Gly Glu Arg Gly
850 855 860
Ser Pro Gly Gly Pro Gly Ala Ala Gly Phe Pro Gly Ala Arg Gly Leu
865 870 875 880
Pro Gly Pro Pro Gly Ser Asn Gly Asn Pro Gly Pro Pro Gly Pro Ser
885 890 895
Gly Ser Pro Gly Lys Asp Gly Pro Pro Gly Pro Ala Gly Asn Thr Gly
900 905 910
Ala Pro Gly Ser Pro Gly Val Ser Gly Pro Lys Gly Asp Ala Gly Gln
915 920 925
Pro Gly Glu Lys Gly Ser Pro Gly Ala Gln Gly Pro Pro Gly Ala Pro
930 935 940
Gly Pro Leu Gly Ile Ala Gly Ile Thr Gly Ala Arg Gly Leu Ala Gly
945 950 955 960
Pro Pro Gly Met Pro Gly Pro Arg Gly Ser Pro Gly Pro Gln Gly Val
965 970 975
Lys Gly Glu Ser Gly Lys Pro Gly Ala Asn Gly Leu Ser Gly Glu Arg
980 985 990
Gly Pro Pro Gly Pro Gln Gly Leu Pro Gly Leu Ala Gly Thr Ala Gly
995 1000 1005
Glu Pro Gly Arg Asp Gly Asn Pro Gly Ser Asp Gly Leu Pro Gly
1010 1015 1020
Arg Asp Gly Ser Pro Gly Gly Lys Gly Asp Arg Gly Glu Asn Gly
1025 1030 1035
Ser Pro Gly Ala Pro Gly Ala Pro Gly His Pro Gly Pro Pro Gly
1040 1045 1050
Pro Val Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Ser Gly
1055 1060 1065
Pro Ala Gly Pro Ala Gly Ala Pro Gly Pro Ala Gly Ser Arg Gly
1070 1075 1080
Ala Pro Gly Pro Gln Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly
1085 1090 1095
Glu Arg Gly Ala Ala Gly Ile Lys Gly His Arg Gly Phe Pro Gly
1100 1105 1110
Asn Pro Gly Ala Pro Gly Ser Pro Gly Pro Ala Gly Gln Gln Gly
1115 1120 1125
Ala Ile Gly Ser Pro Gly Pro Ala Gly Pro Arg Gly Pro Val Gly
1130 1135 1140
Pro Ser Gly Pro Pro Gly Lys Asp Gly Thr Ser Gly His Pro Gly
1145 1150 1155
Pro Ile Gly Pro Pro Gly Pro Arg Gly Asn Arg Gly Glu Arg Gly
1160 1165 1170
Ser Glu Gly Ser Pro Gly His Pro Gly Gln Pro Gly Pro Pro Gly
1175 1180 1185
Pro Pro Gly Ala Pro Gly Pro Cys Cys Gly Gly Val Gly Ala Ala
1190 1195 1200
Ala Ile Ala Gly Ile Gly Gly Glu Lys Ala Gly Gly Phe Ala Pro
1205 1210 1215
Tyr Tyr Gly Asp Glu Pro Met Asp Phe Lys Ile Asn Thr Asp Glu
1220 1225 1230
Ile Met Thr Ser Leu Lys Ser Val Asn Gly Gln Ile Glu Ser Leu
1235 1240 1245
Ile Ser Pro Asp Gly Ser Arg Lys Asn Pro Ala Arg Asn Cys Arg
1250 1255 1260
Asp Leu Lys Phe Cys His Pro Glu Leu Lys Ser Gly Glu Tyr Trp
1265 1270 1275
Val Asp Pro Asn Gln Gly Cys Lys Leu Asp Ala Ile Lys Val Phe
1280 1285 1290
Cys Asn Met Glu Thr Gly Glu Thr Cys Ile Ser Ala Asn Pro Leu
1295 1300 1305
Asn Val Pro Arg Lys His Trp Trp Thr Asp Ser Ser Ala Glu Lys
1310 1315 1320
Lys His Val Trp Phe Gly Glu Ser Met Asp Gly Gly Phe Gln Phe
1325 1330 1335
Ser Tyr Gly Asn Pro Glu Leu Pro Glu Asp Val Leu Asp Val His
1340 1345 1350
Leu Ala Phe Leu Arg Leu Leu Ser Ser Arg Ala Ser Gln Asn Ile
1355 1360 1365
Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Gln Ala Ser
1370 1375 1380
Gly Asn Val Lys Lys Ala Leu Lys Leu Met Gly Ser Asn Glu Gly
1385 1390 1395
Glu Phe Lys Ala Glu Gly Asn Ser Lys Phe Thr Tyr Thr Val Leu
1400 1405 1410
Glu Asp Gly Cys Thr Lys His Thr Gly Glu Trp Ser Lys Thr Val
1415 1420 1425
Phe Glu Tyr Arg Thr Arg Lys Ala Val Arg Leu Pro Ile Val Asp
1430 1435 1440
Ile Ala Pro Tyr Asp Ile Gly Gly Pro Asp Gln Glu Phe Gly Val
1445 1450 1455
Asp Val Gly Pro Val Cys Phe Leu
1460 1465
<210> 20
<211> 680
<212> PRT
<213> 智人
<400> 20
Met Leu Pro Gln Ile Pro Phe Leu Leu Leu Val Ser Leu Asn Leu Val
1 5 10 15
His Gly Val Phe Tyr Ala Glu Arg Tyr Gln Met Pro Thr Gly Ile Lys
20 25 30
Gly Pro Leu Pro Asn Thr Lys Thr Gln Phe Phe Ile Pro Tyr Thr Ile
35 40 45
Lys Ser Lys Gly Ile Ala Val Arg Gly Glu Gln Gly Thr Pro Gly Pro
50 55 60
Pro Gly Pro Ala Gly Pro Arg Gly His Pro Gly Pro Ser Gly Pro Pro
65 70 75 80
Gly Lys Pro Gly Tyr Gly Ser Pro Gly Leu Gln Gly Glu Pro Gly Leu
85 90 95
Pro Gly Pro Pro Gly Pro Ser Ala Val Gly Lys Pro Gly Val Pro Gly
100 105 110
Leu Pro Gly Lys Pro Gly Glu Arg Gly Pro Tyr Gly Pro Lys Gly Asp
115 120 125
Val Gly Pro Ala Gly Leu Pro Gly Pro Arg Gly Pro Pro Gly Pro Pro
130 135 140
Gly Ile Pro Gly Pro Ala Gly Ile Ser Val Pro Gly Lys Pro Gly Gln
145 150 155 160
Gln Gly Pro Thr Gly Ala Pro Gly Pro Arg Gly Phe Pro Gly Glu Lys
165 170 175
Gly Ala Pro Gly Val Pro Gly Met Asn Gly Gln Lys Gly Glu Met Gly
180 185 190
Tyr Gly Ala Pro Gly Arg Pro Gly Glu Arg Gly Leu Pro Gly Pro Gln
195 200 205
Gly Pro Thr Gly Pro Ser Gly Pro Pro Gly Val Gly Lys Arg Gly Glu
210 215 220
Asn Gly Val Pro Gly Gln Pro Gly Ile Lys Gly Asp Arg Gly Phe Pro
225 230 235 240
Gly Glu Met Gly Pro Ile Gly Pro Pro Gly Pro Gln Gly Pro Pro Gly
245 250 255
Glu Arg Gly Pro Glu Gly Ile Gly Lys Pro Gly Ala Ala Gly Ala Pro
260 265 270
Gly Gln Pro Gly Ile Pro Gly Thr Lys Gly Leu Pro Gly Ala Pro Gly
275 280 285
Ile Ala Gly Pro Pro Gly Pro Pro Gly Phe Gly Lys Pro Gly Leu Pro
290 295 300
Gly Leu Lys Gly Glu Arg Gly Pro Ala Gly Leu Pro Gly Gly Pro Gly
305 310 315 320
Ala Lys Gly Glu Gln Gly Pro Ala Gly Leu Pro Gly Lys Pro Gly Leu
325 330 335
Thr Gly Pro Pro Gly Asn Met Gly Pro Gln Gly Pro Lys Gly Ile Pro
340 345 350
Gly Ser His Gly Leu Pro Gly Pro Lys Gly Glu Thr Gly Pro Ala Gly
355 360 365
Pro Ala Gly Tyr Pro Gly Ala Lys Gly Glu Arg Gly Ser Pro Gly Ser
370 375 380
Asp Gly Lys Pro Gly Tyr Pro Gly Lys Pro Gly Leu Asp Gly Pro Lys
385 390 395 400
Gly Asn Pro Gly Leu Pro Gly Pro Lys Gly Asp Pro Gly Val Gly Gly
405 410 415
Pro Pro Gly Leu Pro Gly Pro Val Gly Pro Ala Gly Ala Lys Gly Met
420 425 430
Pro Gly His Asn Gly Glu Ala Gly Pro Arg Gly Ala Pro Gly Ile Pro
435 440 445
Gly Thr Arg Gly Pro Ile Gly Pro Pro Gly Ile Pro Gly Phe Pro Gly
450 455 460
Ser Lys Gly Asp Pro Gly Ser Pro Gly Pro Pro Gly Pro Ala Gly Ile
465 470 475 480
Ala Thr Lys Gly Leu Asn Gly Pro Thr Gly Pro Pro Gly Pro Pro Gly
485 490 495
Pro Arg Gly His Ser Gly Glu Pro Gly Leu Pro Gly Pro Pro Gly Pro
500 505 510
Pro Gly Pro Pro Gly Gln Ala Val Met Pro Glu Gly Phe Ile Lys Ala
515 520 525
Gly Gln Arg Pro Ser Leu Ser Gly Thr Pro Leu Val Ser Ala Asn Gln
530 535 540
Gly Val Thr Gly Met Pro Val Ser Ala Phe Thr Val Ile Leu Ser Lys
545 550 555 560
Ala Tyr Pro Ala Ile Gly Thr Pro Ile Pro Phe Asp Lys Ile Leu Tyr
565 570 575
Asn Arg Gln Gln His Tyr Asp Pro Arg Thr Gly Ile Phe Thr Cys Gln
580 585 590
Ile Pro Gly Ile Tyr Tyr Phe Ser Tyr His Val His Val Lys Gly Thr
595 600 605
His Val Trp Val Gly Leu Tyr Lys Asn Gly Thr Pro Val Met Tyr Thr
610 615 620
Tyr Asp Glu Tyr Thr Lys Gly Tyr Leu Asp Gln Ala Ser Gly Ser Ala
625 630 635 640
Ile Ile Asp Leu Thr Glu Asn Asp Gln Val Trp Leu Gln Leu Pro Asn
645 650 655
Ala Glu Ser Asn Gly Leu Tyr Ser Ser Glu Tyr Val His Ser Ser Phe
660 665 670
Ser Gly Phe Leu Val Ala Pro Met
675 680
<210> 21
<211> 9823
<212> DNA
<213> 人工序列
<400> 21
gacggatcgg gagatctccc gatcccctat ggtgcactct cagtacaatc tgctctgatg 60
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420
attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt 480
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900
gtttaaactt aagcttggta ccgagctcgg atccactagt ccagtgtggt ggaattcatg 960
ttcagctttg tggacctccg gctcctgctc ctcttagcgg ccaccgccct cctgacgcac 1020
ggccaagagg aaggccaagt cgagggccaa gacgaagaca tcccaccaat cacctgcgta 1080
cagaacggcc tcaggtacca tgaccgagac gtgtggaaac ccgagccctg ccggatctgc 1140
gtctgcgaca acggcaaggt gttgtgcgat gacgtgatct gtgacgagac caagaactgc 1200
cccggcgccg aagtccccga gggcgagtgc tgtcccgtct gccccgacgg ctcagagtca 1260
cccaccgacc aagaaaccac cggcgtcgag ggacccaagg gagacactgg cccccgaggc 1320
ccaaggggac ccgcaggccc ccctggccga gatggcatcc ctggacagcc tggacttccc 1380
ggaccccccg gaccccccgg acctcccgga ccccctggcc tcggaggaaa ctttgctccc 1440
cagctgtctt atggctatga tgagaaatca accggaggaa tttccgtgcc tggccccatg 1500
ggtccctctg gtcctcgtgg tctccctggc ccccctggtg cacctggtcc ccaaggcttc 1560
caaggtcccc ctggtgagcc tggcgagcct ggagcttcag gtcccatggg tccccgaggt 1620
cccccaggtc cccctggaaa gaatggagat gatggggaag ctggaaaacc tggtcgtcct 1680
ggtgagcgtg ggcctcctgg gcctcagggt gctcgaggat tgcccggaac agctggcctc 1740
cctggaatga agggacacag aggtttcagt ggtttggatg gtgccaaggg agatgctggt 1800
cctgctggtc ctaagggtga gcctggcagc cctggtgaaa atggagctcc tggtcagatg 1860
ggcccccgtg gcctgcctgg tgagagaggt cgccctggag cccctggccc tgctggtgct 1920
cgtggaaatg atggtgctac tggtgctgcc gggccccctg gtcccaccgg ccccgctggt 1980
cctcctggct tccctggtgc tgttggtgct aagggtgaag ctggtcccca agggccccga 2040
ggctctgaag gtccccaggg tgtgcgtggt gagcctggcc cccctggccc tgctggtgct 2100
gctggccctg ctggaaaccc tggtgctgat ggacagcctg gtgctaaagg tgccaatggt 2160
gctcctggta ttgctggtgc tcctggcttc cctggtgccc gaggcccctc tggaccccag 2220
ggccccggcg gccctcctgg tcccaagggt aacagcggtg aacctggtgc tcctggcagc 2280
aaaggagaca ctggtgctaa gggagagcct ggccctgttg gtgttcaagg accccctggc 2340
cctgctggag aggaaggaaa gcgaggagct cgaggtgaac ccggacccac tggcctgccc 2400
ggaccccctg gcgagcgtgg tggacctggt agccgtggtt tccctggcgc agatggtgtt 2460
gctggtccca agggtcccgc tggtgaacgt ggttctcctg gccctgctgg ccccaaagga 2520
tctcctggtg aagctggtcg tcccggtgaa gctggtctgc ctggtgccaa gggtctgact 2580
ggaagccctg gcagccctgg tcctgatggc aaaactggcc cccctggtcc cgccggtcaa 2640
gatggtcgcc ccggaccccc aggcccacct ggtgcccgtg gtcaggctgg tgtgatggga 2700
ttccctggac ctaaaggtgc tgctggagag cccggcaagg ctggagagcg aggtgttccc 2760
ggaccccctg gcgctgtcgg tcctgctggc aaagatggag aggctggagc tcagggaccc 2820
cctggccctg ctggtcccgc tggcgagaga ggtgaacaag gccctgctgg ctcccccgga 2880
ttccagggtc tccctggtcc tgctggtcct ccaggtgaag caggcaaacc tggtgaacag 2940
ggtgttcctg gagaccttgg cgcccctggc ccctctggag caagaggcga gagaggtttc 3000
cctggcgagc gtggtgtgca aggtccccct ggtcctgctg gtccccgagg ggccaacggt 3060
gctcccggca acgatggtgc taagggtgat gctggtgccc ctggagctcc cggtagccag 3120
ggcgcccctg gccttcaggg aatgcctggt gaacgtggtg cagctggtct tccagggcct 3180
aagggtgaca gaggtgatgc tggtcccaaa ggtgctgatg gctctcctgg caaagatggc 3240
gtccgtggtc tgactggccc cattggtcct cctggccctg ctggtgcccc tggtgacaag 3300
ggtgaaagtg gtcccagcgg ccctgctggt cccactggag ctcgtggtgc ccccggagac 3360
cgtggtgagc ctggtccccc cggccctgct ggctttgctg gcccccctgg tgctgacggc 3420
caacctggtg ctaaaggcga acctggtgat gctggtgcta aaggcgatgc tggtccccct 3480
ggccctgccg gacccgctgg accccctggc cccattggta atgttggtgc tcctggagcc 3540
aaaggtgctc gcggcagcgc tggtccccct ggtgctactg gtttccctgg tgctgctggc 3600
cgagtcggtc ctcctggccc ctctggaaat gctggacccc ctggccctcc tggtcctgct 3660
ggcaaagaag gcggcaaagg tccccgtggt gagactggcc ctgctggacg tcctggtgaa 3720
gttggtcccc ctggtccccc tggccctgct ggcgagaaag gatcccctgg tgctgatggt 3780
cctgctggtg ctcctggtac tcccgggcct caaggtattg ctggacagcg tggtgtggtc 3840
ggcctgcctg gtcagagagg agagagaggc ttccctggtc ttcctggccc ctctggtgaa 3900
cctggcaaac aaggtccctc tggagcaagt ggtgaacgtg gtccccctgg tcccatgggc 3960
ccccctggat tggctggacc ccctggtgaa tctggacgtg agggggctcc tggtgccgaa 4020
ggttcccctg gacgagacgg ttctcctggc gccaagggtg accgtggtga gaccggcccc 4080
gctggacccc ctggtgctcc tggtgctcct ggtgcccctg gccccgttgg ccctgctggc 4140
aagagtggtg atcgtggtga gactggtcct gctggtcccg ccggtcctgt cggccctgtt 4200
ggcgcccgtg gccccgccgg accccaaggc ccccgtggtg acaagggtga gacaggcgaa 4260
cagggcgaca gaggcataaa gggtcaccgt ggcttctctg gcctccaggg tccccctggc 4320
cctcctggct ctcctggtga acaaggtccc tctggagcct ctggtcctgc tggtccccga 4380
ggtccccctg gctctgctgg tgctcctggc aaagatggac tcaacggtct ccctggcccc 4440
attgggcccc ctggtcctcg cggtcgcact ggtgatgctg gtcctgttgg tccccccggc 4500
cctcctggac ctcctggtcc ccctggtcct cccagcgctg gtttcgactt cagcttcctg 4560
ccccagccac ctcaagagaa ggctcacgat ggtggccgct actaccgggc tgatgatgcc 4620
aatgtggttc gtgaccgtga cctcgaggtg gacaccaccc tcaagagcct gagccagcag 4680
atcgagaaca tccggagccc agagggcagc cgcaagaacc ccgcccgcac ctgccgtgac 4740
ctcaagatgt gccactctga ctggaagagt ggagagtact ggattgaccc caaccaaggc 4800
tgcaacctgg atgccatcaa agtcttctgc aacatggaga ctggtgagac ctgcgtgtac 4860
cccactcagc ccagtgtggc ccagaagaac tggtacatca gcaagaaccc caaggacaag 4920
aggcatgtct ggttcggcga gagcatgacc gatggattcc agttcgagta tggcggccag 4980
ggctccgacc ctgccgatgt ggccatccag ctgaccttcc tgcgcctgat gtccaccgag 5040
gcctcccaga acatcaccta ccactgcaag aacagcgtgg cctacatgga ccagcagact 5100
ggcaacctca agaaggccct gctcctccag ggctccaacg agatcgagat ccgcgccgag 5160
ggcaacagcc gcttcaccta cagcgtcact gtcgatggct gcacgagtca caccggagcc 5220
tggggcaaga cagtgattga atacaaaacc accaagacct cccgcctgcc catcatcgat 5280
gtggccccct tggacgttgg tgccccagac caggaattcg gcttcgacgt tggccctgtc 5340
tgcttcctgt aatgcagata tccagcacag tggcggccgc tcgagtctag agggcccgtt 5400
taaacccgct gatcagcctc gactgtgcct tctagttgcc agccatctgt tgtttgcccc 5460
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 5520
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 5580
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga tgcggtgggc 5640
tctatggctt ctgaggcgga aagaaccagc tggggctcta gggggtatcc ccacgcgccc 5700
tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt 5760
gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc 5820
ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta 5880
cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc 5940
tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg 6000
ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt 6060
ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat 6120
taattctgtg gaatgtgtgt cagttagggt gtggaaagtc cccaggctcc ccagcaggca 6180
gaagtatgca aagcatgcat ctcaattagt cagcaaccag gtgtggaaag tccccaggct 6240
ccccagcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc atagtcccgc 6300
ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg 6360
gctgactaat tttttttatt tatgcagagg ccgaggccgc ctctgcctct gagctattcc 6420
agaagtagtg aggaggcttt tttggaggcc taggcttttg caaaaagctc ccgggagctt 6480
gtatatccat tttcggatct gatcaagaga caggatgagg atcgtttcgc atgattgaac 6540
aagatggatt gcacgcaggt tctccggccg cttgggtgga gaggctattc ggctatgact 6600
gggcacaaca gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc 6660
gcccggttct ttttgtcaag accgacctgt ccggtgccct gaatgaactg caggacgagg 6720
cagcgcggct atcgtggctg gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg 6780
tcactgaagc gggaagggac tggctgctat tgggcgaagt gccggggcag gatctcctgt 6840
catctcacct tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg cggcggctgc 6900
atacgcttga tccggctacc tgcccattcg accaccaagc gaaacatcgc atcgagcgag 6960
cacgtactcg gatggaagcc ggtcttgtcg atcaggatga tctggacgaa gagcatcagg 7020
ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc 7080
tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt 7140
ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac atagcgttgg 7200
ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt 7260
acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt gacgagttct 7320
tctgagcggg actctggggt tcgaaatgac cgaccaagcg acgcccaacc tgccatcacg 7380
agatttcgat tccaccgccg ccttctatga aaggttgggc ttcggaatcg ttttccggga 7440
cgccggctgg atgatcctcc agcgcgggga tctcatgctg gagttcttcg cccaccccaa 7500
cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa 7560
taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta 7620
tcatgtctgt ataccgtcga cctctagcta gagcttggcg taatcatggt catagctgtt 7680
tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa 7740
gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact 7800
gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc 7860
ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg 7920
ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 7980
cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 8040
gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 8100
tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 8160
ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg 8220
atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag 8280
gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 8340
tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 8400
cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 8460
cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt 8520
tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 8580
cggcaaacaa accaccgctg gtagcggttt ttttgtttgc aagcagcaga ttacgcgcag 8640
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 8700
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 8760
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 8820
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 8880
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 8940
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 9000
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 9060
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 9120
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 9180
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 9240
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 9300
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 9360
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 9420
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 9480
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 9540
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 9600
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 9660
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 9720
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 9780
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 9823

Claims (10)

1.一种表达重组人源化胶原蛋白的工程细胞构建方法,包括:
失活HEK 293细胞或CHO细胞中的Procollagen N-endopeptidase、Procollagen C-endopeptidase基因;
将PDI和Bip基因导入经失活的HEK 293细胞或CHO细胞中共表达;
将胶原蛋白基因导入经失活的HEK 293细胞或CHO细胞;以及
筛选获得携带有胶原蛋白基因的工程细胞。
2.根据权利要求1所述的方法,其中,所述基因敲除通过基因编辑技术实现。
3.根据权利要求2所述的方法,其中,所述基因敲除通过基因编辑技术实现的步骤包括:针对HEK 293细胞或CHO细胞设计靶向Procollagen N-endopeptidase、Procollagen C-endopeptidase基因的sgRNA。
4.根据权利要求3所述的方法,其中,靶向Procollagen N-endopeptidase基因的sgRNA如SEQ ID NO:1~3所示;靶向Procollagen C-endopeptidase基因的sgRNA如SEQ ID NO:4~6所示。
5.根据权利要求1所述的方法,其中,所述PDI基因的蛋白序列如SEQ ID NO:7或8所示。
6.根据权利要求1所述的方法,其中,所述Bip基因的蛋白序列如SEQ ID NO:11所示。
7.根据权利要求1所述的方法,其中,所述胶原蛋白基因导入HEK 293细胞或CHO细胞的步骤包括:
将载体去除抗生素基因,并导入缺陷型筛选基因,然后将胶原蛋白基因导入载体,获得携带有胶原蛋白基因的载体。
8.根据权利要求1所述的方法,其中,所述胶原蛋白基因编码的蛋白序列如SEQ ID NO:17~20所示。
9.根据权利要求1~8任一项所述的方法,其中,表达的重组人源化胶原蛋白具有天然三螺旋结构。
10.一种表达重组人源化胶原蛋白的工程细胞,其根据如权利要求1~9任一项所述的构建方法获得。
CN202210783483.1A 2022-06-27 2022-06-27 一种表达重组人源化胶原蛋白的工程细胞构建方法 Pending CN117343957A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210783483.1A CN117343957A (zh) 2022-06-27 2022-06-27 一种表达重组人源化胶原蛋白的工程细胞构建方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210783483.1A CN117343957A (zh) 2022-06-27 2022-06-27 一种表达重组人源化胶原蛋白的工程细胞构建方法

Publications (1)

Publication Number Publication Date
CN117343957A true CN117343957A (zh) 2024-01-05

Family

ID=89360117

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210783483.1A Pending CN117343957A (zh) 2022-06-27 2022-06-27 一种表达重组人源化胶原蛋白的工程细胞构建方法

Country Status (1)

Country Link
CN (1) CN117343957A (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117384959A (zh) * 2023-12-05 2024-01-12 南京东万生物技术有限公司 一种生产胶原细胞株的构建及其生产方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117384959A (zh) * 2023-12-05 2024-01-12 南京东万生物技术有限公司 一种生产胶原细胞株的构建及其生产方法

Similar Documents

Publication Publication Date Title
CA2353086A1 (en) Pgc-1, a novel brown fat ppar.gamma. coactivator
WO2012002582A1 (ja) REIC/Dkk-3タンパク質の部分領域ポリペプチド
Lee et al. Lipid raft–associated stomatin enhances cell fusion
CN117343957A (zh) 一种表达重组人源化胶原蛋白的工程细胞构建方法
EP1870460B9 (en) Methods of producing proteins having triple-helix structure
US11628206B2 (en) Method for inhibiting STAT3 activity comprising administering Ssu72
CN107312782B (zh) 鸭坦布苏病毒e蛋白截短体基因、重组鸭瘟病毒及构建方法和应用
CN110317806B (zh) 一种组成型剪接报告基因影像探针及其制备方法
CN112680462B (zh) 一种人乳头瘤病毒35型/hpv35型l1/l2及其制备与应用
CN104096239B (zh) hALR基因微环真核表达载体在降低肝脏胶原蛋白合成方面的应用
CN111358959B (zh) Roquin1蛋白及其编码基因在制备抑制肿瘤的药物中的应用
CN110747204A (zh) 一种监测pre-mRNA剪接效率的双报告基因探针及制备方法
WO2012150676A1 (ja) 2型tnf受容体の発現を誘導する合成ペプチド及びその利用
EP2221066A1 (en) Use of VgII3 activity modulator for the modulation of adipogenesis
EP1736546A1 (en) Using nonhuman animal model, method of measuring transcription activity, method of measuring cell quantity and method of measuring tumor volume
AU2006248487A1 (en) Int6 protein involved in hypoxia stress response and use thereof
CN114177274A (zh) Tdrd10蛋白及其编码基因在制备肿瘤治疗药物中的应用、及肿瘤治疗药物
CN114149496A (zh) 肿瘤血管发生调控蛋白及其应用
CN115920056A (zh) Roquin2蛋白及其编码基因在抗血管发生治疗肿瘤中的应用
KR100917705B1 (ko) 카스파제-7 절단에서 유래하는 세포사멸과 관련된폴리뉴클레오티드 서열, 세포사멸의 억제, 유도 인자 및이의 스크리닝 방법
WO2004015103A1 (ja) Akt2結合蛋白質

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination