CN115873126A - 人生长激素融合蛋白及其制备和用途 - Google Patents

人生长激素融合蛋白及其制备和用途 Download PDF

Info

Publication number
CN115873126A
CN115873126A CN202210395418.1A CN202210395418A CN115873126A CN 115873126 A CN115873126 A CN 115873126A CN 202210395418 A CN202210395418 A CN 202210395418A CN 115873126 A CN115873126 A CN 115873126A
Authority
CN
China
Prior art keywords
ser
leu
glu
val
pro
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210395418.1A
Other languages
English (en)
Inventor
秦锁富
鄢成伟
潘志福
吴凤梅
周冬梅
李娟�
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Kexing Pharmaceutical Co ltd
Original Assignee
Shenzhen Kexing Pharmaceutical Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Kexing Pharmaceutical Co ltd filed Critical Shenzhen Kexing Pharmaceutical Co ltd
Priority to PCT/CN2022/100113 priority Critical patent/WO2023093020A1/en
Publication of CN115873126A publication Critical patent/CN115873126A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/61Growth hormones [GH] (Somatotropin)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/50Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
    • A61K47/51Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
    • A61K47/68Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/50Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
    • A61K47/51Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
    • A61K47/68Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
    • A61K47/6801Drug-antibody or immunoglobulin conjugates defined by the pharmacologically or therapeutically active agent
    • A61K47/6803Drugs conjugated to an antibody or immunoglobulin, e.g. cisplatin-antibody conjugates
    • A61K47/6811Drugs conjugated to an antibody or immunoglobulin, e.g. cisplatin-antibody conjugates the drug being a protein or peptide, e.g. transferrin or bleomycin
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P5/00Drugs for disorders of the endocrine system
    • A61P5/06Drugs for disorders of the endocrine system of the anterior pituitary hormones, e.g. TSH, ACTH, FSH, LH, PRL, GH
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/50Immunoglobulins specific features characterized by immunoglobulin fragments
    • C07K2317/52Constant or Fc region; Isotype
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/30Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto

Abstract

本发明涉及一种Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞、药物组合物及其用途。Fc突变体具有如SEQ ID NO:2~5任一项所示的氨基酸序列;融合蛋白包含:第二肽段,第二肽段为生物活性分子功能区;第三肽段,第三肽段包括前述的Fc突变体,第二肽段与第三肽段相连。本发明的Fc突变体有利于消除抗体Fc区与FcR结合引起的ADCC效应,且可使Fc突变体比原始IgG4分子更稳定;融合蛋白具有更高的体内外活性。

Description

人生长激素融合蛋白及其制备和用途
技术领域
本发明涉及分子生物学及药物领域,具体地,本发明涉及一种人生长激素融合蛋白及其制备和用途,更具体地,本发明涉及一种Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞、药物组合物及其用途。
背景技术
生长激素缺乏症是一种公认的临床综合征,与许多代谢异常相关,包括身体成分异常、体能下降、脂质代谢改变、骨量减少、胰岛素抵抗增加和生活质量降低。大多数与生长激素缺乏症相关的代谢异常可通过重组人生长激素(rhGH)替代物逆转。传统的生长激素缺乏症治疗包括每天皮下注射rhGH,但是,该治疗方法较为繁琐,需每天进行注射,对许多患者来说并不方便,这引起了对治疗依从性差的担忧,这可能导致疗效降低。长效rhGH制剂或长效生长激素制剂不仅可以减少注射次数,从而提高依从性,并有助于提高GH治疗的疗效。
因此,仍亟需开发一种新的有效治疗生长激素缺乏症的融合蛋白。
发明内容
本发明旨在至少在一定程度上解决现有技术中存在的技术问题至少之一。为此,本发明提供了一种Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞、药物组合物及其用途,本发明的融合蛋白稳定性强,可有效治疗生长激素缺乏症。
本发明是基于发明人的下列发现而完成的:
发明人通过实验发现,对野生型IgG4的Fc片段的特定位点进行突变,例如S228P、F234A/F234V、L235E、R409K和K447 Delete,有利于消除抗体Fc区与FcR结合引起的ADCC效应、提高IgG4的结构更稳定、减少表达产物的异质性以及提高表达量。发明人经过实验进一步发现,采用上述位点突变后得到的得到的Fc突变体和生长激素得到的融合蛋白稳定性高,且具有更长的半衰期和良好的体内生物学活性。
为此,在本发明的一个方面,本发明提出了一种Fc突变体。根据本发明的实施例,包括第一肽段,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:第228位、第235位、第409位、第447位和第234位。发明人经过大量实验发现,上述突变有利于消除抗体Fc区与FcR结合引起的ADCC效应和减少表达产物的异质性,且有利于抑制Fab-armexchange的发生,使Fc突变体比原始IgG4分子更稳定;此外,采用该Fc突变体制得的融合蛋白具有更高的体内外活性。
需要说明的是,本文中所指的抗体IgG的Fc片段是指抗体IgG的CH2区和CH3区。
例如,野生型IgG4的Fc片段的氨基酸序列如下所示:
APEFLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK(SEQID NO:51)。
野生型IgG4的铰链区(Hinge)的氨基酸序列如下所示:
ESKYGPPCPSCP(SEQ ID NO:52)。
在本发明的第二方面,本发明提出了一种融合蛋白。根据本发明的实施例,所述融合蛋白包含:第二肽段,所述第二肽段包括生物活性分子功能区;第三肽段,所述第三肽段包括Fc突变体,所述Fc突变体具有如第一方面所限定的Fc突变体,所述第二肽段与所述第三肽段相连。根据本发明实施例的融合蛋白具有更高体内外活性,且后续可通过ProteinA纯化,简化了纯化工艺。
在本发明的第三方面,本发明提出了一种融合蛋白。根据本发明的实施例,所述融合蛋白的结构通式表示为X-L-Y,其中X是第一种生物活性分子;L是不存在或为连接肽;Y是第二种生物活性分子;-为肽键;且所述活性分子X或Y选自蛋白或蛋白结构域、多肽、抗体或抗体片段。
需要说明的是,所述“X-L-Y”是指X的C端与L的N端相连、L的C端与Y的N端相连。
根据本发明的实施例,第二方面或第三方面所述的融合蛋白还可以进一步包含如下附加技术特征的至少之一:
根据本发明的实施例,所述人生长激素的氨基酸序列包括SEQ ID NO:1或者与其具有至少80%-99%同一性的氨基酸序列或者SEQ ID NO:1的至少一部分。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQI FKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF(SEQ ID NO:1)。
根据本发明的实施例,所述人生长激素的核苷酸序列包括SEQ ID NO:33-38任一项或者与其具有至少80%-99%同一性的核苷酸序列或者SEQ ID NO:33-38任一项的至少一部分或其密码子优化序列。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTT(SEQ ID NO:33)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATCGTGCAGTGCAGATCCGTGGAAGGGTCCTGCGGGTTC(SEQ ID NO:34)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATTGTGCAGTGCCGGTCCGTGGAAGGCAGCTGTGGCTTT(SEQ ID NO:35)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATCGTGCAATGTAGAAGCGTCGAAGGGAGCTGCGGCTTC(SEQ ID NO:36)。
TTCCCCACCATCCCCCTGTCTAGACTGTTTGATAACGCCATGCTGCGGGCCCACAGACTCCATCAGCTGGCCTTCGACACCTACCAGGAGTTCGAGGAAGCCTACATCCCTAAGGAACAAAAATACTCCTTCCTGCAGAACCCCCAAACAAGCCTGTGCTTCAGCGAGTCCATCCCTACCCCTTCTAACAGAGAGGAAACACAGCAGAAATCTAATCTGGAACTGCTGAGGATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAACCTGTGCAATTTCTGCGGAGCGTGTTTGCCAACAGCCTGGTGTATGGAGCCAGCGACAGCAATGTGTACGACCTGCTGAAAGATCTGGAAGAGGGAATCCAGACCCTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGATACAAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGTTTTAGAAAGGACATGGACAAGGTGGAGACATTCCTCCGGATCGTGCAGTGCAGATCTGTGGAGGGCAGCTGCGGCTTC(SEQ ID NO:37)。
TTCCCCACGATCCCTCTGTCTAGACTGTTCGACAACGCCATGCTGCGGGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAGGAGTTCGAGGAGGCCTACATCCCCAAAGAGCAAAAATACAGCTTCCTGCAGAACCCCCAGACATCTTTGTGCTTTAGCGAGAGCATCCCTACCCCTAGCAATAGAGAAGAGACACAGCAGAAGAGCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAACCCGTGCAATTTCTGCGCAGCGTCTTCGCCAACAGCCTGGTGTACGGCGCCTCTGATAGCAACGTGTACGACCTGCTGAAAGATCTGGAAGAGGGCATCCAGACCCTGATGGGAAGACTGGAGGACGGCTCTCCAAGAACAGGCCAAATCTTCAAGCAGACCTACAGCAAATTCGATACAAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGATCTGTGGAAGGCAGCTGTGGCTTC(SEQ ID NO:38)。
根据本发明的实施例,所述Y为Fc片段变体(与“Fc突变体”同义)。
根据本发明的实施例,所述Fc片段变体为IgG4亚型Fc突变体。
根据本发明的实施例,所述Fc片段变体的氨基酸序列包括选自SEQ ID NO:2、SEQID NO:3、SEQ ID NO:4、SEQ ID NO:5中的至少之一或者与其中任意一序列具有至少80%-99%同一性的氨基酸序列或者其中任意一序列的至少一部分。
ESKYGPPCPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:2)。
ESKYGPPSPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:3)。
ESKYGPPCPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:4)。
ESKYGPPSPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:5)。
在一些实施例中,所述Fc片段变体相比上述野生型IgG4的氨基酸序列,存在如下选自以下组合突变中的至少一种:
(1)F234A/F234V和L235E位点突变,有利于消除抗体Fc区与FcR结合引起的ADCC效应;
(2)S228P和R409K位点突变,有利于抑制Fab-arm exchange的发生,使IgG4的结构更稳定;
(3)K477 Delete是指删除K477位点,有利于减少表达产物的异质性;
(4)C226位点是IgG4铰链区的第一个二硫键位点,该位点突变为S后,有利于使突变体比原始IgG4分子更稳定。
相比野生型IgG4亚型的Fc片段,SEQ ID NO:2的第一肽段存在突变:S228P、F234A、L235E、R409K和K447 Delete。
相比野生型IgG4亚型的Fc片段,SEQ ID NO:3的第一肽段存在突变:C226S、S228P、F234A、L235E、R409K和K447 Delete。
相比野生型IgG4亚型的Fc片段,SEQ ID NO:4的第一肽段存在突变:S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete。
相比野生型IgG4亚型的Fc片段,SEQ ID NO:5的第一肽段存在突变:C226S、S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete。
根据本发明的实施例,所述连接肽包括选自(GGGGS)n多肽或者与其中任意一序列具有至少80%-99%同一性的氨基酸序列或者其中任意一序列的至少一部分,其中n为大于或等于1的整数,或者n为1、2、3、4、5、6、7、8、9或10。
根据本发明的实施例,所述连接肽包括选自SEQ ID NO:6、SEQ ID NO:7、SEQ IDNO:8中的至少之一或者与其中任意一序列具有至少80%-99%同一性的氨基酸序列或者其中任意一序列的至少一部分。
GGGGSGGGGSGGGGS(SEQ ID NO:6)。
GGGGSGGGGSGGGGSGGGGS(SEQ ID NO:7)。
GGGGSGGGGSGGGGSGGGGSGGGGS(SEQ ID NO:8)。
根据本发明的实施例,所述融合蛋白的氨基酸序列包括选自:SEQ ID NO:21、SEQID NO:22、SEQ ID NO:23、SEQ ID NO:24、SEQ ID NO:25、SEQ ID NO:26、SEQ ID NO:27、SEQID NO:28、SEQ ID NO:29、SEQ ID NO:30、SEQ ID NO:31、SEQ ID NO:32中的至少之一或者与其中任意一序列具有至少80%-99%同一性的氨基酸序列或者其中任意一序列的至少一部分。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSESKYGPPCPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:21)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSESKYGPPSPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:22)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSESKYGPPCPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:23)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSESKYGPPSPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ ID NO:24)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSESKYGPPCPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ IDNO:25)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSESKYGPPSPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ IDNO:26)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSESKYGPPCPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ IDNO:27)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSESKYGPPSPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQ IDNO:28)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSGGGGSESKYGPPCPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQID NO:29)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSGGGGSESKYGPPSPPCPAPEAEGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQID NO:30)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSGGGGSESKYGPPCPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQID NO:31)。
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFGGGGSGGGGSGGGGSGGGGSGGGGSESKYGPPSPPCPAPPVEGGPSVFLFPPKPKDTLMISRTPEVTCVVVAVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLG(SEQID NO:32)。
根据本发明的实施例,SEQ ID NO:21-32所示的氨基酸序列中的第一生物分子、第二生物分子和连接肽如下表所示:
Figure BDA0003597163400000061
在本发明的第四方面,本发明提出了一种核酸分子。根据本发明的实施例,所述核酸分子编码第一方面所述的Fc突变体或第二方面或第三方面所述的融合蛋白。
根据本发明的实施例,所述核苷酸序列包括选自SEQ ID NO:39、SEQ ID NO:40、SEQ ID NO:41、SEQ ID NO:42、SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:45、SEQ ID NO:46、SEQ ID NO:47、SEQ ID NO:48、SEQ ID NO:49、SEQ ID NO:50中的至少之一或者与其中任意一序列具有至少80%-99%同一性的核苷酸序列或者其中任意一序列的至少一部分或其中任一序列的密码子优化序列。
GAGTCTAAGTACGGCCCTCCTTGTCCTCCATGTCCAGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:39)。
GAGTCTAAGTACGGACCTCCTTGTCCTCCATGTCCAGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:40)。
GAATCCAAATACGGCCCCCCCTGTCCCCCTTGCCCTGCCCCCGAGGCTGAAGGCGGGCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGACGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:41)。
GAGTCTAAGTACGGCCCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:42)。
GAGTCTAAGTACGGACCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:43)。
GAGTCTAAGTACGGCCCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:44)。
GAGTCCAAGTATGGCCCCCCCTGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCTCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:45)。
GAGAGCAAGTACGGGCCCCCCTGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:46)。
GAGAGCAAGTACGGCCCCCCTTGTCCTCCTTGCCCCGCCCCTCCCGTGGAAGGCGGACCTAGTGTGTTCCTCTTCCCTCCAAAACCTAAGGATACCCTGATGATCAGCCGGACACCTGAGGTTACATGCGTGGTCGTGGCTGTGAGCCAGGAAGATCCTGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAAGTGCATAATGCTAAGACCAAGCCTCGGGAAGAGCAGTTTAACTCCACCTATAGAGTGGTGTCCGTTCTGACCGTGCTGCACCAGGACTGGCTGAACGGCAAGGAATACAAGTGCAAGGTGTCCAACAAGGGCCTGCCCAGTTCTATTGAGAAGACAATTAGCAAGGCCAAGGGCCAGCCTAGAGAGCCTCAGGTGTACACCCTGCCTCCCAGCCAGGAGGAAATGACCAAGAACCAGGTGTCTCTCACTTGTCTGGTGAAGGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAACCTGAGAACAACTACAAGACCACACCTCCAGTGCTGGACAGCGATGGCTCTTTTTTCCTGTACTCCAAGCTGACAGTGGACAAGTCCAGGTGGCAGGAGGGAAATGTGTTCAGCTGCAGCGTGATGCACGAGGCTCTGCACAATCACTATACCCAGAAAAGCCTCTCTCTGAGCCTGGGC(SEQ ID NO:47)。
GAGTCCAAGTATGGCCCCCCTAGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCTCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:48)。
GAGAGCAAGTACGGGCCCCCTAGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:49)。
GAGTCCAAGTACGGCCCCCCCAGCCCTCCATGTCCTGCCCCTCCTGTGGAAGGCGGACCTTCTGTTTTCCTGTTCCCACCAAAACCTAAGGACACCCTGATGATCAGCAGAACACCTGAGGTGACCTGCGTGGTCGTGGCCGTAAGCCAGGAGGACCCCGAAGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCTAAGACCAAGCCTCGGGAAGAGCAGTTTAACAGCACCTACAGAGTGGTGTCTGTGCTGACCGTGCTGCACCAGGACTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGACTGCCCAGCAGCATCGAGAAGACGATTAGCAAGGCCAAGGGCCAACCTAGAGAGCCTCAGGTGTACACCCTGCCTCCTTCTCAGGAGGAGATGACCAAAAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTCTATCCCTCTGACATCGCCGTCGAGTGGGAAAGCAATGGCCAGCCTGAAAACAACTACAAGACAACACCCCCAGTGCTTGATAGCGACGGCAGCTTCTTCCTGTACTCCAAGCTGACAGTGGATAAGAGCCGGTGGCAGGAAGGAAACGTGTTCTCATGCAGCGTGATGCACGAGGCTCTGCACAATCACTACACCCAGAAGAGCCTGTCACTGAGCCTGGGC(SEQ ID NO:50)。
所述SEQ ID NO:39、SEQ ID NO:40或SEQ ID NO:41用于编码SEQ ID NO:2。
所述SEQ ID NO:42、SEQ ID NO:43或SEQ ID NO:44用于编码SEQ ID NO:3。
所述SEQ ID NO:45、SEQ ID NO:46或SEQ ID NO:47用于编码SEQ ID NO:4。
所述SEQ ID NO:48、SEQ ID NO:49或SEQ ID NO:50用于编码SEQ ID NO:5。
根据本发明的实施例,所述核苷酸序列包括选自SEQ ID NO:9、SEQ ID NO:10、SEQID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20或者与其中任意一序列具有至少80%-99%同一性的核苷酸序列或者其中任意一序列的至少一部分或其中任一序列的密码子优化序列。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGCGGAGGCGGTGGATCTGAGTCTAAGTACGGCCCTCCTTGTCCTCCATGTCCAGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:9)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGCGGAGGCGGTGGATCTGAGTCTAAGTACGGCCCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:10)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATCGTGCAGTGCAGATCCGTGGAAGGGTCCTGCGGGTTCGGCGGGGGGGGCAGCGGCGGGGGCGGCAGCGGGGGCGGGGGCAGCGAGTCCAAGTATGGCCCCCCCTGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCTCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:11)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATCGTGCAGTGCAGATCCGTGGAAGGGTCCTGCGGGTTCGGCGGGGGGGGCAGCGGCGGGGGCGGCAGCGGGGGCGGGGGCAGCGAGTCCAAGTATGGCCCCCCTAGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCTCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:12)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGCGGAGGCGGAGGAAGCGGTGGCGGCGGATCTGAGTCTAAGTACGGACCTCCTTGTCCTCCATGTCCAGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:13)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGCGGAGGCGGAGGAAGCGGTGGCGGCGGATCTGAGTCTAAGTACGGACCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:14)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATTGTGCAGTGCCGGTCCGTGGAAGGCAGCTGTGGCTTTGGGGGGGGGGGGAGCGGGGGGGGCGGGAGCGGCGGGGGCGGGAGCGGCGGGGGCGGCAGCGAGAGCAAGTACGGGCCCCCCTGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:15)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATTGTGCAGTGCCGGTCCGTGGAAGGCAGCTGTGGCTTTGGGGGGGGGGGGAGCGGGGGGGGCGGGAGCGGCGGGGGCGGGAGCGGCGGGGGCGGCAGCGAGAGCAAGTACGGGCCCCCTAGCCCTCCTTGTCCTGCTCCCCCCGTGGAGGGGGGCCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGCCGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:16)。
TTCCCCACCATCCCCCTGAGCAGACTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCATCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAGGCCTACATCCCCAAGGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACAAGCCTGTGCTTCAGCGAGAGCATCCCCACCCCTAGCAACAGAGAGGAGACACAGCAGAAGAGCAACCTGGAGCTGCTGAGAATCAGCCTGCTCCTGATTCAGAGCTGGCTGGAGCCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCTCCGACAGCAACGTGTACGACCTGCTGAAGGACCTGGAGGAGGGCATTCAGACCCTGATGGGCAGACTGGAGGACGGCAGCCCTAGAACCGGGCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAGACCTTCCTGAGAATCGTGCAATGTAGAAGCGTCGAAGGGAGCTGCGGCTTCGGCGGGGGCGGCAGCGGCGGGGGCGGCAGCGGGGGCGGGGGCAGCGGCGGCGGGGGGTCCGGCGGCGGGGGCAGCGAATCCAAATACGGCCCCCCCTGTCCCCCTTGCCCTGCCCCCGAGGCTGAAGGCGGGCCTAGCGTGTTCCTGTTCCCCCCCAAGCCCAAGGACACCCTGATGATCAGCAGAACCCCCGAGGTGACCTGCGTGGTCGTGGACGTGAGCCAAGAGGACCCCGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCCAAGACCAAGCCTAGAGAGGAGCAGTTCAACAGCACCTACAGAGTGGTGAGCGTGCTGACCGTGCTGCACCAAGACTGGCTGAACGGCAAGGAGTACAAGTGCAAGGTGAGCAACAAGGGCCTGCCTAGCAGCATCGAGAAGACCATCAGCAAGGCCAAGGGGCAGCCTAGAGAGCCCCAAGTGTACACCCTGCCCCCTAGCCAAGAGGAGATGACCAAGAACCAAGTGAGCCTGACCTGCCTGGTGAAGGGCTTCTACCCTAGCGACATCGCCGTGGAGTGGGAGAGCAACGGGCAGCCCGAGAACAACTACAAGACCACCCCCCCCGTGCTGGACAGCGACGGCAGCTTCTTCCTGTACAGCAAGCTGACCGTGGACAAGAGCAGATGGCAAGAGGGCAACGTGTTCAGCTGCAGCGTGATGCACGAGGCCCTGCACAACCACTACACACAGAAGAGCCTGAGCCTGAGCCTGGGC(SEQ ID NO:17)。
TTCCCCACCATTCCTCTGAGCCGGCTGTTCGACAACGCCATGCTGAGAGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAAGAGTTCGAGGAAGCCTACATTCCCAAAGAGCAGAAGTACAGCTTCCTGCAGAACCCTCAGACCAGCCTGTGCTTCAGCGAGAGCATCCCCACACCTAGCAACAGAGAGGAAACCCAGCAGAAGTCCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGTCTTGGCTGGAACCCGTGCAGTTCCTGAGAAGCGTGTTCGCCAACAGCCTGGTGTACGGCGCCAGCGACAGCAACGTTTACGACCTGCTGAAGGACCTGGAAGAGGGCATCCAGACACTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGACACCAACAGCCACAACGACGACGCCCTGCTGAAAAACTACGGCCTGCTGTACTGCTTTCGGAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGAAGCGTGGAAGGCTCTTGCGGATTTGGCGGCGGAGGATCTGGCGGAGGTGGAAGCGGAGGCGGAGGAAGCGGTGGCGGCGGTAGTGGCGGTGGTGGATCTGAGTCTAAGTACGGCCCTCCTTCTCCACCATGTCCTGCTCCAGAAGCTGAAGGCGGCCCTTCCGTGTTCCTGTTTCCTCCAAAGCCTAAGGACACCCTGATGATCAGCAGAACCCCTGAAGTGACCTGCGTGGTGGTGGACGTGTCCCAAGAGGATCCTGAGGTGCAGTTCAATTGGTACGTGGACGGCGTGGAAGTGCACAACGCCAAGACCAAGCCTAGAGAGGAACAGTTCAACAGCACCTATAGAGTGGTGTCCGTGCTGACCGTGCTGCACCAGGATTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGCCTGCCTAGCAGCATCGAGAAAACCATCAGCAAGGCCAAGGGCCAGCCAAGGGAACCCCAGGTTTACACACTGCCTCCAAGCCAAGAGGAAATGACCAAGAACCAGGTGTCCCTGACCTGCCTGGTCAAGGGCTTCTACCCTTCCGATATCGCCGTGGAATGGGAGAGCAATGGCCAGCCTGAGAACAACTACAAGACCACACCTCCTGTGCTGGACAGCGACGGCTCATTCTTCCTGTACAGCAAGCTGACAGTGGACAAGAGCCGGTGGCAAGAGGGCAACGTGTTCAGCTGTAGCGTGATGCACGAGGCCCTGCACAACCACTACACCCAGAAGTCTCTGAGCCTGAGCCTGGGA(SEQ ID NO:18)。
TTCCCCACGATCCCTCTGTCTAGACTGTTCGACAACGCCATGCTGCGGGCCCACAGACTGCACCAGCTGGCCTTCGACACCTACCAGGAGTTCGAGGAGGCCTACATCCCCAAAGAGCAAAAATACAGCTTCCTGCAGAACCCCCAGACATCTTTGTGCTTTAGCGAGAGCATCCCTACCCCTAGCAATAGAGAAGAGACACAGCAGAAGAGCAACCTGGAACTGCTGCGGATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAACCCGTGCAATTTCTGCGCAGCGTCTTCGCCAACAGCCTGGTGTACGGCGCCTCTGATAGCAACGTGTACGACCTGCTGAAAGATCTGGAAGAGGGCATCCAGACCCTGATGGGAAGACTGGAGGACGGCTCTCCAAGAACAGGCCAAATCTTCAAGCAGACCTACAGCAAATTCGATACAAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGCTTCAGAAAGGACATGGACAAGGTGGAAACCTTCCTGCGGATCGTGCAGTGCAGATCTGTGGAAGGCAGCTGTGGCTTCGGCGGCGGAGGCAGCGGCGGAGGCGGCTCTGGCGGCGGCGGCAGCGGCGGCGGAGGAAGCGGAGGAGGCGGTTCTGAGAGCAAGTACGGCCCCCCTTGTCCTCCTTGCCCCGCCCCTCCCGTGGAAGGCGGACCTAGTGTGTTCCTCTTCCCTCCAAAACCTAAGGATACCCTGATGATCAGCCGGACACCTGAGGTTACATGCGTGGTCGTGGCTGTGAGCCAGGAAGATCCTGAGGTGCAGTTCAACTGGTACGTGGACGGCGTGGAAGTGCATAATGCTAAGACCAAGCCTCGGGAAGAGCAGTTTAACTCCACCTATAGAGTGGTGTCCGTTCTGACCGTGCTGCACCAGGACTGGCTGAACGGCAAGGAATACAAGTGCAAGGTGTCCAACAAGGGCCTGCCCAGTTCTATTGAGAAGACAATTAGCAAGGCCAAGGGCCAGCCTAGAGAGCCTCAGGTGTACACCCTGCCTCCCAGCCAGGAGGAAATGACCAAGAACCAGGTGTCTCTCACTTGTCTGGTGAAGGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAGTCTAATGGCCAACCTGAGAACAACTACAAGACCACACCTCCAGTGCTGGACAGCGATGGCTCTTTTTTCCTGTACTCCAAGCTGACAGTGGACAAGTCCAGGTGGCAGGAGGGAAATGTGTTCAGCTGCAGCGTGATGCACGAGGCTCTGCACAATCACTATACCCAGAAAAGCCTCTCTCTGAGCCTGGGC(SEQ ID NO:19)。
TTCCCCACCATCCCCCTGTCTAGACTGTTTGATAACGCCATGCTGCGGGCCCACAGACTCCATCAGCTGGCCTTCGACACCTACCAGGAGTTCGAGGAAGCCTACATCCCTAAGGAACAAAAATACTCCTTCCTGCAGAACCCCCAAACAAGCCTGTGCTTCAGCGAGTCCATCCCTACCCCTTCTAACAGAGAGGAAACACAGCAGAAATCTAATCTGGAACTGCTGAGGATCAGCCTGCTGCTGATCCAGAGCTGGCTGGAACCTGTGCAATTTCTGCGGAGCGTGTTTGCCAACAGCCTGGTGTATGGAGCCAGCGACAGCAATGTGTACGACCTGCTGAAAGATCTGGAAGAGGGAATCCAGACCCTGATGGGCAGACTGGAAGATGGCAGCCCTAGAACCGGCCAGATCTTCAAGCAGACCTACAGCAAGTTCGATACAAACAGCCACAACGACGACGCCCTGCTGAAGAACTACGGCCTGCTGTACTGTTTTAGAAAGGACATGGACAAGGTGGAGACATTCCTCCGGATCGTGCAGTGCAGATCTGTGGAGGGCAGCTGCGGCTTCGGCGGCGGCGGCAGCGGCGGTGGCGGTAGCGGCGGAGGCGGGTCCGGAGGCGGCGGCAGCGGCGGAGGCGGATCTGAGTCCAAGTACGGCCCCCCCAGCCCTCCATGTCCTGCCCCTCCTGTGGAAGGCGGACCTTCTGTTTTCCTGTTCCCACCAAAACCTAAGGACACCCTGATGATCAGCAGAACACCTGAGGTGACCTGCGTGGTCGTGGCCGTAAGCCAGGAGGACCCCGAAGTGCAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCACAACGCTAAGACCAAGCCTCGGGAAGAGCAGTTTAACAGCACCTACAGAGTGGTGTCTGTGCTGACCGTGCTGCACCAGGACTGGCTGAACGGCAAAGAGTACAAGTGCAAGGTGTCCAACAAGGGACTGCCCAGCAGCATCGAGAAGACGATTAGCAAGGCCAAGGGCCAACCTAGAGAGCCTCAGGTGTACACCCTGCCTCCTTCTCAGGAGGAGATGACCAAAAACCAGGTGTCCCTGACCTGTCTGGTGAAGGGCTTCTATCCCTCTGACATCGCCGTCGAGTGGGAAAGCAATGGCCAGCCTGAAAACAACTACAAGACAACACCCCCAGTGCTTGATAGCGACGGCAGCTTCTTCCTGTACTCCAAGCTGACAGTGGATAAGAGCCGGTGGCAGGAAGGAAACGTGTTCTCATGCAGCGTGATGCACGAGGCTCTGCACAATCACTACACCCAGAAGAGCCTGTCACTGAGCCTGGGC(SEQ ID NO:20)。
所述SEQ ID NO:9用于编码SEQ ID NO:21。
所述SEQ ID NO:10用于编码SEQ ID NO:22。
所述SEQ ID NO:11用于编码SEQ ID NO:23。
所述SEQ ID NO:12用于编码SEQ ID NO:24。
所述SEQ ID NO:13用于编码SEQ ID NO:25。
所述SEQ ID NO:14用于编码SEQ ID NO:26。
所述SEQ ID NO:15用于编码SEQ ID NO:27。
所述SEQ ID NO:16用于编码SEQ ID NO:28。
所述SEQ ID NO:17用于编码SEQ ID NO:29。
所述SEQ ID NO:18用于编码SEQ ID NO:30。
所述SEQ ID NO:19用于编码SEQ ID NO:31。
所述SEQ ID NO:20用于编码SEQ ID NO:32。
在本发明的第五方面,本发明提出了一种表达载体。根据本发明的实施例,携带第四方面所述的核酸分子。根据本发明实施例的表达载体导入合适的受体细胞后,可在调控系统的介导下,有效实现前面所述的Fc突变体或融合蛋白的表达,以便大量获得所述Fc突变体或融合蛋白。
在本发明的第六方面,本发明提出了一种重组细胞。根据本发明的实施例,所述重组细胞包括:携带第四方面所述的核酸分子;或,表达第一方面所述的Fc突变体或第二方面或第三方面所述的融合蛋白。根据本发明实施例的重组细胞可用于前面所述的Fc突变体或融合蛋白的体外表达和大量获得。
在本发明的第七方面,本发明提出了一种药物组合物。根据本发明的实施例,所述药物组合物包括:第二方面或第三方面所述的融合蛋白。根据本发明实施例的药物组合物可以用于预防和治疗融合蛋白的生物活性分子异常相关疾病。
在本发明的第八方面,本发明提出了一种第一方面所述的融合蛋白的制备方法。根据本发明的实施例,所述方法包括:培养第六方面所述的重组细胞,获得含第一方面或第二方面所述的融合蛋白的培养液;从上述培养液中分离出所述融合蛋白。由此,能够有效地制备第二方面和第三方面所述的融合蛋白。
在本发明的第九方面,本发明提出了一种第二方面或第三方面所述的融合蛋白、第四方面所述的核酸分子、第五方面所述的表达载体、第六方面所述的重组细胞、第七方面所述的药物组合物在制备药物中的用途,所述药物用于治疗或者预防生长激素异常相关疾病。
有益效果
相比现有技术,本发明至少具有包括以下有益技术效果中的一种:
(1)本发明Fc突变体,有利于消除抗体Fc区与FcR结合引起的ADCC效应和减少表达产物的异质性,且有利于抑制Fab-arm exchange的发生,使Fc突变体比原始IgG4分子更稳定。
(2)本发明所提供的融合蛋白与人GHR(人生长激素受体)有很强的结合活性。
(3)本发明所提供的融合蛋白对Nb2-11细胞有很好的增殖作用。
(4)本发明所提供的融合蛋白具有优异的促进报告基因细胞表达荧光素酶活性。
(5)相比人生长激素和其他人生长激素融合蛋白,本发明所提供的融合蛋白具有更长的半衰期,只需要一周给药一次。
(6)本发明所提供的融合蛋白可通过ProteinA亲和层析纯化,后续纯化工艺简单,纯化后所得融合蛋白纯度高。
(7)本发明所提供的额融合蛋白具有良好的体内生物学活性。
本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。
附图说明
本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:
图1为本发明实施例3的融合蛋白结构的示意图;
图2为本发明实施例3中不同融合蛋白在不同培养基中的表达活性。
具体实施方式
下面详细描述本发明的实施例。下面描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。
需要说明的是,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。进一步地,在本发明的描述中,除非另有说明,“多个”的含义是两个或两个以上。
在本文中,术语“包含”或“包括”为开放式表达,即包括本发明所指明的内容,但并不排除其他方面的内容。
在本文中,术语“任选地”、“任选的”或“任选”通常是指随后所述的事件或状况可以但未必发生,并且该描述包括其中发生该事件或状况的情况,以及其中未发生该事件或状况的情况。
在本文中,术语“变体”或“突变体”通常是指包含一个或多个核苷酸或氨基酸突变的任何天然存在的或工程化的分子。
在本文中,术语“室温”表示环境温度,可以为20℃-30℃;在一些实施例中,为22℃-28℃;在一些实施例中,为24℃-26℃;在一些实施例中,为25℃。
在本文中,所述IgG4 Fc片段的氨基酸编号为根据EU编号系统编号,例如,第228位是指按EU编号系统编号第228位;所述“S228P”是指按EU编号系统编号第228位的丝氨酸被脯氨酸替代;“K447Delete”是指按EU编号系统编号第447位的赖氨酸缺失或不存在。
在本文中,术语“(GGGGS)n”表示n个GGGGS相连接,例如“(GGGGS)3”表示GGGGSGGGGSGGGGS。
在本文中,术语“融合蛋白”通常指由两个或更多个蛋白或多肽融合得到的蛋白。编码所述两个或更多个蛋白或多肽的基因或核酸分子可彼此连接而形成融合基因或融合的核酸分子,该融合基因或融合的核酸分子可编码所述融合蛋白。所述融合基因的翻译产生单一多肽,其具有融合前的所述两个或更多个蛋白或多肽中至少一个、甚至每一个的性质。重组融合蛋白通过用于生物学研究或治疗的重组DNA技术人工创造。重组融合蛋白是通过融合基因的遗传工程创造的蛋白质。本发明涉及重组融合蛋白,并且术语融合蛋白和重组融合蛋白在本文以相同含义使用。本文描述的融合蛋白通常包含至少两个结构域(A和C),并且任选地包含第三组分,介于所述两个结构域之间的接头。重组融合蛋白的生成是本领域已知的,并且通常涉及自编码第一蛋白或多肽的cDNA序列去除终止密码子,然后通过连接或重叠延伸PCR以符合读框的方式附接第二蛋白的cDNA序列。该DNA序列然后会由细胞表达成为单一蛋白质。该蛋白质可以经工程化以包括两种原始蛋白质或多肽的完整序列,或仅仅为其中的一部分。
本发明融合蛋白通常由生物合成的方法制备。根据本发明所述的核苷酸序列,本技术领域人员可方便地用各种已知方法制得本发明的编码核酸。这些方法例如但不限于:PCR,DNA人工合成等,具体的方法可参见J.萨姆布鲁克,《分子克隆实验指南》。作为本发明的一种实施方式,可通过分段合成核苷酸序列再进行重叠延伸PCR的方法来构建本发明的编码核酸序列。
在本文中,术语“同一性”或“相似相”均用于描述相对于参考序列的氨基酸序列或核酸序列时,采用通过常规的方法进行确定两个氨基酸序列或核酸序列之间的相同氨基酸或核苷酸的百分比,例如参见,Ausubel等,编著(1995),Current Protocols in MolecularBiology,第19章(Greene Publishing and Wiley-Interscience,New York);和ALIGN程序(Dayhoff(1978),Atlas of Protein Sequence and Structure 5:Suppl.3(NationalBiomedical Research Foundation,Washington,D.C.)。关于比对序列和测定序列同一性有很多算法,包括,Needleman等(1970)J.Mol.Biol.48:443的同源性比对算法;Smith等(1981)Adv.Appl.Math.2:482的局部同源性算法;Pearson等(1988)Proc.Natl.Acad.Sci.85:2444的相似性搜索方法;Smith-Waterman算法(Meth.Mol.Biol.70:173-187(1997);和BLASTP,BLASTN,和BLASTX算法(参见Altschul等(1990)J.Mol.Biol.215:403-410)。利用这些算法的计算机程序也是可获得的,并且包括但不限于:ALIGN或Megalign(DNASTAR)软件,或者WU-BLAST-2(Altschul等,Meth.Enzym.,266:460-480(1996));或者GAP,BESTFIT,BLAST Altschul等,上文,FASTA,和TFASTA,在Genetics Computing Group(GCG)包,8版,Madison,Wisconsin,USA中可获得;和Intelligenetics,Mountain View,California提供的PC/Gene程序中的CLUSTAL。
在本文中,术语“核苷酸”通常是指可为核糖核苷酸、脱氧核苷酸或任一类型核苷酸的修饰形式,及其组合。
在本文中,术语“至少90%同一性”指与各参考序列至少为90%,可为90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%、99.9%的同一性。
在本文中,术语“至少80%-99%序列同一性”指与各参考序列至少80%-99%、至少81%-99%、至少82%-99%、至少83%-99%、至少84%-99%、至少85%-99%、至少86%-99%、至少87%-99%、至少88%-99%、至少89%-99%、至少90%-99%、至少91%-99%、至少92%-99%、至少93%-99%、至少94%-99%、至少95%-99%、至少96%-99%、至少97%-99%、至少98%-99%或至少99%的序列同一性。
在本文中,术语“表达载体”通常是指能够插入在合适的宿主中自我复制的核酸分子,其将插入的核酸分子转移到宿主细胞中和/或宿主细胞之间。所述表达载体可包括主要用于将DNA或RNA插入细胞中的载体、主要用于复制DNA或RNA的载体,以及主要用于DNA或RNA的转录和/或翻译的表达的载体。所述表达载体还包括具有多种上述功能的载体。所述表达载体可以是当引入合适的宿主细胞时能够转录并翻译成多肽的多核苷酸。通常,通过培养包含所述表达载体的合适的宿主细胞,所述表达载体可以产生期望的表达产物。
在本文中,术语“重组细胞”通常是指采用基因工程技术或细胞融合技术对宿主细胞的遗传物质进行修饰改造或重组,获得具有稳定遗传的独特性状的细胞。其中,术语“宿主细胞”是指可导入重组表达载体的原核细胞或真核细胞。本文所用术语“转化的”或“转染的”是指通过本领域已知的各种技术将核酸(例如载体)引入细胞。合适的宿主细胞可以用本发明的DNA序列转化或转染,并且可以用于靶蛋白的表达和/或分泌。可用于本发明的合适宿主细胞的例子包括永生化杂交瘤细胞,NS/0骨髓瘤细胞,293细胞,中国仓鼠卵巢(CHO)细胞,HeLa细胞,Cap细胞(人羊水来源的细胞)和CoS细胞。
在本文中,术语“药物组合物”通常是指单位剂量形式,并且可以通过制药领域中熟知的方法的任何一种进行制备。所有的方法包括使活性成分与构成一种或多种附属成分的载体相结合的步骤。通常,通过均匀并充分地使活性化合物与液体载体、细碎固体载体或这两者相结合,制备组合物。
在本文中,术语“药学上可接受的辅料”均可包括任何溶剂、固体赋形剂、稀释剂或其他液体赋形剂等等,适合于特有的目标剂型。除了任何常规的辅料与本发明的化合物不相容的范围,例如所产生的任何不良的生物效应或与药学上可接受的组合物的任何其他组分以有害的方式产生的相互作用,它们的用途也是本发明所考虑的范围。
对本文所提及的其他药学上可接受的辅料或工艺可参考关于此主题的大量文献,具体而言参见Handbook of Pharmaceutical Excipients,第3版,Arthur H.Kibbe编辑,American Pharmaceutical Association,Washington,USA和Pharmaceutical Press,London;以及Lexikon der Hilfsstoffe für Pharmazie、Kosmetik和angrenzendeGebiete,H.P.Fiedler编辑,第4版,编辑Cantor、Aulendorf和早期版本。
在本文中,术语“给药”指将预定量的物质通过某种适合的方式引入病人。本发明的融合蛋白可以通过任何常见的途径被给药,只要它可以到达预期的组织。给药的各种方式是可以预期的,包括腹膜、静脉、肌肉、皮下、皮层、口服、局部、鼻腔、肺部、直肠和涂抹,但是本发明不限于这些已举例的给药方式。优选地,本发明的组合物采用注射方式被给药。
在本文中,术语“生长激素异常相关疾病”通常是指由于生长激素异常引起的疾病,例如生长激素缺乏或异化状态引起的相关疾病,包括但不限于,儿童生长激素缺乏症、特发性体型矮小、成人生长激素缺乏、特纳氏综合征(TurnersSyndrome)、普拉德威利综合征(Prader Willi Syndrome)、子宫内生长迟缓、特发性身材矮小、肾衰竭、化疗治疗和AIDS治疗期间的异化状态。生长激素缺乏可包括先天或获得性缺乏。关于先天缺陷,当垂体没有发展成生长激素分泌紊乱时,生长激素缺陷可能发生。获得性生长激素缺乏可能由于由于难于递送而导致缺氧导致脑组织损伤而发生。生长激素缺乏的其它原因包括由用于治疗脑瘤或出生后结核性脑膜炎的辐射引起的垂体损伤。生长激素缺乏表现出诸如生长迟缓和身材矮小的症状,并且先天生长激素缺乏表现出低葡萄糖症状,从新生儿开始。另外,儿童表现出诸如焦虑增加和活力降低的症状。
在本文中,术语“治疗”是指用于指获得期望的药理学和/或生理学效果。所述效果就完全或部分预防疾病或其症状而言可以是预防性的,和/或就部分或完全治愈疾病和/或疾病导致的不良作用而言可以是治疗性的。本文使用的“治疗”涵盖哺乳动物、特别是人的疾病,包括:(a)在容易患病但是尚未确诊得病的个体中预防疾病或病症发生;(b)抑制疾病,例如阻滞疾病发展;或(c)缓解疾病,例如减轻与疾病相关的症状。本文使用的“治疗”涵盖将药物或化合物给予个体以治疗、治愈、缓解、改善、减轻或抑制个体的疾病的任何用药,包括但不限于将含本文所述化合物的药物给予有需要的个体。
本发明提出了Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞、药物组合物及其用途,下面将分别对其进行详细描述。
Fc突变体
在本发明的一个方面,本发明提出了一种Fc突变体。根据本发明的实施例,包括第一肽段,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:第228位、第235位、第409位、第447位和第234位。
发明人经过大量实验发现,上述突变有利于消除抗体Fc区与FcR结合引起的ADCC效应和减少表达产物的异质性,且有利于抑制Fab-arm exchange的发生,使Fc突变体比原始IgG4分子更稳定;此外,采用该Fc突变体制得的融合蛋白具有更高体内外活性。
根据本发明的实施例,所述第一肽段相较于野生型IgG4的Fc片段,进一步包括具有如下位点突变的至少之一:第226位、第233位和第265位。
根据本发明的实施例,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:1)第228位、第234位、第235位、第409位和第447位;或,2)第226位、第228位、第234位、第235位、第409位和第447位;或,3)第228位、第233位、第234位、第235位、第265位、第409位和第447位;或,4)第226位、第228位、第233位、第234位、第235位、第265位、第409位和第447位。由此,上述突变的组合可消除抗体Fc区与FcR结合引起的ADCC效应、减少表达产物的异质性和提高Fc突变体的稳定性。
根据本发明的实施例,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:S228P、L235E、R409K、K447 Delete和F234A或F234V;以及C226S、E233P和D265A中的至少之一。由此,上述突变可消除抗体Fc区与FcR结合引起的ADCC效应、减少表达产物的异质性和提高Fc突变体的稳定性。
根据本发明的实施例,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:1)S228P、F234A、L235E、R409K和K447 Delete;或,2)C226S、S228P、L235E、R409K、K447 Delete和F234A或F234V;或,3)S228P、E233P、L235E、D265A、R409K、K447 Delete和F234A或F234V;或,4)C226S、S228P、E233P、L235E、D265A、R409K、K447 Delete和F234A或F234V。由此,上述突变可进一步消除抗体Fc区与FcR结合引起的ADCC效应、减少表达产物的异质性和提高Fc突变体的稳定性。
根据本发明的实施例,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:C226S、S228P、E233P、L235E、D265A、R409K、K447 Delete和F234A或F234V。发明人经过实验发现,采用含有上述第一肽段的Fc突变体与生物活性分子(尤其是生长激素)融合得到的融合蛋白,可进一步提高体内生物学活性,例如,Fc突变体与生长激素融合得到的融合蛋白具有对人生长激素受体结合力强、细胞增殖的活性好和体外活性高等优点。
根据本发明的实施例,所述Fc突变体进一步包括野生型IgG4的铰链区片段,所述野生型IgG4的铰链区片段的C端与所述第一肽段的N端相连。
根据本发明的实施例,所述野生型IgG4的铰链区片段具有SEQ ID NO:52所示的氨基酸序列。
根据本发明的实施例,所述Fc突变体具有如SEQ ID NO:2~5任一项所示的氨基酸序列。发明人发现,采用上述Fc突变体与生物活性分子融合得到的融合蛋白具有较高的体内外生物活性。
融合蛋白
在本发明的又一方面,本发明提出了一种一种融合蛋白。根据本发明的实施例,所述融合蛋白包含:第二肽段,所述第二肽段包括生物活性分子功能区;第三肽段,所述第三肽段包括Fc突变体,所述Fc突变体具有如前限定的Fc突变体,所述第二肽段与所述第三肽段相连。根据本发明实施例的融合蛋白具有更高体内外活性,且后续可通过ProteinA纯化,简化了纯化工艺。
根据本发明的实施例,所述第二肽段包含生长激素、生长激素类似物、生长激素功能区或、生长激素类似物功能区,优选为人生长激素或人生长激素功能区。发明人发现本发明的融合蛋白具有对人生长激素受体结合力强、细胞增殖的活性好和体外活性高等优点。
根据本发明的实施例,所述人生长激素具有如SEQ ID NO:1或与其具有至少90%同一性的氨基酸序列。
根据本发明的实施例,所述第二肽段的C端与所述第三肽段的N端相连。由此,采用上述连接方式制得的融合蛋白,具有更高的体内外活性。
根据本发明的实施例,所述融合蛋白进一步包括连接肽,所述连接肽设置于所述第二肽段和第三肽段之间。由此,采用过上述连接肽可融合的第二肽段和第三肽段,提高最终所得到的融合蛋白的生物活性。
根据本发明的实施例,所述连接肽的N端与所述第二肽段的C端相连,所述连接肽的C端与所述第三肽段的N端相连。由此,采用上述连接方式制得的融合蛋白,具有更高体内外活性。
根据本发明的实施例,所述连接肽的氨基酸序列为(GGGGS)n,其中n为大于或等于1的整数。由此,采用上述连接肽将第二肽段和第三肽段融合得到的融合蛋白,具有更高体内外活性。
根据本发明的实施例,所述连接肽的氨基酸序列为(GGGGS)n,其中n为3、4或5,更优选为5。发明人经过大量实验发现,在制备融合蛋白过程中,连接肽的选择十分重要,相对于n为1-4的整数的连接肽,将生物活性分子与n为5的连接肽以及多种Fc突变体融合得到的融合蛋白具有较高的体内生物学活性。示例性地,本发明实施例中融合蛋白HT-10的活性高于HT-2和HT-6,融合蛋白HT-11的活性高于HT-3和HT-7。
根据本发明的实施例,所述连接肽具有如SEQ ID NO:6~8任一项所示的氨基酸序列。由此,采用过上述连接肽可融合的第二肽段和第三肽段,进一步提高最终所得到的融合蛋白的生物活性。
根据本发明的实施例,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有C226S、S228P、F234A、L235E、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)5。根据本发明的实施例,上述融合蛋白具有更优的体内外生物学活性。
根据本发明的实施例,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)5。根据本发明的实施例,上述融合蛋白具有更优的体内外生物学活性。
根据本发明的实施例,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有C226S、S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)n,其中n为3、4或5。根据本发明的实施例,上述融合蛋白具有更优的体内外生物学活性。
根据本发明的实施例,所述连接肽的N端与所述第二肽段的C端相连,所述连接肽的C端与所述第三肽段的N端相连。由此,得到的融合蛋白具有优异的体内外生物学活性。
根据本发明的实施例,所述融合蛋白具有SEQ ID NO:21~32任一项所示的氨基酸序列。发明人经过实验发现,上述融合蛋白对人生长激素受体结合力强、细胞增殖的活性好和体外活性高。
本领域技术人员能够理解的是,前面针对Fc突变体所描述的特征和优点,同样适用于该融合蛋白,在此不再赘述。
核酸分子
在本发明的另一方面,本发明提出了一种核酸分子。根据本发明的实施例,所述核酸分子编码前述的Fc突变体或前述的融合蛋白。利用该核酸分子可以有效地用于表达上述Fc突变体或融合蛋白,尤其是在原核生物或者低等真核生物表达体系中可以有效地表达上述融合蛋白,具有对人生长激素受体结合力强、细胞增殖的活性好和体外活性高等优点。
根据本发明的实施例,所述核酸分子为DNA。
本领域技术人员能够理解的是,前面针对Fc突变体和融合蛋白所描述的特征和优点,同样适用于该核酸分子,在此不再赘述。
表达载体
在本发明的另一方面,本发明提出了一种表达载体。根据本发明的实施例,携带前述的核酸分子。利用该表达载体能够在细胞内有效地表达上述Fc突变体或融合蛋白,尤其是在原核生物或者低等真核生物表达体系中可以有效地表达上述融合蛋白,具有对人生长激素受体结合力强、细胞增殖的活性好和体外活性高等优点。
根据本发明的实施例,所述表达载体为真核表达载体。
根据本发明的实施例,所述表达载体为慢病毒载体。
本领域技术人员能够理解的是,前面针对Fc突变体、融合蛋白和核酸分子所描述的特征和优点,同样适用于该表达载体,在此不再赘述。
重组细胞
在本发明的另一方面,本发明提出了一种重组细胞。根据本发明的实施例,所述重组细胞包括:携带前述的核酸分子;或,表达前述的Fc突变体或前述的融合蛋白。利用该重组细胞可有效地表达上述Fc突变体或融合蛋白,尤其是在原核生物或者低等真核生物表达体系中可以有效地表达上述融合蛋白,具有对人生长激素受体结合力强、细胞增殖的活性好和体外活性高等优点。
根据本发明的实施例,所述重组细胞是通过将前述的表达载体引入至宿主细胞中而获得的。
根据本发明的实施例,所述重组细胞为真核细胞。
根据本发明的实施例,所述重组细胞为哺乳动物细胞。
本领域技术人员能够理解的是,前面针对Fc突变体、融合蛋白、核酸分子和表达载体所描述的特征和优点,同样适用于该重组细胞,在此不再赘述。
药物组合物
在本发明的另一方面,本发明提出了一种药物组合物。根据本发明的实施例,所述药物组合物包括:前述的融合蛋白。根据本发明实施例的药物组合物可以用于预防和治疗生长激素异常相关疾病。
根据本发明的实施例,所述药物组合物进一步包括药学上可接受的辅料。
根据本发明的实施例,所述药物组合物的剂型为注射剂。
根据本发明的实施例,所述药物组合物的给药途径包括皮下注射或静脉注射。
本领域技术人员能够理解的是,前面针对Fc突变体、融合蛋白、核酸分子、表达载体和重组细胞所描述的特征和优点,同样适用于该药物组合物,在此不再赘述。
预防和/或治疗生长激素异常相关疾病的方法
在本发明的另一方面,本发明提出了一种预防和/或治疗生长激素异常相关疾病的方法。根据本发明的实施例,所述方法包含:向受试者施用药学上可接受量的前述的融合蛋白或前述的药物组合物。根据本发明的实施例,该方法可有效预防或治疗生长激素异常相关疾病。
根据本发明的实施例,所述方法的给药途径包括皮下注射或静脉注射。
根据本发明的实施例,所述生长激素异常相关疾病至少包括选自下列之一:儿童生长激素缺乏症、特发性体型矮小、成人生长激素缺乏、特纳氏综合征、普拉德威利综合征、肾衰竭、化疗治疗和AIDS治疗期间的异化状态引起的疾病、子宫内生长迟缓。
本领域技术人员能够理解的是,前面针对Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞和药物组合物所描述的特征和优点,同样适用于该预防和/或治疗生长激素异常相关疾病的方法,在此不再赘述。
用途
在本发明的另一方面,本发明提出了一种前述的融合蛋白、前述的核酸分子、前述的表达载体、前述的重组细胞、前述的药物组合物在制备药物中的用途,所述药物用于治疗或者预防生长激素异常相关疾病。
根据本发明的实施例,所述生长激素异常相关疾病至少包括选自下列之一:儿童生长激素缺乏症、特发性体型矮小、成人生长激素缺乏、特纳氏综合征、普拉德威利综合征、肾衰竭、化疗治疗和AIDS治疗期间的异化状态引起的疾病、子宫内生长迟缓。
本领域技术人员能够理解的是,前面针对Fc突变体、融合蛋白、核酸分子、表达载体、重组细胞和药物组合物所描述的特征和优点,同样适用于该用途,在此不再赘述。
下面将结合实施例对本发明的方案进行解释。本领域技术人员将会理解,下面的实施例仅用于说明本发明,而不应视为限定本发明的范围。实施例中未注明具体技术或条件的,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。
实施例1:制备重组长效人生长激素融合蛋白表达载体
本申请中将人生长激素(SEQ ID NO:1)通过连接肽(SEQ ID NO:6-8)与多种突变后的Fc突变体(SEQ ID NO:2-5)进行连接,构建融合蛋白,Fc突变体是将野生型IgG4(含有Hinge和Fc片段)的Fc片段(SEQ ID NO:51)进行突变获得的。具体的实验操作如下:
将编码人生长激素的核苷酸序列(SEQ ID NO:33)、编码连接肽的核苷酸序列(SEQID NO:53-55)以及编码多种Fc突变体的核苷酸序列(SEQ ID NO:39、42、45和48)进行组合,采用基因全合成及分子克隆技术分别将核苷酸序列SEQ ID NO:9、SEQ ID NO:10、SEQ IDNO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ IDNO:7、SEQ ID NO:18、SEQ ID NO:19和SEQ ID NO:20克隆至表达载体pCDNA3.4(购自ThermoFisher)中,分别得到各核苷酸序列的融合蛋白表达载体。
编码氨基酸序列为SEQ ID NO:6的连接肽的核苷酸序列:
GGCGGGGGGGGCAGCGGCGGGGGCGGCAGCGGGGGCGGGGGCAGC(SEQ ID NO:53)。
编码氨基酸序列为SEQ ID NO:7的连接肽的核苷酸序列:
GGGGGGGGGGGGAGCGGGGGGGGCGGGAGCGGCGGGGGCGGGAGCGGCGGGGGCGGCAGC(SEQ IDNO:54)。
编码氨基酸序列为SEQ ID NO:8的连接肽的核苷酸序列:
GGCGGCGGAGGCAGCGGCGGAGGCGGCTCTGGCGGCGGCGGCAGCGGCGGCGGAGGAAGCGGAGGAGGCGGTTCT(SEQ ID NO:55)。
实施例2:重组长效人生长激素融合蛋白表达
利用Expi CHO-S(Gibco,A29133)作为宿主细胞,将实施例1获得的多个质粒转染宿主细胞,采用化学转染试剂Polyplus-FectoPRO(polyplus,116-010)瞬时表达融合蛋白(即为重组长效人生长激素融合蛋白),对应的氨基酸序列为SEQ ID NO:21-32。实验操作如下:
1、Expi CHO-S细胞瞬染前一天对细胞进行传代,用培养基(CD FortiCHOTM培养基)将细胞密度调整为3×106细胞/mL左右。将细胞培养瓶放回摇床(37℃、8%CO2)继续培养。
2、瞬染当天(每个分子转染1L):取Expi CHO-S细胞液进行细胞计数,用培养基将细胞密度调整为6×106细胞/mL左右。
3、配制转染复合物:取16个无菌细胞培养瓶,8个标记“DNA”,8个标记“FectoPRO”,在“FectoPRO”瓶中加入转染试剂Polyplus-FectoPRO;在8个“DNA”瓶中全部加入60mLOpti-MEM溶液和每个“DNA”瓶各自加入500μg实施例1所得各核苷酸序列的融合蛋白表达载体;混匀,得融合蛋白表达载体稀释液,将各核苷酸序列得融合蛋白表达载体稀释液分别加入“FectoPRO”瓶中混匀,室温孵育10min后加入细胞液中摇匀,将细胞培养瓶放回摇床继续培养转染。
4、转染18-22h后加入适量OPM-CHO ProFeed,并测细胞液生化指标。根据生化指标,补葡糖糖至6g/L。转染4d后开始测Titer,并隔天补料补糖。细胞活率<80%即得CHO细胞发酵液,可收获上清液进行纯化。
实施例3:重组长效人生长激素融合蛋白纯化
将实施例2所得到的CHO细胞发酵液进行二级离心(一级:3000×g、30min;二级:12000×g、20min),收集上清液经0.2μm滤器过滤,待用。
Protein A亲和层析:使用pH7.2的含20mM磷酸盐和150mM氯化钠的水溶液平衡层析柱至少3个柱体积,上样过滤后的澄清滤液,料液保留在层析柱上的时间为5分钟,上完样品后,再使用pH7.2的含20mM磷酸盐和150mM氯化钠的水溶液平衡层析柱至少1个柱体积,按照顺序使用pH 4.5的含50mM HAc的缓冲液,pH4.0的含50mM醋酸(HAc)的缓冲液,和pH 3.5的含50mM HAc的缓冲液,分别洗脱目的蛋白,当出现吸收峰时,收峰范围为50mAU-peak-50mAU,分别得到SEC纯度符合要求(>95.0)的核苷酸序列为SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:7、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20的融合蛋白,分别命名为HT-1融合蛋白、HT-2融合蛋白、HT-3融合蛋白、HT-4融合蛋白、HT-5融合蛋白、HT-6融合蛋白、HT-7融合蛋白、HT-8融合蛋白、HT-9融合蛋白、HT-10融合蛋白、HT-11融合蛋白、HT-12融合蛋白,具体参见表1。融合蛋白的具体结构如图1所示。
表1:不同融合蛋白的SEC纯度结果
Figure BDA0003597163400000221
此外,发明人还选择不同培养基培养表达HT-9、HT-11和HT-12融合蛋白的细胞,具体培养步骤参见实施例2,区别仅在于培养基和宿主细胞不同,本申请分别选择了Actipro培养基、Vega CHO培养基、AM02培养基、Eden B600s培养基和Max Z培养基,宿主细胞为CHO-K1细胞,结果如图2所示,当细胞培养至第6天(图表中竖线位置)到达活细胞密度峰值,细胞活力开始显著下降(其他培养基细胞生长情况相似,结果选取有代表性的Vega CHO培养基展示)。并且发现,HT-11融合蛋白在所有培养基中均有最高表达;还发现,相较于其他培养基,Max Z、Actipro和Vega CHO可提高融合蛋白的表达。
实施例4:融合蛋白对人GHR结合试验
分别将实施例3所得核苷酸序列为SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:7、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20的融合蛋白作为重组长效人生长激素融合蛋白,并以Genexine GX-H9为对照组,具体实验操作如下:
将人GHR稀释制得0.5μg/ml包被液,分别以100μL/孔加入酶标板,2~8℃包被12小时以上。弃去包板残液,加入1%BSA-PBST(含1%牛血清白蛋白的磷酸盐吐温缓冲液),每孔300μL,37℃封闭1小时。每孔加入300μL的PBST(磷酸盐吐温缓冲液)洗3次,将重组长效人生长激素融合蛋白和Genexine GX-H9稀释至5μg/ml,再5倍稀释8个梯度浓度,100μL/孔加入酶标板。37℃孵育1小时,每孔加入300μL的PBST,洗涤3次后加入1%BSA-PBST稀释10000倍的Goat anti human IgG Fc-HRP,100μL/孔加样。37℃孵育1小时后,每孔加入300μL的PBST,洗涤3次,拍干。加入TMB显色液,每孔100μL。室温反应5分钟后加2M H2SO4水溶液终止反应,100μL/孔。将中止反应的酶标板置酶标仪上,450nm波长下读取吸光度OD450值,并计算各融合蛋白的对人GHR结合活性,结果表明,SEQ ID NO:9-20的融合蛋白相对Genexine GX-H9的人GHR结合活性均在40%以上。结果表明,本发明所提供的融合蛋白与人GHR有很强的结合活性。
其中,Genexine GX-H9的氨基酸序列为:
FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGFRNTGRGGEEKKKEKEKEEQEERETKTPECPSHTQPLGVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK(SEQ ID NO:56)。
实施例5:重组长效人生长激素融合蛋白促进Nb2-11细胞增殖的实验
分别将实施例3所得核苷酸序列为SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:7、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20的融合蛋白作为重组长效人生长激素融合蛋白,并以上述Genexine GX-H9为对照组,具体实验操作如下:
收集细胞并以每毫升1×105个细胞的浓度悬浮在培养基(PRMI 1640培养基,含1%FBS和50μM的β-巯基乙醇)。将每份50μL细胞样品加到96孔细胞培养板的各孔中。用50μL含有0.051ng/mL-3000ng/mL各种浓度重组长效人生长激素融合蛋白和Genexine GX-H9的分析培养基培养上述细胞。在37℃、5%CO2湿润培养箱中培养该细胞板96小时,然后在各孔中添加50μL ellTiter-Glo Luminescent Cell Viability Assay(Promega,G7571)。10分钟后,酶标仪检测化学发光信号。由所得剂量反应曲线的EC50值计算得到重组长效人生长激素融合蛋白的生物学活性,结果表明,SEQ ID NO:9-20的融合蛋白相对GX-H9的人GHR结合活性相近。结果表明,本发明所提供的融合蛋白对Nb2-11细胞有很好的增殖作用。
实施例6:重组长效人生长激素融合蛋白促进报告基因细胞表达荧光素酶的实验
分别将实施例3所得核苷酸序列为SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15、SEQ ID NO:16、SEQ ID NO:7、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20的融合蛋白作为候选物,并以上述GenexineGX-H9为对照组,具体实验操作如下:
取对数生长期的报告基因细胞(293-GHR/STAT5 cell line),胰酶消化后铺板(costar,3917),4×104cells/well,50μL/well,将96孔白板放在37℃、5%CO2培养箱中孵育过夜。细胞加入梯度稀释的重组长效人生长激素融合蛋白,起始浓度为100nM,5倍梯度稀释,10个梯度浓度,稀释液加入细胞中,添加体积为50μL/well,将96孔白板放在37℃、5%CO2培养箱中孵育6小时。孵育结束后,取出96孔白板和Nano-Glo Luciferase Assay试剂盒(promega,N112B),均平衡至室温,样品孔加入反应底物,添加体积为50μL/well,室温放置10分钟,采用酶标仪(Promega,GM2000)记录luminescence信号值,以蛋白浓度作为X轴,以luminescence信号值为Y轴,用GraphPad Prisim 5作四参数拟合,计算EC50值,结果见表2所示。本发明所提供的融合蛋白促进报告基因细胞表达荧光素酶的活性高,其中,HT-4、HT-8、HT-10、HT-11和HT-12的活性更优。并且,还发现,相较于其他的Fc突变体,SEQ ID NO:5所示的Fc突变体的稳定性较高。
表2:不同融合蛋白促进报告基因细胞表达荧光素酶实验的生物学活性检测结果
Figure BDA0003597163400000241
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。
SEQUENCE LISTING
<110> 深圳科兴药业有限公司
<120> 人生长激素融合蛋白及其制备和用途
<130> PDI220117
<160> 56
<170> PatentIn version 3.3
<210> 1
<211> 191
<212> PRT
<213> Artificial
<220>
<223> 1
<400> 1
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe
180 185 190
<210> 2
<211> 228
<212> PRT
<213> Artificial
<220>
<223> 2
<400> 2
Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu Ala
1 5 10 15
Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
20 25 30
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val
35 40 45
Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val
50 55 60
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser
65 70 75 80
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
85 90 95
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser
100 105 110
Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
115 120 125
Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln
130 135 140
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
145 150 155 160
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
165 170 175
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
180 185 190
Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser
195 200 205
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
210 215 220
Leu Ser Leu Gly
225
<210> 3
<211> 228
<212> PRT
<213> Artificial
<220>
<223> 3
<400> 3
Glu Ser Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala Pro Glu Ala
1 5 10 15
Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
20 25 30
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val
35 40 45
Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val
50 55 60
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser
65 70 75 80
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
85 90 95
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser
100 105 110
Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
115 120 125
Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln
130 135 140
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
145 150 155 160
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
165 170 175
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
180 185 190
Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser
195 200 205
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
210 215 220
Leu Ser Leu Gly
225
<210> 4
<211> 228
<212> PRT
<213> Artificial
<220>
<223> 4
<400> 4
Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Pro Val
1 5 10 15
Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
20 25 30
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Ala Val
35 40 45
Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val
50 55 60
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser
65 70 75 80
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
85 90 95
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser
100 105 110
Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
115 120 125
Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln
130 135 140
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
145 150 155 160
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
165 170 175
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
180 185 190
Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser
195 200 205
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
210 215 220
Leu Ser Leu Gly
225
<210> 5
<211> 228
<212> PRT
<213> Artificial
<220>
<223> 5
<400> 5
Glu Ser Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala Pro Pro Val
1 5 10 15
Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr
20 25 30
Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Ala Val
35 40 45
Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val
50 55 60
Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser
65 70 75 80
Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu
85 90 95
Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser
100 105 110
Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro
115 120 125
Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln
130 135 140
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala
145 150 155 160
Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr
165 170 175
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu
180 185 190
Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser
195 200 205
Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser
210 215 220
Leu Ser Leu Gly
225
<210> 6
<211> 15
<212> PRT
<213> Artificial
<220>
<223> 6
<400> 6
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1 5 10 15
<210> 7
<211> 20
<212> PRT
<213> Artificial
<220>
<223> 7
<400> 7
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
1 5 10 15
Gly Gly Gly Ser
20
<210> 8
<211> 25
<212> PRT
<213> Artificial
<220>
<223> 8
<400> 8
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
1 5 10 15
Gly Gly Gly Ser Gly Gly Gly Gly Ser
20 25
<210> 9
<211> 1302
<212> DNA
<213> Artificial
<220>
<223> 9
<400> 9
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agcggaggcg gtggatctga gtctaagtac ggccctcctt gtcctccatg tccagctcca 660
gaagctgaag gcggcccttc cgtgttcctg tttcctccaa agcctaagga caccctgatg 720
atcagcagaa cccctgaagt gacctgcgtg gtggtggacg tgtcccaaga ggatcctgag 780
gtgcagttca attggtacgt ggacggcgtg gaagtgcaca acgccaagac caagcctaga 840
gaggaacagt tcaacagcac ctatagagtg gtgtccgtgc tgaccgtgct gcaccaggat 900
tggctgaacg gcaaagagta caagtgcaag gtgtccaaca agggcctgcc tagcagcatc 960
gagaaaacca tcagcaaggc caagggccag ccaagggaac cccaggttta cacactgcct 1020
ccaagccaag aggaaatgac caagaaccag gtgtccctga cctgcctggt caagggcttc 1080
tacccttccg atatcgccgt ggaatgggag agcaatggcc agcctgagaa caactacaag 1140
accacacctc ctgtgctgga cagcgacggc tcattcttcc tgtacagcaa gctgacagtg 1200
gacaagagcc ggtggcaaga gggcaacgtg ttcagctgta gcgtgatgca cgaggccctg 1260
cacaaccact acacccagaa gtctctgagc ctgagcctgg ga 1302
<210> 10
<211> 1302
<212> DNA
<213> Artificial
<220>
<223> 10
<400> 10
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agcggaggcg gtggatctga gtctaagtac ggccctcctt ctccaccatg tcctgctcca 660
gaagctgaag gcggcccttc cgtgttcctg tttcctccaa agcctaagga caccctgatg 720
atcagcagaa cccctgaagt gacctgcgtg gtggtggacg tgtcccaaga ggatcctgag 780
gtgcagttca attggtacgt ggacggcgtg gaagtgcaca acgccaagac caagcctaga 840
gaggaacagt tcaacagcac ctatagagtg gtgtccgtgc tgaccgtgct gcaccaggat 900
tggctgaacg gcaaagagta caagtgcaag gtgtccaaca agggcctgcc tagcagcatc 960
gagaaaacca tcagcaaggc caagggccag ccaagggaac cccaggttta cacactgcct 1020
ccaagccaag aggaaatgac caagaaccag gtgtccctga cctgcctggt caagggcttc 1080
tacccttccg atatcgccgt ggaatgggag agcaatggcc agcctgagaa caactacaag 1140
accacacctc ctgtgctgga cagcgacggc tcattcttcc tgtacagcaa gctgacagtg 1200
gacaagagcc ggtggcaaga gggcaacgtg ttcagctgta gcgtgatgca cgaggccctg 1260
cacaaccact acacccagaa gtctctgagc ctgagcctgg ga 1302
<210> 11
<211> 1302
<212> DNA
<213> Artificial
<220>
<223> 11
<400> 11
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaatcgtg 540
cagtgcagat ccgtggaagg gtcctgcggg ttcggcgggg ggggcagcgg cgggggcggc 600
agcgggggcg ggggcagcga gtccaagtat ggccccccct gccctccttg tcctgctccc 660
cccgtggagg ggggccctag cgtgttcctg ttccctccca agcccaagga caccctgatg 720
atcagcagaa cccccgaggt gacctgcgtg gtcgtggccg tgagccaaga ggaccccgag 780
gtgcagttca actggtacgt ggacggcgtg gaggtgcaca acgccaagac caagcctaga 840
gaggagcagt tcaacagcac ctacagagtg gtgagcgtgc tgaccgtgct gcaccaagac 900
tggctgaacg gcaaggagta caagtgcaag gtgagcaaca agggcctgcc tagcagcatc 960
gagaagacca tcagcaaggc caaggggcag cctagagagc cccaagtgta caccctgccc 1020
cctagccaag aggagatgac caagaaccaa gtgagcctga cctgcctggt gaagggcttc 1080
taccctagcg acatcgccgt ggagtgggag agcaacgggc agcccgagaa caactacaag 1140
accacccccc ccgtgctgga cagcgacggc agcttcttcc tgtacagcaa gctgaccgtg 1200
gacaagagca gatggcaaga gggcaacgtg ttcagctgca gcgtgatgca cgaggccctg 1260
cacaaccact acacacagaa gagcctgagc ctgagcctgg gc 1302
<210> 12
<211> 1302
<212> DNA
<213> Artificial
<220>
<223> 12
<400> 12
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaatcgtg 540
cagtgcagat ccgtggaagg gtcctgcggg ttcggcgggg ggggcagcgg cgggggcggc 600
agcgggggcg ggggcagcga gtccaagtat ggccccccta gccctccttg tcctgctccc 660
cccgtggagg ggggccctag cgtgttcctg ttccctccca agcccaagga caccctgatg 720
atcagcagaa cccccgaggt gacctgcgtg gtcgtggccg tgagccaaga ggaccccgag 780
gtgcagttca actggtacgt ggacggcgtg gaggtgcaca acgccaagac caagcctaga 840
gaggagcagt tcaacagcac ctacagagtg gtgagcgtgc tgaccgtgct gcaccaagac 900
tggctgaacg gcaaggagta caagtgcaag gtgagcaaca agggcctgcc tagcagcatc 960
gagaagacca tcagcaaggc caaggggcag cctagagagc cccaagtgta caccctgccc 1020
cctagccaag aggagatgac caagaaccaa gtgagcctga cctgcctggt gaagggcttc 1080
taccctagcg acatcgccgt ggagtgggag agcaacgggc agcccgagaa caactacaag 1140
accacccccc ccgtgctgga cagcgacggc agcttcttcc tgtacagcaa gctgaccgtg 1200
gacaagagca gatggcaaga gggcaacgtg ttcagctgca gcgtgatgca cgaggccctg 1260
cacaaccact acacacagaa gagcctgagc ctgagcctgg gc 1302
<210> 13
<211> 1317
<212> DNA
<213> Artificial
<220>
<223> 13
<400> 13
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agcggaggcg gaggaagcgg tggcggcgga tctgagtcta agtacggacc tccttgtcct 660
ccatgtccag ctccagaagc tgaaggcggc ccttccgtgt tcctgtttcc tccaaagcct 720
aaggacaccc tgatgatcag cagaacccct gaagtgacct gcgtggtggt ggacgtgtcc 780
caagaggatc ctgaggtgca gttcaattgg tacgtggacg gcgtggaagt gcacaacgcc 840
aagaccaagc ctagagagga acagttcaac agcacctata gagtggtgtc cgtgctgacc 900
gtgctgcacc aggattggct gaacggcaaa gagtacaagt gcaaggtgtc caacaagggc 960
ctgcctagca gcatcgagaa aaccatcagc aaggccaagg gccagccaag ggaaccccag 1020
gtttacacac tgcctccaag ccaagaggaa atgaccaaga accaggtgtc cctgacctgc 1080
ctggtcaagg gcttctaccc ttccgatatc gccgtggaat gggagagcaa tggccagcct 1140
gagaacaact acaagaccac acctcctgtg ctggacagcg acggctcatt cttcctgtac 1200
agcaagctga cagtggacaa gagccggtgg caagagggca acgtgttcag ctgtagcgtg 1260
atgcacgagg ccctgcacaa ccactacacc cagaagtctc tgagcctgag cctggga 1317
<210> 14
<211> 1317
<212> DNA
<213> Artificial
<220>
<223> 14
<400> 14
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agcggaggcg gaggaagcgg tggcggcgga tctgagtcta agtacggacc tccttctcca 660
ccatgtcctg ctccagaagc tgaaggcggc ccttccgtgt tcctgtttcc tccaaagcct 720
aaggacaccc tgatgatcag cagaacccct gaagtgacct gcgtggtggt ggacgtgtcc 780
caagaggatc ctgaggtgca gttcaattgg tacgtggacg gcgtggaagt gcacaacgcc 840
aagaccaagc ctagagagga acagttcaac agcacctata gagtggtgtc cgtgctgacc 900
gtgctgcacc aggattggct gaacggcaaa gagtacaagt gcaaggtgtc caacaagggc 960
ctgcctagca gcatcgagaa aaccatcagc aaggccaagg gccagccaag ggaaccccag 1020
gtttacacac tgcctccaag ccaagaggaa atgaccaaga accaggtgtc cctgacctgc 1080
ctggtcaagg gcttctaccc ttccgatatc gccgtggaat gggagagcaa tggccagcct 1140
gagaacaact acaagaccac acctcctgtg ctggacagcg acggctcatt cttcctgtac 1200
agcaagctga cagtggacaa gagccggtgg caagagggca acgtgttcag ctgtagcgtg 1260
atgcacgagg ccctgcacaa ccactacacc cagaagtctc tgagcctgag cctggga 1317
<210> 15
<211> 1317
<212> DNA
<213> Artificial
<220>
<223> 15
<400> 15
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaattgtg 540
cagtgccggt ccgtggaagg cagctgtggc tttggggggg gggggagcgg ggggggcggg 600
agcggcgggg gcgggagcgg cgggggcggc agcgagagca agtacgggcc cccctgccct 660
ccttgtcctg ctccccccgt ggaggggggc cctagcgtgt tcctgttccc ccccaagccc 720
aaggacaccc tgatgatcag cagaaccccc gaggtgacct gcgtggtcgt ggccgtgagc 780
caagaggacc ccgaggtgca gttcaactgg tacgtggacg gcgtggaggt gcacaacgcc 840
aagaccaagc ctagagagga gcagttcaac agcacctaca gagtggtgag cgtgctgacc 900
gtgctgcacc aagactggct gaacggcaag gagtacaagt gcaaggtgag caacaagggc 960
ctgcctagca gcatcgagaa gaccatcagc aaggccaagg ggcagcctag agagccccaa 1020
gtgtacaccc tgccccctag ccaagaggag atgaccaaga accaagtgag cctgacctgc 1080
ctggtgaagg gcttctaccc tagcgacatc gccgtggagt gggagagcaa cgggcagccc 1140
gagaacaact acaagaccac cccccccgtg ctggacagcg acggcagctt cttcctgtac 1200
agcaagctga ccgtggacaa gagcagatgg caagagggca acgtgttcag ctgcagcgtg 1260
atgcacgagg ccctgcacaa ccactacaca cagaagagcc tgagcctgag cctgggc 1317
<210> 16
<211> 1317
<212> DNA
<213> Artificial
<220>
<223> 16
<400> 16
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaattgtg 540
cagtgccggt ccgtggaagg cagctgtggc tttggggggg gggggagcgg ggggggcggg 600
agcggcgggg gcgggagcgg cgggggcggc agcgagagca agtacgggcc ccctagccct 660
ccttgtcctg ctccccccgt ggaggggggc cctagcgtgt tcctgttccc ccccaagccc 720
aaggacaccc tgatgatcag cagaaccccc gaggtgacct gcgtggtcgt ggccgtgagc 780
caagaggacc ccgaggtgca gttcaactgg tacgtggacg gcgtggaggt gcacaacgcc 840
aagaccaagc ctagagagga gcagttcaac agcacctaca gagtggtgag cgtgctgacc 900
gtgctgcacc aagactggct gaacggcaag gagtacaagt gcaaggtgag caacaagggc 960
ctgcctagca gcatcgagaa gaccatcagc aaggccaagg ggcagcctag agagccccaa 1020
gtgtacaccc tgccccctag ccaagaggag atgaccaaga accaagtgag cctgacctgc 1080
ctggtgaagg gcttctaccc tagcgacatc gccgtggagt gggagagcaa cgggcagccc 1140
gagaacaact acaagaccac cccccccgtg ctggacagcg acggcagctt cttcctgtac 1200
agcaagctga ccgtggacaa gagcagatgg caagagggca acgtgttcag ctgcagcgtg 1260
atgcacgagg ccctgcacaa ccactacaca cagaagagcc tgagcctgag cctgggc 1317
<210> 17
<211> 1332
<212> DNA
<213> Artificial
<220>
<223> 17
<400> 17
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaatcgtg 540
caatgtagaa gcgtcgaagg gagctgcggc ttcggcgggg gcggcagcgg cgggggcggc 600
agcgggggcg ggggcagcgg cggcgggggg tccggcggcg ggggcagcga atccaaatac 660
ggccccccct gtcccccttg ccctgccccc gaggctgaag gcgggcctag cgtgttcctg 720
ttccccccca agcccaagga caccctgatg atcagcagaa cccccgaggt gacctgcgtg 780
gtcgtggacg tgagccaaga ggaccccgag gtgcagttca actggtacgt ggacggcgtg 840
gaggtgcaca acgccaagac caagcctaga gaggagcagt tcaacagcac ctacagagtg 900
gtgagcgtgc tgaccgtgct gcaccaagac tggctgaacg gcaaggagta caagtgcaag 960
gtgagcaaca agggcctgcc tagcagcatc gagaagacca tcagcaaggc caaggggcag 1020
cctagagagc cccaagtgta caccctgccc cctagccaag aggagatgac caagaaccaa 1080
gtgagcctga cctgcctggt gaagggcttc taccctagcg acatcgccgt ggagtgggag 1140
agcaacgggc agcccgagaa caactacaag accacccccc ccgtgctgga cagcgacggc 1200
agcttcttcc tgtacagcaa gctgaccgtg gacaagagca gatggcaaga gggcaacgtg 1260
ttcagctgca gcgtgatgca cgaggccctg cacaaccact acacacagaa gagcctgagc 1320
ctgagcctgg gc 1332
<210> 18
<211> 1332
<212> DNA
<213> Artificial
<220>
<223> 18
<400> 18
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga tttggcggcg gaggatctgg cggaggtgga 600
agcggaggcg gaggaagcgg tggcggcggt agtggcggtg gtggatctga gtctaagtac 660
ggccctcctt ctccaccatg tcctgctcca gaagctgaag gcggcccttc cgtgttcctg 720
tttcctccaa agcctaagga caccctgatg atcagcagaa cccctgaagt gacctgcgtg 780
gtggtggacg tgtcccaaga ggatcctgag gtgcagttca attggtacgt ggacggcgtg 840
gaagtgcaca acgccaagac caagcctaga gaggaacagt tcaacagcac ctatagagtg 900
gtgtccgtgc tgaccgtgct gcaccaggat tggctgaacg gcaaagagta caagtgcaag 960
gtgtccaaca agggcctgcc tagcagcatc gagaaaacca tcagcaaggc caagggccag 1020
ccaagggaac cccaggttta cacactgcct ccaagccaag aggaaatgac caagaaccag 1080
gtgtccctga cctgcctggt caagggcttc tacccttccg atatcgccgt ggaatgggag 1140
agcaatggcc agcctgagaa caactacaag accacacctc ctgtgctgga cagcgacggc 1200
tcattcttcc tgtacagcaa gctgacagtg gacaagagcc ggtggcaaga gggcaacgtg 1260
ttcagctgta gcgtgatgca cgaggccctg cacaaccact acacccagaa gtctctgagc 1320
ctgagcctgg ga 1332
<210> 19
<211> 1332
<212> DNA
<213> Artificial
<220>
<223> 19
<400> 19
ttccccacga tccctctgtc tagactgttc gacaacgcca tgctgcgggc ccacagactg 60
caccagctgg ccttcgacac ctaccaggag ttcgaggagg cctacatccc caaagagcaa 120
aaatacagct tcctgcagaa cccccagaca tctttgtgct ttagcgagag catccctacc 180
cctagcaata gagaagagac acagcagaag agcaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agagctggct ggaacccgtg caatttctgc gcagcgtctt cgccaacagc 300
ctggtgtacg gcgcctctga tagcaacgtg tacgacctgc tgaaagatct ggaagagggc 360
atccagaccc tgatgggaag actggaggac ggctctccaa gaacaggcca aatcttcaag 420
cagacctaca gcaaattcga tacaaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagat ctgtggaagg cagctgtggc ttcggcggcg gaggcagcgg cggaggcggc 600
tctggcggcg gcggcagcgg cggcggagga agcggaggag gcggttctga gagcaagtac 660
ggcccccctt gtcctccttg ccccgcccct cccgtggaag gcggacctag tgtgttcctc 720
ttccctccaa aacctaagga taccctgatg atcagccgga cacctgaggt tacatgcgtg 780
gtcgtggctg tgagccagga agatcctgag gtgcagttca actggtacgt ggacggcgtg 840
gaagtgcata atgctaagac caagcctcgg gaagagcagt ttaactccac ctatagagtg 900
gtgtccgttc tgaccgtgct gcaccaggac tggctgaacg gcaaggaata caagtgcaag 960
gtgtccaaca agggcctgcc cagttctatt gagaagacaa ttagcaaggc caagggccag 1020
cctagagagc ctcaggtgta caccctgcct cccagccagg aggaaatgac caagaaccag 1080
gtgtctctca cttgtctggt gaagggcttc taccccagcg acatcgccgt ggagtgggag 1140
tctaatggcc aacctgagaa caactacaag accacacctc cagtgctgga cagcgatggc 1200
tcttttttcc tgtactccaa gctgacagtg gacaagtcca ggtggcagga gggaaatgtg 1260
ttcagctgca gcgtgatgca cgaggctctg cacaatcact atacccagaa aagcctctct 1320
ctgagcctgg gc 1332
<210> 20
<211> 1332
<212> DNA
<213> Artificial
<220>
<223> 20
<400> 20
ttccccacca tccccctgtc tagactgttt gataacgcca tgctgcgggc ccacagactc 60
catcagctgg ccttcgacac ctaccaggag ttcgaggaag cctacatccc taaggaacaa 120
aaatactcct tcctgcagaa cccccaaaca agcctgtgct tcagcgagtc catccctacc 180
ccttctaaca gagaggaaac acagcagaaa tctaatctgg aactgctgag gatcagcctg 240
ctgctgatcc agagctggct ggaacctgtg caatttctgc ggagcgtgtt tgccaacagc 300
ctggtgtatg gagccagcga cagcaatgtg tacgacctgc tgaaagatct ggaagaggga 360
atccagaccc tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga tacaaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgttttag aaaggacatg gacaaggtgg agacattcct ccggatcgtg 540
cagtgcagat ctgtggaggg cagctgcggc ttcggcggcg gcggcagcgg cggtggcggt 600
agcggcggag gcgggtccgg aggcggcggc agcggcggag gcggatctga gtccaagtac 660
ggccccccca gccctccatg tcctgcccct cctgtggaag gcggaccttc tgttttcctg 720
ttcccaccaa aacctaagga caccctgatg atcagcagaa cacctgaggt gacctgcgtg 780
gtcgtggccg taagccagga ggaccccgaa gtgcagttca actggtacgt ggacggcgtg 840
gaggtgcaca acgctaagac caagcctcgg gaagagcagt ttaacagcac ctacagagtg 900
gtgtctgtgc tgaccgtgct gcaccaggac tggctgaacg gcaaagagta caagtgcaag 960
gtgtccaaca agggactgcc cagcagcatc gagaagacga ttagcaaggc caagggccaa 1020
cctagagagc ctcaggtgta caccctgcct ccttctcagg aggagatgac caaaaaccag 1080
gtgtccctga cctgtctggt gaagggcttc tatccctctg acatcgccgt cgagtgggaa 1140
agcaatggcc agcctgaaaa caactacaag acaacacccc cagtgcttga tagcgacggc 1200
agcttcttcc tgtactccaa gctgacagtg gataagagcc ggtggcagga aggaaacgtg 1260
ttctcatgca gcgtgatgca cgaggctctg cacaatcact acacccagaa gagcctgtca 1320
ctgagcctgg gc 1332
<210> 21
<211> 434
<212> PRT
<213> Artificial
<220>
<223> 21
<400> 21
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser
195 200 205
Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu Ala Glu Gly
210 215 220
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
225 230 235 240
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser Gln
245 250 255
Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu Val
260 265 270
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr
275 280 285
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
290 295 300
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
305 310 315 320
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
325 330 335
Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val Ser
340 345 350
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
355 360 365
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
370 375 380
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
385 390 395 400
Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val Met
405 410 415
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
420 425 430
Leu Gly
<210> 22
<211> 434
<212> PRT
<213> Artificial
<220>
<223> 22
<400> 22
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser
195 200 205
Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala Pro Glu Ala Glu Gly
210 215 220
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
225 230 235 240
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser Gln
245 250 255
Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu Val
260 265 270
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr
275 280 285
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
290 295 300
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
305 310 315 320
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
325 330 335
Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val Ser
340 345 350
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
355 360 365
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
370 375 380
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
385 390 395 400
Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val Met
405 410 415
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
420 425 430
Leu Gly
<210> 23
<211> 434
<212> PRT
<213> Artificial
<220>
<223> 23
<400> 23
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser
195 200 205
Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Pro Val Glu Gly
210 215 220
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
225 230 235 240
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Ala Val Ser Gln
245 250 255
Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu Val
260 265 270
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr
275 280 285
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
290 295 300
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
305 310 315 320
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
325 330 335
Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val Ser
340 345 350
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
355 360 365
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
370 375 380
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
385 390 395 400
Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val Met
405 410 415
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
420 425 430
Leu Gly
<210> 24
<211> 434
<212> PRT
<213> Artificial
<220>
<223> 24
<400> 24
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser
195 200 205
Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala Pro Pro Val Glu Gly
210 215 220
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
225 230 235 240
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Ala Val Ser Gln
245 250 255
Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu Val
260 265 270
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr
275 280 285
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
290 295 300
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
305 310 315 320
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
325 330 335
Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val Ser
340 345 350
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
355 360 365
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
370 375 380
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
385 390 395 400
Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val Met
405 410 415
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
420 425 430
Leu Gly
<210> 25
<211> 439
<212> PRT
<213> Artificial
<220>
<223> 25
<400> 25
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala
210 215 220
Pro Glu Ala Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
225 230 235 240
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
245 250 255
Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val
260 265 270
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
275 280 285
Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
290 295 300
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly
305 310 315 320
Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
325 330 335
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr
340 345 350
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser
355 360 365
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
370 375 380
Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
385 390 395 400
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe
405 410 415
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
420 425 430
Ser Leu Ser Leu Ser Leu Gly
435
<210> 26
<211> 439
<212> PRT
<213> Artificial
<220>
<223> 26
<400> 26
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala
210 215 220
Pro Glu Ala Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
225 230 235 240
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
245 250 255
Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val
260 265 270
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
275 280 285
Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
290 295 300
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly
305 310 315 320
Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
325 330 335
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr
340 345 350
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser
355 360 365
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
370 375 380
Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
385 390 395 400
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe
405 410 415
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
420 425 430
Ser Leu Ser Leu Ser Leu Gly
435
<210> 27
<211> 439
<212> PRT
<213> Artificial
<220>
<223> 27
<400> 27
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala
210 215 220
Pro Pro Val Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
225 230 235 240
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
245 250 255
Val Ala Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val
260 265 270
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
275 280 285
Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
290 295 300
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly
305 310 315 320
Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
325 330 335
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr
340 345 350
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser
355 360 365
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
370 375 380
Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
385 390 395 400
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe
405 410 415
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
420 425 430
Ser Leu Ser Leu Ser Leu Gly
435
<210> 28
<211> 439
<212> PRT
<213> Artificial
<220>
<223> 28
<400> 28
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Ser Pro Pro Cys Pro Ala
210 215 220
Pro Pro Val Glu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro
225 230 235 240
Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val
245 250 255
Val Ala Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val
260 265 270
Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln
275 280 285
Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln
290 295 300
Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly
305 310 315 320
Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro
325 330 335
Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr
340 345 350
Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser
355 360 365
Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr
370 375 380
Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr
385 390 395 400
Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe
405 410 415
Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys
420 425 430
Ser Leu Ser Leu Ser Leu Gly
435
<210> 29
<211> 444
<212> PRT
<213> Artificial
<220>
<223> 29
<400> 29
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Cys
210 215 220
Pro Pro Cys Pro Ala Pro Glu Ala Glu Gly Gly Pro Ser Val Phe Leu
225 230 235 240
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
245 250 255
Val Thr Cys Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln
260 265 270
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
275 280 285
Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu
290 295 300
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
305 310 315 320
Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys
325 330 335
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
340 345 350
Gln Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
355 360 365
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
370 375 380
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
385 390 395 400
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
405 410 415
Glu Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
420 425 430
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Leu Gly
435 440
<210> 30
<211> 444
<212> PRT
<213> Artificial
<220>
<223> 30
<400> 30
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Ser
210 215 220
Pro Pro Cys Pro Ala Pro Glu Ala Glu Gly Gly Pro Ser Val Phe Leu
225 230 235 240
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
245 250 255
Val Thr Cys Val Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln
260 265 270
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
275 280 285
Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu
290 295 300
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
305 310 315 320
Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys
325 330 335
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
340 345 350
Gln Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
355 360 365
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
370 375 380
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
385 390 395 400
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
405 410 415
Glu Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
420 425 430
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Leu Gly
435 440
<210> 31
<211> 444
<212> PRT
<213> Artificial
<220>
<223> 31
<400> 31
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Cys
210 215 220
Pro Pro Cys Pro Ala Pro Pro Val Glu Gly Gly Pro Ser Val Phe Leu
225 230 235 240
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
245 250 255
Val Thr Cys Val Val Val Ala Val Ser Gln Glu Asp Pro Glu Val Gln
260 265 270
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
275 280 285
Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu
290 295 300
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
305 310 315 320
Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys
325 330 335
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
340 345 350
Gln Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
355 360 365
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
370 375 380
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
385 390 395 400
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
405 410 415
Glu Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
420 425 430
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Leu Gly
435 440
<210> 32
<211> 444
<212> PRT
<213> Artificial
<220>
<223> 32
<400> 32
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Gly
180 185 190
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
195 200 205
Gly Gly Ser Gly Gly Gly Gly Ser Glu Ser Lys Tyr Gly Pro Pro Ser
210 215 220
Pro Pro Cys Pro Ala Pro Pro Val Glu Gly Gly Pro Ser Val Phe Leu
225 230 235 240
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
245 250 255
Val Thr Cys Val Val Val Ala Val Ser Gln Glu Asp Pro Glu Val Gln
260 265 270
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
275 280 285
Pro Arg Glu Glu Gln Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu
290 295 300
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
305 310 315 320
Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys
325 330 335
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
340 345 350
Gln Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
355 360 365
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
370 375 380
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
385 390 395 400
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
405 410 415
Glu Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
420 425 430
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Leu Gly
435 440
<210> 33
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 33
<400> 33
ttccccacca ttcctctgag ccggctgttc gacaacgcca tgctgagagc ccacagactg 60
caccagctgg ccttcgacac ctaccaagag ttcgaggaag cctacattcc caaagagcag 120
aagtacagct tcctgcagaa ccctcagacc agcctgtgct tcagcgagag catccccaca 180
cctagcaaca gagaggaaac ccagcagaag tccaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agtcttggct ggaacccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgccagcga cagcaacgtt tacgacctgc tgaaggacct ggaagagggc 360
atccagacac tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaaaaactac 480
ggcctgctgt actgctttcg gaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagaa gcgtggaagg ctcttgcgga ttt 573
<210> 34
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 34
<400> 34
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaatcgtg 540
cagtgcagat ccgtggaagg gtcctgcggg ttc 573
<210> 35
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 35
<400> 35
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaattgtg 540
cagtgccggt ccgtggaagg cagctgtggc ttt 573
<210> 36
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 36
<400> 36
ttccccacca tccccctgag cagactgttc gacaacgcca tgctgagagc ccacagactg 60
catcagctgg ccttcgacac ctaccaagag ttcgaggagg cctacatccc caaggagcag 120
aagtacagct tcctgcagaa ccctcagaca agcctgtgct tcagcgagag catccccacc 180
cctagcaaca gagaggagac acagcagaag agcaacctgg agctgctgag aatcagcctg 240
ctcctgattc agagctggct ggagcccgtg cagttcctga gaagcgtgtt cgccaacagc 300
ctggtgtacg gcgcctccga cagcaacgtg tacgacctgc tgaaggacct ggaggagggc 360
attcagaccc tgatgggcag actggaggac ggcagcccta gaaccgggca gatcttcaag 420
cagacctaca gcaagttcga caccaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg agaccttcct gagaatcgtg 540
caatgtagaa gcgtcgaagg gagctgcggc ttc 573
<210> 37
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 37
<400> 37
ttccccacca tccccctgtc tagactgttt gataacgcca tgctgcgggc ccacagactc 60
catcagctgg ccttcgacac ctaccaggag ttcgaggaag cctacatccc taaggaacaa 120
aaatactcct tcctgcagaa cccccaaaca agcctgtgct tcagcgagtc catccctacc 180
ccttctaaca gagaggaaac acagcagaaa tctaatctgg aactgctgag gatcagcctg 240
ctgctgatcc agagctggct ggaacctgtg caatttctgc ggagcgtgtt tgccaacagc 300
ctggtgtatg gagccagcga cagcaatgtg tacgacctgc tgaaagatct ggaagaggga 360
atccagaccc tgatgggcag actggaagat ggcagcccta gaaccggcca gatcttcaag 420
cagacctaca gcaagttcga tacaaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgttttag aaaggacatg gacaaggtgg agacattcct ccggatcgtg 540
cagtgcagat ctgtggaggg cagctgcggc ttc 573
<210> 38
<211> 573
<212> DNA
<213> Artificial
<220>
<223> 38
<400> 38
ttccccacga tccctctgtc tagactgttc gacaacgcca tgctgcgggc ccacagactg 60
caccagctgg ccttcgacac ctaccaggag ttcgaggagg cctacatccc caaagagcaa 120
aaatacagct tcctgcagaa cccccagaca tctttgtgct ttagcgagag catccctacc 180
cctagcaata gagaagagac acagcagaag agcaacctgg aactgctgcg gatcagcctg 240
ctgctgatcc agagctggct ggaacccgtg caatttctgc gcagcgtctt cgccaacagc 300
ctggtgtacg gcgcctctga tagcaacgtg tacgacctgc tgaaagatct ggaagagggc 360
atccagaccc tgatgggaag actggaggac ggctctccaa gaacaggcca aatcttcaag 420
cagacctaca gcaaattcga tacaaacagc cacaacgacg acgccctgct gaagaactac 480
ggcctgctgt actgcttcag aaaggacatg gacaaggtgg aaaccttcct gcggatcgtg 540
cagtgcagat ctgtggaagg cagctgtggc ttc 573
<210> 39
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 39
<400> 39
gagtctaagt acggccctcc ttgtcctcca tgtccagctc cagaagctga aggcggccct 60
tccgtgttcc tgtttcctcc aaagcctaag gacaccctga tgatcagcag aacccctgaa 120
gtgacctgcg tggtggtgga cgtgtcccaa gaggatcctg aggtgcagtt caattggtac 180
gtggacggcg tggaagtgca caacgccaag accaagccta gagaggaaca gttcaacagc 240
acctatagag tggtgtccgt gctgaccgtg ctgcaccagg attggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggcctg cctagcagca tcgagaaaac catcagcaag 360
gccaagggcc agccaaggga accccaggtt tacacactgc ctccaagcca agaggaaatg 420
accaagaacc aggtgtccct gacctgcctg gtcaagggct tctacccttc cgatatcgcc 480
gtggaatggg agagcaatgg ccagcctgag aacaactaca agaccacacc tcctgtgctg 540
gacagcgacg gctcattctt cctgtacagc aagctgacag tggacaagag ccggtggcaa 600
gagggcaacg tgttcagctg tagcgtgatg cacgaggccc tgcacaacca ctacacccag 660
aagtctctga gcctgagcct ggga 684
<210> 40
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 40
<400> 40
gagtctaagt acggacctcc ttgtcctcca tgtccagctc cagaagctga aggcggccct 60
tccgtgttcc tgtttcctcc aaagcctaag gacaccctga tgatcagcag aacccctgaa 120
gtgacctgcg tggtggtgga cgtgtcccaa gaggatcctg aggtgcagtt caattggtac 180
gtggacggcg tggaagtgca caacgccaag accaagccta gagaggaaca gttcaacagc 240
acctatagag tggtgtccgt gctgaccgtg ctgcaccagg attggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggcctg cctagcagca tcgagaaaac catcagcaag 360
gccaagggcc agccaaggga accccaggtt tacacactgc ctccaagcca agaggaaatg 420
accaagaacc aggtgtccct gacctgcctg gtcaagggct tctacccttc cgatatcgcc 480
gtggaatggg agagcaatgg ccagcctgag aacaactaca agaccacacc tcctgtgctg 540
gacagcgacg gctcattctt cctgtacagc aagctgacag tggacaagag ccggtggcaa 600
gagggcaacg tgttcagctg tagcgtgatg cacgaggccc tgcacaacca ctacacccag 660
aagtctctga gcctgagcct ggga 684
<210> 41
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 41
<400> 41
gaatccaaat acggcccccc ctgtccccct tgccctgccc ccgaggctga aggcgggcct 60
agcgtgttcc tgttcccccc caagcccaag gacaccctga tgatcagcag aacccccgag 120
gtgacctgcg tggtcgtgga cgtgagccaa gaggaccccg aggtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgccaag accaagccta gagaggagca gttcaacagc 240
acctacagag tggtgagcgt gctgaccgtg ctgcaccaag actggctgaa cggcaaggag 300
tacaagtgca aggtgagcaa caagggcctg cctagcagca tcgagaagac catcagcaag 360
gccaaggggc agcctagaga gccccaagtg tacaccctgc cccctagcca agaggagatg 420
accaagaacc aagtgagcct gacctgcctg gtgaagggct tctaccctag cgacatcgcc 480
gtggagtggg agagcaacgg gcagcccgag aacaactaca agaccacccc ccccgtgctg 540
gacagcgacg gcagcttctt cctgtacagc aagctgaccg tggacaagag cagatggcaa 600
gagggcaacg tgttcagctg cagcgtgatg cacgaggccc tgcacaacca ctacacacag 660
aagagcctga gcctgagcct gggc 684
<210> 42
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 42
<400> 42
gagtctaagt acggccctcc ttctccacca tgtcctgctc cagaagctga aggcggccct 60
tccgtgttcc tgtttcctcc aaagcctaag gacaccctga tgatcagcag aacccctgaa 120
gtgacctgcg tggtggtgga cgtgtcccaa gaggatcctg aggtgcagtt caattggtac 180
gtggacggcg tggaagtgca caacgccaag accaagccta gagaggaaca gttcaacagc 240
acctatagag tggtgtccgt gctgaccgtg ctgcaccagg attggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggcctg cctagcagca tcgagaaaac catcagcaag 360
gccaagggcc agccaaggga accccaggtt tacacactgc ctccaagcca agaggaaatg 420
accaagaacc aggtgtccct gacctgcctg gtcaagggct tctacccttc cgatatcgcc 480
gtggaatggg agagcaatgg ccagcctgag aacaactaca agaccacacc tcctgtgctg 540
gacagcgacg gctcattctt cctgtacagc aagctgacag tggacaagag ccggtggcaa 600
gagggcaacg tgttcagctg tagcgtgatg cacgaggccc tgcacaacca ctacacccag 660
aagtctctga gcctgagcct ggga 684
<210> 43
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 43
<400> 43
gagtctaagt acggacctcc ttctccacca tgtcctgctc cagaagctga aggcggccct 60
tccgtgttcc tgtttcctcc aaagcctaag gacaccctga tgatcagcag aacccctgaa 120
gtgacctgcg tggtggtgga cgtgtcccaa gaggatcctg aggtgcagtt caattggtac 180
gtggacggcg tggaagtgca caacgccaag accaagccta gagaggaaca gttcaacagc 240
acctatagag tggtgtccgt gctgaccgtg ctgcaccagg attggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggcctg cctagcagca tcgagaaaac catcagcaag 360
gccaagggcc agccaaggga accccaggtt tacacactgc ctccaagcca agaggaaatg 420
accaagaacc aggtgtccct gacctgcctg gtcaagggct tctacccttc cgatatcgcc 480
gtggaatggg agagcaatgg ccagcctgag aacaactaca agaccacacc tcctgtgctg 540
gacagcgacg gctcattctt cctgtacagc aagctgacag tggacaagag ccggtggcaa 600
gagggcaacg tgttcagctg tagcgtgatg cacgaggccc tgcacaacca ctacacccag 660
aagtctctga gcctgagcct ggga 684
<210> 44
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 44
<400> 44
gagtctaagt acggccctcc ttctccacca tgtcctgctc cagaagctga aggcggccct 60
tccgtgttcc tgtttcctcc aaagcctaag gacaccctga tgatcagcag aacccctgaa 120
gtgacctgcg tggtggtgga cgtgtcccaa gaggatcctg aggtgcagtt caattggtac 180
gtggacggcg tggaagtgca caacgccaag accaagccta gagaggaaca gttcaacagc 240
acctatagag tggtgtccgt gctgaccgtg ctgcaccagg attggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggcctg cctagcagca tcgagaaaac catcagcaag 360
gccaagggcc agccaaggga accccaggtt tacacactgc ctccaagcca agaggaaatg 420
accaagaacc aggtgtccct gacctgcctg gtcaagggct tctacccttc cgatatcgcc 480
gtggaatggg agagcaatgg ccagcctgag aacaactaca agaccacacc tcctgtgctg 540
gacagcgacg gctcattctt cctgtacagc aagctgacag tggacaagag ccggtggcaa 600
gagggcaacg tgttcagctg tagcgtgatg cacgaggccc tgcacaacca ctacacccag 660
aagtctctga gcctgagcct ggga 684
<210> 45
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 45
<400> 45
gagtccaagt atggcccccc ctgccctcct tgtcctgctc cccccgtgga ggggggccct 60
agcgtgttcc tgttccctcc caagcccaag gacaccctga tgatcagcag aacccccgag 120
gtgacctgcg tggtcgtggc cgtgagccaa gaggaccccg aggtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgccaag accaagccta gagaggagca gttcaacagc 240
acctacagag tggtgagcgt gctgaccgtg ctgcaccaag actggctgaa cggcaaggag 300
tacaagtgca aggtgagcaa caagggcctg cctagcagca tcgagaagac catcagcaag 360
gccaaggggc agcctagaga gccccaagtg tacaccctgc cccctagcca agaggagatg 420
accaagaacc aagtgagcct gacctgcctg gtgaagggct tctaccctag cgacatcgcc 480
gtggagtggg agagcaacgg gcagcccgag aacaactaca agaccacccc ccccgtgctg 540
gacagcgacg gcagcttctt cctgtacagc aagctgaccg tggacaagag cagatggcaa 600
gagggcaacg tgttcagctg cagcgtgatg cacgaggccc tgcacaacca ctacacacag 660
aagagcctga gcctgagcct gggc 684
<210> 46
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 46
<400> 46
gagagcaagt acgggccccc ctgccctcct tgtcctgctc cccccgtgga ggggggccct 60
agcgtgttcc tgttcccccc caagcccaag gacaccctga tgatcagcag aacccccgag 120
gtgacctgcg tggtcgtggc cgtgagccaa gaggaccccg aggtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgccaag accaagccta gagaggagca gttcaacagc 240
acctacagag tggtgagcgt gctgaccgtg ctgcaccaag actggctgaa cggcaaggag 300
tacaagtgca aggtgagcaa caagggcctg cctagcagca tcgagaagac catcagcaag 360
gccaaggggc agcctagaga gccccaagtg tacaccctgc cccctagcca agaggagatg 420
accaagaacc aagtgagcct gacctgcctg gtgaagggct tctaccctag cgacatcgcc 480
gtggagtggg agagcaacgg gcagcccgag aacaactaca agaccacccc ccccgtgctg 540
gacagcgacg gcagcttctt cctgtacagc aagctgaccg tggacaagag cagatggcaa 600
gagggcaacg tgttcagctg cagcgtgatg cacgaggccc tgcacaacca ctacacacag 660
aagagcctga gcctgagcct gggc 684
<210> 47
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 47
<400> 47
gagagcaagt acggcccccc ttgtcctcct tgccccgccc ctcccgtgga aggcggacct 60
agtgtgttcc tcttccctcc aaaacctaag gataccctga tgatcagccg gacacctgag 120
gttacatgcg tggtcgtggc tgtgagccag gaagatcctg aggtgcagtt caactggtac 180
gtggacggcg tggaagtgca taatgctaag accaagcctc gggaagagca gtttaactcc 240
acctatagag tggtgtccgt tctgaccgtg ctgcaccagg actggctgaa cggcaaggaa 300
tacaagtgca aggtgtccaa caagggcctg cccagttcta ttgagaagac aattagcaag 360
gccaagggcc agcctagaga gcctcaggtg tacaccctgc ctcccagcca ggaggaaatg 420
accaagaacc aggtgtctct cacttgtctg gtgaagggct tctaccccag cgacatcgcc 480
gtggagtggg agtctaatgg ccaacctgag aacaactaca agaccacacc tccagtgctg 540
gacagcgatg gctctttttt cctgtactcc aagctgacag tggacaagtc caggtggcag 600
gagggaaatg tgttcagctg cagcgtgatg cacgaggctc tgcacaatca ctatacccag 660
aaaagcctct ctctgagcct gggc 684
<210> 48
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 48
<400> 48
gagtccaagt atggcccccc tagccctcct tgtcctgctc cccccgtgga ggggggccct 60
agcgtgttcc tgttccctcc caagcccaag gacaccctga tgatcagcag aacccccgag 120
gtgacctgcg tggtcgtggc cgtgagccaa gaggaccccg aggtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgccaag accaagccta gagaggagca gttcaacagc 240
acctacagag tggtgagcgt gctgaccgtg ctgcaccaag actggctgaa cggcaaggag 300
tacaagtgca aggtgagcaa caagggcctg cctagcagca tcgagaagac catcagcaag 360
gccaaggggc agcctagaga gccccaagtg tacaccctgc cccctagcca agaggagatg 420
accaagaacc aagtgagcct gacctgcctg gtgaagggct tctaccctag cgacatcgcc 480
gtggagtggg agagcaacgg gcagcccgag aacaactaca agaccacccc ccccgtgctg 540
gacagcgacg gcagcttctt cctgtacagc aagctgaccg tggacaagag cagatggcaa 600
gagggcaacg tgttcagctg cagcgtgatg cacgaggccc tgcacaacca ctacacacag 660
aagagcctga gcctgagcct gggc 684
<210> 49
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 49
<400> 49
gagagcaagt acgggccccc tagccctcct tgtcctgctc cccccgtgga ggggggccct 60
agcgtgttcc tgttcccccc caagcccaag gacaccctga tgatcagcag aacccccgag 120
gtgacctgcg tggtcgtggc cgtgagccaa gaggaccccg aggtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgccaag accaagccta gagaggagca gttcaacagc 240
acctacagag tggtgagcgt gctgaccgtg ctgcaccaag actggctgaa cggcaaggag 300
tacaagtgca aggtgagcaa caagggcctg cctagcagca tcgagaagac catcagcaag 360
gccaaggggc agcctagaga gccccaagtg tacaccctgc cccctagcca agaggagatg 420
accaagaacc aagtgagcct gacctgcctg gtgaagggct tctaccctag cgacatcgcc 480
gtggagtggg agagcaacgg gcagcccgag aacaactaca agaccacccc ccccgtgctg 540
gacagcgacg gcagcttctt cctgtacagc aagctgaccg tggacaagag cagatggcaa 600
gagggcaacg tgttcagctg cagcgtgatg cacgaggccc tgcacaacca ctacacacag 660
aagagcctga gcctgagcct gggc 684
<210> 50
<211> 684
<212> DNA
<213> Artificial
<220>
<223> 50
<400> 50
gagtccaagt acggcccccc cagccctcca tgtcctgccc ctcctgtgga aggcggacct 60
tctgttttcc tgttcccacc aaaacctaag gacaccctga tgatcagcag aacacctgag 120
gtgacctgcg tggtcgtggc cgtaagccag gaggaccccg aagtgcagtt caactggtac 180
gtggacggcg tggaggtgca caacgctaag accaagcctc gggaagagca gtttaacagc 240
acctacagag tggtgtctgt gctgaccgtg ctgcaccagg actggctgaa cggcaaagag 300
tacaagtgca aggtgtccaa caagggactg cccagcagca tcgagaagac gattagcaag 360
gccaagggcc aacctagaga gcctcaggtg tacaccctgc ctccttctca ggaggagatg 420
accaaaaacc aggtgtccct gacctgtctg gtgaagggct tctatccctc tgacatcgcc 480
gtcgagtggg aaagcaatgg ccagcctgaa aacaactaca agacaacacc cccagtgctt 540
gatagcgacg gcagcttctt cctgtactcc aagctgacag tggataagag ccggtggcag 600
gaaggaaacg tgttctcatg cagcgtgatg cacgaggctc tgcacaatca ctacacccag 660
aagagcctgt cactgagcct gggc 684
<210> 51
<211> 217
<212> PRT
<213> Artificial
<220>
<223> 51
<400> 51
Ala Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys
1 5 10 15
Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val
20 25 30
Val Val Asp Val Ser Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr
35 40 45
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu
50 55 60
Gln Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His
65 70 75 80
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys
85 90 95
Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln
100 105 110
Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met
115 120 125
Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro
130 135 140
Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn
145 150 155 160
Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu
165 170 175
Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val
180 185 190
Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gln
195 200 205
Lys Ser Leu Ser Leu Ser Leu Gly Lys
210 215
<210> 52
<211> 12
<212> PRT
<213> Artificial
<220>
<223> 52
<400> 52
Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro
1 5 10
<210> 53
<211> 45
<212> DNA
<213> Artificial
<220>
<223> 53
<400> 53
ggcggggggg gcagcggcgg gggcggcagc gggggcgggg gcagc 45
<210> 54
<211> 60
<212> DNA
<213> Artificial
<220>
<223> 54
<400> 54
gggggggggg ggagcggggg gggcgggagc ggcgggggcg ggagcggcgg gggcggcagc 60
<210> 55
<211> 75
<212> DNA
<213> Artificial
<220>
<223> 55
<400> 55
ggcggcggag gcagcggcgg aggcggctct ggcggcggcg gcagcggcgg cggaggaagc 60
ggaggaggcg gttct 75
<210> 56
<211> 436
<212> PRT
<213> Artificial
<220>
<223> 56
<400> 56
Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg
1 5 10 15
Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu
20 25 30
Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro
35 40 45
Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg
50 55 60
Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu
65 70 75 80
Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val
85 90 95
Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp
100 105 110
Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu
115 120 125
Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser
130 135 140
Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr
145 150 155 160
Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe
165 170 175
Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe Arg
180 185 190
Asn Thr Gly Arg Gly Gly Glu Glu Lys Lys Lys Glu Lys Glu Lys Glu
195 200 205
Glu Gln Glu Glu Arg Glu Thr Lys Thr Pro Glu Cys Pro Ser His Thr
210 215 220
Gln Pro Leu Gly Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
225 230 235 240
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
245 250 255
Gln Glu Asp Pro Glu Val Gln Phe Asn Trp Tyr Val Asp Gly Val Glu
260 265 270
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Phe Asn Ser Thr
275 280 285
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
290 295 300
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser
305 310 315 320
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
325 330 335
Val Tyr Thr Leu Pro Pro Ser Gln Glu Glu Met Thr Lys Asn Gln Val
340 345 350
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
355 360 365
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
370 375 380
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu Thr
385 390 395 400
Val Asp Lys Ser Arg Trp Gln Glu Gly Asn Val Phe Ser Cys Ser Val
405 410 415
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
420 425 430
Ser Leu Gly Lys
435

Claims (10)

1.一种Fc突变体,其特征在于,包括:
第一肽段,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:
第228位、第235位、第409位、第447位和第234位;
任选地,所述第一肽段相较于野生型IgG4的Fc片段,进一步包括具有如下位点突变的至少之一:
第226位、第233位和第265位;
任选地,所述第一肽段相较于野生型IgG4的Fc片段,具有如下位点的突变:
1)第228位、第234位、第235位、第409位和第447位;或
2)第226位、第228位、第234位、第235位、第409位和第447位;或
3)第228位、第233位、第234位、第235位、第265位、第409位和第447位;或
4)第226位、第228位、第233位、第234位、第235位、第265位、第409位和第447位;
优选地,所述第一肽段相较于野生型IgG4的Fc片段,具有如下突变:
1)S228P、F234A、L235E、R409K和K447 Delete;或
2)C226S、S228P、L235E、R409K、K447 Delete和F234A或F234V;或
3)S228P、E233P、L235E、D265A、R409K、K447 Delete和F234A或F234V;或
4)C226S、S228P、E233P、L235E、D265A、R409K、K447 Delete和F234A或F234V;
任选地,进一步包括野生型IgG4的铰链区片段,所述野生型IgG4的铰链区片段的C端与所述第一肽段的N端相连;
任选地,所述野生型IgG4的铰链区片段具有SEQ ID NO:52所示的氨基酸序列;
任选地,所述Fc突变体具有如SEQ ID NO:2~5任一项所示的氨基酸序列。
2.一种融合蛋白,其特征在于,包含:
第二肽段,所述第二肽段包括生物活性分子功能区;
第三肽段,所述第三肽段包括Fc突变体,所述Fc突变体具有如权利要求1所限定的Fc突变体,所述第二肽段与所述第三肽段相连。
3.根据权利要求2所述的融合蛋白,其特征在于,所述第二肽段包含生长激素、生长激素类似物、生长激素功能区或生长激素类似物功能区,优选为人生长激素或人生长激素功能区;
任选地,所述人生长激素具有如SEQ ID NO:1或与其具有至少90%同一性的氨基酸序列;
任选地,所述第二肽段的C端与所述第三肽段的N端相连。
4.根据权利要求2所述的融合蛋白,其特征在于,进一步包括连接肽,所述连接肽设置于所述第二肽段和第三肽段之间;
任选地,所述连接肽的N端与所述第二肽段的C端相连,所述连接肽的C端与所述第三肽段的N端相连;
任选地,所述连接肽的氨基酸序列为(GGGGS)n,其中n为大于或等于1的整数,优选为5、6、7、8、9或10,更优选为5;
任选地,所述连接肽具有如SEQ ID NO:6~8任一项所示的氨基酸序列。
5.根据权利要求2~4任一项所述的融合蛋白,其特征在于,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有C226S、S228P、F234A、L235E、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)5
任选地,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)5
任选地,所述融合蛋白包括第三肽段以及连接肽,所述第三肽段包括Fc突变体,所述Fc突变体的第一肽段与野生型lgG4的Fc片段相比具有C226S、S228P、E233P、F234V、L235E、D265A、R409K和K447 Delete位点突变,所述连接肽的序列为(GGGGS)n,其中n为3、4或5;
任选地,所述连接肽的N端与所述第二肽段的C端相连,所述连接肽的C端与所述第三肽段的N端相连;
任选地,所述融合蛋白具有SEQ ID NO:21~32任一项所示的氨基酸序列。
6.一种核酸分子,其特征在于,所述核酸分子编码权利要求1所述的Fc突变体或权利要求2~5任一项所述的融合蛋白;
任选地,所述核酸分子为DNA。
7.一种表达载体,其特征在于,携带权利要求6所述的核酸分子;
任选地,所述表达载体为真核表达载体;
优选地,所述表达载体为慢病毒载体。
8.一种重组细胞,其特征在于,所述重组细胞包括:
携带权利要求6所述的核酸分子;或,
表达权利要求1所述的Fc突变体或权利要求2~5任一项所述的融合蛋白。
任选地,所述重组细胞是通过将权利要求7所述的表达载体引入至宿主细胞中而获得的;
任选地,所述重组细胞为真核细胞;
优选地,所述重组细胞为哺乳动物细胞。
9.一种药物组合物,其特征在于,包括:
权利要求2~5任一项所述的融合蛋白;
任选地,所述药物组合物进一步包括药学上可接受的辅料;
任选地,所述药物组合物的剂型为注射剂;
任选地,所述药物组合物的给药途径包括皮下注射或静脉注射。
10.权利要求2~5任一项所述的融合蛋白、权利要求6所述的核酸分子、权利要求7所述的表达载体、权利要求8所述的重组细胞、权利要求9所述的药物组合物在制备药物中的用途,所述药物用于治疗或者预防生长激素异常相关疾病;
任选地,所述生长激素异常相关疾病至少包括选自下列之一:
儿童生长激素缺乏症、特发性体型矮小、成人生长激素缺乏、特纳氏综合征、普拉德 威利综合征、肾衰竭、化疗治疗和AIDS治疗期间的异化状态引起的疾病、子宫内生长迟缓。
CN202210395418.1A 2021-11-26 2022-04-14 人生长激素融合蛋白及其制备和用途 Pending CN115873126A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/100113 WO2023093020A1 (en) 2021-11-26 2022-06-21 Human growth hormone fusion protein and its use thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2021114206419 2021-11-26
CN202111420641 2021-11-26

Publications (1)

Publication Number Publication Date
CN115873126A true CN115873126A (zh) 2023-03-31

Family

ID=85769344

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210395418.1A Pending CN115873126A (zh) 2021-11-26 2022-04-14 人生长激素融合蛋白及其制备和用途

Country Status (2)

Country Link
CN (1) CN115873126A (zh)
WO (1) WO2023093020A1 (zh)

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012132067A1 (ja) * 2011-03-30 2012-10-04 中外製薬株式会社 抗原結合分子の血漿中滞留性と免疫原性を改変する方法
CN105229035A (zh) * 2013-03-11 2016-01-06 诺和诺德保健股份有限公司 生长激素化合物
CA3049689A1 (en) * 2017-02-06 2018-08-09 Dana-Farber Cancer Institute, Inc. Compositions and methods for augmenting antibody mediated receptor signaling
CN108794634A (zh) * 2017-05-03 2018-11-13 上海兴迪金生物技术有限公司 重组的长效人生长激素融合蛋白及其制备和用途
CN110256575B (zh) * 2019-06-11 2020-05-12 长春生物制品研究所有限责任公司 一种长效重组人生长激素融合蛋白及其工程细胞
CA3142513A1 (en) * 2019-06-25 2020-12-30 Gilead Sciences, Inc. Flt3l-fc fusion proteins and methods of use
CN112574315B (zh) * 2019-09-30 2023-04-07 广东菲鹏制药股份有限公司 改变效应功能的Fc变体及其融合蛋白

Also Published As

Publication number Publication date
WO2023093020A1 (en) 2023-06-01

Similar Documents

Publication Publication Date Title
JP4035400B2 (ja) 生体内エリスロポエチン活性が増進した融合蛋白質
AU2013274638A1 (en) Fibroblast growth factor 21 proteins
WO2014031420A1 (en) Homodimeric proteins
TWI237056B (en) Fusion protein having enhanced in vivo activity of erythropoietin
TW201107471A (en) Polypeptides selective for αvβ3 integrin conjugated with a variant of human serum albumin (HSA) and pharmaceutical uses thereof
JP2022512843A (ja) スフィンゴシンキナーゼ1とその融合タンパク質およびその使用
WO2016011935A1 (zh) 人fgfr2c胞外段蛋白质及其编码基因与应用
US20220356455A1 (en) Solubilized apyrases, methods and use
CN106008717B (zh) 一种长效重组glp-1融合蛋白及其制备方法和用途
KR20160113268A (ko) 이기능 융합단백질,이의 제조방법 및 용도
CN115873126A (zh) 人生长激素融合蛋白及其制备和用途
WO2022117044A1 (en) Glp-1/gcg dual receptor agonist polypeptide and fusion protein thereof
CN114874333A (zh) 一种生长激素融合蛋白及其应用
WO2013013626A1 (zh) 重组人神经生长因子缺失突变体及其制备方法和用途
CN113150172B (zh) Glp-1r/gipr双靶点激动剂融合蛋白及其制备方法与应用
US20220144903A1 (en) Recombinant ccn domain proteins and fusion proteins
CN109320597B (zh) 狐亚科激活素a蛋白及其制备与应用
CN112028985A (zh) 基因工程生长因子变异体
CN115873127A (zh) 重组长效人生长激素融合蛋白及其制备方法和用途
KR20180003677A (ko) 인간 성장호르몬 변이 단백질 또는 이의 트랜스페린 융합 단백질을 유효성분으로 포함하는 약학적 조성물
CN114805562B (zh) 抗新型冠状病毒人源化纳米抗体及其应用
US20230091105A1 (en) Chimeric fusions between c4-binding protein c-terminal segment and angiopoietin-1 fibrinogen-like domain as angiopoietin mimetics and tie2 agonists to treat vascular diseases
EP1323736A1 (en) Chimeric human-type vascular endothelial cell growth factor
CA3168534A1 (en) Method of enhancing aqueous humor outflow and reducing intraocular pressure
CN115819611A (zh) 生长激素融合蛋白及其制备方法和用途

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination