CN113736801A - mRNA及包含其的新冠病毒mRNA疫苗 - Google Patents

mRNA及包含其的新冠病毒mRNA疫苗 Download PDF

Info

Publication number
CN113736801A
CN113736801A CN202010470599.0A CN202010470599A CN113736801A CN 113736801 A CN113736801 A CN 113736801A CN 202010470599 A CN202010470599 A CN 202010470599A CN 113736801 A CN113736801 A CN 113736801A
Authority
CN
China
Prior art keywords
mrna
protein
seq
sequence
virus
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010470599.0A
Other languages
English (en)
Other versions
CN113736801B (zh
Inventor
王冰
俞航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Lanque Biomedical Co ltd
Original Assignee
Shanghai Lanque Biomedical Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Lanque Biomedical Co ltd filed Critical Shanghai Lanque Biomedical Co ltd
Priority to CN202010470599.0A priority Critical patent/CN113736801B/zh
Publication of CN113736801A publication Critical patent/CN113736801A/zh
Application granted granted Critical
Publication of CN113736801B publication Critical patent/CN113736801B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K39/12Viral antigens
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/06Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite
    • A61K47/16Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite containing nitrogen, e.g. nitro-, nitroso-, azo-compounds, nitriles, cyanates
    • A61K47/18Amines; Amides; Ureas; Quaternary ammonium compounds; Amino acids; Oligopeptides having up to five amino acids
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/06Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite
    • A61K47/24Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite containing atoms other than carbon, hydrogen, oxygen, halogen, nitrogen or sulfur, e.g. cyclomethicone or phospholipids
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/06Organic compounds, e.g. natural or synthetic hydrocarbons, polyolefins, mineral oil, petrolatum or ozokerite
    • A61K47/28Steroids, e.g. cholesterol, bile acids or glycyrrhetinic acid
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K9/00Medicinal preparations characterised by special physical form
    • A61K9/10Dispersions; Emulsions
    • A61K9/127Liposomes
    • A61K9/1271Non-conventional liposomes, e.g. PEGylated liposomes, liposomes coated with polymers
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/12Antivirals
    • A61P31/14Antivirals for RNA viruses
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/525Virus
    • A61K2039/5258Virus-like particles
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/53DNA (RNA) vaccination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/20011Coronaviridae
    • C12N2770/20022New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/20011Coronaviridae
    • C12N2770/20023Virus like particles [VLP]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/20011Coronaviridae
    • C12N2770/20034Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein

Abstract

本发明提供了mRNA,其包含编码来源于SARS‑CoV‑2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的一种、两种、三种或四种蛋白或其片段的mRNA,编码S蛋白的mRNA的序列如SEQ ID NO.18、SEQ ID NO.19或SEQ ID NO.20所示;编码E蛋白的mRNA的序列如SEQ ID NO.21所示;编码M蛋白的mRNA的序列如SEQ ID NO.22所示;编码N蛋白的mRNA的序列如SEQ ID NO.23所示。还提供了一种包含所述mRNA的脂质体纳米颗粒、一种针对新冠病毒的mRNA疫苗等。本发明的mRNA在细胞水平高效产生病毒蛋白,或由产生的蛋白自组装成病毒样颗粒。将包含本发明的mRNA制备成疫苗时,安全性高、有效性好、不会产生非中和抗体而不会产生抗体依赖增强感染效应。

Description

mRNA及包含其的新冠病毒mRNA疫苗
技术领域
本发明涉及一种mRNA及包含其的新冠病毒mRNA疫苗,本发明还涉及所述mRNA 及包含其的新冠病毒mRNA疫苗、脂质体纳米颗粒、药物组合物和试剂盒等。
背景技术
近年来,基于体外转录(IVT)的信使RNA(mRNA)的治疗正显示出巨大的潜力。 它的原理是将体外制备的mRNA包裹成药物递送到体内组织被细胞内吞,外源mRNA到 达细胞后被核糖体识别并根据其编码信息合成相应的蛋白质。早在1990年Wolff等证明 注射到小鼠体内的mRNA能被翻译蛋白质[7]。1992年Jirikowski等显示注射到下丘脑部 位的vasopressin mRNA缓解了老鼠的尿崩症状[8]。mRNA药物在理论上有诸多优势:与 DNA疗法相比,mRNA不需要入核,无基因组整合的插入突变风险;相对于蛋白质药物, mRNA利用细胞自身的翻译系统,可实现高效和剂量依赖性的活性蛋白表达,解决了一 些蛋白的不可成药性问题。然而mRNA一直受到体外制备、稳定性和递送问题的困扰。 直到近几年IVT(invitro transcription,体外转录)技术配合化学和酶学加帽法、修饰核 苷酸的引入、HPLC纯化技术使得mRNA可以在体外进行大规模制备[9,10]。同时脂质 体和脂质纳米颗粒在递送siRNA上获得成功后被证明也能用于mRNA的包裹和递送[11]。 这些技术的突破使得mRNA成药性大大提高,目前有超过25个mRNA药物包括mRNA 疫苗和蛋白替代的临床研究正在展开[12],市场上第一个mRNA产品的竞争已全面展开。 越来越多的研究人员在关注mRNA药物的应用,而中国在这一领域的研究才刚刚起步。
mRNA药物一个最有潜力的应用是疫苗,包括肿瘤疫苗和感染性疾病疫苗。编码抗原蛋白的mRNA分子经体外合成并形成制剂后即可用于人体免疫,这一过程不涉及活病 毒培养的相关操作,大大缩短了研发时间[13]。mRNA疫苗在近几年不断取得突破性进 展,在2013年的一项研究中,研究者针对H7N9流感病毒设计并制备了mRNA疫苗, 在小鼠实验中取得成效[14]。2015年,针对HIV的mRNA疫苗在非人灵长类动物中产生 体液免疫应答。2017年,寨卡病毒的mRNA疫苗在有效的保护了接受病毒攻击的小鼠[15], 并能降低孕期老鼠感染的风险[16]。除了在动物实验上取得成功以外,mRNA疫苗(如 流感疫苗和寨卡疫苗)已经开始了临床试验,Moderna公司的流感病毒mRNA疫苗I期 临床结果显示其免疫原性或优于传统疫苗[17]。同样是该公司的寨卡病毒疫苗 mRNA-1893去年进入了美国FDA的快速通道。IVT mRNA的技术优势能有效应对病毒 的高突变率,使得新爆发流行病疫苗的快速开发成为可能,有望成为提高新发传染病的 防治效率的突破方向。
传统的预防病毒的疫苗有重组蛋白疫苗、灭活疫苗、减毒活疫苗以及体外重组的病 毒样颗粒(virus-like particle,VLP)。在以往的经验中灭活或减毒疫苗因形态和组成与真 实病毒一样,能产生强烈的免疫反应,成为疫苗的首选。但是他们有不可避免的缺点:灭活或减毒疫苗生产周期较长、有些病毒如诺如病毒不能大规模培养、灭活病毒往往不 能诱导免疫反应而减毒疫苗亦有返祖的风险等。而体外重组病毒样颗粒疫苗是由病毒衣 壳蛋白或包膜蛋白自主包装形成的空衣壳结构,能快速刺激机体产生体液免疫和细胞免 疫应答,病毒样颗粒不含有病毒遗传物质和免疫抑制蛋白,目前安全性最高的一种新型 候选疫苗,已经有多种基于VLP的疫苗产品上市[18]。2002年SARS-CoV和2012年 MERS-CoV爆发后,多种疫苗方案被研究,包括灭活或减毒毒株、基于重组DNA的S 蛋白质以及体外重组的病毒样颗粒[19,20]。其中S蛋白质由于是介导病毒入侵的主要蛋 白质,也是中和抗体的主要靶点,受到疫苗开发的特别关注。动物试验表明这些疫苗均 有保护效果,但是安全性还是最大的担忧。例如基于全长S蛋白抗原的疫苗产生了大量 非中和抗体,这些非中和抗体在抗体依赖增强感染效应(antibody-dependent enhancement, ADE)中扮演了重要角色[21],反而加速了疾病进展,造成疫苗安全性存在重大问题。由 于机体在接受mRNA药物后能根据编码信息合成任何蛋白质,mRNA在疫苗抗原的选择 上极其灵活。但是考虑到病毒样颗粒的优势,目前在临床的病毒mRNA疫苗多以病毒样 颗粒为最终抗原展示形式,如寨卡病毒[15]。
mRNA疫苗虽有诸多优势,但大部分还是理论阶段,需要大量的基础和临床研究。有效的能在体内诱导合成病毒样颗粒的mRNA疫苗应满足两个条件,一是表达效率要高, 产生足够剂量的病毒样颗粒刺激机体产生免疫应答;二是产的病毒样颗粒应与真实病毒 从形态和结构组成上一致,使机体获得免疫能力能应对真实病毒。但是由于冠状病毒自 身的特点,研发也面临着很多挑战。冠状病毒为正链单股RNA病毒,它有一层脂双分子 膜形成包膜(envelope)结构,包膜中插入结构蛋白M(membrane)、E(envelope)、和S(spike)。 其中S棘突蛋白质是冠状病毒最重要的表面蛋白,决定了病毒的宿主范围和特异性。S 蛋白质是宿主中和抗体的重要作用位点,因此成为SARS-CoV和MERS-CoV的疫苗设计 中的关键靶点。冠状病毒还有核蛋白N(nucleoprotein)在内层包裹病毒的基因组。除了结 合基因组,N蛋白质对包膜的形态塑成也有作用,因此也被认为是结构蛋白之一。冠状 病毒一个特点是其形态和大小并不是完全固定的,事实上冠状病毒的直径在80-200nm之 间。因此即使是高分辨率冷冻电镜,也无法使用单颗粒分析来获得整个病毒的原子结构。 冠状病毒包膜内结构蛋白质的比例也不固定,取决于病毒在细胞内组装时各个结构蛋白 质的含量。这和寨卡病毒不同,寨卡病毒虽然也是包膜病毒,但是其形态固定,成刚性 二十面体结构,其结构蛋白单一且拷贝数固定,没有棘突结构。因此同样是合成病毒样 颗粒,冠状病毒的mRNA疫苗设计要比寨卡病毒复杂的多。首先,寨卡病毒的mRNA 疫苗只包含了一条mRNA,编码prM-E融合蛋白,而冠状病毒的mRNA疫苗必须是个组 合(cocktail),包含至少3条mRNA编码不同的结构蛋白质。其次,对冠状病毒包膜结 构的装配目前还存在诸多争议。根据对SARS-CoV的研究,M和E共表达已经足够形成 病毒样颗粒,但是没有棘突结构,共表达S与M和E可以引入S蛋白质,使得VLP带 有棘突。但是尽管形成病毒样颗粒,蛋白质组成比例却和真实病毒存在很大差别。另外N蛋白质虽然在内层主要与病毒基因组相互作用,有研究并表明但是它的存在对病毒样 颗粒的表达和分泌有加强作用。目前已有几种新冠病毒疫苗进入临床试验,均以新冠病 毒S蛋白为主要抗原,安全性和有效性还未经证明,仍有失败的风险。因此,急需继续 研发针对新冠病毒的多种抗原策略的新型冠状病毒疫苗。
发明内容
本发明所要解决的技术问题是为了克服现有技术中没有商品化的新冠病毒疫苗等缺 陷,提供了一种mRNA、一种DNA以及包含其的新冠病毒mRNA疫苗、脂质体纳米颗 粒、由其表达产生的病毒样颗粒、药物组合物和试剂盒。本发明的经过密码子优化后或 者进一步经过核苷酸修饰的组装新冠病毒所需的几种蛋白的mRNA,各自单独在细胞中 都可以高表达。并且经过本发明特定比例组成的mRNA,在细胞水平高效产生病毒蛋白, 或由产生的蛋白自组装成病毒样颗粒,能够实现病毒样颗粒的高表达,且大小和形态结 构均极为接近真实病毒,后续用于临床时可以使机体获得免疫能力能应对真实病毒。包 含本发明mRNA的纳米颗粒,多条mRNA同时被脂质纳米颗粒包装的效率/表达效率高 依旧很高,从而能够产生足够剂量的病毒样颗粒刺激机体产生免疫应答,免疫原性和稳 定性较高。将包含本发明的经过密码子优化后或者经过核苷酸修饰的组装新冠病毒所需 的几种蛋白的mRNA制备成疫苗(例如以表达病毒样颗粒的形式、仅表达S蛋白的疫苗 的形式或以仅表达S蛋白中RBD区域的疫苗的形式)时,安全性高、有效性好、不会产 生非中和抗体从而不会产生抗体依赖增强感染效应。
本领域技术人员公知,冠状病毒的形态和大小并非完全固定,其包膜内结构蛋白质 的比例也不固定,因此同样是合成病毒样颗粒,冠状病毒的mRNA疫苗设计要比现有技术中的其他病毒复杂的多。而本发明人通过大量实验和摸索,意外发现对密码子进行特 定优化后,能够实现病毒样颗粒的完整表达。本发明人在实验中还发现,体外转录生成 mRNA的翻译效率和稳定性受其不同的化学修饰(对每种mRNA使用不同的修饰核苷酸 在细胞内的命运大相径庭),5’和3’非翻译区序列(UTRs),5’加帽方式(使用不同的cap0 或cap1类似物)和3’poly(A)尾的长度影响。本发明人经过大量研究,发现通过选用特定 的核苷化学修饰、特定的UTR序列及特定的优化加帽方式,能够进一步实现mRNA转 染细胞半小时后即开始表达较高水平蛋白,并且表达可持续一周。同时本发明人通过大 量实验,最终发现多种修饰核苷酸通过特定的组合能够进一步取得较好的免疫原性和稳 定性。另外,本发明中S蛋白质是长达1273个氨基酸,属于较大蛋白质,结合5’和3’UTR, 最终mRNA总长超过4000nt。本发明人在实验中发现长链mRNA的合成一直是个挑战, 而通过优化编码蛋白质(例如S蛋白质)的mRNA序列,同时优化UTR序列和修饰核 苷酸,进行蛋白质(例如S蛋白质)的表达筛选,还能够克服超长基因mRNA的制备与 纯化的问题。
为了解决上述技术问题,本发明第一方面提供了mRNA,其包含编码来源于 SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的一种、两种、三种或四种蛋 白、其片段、变体或衍生物的mRNA,
其中,编码所述S蛋白的mRNA的序列如SEQ ID NO.18、SEQ ID NO.19或SEQ IDNO.20所示;编码所述E蛋白的mRNA的序列如SEQ ID NO.21所示;编码所述M蛋 白的mRNA的序列如SEQ ID NO.22所示;编码所述N蛋白的mRNA的序列如SEQ ID NO.23所示。
较佳地,所述片段为所述S蛋白的RBD结构域的片段,其mRNA的序列优选如SEQ IDNO.37所示。
较佳地,所述mRNA还包含5’-帽结构,优选为3′-O-Me-m7G(5')ppp(5')G、 m7G(5')ppp(5')(2'OMeA)pG或m7(3'OMeG)(5')ppp(5')(2'OMeA)pG。
本发明中,所述3′-O-Me-m7G(5')ppp(5')G的结构一般如下所示:
Figure BDA0002514168830000051
本发明中,所述m7G(5')ppp(5')(2'OMeA)pG的结构一般如下所示:
Figure BDA0002514168830000061
本发明中,所述m7(3‘OMeG)(5’)ppp(5‘)(2’OMeA)pG的结构一般如下所示:
Figure BDA0002514168830000062
较佳地,所述mRNA序列还包含3’-聚腺苷酸,其序列优选包含约25至约400个腺 苷核苷酸的序列,优选约50至约400个腺苷核苷酸的序列,更优选约50至约300个腺 苷核苷酸的序列,进一步更优选约50至约250个腺苷核苷酸的序列,进一步优选约60 至约250个腺苷核苷酸的序列,最优选120个聚腺苷酸组成的序列。
较佳地,所述mRNA序列还包含5’-UTR,所述5’-UTR的序列优选如SEQ ID NO. 15所示。
较佳地,所述mRNA序列还包含3’-UTR,所述3’-UTR的序列优选来源于提供 稳定的mRNA的基因的3’UTR或来源于其同源物、片段或变体,更优选如SEQ ID NO. 16或SEQ IDNO.17所示。
较佳地,所述mRNA序列还包含多核苷酸修饰,所述多核苷酸优选5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP和5-Methoxy-UTP中的一种或多种。本发明中,所 述的5-methyl-CTP可以购于ApexBio,#B7967。所述的pseudo-UTP可以购于ApexBio, #B7972。所述的N1-Methylpseudo-UTP可以购于ApexBio,#B8049。所述的5-Methoxy-UTP 可以购于ApexBio,#B8061。
更佳地,所述N蛋白的mRNA包括5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP或5-Methoxy-UTP的修饰,或、包括5-methyl-CTP和pseudo-UTP共同的修饰。
更佳地,所述E蛋白的mRNA包括5-methyl-CTP、pseudo-UTP或 N1-Methylpseudo-UTP的修饰。
更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.18所示时,所述S蛋白 的mRNA包括5-methyl-CTP、pseudo-UTP或N1-Methylpseudo-UTP的修饰,或包括 5-methyl-CTP和pseudo-UTP的修饰,优选包括pseudo-UTP或N1-Methylpseudo-UTP的 修饰。
更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.19所示时,所述S蛋白 的mRNA包括pseudo-UTP的修饰,或包括5-methyl-CTP和pseudo-UTP的修饰。
更佳地,当编码所述S蛋白的mRNA的序列如SEQ ID NO.20所示时,所述S蛋白 的mRNA包括pseudo-UTP或N1-Methylpseudo-UTP的修饰。
较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白 的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达得到,表达所述 S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如为1:1:1。
较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白和E蛋白的mRNA, 所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A肽段的 mRNA的序列(经过蛋白表达后,所得的2A肽经过“自剪切”,最后能够得到独立的M 蛋白和E蛋白)进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或 SEQ ID NO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其 DNA序列优选如SEQ ID NO.28或29所示。
较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的所述的S蛋白的mRNA。
较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的所述的S蛋白的RBD结 构域的mRNA。
较佳地,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白 的mRNA,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过编码2A 肽段的所述2A肽段的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO. 42或SEQ IDNO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38 或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或36 所示,其DNA序列优选如SEQ ID NO.28或29所示。进一步更佳地,连接后的mRNA 与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。
较佳地,当所述的mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M 蛋白和N蛋白中的两种、三种或四种蛋白或其片段的mRNA时,所述mRNA编码的蛋 白自组装成病毒样颗粒。
为了解决上述技术问题,本发明第二方面提供了一种DNA,其包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的至少一种蛋白(例如一种、 两种、三种、四种)或其片段的DNA,
其中,编码所述S蛋白的DNA的序列如SEQ ID NO.3、SEQ ID NO.4或SEQ ID NO. 5所示;编码所述E蛋白的DNA的序列如SEQ ID NO.8所示;编码所述M蛋白的DNA 的序列如SEQID NO.11所示;编码所述N蛋白的DNA的序列如SEQ ID NO.13所示。
较佳地,所述片段为所述S蛋白的RBD结构域的片段,其DNA序列优选如SEQ IDNO.30所示。
为了解决上述技术问题,本发明第三方面提供了组合物,其包含多个或多于一个如 本发明第一方面所述的mRNA或如本发明第二方面所述的DNA。
为了解决上述技术问题,本发明第四方面提供了一种脂质体纳米颗粒,其包含本发 明第一方面所述的mRNA、如本发明第二方面所述的DNA或如本发明第三方面所述的 组合物。
较佳地,所脂质体纳米颗粒还包括阳离子脂质和辅助脂质,所述阳离子脂质优选为 DLin-MC3-DMA或DOTMA,所述辅助脂质优选为DSPC和/或胆固醇。
本发明中,所述DLin-MC3-DMA的结构式一般如下所示:
Figure BDA0002514168830000081
本发明中,所述DOTMA的结构式一般如下所示:
Figure BDA0002514168830000082
本发明中,所述DSPC的结构式一般如下所示:
Figure BDA0002514168830000091
较佳地,所脂质体纳米颗粒为长循环阳离子脂质体纳米颗粒,优选为经PEG或其衍生物修饰的长循环阳离子脂质体纳米颗粒;所述PEG的相对分子质量优选为2000~5000,例如为2000、3000、4000或5000。在本发明某一较佳实施例中,所述脂质体纳米颗粒为 包括DMPE-PEG2000的长循环阳离子脂质体纳米颗粒。
为了解决上述技术问题,本发明第五方面提供了一种病毒样颗粒,其包含由本发明 第一方面所述的mRNA所表达的相应蛋白自组装而成、如本发明第二方面所述的DNA 所表达的相应蛋白自组装而成和/或如本发明第三方面所述的组合物所表达的相应蛋白 自组装而成,优选是将所述的mRNA、所述的DNA和/或所述的组合物转入细胞后表达 相应的蛋白,所述细胞优选为293T和/或293A。
较佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和 N蛋白中的两种、三种或四种蛋白或其片段的mRNA所表达的蛋白自组装而成,优选在 细胞中表达所述蛋白,所述细胞优选为293T和/或293A。
更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达的蛋白自组装得到, 表达所述S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如 为1:1:1。
更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白和E蛋白的mRNA 所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述细胞优选为293T和/或293A。 其中,所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A 肽段的mRNA的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42 或SEQ ID NO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO.38或 SEQ ID NO.39所示,编码所述2A肽段的mRNA序列进一步优选如SEQ ID NO.40或SEQ ID NO.41所示。更佳地,所述连接后的mRNA序的列优选如SEQ ID NO.35或36所示, 其DNA序列优选如SEQ ID NO.28或29所示。
更佳地,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白的mRNA所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述细胞优选为293T 和/或293A。其中,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过 编码2A肽段的mRNA的序列进行连接。其中,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ IDNO.43所示,编码所述2A肽段的DNA序列进一步优选如SEQ ID NO. 38或SEQ ID NO.39所示,编码所述2A肽段的RNA序列进一步优选如SEQ ID NO.40 或SEQ ID NO.41所示。更佳地,所述连接后的mRNA的序列优选如SEQ ID NO.35或 36所示,其DNA序列优选如SEQ ID NO.28或29所示。进一步更佳地,连接后的mRNA 与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。
为了解决上述技术问题,本发明第六方面提供了一种针对新冠病毒的mRNA疫苗,其包含如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三 方面所述的组合物和/或如本发明第四方面所述的脂质体纳米颗粒。
较佳地,所述mRNA疫苗诱导细胞产生病毒样颗粒,以激活免疫系统。
较佳地,所述mRNA疫苗还包括本领域常规使用的佐剂。
为了解决上述技术问题,本发明第七方面提供了一种药物组合物,其包含如本发明 第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、 和/或如本发明第六方面所述的mRNA疫苗,和任选地药用载体。
为了解决上述技术问题,本发明第八方面提供了一种试剂盒,其包含如本发明第一 方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、 如本发明第四方面所述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本 发明第六方面所述的mRNA疫苗和/或如本发明第七方面所述的药物组合物。
为了解决上述技术问题,本发明还提供了一种编码2A肽段的mRNA,其序列优选 如SEQ ID NO.40或SEQ ID NO.41所示。
为了解决上述技术问题,本发明还提供了一种编码2A肽段的DNA,其序列如SEQ IDNO.38或SEQ ID NO.39所示。
为了解决上述技术问题,本发明还提供了如本发明第一方面所述的mRNA或如本发明第二方面所述的DNA或如本发明第三方面所述的组合物在制备如本发明第四方面所 述的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的 mRNA疫苗、如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂 盒中的应用。
为了解决上述技术问题,本发明还提供了一种预防和/或治疗新冠病毒感染的方法, 其包括(任选地向有需要的受试者)施用如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述的脂质体 纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的mRNA疫苗、 如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂盒的步骤。
为了解决上述技术问题,本发明还提供了一种如本发明第一方面所述的mRNA、如本发明第二方面所述的DNA、如本发明第三方面所述的组合物、如本发明第四方面所述 的脂质体纳米颗粒、如本发明第五方面所述的病毒样颗粒、如本发明第六方面所述的 mRNA疫苗、如本发明第七方面所述的药物组合物、和/或如本发明第八方面所述的试剂 盒在预防和/或治疗新冠病毒感染中的应用。
本发明中,所示编码2A肽段的序列可以是编码天然病毒的2A肽段的序列,也可以是经过优化的序列(例如T2A和P2A,T2A的mRNA序列可以是如SEQ ID NO.40所示, P2A的mRNA序列可以是如SEQ ID NO.41所示。其对应的DNA序列可以是如SEQ ID NO.38和SEQ IDNO.39所示,经翻译后所得多肽的氨基酸序列可以是如SEQ ID NO.42 和SEQ ID NO.43所示)。这段多肽可高效“自剪切”成前后两个片段,可以使得这段序 列前后两部分的序列独立表达成两个单独的蛋白,达到一段序列上协同表达两个独立蛋 白的目的。
术语解释
本发明中,所述的mRNA又称信使RNA,通常是由DNA的一条链作为模板转录而 来的、携带遗传信息的能指导蛋白质合成的一类单链核糖核酸。以细胞中基因为模板, 依据碱基互补配对原则转录生成mRNA后,mRNA就含有与DNA分子中某些功能片段 相对应的碱基序列,作为蛋白质生物合成的直接模板。
本发明中,所述的mRNA疫苗通常是通过将编码病毒抗原的mRNA直接导入到人 体内,在细胞内表达病毒蛋白抗原,从而激活人体免疫系统,产生针对病毒的中和抗体。
本发明中,所述的抗原(antigen,缩写Ag)一般是指能引起抗体生成的物质,是任何可诱发免疫反应的物质。
本发明中,所述的抗体一般是指机体在抗原物质刺激下,由B细胞分化成的浆细胞所产生的、可与相应抗原发生特异性结合反应的免疫球蛋白。
本发明中,所述的中和抗体一般是指微生物入侵人体后,会刺激产生很多种抗体,但只有部分抗体能迅速识别微生物,并在其入侵人体细胞前将其“抓住”,保护人体不被 感染。这个过程就叫中和作用,发挥作用的抗体就是中和抗体。
本发明中,所述的脂质体纳米颗粒一般是指利用脂质体将药物分子(小分子化合物、 RNA、DNA或蛋白质药物)包装成百纳米大小的复合物,将药物递送到体内,具有增加 药物溶解度、延长药物在体内的滞留时间、增强药物的靶向性及降低毒性等优势。
本发明中,所述的病毒样颗粒(virus-like particles,VLPs)通常是含有某种病毒的一 个或多个结构蛋白的空心颗粒,没有病毒核酸,不能自主复制,在形态上与真正病毒粒 子相同或相似,俗称伪病毒。
本发明中,所述的新冠病毒S蛋白(Spike蛋白)又称棘突或刺突蛋白。S蛋白是冠状病毒最重要的致病靶点蛋白,包含两个亚基,S1和S2。其中,S1主要包含有受体结 合区(RBD结构域),冠状病毒正是通过RBD结构域与细胞表面受体结合来感染细胞。 S蛋白也因此主要承担病毒与宿主细胞膜受体结合及膜融合功能。同时,也是宿主中和 抗体的重要作用位点以及疫苗设计的关键靶点。
在符合本领域常识的基础上,上述各优选条件,可任意组合,即得本发明各较佳实例。
本发明所用试剂和原料均市售可得。
本发明的积极进步效果在于:
(1)本发明的经过密码子优化后或者进一步经过核苷酸修饰的组装新冠病毒所需的 几种蛋白的mRNA,各自单独在细胞中都可以高表达。此外,进一步经过本发明特定比 例组成的mRNA,在细胞水平高效产生病毒蛋白,或由产生的蛋白自组装成病毒样颗粒, 能够实现病毒样颗粒的高表达,且大小和形态结构均极为接近真实病毒,后续用于临床 时可以使机体获得免疫能力能应对真实病毒。
(2)包含本发明mRNA的纳米颗粒,多条mRNA同时被脂质纳米颗粒包装的效率 /表达效率高依旧很高,从而能够产生足够剂量的病毒样颗粒刺激机体产生免疫应答,免 疫原性和稳定性较高。
(3)将包含本发明的经过密码子优化后或者进一步经过核苷酸修饰的组装新冠病毒 所需的几种蛋白的mRNA制备成疫苗(例如以表达病毒样颗粒的形式、仅表达S蛋白的 疫苗的形式或以仅表达S蛋白中RBD区域的疫苗的形式)时,安全性高、有效性好、不 会产生非中和抗体从而不会产生抗体依赖增强感染效应。
附图说明
图1显示了本发明实施例的概述。实施例中用mRNA表达新冠病毒结构蛋白S、M、 E和N以及S蛋白的RBD结构域。用脂质体将mRNA包被成纳米粒(LNP),进行细胞 转染或动物免疫。体外细胞转染的多种mRNA可高表达病毒蛋白,并在合适比例下,自 组装成病毒样颗粒(VLP)。LNP免疫小鼠后,激活小鼠免疫系统,产生抗体。
图2显示了mRNA用脂质体包被转染293A细胞后,Western Blot检测蛋白表达的 结果图。其中,泳道(lane)1为经cap1修饰的mRNA表达的蛋白,泳道2为经 cap1+5mC+pseudoU修饰的mRNA表达的蛋白,泳道3为经cap1+pseudoU修饰的mRNA 表达的蛋白,泳道4为经cap1+5moU修饰的mRNA表达的蛋白,泳道5为经 cap1+N1-m-pseudoU修饰的mRNA表达的蛋白,泳道6为经cap1+5mC修饰的mRNA表 达的蛋白。A为N蛋白的mRNA和NBL mRNA表达蛋白的WB结果图,B为EBL mRNA 和MBL mRNA表达蛋白的WB结果图,C为SGS mRNA、STFmRNA、SBLmRNA表达蛋白的WB结果图,D为SDC50、SDC54、SDC58、SDC60表达蛋白的WB结果图,E 为SGS-RBD结构域表达蛋白的WB结果图,F为MP2AE和MT2AE表达蛋白的WB结 果图。
图3显示了VLP颗粒的电镜照片。
图4显示了mRNA脂质纳米颗粒包装示意图。
图5显示了用ZetaView检测LNP色谱分布图。A的上图是包被了表达S蛋白的SGSmRNA的LNP过滤前的粒径和分布图,下图是同一LNP过滤后的粒径分布图。B的上 图是表达S蛋白的RBD结构域的mRNA的LNP过滤前的粒径和分布图,下图是同一LNP 过滤后的粒径分布图。C的上图是包被了表达M、E和S蛋白mRNA的LNP过滤前的粒 径和分布图,下图是同一LNP过滤后的粒径分布图。
图6显示了经过第一次免疫后一周,用酶联免疫吸附法测试血清中抗体滴度的结果 图。
图7显示小鼠血清中和抗体滴度实验结果。表达S蛋白全长的mRNA(Spike)和 表达病毒样颗粒的mRNA组合(SME),诱导产生的抗体滴度均大于104,单独产生RBD 结构域的mRNA(RBD)诱导产生的中和抗体滴度略高于空白对照(Ctrl)。
具体实施方式
下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施 例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。
本发明针对新型冠状病毒开发的mRNA疫苗,主要采用(1)表达多种病毒蛋白, 在体内组装成病毒样颗粒;(2)表达S蛋白全长的mRNA;(3)表达S蛋白RBD结构 域这三种方式,如图1所示。
实施例1 mRNA制备
为新型冠状病毒(SARS-CoV-2)4个结构基因S、M、E、N进行了密码子优化,并 为每条基因设计了多条编码序列。每种序列将被克隆到mRNA合成载体上。对每种序列, 制备两种mRNA,一种编码不含标签的野生型蛋白质,一种编码C端带有Flag标签,用 于后期表达验证。具体步骤为:
委托上海生工进行合成携带针对SARS-Cov-2的S蛋白(Spike protein)、M蛋白、E蛋白、N蛋白(氨基酸序列分别如SEQ ID NO.1、SEQ ID NO.9、SEQ ID NO.6和SEQ ID NO.12所示,四种蛋白的天然基因序列分别如SEQ ID NO.2(连接有3’-UTR-2)、SEQ ID NO.10、SEQID NO.7和SEQ ID NO.14(连接有3’-UTR-2)所示)的经过密码子优 化的基因序列,其中:S蛋白基因优化后的序列分别如SEQ ID NO.3(SGS,连接有 3’-UTR-2)、SEQ ID NO.4(SBL或S-benchling,连接有3’-UTR-1)、SEQ ID NO.5(STF, 连接有3’-UTR-1)、SEQ ID NO.24(SDC50,连接有3’-UTR-2)、SEQ ID NO.25(SDC54, 连接有3’-UTR-2)、SEQ ID NO.26(SDC58,连接有3’-UTR-2)、SEQ ID NO.27(SDC60, 连接有3’-UTR-2)所示;M蛋白基因优化后的序列如SEQ ID NO.11(MBL,连接有 3’-UTR-1)所示;E蛋白基因优化后的序列如SEQ IDNO.8(EBL,连接有3’-UTR-1) 所示;N蛋白基因优化后的序列如SEQ ID NO.13(NBL,连接有3’-UTR-2)所示。随 后将上述密码子优化过的基因序列亚克隆到含有T7启动子和5’非编码区(5’UTR,序列 如SEQ ID NO.15所示)、3’非编码区(3’UTR,序列如SEQ ID NO.16(3’-UTR-1)或 SEQ ID NO.17所示(3’-UTR-2))的载体(载体有两种:一种是在pUC19基础上加入 5’-UTR和3’-UTR-1区域;一种是在pUC57基础上加入5’-UTR和3’-UTR-2区域)中。 S、E、N和M蛋白分别在C-末端被HA和Flag标记。扩增载体后,通过限制性内切酶 进行酶切使载体线性化(操作步骤均为本领域常规)。进一步纯化酶切片段,并将其用作 体外转录(IVT)的模板,合成修饰的mRNA,具体为:使用具有1-2μg模板的HyperScribe T7高产率RNA合成试剂盒(ApexBio)和加帽cap0或cap1类似物(购于ApexBio)(每 个修饰的核苷酸7.5mM)进行IVT。将反应在37℃下孵育2-4小时,然后进行DNase (Thermo)处理。使用poly(A)加尾试剂盒(ApexBio)将3′poly(A)尾巴进一步添 加到IVT RNA产品中。通过使用RNAClean andConcentrator试剂盒(ApexBio)纯化 mRNA。其中,所得S蛋白基因优化后的mRNA序列分别如SEQ ID NO.18(SGS mRNA)、 SEQ ID NO.19(SBL mRNA)和SEQ ID NO.20(STF mRNA)、以及SEQ ID NO.31 (SDC50)、SEQ ID NO.32(SDC54)、SEQ ID NO.33(SDC58)、SEQ ID NO.34(SDC60) 所示,所得M蛋白基因优化后的mRNA序列如SEQ ID NO.22(MBL mRNA)所示,所 得E蛋白基因优化后的mRNA序列如SEQ ID NO.21(EBL mRNA)所示,所得N蛋白 基因优化后的mRNA序列如SEQ ID NO.23(NBL mRNA)所示。
实施例2 体外转录中掺入的修饰核苷酸
在实施例1中所述的体外转录合成修饰的mRNA过程中,在反应体系中按照一定比例加入修饰核苷酸,修饰核苷酸随机插入mRNA序列中。本实施例尝试使用的修饰核苷 酸包括5-methyl-CTP(简称5mC,ApexBio,#B7967)、pseudo-UTP(简称pseudoU, ApexBio,#B7972)、N1-Methylpseudo-UTP(简称N1-m-pseudo,ApexBio,#B8049)、 5-Methoxy-UTP(5moU,ApexBio,#B8061);用于mRNA的5’加帽的修饰核苷酸有 3′-O-Me-m7G(5')ppp(5')G(ARCA,cap0,APExBIO公司产品,#B8175)、 m7G(5')ppp(5')(2'OMeA)pG(APExBIO公司产品EZ Cap#B8176,cap1,)和 m7(3'OMeG)(5')ppp(5')(2'OMeA)pG(APExBIO公司产品EZ Cap#B8178,cap1类似物)。
具体实验步骤为:
(1)体外mRNA序列中插入多种上述修饰核苷酸;体外转录过程中随机插入修饰 核苷酸,按照修饰核苷酸与未修饰核苷酸1:5的摩尔比,掺入反应体系,使用APExBIO 公司试剂盒#K1047。按照试剂盒使用说明配置反应体系,37℃反应2-4小时。
(2)转录过程如加入5’加帽核苷酸;则在转录反应体系中同时加入5’ m7(3'OMeG)(5')ppp(5')(2'OMeA)pG、m7G(5')ppp(5')(2'OMeA)pG或 3′-O-Me-m7G(5')ppp(5')G,与GTP的摩尔比为8:1。
(3)3’端加入120个聚腺苷酸的序列;使用poly(A)拖尾试剂盒(APExBIO,#K1053)将3′poly(A)尾巴添加到IVT RNA产品中,按照试剂盒使用说明配置反应体系,37℃ 反应1小时。
(4)DNase消化DNA模板;使用NEB公司DNase I(货号#M0303S)进行DNA 模板消化,37℃反应1小时。
(5)mRNA纯化;用Thermo Fisher RiboPure Kit(#AM1924)纯化转录后,DNA 模板已消化的mRNA,用1mM柠檬酸钠,pH6.4的溶液洗脱。进行琼脂糖胶核酸电泳 检测mRNA,并用NanoDrop测定浓度。
实施例3 mRNA转染细胞
使用lipofectamine 2000(lipo2K,ThermoFisher Scientific#11668019)以质量体积比 1:2(mRNA:lipo2K,1g mRNA+2L lipo2K)的比例将实施例1和2中所得S、M、E、 N的mRNA分别转染到293A细胞中,24hr后用Western Blot检测蛋白表达。所得结果 如图2所示。
图2中,数字代表mRNA序列中插入了不同的修饰核苷酸:1.cap1;2. cap1+5mC+pseudoU;3.cap1+pseudoU;4.cap1+5moU;5.cap1+N1-m-pseudo;6.cap1+5mC。 细胞表达的N蛋白和E蛋白均带有HA序列标签,用抗HA的抗体做western blot检测细 胞中蛋白表达情况,GAPDH蛋白作为阳性对照。其中:
如图2的A所示,N蛋白较小,各序列和修饰优化的mRNA,均可以在细胞内表达 N蛋白,其中cap1+5mC+pseudoU(lane 2)和cap1+5moU(lane 4)两种修饰时的mRNA 对于蛋白表达相对较低。
图2的B中,EBL序列表达很强,用抗HA标签肽的抗体检测,信号很强。MBL序 列接flag标签肽,用抗flag标签肽的抗体进行检测,发现cap1(lane 1)、cap1+pseudoU (lane3)以及cap1+N1-m-pseudo(lane 5)、cap1+5mC(lane 6)四种修饰组合表达较好, cap1+5mC+pseudoU(lane 2)和cap1+5moU(lane 4)两种修饰组合的mRNA,E蛋白表 达量很低。
图2的C和D中,表达S蛋白的序列分别连接HA标签肽或flag标签肽,用抗HA 或flag标签肽的抗体进行检测,蛋白表达差异极大。由图2的C中可以看出,未经优化 的天然S基因序列在293A细胞中几乎不表达,或表达量极低。STF和SBL两种优化序 列比天然S基因序列的蛋白表达略有提高,cap1+pseudoU(lane 3)和cap1+N1-m-pseudo (lane 5)修饰的STF表达的蛋白量相对较高,cap1+pseudo(lane 3)修饰的SBL表达的 蛋白量相对较高。SGS基因优化序列使得蛋白表达大为上升,表达量最好的是添加 cap+pseudoU(lane 3)和cap1+N1-m-pseudoU(lane 5)两种方式修饰的SGS序列,cap1 (lane 1)、cap1+5mC+pseudoU(lane2)、cap1+5mC(lane 6)修饰的SGS序列的表达量 也较高。由图2的D中可以看出,SDC50、SDC54、SDC58、SDC60这些优化后的序列 所表达的蛋白中,杂蛋白很多。
图2的E中,带有HA标签肽的SGS-RBD优化mRNA序列(mRNA序列如SEQ ID NO.37所示,其对应的DNA序列如SEQ ID NO.30所述,均采用pseudoU多核苷酸修 饰,5’加帽的结构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所 示的5’UTR和序列如SEQ IDNO.16或17所示的3’UTR)可以在细胞中高表达S蛋白 RBD结构域。
图2的F中,用一条mRNA序列串联表达两种蛋白M和E(即将M蛋白和E蛋白 的mRNA相连接后再进行表达,中间可使用不同的表达2A肽段的mRNA进行连接(蛋 白表达后,2A肽经过“自剪切”,最后能够得到独立的M和E蛋白。在天然病毒2A序 列基础上,经过优化所得的表达2A肽段的T2A和P2A多肽对应的DNA序列如SEQ ID NO.38和SEQ ID NO.39所示,T2AmRNA序列如SEQ ID NO.40所示,P2A mRNA序 列如SEQ ID NO.41所示,可翻译成多肽(对于序列为SEQ ID NO.42和SEQ ID NO.43)),mRNA相连接后所得MT2AE的mRNA序列如SEQ IDNO.35所示(对应的DNA序列 如SEQ ID NO.28所示),mRNA相连接后所得MP2AE的mRNA序列如SEQ ID NO.36 所示(对应的DNA序列如SEQ ID NO.29所示),采用pseudoU多核苷酸修饰,5’加帽 的结构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR 和序列如SEQ ID NO.16或17所示的3’UTR),western blot显示位置接近的双带代表了 M和E蛋白。MP2AE优化mRNA序列得到的两个蛋白量更接近。
实施例4 病毒样颗粒的制备和观测
为了产生病毒样颗粒(VLP),将表达的S、M、E蛋白的mRNA(SGS mRNA、MBL mRNA、EBLmRNA组成的SME mRNA,均采用pseudoU多核苷酸修饰,5’加帽的结 构为Cap1,3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR和序 列如SEQ ID NO.16或17所示的3’UTR),按照摩尔比1:0.5:0.5的比例,用lipo2K包被, 共转染到293A细胞中,并在转染后48小时收集上清液。或者采用串联表达M蛋白、E 蛋白的mRNA(即将M蛋白和E蛋白的mRNA相连接后再进行后续步骤,中间可使用 不同的连接肽进行连接,mRNA相连接后的序列如SEQ IDNO.35或36所示),与表达 S蛋白mRNA(SEQ ID NO.3)按照摩尔比2:1的比例,用lipo2K包被,共转染到293A 细胞中,并在转染后48小时收集上清液。
将所收集的上清液使用100kDa截止浓度的Amicon Ultra-15(Millipore)进行浓缩, 然后将其置于合适的溶液中(20mM HEPES,pH 7.4,120mM NaCl)。在4℃,31,000rpm(Beckman超速离心机,转子型号SW32)超速离心90分钟后,立即用5mL注射器提 取30-40%(w/v)之间的蔗糖溶液,该溶液包含病毒样颗粒(VLP)。使用100kDa截止 浓度的AmiconUltra-15离心管进行溶液交换,将含VLP的溶液置换成PBS缓冲液。为 了制备用于负染透射电子显微镜(TEM)的栅格,将5μL VLP溶液在辉光放电碳涂层的 栅格上吸收2分钟。将网格以逐滴滴加方式染色60秒,然后加载到Talos L120C显微镜 (Thermofisher)上以可视化VLP。结果如图3显示,图3的a中显示了由mRNA转录 出的S、E和M蛋白,自组装成的新冠病毒样颗粒电镜照片;图3的b中单个病毒样颗 粒放大照片,并测量表面棘突大小;图3的c为新冠病毒样颗粒的卡通模式图。由图3 可知,电镜下VLP颗粒直径约90nm,表面形成与天然病毒类似的三聚体棘突,棘突大 小约12×13nm,与天然病毒的大小和结构都极为接近。
实施例5 mRNA包被方法
按照之前报道,将实施例2中所得含有修饰核苷酸的mRNA(分别为表达S蛋白RBD结构域的mRNA;可以表达S蛋白的SGS mRNA;表达S、M、E三种蛋白的SME mRNA, 按摩尔比为1:0.5:0.5混合表达;均采用pseudoU多核苷酸修饰,5’加帽的结构为Cap1, 3’加入120个聚腺苷酸,连接了序列如SEQ ID NO.15所示的5’UTR和序列如SEQ ID NO. 16或17所示的3’UTR)利用DLin-MC3-DMA(APExBIO,#A8791),在低pH下可离 子化(阳离子),两种辅助脂质(DSPC和胆固醇)和聚乙二醇化脂质(DMPE-PEG2000) 包被成纳米颗粒(示意图如图4所示)。通过将溶解在超纯水中的mRNA,与pH 3.0的 100μmM柠檬酸盐缓冲液1:1(v/v)混合,制备了mRNA的水溶液。调节四种脂质组 分[可离子化的脂质:胆固醇:DSPC:DMPE-PEG2000]比例(50:10:38.5:1.5),在乙醇(99.5%) 中溶解为脂质溶液。将mRNA和脂质溶液在NanoAssemblr(Precision Nanosystems)微 流体混合系统中以Aq:EtOH=3:1的体积混合比和恒定的12mL/min的总流速混合, 得到包含mRNA的脂质体纳米颗粒(LNP)。
为了表征上述所制得的LNP,在制备后,将25μL的样品级分注入975的10μmM磷 酸盐缓冲液(pH 7.4)中,并用于在ZetaSizer(Malvern Instruments Inc.)上测量强度平 均粒径(Z平均值)。立即将样品馏分转移至Slide-a-lyzer G2透析盒(10000MWCO,ThermoFischer Scientific Inc.)中,并在4℃下针对PBS(pH7.4)透析过夜。PBS缓冲液的体积 为样品体积的650-800倍。收集样品部分,并从该体积中将25μL注入975μL 10μmM 磷酸盐缓冲液(pH 7.4)中,并再次测量粒径(透析后粒径),LNP在透析前后,直径约 100nm,均处于均一稳定的状态,见图5和表1。透析过的样品用来进行小鼠注射免疫。 图5中显示了表达S蛋白的SGS mRNA对应的结果图,以及表达S蛋白RBD结构域的 mRNA和表达S、M、E三种蛋白的对应mRNA的包装结果。这些结果表面,mRNA经 过脂质体包装后,粒径在100-110nm之间,包装效率大于90%。
表1中显示了用ZetaView检测LNP包被后mRNA样品粒径及分布。S-RBD mRNA 可以表达S蛋白RBD结构域,SGS mRNA可以表达S蛋白,SME mRNA表达S、M、E 三种蛋白,可以形成病毒样颗粒。包被后的LNP粒径均在100-110nm之间,符合纳米颗 粒大小预期。稀释后数量在100-300之间,稀释比例适合。在用1xPBS透析并用0.22μM 或0.45μM滤膜过滤后,粒径和数量保持稳定,可用于后续动物实验。
表1
Figure BDA0002514168830000181
实施例6 小鼠免疫实验
将包被好的表达新冠病毒VLP(含有实施例5中所述的表达S蛋白的SGS mRNA,或表达S、M、E三种蛋白的SME mRNA)或RBD(含有实施例5中所述的表达S蛋白RBD结 构域的mRNA)的上述脂质体纳米颗粒,配合免疫佐剂,进行Balb/c小鼠(肌肉(i.m.) 注射,具体信息如下表2所示。在第42天收集血液样品,并且在荧光抗体病毒中和检测中 分析血清,具体如以下实施例7所述。
表2
组别 品系 数量 途径体积 疫苗剂量 接种时间
1 Balb/c 8 i.m.50μl×3 Control PBS D0,致敏;D14,加强免疫;D35,加强免疫
2 Balb/c 8 i.m.50μl×1 mRNA 10μg D0,致敏
3 Balb/c 8 i.m.50μl×2 mRNA 10μg D0,致敏;D14,加强免疫;
4 Balb/c 8 i.m.50μl×3 mRNA 10μg D0,致敏;D14,加强免疫;D35,加强免疫
实施例7 酶联免疫吸附法测定血清中抗体效价
分别用2μg/ml抗原蛋白(溶于PBS),100ng,包被96孔ELISA板,50μl/孔,4 度过夜避光。其中S蛋白抗原购自Sino Biological,货号40589-V08B1;S蛋白RBD结 构域,购自Novoprotein,货号DRA36。PBST(0.05%Tween)洗3次,200μl/孔,每次 倒扣ELISA板并轻击清洗干净。加100μl/孔2%BSA(溶于PBST)封闭,室温孵育1hr。 PBST(0.05%Tween)洗3次,200μl/孔,每次倒扣ELISA板并轻击清洗干净。加小鼠 血清(稀释100倍作为起始浓度,之后5倍梯度稀释,共6个梯度)于PBS中,混合均 匀后取100μl分别加到ELISA板中,室温孵育2hr。将实施例6中的小鼠进行眼周采血 100μl约20μl血清。洗板后,分别加HRP-anti-mouseIgG(1:5000稀释于PBS),100μl/ 孔,室温孵育1hr。洗板后,分别加TMB底物(ThermoFisher,货号34022),50μl/孔, 室温等待5-15min(避光),将呈现蓝色。分别加1M硫酸终止反应,150μl/孔,蓝色将 变黄。酶标仪读取OD450。
所得结果如图6所示,其中可见mRNA表达病毒样颗粒产生的抗体滴度最高,达到107。表达S蛋白的mRNA产生的抗体滴度达到106。表达S蛋白RBD结构域的mRNA, 产生的抗体滴度为104。可见使用本发明的mRNA表达的病毒样颗粒均可有效激活小鼠 免疫系统,促使血清中产生抗体,有效发挥疫苗的作用。
实施例8 中和抗体检测实验
通过病毒中和测定进行病毒中和抗体应答(特异性B细胞免疫应答)的检测。所述测定 的结果称为病毒中和滴度(VNT)。按照WHO标准,如果各自的VNT是至少0.5IU/ml,则认为抗体滴度是保护性的。因此,在第42天从实施例6中所述接种的小鼠中采集血液样品,并且制备血清。这些血清用于荧光抗体滴度中和(FAVN)检测,所述检测使用人源CACO-2 细胞。培养细胞用假病毒颗粒(表达新冠病毒S蛋白,内核为EGFP DNA)进行感染。之 后不久,以连续两倍稀释四次一式四份检测热灭活的血清,检测其以50μl的体积中和假 病毒颗粒的100TCID50(组织培养感染剂量50%)的潜力。因此,将血清稀释液在37℃(在 具有5%CO2的潮湿的培养箱中)用病毒温育1小时,然后加入胰蛋白酶解的CACO-2细胞(4 ×105个细胞/ml;50μl/孔)。将感染的细胞培养物在潮湿的培养箱中在37℃和5%CO2培养 48小时。在室温用80%丙酮固定细胞后,用荧光检测EGFP表达,用量标志细胞的感染情 况。
根据图7的结果可知,本发明的疫苗(以表达病毒样颗粒的形式、仅表达S蛋白的疫苗的形式或者以仅表达S蛋白中RBD区域的疫苗的形式)均可有效激活小鼠免疫系统, 血清中产生抗体,且其安全性高、有效性好。其中表达病毒样颗粒的mRNA组合产生的 中和抗体滴度最高。不会产生非中和抗体从而不会产生抗体依赖增强感染效应。
参考文献
1.Huang C,Wang Y,Li X,Ren L,Zhao J,et al.2020.Lancet
2.Zhu N,Zhang D,Wang W,Li X,Yang B,et al.2020.N Engl J Med
3.de Wit E,van Doremalen N,Falzarano D,Munster VJ.2016.Nat RevMicrobiol 14:523-34
4.Potter CW.2001.J Appl Microbiol 91:572-9
5.Smith W,Andrewes CH,Laidlaw PP.1933.Lancet 2:66-8
6.Barberis I,Myles P,Ault SK,Bragazzi NL,Martini M.2016.J Prev MedHyg 57: E115-E20
7.Wolff JA,Malone RW,Williams P,Chong W,Acsadi G,et al.1990.Science247: 1465-8
8.Jirikowski GF,Sanna PP,Maciejewski-Lenoir D,Bloom FE.1992.Science255:996-8
9.Zangi L,Lui KO,von Gise A,Ma Q,Ebina W,et al.2013.Nat Biotechnol31:898-907
10.Kariko K,Muramatsu H,Ludwig J,Weissman D.2011.Nucleic Acids Res39:e142
11.Reichmuth AM,Oberli MA,Jaklenec A,Langer R,BlankschteinD.2016.Ther Deliv 7: 319-34
12.Sahin U,Kariko K,Tureci O.2014.Nat Rev Drug Discov 13:759-80
13.Pardi N,Hogan MJ,Porter FW,Weissman D.2018.Nat Rev Drug Discov 17:261-79
14.Hekele A,Bertholet S,Archer J,Gibson DG,Palladino G,etal.2013.Emerg Microbes Infect 2:e52
15.Richner JM,Himansu S,Dowd KA,Butler SL,Salazar V,et al.2017.Cell169:176
16.Richner JM,Jagger BW,Shan C,Fontes CR,Dowd KA,et al.2017.Cell 170:273-83 e12
17.Feldman RA,Fuhr R,Smolenov I,Mick Ribeiro A,Panther L,etal.2019.Vaccine 37: 3326-34
18.Chroboczek J,Szurgot I,Szolajska E.2014.Acta Biochim Pol 61:531-9
19.Yong CY,Ong HK,Yeap SK,Ho KL,Tan WS.2019.Front Microbiol 10:1781
20.Baric RS,Sheahan T,Deming D,Donaldson E,Yount B,et al.2006.Adv ExpMed Biol 581:553-60
21.Yip MS,Leung HL,Li PH,Cheung CY,Dutry I,et al.2016.Hong Kong Med J22: 25-31
22.Millet JK,Tang T,Nathan L,Jaimes JA,Hsu HL,et al.2019.J Vis Exp
23.Islam MA,Xu Y,Tao W,Ubellacker JM,Lim M,et al.2018.Nat Biomed Eng2: 850-64
SEQUENCE LISTING
<110> 上海蓝鹊生物医药有限公司
<120> mRNA及包含其的新冠病毒mRNA疫苗
<130> P20011191C
<160> 43
<170> PatentIn version 3.5
<210> 1
<211> 1273
<212> PRT
<213> SARS-COV-2
<400> 1
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
1010 1015 1020
Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys
1025 1030 1035
Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro
1040 1045 1050
Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val
1055 1060 1065
Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His
1070 1075 1080
Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn
1085 1090 1095
Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln
1100 1105 1110
Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val
1115 1120 1125
Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro
1130 1135 1140
Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn
1145 1150 1155
His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn
1160 1165 1170
Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu
1175 1180 1185
Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu
1190 1195 1200
Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu
1205 1210 1215
Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met
1220 1225 1230
Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro
1250 1255 1260
Val Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 2
<211> 3819
<212> DNA
<213> SARS-COV-2
<400> 2
atgtttgttt ttcttgtttt attgccacta gtctctagtc agtgtgttaa tcttacaacc 60
agaactcaat taccccctgc atacactaat tctttcacac gtggtgttta ttaccctgac 120
aaagttttca gatcctcagt tttacattca actcaggact tgttcttacc tttcttttcc 180
aatgttactt ggttccatgc tatacatgtc tctgggacca atggtactaa gaggtttgat 240
aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccactgagaa gtctaacata 300
ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct acttattgtt 360
aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa tgatccattt 420
ttgggtgttt attaccacaa aaacaacaaa agttggatgg aaagtgagtt cagagtttat 480
tctagtgcga ataattgcac ttttgaatat gtctctcagc cttttcttat ggaccttgaa 540
ggaaaacagg gtaatttcaa aaatcttagg gaatttgtgt ttaagaatat tgatggttat 600
tttaaaatat attctaagca cacgcctatt aatttagtgc gtgatctccc tcagggtttt 660
tcggctttag aaccattggt agatttgcca ataggtatta acatcactag gtttcaaact 720
ttacttgctt tacatagaag ttatttgact cctggtgatt cttcttcagg ttggacagct 780
ggtgctgcag cttattatgt gggttatctt caacctagga cttttctatt aaaatataat 840
gaaaatggaa ccattacaga tgctgtagac tgtgcacttg accctctctc agaaacaaag 900
tgtacgttga aatccttcac tgtagaaaaa ggaatctatc aaacttctaa ctttagagtc 960
caaccaacag aatctattgt tagatttcct aatattacaa acttgtgccc ttttggtgaa 1020
gtttttaacg ccaccagatt tgcatctgtt tatgcttgga acaggaagag aatcagcaac 1080
tgtgttgctg attattctgt cctatataat tccgcatcat tttccacttt taagtgttat 1140
ggagtgtctc ctactaaatt aaatgatctc tgctttacta atgtctatgc agattcattt 1200
gtaattagag gtgatgaagt cagacaaatc gctccagggc aaactggaaa gattgctgat 1260
tataattata aattaccaga tgattttaca ggctgcgtta tagcttggaa ttctaacaat 1320
cttgattcta aggttggtgg taattataat tacctgtata gattgtttag gaagtctaat 1380
ctcaaacctt ttgagagaga tatttcaact gaaatctatc aggccggtag cacaccttgt 1440
aatggtgttg aaggttttaa ttgttacttt cctttacaat catatggttt ccaacccact 1500
aatggtgttg gttaccaacc atacagagta gtagtacttt cttttgaact tctacatgca 1560
ccagcaactg tttgtggacc taaaaagtct actaatttgg ttaaaaacaa atgtgtcaat 1620
ttcaacttca atggtttaac aggcacaggt gttcttactg agtctaacaa aaagtttctg 1680
cctttccaac aatttggcag agacattgct gacactactg atgctgtccg tgatccacag 1740
acacttgaga ttcttgacat tacaccatgt tcttttggtg gtgtcagtgt tataacacca 1800
ggaacaaata cttctaacca ggttgctgtt ctttatcagg atgttaactg cacagaagtc 1860
cctgttgcta ttcatgcaga tcaacttact cctacttggc gtgtttattc tacaggttct 1920
aatgtttttc aaacacgtgc aggctgttta ataggggctg aacatgtcaa caactcatat 1980
gagtgtgaca tacccattgg tgcaggtata tgcgctagtt atcagactca gactaattct 2040
cctcggcggg cacgtagtgt agctagtcaa tccatcattg cctacactat gtcacttggt 2100
gcagaaaatt cagttgctta ctctaataac tctattgcca tacccacaaa ttttactatt 2160
agtgttacca cagaaattct accagtgtct atgaccaaga catcagtaga ttgtacaatg 2220
tacatttgtg gtgattcaac tgaatgcagc aatcttttgt tgcaatatgg cagtttttgt 2280
acacaattaa accgtgcttt aactggaata gctgttgaac aagacaaaaa cacccaagaa 2340
gtttttgcac aagtcaaaca aatttacaaa acaccaccaa ttaaagattt tggtggtttt 2400
aatttttcac aaatattacc agatccatca aaaccaagca agaggtcatt tattgaagat 2460
ctacttttca acaaagtgac acttgcagat gctggcttca tcaaacaata tggtgattgc 2520
cttggtgata ttgctgctag agacctcatt tgtgcacaaa agtttaacgg ccttactgtt 2580
ttgccacctt tgctcacaga tgaaatgatt gctcaataca cttctgcact gttagcgggt 2640
acaatcactt ctggttggac ctttggtgca ggtgctgcat tacaaatacc atttgctatg 2700
caaatggctt ataggtttaa tggtattgga gttacacaga atgttctcta tgagaaccaa 2760
aaattgattg ccaaccaatt taatagtgct attggcaaaa ttcaagactc actttcttcc 2820
acagcaagtg cacttggaaa acttcaagat gtggtcaacc aaaatgcaca agctttaaac 2880
acgcttgtta aacaacttag ctccaatttt ggtgcaattt caagtgtttt aaatgatatc 2940
ctttcacgtc ttgacaaagt tgaggctgaa gtgcaaattg ataggttgat cacaggcaga 3000
cttcaaagtt tgcagacata tgtgactcaa caattaatta gagctgcaga aatcagagct 3060
tctgctaatc ttgctgctac taaaatgtca gagtgtgtac ttggacaatc aaaaagagtt 3120
gatttttgtg gaaagggcta tcatcttatg tccttccctc agtcagcacc tcatggtgta 3180
gtcttcttgc atgtgactta tgtccctgca caagaaaaga acttcacaac tgctcctgcc 3240
atttgtcatg atggaaaagc acactttcct cgtgaaggtg tctttgtttc aaatggcaca 3300
cactggtttg taacacaaag gaatttttat gaaccacaaa tcattactac agacaacaca 3360
tttgtgtctg gtaactgtga tgttgtaata ggaattgtca acaacacagt ttatgatcct 3420
ttgcaacctg aattagactc attcaaggag gagttagata aatattttaa gaatcataca 3480
tcaccagatg ttgatttagg tgacatctct ggcattaatg cttcagttgt aaacattcaa 3540
aaagaaattg accgcctcaa tgaggttgcc aagaatttaa atgaatctct catcgatctc 3600
caagaacttg gaaagtatga gcagtatata aaatggccat ggtacatttg gctaggtttt 3660
atagctggct tgattgccat agtaatggtg acaattatgc tttgctgtat gaccagttgc 3720
tgtagttgtc tcaagggctg ttgttcttgt ggatcctgct gcaaatttga tgaagacgac 3780
tctgagccag tgctcaaagg agtcaaatta cattacaca 3819
<210> 3
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化后的序列(S-GS)
<400> 3
atgttcgtct tcctggtcct gctgcctctg gtctcctcac agtgcgtcaa tctgacaact 60
cggactcagc tgccacctgc ttatactaat agcttcacca gaggcgtgta ctatcctgac 120
aaggtgttta gaagctccgt gctgcactct acacaggatc tgtttctgcc attctttagc 180
aacgtgacct ggttccacgc catccacgtg agcggcacca atggcacaaa gcggttcgac 240
aatcccgtgc tgccttttaa cgatggcgtg tacttcgcct ctaccgagaa gagcaacatc 300
atcagaggct ggatctttgg caccacactg gactccaaga cacagtctct gctgatcgtg 360
aacaatgcca ccaacgtggt catcaaggtg tgcgagttcc agttttgtaa tgatcccttc 420
ctgggcgtgt actatcacaa gaacaataag agctggatgg agtccgagtt tagagtgtat 480
tctagcgcca acaactgcac atttgagtac gtgagccagc ctttcctgat ggacctggag 540
ggcaagcagg gcaatttcaa gaacctgagg gagttcgtgt ttaagaatat cgacggctac 600
ttcaaaatct actctaagca cacccccatc aacctggtgc gcgacctgcc tcagggcttc 660
agcgccctgg agcccctggt ggatctgcct atcggcatca acatcacccg gtttcagaca 720
ctgctggccc tgcacagaag ctacctgaca cccggcgact cctctagcgg atggaccgcc 780
ggcgctgccg cctactatgt gggctacctc cagccccgga ccttcctgct gaagtacaac 840
gagaatggca ccatcacaga cgcagtggat tgcgccctgg accccctgag cgagacaaag 900
tgtacactga agtcctttac cgtggagaag ggcatctatc agacatccaa tttcagggtg 960
cagccaaccg agtctatcgt gcgctttcct aatatcacaa acctgtgccc atttggcgag 1020
gtgttcaacg caacccgctt cgccagcgtg tacgcctgga ataggaagcg gatcagcaac 1080
tgcgtggccg actatagcgt gctgtacaac tccgcctctt tcagcacctt taagtgctat 1140
ggcgtgtccc ccacaaagct gaatgacctg tgctttacca acgtctacgc cgattctttc 1200
gtgatcaggg gcgacgaggt gcgccagatc gcccccggcc agacaggcaa gatcgcagac 1260
tacaattata agctgccaga cgatttcacc ggctgcgtga tcgcctggaa cagcaacaat 1320
ctggattcca aagtgggcgg caactacaat tatctgtacc ggctgtttag aaagagcaat 1380
ctgaagccct tcgagaggga catctctaca gaaatctacc aggccggcag caccccttgc 1440
aatggcgtgg agggctttaa ctgttatttc ccactccagt cctacggctt ccagcccaca 1500
aacggcgtgg gctatcagcc ttaccgcgtg gtggtgctga gctttgagct gctgcacgcc 1560
ccagcaacag tgtgcggccc caagaagtcc accaatctgg tgaagaacaa gtgcgtgaac 1620
ttcaacttca acggcctgac cggcacaggc gtgctgaccg agtccaacaa gaagttcctg 1680
ccatttcagc agttcggcag ggacatcgca gataccacag acgccgtgcg cgacccacag 1740
accctggaga tcctggacat cacaccctgc tctttcggcg gcgtgagcgt gatcacaccc 1800
ggcaccaata caagcaacca ggtggccgtg ctgtatcagg acgtgaattg taccgaggtg 1860
cccgtggcta tccacgccga tcagctgacc ccaacatggc gggtgtacag caccggctcc 1920
aacgtcttcc agacaagagc cggatgcctg atcggagcag agcacgtgaa caattcctat 1980
gagtgcgaca tcccaatcgg cgccggcatc tgtgcctctt accagaccca gacaaactct 2040
cccagaagag cccggagcgt ggcctcccag tctatcatcg cctataccat gtccctgggc 2100
gccgagaaca gcgtggccta ctctaacaat agcatcgcca tcccaaccaa cttcacaatc 2160
tctgtgacca cagagatcct gcccgtgtcc atgaccaaga catctgtgga ctgcacaatg 2220
tatatctgtg gcgattctac cgagtgcagc aacctgctgc tccagtacgg cagcttttgt 2280
acccagctga atagagccct gacaggcatc gccgtggagc aggataagaa cacacaggag 2340
gtgttcgccc aggtgaagca aatctacaag acccccccta tcaaggactt tggcggcttc 2400
aatttttccc agatcctgcc tgatccatcc aagccttcta agcggagctt tatcgaggac 2460
ctgctgttca acaaggtgac cctggccgat gccggcttca tcaagcagta tggcgattgc 2520
ctgggcgaca tcgcagccag ggacctgatc tgcgcccaga agtttaatgg cctgaccgtg 2580
ctgccacccc tgctgacaga tgagatgatc gcacagtaca caagcgccct gctggccggc 2640
accatcacat ccggatggac cttcggcgca ggagccgccc tccagatccc ctttgccatg 2700
cagatggcct ataggttcaa cggcatcggc gtgacccaga atgtgctgta cgagaaccag 2760
aagctgatcg ccaatcagtt taactccgcc atcggcaaga tccaggacag cctgtcctct 2820
acagccagcg ccctgggcaa gctccaggat gtggtgaatc agaacgccca ggccctgaat 2880
accctggtga agcagctgag cagcaacttc ggcgccatct ctagcgtgct gaatgacatc 2940
ctgagccggc tggacaaggt ggaggcagag gtgcagatcg accggctgat caccggccgg 3000
ctccagagcc tccagaccta tgtgacacag cagctgatca gggccgccga gatcagggcc 3060
agcgccaatc tggcagcaac caagatgtcc gagtgcgtgc tgggccagtc taagagagtg 3120
gacttttgtg gcaagggcta tcacctgatg tccttccctc agtctgcccc acacggcgtg 3180
gtgtttctgc acgtgaccta cgtgcccgcc caggagaaga acttcaccac agcccctgcc 3240
atctgccacg atggcaaggc ccactttcca agggagggcg tgttcgtgtc caacggcacc 3300
cactggtttg tgacacagcg caatttctac gagccccaga tcatcaccac agacaacacc 3360
ttcgtgagcg gcaactgtga cgtggtcatc ggcatcgtga acaataccgt gtatgatcca 3420
ctccagcccg agctggacag ctttaaggag gagctggata agtatttcaa gaatcacacc 3480
tcccctgacg tggatctggg cgacatcagc ggcatcaatg cctccgtggt gaacatccag 3540
aaggagatcg accgcctgaa cgaggtggct aagaatctga acgagagcct gatcgacctc 3600
caggagctgg gcaagtatga gcagtacatc aagtggccct ggtacatctg gctgggcttc 3660
atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgtat gacatcctgc 3720
tgttcttgcc tgaagggctg ctgtagctgt ggctcctgct gtaagtttga cgaggatgac 3780
tctgaacctg tgctgaaggg cgtgaagctg cattacacc 3819
<210> 4
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化后的序列 (SBL)
<400> 4
atgttcgttt tcctcgttct gctgcctctt gtcagctctc agtgtgtgaa cctgacaact 60
agaacacaac tacctcccgc ctacacaaac tctttcaccc ggggcgtgta ctacccagac 120
aaagtgttca ggagctctgt gttgcacagc acccaagacc tgtttttgcc attctttagt 180
aatgtgacct ggtttcacgc tatccatgtg tcgggcacca acgggaccaa aagattcgac 240
aaccccgttc tgccgttcaa cgacggcgtg tacttcgcta gcactgagaa gtccaacatt 300
attcgcgggt ggatcttcgg aactaccttg gactccaaaa cacagtctct actcatcgtg 360
aacaacgcga ctaacgtggt gattaaggtg tgtgaatttc agttctgcaa tgatccattt 420
ttaggagtgt actaccacaa aaataataaa tcatggatgg agtctgaatt tcgcgtatac 480
agtagcgcta ataactgtac attcgaatat gttagccaac cctttttgat ggacttagag 540
gggaagcagg gaaattttaa gaatttgcga gaatttgtgt tcaaaaatat cgatgggtat 600
ttcaagatct actccaagca tactcccata aatctggtgc gcgacttacc tcaagggttc 660
agcgcactgg agccactggt agacctgcca atcggcatca acatcacccg attccagacc 720
ctgcttgctc tgcaccgttc atatctgaca ccaggagatt cgtcttccgg atggacagca 780
ggggccgctg cttactatgt tggttatctt cagcctcgga cctttctgct caagtataat 840
gagaatggga ccattaccga cgctgttgat tgtgctctcg atcccctgtc agaaaccaag 900
tgcacactaa aatctttcac agtcgaaaag gggatctacc agacttctaa ctttcgtgta 960
cagcccaccg agagcatcgt caggttccca aatatcacta acctgtgtcc ttttggcgag 1020
gtgttcaacg ctacaagatt tgctagcgtg tacgcctgga acagaaaaag aatatcaaat 1080
tgcgtagccg attacagcgt cttatataac tctgcatcct tctcaacttt caagtgttat 1140
ggagtgagcc cgactaagct gaatgatttg tgctttacaa atgtttatgc cgattcattc 1200
gtgatccggg gcgacgaggt cagacagatc gcccctggcc aaacaggtaa gattgctgat 1260
tacaactaca aattacctga cgattttaca ggatgcgtta tcgcttggaa ctctaacaat 1320
ctcgattcta aggtcggcgg caattacaat tatctttatc gccttttcag gaagtcaaat 1380
cttaagccat tcgagcgaga catcagtacc gagatatacc aggcggggtc caccccgtgt 1440
aacggtgtcg agggtttcaa ctgctacttt ccactgcagt cctatgggtt ccagcccacc 1500
aatggcgtgg gttaccagcc ctaccgagta gtcgtattgt cttttgagct cttgcacgcc 1560
cccgccacgg tgtgcggtcc aaagaaatca actaacttag ttaagaataa atgtgtgaat 1620
tttaacttta acggcctgac agggacagga gtcctgacag aatccaataa gaagttcctt 1680
ccctttcagc agtttggacg cgacatcgca gacaccacag acgccgtgcg tgacccccaa 1740
actctcgaaa ttctcgatat cacaccctgc agttttggcg gggtcagtgt cattacccct 1800
gggaccaata ctagtaacca ggtcgcagtg ctttaccaag atgtcaactg taccgaggtt 1860
cctgtggcta ttcacgcaga ccaactgact ccgacttggc gggtgtatag tacaggctcc 1920
aatgtgtttc agacccgggc aggctgcctg attggggccg agcatgtaaa taactcctac 1980
gagtgcgata tccccatagg tgctggaata tgtgccagtt atcagaccca gacgaactcg 2040
ccaagacgag ctaggtccgt agcctctcag agcataatcg cgtacactat gagcctgggg 2100
gccgaaaatt ccgtggcata tagcaacaac agcattgcta ttcctactaa ctttacaatt 2160
tcagtcacga cggagatcct gccagtctcc atgactaaaa cctccgtgga ctgtacgatg 2220
tacatttgtg gcgattcaac tgaatgctct aacctgctct tacagtacgg ttctttttgt 2280
acccagctga accgggcatt gacgggcatc gcagttgagc aggacaagaa tactcaggag 2340
gtgtttgcgc aagtgaagca aatttataaa actcctccca ttaaggactt tggcggtttc 2400
aacttctcgc agatcctacc tgacccatca aaacctagca agaggtcttt cattgaagac 2460
cttctgttca acaaggtcac actggctgac gccggcttca ttaaacagta cggagattgt 2520
ctaggtgata ttgcagcgcg cgatctgatt tgcgcacaga agtttaacgg cctgacggtc 2580
ttaccccctc tccttaccga cgaaatgatt gcccagtaca ccagcgccct gctcgctggc 2640
acgattacta gcggatggac atttggggcc ggcgctgccc tccagatacc atttgccatg 2700
cagatggcgt ataggtttaa cggcatagga gtaacccaga acgtgctgta cgagaaccaa 2760
aaactgatag ccaatcaatt caatagtgcc ataggaaaga tacaggacag tctcagcagc 2820
accgcgtccg ctctcggaaa gctacaagat gtggtcaacc agaacgcgca ggcattgaat 2880
acactggtga agcagctctc ctcgaatttt ggagcaatca gcagcgtgct gaatgatatc 2940
ctgtctcggc tggacaaggt tgaagccgaa gtccagatcg acaggttaat caccggtcgg 3000
ctgcagagtc tccagacata tgttacccag caactcatca gagctgccga aatacgcgcc 3060
agtgccaatc ttgcagccac taagatgtcc gagtgcgtgt tggggcaaag taaaagggtt 3120
gatttctgtg gaaaaggata tcatcttatg agtttccctc aatccgcccc tcacggagtt 3180
gtcttcctgc atgtgaccta cgtgccagcg caggagaaga acttcacgac cgcccccgcc 3240
atctgccatg atggcaaggc ccattttccc cgcgaaggag tgttcgtatc caatggcacc 3300
cactggttcg tgacgcagag aaatttttat gagccgcaaa ttatcactac cgacaacaca 3360
ttcgtttccg gcaattgcga tgtcgtaatc gggatcgtga ataatacagt ctatgatcct 3420
cttcagccag aactcgattc attcaaagag gagctggata aatatttcaa gaaccacacc 3480
tcccccgatg tggatctggg tgacatatca ggaattaacg caagcgtcgt gaacattcag 3540
aaggaaatcg acaggctcaa tgaagtagca aagaacttga atgagtctct catcgacttg 3600
caggaactcg gcaaatatga gcagtacatt aaatggccgt ggtatatctg gctaggcttt 3660
atcgccggtc tgattgcaat tgtgatggtt actatcatgt tgtgctgcat gacaagttgc 3720
tgttcatgcc ttaaaggctg ctgctcctgc gggtcatgtt gtaaattcga tgaggacgac 3780
tctgagcccg tgctgaaagg ggtgaaactg cactacacg 3819
<210> 5
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化序列3 (STF)
<400> 5
atgttcgtgt tcctggtgct gctgcctctg gtgtccagcc agtgtgtgaa cctgaccacc 60
agaacacagc tgcctccagc ctacaccaac agctttacca gaggcgtgta ctaccccgac 120
aaggtgttca gatccagcgt gctgcactct acccaggacc tgttcctgcc tttcttcagc 180
aacgtgacct ggttccacgc catccacgtg tccggcacca atggcaccaa gagattcgac 240
aaccccgtgc tgcccttcaa cgacggggtg tactttgcca gcaccgagaa gtccaacatc 300
atcagaggct ggatcttcgg caccacactg gacagcaaga cccagagcct gctgatcgtg 360
aacaacgcca ccaacgtggt catcaaagtg tgcgagttcc agttctgcaa cgaccccttc 420
ctgggcgtct actaccacaa gaacaacaag agctggatgg aaagcgagtt ccgggtgtac 480
agcagcgcca acaactgcac cttcgagtac gtgtcccagc ctttcctgat ggacctggaa 540
ggcaagcagg gcaacttcaa gaacctgcgc gagttcgtgt ttaagaacat cgacggctac 600
ttcaagatct acagcaagca cacccctatc aacctcgtgc gggatctgcc tcagggcttc 660
tctgctctgg aacccctggt ggatctgccc atcggcatca acatcacccg gtttcagaca 720
ctgctggccc tgcacagaag ctacctgaca cctggcgata gcagcagcgg atggacagct 780
ggtgccgccg cttactatgt gggctacctg cagcctagaa ccttcctgct gaagtacaac 840
gagaacggca ccatcaccga cgccgtggat tgtgctctgg atcctctgag cgagacaaag 900
tgcaccctga agtccttcac cgtggaaaag ggcatctacc agaccagcaa cttccgggtg 960
cagcccaccg aatccatcgt gcggttcccc aatatcacca atctgtgccc cttcggcgag 1020
gtgttcaatg ccaccagatt cgcctctgtg tacgcctgga accggaagcg gatcagcaat 1080
tgcgtggccg actactccgt gctgtacaac tccgccagct tcagcacctt caagtgctac 1140
ggcgtgtccc ctaccaagct gaacgacctg tgcttcacaa acgtgtacgc cgacagcttc 1200
gtgatccggg gagatgaagt gcggcagatt gcccctggac agacaggcaa gatcgccgac 1260
tacaactaca agctgcccga cgacttcacc ggctgtgtga ttgcctggaa cagcaacaac 1320
ctggactcca aagtcggcgg caactacaat tacctgtacc ggctgttccg gaagtccaat 1380
ctgaagccct tcgagcggga catctccacc gagatctatc aggccggcag caccccttgt 1440
aacggcgtgg aaggcttcaa ctgctacttc ccactgcagt cctacggctt tcagcccaca 1500
aatggcgtgg gctatcagcc ctacagagtg gtggtgctga gcttcgaact gctgcatgcc 1560
cctgccacag tgtgcggccc taagaaaagc accaatctcg tgaagaacaa atgcgtgaac 1620
ttcaacttca acggcctgac cggcaccggc gtgctgacag agagcaacaa gaagttcctg 1680
ccattccagc agtttggccg ggatatcgcc gataccacag acgccgttag agatccccag 1740
acactggaaa tcctggacat caccccttgc agcttcggcg gagtgtctgt gatcacccct 1800
ggcaccaaca ccagcaatca ggtggcagtg ctgtaccagg acgtgaactg taccgaagtg 1860
cccgtggcca ttcacgccga tcagctgaca cctacatggc gggtgtactc caccggcagc 1920
aatgtgtttc agaccagagc cggctgtctg atcggagccg agcacgtgaa caatagctac 1980
gagtgcgaca tccccatcgg cgctggcatc tgtgccagct accagacaca gacaaacagc 2040
cccagacggg ccagatctgt ggccagccag agcatcattg cctacacaat gtctctgggc 2100
gccgagaaca gcgtggccta ctccaacaac tctatcgcta tccccaccaa cttcaccatc 2160
agcgtgacca cagagatcct gcctgtgtcc atgaccaaga ccagcgtgga ctgcaccatg 2220
tacatctgcg gcgattccac cgagtgctcc aacctgctgc tgcagtacgg cagcttctgc 2280
acccagctga atagagccct gacagggatc gccgtggaac aggacaagaa cacccaagag 2340
gtgttcgccc aagtgaagca gatctacaag acccctccta tcaaggactt cggcggcttc 2400
aatttcagcc agattctgcc cgatcctagc aagcccagca agcggagctt catcgaggac 2460
ctgctgttca acaaagtgac actggccgac gccggcttca tcaagcagta tggcgattgt 2520
ctgggcgaca ttgccgccag ggatctgatt tgcgcccaga agtttaacgg actgacagtg 2580
ctgcctcctc tgctgaccga tgagatgatc gcccagtaca catctgccct gctggccggc 2640
acaatcacaa gcggctggac atttggagct ggcgccgctc tgcagatccc ctttgctatg 2700
cagatggcct accggttcaa cggcatcgga gtgacccaga atgtgctgta cgagaaccag 2760
aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgagcagc 2820
acagcaagcg ccctgggaaa gctgcaggac gtggtcaacc agaatgccca ggcactgaac 2880
accctggtca agcagctgtc ctccaacttc ggcgccatca gctctgtgct gaacgatatc 2940
ctgagcagac tggacaaggt ggaagccgag gtgcagatcg acagactgat caccggaagg 3000
ctgcagtccc tgcagaccta cgttacccag cagctgatca gagccgccga gattagagcc 3060
tctgccaatc tggccgccac caagatgtct gagtgtgtgc tgggccagag caagagagtg 3120
gacttttgcg gcaagggcta ccacctgatg agcttccctc agtctgcccc tcacggcgtg 3180
gtgtttctgc acgtgacata cgtgcccgct caagagaaga atttcaccac cgctccagcc 3240
atctgccacg acggcaaagc ccactttcct agagaaggcg tgttcgtgtc caacggcacc 3300
cattggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360
ttcgtgtctg gcaactgcga cgtcgtgatc ggcattgtga acaataccgt gtacgaccct 3420
ctgcagcccg agctggacag cttcaaagag gaactggata agtactttaa gaaccacaca 3480
agccccgacg tggacctggg cgatatcagc ggaatcaatg ccagcgtcgt gaacatccag 3540
aaagagatcg accggctgaa cgaggtggcc aagaatctga acgagagcct gatcgacctg 3600
caagaactgg ggaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttt 3660
atcgccggac tgattgccat cgtgatggtc acaatcatgc tgtgttgcat gaccagctgc 3720
tgtagctgcc tgaagggctg ttgtagctgt ggcagctgct gcaagttcga cgaggacgat 3780
tctgagcccg tgctgaaggg cgtgaaactg cactacaca 3819
<210> 6
<211> 75
<212> PRT
<213> SARS-COV-2
<400> 6
Met Tyr Ser Phe Val Ser Glu Glu Thr Gly Thr Leu Ile Val Asn Ser
1 5 10 15
Val Leu Leu Phe Leu Ala Phe Val Val Phe Leu Leu Val Thr Leu Ala
20 25 30
Ile Leu Thr Ala Leu Arg Leu Cys Ala Tyr Cys Cys Asn Ile Val Asn
35 40 45
Val Ser Leu Val Lys Pro Ser Phe Tyr Val Tyr Ser Arg Val Lys Asn
50 55 60
Leu Asn Ser Ser Arg Val Pro Asp Leu Leu Val
65 70 75
<210> 7
<211> 228
<212> DNA
<213> SARS-COV-2
<400> 7
atgtactcat tcgtttcgga agagacaggt acgttaatag ttaatagcgt acttcttttt 60
cttgctttcg tggtattctt gctagttaca ctagccatcc ttactgcgct tcgattgtgt 120
gcgtactgct gcaatattgt taacgtgagt cttgtaaaac cttcttttta cgtttactct 180
cgtgttaaaa atctgaattc ttctagagtt cctgatcttc tggtctaa 228
<210> 8
<211> 225
<212> DNA
<213> Artificial Sequence
<220>
<223> E蛋白基因优化序列(EBL)
<400> 8
atgtacagct ttgtctcaga ggaaaccggc acgctgattg taaacagcgt gttactattc 60
ctcgccttcg ttgtgtttct ccttgttaca ctggcaatac tgactgccct gcggttgtgc 120
gcttactgct gtaatatcgt gaacgtgtct ttggtgaagc ccagtttcta tgtatattcc 180
agagtcaaaa atctcaactc ctctagggtg cctgacctgc ttgtc 225
<210> 9
<211> 222
<212> PRT
<213> SARS-COV-2
<400> 9
Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu
1 5 10 15
Leu Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Thr Trp Ile
20 25 30
Cys Leu Leu Gln Phe Ala Tyr Ala Asn Arg Asn Arg Phe Leu Tyr Ile
35 40 45
Ile Lys Leu Ile Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys
50 55 60
Phe Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Ile Thr Gly Gly Ile
65 70 75 80
Ala Ile Ala Met Ala Cys Leu Val Gly Leu Met Trp Leu Ser Tyr Phe
85 90 95
Ile Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe
100 105 110
Asn Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu His Gly Thr Ile
115 120 125
Leu Thr Arg Pro Leu Leu Glu Ser Glu Leu Val Ile Gly Ala Val Ile
130 135 140
Leu Arg Gly His Leu Arg Ile Ala Gly His His Leu Gly Arg Cys Asp
145 150 155 160
Ile Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu
165 170 175
Ser Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Ala Gly Asp Ser Gly
180 185 190
Phe Ala Ala Tyr Ser Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr
195 200 205
Asp His Ser Ser Ser Ser Asp Asn Ile Ala Leu Leu Val Gln
210 215 220
<210> 10
<211> 669
<212> DNA
<213> SARS-COV-2
<400> 10
atggcagatt ccaacggtac tattaccgtt gaagagctta aaaagctcct tgaacaatgg 60
aacctagtaa taggtttcct attccttaca tggatttgtc ttctacaatt tgcctatgcc 120
aacaggaata ggtttttgta tataattaag ttaattttcc tctggctgtt atggccagta 180
actttagctt gttttgtgct tgctgctgtt tacagaataa attggatcac cggtggaatt 240
gctatcgcaa tggcttgtct tgtaggcttg atgtggctca gctacttcat tgcttctttc 300
agactgtttg cgcgtacgcg ttccatgtgg tcattcaatc cagaaactaa cattcttctc 360
aacgtgccac tccatggcac tattctgacc agaccgcttc tagaaagtga actcgtaatc 420
ggagctgtga tccttcgtgg acatcttcgt attgctggac accatctagg acgctgtgac 480
atcaaggacc tgcctaaaga aatcactgtt gctacatcac gaacgctttc ttattacaaa 540
ttgggagctt cgcagcgtgt agcaggtgac tcaggttttg ctgcatacag tcgctacagg 600
attggcaact ataaattaaa cacagaccat tccagtagca gtgacaatat tgctttgctt 660
gtacagtaa 669
<210> 11
<211> 669
<212> DNA
<213> Artificial Sequence
<220>
<223> M蛋白基因优化序列MBL
<400> 11
atggcagatt ccaacggtac aattaccgtc gaagagctga aaaagctcct tgagcagtgg 60
aacctggtca tagggttcct attcctgaca tggatttgcc tgctgcaatt tgcctatgcc 120
aacaggaata ggtttttgta tataatcaag ctgattttcc tctggctgtt atggccagtg 180
accctggcct gttttgtgct tgccgctgtt tacagaataa attggatcac cggcggaatc 240
gccatcgcaa tggcttgcct tgtaggcttg atgtggctca gctacttcat tgcttctttc 300
cggctgtttg cgcgaacgcg gtccatgtgg tctttcaatc cggagactaa catactcctc 360
aatgtgcccc tccatggcac tattctgacc agacccctgc tagagagtga actcgtcatc 420
ggagctgtga tcctgcgggg gcacctgaga atcgccggac accacttagg ccgctgtgac 480
atcaaggatc tgcctaaaga aatcactgtt gccacatcac gaaccctttc ttattacaag 540
ttgggggcct cgcagcgtgt ggcaggagac tcaggttttg cggcatacag tcgctacagg 600
attggcaact ataaattaaa cacagaccat tccagcagca gcgataatat tgctttgctt 660
gtgcagtga 669
<210> 12
<211> 419
<212> PRT
<213> SARS-COV-2
<400> 12
Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr
1 5 10 15
Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg
20 25 30
Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn
35 40 45
Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu
50 55 60
Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro
65 70 75 80
Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly
85 90 95
Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr
100 105 110
Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp
115 120 125
Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp
130 135 140
His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln
145 150 155 160
Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser
165 170 175
Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn
180 185 190
Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala
195 200 205
Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu
210 215 220
Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln
225 230 235 240
Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys
245 250 255
Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln
260 265 270
Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp
275 280 285
Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile
290 295 300
Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile
305 310 315 320
Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala
325 330 335
Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu
340 345 350
Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro
355 360 365
Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln
370 375 380
Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu
385 390 395 400
Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser
405 410 415
Thr Gln Ala
<210> 13
<211> 1257
<212> DNA
<213> Artificial Sequence
<220>
<223> N蛋白基因优化序列NBL
<400> 13
atgtcagata acggaccgca gaaccaaagg aacgcccctc ggatcacttt cgggggtcct 60
agcgacagca ctgggtctaa ccaaaatgga gaacgttccg gcgcaagatc caaacagagg 120
aggcctcagg ggcttcctaa caatacagcc tcctggttca cagctctcac acagcatggc 180
aaggaagacc tgaagtttcc tagaggccag ggggttccca tcaatactaa ctcctcccca 240
gacgatcaga ttggttatta tcggcgggct accaggcgga tccggggcgg agacggtaag 300
atgaaggacc tctctccccg ttggtacttt tactacctcg gtacaggccc cgaggctggg 360
cttccgtatg gcgccaataa ggatggaata atttgggtgg ctacggaagg ggccctcaac 420
acaccgaagg atcacattgg cacccgtaat cccgcgaata atgccgccat tgtcctgcag 480
ttgccccagg ggacgacgtt gcccaaaggc ttttacgcag aaggatcgcg cggaggatcc 540
caagcctcca gccgatcaag ctctcgatct cggaactcaa gtcgcaatag cacaccaggg 600
tcttctcgcg ggaccagccc tgcaaggatg gccggaaacg gcggtgatgc tgctttagcg 660
ctgctgctgc tggatagact gaaccaatta gagagtaaaa tgtcaggtaa aggccagcaa 720
cagcaggggc agacagtgac caaaaaaagt gcggccgagg ccagcaagaa accccgccag 780
aaacgaacag ccactaaagc ctacaacgta acccaagcat tcggaaggag aggaccagag 840
cagacccaag gcaattttgg cgatcaagag ctgatccgcc aggggacgga ctataagcat 900
tggccacaga tcgcccagtt cgcacccagt gcttcagcct tcttcggaat gtcgagaatc 960
ggtatggagg tcactccttc tggcacttgg ctgacttata ccggcgcaat aaagctagac 1020
gacaaagacc ctaactttaa ggatcaggtg atcctgctaa ataaacacat tgatgcgtac 1080
aaaacattcc caccaactga gccaaagaag gacaagaaga agaaggcaga tgaaacccag 1140
gctttgcccc agagacagaa aaagcagcag accgtgacct tgctgccagc agccgacctc 1200
gacgattttt caaagcaact tcagcagtcc atgagtagcg ctgacagcac ccaggct 1257
<210> 14
<211> 1257
<212> DNA
<213> SARS-COV-2
<400> 14
atgtctgata atggacccca aaatcagcga aatgcacccc gcattacgtt tggtggaccc 60
tcagattcaa ctggcagtaa ccagaatgga gaacgcagtg gggcgcgatc aaaacaacgt 120
cggccccaag gtttacccaa taatactgcg tcttggttca ccgctctcac tcaacatggc 180
aaggaagacc ttaaattccc tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca 240
gatgaccaaa ttggctacta ccgaagagct accagacgaa ttcgtggtgg tgacggtaaa 300
atgaaagatc tcagtccaag atggtatttc tactacctag gaactgggcc agaagctgga 360
cttccctatg gtgctaacaa agacggcatc atatgggttg caactgaggg agccttgaat 420
acaccaaaag atcacattgg cacccgcaat cctgctaaca atgctgcaat cgtgctacaa 480
cttcctcaag gaacaacatt gccaaaaggc ttctacgcag aagggagcag aggcggcagt 540
caagcctctt ctcgttcctc atcacgtagt cgcaacagtt caagaaattc aactccaggc 600
agcagtaggg gaacttctcc tgctagaatg gctggcaatg gcggtgatgc tgctcttgct 660
ttgctgctgc ttgacagatt gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa 720
caacaaggcc aaactgtcac taagaaatct gctgctgagg cttctaagaa gcctcggcaa 780
aaacgtactg ccactaaagc atacaatgta acacaagctt tcggcagacg tggtccagaa 840
caaacccaag gaaattttgg ggaccaggaa ctaatcagac aaggaactga ttacaaacat 900
tggccgcaaa ttgcacaatt tgcccccagc gcttcagcgt tcttcggaat gtcgcgcatt 960
ggcatggaag tcacaccttc gggaacgtgg ttgacctaca caggtgccat caaattggat 1020
gacaaagatc caaatttcaa agatcaagtc attttgctga ataagcatat tgacgcatac 1080
aaaacattcc caccaacaga gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa 1140
gccttaccgc agagacagaa gaaacagcaa actgtgactc ttcttcctgc tgcagatttg 1200
gatgatttct ccaaacaatt gcaacaatcc atgagcagtg ctgactcaac tcaggcc 1257
<210> 15
<211> 46
<212> DNA
<213> Artificial Sequence
<220>
<223> 5'UTR
<400> 15
ggaaataaga gagaaaagaa gagtaagaag aaatataaga gccacc 46
<210> 16
<211> 110
<212> DNA
<213> Artificial Sequence
<220>
<223> 3'UTR-1
<400> 16
gctggagcct cggtggccat gcttcttgcc ccttgggcct ccccccagcc cctcctcccc 60
ttcctgcacc cgtacccccg tggtctttga ataaagtctg agtgggcggc 110
<210> 17
<211> 109
<212> DNA
<213> Artificial Sequence
<220>
<223> 3'UTR-2
<400> 17
gcggccgctt aattaagctg ccttctgcgg ggcttgcctt ctggccatgc ccttcttctc 60
tcccttgcac ctgtacctct tggtctttga ataaagcctg agtaggaag 109
<210> 18
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化后的mRNA序列1(S-GS mRNA)
<400> 18
uacaagcaga aggaccagga cgacggagac cagaggagug ucacgcaguu agacuguuga 60
gccugagucg acgguggacg aauaugauua ucgaaguggu cuccgcacau gauaggacug 120
uuccacaaau cuucgaggca cgacgugaga uguguccuag acaaagacgg uaagaaaucg 180
uugcacugga ccaaggugcg guaggugcac ucgccguggu uaccguguuu cgccaagcug 240
uuagggcacg acggaaaauu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300
uagucuccga ccuagaaacc guggugugac cugagguucu gugucagaga cgacuagcac 360
uuguuacggu gguugcacca guaguuccac acgcucaagg ucaaaacauu acuagggaag 420
gacccgcaca ugauaguguu cuuguuauuc ucgaccuacc ucaggcucaa aucucacaua 480
agaucgcggu uguugacgug uaaacucaug cacucggucg gaaaggacua ccuggaccuc 540
ccguucgucc cguuaaaguu cuuggacucc cucaagcaca aauucuuaua gcugccgaug 600
aaguuuuaga ugagauucgu guggggguag uuggaccacg cgcuggacgg agucccgaag 660
ucgcgggacc ucggggacca ccuagacgga uagccguagu uguagugggc caaagucugu 720
gacgaccggg acgugucuuc gauggacugu gggccgcuga ggagaucgcc uaccuggcgg 780
ccgcgacggc ggaugauaca cccgauggag gucggggccu ggaaggacga cuucauguug 840
cucuuaccgu gguagugucu gcgucaccua acgcgggacc ugggggacuc gcucuguuuc 900
acaugugacu ucaggaaaug gcaccucuuc ccguagauag ucuguagguu aaagucccac 960
gucgguuggc ucagauagca cgcgaaagga uuauaguguu uggacacggg uaaaccgcuc 1020
cacaaguugc guugggcgaa gcggucgcac augcggaccu uauccuucgc cuagucguug 1080
acgcaccggc ugauaucgca cgacauguug aggcggagaa agucguggaa auucacgaua 1140
ccgcacaggg gguguuucga cuuacuggac acgaaauggu ugcagaugcg gcuaagaaag 1200
cacuaguccc cgcugcucca cgcggucuag cgggggccgg ucuguccguu cuagcgucug 1260
auguuaauau ucgacggucu gcuaaagugg ccgacgcacu agcggaccuu gucguuguua 1320
gaccuaaggu uucacccgcc guugauguua auagacaugg ccgacaaauc uuucucguua 1380
gacuucggga agcucucccu guagagaugu cuuuagaugg uccggccguc guggggaacg 1440
uuaccgcacc ucccgaaauu gacaauaaag ggugagguca ggaugccgaa ggucgggugu 1500
uugccgcacc cgauagucgg aauggcgcac caccacgacu cgaaacucga cgacgugcgg 1560
ggucguuguc acacgccggg guucuucagg ugguuagacc acuucuuguu cacgcacuug 1620
aaguugaagu ugccggacug gccguguccg cacgacuggc ucagguuguu cuucaaggac 1680
gguaaagucg ucaagccguc ccuguagcgu cuaugguguc ugcggcacgc gcuggguguc 1740
ugggaccucu aggaccugua gugugggacg agaaagccgc cgcacucgca cuaguguggg 1800
ccgugguuau guucguuggu ccaccggcac gacauagucc ugcacuuaac auggcuccac 1860
gggcaccgau aggugcggcu agucgacugg gguuguaccg cccacauguc guggccgagg 1920
uugcagaagg ucuguucucg gccuacggac uagccucguc ucgugcacuu guuaaggaua 1980
cucacgcugu aggguuagcc gcggccguag acacggagaa uggucugggu cuguuugaga 2040
gggucuucuc gggccucgca ccggaggguc agauaguagc ggauauggua cagggacccg 2100
cggcucuugu cgcaccggau gagauuguua ucguagcggu aggguugguu gaaguguuag 2160
agacacuggu gucucuagga cgggcacagg uacugguucu guagacaccu gacguguuac 2220
auauagacac cgcuaagaug gcucacgucg uuggacgacg aggucaugcc gucgaaaaca 2280
ugggucgacu uaucucggga cuguccguag cggcaccucg uccuauucuu guguguccuc 2340
cacaagcggg uccacuucgu uuagauguuc ugggggggau aguuccugaa accgccgaag 2400
uuaaaaaggg ucuaggacgg acuagguagg uucggaagau ucgccucgaa auagcuccug 2460
gacgacaagu uguuccacug ggaccggcua cggccgaagu aguucgucau accgcuaacg 2520
gacccgcugu agcgucgguc ccuggacuag acgcgggucu ucaaauuacc ggacuggcac 2580
gacggugggg acgacugucu acucuacuag cgugucaugu guucgcggga cgaccggccg 2640
ugguagugua ggccuaccug gaagccgcgu ccucggcggg aggucuaggg gaaacgguac 2700
gucuaccgga uauccaaguu gccguagccg cacugggucu uacacgacau gcucuugguc 2760
uucgacuagc gguuagucaa auugaggcgg uagccguucu agguccuguc ggacaggaga 2820
ugucggucgc gggacccguu cgagguccua caccacuuag ucuugcgggu ccgggacuua 2880
ugggaccacu ucgucgacuc gucguugaag ccgcgguaga gaucgcacga cuuacuguag 2940
gacucggccg accuguucca ccuccgucuc cacgucuagc uggccgacua guggccggcc 3000
gaggucucgg aggucuggau acacuguguc gucgacuagu cccggcggcu cuagucccgg 3060
ucgcgguuag accgucguug guucuacagg cucacgcacg acccggucag auucucucac 3120
cugaaaacac cguucccgau aguggacuac aggaagggag ucagacgggg ugugccgcac 3180
cacaaagacg ugcacuggau gcacgggcgg guccucuucu ugaaguggug ucggggacgg 3240
uagacggugc uaccguuccg ggugaaaggu ucccucccgc acaagcacag guugccgugg 3300
gugaccaaac acugugucgc guuaaagaug cucggggucu aguaguggug ucuguugugg 3360
aagcacucgc cguugacacu gcaccaguag ccguagcacu uguuauggca cauacuaggu 3420
gaggucgggc ucgaccuguc gaaauuccuc cucgaccuau ucauaaaguu cuuagugugg 3480
aggggacugc accuagaccc gcuguagucg ccguaguuac ggaggcacca cuuguagguc 3540
uuccucuagc uggcggacuu gcuccaccga uucuuagacu ugcucucgga cuagcuggag 3600
guccucgacc cguucauacu cgucauguag uucaccggga ccauguagac cgacccgaag 3660
uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacaua cuguaggacg 3720
acaagaacgg acuucccgac gacaucgaca ccgaggacga cauucaaacu gcuccuacug 3780
agacuuggac acgacuuccc gcacuucgac guaaugugg 3819
<210> 19
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化后的mRNA序列(SBLmRNA)
<400> 19
uacaagcaaa aggagcaaga cgacggagaa cagucgagag ucacacacuu ggacuguuga 60
ucuuguguug auggagggcg gauguguuug agaaaguggg ccccgcacau gaugggucug 120
uuucacaagu ccucgagaca caacgugucg uggguucugg acaaaaacgg uaagaaauca 180
uuacacugga ccaaagugcg auagguacac agcccguggu ugcccugguu uucuaagcug 240
uuggggcaag acggcaaguu gcugccgcac augaagcgau cgugacucuu cagguuguaa 300
uaagcgccca ccuagaagcc uugauggaac cugagguuuu gugucagaga ugaguagcac 360
uuguugcgcu gauugcacca cuaauuccac acacuuaaag ucaagacguu acuagguaaa 420
aauccucaca ugaugguguu uuuauuauuu aguaccuacc ucagacuuaa agcgcauaug 480
ucaucgcgau uauugacaug uaagcuuaua caaucgguug ggaaaaacua ccugaaucuc 540
cccuucgucc cuuuaaaauu cuuaaacgcu cuuaaacaca aguuuuuaua gcuacccaua 600
aaguucuaga ugagguucgu augaggguau uuagaccacg cgcugaaugg aguucccaag 660
ucgcgugacc ucggugacca ucuggacggu uagccguagu uguagugggc uaaggucugg 720
gacgaacgag acguggcaag uauagacugu gguccucuaa gcagaaggcc uaccugucgu 780
ccccggcgac gaaugauaca accaauagaa gucggagccu ggaaagacga guucauauua 840
cucuuacccu gguaauggcu gcgacaacua acacgagagc uaggggacag ucuuugguuc 900
acgugugauu uuagaaagug ucagcuuuuc cccuagaugg ucugaagauu gaaagcacau 960
gucggguggc ucucguagca guccaagggu uuauagugau uggacacagg aaaaccgcuc 1020
cacaaguugc gauguucuaa acgaucgcac augcggaccu ugucuuuuuc uuauaguuua 1080
acgcaucggc uaaugucgca gaauauauug agacguagga agaguugaaa guucacaaua 1140
ccucacucgg gcugauucga cuuacuaaac acgaaauguu uacaaauacg gcuaaguaag 1200
cacuaggccc cgcugcucca gucugucuag cggggaccgg uuuguccauu cuaacgacua 1260
auguugaugu uuaauggacu gcuaaaaugu ccuacgcaau agcgaaccuu gagauuguua 1320
gagcuaagau uccagccgcc guuaauguua auagaaauag cggaaaaguc cuucaguuua 1380
gaauucggua agcucgcucu guagucaugg cucuauaugg uccgccccag guggggcaca 1440
uugccacagc ucccaaaguu gacgaugaaa ggugacguca ggauacccaa ggucgggugg 1500
uuaccgcacc caauggucgg gauggcucau cagcauaaca gaaaacucga gaacgugcgg 1560
gggcggugcc acacgccagg uuucuuuagu ugauugaauc aauucuuauu uacacacuua 1620
aaauugaaau ugccggacug ucccuguccu caggacuguc uuagguuauu cuucaaggaa 1680
gggaaagucg ucaaaccugc gcuguagcgu cugugguguc ugcggcacgc acuggggguu 1740
ugagagcuuu aagagcuaua gugugggacg ucaaaaccgc cccagucaca guaaugggga 1800
cccugguuau gaucauuggu ccagcgucac gaaaugguuc uacaguugac auggcuccaa 1860
ggacaccgau aagugcgucu gguugacuga ggcugaaccg cccacauauc auguccgagg 1920
uuacacaaag ucugggcccg uccgacggac uaaccccggc ucguacauuu auugaggaug 1980
cucacgcuau agggguaucc acgaccuuau acacggucaa uagucugggu cugcuugagc 2040
gguucugcuc gauccaggca ucggagaguc ucguauuagc gcaugugaua cucggacccc 2100
cggcuuuuaa ggcaccguau aucguuguug ucguaacgau aaggaugauu gaaauguuaa 2160
agucagugcu gccucuagga cggucagagg uacugauuuu ggaggcaccu gacaugcuac 2220
auguaaacac cgcuaaguug acuuacgaga uuggacgaga augucaugcc aagaaaaaca 2280
ugggucgacu uggcccguaa cugcccguag cgucaacucg uccuguucuu augaguccuc 2340
cacaaacgcg uucacuucgu uuaaauauuu ugaggagggu aauuccugaa accgccaaag 2400
uugaagagcg ucuaggaugg acuggguagu uuuggaucgu ucuccagaaa guaacuucug 2460
gaagacaagu uguuccagug ugaccgacug cggccgaagu aauuugucau gccucuaaca 2520
gauccacuau aacgucgcgc gcuagacuaa acgcgugucu ucaaauugcc ggacugccag 2580
aaugggggag aggaauggcu gcuuuacuaa cgggucaugu ggucgcggga cgagcgaccg 2640
ugcuaaugau cgccuaccug uaaaccccgg ccgcgacggg aggucuaugg uaaacgguac 2700
gucuaccgca uauccaaauu gccguauccu cauugggucu ugcacgacau gcucuugguu 2760
uuugacuauc gguuaguuaa guuaucacgg uauccuuucu auguccuguc agagucgucg 2820
uggcgcaggc gagagccuuu cgauguucua caccaguugg ucuugcgcgu ccguaacuua 2880
ugugaccacu ucgucgagag gagcuuaaaa ccucguuagu cgucgcacga cuuacuauag 2940
gacagagccg accuguucca acuucggcuu caggucuagc uguccaauua guggccagcc 3000
gacgucucag aggucuguau acaauggguc guugaguagu cucgacggcu uuaugcgcgg 3060
ucacgguuag aacgucggug auucuacagg cucacgcaca accccguuuc auuuucccaa 3120
cuaaagacac cuuuuccuau aguagaauac ucaaagggag uuaggcgggg agugccucaa 3180
cagaaggacg uacacuggau gcacggucgc guccucuucu ugaagugcug gcgggggcgg 3240
uagacgguac uaccguuccg gguaaaaggg gcgcuuccuc acaagcauag guuaccgugg 3300
gugaccaagc acugcgucuc uuuaaaaaua cucggcguuu aauagugaug gcuguugugu 3360
aagcaaaggc cguuaacgcu acagcauuag cccuagcacu uauuauguca gauacuagga 3420
gaagucgguc uugagcuaag uaaguuucuc cucgaccuau uuauaaaguu cuuggugugg 3480
agggggcuac accuagaccc acuguauagu ccuuaauugc guucgcagca cuuguaaguc 3540
uuccuuuagc uguccgaguu acuucaucgu uucuugaacu uacucagaga guagcugaac 3600
guccuugagc cguuuauacu cgucauguaa uuuaccggca ccauauagac cgauccgaaa 3660
uagcggccag acuaacguua acacuaccaa ugauaguaca acacgacgua cuguucaacg 3720
acaaguacgg aauuuccgac gacgaggacg cccaguacaa cauuuaagcu acuccugcug 3780
agacucgggc acgacuuucc ccacuuugac gugaugugc 3819
<210> 20
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> S蛋白基因优化mRNA序列3(STF mRNA)
<400> 20
uacaagcaca aggaccacga cgacggagac cacaggucgg ucacacacuu ggacuggugg 60
ucuugugucg acggaggucg gaugugguug ucgaaauggu cuccgcacau gauggggcug 120
uuccacaagu cuaggucgca cgacgugaga uggguccugg acaaggacgg aaagaagucg 180
uugcacugga ccaaggugcg guaggugcac aggccguggu uaccgugguu cucuaagcug 240
uuggggcacg acgggaaguu gcugccccac augaaacggu cguggcucuu cagguuguag 300
uagucuccga ccuagaagcc guggugugac cugucguucu gggucucgga cgacuagcac 360
uuguugcggu gguugcacca guaguuucac acgcucaagg ucaagacguu gcuggggaag 420
gacccgcaga ugaugguguu cuuguuguuc ucgaccuacc uuucgcucaa ggcccacaug 480
ucgucgcggu uguugacgug gaagcucaug cacagggucg gaaaggacua ccuggaccuu 540
ccguucgucc cguugaaguu cuuggacgcg cucaagcaca aauucuugua gcugccgaug 600
aaguucuaga ugucguucgu guggggauag uuggagcacg cccuagacgg agucccgaag 660
agacgagacc uuggggacca ccuagacggg uagccguagu uguagugggc caaagucugu 720
gacgaccggg acgugucuuc gauggacugu ggaccgcuau cgucgucgcc uaccugucga 780
ccacggcggc gaaugauaca cccgauggac gucggaucuu ggaaggacga cuucauguug 840
cucuugccgu gguaguggcu gcggcaccua acacgagacc uaggagacuc gcucuguuuc 900
acgugggacu ucaggaagug gcaccuuuuc ccguagaugg ucuggucguu gaaggcccac 960
gucggguggc uuagguagca cgccaagggg uuauaguggu uagacacggg gaagccgcuc 1020
cacaaguuac gguggucuaa gcggagacac augcggaccu uggccuucgc cuagucguua 1080
acgcaccggc ugaugaggca cgacauguug aggcggucga agucguggaa guucacgaug 1140
ccgcacaggg gaugguucga cuugcuggac acgaaguguu ugcacaugcg gcugucgaag 1200
cacuaggccc cucuacuuca cgccgucuaa cggggaccug ucuguccguu cuagcggcug 1260
auguugaugu ucgacgggcu gcugaagugg ccgacacacu aacggaccuu gucguuguug 1320
gaccugaggu uucagccgcc guugauguua auggacaugg ccgacaaggc cuucagguua 1380
gacuucggga agcucgcccu guagaggugg cucuagauag uccggccguc guggggaaca 1440
uugccgcacc uuccgaaguu gacgaugaag ggugacguca ggaugccgaa agucgggugu 1500
uuaccgcacc cgauagucgg gaugucucac caccacgacu cgaagcuuga cgacguacgg 1560
ggacgguguc acacgccggg auucuuuucg ugguuagagc acuucuuguu uacgcacuug 1620
aaguugaagu ugccggacug gccguggccg cacgacuguc ucucguuguu cuucaaggac 1680
gguaaggucg ucaaaccggc ccuauagcgg cuaugguguc ugcggcaauc ucuagggguc 1740
ugugaccuuu aggaccugua guggggaacg ucgaagccgc cucacagaca cuagugggga 1800
ccgugguugu ggucguuagu ccaccgucac gacauggucc ugcacuugac auggcuucac 1860
gggcaccggu aagugcggcu agucgacugu ggauguaccg cccacaugag guggccgucg 1920
uuacacaaag ucuggucucg gccgacagac uagccucggc ucgugcacuu guuaucgaug 1980
cucacgcugu agggguagcc gcgaccguag acacggucga uggucugugu cuguuugucg 2040
gggucugccc ggucuagaca ccggucgguc ucguaguaac ggauguguua cagagacccg 2100
cggcucuugu cgcaccggau gagguuguug agauagcgau aggggugguu gaagugguag 2160
ucgcacuggu gucucuagga cggacacagg uacugguucu ggucgcaccu gacgugguac 2220
auguagacgc cgcuaaggug gcucacgagg uuggacgacg acgucaugcc gucgaagacg 2280
ugggucgacu uaucucggga cugucccuag cggcaccuug uccuguucuu guggguucuc 2340
cacaagcggg uucacuucgu cuagauguuc uggggaggau aguuccugaa gccgccgaag 2400
uuaaagucgg ucuaagacgg gcuaggaucg uucgggucgu ucgccucgaa guagcuccug 2460
gacgacaagu uguuucacug ugaccggcug cggccgaagu aguucgucau accgcuaaca 2520
gacccgcugu aacggcgguc ccuagacuaa acgcgggucu ucaaauugcc ugacugucac 2580
gacggaggag acgacuggcu acucuacuag cgggucaugu guagacggga cgaccggccg 2640
uguuaguguu cgccgaccug uaaaccucga ccgcggcgag acgucuaggg gaaacgauac 2700
gucuaccgga uggccaaguu gccguagccu cacugggucu uacacgacau gcucuugguc 2760
uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacucgucg 2820
ugucguucgc gggacccuuu cgacguccug caccaguugg ucuuacgggu ccgugacuug 2880
ugggaccagu ucgucgacag gagguugaag ccgcgguagu cgagacacga cuugcuauag 2940
gacucgucug accuguucca ccuucggcuc cacgucuagc ugucugacua guggccuucc 3000
gacgucaggg acgucuggau gcaauggguc gucgacuagu cucggcggcu cuaaucucgg 3060
agacgguuag accggcggug guucuacaga cucacacacg acccggucuc guucucucac 3120
cugaaaacgc cguucccgau gguggacuac ucgaagggag ucagacgggg agugccgcac 3180
cacaaagacg ugcacuguau gcacgggcga guucucuucu uaaaguggug gcgaggucgg 3240
uagacggugc ugccguuucg ggugaaagga ucucuuccgc acaagcacag guugccgugg 3300
guaaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360
aagcacagac cguugacgcu gcagcacuag ccguaacacu uguuauggca caugcuggga 3420
gacgucgggc ucgaccuguc gaaguuucuc cuugaccuau ucaugaaauu cuuggugugu 3480
ucggggcugc accuggaccc gcuauagucg ccuuaguuac ggucgcagca cuuguagguc 3540
uuucucuagc uggccgacuu gcuccaccgg uucuuagacu ugcucucgga cuagcuggac 3600
guucuugacc ccuucaugcu cgucauguag uucaccggga ccauguagac cgacccgaaa 3660
uagcggccug acuaacggua gcacuaccag uguuaguacg acacaacgua cuggucgacg 3720
acaucgacgg acuucccgac aacaucgaca ccgucgacga cguucaagcu gcuccugcua 3780
agacucgggc acgacuuccc gcacuuugac gugaugugu 3819
<210> 21
<211> 225
<212> RNA
<213> Artificial Sequence
<220>
<223> E蛋白基因优化mRNA序列(EBL mRNA)
<400> 21
uacaugucga aacagagucu ccuuuggccg ugcgacuaac auuugucgca caaugauaag 60
gagcggaagc aacacaaaga ggaacaaugu gaccguuaug acugacggga cgccaacacg 120
cgaaugacga cauuauagca cuugcacaga aaccacuucg ggucaaagau acauauaagg 180
ucucaguuuu uagaguugag gagaucccac ggacuggacg aacag 225
<210> 22
<211> 669
<212> RNA
<213> Artificial Sequence
<220>
<223> M蛋白基因优化mRNA序列(MBL mRNA)
<400> 22
uaccgucuaa gguugccaug uuaauggcag cuucucgacu uuuucgagga acucgucacc 60
uuggaccagu aucccaagga uaaggacugu accuaaacgg acgacguuaa acggauacgg 120
uuguccuuau ccaaaaacau auauuaguuc gacuaaaagg agaccgacaa uaccggucac 180
ugggaccgga caaaacacga acggcgacaa augucuuauu uaaccuagug gccgccuuag 240
cgguagcguu accgaacgga acauccgaac uacaccgagu cgaugaagua acgaagaaag 300
gccgacaaac gcgcuugcgc cagguacacc agaaaguuag gccucugauu guaugaggag 360
uuacacgggg agguaccgug auaagacugg ucuggggacg aucucucacu ugagcaguag 420
ccucgacacu aggacgcccc cguggacucu uagcggccug uggugaaucc ggcgacacug 480
uaguuccuag acggauuucu uuagugacaa cgguguagug cuugggaaag aauaauguuc 540
aacccccgga gcgucgcaca ccguccucug aguccaaaac gccguauguc agcgaugucc 600
uaaccguuga uauuuaauuu gugucuggua aggucgucgu cgcuauuaua acgaaacgaa 660
cacgucacu 669
<210> 23
<211> 1257
<212> RNA
<213> Artificial Sequence
<220>
<223> N蛋白基因优化mRNA序列(NBL mRNA)
<400> 23
uacagucuau ugccuggcgu cuugguuucc uugcggggag ccuagugaaa gcccccagga 60
ucgcugucgu gacccagauu gguuuuaccu cuugcaaggc cgcguucuag guuugucucc 120
uccggagucc ccgaaggauu guuaugucgg aggaccaagu gucgagagug ugucguaccg 180
uuccuucugg acuucaaagg aucuccgguc ccccaagggu aguuaugauu gaggaggggu 240
cugcuagucu aaccaauaau agccgcccga ugguccgccu aggccccgcc ucugccauuc 300
uacuuccugg agagaggggc aaccaugaaa augauggagc cauguccggg gcuccgaccc 360
gaaggcauac cgcgguuauu ccuaccuuau uaaacccacc gaugccuucc ccgggaguug 420
uguggcuucc uaguguaacc gugggcauua gggcgcuuau uacggcggua acaggacguc 480
aacggggucc ccugcugcaa cggguuuccg aaaaugcguc uuccuagcgc gccuccuagg 540
guucggaggu cggcuaguuc gagagcuaga gccuugaguu cagcguuauc gugugguccc 600
agaagagcgc ccuggucggg acguuccuac cggccuuugc cgccacuacg acgaaaucgc 660
gacgacgacg accuaucuga cuugguuaau cucucauuuu acaguccauu uccggucguu 720
gucguccccg ucugucacug guuuuuuuca cgccggcucc ggucguucuu uggggcgguc 780
uuugcuuguc ggugauuucg gauguugcau uggguucgua agccuuccuc uccuggucuc 840
gucuggguuc cguuaaaacc gcuaguucuc gacuaggcgg uccccugccu gauauucgua 900
accggugucu agcgggucaa gcguggguca cgaagucgga agaagccuua cagcucuuag 960
ccauaccucc agugaggaag accgugaacc gacugaauau ggccgcguua uuucgaucug 1020
cuguuucugg gauugaaauu ccuaguccac uaggacgauu uauuugugua acuacgcaug 1080
uuuuguaagg gugguugacu cgguuucuuc cuguucuucu ucuuccgucu acuuuggguc 1140
cgaaacgggg ucucugucuu uuucgucguc uggcacugga acgacggucg ucggcuggag 1200
cugcuaaaaa guuucguuga agucgucagg uacucaucgc gacugucgug gguccga 1257
<210> 24
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> SDC50
<400> 24
atgttcgtgt ttctggtgct gctgcctctg gtgtcttctc agtgtgtgaa tctgacaaca 60
agaacacagc tgcctcctgc ctacaccaac agctttacaa gaggagtgta ctaccctgac 120
aaggtgttca gaagcagcgt gctgcattct acacaggacc tgtttctgcc tttcttcagc 180
aacgtgacct ggtttcacgc cattcacgtg tctggcacaa atggaaccaa gaggttcgac 240
aatcctgtgc tgcctttcaa cgatggcgtg tactttgcct ctaccgagaa gagcaacatc 300
atcagaggct ggatctttgg caccacactg gatagcaaga cacagtctct gctgatcgtg 360
aacaatgcca ccaacgtggt gatcaaggtg tgtgagttcc agttctgcaa cgaccctttt 420
ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt cagagtgtac 480
agctctgcca acaattgcac ctttgagtac gtgagccagc ctttcctgat ggatctggaa 540
ggaaagcagg gcaatttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600
ttcaagatct acagcaagca cacccccatc aatctggtga gagatctgcc tcagggattt 660
tctgctctgg aacctctggt ggatctgcct attggcatca acatcaccag attccagaca 720
ctgctggctc tgcacagatc ttacctgaca cctggagatt cttcttctgg atggacagct 780
ggagctgctg cttattacgt gggctatctg cagcctagaa ccttcctgct gaagtacaac 840
gagaatggca ccatcacaga tgctgtggat tgtgctctgg atcctctgtc tgagaccaag 900
tgtacactga agagcttcac agtggagaag ggcatctacc agaccagcaa tttcagagtg 960
cagcctacag agagcatcgt gagattcccc aacatcacca atctgtgccc ttttggagag 1020
gtgttcaatg ccaccagatt tgcctctgtg tacgcctgga acagaaagag gatcagcaac 1080
tgtgtggccg attactctgt gctgtacaac tctgccagct ttagcacctt caagtgctac 1140
ggagtgtctc ctacaaagct gaacgacctg tgtttcacca acgtgtacgc cgatagcttc 1200
gtgattagag gcgatgaagt gagacagatt gctcctggcc agacaggaaa gatcgccgat 1260
tacaactaca agctgcctga tgacttcacc ggctgtgtga ttgcctggaa tagcaataac 1320
ctggacagca aagtgggcgg caactacaac tacctgtaca gactgttcag gaagagcaac 1380
ctgaagccct tcgagagaga catctctacc gagatttatc aggctggaag caccccttgt 1440
aatggcgtgg aaggcttcaa ctgttacttt cctctgcaga gctacggctt tcagcctacc 1500
aatggagtgg gatatcagcc ttatagagtg gtggtgctga gctttgaact gctgcatgct 1560
cctgctacag tgtgtggacc taagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620
ttcaacttca acggcctgac aggaacagga gtgctgacag agagcaataa gaagttcctg 1680
cccttccagc agtttggcag agacattgcc gatacaacag atgccgtgag agatcctcag 1740
acactggaga tcctggatat cacaccttgt agctttggcg gcgtgtctgt gattacacct 1800
ggaaccaata ccagcaatca ggtggctgtg ctgtaccagg atgtgaattg cacagaagtg 1860
cctgtggcca ttcatgctga tcagctgaca cctacatgga gagtgtacag caccggctct 1920
aatgtgtttc agaccagagc tggatgtctg attggagccg agcacgtgaa taacagctac 1980
gagtgtgaca tccctattgg agccggaatc tgtgcctctt atcagacaca gaccaactct 2040
cctagaagag ccagatctgt ggcctctcag tctatcatcg cctataccat gtctctggga 2100
gctgagaata gcgtggccta tagcaacaac agcattgcca tccctaccaa cttcaccatc 2160
agcgtgacaa cagagattct gcctgtgagc atgaccaaga catctgtgga ctgcaccatg 2220
tacatctgtg gcgattctac cgagtgtagc aatctgctgc tgcagtacgg ctctttttgt 2280
acccagctga atagagccct gacaggaatt gccgtggaac aggacaagaa tacccaggaa 2340
gtgtttgccc aggtgaagca gatctacaag acccctccta tcaaggactt tggcggcttc 2400
aacttctctc agattctgcc tgatcctagc aagcccagca agagaagttt catcgaggat 2460
ctgctgttca acaaggtgac actggccgat gccggattta tcaagcagta tggagattgt 2520
ctgggcgata tcgccgccag agatctgatt tgtgcccaga agtttaatgg actgaccgtg 2580
ctgcctcctc tgctgacaga tgagatgatt gctcagtata catctgccct gctggccgga 2640
acaatcacat ctggatggac atttggagct ggagctgctc tgcagattcc ttttgccatg 2700
cagatggcct acagattcaa tggcatcggc gtgacacaga atgtgctgta cgagaaccag 2760
aagctgattg ccaaccagtt caacagcgcc attggcaaga tccaggattc tctgtcttct 2820
acagcctctg ctctgggaaa actgcaggat gtggtgaatc agaatgccca ggccctgaat 2880
acactggtga agcagctgtc tagcaatttt ggcgccatct ctagcgtgct gaatgacatc 2940
ctgagcagac tggataaagt ggaggccgaa gtgcagatcg atagactgat cacaggcaga 3000
ctgcagtctc tgcagacata tgtgacacag cagctgatta gagctgccga gatcagagct 3060
tctgctaatc tggctgccac aaagatgtct gagtgtgtgc tgggacagtc taagagagtg 3120
gacttctgtg gcaaaggcta tcacctgatg agctttcctc agtctgctcc tcatggagtg 3180
gtgtttctgc atgtgacata tgtgcctgcc caggagaaga acttcacaac agctcctgcc 3240
atttgtcatg atggcaaggc ccactttcct agagaaggag tgttcgtgtc taatggcaca 3300
cactggttcg tgacacagag gaacttctac gagcctcaga tcatcaccac cgataacacc 3360
ttcgtgtctg gcaattgcga tgtggtgatc ggcatcgtga acaataccgt gtatgatcct 3420
ctgcagcctg agctggatag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480
tctcctgatg tggatctggg cgatatctct ggcatcaatg cctctgtggt gaacatccag 3540
aaggagatcg acagactgaa tgaggtggcc aagaacctga atgagagcct gatcgatctg 3600
caggaactgg gaaagtacga gcagtacatc aagtggcctt ggtacatctg gctgggattt 3660
attgccggac tgattgccat cgtgatggtg accatcatgc tgtgctgtat gaccagctgt 3720
tgtagctgtc tgaaaggctg ctgtagctgt ggcagctgtt gcaagtttga tgaggatgat 3780
tctgagcctg tgctgaaggg cgtgaagctg cactacacc 3819
<210> 25
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> SDC54
<400> 25
atgttcgtgt tcctggtgct gctgcctctg gtgagctctc agtgtgtgaa tctgaccaca 60
agaacccagc tgcctcctgc ctacaccaac agctttacca gaggagtgta ctaccccgac 120
aaggtgttca gaagcagcgt gctgcatagc acacaggatc tgttcctgcc cttcttcagc 180
aacgtgacct ggtttcacgc catccatgtg tctggcacca atggcaccaa gagattcgac 240
aaccctgtgc tgcctttcaa cgatggcgtg tacttcgcct ctaccgagaa gagcaacatc 300
atcagaggct ggatcttcgg caccacactg gatagcaaga cccagtctct gctgatcgtg 360
aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420
ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt cagggtgtac 480
agcagcgcca acaattgcac cttcgagtac gtgagccagc ctttcctgat ggatctggag 540
ggaaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600
ttcaagatct acagcaagca cacccccatc aacctggtga gagatctgcc tcagggattt 660
tctgctctgg agcctctggt ggatctgcct atcggcatca acatcaccag attccagaca 720
ctgctggccc tgcacagaag ctacctgaca cctggagatt cttcttctgg ctggacagct 780
ggagctgctg cctattacgt gggctatctg cagcccagaa ccttcctgct gaagtacaac 840
gagaacggca ccatcacaga tgccgtggat tgtgccctgg atcctctgtc tgagaccaag 900
tgtaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttcagagtg 960
cagcctaccg agagcatcgt gagattcccc aacatcacca acctgtgccc ttttggcgag 1020
gtgttcaatg ccaccagatt tgccagcgtg tacgcctgga acaggaagag gatcagcaac 1080
tgtgtggccg attacagcgt gctgtacaac tctgccagct tcagcacctt caagtgctac 1140
ggcgtgtctc ctacaaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200
gtgattagag gcgatgaggt gagacagatt gctcctggcc agacaggcaa gattgccgac 1260
tacaactaca agctgcctga cgacttcacc ggctgtgtga ttgcctggaa cagcaacaat 1320
ctggacagca aggtgggcgg caactacaac tacctgtaca ggctgttcag gaagagcaac 1380
ctgaagccct tcgagagaga catcagcacc gagatctatc aggctggaag caccccttgt 1440
aatggcgtgg agggcttcaa ctgttacttc cctctgcaga gctacggctt tcagcctacc 1500
aatggagtgg gctatcagcc ttacagagtg gtggtgctga gctttgaact gctgcatgct 1560
cctgctacag tgtgtggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620
ttcaacttca acggcctgac cggaacagga gtgctgacag agagcaacaa gaagttcctg 1680
cccttccagc agttcggcag agatatcgcc gataccacag atgccgtgag agatcctcag 1740
acactggaga tcctggacat cacaccttgc agctttggcg gagtgtctgt gatcacacct 1800
ggcaccaata ccagcaatca ggtggctgtg ctgtaccagg acgtgaattg caccgaagtg 1860
cctgtggcca ttcatgctga tcagctgacc cctacatgga gagtgtacag caccggctct 1920
aatgtgttcc agaccagagc cggatgtctg attggagccg agcacgtgaa taacagctac 1980
gagtgcgaca tccctattgg agccggcatc tgtgcctctt atcagaccca gaccaactct 2040
cctagaagag ccagaagcgt ggcctctcag agcatcattg cctacaccat gtctctggga 2100
gccgagaata gcgtggccta cagcaataac agcatcgcca tccccaccaa cttcaccatc 2160
agcgtgacca cagagattct gcctgtgagc atgaccaaga cctctgtgga ctgcaccatg 2220
tacatctgtg gcgactctac cgagtgcagc aatctgctgc tgcagtatgg cagcttttgt 2280
acccagctga acagagccct gacaggcatt gctgtggagc aggataagaa cacccaggag 2340
gtgtttgccc aggtgaagca gatctacaag acccctccca tcaaggactt cggcggcttt 2400
aacttcagcc agatcctgcc tgatcctagc aagcccagca agaggagctt tatcgaggac 2460
ctgctgttca acaaggtgac cctggccgat gctggcttta tcaagcagta cggagattgt 2520
ctgggcgata tcgccgccag agacctgatt tgtgcccaga agttcaatgg actgaccgtg 2580
ctgcctcctc tgctgacaga tgagatgatt gcccagtaca catctgccct gctggctggc 2640
acaatcacat ctggatggac atttggagct ggagctgccc tgcagatccc ttttgccatg 2700
cagatggcct acagattcaa cggcatcggc gtgacccaga atgtgctgta cgagaaccag 2760
aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggattc tctgtctagc 2820
acagcctctg ctctgggaaa gctgcaggat gtggtgaatc agaatgccca ggccctgaat 2880
acactggtga agcagctgag cagcaacttt ggcgccatca gctctgtgct gaatgacatc 2940
ctgagcagac tggacaaggt ggaggctgaa gtgcagatcg acagactgat cacaggcaga 3000
ctgcagtctc tgcagaccta cgtgacacag cagctgatta gagctgccga gatcagagct 3060
tctgccaatc tggctgccac caagatgtct gagtgtgtgc tgggacagag caagagagtg 3120
gacttctgtg gcaaaggcta ccacctgatg agcttccctc agtctgctcc tcatggagtg 3180
gtgtttctgc acgtgaccta tgtgcctgcc caggagaaga acttcaccac agctcctgcc 3240
atttgtcacg atggcaaggc ccactttcct agagaaggcg tgttcgtgag caatggcaca 3300
cactggttcg tgacccagag gaacttctac gagccccaga tcatcaccac cgataacacc 3360
ttcgtgagcg gcaattgcga cgtggtgatc ggcatcgtga acaataccgt gtacgatcct 3420
ctgcagcctg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480
agccctgatg tggatctggg cgacatctct ggcatcaatg ccagcgtggt gaacatccag 3540
aaggagatcg acaggctgaa cgaggtggcc aagaacctga atgagagcct gatcgatctg 3600
caggagctgg gcaagtacga gcagtacatc aagtggcctt ggtacatctg gctgggcttt 3660
atcgccggac tgattgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720
tgtagctgtc tgaagggctg ttgtagctgt ggcagctgtt gcaagttcga cgaggatgat 3780
agcgagcctg tgctgaaagg cgtgaagctg cactacacc 3819
<210> 26
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> SDC58
<400> 26
atgttcgtgt tcctggtgct gctgcccctg gtgagctctc agtgtgtgaa cctgaccacc 60
agaacccagc tgcctcctgc ctacaccaac agcttcacca gaggcgtgta ctaccccgac 120
aaggtgttca gaagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc 180
aacgtgacct ggttccacgc catccacgtg tctggcacca atggcaccaa gaggttcgac 240
aaccctgtgc tgcccttcaa cgacggcgtg tacttcgcca gcaccgagaa gagcaacatc 300
atcaggggct ggatcttcgg caccaccctg gacagcaaga cccagagcct gctgatcgtg 360
aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420
ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt ccgggtgtac 480
agcagcgcca acaactgcac cttcgagtac gtgagccagc ccttcctgat ggacctggag 540
ggcaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600
ttcaagatct acagcaagca cacccccatc aacctggtga gagacctgcc tcagggcttt 660
tctgccctgg agcctctggt ggacctgcct atcggcatca acatcaccag gttccagacc 720
ctgctggccc tgcacagaag ctacctgaca cctggcgata gctcttctgg ctggacagct 780
ggagctgctg cctattacgt gggctacctg cagcccagga ccttcctgct gaagtacaac 840
gagaacggca ccatcaccga cgccgtggat tgtgccctgg atcctctgag cgagaccaag 900
tgcaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttccgggtg 960
cagcctaccg agagcatcgt gaggttcccc aacatcacca acctgtgccc tttcggcgag 1020
gtgttcaacg ccaccagatt cgcctctgtg tacgcctgga acaggaagcg gatcagcaac 1080
tgcgtggccg actacagcgt gctgtacaac agcgccagct tcagcacctt caagtgctac 1140
ggcgtgagcc ctaccaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200
gtgatcagag gcgatgaggt gagacagatc gcccctggac agaccggcaa gatcgccgac 1260
tacaactaca agctgcccga cgacttcacc ggctgtgtga tcgcctggaa cagcaacaac 1320
ctggacagca aggtgggcgg caactacaac tacctgtacc ggctgttccg gaagagcaac 1380
ctgaagccct tcgagaggga catcagcacc gagatctacc aggccggaag cacaccttgc 1440
aatggcgtgg agggcttcaa ctgctacttc cccctgcaga gctacggctt tcagcctacc 1500
aatggcgtgg gctaccagcc ctacagagtg gtggtgctga gctttgaact gctgcatgcc 1560
cctgccacag tgtgtggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620
ttcaacttca acggcctgac cggcacaggc gtgctgaccg agagcaacaa gaagttcctg 1680
cccttccagc agttcggcag agacatcgcc gataccaccg atgccgtgag agatcctcag 1740
accctggaga tcctggacat caccccttgc agctttggcg gagtgagcgt gatcacacct 1800
ggcaccaaca ccagcaatca ggtggccgtg ctgtaccagg acgtgaactg cacagaggtg 1860
cctgtggcca ttcatgccga tcagctgacc cctacctgga gagtgtacag caccggcagc 1920
aatgtgttcc agaccagagc cggctgtctg atcggagccg agcacgtgaa caacagctac 1980
gagtgcgaca tccctatcgg agccggcatc tgcgcctctt accagacaca gaccaacagc 2040
cccagaagag ccagaagcgt ggccagccag tctatcatcg cctacaccat gagcctggga 2100
gccgagaaca gcgtggccta cagcaacaac agcatcgcca tccccaccaa cttcaccatc 2160
agcgtgacca ccgagatcct gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg 2220
tacatctgcg gcgacagcac agagtgcagc aacctgctgc tgcagtacgg cagcttttgc 2280
acccagctga acagagccct gacaggcatt gccgtggagc aggacaagaa cacccaggag 2340
gtgttcgccc aggtgaagca gatctacaag acccccccca tcaaggactt cggcggcttc 2400
aacttcagcc agatcctgcc tgaccctagc aagcccagca agcggagctt catcgaggac 2460
ctgctgttca acaaggtgac cctggccgat gccggcttca tcaagcagta cggcgattgt 2520
ctgggcgata tcgccgccag agacctgatc tgtgcccaga agttcaacgg cctgaccgtg 2580
ctgcctcctc tgctgacaga tgagatgatc gcccagtaca cctctgccct gctggccgga 2640
accatcacat ctggctggac atttggagct ggagccgccc tgcagatccc tttcgccatg 2700
cagatggcct acaggttcaa cggcatcggc gtgacccaga acgtgctgta cgagaaccag 2760
aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgtctagc 2820
acagcctctg ctctgggcaa gctgcaggat gtggtgaacc agaatgccca ggccctgaac 2880
accctggtga agcagctgag cagcaatttc ggcgccatca gcagcgtgct gaacgacatc 2940
ctgagcagac tggacaaggt ggaggccgag gtgcagatcg acagactgat caccggcaga 3000
ctgcagagcc tgcagaccta cgtgacacag cagctgatca gagccgccga gatcagagcc 3060
tctgccaatc tggctgccac caagatgagc gagtgtgtgc tgggccagag caagagagtg 3120
gacttctgcg gcaaaggcta ccacctgatg agcttccccc agtctgctcc tcatggcgtg 3180
gtgtttctgc acgtgaccta cgtgcctgcc caggagaaga acttcaccac agcccctgcc 3240
atctgtcacg atggcaaggc ccacttccct agagagggcg tgttcgtgag caatggcacc 3300
cactggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360
ttcgtgagcg gcaactgcga cgtggtgatc ggcatcgtga acaacaccgt gtacgaccct 3420
ctgcagcccg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480
agccccgacg tggatctggg cgacatcagc ggcatcaacg ccagcgtggt gaacatccag 3540
aaggagatcg accggctgaa cgaggtggcc aagaacctga acgagagcct gatcgacctg 3600
caggagctgg gcaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttt 3660
atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720
tgcagctgcc tgaagggctg ttgtagctgt ggcagctgct gcaagttcga cgaggacgat 3780
agcgagcctg tgctgaaggg cgtgaagctg cactacacc 3819
<210> 27
<211> 3819
<212> DNA
<213> Artificial Sequence
<220>
<223> SDC60
<400> 27
atgttcgtgt tcctggtgct gctgcccctg gtgagcagcc agtgtgtgaa cctgaccacc 60
agaacccagc tgcctcccgc ctacaccaac agcttcacca ggggcgtgta ctaccccgac 120
aaggtgttca ggagcagcgt gctgcacagc acccaggacc tgttcctgcc cttcttcagc 180
aacgtgacct ggttccacgc catccacgtg agcggcacca atggcaccaa gcggttcgac 240
aaccctgtgc tgcccttcaa cgacggcgtg tacttcgcca gcaccgagaa gagcaacatc 300
atccggggct ggatcttcgg caccaccctg gacagcaaga cccagagcct gctgatcgtg 360
aacaacgcca ccaacgtggt gatcaaggtg tgcgagttcc agttctgcaa cgaccccttc 420
ctgggcgtgt actaccacaa gaacaacaag agctggatgg agagcgagtt ccgggtgtac 480
agcagcgcca acaactgcac cttcgagtac gtgagccagc ccttcctgat ggacctggag 540
ggcaagcagg gcaacttcaa gaacctgcgg gagttcgtgt tcaagaacat cgacggctac 600
ttcaagatct acagcaagca cacccccatc aacctggtga gggacctgcc tcagggcttt 660
tctgccctgg agcctctggt ggacctgccc atcggcatca acatcaccag gttccagacc 720
ctgctggccc tgcacaggag ctacctgaca cctggcgata gctcttctgg ctggacagcc 780
ggagctgctg cctactacgt gggctacctg cagccccgga ccttcctgct gaagtacaac 840
gagaacggca ccatcaccga cgccgtggat tgcgccctgg atcctctgag cgagaccaag 900
tgcaccctga agagcttcac cgtggagaag ggcatctacc agaccagcaa cttccgggtg 960
cagcccaccg agagcatcgt gaggttcccc aacatcacca acctgtgccc cttcggcgag 1020
gtgttcaacg ccaccagatt cgccagcgtg tacgcctgga accggaagcg gatcagcaac 1080
tgcgtggccg actacagcgt gctgtacaac agcgccagct tcagcacctt caagtgctac 1140
ggcgtgagcc ccaccaagct gaacgacctg tgcttcacca acgtgtacgc cgacagcttc 1200
gtgatcaggg gcgatgaggt gagacagatc gcccctggcc agaccggcaa gatcgccgac 1260
tacaactaca agctgcccga cgacttcacc ggctgcgtga tcgcctggaa cagcaacaac 1320
ctggacagca aggtgggcgg caactacaac tacctgtacc ggctgttccg gaagagcaac 1380
ctgaagccct tcgagcggga catcagcacc gagatctacc aggccggaag caccccttgc 1440
aacggcgtgg agggcttcaa ctgctacttc cccctgcaga gctacggctt ccagcctacc 1500
aatggcgtgg gctaccagcc ctacagggtg gtggtgctga gctttgagct gctgcatgct 1560
cctgccaccg tgtgcggccc caagaagagc accaacctgg tgaagaacaa gtgcgtgaac 1620
ttcaacttca acggcctgac cggcaccggc gtgctgaccg agagcaacaa gaagttcctg 1680
cccttccagc agttcggcag ggacatcgcc gataccaccg atgccgtgag agaccctcag 1740
accctggaga tcctggacat caccccttgc agcttcggcg gagtgagcgt gatcacacct 1800
ggcaccaaca ccagcaacca ggtggccgtg ctgtaccagg acgtgaactg caccgaggtg 1860
cctgtggcca ttcacgccga tcagctgacc cccacctgga gagtgtacag caccggcagc 1920
aacgtgttcc agaccagagc cggctgtctg atcggcgccg agcacgtgaa caacagctac 1980
gagtgcgaca tccccatcgg cgccggcatc tgtgccagct atcagaccca gaccaacagc 2040
cctaggaggg ccagaagcgt ggccagccag tctatcatcg cctacaccat gagcctgggc 2100
gccgagaaca gcgtggccta cagcaacaac agcatcgcca tccccaccaa cttcaccatc 2160
agcgtgacca ccgagatcct gcccgtgagc atgaccaaga ccagcgtgga ctgcaccatg 2220
tacatctgcg gcgacagcac cgagtgcagc aacctgctgc tgcagtacgg cagcttctgc 2280
acccagctga acagagccct gacaggcatc gccgtggagc aggacaagaa cacccaggag 2340
gtgttcgccc aggtgaagca gatctacaag acccccccca tcaaggactt cggcggcttc 2400
aacttcagcc agatcctgcc tgaccccagc aagcccagca agcggagctt catcgaggac 2460
ctgctgttca acaaggtgac cctggccgac gccggcttca tcaagcagta cggcgactgt 2520
ctgggcgaca tcgccgccag agacctgatc tgtgcccaga agttcaacgg cctgaccgtg 2580
ctgccccctc tgctgaccga tgagatgatc gcccagtaca cctctgccct gctggccggc 2640
accatcacat ctggctggac ctttggagct ggagccgccc tgcagatccc tttcgccatg 2700
cagatggcct accggttcaa cggcatcggc gtgacccaga acgtgctgta cgagaaccag 2760
aagctgatcg ccaaccagtt caacagcgcc atcggcaaga tccaggacag cctgagcagc 2820
accgcctctg ctctgggcaa actgcaggac gtggtgaacc agaacgccca ggccctgaac 2880
accctggtga agcagctgag cagcaacttc ggcgccatca gcagcgtgct gaacgacatc 2940
ctgagcaggc tggacaaggt ggaggccgag gtgcagatcg acaggctgat caccggcaga 3000
ctgcagagcc tgcagaccta cgtgacccag cagctgatca gagccgccga gatcagagcc 3060
tctgccaatc tggccgccac caagatgagc gagtgtgtgc tgggccagag caagagggtg 3120
gacttctgcg gcaagggcta ccacctgatg agcttccccc agtctgcccc tcatggcgtg 3180
gtgttcctgc acgtgaccta cgtgcctgcc caggagaaga acttcaccac cgcccctgcc 3240
atctgccacg atggcaaggc ccacttccct agagagggcg tgttcgtgag caacggcacc 3300
cactggttcg tgacccagcg gaacttctac gagccccaga tcatcaccac cgacaacacc 3360
ttcgtgagcg gcaactgcga cgtggtgatc ggcatcgtga acaacaccgt gtacgacccc 3420
ctgcagcccg agctggacag cttcaaggag gagctggaca agtacttcaa gaaccacacc 3480
agccccgacg tggacctggg cgacatcagc ggcatcaacg ccagcgtggt gaacatccag 3540
aaggagatcg accggctgaa cgaggtggcc aagaacctga acgagagcct gatcgacctg 3600
caggagctgg gcaagtacga gcagtacatc aagtggccct ggtacatctg gctgggcttc 3660
atcgccggcc tgatcgccat cgtgatggtg accatcatgc tgtgctgcat gaccagctgc 3720
tgcagctgcc tgaagggctg ctgcagctgt ggcagctgtt gcaagttcga cgaggacgac 3780
agcgagcccg tgctgaaggg cgtgaagctg cactacacc 3819
<210> 28
<211> 957
<212> DNA
<213> Artificial Sequence
<220>
<223> MT2AE
<400> 28
atggccgatt ctaatggcac catcaccgtg gaagagctga agaagctgct cgagcaatgg 60
aacctggtga tcggatttct gttcctgacc tggatctgtc tgttgcagtt cgcctacgcc 120
aaccggaaca gattcctgta catcatcaaa ctgatcttcc tgtggctgct gtggcctgtg 180
accctggcct gcttcgtgct ggccgccgtg taccggatta actggatcac cggaggcatc 240
gctatcgcca tggcatgcct ggtcggactt atgtggctgt cttatttcat cgccagcttc 300
agactgttcg ctagaaccag aagcatgtgg tcctttaacc ctgagacaaa catcctgctg 360
aacgtgcctc tgcacggcac aatcctgaca cggccactgc tggaaagcga gctggtcatc 420
ggcgccgtga tcctgcgggg ccatctgcgc attgccggac accacctggg cagatgcgac 480
atcaaggacc tgcccaagga aatcaccgtg gccaccagca gaacactgtc ctactacaaa 540
ctgggcgcta gtcagagagt ggccggcgac agcggcttcg ccgcttattc tagatacaga 600
atcggcaact acaagctgaa taccgatcac agcagcagca gcgacaacat cgccctgctg 660
gtgcagggca gcggcgaggg cagaggaagc ctgctgacat gtggcgatgt ggaagagaac 720
cccggccctg ccatgtacag ctttgtgtct gaggaaaccg gcaccctgat cgtgaacagc 780
gtgctgctgt ttctggcctt cgtcgtgttc ctgctggtga cactggctat cctgaccgcc 840
ctgaggctgt gcgcctactg ctgcaacatc gtgaatgtat ccctggtgaa gccttccttc 900
tacgtgtaca gccgggtgaa gaaccttaat agctctagag tgcccgacct gctcgtt 957
<210> 29
<211> 960
<212> DNA
<213> Artificial Sequence
<220>
<223> MP2AE
<400> 29
atggccgaca gcaacggcac aatcacagtg gaagagctga agaagctgct ggagcagtgg 60
aacctggtga ttggatttct tttcctcacc tggatctgcc tgctgcagtt cgcctatgcc 120
aaccggaaca gattcctgta catcatcaag ctgatcttcc tgtggctgct gtggcccgtg 180
accctggcct gttttgtgct ggccgccgtg taccggatca actggatcac cggcggaatc 240
gctatcgcca tggcctgcct ggtgggcctg atgtggctga gctacttcat cgcctccttt 300
agactgttcg ccagaaccag aagcatgtgg tccttcaacc ctgagacaaa tatcctgctc 360
aacgtgcccc tgcacggcac catcctgacc cggcctctgc tcgagagcga gctggtgatc 420
ggcgccgtga tcctgagagg ccacctgaga atcgccggac accacctggg cagatgcgac 480
atcaaggacc tgccaaagga aatcaccgtt gctacaagca gaacactgtc ctactacaag 540
ctgggcgctt ctcaaagagt cgccggcgac agcggcttcg ctgcttatag ccgctacagg 600
attggaaatt acaagctgaa caccgatcat tcttctagca gcgacaacat cgccctgctg 660
gtccagggca gcggcgccac aaacttcagc ctgcttaaac aggccggcga tgtggaagag 720
aaccccggcc ctgccatgta cagcttcgtg tccgaggaaa ccggcaccct gatcgtgaac 780
agcgtgctgc tgttccttgc ttttgtggtg ttcctgctgg tcaccctggc catcctgacc 840
gccctgagac tgtgtgccta ctgctgcaac atcgtgaatg tgtctctggt gaagcctagc 900
ttctacgtgt acagccgggt gaaaaacctg aactctagcc gggtgcctga tctgctggtg 960
<210> 30
<211> 798
<212> DNA
<213> Artificial Sequence
<220>
<223> SGS-RBD
<400> 30
atggagacag acacactcct gctatgggta ctgctgctct gggttccagg ttccaccgga 60
gactgcccat ttggcgaggt gttcaacgca acccgcttcg ccagcgtgta cgcctggaat 120
aggaagcgga tcagcaactg cgtggccgac tatagcgtgc tgtacaactc cgcctctttc 180
agcaccttta agtgctatgg cgtgtccccc acaaagctga atgacctgtg ctttaccaac 240
gtctacgccg attctttcgt gatcaggggc gacgaggtgc gccagatcgc ccccggccag 300
acaggcaaga tcgcagacta caattataag ctgccagacg atttcaccgg ctgcgtgatc 360
gcctggaaca gcaacaatct ggattccaaa gtgggcggca actacaatta tctgtaccgg 420
ctgtttagaa agagcaatct gaagcccttc gagagggaca tctctacaga aatctaccag 480
gccggcagca ccccttgcaa tggcgtggag ggctttaact gttatttccc actccagtcc 540
tacggcttcc agcccacaaa cggcgtgggc tatcagcctt accgcgtggt ggtgctgagc 600
tttgagctgc tgcacgccta cccgtacgac gtgccggact acgccaatgc tgtgggccag 660
gacacgcagg aggtcatcgt ggtgccacac tccttgccct ttaaggtggt ggtgatctca 720
gccatcctgg ccctggtggt gctcaccatc atctccctta tcatcctcat catgctttgg 780
cagaagaagc cacgttag 798
<210> 31
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> SDC-50 mRNA
<400> 31
uacaagcaca aggaccacga cgacggagac cacagaagag ucacacacuu agacuguugg 60
ucuugggucg acggaggacg gauaugguug ucgaaguguu cuccgcacau gaugggacug 120
uuccacaagu ccagaagaca cgacgugaga uggguccuag acaaggacgg aaagaagucg 180
uugcacugga ccaaagugcg guagguacac agaccguggu uaccgugguu cucuaagcug 240
uuaggacacg acggaaaguu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300
uagucuccga ccuagaaacc guguugggac cuaucguucu gggucagaga cgacuagcac 360
uuguuacggu gguugcacca cuaguuccac acgcucaagg ucaagacguu acugggaaag 420
gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa gucccacaug 480
ucgagacggu uguuaacgug gaagcucaug cacucggucg gaaaggacua ccuagaccuu 540
ccuuucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600
aaguucuaga ugucguucgu guggggguag uuagaccacu cucuagacgg agucccuaaa 660
agacgagacc uuggagacca ccuagacgga uagccguagu uguagugguc uaaggucugu 720
gacgaccgag acgugucuuc gauagacugu ggaccgcuaa gaagaagacc uaccugucga 780
ccucgacgac gaauaaugca cccgauggac gucggaucuu ggaaggacga cuucauguug 840
cucuuaccgu gguaguggcu acgacaccua acacgggacc uaggagacag acucuguuuc 900
acaugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu aaagucucac 960
gucggauggc ucucguagca cucuaagggg uuguaguggu uagacacggg aaaaccgcuc 1020
cacaaguuac gguggucuaa acggucgcac auacggaccu uguccuucuc uuagucguug 1080
acacaccggc ugaugucgca cgacauguua agacggucga aaucguggaa guucacgaug 1140
ccgcacagag gaugguucga cuuacuggac acaaaguggu ugcacaugcg gcugucgaag 1200
cacuagucuc cucuacuuca cucugucuaa cgaggaccgg ucuguccguu cuagcggcua 1260
auguugaugu ucgacggacu acugaagugg ccgacacacu agcggaccuu aucguuguua 1320
gaccugucgu uucacccgcc guugauguug auggacaugu ccgacaaguc cuucucguug 1380
gacuucggga agcucucucu guagagaugg cucuagauag uccgaccuuc guggggaaca 1440
uuaccgcacc uuccgaaguu gacaaugaag ggagacgucu cgaugccgaa agucggaugg 1500
uuaccucacc cuauagucgg aaugucucac caccacgacu cgaaacuuga cgacguacga 1560
ggacgauguc acacaccggg auucuucucg ugguuggacc acuucuuguu cacgcacuug 1620
aaguugaagu ugccggacug gccuuguccu cacgacuguc ucucguuguu cuucaaggac 1680
gggaaggucg ucaaaccguc ucuguaacgg cuaugguguc uacggcacuc ucuaggaguc 1740
ugugaccucu aggaccuaua guguggaacg ucgaaaccgc cucacagaca cuagugugga 1800
ccuugguuau ggucguuagu ccaccgacac gacauggucc ugcacuuaac gugucuucac 1860
ggacaccggu aaguacgacu agucgacugg ggauguaccu cucacauguc guguccgucg 1920
uuacacaaag ucuggucucg gccuacagac uaaccucgac ucgugcacuu guugucgaug 1980
cucacacugu agggauaacc ucggccuuag acacggucga uagucugugu cugguugaga 2040
ggaucuucuc ggucuagaca ccggucgguc agauaguagc ggauauggua cagagacccu 2100
cgacucuuau cgcaccggau gucguuguug ucguagcggu agggaugguu gaagugguag 2160
ucgcacuguu gucucuagga cggacacucg uacugguucu guagacaccu gacgugguac 2220
auguagacac cgcugucgug ucucacaucg uuagacgacg acgucaugcc gucgaaaaca 2280
ugggucgacu uaucucggga cuguccuuaa cggcaccucg uccuauucuu auggguccuc 2340
cacaaacggg uccacuucgu cuagauguuc uggggaggau aguuccugaa gccgccgaag 2400
uugaagucgg ucuaagacgg acuaggaucg uucgggucgu ucucuucaaa guagcuccua 2460
gacgacaagu uguuccacug ggaccggcua cggccuaaau aguucgucau accgcuaaca 2520
gacccgcuau agcggcgguc ucuagacuaa acacgggucu ucaaguuacc ugacuggcac 2580
gacggaggag acgacugucu acucuacuaa cgagucaugu guagacggga cgaccgaccg 2640
uguuagugua gaccuaccug uaaaccucga ccucgacgag acgucuaggg aaaacgguac 2700
gucuaccgga ugucuaaguu gccguagccu cacugggucu uacacgacau gcucuugguc 2760
uucgacuagc gguuggucaa guugucgcgg uaaccguucu agguccuaag agacagaucg 2820
ugucgaagac gagacccguu ugacguccua caccacuuag ucuuacgagu ccgggacuua 2880
ugggaccacu ucgucgacag aucguuaaaa ccgcgguagu cgucgcacga cuuacuguag 2940
gacucgucug accuauuuca ccuccggcuu cacgucuagc ugucugacua guguccuucu 3000
gacgucagag acgucuggau gcacuguguc gucgacuaau cucgacggcu cuaaucucgg 3060
agacgauuag accgacggug guucuacaga cucacacacg acccugucag auucucucac 3120
cugaagacac cguuuccgau gguggacuac ucgaaaggag ucagacgagg aguaccucac 3180
cacaaagacg ugcacuguau acacggacgg guccucuucu ugaaguggug ucgaggacgg 3240
uaaacagugc uaccguuucg ggugaaagga ucucuuccgc acaagcacuc guuaccuugg 3300
gugaccaaac acugggucuc uuugaagaug cucggggucu aguaguggug gcuguuaugg 3360
aagcacagac cguuaacgcu gcaccacuag ccguagcacu uguuauggca cauacuagga 3420
gacgucggac ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480
ucgggacuac accuagaccc gcuauagaga ccguaguuac ggagacacca cuuguagguc 3540
uuccucuagc uguccgacuu acuccaccgg uucuuggacu uacucucgga cuagcuagac 3600
guccucgacc cuuucaugcu cgucauguag uucaccggaa ccauguagac cgacccgaaa 3660
uaacggccug acuaacggua gcacuaccac ugguaguacg acacgacgua cuguucgaca 3720
acaucgacag acuucccgac gacaagaaca ccgucgacaa cguucaagcu acuccuacua 3780
ucgcucggac acgacuuucc gcacuucgac gugaugugg 3819
<210> 32
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> SDC-54 mRNA
<400> 32
uacaagcaca aggaccacga cgacggagac cacucgagag ucacacacuu agacuggugu 60
ucuugggucg acggaggacg gaugugguug ucgaaauggu cuccucacau gauggggcug 120
uuccacaagu cuucgucgca cgacguaucg uguguccuag acaaggacgg gaagaagucg 180
uugcacugga ccaaagugcg guagguacac agaccguggu uaccgugguu cucuaagcug 240
uugggacacg acggaaaguu gcuaccgcac augaagcgga gauggcucuu cucguuguag 300
uagucuccga ccuagaagcc guggugugac cuaucguucu gggucagaga cgacuagcac 360
uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420
gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa gucccacaug 480
ucgucgcggu uguuaacgug gaagcucaug cacucggucg gaaaggacua ccuagaccuc 540
ccuuucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600
aaguucuaga ugucguucgu guggggguag uuggaccacu cucuagacgg agucccuaaa 660
agacgagacc ucggagacca ccuagacgga uagccguagu uguagugguc uaaggucugu 720
gacgaccggg acgugucuuc gauggacugu ggaccucuaa gaagaagacc gaccugucga 780
ccucgacgac ggauaaugca cccgauagac gucgggucuu ggaaggacga cuucauguug 840
cucuugccgu gguagugucu acggcaccua acacgggacc uaggagacag acucugguuc 900
acaugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaagucucac 960
gucggauggc ucucguagca cucuaagggg uuguaguggu uggacacggg aaaaccgcuc 1020
cacaaguuac gguggucuaa acggucgcac augcggaccu uguccuucuc cuagucguug 1080
acacaccggc uaaugucgca cgacauguug agacggucga agucguggaa guucacgaug 1140
ccgcacagag gauguuucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200
cacuaaucuc cgcuacucca cucugucuaa cgaggaccgg ucuguccguu cuaacggcug 1260
auguugaugu ucgacggacu gcugaagugg ccgacacacu aacggaccuu gucguuguua 1320
gaccugucgu uccacccgcc guugauguug auggacaugu ccgacaaguc cuucucguug 1380
gacuucggga agcucucucu guagucgugg cucuagauag uccgaccuuc guggggaaca 1440
uuaccgcacc ucccgaaguu gacaaugaag ggagacgucu cgaugccgaa agucggaugg 1500
uuaccucacc cgauagucgg aaugucucac caccacgacu cgaaacuuga cgacguacga 1560
ggacgauguc acacaccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620
aaguugaagu ugccggacug gccuuguccu cacgacuguc ucucguuguu cuucaaggac 1680
gggaaggucg ucaagccguc ucuauagcgg cuaugguguc uacggcacuc ucuaggaguc 1740
ugugaccucu aggaccugua guguggaacg ucgaaaccgc cucacagaca cuagugugga 1800
ccgugguuau ggucguuagu ccaccgacac gacauggucc ugcacuuaac guggcuucac 1860
ggacaccggu aaguacgacu agucgacugg ggauguaccu cucacauguc guggccgaga 1920
uuacacaagg ucuggucucg gccuacagac uaaccucggc ucgugcacuu auugucgaug 1980
cucacgcugu agggauaacc ucggccguag acacggagaa uagucugggu cugguugaga 2040
ggaucuucuc ggucuucgca ccggagaguc ucguaguaac ggauguggua cagagacccu 2100
cggcucuuau cgcaccggau gucguuauug ucguagcggu aggggugguu gaagugguag 2160
ucgcacuggu gucucuaaga cggacacucg uacugguucu ggagacaccu gacgugguac 2220
auguagacac cgcugagaug gcucacgucg uuagacgacg acgucauacc gucgaaaaca 2280
ugggucgacu ugucucggga cuguccguaa cgacaccucg uccuauucuu guggguccuc 2340
cacaaacggg uccacuucgu cuagauguuc uggggagggu aguuccugaa gccgccgaaa 2400
uugaagucgg ucuaggacgg acuaggaucg uucgggucgu ucuccucgaa auagcuccug 2460
gacgacaagu uguuccacug ggaccggcua cgaccgaaau aguucgucau gccucuaaca 2520
gacccgcuau agcggcgguc ucuggacuaa acacgggucu ucaaguuacc ugacuggcac 2580
gacggaggag acgacugucu acucuacuaa cgggucaugu guagacggga cgaccgaccg 2640
uguuagugua gaccuaccug uaaaccucga ccucgacggg acgucuaggg aaaacgguac 2700
gucuaccgga ugucuaaguu gccguagccg cacugggucu uacacgacau gcucuugguc 2760
uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuaag agacagaucg 2820
ugucggagac gagacccuuu cgacguccua caccacuuag ucuuacgggu ccgggacuua 2880
ugugaccacu ucgucgacuc gucguugaaa ccgcgguagu cgagacacga cuuacuguag 2940
gacucgucug accuguucca ccuccgacuu cacgucuagc ugucugacua guguccgucu 3000
gacgucagag acgucuggau gcacuguguc gucgacuaau cucgacggcu cuagucucga 3060
agacgguuag accgacggug guucuacaga cucacacacg acccugucuc guucucucac 3120
cugaagacac cguuuccgau gguggacuac ucgaagggag ucagacgagg aguaccucac 3180
cacaaagacg ugcacuggau acacggacgg guccucuucu ugaaguggug ucgaggacgg 3240
uaaacagugc uaccguuccg ggugaaagga ucucuuccgc acaagcacuc guuaccgugu 3300
gugaccaagc acugggucuc cuugaagaug cucggggucu aguaguggug gcuauugugg 3360
aagcacucgc cguuaacgcu gcaccacuag ccguagcacu uguuauggca caugcuagga 3420
gacgucggac ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480
ucgggacuac accuagaccc gcuguagaga ccguaguuac ggucgcacca cuuguagguc 3540
uuccucuagc uguccgacuu gcuccaccgg uucuuggacu uacucucgga cuagcuagac 3600
guccucgacc cguucaugcu cgucauguag uucaccggaa ccauguagac cgacccgaaa 3660
uagcggccug acuaacggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720
acaucgacag acuucccgac aacaucgaca ccgucgacaa cguucaagcu gcuccuacua 3780
ucgcucggac acgacuuucc gcacuucgac gugaugugg 3819
<210> 33
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> SDC-58 mRNA
<400> 33
uacaagcaca aggaccacga cgacggggac cacucgagag ucacacacuu ggacuggugg 60
ucuugggucg acggaggacg gaugugguug ucgaaguggu cuccgcacau gauggggcug 120
uuccacaagu cuucgucgca cgacgugucg uggguccugg acaaggacgg gaagaagucg 180
uugcacugga ccaaggugcg guaggugcac agaccguggu uaccgugguu cuccaagcug 240
uugggacacg acgggaaguu gcugccgcac augaagcggu cguggcucuu cucguuguag 300
uaguccccga ccuagaagcc guggugggac cugucguucu gggucucgga cgacuagcac 360
uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420
gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa ggcccacaug 480
ucgucgcggu uguugacgug gaagcucaug cacucggucg ggaaggacua ccuggaccuc 540
ccguucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600
aaguucuaga ugucguucgu guggggguag uuggaccacu cucuggacgg agucccgaaa 660
agacgggacc ucggagacca ccuggacgga uagccguagu uguagugguc caaggucugg 720
gacgaccggg acgugucuuc gauggacugu ggaccgcuau cgagaagacc gaccugucga 780
ccucgacgac ggauaaugca cccgauggac gucggguccu ggaaggacga cuucauguug 840
cucuugccgu gguaguggcu gcggcaccua acacgggacc uaggagacuc gcucugguuc 900
acgugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaaggcccac 960
gucggauggc ucucguagca cuccaagggg uuguaguggu uggacacggg aaagccgcuc 1020
cacaaguugc gguggucuaa gcggagacac augcggaccu uguccuucgc cuagucguug 1080
acgcaccggc ugaugucgca cgacauguug ucgcggucga agucguggaa guucacgaug 1140
ccgcacucgg gaugguucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200
cacuagucuc cgcuacucca cucugucuag cggggaccug ucuggccguu cuagcggcug 1260
auguugaugu ucgacgggcu gcugaagugg ccgacacacu agcggaccuu gucguuguug 1320
gaccugucgu uccacccgcc guugauguug auggacaugg ccgacaaggc cuucucguug 1380
gacuucggga agcucucccu guagucgugg cucuagaugg uccggccuuc guguggaacg 1440
uuaccgcacc ucccgaaguu gacgaugaag ggggacgucu cgaugccgaa agucggaugg 1500
uuaccgcacc cgauggucgg gaugucucac caccacgacu cgaaacuuga cgacguacgg 1560
ggacgguguc acacaccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620
aaguugaagu ugccggacug gccguguccg cacgacuggc ucucguuguu cuucaaggac 1680
gggaaggucg ucaagccguc ucuguagcgg cuaugguggc uacggcacuc ucuaggaguc 1740
ugggaccucu aggaccugua guggggaacg ucgaaaccgc cucacucgca cuagugugga 1800
ccgugguugu ggucguuagu ccaccggcac gacauggucc ugcacuugac gugucuccac 1860
ggacaccggu aaguacggcu agucgacugg ggauggaccu cucacauguc guggccgucg 1920
uuacacaagg ucuggucucg gccgacagac uagccucggc ucgugcacuu guugucgaug 1980
cucacgcugu agggauagcc ucggccguag acgcggagaa uggucugugu cugguugucg 2040
gggucuucuc ggucuucgca ccggucgguc agauaguagc ggauguggua cucggacccu 2100
cggcucuugu cgcaccggau gucguuguug ucguagcggu aggggugguu gaagugguag 2160
ucgcacuggu ggcucuagga cgggcacucg uacugguucu ggucgcaccu gacgugguac 2220
auguagacgc cgcugucgug ucucacgucg uuggacgacg acgucaugcc gucgaaaacg 2280
ugggucgacu ugucucggga cuguccguaa cggcaccucg uccuguucuu guggguccuc 2340
cacaagcggg uccacuucgu cuagauguuc uggggggggu aguuccugaa gccgccgaag 2400
uugaagucgg ucuaggacgg acugggaucg uucgggucgu ucgccucgaa guagcuccug 2460
gacgacaagu uguuccacug ggaccggcua cggccgaagu aguucgucau gccgcuaaca 2520
gacccgcuau agcggcgguc ucuggacuag acacgggucu ucaaguugcc ggacuggcac 2580
gacggaggag acgacugucu acucuacuag cgggucaugu ggagacggga cgaccggccu 2640
ugguagugua gaccgaccug uaaaccucga ccucggcggg acgucuaggg aaagcgguac 2700
gucuaccgga uguccaaguu gccguagccg cacugggucu ugcacgacau gcucuugguc 2760
uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacagaucg 2820
ugucggagac gagacccguu cgacguccua caccacuugg ucuuacgggu ccgggacuug 2880
ugggaccacu ucgucgacuc gucguuaaag ccgcgguagu cgucgcacga cuugcuguag 2940
gacucgucug accuguucca ccuccggcuc cacgucuagc ugucugacua guggccgucu 3000
gacgucucgg acgucuggau gcacuguguc gucgacuagu cucggcggcu cuagucucgg 3060
agacgguuag accgacggug guucuacucg cucacacacg acccggucuc guucucucac 3120
cugaagacgc cguuuccgau gguggacuac ucgaaggggg ucagacgagg aguaccgcac 3180
cacaaagacg ugcacuggau gcacggacgg guccucuucu ugaaguggug ucggggacgg 3240
uagacagugc uaccguuccg ggugaaggga ucucucccgc acaagcacuc guuaccgugg 3300
gugaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360
aagcacucgc cguugacgcu gcaccacuag ccguagcacu uguuguggca caugcuggga 3420
gacgucgggc ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480
ucggggcugc accuagaccc gcuguagucg ccguaguugc ggucgcacca cuuguagguc 3540
uuccucuagc uggccgacuu gcuccaccgg uucuuggacu ugcucucgga cuagcuggac 3600
guccucgacc cguucaugcu cgucauguag uucaccggga ccauguagac cgacccgaaa 3660
uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720
acgucgacgg acuucccgac aacaucgaca ccgucgacga cguucaagcu gcuccugcua 3780
ucgcucggac acgacuuccc gcacuucgac gugaugugg 3819
<210> 34
<211> 3819
<212> RNA
<213> Artificial Sequence
<220>
<223> SDC-60 mRNA
<400> 34
uacaagcaca aggaccacga cgacggggac cacucgucgg ucacacacuu ggacuggugg 60
ucuugggucg acggagggcg gaugugguug ucgaaguggu ccccgcacau gauggggcug 120
uuccacaagu ccucgucgca cgacgugucg uggguccugg acaaggacgg gaagaagucg 180
uugcacugga ccaaggugcg guaggugcac ucgccguggu uaccgugguu cgccaagcug 240
uugggacacg acgggaaguu gcugccgcac augaagcggu cguggcucuu cucguuguag 300
uaggccccga ccuagaagcc guggugggac cugucguucu gggucucgga cgacuagcac 360
uuguugcggu gguugcacca cuaguuccac acgcucaagg ucaagacguu gcuggggaag 420
gacccgcaca ugaugguguu cuuguuguuc ucgaccuacc ucucgcucaa ggcccacaug 480
ucgucgcggu uguugacgug gaagcucaug cacucggucg ggaaggacua ccuggaccuc 540
ccguucgucc cguugaaguu cuuggacgcc cucaagcaca aguucuugua gcugccgaug 600
aaguucuaga ugucguucgu guggggguag uuggaccacu cccuggacgg agucccgaaa 660
agacgggacc ucggagacca ccuggacggg uagccguagu uguagugguc caaggucugg 720
gacgaccggg acguguccuc gauggacugu ggaccgcuau cgagaagacc gaccugucgg 780
ccucgacgac ggaugaugca cccgauggac gucggggccu ggaaggacga cuucauguug 840
cucuugccgu gguaguggcu gcggcaccua acgcgggacc uaggagacuc gcucugguuc 900
acgugggacu ucucgaagug gcaccucuuc ccguagaugg ucuggucguu gaaggcccac 960
gucggguggc ucucguagca cuccaagggg uuguaguggu uggacacggg gaagccgcuc 1020
cacaaguugc gguggucuaa gcggucgcac augcggaccu uggccuucgc cuagucguug 1080
acgcaccggc ugaugucgca cgacauguug ucgcggucga agucguggaa guucacgaug 1140
ccgcacucgg ggugguucga cuugcuggac acgaaguggu ugcacaugcg gcugucgaag 1200
cacuaguccc cgcuacucca cucugucuag cggggaccgg ucuggccguu cuagcggcug 1260
auguugaugu ucgacgggcu gcugaagugg ccgacgcacu agcggaccuu gucguuguug 1320
gaccugucgu uccacccgcc guugauguug auggacaugg ccgacaaggc cuucucguug 1380
gacuucggga agcucgcccu guagucgugg cucuagaugg uccggccuuc guggggaacg 1440
uugccgcacc ucccgaaguu gacgaugaag ggggacgucu cgaugccgaa ggucggaugg 1500
uuaccgcacc cgauggucgg gaugucccac caccacgacu cgaaacucga cgacguacga 1560
ggacgguggc acacgccggg guucuucucg ugguuggacc acuucuuguu cacgcacuug 1620
aaguugaagu ugccggacug gccguggccg cacgacuggc ucucguuguu cuucaaggac 1680
gggaaggucg ucaagccguc ccuguagcgg cuaugguggc uacggcacuc ucugggaguc 1740
ugggaccucu aggaccugua guggggaacg ucgaagccgc cucacucgca cuagugugga 1800
ccgugguugu ggucguuggu ccaccggcac gacauggucc ugcacuugac guggcuccac 1860
ggacaccggu aagugcggcu agucgacugg ggguggaccu cucacauguc guggccgucg 1920
uugcacaagg ucuggucucg gccgacagac uagccgcggc ucgugcacuu guugucgaug 1980
cucacgcugu agggguagcc gcggccguag acacggucga uagucugggu cugguugucg 2040
ggauccuccc ggucuucgca ccggucgguc agauaguagc ggauguggua cucggacccg 2100
cggcucuugu cgcaccggau gucguuguug ucguagcggu aggggugguu gaagugguag 2160
ucgcacuggu ggcucuagga cgggcacucg uacugguucu ggucgcaccu gacgugguac 2220
auguagacgc cgcugucgug gcucacgucg uuggacgacg acgucaugcc gucgaagacg 2280
ugggucgacu ugucucggga cuguccguag cggcaccucg uccuguucuu guggguccuc 2340
cacaagcggg uccacuucgu cuagauguuc uggggggggu aguuccugaa gccgccgaag 2400
uugaagucgg ucuaggacgg acuggggucg uucgggucgu ucgccucgaa guagcuccug 2460
gacgacaagu uguuccacug ggaccggcug cggccgaagu aguucgucau gccgcugaca 2520
gacccgcugu agcggcgguc ucuggacuag acacgggucu ucaaguugcc ggacuggcac 2580
gacgggggag acgacuggcu acucuacuag cgggucaugu ggagacggga cgaccggccg 2640
ugguagugua gaccgaccug gaaaccucga ccucggcggg acgucuaggg aaagcgguac 2700
gucuaccgga uggccaaguu gccguagccg cacugggucu ugcacgacau gcucuugguc 2760
uucgacuagc gguuggucaa guugucgcgg uagccguucu agguccuguc ggacucgucg 2820
uggcggagac gagacccguu ugacguccug caccacuugg ucuugcgggu ccgggacuug 2880
ugggaccacu ucgucgacuc gucguugaag ccgcgguagu cgucgcacga cuugcuguag 2940
gacucguccg accuguucca ccuccggcuc cacgucuagc uguccgacua guggccgucu 3000
gacgucucgg acgucuggau gcacuggguc gucgacuagu cucggcggcu cuagucucgg 3060
agacgguuag accggcggug guucuacucg cucacacacg acccggucuc guucucccac 3120
cugaagacgc cguucccgau gguggacuac ucgaaggggg ucagacgggg aguaccgcac 3180
cacaaggacg ugcacuggau gcacggacgg guccucuucu ugaaguggug gcggggacgg 3240
uagacggugc uaccguuccg ggugaaggga ucucucccgc acaagcacuc guugccgugg 3300
gugaccaagc acugggucgc cuugaagaug cucggggucu aguaguggug gcuguugugg 3360
aagcacucgc cguugacgcu gcaccacuag ccguagcacu uguuguggca caugcugggg 3420
gacgucgggc ucgaccuguc gaaguuccuc cucgaccugu ucaugaaguu cuuggugugg 3480
ucggggcugc accuggaccc gcuguagucg ccguaguugc ggucgcacca cuuguagguc 3540
uuccucuagc uggccgacuu gcuccaccgg uucuuggacu ugcucucgga cuagcuggac 3600
guccucgacc cguucaugcu cgucauguag uucaccggga ccauguagac cgacccgaag 3660
uagcggccgg acuagcggua gcacuaccac ugguaguacg acacgacgua cuggucgacg 3720
acgucgacgg acuucccgac gacgucgaca ccgucgacaa cguucaagcu gcuccugcug 3780
ucgcucgggc acgacuuccc gcacuucgac gugaugugg 3819
<210> 35
<211> 957
<212> RNA
<213> Artificial Sequence
<220>
<223> MT2AE mRNA
<400> 35
uaccggcuaa gauuaccgug guaguggcac cuucucgacu ucuucgacga gcucguuacc 60
uuggaccacu agccuaaaga caaggacugg accuagacag acaacgucaa gcggaugcgg 120
uuggccuugu cuaaggacau guaguaguuu gacuagaagg acaccgacga caccggacac 180
ugggaccgga cgaagcacga ccggcggcac auggccuaau ugaccuagug gccuccguag 240
cgauagcggu accguacgga ccagccugaa uacaccgaca gaauaaagua gcggucgaag 300
ucugacaagc gaucuugguc uucguacacc aggaaauugg gacucuguuu guaggacgac 360
uugcacggag acgugccgug uuaggacugu gccggugacg accuuucgcu cgaccaguag 420
ccgcggcacu aggacgcccc gguagacgcg uaacggccug ugguggaccc gucuacgcug 480
uaguuccugg acggguuccu uuaguggcac cgguggucgu cuugugacag gaugauguuu 540
gacccgcgau cagucucuca ccggccgcug ucgccgaagc ggcgaauaag aucuaugucu 600
uagccguuga uguucgacuu auggcuagug ucgucgucgu cgcuguugua gcgggacgac 660
cacgucccgu cgccgcuccc gucuccuucg gacgacugua caccgcuaca ccuucucuug 720
gggccgggac gguacauguc gaaacacaga cuccuuuggc cgugggacua gcacuugucg 780
cacgacgaca aagaccggaa gcagcacaag gacgaccacu gugaccgaua ggacuggcgg 840
gacuccgaca cgcggaugac gacguuguag cacuuacaua gggaccacuu cggaaggaag 900
augcacaugu cggcccacuu cuuggaauua ucgagaucuc acgggcugga cgagcaa 957
<210> 36
<211> 960
<212> RNA
<213> Artificial Sequence
<220>
<223> MP2AE mRNA
<400> 36
uaccggcugu cguugccgug uuagugucac cuucucgacu ucuucgacga ccucgucacc 60
uuggaccacu aaccuaaaga aaaggagugg accuagacgg acgacgucaa gcggauacgg 120
uuggccuugu cuaaggacau guaguaguuc gacuagaagg acaccgacga caccgggcac 180
ugggaccgga caaaacacga ccggcggcac auggccuagu ugaccuagug gccgccuuag 240
cgauagcggu accggacgga ccacccggac uacaccgacu cgaugaagua gcggaggaaa 300
ucugacaagc ggucuugguc uucguacacc aggaaguugg gacucuguuu auaggacgag 360
uugcacgggg acgugccgug guaggacugg gccggagacg agcucucgcu cgaccacuag 420
ccgcggcacu aggacucucc gguggacucu uagcggccug ugguggaccc gucuacgcug 480
uaguuccugg acgguuuccu uuaguggcaa cgauguucgu cuugugacag gaugauguuc 540
gacccgcgaa gaguuucuca gcggccgcug ucgccgaagc gacgaauauc ggcgaugucc 600
uaaccuuuaa uguucgacuu guggcuagua agaagaucgu cgcuguugua gcgggacgac 660
caggucccgu cgccgcggug uuugaagucg gacgaauuug uccggccgcu acaccuucuc 720
uuggggccgg gacgguacau gucgaagcac aggcuccuuu ggccguggga cuagcacuug 780
ucgcacgacg acaaggaacg aaaacaccac aaggacgacc agugggaccg guaggacugg 840
cgggacucug acacacggau gacgacguug uagcacuuac acagagacca cuucggaucg 900
aagaugcaca ugucggccca cuuuuuggac uugagaucgg cccacggacu agacgaccac 960
<210> 37
<211> 798
<212> RNA
<213> Artificial Sequence
<220>
<223> SGS-RBD mRNA
<400> 37
uaccucuguc ugugugagga cgauacccau gacgacgaga cccaaggucc aagguggccu 60
cugacgggua aaccgcucca caaguugcgu ugggcgaagc ggucgcacau gcggaccuua 120
uccuucgccu agucguugac gcaccggcug auaucgcacg acauguugag gcggagaaag 180
ucguggaaau ucacgauacc gcacaggggg uguuucgacu uacuggacac gaaaugguug 240
cagaugcggc uaagaaagca cuaguccccg cugcuccacg cggucuagcg ggggccgguc 300
uguccguucu agcgucugau guuaauauuc gacggucugc uaaaguggcc gacgcacuag 360
cggaccuugu cguuguuaga ccuaagguuu cacccgccgu ugauguuaau agacauggcc 420
gacaaaucuu ucucguuaga cuucgggaag cucucccugu agagaugucu uuagaugguc 480
cggccgucgu ggggaacguu accgcaccuc ccgaaauuga caauaaaggg ugaggucagg 540
augccgaagg ucggguguuu gccgcacccg auagucggaa uggcgcacca ccacgacucg 600
aaacucgacg acgugcggau gggcaugcug cacggccuga ugcgguuacg acacccgguc 660
cugugcgucc uccaguagca ccacggugug aggaacggga aauuccacca ccacuagagu 720
cgguaggacc gggaccacca cgagugguag uagagggaau aguaggagua guacgaaacc 780
gucuucuucg gugcaauc 798
<210> 38
<211> 66
<212> DNA
<213> Artificial Sequence
<220>
<223> T2A DNA
<400> 38
ggcagcggcg agggcagagg aagcctgctg acatgtggcg atgtggaaga gaaccccggc 60
cctgcc 66
<210> 39
<211> 69
<212> DNA
<213> Artificial Sequence
<220>
<223> P2A DNA
<400> 39
ggcagcggcg ccacaaactt cagcctgctt aaacaggccg gcgatgtgga agagaacccc 60
ggccctgcc 69
<210> 40
<211> 66
<212> RNA
<213> Artificial Sequence
<220>
<223> T2A mRNA
<400> 40
ccgucgccgc ucccgucucc uucggacgac uguacaccgc uacaccuucu cuuggggccg 60
ggacgg 66
<210> 41
<211> 69
<212> RNA
<213> Artificial Sequence
<220>
<223> P2A mRNA
<400> 41
ccgucgccgc gguguuugaa gucggacgaa uuuguccggc cgcuacaccu ucucuugggg 60
ccgggacgg 69
<210> 42
<211> 22
<212> PRT
<213> thosea asigna virus 2A
<400> 42
Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu
1 5 10 15
Glu Asn Pro Gly Pro Ala
20
<210> 43
<211> 23
<212> PRT
<213> porcine teschovirus-1 2A
<400> 43
Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val
1 5 10 15
Glu Glu Asn Pro Gly Pro Ala
20

Claims (14)

1.mRNA,其包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的一种、两种、三种或四种蛋白或其片段的mRNA,
其中,编码所述S蛋白的mRNA的序列如SEQ ID NO.18、SEQ ID NO.19或SEQ ID NO.20所示;编码所述E蛋白的mRNA的序列如SEQ ID NO.21所示;编码所述M蛋白的mRNA的序列如SEQID NO.22所示;编码所述N蛋白的mRNA的序列如SEQ ID NO.23所示。
2.如权利要求1所述的mRNA,其特征在于,所述片段为所述S蛋白的RBD结构域的片段,其mRNA的序列优选如SEQ ID NO.37所示。
3.如权利要求1或2所述的mRNA,其特征在于,所述mRNA还包含以下(a)~(e)中的一种或多种:
(a)5’-帽结构,优选为3′-O-Me-m7G(5')ppp(5')G、m7G(5')ppp(5')(2'OMeA)pG或m7(3'OMeG)(5')ppp(5')(2'OMeA)pG;
(b)3’-聚腺苷酸,其序列优选包含约25至约400个腺苷核苷酸的序列,优选约50至约400个腺苷核苷酸的序列,更优选约50至约300个腺苷核苷酸的序列,进一步更优选约50至约250个腺苷核苷酸的序列,进一步优选约60至约250个腺苷核苷酸的序列,最优选为由120个聚腺苷酸组成的序列;
(c)5’-UTR,所述5’-UTR的序列优选如SEQ ID NO.15所示;
(d)3’-UTR,所述3’-UTR的序列优选来源于提供稳定的mRNA的基因的3’UTR,更优选如SEQ ID NO.16或SEQ ID NO.17所示;
(e)多核苷酸修饰,所述多核苷酸优选5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP和5-Methoxy-UTP中的一种或多种;
较佳地:
所述N蛋白的mRNA包括5-methyl-CTP、pseudo-UTP、N1-Methylpseudo-UTP或5-Methoxy-UTP的修饰,或包括5-methyl-CTP和pseudo-UTP的修饰;
和/或,所述E蛋白的mRNA包括5-methyl-CTP、pseudo-UTP或N1-Methylpseudo-UTP的修饰;
和/或,当编码所述S蛋白的mRNA的序列如SEQ ID NO.18所示时,所述S蛋白的mRNA包括5-methyl-CTP、pseudo-UTP或N1-Methylpseudo-UTP的修饰,或包括5-methyl-CTP和pseudo-UTP的修饰,优选包括pseudo-UTP或N1-Methylpseudo-UTP的修饰;或者,当编码所述S蛋白的mRNA的序列如SEQ ID NO.19所示时,所述S蛋白的mRNA包括pseudo-UTP的修饰,或包括5-methyl-CTP和pseudo-UTP的修饰;或者,当编码所述S蛋白的mRNA的序列如SEQ IDNO.20所示时,所述S蛋白的mRNA包括pseudo-UTP或N1-Methylpseudo-UTP的修饰。
4.如权利要求1~3任一项所述的mRNA,其特征在于,所述mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达得到,表达所述S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如为1:1:1;
或者,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白和E蛋白的mRNA,所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A肽段的mRNA的序列进行连接,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ ID NO.43所示,编码所述2A肽段的DNA序列优选如SEQ ID NO.38或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列优选如SEQID NO.40或SEQ ID NO.41所示,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其DNA序列优选如SEQ ID NO.28或29所示;
或者,所述mRNA包含编码来源于SARS-CoV-2病毒的S蛋白的mRNA;
或者,所述mRNA包含编码来源于SARS-CoV-2病毒的S蛋白的RBD结构域的mRNA;
或者,所述mRNA包含编码来源于SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白的mRNA,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过编码2A肽段的mRNA的序列进行连接,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ ID NO.43所示,编码所述2A肽段的DNA序列优选如SEQ ID NO.38或SEQ ID NO.39所示,编码所述2A肽段的mRNA序列优选如SEQ ID NO.40或SEQ ID NO.41所示,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其DNA序列优选如SEQ ID NO.28或29所示,连接后的mRNA与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。
5.如权利要求1~4任一项所述的mRNA,其特征在于,当所述的mRNA包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的两种、三种或四种蛋白或其片段的mRNA时,所述mRNA编码的蛋白自组装成病毒样颗粒。
6.一种DNA,其包含编码来源于SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的至少一种蛋白或其片段的DNA,
其中,编码所述S蛋白的DNA的序列如SEQ ID NO.3、SEQ ID NO.4或SEQ ID NO.5所示;编码所述E蛋白的DNA的序列如SEQ ID NO.8所示;编码所述M蛋白的DNA的序列如SEQ IDNO.11所示;编码所述N蛋白的DNA的序列如SEQ ID NO.13所示;
较佳地,所述片段为所述S蛋白的RBD结构域的片段,其DNA序列优选如SEQ ID NO.30所示。
7.组合物,其包含多个或多于一个如权利要求1~5任一项所述的mRNA和/或如权利要求6所述的DNA。
8.一种脂质体纳米颗粒,其包含如权利要求1~5任一项所述的mRNA、如权利要求6所述的DNA和/或如权利要求7所述的组合物;
较佳地:
所脂质体纳米颗粒还包括阳离子脂质和辅助脂质,所述阳离子脂质优选为DLin-MC3-DMA或DOTMA,所述辅助脂质优选为DSPC和/或胆固醇;
和/或,所脂质体纳米颗粒为长循环阳离子脂质体纳米颗粒,优选为经PEG或其衍生物修饰的长循环阳离子脂质体纳米颗粒;所述PEG的相对分子质量优选为2000~5000,例如为2000、3000、4000或5000;更优选为包括DMPE-PEG2000的长循环阳离子脂质体纳米颗粒。
9.一种病毒样颗粒,其由包含如权利要求1~5任一项所述的mRNA、如权利要求6所述的DNA和/或如权利要求7所述的组合物所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述细胞优选为293T和/或293A;
较佳地:
所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白、M蛋白和N蛋白中的两种、三种或四种蛋白或其片段的mRNA所表达的蛋白自组装而成,优选在细胞中表达所述蛋白;
更佳地:
所述的病毒样颗粒由编码SARS-CoV-2病毒的S蛋白、E蛋白和M蛋白的mRNA,所述S蛋白、E蛋白、M蛋白由三条独立的mRNA分别表达的蛋白自组装得到,表达所述S蛋白、E蛋白、M蛋白的mRNA的摩尔比优选为1:(2~0.5):(2~0.5),例如为1:1:1;
或者,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白和E蛋白的mRNA所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述M蛋白和E蛋白的mRNA优选连接后进行表达,所述连接优选通过编码2A肽段的mRNA的序列进行连接,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ ID NO.43所示,编码所述2A肽段的DNA序列优选如SEQ ID NO.38或SEQID NO.39所示,编码所述2A肽段的mRNA序列优选如SEQ ID NO.40或SEQ ID NO.41所示,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其DNA序列优选如SEQ ID NO.28或29所示;
或者,所述的病毒样颗粒由编码SARS-CoV-2病毒的M蛋白、E蛋白和S蛋白的mRNA所表达的蛋白自组装而成,优选在细胞中表达所述蛋白,所述M蛋白和E蛋白的mRNA连接后进行表达,所述连接优选通过编码2A肽段的mRNA的序列进行连接,所述2A肽段的氨基酸序列优选如SEQ ID NO.42或SEQ ID NO.43所示,编码所述2A肽段的DNA序列优选如SEQ ID NO.38或SEQ ID NO.39所示,编码所述2A肽段的RNA序列优选如SEQ ID NO.40或SEQ ID NO.41所示,所述连接后的mRNA的序列优选如SEQ ID NO.35或36所示,其DNA序列优选如SEQ ID NO.28或29所示,连接后的mRNA与所述S蛋白的mRNA的摩尔比优选为1.5:1~3:1,例如为2:1。
10.一种针对新冠病毒的mRNA疫苗,其特征在于,其包含如权利要求1~5任一项所述的mRNA、如权利要求6所述的DNA、如权利要求7所述的组合物和/或如权利要求8所述的脂质体纳米颗粒;
较佳地,所述mRNA疫苗诱导细胞产生病毒样颗粒;和/或,所述mRNA疫苗还包括佐剂。
11.一种药物组合物,其包含如权利要求1~5任一项所述的mRNA、如权利要求6所述的DNA、如权利要求7所述的组合物、如权利要求8所述的脂质体纳米颗粒、如权利要求9所述的病毒样颗粒和/或如权利要求10所述的mRNA疫苗,和任选地药用载体。
12.一种试剂盒,其包含如权利要求1~5任一项所述的mRNA、如权利要求6所述的DNA、如权利要求7所述的组合物、如权利要求8所述的脂质体纳米颗粒、如权利要求9所述的病毒样颗粒、如权利要求10所述的mRNA疫苗和/或如权利要求11所述的药物组合物。
13.编码2A肽段的mRNA,其序列如SEQ ID NO.40和/或SEQ ID NO.41所示。
14.编码2A肽段的DNA,其序列如SEQ ID NO.38或SEQ ID NO.39所示。
CN202010470599.0A 2020-05-28 2020-05-28 mRNA及包含其的新冠病毒mRNA疫苗 Active CN113736801B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010470599.0A CN113736801B (zh) 2020-05-28 2020-05-28 mRNA及包含其的新冠病毒mRNA疫苗

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010470599.0A CN113736801B (zh) 2020-05-28 2020-05-28 mRNA及包含其的新冠病毒mRNA疫苗

Publications (2)

Publication Number Publication Date
CN113736801A true CN113736801A (zh) 2021-12-03
CN113736801B CN113736801B (zh) 2023-12-12

Family

ID=78724309

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010470599.0A Active CN113736801B (zh) 2020-05-28 2020-05-28 mRNA及包含其的新冠病毒mRNA疫苗

Country Status (1)

Country Link
CN (1) CN113736801B (zh)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115998854A (zh) * 2023-01-30 2023-04-25 南京市第二医院 一组流感病毒基因的非编码区在制备mRNA疫苗中的应用
CN116474083A (zh) * 2023-02-20 2023-07-25 上海君拓生物医药科技有限公司 一种VLP-mRNA复合多价病毒疫苗及其制备方法和应用
WO2023169500A1 (zh) * 2022-03-09 2023-09-14 中生复诺健生物科技(上海)有限公司 编码新型冠状病毒S蛋白的mRNA疫苗
CN117004671A (zh) * 2023-09-26 2023-11-07 中国医学科学院医学生物学研究所 一种降低副产物dsRNA产量的体外转录体系
WO2024079361A1 (en) * 2022-10-14 2024-04-18 Neuway Pharma Gmbh A PROTEIN- OR PEPTIDE-BASED CAPSULE (PPC), PREFERABLY A VLP, LOADED WITH A MESSENGER RNA (mRNA) AND A METHOD OF PRODUCTION AND PURIFICATION THEREOF

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113293202A (zh) * 2021-07-02 2021-08-24 广东莱恩医药研究院有限公司 一种定量检测机体中mRNA含量的实时荧光定量PCR试剂盒及检测方法和应用
WO2022068846A1 (zh) * 2020-09-29 2022-04-07 深圳市瑞吉生物科技有限公司 新型冠状病毒mRNA疫苗及其制备方法与应用

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022068846A1 (zh) * 2020-09-29 2022-04-07 深圳市瑞吉生物科技有限公司 新型冠状病毒mRNA疫苗及其制备方法与应用
CN113293202A (zh) * 2021-07-02 2021-08-24 广东莱恩医药研究院有限公司 一种定量检测机体中mRNA含量的实时荧光定量PCR试剂盒及检测方法和应用

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JIA XIA 等: "Towards an effective mRNA vaccine agasinst 2019-nCov:demonstration of virus-like particles expressed from an modified mRNA cocktail", 中国科学院科技论文预发布平台, pages 1 - 10 *
ZHANG YZ 等: "Wuhan seafood market pneumonia virus isolate Wuhan-Hu-1, complete genome", GENBANK DATABASE, pages 045512 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023169500A1 (zh) * 2022-03-09 2023-09-14 中生复诺健生物科技(上海)有限公司 编码新型冠状病毒S蛋白的mRNA疫苗
WO2024079361A1 (en) * 2022-10-14 2024-04-18 Neuway Pharma Gmbh A PROTEIN- OR PEPTIDE-BASED CAPSULE (PPC), PREFERABLY A VLP, LOADED WITH A MESSENGER RNA (mRNA) AND A METHOD OF PRODUCTION AND PURIFICATION THEREOF
CN115998854A (zh) * 2023-01-30 2023-04-25 南京市第二医院 一组流感病毒基因的非编码区在制备mRNA疫苗中的应用
CN116474083A (zh) * 2023-02-20 2023-07-25 上海君拓生物医药科技有限公司 一种VLP-mRNA复合多价病毒疫苗及其制备方法和应用
CN117004671A (zh) * 2023-09-26 2023-11-07 中国医学科学院医学生物学研究所 一种降低副产物dsRNA产量的体外转录体系

Also Published As

Publication number Publication date
CN113736801B (zh) 2023-12-12

Similar Documents

Publication Publication Date Title
CN113736801B (zh) mRNA及包含其的新冠病毒mRNA疫苗
CN111218458B (zh) 编码SARS-CoV-2病毒抗原的mRNA和疫苗及疫苗的制备方法
CN111217918B (zh) 一种基于2,4-二氧四氢喋啶合酶的新型冠状病毒s蛋白双区域亚单位纳米疫苗
CN112266411B (zh) 一种新型冠状病毒疫苗及其应用
WO2021184415A1 (zh) 一种基于幽门螺旋杆菌铁蛋白的新型冠状病毒s蛋白单区域亚单位纳米疫苗
KR20220140528A (ko) Sars-cov-2 mrna 도메인 백신
CN112575008B (zh) 编码新型冠状病毒的结构蛋白的核酸分子以及新型冠状病毒疫苗
CN111217919A (zh) 一种基于火球菌铁蛋白的新型冠状病毒s蛋白双区域亚单位纳米疫苗
CN113943373B (zh) 一种β冠状病毒多聚体抗原、其制备方法和应用
CN117529335A (zh) 爱泼斯坦-巴尔病毒mrna疫苗
CN117355329A (zh) Vlp肠道病毒疫苗
CN113666990A (zh) 一种诱导广谱抗冠状病毒的t细胞疫苗免疫原及其应用
US20230203137A1 (en) Preparation method of artificial antibody
WO2023092069A1 (en) Sars-cov-2 mrna domain vaccines and methods of use
CN114989308A (zh) 新冠病毒嵌合核酸疫苗及其用途
CN113403329B (zh) 一种用于猫冠状病毒的rna疫苗及其构建方法
CN114213548A (zh) 同时诱导抗多种病毒的免疫应答的方法
WO2022103929A9 (en) Nucleic acids, proteins, and vaccines of sars-cov-2
CN115666633A (zh) CpG-佐剂的SARS-CoV-2病毒疫苗
WO2023098679A1 (zh) 预防突变株的新型冠状病毒mRNA疫苗
CN116426543B (zh) 一种新型冠状病毒疫苗及其制备方法
CN115960252A (zh) 新型冠状病毒免疫原性物质、其制备方法和应用
CN115960180A (zh) 2019-nCoV S蛋白的突变体及其基因工程化的mRNA和疫苗组合物
CN113637695B (zh) 靶向刺激体液免疫和细胞免疫的新冠病毒mRNA疫苗
CN115678906A (zh) 经优化的新冠病毒嵌合核酸疫苗及其用途

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant