CN113046327A - 穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用 - Google Patents

穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用 Download PDF

Info

Publication number
CN113046327A
CN113046327A CN202110172158.7A CN202110172158A CN113046327A CN 113046327 A CN113046327 A CN 113046327A CN 202110172158 A CN202110172158 A CN 202110172158A CN 113046327 A CN113046327 A CN 113046327A
Authority
CN
China
Prior art keywords
leu
xcov
coronavirus
thr
val
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110172158.7A
Other languages
English (en)
Other versions
CN113046327B (zh
Inventor
童贻刚
范华昊
宋立华
安小平
王立钦
刘文丽
刘振东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Chemical Technology
Original Assignee
Beijing University of Chemical Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Chemical Technology filed Critical Beijing University of Chemical Technology
Priority to CN202210386551.0A priority Critical patent/CN114657149B/zh
Priority to CN202210377234.2A priority patent/CN114908062B/zh
Priority to CN202210375336.0A priority patent/CN114908061B/zh
Publication of CN113046327A publication Critical patent/CN113046327A/zh
Application granted granted Critical
Publication of CN113046327B publication Critical patent/CN113046327B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
    • C12Q1/701Specific hybridization probes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/20011Coronaviridae
    • C12N2770/20021Viruses as such, e.g. new isolates, mutants or their genomic sequences
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2770/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
    • C12N2770/00011Details
    • C12N2770/20011Coronaviridae
    • C12N2770/20034Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Immunology (AREA)
  • General Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Virology (AREA)
  • General Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

本发明提供一种从穿山甲中分离的冠状病毒,命名为穿山甲冠状病毒xCoV,其与SARS‑COV‑2的S蛋白同源性达92.5%,xCoV感染细胞的受体与SARS‑COV‑2一致,均为血管紧张素转化酶2(ACE2)。但xCoV不感染人,因此对人而言是非常安全的,可用于抗SARS‑COV‑2病毒的活性药物、疫苗的筛选,还可用于制备抗SARS‑COV‑2病毒的减毒疫苗或灭活疫苗。基于该穿山甲冠状病毒xCoV,筛选获得了多个具有抗冠状病毒活性的活性化合物,并对其中的千金藤素(千金藤碱)、西拉菌素、盐酸甲氟喹(甲氟喹)进行了EC50、CC50及SI的评价,并通过转录组测序分析研究了千金藤素抑制xCoV的机制。

Description

穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用
技术领域
本发明属于药物领域,具体涉及一种穿山甲冠状病毒xCoV及其用于抗SARS-COV-2病毒药物的筛选的用途,包含该穿山甲冠状病毒xCoV的药物筛选模型、药物筛选方法,还涉及基于该xCoV筛选出的活性化合物用于制备治疗SARS-COV-2病毒感染性疾病的药物的用途。
背景技术
新型冠状病毒(被世界卫生组织命名为“SARS-COV-2”,先前命名为2019新型冠状病毒或2019nCoV)属于β属冠状病毒,有包膜,颗粒呈圆形或椭圆形,常为多形性,直径60-140nm。其基因特征与SARS-CoV和MERS-CoV有明显区别。目前研究显示其与蝙蝠SARS样冠状病毒(bat-SL-CoVZC45)同源性达85%以上。SARS-CoV-2的防治迫切需要有效的疫苗和特异性治疗,如何快速筛选能够抑制该病毒复制的药物成为迫在眉睫的问题。同时,由于该病毒的传染性极强,如何安全地进行药物筛选等相关研究、保护研究人员不受感染也成为亟待解决的问题。
发明内容
本发明人从海关查获的死亡的穿山甲中分离并培养的一株新型冠状病毒xCoV,称为穿山甲冠状病毒xCoV(在本发明的上下文中也称为“穿山甲xCoV”或者“xCoV”),其全基因组序列分析结果显示与SARS-COV-2的S蛋白同源性达92.5%,是迄今为止成功分离培养的与SARS-COV-2的S蛋白同源性最高的病毒。进一步实验显示穿山甲xCoV感染细胞的受体与SARS-COV-2一致,均为血管紧张素转化酶2(ACE2)。但是该病毒不感染人,因此,对人而言是非常安全的。
本发明提供一种冠状病毒(也称为“穿山甲冠状病毒xCoV”、“穿山甲xCoV”或者“xCoV”),该冠状病毒毒株xCoV于2020年2月14日保藏在中国微生物菌种保藏管理委员会普通微生物中心(地址:北京市朝阳区北辰西路1号院3号,中国科学院微生物研究所),保藏编号为CGMCC No.19295。
根据本发明的穿山甲冠状病毒毒株xCoV,其全基因组核苷酸序列如序列表中SEQID NO:1所示。
根据本发明的穿山甲冠状病毒毒株xCoV,其S基因的核酸序列如序列表中SEQ IDNO:2所示。
根据本发明的穿山甲冠状病毒毒株xCoV,其S蛋白的氨基酸序列如序列表中SEQID NO:3所示。
根据本发明的穿山甲冠状病毒毒株xCoV,其与SARS-COV-2的S蛋白同源性达92.5%。其中,所述SARS-COV-2的S基因的核酸序列如序列表中SEQ ID NO:4所示,其S蛋白的氨基酸序列如序列表中SEQ ID NO:5所示。
其中,xCoV与SARS-COV-2的序列一致性结果在图1中示出。
本发明还提供所述穿山甲冠状病毒毒株xCoV的应用,其用于抗SARS-COV-2病毒的活性药物的筛选与评价,抗SARS-COV-2病毒的疫苗的筛选与评价,以及用于制备抗SARS-COV-2病毒的减毒疫苗或灭活疫苗,以及用于SARS-COV-2病毒感染的诊断和治疗性抗体的制备。其中所述疫苗中还包括药学上可接受的佐剂。
本发明还提供一种用于筛选和/或评价抗冠状病毒活性药物的药物筛选模型,其包括所述保藏编号为CGMCC No.19295的冠状病毒(也称为“穿山甲冠状病毒xCoV”、“冠状病毒xCoV”、“穿山甲xCoV”或者“xCoV”)。
根据本发明的药物筛选模型,其为采用所述穿山甲冠状病毒xCoV感染的哺乳动物细胞,优选Vero E6细胞(非洲绿猴肾细胞)。
根据本发明的药物筛选模型,其中所述模型优选用于筛选和/或评价有抗SARS-CoV-2病毒活性的药物。
本发明还提供一种筛选和/或评价抗冠状病毒活性药物的方法,其采用上述药物筛选模型进行;优选地,该方法用于筛选和/或评价有抗SARS-CoV-2活性的药物。
根据本发明的筛选和/或评价抗冠状病毒活性药物的方法,其包括步骤(1):向所述药物筛选模型中加入待测试的药物并进行培养。
根据本发明的筛选和/或评价抗冠状病毒活性药物的方法,其在步骤(1)之后还任选地包括以下步骤(2a)或步骤(2b),或同时包含步骤(2a)和步骤(2b):
步骤(2a):在显微镜下观察细胞病变;
步骤(2b):测定细胞和上清中的病毒核酸。
根据本发明的筛选和/或评价抗冠状病毒活性药物的方法,步骤(1)中的培养时间可以为12-90小时,如24-72小时,48-72小时,24小时,48小时或72小时等。
在步骤(2a)中,当观察到存在完整的细胞单层或细胞病变不明显时,表明待测试的药物具有抑制病毒复制的活性。
此外,本发明还提供以下化合物中的任一种、两种或三种用于制备治疗冠状病毒感染性疾病的药物的用途:千金藤素(千金藤碱)、西拉菌素、盐酸甲氟喹和甲氟喹。
根据本发明的用途,所述冠状病毒为SARS-COV-2病毒。
此外,本发明还通过Time-of-Addition试验发现了千金藤素、西拉菌素和盐酸甲氟喹对xCoV病毒生命周期的影响,并通过转录组学分析解释了千金藤素抑制xCoV病毒复制的机理。
有益效果
本发明的穿山甲冠状病毒xCoV与SARS-COV-2的S蛋白同源性高,xCoV感染细胞的受体与SARS-COV-2一致,均为血管紧张素转化酶2(ACE2)。并且该xCoV病毒不感染人,因此对人而言是非常安全,可用于抗SARS-COV-2病毒的药物筛选与评价、疫苗筛选与评价、减毒及灭活疫苗的制备。基于该xCoV病毒筛选抗SARS-COV-2病毒的药物对研发人员非常安全,不必担心被感染。基于该筛选模型筛选出了抗SARS-CoV-2的活性药物千金藤素(千金藤碱)、西拉菌素、盐酸甲氟喹(甲氟喹),并证明了千金藤素、西拉菌素和盐酸甲氟喹在xCoV入胞后发挥抑制作用,还解释了千金藤素抗xCoV机制。千金藤素主要通过干扰细胞应激反应,包括内质网应激/未折叠蛋白反应和热休克因子1(HSF1)介导的热休克反应来逆转受感染细胞中大多数失调的基因和通路,从而发挥抗冠状病毒效果。
附图说明
图1示出了xCoV与其他冠状病毒的序列一致性结果。
图2示出了xCoV全基因组与SARS-CoV-2全基因组的进化树分析。
图3示出了xCoV的S基因与SARS-CoV-2的S基因的进化树分析。
图4示出了siRNA敲降ACE2后ACE2 mRNA的表达情况。
图5示出了siRNA敲降ACE2表达后对xCoV病毒感染的影响。
图6示出了本发明的药物筛选的流程。
图7示出了感染复数为0.01的xCoV感染72小时后,不加药时的Vero E6细胞形态图。
图8示出了不加病毒培养72小时后,不加药时的Vero E6细胞形态图。
图9示出了加入终浓度10μM(微摩尔每升)的千金藤素和感染复数为0.01的xCoV感染72小时后的Vero E6细胞形态图。
图10示出了加入终浓度10μM(微摩尔每升)的西拉菌素和感染复数为0.01的xCoV感染72小时后的Vero E6细胞形态图。
图11示出了加入终浓度10μM(微摩尔每升)的盐酸甲氟喹和感染复数为0.01的xCoV感染72小时后的Vero E6细胞形态图。
图12示出了三种化合物对xCoV的抑制作用,10μM的千金藤素抑制xCoV病毒复制的倍数为15393倍,10μM的西拉菌素抑制xCoV病毒复制的倍数为5053倍,10μM的盐酸甲氟喹抑制xCoV病毒复制的倍数为31倍。
图13示出了千金藤素对xCoV的半数有效浓度(EC50)为0.9851μM、对Vero E6细胞的半数细胞毒性浓度(CC50)为39.32μM,以及选择指数(SI)为39.91。
图14示出了西拉菌素对xCoV的半数有效浓度(EC50)为1.908μM、对Vero E6细胞的半数细胞毒性浓度(CC50)为6.227μM,以及选择指数(SI)为3.290。
图15示出了盐酸甲氟喹对xCoV的半数有效浓度(EC50)为2.728μM、对Vero E6细胞的半数细胞毒性浓度(CC50)为10.08μM,以及选择指数(SI)为3.695。
图16示出了千金藤素对xCoV的Time-of-Addition试验结果。
图17示出了西拉菌素对xCoV的Time-of-Addition试验结果。
图18示出了盐酸甲氟喹对xCoV的Time-of-Addition试验结果。
图19示出了千金藤素抗xCoV复制的转录组分析结果。
具体实施方式
以下结合实施例对本发明做进一步描述。需要说明的是,下述实施例不能作为对本发明保护范围的限制,任何在本发明基础上作出的改进都不违背本发明的精神。本发明所用原料和设备,如无特殊说明,均可商业购买。
一、实验方法
1.细胞培养和病毒培养
非洲绿猴肾细胞系Vero E6是从美国模式培养物集存库(ATCC,1586号)中获得的,在37℃、5%CO2的条件下,在含有10%胎牛血清(FBS;Gibco Invitrogen)的DMEM培养基(Gibco)中培养。
将穿山甲分离株xCoV在Vero E6细胞中繁殖,用噬斑实验测定病毒滴度。所有感染实验均在生物安全2级(BLS2)实验室进行。
上市药物库(产品号L1000,含2080种已经上市的药物)和抗病毒化合物库(产品号L1700,含326抗病毒药物)从上海陶素生化科技有限公司购买。所有药物的初始浓度均为10mM(毫摩尔每升)。
千金藤素(T0131)、西拉菌素(T0124)、盐酸甲氟喹(T0860)从上海陶素生化科技有限公司购买。所有药物的初始浓度均为10mM(毫摩尔每升)。
2.探究ACE2是否为xCoV感染细胞受体
转染前一天,在12孔细胞培养板中每孔接种2×105个Vero E6细胞。第二天,当细胞贴壁好时利用ACE2 siRNA smartpool(苏州吉玛基因)通过RNAiMax转染试剂反式转染的方式沉默ACE2基因表达。将细胞分别与2、10和50nM的siRNAs转染液在37℃条件下孵育48小时。48小时后,将细胞与xCoV在37℃共孵育2小时。用PBS洗去未结合的病毒,加入新鲜培养基继续培养24小时。用PBS洗去未结合的病毒,提取总RNA,采用两步法qRT-PCR测定ACE2mRNA和病毒感染情况。
3.利用SARS-CoV-2高度同源的穿山甲冠状病毒xCoV从上市药物库中筛选潜在的抗新型肺炎病毒药物
96孔细胞板中种上2.5×104个Vero E6细胞,24小时后用MOI=0.01的xCoV感染Vero E6细胞,同时向其中加入终浓度为10μM的各种已知药物(2406种上市药物和III期临床试验药物),在第3天时通过显微镜镜下观察细胞病变,对没有明显细胞病变的培养孔提取细胞和上清中的RNA,用qRT-PCR测定细胞和上清中的病毒复制情况。
4.病毒RNA提取及实时定量RT-PCR(qRT-PCR)
根据制造商的说明,使用AxyPrepTM体液病毒DNA/RNA微型制备试剂盒(Axygen,产品编号AP-MN-BF-VNA-250)和AxyPrepTM多用途总RNA微型制备试剂盒(Axygene,产品编号AP-MN-MS-RNA-250G)采集细胞培养上清液和Vero E6细胞进行RNA提取。用带有gDNA酶的HifairⅡ1链cDNA合成试剂盒(上海翊圣生物科技有限公司,产品编号11121ES60)进行反转录,用Hieff-qPCR-SYBR-Green-Master Mix(上海翊圣生物科技有限公司,货号:11202ES08)或两步Taqman探针检测qRT-PCR系统(Applied-Biosystem)进行qPCR,所用引物的序列信息在表1中示出。经测序确认后,由北京瑞博兴科生物科技有限公司将PCR产物插入T载体,产生标准质粒。标准曲线是通过测定质粒连续稀释(103-109)的拷贝数而产生的。SYBR-Green法的qPCR扩增:95℃5min,40个循环,95℃10s,55℃20s,72℃31s。
Taqman法:50℃2min,95℃10min,40次循环,95℃10s,60℃1min,用GraphPad-Prism 8软件分析图12的数据。
本发明的药物筛选流程如图6所示。
5.EC50与CC50检测及Time-of-Addition试验
使用实验方法3筛选出的3种化合物-千金藤素、西拉菌素、盐酸甲氟喹进行试验,在使用MOI=0.01的xCoV感染Vero E6细胞。
EC50检测:Vero E6细胞接种至24孔细胞培养板,细胞密度达到60%-80%时进行试验;药物稀释至200μM,后按两倍比梯度稀释至0.39μM。Vero E6细胞换液后,药物溶液与病毒悬液1:1稀释后加至细胞。试验药物终浓度分别为100μM、50μM、25μM、12.5μM、6.25μM、3.125μM、1.56μM、0.78μM、0.39μM、0.195μM、0μM。37℃5%CO2培养60-72h,观察CPE,提取细胞核酸进行qPCR检测,用GraphPad-Prism 8软件进行数据分析计算EC50
CC50检测:使用Cell-Titer-Blue法进行CC50的检测。Vero E6细胞接种至96孔细胞培养板,细胞密度达到60%-80%时进行试验。药物两倍比梯度稀释,Vero E6细胞换液后加入稀释后的药物。试验药物终浓度分别为100μM、50μM、25μM、12.5μM、6.25μM、3.125μM、1.56μM、0.78μM、0.39μM、0.195μM、0μM。37℃5%CO2培养48h,每孔加20μl Cell-Titer-Blue,分别在0min、30min、60min、120min检测593nm发光强度,用GraphPad-Prism 8软件进行数据分析计算CC50
SI为CC50除以EC50计算得出。
Time-of-Addition检测:Vero E6细胞接种至24孔细胞培养板,细胞密度达到60%-80%时进行试验。选择试验药物浓度为6.25μM。“全时程”实验方法:加入药物-病毒混合液,37℃5%CO2培养2h后换液,加入药物-病毒混合液;“入胞前”实验方法:加入药物-病毒混合液,37℃5%CO2培养2h后换液,加入纯培养基;“入胞后”实验方法:加入纯培养基,37℃5%CO2培养2h后换液,加入药物-病毒混合液。37℃5%CO2继续培养60-72h,观察CPE,提取细胞核酸进行qPCR检测,用GraphPad-Prism 8软件进行数据分析。
6.千金藤素抗xCoV感染的转录组学分析
千金藤素(CEP)试验浓度为6.25μM,使用MOI=0.01的xCoV感染Vero E6细胞。实验设置四个分组:Vero、Vero+Virus、Vero+CEP、Vero+Virus+CEP。培养72h后,收集细胞样品后使用TRIzol进行RNA提取,使用QIAseq FastSelect-rRNA HMR Kit(Qiagen,产品编号334387)去除rRNA,使用NEBNext UltraTMRNA Library Prep Kit for Illumina(NEB,产品编号E7770L)建立mRNA测序文库,使用Illumina Hiseq 2500sequencing system(安诺优达生物科技有限公司)进行RNA测序(RNA-seq)。
FastQC(http://www.bioinformatics.babraham.ac.uk/projects/fastqc/)工具和FASTX工具包中的fastx_trimmer用于去除低质量数据和接头序列;使用HISAT2(v2.1.0)将修剪后的RNA-seq序列映射到参比绿猴基因组ChlSab1.1(GCA_000409795.2);使用SAMtools(v1.5)删除双末端数据重复序列;使用HTseq对每个不同基因进行计数;使用DESeq2鉴定不同实验组之间的差异表达基因;使用Benjamini-Hochberg法调整P值来计算错误发现率(FDR);FDR q值<0.05和|Log2(倍数变化)|>1的基因被认为是差异表达基因;使用R语言的ggplot2软件包绘制火山图。
Gct格式文件(包括Vero对Vero+Virus,Vero+Virus对Vero+Virus+CEP)用作处理文件。基因集包括(1)热休克因子1(HSF1)介导的热休克反应调节、细胞致热性调节、HSF1依赖性反式激活、HYPOXIA、对病毒的防御反应、对病毒的反应、HIF1靶标、脂肪细胞分化和自噬,可从MSigDB、KEGG和Reactome数据库下载,(2)病毒的上/下调基因,是上述RNA-seq数据中差异表达的基因,其FDR q值<0.05和|Log2(倍数变化)|>1。使用Signal2Noise模式运行GSEA4.0.3(https://www.gsea-msigdb.org/gsea/index.jsp)进行1000个排列计算得出的基因组富集P值,得出归一化富集得分(NES)值和FDR值。可视化热图由GENE-E的R软件包绘制。并从MSigDB、KEGG和Reactome数据库绘制热图,以通过途径模式显示所选基因集。
使用Metascape工具(https://metascape.org)对上所述获得差异表达的基因进行基因本体(GO)分析。P值<0.05的途径作为显着富集的途径,最显著富集的途径使用R包ggplot2创建的气泡图展示,使用Metascape网站中Cytoscape绘制每个重要的富集途径的相互作用网络和蛋白质-蛋白质相互作用(PPI)网络。并使用BioGrid和OmniPath对每个给定的基因列表进行详细地PPI富集分析。
表1研究中使用的引物序列
Figure BDA0002939283190000091
Figure BDA0002939283190000101
二、实验结果
通过全基因组和各个病毒编码基因(核苷酸水平和氨基酸水平)比较分析发现:xCoV与SARS-CoV-2高度同源,与SARS-COV-2的S蛋白同源性达92.5%,是迄今为止成功分离培养的与SARS-COV-2的S蛋白同源性最高的病毒(图1)。不论是全基因组水平(图2)还是病毒吸附进入细胞的关键基因S基因(图3),xCoV与SARS-CoV-2的同源性均远高于SARS病毒。
通过加入不同浓度的siRNA特异性敲降ACE2的表达,发现随着ACE2mRNA表达水平逐渐降低(图4),xCoV感染细胞的能力显著逐渐下降,强烈提示ACE2是xCoV进入细胞的受体(图5)。
96孔细胞培养孔中,每个孔中分别加入终浓度为10μM的各种已知药物(2080多种上市药物和326种抗病毒化合物)中的一种化合物和MOI=0.01的xCoV,处理后的Vero E6细胞在37℃、5%的CO2细胞培养箱中培养72小时。此时加入病毒不加化合物的细胞培养孔和绝大多数加入各种化合物的细胞培养孔中,细胞发生明显的细胞病变效应(图7),同时,没有加入病毒和药物的细胞培养中细胞未见任何细胞病变(图8)。然后,在加入终浓度为10μM的千金藤素(图9)、西拉菌素(图10)和盐酸甲氟喹(图11)的病毒感染细胞培养孔中,未见明显细胞病变。强烈提示千金藤素、西拉菌素、盐酸甲氟喹是潜在的强力抗xCoV感染细胞抑制剂。
进一步通过实时定量PCR技术检测发现,与只加0.1%DMSO的对照组相比(所有化合物均溶于DMSO中,因此加入药物后每个细胞培养孔中的DMSO浓度为0.1%),10微摩尔每升的千金藤素、西拉菌素、盐酸甲氟喹在感染复数0.01的xCoV感染细胞72小时后,分别能抑制病毒复制15393倍、5053倍、31倍(图12)。此实验结果已经重复5次,均可重复。
EC50、CC50与SI的结果显示,千金藤素(图13)、西拉菌素(图14)、盐酸甲氟喹(图15)对xCoV的抑制呈现浓度依赖现象。此外,千金藤素(图16)、西拉菌素(图17)、盐酸甲氟喹(图18)均在xCoV进入细胞后发挥抑制病毒作用。
具体地,图16示出了千金藤素对xCoV的Time-of-Addition试验结果,表明千金藤素是在xCoV进入细胞后发挥抑制作用,而不能抑制xCoV的入胞。图17示出了西拉菌素对xCoV的Time-of-Addition试验结果,表明西拉菌素是在xCoV进入细胞后发挥抑制作用,而不能抑制xCoV的入胞。图18示出了盐酸甲氟喹对xCoV的Time-of-Addition试验结果,表明盐酸甲氟喹是在xCoV进入细胞后发挥抑制作用,而不能抑制xCoV的入胞。
进一步的转录组测序分析提示,千金藤素主要通过干扰细胞应激反应,包括内质网应激/未折叠蛋白反应和HSF1介导的热休克反应来逆转受感染细胞中大多数失调的基因和通路,从而发挥抗冠状病毒效果(图19)。
三、讨论
对SARS-CoV-2病毒进行研究需要高水平的生物防护设施,这与研究的迫切需要相冲突。发明人分离的穿山甲冠状病毒xCoV对人体的致病性较低或无致病性,为研究与其密切相关的SARS-CoV-2提供了一种可供选择的模式,供没有生物安全3级设施的研究人员使用。发明人之所以认为这种分离物对人体的致病性低或无致病性,是因为早在2017年,与穿山甲有密切接触的人群中未发现疑似感染;发明人的穿山甲冠状病毒xCoV分离物是在生物安全二级设施中常规培养的。
在2013年从云南的一只亲鼻蝠采集的样本中发现的一种冠状病毒与SARS-CoV-2密切相关,因此,推测蝙蝠可能也是SARS-CoV-2的宿主。近日,华南农业大学的研究人员宣布发现穿山甲(Manisjavanica)是SARS-CoV-2的中间宿主。同样,在2019年10月,一项针对穿山甲的病毒全基因组研究发现了SARS-CoV相关序列,在出现SARS-CoV-2后,该序列被重新鉴定为SARS-CoV-2相关序列。此外,发明人还从已死亡的走私穿山甲中分离培养了一株SARS-CoV-2相关冠状病毒xCoV。通过全基因组和各个病毒编码基因(核苷酸和氨基酸)比较分析发现xCoV与SARS-CoV-2高度同源,与SARS-COV-2的S蛋白同源性达92.5%,是迄今为止成功分离培养的与SARS-COV-2的S蛋白同源性最高的病毒(图1)。不论是全基因组水平(图2)还是病毒吸附进入细胞的关键基因S基因(图3),xCoV与SARS-CoV-2的同源性均远高于SARS病毒。
在本研究中,发明人在SARS-CoV-2相关冠状病毒即穿山甲冠状病毒xCoV模型中进行了抗冠状病毒活性药物的筛选。基于实验室前期研究结果发现,xCoV感染哺乳动物细胞Vero E6(非洲绿猴肾细胞)以后能使细胞产生非常明显的细胞病变。基于这一特点,发明人在前期使用xCoV感染96孔细胞培养板中的Vero细胞,同时向每个细胞培养孔中加入单个的上市药物(2080多种上市药物和326种抗病毒化合物),进行潜在的抑制病毒复制的活性药物筛选。在第3天时通过显微镜下观察细胞病变,结果发现有3种潜在药物对病毒感染细胞抑制明显(3种药物分别是千金藤素、西拉菌素和盐酸甲氟喹)。由于xCoV与目前的SARS-COV-2高度同源,且xCoV感染细胞的受体与SARS-COV-2一致,如果药物对xCoV感染细胞有抑制作用,则其对SARS-COV-2的感染也有抑制作用。
值得注意的是,王一飞等人2003的专利“千金藤素在制备抗SARS病毒药物中的应用”中提到千金藤素对2003年引起非典的SARS-CoV病毒的半数抑制剂量为8μg/ml(13.186μM),即13.186μM的千金藤素终浓度能抑制50%的病毒感染。而发明人的实验结果显示,使用更低浓度的千金藤素(10μM),抑制xCoV病毒复制的能力达到15393倍。因此,千金藤素对xCoV病毒复制的抑制能力至少比对SARS-CoV病毒复制的抑制能力强30786倍。此外,千金藤素对SARS-CoV的抑制(实际上是低效抑制)并不能说明其能强力有效地抑制xCoV和SARS-CoV-2。事实上,发明人已测试了的上市药物库中包含了很多可以有效抑制SARS-CoV病毒复制的药物(如磷酸奥司他韦),但它们(如磷酸奥司他韦)对xCoV和SARS-CoV-2的复制却几乎没有抑制作用。这其中的主要原因之一是由于SARS-CoV病毒与SARS-CoV-2病毒同源性较远,SARS-CoV病毒和SARS-CoV-2病毒在基因组和氨基酸水平差异较大,造成对SARS-CoV病毒有抑制作用的药物很可能对xCoV和SARS-CoV-2没有任何影响(如磷酸奥司他韦)。而xCoV分离自从海关查获的死亡的穿山甲中,与SARS-CoV-2高度同源,是目前可分离培养的冠状病毒中与SARS-CoV-2同源性最高的病毒。因此,本发明中的对xCoV有强烈抑制作用的千金藤素、西拉菌素和盐酸甲氟喹也能够抑制SARS-CoV-2的病毒复制,极有可能成为新型冠状病毒肺炎的特效药,并建议这些药物可用于SARS-CoV-2患者的临床试验。
发明人认为,作为一种潜在的治疗SARS-CoV-2的药物,千金藤素具有特别重大的的药用价值。该药是从防己科植物头花千金藤、地不容中分离提取的双节基异哇琳生物碱,被批准用于白细胞减少症。它具有多种功能,如抑制抗肿瘤药物的外排转运体ABCC10,通过降低质膜流动性抑制HIV-1的进入,与Hsp90的中心部分结合。重要的是,大剂量的这种药物在动物身上具有低毒性,在人类中没有明显的副作用。此外,有研究表明SARS-CoV-2可以引起外周血单个核细胞的细胞应激反应和自噬途径相关基因的富集,而千金藤素可以有效逆转受感染细胞中大多数失调的基因和途径,尤其是内质网应激/未折叠蛋白应答和HSF1介导的热休克应答,由此发挥抗冠状病毒感染效果。鉴于所观察到的对病毒复制的强烈抑制作用和该药物已确立的抗炎作用,发明人认为,千金藤素是治疗SARS-CoV-2感染的一个有前途的候选药物。
序列表
<110> 北京化工大学
<120> 穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用
<130> CPCN21110144
<150> 2020100968681
<151> 2020-02-16
<160> 20
<170> SIPOSequenceListing 1.0
<210> 1
<211> 29837
<212> DNA
<213> xCoV
<400> 1
gctttaactt tttacaaatc ccaggtagca aaaccaacca actctcgatc tcttgtagat 60
ctgttctcta aacgaacttt aaaatctgtg tggctgtcac ttggctgcat gcctagtgca 120
ctcacgcagt ataataataa ttaattactg tcgttgacag gaaacgagta actcgtccgt 180
cttctgcaga ctgcttacgg tttcgtccgt gttgcagtcg atcatcagca tacctaggtt 240
ttgtccgggt gtgaccgaaa ggtaagatgg agagccttgt ccctggtttc aacgagaaaa 300
cacacgtcca actcagtttg cctgttcttc aggttcgcga cgtgctagta cgtggctttg 360
gagactccgt ggaggaggct ctatcagagg cacgtcaaca tcttcttgac ggcacttgtg 420
gcataatcga tgttgaaaag ggagtactcc cccaactcga acagccctat gtgtttgtca 480
aacgttctga tgcccgaact gctcctcacg gccatgtaat ggttgaattg gtggcagaac 540
tcgatggtgt tcagtacggt cgtagcggtg agactcttgg tgttcttgta ccacatgttg 600
gtgaaacacc tgttgcttat cgcaaaattc ttctccgtaa gaatggtaat aagggagccg 660
gtggtcatag tttcggcatc gatctaaagt cttatgactt aggtgacgag cttggcactg 720
atcccattga tgactttcaa gtcaactgga acactaaaca tggcagtggt gtaactcgtg 780
agctcatgcg tgagcttaat gggggcgcat acactcgcta tgtagacaat aacttctgtg 840
gccctgacgg ctaccctctt gagtgcatca aagacttatt agctcgtgct ggaaagtctt 900
cttgctcttt gtccgaacaa ctggacttta ttgacactaa aagaggtgtg tactgctgcc 960
gtgaacatga acatgaaatt gtttggtaca cggaacgctc cgacaagagc tacgaattac 1020
agacaccttt tgaaatcaaa ttggcaaaga aatttgacaa tttcacaggg gaatgtccaa 1080
actttgtctt cccactaaat tctacaatca agaccattca accacgtgtt gaaaagaaaa 1140
agcttgaggg ttttatgggt agaattcgat ctgtctatcc tgttgcatca ccaaatgaat 1200
gcaacccaat gcacctttcg acgcttatga agtgtgaaca ttgtagtgaa acttcatggc 1260
aaactggtga cttccttaaa gccacttgtg aattttgtgg tactgaaaat caagtcaaag 1320
aaggacctac cacttgtggt taccttcctc aaaatgctgt agtaaaaatt ttttgtccag 1380
catgtcataa tccagaaatg ggacctgagc acagtctcgc agaataccat aatgaatctg 1440
gtattaaaac cactcttcgt aaaggtggtc gtaccaaagc atttggagga tgtgtgttct 1500
cttatgtggg ctgtcacaac aagtgtgcct attgggtgcc tcgtgctgct gctaacgtag 1560
gatgtaacca cacaggagtt gtgggagaag gttctgaaag tctcaatgat aaccttcttg 1620
aaatacttac taaggagaaa gtcaacatta acattgttgg tgactttaaa ctgactgaag 1680
agatcgccat aatcttggca tctttttctg catccacgag tgctttcgtg gaaactgtga 1740
agggcttgga ttacaagtct ttcaaacaaa ttgttgaatc ctgtggtaac tttaaagtaa 1800
ccaagggaaa attcaagaag aatgcttgga atattggtga accaaagtcc atactgagcc 1860
ctctgtatgc atttccctca gaagctgctc gtgtggtacg ttccattttt tcacgcactc 1920
ttgaaactgc tcaacactct gtgcgtgtct tgcaaaaggc cgctattaca attctggacg 1980
gaatttcaca gtactcactc agattgattg atgctatgtt gttcacgtct gaacttacaa 2040
cagacagtat cgtagtgatg gcatacgtca caggtggtgt tgtacaaatg actacacaat 2100
ggcttaccaa tatttttggt actgtgtatg aaaaattgaa acctattctt gactggcttg 2160
aagagaagtt caaggaaggg atagagtttc ttaaggacgg ttgggagatt gtaaaattca 2220
tcacaacctg ttcttgtgaa atcattggtg gacagcttgt agcattcacc actgaactta 2280
aagacagtgt gaagaaattt ttcaaactgg ttaacaaatt tcttgctctt tgtgctgatt 2340
ccatcgtcat tggtggtgca aaacttaaag ctttgaattt gggagaaacc tttgtcgcac 2400
actccagagg actctacaaa aagtgtgtga aatccagagg agactctggt ttactcatgc 2460
ctctaaaagc accaaaagaa gttatcttcc ttgatggaga aactttgcct acagaggtac 2520
tttcagaaga agtaatacta aaaactggtg aattacaacc acttgaggaa ccaactgcac 2580
aggcagttga agtaccactc gtaggtacac cagtttgcat taatggatta atgctgcttg 2640
aaattaaaga tactgaaaag tattgtgctc ttgcacctaa catgatggtc actaataata 2700
ccttcactct aaaaggtggt gcaccaacca aagtcacatt tggtgatgac acagtcattg 2760
aagtccaagg ctacaagaat gtgaatatca catttgaatt ggatgaacga gtagataaag 2820
tactcaacga aaagtgctct aactacactg tagaacttgg aaccaacata gatgaattgg 2880
cttgtgttgt agctgaggca gtaataaaga ctttgcaacc tgtttcagaa ttacttacac 2940
cgctaggcat tgacttagac gagtggggtg ttgcaaccta ttacttgttt gacgagtctg 3000
gtgagtatac tttgtcttca cgtatgtatt gttcattcta tcctccagat gaggattatg 3060
aagaagaata cagcgaagag gaacaacctg aacaaccaac tcaatatgag tatggtactg 3120
aatctgatta caaaggtttg cctttggaat ttggtgcatc ttctgtacaa caacaggaag 3180
aacaagaaga agattggtta gaaactgaag ctgaagtggt ggaacaagaa gttacaccaa 3240
ctgagcaaga ggaagagcta ccaatcactg aaattgttcc tgcagtggaa caaactacaa 3300
ttgtagagct agaatgtgat aatttcactg gttatttaaa actcactgat aatgtttcca 3360
ttaaaaatgt ggatattgta agtgaggcta aaaatgtaaa acctacaata gtggttaatg 3420
ctgctaatgt acacctaaaa catggtggtg gtgttgctgg tgctcttaac aaagctacta 3480
acaacgctat gcaaattgag tctgatgact acattgccag aaatggacca ctaaacgtgg 3540
gtggtagttg tcttctaaat ggacacaatt tggctaaaaa ctgccttcat gttgttggtc 3600
ctaatctcaa caagggtgaa gacattcaat tacttaaagt tgcctatgaa aatttcaatc 3660
atcatgaaaa attacttgca ccacttctct cagcaggcat ctttggtgca caaccaatac 3720
agtctttgaa ggtgtgtatt gaaacagtac gcacacaagt ctttttagct gtctttgaca 3780
aggacctcta tgaagaactt gtagctagct ttttagaaat gaaaagtgag actaaagtac 3840
aagatcactt tgacgtcgtt gagactaagg ttgaaattac acctgaagaa tctgcttcaa 3900
gtgagaaacc taccaaggaa gagcctaaaa aggtgaaacc ttgtattgaa gaagttacaa 3960
ctactctaga agaaactaag tttcttacag aaaacttgtt actgtatgca gacattaatg 4020
gtaatctgta tccagattca accagtcttg tggaaaatgt tgatgtcacc ttccttaaaa 4080
aggatgctcc ttatatagta ggtgacataa ttactagtgg taatttaaca accgttgtca 4140
taccaacaaa gaaagcaggt ggtactacag aaatgcttgc aaaggcattg cgtaaagtac 4200
ctactgacca ttatataacc acctaccccg gacagggttg cgttggttat acaattgagg 4260
aagcaaaaac agctcttaag aagagtaaga gtgcttatta tgtattaccc tctataattc 4320
caaataagaa agaagaaatt cttggtactg tttcttggaa cttgcgtgaa atgcttacgc 4380
atgctgaaga aacacgtaaa ttaatgccta tttgcatgga tacaaaggct ataatgtcta 4440
ctgtgcaaag gaagtataag ggtattaaga tacaggaggg agtcgtggac tacggtgtaa 4500
ggttttactt ctatactagt aaaacacctg tagctacact tattgcaact cttaattcat 4560
taggagaaac cttggtcaca atgcctttgg gttatgtgac acatggttta aatttagaag 4620
aggctgctag gtatatgaga tcactcaaag tacccgcaac cgtttctgtt tcttcaccag 4680
atgctgttac agcatataat ggttacctta cttcctcttc aaagactcct gaagagcatt 4740
tcatagaaac catctcacta gctggttcat ataaagactg gtcctattca ggacaggcta 4800
ctgaattagg tattgaattt cttaaaagag gtgacaaagt tgtctaccac acaactagta 4860
aaccaatcac tttccacatg gatggtgagg ttatcaacat tgacagtctt aagacactcc 4920
tagctcttag ggaagttaag accattaagg tgtttaccac agttgacaac attaatcttc 4980
acactcaagt tgtggacatg tctatgactt atggacaaca gtttggtcca acctacttgg 5040
acggagctga cgttacaaag attaaacctc atgcatctca tgacagcaag acattttatg 5100
tgttgcctaa tgatgataca ctacgcagtg aggcttttga gtactatcac acaactgatg 5160
aaagttttct aggtagatac atgtcagcat taaatcatac taagaaatgg aaatttccac 5220
aggttaatgg tttaacatcc attaaatggg cagataacaa ttgttaccta gccacagcct 5280
tattaactct tcaacagata gaattgaagt ttaatccacc agcattgcaa gacgcctact 5340
acagggctag agttggtgat gctgcaaatt tctgtgcact tatacttgct tactgtaata 5400
aaacagtggg tgagctaggt gatgtaagag aaacaatgag tcatttgttt caacatgcca 5460
acttggattc ttgtaaaaga gtcctaaatg tggtgtgtaa aacttgtgga cagcaacaaa 5520
ctaccttaaa gggtgtagag gctgtaatgt atatgggtac actttcttat gaacaactta 5580
agagaggtgt aactgtaccg tgtgtttgtg gaagacaagc tacacagtat ttagtacaac 5640
aagagtcatc ttttgttatg atgtctgcac cacctgctga atataaacta aagcatggta 5700
ctttcttgtg tgctagtgag tatactggta attaccagtg tggtcattat aaacacatca 5760
cttcaaagga aaccttgtat gtcatagatg gtgcattgct cagcaaaacc tcagagtaca 5820
aaggccctgt tacagatgtt ttctataaag aaaacagcta cacaacaacc ataaaaccaa 5880
ttgtttataa actagacggt gttgtgtgta cagaaattga tcctaaattg gatggttact 5940
ataaaaagga taatgcctat tttactgaac agccaattga tttagtgcca actcaacctt 6000
acccaaactc aaactttgac aatttcaagt ttgtttgtga caacaccaaa tttgctgatg 6060
acttaaacca gatgtctggt tataagaagc ctgcttctcg tgagcttaag attacttttt 6120
ttcctgactt gaatggtgat gtagtggcta ttgattataa acattacaca ccttcattca 6180
agaaaggtgc taagttgttg cacaagccta ttgtatggca tgtgaataac acaattaaca 6240
aagcaacgtt taaaccaaat acttggtgct tacgttgtct ttggagtact aaaccagttg 6300
aaacgtcaaa tatttttgat gttctgcaat cagaggacac acagggaatg gaaactcttg 6360
cctgtgagga cactaaactt gtcactgaag aagtagtgga aactcctacc atacagaaag 6420
acatagtaga gtgtgatgtg aaaactaccg aagttgtagg tgacgtcata cttaaaccag 6480
cacaagacgg tgtaaaaata acagaagaag ttggtcatga agatctaatg gctgcttatg 6540
tagacaatac tagtctaaca attaagaaac ccaatgaatt atcagtaatg ttgggtctaa 6600
aaactttaaa aactcacggt ttggctgctg ttaatagtgt cccttgggat actattgtta 6660
cttatgccaa accgtttctt aataaggtaa ctagtgttgc tgcaagtgga gttgcgcgtt 6720
gtttaaaccg catctgtgtt aactatatgc cttatgtttt aactttgttg ctgcaattct 6780
gtacttttac tagaagtact aattctagaa tcaaagcatc tatgccaact actatagcta 6840
aaaatacggt taagagtgtt ggtaagttct gtttagaagc ctcatttaat tatttgaagt 6900
ctcctaattt ttctaaactc ataactatta tagtatggtt tcttttgtta agtgtttgtc 6960
taggttcttt aatctattca agtgctgctt taggtgtttt gatgtctaat ctaggtatgc 7020
cgtcatactg tacaagttac agagatggtt atctaaactc tactaatgtc acaacaacag 7080
cttactgtac gggttctata ccgtgtagtg tctgtcttag tggtatggat tctttagatg 7140
cttatcctgc tctagaaact attcaagtta ccatttcttc ttttaaatgg gatttaactg 7200
cttttggcat tattgcagag tggtgtttgg catatattct ctttactagg ttcttttatg 7260
tactaggttt agccgcaatt atgcaattgt tctttggcta tttcgctgta cattttatta 7320
gtaattcttg gcttatgtgg ctcataatta atcttgtaca aatggcccct atttcagcta 7380
tggttagaat gtatatcttt ttcgcatcat tttattatgt gtggaagagc tatatacatg 7440
ttgttgacgg ttgtacctca tctacttgta tgatgtgtta caaacgtaat agagctacaa 7500
gggttgaatg cacaaccatt gtaaatggtg taagaagatc attttatgtc tatgctaatg 7560
gaggtaaagg attttgtaaa ctacataact ggaattgtgt caattgtgat actttctgtg 7620
caggtagtac ttttattagt gatgaagtcg caagagactt gtccctacaa tttaagagac 7680
ccattaatcc tacagaccag tcatcttatg tagtggatag tgtagctgtg aaaaatggtt 7740
cgctgcacct ctactttgac aaggctggcc agaaaaccta tgaaagacat tctctttctc 7800
actttgtcaa tttagacaac ttgagagcta ataacactaa aggatcaata cccattaatg 7860
tcattgtgtt tgatggtaag tctaagtgtg atgaatcatc agctagagca gcttctgttt 7920
attacagtca gcttatgtgt caacctatat tgttacttga ccaggcgtta gtgtctgatg 7980
ttggtgacag tgcagaagta gctgttaaaa tgtttgatgc ttatgttaat acattctcat 8040
caacttttaa cgtgcctatg gataagttaa aaactctcat tgcaacagct gagactgaac 8100
ttgctaagaa tgtgtcttta gataatgtcc tttcaacatt tatctcagca gctcgtcaag 8160
ggtttgttga ttctgatgtt gatactaagg acgttgtgga atgtctaaaa atttctcatc 8220
aatcagacat tgaagttaca ggtgacagtt gtaataacta tatgctcacc tataacaaag 8280
tggaaaacat gacgcctaga gatctaggtg cttgtattga ttgcagtgca cgtcatatta 8340
atgcacaagt agcaaaaagt cacaacattt ctttgatttg gaacattaaa gatttcatgt 8400
cgctgtctga acaactgcgt aaacaaatac gtagtgctgc taagaagaat aacttgcctt 8460
ttaagttgac atgtgcaact actagacaag ttgttaatgt agtaacaaca aagatagcac 8520
ttaaaggtgg taaatttgtt acaaattggt ttaagtactt gcttaaagcc acattagttt 8580
gtgttgttat agcttgtgtc ttttacttta ttacacctgt acacgtgctt actaagcatg 8640
gtgattttgc agatgaaatc attggttaca aagctattga agatggtgtc acacgtgaca 8700
tttcatctaa cgacaattgc tttgctaata aacacgttgg atttgactca tggtttagtc 8760
aacgtggtgg ttcttatact aatgataaga cttgtccaat tgtggctgcc gtcataactc 8820
gtgatgtagg atttgtagtt cctggtttac caggaacaat tttccgtaca ttaagtggtg 8880
actttttaca tttcttacct agagtgttta gtgctgttgg caatatttgc tacacaccat 8940
ccaaacttat agagtacact gacttcgcaa catcagcctg tgttttagca gctgaatgta 9000
ccatattcaa agatgcagct ggaaagcctg tgccatattg ttatgacact aatgtgctcg 9060
aaggttctgt accttatgaa tcactccgtc cagacacacg ttatgtcttg atggatggtt 9120
ctattataca attccctaac acgtaccttg aaggttctgt tagagttgtt acaacttttg 9180
actctgagta ctgtagacat ggtacttgtg aaaaatctga agctggcatc tgtgtttcca 9240
ctagtggtag atgggtgctt aataatgatt attatagatc attacctggt gtgttttgtg 9300
gtgttgattc tgtaaatctc ttaacaaata tgtttacacc tttgattcaa cctattggtg 9360
ctttagacat atcagcttca attgttgcag gtggtttagt tgctatattt gtaacttgtc 9420
ttgcatacta ttttatgagg ttcaggagag cttttggcga atacagtcat gtagttgcct 9480
ttaatactct cttgtttttg atgtccttta ctgtactctg tcttacgcct gtgtatagtt 9540
tcttaccagg tgtttattca gttttttatt tgtacttgac attttatctt actaatgatg 9600
tttcattttt agctcatgtt caatggatgg tcatgttcac tcctttagta cctttctgga 9660
ttacaattgt ttatgtcatt tgtatatcta ctaagcattg ttactggttc tttagtaatt 9720
accttagacg tagagttgtc tttaatggta cttcctttag cacttttgaa gaagcagctt 9780
tgtgtacatt cttgcttaac aaggaaatgt atcttaaatt gcgtagtgaa actttacttc 9840
cactgacgca atataataga tacttagcgc tttacaacaa gtacaaatac tttagtggag 9900
ccatggacac aactagctac agagaagcag catgctgtca tcttgctaag gctctaaatg 9960
atttcagtaa ctcaggttct gatgtgctct accaaccacc acagacatcc attacatcgg 10020
ctgtccttca aagtggattt agaaaaatgg cttttccatc tggtaaggta gaaggttgta 10080
tggtgcaagt tacttgtgga acaactacac ttaatggtct ttggcttgat gatgtggtct 10140
attgtccaag acatgtgatc tgcacagctg aagatatgct taatccaaat tatgaggatt 10200
tgcttattcg taaatctaac cataattttc tggtacaagc tggtaatgtt catttgagag 10260
ttatcggaca ttctatgcaa aattgtgttc ttaagttgaa agttgacgct gccaacccta 10320
agacaccaaa atataagttt gttcgaattc aacccggaca gactttttca gtattagctt 10380
gttacaatgg ttcaccatca ggtgtttacc agtgtgctat gagacctaat tttactatta 10440
aaggatcatt ccttaatggt tcttgtggta gtgttggttt taacatagac tatgactgtg 10500
tctctttttg ctacatgcat cacatggaac ttccaacagg agtacatgcg ggcacagatt 10560
tagaaggtac cttctacgga ccttttgttg acagacagac agctcaagca gctggtacag 10620
acacagtcat tactataaat gttttggctt ggttgtatgc agctgttatt aatggagata 10680
gatggtttct taacagatac acaactactc ttaatgattt caacttagtt gctatgaagt 10740
tcaactatga acctctcaca caagatcatg ttgatattct aggaccacta tcagctcaaa 10800
ctggtgttgc tgtcttagat atgtgtgctt cattaaagga attgctccaa aatggtatga 10860
acggtcgcac tatcttaggt agtgccatat tagaagatga gtttacacca tttgacgttg 10920
ttagacaatg ttcaggtgta acttttcaaa gtgctattaa aagaactgtc aaaggtactc 10980
accattggtt gttgttaaca atcttgacat ctcttcttgt attggttcaa agtactcaat 11040
ggtctttgtt cttctttgtt tatgaaaatg ccttcttgcc tttcgcttta ggtataattg 11100
ctatgtctgc ttttgctatg atgtttgtta agcataagca tgcattcttg tgtctattcc 11160
tgttaccttc cttagctact gtagcttact ttaatatggt ctacatgcct gctagttggg 11220
tgatgcgtat catgacttgg ttggacatgg ttgataccag cttgtctggt tataaactta 11280
aggactgtat catgtatgca tcagctatta tcttactcat acttatgaca gcaagaactg 11340
tttatgatga tggtgctagg cgtgtatgga cactaatgaa tgttcttaca cttgtttata 11400
aagtctatta tggtaatgct ttagatcaag caatttctat gtgggctctt attatctctg 11460
tcacctctaa ctattcaggc gttgttacaa ccgtcatgtt cttggctaga ggtattgtct 11520
ttatgtgcgt tgagtattgt ccaattctct ttattacagg taacacctta cagtgtataa 11580
tgttggtgta ctgcttttta ggctattttt gtacttgtta ttttggcctc ttttgtttac 11640
tcaatcgtta ctttagactt acccttggtg tttacgatta tctcgtttcc acacaagagt 11700
ttagatatat gaattcacaa ggtcttttac cacctaagaa cagcatagat gccttcaaac 11760
taaatgttaa gcttttaggt attggtggca aaccctgtat caaagtagca actgttcaat 11820
caaagatgtc agatgtgaaa tgtacttctg tagtccttct ctcagtttta caacaactta 11880
gagttgaatc atcttcaaag ttgtgggcac agtgtgtgca attgcacaat gatatacttc 11940
ttgcaaagga caccactgaa gcatttgaaa aaatggtttc attactgtct gtgttgctat 12000
ccatgcaagg tgctgtagac ataaacaaac tctgtgaaga aatgttggac aacagagcaa 12060
cattacaggc tattgcttca gaatttagtt ctttaccatc ctatgctgcc tttgctacag 12120
ctcaagaagc ttatgagcaa gcggtggcta acggtgattc tgaagtggtt cttaaaaagt 12180
taaagaaatc tctgaatgtg gcaaagtctg aatttgaccg tgatgcggcc atgcagcgta 12240
agctagaaaa gatggctgat caagctatga cccaaatgta caaacaggca cggtctgaag 12300
acaagagggc aaaagtcact agtgcaatgc aaactatgct tttcactatg cttagaaaac 12360
ttgataatga tgctctaaac aacattatca ataatgccag agacggttgt gttccactga 12420
acataatccc ccttactact gcagccaaac taatggttgt tgtacctgac tataacacct 12480
ataaaaatac ttgtgaaggt agtactttta cttatgcctc agcactttgg gaaattcaac 12540
aagttgttga tgcagatagc aaaatagtcc aacttagtga aattactatg gacaattctc 12600
ctaatattgc ttggcctctt attgtaacag ctttaagagc caattcagct gtcaaacttc 12660
agaataatga actgagtccc gtagcacttc gacagatgtc atgtgctgca ggtactacac 12720
aaacagcttg taatgaggat aatgcattag cctactataa cacatcaaag ggaggtaggt 12780
ttgttttggc attactatct gatcttcaag atctcaagtg ggccagattt cctaaatctg 12840
atggtactgg caccatttat acagagctgg aaccaccttg taggtttgtt acagacacac 12900
caaaaggacc taaagtaaag tatttgtact tcattaaggg tttgaataat ttgaatagag 12960
gtatggtact gggcagctta gctgctactg tacgtttaca agctggtaat gcaacagaag 13020
tgcctgccaa ctcaactgtt ctttctttct gtgcatttgc tgtagatgca tcaaaagctt 13080
acagagacta cctagcaagt ggaggacaac caataacaaa ttgtgttaag atgttgtgta 13140
cacatacagg tactggtcag gcaataactg taacaccgga agccaatatg gatcaagaat 13200
cctttggtgg tgcttcttgt tgcttgtact gtagatgcca catagatcat cctaacccta 13260
aaggttactg tgagcttaaa ggtaagtatg tacaaatacc taccacttgt gctaatgacc 13320
cagtgggttt tacacttaaa aacacagtct gtaccgtctg cggcatgtgg aaaggttatg 13380
gctgtagttg tgatcaactc cgcgaaccta tgcttcagtc tgctgatgca cagtcgtttt 13440
taaacgggtt tgcggtgtaa gtgcagcccg tcttacaccg tgcggcacag gcacaagcac 13500
tgatgtcgtg tatagggctt ttgacatcta caatgaaaaa gtagctggtt ttgctaagtt 13560
ccttaaaaca aattgttgcc gttttcaaga aaaagacgaa gatggtaacc tgatagattc 13620
ctacttcata gttaagagac atactttctc taactatcaa catgaagaag ctatttataa 13680
cttgcttaaa gattgtccgg ctgttgctgt tcatgatttt ttcaagttta gagtagatgg 13740
tgacatggta ccacacatat cacgtcaacg tctaactaaa tacacaatgg cagacttagt 13800
ctatgcctta cgtcactttg acgaaggtaa ttgtgacact cttaaagaaa tacttgtcac 13860
atacaattgt tgtactgatg actattttaa taagaaggat tggtatgatt ttgtagagaa 13920
tcctgacatt ttacgcgtat atgctaactt aggtgagcgt gtacgtcaag cattattaaa 13980
gactgtacag ttttgcgatg ctatgcgtga tgcaggtatt gtaggtgtac taactctaga 14040
taatcaagat ctcaatggga actggtatga tttcggagat ttcatacaga ctacaccagg 14100
tagtggggtt cctattgttg attcttatta ttcattgcta atgcctattc tcacacttac 14160
gagggcatta gctgctgagt ctcatctaga cgctgatttg acaaaacctt atgtaaaatg 14220
ggatttgtta aaatatgatt tcacggaaga aaggttaaac ctttttaacc gttatttcaa 14280
gtattgggat caaacctacc acccaaattg tgttaactgt ttggatgaca gatgcattct 14340
gcattgcgca aactttaatg tgttattctc tactgttttt ccaccaacaa gttttggtcc 14400
attagtgaga aaaatttttg ttgatggtgt accttttgta gtttcaacag gttaccactt 14460
cagagagcta ggtgttgtac ataatcaaga tgtaaacata catagctcga gacttagttt 14520
taaggaacta ttagtgtatg ctgctgatcc tgctatgcat gcagcttctg gtaatctttt 14580
gctagacaaa cgcactacat gcttttcagt agcagcacta acgaacaatg ttgcttttca 14640
aactgtcaaa ccaggtaatt ttaacaaaga cttttatgac tttgctgtct ctaaaggctt 14700
ctttaaagaa gggagttctg ttgaactcaa acatttcttc tttgcccaag atggtaatgc 14760
tgctattagc gattacgact attatcggta caatttacca actatgtgtg atatccgaca 14820
gctactattt gtagtagaag ttgttgataa atattttgat tgttatgacg gtggttgtat 14880
taatgcaaac caagtcatag taaacaattt agataaatct gccggatttc catttaacaa 14940
atggggaaaa gccagacttt attatgattc tatgagctat gaggatcaag atgcactctt 15000
cgcttatact aagcgtaatg tcatccctac tataacccaa atgaatctta agtatgccat 15060
tagtgctaaa aatagagctc gcaccgttgc aggtgtttct atttgtagta ctatgactaa 15120
tagacagttt catcaaaaac ttttgaaatc catagccgcc acaagaggtg ccactgttgt 15180
catcggaact agtaaattct atggtggctg gaacaatatg ttaaaaactg tttacagtga 15240
tgtagaaaat ccacacctta tgggttggga ttatccaaaa tgtgatagag ccatgcctaa 15300
catgcttagg ataatggctt ctcttgttct tgctcgcaaa catactactt gctgtagttt 15360
gtcacatcgt ttctatagat tagctaacga atgtgcacaa gttttaagtg aaatggtcat 15420
gtgtggcggt tcactatatg tgaaaccagg tggtacatct tcaggagatg ccacaactgc 15480
ttatgctaat agtgtcttca acatttgtca ggctgttact gccaatgtga atgcacttct 15540
atcaactgat ggcaacaaga ttggcgataa gtatattcgc aatcttcaac acagacttta 15600
tgaatgtctc tataggaata gagatgttga tacagacttt gtcaatgaat tttacgctta 15660
cttgcgtaaa catttttcaa tgatgatact ttctgatgat gctgttgttt gctttaatag 15720
cacctacgca tcacagggtc ttgtagctag cataaagaat tttaaatcag ttctttatta 15780
tcaaaataat gtttttatgt ctgaggcaaa atgctggact gagactgacc ttacaaaggg 15840
acctcatgaa ttttgctctc aacacactat gctagttaaa caaggtgatg attatgtgta 15900
cttgccctat cctgatccat cacgcattct aggcgcaggt tgttttgtcg atgacattgt 15960
caagacagat ggtacactaa tgattgaaag atttgtgtca ttggctattg atgcttatcc 16020
acttactaaa catcctaatc aggagtatgc tgatgtcttt catttgtatt tacaatacat 16080
acgaaagtta catgatgaac tcacaggaca catgttagac atgtattctg ttatgcttac 16140
taatgatagt acttcaaggt attgggagcc agagttctat gaagcaatgt acacacctca 16200
tacagtctta caggctgtgg gagcttgtgt tctctgcaat tcacagactt ccttaagatg 16260
tggtgcgtgt atacgtagac ccttcttatg ctgtaaatgt tgttatgacc atgtcatatc 16320
aacatctcat aaattggttt tgtctgttaa tccgtatgtt tgcaatgcca caggttgtga 16380
cgtcacagac gttacacaac tttatttagg aggtatgagc tattattgca aagcacataa 16440
accgcctatt agctttcctc tttgtgctaa tggacaggtt tttggtttgt acaaaaacac 16500
atgtgttggt agcgataatg ttaccgactt taatgctata gctacatgtg attggacaaa 16560
tgctggtgat tacattcttg cgaacacctg cacagaaaga cttaaacttt ttgctgctga 16620
aacacttaaa gcaacagagg agaccttcaa actatcttat ggtattgcca ctgtacgtga 16680
agtactgtca gatagagaat tatatctttc ttgggaagta ggaaaaccta gaccaactct 16740
caatagaaat tatgtgttta ctggttacag agtaactaag aatagtaaaa cacaaattgg 16800
tgaatacact tttgaaaaag gtgattatgg tgatgctgtt gtttaccgtg gtacaacaac 16860
ttataaatta aacgtgggtg actattttgt gttaacatca cacacagtca tgccactgag 16920
tgcaccaaca ttagtgccac aggagcatta tgttaggatt actggcttgt accctacact 16980
caacatttca gatgagtttt ctagcaacgt agctaattac cagaaagttg gtatgcaaaa 17040
atactcaact ttacaaggac caccaggtac tggtaaaagt cattttgcta taggattagc 17100
attgtactat ccttcagcac gcattgttta tacagcatgt tcacatgcag ctgtagatgc 17160
actgtgtgaa aaagcattaa aatatctgcc cattgataaa tgtagcagaa ttataccagc 17220
acgtgctcgt gttgaatgct ttgacaaatt caaagttaat tcgacactag agcaatatgt 17280
gttctgtaca gtgaatgcac taccagaaac aacagctgac attgtggttt ttgatgagat 17340
atcaatggcc actaattatg atttaagtgt tgttaatgct aggttaaggg caaaacacta 17400
tgtatatata ggtgaccctg cacaattgcc agcaccacgc acgttgctca ctaagggtac 17460
tctagaacct gagtacttta attctgtttg cagattaatg aaaactatag gtcctgatat 17520
gtttttaggt acttgtagaa gatgtcctgc tgaaatagtt gacactgtaa gtgctctagt 17580
ttatgataat aaacttagag ctcataaaga taaatcacaa cagtgcttta aaatgtttta 17640
caagggtgtt ataacacatg atgtctcatc tgctattaac agacctcaaa ttggtgtagt 17700
tagagaattt ctaacacgca accctacttg gagaaaggct gttttcatct ctccttataa 17760
ttcacagaat gctgttgctg ccaaaatatt aggtttacca acacaaactg tggattcatc 17820
acagggttct gagtatgact atgtcatatt cacacaaaca actgaaactg cacactcttg 17880
taatgttaac cgctttaatg tggccattac tagagcaaaa attggtatac tttgcataat 17940
gtctgataga gacctttatg acaaattaca atttacaagc cttgaagttc cacgtcgaaa 18000
cgtggcaacc ttacaagctg aaaatgtaac agggcttttt aaggattgta gtaaggttat 18060
tacaggatta caccctacac aagcaccaac ttaccttagt gttgatacaa aattcaagac 18120
tgaaggtttg tgtgtcgaca taccaggaat accaaaagac atgacctata ggagactcat 18180
ctctatgatg ggtttcaaaa tgaattatca agttaatggt taccctaaca tgttcatcac 18240
ccgcgaagaa gccattaaac atgttcgtgc atgggttggt tttgatgtcg aagggtgtca 18300
tgctacaaga gaagctgttg gtactaattt accattacag ctaggctttt caacaggtgt 18360
caatctagta gcagttccta caggctacgt tgatacatct aatgcaacag agttttctag 18420
ggtgagtgct aaaccaccac ctggtgacca atttaaacat cttataccac ttatgtacaa 18480
aggattacct tggaacattg tgcgtataaa gatagttcag atgttaagtg acacacttaa 18540
aaacctttca gacagagtcg tttttgtcct ttgggcacat ggctttgagc tgacatctat 18600
gaaatacttt gtcaaaatag gacctgaacg cacttgttgc ttatgtgaca aacgtgctac 18660
ctgtttttgc acagcatctg atacttatgc gtgttggcat cactcagttg gatttgacta 18720
tgtctacaac cctttcatga ttgatgttca acaatggggt tttactggta accttcaaag 18780
taaccatgac caatactgtc aagtacacgg taatgcacat gttgctagtt gtgatgctat 18840
catgactaga tgtttagcag tccatgaatg ctttgttaag cgtgttgact ggaccattga 18900
atatcctatt ataggtgatg agctgaagat aaatgcagca tgccgtaaag tacaacatat 18960
ggtagtaaag gctgcattac ttgctgataa gtttccagtt cttcatgata ttggtaatcc 19020
aaaagctata aaatgtgtac ctcaagcaga cacagattgg aagttttatg atgctcaacc 19080
ttgtagtgat aaagcttata aaatagagga attattctat tcctatgcta cccattctga 19140
taaattcaag gatggtgttt gtcttttctg gaactgcaac gttgacagat acccagcaaa 19200
tgcaatagtc tgcagatttg acacaagagt tctgtccaat ctaaacttac caggttgtga 19260
tggtggtagt ttgtatgtaa ataaacatgc tttccacaca ccagcttttg acaagagtgc 19320
ttttgtaaat cttaagcaat taccattctt ttactactca gatagccctt gtgagtctca 19380
tggcaaacaa gtggtgtcag acatagatta tgtaccttta aagtctgcaa cgtgtattac 19440
acgttgtaac ttaggtgggg ctgtttgcag acatcatgcg aatgaataca gattgtattt 19500
agacgcctat aatatgatga tttctgctgg ttttagcctt tggatttaca aacaatttga 19560
tacctacaat ctctggaaca cttttacaag actccagagt ttagaaaatg tggctttcaa 19620
tgttattaat aagggacatt tcgatggaca gcaaggtgaa acacctgttt ctatcgttaa 19680
taacactgtc tacacaaaag tagatggtgt tgatgttgaa ttgtttgaga acaaaacaac 19740
actacctgtt aatgtagcgt ttgagctctg ggctaagcgc aatatcaaac ctgttccaga 19800
agtgaaaata ctcaacaatt tgggtgttga cattgctgct aatacggtga tttgggacta 19860
caaaagagaa gcccctgcac atgtttctac aattggagtt tgtactatga ctgacatagc 19920
aaagaaatct actgaaactg catgttcacc actcactatc ttatttgatg gtagagttga 19980
aggacaagtt gacttattca gaaatgcccg taatggtgtt ttaataactg agggtaatgt 20040
aaaaggatta caaccatcag taggtccaaa acaagctagt cttaatggag tcacattaat 20100
tggtgaagca gtgaaaacac agtttaacta ttataagaag gttgatggtg tagtacaaca 20160
actacctgaa acttacttta ctcagagtag aaatttgcaa gaattcaaac ccaggagtca 20220
aatggaaatt gatttcttag aattagctat ggatgagttc attgaacgat ataaactaga 20280
aggctacgct ttcgaacata tcgtttatgg agattttagt catggtcagt taggtggatt 20340
acatctattg attggacttg ctaagcgttc taaggattca ccactagaat tagaggattt 20400
tattcctatg gacagtacag ttaaaaatta ctttattaca gatgcacaaa cagggtcatc 20460
taagtgtgtg tgttctgtta tagatttatt acttgatgat tttgttgaaa taataaaatc 20520
acaggattta tcagtagttt ctaaagtggt taaagtgact attgactatg cagaaattgc 20580
ttttatgctt tggtgtaaag atggccatgt agagacattt tacccaaaat tacaatctag 20640
tcaagcttgg caacctggtg ttgctatgcc gaacctttac aaaatgcaga gaatgctact 20700
tgataaatgt gatcttcaaa attatggtga agcagcaact ctacctaaag gcataatgat 20760
gaatgttgca aaatatactc aactgtgtca atatttaaat actttgactt tagctgtacc 20820
ttataacatg agagtaatac actttggtgc tggttctgat aaaggagttg cacctggtac 20880
agcagttctt agacagtggt tgcctacggg tacactactt gtcgattctg atcttaatga 20940
cttcgtctct gacgctgatt ctactttaat aggtgactgt gcaaccgtac acactgctaa 21000
taaatgggat ctcattatta gtgatatgta cgatcctaaa accaaacatg taacaagaga 21060
aaatgactct aaagaggggt ttttcactta catctgtgga tttatacaac aaaagttagc 21120
ccttggaggt tctgtggcca taaagataac agagcattct tggaatgctg atctttataa 21180
actcatggga cactttgcat ggtggactgc ttttgttact aatgtaaatg cctcttcttc 21240
agaggcattt ttaattggat gtaattatct tggcaaacca cgtgaacaaa tagatggtta 21300
tgtcatgcat gcaaattaca tattctggag gaatactaat ccaattcaat tatcttccta 21360
ttcattattt gacatgagta aatttcctct taaattaaga gggacagctg tcatgtcctt 21420
aaaagaagga caaatcaatg atatgatatt gtctttactt agtaaaggca gacttattat 21480
tagagaaaac aacaaggttg tggtttctag tgatgtttta gttaataact aaacgaacta 21540
tgtttgtttt tctttttgtc ttgcctttgg tttccagtca atgtgtcaat ttgaccacaa 21600
gaactggaat acagccaggt tataccaatt catctactag aggtgtctat tatccagaca 21660
aagtttttag gtcttcaatt ttacatctta cacaagacct tttcttacct ttcttttcta 21720
atgttacttg gtttaacacc ataaattatc aaggaggctt taagaagttt gacaatcctg 21780
ttttaccatt taatgatggt gtttactttg cctccacgga aaagtccaat attatacgcg 21840
gttggatttt tggaacaaca cttgatgcca gaactcaatc tcttctaata gttaacaacg 21900
caaccaatgt tgttatcaaa gtatgtgagt ttcagttttg cactgatcca tttttaggtg 21960
tttactatca taacaacaat aaaacatggg ttgaaaatga gtttagagtt tattcaagtg 22020
ccaacaattg cactttcgaa tacatttctc aaccttttct tatggacctt gaaggaaagc 22080
aaggtaattt taagaacctt agagagtttg tgtttaaaaa tgttgatggt tatttcaaga 22140
tttactctaa acacacacct attgatttag tgcgcgacct ccccagaggt tttgctgcat 22200
tggaaccact ggtggacctc cctataggta ttaatattac cagattccaa acattgcttg 22260
ctttacatag aagttatctt acacctggta agctagaaag tggctggaca actggagctg 22320
ctgcttacta tgtaggttac ctacaacaga ggacttttct cttaagttac aatcaaaatg 22380
gaaccattac agatgctgtt gattgttcac tagaccctct ttcagagaca aagtgcacat 22440
taaaatccct aacagttgaa aaaggaattt accagacttc taacttcaga gttcaaccaa 22500
caatcagtat agttagattt cctaatatta caaacttatg tccatttgga gaagtgttta 22560
acgcatccaa atttgcatca gtttatgctt ggaacaggaa gagaattagc aattgtgttg 22620
ctgattactc tgtactttat aactctacat cattttccac ttttaaatgt tatggagttt 22680
cacctacaaa actcaatgac ctttgcttca ccaatgtgta tgcagactca tttgttgtta 22740
aaggtgacga ggttagacaa atagcacccg gacaaactgg tgttattgct gattataact 22800
ataagctgcc agatgatttt actggttgtg ttattgcttg gaactcagtt aagcaagatg 22860
ctttgactgg tggtaattat ttgtatagat tatttagaaa gtctaagctt aaaccatttg 22920
agagagatat ttccactgaa atataccaag ccggcagcac accctgtaac ggtcaagttg 22980
gtctaaattg ttattatcct cttgaaaggt atggttttca cccaactaca ggtgttaact 23040
accaaccttt tagagtggtt gttttatcat ttgagttact taatggacca gctactgttt 23100
gtggacccaa attgtctaca acactagtta aagacaaatg tgtcaatttc aactttaacg 23160
gtttaactgg cacaggtgtt cttacaacat ctaagaaaca gtttctgcct tttcaacaat 23220
ttggtagaga catctctgac actactgatg ctgtccgtga cccacagaca cttgaaatac 23280
ttgacattac cccttgctct tttggaggag ttagtgtgat aacaccaggt acaaacactt 23340
ctaatcaagt ggctgtactt taccaagatg ttaactgtac tgaagtgcct atggccattc 23400
atgcagaaca acttacacct gcctggcgtg tttactctgc aggagcaaat gtgtttcaaa 23460
caagagcagg ctgtttagta ggtgctgagc atgtcaacaa ttcttatgaa tgtgacattc 23520
cagtcggtgc tggcatatgt gcaagttacc attccatgtc atcattgcgt agtgtcaacc 23580
agcgttcaat cattgcttac actatgtctt taggtgcaga aaattcagtt gcttattcta 23640
ataattcaat tgccatacct actaatttta caataagtgt taccacagaa attctaccag 23700
tgtcaatgac taagacttct gtagattgta ctatgtacat ctgtggagat tcaattgagt 23760
gtagtaattt attgctacaa tatggcagtt tttgcacaca attaaaccgt gctttgactg 23820
ggattgctgt tgaacaagac aaaaacacac aagaagtttt tgcccaggtt aaacaaatct 23880
acaaaacacc acctattaaa gattttggtg gctttaactt ttcacaaata ttgccagatc 23940
catcaaaacc aagcaagagg tcatttattg aggatttact cttcaacaaa gtgacacttg 24000
ctgatgctgg cttcatcaaa caatatggtg attgccttgg tgatattgct gctagagatc 24060
tcatctgtgc acaaaagttc aatggactca cggttctacc gcctttgctc acagatgaaa 24120
tgattgctca atacacttct gcactacttg ctggaacaat cacctcaggt tggacctttg 24180
gtgcaggagc tgctttacaa ataccctttg caatgcaaat ggcttacagg tttaatggca 24240
ttggagtcac tcagaatgtt ctatatgaga atcagaaatt aattgccaat cagttcaaca 24300
gtgctattgg caaaatacag gattcacttt catctacggc tagtgcactt ggtaaacttc 24360
aagacgtcgt aaatcaaaat gcacaggctt taaacacact tgtcaaacaa cttagttcca 24420
attttggagc tatttcgagt gtgcttaatg atattctttc acgtcttgac aaagttgagg 24480
ctgaagtgca aattgatagg ttaatcacag gaagactaca gagtcttcaa acttatgtga 24540
cacaacaatt aatcagagca gcagaaatca gagcttctgc taatcttgct gcaacaaaaa 24600
tgtctgagtg cgtactcgga caatctaaaa gagttgattt ttgtggaaaa ggctaccatt 24660
taatgtcttt ccctcaatca gcaccgcatg gtgttgtttt cttgcatgtt acttatgtac 24720
ctgcacaaga aaagaacttt actactgctc ctgctatttg tcatgaagga aaagcacact 24780
tccctcgtga aggtgtcttc gtttcaaatg gcactcattg gtttattaca caaaggaatt 24840
tttatgaacc tcaaattatt accactgaca acacattcgt ctctggtagc tgtgatgttg 24900
taattggaat agtcaacaac acagtttatg atcctttgca acccgagctt gactcattta 24960
aggaggagtt agacaaatac ttcaaaaatc acacatcacc agatgttgat cttggcgaca 25020
tatctggcat aaatgcttcg gtcgtcaaca tacaaaaaga aattgaccgc ctcaatgagg 25080
ttgccaaaaa tttgaatgaa tcactcattg acctacaaga gcttggaaaa tatgagcaat 25140
acatcaaatg gccttggtac atttggcttg gttttatagc tgggctaatt gctatcatta 25200
tggtcacaat catgctatgt tgtatgacta gttgctgtag ttgcctcaag ggttgttgct 25260
cttgcggttc ctgctgcaaa tttgatgaag acgattcaga acctgttctg aaaggagtca 25320
aattacatta cacataaacg aacttaatgg atttgtttat gagaattttt aatcttggat 25380
ctgtaacatt caaaccagga aaaattgaag atgctactcc ttcagattct attcgcgcta 25440
ctgcaacgat accgatacaa gcctcactcc ctttcggatg gcttattgtt ggcgttgcac 25500
ttcttgctgt ttttcagagc gcttccaaaa taattacact caaaaagagg tggcaatttg 25560
ctctctccaa gggtgttcat tttgcttgca acttgcttct actatttgtt acagtctact 25620
ctcacctttt gcttgttgct gctggccttg aagcccaatt tctctatctt tacgctttag 25680
tttattttct gcaaagtgtt aatgcttgca gaattattat gaggctttgg ctgtgctgga 25740
agtgcagatc caaaaatcca ttactttatg atgccaatta ctttctttgc tggcatacta 25800
attgctatga ctattgtata ccatataata gcataacttc ttcaattgtc attacatcag 25860
gtgatggcac tccaagtcct attacagacc atgactacca aattggtggt tatacggaaa 25920
agtgggaatc tggtgttaaa gactgtgtta cattacatgg ttactttaca tcagaatgct 25980
accagctgta ctctacacaa cttagtacag atactggtgt tgaacatact accttcttca 26040
tttacagtag aattgtggat gaaccagaag accatgttca aattcacaca atcgacggct 26100
catcaggagt tgtaaatcca gcaatggatc ctatctatga tgagccgacg acgactacta 26160
gcgtgccttt gtaagcacaa gctgatgagt acgaacttat gtactcattc gtttcggaag 26220
agacaggtac gttaatagtt aatagcgtac ttctttttct tgctttcgtg gtattcttgc 26280
tagtcacact agccatcctt actgcgcttc gattgtgtgc gtactgctgc aatattgtta 26340
acgtgagttt agttaaacct tctttttacg tctactcacg tgttaaaaat ctgaattctt 26400
ctagagttcc tgatcttctg gtctaaacga actaaatatt ttagtttttc tgtttggaac 26460
tttaatttta gccatgtcag ctaacaacgg tactattacc gttgaagagc ttaaaaagct 26520
cttagaacaa tggaacctag taataggttt cctatttcta acatggattt gtcttttaca 26580
gttcgcctat gctaacagga ataggtttct gtacataatt aagttaattt tcctctggct 26640
actttggcca gtaactttag cttgctttgt gcttgctgct gtttacagaa tcaattggat 26700
taccggtgga atcgcgattg caatgacttg tcttgtgggc ttgatgtggc ttagctactt 26760
cattgcttca ttcaggcttt ttgcgcgtac gcgttccatg tggtccttca atccagaaac 26820
aaacatactg ctgaatgtgc cattgcatgg tacaattttg accagaccac tcctagaaag 26880
tgaacttgtc atcggtgctg tgatcctcag aggacacctt cgcattgctg gacatcatct 26940
aggacgctgt gacatcaagg acctgccaaa agaaatcact gtagctacat cacgaacgct 27000
ttcttattac aaattgggag cttcgcagcg tgtagccggt gactcaggtt ttgctgcata 27060
cagtcgctat cggattggca actacaaact aaacacagac cattccaata gcagtgacaa 27120
tattgctttg cttgtacagt aagtgacaac agatgtttca tctcgttgac tttcaggtta 27180
ctatagcaga gatacttatt attattatga gaactttcaa gatttccatt tggaaccttg 27240
attacatcat taatctcata attaaaaatt tatctaagcc tttaactgaa aataaatatt 27300
ctcagttaga cgaagagcaa ccaatggaga ttgattaaac gaacatgaaa attattcttc 27360
tcttggcatt agttactttt gctacatgcg aacgttacca ctaccaagag tgtgttagag 27420
gtacaactgt actaataaag gaaccttgct cttctggaac ttacgagggc aattcaccat 27480
ttcatcctct tgctgataat aaatttgcac ttgcttgcac aagccaacaa tttgcttttg 27540
cttgccctga cggtactaaa catacctttc agttacgtgc gagatcagtt tcaccaaaac 27600
ttttcatcag acaagaggaa gttcaagaac tttactcacc acttttcctc ataattgctg 27660
cattagtgtt tataacactt tgcttcacac ttaagagaaa gacagaatga gtgaaattac 27720
actaattgac ttctatttgt gctttttagc ctttctgcta ttccttgttt taattatgct 27780
catgatattt tggttttgct ttgacactcc aagatgatga tgagtgttgc caagtctaaa 27840
cgaacatgaa atttcttgtt ttacttggaa tactaacaac agtacacaca ttccatcagg 27900
aatgtagttt acagtcatgt caattcaatt caccttatgt agttgatgat ccatgcccta 27960
tacatttcta ctcgaaatgg tatattaggg tcggtgctag aaaatctgca ccattgattg 28020
aactctgtgt tgatgaagta ggttcaaaaa cacctattaa atacatcgac attggcaact 28080
acactgtttc ttgttcaccg tttactataa actgtcaaga acctaaatta ggtagtctcg 28140
tagttcgttg ttcgttctat gaagactttg ttgattacca tgacattcgt gttgttttag 28200
atttcatcta aacgaacaaa caaaatgtct gataatggac cccaaaatcg tgcaccccgc 28260
attacatttg gtggaccctc agattcgact gacaataacc agaatggaga ccgcagtgga 28320
gcaaggccaa aacaacgaag gccccaggga ttacccaata atactgcgtc ttggttcacc 28380
gctctcactc aacatggtaa ggaagacctt agattccctc gaggacaagg tgttccgatt 28440
aacaccaata gtaccaaaga tgaccaaatt ggctactacc gaagagctac cagacgagtt 28500
cgtggtggtg acggtaaaat gaaagatctc agtccacgat ggtacttcta ttaccttgga 28560
actgggccag aagctggact tccctatggt gctaacaaag aaggcatcat atgggttgca 28620
actgagggag ccttgaatac accaaaagat cacattggca cccgcaatcc aaacaacaat 28680
gctgcaatcg tgctacaact tcctcaagga acagctttgc ctaaaggttt ctacgcagaa 28740
gggagcagag gcggcagtca agcttcttca cgctcttcat cacgtagtcg caatagttcc 28800
agaaactcaa ctccaggcag tagtagggga acttctcctg ctcgaattgc tggcaatggt 28860
ggtgatgctg cccttgcttt gctactgctt gatcggttga atgcacttga gagcaaaatg 28920
tctggtaaag gctcacaaca acagagccaa acagtcacta agaaatctgc tgctgaggct 28980
tccaagaaac ctcgccaaaa acgtactgcc actaaacaat acaatgtcac tcaggcattt 29040
ggcagacgtg gtcctgaaca aacccaagga aattttgggg accaagaatt aatcagacaa 29100
ggaactgagt acaaacattg gccgcaaatt gcacaatttg cacctagcgc ttctgcattc 29160
ttcggaatgt cgcgcattgg catggaagtc acaccttcgg gaacatggct gacttacaca 29220
ggtgccatca agcttgatga caaagatcca agcttcaaag acaacgtcat actgctgaac 29280
aagcacattg acgcatacaa aacattccca ccaacagagc ctaaaaagga caaaaagaaa 29340
aagactgacg aaagccagcc tttaccgcag agacagaaga aacaacaaac tgtgactctt 29400
cttcctgctg cagatttgga tgatttctcc aaacaattgc aacaatccat gagcagtgct 29460
gattcaactc aggcttaaac tcatgcagac cacacaaggc agatgggcta tgtaaacgtt 29520
ttcgcttttc cgtttacgat acatagtcta ctcttgtgca gaatgaattc tcgtagctat 29580
acagcacaag taggtatagt taactttaat ctcacatagc aatctttaat cagtgtgtaa 29640
cattagggag gacttgaaag agccaccaca ttttcaccga ggccacgcgg agtacgatcg 29700
agggtacagt gaataatgct agggagagct gcctatatgg aagagcccta atgtgtaaat 29760
taattttagt agtgctatcc ccatgtgatt ttaatagctt cttaggagaa tgacaaaaaa 29820
aaaaaaaaaa aaaaaaa 29837
<210> 2
<211> 3723
<212> DNA
<213> xCoV
<400> 2
ccaggttata ccaattcatc tactagaggt gtctattatc cagacaaagt ttttaggtct 60
tcaattttac atcttacaca agaccttttc ttacctttct tttctaatgt tacttggttt 120
aacaccataa attatcaagg aggctttaag aagtttgaca atcctgtttt accatttaat 180
gatggtgttt actttgcctc cacggaaaag tccaatatta tacgcggttg gatttttgga 240
acaacacttg atgccagaac tcaatctctt ctaatagtta acaacgcaac caatgttgtt 300
atcaaagtat gtgagtttca gttttgcact gatccatttt taggtgttta ctatcataac 360
aacaataaaa catgggttga aaatgagttt agagtttatt caagtgccaa caattgcact 420
ttcgaataca tttctcaacc ttttcttatg gaccttgaag gaaagcaagg taattttaag 480
aaccttagag agtttgtgtt taaaaatgtt gatggttatt tcaagattta ctctaaacac 540
acacctattg atttagtgcg cgacctcccc agaggttttg ctgcattgga accactggtg 600
gacctcccta taggtattaa tattaccaga ttccaaacat tgcttgcttt acatagaagt 660
tatcttacac ctggtaagct agaaagtggc tggacaactg gagctgctgc ttactatgta 720
ggttacctac aacagaggac ttttctctta agttacaatc aaaatggaac cattacagat 780
gctgttgatt gttcactaga ccctctttca gagacaaagt gcacattaaa atccctaaca 840
gttgaaaaag gaatttacca gacttctaac ttcagagttc aaccaacaat cagtatagtt 900
agatttccta atattacaaa cttatgtcca tttggagaag tgtttaacgc atccaaattt 960
gcatcagttt atgcttggaa caggaagaga attagcaatt gtgttgctga ttactctgta 1020
ctttataact ctacatcatt ttccactttt aaatgttatg gagtttcacc tacaaaactc 1080
aatgaccttt gcttcaccaa tgtgtatgca gactcatttg ttgttaaagg tgacgaggtt 1140
agacaaatag cacccggaca aactggtgtt attgctgatt ataactataa gctgccagat 1200
gattttactg gttgtgttat tgcttggaac tcagttaagc aagatgcttt gactggtggt 1260
aattatttgt atagattatt tagaaagtct aagcttaaac catttgagag agatatttcc 1320
actgaaatat accaagccgg cagcacaccc tgtaacggtc aagttggtct aaattgttat 1380
tatcctcttg aaaggtatgg ttttcaccca actacaggtg ttaactacca accttttaga 1440
gtggttgttt tatcatttga gttacttaat ggaccagcta ctgtttgtgg acccaaattg 1500
tctacaacac tagttaaaga caaatgtgtc aatttcaact ttaacggttt aactggcaca 1560
ggtgttctta caacatctaa gaaacagttt ctgccttttc aacaatttgg tagagacatc 1620
tctgacacta ctgatgctgt ccgtgaccca cagacacttg aaatacttga cattacccct 1680
tgctcttttg gaggagttag tgtgataaca ccaggtacaa acacttctaa tcaagtggct 1740
gtactttacc aagatgttaa ctgtactgaa gtgcctatgg ccattcatgc agaacaactt 1800
acacctgcct ggcgtgttta ctctgcagga gcaaatgtgt ttcaaacaag agcaggctgt 1860
ttagtaggtg ctgagcatgt caacaattct tatgaatgtg acattccagt cggtgctggc 1920
atatgtgcaa gttaccattc catgtcatca ttgcgtagtg tcaaccagcg ttcaatcatt 1980
gcttacacta tgtctttagg tgcagaaaat tcagttgctt attctaataa ttcaattgcc 2040
atacctacta attttacaat aagtgttacc acagaaattc taccagtgtc aatgactaag 2100
acttctgtag attgtactat gtacatctgt ggagattcaa ttgagtgtag taatttattg 2160
ctacaatatg gcagtttttg cacacaatta aaccgtgctt tgactgggat tgctgttgaa 2220
caagacaaaa acacacaaga agtttttgcc caggttaaac aaatctacaa aacaccacct 2280
attaaagatt ttggtggctt taacttttca caaatattgc cagatccatc aaaaccaagc 2340
aagaggtcat ttattgagga tttactcttc aacaaagtga cacttgctga tgctggcttc 2400
atcaaacaat atggtgattg ccttggtgat attgctgcta gagatctcat ctgtgcacaa 2460
aagttcaatg gactcacggt tctaccgcct ttgctcacag atgaaatgat tgctcaatac 2520
acttctgcac tacttgctgg aacaatcacc tcaggttgga cctttggtgc aggagctgct 2580
ttacaaatac cctttgcaat gcaaatggct tacaggttta atggcattgg agtcactcag 2640
aatgttctat atgagaatca gaaattaatt gccaatcagt tcaacagtgc tattggcaaa 2700
atacaggatt cactttcatc tacggctagt gcacttggta aacttcaaga cgtcgtaaat 2760
caaaatgcac aggctttaaa cacacttgtc aaacaactta gttccaattt tggagctatt 2820
tcgagtgtgc ttaatgatat tctttcacgt cttgacaaag ttgaggctga agtgcaaatt 2880
gataggttaa tcacaggaag actacagagt cttcaaactt atgtgacaca acaattaatc 2940
agagcagcag aaatcagagc ttctgctaat cttgctgcaa caaaaatgtc tgagtgcgta 3000
ctcggacaat ctaaaagagt tgatttttgt ggaaaaggct accatttaat gtctttccct 3060
caatcagcac cgcatggtgt tgttttcttg catgttactt atgtacctgc acaagaaaag 3120
aactttacta ctgctcctgc tatttgtcat gaaggaaaag cacacttccc tcgtgaaggt 3180
gtcttcgttt caaatggcac tcattggttt attacacaaa ggaattttta tgaacctcaa 3240
attattacca ctgacaacac attcgtctct ggtagctgtg atgttgtaat tggaatagtc 3300
aacaacacag tttatgatcc tttgcaaccc gagcttgact catttaagga ggagttagac 3360
aaatacttca aaaatcacac atcaccagat gttgatcttg gcgacatatc tggcataaat 3420
gcttcggtcg tcaacataca aaaagaaatt gaccgcctca atgaggttgc caaaaatttg 3480
aatgaatcac tcattgacct acaagagctt ggaaaatatg agcaatacat caaatggcct 3540
tggtacattt ggcttggttt tatagctggg ctaattgcta tcattatggt cacaatcatg 3600
ctatgttgta tgactagttg ctgtagttgc ctcaagggtt gttgctcttg cggttcctgc 3660
tgcaaatttg atgaagacga ttcagaacct gttctgaaag gagtcaaatt acattacaca 3720
taa 3723
<210> 3
<211> 1240
<212> PRT
<213> xCoV
<400> 3
Pro Gly Tyr Thr Asn Ser Ser Thr Arg Gly Val Tyr Tyr Pro Asp Lys
1 5 10 15
Val Phe Arg Ser Ser Ile Leu His Leu Thr Gln Asp Leu Phe Leu Pro
20 25 30
Phe Phe Ser Asn Val Thr Trp Phe Asn Thr Ile Asn Tyr Gln Gly Gly
35 40 45
Phe Lys Lys Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr
50 55 60
Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly
65 70 75 80
Thr Thr Leu Asp Ala Arg Thr Gln Ser Leu Leu Ile Val Asn Asn Ala
85 90 95
Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Thr Asp Pro
100 105 110
Phe Leu Gly Val Tyr Tyr His Asn Asn Asn Lys Thr Trp Val Glu Asn
115 120 125
Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Ile
130 135 140
Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys
145 150 155 160
Asn Leu Arg Glu Phe Val Phe Lys Asn Val Asp Gly Tyr Phe Lys Ile
165 170 175
Tyr Ser Lys His Thr Pro Ile Asp Leu Val Arg Asp Leu Pro Arg Gly
180 185 190
Phe Ala Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile
195 200 205
Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro
210 215 220
Gly Lys Leu Glu Ser Gly Trp Thr Thr Gly Ala Ala Ala Tyr Tyr Val
225 230 235 240
Gly Tyr Leu Gln Gln Arg Thr Phe Leu Leu Ser Tyr Asn Gln Asn Gly
245 250 255
Thr Ile Thr Asp Ala Val Asp Cys Ser Leu Asp Pro Leu Ser Glu Thr
260 265 270
Lys Cys Thr Leu Lys Ser Leu Thr Val Glu Lys Gly Ile Tyr Gln Thr
275 280 285
Ser Asn Phe Arg Val Gln Pro Thr Ile Ser Ile Val Arg Phe Pro Asn
290 295 300
Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Ser Lys Phe
305 310 315 320
Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala
325 330 335
Asp Tyr Ser Val Leu Tyr Asn Ser Thr Ser Phe Ser Thr Phe Lys Cys
340 345 350
Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val
355 360 365
Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Glu Val Arg Gln Ile Ala
370 375 380
Pro Gly Gln Thr Gly Val Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp
385 390 395 400
Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Val Lys Gln Asp Ala
405 410 415
Leu Thr Gly Gly Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Lys Leu
420 425 430
Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser
435 440 445
Thr Pro Cys Asn Gly Gln Val Gly Leu Asn Cys Tyr Tyr Pro Leu Glu
450 455 460
Arg Tyr Gly Phe His Pro Thr Thr Gly Val Asn Tyr Gln Pro Phe Arg
465 470 475 480
Val Val Val Leu Ser Phe Glu Leu Leu Asn Gly Pro Ala Thr Val Cys
485 490 495
Gly Pro Lys Leu Ser Thr Thr Leu Val Lys Asp Lys Cys Val Asn Phe
500 505 510
Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Thr Ser Lys Lys
515 520 525
Gln Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ser Asp Thr Thr
530 535 540
Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro
545 550 555 560
Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser
565 570 575
Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro
580 585 590
Met Ala Ile His Ala Glu Gln Leu Thr Pro Ala Trp Arg Val Tyr Ser
595 600 605
Ala Gly Ala Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Val Gly Ala
610 615 620
Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Val Gly Ala Gly
625 630 635 640
Ile Cys Ala Ser Tyr His Ser Met Ser Ser Leu Arg Ser Val Asn Gln
645 650 655
Arg Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val
660 665 670
Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser
675 680 685
Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp
690 695 700
Cys Thr Met Tyr Ile Cys Gly Asp Ser Ile Glu Cys Ser Asn Leu Leu
705 710 715 720
Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly
725 730 735
Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val
740 745 750
Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn
755 760 765
Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe
770 775 780
Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe
785 790 795 800
Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu
805 810 815
Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu
820 825 830
Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr
835 840 845
Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro
850 855 860
Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln
865 870 875 880
Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser
885 890 895
Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu
900 905 910
Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr
915 920 925
Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu
930 935 940
Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile
945 950 955 960
Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr
965 970 975
Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala
980 985 990
Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp
995 1000 1005
Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro
1010 1015 1020
His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys
1025 1030 1035 1040
Asn Phe Thr Thr Ala Pro Ala Ile Cys His Glu Gly Lys Ala His Phe
1045 1050 1055
Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Ile Thr
1060 1065 1070
Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe
1075 1080 1085
Val Ser Gly Ser Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr Val
1090 1095 1100
Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp
1105 1110 1115 1120
Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile
1125 1130 1135
Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg
1140 1145 1150
Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln
1155 1160 1165
Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp
1170 1175 1180
Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Ile Met Val Thr Ile Met
1185 1190 1195 1200
Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser
1205 1210 1215
Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu
1220 1225 1230
Lys Gly Val Lys Leu His Tyr Thr
1235 1240
<210> 4
<211> 3822
<212> RNA
<213> SARS-CoV-2
<400> 4
auguuuguuu uucuuguuuu auugccacua gucucuaguc aguguguuaa ucuuacaacc 60
agaacucaau uacccccugc auacacuaau ucuuucacac gugguguuua uuacccugac 120
aaaguuuuca gauccucagu uuuacauuca acucaggacu uguucuuacc uuucuuuucc 180
aauguuacuu gguuccaugc uauacauguc ucugggacca augguacuaa gagguuugau 240
aacccugucc uaccauuuaa ugaugguguu uauuuugcuu ccacugagaa gucuaacaua 300
auaagaggcu ggauuuuugg uacuacuuua gauucgaaga cccagucccu acuuauuguu 360
aauaacgcua cuaauguugu uauuaaaguc ugugaauuuc aauuuuguaa ugauccauuu 420
uuggguguuu auuaccacaa aaacaacaaa aguuggaugg aaagugaguu cagaguuuau 480
ucuagugcga auaauugcac uuuugaauau gucucucagc cuuuucuuau ggaccuugaa 540
ggaaaacagg guaauuucaa aaaucuuagg gaauuugugu uuaagaauau ugaugguuau 600
uuuaaaauau auucuaagca cacgccuauu aauuuagugc gugaucuccc ucaggguuuu 660
ucggcuuuag aaccauuggu agauuugcca auagguauua acaucacuag guuucaaacu 720
uuacuugcuu uacauagaag uuauuugacu ccuggugauu cuucuucagg uuggacagcu 780
ggugcugcag cuuauuaugu ggguuaucuu caaccuagga cuuuucuauu aaaauauaau 840
gaaaauggaa ccauuacaga ugcuguagac ugugcacuug acccucucuc agaaacaaag 900
uguacguuga aauccuucac uguagaaaaa ggaaucuauc aaacuucuaa cuuuagaguc 960
caaccaacag aaucuauugu uagauuuccu aauauuacaa acuugugccc uuuuggugaa 1020
guuuuuaacg ccaccagauu ugcaucuguu uaugcuugga acaggaagag aaucagcaac 1080
uguguugcug auuauucugu ccuauauaau uccgcaucau uuuccacuuu uaaguguuau 1140
ggagugucuc cuacuaaauu aaaugaucuc ugcuuuacua augucuaugc agauucauuu 1200
guaauuagag gugaugaagu cagacaaauc gcuccagggc aaacuggaaa gauugcugau 1260
uauaauuaua aauuaccaga ugauuuuaca ggcugcguua uagcuuggaa uucuaacaau 1320
cuugauucua agguuggugg uaauuauaau uaccuguaua gauuguuuag gaagucuaau 1380
cucaaaccuu uugagagaga uauuucaacu gaaaucuauc aggccgguag cacaccuugu 1440
aaugguguug aagguuuuaa uuguuacuuu ccuuuacaau cauaugguuu ccaacccacu 1500
aaugguguug guuaccaacc auacagagua guaguacuuu cuuuugaacu ucuacaugca 1560
ccagcaacug uuuguggacc uaaaaagucu acuaauuugg uuaaaaacaa augugucaau 1620
uucaacuuca augguuuaac aggcacaggu guucuuacug agucuaacaa aaaguuucug 1680
ccuuuccaac aauuuggcag agacauugcu gacacuacug augcuguccg ugauccacag 1740
acacuugaga uucuugacau uacaccaugu ucuuuuggug gugucagugu uauaacacca 1800
ggaacaaaua cuucuaacca gguugcuguu cuuuaucagg auguuaacug cacagaaguc 1860
ccuguugcua uucaugcaga ucaacuuacu ccuacuuggc guguuuauuc uacagguucu 1920
aauguuuuuc aaacacgugc aggcuguuua auaggggcug aacaugucaa caacucauau 1980
gagugugaca uacccauugg ugcagguaua ugcgcuaguu aucagacuca gacuaauucu 2040
ccucggcggg cacguagugu agcuagucaa uccaucauug ccuacacuau gucacuuggu 2100
gcagaaaauu caguugcuua cucuaauaac ucuauugcca uacccacaaa uuuuacuauu 2160
aguguuacca cagaaauucu accagugucu augaccaaga caucaguaga uuguacaaug 2220
uacauuugug gugauucaac ugaaugcagc aaucuuuugu ugcaauaugg caguuuuugu 2280
acacaauuaa accgugcuuu aacuggaaua gcuguugaac aagacaaaaa cacccaagaa 2340
guuuuugcac aagucaaaca aauuuacaaa acaccaccaa uuaaagauuu uggugguuuu 2400
aauuuuucac aaauauuacc agauccauca aaaccaagca agaggucauu uauugaagau 2460
cuacuuuuca acaaagugac acuugcagau gcuggcuuca ucaaacaaua uggugauugc 2520
cuuggugaua uugcugcuag agaccucauu ugugcacaaa aguuuaacgg ccuuacuguu 2580
uugccaccuu ugcucacaga ugaaaugauu gcucaauaca cuucugcacu guuagcgggu 2640
acaaucacuu cugguuggac cuuuggugca ggugcugcau uacaaauacc auuugcuaug 2700
caaauggcuu auagguuuaa ugguauugga guuacacaga auguucucua ugagaaccaa 2760
aaauugauug ccaaccaauu uaauagugcu auuggcaaaa uucaagacuc acuuucuucc 2820
acagcaagug cacuuggaaa acuucaagau guggucaacc aaaaugcaca agcuuuaaac 2880
acgcuuguua aacaacuuag cuccaauuuu ggugcaauuu caaguguuuu aaaugauauc 2940
cuuucacguc uugacaaagu ugaggcugaa gugcaaauug auagguugau cacaggcaga 3000
cuucaaaguu ugcagacaua ugugacucaa caauuaauua gagcugcaga aaucagagcu 3060
ucugcuaauc uugcugcuac uaaaauguca gaguguguac uuggacaauc aaaaagaguu 3120
gauuuuugug gaaagggcua ucaucuuaug uccuucccuc agucagcacc ucauggugua 3180
gucuucuugc augugacuua ugucccugca caagaaaaga acuucacaac ugcuccugcc 3240
auuugucaug auggaaaagc acacuuuccu cgugaaggug ucuuuguuuc aaauggcaca 3300
cacugguuug uaacacaaag gaauuuuuau gaaccacaaa ucauuacuac agacaacaca 3360
uuugugucug guaacuguga uguuguaaua ggaauuguca acaacacagu uuaugauccu 3420
uugcaaccug aauuagacuc auucaaggag gaguuagaua aauauuuuaa gaaucauaca 3480
ucaccagaug uugauuuagg ugacaucucu ggcauuaaug cuucaguugu aaacauucaa 3540
aaagaaauug accgccucaa ugagguugcc aagaauuuaa augaaucucu caucgaucuc 3600
caagaacuug gaaaguauga gcaguauaua aaauggccau gguacauuug gcuagguuuu 3660
auagcuggcu ugauugccau aguaauggug acaauuaugc uuugcuguau gaccaguugc 3720
uguaguuguc ucaagggcug uuguucuugu ggauccugcu gcaaauuuga ugaagacgac 3780
ucugagccag ugcucaaagg agucaaauua cauuacacau aa 3822
<210> 5
<211> 1273
<212> PRT
<213> SARS-CoV-2
<220>
<221> misc_feature
<222> (13)..(13)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (151)..(151)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (155)..(155)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (162)..(162)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (247)..(247)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (596)..(596)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (673)..(673)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (686)..(686)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (689)..(689)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (721)..(721)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (758)..(758)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (929)..(929)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (943)..(943)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (975)..(975)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (1003)..(1003)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (1239)..(1239)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (1242)..(1242)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> UNSURE
<222> (13)..(13)
<223> The 'Xaa' at location 13 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (151)..(151)
<223> The 'Xaa' at location 151 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (155)..(155)
<223> The 'Xaa' at location 155 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (162)..(162)
<223> The 'Xaa' at location 162 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (247)..(247)
<223> The 'Xaa' at location 247 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (596)..(596)
<223> The 'Xaa' at location 596 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (673)..(673)
<223> The 'Xaa' at location 673 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (686)..(686)
<223> The 'Xaa' at location 686 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (689)..(689)
<223> The 'Xaa' at location 689 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (721)..(721)
<223> The 'Xaa' at location 721 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (758)..(758)
<223> The 'Xaa' at location 758 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (929)..(929)
<223> The 'Xaa' at location 929 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (943)..(943)
<223> The 'Xaa' at location 943 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (975)..(975)
<223> The 'Xaa' at location 975 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (1003)..(1003)
<223> The 'Xaa' at location 1003 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (1239)..(1239)
<223> The 'Xaa' at location 1239 stands for Gln, Arg, Pro, or Leu.
<220>
<221> UNSURE
<222> (1242)..(1242)
<223> The 'Xaa' at location 1242 stands for Gln, Arg, Pro, or Leu.
<400> 5
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Xaa Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Xaa Trp Met Glu Xaa Glu Phe Arg Val Tyr
145 150 155 160
Ser Xaa Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Xaa Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Xaa Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Xaa Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Xaa Val Ala
675 680 685
Xaa Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Xaa Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Xaa Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Xaa Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Xaa Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Xaa Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Xaa Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu
1010 1015 1020
Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035 1040
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala
1045 1050 1055
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1060 1065 1070
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His
1075 1080 1085
Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
1090 1095 1100
Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr
1105 1110 1115 1120
Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
1125 1130 1135
Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu
1140 1145 1150
Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp
1155 1160 1165
Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp
1170 1175 1180
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu
1185 1190 1195 1200
Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile
1205 1210 1215
Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile
1220 1225 1230
Met Leu Cys Cys Met Thr Xaa Cys Cys Xaa Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val
1250 1255 1260
Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 6
<211> 22
<212> DNA
<213> Artificial
<400> 6
ggtgattgcc ttggtgatat tg 22
<210> 7
<211> 23
<212> DNA
<213> Artificial
<400> 7
gcaagtagtg cagaagtgta ttg 23
<210> 8
<211> 24
<212> DNA
<213> Artificial
<400> 8
tctgtgagca aaggcggtag aacc 24
<210> 9
<211> 24
<212> DNA
<213> Artificial
<400> 9
agtgtgacgt ggacatccgc aaag 24
<210> 10
<211> 24
<212> DNA
<213> Artificial
<400> 10
atccacatct gctggaaggt ggac 24
<210> 11
<211> 21
<212> DNA
<213> Artificial
<400> 11
cgaagccgaa gacctgttct a 21
<210> 12
<211> 20
<212> DNA
<213> Artificial
<400> 12
gggcaagtgt ggactgttcc 20
<210> 13
<211> 21
<212> DNA
<213> Artificial
<400> 13
gaggagacua ugaaguaaat t 21
<210> 14
<211> 21
<212> DNA
<213> Artificial
<400> 14
uuuacuucau agucuccuct t 21
<210> 15
<211> 21
<212> DNA
<213> Artificial
<400> 15
cugggaugca cagagaauat t 21
<210> 16
<211> 21
<212> DNA
<213> Artificial
<400> 16
uauucucugu gcaucccagt t 21
<210> 17
<211> 23
<212> DNA
<213> Artificial
<400> 17
ccacaccuca acuacauggu utt 23
<210> 18
<211> 21
<212> DNA
<213> Artificial
<400> 18
uuuaaaugcu uagguguggt t 21
<210> 19
<211> 21
<212> DNA
<213> Artificial
<400> 19
uucuccgaac gugucacgut t 21
<210> 20
<211> 21
<212> DNA
<213> Artificial
<400> 20
acgugacacg uucggagaat t 21

Claims (10)

1.一种穿山甲冠状病毒xCoV,其保藏于中国微生物菌种保藏管理委员会普通微生物中心,保藏编号为CGMCC No.19295。
2.根据权利要求1的穿山甲冠状病毒xCoV,其全基因组核苷酸序列如序列表中SEQ IDNO:1所示。
3.根据权利要求1的穿山甲冠状病毒xCoV,其S基因的核酸序列如序列表中SEQ ID NO:2所示。
4.根据权利要求1的穿山甲冠状病毒xCoV,其S蛋白的氨基酸序列如序列表中SEQ IDNO:3所示。
5.根据权利要求1的穿山甲冠状病毒xCoV,其与SARS-COV-2的S蛋白同源性达92.5%。
6.权利要求1-5中任一项的穿山甲冠状病毒xCoV的应用,其用于抗SARS-COV-2病毒的活性药物的筛选与评价,抗SARS-COV-2病毒的疫苗的筛选与评价,以及用于制备抗SARS-COV-2病毒的减毒疫苗或灭活疫苗。
7.以下化合物中的任一种、两种或三种用于制备治疗冠状病毒感染性疾病的药物的用途:千金藤素(千金藤碱)、西拉菌素、盐酸甲氟喹和甲氟喹;优选地,所述冠状病毒为SARS-COV-2病毒。
8.一种用于筛选和/或评价抗冠状病毒活性药物的药物筛选模型,其包括权利要求1-5中任一项所述的穿山甲冠状病毒xCoV;
优选地,所述药物筛选模型为采用所述穿山甲冠状病毒xCoV感染的哺乳动物细胞,优选Vero细胞E6(非洲绿猴肾细胞);
优选地,所述药物筛选模型用于筛选和/或评价有抗SARS-CoV-2病毒活性的药物。
9.一种筛选和/或评价抗冠状病毒活性药物的方法,其采用权利要求8中所述的药物筛选模型进行;
优选地,该方法用于筛选和/或评价有抗SARS-CoV-2活性的药物。
10.根据权利要求9所述的筛选和/或评价抗冠状病毒活性药物的方法,其包括步骤(1):向所述药物筛选模型中加入待测试的药物并进行培养;
任选地,在步骤(1)之后还包括以下步骤(2a)或步骤(2b),或同时包含步骤(2a)和步骤(2b):
步骤(2a):在显微镜下观察细胞病变;
步骤(2b):测定细胞和上清中的病毒核酸。
CN202110172158.7A 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用 Active CN113046327B (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN202210386551.0A CN114657149B (zh) 2020-02-16 2021-02-08 西拉菌素在制备治疗冠状病毒感染性疾病的药物的用途
CN202210377234.2A CN114908062B (zh) 2020-02-16 2021-02-08 盐酸甲氟喹或甲氟喹在制备治疗冠状病毒感染性疾病的药物的用途
CN202210375336.0A CN114908061B (zh) 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2020100968681 2020-02-16
CN202010096868 2020-02-16

Related Child Applications (3)

Application Number Title Priority Date Filing Date
CN202210386551.0A Division CN114657149B (zh) 2020-02-16 2021-02-08 西拉菌素在制备治疗冠状病毒感染性疾病的药物的用途
CN202210377234.2A Division CN114908062B (zh) 2020-02-16 2021-02-08 盐酸甲氟喹或甲氟喹在制备治疗冠状病毒感染性疾病的药物的用途
CN202210375336.0A Division CN114908061B (zh) 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用

Publications (2)

Publication Number Publication Date
CN113046327A true CN113046327A (zh) 2021-06-29
CN113046327B CN113046327B (zh) 2022-05-10

Family

ID=76508805

Family Applications (4)

Application Number Title Priority Date Filing Date
CN202210375336.0A Active CN114908061B (zh) 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用
CN202110172158.7A Active CN113046327B (zh) 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用
CN202210377234.2A Active CN114908062B (zh) 2020-02-16 2021-02-08 盐酸甲氟喹或甲氟喹在制备治疗冠状病毒感染性疾病的药物的用途
CN202210386551.0A Active CN114657149B (zh) 2020-02-16 2021-02-08 西拉菌素在制备治疗冠状病毒感染性疾病的药物的用途

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN202210375336.0A Active CN114908061B (zh) 2020-02-16 2021-02-08 穿山甲冠状病毒xCoV及其应用

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN202210377234.2A Active CN114908062B (zh) 2020-02-16 2021-02-08 盐酸甲氟喹或甲氟喹在制备治疗冠状病毒感染性疾病的药物的用途
CN202210386551.0A Active CN114657149B (zh) 2020-02-16 2021-02-08 西拉菌素在制备治疗冠状病毒感染性疾病的药物的用途

Country Status (1)

Country Link
CN (4) CN114908061B (zh)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1566947A (zh) * 2003-06-27 2005-01-19 中国医学科学院实验动物研究所 一种sars相关冠状病毒的灵长类动物模型及其构建方法和用途

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005512998A (ja) * 2001-11-09 2005-05-12 ローレン チャロウス、 抗マラリア治療剤に対する新規の使用
CN1237185C (zh) * 2003-06-04 2006-01-18 中国科学院上海药物研究所 Sars冠状病毒3cl蛋白酶三维结构模型与抗sars药物
CN1450164A (zh) * 2003-06-11 2003-10-22 复旦大学 一种新型冠状病毒株及其医药用途
CN1666602A (zh) * 2004-03-11 2005-09-14 华中科技大学同济医学院附属同济医院 建立sars相关冠状病毒小鼠模型的方法
US7371837B2 (en) * 2004-07-21 2008-05-13 The University Of Hong Kong Human virus causing respiratory tract infection and uses thereof
WO2015157223A1 (en) * 2014-04-07 2015-10-15 University Of Maryland, Baltimore Methods of treating coronavirus infection
GB201918670D0 (en) * 2019-12-17 2020-01-29 Bauer Sabine Catheter device for releasing pharmaceutically active compounds over an extended period

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1566947A (zh) * 2003-06-27 2005-01-19 中国医学科学院实验动物研究所 一种sars相关冠状病毒的灵长类动物模型及其构建方法和用途

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
HUA-HAO FAN 等: "Repurposing of clinically approved drugs for treatment of coronavirus disease 2019 in a 2019-novel coronavirus-related coronavirus model", 《CHIN MED J (ENGL)》 *
PING LIU 等: "Viral metagenomics revealed sendai virus and coronavirus infection of Malayan Pangolins (Manis javanica)", 《VIRUSES》 *
TOMMY TSAN-YUK LAM 等: "Identifying SARS-CoV-2 related coronaviruses in Malayan pangolins", 《NATURE》 *
刘千勇 等: "新型冠状病毒(2019-nCoV)的靶向药物研究策略", 《药学学报》 *
朱小丽: "新型冠状病毒病(COVID-19)研究进展", 《中国生物工程杂志》 *
郁文亮 等: "新型冠状病毒的生物学特征及其药物研发策略", 《药物生物技术》 *
马怡红 等: "新型冠状病毒的初步认知", 《青海畜牧兽医杂志》 *

Also Published As

Publication number Publication date
CN114657149A (zh) 2022-06-24
CN114657149B (zh) 2023-10-20
CN114908061A (zh) 2022-08-16
CN114908062A (zh) 2022-08-16
CN114908062B (zh) 2023-10-20
CN113046327B (zh) 2022-05-10
CN114908061B (zh) 2023-10-27

Similar Documents

Publication Publication Date Title
Todt et al. Robust hepatitis E virus infection and transcriptional response in human hepatocytes
CN111534643B (zh) 一种检测呼吸道病原的核酸的试剂盒、检测方法及应用
Li et al. African swine fever virus protein MGF-505-7R promotes virulence and pathogenesis by inhibiting JAK1-and JAK2-mediated signaling
Niu et al. Novel goose astrovirus associated gout in Gosling, China
Gladue et al. Interaction between Core protein of classical swine fever virus with cellular IQGAP1 protein appears essential for virulence in swine
KR101927726B1 (ko) 살모넬라균의 비코딩 rna 및 이의 식별과 응용
Li et al. A QP509L/QP383R-deleted African swine fever virus is highly attenuated in swine but does not confer protection against parental virus challenge
WO2012102793A2 (en) Mammalian genes involved in toxicity and infection
CN117413063A (zh) 冠状病毒治疗性干扰颗粒
CN110420331A (zh) Alkbh5抑制物在治疗病毒感染性疾病中的应用
Qin et al. The roles of mandarin fish STING in innate immune defense against infectious spleen and kidney necrosis virus infections
CN112587663B (zh) 长链非编码RNA-lncIVRL在防治甲型流感病毒感染中的应用
CN113046327B (zh) 穿山甲冠状病毒xCoV及其应用和药物抗冠状病毒感染的应用
CN110129288B (zh) Tbk1作为e3泛素连接酶的应用
CN106138030B (zh) 一种肠道病毒71毒株以及芒柄花素或其盐在抑制肠道病毒71中的应用
Wei et al. Litopenaeus vannamei Src64B restricts white spot syndrome virus replication by modulating apoptosis
CN103215267B (zh) 抑制流感病毒相关基因的siRNA及其应用
CN114246847A (zh) 查尔酮类化合物在治疗冠状病毒感染中的应用
Wang et al. The dominant strain of SARS-CoV-2 is a mosaicism
CN114246874B (zh) 鲁斯可皂苷元在预防冠状病毒感染中的应用
CN115305249B (zh) 一种电离辐射响应的lncRNA分子lnc1267及其在调控细胞放射敏感性中的应用
Guzman Host Gene Variants and Sting-Mediated Reactive Oxygen Species Affect Drosophila Melanogaster Survival During Coxiella burnetii Infection
Vong Genomes of Small RNA Viruses: Amendments, Discoveries, and Characterizations
CN114246853A (zh) 异阿魏酸在制备用于防治冠状病毒感染的产品中的应用
Ghazi Mechanistic In silico design of angiotensin-converting enzyme 2 (ACE2) recombinant protein to block the S1 protein pathway of COVID-19 virus. Open J Proteom Genom 5 (1): 001-007

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40052911

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant