CN115484815A - 用于在大豆中表达rna指导的核酸酶和dna结合蛋白的改进多核苷酸 - Google Patents

用于在大豆中表达rna指导的核酸酶和dna结合蛋白的改进多核苷酸 Download PDF

Info

Publication number
CN115484815A
CN115484815A CN202180025334.2A CN202180025334A CN115484815A CN 115484815 A CN115484815 A CN 115484815A CN 202180025334 A CN202180025334 A CN 202180025334A CN 115484815 A CN115484815 A CN 115484815A
Authority
CN
China
Prior art keywords
seq
soybean
sscai
optionally
reference polynucleotide
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180025334.2A
Other languages
English (en)
Inventor
斯科特·A·贝文
亚当·P·乔伊斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inari Agricultural Technology Co ltd
Original Assignee
Inari Agricultural Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inari Agricultural Technology Co ltd filed Critical Inari Agricultural Technology Co ltd
Publication of CN115484815A publication Critical patent/CN115484815A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H5/00Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
    • A01H5/10Seeds
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01HNEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
    • A01H6/00Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
    • A01H6/54Leguminosae or Fabaceae, e.g. soybean, alfalfa or peanut
    • A01H6/542Glycine max [soybean]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Abstract

披露了获得植物细胞、植物和植物部分的方法,这些植物细胞、植物和植物部分包括大豆植物细胞、植物和植物部分,包含提供编码的RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)和RNA指导的DNA结合蛋白的增加的表达的合成的多核苷酸。还提供了包含合成的多核苷酸的大豆植物细胞、植物和植物部分,这些合成的多核苷酸提供编码的RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)和RNA指导的DNA结合蛋白的增加的表达。

Description

用于在大豆中表达RNA指导的核酸酶和DNA结合蛋白的改进多 核苷酸
相关申请的引用
该国际专利申请要求2020年9月8日提交的美国临时专利申请号63/075,395;2020年8月31日提交的美国临时专利申请号63/072,585;2020年3月30日提交的美国临时专利申请号63/001,806的权益;其各自通过引用以其整体并入本文。
序列表的并入
包含名为“10071WO01”的文件的序列表(该文件为793,473字节(在
Figure BDA0003867941360000011
中测量))包含188个生物序列,创建于2021年3月16日,通过USPTO的EFS系统以电子方式提交,并通过引用以其整体并入本文。
技术领域
本披露总体上涉及包含合成的多核苷酸的方法和组合物,其提供编码的RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)和核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)在大豆细胞、植物和植物部分中增加的表达。
背景技术
针对噬菌体和病毒的细菌获得性免疫的CRISPR/Cas系统已被应用于基因组修饰和基因表达控制的有效新技术。CRISPR/Cas系统组分表达的改进可以提供改进的基因组修饰频率和改进的基因表达控制。
衍生自不同物种的基因在同义密码子的平均使用方面可能有很大差异。在植物中,双子叶植物通常具有比单子叶植物更低GC含量的编码序列。在设计具有目标编码蛋白质的最佳表达的转基因时,通常设计转基因的核酸以模拟预期宿主的密码子使用。
发明内容
修饰植物基因组中的内源植物基因(例如大豆基因组中的大豆基因的方法)包括:向包含编码RNA指导的核酸内切酶(RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸的大豆植物细胞中引入针对该内源大豆基因中的靶编辑位点的指导RNA或编码指导RNA的多核苷酸和任选地与该靶编辑位点具有同源性的供体模板DNA分子,其中所述合成的多核苷酸:(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码RGE或RGN的大豆密码子优化的参考多核苷酸的sCAI;或i、ii和iii的任何组合;以及选择包含该内源植物基因或大豆基因修饰的经修饰的植物细胞或大豆植物细胞、植物或大豆植物、植物部分或大豆植物部分、植物组织或大豆组织、或植物愈伤组织或大豆愈伤组织。
提供了修饰大豆基因组中的内源大豆基因的方法,这些方法包括向大豆植物细胞中引入:编码RNA指导的核酸内切酶(RGE)(例如Cas12j RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码RGE或RGN的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、所述Tm和所述较低sCAI的任何组合;指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶编辑位点;和任选地与该靶编辑位点具有同源性的供体模板DNA分子;并且选择包含内源大豆基因修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
提供了修饰大豆基因组中内源大豆基因表达的方法,这些方法包括:向大豆植物细胞中引入:(i)编码包含核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸;例如,Cas12j ndRGDBP),其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、Tm和sCAI的任何组合;和(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对内源大豆基因中的靶结合位点;并且选择其中内源大豆基因的表达已经被修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
大豆植物细胞包含编码蛋白质的合成的多核苷酸,该蛋白质包含RNA指导的核酸内切酶(RGE;例如,Cas12j RGE)、RNA指导的切口酶(RGN)或核酸酶缺陷型的受RNA指导的DNA结合蛋白(ndRGDB;例如,Cas12j ndRGDB),其中所述多核苷酸:具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%;具有的熔化温度(Tm)大于89或90摄氏度;具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;或该GC含量、Tm和sCAI的任何组合。还提供了大豆植物、包括种子或豆荚的大豆植物部分,以及包括包含大豆细胞的分生组织、胚胎组织和/或愈伤组织的大豆组织。
还披露了用于获得本文披露的任何上述或以其他方式提供的大豆植物细胞的方法,该方法包括:(a)将编码包含该RNA指导的核酸内切酶(RGE)、该RNA指导的切口酶(RGN)或该核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸引入该大豆植物细胞,其中所述多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%;具有的熔化温度(Tm)大于89或90摄氏度;具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;所述GC含量、Tm和较低sCAI的任何组合;以及(b)选择包含该合成的多核苷酸的大豆植物细胞。
提供了包含失活性Cas12j突变的分离的和重组的核酸。
附图说明
图1描绘了大豆(Glycine max或soybean)的密码子使用数据库表,来自万维网上的https网站“kazusa.or.jp/codon/”。
图2显示了对来自表达载体的大豆细胞表达的核酸酶水平的蛋白质印迹检测,该表达载体含有:GC含量约为37.5%的CasSoy_1.1.1大豆密码子优化的参考多核苷酸(SCORP)(左侧)或具有GC含量约为49.5%的编码序列的测试Cas Soy1.1.S多核苷酸(中间)。模拟转染的对照大豆细胞在右侧。
图3显示了用双子叶植物优化(左侧)和大豆优化(中间)表达载体转化的番茄和大豆原生质体的基因组编辑效率。模拟转染阴性对照在右侧。
具体实施方式
定义
在本文使用的情况下,术语“和/或”应当被视为具有或不具有其他指定特征情况下两个或更多个指定特征或组分中的每一个的特定披露。因此,如在本文“A和/或B”等短语中使用的术语“和/或”旨在包括“A和B”、“A或B”、“A”(单独)、和“B”(单独)。同样地,如在“A、B、和/或C”等短语中使用的术语“和/或”旨在涵盖以下实施例中的每一个:A、B和C;A、B或C;A或C;A或B;B或C;A和C;A和B;B和C;A(单独);B(单独);和C(单独)。
如本文所用,术语“Cas12j”和“CasΦ”在本文中可互换使用以指代RNA定向核酸酶的相同分组。
如本文所用,术语“Cpf1”和“Cas12a”在本文中可互换使用以指代RNA定向核酸酶的相同分组。
如本文所用,术语“Cas12e”和“CasX”在本文中可互换使用以指代RNA定向核酸酶的相同分组。
如本文所用,短语“供体模板DNA分子”是指与靶编辑位点具有同源性的dsDNA或ssDNA分子。供体模板DNA分子可用于通过同源定向修复来编辑基因组中的靶编辑位点。
如本文所用,“异源”是指分别未在天然核酸或蛋白质中发现的核苷酸或多肽序列。例如,相对于RGE、RGN或ndRGDBP多肽,异源多肽包含来自不同于RGE、RGN或ndRGDBP多肽的蛋白质的氨基酸序列。在一些情况下,来自一个物种的RGE、RGN或ndRGDBP蛋白的一部分与来自不同物种的Cas蛋白的一部分融合。因此,可以认为来自每个物种的Cas序列相对于彼此是异源的。作为另一个实例,可以将RGE、RGN或ndRGDBP蛋白(例如,dCas蛋白)融合到来自非Cas蛋白(例如,组蛋白去乙酰化酶)的活性结构域,并且活性结构域的序列可以考虑为异源多肽(与Cas蛋白异源)。
如本文所用,术语“包括(include、includes和including)”应被解释为至少具有它们所指的特征而不排除任何另外的未指定特征。
如本文所用,术语“相应(correspond,corresponding)”等,当在任何给定RGE、RGN或ndRGDBP多肽中相对于参考RGE、RGN或ndRGDBP的氨基酸位置、突变和/或取代的上下文中使用时,均指当使用成对比对算法(例如,具有默认参数的CLUSTAL O 1.2.4)将给定的RGE、RGN或ndRGDBP多肽与参考RGE、RGN或ndRGDBP多肽序列进行比对时,与参考多肽序列中的氨基酸残基具有同一性或相似性的给定RGE、RGN或ndRGDBP序列中氨基酸残基的位置、突变和/或取代。
如本文所用,术语“包括(include、includes和including)”应被解释为至少具有它们所指的特征而不排除任何另外的未指定特征。
本文可互换使用的术语“多核苷酸”和“核酸”是指任何长度的核苷酸的聚合形式,核糖核苷酸或脱氧核苷酸。因此,该术语包括但不限于单链、双链或多链DNA或RNA,基因组DNA,cDNA,DNA-RNA杂合体,或包含嘌呤和嘧啶碱基或其他天然、化学或生化修饰的、非天然的或衍生的核苷酸碱基的聚合体。术语“多核苷酸”和“核酸”应理解为包括(如适用于所描述的实施例)单链(例如有义或反义)和双链多核苷酸。
术语“多肽”、“肽”和“蛋白质”在本文中可互换使用,是指任何长度的氨基酸的聚合形式,其可以包括基因编码和非基因编码的氨基酸、化学或生物化学修饰或衍生的氨基酸和具有经修饰的肽骨架的多肽。该术语包括融合蛋白,包括但不限于,具有异源氨基酸序列的融合蛋白、具有异源和同源前导序列的融合蛋白(这些融合蛋白具有或不具有N-末端甲硫氨酸残基);经免疫标记的蛋白;等等。
如本文所用,如应用于核酸、蛋白质、细胞或生物体的术语“天然存在的”是指在自然界中发现的核酸、细胞、蛋白质或生物体。
如本文所用,术语“分离的”意在描述处于与多核苷酸、多肽或细胞天然存在的环境不同的环境中的多核苷酸、多肽或细胞。分离的经基因修饰的宿主细胞可以存在于经基因修饰的宿主细胞的混合群体中。
如本文所用,术语“外源核酸”是指在自然界中给定的细菌、生物体或细胞中通常或天然不存在和/或不由其产生的核酸。如本文所用,术语“内源核酸”是指通常在自然界中给定的细菌、生物体或细胞中存在和/或由其产生的核酸。“内源核酸”也称为“天然核酸”或对给定细菌、生物体或细胞“天然”的核酸。
如本文所用,“重组”是指特定核酸(DNA或RNA)是克隆、限制性酶切和/或连接步骤的各种组合的产物,产生具有结构编码或非编码序列(其与自然系统中发现的内源性核酸不同)的构建体。通常,编码结构编码序列的DNA序列可以从cDNA片段和短寡核苷酸接头或从一系列合成的寡核苷酸组装,以提供合成的核酸,该合成的核酸能够从包含在细胞中或在无细胞转录和翻译系统中的重组转录单元表达。此类序列可以以不受内部非翻译序列或内含子(其通常存在于真核基因中)中断的开放阅读框的形式提供。包含相关序列的基因组DNA也可用于形成重组基因或转录单位。非翻译DNA序列可以存在于开放阅读框的5'或3'处,其中此类序列不干扰编码区的操作或表达,并且确实可以通过各种机制调节期望产物的产生(参见“DNA调节序列”,见下文)。
因此,例如,术语“重组”多核苷酸或“重组”核酸是指非天然存在的,例如,通过人工干预将另外两个分开的序列区段人工组合制成。这种人工组合通常通过化学合成手段或通过人工操作分离的核酸区段(例如通过基因工程技术)来完成。通常这样做是为了用编码相同或保守氨基酸的冗余密码子替换密码子,同时通常引入或去除序列识别位点。可替代地,将所期望功能的核酸区段连接在一起以产生所期望的功能组合。这种人工组合通常通过化学合成手段或通过人工操作分离的核酸区段(例如通过基因工程技术)来完成。
类似地,术语“重组”多肽是指非天然存在的多肽,例如,通过人工干预将两个原本分开的氨基序列区段人工组合制成。因此,例如,包含异源氨基酸序列的多肽是重组的。
短语“大豆密码子适应指数”(sCAI)是指根据图1的大豆密码子偏倚表计算的给定多核苷酸编码序列的密码子适应指数。在某些实施例中,本主题的合成的多核苷酸和参考多核苷酸的sCAI可以从http:internet site“genomes.urv.es/CAIcal/”(Puigbo等人Biology Direct[生物指导],3:38)使用图1的大豆密码子偏倚表获得。在某些实施例中,本主题的合成的多核苷酸和参考多核苷酸的sCAI可以根据以下公式计算,其中相对同义密码子使用值是根据Sharp和Li.1987.Nucleic Acids Research[核酸研究].15(3);1281-1295由以下公式从图1的大豆密码子偏倚表计算得出的。
CAI=CAI观察/CAI最大
Figure BDA0003867941360000071
Figure BDA0003867941360000072
其中RSCU(相对同义密码子使用)是基因中第k个密码子的RSCU值,RSCUk最大是基因中第k个密码子编码的氨基酸的最大RSCU值,并且L是基因中密码子的数量;并且其中根据以下公式计算RCSU
Figure BDA0003867941360000073
其中Xij是第i个氨基酸的第j个密码子的出现次数,ni是第i个氨基酸的替代密码子的数量(从1到6)。
短语“大豆密码子优化的参考多核苷酸”或首字母缩略词“SCORP”是指编码多肽的多核苷酸,其中参考多核苷酸的序列是通过Puigbo P.等人2007 OPTIMIZER:A web serverfor optimizing the codon usage of DNA sequences[OPTIMIZER:用于优化DNA序列密码子使用的网络服务器].Nucleic Acids Research[核酸研究],35:W126-W131中提出的OPTIMIZER程序和图1中提出的大豆密码子偏倚表从多肽序列生成。
“构建体”或“载体”是指重组核酸,通常是重组DNA,其已经为表达和/或增殖一个或多个特定多核苷酸序列的目的而产生,或将用于构建其他重组多核苷酸序列。
本文可互换使用的术语“DNA调节序列”、“控制元件”和“调节元件”是指转录和翻译控制序列,例如启动子、增强子、聚腺苷酸化信号、终止子、蛋白质降解信号等,其提供和/或调节宿主细胞中编码序列的表达和/或编码多肽的产生。
如本文所用的短语“靶位点”或“靶编辑位点”是指以下多核苷酸序列的任何或所有:(i)由与指导RNA复合的RGE或RGN结合;(ii)包含与指导RNA复合的RGE或RGN的核酸内切酶或切口酶切割位点;和/或(iii)由与邻近RGE的核酸内切酶切割位点的序列具有同源性的供体模板DNA分子结合。
如本文所用的短语“靶DNA结合位点”是指由与指导RNA复合的ndRGDBP结合的多核苷酸序列。
术语“转化”在本文中与“遗传修饰”可互换使用,是指在将新核酸(例如,细胞外源性DNA)引入细胞后在细胞中诱导的永久性或瞬时遗传变化。遗传改变(“修饰”)可以通过将新核酸掺入宿主细胞的基因组中,或通过将新核酸瞬时或稳定地维持为附加型元件来实现。当细胞是真核细胞时,通常通过将新DNA引入细胞的基因组中来实现永久性遗传改变。在原核细胞中,可以将永久性变化引入染色体或通过诸如质粒和表达载体的染色体外元件引入,这些元件可能包含一种或多种选择标记以帮助它们在重组宿主细胞中的维持。合适的遗传修饰方法包括病毒感染、转染、缀合、原生质体融合、电穿孔、粒子枪技术、磷酸钙沉淀、直接显微注射等。方法的选择通常取决于被转化细胞的类型和发生转化的环境(例如,体外、离体或体内)。这些方法的一般性讨论可以在Ausubel等人,Short Protocols inMolecular Biology[分子生物学简短方案],第3版,约翰·威利父子出版社(Wiley&Sons),1995中找到。
“可操作地连接”是指一种并列关系,其中如此描述的组分处于允许它们以其预期方式发挥功能的关系中。例如,如果启动子影响编码序列的转录或表达,则启动子与编码序列可操作地连接。如本文所用,术语“异源启动子”和“异源控制区”是指通常不与自然界中的特定核酸相关联的启动子和其他控制区。例如,“与编码区异源的转录控制区”是在自然界中通常不与编码区相关联的转录控制区。在其他实例中,两个或更多个编码不同多肽组分的不同多核苷酸序列可以可操作地连接。当两个不同的多肽组分(例如RGE、RGN或ndRGDBP和异源多肽)可操作地连接时,产生了融合多肽,其中每个不同的多肽组分可以执行其预期功能。在某些实施例中,这样的融合多肽可以通过可操作地连接的多核苷酸的转录和翻译或通过可操作地连接的多核苷酸(例如,其中多核苷酸是RNA分子)的翻译来产生。
如本文所用,“宿主细胞”表示体内或体外真核细胞、原核细胞或来自作为单细胞实体培养的多细胞生物体(例如细胞系)的细胞,这些真核或原核细胞可以被或已经被用作核酸(例如,表达载体)的受体,并且包括已经被核酸进行基因修饰的原始细胞的后代。可以理解,由于自然的、偶然的或故意的突变,单个细胞的子代可能不一定在形态上或在基因组或总DNA互补序列方面与原始亲代完全相同。“重组宿主细胞”(也称为“经基因修饰的宿主细胞”)是其中已引入异源核酸例如表达载体的宿主细胞。例如,主题原核宿主细胞是经基因修饰的原核宿主细胞(例如,细菌),这是通过将异源核酸(例如对原核宿主细胞来说是外来的(通常在自然界中不存在)的外源核酸,或在原核宿主细胞中通常不存在的重组核酸)引入合适的原核宿主细胞;并且主题真核宿主细胞是经基因修饰的真核宿主细胞,这是通过将异源核酸(例如对真核宿主细胞来说是外来的外源核酸,或在真核宿主细胞中通常不存在的重组核酸)引入合适的真核宿主细胞。
术语“保守氨基酸取代”是指蛋白质中具有相似侧链的氨基酸残基的互换性。例如,具有脂肪族侧链的一组氨基酸由甘氨酸、丙氨酸、缬氨酸、亮氨酸和异亮氨酸组成;具有脂肪族羟基侧链的一组氨基酸由丝氨酸和苏氨酸组成;具有含酰胺侧链的一组氨基酸由天冬酰胺和谷氨酰胺组成;具有芳香族侧链的一组氨基酸由苯丙氨酸、酪氨酸和色氨酸组成;具有碱性侧链的一组氨基酸由赖氨酸、精氨酸和组氨酸组成;具有含硫侧链的一组氨基酸由半胱氨酸和甲硫氨酸组成。示例性的保守氨基酸取代基团是:缬氨酸-亮氨酸-异亮氨酸、苯丙氨酸-酪氨酸、赖氨酸-精氨酸、丙氨酸-缬氨酸和天冬酰胺-谷氨酰胺。
多核苷酸或多肽与另一个多核苷酸或多肽具有一定百分比“序列同一性”,意思是当对齐时,碱基或氨基酸百分比相同,且在比较两个序列时处于相同的相对位置。序列相似性可以以多种不同方式确定。为了确定序列同一性,可以使用可在万维网ncbi.nlm.nih.gov/BLAST上获得的方法和计算机程序(包括BLAST)来比对序列。参见,例如,Altschul等人(1990),/.Mol.Biol.[分子生物学]215:403-10。另一种比对算法是FASTA,可从美国威斯康星州麦迪逊市的遗传学计算组(GCG)(牛津分子集团公司(OxfordMolecular Group,Inc.)的全资子公司)包中获得。其他用于比对的技术描述于以下中:Methods in Enzymology[酶学中的方法],第266卷:Computer Methods forMacromolecular Sequence Analysis[大分子序列分析的计算机方法](1996),编辑Doolittle,学术出版社(Academic Press,Inc.),Harcourt Brace&Co.公司的分公司,圣地亚哥,加利福尼亚州,美国。特别感兴趣的是允许序列中存在缺口的比对程序。史密斯-沃特曼(Smith-Waterman)是允许序列比对中出现缺口的算法。参见Meth.Mol.Biol.[分子生物学方法]70:173-187(1997)。此外,使用Needleman和Wunsch比对方法的GAP程序可用于比对序列。参见/.Mol.Biol.[分子生物学]48:443-453(1970)。CLUSTAL和MUSCLE是其他常用的比对程序。
如本文所用,术语“治疗”(“treatment”、“treating”)等是指获得所期望的性状、药理和/或生理效果。效果可以是赋予期望的性状(例如提高的产率、对昆虫、真菌、细菌病原体和/或线虫的抗性、除草剂耐受性、非生物胁迫耐受性(例如,干旱、寒冷、盐和/或热耐受性)、蛋白质数量和/或质量、淀粉数量和/或质量、脂质数量和/或质量、次生代谢物数量和/或质量等,所有这些都与缺乏该修饰的对照植物相比。就完全或部分预防疾病或其症状而言,该效果可以是预防性的和/或就疾病和/或归因于该疾病的副作用的部分或完全治愈而言,可以是治疗性的。如本文所用,“治疗”涵盖对植物或哺乳动物例如人疾病的任何治疗,并且包括:(a)在可能易患该疾病但尚未被诊断为患有该疾病的受试者中预防该疾病发生;(b)抑制该疾病,例如阻止其发展;和(c)缓解该疾病,例如导致该疾病消退。
如本文所用,“Tm”是使用以下公式计算的双链DNA序列的熔化温度:
Tm(℃)=(7.35×E)+[17.34×ln(长度)]+[4.96×ln(浓度)]+[0.89×ln(DNA)]-25.42
其中Tm=预测的熔化温度;E=DNA强度参数/碱基=累积DNA强度参数/DNA序列长度;Len=核苷酸序列长度(碱基对数);Conc=[Na+]溶液浓度(摩尔)=0.16M;DNA=总核苷酸链浓度=0.0001g/mL,根据Khandelwal G,Bhyravabhotla J(2010)PLoS ONE[公共科学图书馆·综合]5(8):e12433.doi.org/10.1371/journal.pone.0012433的方法。
应当理解,本披露限于所描述的特定实施例,因为这样当然可以变化。还应当理解,本文使用的术语仅是为了描述具体实施例的目的,而并不意图是限制性的,因为本披露的范围仅由所附权利要求限定。
除非另外定义,否则本文所用的全部技术和科学术语具有与本披露所属领域的普通技术人员通常所理解的相同的含义。尽管与本文描述的那些类似或等同的任何方法和材料也可用于本披露的实践或测试,但现在描述了优选的方法和材料。本文提及的所有出版物均通过引用并入本文,以披露和描述与引用该出版物相关的方法和/或材料。
必须注意的是,如本文和所附权利要求所用,单数形式“一种”、“一个”和“该”包括复数指示物,除非上下文另有明确规定。因此,例如,提及“合成的多核苷酸”或“本主题的合成的多核苷酸”包括多个这样的多核苷酸并且提及“指导RNA”包括提及本领域技术人员已知的一种或多种指导RNA及其等价物,等等。还应注意,可以起草权利要求以排除任何任选要素。因此,此陈述旨在作为使用与权利要求要素的叙述有关的排他性术语如“单独”、“仅”等或使用“否定型”限定的前提基础。
应当理解,为清楚起见在单独的实施例的上下文中描述的本披露的某些特征也可以在单个实施例中组合提供。相反,为简洁起见,在单个实施例的上下文中描述的本披露的各种特征也可以单独地或以任何合适的子组合来提供。与本披露有关的实施例的所有组合都被本披露特别地包含并且在本文中披露,就好像每一种组合都被单独且明确地披露一样。此外,各种实施例及其要素的所有子组合也被本披露特别地包括并且在本文中披露,就好像每个这样的子组合在本文中单独且明确地披露一样。
在任何前述定义与通过引用并入本文的任何专利或非专利参考文献、本文引用的任何专利或非专利参考文献或在别处找到的任何专利或非专利参考文献中提供的定义不一致的程度情况下,应当理解,在此将使用该前述定义。
描述
本披露提供了大豆植物细胞、植物和植物部分(例如,种子、胚和/或分生组织),其包含合成的多核苷酸,该合成的多核苷酸提供编码的RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)和核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的增加的表达。本披露还提供了植物细胞、植物和植物部分(例如,种子、胚和/或分生组织),例如大豆或玉米植物细胞、植物和植物部分,其包含合成的多核苷酸,该合成的多核苷酸提供编码的RNA指导的核酸内切酶(RGE)和核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的增加的表达。还提供了使用合成的多核苷酸和植物或植物材料(例如包含合成的核苷酸的大豆细胞、大豆植物和大豆植物部分)以在那些植物或植物材料(例如,大豆细胞、大豆植物和大豆植物部分)中获得改善的基因组修饰频率和改善的基因表达控制的方法。还提供了制备包含合成的多核苷酸的植物或植物材料(例如大豆细胞、大豆植物和大豆植物部分)的方法。还提供了包含植物细胞(例如大豆细胞)和合成的多核苷酸的组合物。在某些实施例中,编码RGE、RGN、ndRDGP或包含它们的融合多肽的合成的多核苷酸的表达与编码相同RGE、RGN、ndRDGP或融合多肽的大豆密码子优化的参考多核苷酸(SCORP)的表达相比增加。这种表达的增加可以反映为与包含编码RGE、RGN、ndRDGP或融合多肽的SCORP的对照大豆细胞、大豆植物和大豆植物部分相比,在大豆细胞、大豆植物和大豆植物部分中RGE、RGN、ndRDGP或融合多肽的积累和/或生物活性(例如修饰序列的频率和/或内源大豆基因的表达)增加。
编码本文披露的RGE、RGN和ndRGDBP的本主题的合成的多核苷酸可以通过一个或多个特征与大豆密码子优化的参考多核苷酸(SCORP)区分开来,该一个或多个特征包括与SCORP相比增加的GC(鸟嘌呤和胞嘧啶)含量,与SCORP相比增加的熔解温度(Tm),大豆密码子适应指数(sCAI)低于SCORP的sCAI,或这样的GC含量增加、Tm增加和sCAI减少的任何组合。在某些实施例中,本主题的合成的多核苷酸的GC含量为约46%至约47%、48%、49%、50%、51%、52%、53%、54%、55%或56%。在某些实施例中,与SCORP的GC含量相比,GC含量增加至少约6%、7%、8%、9%、10%、11%、12%或13%。在某些实施例中,与SCORP的GC含量相比,GC含量增加至少约6%、7%或8%至约9%、10%、11%、12%、13%、14%或15%。在某些实施例中,与SCORP的Tm相比,Tm增加至少约2、3、4、5或6摄氏度。在某些实施例中,与SCORP的Tm相比,Tm含量增加至少约2或3至约4、5或6摄氏度。在某些实施例中,与SCORP的sCAI相比,sCAI降低至少约0.01、0.02、0.03、0.04或0.05。在某些实施例中,与SCORP的sCAI相比,sCAI降低至少约0.01或0.02至约0.03、0.04或0.05。
编码本文披露的RGE、RGN和ndRGDBP的本主题的合成的多核苷酸可以与一种或多种不同的编码异源多肽的多核苷酸序列可操作地连接。在某些实施例中,本主题的合成的多核苷酸与编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ST)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合可操作地连接。在其他实施例中,本主题的合成的多核苷酸(或进一步包含可操作地连接的第二多核苷酸的本主题的合成的多核苷酸)可操作地连接至编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列。在某些实施例中(例如,对于大豆),编码不同多肽的任何上述不同、第二或第三多核苷酸也可以通过一个或多个特征与编码这些相同的不同肽的大豆密码子优化的参考多核苷酸(SCORP)区分开来,该一个或多个特征包括与SCORP相比增加的GC(鸟嘌呤和胞嘧啶)含量,与SCORP相比增加的熔解温度(Tm),大豆密码子适应指数(sCAI)低于SCORP的sCAI,或这样的GC含量增加、Tm增加和sCAI减少的任何组合。这样的不同的、第二或第三多核苷酸(其包含与SCORP相比增加的GC(鸟嘌呤和胞嘧啶)含量、与SCORP相比增加的熔化温度(Tm)、低于SCORP的sCAI的大豆密码子适应指数(sCAI))可以通过不同多肽的“反向翻译”或“逆向翻译”(即使用蛋白质序列和具有比图1的大豆密码子偏倚表更多的富含GC的密码子的密码子使用表来生成DNA序列)获得。将接受多肽序列和密码子偏倚表作为输入以生成多肽序列的逆向翻译或反向翻译程序包括万维网互联网站点“bioinformatics.org/sms2/rev_trans.html”(Stothard P(2000)Biotechniques 28:1102-1104)和万维网网站“ebi.ac.uk/Tools/st/emboss_backtranseq/”上的“EMBOSS Backtranseq”功能(Madeira等人Nucleic Acids Research[核酸研究],2019年6月30日,47(W1):W636-W641 DOI:10.1093/nar/gkz268)。在某些实施例中,前述不同的、第二或第三多核苷酸的GC含量为约46%至约47%、48%、49%、50%、51%、52%、53%、54%、55%、或56%。在某些实施例中,与SCORP的GC含量相比,前述不同的、第二或第三多核苷酸的GC含量增加至少约6%、7%、8%、9%、10%、11%、12%或13%。在某些实施例中,与SCORP的GC含量相比,前述不同的、第二或第三多核苷酸的GC含量增加至少约6%、7%或8%至约9%、10%、11%、12%、13%、14%或15%。在某些实施例中,与SCORP的Tm相比,前述不同的、第二或第三多核苷酸的Tm增加至少约2、3、4、5或6摄氏度。在某些实施例中,与SCORP的Tm相比,前述不同的、第二或第三多核苷酸的Tm含量增加至少约2或3至约4、5或6摄氏度。在某些实施例中,与SCORP的sCAI相比,前述不同的、第二或第三多核苷酸的sCAI降低至少约0.01、0.02、0.03、0.04或0.05。在某些实施例中,与SCORP的sCAI相比,sCAI降低至少约0.01或0.02至约0.03、0.04或0.05。
本文提供的针对大豆的本主题的合成的多核苷酸和编码某些RGE的相应SCORP的非限制性实例列于下表1-12中(以及在此提供的序列表中列出的SEQ ID NO的相应序列中)。还提供了编码RGE、RGN、ndRGDBP多肽的合成多核苷酸,其在下表1-12中(以及在此提供的序列表中列出的SEQ ID NO的相应序列中)列出的合成的多核苷酸中包含一个、两个、三个或更多个核苷酸插入、缺失和/或取代。
表1.SpCas9编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000141
Figure BDA0003867941360000151
表2.SaCas9编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000152
表3.FnCpf1编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000153
Figure BDA0003867941360000161
表4.CasJ编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000162
Figure BDA0003867941360000171
表5.AsCpf1编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000172
表6.Cms1编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000173
Figure BDA0003867941360000181
表7.LbCpf1编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000182
Figure BDA0003867941360000191
表8.MAD7编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000192
表9.CasX编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000193
Figure BDA0003867941360000201
表10.Cas12j-1编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000202
Figure BDA0003867941360000211
表11.Cas12j-2编码的蛋白质、SCORP和本主题的合成的多核苷酸
Figure BDA0003867941360000212
表12.Cas12j-3编码的蛋白质、SCORP和本主题的合成的多核苷酸。
Figure BDA0003867941360000221
*根据Khandelwal G,Bhyravabhotla J(2010)PLoS ONE[公共科学图书馆·综合]5(8):e12433.doi.org/10.1371/journal.pone.0012433的方法计算Tm,其中Conc=[Na+]溶液浓度(摩尔)=0.16M;DNA=总核苷酸链浓度=0.0001g/mL。
**根据http:internet site“genomes.urv.es/CAIcal/”上的程序(Puigbo等人Biology Direct[生物指导],3:38)使用图1的大豆密码子偏倚表计算sCAI。
SEQ ID NO:156-165、166-175和176-185代表针对单子叶植物表达优化的核酸序列,例如分别编码Cas12j-1、Cas12j-2和Cas12j-3的玉米优化的核酸序列。
本主题的合成的多核苷酸和SCORP可以编码RGE、RGN或ndRGDBP多肽(该术语可与术语“RGE、RGN或ndRGDBP蛋白”互换使用),其可以结合和/或修饰(例如,切割、切口、甲基化、去甲基化等)靶核酸和/或与靶核酸相关的多肽(例如,组蛋白尾的甲基化或乙酰化)(例如,在一些情况下,RGE、RGN或ndRGDBP蛋白包括具有活性的融合配偶体,在一些情况下,RGE或RGN提供核酸酶活性)。在一些情况下,RGE蛋白是天然存在的蛋白质(例如,天然存在于原核细胞中)。在其他情况下,RGE、RGN或ndRGDBP蛋白不是天然存在的多肽(例如,RGE、RGN或ndRGDBP蛋白是变体RGE、RGN或ndRGDBP蛋白、嵌合蛋白、RGE、RGN、或ndRGDBP融合多肽等)。
在一些实施例中,由本主题的合成的多核苷酸编码的RGE蛋白和SCORP可以编码天然存在的(野生型)蛋白。天然存在的RGE蛋白的序列的非限制性实例在SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132和144中列出。然而,本主题的合成的多核苷酸和SCORP是非天然存在的(人工)多核苷酸。在某些实施例中,由本主题的合成的多核苷酸编码的RGE蛋白质和SCORP是与天然存在的RGE蛋白质相比包含一个或多个氨基酸残基的插入、缺失和/或取代的非天然存在的多肽。在某些实施例中,与天然存在的RGE蛋白质相比,由本主题的合成的多核苷酸编码的RGN或ndRGDBP蛋白质和SCORP是包含一个或多个氨基酸残基的插入、缺失和/或取代的非天然存在的多肽。
在一些实施例中,合成的多核苷酸在以下任一种的全长上具有至少70%、76%、80%、85%、90%、95%、97%、98%、99%或100%序列同一性:(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(v)选自由SEQ IDNO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ IDNO:74的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;:(x)选自由SEQ ID NO:122-131组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于50%,例如大于55%、56%、57%或58%;(xi)选自由SEQ IDNO:134-143组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于50%,例如大于56%、57%、58%、59%或60%;(xii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI和/或GC(鸟嘌呤和胞嘧啶)含量大于45%,例如大于50%、51%、52%、53%或54%。
在一些情况下,由本主题的合成的多核苷酸和SCORP编码的RGE、RGN或ndRGDBP蛋白编码与如SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144所示的RGE、RGN或ndRGDBP蛋白序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多,60%或更多、70%或更多、80%或更多、85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性)的氨基酸序列,其中与SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144具有100%序列同一性的蛋白质是RGE。例如,在一些情况下,RGE、RGN或ndRGDBP蛋白编码与如SEQ ID NO:1、13、25、37、49、61、73、85或97所示的RGE、RGN或ndRGDBP蛋白序列具有50%或更多序列同一性(例如,60%或更多、70%或更多、80%或更多、85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性)的氨基酸序列,其中与SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144具有100%序列同一性的蛋白质是RGE。在一些情况下,RGE、RGN或ndRGDBP蛋白编码与如SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144所示的RGE、RGN或ndRGDBP蛋白序列具有80%或更多序列同一性(例如,85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性)的氨基酸序列。其中与SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144具有100%序列同一性的蛋白质是RGE。在一些情况下,RGE、RGN或ndRGDBP蛋白编码与如SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144所示的RGE、RGN或ndRGDBP蛋白序列具有90%或更多序列同一性(例如,95%或更多、97%或更多、98%或更多、99%或更多、99.5%、99.8%、99.9%或100%序列同一性)的氨基酸序列。其中与SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144具有100%序列同一性的蛋白质是RGE。在一些情况下,RGE、RGN或ndRGDBP蛋白编码具有如SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144所示的RGE、RGN或ndRGDBP蛋白序列的氨基酸序列。在一些情况下,RGE、RGN或ndRGDBP蛋白编码具有如SEQ ID NO:1、13、25、37、49、61、73、85、97、120、132或144所示的RGE、RGN或ndRGDBP蛋白序列的氨基酸序列,除了序列编码降低蛋白质的天然存在的催化活性的氨基酸取代(例如,1、2、3或更多个氨基酸取代)(例如,在如下所述的氨基酸位置)。进一步包含另外的异源肽序列的RGE、RGN或ndRGDBP融合多肽可以进一步包含任何上述RGE、RGN或ndRGDBP蛋白。
RGE蛋白包括3个部分RuvC结构域(RuvC-I、RuvC-II和RuvC-III,在本文中也称为亚结构域),它们与RGE蛋白的一级氨基酸序列不连续,但一旦蛋白质产生并折叠则形成RuvC结构域。在一些情况下,(主题组合物和/或方法的)RGE蛋白包括分裂的RuvC结构域(例如,3个部分RuvC结构域-RuvC-I、RuvC-II和RuvC-III)。
与相应的野生型RGE蛋白的氨基酸序列相比,变体RGE、RGN或ndRGDBP蛋白具有至少一个氨基酸不同的氨基酸序列(例如,具有缺失、插入、取代、融合)。切割双链靶核酸的一条链但不切割另一条链的RGN蛋白在本文中称为“RGN”或“切口酶”(例如,“切口酶CasJ”)。基本上不具有核酸酶活性的Cas蛋白在本文中被称为ndRGDBP或死Cas蛋白(“dCas”)(需要注意的是,在某些实施例中,核酸酶活性可以由可操作地连接到ndRGDBP的异源多肽提供)。对于本文所述的任何RGE、RGN或ndRGDBP变体蛋白(例如,切口酶Cas、dCas、嵌合Cas、Cas融合多肽),RGE、RGN或ndRGDBP变体可包括具有上述相同参数(例如,存在的结构域、同一性百分比等)的RGE、RGN或ndRGDBP蛋白质序列。
在某些实施例中,编码的ndRGDBP获自RGE,例如,相对于天然存在的催化活性RGE序列突变,并与相应的天然发生的序列相比时表现出降低的核酸内切酶活性(例如,表现出90%或更少,80%或更少、70%或更少、60%或更少、50%或更少、40%或更少、30%或更少、20%或更少、10%或更少、5%或更少、或1%或更少核酸内切酶活性)。在一些情况下,编码的ndRGDBP是催化方面的“死”蛋白(基本上没有核酸内切酶活性),可以称为“dCas”。在一些情况下,编码的RGN仅切割双链靶核酸(例如双链靶DNA)的一条链。如本文更详细描述的,在一些情况下,编码的RGE、RGN或ndRGDBP融合(例如,缀合或可操作地连接)到具有目的活性(例如,目的催化活性)的异源多肽以形成融合蛋白(例如嵌合Cas蛋白或Cas融合多肽)。
Cas9 RGE(SEQ ID NO:1)的保守催化残基包括上面鉴定的RuvC亚结构域残基。根据SEQ ID NO:1编号的D10和/或H840是可以突变的残基,例如作为D10A或H840A,以降低Cas9多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或任何Cas9蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸取代)时,Cas9蛋白具有降低的活性。在一些情况下,变体Cas9蛋白是催化方面的“死”蛋白(无催化活性),被称为“dCas9”。dCas9蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dCas9(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。在一些情况下,仅切割双链靶核酸(例如双链靶DNA)的一条链的切口酶或RGN可以通过突变SEQ IDNO:1的Cas9蛋白的一个或多个残基和/或催化残基来获得。在某些实施例中,Cas9 RGN可以包含残基D10中的突变(例如,D10A)。Cas9融合多肽可以包含上述Cas9 RGE、ndDBP或RGN蛋白和异源多肽中的任何。
FnCpf1 RGE(SEQ ID NO:25)的保守催化残基包括上面鉴定的RuvC亚结构域残基。根据SEQ ID NO:25编号的D917、E1006、E1028、D1255和/或N1257是可以突变的残基,例如D917A、E1006A、E1028A、D1255A和/或N1257A,以降低FnCpf1多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或任何FnCpf1蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸取代)时,FnCpf1蛋白具有降低的活性。在一些情况下,变体FnCpf1蛋白是催化方面的“死”蛋白(无催化活性),被称为“dFnCpf1”。dFnCpf1蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dFnCpf1(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。在一些情况下,仅切割双链靶核酸(例如双链靶DNA)的一条链的切口酶或RGN可以通过突变SEQ ID NO:25的FnCpf1蛋白的一个或多个残基和/或催化残基来获得。在某些实施例中,FnCpf1 RGN可以包含残基R1226中的突变(例如,R1226A)。FnCpf1融合多肽可以包含上述FnCpf1 RGE、ndDBP或RGN蛋白和异源多肽中的任何。
CasJ RGE(SEQ ID NO:37)的保守催化残基包括上面鉴定的RuvC亚结构域残基。根据SEQ ID NO:37编号的D901、E1128和D1298是可以突变的残基,例如D901A、E1128A或D1298A,以降低CasJ多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或任何CasJ蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸取代)时,CasJ蛋白具有降低的活性。在一些情况下,变体CasJ蛋白是催化方面的“死”蛋白(无催化活性),被称为“dCasJ”。dCasJ蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dCasJ(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。在一些情况下,仅切割双链靶核酸(例如双链靶DNA)的一条链的切口酶或RGN可以通过突变SEQ ID NO:37的CasJ蛋白的一个或多个残基和/或催化残基来获得。在某些实施例中,CasJ RGN可以包含残基E1128和/或D1298(例如E1128A和/或D1298A)中的突变。CasJ融合多肽可以包含上述CasJ RGE、ndDBP或RGN蛋白和异源多肽中的任何。
LbCpf1 RGE(SEQ ID NO:73)的保守催化残基包括上面鉴定的RuvC亚结构域残基。根据SEQ ID NO:73编号的D832、E925和/或D1148是可以突变的残基,例如D832A、E925A和/或D1148A,以降低LbCpf1多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或任何LbCpf1蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸取代)时,LbCpf1蛋白具有降低的活性。在一些情况下,变体LbCpf1蛋白是催化方面的“死”蛋白(无催化活性),被称为“dLbCpf1”。dLbCpf1蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dLbCpf1(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。在一些情况下,仅切割双链靶核酸(例如双链靶DNA)的一条链的切口酶或RGN可以通过突变SEQ ID NO:73的LbCpf1蛋白的一个或多个残基和/或催化残基来获得。在某些实施例中,LbCpf1 RGN可以包含残基R1138中的突变(例如,R1138A)。LbCpf1融合多肽可以包含上述LbCpf1 RGE、ndDBP或RGN蛋白和异源多肽中的任何。
Cas12j-1RGE(SEQ ID NO:120)的保守催化残基包括RuvC亚结构域残基。根据SEQID NO:120编号的D371、E579和/或D673是可以突变的残基。C640、C643、C646、C661和/或C664也可以被突变以降低催化活性。示例性突变是D371A、E579A、D673A、C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S,以降低Cas12j-1多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或Cas12j-1蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸或丝氨酸取代)时,Cas12j-1蛋白具有降低的活性。在一些情况下,变体Cas12j-1蛋白是催化方面的“死”蛋白(无催化活性),被称为“dCas12j-1”。dCas12j-1蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dCas12j-1(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。Cas12j-1融合多肽可以包含上述Cas12j-1RGE或ndDBP蛋白和异源多肽中的任何。
Cas12j-2RGE(SEQ ID NO:132)的保守催化残基包括上面鉴定的RuvC亚结构域残基。根据SEQ ID NO:132编号的D394、E606和/或D697是可以突变的残基。C667、C670、C673、C685和C688也可以被突变以降低催化活性。示例性突变是D394A、E606A、D697A、C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S,以降低Cas12j-2多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或Cas12j-2蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸或丝氨酸取代)时,Cas12j-2蛋白具有降低的活性。在一些情况下,变体Cas12j-2蛋白是催化方面的“死”蛋白(无催化活性),被称为“dCas12j-2”。dCas12j-2蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dCas12j-2(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。Cas12j-2融合多肽可以包含上述Cas12j-2RGE或ndDBP蛋白和异源多肽中的任何。
Cas12j-3RGE(SEQ ID NO:144)的保守催化残基包括RuvC亚结构域残基。根据SEQID NO:144编号的D413、E618和/或D710是可以突变的残基。C680、C683、C687、C698和C701也可以被突变以降低催化活性。示例性突变是D413A、E618A、D710A、C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S,以降低Cas12j-3多肽的催化活性并提供ndRGDBP。因此,在一些情况下,当一个或多个上述氨基酸(或Cas12j-3蛋白的一个或多个相应氨基酸)发生突变(例如,被丙氨酸或丝氨酸取代)时,Cas12j-3蛋白具有降低的活性。在一些情况下,变体Cas12j-3蛋白是催化方面的“死”蛋白(无催化活性),被称为“dCas12j-3”。dCas12j-3蛋白可以与提供活性的融合配偶体融合,并且在一些情况下,dCas12j-3(例如,没有提供催化活性的融合配偶体-但在真核细胞中表达时可以具有NLS的情况下)可以结合靶DNA并可以阻断RNA聚合酶从靶DNA翻译或其他内源DNA结合或加工蛋白质的功能。Cas12j-3融合多肽可以包含上述Cas12j-3RGE或ndDBP蛋白和异源多肽中的任何。
如上所述,在一些情况下,RGE、RGN或ndRGDBP蛋白(在一些情况下是具有野生型核酸内切酶活性的Cas9、Cas12a、Cas12e或Cas12j蛋白,并且在一些情况下是具有降低的或经修饰的切割活性的变体RGE、RGN或ndRGDBP,例如dCas或切口酶Cas)与具有目的活性(例如,目的催化活性)的异源多肽融合(缀合)以形成融合蛋白(例如,嵌合Cas或Cpf1蛋白或Cas或Cpf1融合多肽)。可融合RGE、RGN或ndRGDBP Cas蛋白的异源多肽在本文中称为“融合配偶体”。在某些实施例中,编码RGE、RGN或ndRGDBP的主题多核苷酸可操作地连接至编码异源多肽或融合配偶体的多核苷酸。在某些实施例中,编码异源多肽或融合配偶体的多核苷酸:(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码异源多肽的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。与由主题多核苷酸编码的RGE、RGN或ndRGDBP融合的异源多肽包括具有修饰靶DNA的酶活性的异源多肽、核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ST)、转录激活结构域(TAD)、转录阻遏结构域(TRD);或其任何组合。
在一些情况下,融合配偶体(例如异源多肽)在融合到例如由本主题的合成的多核苷酸编码的ndRGDBP或SCORP时可以调节靶DNA的转录(例如抑制转录、增加转录)。例如,在一些情况下,融合配偶体是抑制转录的蛋白质(或蛋白质的结构域)(例如,转录阻遏物,其是通过募集转录抑制蛋白、修饰靶DNA(例如甲基化)、募集DNA修饰剂、调节与靶DNA相关的组蛋白、募集组蛋白修饰剂(例如修饰组蛋白乙酰化和/或甲基化的那些等)而起作用的蛋白质)。在一些情况下,融合配偶体是增加转录的蛋白质(或蛋白质的结构域)(例如,转录激活物,其是通过募集转录激活蛋白、修饰靶DNA(例如去甲基化)、募集DNA修饰剂、调节与靶DNA相关的组蛋白、募集组蛋白修饰剂(例如修饰组蛋白乙酰化和/或甲基化的那些等)而起作用的蛋白质)。
在一些情况下,嵌合RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽包括具有修饰靶核酸的酶活性(例如,核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性或糖基化酶活性)的异源多肽。
在一些情况下,嵌合RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽包括具有修饰与靶核酸相关的多肽(例如组蛋白)的酶活性(例如甲基转移酶活性、去甲基化酶活性、乙酰转移酶活性、去乙酰化酶活性、激酶活性、磷酸酶活性、泛素连接酶活性、去泛素化活性、腺苷酸化活性、去腺苷酸化活性、SUMO化活性、去SUMO化活性、核糖基化活性、去核糖基化活性、肉豆蔻酰化活性或去肉豆蔻酰化活性)的异源多肽。
可用于增加靶内源基因转录的异源多肽的实例可包括转录激活物结构域(TAD),例如玉蜀黍c1VP16、VP64、VP48、VP160、p65亚结构域(例如,来自NFkB)、EDLL的TAD和/或TAL激活结构域(例如,对于在植物中的活性);组蛋白赖氨酸甲基转移酶,例如SET1A、SET1B、MLL1至5、ASH1、SYMD2、NSDl等;组蛋白赖氨酸去甲基化酶,例如JHDM2a/b、UTX、JMJD3等;组蛋白乙酰转移酶,例如GCN5、PCAF、CBP、p300、TAF1、TIP60/PLIP、MOZ/MYST3、MORF/MYST4、SRCl、ACTR、P160、CLOCK等;和DNA去甲基化酶,例如十-十一易位(TET)双加氧酶1(TET1CD)、TET1、DME、DML1、DML2、ROS1等。在某些实施例中,可以使用多个VP64 TAD(Lowder等人,MolPlant.[分子植物学]2018;11(2):245-256)。可用于本文提供的ATF中的强效植物TAD的另一个实例是在AP2/ERF转录因子中发现的EDLL基序(Tiwari等人,Plant J.[植物杂志]2012;70(5):855-65)。可用于本文提供的ATF中的强效植物TAD的又一个实例是杂合VP64-p65-Rta三部分激活物(VPR;SEQ ID NO:109;Chavez等人,Nat Methods.[自然方法]2015;12(4):326-8)。在某些实施例中,上述异源肽可融合至结合靶内源基因的ndRGDBP。在某些实施例中,这样的ndRDBP还可以融合至合适的靶向肽,例如核定位信号(NLS;当靶向核基因时)或叶绿体转运肽(CTP;当靶向质体基因组中的基因时)。
可用于降低转录的异源多肽的实例可包括转录阻遏结构域(TRD),包括Krüppel相关框(KRAB或SKD);KOX1抑制结构域;Mad mSIN3交互结构域(SID);ERF阻遏结构域(ERD;Dong CJ,Liu JY.BMC Plant Biol[BMC植物生物学].2010年3月16日;10:47.)或SRDX抑制结构域(Figueroa P,Browse J.Plant J[植物杂志].2015年3月;81(6):849-60)用于植物中的抑制,等等;组蛋白赖氨酸甲基转移酶,例如Pr-SET7/8、SUV4-20H1、RIZl等;组蛋白赖氨酸去甲基化酶,例如JMJD2A/JHDM3A、JMJD2B、JMJD2C/GASC1、JMJD2D、JARID1A/RBP2、JARIDlB/PLU-1、JARID1C/SMCX、JARID1D/SMCY等;组蛋白赖氨酸去乙酰化酶,例如HDAC1、HDAC2、HDAC3、HDAC8、HDAC4、HDAC5、HDAC7、HDAC9、SIRT1、SIRT2、HDAC11等;DNA甲基化酶,例如Hhal DNA m5c-甲基转移酶(M.Hhal)、DNA甲基转移酶1(DNMT1)、DNA甲基转移酶3a(DNMT3a)、DNA甲基转移酶3b(DNMT3b)、MET1、DRM3(植物)、ZMET2、CMT1、CMT2(植物)等;以及外周招募元件,如Lamin A、Lamin B等。在某些实施例中,上述异源肽可融合至结合靶内源基因的ndRGDBP。在某些实施例中,这样的ndRDBP还可以融合至合适的靶向肽,例如核定位信号(NLS;当靶向核基因时)或叶绿体转运肽(CTP;当靶向质体基因组中的基因时)。
在一些情况下,用于RGE、RGN或ndRGDBP融合多肽中的融合配偶体具有修饰靶核酸(例如,ssRNA、dsRNA、ssDNA、dsDNA)的酶活性。融合配偶体可以提供的酶活性的实例包括但不限于:核酸酶活性(例如由限制酶(例如,Fokl核酸酶)提供的活性)、甲基转移酶活性(例如由甲基转移酶(例如,Hhal DNA m5c-甲基转移酶,M.Hhal)、DNA甲基转移酶1(DNMT1)、DNA甲基转移酶3a(DNMT3a)、DNA甲基转移酶3b(DNMT3b)、MET1、DRM3(植物)、ZMET2、CMT1、CMT2(植物)等)提供的活性);去甲基化酶活性(例如由去甲基化酶(例如,十-十一易位(TET)双加氧酶1(TET1CD)、TET1、DME、DML1、DML2、ROS 1等)提供的活性)、DNA修复活性、DNA损伤活性、脱氨基活性(例如由脱氨酶(例如胞嘧啶脱氨酶例如大鼠APOBEC1)提供的活性)、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性(例如由整合酶和/或解离酶(例如,Gin转化酶,例如Gin转化酶的高活性突变体GinH106Y;人免疫缺陷病毒1型整合酶(IN);Tn3解离酶;等)提供的活性)、转座酶活性、重组酶活性(例如由重组酶(例如,Gin重组酶的催化结构域)提供的活性)、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和糖基化酶活性)。
在一些情况下,用于RGE、RGN或ndRGDBP融合多肽中的融合配偶体具有修饰与靶核酸(例如,ssRNA、dsRNA、ssDNA、dsDNA)相关的蛋白质(例如,组蛋白、RNA结合蛋白、DNA结合蛋白等)的酶活性。融合配偶体可以提供的酶活性(修饰与靶核酸相关的蛋白质)的实例包括但不限于:甲基转移酶活性(例如由组蛋白甲基转移酶(HMT)(例如,杂色抑制子3-9同源物1(SUV39H1,也称为KMTIA)、常染色质组蛋白赖氨酸甲基转移酶2(G9A,也称为KMT1C和EHMT2)、SUV39H2、ESET/SETDB 1等、SET1A、SET1B、MLL1至5、ASH1、SYMD2、NSD1、DOT1L、Pr-SET7/8、SUV4-20H1、EZH2、RIZl)提供的活性)、去甲基化酶活性(例如由组蛋白去甲基化酶(例如,赖氨酸去甲基化酶1A(KDM1A,也称为LSD1)、JHDM2a/b、JMJD2A/JHDM3A、JMJD2B、JMJD2C/GASC1、JMJD2D、JARID1A/RBP2、JARIDlB/PLU-1、JARID1C/SMCX、JARID1D/SMCY、UTX、JMJD3等)提供的)、乙酰转移酶活性(例如由组蛋白乙酰化酶转移酶(例如人乙酰转移酶p300的催化核心/片段、GCN5、PCAF、CBP、TAF1、TIP60/PLIP、MOZ/MYST3、MORF/MYST4、HB01/MYST2、HMOF/MYST1、SRC1、ACTR、P160、CLOCK等)提供的)、去乙酰化酶活性(例如由组蛋白去乙酰化酶(例如HDAC1、HDAC2、HDAC3、HDAC8、HDAC4、HDAC5、HDAC7、HDAC9、SIRT1、SIRT2、HDAC11等)提供的)、激酶活性、磷酸酶活性、泛素连接酶活性、去泛素化活性、腺苷酸化活性、去腺苷酸化活性、SUMO化活性、去SUMO化活性、核糖基化活性、去核糖基化活性、肉豆蔻酰化活性和去肉豆蔻酰化活性。
在RGE、RGN或ndRGDBP融合多肽中使用的合适融合配偶体的另一个实例是二氢叶酸还原酶(DHFR)去稳定结构域(例如,以产生化学可控的嵌合RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽)和叶绿体转运肽。
在一些情况下,RGE、RGN或ndRGDBP融合多肽包含:a)RGE、RGN或ndRGDBP多肽;和b)叶绿体转运肽。因此,例如,CRISPR-RGE、RGN或ndRGDBP复合物可以靶向叶绿体。在一些情况下,这种靶向可以通过存在称为叶绿体转运肽(CTP)或质体转运肽的N末端延伸来实现。如果要在植物质体(例如叶绿体)中使表达的多肽区室化,则来自细菌来源的染色体转基因必须具有编码CTP序列的序列,该CTP序列与编码表达的多肽的序列融合。
因此,外源多肽向叶绿体的定位通常是通过将编码CTP序列的多核苷酸序列可操作地连接到编码外源多肽的多核苷酸的5'区来实现的。在易位到质体的过程中,CTP在加工步骤中被去除。然而,加工效率可能会受到CTP的氨基酸序列和肽氨基末端附近的序列的影响。已经描述的用于靶向叶绿体的其他选择是玉蜀黍cab-m7信号序列(美国专利号7,022,896,WO 97/41228)豌豆谷胱甘肽还原酶信号序列(WO 97/41228)和US 2009029861中描述的CTP。
本文披露的RGE、RGN或ndRGDBP多肽可进一步包含至少一种质体靶向信号肽、至少一种线粒体靶向信号肽或将RGE、RGN或ndRGDBP多肽靶向质体和线粒体的信号肽。质体、线粒体和双靶向信号肽定位信号是本领域已知的(参见,例如,Nassoury和Morse(2005)Biochim Biophys Acta[生物化学与生物物理学学报]1743:5-19;Kunze和Berger(2015)Front Physiol[生理学前沿]dx.doi.org/10.3389/fphys.2015.00259;Herrmann和Neupert(2003)IUBMB Life[IUBMB生命]55:219-225;Soll(2002)Curr Opin Plant Biol[植物生物学新见]5:529-535;Carrie和Small(2013)Biochim Biophys Acta[生物化学与生物物理学学报]1833:253-259;Carrie等人(2009)FEBS J[欧洲生化学会联合会杂志]276:1187-1195;Silva-Filho(2003)Curr Opin Plant Biol[植物生物学新见]6:589-595;Peeters和Small(2001)Biochim Biophys Acta[生物化学与生物物理学学报]1541:54-63;Murcha等人(2014)JExp Bot[实验植物学杂志]65:6301-6335;Mackenzie(2005)TrendsCell Biol[细胞生物学趋势]15:548-554;Glaser等人(1998)Plant Mol Biol[植物分子生物学]38:311-338)。质体、线粒体或双靶向信号肽可位于RGE、RGN或ndRGDBP多肽的N末端、C末端或内部位置。
在一些情况下,RGE、RGN或ndRGDBP融合多肽可包含:a)RGE、RGN或ndRGDBP多肽;和b)内体逃逸肽(EEP)。在一些情况下,内体逃逸多肽包含SEQ ID NO:110或SEQ ID NO:111的氨基酸序列。
对于在与Cas9、锌指和/或TALE蛋白融合的上下文中(用于位点特异性靶核酸修饰、转录调节和/或靶蛋白修饰,例如,组蛋白修饰)使用的以上融合配偶体中的一些(或更多)的实例参见例如:Nomura等人J Am Chem Soc.[美国化学学会杂志]2007年7月18日;129(28):8676-7;Rivenbark等人,Epigenetics.[表观遗传学]2012年4月;7(4):350-60;Nucleic Acids Res.[核酸研究]2016年7月8日;44(12):5615-28;Gilbert等人,Cell.[细胞]2013年7月18日;154(2):442-51;Kearns等人,Nat Methods.[自然方法]2015年5月;12(5):401-3;Mendenhall等人,Nat Biotechnol.[自然生物技术]2013年12月;31(12):1133-6;Hilton等人,Nat Biotechnol.[自然生物技术]2015年5月;33(5):510-7;Gordley等人,Proc Natl Acad Sci U S A.[美国国家科学院院刊]2009年3月31日;106(13):5053-8;Akopian等人,Proc Natl Acad Sci U S A.[美国国家科学院院刊]2003年7月22日;100(15):8688-91;Tan等人,J Virol.[病毒学杂志]2006年2月;80(4):1939-48;Tan等人,ProcNatl Acad Sci US A[美国国家科学院院刊].2003年10月14日;100(21):11997-2002;Papworth等人,Proc Natl Acad Sci U S A[美国国家科学院院刊].2003年2月18日;100(4):1621-6;Sanjana等人,Nat Protoc[自然实验手册].2012年1月5日;7(l):171-92;Beerli等人,Proc Natl Acad Sci U S A[美国国家科学院院刊].1998年12月8日;95(25):14628-33;Snowden等人,Curr Biol[当代生物学].2002年12月23日;12(24):2159-66;Xu等人,Cell Discov[细胞发现].2016年5月3日;2:16009;Komor等人,Nature[自然].2016年4月20日;533(7603):420-4;Chaikind等人,Nucleic Acids Res[核酸研究].2016年8月11日;Choudhury等人,Oncotarget[肿瘤靶标].2016年6月23日;Du等人,Cold Spring HarbProtoc[冷泉港实验方案].2016年1月4日;Pham等人,Methods Mol Biol[分子生物学方法].2016;1358:43-57;Balboa等人,Stem Cell Reports[干细胞报告].2015年9月8日;5(3):448-59;Hara等人,Sci Rep[科学报告].2015年1月9日;5:11221;Piatek等人,PlantBiotechnol J[植物生物技术杂志].2015年5月;13(4):578-89;Hu等人,Nucleic AcidsRes[核酸研究].2014年4月;42(7):4375-90;Cheng等人,Cell Res[细胞研究].2013年10月;23(10):1163-71;Cheng等人,Cell Res[细胞研究].2013年10月;23(10):l 163-71;和Maeder等人,Nat Methods[自然方法].2013年10月;10(10):977-9。
可用于RGE、RGN或ndRGDBP融合多肽中的其他合适的异源多肽包括但不限于直接和/或间接提供增加的靶核酸转录和/或翻译的多肽(例如,转录激活物或其片段,其是募集转录激活物、小分子/药物反应性转录和/或翻译调节剂、翻译调节蛋白等的蛋白质或其片段)。实现增加或减少转录的异源多肽的非限制性实例包括转录激活结构域和转录阻遏结构域。在一些这样的情况下,嵌合RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽被指导核酸(指导RNA)靶向到靶核酸中的特定位置(即序列)并发挥基因座特异性调节作用,例如阻断RNA聚合酶与启动子的结合(这选择性抑制转录激活物功能),和/或修饰局部染色质状态(例如,当使用修饰靶核酸或修饰与靶核酸相关联的多肽的融合序列时)。在一些情况下,这些变化是短暂的(例如,转录抑制或激活)。在一些情况下,这些变化是可遗传的(例如,当对靶核酸或与靶核酸相关的蛋白质(例如核小体组蛋白)进行表观遗传修饰时)。
当靶向ssRNA靶核酸时使用的异源多肽的非限制性实例包括但不限于:剪接因子(例如,RS结构域);蛋白质翻译组分(例如,翻译起始、延伸和/或释放因子;例如,eIF4G);RNA甲基化酶;RNA编辑酶(例如RNA脱氨酶,例如作用于RNA(ADAR)的腺苷脱氨酶,包括A到I和/或C到U编辑酶);解旋酶;RNA结合蛋白;等。应当理解,异源多肽可以包括整个蛋白质,或者在一些情况下可以包括蛋白质的片段(例如,功能结构域)。
主题嵌合RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的异源多肽可以是能够与ssRNA相互作用的任何结构域(为了本披露的目的,其包括分子内和/或分子间二级结构(例如,双链RNA双链体,如发夹、茎环等),无论是瞬时的还是不可逆的相互作用,直接或间接的相互作用,包括但不限于选自以下组的效应子结构域:核酸内切酶(例如来自SMG5和SMG6等蛋白质的RNA酶III、CRR22 DYW结构域、Dicer和PIN(PilT N末端)结构域);负责刺激RNA切割的蛋白质和蛋白质结构域(例如CPSF、CstF、CFIm和CFIIm);核酸外切酶(例如XRN-1或核酸外切酶T);去腺苷化酶(例如HNT3);负责无义介导的RNA衰变的蛋白质和蛋白质结构域(例如UPF1、UPF2、UPF3、UPF3b、RNP SI、Y14、DEK、REF2和SRml60);负责稳定RNA的蛋白质和蛋白质结构域(例如PABP);负责阻遏翻译的蛋白质和蛋白质结构域(例如Ago2和Ago4);负责刺激翻译的蛋白质和蛋白质结构域(例如Staufen);负责(例如,能够)调节翻译的蛋白质和蛋白质结构域(例如,翻译因子,例如起始因子、延伸因子、释放因子等,例如,eIF4G);负责RNA的多聚腺苷酸化的蛋白质和蛋白质结构域(例如PAP1、GLD-2和Star-PAP);负责RNA的多尿苷酸化的蛋白质和蛋白质结构域(例如CI Dl和末端尿苷酸转移酶);负责RNA定位的蛋白质和蛋白质结构域(例如来自IMPl、ZBPl、She2p、She3p和Bicaudal-D);负责RNA的核保留的蛋白质和蛋白质结构域(例如Rrp6);负责RNA的核输出的蛋白质和蛋白质结构域(例如TAP、NXF1、THO、TREX、REF和Aly);负责阻遏RNA剪接的蛋白质和蛋白质结构域(例如PTB、Sam68和hnRNP Al);负责刺激RNA剪接的蛋白质和蛋白质结构域(例如富含丝氨酸/精氨酸(SR)的结构域);负责降低转录效率的蛋白质和蛋白质结构域(例如FUS(TLS));以及负责刺激转录的蛋白质和蛋白质结构域(例如CDK7和HIV Tat)。可替代地,效应子结构域可选自包含以下的组:核酸内切酶;能够刺激RNA切割的蛋白质和蛋白质结构域;核酸外切酶;去腺苷化酶;具有无义介导的RNA衰变活性的蛋白质和蛋白质结构域;能够稳定RNA的蛋白质和蛋白质结构域;能够阻遏翻译的蛋白质和蛋白质结构域;能够刺激翻译的蛋白质和蛋白质结构域;能够调节翻译的蛋白质和蛋白质结构域(例如,翻译因子,例如起始因子、延伸因子、释放因子等,例如,eIF4G);能够使RNA多聚腺苷酸化的蛋白质和蛋白质结构域;能够使RNA多尿苷酸化的蛋白质和蛋白质结构域;具有RNA定位活性的蛋白质和蛋白质结构域;能够使RNA在核内保留的蛋白质和蛋白质结构域;具有RNA核输出活性的蛋白质和蛋白质结构域;能够阻遏RNA剪接的蛋白质和蛋白质结构域;能够刺激RNA剪接的蛋白质和蛋白质结构域;能够降低转录效率的蛋白质和蛋白质结构域;以及能够刺激转录的蛋白质和蛋白质结构域。另一种合适的异源多肽是PUF RNA结合结构域,其在WO 2012068627中有更详细的描述,该文献通过引用以其整体并入本文。
可(整体或作为其片段)用作嵌合RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的异源多肽的一些RNA剪接因子具有模块化结构,其中具有单独的序列特异性RNA结合模块和剪接效应子结构域。例如,富含丝氨酸/精氨酸(SR)的蛋白家族的成员包含与前mRNA中的外显子剪接增强子(ESE)结合的N末端RNA识别基序(RRM)和促进外显子包含的C末端RS结构域。作为另一个实例,hnRNP蛋白hnRNP Al通过其RRM结构域与外显子剪接沉默子(ESS)结合,并通过C末端的富含甘氨酸的结构域抑制外显子包含。一些剪接因子可以通过与两个替代位点之间的调节序列结合来调节剪接位点(ss)的替代使用。例如,ASF/SF2可以识别ESE并促进使用内含子近端位点,而hnRNP Al可以结合ESS并将剪接转向使用内含子远端位点。这些因子的一种应用是产生调节内源基因,特别是疾病相关基因的可变剪接的ESF。例如,Bcl-xpre-mRNA在两个可变5'剪接位点情况下产生两个剪接异构体,以编码功能相反的蛋白质。长剪接异构体Bcl-xL是有效的凋亡抑制剂,在长寿命的有丝分裂后细胞中表达,在许多癌细胞中上调,保护细胞免受凋亡信号的影响。短异构体Bcl-xS是促凋亡异构体,在具有高周转率的细胞(例如,发育中的淋巴细胞)中以高水平表达。两种Bcl-x剪接异构体的比例由位于核心外显子区域或外显子延伸区域(即,在两个可变5'剪接位点之间)的多个cc元件调节。对于更多实例,参见WO 2010075303,其通过引用以其整体并入本文。
其他合适的融合配偶体或RGE、RGN或ndRGDBP融合多肽RGE、RGN或ndRGDBP融合多肽包括但不限于作为边界元件(例如CTCF)的蛋白质(或其片段)、提供外周募集的蛋白质及其片段(例如,Lamin A、Lamin B等)、蛋白质对接元件(例如,FKBP/FRB、Pill/Abyl等)。
可以用于在编码嵌合RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸中适用的各种其他合适的异源多肽(或其片段)的实例但不限于以下申请中描述的那些(这些出版物涉及其他CRISPR核酸内切酶,例如Cas9,但所描述的融合配偶体也可以与RGE、RGN或ndRGDBP一起使用):PCT专利申请:WO 2010075303、WO2012068627和WO 2013155555,例如可以在以下美国专利和专利申请中找到:8,906,616;8,895,308;8,889,418;8,889,356;8,871,445;8,865,406;8,795,965;8,771,945;8,697,359;20140068797;20140170753;20140179006;20140179770;20140186843;20140186919;20140186958;20140189896;20140227787;20140234972;20140242664;20140242699;20140242700;20140242702;20140248702;20140256046;20140273037;20140273226;20140273230;20140273231;20140273232;20140273233;20140273234;20140273235;20140287938;20140295556;20140295557;20140298547;20140304853;20140309487;20140310828;20140310830;20140315985;20140335063;20140335620;20140342456;20140342457;20140342458;20140349400;20140349405;20140356867;20140356956;20140356958;20140356959;20140357523;20140357530;20140364333;和20140377868;所有这些都通过引用以其整体并入本文。
在一些情况下,异源多肽(融合配偶体)或RGE、RGN或ndRGDBP融合多肽提供亚细胞定位,例如,异源多肽包含亚细胞定位序列(例如,用于靶向核的核定位信号(NLS),将融合蛋白保持在核外的序列(例如核输出序列(NES)),将融合蛋白保持在细胞质中的序列,用于靶向线粒体的线粒体定位信号,用于靶向叶绿体的叶绿体定位信号,ER保留信号等)。在一些实施例中,RGE、RGN或ndRGDBP融合多肽不包括NLS,使得蛋白质不靶向细胞核(这可能是有利的,例如,当靶核酸是存在于胞质溶胶中的RNA时)。在一些实施例中,异源多肽可以提供标签或表位标签(例如,异源多肽是可检测的标记)以便于追踪和/或纯化(例如,荧光蛋白,例如,绿色荧光蛋白(GFP),YFP、RFP、CFP、mCherry、tdTomato、mScarlett等;组氨酸标签,例如6XHis标签;血凝素(HA)标签;FLAG标签;Myc标签;等等)。
在一些情况下,RGE、RGN或ndRGDBP可操作地连接到核定位信号(NLS)(例如,在一些情况下,2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)。因此,在一些情况下,RGE、RGN或ndRGDBP融合多肽包括一个或多个NLS(例如,2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)。在一些情况下,一个或多个NLS(2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)位于N末端和/或C末端处或附近(例如,在50个氨基酸内)。在一些情况下,一个或多个NLS(2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)位于N末端处或附近(例如,在50个氨基酸内)。在一些情况下,一个或多个NLS(2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)位于C末端处或附近(例如,在50个氨基酸内)。在一些情况下,一个或多个NLS(3个或更多个、4个或更多个或5个或更多个NLS)位于N末端和C末端两者处或附近(例如,在50个氨基酸内)。在一些情况下,NLS位于N末端,NLS位于C末端。
NLS的非限制性实例包括包含至少4个连续碱性氨基酸的NLS,例如SV40大T抗原NLS(PKKKRKV;SEQ ID NO:112)、玉蜀黍不透明-2核定位信号(SEQ ID NO:113)和扩展的SV40大T抗原NLS(SEQ ID NO:114)。一般来说,NLS(或多个NLS)具有足够的强度来驱动RGE、RGN或ndRGDBP蛋白在真核细胞的细胞核中以可检测的量积累。可以通过任何合适的技术来检测核中的积累。例如,可检测标记可与RGE、RGN或ndRGDBP蛋白融合,从而可可视化细胞内的位置。细胞核也可以从细胞中分离出来,然后可以通过任何合适的检测蛋白质的方法(例如免疫组织化学、蛋白质印迹或酶活性测定)来分析其内容物。也可以间接确定细胞核中的积累。
在一些情况下,RGE、RGN或ndRGDBP融合多肽包括“蛋白质转导结构域”或PTD(也称为CPP-细胞穿透肽),它是指促进穿过脂质双层、胶束、细胞膜、细胞器膜或囊泡膜的多肽、多核苷酸、碳水化合物或有机或无机化合物。附着在另一个分子(其范围可以从小的极性分子到大的大分子和/或纳米颗粒)上的PTD促进分子穿过膜,例如从细胞外空间到细胞内空间,或从细胞质到细胞器内。在一些实施例中,PTD与多肽的氨基末端共价连接(例如,连接至RGE、RGN或ndRGDBP)以产生融合蛋白。在一些实施例中,PTD共价连接至多肽的羧基末端(例如,连接至野生型RGE、RGN或ndRGDBP以生成融合蛋白,或连接至变体RGE、RGN或ndRGDBP蛋白(例如RGE、RGN或ndRGDBP,切口酶RGE、RGN或ndRGDBP,或嵌合RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽)以生成融合蛋白)。在一些情况下,PTD在合适的插入位点插入RGE、RGN或ndRGDBP融合多肽内部(即,不在RGE、RGN或ndRGDBP融合多肽的N末端或C末端)。在一些情况下,主题RGE、RGN或ndRGDBP融合多肽包括(缀合至、融合至)一个或多个PTD(例如,两个或更多个、三个或更多个、四个或更多个PTD)。在一些情况下,PTD包括核定位信号(NLS)(例如,在一些情况下,2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)。因此,在一些情况下,RGE、RGN或ndRGDBP融合多肽包括一个或多个NLS(例如,2个或更多个、3个或更多个、4个或更多个、或5个或更多个NLS)。在一些实施例中,PTD与核酸(例如,RGE、RGN或ndRGDBP指导核酸,编码RGE、RGN或ndRGDBP指导核酸的多核苷酸,编码RGE、RGN或ndRGDBP融合多肽的多核苷酸,供体模板DNA分子等)共价连接。PTD的实例包括但不限于最小十一肽蛋白转导结构域(对应于HIV-1TAT的包含YGRKKRRQRRR(SEQ ID NO:115)的残基47-57;包含足以直接进入细胞的若干个精氨酸(例如、3、4、5、6、7、8、9、10或10-50个精氨酸)的聚精氨酸序列;VP22结构域(Zender等人(2002)Cancer Gene Ther[癌症基因疗法].9(6):489-96);果蝇触角蛋白转导结构域(Noguchi等人(2003)Diabetes[糖尿病]52(7):1732-1737);截短的人降钙素肽(Trehin等人(2004)Pharm.Research[药物研究]21:1248-1256);聚赖氨酸(Wender等人(2000)Proc.Natl.Acad.Sci.USA[美国科学院院报]97:13003-13008);运输蛋白;示例性PTD包括但不限于3个精氨酸残基至50个精氨酸残基的精氨酸均聚物。在一些实施例中,PTD是可激活的CPP(ACPP)(Aguilera等人(2009)IntegrBiol[整合生物学](Camb)6月;1(5-6):371-381)。ACPP包含通过可切割的接头连接到匹配的聚阴离子(例如,Glu9或“E9”)的聚阳离子CPP(例如,Arg9或“R9”),这将净电荷降低到几乎为零,从而抑制细胞的粘附和摄取。接头断裂后,聚阴离子被释放,局部暴露聚精氨酸及其固有的粘附性,从而“激活”ACPP穿过膜。
在一些实施例中,主题RGE、RGN或ndRGDBP蛋白可以通过接头多肽(例如,一个或多个接头多肽)融合至融合配偶体。接头多肽可以具有多种氨基酸序列中的任一种。蛋白质可以通过通常具有柔性性质的间隔肽连接,但不排除其他化学连接。合适的接头包括长度在4个氨基酸和40个氨基酸之间,或在4个氨基酸和25个氨基酸之间的多肽。这些接头可以通过使用合成的、编码接头的寡核苷酸来产生以偶联蛋白质,或者可以由编码融合蛋白的核酸序列编码。可以使用具有一定程度柔性的肽接头。连接肽实际上可以具有任何氨基酸序列,记住优选的接头将具有产生通常是柔性的肽的序列。使用小氨基酸,如甘氨酸和丙氨酸,可用于产生柔性肽。这种序列的产生对于本领域技术人员来说是常规的。多种不同的接头是可商购的并且被认为适合使用。
接头多肽的实例包括甘氨酸聚合物(G)n、甘氨酸-丝氨酸聚合物(包括,例如,((GS)n、GSGGSn(SEQ ID NO:116)、GGSGGSn(SEQ ID NO:117)和GGGSn(SEQ ID NO:118),其中n是至少为1的整数)、甘氨酸-丙氨酸聚合物、丙氨酸-丝氨酸聚合物。普通技术人员将认识到与任何所需元件缀合的肽的设计可以包括全部或部分柔性的接头,使得接头可以包括柔性接头以及赋予较低柔性结构的一个或多个部分。
可以说RGE、RGN或ndRGDBP指导RNA包括两个区段,靶向区段和蛋白质结合区段。RGE、RGN或ndRGDBP指导RNA的靶向区段包括与靶核酸(例如,靶ssRNA、靶ssDNA、双链靶DNA的互补链等)内的特定序列(靶位点)互补(并因此杂交)的核苷酸序列(指导序列)。靶核酸(例如基因组DNA)的位点特异性结合和/或切割可以发生在由RGE、RGN或ndRGDBP指导RNA(RGE、RGN或ndRGDBP指导RNA的指导序列)和靶核酸之间的碱基配对互补性确定的位置处(例如,靶基因座的靶序列)。用于Cas9和Cas12 RGE、RGN和ndDBP的指导RNA的设计在Robb,G.B.Genome editing with CRISPR-Cas:an overview.[使用CRISPR-Cas进行基因组编辑:概述]Current Protocols Essential Laboratory Techniques[当前方案基本实验室技术],19,e36.doi:10.1002/cpet.36;(2019)中进行了阐述。
蛋白质结合区段(或“蛋白质结合序列”)与RGE、RGN或ndRGDBP多肽相互作用(结合)。
在一些情况下,蛋白质结合区段由具有17-20或16-36个核苷酸的短序列组成,例如具有18或19或约24至29个核苷酸的序列。该蛋白质结合区段形成长度为五对残基的双链RNA双链体。5'末端在第一个RNA双链残基的上游有大约3个或9-14个残基。具有4-5个残基的茎结构将双链区分开。参见Pausch等人,Science[科学]369,333-337(2020)。
在一些情况下,主题RGE、RGN或ndRGDBP指导RNA的蛋白质结合区段包括两个互补的核苷酸段,它们彼此杂交以形成双链RNA双链体(dsRNA双链体)。在一些实施例中,本主题的合成的多核苷酸编码蛋白,该蛋白与如SEQ ID NO:37所示的RGE、RGN或ndRGDBP CasJ蛋白序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性),蛋白质结合区段可以由例如由SEQ ID NO:119的DNA分子编码的RNA构成。在一些实施例中,本主题的合成的多核苷酸编码蛋白,该蛋白与如SEQID NO:120所示的RGE或ndRGDBP Cas12j-1蛋白序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性),蛋白质结合区段可以由例如由SEQ ID NO:186的DNA分子或其3’片段编码的RNA构成。
在一些实施例中,本主题的合成的多核苷酸编码蛋白,该蛋白与如SEQ ID NO:132所示的RGE或ndRGDBP Cas12j-2蛋白序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性),蛋白质结合区段可以由例如由SEQ ID NO:187的DNA分子或其3’片段编码的RNA构成。
在一些实施例中,本主题的合成的多核苷酸编码蛋白,该蛋白与如SEQ ID NO:144所示的RGE或ndRGDBP Cas12j-3蛋白序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性),蛋白质结合区段可以由例如由SEQ ID NO:188的DNA分子或其3’片段编码的RNA构成。
RGE、RGN或ndRGDBP指导RNA和RGE、RGN或ndRGDBP蛋白,例如融合RGE、RGN或ndRGDBP多肽,形成复合物(例如,通过非共价相互作用结合)。RGE、RGN或ndRGDBP指导RNA通过包括靶向区段向复合物提供靶特异性,该靶向区段包括指导序列(与靶核酸序列互补的核苷酸序列)。复合物的RGE、RGN或ndRGDBP蛋白提供位点特异性活性(例如,由RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽提供的切割活性和/或在嵌合RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽的情况下由融合配偶体提供的活性)。换言之,RGE、RGN或ndRGDBP蛋白通过其与RGE、RGN或ndRGDBP指导RNA的缔合而被指导至靶核酸序列(例如,靶序列)。
可以制备也称为RGE、RGN或ndRGDBP指导RNA的“靶向序列”的“指导序列”,使得RGE、RGN或ndRGDBP指导RNA可以使RGE、RGN或ndRGDBP蛋白(例如天然存在的RGE、RGN或ndRGDBP蛋白、融合RGE、RGN或ndRGDBP多肽(例如,嵌合RGE、RGN或ndRGDBP)等)靶向任何期望的靶核酸的任何期望序列,除了可以考虑原型间隔区相邻基序(PAM)序列之外(例如,如本文所述)。通常,指导RNA的靶向序列通常包含约18或19至约21或22个核苷酸的序列,其对应于紧邻PAM的5’末端的序列(例如对于Cas9和类似的RNA定向核酸酶)或约20、21、22、23或24个核苷酸的序列,其对应于紧邻PAM的3’末端的序列(例如对于Cas12a(即Cpf1)和类似的RNA定向核酸酶)。因此,例如,RGE、RGN或ndRGDBP指导RNA可以具有与真核细胞中的核酸(例如,病毒核酸、真核核酸(例如,真核染色体、染色体序列、真核RNA等)等)中的序列互补(例如,可以杂交)的指导序列酸。
在本主题的合成的多核苷酸编码与如SEQ ID NO:37所示的RGE、RGN或ndRGDBPCasJ蛋白质序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多、85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性)的蛋白质的情况下,CasJ RGE、RGN或ndRGDBP的PAM紧邻靶DNA的非互补链的靶序列的5'(互补链与指导RNA的指导序列杂交,而非互补链不直接与指导RNA杂交并且是互补链的反向互补链)。在一些实施例中(例如,当使用前述CasJ蛋白时),非互补链的PAM共有序列富含T。PAM序列的实例包括但不限于TTN、CTN、TCN、CCN、TTTN、TCTN、TTCN、CTTN、ATTN、TCCN、TTGN、GTTN、CCCN、CCTN、TTAN、TCGN、CTCN、ACTN、GCTN、TCAN、GCCN和CCGN(其中N定义为任何核苷酸)。
在本主题的合成的多核苷酸编码与如SEQ ID NO:120、132或144所示的RGE或ndRGDBP Cas12j蛋白质序列具有20%或更多序列同一性(例如,30%或更多、40%或更多、50%或更多、60%或更多、70%或更多、80%或更多、85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%序列同一性)的蛋白质的情况下,Cas12jRGE或ndRGDBP的PAM紧邻靶DNA的非互补链的靶序列的5'(互补链与指导RNA的指导序列杂交,而非互补链不直接与指导RNA杂交并且是互补链的反向互补链)。在一些实施例中,例如当使用Cas12j-1蛋白(SEQ ID NO:120)时,非互补链的PAM共有序列是5'-VTTR-3'(其中V是A、C或G,且R是A或G)。在一些实施例中,例如当使用Cas12j-2蛋白(SEQ ID NO:132)时,非互补链的PAM共有序列是5'-TBN-3'(其中B是G、T或C,且N是A、T、C或G)。在一些实施例中,例如当使用Cas12j-3蛋白(SEQ ID NO:144)时,非互补链的PAM共有序列是VTTN。
在一些实施例中,主题RGE、RGN或ndRGDBP指导RNA也可以说包括“激活物”和“靶向物”(例如,分别为“激活物-RNA”和“靶向物-RNA”)。当“激活物”和“靶向物”是两个单独的分子时,指导RNA在本文中称为“双指导RNA”、“dgRNA”、“双分子指导RNA”或“双分子指导RNA”。(例如,“RGE、RGN或ndRGDBP双指导RNA”)。在一些实施例中,激活物和靶向物彼此共价连接(例如,通过间插核苷酸),并且指导RNA在本文中被称为“单指导RNA”、“sgRNA”、“单分子指导RNA、”或“独分子指导RNA”(例如,“RGE、RGN或ndRGDBP单指导RNA”)。因此,主题RGE、RGN或ndRGDBP单指导RNA包含彼此连接(例如,通过间插核苷酸)的靶向物(例如,靶向物-RNA)和激活物(例如,激活物-RNA),并且可以彼此杂交形成指导RNA的蛋白质结合区段的双链RNA双链体(dsRNA双链体),从而产生茎环结构。因此,靶向物和激活物各自具有形成双链体的区段,其中靶向物的形成双链体的区段和激活物的形成双链体的区段彼此互补并彼此杂交。
在一些实施例中,RGE、RGN或ndRGDBP单指导RNA的接头是一段核苷酸。在一些情况下,RGE、RGN或ndRGDBP单指导RNA的靶向物和激活物通过间插核苷酸彼此连接,并且接头可以具有3到20个核苷酸(nt)的长度(例如,3到15、3到12、3到10、3到8、3到6、3到5、3到4、4到20、4到15、4到12、4到10、4到8、4到6或4到5nt)。在一些实施例中,RGE、RGN或ndRGDBP单指导RNA的接头可具有3至100个核苷酸(nt)的长度(例如,3到80、3到50、3到30、3到25、3到20、3到15、3到12、3到10、3到8、3到6、3到5、3到4、4至100、4到80、4到50、4到30、4到25、4到20、4到15、4到12、4到10、4到8、4到6或4到5nt)。在一些实施例中,RGE、RGN或ndRGDBP单指导RNA的接头可具有3至10个核苷酸(nt)的长度(例如,3到9、3到8、3到7、3到6、3到5、3到4、4到10、4到9、4到8、4到7、4到6或4到5nt)。
主题RGE、RGN或ndRGDBP指导RNA的靶向区段包括指导序列(即,靶向序列),其是与靶核酸中的序列(靶位点)互补的核苷酸序列。换言之,RGE、RGN或ndRGDBP指导RNA的靶向区段可以与靶核酸(例如,双链DNA(dsDNA)、单链DNA(ssDNA)、单链RNA(ssRNA)或双链RNA(dsRNA))以序列特异性方式通过杂交(即碱基配对)相互作用。RGE、RGN或ndRGDBP指导RNA的指导序列可以被修饰(例如,通过基因工程)/设计以与靶核酸(例如,真核靶核酸,如基因组DNA)内任何期望的靶序列杂交(例如,同时考虑PAM,例如,当靶向dsDNA靶时)。
在一些实施例中,指导序列与靶核酸的靶位点之间的互补性百分比为60%或更多(例如,65%或更多、70%或更多、75%或更多、80%或更多、85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%)。在一些情况下,指导序列与靶核酸的靶位点之间的互补性百分比为80%或更多(例如,85%或更多、90%或更多、95%或更多、97%或更多、98%或更多、99%或更多、或100%)。在一些情况下,指导序列与靶核酸的靶位点之间的互补性百分比为90%或更多(例如,95%或更多、97%或更多、98%或更多、99%或更多、或100%)。在一些情况下,指导序列与靶核酸的靶位点之间的互补性百分比为100%。
在一些情况下,指导序列和靶核酸的靶位点之间的互补性百分比在靶核酸的靶位点的七个连续最3'核苷酸上为100%。
本披露提供了一种或多种核酸,其包含以下中的一种或多种:供体模板DNA分子序列(用于靶基因的同源定向修复),编码RGE、RGN或ndRGDBP多肽或融合多肽等的本主题的合成的多核苷酸序列,RGE、RGN或ndRGDBP指导RNA,和编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列(在双指导RNA形式的情况下可以包括两个单独的核苷酸序列,或者在单指导RNA形式的情况下可以包括单个核苷酸序列)。本披露提供了包含编码由本主题的合成的多核苷酸编码的RGE、RGN或ndRGDBP融合多肽的核苷酸序列的核酸。本披露提供了包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸序列的重组表达载体。本披露提供了包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的重组表达载体。本披露提供了包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸的重组表达载体。本发明提供了重组表达载体,其包含:a)编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸;和b)编码一种或多种RGE、RGN或ndRGDBP指导RNA的核苷酸序列。在一些情况下,编码RGE、RGN或ndRGDBP蛋白的本主题的合成的多核苷酸和/或编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列可操作地连接到在所选细胞类型(例如、原核细胞、真核细胞、植物细胞(包括大豆植物细胞)、动物细胞、哺乳动物细胞、灵长类动物细胞、啮齿动物细胞、人细胞等)中可操作的启动子。
本披露提供了一种或多种重组表达载体,其(在一些情况下在不同的重组表达载体中,并且在一些情况下在相同的重组表达载体中)包括:(i)供体模板DNA分子的核苷酸序列(其中供体模板DNA分子包含与靶核酸(例如,靶基因组)的靶序列具有同源性的核苷酸序列);(ii)编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列,该指导RNA与靶向的基因组的靶基因座的靶序列杂交(例如,单或双指导RNA)(例如,可操作地连接至在靶细胞如真核细胞中可操作的启动子);和(iii)编码RGE、RGN或ndRGDBP蛋白的本主题的合成的多核苷酸(例如,可操作地连接至在靶细胞如真核细胞或大豆细胞中可操作的启动子)。本披露提供了一种或多种重组表达载体,其(在一些情况下在不同的重组表达载体中,并且在一些情况下在相同的重组表达载体中)包括:(i)供体模板DNA分子的核苷酸序列(其中供体模板DNA分子包含与靶核酸(例如,靶基因组)的靶序列具有同源性的核苷酸序列);和(ii)编码RGE、RGN或ndRGDBP指导RNA的核酸,该指导RNA与靶向的基因组的靶基因座的靶序列杂交(例如,单或双指导RNA)(例如,可操作地连接至在靶细胞如真核细胞中可操作的启动子)。本披露提供了一种或多种重组表达载体,其(在一些情况下在不同的重组表达载体中,并且在一些情况下在相同的重组表达载体中)包括:(i)编码RGE、RGN或ndRGDBP指导RNA的本主题的合成的多核苷酸,该指导RNA与靶向的基因组的靶基因座的靶序列杂交(例如,单或双指导RNA)(例如,可操作地连接至在靶细胞如大豆植物细胞中可操作的启动子);和(ii)编码RGE、RGN或ndRGDBP蛋白的本主题的合成的多核苷酸(例如,可操作地连接至在靶细胞如真核细胞中可操作的启动子)。
取决于所利用的宿主/载体系统,可以在表达载体中使用许多合适的转录和翻译控制元件中的任一种,包括组成型和诱导型启动子、转录增强子元件、转录终止子等。
在一些实施例中,编码RGE、RGN或ndRGDBP指导RNA的本主题的合成的多核苷酸可操作地连接至控制元件,例如转录控制元件,例如启动子。在一些实施例中,编码RGE、RGN或ndRGDBP蛋白或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸可操作地连接至控制元件,例如转录控制元件,例如启动子。
转录控制元件可以是启动子。在一些情况下,启动子是组成型活性启动子。在一些情况下,启动子是调节型启动子。在一些情况下,启动子是诱导型启动子。在一些情况下,启动子是组织特异性启动子。在一些情况下,启动子是细胞类型特异性启动子。在一些情况下,转录控制元件(例如,启动子)在靶向的细胞类型或靶向的细胞群中起作用。
在一些实施例中,编码RGE、RGN或ndRGDBP指导RNA和/或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸可操作地连接至诱导型启动子。在一些实施例中,编码RGE、RGN或ndRGDBP指导RNA和/或RGE、RGN或ndRGDBP融合蛋白的本主题的合成的多核苷酸可操作地连接至组成型启动子。
将核酸(例如,包含供体模板DNA分子序列、编码RGE、RGN或ndRGDBP蛋白的一种或多种本主题的合成的多核苷酸和/或RGE、RGN或ndRGDBP指导RNA等的核酸)引入宿主细胞中的方法在本领域中是已知的,并且可以使用任何方便的方法将核酸(例如,表达构建体)引入细胞中。合适的方法包括例如病毒感染、转染、脂质体转染、电穿孔、磷酸钙沉淀、聚乙烯亚胺(PEI)介导的转染、DEAE-葡聚糖介导的转染、脂质体介导的转染、粒子枪技术、磷酸钙沉淀、直接显微注射、纳米颗粒-介导的核酸递送等。
将重组表达载体引入细胞可以在任何培养基中以及在任何促进细胞存活的培养条件下进行。将重组表达载体引入靶细胞可以在体内或离体进行。将重组表达载体引入靶细胞可以在体外进行。
在一些实施例中,编码RGE、RGN或ndRGDBP蛋白的本主题的合成的多核苷酸可以作为RNA提供。RNA可以通过直接化学合成提供,也可以在体外从DNA转录(例如,编码RGE、RGN或ndRGDBP蛋白)。一旦合成,可以通过任何众所周知的将核酸引入细胞的技术(例如,显微注射、电穿孔、转染等)将RNA引入细胞中。
可以使用成熟的转染技术向细胞提供核酸;例如,参见Angel和Yanik(2010)PLoSONE[公共科学图书馆·综合]5(7):el 1756,以及来自凯杰公司(Qiagen)的市售
Figure BDA0003867941360000491
试剂、来自Stemgent公司的StemfectTMRNA转染试剂盒,以及来自米纳斯生物公司(Mirus Bio LLC)的
Figure BDA0003867941360000492
转染试剂盒。另见Beumer等人(2008)PNAS[美国国家科学院院刊]105(50):19821-19826。
载体可以直接提供给靶宿主细胞。换言之,将细胞与包含本主题的合成的多核苷酸的载体(例如,编码RGE、RGN或ndRGDBP蛋白的重组表达载体;等)接触,以使载体被细胞摄取。使细胞与作为质粒的核酸载体接触的方法包括电穿孔、氯化钙转染、显微注射和脂转染,这些在本领域中是众所周知的。对于病毒载体递送,细胞可以与包含主题病毒表达载体(例如双生病毒载体、TMV载体等)的病毒颗粒接触,该主题病毒表达载体包含本主题的合成的多核苷酸。
用于向靶宿主细胞提供编码RGE、RGN或ndRGDBP指导RNA和/或RGE、RGN或ndRGDBP多肽或融合多肽的核酸的载体可以包括用于驱动目的核酸表达即转录激活的合适启动子。换言之,在一些情况下,目的核酸将与启动子可操作地连接。这可以包括普遍作用型启动子,例如病毒启动子(例如,CaMV35S或CaMV19S)、肌动蛋白启动子或诱导型启动子,例如在特定细胞群中具有活性或响应存在四环素等药物具有活性的启动子。通过转录激活,旨在将靶细胞中的转录增加至高于基础水平10倍、100倍、更通常为1000倍。此外,用于向细胞提供编码RGE、RGN或ndRGDBP指导RNA和/或RGE、RGN或ndRGDBP蛋白的核酸的载体可以包括在靶细胞中编码选择标记的核酸序列,因此以鉴定已摄取RGE、RGN或ndRGDBP指导RNA和/或RGE、RGN或ndRGDBP蛋白的细胞。
包含编码RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的核酸在一些情况下是RNA。因此,可以将RGE、RGN或ndRGDBP融合蛋白作为RNA引入细胞。将RNA引入细胞的方法是本领域已知的并且可以包括例如直接注射、转染或用于引入DNA的任何其他方法。
多种化合物、载体系统(例如,细菌植物转化载体系统)和方法中的任何一种都可以用于向靶细胞(例如植物细胞,包括大豆细胞)递送包含本主题的合成的多核苷酸的RGE、RGN或ndRGDBP系统。本文提供的RGE、RGN或ndRGDBP系统包括可以包含以下的系统:(a)编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸,RGE、RGN或ndRGDBP指导RNA和供体模板DNA分子;(b)包含编码RGE、RGN或ndRGDBP多肽的mRNA的本主题的合成的多核苷酸;和RGE、RGN或ndRGDBP指导RNA;(c)包含编码RGE、RGN或ndRGDBP多肽的mRNA的本主题的合成的多核苷酸,RGE、RGN或ndRGDBP指导RNA和供体模板DNA分子;(d)包含编码RGE、RGN或ndRGDBP融合多肽的mRNA的本主题的合成的多核苷酸;和RGE、RGN或ndRGDBP指导RNA;(e)包含编码RGE、RGN或ndRGDBP融合多肽的mRNA的本主题的合成的多核苷酸,RGE、RGN或ndRGDBP指导RNA和供体模板DNA分子;(f)重组表达载体,其包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸和编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列;(g)重组表达载体,其包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸,编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列,和编码供体模板DNA分子的核苷酸序列;(h)重组表达载体,其包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸和编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列;(i)重组表达载体,其包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸,编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列,和编码供体模板DNA分子的核苷酸序列;(j)包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸的第一重组表达载体,和包含编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列的第二重组表达载体;(k)包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸的第一重组表达载体,和包含编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列的第二重组表达载体;和供体模板DNA分子;(l)包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的第一重组表达载体,和包含编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列的第二重组表达载体;(m)包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的第一重组表达载体,和包含编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列的第二重组表达载体;和供体模板DNA分子;(n)重组表达载体,其包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸,编码第一RGE、RGN或ndRGDBP指导RNA的核苷酸序列,和编码第二RGE、RGN或ndRGDBP指导RNA的核苷酸序列;或(o)重组表达载体,其包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸,编码第一RGE、RGN或ndRGDBP指导RNA的核苷酸序列,和编码第二RGE、RGN或ndRGDBP指导RNA的核苷酸序列;或(a)到(o)之一的一些变体。作为非限制性实例,RGE、RGN或ndRGDBP系统可以与脂质组合。作为另一个非限制性实例,RGE、RGN或ndRGDBP系统可以与颗粒组合,或配制成颗粒。作为另一个非限制性实例,RGE、RGN或ndRGDBP系统可包含在植物细胞中或递送至植物细胞(例如大豆植物细胞)。
将核酸引入宿主细胞的方法是本领域已知的,并且可以使用任何方便的方法将本主题的合成的多核苷酸(例如,表达构建体/载体)或包含它们的RGE、RGN或ndRGDBP系统引入靶细胞(例如,原核细胞、真核细胞、植物细胞(例如大豆植物细胞)、动物细胞、哺乳动物细胞、人细胞等)。合适的方法包括例如病毒感染、转染、缀合、原生质体融合、脂转染、电穿孔、磷酸钙沉淀、聚乙烯亚胺(PEI)介导的转染、DEAE-葡聚糖介导的转染、脂质体介导的转染、粒子枪技术、磷酸钙沉淀、直接微量注射、纳米颗粒介导的核酸递送(参见,例如,Panyam等人Adv Drug Deliv Rev.[药物递送综述进展]2012年9月13日.pii:S0169-409X(12)00283-9.doi:10.1016/j.addr.2012.09.023),等。在植物中,可以使用以下:细菌介导的(例如,农杆菌属物种(Agrobacterium sp.)、根瘤菌属物种(Rhizobium sp.)、中华根瘤菌属物种(Sinorhizobium sp.)、中生根瘤菌属物种(Mesorhizobium sp.)、慢生根瘤菌属物种(Bradyrhizobium sp.)、固氮菌属物种(Azobacter sp.)、叶杆菌属物种(Phyllobacterium sp.))用包含本主题的合成的多核苷酸的核酸对植物(例如,大豆)细胞、原生质体、胚胎、愈伤组织或组织的转染或转化;参见例如Broothaerts等人(2005)Nature[自然],433:629-633。
在一些情况下,RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统可以作为核酸(例如,mRNA、DNA、质粒、表达载体、病毒载体、等)递送,该核酸编码RGE、RGN或ndRGDBP多肽或融合多肽和/或RGE、RGN或ndRGDBP系统的其他组分。可以通过任何方便的方法将RGE、RGN或ndRGDBP多肽或融合多肽引入细胞(提供给细胞);这样的方法是本领域普通技术人员已知的。作为说明性实例,可将编码RGE、RGN或ndRGDBP多肽的本主题的合成的多肽直接注射到细胞中(例如,有或没有RGE、RGN或ndRGDBP指导RNA或编码RGE、RGN或ndRGDBP指导RNA的核酸,以及有或没有供体模板DNA分子)。在一些情况下,RGE、RGN或ndRGDBP融合多肽(例如,与融合配偶体融合的RGE、RGN或ndRGDBP)作为核酸(例如,mRNA、DNA、质粒、表达载体、病毒载体等)提供,该核酸包含编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多肽。
在一些情况下,核酸(例如,RGE、RGN或ndRGDBP指导RNA;包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸的核酸;RGE、RGN或ndRGDBP系统等的一种或多种组分)以颗粒或与颗粒向关联地递送至细胞(例如,靶宿主细胞,例如大豆细胞)。在一些情况下,RGE、RGN或ndRGDBP系统被递送至颗粒中的细胞或与颗粒相关联的颗粒。视情况而定,术语“颗粒”和纳米颗粒可互换使用。可以使用颗粒或脂质包膜同时递送以下:包含编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸和/或RGE、RGN或ndRGDBP指导RNA的重组表达载体,包含编码RGE、RGN或ndRGDBP多肽的本主题的合成的多核苷酸的mRNA,和指导RNA。例如,编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸和RGE、RGN或ndRGDBP指导RNA可通过颗粒递送,例如包含脂质或类脂质和亲水性聚合物的递送颗粒,例如阳离子脂质和亲水聚合物,例如其中阳离子脂质包含l,2-二油酰基-3-三甲基铵-丙烷(DOTAP)或l,2-双十四烷酰基-sn-甘油-3-磷酸胆碱(DMPC)和/或其中亲水聚合物包括乙二醇或聚乙二醇(PEG);和/或其中所述颗粒进一步包含胆固醇(例如,来自以下的颗粒:配制品l=DOTAP 100、DMPC 0、PEG 0、胆固醇0;配制品编号2=DOTAP 90、DMPC 0、PEG 10、胆固醇0;配制品编号3=DOTAP 90、DMPC 0、PEG 5、胆固醇5)。
可以使用颗粒或脂质包膜同时递送以下:包含编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸的mRNA或包含编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸和/或RGE、RGN或ndRGDBP指导RNA(或核酸,例如编码RGE、RGN或ndRGDBP指导RNA的一种或多种表达载体)的重组表达载体。例如,可以使用具有被磷脂双层壳包裹的聚(β-氨基酯)(PBAE)核的可生物降解核-壳结构纳米颗粒。在一些情况下,使用基于自组装生物粘附聚合物的颗粒/纳米颗粒。
类脂质化合物(例如,如美国专利申请20110293703中所述)也可用于多核苷酸的施用,并可用于递送编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸。一方面,氨基醇类脂质化合物与待递送至细胞或受试者的药剂组合以形成微粒、纳米颗粒、脂质体或胶束。氨基醇类脂质化合物可以与其他氨基醇类脂质化合物、聚合物(合成的或天然的)、表面活性剂、胆固醇、碳水化合物、蛋白质、脂质等组合以形成颗粒。
聚(β-氨基醇)(PBAA)可用于将编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸递送至靶细胞。美国专利公开号20130302401涉及使用组合聚合制备的一类聚(β-氨基醇)(PBAA)。
可以使用基于糖的颗粒,例如GalNAc,如参考WO 2014118272(通过引用并入本文)和Nair,J K等人,2014,Journal of the American Chemical Society[美国化学会杂志]136(49),16958-16961))所描述,可用于将编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸递送至靶细胞。
在一些情况下,脂质纳米颗粒(LNP)可用于将编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸递送至靶细胞。带负电荷的聚合物(例如RNA)可以在低pH值(例如pH 4)下加载到LNP中,其中可电离的脂质显示正电荷。然而,在生理pH值下,LNP表现出低表面电荷,与较长的循环时间相适应。四种可电离的阳离子脂质已被关注,即l,2-二亚油酰基-3-二甲基铵-丙烷(DLinDAP)、1,2-二亚油酰基氧基-3-N,N-二甲基氨基丙烷(DLinDMA)、1,2-二亚油酰基氧基-酮基-N,N-二甲基-3-氨基丙烷(DLinKDMA)和l,2-二亚油酰基-4-(2-二甲基氨基乙基)-[l,3]-二氧戊环(DLinKC2-DMA)。LNP的制备并在例如Rosin等人(2011)Molecular Therapy[分子疗法]19:1286-2200)中进行了描述。可以使用阳离子脂质l,2-二亚油酰基-3-二甲基铵-丙烷(DLinDAP)、l,2-二亚油酰基氧基-3-N,N-二甲基氨基丙烷(DLinDMA)、l,2-二亚油酰基氧基酮基-N,N-二甲基-3-氨基丙烷(DLinKDMA)和l,2-二亚油酰基-4-(2-二甲基氨基乙基)-[l,3]-二氧戊环(DLinKC2-DM A)、(3-o-[2"-(甲氧基聚乙二醇2000)丁二酰基]-l,2-二肉豆蔻酰基-sn-乙二醇(PEG-S-DMG)和R-3-[(ω-甲氧基-聚(乙二醇)2000)氨基甲酰基]-l,2-二肉豆蔻基氧基丙基-3-胺(PEG-C-DOMG)。核酸(例如,RGE、RGN或ndRGDBP指导RNA;编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸;等)可以封装在含有DlinDAP、DLinDMA、DLinK-DMA和DLinKC2-DMA的LNP(阳离子脂质:DSPC:CHOL:PEGS-DMG或PEG-C-DOMG的摩尔比为40:10:40:10)中。在一些情况下,掺加0.2%SP-DiOC18。
球形核酸(SNATM)构建体和其他纳米颗粒(特别是金纳米颗粒)可用于将包含本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分的核酸递送至靶细胞。参见,例如,Cutler等人,J.Am.Chem.Soc[美国化学学会杂志].2011 133:9254-9257,Hao等人,Small.2011 7:3158-3162,Zhang等人,ACS Nano[ACS纳米].2011 5:6962-6970,Cutler等人,J.Am.Chem.Soc[美国化学学会杂志].2012 134:1376-1391,Young等人,Nano Lett[纳米快报].2012 12:3867-71,Zheng等人,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊].2012 109:11975-80,Mirkin,Nanomedicine[纳米药物]2012 7:635-638Zhang等人,J.Am.Chem.Soc[美国化学学会杂志].2012 134:16488-1691,Weintraub,Nature[自然]2013 495:S14-S16,Choi等人,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊].2013 110(19):7625-7630,Jensen等人,Sci.Transl.Med[科学转化医学].5,209ral52(2013)和Mirkin,等人,Small,10:186-192。
具有包含编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸的RNA的自组装纳米颗粒可以用聚乙烯亚胺(PEI)构建,聚乙烯亚胺被连接在聚乙二醇(PEG)远端的Arg-Gly-Asp(RGD)肽配体PEG化。
通常,“纳米颗粒”是指直径小于1000nm的任何颗粒。在一些情况下,适用于将包含本主题的合成的多核苷酸的核酸递送至靶细胞的纳米颗粒具有500nm或更小的直径,例如25nm至35nm、35nm至50nm、50nm至75nm、75nm至100nm、100nm至150nm、150nm至200nm、200nm至300nm、300nm至400nm或400nm至500nm。在一些情况下,适用于将包含本主题的合成的多核苷酸的核酸递送至靶细胞的纳米颗粒具有25nm至200nm的直径。在一些情况下,适用于将包含本主题的合成的多核苷酸的核酸递送至靶细胞的纳米颗粒具有100nm或更小的直径。在一些情况下,适用于将包含本主题的合成的多核苷酸的核酸递送至靶细胞的纳米颗粒具有35nm至60nm的直径。
适用于将包含本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分的核酸递送至靶细胞的纳米颗粒可以不同形式提供,例如,作为固体纳米颗粒(例如,金属,例如银、金、铁、钛)、非金属、基于脂质的固体、聚合物)、纳米颗粒的悬浮液或其组合。可以制备金属、电介质和半导体纳米颗粒,以及杂混结构(例如,核-壳纳米颗粒)。如果由半导体材料制成的纳米颗粒足够小(通常低于10nm),可以发生电子能级的量子化,它们也可以被标记为量子点。这种纳米级颗粒在生物医学应用中用作药物载剂或显像剂,并且可以适用于本披露中的类似目的。
半固体和软纳米颗粒也适用于将包含本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分的核酸递送至靶细胞。半固体性质的原型纳米颗粒是脂质体。
在一些情况下,脂质体用于将本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分递送至靶细胞。脂质体是球形囊泡结构,这些球形囊泡结构由围绕内部水性隔室的单层或多层的脂质双层和相对不可渗透的外部亲脂性磷脂双层构成。脂质体可以由几种不同类型的脂质制成;然而,磷脂最常用于生成脂质体。虽然当脂质膜与水溶液混合时脂质体的形成是自发的,但也可以通过使用均质器、超声仪或挤出装置以摇动的形式施加力来加速脂质体的形成。可以将几种其他添加剂添加到脂质体中以改变它们的结构和特性。例如,可以将胆固醇或鞘磷脂添加到脂质体混合物中,以帮助稳定脂质体结构并防止脂质体内部货物的泄漏。脂质体配制品可以主要由天然磷脂和脂质(例如1,2-二硬脂酰基-sn-甘油-3-磷脂酰胆碱(DSPC)、鞘磷脂、卵磷脂酰胆碱和单唾液酸神经节苷脂)构成。
稳定的核酸-脂质颗粒(SNALP)可用于将本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分递送至靶细胞。SNALP配制品可包含摩尔百分比为2:40:10:48的脂质3-N-[(甲氧基聚(乙二醇)2000)氨基甲酰基]-1,2-二肉豆蔻酰基氧基-丙基胺(PEG-C-DMA)、l,2-二亚油酰基氧基-N,N-二甲基-3-氨基丙烷(DLinDMA)、l,2-二硬脂酰基-sn-甘油-3-磷脂酰胆碱(DSPC)和胆固醇。可通过将D-Lin-DMA和PEG-C-DMA与双硬脂酰磷脂酰胆碱(DSPC)、胆固醇和siRNA(使用25:1脂质/siRNA比率)和胆固醇/D-Lin-DMA/DSPC/PEG-C-DMA的48/40/10/2摩尔比率进行配制来制备SNALP脂质体。所得SNALP脂质体的大小约为80-100nm。SNALP可以包含合成的胆固醇(西格玛奥德里奇公司(Sigma-Aldrich),圣路易斯,密苏里州,美国)、二棕榈酰基磷脂酰胆碱(Avanti极性脂质公司(Avanti PolarLipids),阿拉巴斯特(Alabaster),阿拉巴马州(Ala),美国)、3-N-[(w-甲氧基聚(乙二醇)2000)氨基甲酰基]-l,2-二肉豆蔻酰基氧基丙基胺和阳离子l,2-二亚油酰基氧基-3-N,N二甲基氨基丙烷。SNALP可包含合成的胆固醇(西格玛奥德里奇公司)、l,2-二硬脂酰基-sn-甘油-3-磷脂酰胆碱(DSPC;Avanti极性脂质公司)、PEG-cDMA和l,2-二亚油酰基氧基-3-(N;N-二甲基)氨基丙烷(DLinDMA)。
其他阳离子脂质,例如氨基脂质2,2-二亚油基-4-二甲氨基乙基-[l,3]-二氧戊环(DLin-KC2-DMA)可用于递送本主题的合成的多核苷酸或RGE的一种或多种组分,RGN或ndRGDBP系统到靶细胞。可以考虑具有以下脂质组合物的预制囊泡:氨基脂质、二硬脂酰磷脂酰胆碱(DSPC)、胆固醇和(R)-2,3-双(十八烷基氧基)丙基-l-(甲氧基聚(乙二醇)2000)丙基氨基甲酸酯(PEG-脂质),分别地摩尔比为40/10/40/10,和约为0.05(w/w)FVII siRNA/总脂质比。为确保70-90nm范围内的窄粒度分布和0.11.+-.0.04(n=56)的低多分散指数,在添加指导RNA之前,这些颗粒可以通过80nm膜挤出多达三次。可以使用含有高效氨基脂质16的颗粒,其中四种脂质组分16、DSPC、胆固醇和PEG-脂质的摩尔比(50/10/38.5/1.5)可以进一步优化以增强体内活性。
脂质可以与RGE、RGN或ndRGDBP系统或其一种或多种组分或编码它们的核酸一起配制以形成脂质纳米颗粒(LNP)。合适的脂质包括但不限于DLin-KC2-DMA4、CI 2-200和共脂质二硬脂酰磷脂酰胆碱,胆固醇和PEG-DMG可以使用自发囊泡形成程序与本披露的RGE、RGN或ndRGDBP系统或其组分一起配制。组分摩尔比可以为约50/10/38.5/1.5(DLin-KC2-DMA或C12-200/二硬脂酰磷脂酰胆碱/胆固醇/PEG-DMG)。
本披露的RGE、RGN或ndRGDBP系统或其组分可以封装在PLGA微球中递送,例如在美国公开申请20130252281和20130245107和20130244279中进一步描述的那些。
超电荷电蛋白可用于将本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分递送至靶细胞。超电荷电蛋白质是一类工程化的或天然存在的蛋白质,具有异常高的正或负净理论电荷。超负电荷和超正电荷蛋白质都表现出承受热或化学诱导的聚集的能力。超正电荷蛋白质还能够穿透哺乳动物细胞。将货物与这些蛋白质(如质粒DNA、RNA或其他蛋白质)相关联,可以促进这些大分子在体外和体内功能性递送到哺乳动物细胞中。
细胞穿透肽(CPP)可用于将本主题的合成的多核苷酸或RGE、RGN或ndRGDBP系统的一种或多种组分递送至靶细胞。CPP通常具有的氨基酸组成要么包含高相对丰度的带正电荷氨基酸(例如赖氨酸或精氨酸),要么具有包含极性/带电荷氨基酸和非极性疏水氨基酸的交替模式的序列。
本披露提供了经修饰的细胞(例如,经修饰的植物细胞或经修饰的大豆细胞),其包含编码RGE、RGN或ndRGDBP多肽或融合多肽或RGE、RGN或ndRGDBP系统的一种或多种组分的本主题的合成的多核苷酸。本披露提供了用包含编码本披露的RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸的mRNA进行基因修饰的经基因修饰的细胞。本披露提供了用包含编码本披露的RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸的重组表达载体进行基因修饰的经基因修饰的细胞。本披露提供了用重组表达载体进行基因修饰的经基因修饰的细胞(例如大豆细胞),该重组表达载体包含:a)编码本披露的RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸;和b)编码本披露的RGE、RGN或ndRGDBP指导RNA的核苷酸序列。本披露提供了用重组表达载体进行基因修饰的经基因修饰的细胞,该重组表达载体包含:a)编码RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸;b)编码RGE、RGN或ndRGDBP指导RNA的核苷酸序列;和c)编码供体模板DNA分子的核苷酸序列。
用作本披露的编码RGE、RGN或ndRGDBPRGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸和/或RGE、RGN或ndRGDBP指导RNA的受体的细胞可以是多种细胞中的任何,包括例如体外细胞;体内细胞;离体细胞;原代细胞;癌细胞;动物细胞;植物细胞;藻类细胞;真菌细胞;等等。在某些实施例中,细胞是大豆细胞,包括分生组织或胚胎大豆细胞。充当编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸和/或RGE、RGN或ndRGDBP指导RNA的受体的细胞称为“宿主细胞”或“靶细胞”。
因为使用RGE、RGN或ndRGDBP多肽或融合多肽的方法包括将RGE、RGN或ndRGDBP多肽或融合多肽与靶核酸中的特定区域结合(通过由相关联的RGE、RGN或ndRGDBP指导RNA靶向那里),这些方法在本文中通常称为结合方法(例如,结合靶核酸的方法)。然而,应理解,在一些情况下,虽然结合方法可能仅导致靶核酸的结合,但在其他情况下,该方法可能具有不同的最终结果(例如,该方法可以导致靶核酸的修饰,例如切割/甲基化/等,调节从靶核酸的转录;调节靶核酸的翻译;基因组编辑;调节与靶核酸相关联的蛋白质;分离靶核酸;等等)。
有关获得或设计适用于某些RGE、RGN和ndRGDBP的指导RNA的合适方法的实例参见例如Pausch等人,Science[科学]369,333-337(2020)以及Jinek等人,Science[科学].2012年8月17日;337(6096):816-21;Chylinski等人,RNA Biol[RNA生物学].2013年5月;10(5):726-37;Ma等人,Biomed Res Int[国际生物医学研究].2013;2013:270805;Hou等人,ProcNatl Acad Sci U S A[美国国家科学院院刊].2013年9月24日;110(39):15644-9;Jinek等人,Elife.2013;2:e00471;Pattanayak等人,Nat Biotechnol[自然生物技术].2013年9月;31(9):839-43;Qi等人,Cell[细胞].2013年2月28日;152(5):1173-83;Wang等人,Cell[细胞].2013年5月9日;153(4):910-8;Auer等人,Genome Res[基因组研究].2013年10月31日;Chen等人,Nucleic Acids Res[核酸研究].2013年11月l日;41(20):el9;Cheng等人,CellRes[细胞研究].2013年10月;23(10):1163-71;Cho等人,Genetics[遗传学].2013年11月;195(3):1177-80;DiCarlo等人,Nucleic Acids Res[核酸研究].2013年4月;41(7):4336-43;Dickinson等人,Nat Methods[自然方法].2013年10月;10(10):1028-34;Ebina等人,Sci Rep[科学报道].2013;3:2510;Fujii等人,Nucleic Acids Res[核酸研究].2013年11月l日;41(20):el87;Hu等人,Cell Res[细胞研究].2013年11月;23(l l):1322-5;Jiang等人,Nucleic Acids Res[核酸研究].2013年11月l日;41(20):el88;Larson等人,NatProtoc[自然实验手册].2013年11月;8(l l):2180-96;Mali等人,Nat Methods[自然方法].2013年10月;10(10):957-63;Nakayama等人,Genesis[发生].2013年12月;51(12):835-43;Ran等人,Nat Protoc[自然实验手册].2013年11月;8(l l):2281-308;Ran等人,Cell[细胞].2013年9月12日;154(6):1380-9;Upadhyay等人,G3(Bethesda).2013年12月9日;3(12):2233-8;Walsh等人,Proc Natl Acad Sci U S A[美国国家科学院院刊].2013年9月24日;110(39):15514-5;Xie等人,Mol Plant[分子植物学].2013年10月9日;Yang等人,Cell[细胞].2013年9月12日;154(6):1370-9;和美国专利和专利申请:8,906,616;8,895,308;8,889,418;8,889,356;8,871,445;8,865,406;8,795,965;8,771,945;8,697,359;20140068797;20140170753;20140179006;20140179770;20140186843;20140186919;20140186958;20140189896;20140227787;20140234972;20140242664;20140242699;20140242700;20140242702;20140248702;20140256046;20140273037;20140273226;20140273230;20140273231;20140273232;20140273233;20140273234;20140273235;20140287938;20140295556;20140295557;20140298547;20140304853;20140309487;20140310828;20140310830;20140315985;20140335063;20140335620;20140342456;20140342457;20140342458;20140349400;20140349405;20140356867;20140356956;20140356958;20140356959;20140357523;20140357530;20140364333;和20140377868;其中的每一个都通过引用以其整体并入本文。
在需要将多核苷酸序列插入基因组(在该基因组中,靶序列被切割)的应用中,还可以向细胞提供供体模板DNA分子。供体模板DNA分子可以插入到被RGE或RGN蛋白切割的靶编辑位点处(例如,在dsDNA切割之后,在对靶DNA进行切口之后,在对靶DNA进行双重切口之后,等)。供体模板DNA分子可以在靶位点包含与基因组序列足够的同源性,例如,与靶位点侧翼的核苷酸序列(例如,在靶位点的约50个碱基或更少内,例如,约30个碱基内、约15个碱基内、约10个碱基内、约5个碱基内,或紧侧翼于靶位点)具有70%、80%、85%、90%、95%或100%的同源性,以支持它与其具有同源性的基因组序列之间的同源定向修复。供体和基因组序列之间的具有序列同源性的大约25、50、100或200个核苷酸,或超过200个核苷酸(或10到200个核苷酸或更多的任何整数值)可以支持同源性定向修复。供体模板DNA分子可以具有任何长度,例如10个核苷酸或更多、50个核苷酸或更多、100个核苷酸或更多、250个核苷酸或更多、500个核苷酸或更多、1000个核苷酸或更多、5000个核苷酸或更多,等。
供体模板DNA分子通常与其取代的基因组序列不相同。相反,供体模板DNA分子可以包含至少一个或多个相对于基因组序列的单碱基变化、插入、缺失、倒位或重排,只要存在足够的同源性以支持同源性定向修复(例如,对于基因校正,例如,将致病碱基对转换为非致病碱基对)。在一些实施例中,供体模板DNA分子包含侧接两个同源区的非同源序列,使得靶DNA区和两个侧翼序列之间的同源定向修复导致非同源序列插入靶区。供体模板DNA分子还可以包含载体骨架,该载体骨架含有与目的DNA区域不同源并且不打算插入目的DNA区域的序列。通常,供体模板DNA分子的一个或多个同源区将与需要重组的基因组序列具有至少50%序列同一性。在某些实施例中,存在60%、70%、80%、90%、95%、98%、99%或99.9%序列同一性。根据供体模板DNA分子的长度,可以存在1%和100%序列同一性之间的任何值。
与基因组序列相比,供体模板DNA分子可包含某些序列差异,例如限制性位点、核苷酸多态性、选择标记(例如,药物抗性基因、荧光蛋白、酶等)等,这些差异可用于评估供体序列在切割位点的成功插入,或者在一些情况下可用于其他目的(例如,显示在靶向的基因组基因座处的表达)。在一些情况下,如果位于编码区,这种核苷酸序列差异不会改变氨基酸序列,或会产生沉默的氨基酸变化(例如,不影响蛋白质结构或功能的变化)。可替代地,这些序列差异可包括侧翼重组序列,例如FLP、loxP序列等,其可在稍后时间被激活以去除标记序列。
在一些情况下,供体模板DNA分子作为单链DNA提供给细胞。在一些情况下,供体模板DNA分子作为双链DNA提供给细胞。它可以线性或环形形式引入细胞中。如果以线性形式引入,则可以通过任何方便的方法保护供体序列的末端(例如,免于核酸外切降解)。例如,一个或多个双脱氧核苷酸残基可以添加到线性分子的3'末端和/或自身互补寡核苷酸可以连接到一个或两个末端。参见,例如,Chang等人(1987)Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]84:4959-4963;Nehls等人(1996)Science[科学]272:886-889。保护外源多核苷酸免于降解的其他方法包括但不限于一个或多个末端氨基基团的添加和经修饰的核苷酸间键的使用,例如硫代磷酸酯、氨基磷酸酯和O-甲基核糖或脱氧核糖残基。作为保护线性供体序列末端的替代方案,可以在同源区之外包括另外长度的序列,其可以在不影响重组的情况下被降解。可以将供体模板DNA分子作为载体分子的一部分引入细胞中,所述载体分子具有另外的序列,例如复制起点、启动子和编码抗生素抗性的基因。此外,供体序列可以作为裸核酸、作为与诸如脂质体或泊洛沙姆之类的试剂复合的核酸引入,或者可以通过病毒(例如,腺病毒、AAV、双生病毒)递送,如本文其他地方针对编码RGE、RGN或ndRGDBP指导RNA,RGE、RGN或ndRGDBP肽,RGE、RGN或ndRGDBP融合多肽和/或供体模板DNA分子的核酸所描述的。
如上所述,在一些情况下,包含编码RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的核酸(例如,重组表达载体)用作转基因以生成转基因植物,该转基因植物产生RGE、RGN或ndRGDBP多肽,或RGE、RGN或ndRGDBP融合多肽。提供了包含编码RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的转基因植物、植物部分(例如种子)、组织或转基因植物细胞,特别是转基因大豆植物、大豆植物部分(例如大豆种子)、大豆组织或转基因大豆植物细胞。在一些实施例中,转基因植物的基因组包含本主题的合成的多核苷酸。在一些实施例中,转基因植物对于遗传修饰是纯合的。在一些实施例中,转基因植物对于遗传修饰是杂合的。Schindele等人(FEBSLetters[欧洲生化学会联合会快报]592(2018)1954-1967)中阐述的在植物中使用基于Cas9或Cas12的RGE、RGN或ndRGDBP的方法可适用于与本文提供的本主题的合成的多核苷酸一起使用。
建立了将外源核酸引入植物细胞的方法。这样的植物细胞被认为是如上定义的“转化的”。合适的方法包括病毒感染(例如双链DNA病毒,包括双生病毒)、转染、缀合、原生质体融合、电穿孔、粒子枪技术、磷酸钙沉淀、直接显微注射、碳化硅晶须技术、农杆菌介导的转化等。方法的选择通常取决于被转化细胞的类型和发生转化的环境(例如,体外、离体或体内)。
基于土壤细菌根癌农杆菌的转化方法对于将外源核酸分子引入维管植物中特别有用。农杆菌的野生型形式含有Ti(致瘤)质粒,其在宿主植物中指导致瘤冠瘿生长的产生。将Ti质粒的致瘤T-DNA区域向植物基因组的转移需要Ti质粒编码的毒力基因以及T-DNA边界,其是标示了待转移的区域的一组正向DNA重复。基于农杆菌的载体是经修饰形式的Ti质粒,其中诱导肿瘤的功能被有待引入植物宿主的目的核酸序列所替代。
农杆菌介导的转化通常采用共整合载体或二元载体系统,其中Ti质粒的组分在辅助载体和穿梭载体之间划分,辅助载体永久存在于农杆菌宿主中并携带毒力基因,穿梭载体包含以T-DNA序列为边界的目的基因。多种二元载体在本领域中是众所周知的并且是可商购的,例如从克罗泰克公司(Clontech)(加利福尼亚州帕洛阿尔托(Palo Alto))可商购。本领域也熟知将农杆菌与所培养的植物细胞或创伤组织例如像叶组织、根外植体、下胚轴(hypocotyledon)、茎段或块茎共培养的方法。参见,例如,Glick和Thompson,(编辑),Methods in Plant Molecular Biology and Biotechnology[植物分子生物学和生物技术方法],波卡雷顿(Boca Raton),佛罗里达州:CRC出版社(1993)。
微粒介导的转化也可用于产生主题转基因植物。这种方法(最初由Klein等人(Nature[自然]327:70-73(1987))进行了描述)依赖于通过与氯化钙、亚精胺或聚乙二醇沉淀而被所希望的核酸分子包被的微弹,例如金或钨。使用装置例如生物弹射击(BIOLISTIC)PD-1000(伯乐公司(Biorad);加利福尼亚州赫拉克勒斯(Hercules Calif.)),将微弹颗粒以高速加速引入被子植物组织中。可以将包含编码RGE、RGN或ndRGDBP多肽或RGE、RGN或ndRGDBP融合多肽的本主题的合成的多核苷酸的核酸(例如,重组表达载体)引入植物中,引入的方式使得该核酸能够进入一个或多个植物细胞,例如,通过体内或离体方案进入。“体内”是指将核酸施用于植物的活体,例如渗透。“离体”是指在植物外修饰细胞或外植体,然后将这些细胞或器官再生为植物。已经描述了许多适用于植物细胞的稳定转化或适用于建立转基因植物的载体,包括在Weissbach和Weissbach,(1989)Methods for PlantMolecular Biology[植物分子生物学方法]学术出版社(Academic Press),和Gelvin等人,(1990)Plant Molecular Biology Manual[植物分子生物学手册],Kluwer学术出版社(Kluwer Academic Publishers)中描述的那些。具体实例包括那些衍生自根癌农杆菌的Ti质粒的那些,以及由Herrera-Estrella等人(1983)Nature[自然]303:209,Bevan(1984)Nucl Acid Res[核酸研究].12:8711-8721,Klee(1985)Bio/Technolo[生物/技术]3:637-642披露的那些。可替代地,可以使用非Ti载体通过使用游离DNA递送技术将DNA转移到植物和细胞中。通过使用这些方法,可以生产转基因植物,例如小麦、稻(Christou(1991)Bio/Technology[生物/技术]9:957-9和4462)和玉米(Gordon-Kamm(1990)Plant Cell[植物细胞]2:603-618)。未成熟的胚胎也可以是单子叶植物的良好靶组织,用于通过使用粒子枪进行直接DNA递送技术(Weeks等人(1993)Plant Physiol[植物生理学]102:1077-1084;Vasil(1993)Bio/Technolo[生物/技术]10:667-674;Wan和Lemeaux(1994)Plant Physiol[植物生理学]104:37-48和用于农杆菌介导的DNA转移(Ishida等人(1996)Nature Biotech[自然生物技术]14:745-750)。在美国专利申请公开号20150099648、20140283225、20140173774、20090077694、20090049567和20080229447中也阐述了转化大豆的方法,这些专利申请通过引用以其整体并入本文。将DNA引入叶绿体的方法是基因枪轰击、原生质体的聚乙二醇转化和显微注射(Danieli等人Nat.Biotechnol[自然生物技术]16:345-348,1998;Staub等人Nat.Biotechnol[自然生物技术]18:333-338,2000;O'Neill等人Plant J[植物杂志].3:729-738,1993;Knoblauch等人Nat.Biotechnol[自然生物技术]17:906-909;美国专利号5,451,513、5,545,817、5,545,818和5,576,198;国际申请号WO 95/16783;以及Boynton等人,Methods in Enzymology[酶学方法]217:510-536(1993),Svab等人,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]90:913-917(1993),和McBride等人,Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]91:7301-7305(1994))。适用于基因枪轰击、原生质体聚乙二醇转化和显微注射方法的任何载体都适合作为叶绿体转化的靶向载体。任何双链DNA载体都可以用作转化载体,特别是当引入方法不使用农杆菌时。
可以进行基因修饰的植物包括谷物、饲料作物、水果、蔬菜、油料作物、棕榈植物、林业植物和藤本植物。可以用本主题的合成的多核苷酸修饰的植物的具体实例如下:玉蜀黍、香蕉、花生、豌豆、向日葵、番茄、卡诺拉油菜、烟草、小麦、大麦、燕麦、马铃薯、豆科植物,包括大豆、豆类、花生、豌豆和扁豆;棉花、康乃馨、高粱、羽扇豆和稻。
本披露提供了转化的植物细胞、组织、植物和含有转化的植物细胞(例如大豆植物细胞)的产品。某些本主题的转化的细胞、组织和产品的特征是存在整合到基因组中的本主题的合成的多核苷酸,以及植物细胞产生RGE、RGN或ndRGDBP多肽,或RGE、RGN,或ndRGDBP融合多肽。
本披露的重组植物细胞(例如豆科植物细胞,包括大豆细胞)可作为重组细胞的群体,或作为组织、种子、整株植物、茎、果实、叶、根、花、茎、块茎、谷粒、动物饲料、植物田等使用。
编码RGE、RGN或ndRGDBP多肽或融合多肽的本主题的合成的多核苷酸可以在未知启动子(例如,当核酸随机整合到宿主细胞基因组中时)的控制下(即可操作地连接)或可以在已知启动子的控制下(即可操作地连接到)。合适的已知启动子可以是任何已知启动子并且包括组成型活性启动子、诱导型启动子、空间限制型和/或时间限制型启动子等。
实施例
本文提供的植物细胞和方法的各种实施例包括在以下实施例的非限制性列表中。
实施例集1
1.一种修饰大豆基因组中的内源大豆基因的方法,该方法包括:
(a)向包含编码RNA指导的核酸内切酶(RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸的大豆植物细胞中引入针对该内源大豆基因中的靶编辑位点的指导RNA或编码指导RNA的多核苷酸和任选地与该靶编辑位点具有同源性的供体模板DNA分子,其中所述合成的多核苷酸:
(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;
(ii)具有的熔化温度(Tm)大于89或90摄氏度;
(iii)具有的大豆密码子适应指数(sCAI)低于编码该RGE或该RGN的大豆密码子优化的参考多核苷酸的sCAI,
(iv)或i、ii和iii的任何组合;并且
(b)选择包含该内源大豆基因的修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
2.一种修饰大豆基因组中的内源大豆基因的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码RNA指导的核酸内切酶(RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码该RGE或该RGN的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、所述Tm和所述较低sCAI的任何组合;
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶编辑位点;以及任选地
(iii)与该靶编辑位点具有同源性的供体模板DNA分子;并且
(b)选择包含该内源大豆基因的修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
3.如实施例1或2所述的方法,其中该RGE包括II型Cas核酸内切酶、Cas9核酸内切酶、V型Cas核酸内切酶、Cas12a核酸内切酶、Cas12c核酸内切酶、CasX核酸内切酶或工程化的核酸内切酶。
4.如实施例1或2所述的方法,其中该RGN包含II型Cas切口酶、Cas9切口酶、V型Cas切口酶、Cas12a切口酶、Cas12c切口酶、CasX切口酶或工程化的切口酶。
5.如实施例1或2所述的方法,其中该RGN包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的R1226A氨基酸突变;或(iii)SEQ ID NO:73的LbCpf1蛋白中的R1138A突变。
6.如实施例1或2所述的方法,其中该合成的多核苷酸在以下中任一个的全长上具有至少76%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI。
7.如实施例1或2所述的方法,其中该合成的多核苷酸具有大于48%的GC含量并且在以下中任一个的全长上具有至少70%,75%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI。
8.如实施例1或2所述的方法,其中该合成的多核苷酸具有:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、97%、98%或99%的序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI。
9.如实施例1或2所述的方法,其中该合成的多核苷酸编码RGE并且:
(i)该RGE是与SEQ ID NO:1具有至少95%、96%、97%、98%或99%序列同一性的SpCas9核酸内切酶或其变体,并且编码该SpCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%、96%、97%、98%或99%序列同一性;
(ii)该RGE是与SEQ ID NO:13具有至少95%、96%、97%、98%或99%序列同一性的SaCas9核酸内切酶或其变体,并且编码该SaCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%、96%、97%、98%或99%序列同一性;
(iii)该RGE是与SEQ ID NO:25具有至少95%、96%、97%、98%或99%序列同一性的FnCpf1核酸内切酶或其变体,并且编码该FnCpf1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%序列同一性;或
(iv)该RGE是与SEQ ID NO:37具有至少95%、96%、97%、98%或99%序列同一性的CasJ核酸内切酶或其变体,并且编码该CasJ核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%、96%、97%、98%或99%序列同一性。
10.如实施例1或2所述的方法,其中该合成的多核苷酸编码该RGN,具有大于48%的GC含量,并且在以下中任一个的全长上具有至少70%、75%、80%、85%、90%、95%、96%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN。
11.如实施例1或2所述的方法,其中该合成的多核苷酸编码该RGN并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列上具有超过或至少80%、85%、90%、95%、96%、97%、98%或99%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、96%、97%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、96%、97%、98%或99%的序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN。
12.如实施例1或2所述的方法,其中:
(i)该合成的多核苷酸是与SEQ ID NO:1具有至少95%、97%、98%或99%序列同一性的SpCas9 RGN,并且编码该SpCas9 RGN的大豆密码子优化的参考多核苷酸与SEQ IDNO:2具有至少95%、97%、98%或99%序列同一性;
(ii)该RGN是与SEQ ID NO:13具有至少95%、97%、98%或99%序列同一性的SaCas9 RGN,并且编码该SaCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%、97%、98%或99%序列同一性;
(iii)该RGN是与SEQ ID NO:25具有至少95%、97%、98%或99%序列同一性的FnCpf1 RGN,并且编码该FnCpf1 RGN或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:26具有至少95%、97%、98%或99%序列同一性;或
(iv)该RGN是与SEQ ID NO:37具有至少95%、97%、98%或99%序列同一性的CasJRGN,并且编码该CasJ RGN或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%、97%、98%或99%序列同一性。
13.如实施例1-12中任一项所述的方法,其中该合成的多核苷酸:
(i)编码该RGE并且与在对照大豆植物细胞中修饰该内源基因的频率相比提供在该大豆植物细胞的核、质体或线粒体基因组中修饰该内源基因的频率方面的至少5倍增加,该对照大豆植物细胞具有包含编码该RGE的大豆密码子优化的参考多核苷酸的对照多核苷酸;或,
(ii)编码该RGN并与对照大豆植物细胞中内源靶序列的切口或切口相关修饰相比提供该大豆植物细胞的核、质体或线粒体基因组中的该内源靶序列的切口或切口相关修饰方面的至少2倍增加,该对照大豆植物细胞包含编码该RGN的对照大豆密码子优化的参考多核苷酸。
14.如实施例1-12或13中任一项所述的方法,其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%、9%或10%的GC含量,或任选地其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%至约12%的GC含量。
15.一种修饰大豆基因组中内源大豆基因的表达的方法,该方法包括:
(a)将针对该内源大豆基因中的靶DNA结合位点的指导RNA或编码指导RNA的多核苷酸引入包含编码ndRGDBP的合成的多核苷酸的大豆植物细胞中,其中所述合成的多核苷酸:
(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;
(ii)具有的熔化温度(Tm)大于89或90摄氏度;
(iii)具有的大豆密码子适应指数(sCAI)低于编码该ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI;
(iv)或i、ii和iii的任何组合;并且
(b)选择其中该内源大豆基因的表达已经被修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
16.一种修饰大豆基因组中内源大豆基因的表达的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码包含核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码该ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、Tm和sCAI的任何组合;并且
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶结合位点;并且
(b)选择其中该内源大豆基因的表达已经被修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
17.如实施例16所述的方法,其中该ndRGDBP包括II型Cas ndRGDBP、Cas9ndRGDBP、V型Cas ndRGDBP、Cas12a ndRGDBP、Cas12c ndRGDBP、CasX ndRGDBP或工程化的ndRGDBP。
18.如实施例16所述的方法,其中该ndRGDBP包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A和/或H840A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的D917A、E1006A、E1028A、D1255A和/或N1257A突变;(iii)SEQ ID NO:37的CasJ蛋白中的D901A、E1128A和/或D1298A突变;或(iv)SEQ ID NO:73的LbCpf1蛋白中的D832A、E925A和/或D1148A突变。
19.如实施例16所述的方法,其中该合成的多核苷酸在以下中任一个的全长上具有至少76%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP。
20.如实施例16所述的方法,其中该合成的多核苷酸具有大于48%的GC含量并且在以下中任一个的全长上具有至少70%,75%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP。
21.如实施例16所述的方法,其中该合成的多核苷酸:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
22.如实施例16所述的方法,其中:
(i)该ndRGDBP是与SEQ ID NO:1具有至少95%、97%、98%或99%序列同一性的SpCas9 ndRGDBP,并且编码该SpCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%、97%、98%或99%序列同一性;
(ii)该ndRGDBP是与SEQ ID NO:13具有至少95%、97%、98%或99%序列同一性的SaCas9 ndRGDBP,并且编码该SaCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%、97%、98%或99%序列同一性;
(iii)该ndRGDBP是与SEQ ID NO:25具有至少95%、97%、98%或99%序列同一性的FnCpf1 ndRGDBP,并且编码该FnCpf1 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%、97%、98%或99%序列同一性;或(iv)该ndRGDBP是与SEQID NO:37具有至少95%、97%、98%或99%序列同一性的CasJ ndRGDBP,并且编码该CasJndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%、97%、98%或99%序列同一性。
23.如实施例16-22中任一项所述的方法,其中该合成的多核苷酸进一步包含可操作地连接的多核苷酸,该多核苷酸编码修饰该内源大豆基因的表达的效应子结构域。
24.如实施例16-23中任一项所述的方法,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;或(i)和(ii)的组合。
(c)多聚腺苷酸化序列;
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ST)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
25.如实施例24所述的方法,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
26.如实施例16-25中任一项所述的方法,其中与对照大豆植物细胞中该内源基因的表达相比,该合成的多核苷酸提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBP并且(i)具有的GC含量比该多核苷酸的GC含量低至少约8%、9%或10%,或任选地其中编码该ndRGDBP的对照多核苷酸具有的GC含量至少比该合成的多核苷酸的GC含量低约8%至12%。
27.如实施例16-26中任一项所述的方法,其中该合成的多核苷酸包含编码RGE、RGN或ndRGDBP的RNA分子。
28.一种大豆植物细胞,其包含编码蛋白质的合成的多核苷酸,该蛋白质包含RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)或核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP),其中所述多核苷酸:
(a)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;
(b)具有的熔化温度(Tm)大于89或90摄氏度;
(c)具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;或
(d)(a)、(b)和/或(c)的任何组合。
29.如实施例28所述的大豆植物细胞,其中该RGE包括II型Cas核酸内切酶、Cas9核酸内切酶、V型Cas核酸内切酶、Cas12a核酸内切酶、Cas12c核酸内切酶、CasX核酸内切酶或工程化的核酸内切酶。
30.如实施例28所述的大豆植物细胞,其中该ndRGDBP包括II型Cas ndRGDBP、Cas9ndRGDBP、V型Cas ndRGDBP、Cas12a ndRGDBP、Cas12c ndRGDBP、CasX ndRGDBP或工程化的ndRGDBP。
31.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGE并且在以下中任一个的全长上具有至少76%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:62、SEQ ID NO:75-84的大豆密码子优化的参考多核苷酸的sCAI,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;或
(viii)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI。
32.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGE并且具有超过48%的GC含量并且在以下中任一个的全长上具有至少70%,75%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:62、SEQ ID NO:75-84的大豆密码子优化的参考多核苷酸的sCAI,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;或
(viii)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI。
33.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI。
34.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且:
(i)该RGE是与SEQ ID NO:1具有至少95%、97%、98%或99%序列同一性的SpCas9核酸内切酶或其变体,并且编码该SpCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%、97%、98%或99%序列同一性;
(ii)该RGE是与SEQ ID NO:13具有至少95%、97%、98%或99%序列同一性的SaCas9核酸内切酶或其变体,并且编码该SaCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%、97%、98%或99%序列同一性;
(iii)该RGE是与SEQ ID NO:25具有至少95%、97%、98%或99%序列同一性的FnCpf1核酸内切酶或其变体,并且编码该FnCpf1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%、97%、98%或99%序列同一性;或
(iv)该RGE是与SEQ ID NO:37具有至少95%、97%、98%或99%序列同一性的CasJ核酸内切酶或其变体,并且编码该CasJ核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%、97%、98%或99%序列同一性。
35.如实施例28至34或35中任一项所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且与在对照大豆植物细胞中修饰该靶基因的效率相比提供在该大豆植物细胞的核、质体或线粒体基因组中修饰内源基因或基因座的效率方面的至少5倍增加,该对照大豆植物细胞具有对照大豆密码子优化的参考多核苷酸。
36.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGN或该ndRGDBP并且在以下中任一个的全长上具有至少76%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP。
37.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGN或RGDBP并且具有超过48%的GC含量并且在以下中任一个的全长上具有至少70%,75%、80%、85%、90%、95%、97%、98%或99%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP。
38.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGN或该RGDBP并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、两个、三个、四个、五个、六个、七个、八个、九个或十个序列的全长上具有超过或至少80%、85%、90%、95%、97%、98%或99%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP。
39.如实施例28所述的大豆植物细胞,其中:
(i)该RGN或该ndRGDBP是与SEQ ID NO:1具有至少95%、97%、98%或99%序列同一性的SpCas9 RGN或ndRGDBP,并且编码该SpCas9 RGN或ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%、97%、98%或99%序列同一性;
(ii)该RGN或该ndRGDBP是与SEQ ID NO:13具有至少95%、97%、98%或99%序列同一性的SaCas9 RGN或ndRGDBP,并且编码该SaCas9 RGN或ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%、97%、98%或99%序列同一性;
(iii)该RGN或该ndRGDBP是与SEQ ID NO:25具有至少95%、97%、98%或99%序列同一性的FnCpf1 RGN或ndRGDBP,并且编码该FnCpf1 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%、97%、98%或99%序列同一性;或
(iv)该RGN或该ndRGDBP是与SEQ ID NO:37具有至少95%、97%、98%或99%序列同一性的CasJ RGN或ndRGDBP,并且编码该CasJ ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%、97%、98%或99%序列同一性。
40.如实施例28、30、36至38或39中任一项所述的大豆植物细胞,其中该合成的多核苷酸:
(i)编码包含该ndRGDBP的蛋白质并且与对照大豆植物细胞中内源基因的表达相比提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加或减少,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBp;或
(ii)编码该RGN并与对照大豆植物细胞中内源靶序列的切口或切口相关修饰相比提供该大豆植物细胞的核、质体或线粒体基因组中的该内源靶序列的切口或切口相关修饰方面的至少2倍增加,该对照大豆植物细胞包含编码该RGN的对照大豆密码子优化的参考多核苷酸。
41.如实施例28至39或40中任一项所述的大豆植物细胞,其中该合成的多核苷酸包含编码该RNA指导的核酸内切酶蛋白或RNA指导的DNA结合蛋白的RNA分子。
42.如实施例28至40或41中任一项所述的大豆植物细胞,其中该大豆植物细胞进一步包含指导RNA或编码指导RNA的多核苷酸。
43.如实施例28至41或42中任一项所述的大豆植物细胞,其中,该大豆植物细胞进一步包含与该靶编辑位点具有同源性的供体模板DNA分子。
44.如实施例28至42或43中任一项所述的大豆植物细胞,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;或(i)和(ii)的组合。
(c)多聚腺苷酸化序列;和/或
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ET)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
45.如实施例44所述的大豆植物细胞,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
46.如实施例28-44或45中任一项所述的大豆植物细胞,其中该ndRGDBP包含HNH和/或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A和/或H840A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的D917A、E1006A、E1028A、D1255A和/或N1257A突变;(iii)SEQ ID NO:37的CasJ蛋白中的D901A、E1128A和/或D1298A突变;或(iv)SEQ ID NO:73的LbCpf1蛋白中的D832A、E925A和/或D1148A突变。
47.如实施例28-45或46中任一项所述的大豆植物细胞,其中该RGN包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的R1226A氨基酸突变;(iii)SEQ ID NO:73的LbCpf1蛋白中的R1138A突变。
48.一种大豆植物、植物部分、组织或愈伤组织,其包含如实施例28至47中任一项所述的大豆植物细胞。
49.如实施例48所述的大豆植物部分,其中:
(a)该部分是茎、荚、叶、芽、根或种子;
(b)该组织是愈伤组织、分生组织或胚组织;或
(c)该组织是胚愈伤组织。
50.一种获得如实施例28至47中任一项所述的大豆植物细胞的方法,该方法包括:
(a)将编码包含该RNA指导的核酸内切酶(RGE)、该RNA指导的切口酶(RGN)或该核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸引入该大豆植物细胞,其中所述多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;具有的熔化温度(Tm)大于89或90摄氏度;具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;所述GC含量、Tm和较低sCAI的任何组合;并且
(b)选择包含该合成的多核苷酸的植物细胞。
实施例集2
1.一种修饰植物基因组中的内源植物基因的方法,该方法包括:
(a)向包含编码RNA指导的核酸内切酶(RGE)的合成的多核苷酸的植物细胞中引入针对该内源植物基因中的靶编辑位点的指导RNA或编码指导RNA的多核苷酸和任选地与该靶编辑位点具有同源性的供体模板DNA分子,
其中所述合成的多核苷酸在SEQ ID NO:122-131、134-143或146-185中任一个的全长上具有至少75%、80%、85%、90%、95%、98%或99%序列同一性,并且
(b)选择包含该内源植物基因的修饰的经修饰的植物细胞、植物、植物部分、植物组织或植物愈伤组织。
2.如实施例1所述的方法,其中该植物是大豆,并且该合成的多核苷酸在SEQ IDNO:122-131、134-143或146-185中任一个的全长上具有至少77%、80%、85%、90%、95%、98%或99%序列同一性,和
(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;
(ii)具有的熔化温度(Tm)大于90摄氏度;
(iii)具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;
(iv)或i、ii和iii的任何组合;并且
3.一种修饰内源大豆基因组的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码RNA指导的核酸内切酶(RGE)的合成的多核苷酸,其中所述合成多核苷酸编码Cas12j核酸酶并且具有的GC(鸟嘌呤和胞嘧啶)含量大于50%,具有的熔化温度(Tm)大于90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、所述Tm和所述较低sCAI的任何组合;
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶编辑位点;以及任选地
(iii)与该靶编辑位点具有同源性的供体模板DNA分子;并且
(b)选择包含该内源大豆基因的修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
4.如实施例1或3所述的方法,其中该RGE包含在SEQ ID NO:122-131、134-143或146-185中任一个的全长上具有至少77%、80%、85%、90%、95%、98%或99%序列同一性。
5.如实施例2或3所述的方法,其中该合成的多核苷酸:
(i)在选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(iii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少77%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
6.如实施例2或3所述的方法,其中该合成的多核苷酸具有大于50%的GC含量并且在以下中任一个的全长上具有至少70%、80%、85%、90%、95%、98%或99%序列同一性:
(i)选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(iii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
7.如实施例2或3所述的方法,其中该合成的多核苷酸:
(i)在选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)在选自由SEQ ID NO:122-131组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(iii)在选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(iv)在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%的序列同一性并且任选地具有的sCAI低于SEQ IDNO:133的大豆密码子优化的参考多核苷酸的sCAI;
(v)在选自由SEQ ID NO:134-143组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%的序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(vi)在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(vii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:145的大豆密码子优化的参考多核苷酸的sCAI;
(viii)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(ix)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
8.如实施例1-3所述的方法,其中:
(i)该RGE是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且编码该Cas12j-1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:121具有至少95%序列同一性;
(ii)该RGE是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且编码该Cas12j-2核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:133具有至少95%序列同一性;或
(iii)该RGE是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且编码该Cas12j-3核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:145具有至少95%序列同一性。
9.如实施例2或3所述的方法,其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%、9%或10%的GC含量,或任选地其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%至约12%的GC含量。
10.如实施例1或3所述的方法,其中该合成的多核苷酸编码RGE并且
(i)该RGE是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:156-164和165组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;
(ii)该RGE是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:166-174和175组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;或
(iii)该RGE是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:176-184和185组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性。
11.一种修饰植物基因组中内源基因的表达的方法,该方法包括:
(a)将针对该内源大豆基因中的靶DNA结合位点的指导RNA或编码指导RNA的多核苷酸引入包含编码ndRGDBP的合成的多核苷酸的植物细胞中,其中所述合成的多核苷酸在SEQ ID NO:122-131、134-143或146-185中任一个的全长上具有至少75%、80%、85%、90%、95%、98%或99%序列同一性,并且
(b)选择其中该内源植物基因的表达已被修饰的植物细胞、植物、植物部分、组织或植物愈伤组织。
12.如实施例11所述的方法,其中该ndRGDBP包含至少一个对应于以下的突变:
(i)SEQ ID NO:120的残基D371、E579、D673、C640、C643、C646、C661或C664;
(ii)SEQ ID NO:132的残基D394、E606、D697、C667、C670、C673、C685或C688;或
(iii)SEQ ID NO:144的残基D413、E618、D710、C680、C683、C687、C698或C701。
13.如实施方式12所述的方法,其中该ndRGDBP包含至少一个选自以下的突变
(i)SEQ ID NO:120的D371A、E579A、D673A、C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S或C664S;
(ii)SEQ ID NO:132的D394A、E606A、D697A、C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S或C688S;或
(iii)SEQ ID NO:144的D413A、E618A、D710A、C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S。
14.如实施例11所述的方法,其中该合成的多核苷酸编码ndRGDBP并且
(i)该ndRGDBP是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:156-164和165组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;
(ii)该ndRGDBP是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:166-174和175组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;或
(iii)该ndRGDBP是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且该合成的多核苷酸在选自由SEQ ID NO:176-184和185组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;
15.一种修饰大豆基因组中内源大豆基因的表达的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码包含Cas12j核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于50%,具有的熔化温度(Tm)大于90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码该ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、Tm和sCAI的任何组合;以及
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶结合位点;并且
(b)选择其中该内源大豆基因的表达已经被修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
16.如实施例15所述的方法,其中该ndRGDBP包含RuvC样核酸酶结构域中的突变。
17.如实施例15所述的方法,其中该ndRGDBP包含以下突变:
(i)选自由SEQ ID NO:120的D371A、E579A、D673A、C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S组成的组;
(ii)选自由SEQ ID NO:132的D394A、E606A、D697A、C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S组成的组;或
(iii)选自由SEQ ID NO:144的D413A、E618A、D710A、C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S组成的组。
18.如实施例15所述的方法,其中该合成的多核苷酸
(i)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(iii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少77%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
19.如实施例15所述的方法,其中该合成的多核苷酸具有大于50%的GC含量并且在以下中任一个的全长上具有至少70%、80%、85%、90%、95%、98%或99%序列同一性:
(i)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;或
(iii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP。
20.如实施例15所述的方法,其中该合成的多核苷酸:
(i)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(ii)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;其包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iii)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iv)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(v)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vi)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(viii)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(ix)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
21.如实施例15所述的方法:
(i)该ndRGDBP是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1变体,并且该合成的多核苷酸在选自由SEQ ID NO:122-131、156-164和165组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;
(ii)该ndRGDBP是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2变体,并且该合成的多核苷酸在选自由SEQ ID NO:134-143、166-174和175组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性;或
(iii)该ndRGDBP是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3变体,并且该合成的多核苷酸在选自由SEQ ID NO:146-155、176-184和185组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%或95%序列同一性。
22.如实施例11或15所述的方法,其中该合成的多核苷酸进一步包含可操作地连接的多核苷酸,该多核苷酸编码修饰该内源大豆基因的表达的效应子结构域。
23.如实施例11或15所述的方法,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;或(i)和(ii)的组合;
(c)多聚腺苷酸化序列;
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ST)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
24.如实施例23所述的方法,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
25.如实施例11或15所述的方法,其中与对照大豆植物细胞中该内源基因的表达相比,该合成的多核苷酸提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBP并且(i)具有的GC含量比该多核苷酸的GC含量低至少约8%、9%或10%,或任选地其中编码该ndRGDBP的对照多核苷酸具有的GC含量至少比该合成的多核苷酸的GC含量低约8%至12%。
26.如实施例11或15所述的方法,其中该合成的多核苷酸包含编码该ndRGDBP的RNA分子。
27.一种植物细胞,其包含编码蛋白质的合成的多核苷酸,该蛋白质包含RNA指导的核酸内切酶(RGE)或核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP),其中所述多核苷酸在SEQ ID NO:122-131、134-143或146-185中任一个的全长上具有至少75%、80%、85%、90%、95%、98%或99%序列同一性,任选地其中该植物细胞是单子叶植物细胞,并且任选地其中该单子叶植物细胞是玉米植物细胞。
28.一种大豆植物细胞,其包含编码蛋白质的合成的多核苷酸,该蛋白质包含Cas12j RNA指导的核酸内切酶(RGE)或核酸酶缺陷型受Cas12j RNA指导的DNA结合蛋白(ndRGDBP),其中所述多核苷酸:
(a)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;
(b)具有的熔化温度(Tm)大于90摄氏度;
(c)具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;或
(d)(a)、(b)和/或(c)的任何组合。
29.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RNA指导的DNA结合蛋白(ndRGDBP),其包含以下中的至少一个突变:
(i)SEQ ID NO:120的残基D371、E579、D673、C640、C643、C646、C661和C664;
(ii)SEQ ID NO:132的残基D394、E606、D697、C667、C670、C673、C685和C688;或
(iii)SEQ ID NO:144的残基D413、E618、D710、C680、C683、C687、C698和C701。
30.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGE并且:
(i)在选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%的序列同一性,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(iii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸中任一个的全长上具有至少77%、80%、85%、90%、95%、98%或99%序列同一性,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(iv)在SEQ ID NO:146-185中任一个的全长上具有至少75%、80%、85%、90%、95%、98%或99%序列同一性。
31.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸具有大于50%的GC含量并且在以下中任一个的全长上具有至少70%、80%、85%、90%、95%、98%或99%序列同一性:
(i)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(iii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
32.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸:
(i)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(ii)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(iii)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(iv)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(v)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(vi)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(vii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:145的大豆密码子优化的参考多核苷酸的sCAI;
(viii)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(ix)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
33.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且:
(i)该RGE是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且编码该Cas12j-1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:121具有至少95%序列同一性;
(ii)该RGE是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且编码该Cas12j-2核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:133具有至少95%序列同一性;或
(iii)该RGE是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且编码该Cas12j-3核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:145具有至少95%序列同一性。
34.如实施例28至33中任一项所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且与在对照大豆植物细胞中修饰该靶基因的效率相比提供在该大豆植物细胞的核、质体或线粒体基因组中修饰内源基因或基因座的效率方面的至少2倍增加,该对照大豆植物细胞具有对照大豆密码子优化的参考多核苷酸。
35.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码该ndRGDBP并且在以下中任一个的全长上具有至少76%、80%、85%、90%、95%、98%或99%序列同一性:
(i)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(ii)选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iii)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iv)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(v)选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(viii)选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(ix)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
36.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGDBP,具有大于50%的GC含量并且在以下中任一个的全长上具有至少70%、80%、85%、90%、95%、98%或99%序列同一性:
(i)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(ii)选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iii)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iv)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(v)选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(viii)选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(ix)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
37.如实施例28所述的大豆植物细胞,其中该合成的多核苷酸编码RGDBP并且:
(i)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(ii)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iii)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(iv)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%的序列同一性并且任选地具有的sCAI低于SEQID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(v)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vi)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(vii)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%、85%、90%、95%、98%或99%序列同一性并且任选地具有的sCAI低于SEQID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(viii)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(ix)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列的全长上具有超过80%、85%、90%、95%、98%或99%序列同一性并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
38.如实施例29所述的大豆植物细胞,其中:
(i)该ndRGDBP是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1ndRGDBP,并且编码该Cas12j-1ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:121具有至少95%序列同一性;
(ii)该ndRGDBP是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2ndRGDBP,并且编码该Cas12j-2ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:133具有至少95%序列同一性;或
(iii)该ndRGDBP是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3ndRGDBP,并且编码该Cas12j-3ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:145具有至少95%序列同一性。
39.如实施例28、29、35至37或38中任一项所述的大豆植物细胞,其中该合成的多核苷酸编码包含该ndRGDBP的蛋白质并且与对照大豆植物细胞中内源基因的表达相比提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加或减少,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBP。
40.如实施例28至33、34至37或38中任一项所述的大豆植物细胞,其中该合成的多核苷酸包含编码该RNA指导的核酸内切酶蛋白或RNA指导的DNA结合蛋白的RNA分子。
41.如实施例28至33、35至37或38中任一项所述的大豆植物细胞,其中该大豆植物细胞进一步包含指导RNA或编码指导RNA的多核苷酸。
42.如实施例28至33、35至37或38中任一项所述的大豆植物细胞,其中该大豆植物细胞进一步包含与该靶编辑位点具有同源性的供体模板DNA分子。
43.如实施例28至33、35至37或38中任一项所述的大豆植物细胞,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;或(i)和(ii)的组合。
(c)多聚腺苷酸化序列;和/或
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ET)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;(ii)具有的熔化温度(Tm)大于90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
44.如实施例43所述的大豆植物细胞,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
45.如实施例28、29、35至37或38中任一项所述的大豆植物细胞,其中该ndRGDBP包含至少一个对应于以下的突变:
(i)SEQ ID NO:120的残基D371、E579、D673、C640、C643、C646、C661或C664;
(ii)SEQ ID NO:132的残基D394、E606、D697、C667、C670、C673、C685或C688;或
(iii)SEQ ID NO:144的残基D413、E618、D710、C680、C683、C687、C698或C701。
46.如实施例28、29、35至37或38中任一项所述的大豆植物细胞,其中该ndRGDBP包含:
(i)SEQ ID NO:120,其具有选自由C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S组成的组的突变;
(ii)SEQ ID NO:132,其具有选自由C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S组成的组的突变;或
(iii)SEQ ID NO:144,其具有选自由C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S组成的组的突变。
47.一种大豆植物、植物部分、组织或愈伤组织,其包含如实施例28至33、35至37或38中任一项所述的大豆植物细胞。
48.如实施例47所述的大豆植物部分,其中:
(a)该部分是茎、荚、叶、芽、根或种子;
(b)该组织是愈伤组织、分生组织或胚组织;或
(c)该组织是胚愈伤组织。
49.一种获得如实施例28至33、35至37或38中任一项所述的大豆植物细胞的方法,该方法包括:
(a)将编码包含该Cas12j RNA指导的核酸内切酶(RGE)或Cas12j核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸引入该大豆植物细胞,其中所述多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于50%;具有的熔化温度(Tm)大于90摄氏度;具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;所述GC含量、Tm和较低sCAI的任何组合;并且
(b)选择包含该合成的多核苷酸的植物细胞。
50.一种分离的多核苷酸,其包含SEQ ID NO:122-131、134-143或146-185中的任一个。
51.一种编码Cas12j多肽的分离的多核苷酸,该分离的多核苷酸包含对应于以下的突变或残基:
(a)SEQ ID NO:120的C640、SEQ ID NO:132的C667或SEQ ID NO:144的C680;
(b)SEQ ID NO:120的C643、SEQ ID NO:132的C670或SEQ ID NO:144的C683;
(c)SEQ ID NO:120的C646、SEQ ID NO:132的C673或SEQ ID NO:144的C687;
(d)SEQ ID NO:120的C661、SEQ ID NO:132的C685或SEQ ID NO:144的C698;或
(e)SEQ ID NO:120的C664、SEQ ID NO:132的C688或SEQ ID NO:144的C701。
52.一种编码多肽的分离的多核苷酸,该分离的多核苷酸包含:
(a)SEQ ID NO:120,其具有选自由C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S组成的组的突变;
(b)SEQ ID NO:132,其具有选自由C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S组成的组的突变;或
(c)SEQ ID NO:144,其具有选自由C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S组成的组的突变。
53.一种重组核酸,其包含如实施例50、51或52中任一项所述的分离的核酸。
实例
以下实例并非旨在限制权利要求的范围。
实例1-大豆细胞中的Cas合成序列构建和表达水平
合成了两个编码相同RGN多肽但使用不同密码子的Cas核酸酶表达载体。编码RGN的Cas Soy 1.1.1的大豆密码子优化的参考多核苷酸序列包含使用OPTIMIZER程序(PuigboP.,Guzmen E.Romeu A.和Garcia-Vallve S.2007OPTIMIZER:A web server foroptimizing the codon usage of DNA sequences[OPTIMIZER:用于优化DNA序列密码子使用的网络服务器].Nucleic Acids Research[核酸研究],35:W126-W131)根据常规大豆密码子使用表(图1;来自万维网网站“kazusa.or.jp/codon/cgi-bin/showcodon.cgi?species=3847”)指定的密码子并且具有的GC含量为约37.5%。编码RGN的测试Cas Soy1.1.S主题合成多核苷酸序列包含未根据常规大豆密码子使用表指定的密码子并且具有约49.5%的GC含量。将对照参考Cas Soy 1.1.1和测试Cas Soy 1.1.1S编码序列插入到在其他方面相同的植物表达盒中。
在类似条件下将表达载体转染到大豆原生质体中。与Cas Soy 1.1.1大豆密码子优化的参考多核苷酸相比,对表达的Cas多肽的免疫印迹(即“蛋白质印迹”)探测揭示了测试Cas Soy 1.1.1S主题合成多核苷酸序列的更高水平的表达(图2)。
实例2-Cas表达载体在番茄和大豆原生质体中的表现
将Cas表达载体(其包含Cas Soy 1.1.1的大豆密码子优化的参考多核苷酸序列或Cas Soy 1.1.1S的本主题的合成的多核苷酸序列并且各自进一步包含针对番茄基因组位点的RNA指导物的表达盒)转染到番茄原生质体中。还将Cas Soy 1.1.1或Cas Soy 1.1.1S表达载体(每个表达载体进一步包含针对大豆基因组位点的RNA指导物的表达盒)转染到大豆原生质体中。处理后提取经转染的原生质体的DNA,并量化靶位点的编辑效率。Cas Soy1.1.S比Cas Soy 1.1.1提高了约10倍的编辑效率(图3)。
实例3-大豆细胞中的Cas合成序列构建和表达水平
合成了两个编码SEQ ID NO:132的相同RGE多肽但使用不同密码子的Cas核酸酶表达载体。编码RGE的Cas12j-2的大豆密码子优化的参考多核苷酸序列包含使用OPTIMIZER程序(Puigbo P.,Guzmen E.Romeu A.和Garcia-Vallve S.2007 OPTIMIZER:A web serverfor optimizing the codon usage of DNA sequences[OPTIMIZER:用于优化DNA序列密码子使用的网络服务器].Nucleic Acids Research[核酸研究],35:W126-W131)根据常规大豆密码子使用表(图1;来自万维网网站“kazusa.or.jp/codon/cgi-bin/showcodon.cgi?species=3847”)指定的密码子并且具有的GC含量为约48.7%(SEQ ID NO:133)。编码RGE的测试Cas12j-2主题合成多核苷酸序列包含未根据常规大豆密码子使用表指定的密码子并且具有约58.4%的GC含量(SEQ ID NO:137)。将对照参考和测试Cas12j-2编码序列插入到在其他方面相同的植物表达盒中。
在类似条件下将表达载体转染到大豆原生质体中。与参考Cas12j-2多核苷酸相比,对表达的Cas多肽的免疫印迹(即“蛋白质印迹”)探测揭示了测试Cas 12j-2主体合成多核苷酸序列的更高水平的表达。
实例4-编辑效率的增加
将Cas表达载体(其包含Cas12j-2的大豆密码子优化的参考多核苷酸序列或CasCas12j-2的本主题的合成的多核苷酸序列并且各自进一步包含针对大豆基因组位点的RNA指导物的表达盒)转染到大豆原生质体中。处理后提取经转染的原生质体的DNA,并量化靶位点的编辑效率。表达的本主题的合成的多核苷酸显示出比参考多核苷酸更高的编辑效率。
尽管为了清楚理解的目的已经通过说明和实例的方式对前述披露进行了一些详细的描述,但是显然可以在所附权利要求的范围内实施某些改变和修改。
序列表
<110> 伊纳瑞农业有限公司(Inari Agriculture, Inc. )
Bevan, Scott A.
Joyce, Adam
<120>用于在大豆中表达RNA指导的核酸酶和DNA结合蛋白的改进多核苷酸
<130> P13455WO00
<150> 63/075,395
<151> 2020-09-08
<150> 63/072,585
<151> 2020-08-31
<150> 63/001,806
<151> 2020-03-30
<160> 188
<170> PatentIn 3.5版
<210> 1
<211> 1368
<212> PRT
<213> 酿脓链球菌(Streptococcus pyogenes)
<400> 1
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<210> 2
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 2
atggacaaga aatattctat agggctggac attgggacaa acagtgtcgg gtgggcagtg 60
attaccgatg aatacaaggt accatcaaaa aaatttaagg tcctcggcaa cacagatcgc 120
cactccatta agaaaaattt gattggtgct ctcttgttcg acagtggcga gactgccgaa 180
gctacacgtc tgaagagaac cgccagacgc cgctataccc gtaggaaaaa cagaatctgt 240
tacctccagg aaattttttc taacgagatg gctaaggtgg acgactcatt ctttcacagg 300
ctggaggaat cctttcttgt cgaagaagat aagaagcacg agaggcaccc aatcttcggc 360
aatattgttg atgaggtggc ctaccacgaa aaatatccta ctatctatca cttgagaaag 420
aaactcgtcg actctaccga caaagcagat ctgcgtctca tttatctggc acttgcacac 480
atgatcaagt tccgtgggca tttcttgatt gagggggatt tgaaccctga taactctgat 540
gtagataagc tgtttattca gctggtccag acctacaacc agctgtttga agagaaccct 600
ataaatgctt caggagttga tgctaaggct atcctcagcg cacgcctgtc aaaaagcagg 660
agacttgaga accttatcgc tcagctccca ggggagaaaa aaaatggcct gtttggcaat 720
ctgatcgcac tctcattggg cttgacacct aacttcaaaa gtaatttcga tctcgctgag 780
gacgcaaaac tgcaactgtc taaggatact tacgatgacg acctcgacaa cttgctcgcc 840
cagataggag accagtatgc cgatctcttc ttggcagcca aaaatttgag tgacgcaatc 900
cttttgtctg atattctgcg cgttaatact gagataacta aggctcctct ctctgctagc 960
atgattaaga ggtacgacga acaccatcaa gatcttaccc ttctcaaggc cctcgtgcgc 1020
caacagctgc cagagaaata taaggagatt ttcttcgacc agtcaaagaa cggctacgca 1080
ggctacatag acgggggggc tagccaggag gaattctaca agttcatcaa gcccatcctt 1140
gaaaagatgg atggcacaga ggaactgttg gtcaagctga acagggaaga ccttctcaga 1200
aaacaaagga ctttcgacaa cggaagcata cctcaccaaa tccatttggg ggaattgcat 1260
gctatactta ggcgccaaga agacttctat cccttcttga aggacaatcg tgagaaaata 1320
gagaagatcc ttacattcag aattccatac tacgtcgggc ctctggccag aggaaattcc 1380
cgttttgctt ggatgactcg taaatctgaa gagaccataa caccctggaa tttcgaagaa 1440
gttgtagata agggggcttc tgctcagagt ttcatcgaaa gaatgacaaa tttcgataag 1500
aacttgccta atgagaaagt attgccaaaa cactctcttc tttatgagta ctttaccgtc 1560
tataacgaat tgactaaggt caaatatgtt accgaaggca tgaggaaacc cgcctttttg 1620
tcaggggagc agaagaaagc aatcgtcgat ctcctgttta aaactaatag gaaagttaca 1680
gttaagcagt tgaaggaaga ttatttcaaa aagattgaat gttttgattc tgtggaaatc 1740
tctggagtag aggatcgctt taatgccagc ttggggacat accatgatct gcttaaaatt 1800
atcaaggaca aagattttct ggacaatgaa gaaaatgagg acatcctgga agatatagtt 1860
ctgactttga ccttgtttga ggatagagaa atgatagagg aaagactcaa aacttatgct 1920
catttgttcg acgacaaggt tatgaagcaa ctgaaaaggc gtagatatac cgggtgggga 1980
cgcttgagta gaaaattgat caacggcata agggataaac agagcggcaa gaccattctt 2040
gatttcttga aatcagatgg ctttgccaac cgcaacttca tgcaactgat ccacgacgat 2100
agtcttactt ttaaagagga tatccagaag gcccaagtct cagggcaagg ggacagcctg 2160
cacgaacaca tagccaacct cgctggctct cctgcaatta agaagggaat ccttcagaca 2220
gtaaaggtcg tggacgagct tgtgaaggta atggggcgcc acaagccaga aaacatcgtg 2280
attgaaatgg caagggaaaa ccagaccacc cagaaaggac aaaagaacag tagggagcgc 2340
atgaagcgca tagaagaagg gattaaagag ctcgggtccc aaatcctcaa ggagcatccc 2400
gtagaaaata cacaacttca gaatgagaag ctgtacctgt actacctgca aaacggtaga 2460
gatatgtacg tagatcaaga actcgacatc aaccgcttgt ctgattatga cgtggaccac 2520
atcgtccctc agtctttcct taaggacgac tctattgata acaaggttct gacaaggagc 2580
gacaagaata ggggcaagtc cgacaatgtg ccctctgaag aggtcgtaaa gaagatgaag 2640
aattattgga ggcaattgtt gaacgcaaaa ttgattactc agagaaagtt tgacaatctt 2700
actaaggcag aacgtggagg actgtctgag ctcgacaaag ccgggttcat caagagacaa 2760
ctcgttgaaa caagacagat tacaaagcat gtcgcacaga tattggactc caggatgaat 2820
actaagtatg acgaaaacga caagctgatc agagaagtca aagtgattac tcttaagtct 2880
aagctcgtca gcgactttcg caaagatttt caattttaca aagtaaggga gattaataat 2940
taccaccacg ctcatgacgc ctacttgaat gcagtcgttg gcactgcact gatcaagaag 3000
taccccaaac tcgaatctga attcgtttat ggtgattaca aagtttatga tgtccgtaag 3060
atgatagcaa agtccgagca ggaaatagga aaggctactg ctaaatattt cttctactct 3120
aatattatga atttcttcaa aactgaaata acattggcca acggggagat ccgtaagcgt 3180
cctttgatag agacaaatgg ggaaacagga gagattgtct gggataaggg aagagacttc 3240
gccaccgtca gaaaagttct ctccatgccc caggtgaaca ttgttaaaaa gaccgaggtc 3300
cagacagggg gtttttccaa agaatccata ctgcctaaga gaaactctga caaattgatc 3360
gcaaggaaga aagattggga tccaaagaag tacggaggct tcgattcacc tacagtagct 3420
tactccgtat tggtagtcgc taaggtggag aaaggcaaat ctaagaagct caagtctgta 3480
aaggaactcc ttggaatcac aattatggag agatccagtt ttgagaaaaa cccaatagac 3540
ttcctggaag caaagggata caaggaggta aaaaaagatc tgataatcaa actcccaaag 3600
tattctttgt tcgagttgga gaatggaaga aaacgcatgt tggcttctgc cggtgagctg 3660
cagaagggca atgagcttgc tctcccatcc aagtatgtta atttcctgta tttggcaagc 3720
cattatgaaa agttgaaagg cagccccgag gacaacgagc aaaagcagtt gtttgtagag 3780
caacataaac actacctcga cgagataatt gagcaaatca gcgagttcag taagcgtgtt 3840
atcttggccg acgctaacct tgacaaagtc ttgagcgcat acaacaagca tcgtgacaaa 3900
cctatcagag agcaagctga gaatataata catttgttca cacttactaa tttgggcgct 3960
ccagccgctt ttaaatactt cgacactaca attgaccgta aaagatacac tagcacaaaa 4020
gaggttctgg atgcaactct tatacaccag agcattactg gtttgtatga gactaggatt 4080
gatcttagcc aactcggagg ggac 4104
<210> 3
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 3
atggacaaga aatactctat cgggctcgat atagggacca actccgtcgg gtgggccgtc 60
atcaccgacg agtacaaggt tccctctaag aagttcaagg ttctcgggaa tactgacagg 120
cactcaatca aaaagaacct catcggcgcc ctcctctttg actccgggga gaccgccgag 180
gcaaccaggc tcaagagaac cgccagaagg aggtacacca ggaggaaaaa cagaatctgt 240
tacctccaag agattttctc caatgagatg gctaaggtgg acgacagttt cttccacagg 300
ctcgaggagt catttctcgt ggaggaagac aagaagcacg aaaggcaccc catattcggc 360
aacatcgtcg atgaggtggc ctaccatgag aagtacccca caatttacca cctcaggaag 420
aagctcgttg actcaacaga caaggccgat ctcagattga tatacctcgc ccttgcccac 480
atgataaagt ttaggggcca cttcctcatc gagggggacc tcaaccccga caactccgac 540
gttgacaagc ttttcatcca gcttgtccag acttacaacc agttgttcga ggagaacccc 600
atcaacgcct ctggcgtgga cgccaaggcc atactctccg ccagactctc caagagcagg 660
aggttggaga acctcatcgc tcaactcccc ggggagaaaa agaacggcct ttttggaaac 720
ctcatcgccc tctccctcgg actcaccccc aattttaaga gtaactttga tctcgccgag 780
gatgcaaagc ttcaactctc aaaggacaca tacgacgacg atcttgacaa tctcttggcc 840
caaattggag accagtatgc cgacctcttt ctcgccgcta agaacctctc cgacgccatt 900
ctcctctccg acatcctcag ggttaacacc gagatcacca aggcccccct ctccgccagc 960
atgattaaga ggtacgacga gcaccaccaa gacctcaccc ttctcaaggc acttgtcagg 1020
cagcaactcc ccgagaagta caaggagata ttcttcgatc agtccaagaa cgggtacgcc 1080
gggtatatcg atggcggtgc ctcccaagag gagttctata agtttatcaa gcccatcctc 1140
gagaagatgg acggaaccga ggaacttctc gtcaaattga acagggagga cctcctcaga 1200
aagcagagaa cttttgataa cgggtctatc ccccaccaga tccaccttgg cgaactccac 1260
gccattctca ggaggcagga ggacttctat cctttcctca aagacaacag agagaagatc 1320
gagaagattc ttaccttcag gataccctac tacgtcgggc cactcgccag ggggaacagt 1380
aggttcgcct ggatgaccag aaaaagcgag gagaccatca ccccctggaa ttttgaggag 1440
gtggtcgaca agggagccag cgcccagtca tttatcgaga gaatgactaa cttcgataag 1500
aatctcccca acgagaaagt tttgcccaag cactccttgc tttatgagta tttcactgtg 1560
tacaatgagc tcaccaaggt gaaatatgtg acagaaggca tgcgcaaacc cgcctttctc 1620
agcggtgagc agaagaaagc catcgtggac cttctcttca agaccaacag gaaggtgacc 1680
gtcaagcagc tcaaggagga ctacttcaaa aaaatcgaat gttttgactc cgtcgagatc 1740
agcggcgtcg aagacaggtt caatgcctca ctcggtactt accacgatct cctcaagatc 1800
atcaaggaca aggacttcct cgataatgaa gaaaacgagg acatccttga ggacatcgtg 1860
ctcaccctca ccctcttcga ggacagggaa atgatcgagg aaaggctcaa gacctacgct 1920
catctctttg acgacaaagt catgaaacag ttgaagagga ggaggtatac cggttggggc 1980
agattgagca gaaaactcat aaatggcatc agagataagc aatcaggcaa gactatcctc 2040
gacttcctta agtcagacgg attcgccaat aggaacttta tgcagcttat ccacgatgac 2100
tcactcacct ttaaggagga catccagaag gcccaggtgt ccggtcaagg ggacagcctc 2160
cacgagcata tcgccaatct cgccgggtct cctgccatca agaaggggat cctccagacc 2220
gtgaaggtcg tcgatgagct cgttaaggtc atgggcagac acaaacccga gaacatcgtc 2280
attgagatgg ccagggagaa tcagaccacc cagaagggcc agaaaaactc tagggagagg 2340
atgaaaagga tcgaggaggg tatcaaggag ttgggctccc agatcctcaa ggagcacccc 2400
gtggaaaaca cccagctcca gaacgaaaag ctctatctct attaccttca gaacggaaga 2460
gatatgtacg tcgaccaaga gttagatatt aacaggcttt ctgattatga cgtggatcac 2520
atcgtgcctc agtccttcct caaggacgac agcatcgata ataaggtgct caccaggtca 2580
gacaagaaca ggggcaagtc tgacaacgtt cccagtgaag aggtcgttaa aaagatgaag 2640
aactactgga ggcagttgct caatgccaag ctcatcaccc aaaggaagtt cgacaacctc 2700
accaaggccg agaggggagg gctcagcgag ctcgacaagg ccggatttat caaaaggcag 2760
ctcgttgaga ctagacagat aaccaagcac gtcgcacaga tcctcgattc cagaatgaat 2820
acaaagtacg acgaaaatga caagctcata agggaggtga aggtgattac cctcaagagc 2880
aagttggtgt ccgactttag gaaggacttc cagttctaca aggtcaggga gattaacaac 2940
taccaccatg cacacgacgc ttacctcaac gccgtcgtgg gtaccgccct cattaagaag 3000
taccccaagc ttgaatcaga atttgtctac ggcgactaca aggtctacga cgtcaggaaa 3060
atgattgcca aatccgaaca ggagatcggg aaagccaccg ctaagtactt cttctactcc 3120
aacatcatga atttcttcaa gaccgagatt accctcgcca acggggagat caggaagagg 3180
cccctcatcg agaccaacgg tgaaacaggc gagatcgtct gggacaaggg gagagatttc 3240
gccactgtta ggaaggtgct cagcatgccc caggtcaaca tcgtgaagaa gacagaggtc 3300
cagactggcg gattcagcaa agagtccatc cttcccaaga gaaacagcga caaactcatt 3360
gccaggaaga aggactggga ccctaaaaag tacggcgggt tcgatagccc caccgtggcc 3420
tacagcgttt tggttgtggc caaagtggag aagggcaaaa gcaagaagct caagagcgtg 3480
aaggaactcc tcgggatcac catcatggaa aggtcctcct tcgagaagaa ccccatcgac 3540
ttcctcgaag ccaaggggta caaggaggtg aaaaaggatc tcatcattaa actccccaag 3600
tacagcttgt tcgaactcga aaatgggagg aagagaatgt tggccagcgc aggtgaactc 3660
cagaagggca atgaactcgc ccttcccagc aagtacgtga acttcctcta cctcgcttcc 3720
cactatgaaa aattgaaggg tagcccagag gacaacgagc agaaacagtt gttcgtggaa 3780
caacacaaac actacctcga cgagatcatc gagcagatca gcgagttcag caagagggtc 3840
atactcgccg acgccaacct cgacaaggtc ctcagcgcct acaacaagca cagagacaag 3900
cccatcaggg agcaggccga aaacatcatc catctcttca ccctcaccaa tcttggggca 3960
cccgccgcat ttaaatactt cgacaccact attgacagaa agaggtatac atctaccaag 4020
gaggtgctcg acgccaccct catccaccag tccataaccg gcttgtacga gaccagaatc 4080
gacttgagcc agctcggggg ggac 4104
<210> 4
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 4
atggacaaga agtatagtat agggctcgac atcggcacca atagtgttgg ctgggccgtg 60
atcaccgatg agtacaaggt gccctccaaa aagtttaagg tgctcgggaa taccgacaga 120
cacagcatca aaaagaattt gatcggggcc ctcctctttg actccgggga gaccgccgag 180
gccaccaggc tcaagaggac tgccaggagg aggtacacca ggaggaaaaa caggatctgc 240
tacctccaag agatcttcag taacgaaatg gccaaggtcg acgactcctt cttccacagg 300
ctcgaggaga gcttccttgt ggaggaagac aaaaagcacg agaggcaccc cattttcggg 360
aatattgtcg acgaggtggc ctatcatgag aagtacccca ccatctacca cctcagaaag 420
aagctcgttg acagtaccga taaagccgac ctcaggctca tctatcttgc cctcgcccac 480
atgataaaat ttagaggaca tttcttgatc gagggtgacc ttaaccctga caactccgac 540
gtcgataagc ttttcatcca gctcgtccag acatataacc agctcttcga ggagaacccc 600
atcaatgcct ctggcgttga tgccaaggcc atactttcag ccaggctctc caagtccagg 660
aggctcgaga atctcatcgc ccagctccct ggagaaaaga agaatgggct ctttgggaac 720
ctcatagccc tttcattggg tctcacaccc aacttcaagt caaactttga cctcgcagaa 780
gacgccaaac tccagctctc caaggacacc tacgatgatg acctcgacaa cctcctcgcc 840
cagattgggg accagtacgc cgacctcttc ctcgcagcta agaacctctc tgacgccatc 900
cttcttagcg acatcttgag agtcaacacc gagatcacca aggcccccct ttcagccagc 960
atgatcaaga gatacgacga acaccaccag gacctcaccc ttctcaaggc cctcgtgagg 1020
cagcagctcc ctgagaagta caaggagata ttcttcgatc agagcaaaaa cgggtatgcc 1080
ggatacatcg acgggggagc ctctcaggag gagttctata agttcatcaa gcctatcctc 1140
gaaaagatgg atggcaccga ggaattgctc gtcaagctca atagggagga cctcctcaga 1200
aaacagagga ccttcgacaa cgggagcatc ccccaccaaa tccatctcgg cgagctccac 1260
gccatactta ggagacaaga ggacttctac ccattcctca aggacaacag ggagaagatc 1320
gagaagatcc tcaccttcag aatcccctac tacgtcggac cacttgccag aggaaacagc 1380
aggttcgcct ggatgactag gaaatccgag gagaccatca ccccctggaa tttcgaggaa 1440
gttgtcgaca agggcgcatc cgcccagagc tttatagaga ggatgactaa cttcgataaa 1500
aatctcccta acgagaaggt cctcccaaaa cacagcctcc tctacgagta tttcaccgtg 1560
tacaacgagc ttaccaaggt caagtacgtc accgagggga tgaggaagcc tgccttcttg 1620
agcggagagc agaagaaggc catcgttgac ctcctcttta agaccaacag aaaggtgacc 1680
gtgaagcaac tcaaagagga ttatttcaag aaaatcgagt gtttcgacag tgttgagatc 1740
agcggcgtgg aagataggtt caatgcctca ctcggtacat accacgatct cctcaaaatc 1800
atcaaggaca aggacttcct cgataacgag gaaaacgagg acatcctcga ggatatcgtt 1860
ctcacattga ccctcttcga ggacagggag atgatcgagg agaggctcaa gacctatgcc 1920
cacctctttg acgataaggt gatgaagcag ctcaagagga ggaggtacac cgggtggggc 1980
aggctcagca gaaagctcat caatgggatc agggacaagc agagcgggaa gaccatcctc 2040
gacttcctca agtccgacgg gtttgccaat aggaacttta tgcagctcat acacgacgac 2100
tccctcacct tcaaggagga catacaaaaa gcccaggtca gcgggcaggg cgacagcttg 2160
cacgagcaca tagcaaatct cgccggttcc cccgccatca agaaagggat actccagact 2220
gtgaaggtcg tcgatgaact cgtcaaggtc atgggaaggc ataagcccga gaatatcgtc 2280
atcgagatgg caagggagaa ccagaccacc cagaaggggc agaaaaatag tagggagagg 2340
atgaagagga tcgaggaggg cataaaggag ctcgggtccc agatcttgaa ggagcaccct 2400
gtggagaaca cacaactcca gaacgaaaag ctctacctct actacctcca aaacggcagg 2460
gacatgtacg tggaccaaga gctcgacata aacaggctct ccgactacga tgttgatcac 2520
atcgttcctc agagcttcct caaagatgac agcatcgaca ataaggtcct taccagaagc 2580
gacaagaata gaggcaagag cgacaacgtg ccctccgaag aggtggttaa aaagatgaaa 2640
aactattgga ggcagctcct taacgctaag cttatcaccc agaggaagtt cgataacctt 2700
accaaagccg aaaggggcgg cttgtccgag ctcgacaagg ccggcttcat caagaggcag 2760
ctcgtcgaaa ccaggcagat caccaagcac gtggctcaaa tcctcgatag caggatgaat 2820
actaagtacg acgagaatga caaactcata agggaggtca aagttatcac actcaagagc 2880
aaactcgtgt ccgacttcag gaaggacttc cagttctaca aggtgagaga aatcaacaac 2940
tatcaccatg cccacgacgc ctacctcaat gccgtggtgg gcaccgccct catcaagaag 3000
taccccaagc tcgagagcga gttcgtttat ggcgactaca aggtttacga cgtcagaaag 3060
atgatcgcca agagtgagca agaaattgga aaggccactg ccaagtactt cttctacagc 3120
aatatcatga acttctttaa gaccgagatc accctcgcaa atggcgagat caggaagagg 3180
cccctcatcg agaccaacgg cgagaccggc gagatcgtct gggataaagg gagggacttc 3240
gctactgtca ggaaagtcct cagtatgccc caagtcaaca tcgtcaaaaa aaccgaagtg 3300
caaaccggag gcttcagcaa ggaatccatc ctccccaaga ggaatagcga caagctcatc 3360
gccaggaaga aagactggga tcccaaaaaa tacggtgggt tcgactcccc cactgtcgca 3420
tacagcgttc tcgtggtcgc caaagtggag aaaggcaaga gcaagaaact taagagcgtc 3480
aaggaactcc tcgggataac catcatggag aggagcagct tcgagaagaa tcccatcgac 3540
ttcctcgaag ccaagggata caaggaggtc aaaaaggacc tcatcatcaa gctccccaag 3600
tatagcctct ttgagttgga gaacggaaga aagaggatgc tcgcatcagc cggggagttg 3660
cagaagggca atgaacttgc cctccccagc aagtacgtca atttcctcta cctcgccagc 3720
cactacgaaa aactcaaagg gtcaccagaa gacaacgagc aaaagcagct cttcgtcgag 3780
cagcacaagc attacctcga cgaaatcata gagcagatct ctgagttctc caaaagggtt 3840
atcttggccg atgctaacct tgacaaggtg ctttcagcct acaacaagca cagggataaa 3900
cccataaggg agcaggcaga gaatattatc catctcttca ccctcaccaa cctcggggcc 3960
cccgcagcct tcaagtactt cgataccaca atcgacagga agaggtacac cagtaccaag 4020
gaggtgcttg atgccaccct catacaccag tccatcaccg gtctctacga gaccaggatc 4080
gatctttctc agctcggggg ggac 4104
<210> 5
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 5
atggacaaga agtatagcat cggcctcgac atcgggacca actcagtcgg gtgggccgtg 60
atcaccgacg agtacaaggt gccctccaag aagttcaaag tcctcggcaa taccgacaga 120
catagcatca agaagaacct catcggggcc ctcctctttg actccgggga gactgctgag 180
gccaccaggt tgaagaggac cgccaggaga aggtacacca ggagaaagaa caggatctgt 240
taccttcagg agatattttc caacgaaatg gctaaagttg acgattcatt cttccatagg 300
ctcgaggagt cattcctcgt ggaggaagac aagaagcacg agaggcaccc catcttcggg 360
aatatcgtgg atgaggttgc ctaccacgag aagtatccca ctatctacca cctcagaaag 420
aaactcgtcg attccaccga caaagccgat ctcaggctca tctaccttgc cttggctcac 480
atgatcaagt tcaggggcca ctttctcatc gagggggact tgaaccccga caactccgac 540
gttgataaac tcttcatcca gctcgtgcaa acctacaacc agctcttcga ggagaacccc 600
atcaacgctt ccggggttga cgccaaggcc atccttagcg ccaggctctc caagtctaga 660
aggctcgaga acctcatcgc ccagttgccc ggggagaaaa agaatgggtt gttcggcaac 720
ctcatagcct tgagtttggg actcacaccc aacttcaaga gcaacttcga cctcgccgaa 780
gatgccaagt tgcagctctc taaggacacc tacgacgacg acctcgacaa ccttctcgcc 840
cagataggcg accagtacgc cgacctcttc ctcgccgcca agaacctctc tgacgccatc 900
ctcctcagcg acatcctcag ggtcaacacc gagatcacca aggcccctct ttccgccagc 960
atgatcaaga ggtacgacga gcaccatcag gacctcaccc tcctcaaagc cctcgtcaga 1020
caacagctcc ccgagaaata caaggagatt ttcttcgacc aatccaagaa cggatatgcc 1080
ggctatatag acgggggggc ttcccaggag gagttctaca agtttatcaa gcccatcctc 1140
gagaagatgg acggcaccga ggagctcctc gttaagctca atagggagga cctcctcagg 1200
aaacagagga cattcgacaa cggctcaatc ccccatcaga tacatctcgg ggagctccat 1260
gccatcttga gaaggcagga ggacttctac cccttcctca aggacaatag ggaaaagatc 1320
gagaagatcc tcaccttcag gatcccttac tacgtcggcc ccctcgccag ggggaatagc 1380
aggttcgcct ggatgaccag gaagagcgag gagaccatca ccccttggaa cttcgaagag 1440
gtggtggaca aaggcgctag cgcccagagc ttcatcgaga ggatgaccaa ctttgataag 1500
aacctcccta acgaaaaagt cttgcccaag cactccctcc tctacgagta ctttaccgtt 1560
tacaacgagc tcaccaaggt caaatacgtc accgaaggca tgaggaagcc cgccttcctt 1620
tccggcgagc agaagaaggc cattgttgac cttttgttca agaccaacag gaaagtgacc 1680
gtcaagcagc tcaaagagga ttacttcaag aaaatcgaat gcttcgacag cgtcgaaatc 1740
tccggcgtgg aagacaggtt caacgcaagc ttgggcacct accatgacct tctcaagatt 1800
atcaaggata aggactttct cgacaacgag gaaaacgagg atattctcga ggatatagtg 1860
ctcacactca ccctcttcga ggacagggag atgattgagg aaaggctcaa gacctacgca 1920
cacctcttcg acgacaaggt tatgaaacag ctcaaaagga ggaggtacac cggctggggt 1980
aggctttcca ggaagctcat caacggaatc agggacaaac agagcggaaa gaccattctc 2040
gacttcctca agagcgacgg ctttgccaac aggaatttta tgcagctcat ccacgacgac 2100
tcactcacct tcaaggaaga tatccaaaaa gcccaggtct ccggacaggg tgacagcctt 2160
cacgagcaca tcgccaactt ggccgggtct cccgccatca agaagggcat tcttcagact 2220
gtcaaggtgg tggacgagct cgtgaaggtt atgggcagac acaagcccga gaacatcgtt 2280
attgagatgg ccagggagaa tcagaccacc cagaaaggcc agaaaaatag cagggagaga 2340
atgaagagga ttgaagaagg gatcaaagag ctcggctccc agatcctcaa agagcatccc 2400
gtggagaata ctcagcttca aaacgagaag ctctacctct actacctcca gaacggcaga 2460
gatatgtacg tcgaccaaga actcgacatc aacaggctct ccgactacga cgtggaccac 2520
atcgtcccac agtccttcct caaggacgac tccatagaca acaaggtcct caccaggagc 2580
gacaagaaca gggggaaatc tgataatgtg cccagcgagg aggtcgtgaa gaagatgaag 2640
aactactgga ggcaattgtt gaacgccaaa ctcatcaccc aaaggaagtt cgacaatctc 2700
accaaggccg agaggggggg cctcagcgag cttgacaagg ccggcttcat aaagaggcag 2760
ctcgtcgaga ccaggcagat cacaaagcac gtcgctcaaa tcctcgacag cagaatgaac 2820
accaagtacg acgagaacga caagcttatc agagaggtga aggtcatcac tttgaagagc 2880
aagctcgttt ctgacttcag gaaggatttc caattctaca aggttagaga gatcaacaac 2940
taccaccatg cccacgacgc ttacctcaac gccgtcgttg gaaccgccct catcaagaag 3000
tatcctaaac tcgagtccga gtttgtgtat ggagactata aggtctacga cgtcagaaag 3060
atgatcgcca agtccgagca ggagatcgga aaggccaccg caaagtactt tttctactcc 3120
aacataatga acttcttcaa aactgagatc actctcgcca acggggagat caggaagagg 3180
ccactcatcg agaccaacgg tgagaccggc gagatcgtct gggacaaagg gagggatttc 3240
gccactgtca ggaaggttct cagcatgcca caggtcaaca tagtcaagaa gaccgaggtt 3300
cagactgggg gcttctccaa agagagtatc cttcccaaga ggaacagcga caaactcatc 3360
gcaaggaaga aagactggga ccccaagaag tacggcggct tcgacagccc caccgtcgcc 3420
tattccgtcc tcgttgttgc caaggttgag aagggtaaat ccaagaagct caaatccgtt 3480
aaggaactcc tcgggataac catcatggag aggagctcct tcgagaaaaa ccccattgac 3540
tttctcgagg ccaaggggta taaagaagtc aaaaaggacc tcatcatcaa gctcccaaaa 3600
tacagcctct tcgagcttga aaacggtagg aagaggatgc ttgcctccgc cggggagctc 3660
cagaagggga acgagctcgc cctcccctct aagtacgtga acttccttta tttggcttcc 3720
cactacgaga agctcaaagg cagccccgag gacaatgagc aaaagcagct ctttgtcgag 3780
caacacaagc actacctcga tgagatcatt gagcagatct ccgagttctc caaaagggtc 3840
attctcgccg acgcaaacct tgacaaggtg ctctcagcct ataacaagca cagggacaaa 3900
cccatcagag agcaggctga gaatatcatc cacctcttca ccctcaccaa tttgggcgcc 3960
cccgccgcct ttaagtactt cgacactacc atcgacagga agaggtacac aagcaccaaa 4020
gaggtgctcg acgccaccct catacatcag tcaattaccg ggctttacga aactaggatc 4080
gacctcagcc aacttggggg agat 4104
<210> 6
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 6
atggacaaga agtactctat cgggctcgat atcggtacca acagcgtcgg ttgggccgtt 60
atcaccgacg aatataaggt gcccagtaag aagttcaagg tcctcggtaa cacagacaga 120
cattccatca agaagaatct catcggtgcc ctcctctttg acagcggtga gacagccgag 180
gccacaaggc tcaagaggac cgccaggagg aggtacacca ggagaaagaa cagaatctgc 240
tacctccagg agatcttcag caacgaaatg gccaaggtgg atgattcctt cttccacaga 300
ttggaggaga gctttctcgt ggaggaggat aagaagcacg agaggcatcc tattttcggc 360
aacatcgttg acgaggtcgc ctatcatgag aaatatccca ccatttacca cctcaggaaa 420
aagctcgtgg acagcactga taaggccgac ttgagattga tctacctcgc ccttgcccac 480
atgatcaagt tcagagggca cttcctcatc gagggagacc tcaacccaga caacagcgac 540
gtcgacaagc tcttcatcca gctcgtgcag acttacaacc agctcttcga ggagaacccc 600
atcaacgcca gtggcgtgga cgccaaggcc atcctcagcg ctaggctcag caaatctagg 660
aggttggaga acttgatcgc ccagctcccc ggcgagaaga agaatgggct tttcgggaac 720
ctcattgccc tctccctcgg cctcaccccc aacttcaagt ccaacttcga cctcgccgag 780
gatgccaagc tccaactcag caaggacacc tacgatgacg atctcgacaa cctcctcgcc 840
caaatcgggg accagtacgc agatctcttt ctcgctgcca agaacctctc cgacgccatc 900
ctcttgagcg acatcctcag agtgaacacc gagatcacta aggcccccct ctctgccagc 960
atgatcaaga ggtacgacga gcatcaccag gacctcactc ttctcaaggc cctcgttagg 1020
cagcagctcc ccgagaagta taaggagatc ttctttgacc agagcaagaa cgggtacgcc 1080
ggatacattg acggaggggc ctcccaggaa gaattttaca agttcatcaa gcccatcctc 1140
gagaaaatgg acgggaccga agaactcctc gtgaagctca acagagagga cctcttgagg 1200
aagcaaagga ccttcgataa cggcagcatc ccccatcaga tccacctcgg tgagctccac 1260
gccatactca ggaggcagga ggacttttac ccatttctca aggacaacag ggaaaaaatc 1320
gagaagatcc tcaccttcag gatcccctac tacgttggcc ccctcgccag agggaactcc 1380
aggttcgcct ggatgaccag aaagagtgaa gagaccatca ccccctggaa cttcgaggag 1440
gttgtggaca agggggccag cgcccagagt ttcatcgaga ggatgaccaa ctttgacaag 1500
aacctcccca atgagaaagt cctccccaag cacagcctcc tttacgagta cttcaccgtc 1560
tacaatgagc tcaccaaggt taaatacgtc accgaaggca tgagaaaacc cgccttcctc 1620
tccggcgagc agaagaaggc aatagtggat ctcctcttta aaacaaacag gaaggtgacc 1680
gtcaaacagc tcaaggaaga ctacttcaag aagatagagt gcttcgatag cgttgaaatc 1740
tccggcgtcg aggacaggtt taatgccagc ctcgggactt accacgacct cctcaagatc 1800
atcaaggaca aggatttcct cgacaacgag gagaatgaag atatcctcga ggacatcgtc 1860
ctcaccctta ccctcttcga ggatagagag atgatcgagg agaggctcaa gacttacgcc 1920
cacttgttcg acgataaagt catgaagcaa ctcaaaagga gaaggtacac cggatggggg 1980
aggctcagca ggaagttaat caatggtatc agggacaagc aaagcggtaa aactatcctc 2040
gatttcctca aatccgacgg ctttgccaac aggaacttca tgcagctcat ccacgacgac 2100
agccttacct tcaaagagga catccagaag gctcaagtct ctggccaggg agacagcctc 2160
cacgagcaca tagccaacct cgccgggtct cccgccatca agaagggaat ccttcagacc 2220
gtcaaggtcg ttgatgagct cgtaaaagtg atgggcaggc acaagccaga gaacatcgtc 2280
atagagatgg ccagggagaa ccagaccacc cagaaaggcc agaaaaacag cagagaaaga 2340
atgaagagga tcgaagaagg gataaaggag ctcgggtccc agatactcaa agagcaccca 2400
gttgagaata cccagctcca aaacgagaaa ctctacctct actacctcca gaatgggagg 2460
gacatgtatg tggaccaaga gcttgacatc aataggttgt ccgactacga cgtcgaccac 2520
atcgtccccc agagctttct caaggacgac agtatcgaca acaaggtgct caccaggtcc 2580
gacaagaaca gagggaagag cgacaacgtc ccctccgagg aagtggtgaa aaagatgaag 2640
aactactgga ggcaattgtt gaacgccaag ctcatcaccc agaggaagtt cgataacctc 2700
accaaggcag agaggggggg tctctctgag ctcgacaagg ccggtttcat caaaaggcag 2760
cttgtggaga caaggcaaat caccaagcat gtcgctcaga tcctcgactc caggatgaac 2820
accaagtacg atgaaaatga caagctcatc agggaggtta aggtcatcac cttgaagtct 2880
aaactcgtta gcgacttcag aaaggatttc cagttctaca aggtgaggga gatcaacaat 2940
taccatcacg cacacgacgc ctatctcaac gctgttgtcg gcacagccct cattaagaag 3000
taccccaagc tcgagagcga gttcgtctat ggtgattata aggtttacga cgtcaggaag 3060
atgatcgcca agagcgagca ggagatcgga aaggcaaccg ccaagtactt cttctactcc 3120
aacatcatga acttcttcaa gaccgagata accctcgcca acggcgagat caggaaaagg 3180
cccctcatag aaaccaatgg ggaaacagga gagatcgttt gggacaaggg tagggacttc 3240
gcaaccgtga ggaaggttct tagcatgccc caggttaata tcgttaagaa gacagaggtg 3300
cagaccggag ggttctctaa agagagcatc ctcccaaaaa gaaacagcga taagctcata 3360
gccaggaaaa aggactggga ccccaaaaag tacgggggtt ttgacagtcc caccgttgcc 3420
tattcagttc tcgtggtcgc taaggtcgag aagggcaaga gcaagaagtt gaagagtgtt 3480
aaggaactcc tcggcatcac catcatggag aggagctcct tcgaaaagaa ccccatcgac 3540
ttcctcgaag ccaagggtta caaggaggtt aagaaggatc tcatcatcaa gctccctaag 3600
tacagcctct ttgagctcga aaatgggagg aagaggatgc ttgctagcgc cggagagttg 3660
cagaaaggga acgagctcgc cctcccctcc aagtacgtta acttcctcta cctcgcctca 3720
cactatgaga agcttaaagg atcacccgag gacaatgagc aaaagcagct ctttgtggag 3780
cagcacaagc actatctcga cgagatcata gagcagattt ccgagttctc aaaaagggtt 3840
attctcgccg atgccaacct cgataaggtc ctctccgcct ataacaagca cagggacaag 3900
cccataagag agcaggccga aaacattata catcttttta cccttaccaa cctcggggcc 3960
cccgccgcct tcaagtactt cgataccaca atcgatagga agagatatac ctccactaag 4020
gaagttctcg acgccacatt gatccaccag agcataacag gcctctacga gaccaggatc 4080
gatttgtccc agctcggcgg tgat 4104
<210> 7
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 7
atggacaaga agtactccat cggactcgac attgggacca actccgtggg ctgggccgtt 60
atcaccgacg aatacaaggt cccctcaaag aaattcaagg ttctcggaaa tactgatagg 120
cactccatca agaagaacct cataggggct ctcctcttcg actctgggga aaccgccgag 180
gctaccaggc tcaagaggac cgccaggagg aggtacacca ggagaaagaa tagaatctgc 240
tatctccagg agatattctc aaacgaaatg gccaaagtcg acgactcttt cttccacagg 300
cttgaggaaa gcttcctcgt tgaggaggac aagaagcacg agagacaccc tatcttcggt 360
aacatcgttg acgaagtggc ataccacgag aagtacccca ccatctacca tctcaggaag 420
aagctcgttg atagcaccga caaggccgac ctcagactca tttatttggc actcgcacac 480
atgatcaaat tcagggggca cttcctcatc gaaggggacc tcaatcccga caatagtgat 540
gtggacaagc tcttcatcca gctcgtccag acttataacc agctcttcga agagaatccc 600
attaacgcca gcggagtgga cgctaaagcc atcctcagcg ccaggctctc caagtccagg 660
aggcttgaga atctcatcgc ccagttgcct ggcgaaaaga aaaacgggct cttcggtaac 720
ctcatcgcac tctccctcgg ccttaccccc aactttaagt ccaactttga cctcgccgag 780
gacgctaagc tccaactcag caaagacacc tacgatgatg acctcgacaa cttgctcgcc 840
cagatcggcg accagtacgc tgatttgttc ttggccgcca agaatctcag cgacgcaatt 900
ctcctctccg acatcttgag ggtcaatacc gagatcacca aagcccccct cagcgccagc 960
atgatcaaaa gatatgacga acaccaccaa gacctcacac ttctcaaggc cctcgttagg 1020
caacagctcc ccgagaagta caaggaaatc ttcttcgacc agagcaagaa tgggtacgca 1080
gggtacatcg acgggggcgc ctctcaagag gagttctaca agtttattaa gcccatcttg 1140
gagaaaatgg atgggaccga agagcttctc gtgaaactca acagggagga ccttctcagg 1200
aagcaaagaa ccttcgacaa tggcagtata ccccatcaaa tccatctcgg agagctccac 1260
gccatcctca ggaggcagga ggacttctat cccttcctta aggacaacag agaaaaaatc 1320
gagaagatct tgaccttcag gatcccctac tacgtcggcc ccctcgccag ggggaactcc 1380
agattcgctt ggatgaccag gaagagcgag gagacaatca ctccatggaa cttcgaagaa 1440
gtggttgaca aaggggcatc cgcccagtcc ttcatagaga ggatgaccaa ctttgacaag 1500
aatctcccca acgagaaggt cctccccaag cactccctcc tctatgagta cttcactgtt 1560
tataacgaac tcaccaaggt taagtacgtc accgagggaa tgagaaagcc cgccttcctc 1620
tccggcgagc aaaagaaggc catcgtggac ctcctcttca aaaccaatag gaaggttacc 1680
gttaagcagc tcaaggagga ttacttcaag aaaattgagt gctttgatag cgtggagatc 1740
agcggggttg aggacaggtt caacgcatca cttggtacat accacgattt gctcaagata 1800
atcaaggata aggactttct cgacaacgag gagaacgagg acattctcga agacatcgtg 1860
ctcaccttga ccttgttcga agatagggag atgatcgagg aaaggctcaa aacctacgct 1920
catctcttcg atgacaaggt tatgaaacag ctcaaaagga ggaggtacac tggttggggc 1980
aggttgagca ggaaactcat caacggaatc agggacaagc aaagcggtaa gacaatcttg 2040
gatttcctta agtccgatgg gtttgctaac aggaacttca tgcagctcat ccacgatgat 2100
agcctcacat tcaaggagga catccagaag gcccaggtca gcgggcaggg ggacagcctc 2160
catgagcaca tcgccaacct cgccggcagt cccgcaatta agaaagggat cctccaaacc 2220
gtcaaggttg tggacgaact cgtgaaggtc atgggaaggc acaagcccga gaacatagtt 2280
atcgagatgg ccagggagaa tcagacaacc caaaaggggc agaagaactc cagggagagg 2340
atgaaaagga ttgaggaggg aatcaaggag ctcgggtccc agatcctcaa agagcaccca 2400
gtggagaaca cccagctcca aaacgagaag ctttatctct actatcttca aaacgggagg 2460
gacatgtacg tcgaccagga gctcgacatc aacagactca gcgactacga cgttgaccat 2520
atcgttcctc agtcattttt gaaggacgac tccatcgaca acaaggtgct caccagaagc 2580
gacaagaata ggggcaagtc tgacaatgtg ccctcagagg aggtcgtcaa gaagatgaag 2640
aactactgga ggcagctcct caacgctaag ctcatcacac agaggaaatt cgacaacctc 2700
accaaggccg aaaggggggg gctctccgag ctcgacaagg ccggcttcat caagaggcag 2760
ctcgttgaga ccaggcagat caccaaacat gtcgcccaga tcctcgacag tagaatgaac 2820
acaaagtacg acgagaacga caagctcatc agagaggtca aggttatcac cctcaaatcc 2880
aagctcgtgt ctgacttcag gaaagacttc cagttctaca aggtgaggga gatcaacaat 2940
tatcaccacg cccatgatgc ctacctcaac gccgtcgttg ggaccgccct catcaagaag 3000
tatcccaaac tcgagagcga attcgtctac ggcgactata aggtctacga cgtgaggaaa 3060
atgatcgcca aatccgagca ggaaatcggt aaggccaccg ccaagtactt cttctactcc 3120
aacatcatga acttcttcaa gaccgagatc accctcgcca acggtgagat caggaagagg 3180
cccctcatcg aaaccaatgg agagacaggc gaaatcgtgt gggataaggg cagggacttt 3240
gccaccgtta ggaaggttct ctcaatgccc caagtcaaca tcgtcaagaa gactgaggtc 3300
cagacagggg gcttcagcaa ggaaagcatc cttcccaaga ggaactccga caagctcatc 3360
gcaaggaaga aggactggga cccaaagaag tacgggggct tcgactcccc caccgttgcc 3420
tattccgtgc tcgtggttgc gaaggttgag aaggggaagt ccaagaagct caaaagcgtg 3480
aaggaactcc tcggcatcac tatcatggag agatctagct tcgagaagaa tcccatcgat 3540
ttccttgaag ctaaggggta caaggaagtc aagaaggacc ttatcattaa gctcccaaag 3600
tactctctct tcgagctcga aaatggcagg aagaggatgc tcgcctccgc cggggagctc 3660
caaaagggga acgaactcgc cctccccagt aaatatgtca acttcctcta tctcgcctcc 3720
cactatgaaa agctcaaggg gagccccgaa gacaacgaac agaagcaact cttcgtggaa 3780
cagcataagc attacttgga cgagatcatc gagcagatct ccgagttttc caagagagtg 3840
atcctcgccg atgccaacct tgacaaggtt ctctcagcct ataacaagca tagggacaaa 3900
cccatcaggg aacaggccga gaatatcatc catctcttca ccctcaccaa cctcggcgcc 3960
ccagccgcat ttaagtactt tgacaccact atcgacagga aaaggtatac cagcaccaaa 4020
gaggtcctcg atgctaccct catccaccaa tccatcaccg gcctctacga gaccaggatc 4080
gacctctccc aactcggggg cgac 4104
<210> 8
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 8
atggacaaga aatactcaat cggtctcgat atcggcacca atagcgtggg ctgggccgtt 60
atcaccgacg agtacaaggt cccaagcaag aagttcaagg ttctcgggaa caccgacagg 120
cacagcataa agaagaacct cataggggcc ctcctcttcg acagcggcga gaccgccgaa 180
gccaccaggc tcaagaggac cgccaggagg aggtacacta ggagaaagaa caggatctgc 240
tacctccaag aaattttttc caacgaaatg gccaaggtcg acgactcctt cttccatagg 300
ctcgaggaat ccttccttgt tgaggaagac aagaagcatg agaggcatcc catattcggg 360
aacattgtcg atgaggttgc ctaccacgag aaatatccaa ctatctacca cctcagaaag 420
aaactcgtgg acagcaccga caaggccgat cttaggctta tctacctcgc cctcgcccac 480
atgataaagt tcaggggcca cttcttgatc gagggcgacc tcaaccccga caatagcgac 540
gtcgataagc tcttcatcca actcgtccag acctacaacc agctcttcga ggagaatcca 600
atcaacgcct ccggggtgga cgccaaagcc atactcagcg cccgtctctc aaaatccaga 660
aggctcgaga accttattgc ccaactcccc ggtgagaaga agaacggcct ctttggaaac 720
ctcatcgctc tcagcttggg ccttactccc aactttaaga gcaacttcga cctcgccgag 780
gatgccaaac tccaactcag caaggacacc tacgacgacg acctcgacaa cttgctcgcc 840
cagattgggg atcagtacgc cgaccttttc ctcgccgcca aaaaccttag tgacgccatc 900
ctcctctccg acattcttag ggtgaatacc gagatcacca aggcacccct cagcgcctca 960
atgatcaaga gatacgacga gcaccatcag gatctcactc tcctcaaagc cctcgtgagg 1020
caacagctcc cagaaaagta caaggagatt tttttcgacc agagtaagaa cgggtacgcc 1080
gggtacatcg acggcggcgc ttcccaggaa gaattctaca agttcataaa gcccatcctc 1140
gagaagatgg acggcactga ggaattgctc gtgaagctca acagggaaga tctccttagg 1200
aagcagagga cctttgacaa cgggagcatc ccccaccaga tccacctcgg cgagctccac 1260
gcaatcctta ggagacagga ggacttctac cccttcctta aggacaacag ggagaaaatc 1320
gagaaaatac ttaccttcag gatcccctac tacgtcggcc cacttgccag ggggaacagc 1380
aggttcgcct ggatgaccag gaagtcagag gagactatca ccccctggaa tttcgaggag 1440
gtggtcgata agggcgcctc agcccagtcc tttattgaaa ggatgacaaa tttcgataag 1500
aacttgccca acgagaaggt cctccccaaa cactccctcc tctacgagta tttcactgtc 1560
tacaatgagt tgacaaaggt gaaatacgtt accgagggca tgaggaagcc tgcattcctc 1620
agcggtgaac agaagaaggc tatcgttgat cttttgttta agactaatag aaaagttact 1680
gttaagcagc tcaaagagga ctatttcaaa aagatcgaat gcttcgactc cgtggagatc 1740
agcggggtcg aagacaggtt taacgcctcc ctcgggacct accatgacct cctcaagatc 1800
atcaaagaca aggatttcct cgacaacgag gagaacgagg acatcttgga ggatatcgtg 1860
ctcaccctca ccctcttcga ggacagggag atgatcgagg agaggctcaa gacatacgcc 1920
cacctcttcg acgacaaggt catgaagcaa ctcaaaagaa ggaggtacac cgggtggggt 1980
agactcagca ggaagctcat caatggcatc agagataagc agagtggtaa gactatcctc 2040
gatttcctca agagcgacgg cttcgccaat aggaacttca tgcaattgat acacgacgat 2100
agcctcacct tcaaagagga catccagaaa gcccaggttt ccggtcaagg ggacagcctc 2160
cacgagcaca tcgccaacct tgccggcagt cccgcaatta aaaagggcat cctccagacc 2220
gttaaagtcg ttgacgagct tgtcaaggtg atgggcagac acaagccaga aaacatcgtc 2280
atcgagatgg ccagggagaa ccagaccacc cagaagggcc aaaagaactc cagagagagg 2340
atgaaaagga ttgaggaggg tatcaaggag cttggatccc agatcctcaa agagcatcca 2400
gtcgagaaca cacagctcca gaacgagaaa ctctacctct attacctcca gaacggaagg 2460
gacatgtacg tcgatcagga gctcgatatc aacaggctca gcgactacga cgtcgaccat 2520
atcgtccccc agtcctttct caaggacgac agcattgaca acaaggtgct caccagatcc 2580
gacaagaaca gaggcaagtc agataacgtt ccctccgagg aggtggttaa aaagatgaag 2640
aactattgga ggcagctcct caatgcaaag ctcatcaccc agaggaagtt tgacaacctc 2700
accaaggcag aaaggggggg cctctccgag ctcgacaagg ccggattcat caagaggcag 2760
cttgtcgaaa ccaggcagat cacaaaacat gtggcccaaa tcttggattc caggatgaac 2820
acaaagtatg acgagaacga taaattgatc agggaggtga aggtgatcac ccttaagagc 2880
aagctcgtca gtgacttcag gaaggacttc cagttctaca aagtcagaga gatcaacaac 2940
tatcaccatg cccacgatgc ttatcttaac gcagtggttg gcaccgctct tatcaagaaa 3000
taccccaagc tcgagtccga attcgtgtac ggcgactaca aggtttacga cgttaggaag 3060
atgatcgcca aatccgagca agagatcggg aaggccaccg ccaaatactt cttttactcc 3120
aacataatga acttcttcaa gaccgagatc actctcgcca acggggagat caggaagaga 3180
cccctcatcg agaccaacgg agaaaccggg gagatcgtgt gggataaggg cagggacttt 3240
gccaccgtga gaaaagtcct cagcatgccc caggtgaaca tagtcaagaa gaccgaggtg 3300
cagaccgggg gattcagcaa ggagagcatc ttgcccaaga ggaactccga taaactcatc 3360
gccaggaaaa aggactggga ccccaaaaag tacgggggtt tcgactcccc caccgtcgcc 3420
tactccgtcc tcgttgtggc aaaggttgag aaggggaaga gcaagaagct taagagtgtc 3480
aaagaactct tggggatcac catcatggag aggagctcct ttgagaagaa ccccatcgac 3540
ttcctcgagg ccaagggcta caaggaggtg aagaaggatc tcatcatcaa acttccaaag 3600
tactccctct ttgagctcga gaacggaaga aaaaggatgc tcgccagcgc cggggagctc 3660
cagaaaggta atgagctcgc cctcccaagc aagtacgtta acttcctcta tctcgcaagc 3720
cactatgaga agctcaaagg gagtcctgaa gataatgagc aaaaacagct cttcgtcgag 3780
cagcacaagc actacctcga cgagataatc gagcagatct ccgagttctc caagagggtc 3840
atcctcgccg acgctaacct cgacaaggtt ctctccgcct acaataaaca cagggataag 3900
cccatcaggg agcaagccga aaacattata catctcttca cccttacaaa tctcggggcc 3960
cccgccgctt ttaagtattt cgacacaacc atcgacagga aaagatacac atccactaaa 4020
gaagttctcg atgccaccct catacaccag agcatcaccg ggctctatga aacaaggatt 4080
gacctctccc agctcggcgg cgac 4104
<210> 9
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 9
atggacaaaa aatatagtat cggattggac atcgggacta actccgtggg gtgggctgtt 60
ataaccgacg agtacaaggt gcccagtaag aagttcaaag tcctcggcaa taccgacaga 120
cacagcatta agaagaacct catcggcgct ctccttttcg attccggcga gaccgccgaa 180
gccaccaggc tcaagaggac agcaaggagg agatatacca gaaggaagaa caggatctgc 240
tacttgcagg agatctttag caacgagatg gccaaggtgg acgacagctt cttccacagg 300
ctcgaggaga gctttctcgt ggaggaggat aagaagcacg agaggcaccc catctttggg 360
aacatcgtgg acgaagtcgc ttaccatgag aagtacccca ccatttatca cctcaggaag 420
aagctcgtcg acagcaccga caaagcagac cttaggctca tctacctcgc cctcgcccac 480
atgataaagt tcaggggcca ctttcttatc gagggcgact tgaaccccga caatagtgac 540
gttgacaagc tcttcatcca gctcgttcaa acctacaacc agctcttcga ggagaacccc 600
atcaatgcct ctggtgttga cgccaaggca atcctcagcg ccagactcag caagtccagg 660
aggctcgaga acctcatagc acaactcccc ggcgaaaaga aaaatggcct cttcgggaac 720
ctcatcgcct tgagcctcgg actcaccccc aacttcaaga gcaacttcga cctcgccgag 780
gacgcaaagc ttcaactcag taaggacacc tatgacgacg acctcgacaa cctcctcgcc 840
cagatcgggg accagtacgc cgacctcttc ctcgccgcca aaaacctctc cgacgctatc 900
ttgttgagtg acatcctcag ggttaacacc gagatcacta aggcccccct cagtgccagc 960
atgataaaga ggtatgacga gcaccatcag gatcttaccc tccttaaagc tctcgttcgt 1020
caacaactcc ccgagaagta taaggaaatc ttcttcgacc agtcaaagaa cggctatgcc 1080
gggtatatcg acgggggggc aagccaggaa gagttttaca aattcataaa gcccatcctc 1140
gaaaagatgg acgggaccga ggaactcctc gtcaagctca atagggagga cctcctcagg 1200
aagcaaagga cctttgataa cgggagcatc ccccaccaaa tccacctcgg cgagctccac 1260
gccatcctta ggaggcagga ggacttttac cccttcctca aagacaacag ggagaagata 1320
gagaagatac tcaccttcag gatcccctac tatgtcgggc ccctcgccag agggaattcc 1380
aggttcgctt ggatgaccag gaaatctgaa gaaaccataa caccctggaa ctttgaagag 1440
gtggtcgaca agggggcttc cgcccagtcc ttcatcgaga ggatgaccaa cttcgacaag 1500
aatctcccta atgagaaggt cctccccaag cattcccttc tctatgaata ctttaccgtg 1560
tacaacgagc tcaccaaggt gaagtacgtc accgagggca tgaggaagcc cgcatttctt 1620
agcggcgagc agaagaaggc aatagtggat ctcctcttca agaccaatag gaaggtgacc 1680
gttaagcagc tcaaggagga ttactttaag aaaatcgagt gcttcgattc agtcgaaatc 1740
agcggggtgg aggacaggtt caatgcaagt ctcggtacct atcacgacct cctcaagatc 1800
attaaggaca aggactttct cgacaacgag gagaacgagg acatcttgga ggatatcgtg 1860
ctcaccctca cactctttga ggacagggag atgatcgaag agaggctcaa gacctacgcc 1920
catctctttg acgacaaagt gatgaaacaa ctcaagagga ggaggtacac cggctggggc 1980
aggctctcca ggaaactcat aaacgggatc agagataaac aaagcggaaa gacaatcctc 2040
gatttcctca aatccgatgg tttcgccaac aggaacttca tgcagctcat ccatgacgac 2100
tcactcacct tcaaagagga tatccaaaag gcccaggtca gcggccaggg ggactccctc 2160
catgagcaca tagccaatct cgccggctcc ccagccatca agaagggcat cctccagacc 2220
gtcaaagttg tggacgagtt ggtcaaagtc atgggcagac acaaacccga aaacatcgtc 2280
atcgaaatgg ccagggagaa tcaaaccaca cagaagggtc aaaagaacag cagggagagg 2340
atgaagagaa ttgaagaagg aatcaaagag ctcggaagcc agatcctcaa ggagcaccca 2400
gtcgagaaca cccagctcca aaacgagaag ctctacctct actacctcca gaacggcagg 2460
gacatgtacg tcgaccagga actcgacatc aacaggctct ctgactatga cgtggaccac 2520
attgtgcccc agtccttcct caaggacgac agcatcgaca ataaggtcct cactaggagc 2580
gacaaaaaca gaggtaagtc cgataatgtg ccctccgagg aagtggtgaa gaagatgaag 2640
aattactgga ggcagctttt gaacgccaag ttgataaccc aaagaaagtt cgacaacctc 2700
actaaggctg agaggggtgg actctcagag ttggacaagg ccggattcat caagaggcag 2760
ctcgtggaga ccaggcagat aaccaagcac gtcgctcaaa tcctcgactc caggatgaac 2820
accaagtacg acgagaatga caagctcatc agggaagtga aagttatcac ccttaaaagc 2880
aaactcgtgt ccgacttcag gaaagacttc cagttctata aggttagaga gattaacaac 2940
taccatcacg cccacgacgc ctatttgaac gccgttgtgg gaaccgccct tattaagaaa 3000
taccccaagc tcgaaagtga gttcgtgtac ggcgactaca aagtttatga cgtcagaaaa 3060
atgattgcca agagcgagca agagatcggg aaggctaccg ccaagtactt tttctacagc 3120
aacatcatga acttcttcaa gaccgagatc accctcgcaa acggcgagat caggaagagg 3180
cccctcatcg agaccaacgg cgagaccggg gagatcgtct gggacaaagg tagggacttt 3240
gccaccgtga gaaaggtcct ctccatgccc caggtcaaca tcgtcaagaa gaccgaagtt 3300
cagaccgggg ggttttcaaa ggaatctatc ctccccaaaa gaaacagcga caagctcatc 3360
gccagaaaga aggactggga ccccaagaag tacggcggct ttgactcccc caccgtcgcc 3420
tacagcgttc tcgtggtcgc caaggttgaa aaggggaaaa gcaaaaaact caagtccgtg 3480
aaggagcttc tcggtattac tatcatggag aggtcttcat tcgagaagaa tcccatcgat 3540
ttcctcgagg caaagggtta caaagaggtc aaaaaggatc tcattatcaa gcttcctaaa 3600
tactcattgt tcgaactcga gaatggcagg aagagaatgc ttgcctccgc cggcgagttg 3660
cagaagggca acgaactcgc actccccagc aagtacgtca acttcctcta cctcgccagc 3720
cattacgaaa aacttaaagg ctcccccgag gacaacgagc agaagcagct tttcgttgag 3780
cagcacaaac attacctcga cgagatcatc gagcagatct ccgagttcag caagagggtt 3840
atcctcgctg acgccaacct cgacaaggtc ctttccgcct acaacaagca cagagataag 3900
cccatcaggg agcaggccga gaacatcatc cacctcttca cccttacaaa cctcggcgcc 3960
cctgccgcct ttaaatactt cgacacaaca atcgacagga agagatacac cagcactaag 4020
gaagtgctcg acgccactct catccaccaa tcaatcaccg ggctctacga gaccagaatc 4080
gacctcagcc agttgggtgg cgac 4104
<210> 10
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 10
atggacaaga aatacagcat tggtctcgac atcggtacca actccgtggg ctgggcagtc 60
atcaccgacg agtataaggt gcccagcaaa aagttcaagg tgttggggaa cactgacagg 120
cactccatca aaaagaacct cattggcgca ctcctttttg acagcggtga gaccgcagaa 180
gccaccaggt tgaagaggac cgccagaagg agatacacaa ggagaaagaa cagaatctgc 240
tatctccagg aaatcttctc caatgagatg gccaaagtcg acgattcctt cttccacagg 300
ctcgaggagt cattcctcgt tgaggaggac aagaagcacg agaggcaccc catcttcggg 360
aacatagtgg atgaagtggc ctaccacgag aagtacccca ccatctacca cctcaggaag 420
aagctcgttg acagcaccga caaagccgac ctcaggctca tctaccttgc cctcgcccac 480
atgatcaaat ttaggggtca tttcctcatt gagggcgacc tcaaccccga caacagcgat 540
gtggacaagc tcttcatcca gcttgtgcag acctacaacc agctcttcga ggagaacccc 600
ataaacgcct caggcgttga cgcaaaggcc attctctctg ccagattgtc caagtccaga 660
aggctcgaga acctcatcgc ccagctcccc ggagagaaga agaacgggct cttcggaaat 720
ctcatcgccc tctccctcgg actcacaccc aacttcaaat ccaactttga cctcgccgag 780
gatgcaaagc tccaactctc caaggacacc tacgacgacg acctcgacaa cctcctcgcc 840
caaatcgggg accagtacgc cgacctcttc cttgctgcca aaaacctcag cgacgccatc 900
ctcctcagcg atatactcag agtcaatacc gaaatcacaa aggcccccct ctccgcctcc 960
atgatcaaga ggtacgacga acaccaccag gatctcacat tgctcaaggc cttggtcaga 1020
cagcagctcc ccgagaaata caaagaaatc ttcttcgacc agtccaagaa cggctacgcc 1080
ggttacatcg acggaggtgc ctcccaagag gagttctata agttcatcaa gcccatcttg 1140
gagaaaatgg acgggaccga ggagctcctc gtgaagctta atagggagga tctcttgaga 1200
aagcagagga cattcgataa cggcagcatc ccccaccaga tccacctcgg agagctccac 1260
gccatcttga ggagacagga agatttctat ccctttctca aagataacag ggagaaaatc 1320
gagaagatcc tcaccttcag aattccctac tacgttggac ccctcgccag aggcaattca 1380
aggttcgcct ggatgacaag gaagagcgag gagaccatca ccccatggaa ctttgaagaa 1440
gtcgtcgata aaggggccag cgctcaaagc ttcatcgaga ggatgaccaa cttcgacaag 1500
aacttgccca acgagaaggt cctccccaag cactcccttc tctacgaata cttcactgtc 1560
tataacgaac tcactaaagt caagtacgtt accgagggca tgaggaaacc cgccttcctt 1620
tccggggagc agaagaaggc catagttgac ctccttttca agaccaacag gaaggtgacc 1680
gttaaacagc tcaaagagga ctacttcaag aagatagagt gcttcgactc tgtggagatt 1740
tccggggttg aggataggtt caacgccagt ctcgggacct accatgacct cctcaagatc 1800
atcaaggaca aggacttcct cgacaacgag gagaacgaag acatcttgga ggatatcgtc 1860
ctcaccctta ccctcttcga ggacagggag atgatcgaag agagactcaa gacttacgca 1920
cacttgttcg acgacaaagt catgaaacaa ctcaaaagga gaaggtacac cgggtggggc 1980
aggctctcta ggaagttgat caacggtatt agagacaaac agagcgggaa gaccatcctt 2040
gacttcctta aaagtgacgg gtttgccaat aggaacttca tgcagctcat ccacgacgac 2100
agcctcacct tcaaggaaga catccagaag gctcaagtgt ccggccaggg ggattcactc 2160
catgaacata tcgccaacct cgccggcagc cccgctatca agaaagggat actccagacc 2220
gtcaaggtcg ttgacgagct cgtgaaagtt atgggcagac acaagccaga gaacatcgtg 2280
atcgagatgg ccagggaaaa ccagacaacc cagaaaggtc agaagaactc cagggagagg 2340
atgaaaagga tcgaggaggg catcaaagag cttgggtccc aaatactcaa agagcacccc 2400
gtcgagaaca cccaactcca aaacgagaag ctctacctct attaccttca gaatggcagg 2460
gacatgtacg tggaccaaga gttggatatc aacaggctca gcgactacga cgtggaccac 2520
atcgtgccac agagcttcct taaggacgac tccatcgata acaaggttct cactagaagc 2580
gacaagaaca gaggcaagag tgacaatgtc ccttcggaag aggtcgttaa gaagatgaag 2640
aactactgga ggcagctcct caacgcaaag cttatcaccc aaaggaagtt cgataatctc 2700
accaaagccg agagaggggg gttgagcgaa ctcgacaagg ccggcttcat aaaaagacag 2760
ctcgtcgaga ctaggcaaat caccaaacac gttgcacaaa tcctcgatag taggatgaat 2820
accaaatatg acgagaacga taagctcatc agagaggtga aggtgatcac cctcaagagc 2880
aaactcgtgt ccgatttcag gaaggacttc cagttctaca aagtgagaga gataaacaac 2940
tatcaccatg cacatgacgc ttaccttaac gccgttgttg gcaccgccct catcaaaaag 3000
taccctaagt tggaatccga atttgtttac ggggactaca aggtttatga cgtcaggaag 3060
atgatcgcaa aatctgagca ggagattggc aaggccaccg caaagtactt tttctactcc 3120
aatatcatga atttcttcaa aacagagatt accctcgcca acggggagat tagaaagaga 3180
cccctcatcg aaaccaacgg ggagaccggg gagatcgtct gggacaaggg gagagacttc 3240
gccaccgtta ggaaagtgct ctctatgccc caagtgaaca tcgtgaagaa gacagaggtt 3300
cagactgggg gctttagcaa ggagagcatc ctcccaaaga ggaacagcga caagttgatc 3360
gccagaaaga aagactggga tcccaagaag tacggcgggt tcgacagccc aaccgtcgcc 3420
tacagcgtcc tcgtcgtcgc aaaagtggag aagggcaaat ccaagaagct caagtccgtg 3480
aaagagcttc tcgggatcac cataatggaa aggagctcct tcgagaagaa ccccatcgac 3540
ttccttgagg ccaagggcta caaagaggtg aagaaagact tgatcatcaa gctccccaag 3600
tactccctct tcgagctcga gaatggcagg aagaggatgc tcgcctcagc cggcgagctc 3660
caaaagggca acgagcttgc ccttcccagc aagtacgtta actttctcta cctcgccagc 3720
cactacgaga agctcaaggg gtcccccgag gacaacgagc aaaagcagct cttcgtggaa 3780
cagcacaagc actatctcga tgagatcatt gagcagatca gtgagtttag caagagggtc 3840
atccttgcag acgccaactt ggacaaggtg ctctccgcct ataacaagca cagagacaag 3900
cctatcaggg agcaggccga gaacatcatc catctcttca ccttgaccaa cttgggcgcc 3960
ccagccgcct tcaagtactt tgacaccacc atcgacagga agaggtacac atccaccaaa 4020
gaggttctcg acgccacact catccaccag tccataaccg ggctctacga gaccaggatc 4080
gatctctccc aactcggggg ggac 4104
<210> 11
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 11
atggacaaaa aatacagcat cgggctcgac attggaacta attccgttgg gtgggccgtg 60
atcaccgatg agtataaggt gccatctaag aaattcaagg tcctcgggaa taccgacagg 120
cacagcataa agaagaactt gatcggggcc ctcctcttcg acagcgggga aaccgccgaa 180
gccacaaggc ttaaaaggac agccaggagg aggtacacca ggaggaaaaa taggatatgc 240
tacctccaag agatatttag caatgagatg gccaaagtcg atgacagctt cttccatagg 300
ctcgaggaat ccttcctcgt cgaggaggac aaaaagcacg agaggcaccc catcttcggg 360
aacatagttg acgaagtcgc ataccacgag aaatacccca ctatatatca cctcaggaaa 420
aagctcgtgg actccaccga taaggcagac ctcaggctca tctatctcgc cttggcccac 480
atgatcaaat tcaggggtca cttcctcatc gagggcgacc ttaaccccga caacagcgac 540
gtggacaagc tcttcatcca gctcgtccaa acctacaacc agttgtttga ggagaaccct 600
atcaacgcca gcggcgtgga tgccaaggcc atcctctccg ccaggctttc caagagcagg 660
aggttggaaa acctcatcgc ccaattgccc ggcgagaaaa agaacggcct cttcggcaat 720
ctcatcgccc tttcccttgg cctcacaccc aatttcaagt ctaacttcga tctcgctgag 780
gatgccaagc tccagcttag caaggacacc tatgacgacg acctcgacaa tttgctcgca 840
cagatagggg accaatatgc cgatttgttc ctcgccgcaa agaacctcag cgatgccatc 900
ctcctcagcg atatcctcag ggtcaacacc gaaataacca aagccccctt gtccgcttcc 960
atgatcaaga ggtatgacga gcaccaccag gatctcacct tgctcaaggc cctcgtgagg 1020
cagcagcttc ccgagaagta taaagagatc tttttcgacc agtccaagaa tgggtacgct 1080
ggatacatcg acgggggggc ttcccaggag gagttttaca agttcatcaa gcccatcctc 1140
gagaagatgg acggcaccga ggagctcctc gtcaaactca acagggagga tctcctcagg 1200
aagcaaagga cctttgataa cgggagcatc ccccaccaaa ttcatctcgg ggagctccac 1260
gccatcctca gaaggcaaga ggacttttat cccttcctca aggacaacag ggagaaaatc 1320
gaaaagattc tcacctttag gatcccctac tacgtcgggc ccctcgccag gggtaactca 1380
agatttgcct ggatgaccag gaaaagcgag gagactatca ccccctggaa cttcgaagag 1440
gtcgttgata aaggcgccag cgcccagtcc ttcatcgaaa ggatgaccaa ctttgacaag 1500
aatctcccca acgaaaaagt ccttcctaag cactcactct tgtacgagta cttcaccgtc 1560
tacaatgaac tcaccaaggt taaatacgtg accgagggaa tgagaaagcc cgccttcctc 1620
agcggggaac agaagaaggc catcgttgac ctcctcttca agacaaacag gaaagttacc 1680
gttaagcagc tcaaggagga ctactttaag aagattgaat gcttcgatag cgttgagatc 1740
tccggcgtcg aggacagatt caacgccagt ctcggaacct accatgacct cctcaagatc 1800
atcaaggaca aagacttctt ggataacgag gagaacgagg atatccttga ggacatcgtg 1860
ctcaccctta cccttttcga agacagggag atgatcgagg aaaggctcaa gacctacgcc 1920
cacctcttcg acgacaaggt catgaagcaa ctcaagagga gaaggtacac cggttggggc 1980
aggctctcca ggaagcttat caatggtatt agggacaagc agagcggaaa aactatcctc 2040
gacttcctta agagtgatgg atttgctaat aggaatttca tgcagctcat acacgacgac 2100
agcctcacct tcaaggaaga tatccaaaag gcccaggttt caggccaggg agatagcctc 2160
catgaacaca ttgctaatct cgcagggagt ccagccatca agaaagggat cttgcagacc 2220
gtgaaggttg tggatgagct cgtgaaggtc atggggaggc ataagcccga gaacatcgtt 2280
atcgagatgg ccagggagaa ccaaaccaca cagaaggggc aaaagaacag cagagagagg 2340
atgaaaagaa ttgaagaagg aatcaaggaa ctcgggagtc agatcctcaa agagcaccct 2400
gtggagaaca cccagcttca gaacgagaag ctctacctct actacttgca gaacggaagg 2460
gacatgtacg tcgaccagga gctcgacatc aacaggctca gtgattacga cgtcgaccat 2520
atcgtcccac aatccttcct taaggacgac agcatagaca acaaagtgct tacaaggagc 2580
gataagaaca gaggtaagag tgacaacgtc ccatccgagg aggtcgtcaa gaagatgaag 2640
aactactgga ggcaacttct caacgctaag cttatcacac agaggaagtt cgataacctt 2700
accaaggccg agaggggggg actttccgaa cttgacaagg ccgggttcat caagagacag 2760
ctcgtcgaga ccagacaaat caccaaacac gtcgcccaga ttctcgatag caggatgaat 2820
accaagtacg acgagaacga caaactcatc agggaggtga aggtgataac cctcaagagc 2880
aagctcgtct cagactttag gaaggacttc cagttttaca aggtcaggga aataaacaac 2940
taccatcatg cccacgacgc ctatctcaac gccgtggtcg gcaccgccct cataaagaag 3000
taccctaaac tcgagagtga gttcgtctac ggcgactaca aagtctatga tgtcaggaaa 3060
atgattgcaa agtccgaaca ggagattgga aaggcaaccg ccaagtactt tttctactca 3120
aacattatga acttcttcaa gaccgagata accctcgcaa acggcgaaat caggaagagg 3180
ccccttatcg agaccaacgg cgagactggt gaaatcgttt gggataaggg cagggacttc 3240
gccaccgtca ggaaggtgct ctccatgccc caggtgaaca tcgtgaagaa gacagaggtc 3300
caaactggcg gcttctccaa ggaatccatc ttgccaaaaa ggaactctga caagctcatc 3360
gccaggaaaa aggactggga ccccaagaag tacgggggct tcgactcccc cactgtcgcc 3420
tactccgtcc tcgtggttgc taaggtggag aaggggaaga gtaagaagct caaaagcgtc 3480
aaggagctcc tcggcatcac tatcatggag aggtcaagtt tcgagaagaa ccccatcgat 3540
ttcttggagg ccaaggggta caaagaggtc aagaaggacc tcatcatcaa gctccccaag 3600
tactctctct tcgagcttga gaacgggagg aagaggatgc tcgcctccgc cggggagttg 3660
cagaagggaa acgagctcgc cctccccagc aagtatgtga acttcttgta cctcgcctca 3720
cactacgaga agctcaaagg atcccccgag gataacgagc agaagcagct cttcgttgaa 3780
cagcataagc actacttgga cgagatcatt gaacaaatct ctgagttctc caaaagggtt 3840
atactcgccg acgccaacct cgataaagtg cttagcgcct acaataagca cagggataaa 3900
cccatcaggg aacaggccga gaacataatc cacctcttca ccctcaccaa ccttggggct 3960
ccagcagcct tcaagtattt cgataccacc atcgatagga agagatatac ctcaaccaag 4020
gaagtcctcg acgccactct tatccaccag agtatcaccg gcttgtacga gaccaggatt 4080
gatctctccc agctcggcgg tgac 4104
<210> 12
<211> 4104
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 12
atggacaaga aatacagcat tggcctcgac attggaacca actcagtggg ctgggccgtg 60
atcaccgacg agtacaaggt cccctccaag aaatttaaag ttctcggtaa caccgacagg 120
cactccatta agaagaacct tattggcgcc ttgctcttcg actctggcga aaccgccgag 180
gccacaaggc tcaaaagaac agcaaggagg aggtacacca ggaggaagaa taggatctgc 240
tacctccaag aaatcttcag taacgagatg gctaaggtcg acgattcctt ctttcacaga 300
ctcgaggagt ccttccttgt ggaggaagac aagaagcatg aaaggcaccc catcttcggt 360
aacatagtcg acgaggtggc ctaccacgag aagtacccca caatctacca tctcagaaaa 420
aagttggtcg actccaccga caaggccgat ctcaggctta tctacctcgc cttggctcac 480
atgatcaagt ttaggggcca tttcctcatc gagggcgacc tcaatcccga taacagcgac 540
gtcgacaagc tctttatcca gctcgttcag acatacaatc agttgttcga ggaaaacccc 600
atcaacgcca gcggcgtcga tgccaaggca atcctttccg caaggctctc taagagcaga 660
aggctcgaaa atctcatcgc ccaacttccc ggcgagaaga agaatgggct ctttggtaac 720
ttgatcgccc tctcactcgg gctcacaccc aacttcaaga gcaatttcga tctcgcagaa 780
gacgctaagc tccagctctc caaggatacc tacgatgatg acctcgacaa cctccttgcc 840
cagattgggg accagtacgc cgacctcttt cttgccgcca agaacctcag cgacgccata 900
ctcctctccg acatcctcag ggttaatacc gaaataacca aggcccccct cagtgcctcc 960
atgatcaaga gatacgacga gcaccaccaa gacctcacac tcctcaaagc cctcgtcagg 1020
cagcagctcc ccgagaagta caaagagatc tttttcgatc agtccaagaa tggctatgcc 1080
ggctacatcg atggcggggc ctctcaggag gagttctaca aattcatcaa acccatcctc 1140
gagaagatgg acggaaccga ggagctcctc gtcaagctca acagggagga cctcctcagg 1200
aagcagagga cgttcgacaa cgggagcatc ccccaccaga ttcatctcgg cgagctccac 1260
gctatactca ggaggcagga agatttctac cccttcctca aggacaacag ggagaagatc 1320
gaaaagatcc tcaccttcag gattccttac tacgttgggc ccctcgccag aggcaacagc 1380
aggttcgcat ggatgaccag gaaatctgag gagaccatca ctccctggaa ctttgaggag 1440
gttgtcgaca agggggcctc cgctcagagc ttcatcgaga ggatgaccaa ctttgataag 1500
aatctcccta acgagaaggt cctccctaaa cattccctct tgtacgaata cttcaccgtc 1560
tacaacgagc tcaccaaagt caagtatgtc actgagggga tgagaaagcc agccttcctc 1620
agtggggaac agaaaaaagc catcgtcgac ctcctcttca aaaccaacag aaaggttacc 1680
gtgaaacagc tcaaggagga ctatttcaaa aaaatagaat gtttcgattc cgttgagatc 1740
tcaggcgttg aggacaggtt caacgccagc cttgggacct atcacgacct ccttaagatc 1800
atcaaggata aggattttct cgacaatgag gagaacgagg atatcctcga ggacatcgtc 1860
ctcaccttga ccctcttcga ggacagggag atgatcgagg agaggctcaa gacctacgct 1920
cacctcttcg acgacaaggt tatgaagcag ctcaagagga ggaggtacac cggctggggc 1980
aggctttcaa gaaaacttat caacgggata agggacaagc agtccggcaa gaccatcctc 2040
gatttcctca agtctgacgg cttcgccaac aggaacttca tgcagctcat ccatgacgac 2100
tccctcacct tcaaggaaga catccagaaa gctcaggtct caggacaggg cgacagcctc 2160
cacgagcaca tcgccaacct cgccggatcc cccgccatta agaagggaat attgcaaacc 2220
gttaaagttg tggacgagct cgttaaggtt atggggaggc ataaacccga gaacatcgtg 2280
atcgagatgg ccagggagaa ccagacaacc cagaagggac aaaagaatag cagagaaaga 2340
atgaagagga tcgaggaggg aatcaaggag ttgggttccc agatactcaa ggagcacccc 2400
gtggagaaca cccagttgca gaacgaaaag ctctacctct actatcttca gaacggcagg 2460
gacatgtacg tggatcagga gctcgatatc aacagactct ccgactatga cgtggaccac 2520
atcgttccac agagctttct taaagacgac agcatcgata acaaagttct caccaggagc 2580
gataagaaca ggggcaaatc cgacaatgtt cccagcgaag aagtcgtgaa aaagatgaag 2640
aattactgga gacagcttct caatgccaaa ctcatcaccc agagaaagtt cgacaacctc 2700
accaaggccg agaggggtgg gctcagcgag ctcgacaagg ccggctttat aaagagacag 2760
ctcgtggaga ccaggcagat cactaagcat gtggcccaga tcctcgactc caggatgaat 2820
accaaatacg acgagaatga caagctcatc agggaggtca aagtgatcac cctcaagtcc 2880
aagctcgtgt ccgacttcag gaaagatttt caattttaca aagtgagaga gatcaataac 2940
tatcaccacg cccacgatgc ctacctcaat gccgtggtgg gaacagccct tatcaagaaa 3000
taccctaagc tcgaatccga gttcgtgtac ggagactata aggtctacga cgtgagaaaa 3060
atgatcgcaa agtccgagca ggagatcggc aaggccaccg caaagtactt tttctacagc 3120
aacatcatga acttcttcaa gaccgagatc accctcgcca acggcgagat caggaagagg 3180
cctctcatcg agaccaatgg ggagaccggg gagattgttt gggacaaggg gagggacttt 3240
gccactgtga ggaaggtttt gagtatgcct caggtgaaca tcgtcaagaa gacagaggtt 3300
cagacaggcg ggttttccaa ggagagtatt ttgcccaaaa ggaactccga caagctcata 3360
gccaggaaga aagactggga ccccaaaaag tacgggggct ttgacagccc cactgtggcc 3420
tatagcgtcc ttgtcgtggc caaggtcgag aaggggaagt ccaagaagct caagagtgtg 3480
aaggagctcc tcggcatcac catcatggaa aggtccagct tcgagaagaa ccccatagat 3540
ttcctcgagg ccaaggggta caaggaggtg aagaaggacc tcatcatcaa gctccccaag 3600
tacagcctct tcgagctcga aaacggtagg aagaggatgc tcgcctctgc cggagagctc 3660
cagaagggca acgagctcgc cctccccagc aagtacgtga acttcctcta cctcgccagc 3720
cactacgaga agctcaaggg gagccccgaa gacaacgagc agaagcagct ctttgtggag 3780
caacacaaac actacctcga cgaaattatc gagcagatct ccgagtttag caaaagagtc 3840
atcctcgctg acgccaacct tgacaaggtg ctctctgcct ataacaagca cagggacaag 3900
cctatcaggg agcaggccga gaacatcata catcttttca ccttgaccaa cctcggagcc 3960
cctgccgcct tcaagtactt cgacaccaca atcgatagga agaggtacac cagcactaag 4020
gaggttttgg acgccaccct cattcaccaa agcatcaccg gactctacga gaccaggatc 4080
gaccttagcc aactcggagg cgac 4104
<210> 13
<211> 1053
<212> PRT
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<400> 13
Met Lys Arg Asn Tyr Ile Leu Gly Leu Asp Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 14
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 14
atgaagagga actatatact aggcctcgac ataggtatca ccagcgtggg ctacggcatc 60
atagattatg agactaggga cgtgatagat gctggtgtta ggctattcaa ggaggcgaat 120
gtcgagaaca atgaaggacg tagatcaaag cgcggcgcac gccgcctgaa acgcagaagg 180
cgacacagga ttcagagggt gaagaaactg ttgttcgact acaatttgct tactgaccat 240
tcagagcttt ccggtatcaa tccttacgag gctcgtgtga aaggcctcag ccaaaaattg 300
tccgaagaag aattttccgc tgcacttctg cacctggcaa aacgacgtgg tgtccataac 360
gtcaatgaag ttgaagagga taccgggaat gagttgagta caaaagaaca aattagtaga 420
aatagtaaag ctttagagga aaaatatgtc gctgagctcc aattagaaag attaaagaag 480
gatggtgagg ttaggggttc cataaatcgt tttaagacaa gtgattacgt caaagaggca 540
aagcaacttc ttaaagtaca gaaggcatat catcaattgg atcaatcctt tatcgataca 600
tacatcgatc ttcttgagac tcgtagaact tactatgaag gcccaggcga gggttctcca 660
tttggctgga aggatataaa agagtggtat gaaatgctga tgggtcattg tacctacttt 720
ccagaggaac taaggtctgt taagtatgct tataacgccg atctttataa cgccctcaat 780
gacctgaata acctagttat tactagggac gaaaatgaaa agttggaata ttatgaaaag 840
ttccagatta ttgagaacgt tttcaaacaa aagaagaaac cgaccctaaa acagatagct 900
aaggaaattt tagtcaacga agaggatatt aagggatacc gtgttacaag cactggcaaa 960
cctgaattta ccaatctcaa agtttatcat gatattaaag acattaccgc ccggaaagag 1020
ataattgaaa acgctgaact tttagaccag attgcgaaga ttttgactat atatcaatcc 1080
tcagaggata tccaggagga gttaacgaat ctcaactcag agcttaccca agaagagata 1140
gagcagataa gtaacttgaa gggatacact gggactcata atctctcgct gaaggcaata 1200
aatttgatcc ttgatgaatt atggcatacc aatgataatc aaatagctat tttcaacagg 1260
ctaaagttgg tgcctaagaa ggttgatttg agccaacaaa aggagatccc aacgacttta 1320
gtcgatgatt tcattctgag cccagtagtg aagcggtcat tcattcaatc aatcaaggtt 1380
ataaatgcca ttattaagaa gtacggctta cctaacgata ttataatcga actcgccaga 1440
gaaaaaaata gcaaggatgc ccagaaaatg atcaacgaaa tgcaaaagag aaatcggcag 1500
actaatgaga gaatcgagga aatcatcagg accacaggta aggagaatgc aaaatacctg 1560
attgaaaaga tcaaattgca tgacatgcaa gagggaaagt gcttgtattc cctcgaagca 1620
atccccctcg aggatcttct taacaaccct ttcaactatg aggttgatca cattattcct 1680
aggtctgtgt cattcgataa tagcttcaat aacaaagtgc ttgtgaagca ggaagagaac 1740
tcaaaaaaag ggaacagaac ccccttccag tacctcagtt cctctgattc caaaatttct 1800
tatgaaactt ttaaaaagca cattttgaac ctagcaaagg gtaagggaag aatttctaag 1860
acaaagaaag agtatctttt agaggagcgg gatattaata ggtttagtgt gcaaaaagat 1920
ttcataaaca gaaaccttgt tgacaccaga tatgccaccc ggggtcttat gaatttactt 1980
agatcgtact tccgggtgaa caatttggac gttaaggtga agtcaatcaa tgggggcttc 2040
acctcatttc taaggcggaa atggaagttt aaaaaagaac gtaataaggg gtacaagcat 2100
cacgcagaag atgcactcat tattgcaaac gcggatttta ttttcaaaga gtggaagaaa 2160
ctcgataaag caaaaaaagt tatggaaaac caaatgttcg aggaaaaaca agctgaatct 2220
atgcctgaaa tagagacgga gcaagaatac aaggagattt tcatcactcc tcatcagatt 2280
aagcatataa aggacttcaa ggattataag tatagccatc gcgtggacaa aaagcctaat 2340
agagagctta tcaatgatac tctttattca acccgtaagg atgataaagg taataccttg 2400
attgtcaata atctcaatgg tctgtacgat aaagacaatg acaaacttaa gaaactcatt 2460
aataaatctc cagaaaagct gcttatgtat caccacgacc cgcaaacata tcaaaagttg 2520
aagctgataa tggagcaata tggagatgag aagaaccctc tctataaata ttatgaggag 2580
accggaaact atcttacaaa atactccaag aaggataacg gaccggttat aaaaaagatt 2640
aaatactacg gtaataaact taacgctcat ctcgacataa ctgatgatta tcccaattca 2700
cgcaacaagg tggtaaagtt gtccctcaaa ccatacaggt tcgacgttta ccttgataac 2760
ggggtataca agttcgttac cgttaagaac cttgacgtca ttaagaaaga gaactactat 2820
gaggttaata gtaagtgcta tgaagaagct aaaaaactaa aaaaaatatc caaccaggca 2880
gaatttatcg catcatttta caataacgat cttataaaga ttaacggcga actctacagg 2940
gtgattggtg tgaataatga tcttctgaat cgaattgagg tgaacatgat tgatatcact 3000
tacagggagt atcttgagaa catgaatgat aagcgccccc caaggatcat taaaactata 3060
gcatcaaaga cccaatcgat aaagaaatac agtacagata tattggggaa tctatacgag 3120
gttaaatcta agaaacaccc ccaaattatc aaaaaaggt 3159
<210> 15
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 15
atgaaaagga actatatcct cggcctcgac ataggtatta ccagcgtcgg ctacggcatt 60
atcgactacg agacaagaga tgtcatcgac gccggggtga ggctctttaa ggaggccaac 120
gttgaaaaca atgaaggcag gagatctaag agaggagcca ggagacttaa gagaaggagg 180
aggcacagga tccagagggt caaaaagctc ctcttcgatt acaacttgct taccgatcat 240
agcgagctct ccggcatcaa cccctacgag gccagggtga agggcctcag tcagaaactc 300
tcagaggagg agttcagcgc cgcccttctc cacttggcca agaggagggg cgtgcacaac 360
gtgaatgagg tcgaggaaga caccgggaac gagctctcta ccaaggagca gatcagcagg 420
aacagtaagg cccttgagga gaagtacgtc gccgaactcc agctcgagag gcttaagaag 480
gacggcgagg tgagaggctc catcaacagg ttcaagacca gcgactacgt taaggaagcc 540
aaacaactcc tcaaggttca aaaggcctat caccaactcg atcagagctt catagacact 600
tacatcgacc tcctcgagac caggaggacc tactacgagg ggcccggcga ggggtccccc 660
tttgggtgga aggacatcaa agagtggtac gagatgctca tggggcactg cacttacttc 720
cccgaagagt tgaggtccgt caagtatgcc tataatgccg acctctacaa cgcactcaat 780
gacttgaata acctcgtgat cacaagagat gagaacgaga aactcgaata ctacgagaag 840
ttccaaatca ttgagaatgt cttcaaacaa aagaagaagc ccaccctcaa gcagatcgcc 900
aaggaaatct tggtcaacga agaggacatc aaaggttaca gggttacaag caccggcaag 960
cccgagttca ctaacctcaa ggtgtaccat gacatcaagg acatcaccgc caggaaagaa 1020
atcatcgaaa acgccgagct cctcgatcag atcgccaaaa tactcaccat ctaccagtcc 1080
agcgaggaca ttcaggagga gctcaccaac ctcaattctg agttgaccca agaggagatt 1140
gaacagatct ccaacctcaa gggctacact ggaacccata atctcagctt gaaggctatc 1200
aacctcatcc tcgatgagct ctggcacact aatgacaacc agatcgccat cttcaacagg 1260
ctcaaactcg tccccaagaa agtcgacctt agccagcaga aagagatccc caccaccctc 1320
gtcgacgact ttatcctttc ccccgtcgtt aagaggtcct tcatccagag tatcaaagtc 1380
atcaacgcca tcataaaaaa atacggactc cccaacgaca tcatcatcga actcgccagg 1440
gagaagaaca gcaaggacgc ccagaagatg attaatgaga tgcagaagag gaacaggcaa 1500
accaatgaaa gaattgaaga gattatcaga accaccggca aagagaacgc aaagtacctt 1560
atcgagaaaa tcaaactcca cgacatgcag gagggcaagt gcctctattc tctcgaagcc 1620
atccccctcg aagacctcct caacaacccc ttcaactacg aagtcgatca catcataccc 1680
aggagcgtta gcttcgacaa cagcttcaat aacaaagtgc tcgttaagca ggaggagaac 1740
agcaaaaagg gaaataggac cccctttcag tacttgtcaa gtagcgattc caagatcagc 1800
tacgaaacct tcaaaaagca catcctcaac cttgctaagg ggaaggggag gatcagcaag 1860
accaaaaagg agtacctcct cgaagagagg gacatcaaca ggttcagcgt ccagaaagac 1920
ttcatcaata gaaatctcgt tgacaccagg tacgccacta ggggcctcat gaatctcctc 1980
aggagctact tcagggtgaa taacctcgac gtgaaagtga agtccataaa cggcgggttc 2040
acctccttcc tcaggaggaa atggaagttc aagaaggaga ggaacaaggg gtataagcac 2100
cacgccgagg acgccttgat cattgccaat gctgacttca tctttaagga atggaagaaa 2160
ttggacaagg ccaagaaggt gatggagaac cagatgtttg aggagaagca ggccgagagc 2220
atgcccgaga tcgagaccga gcaggagtat aaggagatct tcatcacccc ccatcagatc 2280
aagcacatca aggacttcaa ggactataag tactcccata gagtcgacaa gaaacctaac 2340
agggagctca tcaacgatac cctctatagc accaggaagg atgacaaggg aaacacactc 2400
atcgtcaaca acctcaacgg cctctacgat aaagacaacg acaagctcaa gaagctcata 2460
aacaaaagcc ccgagaagtt gctcatgtac caccacgacc cccaaactta ccagaagctc 2520
aagctcatca tggagcagta cggcgacgag aagaacccat tgtataaata ctacgaagag 2580
accgggaact acctcaccaa gtactctaag aaggacaacg gccctgttat taagaagatc 2640
aaatactatg gcaacaagct caacgcccac ctcgacatca ccgacgacta tcccaattcc 2700
agaaataagg tcgtcaagct ctccttgaag ccttacaggt tcgacgttta cctcgacaac 2760
ggcgtttaca agttcgtgac cgtcaagaac ctcgacgtca tcaaaaagga aaactattat 2820
gaggtcaaca gcaagtgcta cgaagaggcc aagaaactca agaagataag caaccaggcc 2880
gaatttatcg cctccttcta caacaatgac ctcatcaaga tcaatggcga actttacagg 2940
gtcattggtg ttaacaacga tctcctcaac aggatcgagg tcaacatgat cgacatcacc 3000
tacagggaat atttggaaaa catgaacgac aaaaggcccc ccaggatcat aaagaccatc 3060
gccagcaaga cacagagcat caagaagtat tccacagata tcctcggtaa cctctacgag 3120
gttaagagca agaagcaccc ccagatcatc aaaaaaggg 3159
<210> 16
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 16
atgaaaagga attacatcct cggtctcgac atcggcatca cctctgtcgg gtacggcatc 60
atcgattatg agaccaggga tgttatcgac gccggcgtga ggctcttcaa agaggccaac 120
gtcgaaaata acgagggtag aaggagcaag aggggcgcca gaaggcttaa gaggagaagg 180
aggcacagga tccagagagt caagaagttg ctttttgatt ataatctcct caccgaccat 240
tcagagctct ctgggatcaa tccctatgag gccagggtga agggcttgag ccagaagctc 300
agcgaggaag agttctctgc cgccctcctc cacctcgcta agaggagggg ggtgcacaat 360
gtcaacgaag ttgaggagga taccggtaac gagctctcca ccaaggaaca gatttccagg 420
aactcaaagg cccttgagga gaaatacgtt gccgagttgc agctcgagag gctcaagaag 480
gatggggagg tcagaggttc catcaatagg tttaagacta gcgactacgt taaggaggcc 540
aagcagctcc tcaaagtcca gaaggcctac caccagctcg accagagttt catcgatacc 600
tacatcgatt tgttggagac cagaaggacc tactatgagg gccccgggga gggcagcccc 660
tttggatgga aggacatcaa ggaatggtac gagatgctca tgggccactg cacctacttt 720
cccgaagaat tgaggtctgt gaagtacgcc tacaacgccg acctttataa cgccctcaac 780
gatttgaaca acctcgtgat caccagggac gagaacgaga agctcgagta ctacgagaaa 840
ttccagatca tcgagaacgt gtttaagcaa aagaagaagc ctacccttaa gcagatcgcc 900
aaggagatac ttgtcaacga ggaagatatt aagggataca gagttaccag cactggcaag 960
cccgagttca ccaacttgaa ggtgtaccac gatatcaagg atatcactgc caggaaagag 1020
ataatcgaga acgccgagct cctcgaccag atagcaaaga tcctcaccat ctatcagagc 1080
agcgaggata tccaggagga gctcaccaac ctcaactccg agctcactca ggaggagatc 1140
gagcagatct ccaacctcaa aggctacacc gggacccaca acttgtccct caaggccatc 1200
aatcttatcc tcgatgagct ctggcacacc aacgacaacc aaatcgccat cttcaacagg 1260
ctcaagctcg tccccaagaa ggtggacctt tcccaacaga aggagatccc caccaccttg 1320
gtcgatgact tcatccttag ccctgtcgtc aagaggagct ttatccaatc catcaaggtt 1380
atcaacgcca taataaagaa gtacggtctc cccaacgaca tcatcataga gctcgcaaga 1440
gagaagaaca gcaaggacgc acaaaagatg atcaacgaaa tgcagaaaag gaacaggcag 1500
actaatgaaa ggatcgagga gatcatcagg accaccggaa aggagaacgc caagtacctc 1560
atcgagaaga tcaaactcca cgacatgcaa gaaggtaaat gcctctacag cctcgaggcc 1620
atccccctcg aggacctcct taacaacccc ttcaattacg aagtcgacca catcatccca 1680
aggtccgtta gcttcgacaa cagtttcaac aacaaagtgc tcgttaagca ggaggagaac 1740
agcaagaagg gcaacaggac acccttccaa tacctctcat cctcagattc caaaatcagc 1800
tacgaaactt tcaagaaaca catcctcaac ctcgccaaag ggaagggaag gatcagcaag 1860
accaagaagg agtacctcct cgaagagagg gacatcaaca gattctccgt tcagaaggac 1920
ttcatcaata ggaacttggt tgacaccaga tacgccacca gggggcttat gaaccttctc 1980
aggtcctatt ttagggtgaa caacctcgac gtcaaagtca agagcatcaa cggtgggttc 2040
accagcttcc tcaggaggaa gtggaagttt aagaaagaga ggaacaaggg gtacaaacac 2100
cacgccgagg acgccctcat tatcgccaac gccgacttca tcttcaaaga gtggaagaag 2160
ctcgacaagg ccaagaaagt tatggagaat cagatgttcg aagagaaaca ggcagagagc 2220
atgcctgaga tcgagaccga gcaggaatac aaggagattt tcatcacccc ccaccagatt 2280
aagcacatta aggacttcaa ggattacaag tatagtcaca gggtggacaa gaaacccaac 2340
agggagctca tcaatgacac tctctacagc accaggaagg acgacaaagg caacaccctt 2400
atcgtcaaca atctcaacgg gctctacgac aaggataacg acaagttgaa gaagctcatt 2460
aacaaatcac ccgagaagct cctcatgtac catcacgacc ctcagaccta ccagaagctc 2520
aagctcatca tggagcagta cggtgacgag aaaaaccctc tctacaagta ctatgaggag 2580
actgggaact acctcaccaa gtacagcaag aaggacaacg gccctgtcat taagaagatc 2640
aagtactatg gcaacaagtt gaatgcccac ctcgacatca ccgatgatta ccccaactca 2700
agaaacaaag tggtcaaact cagcctcaaa ccatataggt ttgacgttta cctcgataat 2760
ggtgtttaca agtttgttac cgtgaagaac ctcgacgtga tcaagaagga gaattattat 2820
gaggttaatt ctaaatgcta tgaggaagcc aagaaactca agaagatcag caaccaggcc 2880
gagttcatcg ccagcttcta caacaatgac ctcatcaaaa tcaacggtga actctacagg 2940
gtgatcggcg tgaataatga cctcctcaac aggatcgagg tcaacatgat cgacatcacc 3000
tacagggagt acttggaaaa catgaacgac aagagacctc caagaatcat aaagaccatc 3060
gccagtaaga cccaatccat caagaagtac agcaccgaca tccttgggaa cctctatgaa 3120
gtcaagtcca agaagcatcc ccagatcatt aagaaggga 3159
<210> 17
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 17
atgaagagga actatattct cggactcgac atcgggatta cctccgtggg gtacggcatc 60
atcgattacg agaccagaga cgtgatcgat gccggggtga gactcttcaa agaagccaac 120
gtggagaaca atgaggggag gagaagcaaa aggggtgcca ggaggttgaa gagaaggagg 180
agacatagga tccaaagagt gaagaaattg ctttttgatt acaatctcct caccgaccac 240
tccgagctct ccgggataaa cccctatgaa gccagggtca agggcctctc tcagaagctt 300
agcgaagagg agttcagtgc cgccctcctc cacttggcta aaaggagggg ggtgcacaat 360
gttaacgagg tcgaggagga caccggcaac gagctctcta ccaaggagca gatctccagg 420
aactctaagg ccttggaaga gaagtacgtg gctgaactcc agctcgaaag gctcaagaag 480
gatggggagg tcagaggctc cattaacagg ttcaaaacct ccgactacgt caaggaggcc 540
aaacagctcc tcaaggtgca gaaggcctac caccagctcg atcagagctt catagacacc 600
tacatcgacc tcctcgaaac caggaggaca tactacgagg gtcccggcga gggttccccc 660
ttcgggtgga aggatatcaa ggagtggtac gagatgctca tggggcactg cacctacttc 720
cccgaggagc tcaggtctgt taagtatgcc tacaacgccg acctctacaa cgcactcaac 780
gaccttaaca atttggtcat cactagggac gagaacgaga agctcgaata ctacgagaag 840
ttccagatca tcgagaatgt tttcaaacag aagaagaaac ccaccttgaa acagatcgct 900
aaagagatac tcgttaacga ggaggacatc aaagggtata gggtcaccag caccgggaag 960
cccgagttta ccaatctcaa agtttaccac gatatcaaag atatcaccgc caggaaggaa 1020
atcatcgaga acgccgaact tcttgaccag atagccaaga ttctcaccat ctatcagtcc 1080
tcagaggata tccaagagga gctcaccaac cttaactccg agcttaccca agaggagatc 1140
gaacagatca gtaacttgaa gggctatacc gggacccaca acctttcact caaggccatc 1200
aacctcatcc tcgacgagct ctggcatact aacgacaacc agatcgccat cttcaacagg 1260
ctcaagctcg tgcccaagaa ggtggacctc tcccaacaga aggagatacc caccaccctc 1320
gttgatgact ttatcctcag ccccgtcgtc aagaggtctt ttatccagag catcaaggtc 1380
atcaacgcca taatcaagaa gtacggcctc cccaacgaca tcatcatcga gctcgctagg 1440
gagaagaata gcaaggacgc ccagaagatg ataaacgaga tgcagaagag gaacaggcag 1500
actaacgaga ggattgagga aataatcagg accactggga aagaaaacgc caagtatttg 1560
atcgaaaaga tcaagctcca tgacatgcag gagggcaagt gcttgtactc cctcgaggcc 1620
atccccctcg aggatctcct taacaacccc ttcaactacg aggtggatca catcatcccc 1680
aggtccgttt ccttcgacaa ctctttcaac aacaaagtcc ttgtgaagca ggaggaaaac 1740
tccaagaagg ggaacagaac acccttccag tatctctcat cctcagattc aaagatctcc 1800
tacgagacct ttaagaagca catcctcaac ctcgccaagg ggaagggcag gataagcaag 1860
accaagaaag agtatctctt ggaggaaagg gacataaata ggtttagcgt gcagaaggac 1920
tttatcaaca ggaatctcgt tgacaccagg tacgccacca gagggctcat gaatctcctc 1980
aggtcctact tcagggtgaa caatctcgat gtgaaggtca agtccatcaa cgggggcttc 2040
acctccttcc tcagaaggaa atggaagttc aagaaagaaa ggaacaaggg atacaagcat 2100
catgccgagg acgcccttat catcgccaac gctgacttta tattcaagga gtggaagaag 2160
ctcgataaag ccaagaaggt gatggagaac cagatgttcg aggagaagca ggccgagtcc 2220
atgcccgaga tcgagaccga gcaggaatat aaagagatct ttatcacccc ccaccagatc 2280
aagcacataa aggacttcaa agattacaag tactcccaca gagtggacaa gaagcccaac 2340
agggagctca tcaacgacac cctctactcc accaggaagg acgataaggg aaacaccttg 2400
attgtcaata acctcaacgg cctctacgac aaggacaatg ataagctcaa gaagctcatc 2460
aacaagagcc ccgagaagct tctcatgtac caccacgacc cccagaccta tcagaagctc 2520
aagctcatca tggagcagta cggcgacgag aagaaccccc tctacaagta ctacgaggag 2580
accggaaact atctcaccaa gtacagtaag aaggacaacg ggcccgtcat caagaagatt 2640
aaatactatg gaaacaagct caacgcccac cttgacatca ccgacgatta ccccaattcc 2700
aggaacaagg tcgtgaagct ctccttgaaa ccctataggt tcgacgtcta tttggacaac 2760
ggtgtgtaca agttcgttac cgttaaaaac ctcgacgtga tcaagaagga aaactactat 2820
gaggttaaca gcaagtgcta tgaggaggcc aagaagctca agaagattag caaccaggca 2880
gagttcatcg cttcctttta caacaacgat ctcatcaaga tcaacggtga gctctatagg 2940
gtcatcggcg tgaataatga cctcctcaac aggattgaag ttaatatgat cgacatcacc 3000
tacagggagt accttgagaa catgaacgat aagagacccc ccagaatcat aaaaaccatc 3060
gcctccaaga cccagagcat aaagaagtat tcaaccgata tcctcggcaa cctctatgag 3120
gtgaagtcca aaaagcaccc ccagatcatt aaaaagggc 3159
<210> 18
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 18
atgaagagga actacattct cgggctcgac atcgggatca cctcagtcgg gtacgggatt 60
atcgactacg agaccagaga tgtgatcgac gccggcgtca ggctcttcaa ggaagcaaac 120
gttgagaaca acgaggggag gagatccaag agaggagcca ggcgcctcaa aagaaggaga 180
aggcacagga tccagagggt gaagaaactc ctctttgact acaacctcct caccgaccat 240
agcgagctct ctggaatcaa cccctacgag gccagggtga aaggcctctc tcagaagttg 300
tcagaggaag agttcagcgc cgccttgttg cacctcgcaa agaggagggg cgtgcataac 360
gtcaatgaag ttgaagagga taccggcaac gagctctcaa ccaaggagca aataagcaga 420
aatagtaaag cacttgagga gaagtacgtg gctgaactcc aactcgagag gctcaaaaag 480
gacggcgagg tcaggggtag cattaatagg tttaaaacct ccgattacgt taaggaagca 540
aagcaattgc ttaaggtcca aaaggcctac caccagctcg atcaaagctt catcgacacc 600
tacatagacc tccttgagac caggagaaca tactacgaag gccccggcga ggggtccccc 660
ttcggttgga aggacattaa ggagtggtac gagatgctca tgggacattg cacctacttc 720
cccgaagaac tcaggagcgt gaagtacgcc tacaacgccg acctctacaa cgcccttaac 780
gatttgaaca atctcgtcat cactagggac gagaatgaaa agctcgagta ctacgaaaag 840
ttccagatca tcgagaatgt cttcaagcag aagaagaagc ccaccttgaa gcagatcgcc 900
aaggagatcc tcgtgaatga ggaggacatc aaggggtaca gggtgaccag caccggtaaa 960
ccagagttca ccaacttgaa ggtctaccac gatatcaagg acatcaccgc caggaaagaa 1020
atcatcgaaa atgccgaact ccttgaccag atcgccaaaa tcttgacaat ctaccagagc 1080
agcgaggaca tccaagagga actcaccaac cttaatagcg agttgaccca ggaggagatc 1140
gaacagatct ccaaccttaa aggctacacc ggcacccata acctctcctt gaaggccatc 1200
aacctcatcc tcgacgagct ttggcacacc aatgacaacc aaatcgccat cttcaacagg 1260
cttaagctcg ttcccaagaa ggttgacctc agccagcaaa aggagatacc caccactctc 1320
gtcgatgatt tcatcctctc tcccgttgtg aagaggtcct tcatccagtc cattaaggtg 1380
atcaacgcca tcattaaaaa gtacgggctc ccaaatgaca taattatcga gcttgcaagg 1440
gagaagaatt ccaaggacgc ccagaagatg atcaacgaaa tgcagaagag gaataggcaa 1500
acaaacgaga ggatcgagga gatcatcaga acaaccggga aagagaacgc caagtacctc 1560
atcgaaaaaa tcaagctcca cgacatgcag gagggcaaat gtctctacag tctcgaggcc 1620
atacccctcg aggacctcct caacaaccca tttaattatg aagtggacca catcatcccc 1680
aggtccgtgt ccttcgataa ctccttcaac aacaaggtgc tcgtcaagca agaagaaaat 1740
agcaagaagg gtaacaggac ccccttccaa tacctctcaa gctccgattc taagatcagc 1800
tatgagacct tcaagaaaca catccttaac ctcgccaagg ggaaaggaag gataagcaaa 1860
actaaaaagg agtatctcct cgaagagagg gatatcaaca ggtttagtgt gcagaaagac 1920
tttatcaata gaaacctcgt ggacaccagg tacgccacca gaggtctcat gaacctcctc 1980
aggagctact tcagggtgaa caatctcgac gttaaggtta agtctatcaa cgggggcttt 2040
accagcttcc tcaggaggaa gtggaagttt aagaaggaaa gaaacaaggg ttataagcac 2100
cacgccgagg acgccctcat catagccaac gctgacttca tcttcaagga atggaagaag 2160
ctcgacaagg ccaaaaaggt gatggagaac cagatgtttg aggaaaagca agccgagagt 2220
atgcccgaga ttgagacaga gcaagagtat aaggagattt tcatcacccc ccaccagatc 2280
aagcatatca aagacttcaa ggactacaag tacagccata gggtggacaa gaagcctaac 2340
agggagctca tcaatgacac cttgtattct accaggaaag atgataaggg gaacaccttg 2400
atcgttaaca acctcaacgg cctctacgac aaggataacg acaagctcaa gaagctcatc 2460
aacaaaagtc ccgaaaagct cttgatgtac catcacgacc cacagaccta ccagaagttg 2520
aagctcatca tggagcagta cggggacgaa aagaaccccc tttacaagta ctacgaggaa 2580
accgggaact acctcaccaa gtactctaag aaagacaatg gccccgtcat caagaagatc 2640
aagtattacg ggaataagct taatgcccac cttgatatca ccgacgacta ccccaactcc 2700
agaaacaagg ttgttaaact ctccctcaaa ccatatagat ttgacgtcta cctcgacaac 2760
ggagtctaca agttcgtcac cgtgaaaaac cttgatgtca tcaaaaagga gaactattat 2820
gaggtgaata gcaaatgcta tgaagaggcc aagaagttga agaagatctc taatcaagct 2880
gagttcatcg ccagcttcta taacaatgac ctcattaaaa tcaacggcga gctttacaga 2940
gtgatcggag tgaacaacga cctcctcaac aggatcgagg ttaacatgat cgacataacc 3000
tacagggagt acctcgagaa catgaacgac aagaggccac caaggatcat caaaaccatc 3060
gccagtaaga cccagagcat taaaaaatac agcacagaca tcctcggcaa tttgtacgaa 3120
gtgaagagca agaagcaccc ccagatcatc aagaagggg 3159
<210> 19
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 19
atgaagagga actatatcct cggactcgac attgggatca ctagcgtcgg gtacgggatc 60
atagactacg agactagaga tgtgatcgac gccggcgtca ggcttttcaa agaggccaac 120
gttgaaaaca acgaagggag aaggtccaag agaggcgcca ggaggctcaa gaggaggagg 180
aggcatagga tccagagagt gaagaagttg ctcttcgatt acaatctcct caccgatcac 240
tccgagctca gcggcatcaa tccctacgag gccagggtga agggattgag ccagaagctc 300
tccgaggagg agttctccgc agcactcctt cacctcgcca aaagaagggg agtccataac 360
gttaacgaag tggaggagga caccggcaat gagctcagca ccaaagagca gatcagcaga 420
aactccaagg cacttgagga gaagtatgtt gccgagttgc agttggagag gctcaagaag 480
gacggcgagg tgagggggag catcaacagg ttcaaaacct ccgattacgt taaagaggcc 540
aaacagctcc ttaaggtcca gaaagcctac catcagttgg accaaagctt catagacacc 600
tacattgacc tcctcgagac caggaggacc tactacgagg gtcctggaga ggggagccca 660
ttcgggtgga aggacatcaa ggagtggtac gagatgctca tgggccactg tacctacttc 720
cccgaagagc tcaggagcgt taagtacgcc tacaacgccg atctctacaa cgccctcaat 780
gacctcaaca acctcgttat caccagggac gagaacgaga aactcgaata ctacgaaaaa 840
tttcagataa tcgaaaatgt cttcaagcag aagaagaaac ccacactcaa gcagatcgcc 900
aaggaaatcc tcgttaacga agaggacatt aagggttata gggtgaccag caccggtaag 960
cctgagttca ccaacctcaa agtgtaccac gatataaagg acataacagc taggaaggag 1020
ataatcgaga acgccgagct tctcgaccaa atcgcaaaaa tccttaccat ctaccagtcc 1080
tccgaggata tccaggaaga actcacaaac cttaactccg aactcaccca ggaagagatc 1140
gagcagatct ccaacctcaa agggtatacc gggacccaca atctcagctt gaaggctatt 1200
aacctcattc tcgacgaact ctggcatacc aatgacaatc agatagccat ctttaatagg 1260
ctcaaacttg tccccaagaa agtcgacctc tcccagcaga aggagatccc caccaccctc 1320
gtggacgatt tcatcctttc cccagtcgtc aagagaagtt tcatccagag catcaaagtc 1380
atcaacgcca taatcaagaa atacggcctc cccaatgaca tcataattga gcttgcaaga 1440
gagaagaact ccaaggatgc acagaagatg atcaatgaga tgcaaaagag gaataggcaa 1500
acaaatgaga ggatcgagga gatcatcagg acaacaggca aggaaaacgc caaatacctc 1560
atcgagaaga taaagctcca cgatatgcag gaaggaaagt gcttgtacag cctcgaggcc 1620
ataccccttg aagacctcct caacaacccc ttcaactacg aggtcgacca tatcatcccc 1680
agatctgtca gcttcgacaa ctccttcaac aacaaggtgc tcgttaagca agaggagaac 1740
tcaaagaagg ggaacaggac ccccttccag tacctctcca gcagcgattc caaaatctcc 1800
tacgaaacct tcaagaaaca tattctcaac ctcgccaaag gcaagggaag gatatccaag 1860
accaagaagg agtacctcct cgaggagaga gatatcaaca ggttctccgt ccagaaagac 1920
tttatcaaca ggaatctcgt ggataccagg tacgccacca ggggccttat gaacctcctc 1980
agatcctact tcagggtcaa caacttggac gtcaaggtca agagcatcaa cggcgggttt 2040
accagcttct tgaggaggaa atggaagttt aagaaggaga ggaacaaggg gtacaagcac 2100
cacgccgaag acgccctcat tatcgccaac gccgacttca tcttcaaaga atggaagaag 2160
ctcgacaagg ccaagaaagt catggaaaat cagatgttcg aggaaaaaca ggccgagagc 2220
atgcccgaga tcgagaccga gcaggagtac aaggagatct ttatcacccc ccatcagatc 2280
aagcacatca aggacttcaa ggactacaag tactcccaca gggttgataa gaagcccaat 2340
agagaattga tcaatgacac actctacagc acaaggaaag acgacaaggg taacaccctc 2400
atcgtcaata acctcaatgg cctctatgac aaagataacg ataagttgaa gaagctcatc 2460
aacaagtccc ccgagaaact cctcatgtac caccacgacc ctcagactta tcaaaaactt 2520
aaattgatca tggaacaata tggtgacgag aagaaccccc tctataaata ttacgaggag 2580
accgggaact acctcaccaa gtatagcaag aaggacaatg gacccgtcat caagaagatc 2640
aagtactatg ggaacaagct caacgcccac ctcgacatca ccgatgacta ccctaacagt 2700
cgaaacaagg ttgtcaagct ctcccttaag ccatacaggt ttgacgtgta cctcgacaac 2760
ggtgtttaca agttcgtgac tgttaagaac ctcgacgtca ttaagaagga gaattattac 2820
gaggtgaaca gcaaatgcta cgaggaagcc aaaaaactca agaagatctc caaccaggcc 2880
gagttcattg ccagcttcta caataatgac ctcatcaaga taaacgggga gctctacagg 2940
gtgatcggtg tcaataacga cctcctcaac aggattgagg tgaacatgat cgacatcacc 3000
tatagagagt acttggagaa catgaacgat aagaggcccc ctaggatcat aaagaccatc 3060
gccagcaaga cccagagtat taagaaatac agcaccgaca tcctcgggaa tctctacgag 3120
gtcaagagca aaaagcaccc ccagatcatt aagaagggg 3159
<210> 20
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 20
atgaaaagaa actatatact cggcctcgac atcggcatca cctctgttgg gtacgggatc 60
atcgactacg agaccaggga cgtcatcgac gcaggagtca ggcttttcaa ggaagccaac 120
gttgaaaaca acgagggaag gaggagcaag aggggggcca ggagactcaa aaggaggagg 180
aggcacagga tccagagggt caagaagttg ctcttcgatt acaacctcct cacagaccac 240
agtgaactct ctgggatcaa tccttacgag gcaagggtta aggggctcag ccaaaaactc 300
tccgaggagg agtttagcgc cgccctcctc cacctcgcca agaggagggg cgtccacaac 360
gtgaacgagg ttgaggagga caccggcaac gagctctcca ccaaggagca gatttccaga 420
aacagcaagg cactcgagga gaagtatgtg gccgaattgc agttggagag gctcaagaag 480
gacggggagg tgaggggctc catcaacagg ttcaagacca gcgactacgt gaaagaggca 540
aagcaactcc tcaaagttca gaaggcctac caccaacttg accagagctt catcgatacc 600
tatatcgacc tcctcgagac aaggaggaca tactatgaag gccctggcga ggggagccct 660
tttggctgga aagacatcaa agagtggtat gagatgctca tgggccactg cacctacttc 720
cccgaagaac tcaggtccgt gaagtacgcc tacaatgctg acctttacaa cgccttgaac 780
gacctcaata acctcgtcat cactagggac gagaacgaga aactcgagta ttacgagaaa 840
ttccagatca tcgagaacgt gtttaagcag aagaagaagc ccacccttaa acagatcgcc 900
aaggaaatcc tcgttaatga ggaggacatc aagggctata gggtgacctc caccggaaag 960
cccgaattca ctaacctcaa agtgtaccac gatatcaagg acatcaccgc cagaaaagag 1020
ataatcgaga atgccgagct cctcgatcaa atcgccaaaa tactcaccat ataccagtcc 1080
tccgaggaca tccaagagga gctcaccaac ctcaactccg aactcaccca ggaagaaata 1140
gagcagatct caaaccttaa ggggtatacc ggtacccaca acttgagctt gaaggccatc 1200
aacctcatct tggatgaact ctggcataca aacgacaacc agatagccat cttcaacagg 1260
ttgaagctcg tgcccaagaa ggtcgacctc agccaacaga aggagatacc cacaaccctc 1320
gtggacgact tcatcctctc ccccgtcgtg aagaggtcct tcatccagag tatcaaggtg 1380
atcaacgcta tcatcaagaa gtatggcttg cctaatgaca tcatcatcga gttggccagg 1440
gagaagaata gcaaggacgc ccagaaaatg atcaacgaaa tgcaaaagag aaacaggcag 1500
accaacgaga gaatagagga gatcatcagg actactggta aagagaacgc caagtatctc 1560
atcgagaaga tcaagctcca tgacatgcag gaggggaagt gcctctactc cctcgaggcc 1620
atccccctcg aggacctcct taacaacccc ttcaactacg aggtcgatca catcattcct 1680
aggtccgtga gcttcgacaa cagtttcaat aataaggtgc tcgtgaagca agaggagaat 1740
tccaagaagg ggaatagaac tcctttccaa tatctcagta gctccgactc caagatctcc 1800
tacgagactt tcaaaaagca catcctcaac ctcgccaaag gcaagggcag gatttccaaa 1860
accaagaagg agtatctcct cgaggaaaga gacattaaca ggttcagcgt ccagaaggac 1920
tttataaaca gaaacctcgt cgacaccagg tatgccacta ggggccttat gaatctcctc 1980
agatcctact tcagagtgaa caaccttgat gttaaagtta agtccatcaa tgggggattc 2040
accagcttcc tcagaaggaa atggaagttc aagaaggaaa ggaacaaggg gtacaaacac 2100
cacgccgaag atgcactcat catagccaac gccgatttta tattcaaaga gtggaagaag 2160
ctcgacaagg ctaagaaagt catggagaac cagatgttcg aggagaagca ggccgagtcc 2220
atgcccgaga tcgaaaccga gcaggagtac aaagaaatat tcatcacccc ccaccaaatc 2280
aagcacatca aggactttaa ggactataag tactcccata gggtcgataa aaagccaaac 2340
agggagctca tcaacgacac cctctatagt accagaaaag acgacaaggg aaacaccctc 2400
atcgtgaaca acctcaacgg actctacgac aaggataacg ataagctcaa gaaactcatt 2460
aacaagtccc ccgaaaaact cctcatgtac caccacgacc cccaaaccta ccagaagctc 2520
aaattgatca tggaacagta tggggacgag aagaaccctt tgtacaaata ctatgaggag 2580
accggtaact atcttaccaa gtacagcaag aaggacaacg gccccgtcat caagaagatc 2640
aagtactacg ggaacaagtt gaacgcacac ctcgatatca ccgatgatta ccccaactcc 2700
agaaataagg tggttaagct cagcttgaag ccctacagat tcgacgtcta cctcgataac 2760
ggggtctaca agttcgtcac cgtgaagaat ctcgacgtca tcaagaagga gaactactac 2820
gaggttaact caaagtgcta tgaggaggcc aagaagctca agaagatctc caaccaggcc 2880
gagttcatcg cttccttcta caataatgac ctcatcaaga tcaatgggga gctttacagg 2940
gtcatcggcg ttaacaatga cctcctcaac aggatcgagg tgaacatgat cgacatcacc 3000
tacagggaat acctcgagaa catgaacgac aagagacctc ctaggataat caagacaata 3060
gcctcaaaga cccagagcat caagaagtac tccaccgaca tcctcgggaa cctctacgag 3120
gtgaagagta agaagcaccc ccagatcatc aagaagggc 3159
<210> 21
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 21
atgaagagaa actatatcct cgggctcgac atcgggatca catccgtggg ctatggtatc 60
atcgactacg agaccaggga cgttatagat gccggagtga ggctttttaa agaggccaac 120
gttgagaaca acgagggaag aaggagcaaa aggggggcca gaaggctcaa gaggaggagg 180
agacacagga tacagagggt gaagaagctc ctcttcgatt acaacctcct cactgatcac 240
tccgagctct ccggaatcaa tccctacgag gccagggtta aaggactcag ccagaagttg 300
tccgaggagg agtttagcgc agcactcttg cacctcgcca agaggagagg agtccacaac 360
gttaatgagg ttgaggagga cactggcaac gaactctcca ccaaggagca gatctcaaga 420
aacagcaagg ccctcgagga gaagtacgtt gccgagctcc agttggagag actcaagaag 480
gatggggagg tcaggggctc cataaacagg ttcaagacca gcgactatgt caaggaggct 540
aagcaactcc tcaaggtgca aaaggcctac caccagctcg accagagctt catcgacacc 600
tatatcgacc tcctcgagac cagaagaacc tattacgagg ggcctggcga gggctccccc 660
tttggatgga aggacatcaa ggagtggtac gaaatgctta tgggccactg cacatatttc 720
cccgaagaac tcagatccgt caagtacgcc tataacgccg atctctacaa cgccctcaac 780
gacctcaaca acctcgtgat caccagggac gaaaatgaga aactcgaata ctacgagaag 840
ttccagatca tcgagaatgt ttttaagcag aaaaaaaagc ccaccctcaa gcagatcgcc 900
aaggagatcc ttgtcaatga agaggacatt aagggctaca gggtgaccag caccggtaaa 960
cccgaattta ccaacctcaa ggtttaccac gacatcaagg acataaccgc cagaaaagaa 1020
atcatcgaga acgccgagct tttggaccaa atcgccaaga tcctcactat ctatcaaagc 1080
agcgaggaca tccaagagga gctcaccaac ctcaacagcg agctcactca ggaggaaatc 1140
gagcagataa gcaacctcaa ggggtacact ggaacccaca atctcagtct caaggccatc 1200
aacctcatcc tcgacgagct ctggcacacc aacgacaacc agatcgccat ttttaatagg 1260
cttaagttgg tgcccaagaa agttgacctc tcccaacaga aggagatccc aaccaccctc 1320
gttgacgatt tcatcctctc ccccgtcgtt aaaaggagct ttatccagag catcaaggtc 1380
atcaatgcca ttatcaaaaa gtacgggctt cccaacgata tcatcatcga gctcgccagg 1440
gaaaagaata gcaaagacgc ccaaaaaatg atcaacgaga tgcagaaaag aaataggcag 1500
accaacgaga ggatcgagga gatcatcagg accaccggaa aggagaacgc caagtacctc 1560
atcgaaaaaa tcaagctcca tgacatgcag gagggcaagt gtctctattc actcgaggcc 1620
atccccttgg aggacctcct caacaatccc tttaactacg aagtcgacca catcatcccc 1680
aggtcagtgt ccttcgacaa ctccttcaac aacaaggtgc tcgttaagca ggaggagaac 1740
agcaagaaag gcaacaggac accattccaa taccttagtt cttccgattc caagatctcc 1800
tacgaaacct ttaaaaagca tatcctcaac ctcgccaagg ggaagggtag gatctctaaa 1860
accaagaagg agtacctcct cgaggaaagg gacatcaaca gattctctgt gcagaaggac 1920
tttatcaaca ggaatctcgt ggacaccagg tacgcaacca ggggcctcat gaacctcctc 1980
aggtcctatt tcagggtgaa caatctcgac gtcaaggtta agagcatcaa cggggggttt 2040
acctcattcc ttaggaggaa atggaaattc aagaaggaaa ggaataaggg gtacaagcac 2100
cacgctgagg acgccctcat catcgctaat gccgacttca tcttcaaaga gtggaaaaag 2160
ctcgacaaag ctaagaaggt catggaaaat caaatgtttg aggagaagca ggccgagtct 2220
atgcccgaga tcgagacaga gcaggagtac aaggagatct ttattacccc ccaccagatc 2280
aagcacatca aggatttcaa agattacaag tattcccaca gagtggacaa aaaacccaac 2340
agggagctca tcaacgacac actctactcc accaggaagg atgacaaggg aaacaccctc 2400
atcgttaaca accttaacgg cctttacgat aaagacaacg ataagctcaa gaagctcatc 2460
aacaagtccc ccgagaagct cctcatgtac caccacgacc cccagactta ccagaaactc 2520
aagctcatca tggagcagta cggcgacgaa aagaatcccc tctataagta ctatgaggag 2580
actggaaatt acctcaccaa gtactccaaa aaggacaacg ggcctgtgat caagaaaatt 2640
aagtactatg gcaacaagct caacgcccac ctcgacatta ccgacgacta ccccaacagc 2700
aggaacaaag tggtcaagct ctctctcaag ccttacaggt ttgatgtcta cctcgacaac 2760
ggcgtgtaca agttcgtgac cgtgaaaaac ctcgacgtta tcaagaagga gaactactac 2820
gaggtgaatt ccaagtgtta cgaagaagcc aaaaagctca agaagatctc caatcaggcc 2880
gagttcatcg caagcttcta caacaatgat ctcatcaaga taaacggaga actctacaga 2940
gttatcggcg tcaacaatga tctccttaac agaattgagg tcaatatgat cgacatcacc 3000
tacagggaat accttgagaa tatgaacgac aagaggcccc caaggatcat aaagaccatc 3060
gccagtaaaa cccagtccat taaaaagtac tcaaccgaca tcctcggcaa cttgtacgag 3120
gtgaaatcta agaagcaccc ccagatcatc aagaagggc 3159
<210> 22
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 22
atgaagagga attatatcct cggactcgat atcggcatca cctctgtggg gtacggaatc 60
atagactacg aaacaagaga tgtcatcgac gccggggtca ggcttttcaa agaagccaac 120
gttgagaaca acgaggggag gaggagcaaa aggggggcta ggaggctcaa gaggaggagg 180
aggcacagga ttcaaagggt gaagaagttg ctcttcgact acaacctcct cactgaccac 240
tccgagttgt ccggcatcaa tccttacgag gccagagtga aaggccttag ccaaaagctc 300
tccgaagagg agttcagcgc tgccttgctc caccttgcca agaggagagg ggtccacaac 360
gttaacgagg ttgaagagga taccgggaac gaactcagca ccaaggagca gatctccagg 420
aacagcaagg ccctcgaaga gaagtacgtg gccgagctcc aactcgagag gctcaagaag 480
gacggggagg ttagagggag catcaatagg tttaagacca gcgactacgt gaaggaggcc 540
aaacagttgc tcaaggtgca gaaggcctac catcaactcg accaaagctt tatagacacc 600
tacatcgatc tcctcgagac aagaaggaca tactacgagg gccccggaga ggggtcccct 660
ttcggctgga aggacatcaa agagtggtac gagatgctca tgggccactg cacctacttc 720
cccgaggagc tcaggtcagt caagtatgcc tacaacgctg acttgtacaa tgccctcaac 780
gaccttaaca atctcgtcat caccagggac gaaaatgaga agcttgagta ttacgagaaa 840
ttccagatta tcgagaatgt tttcaagcaa aagaagaaac ccaccctcaa gcagatcgcc 900
aaggagatac ttgttaatga ggaagacatc aagggctata gggtgaccag taccggtaaa 960
ccagaattca ctaatttgaa agtttaccac gacatcaagg acatcacagc caggaaagag 1020
attatcgaga atgccgagct cctcgatcag atcgcaaaga tcttgacaat atatcagtcc 1080
agtgaggaca tccaggaaga gctcaccaac ctcaacagcg agctcaccca agaggaaatc 1140
gaacagatca gcaacctcaa gggctacacc ggaacccata acctctcact caaagccatt 1200
aacctcatcc tcgacgagct ctggcatacc aatgataacc agatagccat attcaacagg 1260
cttaaactcg tccccaagaa ggtggacctc agccaacaaa aggagatccc cactaccctt 1320
gtcgacgact tcatcctcag tcccgtggtt aaaagatcct tcatccagtc catcaaggtg 1380
atcaacgcta tcataaagaa gtatgggctc cccaatgaca tcatcatcga acttgccagg 1440
gagaagaaca gtaaggacgc ccagaagatg atcaacgaaa tgcagaaaag gaacaggcag 1500
accaacgaga gaatcgaaga gatcatcagg accaccggga aggaaaatgc caagtatctc 1560
atcgagaaga tcaagctcca cgacatgcaa gaaggcaagt gcctctatag cctcgaggct 1620
atcccactcg aggatctcct caacaacccc ttcaactatg aagtcgacca tatcatcccc 1680
aggtccgtct cctttgacaa cagcttcaat aacaaggtgc tcgtgaaaca ggaggaaaat 1740
agcaagaagg gtaacaggac ccccttccag tacttgtcct cctccgactc caagatcagc 1800
tacgaaacct tcaagaagca catccttaac ctcgctaagg gaaaaggcag aatcagcaag 1860
accaagaagg agtatctcct cgaggagagg gacatcaaca ggttcagcgt ccagaaagac 1920
ttcatcaaca ggaacctcgt cgacaccagg tatgcaacta gggggttgat gaacctcctc 1980
aggagctact tcagagtcaa caacctcgac gttaaggtca agagtataaa tggtggcttc 2040
accagtttcc tcaggaggaa gtggaagttt aagaaggaga ggaacaaagg atacaagcac 2100
cacgccgagg atgccctcat tatcgcaaac gccgacttta tcttcaagga gtggaagaaa 2160
ctcgataagg ccaaaaaagt tatggagaac cagatgttcg aggagaagca ggccgaaagt 2220
atgcctgaga tagagaccga gcaggaatat aaagagattt tcatcactcc tcaccagata 2280
aagcatatca aggacttcaa ggactacaag tattcccaca gagttgacaa gaagcccaac 2340
agggagctta tcaacgacac cctctactcc accaggaagg acgacaaggg aaacacactc 2400
atcgttaata acttgaatgg actctacgac aaggacaatg acaaactcaa gaagctcatc 2460
aataagagcc ccgagaagct ccttatgtac caccatgacc cacagacata tcaaaagttg 2520
aagctcatca tggagcagta cggggacgaa aagaaccctc tctacaagta ctacgaggag 2580
actgggaact acctcaccaa gtattccaag aaagataatg ggcccgtcat caagaaaata 2640
aagtactacg gtaacaagct caatgcccac ctcgacatca ccgatgacta ccctaacagc 2700
aggaacaaag tcgttaagct ctcccttaaa ccctatagat tcgatgtcta cctcgacaat 2760
ggagtgtaca aatttgtcac cgtgaagaac ctcgatgtta tcaagaaaga aaactactat 2820
gaggtgaaca gcaagtgcta tgaggaggcc aagaagctta aaaagatcag caaccaggcc 2880
gaattcatcg cctcctttta caataacgac ctcatcaaga tcaacggaga gctctacaga 2940
gtgataggtg tgaataatga cctcctcaac agaatcgagg tcaacatgat cgacattacc 3000
tacagggagt acctcgaaaa tatgaacgac aaaaggcccc ccaggatcat caagaccatc 3060
gcatccaaga cccagtccat caaaaaatac tcaaccgaca tccttgggaa cctctacgag 3120
gtcaagagca agaaacatcc ccagatcatc aagaagggg 3159
<210> 23
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 23
atgaagagga actatatcct cggccttgat atcggcataa cctccgtggg gtatggcatc 60
atagactacg agaccagaga cgtcatcgac gccggcgtta gactctttaa ggaggccaac 120
gtcgagaaca acgaggggag aaggagcaag aggggggcca ggaggctcaa gagaagaaga 180
aggcacagga tccagagggt gaagaagttg ctcttcgact ataatctcct caccgaccac 240
agcgagctca gcgggatcaa cccctacgag gccagggtca agggtttgtc ccagaagctc 300
agcgaagagg agttttctgc tgctctcctc cacctcgcca agaggagagg cgtccacaac 360
gtcaacgaag tggaggagga caccggcaac gagctttcca ccaaggagca gattagcaga 420
aacagtaaag ctcttgagga aaagtacgtc gccgagctcc agctcgagag gttgaagaag 480
gacggcgaag tgagaggttc cataaataga tttaagacct ccgactacgt taaagaggcc 540
aagcagcttt tgaaagtgca gaaggcctac caccaactcg accagtcatt catcgacaca 600
tacatcgacc ttctcgagac caggcgaacc tattacgaag gccccggcga gggcagcccc 660
tttggatgga aggacatcaa ggagtggtac gagatgctca tggggcactg tacttatttt 720
cccgaagagc tcaggtcagt gaagtacgcc tataacgccg acctctacaa cgccttgaac 780
gacctcaaca acctcgtcat aaccagggat gagaatgaaa agctcgagta ttacgaaaag 840
ttccagatca ttgaaaacgt gttcaaacaa aagaagaagc ccaccctcaa gcaaatcgcc 900
aaagagatcc tcgtgaacga agaggatatc aaggggtaca gggtgacctc caccggaaag 960
ccagagttca ccaaccttaa ggtttaccac gatatcaagg acataaccgc aagaaaagaa 1020
atcatcgaaa atgcagagct ccttgaccag atcgccaaga tcttgaccat ctaccaaagc 1080
tccgaggaca tccaagagga gctcaccaac ttgaactcag aactcaccca agaggagata 1140
gaacaaatca gcaacctcaa aggctatacc ggtacccaca acctcagcct caaggccatc 1200
aacctcatcc tcgacgaact ctggcatacc aacgataacc agatcgcaat ctttaatagg 1260
ctcaagctcg ttcccaagaa ggtcgatctc agccagcaaa aagagatccc cactaccctc 1320
gttgatgact tcatcctctc ccctgtggtc aagaggagct ttatccagtc aatcaaggtg 1380
atcaatgcca tcatcaagaa gtacgggttg cccaacgata tcatcataga gctcgcaagg 1440
gagaagaact ccaaggacgc tcagaagatg atcaatgaga tgcagaagag gaacaggcag 1500
accaacgagc gaatcgagga gatcataagg accaccggca aggagaacgc taagtacctc 1560
atcgaaaaga taaaactcca cgacatgcag gaaggtaagt gcttgtacag tttggaggcc 1620
atacccctcg aagacctcct caacaacccc ttcaactacg aggtggacca tatcatcccc 1680
aggagcgtca gctttgacaa cagcttcaac aataaggtct tggtgaagca ggaggaaaac 1740
tccaaaaagg gcaacagaac accatttcag tacctctcta gctccgacag caagatttcc 1800
tacgagactt ttaagaagca catcctcaac ctcgctaaag ggaaagggag aatcagcaaa 1860
accaaaaagg agtatttgct cgaggagaga gatatcaaca gattcagcgt tcagaaagac 1920
ttcatcaaca ggaacttggt tgacaccagg tatgccacaa ggggcttgat gaacctcctc 1980
aggagctatt tcagagtgaa caatttggac gtgaaggtca agagcattaa cggggggttc 2040
acctctttcc tcaggagaaa gtggaagttc aaaaaggaga gaaacaaggg gtacaagcac 2100
cacgccgagg acgccctcat aattgccaac gcagatttca tcttcaagga gtggaaaaag 2160
ctcgataagg caaagaaggt catggaaaac caaatgtttg aggagaagca ggccgagtct 2220
atgcccgaaa tcgagactga acaggagtac aaggaaatct tcataacccc acaccagatc 2280
aagcacatca aggacttcaa ggattacaag tactcccata gggtggacaa gaagcccaat 2340
agagagctta tcaatgatac cctctactcc actaggaagg acgacaaggg caacacactt 2400
atcgttaaca acctcaacgg gttgtatgac aaagacaatg acaagttgaa gaagttgatc 2460
aacaaaagcc ctgaaaagct cctcatgtac caccacgacc ctcaaaccta ccaaaagctc 2520
aagctcataa tggaacagta cggggacgag aagaaccccc tctataagta ctacgaagag 2580
accgggaatt acctcaccaa gtattccaag aaggacaatg gccctgttat caagaagatc 2640
aagtactacg gaaacaagct caacgcccac ctcgacatca ctgatgacta cccaaacagc 2700
agaaacaagg ttgtgaagct ctccctcaaa ccctacaggt tcgacgtcta cctcgataac 2760
ggcgtttata agtttgttac cgtcaaaaac ctcgacgtta tcaagaaaga gaactactat 2820
gaggttaata gcaagtgcta cgaggaggcc aagaagctta aaaagatctc caatcaggcc 2880
gagtttatcg caagctttta taataacgac ctcatcaaga tcaacggcga actctacagg 2940
gtcatcgggg tgaacaacga tctcctcaac agaatcgaag tgaatatgat agatattaca 3000
tacagggagt atttggagaa catgaacgat aagaggccac ccagaatcat aaagaccatc 3060
gccagcaaga cccagagcat caagaaatac agtaccgaca tcttggggaa cctctatgag 3120
gtgaaaagca agaagcaccc ccagatcatc aaaaaggga 3159
<210> 24
<211> 3159
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 24
atgaaaagga actatatcct cgggctcgac atcgggatca ccagcgttgg ctacggcatc 60
attgattatg aaacaaggga cgtgatcgat gctggggtta ggctcttcaa ggaagctaac 120
gtcgagaata atgaggggag gaggtccaag aggggcgcca ggaggctcaa gaggaggagg 180
aggcacagga tccaaagagt taagaagttg ctcttcgact acaacctcct caccgaccat 240
agcgagctca gcggcattaa cccctacgaa gccagagtga aggggctcag ccagaagctc 300
agcgaggaag agttctctgc cgcactcctc catcttgcaa agaggagagg cgtccacaac 360
gtcaatgaag ttgaagagga cactggaaat gagctctcca ccaaggagca gatcagtagg 420
aacagcaagg cccttgagga gaagtatgtg gccgagcttc agctcgaaag gcttaaaaaa 480
gacggagagg tgaggggctc catcaacagg tttaagacca gcgactacgt gaaggaggca 540
aagcaactcc tcaaggtgca gaaagcttac catcagctcg accaatcctt catcgacacc 600
tacatcgacc tcctcgagac caggagaact tactacgaag ggcctgggga ggggagcccc 660
ttcgggtgga aggacatcaa ggagtggtac gaaatgctca tggggcactg tacctacttc 720
cccgaggagc tcaggtctgt caaatacgcc tacaacgccg acctctacaa tgcccttaac 780
gatcttaaca acctcgtcat caccagggac gaaaacgaga aactcgaata ttacgagaag 840
ttccaaatca tcgaaaacgt ctttaaacag aagaagaagc ccaccctcaa gcagattgcc 900
aaagaaatcc tcgttaacga agaggacatc aaaggttaca gggtgacctc cactgggaag 960
cccgagttca ccaatctcaa ggtgtaccac gacatcaagg atattacagc aagaaaggag 1020
atcatcgaga atgccgagct cctcgaccag atcgctaaga tcctcaccat ctaccagtct 1080
tccgaagata tccaagagga acttaccaac ctcaacagcg agctcaccca agaggagatc 1140
gagcagatca gcaacttgaa gggttacacc ggcacccaca acctctctct caaggccata 1200
aacttgatcc tcgacgagtt gtggcacacc aacgacaatc agatcgctat cttcaatagg 1260
ctcaagctcg tccccaaaaa ggtcgacctc tcccagcaga aagagatccc caccactttg 1320
gttgacgact tcatcctttc ccccgtggtc aaaaggagct tcatccagag catcaaggtc 1380
atcaatgcaa tcattaagaa atacgggttg cccaatgaca tcatcataga gctcgccagg 1440
gagaagaact ccaaggacgc ccaaaaaatg atcaacgaga tgcagaagag gaacaggcag 1500
accaatgaga gaatcgaaga aatcattagg accaccggca aggagaacgc caagtacctc 1560
atcgaaaaga tcaagctcca cgacatgcaa gagggcaaat gcctctacag cctcgaggcc 1620
atcccccttg aggatctctt gaacaaccca ttcaattatg aggtcgacca catcattccc 1680
aggagcgtca gcttcgacaa ctccttcaac aacaaggtgt tggtgaaaca agaggagaac 1740
agcaagaagg gtaacagaac ccctttccag tacttgagca gcagcgatag caaaatctct 1800
tacgaaacct tcaaaaagca catcctcaac ctcgccaagg gcaaaggcag gatttccaag 1860
accaagaagg agtacttgct cgaggaaagg gacatcaaca ggttctccgt tcagaaggac 1920
ttcatcaaca ggaacctcgt ggacacaagg tacgccacca ggggcctcat gaacctcttg 1980
aggagctact tcagggtcaa taacctcgac gtcaaggtga aatctatcaa cggggggttc 2040
acctcattcc tcaggaggaa gtggaagttc aagaaggaga gaaacaaggg atacaaacac 2100
cacgccgagg acgccctcat catcgccaat gccgatttca ttttcaaaga gtggaagaag 2160
ctcgacaagg caaagaaggt catggaaaac cagatgtttg aggagaagca ggccgaatcc 2220
atgcccgaga tagagaccga acaggagtac aaggagatct tcatcacccc ccaccagatc 2280
aagcacatca aggacttcaa agactacaag tacagccaca gagtggacaa gaagcccaac 2340
agggagctca tcaatgacac tctctacagc actaggaagg acgataaggg gaacaccctc 2400
atcgtcaaca acctcaatgg tttgtatgat aaggacaacg acaaactcaa gaaactcatc 2460
aataagagcc ccgaaaaact cctcatgtat caccacgacc cccagaccta ccagaagttg 2520
aagctcatca tggagcagta cggggacgag aaaaatccct tgtacaagta ttacgaggag 2580
accgggaact acctcaccaa gtactccaaa aaggacaacg gtcccgttat caagaagatc 2640
aagtactacg ggaacaagct caatgcacat ctcgacataa ccgacgacta ccccaactcc 2700
aggaacaaag tggtcaagtt gtccttgaag ccctacaggt tcgatgtgta cttggataac 2760
ggcgtgtaca aattcgttac cgtgaagaac ttggacgtga tcaagaagga gaactactac 2820
gaggttaact ccaagtgcta tgaggaggct aaaaagctca agaaaatcag caaccaggcc 2880
gagttcatcg ccagctttta caacaacgac ttgatcaaga tcaacggcga gctctacagg 2940
gttataggcg tgaacaatga cttgctcaac aggatcgagg tcaacatgat cgacatcact 3000
tacagggagt acctcgagaa tatgaacgac aagaggccac ctaggatcat caaaaccatc 3060
gctagcaaga cccagagcat caagaagtac agcaccgaca tcttgggtaa tctttacgag 3120
gttaagagta aaaagcatcc ccagatcatc aaaaaggga 3159
<210> 25
<211> 1300
<212> PRT
<213> 土拉弗朗西斯菌新凶手亚种(Francisella tularensis subsp. novicida)
<400> 25
Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys
20 25 30
Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys
35 40 45
Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu
50 55 60
Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser
65 70 75 80
Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys
85 90 95
Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr
100 105 110
Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile
115 120 125
Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln
130 135 140
Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr
145 150 155 160
Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr
165 170 175
Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser
180 185 190
Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu
195 200 205
Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys
210 215 220
Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu
225 230 235 240
Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg
245 250 255
Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr
260 265 270
Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys
275 280 285
Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile
290 295 300
Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys
305 310 315 320
Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser
325 330 335
Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met
340 345 350
Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys
355 360 365
Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln
370 375 380
Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr
385 390 395 400
Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala
405 410 415
Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn
420 425 430
Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala
435 440 445
Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn
450 455 460
Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala
465 470 475 480
Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys
485 490 495
Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys
500 505 510
Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp
515 520 525
Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His
530 535 540
Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His
545 550 555 560
Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val
565 570 575
Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser
580 585 590
Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly
595 600 605
Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys
610 615 620
Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile
625 630 635 640
Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys
645 650 655
Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val
660 665 670
Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile
675 680 685
Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
690 695 700
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe
705 710 715 720
Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp
725 730 735
Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu
740 745 750
Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn
755 760 765
Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr
770 775 780
Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg
785 790 795 800
Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
805 810 815
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr
820 825 830
Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala
835 840 845
Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu
850 855 860
Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe
865 870 875 880
His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe
885 890 895
Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His
900 905 910
Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu
915 920 925
Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
930 935 940
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile
945 950 955 960
Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn
965 970 975
Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile
980 985 990
Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu
995 1000 1005
Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val
1010 1015 1020
Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu
1025 1030 1035
Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg
1040 1045 1050
Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly
1055 1060 1065
Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser
1070 1075 1080
Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys
1085 1090 1095
Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp
1100 1105 1110
Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe
1115 1120 1125
Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr
1130 1135 1140
Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp
1145 1150 1155
Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu
1160 1165 1170
Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly
1175 1180 1185
Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe
1190 1195 1200
Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg
1205 1210 1215
Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val
1220 1225 1230
Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys
1235 1240 1245
Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly
1250 1255 1260
Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu
1265 1270 1275
Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu
1280 1285 1290
Phe Val Gln Asn Arg Asn Asn
1295 1300
<210> 26
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 26
atgagcatct atcaggaatt tgtgaataag tattctctat caaagaccct gcgtttcgaa 60
ctcattcctc aaggcaaaac cttggaaaac attaaggcca gagggcttat cttggatgac 120
gagaaaaggg ccaaagatta caagaaggca aaacaaataa ttgacaaata tcatcaattc 180
ttcatcgaag aaattttgag tagtgtttgt atcagcgaag atcttcttca gaattattcg 240
gatgtctatt tcaagctcaa aaaaagcgac gacgataatt tgcagaagga tttcaaaagt 300
gcaaaggaca ctattaaaaa gcaaatttct gaatatatca aagattcaga gaagttcaag 360
aaccttttta accagaatct tattgacgct aagaaggggc aagagtccga tcttatacta 420
tggctaaaac aaagcaagga taacggtatt gaactcttca aagcaaattc cgatatcact 480
gatattgacg aagcacttga gattatcaaa tcttttaaag gttggactac atacttcaag 540
gggttccacg aaaatcgaaa gaatgtctat tcaagtaacg atattcccac aagtataatt 600
tataggatcg ttgatgacaa tttacccaag ttccttgaga ataaggcaaa gtacgaatcc 660
ttgaaggata aggctcctga agcaataaat tatgagcaaa taaaaaagga cctcgccgaa 720
gagcttacct ttgacattga ttacaaaacc agcgaagtca atcaaagagt tttttcattg 780
gacgaggttt tcgagattgc caattttaac aattacttga accagtctgg cattacaaag 840
ttcaacacaa ttatcggggg aaagtttgtt aatggtgaga atactaaaag gaagggcata 900
aatgagtata taaaccttta ttcccaacag ataaatgata agactttgaa gaagtataaa 960
atgtcagttc tcttcaaaca gattctgtcc gataccgaat cgaaatcttt tgttattgat 1020
aaattggagg acgacagcga cgtcgtgact acaatgcaga gcttttatga acaaatagct 1080
gcttttaaga ccgtagagga gaagagcatt aaagaaactc tgtcgttgct gttcgacgac 1140
ctaaaagctc aaaaacttga cctttcaaag atctacttca agaatgataa gtctctgacc 1200
gatttgtcac aacaagtttt tgatgactat agtgtaattg ggactgccgt tctagagtac 1260
attacccagc agattgctcc taagaatttg gataatccct ctaaaaagga acaggagtta 1320
atagctaaga aaactgagaa ggccaaatac cttagtttgg agacaattaa attagcctta 1380
gaagagttta acaagcaccg tgatattgac aaacaatgta gattcgaaga aattctggca 1440
aattttgccg ctattcctat gatttttgac gaaattgcac aaaacaagga taatctggca 1500
cagataagca taaagtacca aaaccagggt aagaaggatt tattgcaggc tagtgcagaa 1560
gacgatgtga aggcaataaa agacttattg gatcagacta ataacctgct gcataagcta 1620
aaaatcttcc acatcagcca aagcgaggat aaagctaata ttctagacaa agatgagcat 1680
ttttatttgg tcttcgaaga atgctatttt gaacttgcaa atattgttcc tctttataac 1740
aagatccgta actacataac acaaaaacct tattccgatg aaaagtttaa gttgaacttc 1800
gagaattcta ctttggccaa cggctgggac aaaaacaaag aacccgataa tactgctatt 1860
ttgttcatta aggacgataa gtattacctt ggggttatga ataagaaaaa caataagatc 1920
ttcgatgata aagcaatcaa ggagaacaag ggagagggtt ataagaaaat cgtctataaa 1980
ttgcttcctg gggcaaacaa gatgctacct aaggtgttct tcagcgcaaa gtccattaag 2040
ttttataacc cttccgagga tatattaagg attcgcaatc attctacgca tacgaaaaat 2100
ggtagcccac aaaagggata cgaaaaattt gaatttaata ttgaagactg caggaagttc 2160
atcgactttt acaaacaaag tatctctaaa caccccgagt ggaaggactt tggctttcgc 2220
ttctcagata ctcagagata taactcaata gatgaattct atcgtgaagt agagaaccag 2280
gggtacaaac taactttcga gaatattagc gagagctaca tagattcagt agttaatcaa 2340
ggtaagttgt atctgtttca aatctataat aaagacttct cagcatactc taagggtcga 2400
cccaacttgc ataccttgta ctggaaggcc ttgttcgatg aaaggaacct ccaagatgta 2460
gtttacaagc tgaatggtga ggccgagctg ttctatagaa agcaatctat tccgaaaaag 2520
atcacacatc ccgctaaaga agcaattgct aataaaaaca aggataatcc taaaaaggag 2580
tctgtttttg agtatgattt aattaaggat aaaaggttta ccgaggataa gtttttcttc 2640
cattgtccca taactatcaa ttttaagagt tctggtgcga ataagtttaa cgacgaaatt 2700
aatcttttgc ttaaggagaa ggcaaacgat gtgcacatct tgagtatcga caggggggag 2760
agacatctcg catattacac attagttgat ggcaagggaa acattatcaa acaggatact 2820
tttaatatta taggtaacga tcgtatgaag acaaactatc acgataagtt ggctgcaatc 2880
gaaaaagata gggactctgc ccgcaaggac tggaagaaaa ttaacaatat taaagaaatg 2940
aaggaagggt atctgtctca ggttgttcac gagatcgcga agcttgtaat agaatacaac 3000
gctattgtgg ttttcgaaga tctgaacttc ggtttcaaac gcggaaggtt taaggttgag 3060
aagcaggttt atcaaaagtt ggaaaaaatg ttgattgaaa aattgaacta tcttgtgttt 3120
aaagataacg aatttgataa aaccggtggc gttctccgtg catatcaact cactgcacca 3180
tttgaaacat ttaaaaaaat gggaaagcag acagggatta tctattatgt tccagcaggc 3240
tttacttcta aaatatgtcc cgtcaccggc tttgtgaacc aattgtaccc taaatatgag 3300
tccgtttcaa aaagtcagga gttcttttca aaatttgata aaatttgtta taacttagat 3360
aaaggttact tcgaattttc gttcgattat aaaaactttg gagataaagc cgcgaaggga 3420
aaatggacta tagcctcatt tgggtcaaga ttaataaact tcaggaattc tgataagaac 3480
cataactggg acaccaggga ggtgtaccca acaaaagagc tggaaaagct tcttaaggat 3540
tactcaatag agtatggaca cggcgaatgt ataaaggctg ccatttgcgg cgaatcggat 3600
aagaaatttt tcgctaaatt gacaagcgtg ctaaacacta tccttcaaat gagaaactca 3660
aagacaggta ctgaacttga ctaccttatc agtcctgtgg cggatgttaa tggtaatttt 3720
ttcgactctc gtcaggcacc aaaaaatatg ccccaagatg cagatgcaaa tggtgcctat 3780
cacattggac tcaaaggatt aatgctgttg ggaaggatta agaataacca agagggaaag 3840
aaattgaacc tcgtgataaa gaacgaggaa tatttcgaat tcgtccagaa cagaaataac 3900
<210> 27
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 27
atgtccatat atcaggagtt cgtgaacaag tacagtctta gcaagacctt gaggtttgag 60
cttatccccc agggaaagac cctcgagaac atcaaggcca ggggcctcat cctcgacgac 120
gagaagaggg ccaaggacta caagaaggcc aagcagatta tcgacaagta ccatcaattt 180
ttcatcgagg agatactctc cagtgtttgc ataagcgagg acctcctcca gaattactcc 240
gacgtttatt tcaagctcaa gaagagcgac gacgacaatc tccagaaaga cttcaagtcc 300
gccaaggaca ccatcaagaa gcaaatcagc gagtatatca aggattccga gaagttcaag 360
aacctcttca atcaaaactt gatcgatgcc aagaagggcc aggaaagcga cctcattctc 420
tggctcaagc agtccaagga taacgggatt gagctcttca aggccaacag cgacatcact 480
gacattgacg aagccctcga gatcattaag tcatttaagg gttggaccac ctacttcaag 540
ggcttccatg agaacaggaa aaacgtgtac tcatccaacg acatccccac ctccatcatc 600
tacaggatcg tggacgacaa tctccccaaa ttcctcgaga ataaagccaa atatgagagc 660
ttgaaggaca aagcccccga ggccattaac tacgagcaga tcaagaagga tttggccgaa 720
gaacttacct ttgacatcga ctacaagacc tccgaagtga accaaagggt cttcagcctc 780
gatgaagtgt ttgaaatcgc caatttcaat aactacctca accaaagtgg gatcacaaag 840
ttcaatacca tcatcggagg caagttcgtt aacggcgaaa ataccaagag gaagggaatc 900
aatgagtaca tcaacctcta ctcacaacag attaacgata agaccctcaa aaagtacaaa 960
atgagcgtcc tctttaagca aatcctcagc gacaccgaga gcaagtcatt cgtgattgac 1020
aagttggagg acgactccga cgtggtgaca actatgcagt ccttctacga gcagatcgcc 1080
gcctttaaaa ccgtggagga gaagagcatc aaagagaccc tcagcttgct ctttgacgac 1140
ttgaaggccc agaaactcga cctttctaag atctatttca agaatgacaa gtccctcacc 1200
gacctcagcc agcaagtgtt cgacgactac tcagttatcg ggacagcagt cctcgaatac 1260
attacacagc agatcgcccc aaagaacctc gataacccct ccaagaagga gcaggagctc 1320
atcgccaaga aaaccgagaa ggctaagtat ctcagtttgg agactatcaa gctcgccctc 1380
gaggagttca acaagcacag agacatagac aagcaatgta ggtttgagga gatcttggcc 1440
aacttcgccg ccatccccat gatattcgac gagatcgccc agaacaaaga caacctcgcc 1500
cagatcagca tcaagtatca gaatcagggg aagaaggatc tccttcaggc ctcagccgaa 1560
gatgacgtca aggccatcaa ggatctcctc gaccaaacaa acaacctcct ccacaaactc 1620
aaaattttcc atatcagcca gtccgaagac aaggccaaca tcttggacaa ggacgagcat 1680
ttctacctcg tctttgaaga atgctacttc gagctcgcca atatcgtgcc actctacaat 1740
aaaatcagga actacatcac ccagaaacca tacagtgacg agaagttcaa acttaatttc 1800
gagaactcca ccctcgctaa cgggtgggat aagaacaagg agcccgacaa caccgcaatt 1860
cttttcatca aggacgacaa gtactacctc ggcgtcatga acaaaaagaa caacaaaatc 1920
ttcgatgata aagccatcaa ggagaacaag ggcgaggggt ataagaagat agtgtataag 1980
cttctccccg gcgccaacaa gatgctccca aaagtctttt ttagcgccaa gtccataaag 2040
ttctacaacc ccagtgagga tatcctcagg atcaggaacc actccactca cacaaagaac 2100
gggagccccc aaaaagggta cgagaaattc gaattcaaca tcgaagactg cagaaagttc 2160
atcgatttct acaagcagtc catctccaaa caccccgaat ggaaggattt cgggttcagg 2220
ttctcggaca cccaaaggta caacagcatc gacgaattct acagggaggt tgagaaccag 2280
ggttacaaat tgacattcga aaacatcagt gagagctata tcgattccgt ggtgaaccag 2340
gggaagctct acctctttca aatatataat aaggatttca gtgcctacag caaggggagg 2400
ccaaacttgc atacccttta ctggaaagca ctcttcgacg agaggaacct ccaggacgtg 2460
gtttataagt taaacggcga ggccgagctc ttctacagga aacagagtat ccccaaaaag 2520
atcacacacc ccgccaaaga ggccatagcc aacaagaaca aggacaaccc caagaaagaa 2580
tccgttttcg agtatgacct catcaaggac aagaggttca ccgaggacaa attcttcttt 2640
cactgcccta tcaccattaa ctttaagtcc tccggcgcca acaagttcaa cgacgagatc 2700
aacctcttgc ttaaggagaa ggctaatgac gttcacatac tcagcatcga caggggggag 2760
aggcatctcg cctactacac cctcgtggac ggcaaaggca acatcatcaa gcaagacacc 2820
ttcaacatta tcgggaacga caggatgaag accaactacc acgacaagct cgccgctatc 2880
gagaaagaca gggacagcgc cagaaaagac tggaagaaaa taaacaatat caaggagatg 2940
aaagaagggt acctcagcca agtcgtccat gagatcgcca agctcgtgat cgagtataac 3000
gctatagttg tcttcgagga cctcaacttc ggcttcaaaa gaggcaggtt caaagtggaa 3060
aagcaggtgt accagaaact cgaaaagatg ctcatcgaga agttgaacta tctcgttttc 3120
aaggataatg agttcgacaa aaccgggggc gtcctcaggg cctaccagct caccgcaccc 3180
ttcgagactt tcaaaaaaat gggcaagcag accggtatca tttactatgt ccccgccggt 3240
ttcacttcta agatctgccc cgtgacaggc tttgtgaacc agctttatcc caagtacgaa 3300
tcagtttcca agtcccagga gttcttctct aagtttgata agatatgcta caacctcgac 3360
aaggggtatt tcgaattcag cttcgattat aagaacttcg gagacaaggc cgccaaggga 3420
aagtggacca tcgccagctt tggttccagg ctcataaact tcaggaacag cgacaagaac 3480
cacaactggg ataccaggga ggtttaccct accaaagaac tcgagaagct cctcaaggac 3540
tactccatcg agtacggtca tggggagtgc atcaaggctg ccatctgcgg cgagagcgat 3600
aagaagttct tcgccaaact caccagcgtt ttgaacacca tcctccagat gaggaatagc 3660
aagaccggca ccgagctcga ctacctcatc tcacccgtcg ccgacgtgaa cggcaatttc 3720
ttcgattcta ggcaggcacc taagaacatg ccccaggacg cagacgccaa cggcgcctac 3780
cacatcgggc ttaaggggct catgctcttg ggcagaatca aaaacaacca agaggggaaa 3840
aaactcaact tggtgatcaa gaacgaggaa tacttcgagt tcgtccagaa caggaacaac 3900
<210> 28
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 28
atgagcatat accaagagtt cgtgaacaag tacagcctct ctaagaccct taggtttgag 60
ctcatccccc aggggaaaac cctcgagaac atcaaggcca ggggcctcat cctcgacgat 120
gagaagaggg ccaaggacta caagaaagcc aagcagatca tcgacaagta tcaccagttc 180
ttcatcgaag agatcctctc ctccgtgtgc atcagtgagg acttgctcca aaattactcc 240
gatgtgtact tcaagctcaa gaagtcagat gatgacaacc ttcagaagga ctttaagagt 300
gccaaagata ccatcaaaaa acagatcagc gagtacatca aggactccga gaagttcaag 360
aacttgttca accagaacct cattgacgcc aagaaggggc aggagtccga tctcatcctc 420
tggctcaagc agagcaaaga caacgggatc gagctcttca aggctaactc agacataacc 480
gacatcgatg aggccctcga gatcatcaaa tcctttaagg ggtggaccac ttattttaag 540
gggtttcacg agaacaggaa gaatgtctat agcagcaacg acatccccac cagcatcatc 600
tataggatcg tggatgataa tcttcccaaa tttttggaaa acaaggccaa gtacgaatct 660
ctcaaggaca aggcccccga ggccattaac tacgaacaga tcaagaagga cctcgccgag 720
gaacttacct tcgacattga ttacaagacc tccgaggtga accagagggt gttcagcctc 780
gacgaggtgt ttgagatcgc aaacttcaat aactacctca accaaagcgg tatcaccaag 840
ttcaacacca tcattggggg gaagttcgtc aacggtgaga acactaaaag gaaagggatc 900
aacgaatata tcaatctcta ctctcagcag atcaatgaca agaccctcaa aaagtacaag 960
atgagcgttc tcttcaaaca gatcctctcc gataccgagt ccaagagttt cgtgatcgat 1020
aagctcgaag acgacagtga tgtggtgacc accatgcaga gcttttacga acagatcgcc 1080
gctttcaaaa ccgtggagga aaaatccatc aaagagaccc tttccctcct cttcgacgac 1140
ctcaaggccc aaaagcttga cctcagcaag atttacttca aaaacgacaa gagcctcacc 1200
gacctcagcc agcaggtttt cgacgactac tccgttattg gcaccgccgt gttggagtac 1260
atcacccagc agatcgcccc aaaaaacctc gacaacccct ccaagaagga gcaggaactc 1320
atcgccaaga agacagaaaa ggccaaatac ctttcacttg agacaatcaa gctcgccctc 1380
gaggagttca acaagcacag ggatatcgac aagcaatgta ggttcgaaga gatcctcgca 1440
aacttcgccg ccatccccat gatctttgac gaaatcgcac agaacaagga caatctcgcc 1500
cagatcagca tcaagtacca gaaccagggc aaaaaggatc tcttgcaggc tagcgctgag 1560
gacgatgtta aggccatcaa agacctcttg gatcagacca acaaccttct ccacaagctc 1620
aagatcttcc acatctccca gtccgaagac aaggccaaca tcttggacaa ggatgagcat 1680
ttttacctcg tgttcgaaga atgctacttc gagctcgcta acatcgtccc cttgtacaac 1740
aagatcagaa actatattac ccagaagccc tactccgacg aaaagtttaa gctcaacttt 1800
gagaattcaa cattggcaaa cggctgggat aagaacaagg agccagacaa taccgccatc 1860
ctctttatca aggatgataa gtactacctc ggagtcatga ataagaagaa taacaagata 1920
ttcgacgaca aagccatcaa ggagaacaag ggggagggct acaagaagat cgtgtataag 1980
ctcctccccg gcgccaacaa gatgttgccc aaggtgttct tcagcgctaa aagcatcaag 2040
ttttataacc ccagtgaaga tattctcagg ataaggaatc acagcacaca caccaagaac 2100
gggagccccc agaaagggta cgaaaagttc gagttcaata ttgaggactg taggaagttt 2160
atagacttct ataagcagag catctccaag caccccgagt ggaaagactt tggattcagg 2220
ttctccgaca cccagaggta taactcaatc gacgagttct acagggaggt tgagaatcaa 2280
ggttacaagc tcacctttga gaacatctcc gagtcctaca tcgactctgt cgtgaaccag 2340
gggaagctct atctcttcca gatctacaac aaggacttca gcgcatacag caagggtagg 2400
cctaatctcc acacccttta ctggaaagcc ctcttcgacg agaggaacct tcaggacgtg 2460
gtttataaac tcaacggtga ggccgagctc ttttacagga agcagtccat ccccaaaaag 2520
attactcacc cagccaagga ggccatcgcc aataagaaca aggacaaccc caagaaggaa 2580
tccgtctttg agtatgacct cataaaagat aagaggttta ccgaggacaa gttcttcttt 2640
cactgtccca tcaccataaa cttcaagtct agcggggcca acaagttcaa cgacgagata 2700
aacctccttc tcaaggagaa ggcaaatgac gtgcacattc tctccatcga caggggcgag 2760
aggcatttgg cctattacac cctcgtggat gggaagggca acattattaa gcaagacacc 2820
ttcaacatca taggaaacga caggatgaag accaactatc acgacaaatt ggccgctata 2880
gagaaggaca gagactcagc caggaaggat tggaagaaaa taaacaacat caaggagatg 2940
aaggagggct acttgagcca ggttgtccac gaaatagcca agttggtcat tgaatacaac 3000
gccatagttg tcttcgagga tctcaacttt ggctttaaga gggggaggtt caaggtggag 3060
aagcaggtgt accagaagct cgagaaaatg ctcatcgaaa agctcaatta cttggtgttc 3120
aaggataatg agtttgacaa gaccgggggc gtcctcagag cctaccagct caccgcccca 3180
ttcgagacct tcaagaaaat ggggaagcag accgggatca tctactacgt gcccgccggg 3240
ttcacctcaa agatatgccc cgtgacaggc ttcgtcaacc aattgtaccc caagtacgag 3300
tccgtctcca agtcccagga gttcttcagc aagtttgaca aaatttgcta caacctcgac 3360
aagggctact tcgagttctc cttcgattac aagaattttg gggacaaagc cgccaaaggg 3420
aagtggacaa tagcaagctt cgggtcaagg ctcatcaatt ttagaaactc agacaaaaac 3480
cacaactggg acactaggga ggtgtatcca accaaggagc tcgagaagct cctcaaggac 3540
tacagtatcg agtacggaca cggcgaatgc atcaaggccg ccatctgcgg cgagtcagac 3600
aagaaattct tcgccaaact cactagcgtc ctcaacacaa tcctccagat gaggaactcc 3660
aaaaccggga ctgagctcga ctacctcatc agcccagtcg ccgacgttaa cggcaatttc 3720
ttcgattcca ggcaggcccc caagaacatg ccccaggacg ccgacgccaa tggggcctat 3780
catatcgggc tcaaagggct catgttgctc ggcaggatca agaacaatca ggagggaaag 3840
aagttgaacc tcgttatcaa aaacgaggag tacttcgagt tcgttcagaa taggaacaat 3900
<210> 29
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 29
atgtcaatct accaagaatt cgttaacaaa tactccctca gcaaaaccct tagattcgag 60
ctcatccccc agggcaaaac actcgagaac attaaggcta gggggcttat cctcgacgac 120
gaaaagaggg ccaaggacta caagaaagcc aagcagataa tcgacaaata ccaccagttc 180
ttcatcgagg agatcctctc ctccgtttgc atctccgagg acctccttca gaactacagt 240
gatgtgtact ttaagctcaa aaagtccgac gacgacaacc tccagaagga tttcaagtcc 300
gccaaggaca ccatcaagaa gcagatatcc gagtacatta aggacagcga gaaattcaag 360
aacctcttca atcagaacct catcgacgca aaaaaaggac aggagagcga cctcatcctt 420
tggctcaaac agtccaaaga caacggtatc gagctcttca aggcaaactc tgacatcacc 480
gacatcgacg aagccttgga gatcataaag agcttcaagg ggtggaccac ttactttaaa 540
gggttccacg agaacaggaa gaacgtgtac tcctcaaatg acatccccac cagcatcatc 600
tacagaatcg tggacgacaa tctgcccaag tttttggaaa ataaggccaa atatgaaagc 660
ctcaaagaca aggcacccga ggccataaac tacgagcaga tcaagaagga cctcgccgaa 720
gaactcacct tcgacattga ctacaagact agcgaggtta atcagagagt ctttagcctt 780
gacgaggttt ttgagatcgc caacttcaat aactacctca atcaaagcgg cataaccaag 840
ttcaatacca tcatcggcgg caagttcgtt aacggtgaga acaccaagag gaagggcatc 900
aatgaataca ttaatctcta ctcccagcaa atcaacgaca agactctcaa aaagtacaag 960
atgtccgtcc tttttaagca gatactcagc gataccgagt ctaagagctt cgtcattgac 1020
aagctcgagg atgacagcga cgtggttacc accatgcaga gcttctacga gcagatcgcc 1080
gccttcaaga ctgtggaaga gaagagcatt aaggagaccc tcagtctcct cttcgacgac 1140
ctcaaggcac agaagctcga tctcagcaag atatatttca aaaatgacaa gtccctcacc 1200
gacctcagtc aacaagtttt cgacgactac agcgtcatcg ggaccgccgt cttggagtac 1260
attacccagc agatcgcccc caaaaacctc gacaacccta gcaagaaaga acaagagctc 1320
atagccaaga agaccgaaaa ggccaagtac ctctccctcg aaaccattaa gttggccctc 1380
gaggagttta acaagcacag ggacatcgac aagcagtgta gatttgagga aatacttgcc 1440
aacttcgccg ccatccccat gatctttgac gagatcgctc agaataagga caacctcgca 1500
cagatctcga taaagtacca gaaccagggt aaaaaagact tgttgcaggc ctctgccgag 1560
gacgacgtta aggccatcaa agatttgctc gaccagacta acaacctcct tcacaaactc 1620
aagatcttcc acatctctca gagtgaagac aaggccaaca tactcgacaa agacgagcac 1680
ttctacctcg tgttcgaaga atgctacttc gagctcgcca acatcgttcc cctctacaac 1740
aagatcagga actacattac ccagaagccc tactcagatg aaaaattcaa gctcaacttc 1800
gagaactcca ccctcgccaa cggctgggat aaaaacaagg agcccgacaa caccgccatc 1860
ctcttcataa aggacgataa atattacttg ggagttatga acaagaaaaa caacaagatc 1920
ttcgacgata aggctataaa ggagaacaaa ggggaggggt acaaaaagat agtttacaag 1980
ctcctccccg gtgccaacaa gatgctcccc aaggtgttct tctccgccaa gagtataaag 2040
ttctacaacc ccagcgagga catcctcagg ataaggaatc actccaccca caccaagaat 2100
ggcagccccc agaaggggta tgaaaagttc gaattcaaca tcgaggactg caggaagttc 2160
atcgacttct acaaacagag catcagtaag cacccagagt ggaaggattt cgggtttagg 2220
ttctccgata cccagaggta caactccatc gacgagttct atagagaggt tgagaaccag 2280
gggtacaagc tcacctttga gaacatcagc gagtcctata tcgactccgt cgttaaccag 2340
gggaaactct acttgttcca aatttacaac aaggatttct ccgcctactc caagggtagg 2400
cccaaccttc acaccctcta ctggaaggcc ctcttcgatg agagaaacct ccaggacgtt 2460
gtgtataagc tcaatgggga ggccgaactc ttctacagaa agcaatccat ccccaaaaag 2520
ataacccatc ccgctaagga ggctatcgcc aataagaaca aggacaatcc caagaaggag 2580
tccgtgttcg agtacgatct cataaaggac aaaaggttca ccgaggacaa gttcttcttc 2640
cattgtccaa ttaccatcaa ttttaaaagc tcaggagcca acaaatttaa cgacgaaatc 2700
aacctcttgc tcaaggagaa ggccaatgac gttcacatct tgtccatcga caggggcgag 2760
agacacttgg catactatac tctcgttgat ggcaagggca atatcattaa acaagacacc 2820
ttcaacatca tcgggaacga cagaatgaaa accaactacc acgacaagct cgccgccata 2880
gagaaggaca gagactccgc caggaaggac tggaaaaaga tcaacaacat caaggagatg 2940
aaggaggggt acctttccca ggttgttcac gagatcgcca agctcgtgat tgagtacaac 3000
gccatcgtgg tgttcgagga cctcaacttt ggctttaaga ggggcaggtt caaagtggag 3060
aagcaggttt atcagaaact cgagaagatg ctcatcgaga aactcaacta cttggtcttc 3120
aaggacaacg agttcgacaa gactggtggg gtcttgaggg cctatcaact cacagcaccc 3180
tttgagacct tcaagaagat ggggaagcag accggcatca tctattatgt gcctgccggg 3240
ttcaccagca agatctgccc cgtgaccggc ttcgttaacc agctctatcc caagtatgag 3300
tccgtcagca agtctcaaga gttcttttcc aagttcgaca aaatctgcta taacctcgac 3360
aaaggttact tcgagttcag cttcgactac aagaacttcg gcgacaaggc cgctaagggg 3420
aaatggacca ttgcctcttt cggcagcagg ctcataaact tcaggaacag cgacaaaaac 3480
cacaactggg ataccagaga ggtttatccc acaaaggagc tcgagaaact cctcaaggac 3540
tactctatcg agtacggcca cggcgaatgc atcaaggctg ccatttgcgg agagtctgac 3600
aagaaattct tcgccaagct cacctccgtc ttgaacacca tcctccagat gagaaatagc 3660
aagaccggga ctgaactcga ctacctcatc tcccctgtgg ccgacgtcaa cggcaatttc 3720
ttcgattcaa ggcaggcacc caagaatatg ccccaggatg ccgacgccaa tggggcctac 3780
cacatcggcc tcaaggggtt gatgttgctt gggaggatca agaacaacca ggaaggcaag 3840
aagctcaacc tcgtgatcaa aaatgaagag tacttcgaat ttgtccaaaa taggaacaat 3900
<210> 30
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 30
atgtccatct accaggaatt cgttaataaa tacagcctca gcaagaccct caggttcgag 60
ctcatccccc agggcaagac cctcgaaaat atcaaggcca ggggactcat cctcgacgac 120
gagaaaaggg ctaaagatta caagaaggcc aagcagataa ttgacaagta ccaccagttc 180
ttcatcgagg agatcctcag ttccgtctgt atcagtgagg atctcctcca gaactactct 240
gacgtctact tcaagctcaa aaagagcgac gatgacaatc tccagaagga cttcaagagc 300
gccaaggaca caataaagaa acaaatcagc gaatatatca aggacagcga gaagtttaag 360
aaccttttca accagaacct cattgacgcc aagaagggcc aagaatctga cctcatcttg 420
tggctcaaac agtctaagga caacggaatc gagttgttca aagccaactc cgacatcact 480
gacatcgacg aggccctcga aatcattaaa tccttcaagg gatggaccac ctacttcaag 540
ggcttccatg agaacaggaa gaacgtttac agcagcaacg atatacctac cagcatcata 600
tataggatcg tggacgacaa cctccccaag ttcctcgaga acaaggctaa gtacgagagc 660
cttaaggaca aagcccccga agccatcaac tacgagcaga tcaagaagga cttggccgag 720
gagctcactt tcgacattga ctataagacc agcgaagtca accaaagggt gttttcactt 780
gacgaggtgt ttgagatcgc caattttaac aactacctca accagagtgg gatcaccaag 840
tttaacacca tcataggggg caagttcgtg aatggggaga acaccaagag gaaggggatc 900
aacgaataca tcaacttgta cagtcagcag attaacgaca agactctcaa gaagtataag 960
atgtccgtcc tttttaaaca gattctctcc gatacagaga gcaagtcatt cgtcattgac 1020
aaacttgagg acgactctga cgtcgtcacc actatgcagt ccttctatga gcagattgca 1080
gcctttaaga ccgttgagga gaagagcatc aaagagacct tgtccctcct tttcgatgac 1140
ctcaaagccc agaagctcga cctctccaag atatacttca agaatgacaa gtccctcaca 1200
gacttgagcc agcaagtgtt cgacgactac agcgtcatcg gcaccgccgt gctcgagtac 1260
atcacccaac agatcgcccc caagaacctc gacaacccca gtaagaagga gcaggagctc 1320
atcgccaaga agacagaaaa ggccaaatat ctcagtctcg agaccatcaa gctcgcactc 1380
gaagagttca acaagcacag ggacatcgac aagcagtgca ggttcgagga gatcctcgcc 1440
aacttcgctg ctatccccat gatcttcgat gagatagccc agaacaaaga caatctcgct 1500
cagatctcta tcaagtacca gaaccaagga aagaaggacc tcctccaggc cagcgccgag 1560
gacgacgtca aggccatcaa ggacctcctc gaccagacca acaatcttct ccacaaactc 1620
aagatcttcc acatctccca atccgaagac aaggccaaca tactcgacaa ggacgaacac 1680
ttctacctcg tttttgagga gtgctacttc gagctcgcca acatcgttcc tctctacaat 1740
aaaattagga actacatcac ccagaaaccc tatagcgatg agaagtttaa attgaacttc 1800
gagaattcca cactcgccaa tgggtgggac aagaacaagg agcccgacaa taccgccatc 1860
ctcttcatta aggacgacaa gtattacctt ggagttatga ataaaaaaaa caataagatt 1920
ttcgacgaca aggccatcaa ggagaacaaa ggtgaaggct ataagaagat agtgtacaag 1980
ttgctcccag gcgcaaacaa gatgcttccc aaggtgttct tttccgccaa gtctatcaag 2040
ttttacaatc cctcagaaga catcctcagg atcaggaatc attccaccca taccaagaat 2100
gggagccctc agaagggcta cgagaagttc gagttcaata tcgaggactg caggaagttc 2160
attgatttct acaaacagtc aatctccaag caccccgaat ggaaagattt cggcttcaga 2220
tttagcgaca ctcagaggta caattccatc gacgagttct acagggaggt tgaaaatcag 2280
ggatacaagc tcaccttcga gaacatttcc gagtcttaca tcgatagcgt ggtcaatcag 2340
ggtaaacttt acctcttcca gatctacaac aaggacttct ccgcctactc caaaggcaga 2400
cccaacctcc acaccctcta ctggaaagca ctttttgacg agaggaacct tcaggacgtt 2460
gtttacaagc tcaacgggga ggctgagctc ttctacagga agcaatccat accaaagaaa 2520
atcacacatc ccgccaagga agccatcgcc aacaagaata aggataatcc caaaaaagaa 2580
tccgtcttcg aatatgacct cataaaggat aaaaggttta ccgaagacaa attctttttc 2640
cattgcccca tcaccatcaa cttcaagtca tccggggcaa acaagttcaa cgacgagatt 2700
aacctcctcc tcaaagagaa ggccaatgac gtgcatatct tgtccatcga caggggggag 2760
agacacctcg cctattacac cctcgtggat gggaagggta acatcatcaa acaggacacc 2820
ttcaatatca tcggcaacga caggatgaaa accaactacc acgacaagtt ggctgccatc 2880
gaaaaggaca gagacagtgc caggaaggac tggaagaaga tcaacaatat aaaggagatg 2940
aaggagggtt atctcagcca ggtggttcac gagattgcca agcttgtcat cgagtataac 3000
gccatcgtcg ttttcgagga cctcaatttc gggttcaaga ggggtaggtt caaggttgag 3060
aagcaggttt atcagaagct cgagaagatg ctcattgaga agcttaatta cctcgtgttc 3120
aaggacaacg agttcgacaa gaccggcggc gtcctcaggg cctaccagct caccgccccc 3180
ttcgagacat tcaagaaaat gggtaagcag accggcatta tatactacgt ccccgctggg 3240
ttcaccagca agatctgccc tgttaccggc tttgtcaatc agctttaccc taagtatgag 3300
tccgtgagca agagccagga atttttttcc aagttcgaca agatctgtta taacctcgac 3360
aagggctact tcgagttcag cttcgactac aaaaatttcg gtgacaaggc cgccaagggg 3420
aagtggacca tcgcaagctt cggttctagg ctcatcaatt tcaggaacag cgacaagaac 3480
cacaactggg atactagaga ggtgtacccc accaaggagc tcgagaagct cctcaaggac 3540
tatagcatcg agtatggtca cggtgagtgc atcaaagccg ccatctgcgg cgagtccgac 3600
aaaaagtttt ttgccaaatt gacctccgtt ctcaacacca tcctccagat gaggaacagc 3660
aaaaccggga ccgagttgga ttacctcatc agccccgttg cagatgtgaa cggcaacttc 3720
tttgacagca ggcaggcccc caaaaatatg cctcaggatg ctgacgcaaa tggtgcctac 3780
cacatcgggc tcaaggggct catgctcctc gggagaatca agaataacca ggagggcaag 3840
aagctcaacc tcgtcatcaa gaacgaggag tactttgaat ttgtgcagaa caggaataac 3900
<210> 31
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 31
atgagtatct atcaggagtt cgtgaataaa tacagcctca gcaaaactct caggttcgag 60
ctcatacctc aaggcaagac cctcgagaat atcaaggcca ggggcctcat cctcgatgat 120
gagaagaggg ccaaggacta caaaaaggcc aagcagatca tagacaaata ccatcagttc 180
tttatcgagg aaatactctc cagcgtttgc atcagcgaag accttctcca gaactactct 240
gatgtgtact tcaagctcaa gaagagcgac gatgataacc tccaaaaaga cttcaagagt 300
gccaaggaca ctatcaagaa gcagatttca gagtacatca aagatagcga gaaattcaag 360
aacctcttca atcaaaacct catcgatgcc aagaaaggcc aggaatccga cctcattctc 420
tggctcaagc agagcaaaga caacggtatt gaactcttca aggctaacag cgacatcacc 480
gacatcgacg aagccttgga gataataaag agtttcaaag gctggacaac ctacttcaag 540
ggcttccacg aaaacaggaa gaatgtttac tctagcaatg atatacccac ctccatcatt 600
tacagaatcg ttgacgacaa tctccccaag ttcctcgaga acaaggctaa gtatgaatcc 660
ctcaaggaca aggcccccga agccatcaac tacgagcaga taaagaaaga tttggctgag 720
gagctcacct ttgacatcga ctacaagacc tctgaagtga accagagagt gttcagcctc 780
gacgaagttt ttgaaatcgc caacttcaac aactatctca accagtccgg tatcaccaag 840
tttaacacaa tcatcggagg taagttcgtg aacggggaga acacaaagag gaaaggcatt 900
aacgagtata tcaacctcta cagccagcag atcaatgata agacactcaa aaagtacaag 960
atgagcgtcc tctttaaaca gatactcagc gacaccgaat ctaaatcctt cgtcattgac 1020
aagctcgagg acgactccga cgtcgtcacc acaatgcaat ccttctacga gcagatcgcc 1080
gccttcaaga ccgtcgagga gaagtccatc aaggagaccc tcagcctcct ctttgatgac 1140
ctcaaagccc aaaagctcga cctctccaag atctacttca agaatgacaa gagcctcacc 1200
gacctctccc aacaagtctt cgacgactat tccgtcatcg gcaccgccgt cctcgagtac 1260
atcacccaac aaatcgcccc taagaacctc gacaatccca gcaaaaagga acaagaactc 1320
atcgccaaga agaccgaaaa ggccaaatac ctcagccttg aaactatcaa gcttgccctc 1380
gaagagttca acaagcacag ggacatcgac aagcagtgca gatttgaaga gatcctcgcc 1440
aattttgccg ccattcccat gatcttcgac gaaatcgccc agaataagga caacttggct 1500
cagatcagta tcaagtacca aaatcagggc aagaaggacc tcctccaggc cagtgctgag 1560
gatgacgtga aggccatcaa ggatctcctc gatcagacca acaacttgct ccataaactc 1620
aagatcttcc atatcagcca gagcgaggac aaggccaata tccttgataa ggacgagcat 1680
ttctacctcg ttttcgagga gtgctacttc gagcttgcta atatcgtccc cctctacaac 1740
aagatcagaa actacatcac ccagaagccc tattccgacg agaagttcaa gctcaacttc 1800
gaaaactcaa ctctcgccaa cggttgggat aagaataaag agcccgacaa caccgccatc 1860
ctcttcataa aagacgacaa atattatttg ggcgttatga acaagaagaa caacaagatc 1920
tttgacgata aagccatcaa ggagaataag ggcgagggct acaaaaagat cgtttataag 1980
ctcctcccag gtgccaataa gatgcttccc aaggtttttt tctccgccaa gtccatcaag 2040
ttctataacc cctccgagga catcctcagg ataaggaatc actctaccca taccaaaaac 2100
ggctcacccc agaaagggta cgagaagttc gaattcaaca tcgaggactg taggaagttc 2160
atcgacttct acaagcagtc catcagtaag caccctgagt ggaaggattt cgggttcagg 2220
ttcagcgata cccagaggta taacagtatc gatgagttct acagggaggt ggaaaaccag 2280
ggatacaagc tcacatttga gaacatatcc gaaagctaca tcgactccgt ggtgaaccag 2340
gggaagctct atctcttcca gatctataat aaggacttct ctgcttactc taaggggaga 2400
cccaatctcc acaccctcta ctggaaggcc ctcttcgacg aaaggaacct ccaagacgtt 2460
gtctacaagc tcaacgggga ggccgagctt ttctacagaa agcagtcaat ccccaagaag 2520
attacccacc ctgccaagga ggccatagcc aacaagaata aggacaaccc caagaaggaa 2580
agcgttttcg aatacgatct cattaaggac aagagattca ccgaagacaa gttctttttc 2640
cactgcccca tcaccatcaa ttttaagagc tctggtgcca acaaattcaa cgacgagatc 2700
aatctcctcc tcaaggagaa ggccaatgac gtgcacatcc tcagcatcga cagaggggag 2760
aggcacctcg cctactatac cctcgtggat gggaaaggaa acatcatcaa gcaggatact 2820
tttaacatca tcgggaacga taggatgaag accaactacc atgataagct cgccgcaatc 2880
gagaaggaca gagactccgc cagaaaggac tggaagaaga taaataacat caaagagatg 2940
aaagagggct acttgtccca agtcgtgcac gagatcgcca agctcgttat cgagtacaac 3000
gccatcgtcg tgttcgagga tctcaacttc gggttcaaga gaggcaggtt caaggttgag 3060
aagcaggtct atcagaagct tgagaagatg ctcattgaaa agctcaacta cttggtcttt 3120
aaggataacg agtttgataa aaccggaggc gttctcaggg cctaccagct caccgccccc 3180
ttcgagacct tcaagaaaat ggggaagcag accggtatca tctactatgt gcccgccggc 3240
ttcacctcca agatctgccc agttaccggc ttcgttaacc agctctatcc caagtacgaa 3300
tctgtgtcca agtcacagga gttcttcagt aagtttgaca aaatctgcta caacctcgat 3360
aagggttact tcgagttcag cttcgactac aagaacttcg gggacaaggc cgccaagggg 3420
aagtggacaa tcgccagttt cgggtccagg ctcataaact ttaggaactc cgataaaaac 3480
cacaactggg acaccaggga agtctacccc accaaagagt tggagaagct cctcaaggac 3540
tatagtatcg agtacggcca cggtgagtgc attaaggccg ccatctgtgg agagtccgac 3600
aaaaagtttt tcgccaagct caccagcgtt ttgaacacca tactccagat gaggaacagc 3660
aaaaccggta ccgaactcga ctacctcatt agccccgtgg ccgatgtcaa tgggaacttt 3720
ttcgacagca ggcaggcacc caagaacatg ccccaggacg ccgacgccaa cggagcctac 3780
cacatcggcc tcaaggggct catgctcctt ggtagaatca aaaacaacca ggagggcaag 3840
aagctcaacc tcgtcatcaa aaacgaagag tacttcgagt ttgtccagaa caggaacaat 3900
<210> 32
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 32
atgtctatct accaggagtt cgtcaacaag tactccctca gcaagaccct caggtttgag 60
cttatcccac aaggcaagac cctcgagaac atcaaggcca ggggtcttat cctcgacgac 120
gaaaaaaggg ccaaggacta caagaaggcc aagcagatca tcgacaagta ccaccagttc 180
tttatcgagg agatactcag cagcgtgtgc atcagcgagg accttctcca gaactactcc 240
gacgtttact ttaagctcaa gaagtctgac gacgataacc tccagaagga cttcaagtcc 300
gccaaggata ccatcaagaa gcagatcagc gagtatatta aggatagcga gaagttcaag 360
aatcttttca accagaacct catcgatgcc aaaaaggggc aggaaagtga cctcatactc 420
tggctcaaac agagcaaaga caacgggatc gagctcttca aagccaacag cgacatcaca 480
gacatcgacg aggctctcga aatcatcaaa agttttaagg gatggaccac ctacttcaag 540
ggtttccacg agaataggaa gaatgtgtac tcttcaaatg acattcccac ctccatcatc 600
tataggatcg tggatgataa cctccccaag ttcctcgaga ataaggccaa gtatgagtcc 660
ctcaaggaca aggcccccga ggccataaac tacgaacaga tcaagaagga tctcgccgag 720
gaattgacat tcgatatcga ctacaagact agcgaggtga accagagggt gttcagcctc 780
gacgaagtgt tcgaaatcgc caatttcaac aactatctca accagagtgg gataaccaaa 840
tttaatacaa taataggcgg caagttcgtt aatggcgaga acaccaagag aaagggcatc 900
aatgagtata tcaatctcta tagccagcag ataaacgaca agaccctcaa aaaatacaag 960
atgtccgtcc tctttaagca gatcctctcc gacaccgaat caaagtcctt tgtcatcgac 1020
aagttggagg acgactccga cgttgtcacc actatgcagt cattctacga acagattgcc 1080
gccttcaaaa ccgtggagga aaaaagcatc aaagagaccc tcagcctctt gttcgacgac 1140
ctcaaggccc aaaaactcga ccttagcaaa atctacttca agaatgacaa gtccctcacc 1200
gacctcagcc agcaagtgtt tgatgactac agtgtgattg ggacagccgt cctcgagtac 1260
ataactcagc agatagctcc caaaaacttg gacaacccct ccaagaagga acaggagctc 1320
atagctaaga aaactgagaa agccaagtac ttgtcactcg agacaatcaa actcgccctc 1380
gaggaattca acaagcacag ggacatcgac aagcagtgta ggttcgaaga gatcctcgca 1440
aacttcgccg caatccccat gatctttgat gaaatcgctc agaacaagga caatctcgct 1500
cagatcagca ttaaatacca aaaccagggg aagaaggact tgctccaagc cagcgccgag 1560
gatgacgtta aagccatcaa ggatctcctc gatcagacca acaacctcct ccacaagctc 1620
aagatatttc acattagcca gagtgaggac aaagccaaca tcctcgataa agacgaacac 1680
ttctatctcg tgttcgagga gtgttatttc gagctcgcaa atatcgttcc cctctacaac 1740
aagatcagga actatatcac ccaaaagcca tacagcgatg agaagtttaa gctcaacttt 1800
gagaatagca cattggccaa cgggtgggac aagaacaagg aacccgacaa cactgccatc 1860
ctcttcatca aagacgacaa atactacctc ggggtgatga acaagaagaa caacaagatc 1920
tttgatgata aggccatcaa ggagaacaaa ggcgagggct acaagaaaat cgtgtacaaa 1980
ctcctccccg gcgccaacaa gatgctcccc aaagttttct tctccgccaa gtccatcaag 2040
ttttataacc ccagcgagga catccttagg atcagaaacc attcaaccca caccaagaac 2100
ggaagccccc agaaagggta cgagaaattc gagttcaaca tcgaggactg caggaagttt 2160
atcgattttt acaaacagtc cataagcaag caccccgagt ggaaggactt cggttttaga 2220
ttctcagaca cacagagata caacagcata gacgaattct acagggaagt cgaaaatcag 2280
gggtacaagt tgacctttga gaacatcagc gagagctata tagacagcgt tgttaaccag 2340
gggaagctct acctcttcca gatatacaac aaagacttct ctgcctactc caaaggcagg 2400
cccaatctcc atacattgta ctggaaagcc ctctttgatg agaggaacct ccaggacgtc 2460
gtgtataagc tcaatggcga agccgagctc ttctatagga aacaatcaat ccctaagaag 2520
atcacccacc ccgccaaaga ggctattgcc aacaagaaca aagataaccc caagaaggag 2580
agcgtctttg agtacgactt gatcaaggac aagaggttta ccgaggacaa gtttttcttc 2640
cactgcccca tcactatcaa cttcaaatct tccggtgcca acaagtttaa cgacgagatt 2700
aacctcctcc tcaaagagaa agccaacgac gtccacatcc tcagtataga caggggcgag 2760
aggcacctcg cctattacac cctcgtcgac gggaagggca acatcatcaa acaggatacc 2820
ttcaacatca tcggaaacga caggatgaag accaactacc atgataagct cgccgccatc 2880
gaaaaagaca gggacagcgc aaggaaggat tggaagaaga tcaataatat caaggagatg 2940
aaggaggggt acctctctca ggtggtgcac gaaattgcta agctcgtgat cgagtacaat 3000
gccattgtcg ttttcgagga cttgaatttc ggcttcaaga gaggcaggtt taaggtcgag 3060
aagcaggttt atcagaagct cgagaaaatg ttgatcgaga agctcaacta cctcgtcttc 3120
aaggacaatg agttcgacaa gactggaggg gttctcagag cctaccagct caccgccccc 3180
tttgagacct tcaagaagat ggggaagcaa accggcataa tctattacgt tcccgcagga 3240
ttcacttcta agatctgccc cgtgacaggc ttcgttaatc agttgtaccc aaagtacgag 3300
agcgtgtcca agtcacagga gttcttttcc aaattcgaca agatctgcta caacctcgac 3360
aaaggatact tcgaattcag cttcgactac aagaatttcg gggacaaggc cgctaagggc 3420
aagtggacca ttgccagttt cgggtccagg ctcatcaatt ttaggaattc cgacaaaaac 3480
cacaattggg acacgaggga ggtctaccct accaaagagc tcgaaaagct cctcaaagac 3540
tacagtatcg agtacggcca tggtgaatgc atcaaggccg ccatctgcgg ggagagcgac 3600
aagaagttct tcgccaagtt gacctccgtc ctcaatacca tcctccagat gaggaacagc 3660
aaaaccggca ccgagctcga ctaccttatc agccctgtgg ctgacgttaa cgggaacttt 3720
ttcgactcca ggcaggcacc caagaacatg ccacaagacg ctgacgccaa cggcgcctac 3780
cacatcgggc tcaagggctt gatgcttttg ggcagaatca agaacaacca ggaggggaaa 3840
aagctcaacc tcgtgataaa gaacgaggag tatttcgagt tcgttcagaa caggaacaac 3900
<210> 33
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 33
atgtccatct atcaggagtt tgtgaacaaa tatagcctca gcaaaacctt gaggttcgaa 60
ctcatacccc aaggcaaaac cctcgagaac ataaaggcta gagggctcat cctcgacgat 120
gagaagaggg ccaaggatta caagaaagct aagcagatta tagacaagta tcatcaattc 180
tttattgagg aaatcctctc ttccgtctgc atctccgagg atctccttca gaattactcc 240
gacgtctact tcaagctcaa gaagagcgac gacgacaacc tccagaaaga cttcaagtct 300
gccaaggaca ccatcaagaa gcaaattagc gagtacatca aggacagcga gaagttcaag 360
aacctcttta atcaaaatct catcgatgcc aagaagggcc aagagtctga tcttatcttg 420
tggctcaagc aaagcaaaga caacgggatt gagctcttca aggccaacag tgacatcacc 480
gacatcgacg aggccttgga gatcatcaag tcctttaagg ggtggaccac ttactttaag 540
gggttccacg agaacaggaa gaacgtctat tccagcaacg acatccccac cagcatcatt 600
tacaggattg ttgacgataa cctcccaaag ttcctcgaaa acaaagctaa gtatgagtcc 660
ctcaaggata aggcccccga ggccatcaac tatgaacaga tcaagaagga cctcgccgag 720
gagctcacct tcgacatcga ctacaagacc agcgaggtga accagagggt cttttcactc 780
gacgaggtgt ttgagatcgc caactttaat aactacctca atcagtccgg gattaccaag 840
ttcaatacca tcattggggg caagtttgtg aatggggaga acactaagag aaagggtatt 900
aacgagtaca tcaatttgta cagccaacaa atcaacgaca agaccctcaa gaaatacaag 960
atgagcgtcc tcttcaagca gatcctttcc gacaccgagt caaagtcctt cgtcatcgac 1020
aagctcgaag atgacagcga cgttgttacc accatgcaga gcttctatga acagatcgcc 1080
gccttcaaga ccgttgagga gaagagcatc aaggagaccc tctcactcct ctttgatgac 1140
ctcaaggccc aaaagctcga cctctccaaa atttacttca agaatgacaa gagcctcacc 1200
gacctcagcc agcaggtctt cgacgactac tccgtcatcg gcaccgccgt gctcgagtat 1260
atcacccaac agatcgctcc caaaaacctc gacaatccca gcaagaaaga acaggaactc 1320
atcgccaaga agaccgagaa ggccaagtac ctctccctcg agacaatcaa gctcgccctc 1380
gaggagttca ataaacacag agacatcgac aagcagtgca ggtttgagga gatcctcgcc 1440
aatttcgccg ccatccccat gatcttcgac gagatagccc aaaacaagga caatctcgcc 1500
cagatcagca tcaagtacca gaaccaaggc aagaaggatt tgctccaggc ctccgccgag 1560
gatgacgtga aggctatcaa agatttgttg gatcagacca acaacctcct ccacaagctc 1620
aaaatcttcc acatcagcca atccgaagac aaggccaaca tcctcgacaa ggacgagcac 1680
ttctaccttg ttttcgaaga atgctacttc gagttggcca acatcgtccc cctctacaac 1740
aagatcagga actacatcac ccaaaagccc tactccgacg aaaagttcaa actcaacttt 1800
gagaactcta ctcttgccaa cgggtgggat aagaataagg aacctgacaa caccgccata 1860
ctcttcatca aggacgacaa atactacctc ggcgttatga acaaaaagaa caacaaaatc 1920
ttcgacgaca aggccatcaa agagaacaag ggagaaggtt acaagaagat tgtgtacaag 1980
ctccttccag gggccaataa aatgctcccc aaggtgtttt ttagcgccaa gagcatcaag 2040
ttctacaacc cctcagagga catcctcagg atcaggaacc acagcaccca cactaagaac 2100
ggcagtcccc agaagggtta cgagaaattc gaattcaaca tcgaggattg taggaagttc 2160
attgatttct ataagcaaag catctccaaa caccccgagt ggaaagactt tggcttcaga 2220
ttcagcgaca cccagagata caactcaatc gacgagttct acagggaagt cgagaaccag 2280
ggctataaat tgacctttga gaacatctcc gagtcctaca tcgacagcgt cgtcaaccag 2340
gggaagctct atctctttca gatctacaac aaggacttca gcgcatatag caagggaaga 2400
cccaatctcc ataccctcta ctggaaggcc ctcttcgacg agaggaacct ccaggacgtg 2460
gtctacaagc tcaatgggga ggccgaattg ttctacagaa agcagtcaat ccccaagaag 2520
attacccacc cagccaaaga ggcaatagcc aacaagaaca aggacaatcc caagaaggaa 2580
tcagtgttcg agtacgactt gataaaagac aagagattca ctgaagataa gttctttttc 2640
cactgcccca tcaccatcaa cttcaagtcc agcggcgcca acaaattcaa tgatgagatc 2700
aacctcctcc tcaaggagaa ggctaacgac gtccacatct tgagcatcga cagaggcgag 2760
aggcaccttg cctactacac tctcgtcgac gggaaaggca atatcattaa gcaggatact 2820
ttcaacatca tcggaaacga caggatgaag actaattacc atgacaagct cgccgccata 2880
gaaaaggata gggactccgc caggaaagac tggaagaaga tcaataacat caaggagatg 2940
aaggaggggt atctttccca ggtcgttcat gagatcgcca agctcgtgat agaatataat 3000
gccatcgtcg tcttcgagga tctcaacttt ggtttcaaaa gggggagatt taaggtggaa 3060
aagcaggtct accagaaact cgagaaaatg ctcatcgaga agctcaatta tctcgtgttc 3120
aaggacaacg agttcgacaa gaccggcggg gtcctcaggg catatcagct taccgccccc 3180
tttgagacct tcaagaagat ggggaagcag accggtatca tctactacgt tcccgcaggg 3240
tttaccagca aaatctgtcc cgtgaccggc tttgtgaacc aactctaccc aaaatatgaa 3300
tcagtgtcca agtcccagga attcttctcc aaattcgaca aaatatgtta taacctcgac 3360
aaagggtact tcgaatttag cttcgattac aaaaacttcg gcgacaaagc cgccaagggg 3420
aagtggacca tcgccagctt cgggtcaaga ctcatcaact tcagaaactc cgataagaac 3480
cataactggg acaccaggga ggtctacccc accaaggagt tggagaagct cctcaaggac 3540
tatagcatcg agtacggtca cggggaatgc atcaaggccg ccatctgcgg cgaatccgat 3600
aagaagttct tcgccaagct cacctccgtc ctcaacacca tcctccagat gagaaacagc 3660
aagaccggca ctgagctcga ctacctcata agccccgtcg ccgacgtcaa cggaaacttt 3720
ttcgactcaa gacaggcccc caaaaacatg ccacaggacg ccgacgccaa cggcgcctac 3780
cacatcgggt tgaaaggcct catgctcctt gggaggataa agaacaatca agagggcaaa 3840
aagcttaacc tcgtcatcaa gaacgaggag tactttgaat tcgttcagaa caggaataac 3900
<210> 34
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 34
atgtccatct atcaggagtt cgtgaataag tatagcttgt ccaaaaccct taggtttgag 60
ctcatcccac aggggaaaac cctcgagaat atcaaggcca gggggctcat actcgacgac 120
gagaagaggg ccaaagacta caagaaggcc aagcagataa tcgataagta ccaccaattc 180
ttcatagaag agatcttgag ttccgtctgc atctccgagg accttttgca gaactactcc 240
gacgtttact tcaaactcaa gaagagcgac gacgacaacc tccaaaagga cttcaaatcc 300
gctaaggaca ctatcaagaa acaaatctcc gaatatatca aggacagcga gaaattcaag 360
aacttgttta accagaacct tattgacgcc aaaaaggggc aggagagcga cctcattctc 420
tggctcaaac agagcaagga caacggcatc gagctcttca aggccaactc cgacataacc 480
gacattgatg aggccctcga gatcattaag tcctttaagg gctggaccac ctatttcaag 540
ggcttccacg agaacaggaa gaatgtttac agcagcaacg acattcccac ctccatcatc 600
tacaggatcg tcgacgataa cctccccaaa ttcctcgaaa acaaggctaa atacgaatcc 660
cttaaggaca aggcccctga agcaatcaac tacgaacaaa tcaagaaaga cctcgccgag 720
gagctcacct tcgacatcga ctacaagaca agcgaggtga accagagggt gttcagtttg 780
gatgaggttt ttgagatcgc caatttcaac aactacttga accagagcgg catcaccaag 840
ttcaatacca tcatcggggg caagttcgtg aacggcgaga acaccaaaag gaagggtatc 900
aacgaataca tcaacctcta cagtcaacag atcaacgaca agaccctcaa gaagtataag 960
atgagcgttc tcttcaagca gatcctcagc gacactgagt ctaagtcctt cgttatagac 1020
aagcttgagg acgacagcga tgtcgtgacc accatgcaat ccttctacga acagatcgcc 1080
gccttcaaga ccgtggagga gaagtcaatc aaggaaaccc tcagcctctt gttcgatgat 1140
ctcaaggctc aaaagctcga cctctcaaaa atctatttca agaacgacaa gtccctcacc 1200
gacctctcac agcaagtttt cgatgactac tccgtgatag gcaccgccgt tctcgagtac 1260
atcacccagc agatcgcccc caaaaacctc gacaacccct ctaagaagga acaggagttg 1320
atcgcaaaga agaccgagaa agccaagtac ctctccttgg agaccatcaa gttggccctc 1380
gaagagttca acaagcatag ggacatcgac aagcagtgca ggttcgagga aatactcgcc 1440
aacttcgccg caatccccat gatcttcgac gagatcgccc agaacaaaga caacctcgcc 1500
caaatcagta tcaagtacca aaaccagggg aaaaaagacc tcttgcaggc ctcagccgag 1560
gacgacgtta aggctatcaa ggaccttctt gaccagacca acaatctcct ccataaactc 1620
aaaatcttcc acatctcaca gtccgaagat aaagccaaca tcctcgacaa ggacgagcac 1680
ttctacctcg tctttgagga gtgctacttc gaactcgcca acatcgtccc tctctacaat 1740
aaaattagga attatatcac acagaaaccc tactctgacg aaaagttcaa gctcaacttt 1800
gagaatagta ccctcgccaa tgggtgggat aagaacaaag agcccgacaa taccgccatc 1860
ctctttatca aggacgacaa gtactatctc ggggtcatga acaagaagaa caataagata 1920
ttcgacgata aggccatcaa agagaacaag ggagaggggt acaaaaaaat cgtgtacaag 1980
cttctccctg gggccaataa gatgcttccc aaagtcttct tctccgctaa gtccatcaag 2040
ttctataacc ccagcgagga catccttagg atcaggaacc actccaccca caccaaaaac 2100
ggctcccccc aaaagggcta tgagaagttc gagttcaaca tcgaggactg caggaaattc 2160
atagacttct acaagcagtc catctccaaa caccccgagt ggaaggactt cgggttcagg 2220
ttcagcgaca cccaaaggta caactccatc gacgaatttt atagggaagt ggagaaccaa 2280
gggtacaagt tgacctttga gaatatcagc gagtcctaca tcgacagcgt tgtcaaccag 2340
gggaagctct atctcttcca aatctacaac aaggatttct ccgcctactc caaaggaaga 2400
cccaatctcc acaccctcta ttggaaggcc ctcttcgacg aaaggaactt gcaggacgtc 2460
gtttacaagc tcaatgggga ggccgaactc ttctacagaa agcaatcaat ccccaaaaag 2520
ataacccacc ccgctaaaga agctatcgcc aacaagaaca aagacaaccc caaaaaagaa 2580
tccgtgttcg aatacgacct catcaaagac aagaggttca ctgaagacaa attcttcttc 2640
cactgcccca tcaccatcaa cttcaagagc agcggggcca acaaattcaa cgacgaaatc 2700
aacctcctcc tcaaagagaa agccaacgac gtgcacattc tcagcatcga taggggtgag 2760
aggcacctcg cctactacac cctcgttgat ggcaagggga acatcataaa acaagatacc 2820
ttcaatatca tagggaatga caggatgaag accaactacc acgataagct tgcagccatc 2880
gagaaggaca gggattccgc aagaaaggat tggaagaaaa tcaacaacat caaagagatg 2940
aaggagggtt acctctccca ggtggttcac gaaatagcca agctcgtgat cgagtacaac 3000
gccatagtgg tgttcgagga cctcaacttt ggctttaaga gggggaggtt caaggttgag 3060
aagcaagtct accagaagct cgagaaaatg ctcatagaaa agctcaacta tctcgtcttc 3120
aaggacaacg agtttgacaa aaccggtggt gtcctcagag catatcagct caccgctccc 3180
ttcgagacct tcaagaagat gggaaaacag accgggatca tatactacgt gcccgccggg 3240
ttcacctcta agatctgtcc cgttaccggc ttcgtcaatc agctttaccc caagtatgag 3300
tccgtctcca aatcccagga gttcttttcc aagttcgata agatttgcta caacttggac 3360
aaggggtact tcgagttctc cttcgactac aagaacttcg gggacaaagc cgccaaaggg 3420
aaatggacca tcgcctcctt tggcagtaga ctcatcaact tcaggaactc cgacaagaac 3480
cacaactggg ataccaggga ggtctacccc actaaggaac ttgagaagct cctcaaagac 3540
tacagcatcg agtacgggca cggggagtgc atcaaggccg ccatctgcgg tgaaagcgac 3600
aagaagtttt tcgccaagct caccagtgtg ctcaacacca tattgcaaat gaggaattct 3660
aaaaccggga cagagctcga ctatctcatc agtcccgttg ccgacgtcaa cggcaacttc 3720
tttgactcca ggcaggcccc caagaacatg ccacaggacg ccgacgccaa tggcgcctac 3780
cacatcgggc tcaaggggct catgcttctc ggcaggatta agaacaacca agagggcaag 3840
aagttgaacc tcgtgatcaa aaacgaggag tactttgagt tcgtgcagaa taggaacaac 3900
<210> 35
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 35
atgagcatct atcaggagtt tgttaacaag tattccctca gtaagacact caggttcgag 60
ctcatcccac aaggcaagac actcgagaac atcaaggcca ggggactcat tctcgacgac 120
gagaagagag caaaggacta caagaaagct aaacagataa tcgataaata ccatcagttt 180
ttcatcgagg agatcttgag cagcgtgtgc atctccgagg accttctcca gaactatagc 240
gatgtctact ttaagctcaa aaagagcgac gacgacaatt tgcagaagga tttcaagagc 300
gcaaaggata ccataaagaa gcagatcagt gaatacatta aggacagcga gaagttcaag 360
aacctcttca accagaatct catcgacgcc aagaaggggc aggagagcga cttgatcctc 420
tggctcaaac agtccaagga caatggtatc gagctcttca aggccaactc cgatattacc 480
gacatcgatg aagccctcga gataatcaaa agtttcaagg gatggaccac ttacttcaaa 540
gggttccacg agaacagaaa gaacgtttac tcttcaaacg atatccccac aagtatcatc 600
tacaggatcg tggacgacaa cctccccaag ttcctcgaga acaaggccaa gtacgagtca 660
ctcaaagata aagcccccga ggctatcaac tacgagcaaa taaagaagga cctcgctgaa 720
gagctcacct tcgacatcga ttacaagacc agcgaggtca accagagggt cttcagtctc 780
gatgaggtgt tcgagatcgc aaactttaat aactacctca atcagagtgg catcacaaag 840
ttcaacacca tcatcggtgg gaaattcgtt aacggggaga acaccaagag aaagggaatc 900
aacgaatata taaacctcta cagccaacag ataaatgata agaccctcaa gaagtacaag 960
atgagcgtcc tctttaagca gatcctttct gacaccgaga gcaagagttt tgttatcgac 1020
aagcttgagg acgacagtga cgtcgtcact accatgcaga gcttttacga gcagatcgcc 1080
gccttcaaga ctgtggagga gaagtctatc aaagagactc tctccctcct tttcgacgac 1140
ctcaaggccc agaaactcga tctctctaag atctacttca agaacgataa gagtttgacc 1200
gacctctccc agcaggtgtt cgatgattac agcgtgatcg ggaccgccgt cctcgagtac 1260
atcacacagc agatcgcccc caagaacctc gacaacccaa gcaagaagga gcaagagctc 1320
attgccaaga agaccgagaa agctaagtac ctcagcctcg agactatcaa actcgctctc 1380
gaggagttta ataaacatag agacatcgat aaacagtgca ggttcgagga gatcctcgcc 1440
aactttgccg ccatacccat gatatttgat gagatcgccc aaaacaagga taaccttgcc 1500
cagatcagca tcaagtacca gaaccagggc aaaaaggacc tcttgcaggc ctccgccgag 1560
gacgatgtta aggccatcaa ggacctcctc gaccagacca acaacctcct ccataagctc 1620
aaaatctttc acatctccca gtccgaggat aaggccaaca tcctcgacaa ggacgagcac 1680
ttctacttgg tcttcgagga gtgctacttc gagctcgcta acatcgtgcc actctataac 1740
aagatcagga actacatcac ccaaaagccc tactctgacg agaagttcaa gctcaacttc 1800
gagaacagca ccctcgccaa cgggtgggac aaaaacaaag aacccgacaa cactgccatc 1860
ctcttcatca aagacgacaa atactatctc ggcgtgatga ataaaaaaaa caacaagata 1920
tttgatgaca aagccatcaa ggagaacaag ggcgaaggtt acaagaaaat cgtctacaag 1980
ctcctccccg gcgccaacaa gatgctcccc aaggtcttct ttagtgcaaa gtcaatcaag 2040
ttttataacc cctccgaaga catcctcagg attaggaacc actccaccca caccaagaac 2100
ggcagccctc aaaaagggta cgagaaattc gagttcaaca tcgaggactg cagaaaattc 2160
atagacttct acaagcaaag catcagcaag caccccgaat ggaaagattt cggctttagg 2220
ttcagcgaca cccaaagata caactccatc gacgaattct acagggaggt cgagaaccag 2280
gggtacaagc tcaccttcga aaacatctcc gagtcataca tcgactctgt ggttaaccag 2340
ggaaaattgt acctcttcca gatttataac aaggacttca gcgcctactc caaggggaga 2400
ccaaatctcc acaccctcta ctggaaggcc ttgtttgacg aaaggaactt gcaggacgtt 2460
gtttacaaac tcaacgggga ggctgagctc ttttacagga agcagtccat ccccaagaaa 2520
atcacccatc ccgccaagga ggccatcgcc aataagaata aagacaaccc taaaaaggag 2580
agcgtgttcg agtacgacct catcaaagac aagaggttca ccgaggacaa gttcttcttc 2640
cactgcccca tcaccatcaa cttcaagagc agcggggcca acaagttcaa cgatgagata 2700
aacttgctcc ttaaggagaa ggccaacgac gttcacattc tctccataga caggggggag 2760
aggcaccttg cttactacac ccttgtcgac ggtaagggca atattataaa gcaagacacc 2820
ttcaacatca tcggcaacga caggatgaag actaactacc atgacaagct cgccgccatc 2880
gagaaggata gggactccgc aaggaaggac tggaagaaga tcaacaacat caaagagatg 2940
aaggagggct acctcagcca ggtcgttcat gagatcgcca agctcgttat cgagtataac 3000
gccatcgtcg tctttgagga cctcaatttc ggattcaaga gggggagatt caaggtggag 3060
aagcaggtct accagaagct cgagaagatg ctcattgaaa agctcaatta cctcgtgttc 3120
aaggacaacg agttcgacaa gacaggtggc gtgctcaggg cctaccaatt gaccgccccc 3180
tttgaaacct ttaagaagat gggaaagcag accgggatca tatattatgt ccccgccggc 3240
ttcacaagca agatttgccc cgttaccgga ttcgtcaacc aactctaccc aaaatatgag 3300
agtgtcagta agtcacagga gttctttagc aaattcgaca aaatctgcta taatctcgac 3360
aagggctact tcgagttcag cttcgactat aagaatttcg gggacaaagc cgccaagggc 3420
aagtggacca tcgccagctt tggcagcagg ctcatcaact tcaggaactc cgacaagaac 3480
cacaactggg acaccaggga ggtgtacccc accaaggagc tcgaaaaact ccttaaggac 3540
tattccatag agtacgggca cggggagtgc atcaaggccg ccatctgcgg cgagtccgat 3600
aaaaagttct tcgccaagct cacctctgtc cttaatacca ttctccaaat gagaaattcc 3660
aagaccggga ccgagctcga ctacctcatc agcccagtcg ccgacgtgaa tgggaacttc 3720
ttcgactcaa ggcaggcccc caagaacatg cctcaggacg ccgacgccaa cggcgcctat 3780
cacataggcc tcaagggtct catgctcctc gggaggatca agaacaacca ggagggcaag 3840
aagctcaacc tcgttatcaa gaacgaggag tacttcgagt tcgttcagaa taggaacaac 3900
<210> 36
<211> 3900
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 36
atgtctattt accaggagtt cgtgaacaag tacagcctca gcaagaccct cagattcgag 60
ctcatccccc aggggaaaac cctcgagaat attaaggcca gagggcttat tctcgacgac 120
gagaagaggg ccaaagacta taagaaggca aagcagatca tcgacaagta ccaccagttt 180
ttcatcgagg agatcctcag cagtgtgtgc atcagcgaag atctcctcca gaactactct 240
gacgtctact tcaagctcaa aaaaagcgac gacgacaact tgcaaaagga cttcaaatcc 300
gccaaggata ctatcaagaa gcaaatcagc gaatatatca aggactccga aaagttcaag 360
aatctcttta atcaaaatct catcgacgcc aaaaagggcc aagagtctga cttgatcctc 420
tggctcaagc agtccaaaga caatgggatc gagctcttca aagcaaactc tgacattact 480
gacattgacg aggccctcga gatcatcaag tcattcaagg ggtggaccac atacttcaag 540
gggttccacg agaacaggaa gaacgtgtac agctccaacg acatacccac cagcatcata 600
tacagaatcg ttgacgataa cctccccaag ttccttgaaa ataaggccaa gtatgagtcc 660
ttgaaggaca aagctcccga ggccataaac tacgagcaga taaagaagga tcttgcagag 720
gagctcacct tcgacattga ctacaagacc agcgaggtta accagagagt cttctcactc 780
gacgaggtgt tcgagatcgc caacttcaac aattatttga accagtcagg gataaccaaa 840
ttcaacacca ttatcggtgg caaattcgtg aacggggaga acaccaagag gaaaggtatc 900
aacgagtaca tcaatcttta ttctcaacag atcaacgaca agacactcaa gaaatacaag 960
atgagcgtcc tctttaagca gatactcagc gacaccgaaa gcaaatcctt tgtcatcgat 1020
aagctcgaag acgattccga cgtggtcacc accatgcaaa gcttctacga gcagatcgcc 1080
gctttcaaaa ccgtcgaaga gaagagtatc aaggagactc tcagcctttt gttcgacgac 1140
ttgaaggccc agaaacttga cctctcaaag atctacttca agaacgataa gtctctcact 1200
gacctctccc agcaggtctt cgacgattac agcgtcattg gaaccgctgt gttggagtac 1260
attacccagc agattgcccc caagaacctc gacaaccctt ctaaaaagga gcaagaactc 1320
atagccaaga agaccgagaa ggccaagtac ctctccctcg agacaatcaa gctcgccttg 1380
gaagagttca ataagcacag ggatatcgac aagcagtgta ggttcgagga gatcctcgct 1440
aactttgccg ccatacccat gatctttgac gagatcgccc agaacaaaga caatctcgcc 1500
cagatatcca taaagtacca gaatcagggt aaaaaggatc tcctccaagc cagcgccgag 1560
gacgacgtta aggcaatcaa ggacctcctt gaccagacca ataacctcct ccacaagctc 1620
aagatctttc acattagcca gagcgaggat aaggcaaata tcctcgataa ggacgagcac 1680
ttctacctcg tgttcgaaga gtgctacttc gaactcgcca atatcgtccc cttgtacaac 1740
aaaatcagaa actacataac ccagaagcca tattccgacg agaaatttaa gctcaacttc 1800
gagaactcca cactcgccaa cgggtgggac aaaaacaaag agcctgataa taccgccatc 1860
ctcttcatca aggacgacaa gtattacctc ggagtgatga acaagaagaa caacaaaatc 1920
ttcgatgaca aggccatcaa ggagaataag ggcgagggct ataaaaagat cgtgtacaag 1980
ctcctccccg gggccaataa gatgctcccc aaggtgttct tctccgccaa gagtataaag 2040
ttctacaacc ccagtgaaga catcttgaga atcaggaacc acagcacaca cactaagaac 2100
ggaagccccc agaagggcta cgaaaaattc gagttcaaca tcgaggactg caggaaattc 2160
atcgacttct ataagcaatc catatccaag caccccgagt ggaaggactt tggcttcaga 2220
ttcagtgaca cccaaaggta taacagtatt gacgaattct acagggaggt cgagaaccag 2280
ggctacaagc tcaccttcga gaatatatcc gagagctaca tcgactccgt ggtcaatcag 2340
gggaagctct atctcttcca gatctacaac aaggacttct ctgcctactc aaaggggagg 2400
cccaacctcc acaccctcta ctggaaggca ttgtttgacg agagaaatct ccaggacgtt 2460
gtctacaagc tcaacgggga agccgagctc ttctacagga agcaatcaat cccaaaaaaa 2520
atcactcatc ccgccaaaga ggccatcgcc aacaaaaaca aagacaaccc caagaaagag 2580
agcgtctttg agtacgacct catcaaagac aagagattca ccgaggataa gttcttcttc 2640
cactgcccca tcactatcaa cttcaagtca tctggcgcta ataaattcaa cgacgagatc 2700
aacttgctcc ttaaggagaa ggccaacgac gtgcatatcc tcagcatcga caggggggaa 2760
aggcatctcg catactacac cctcgttgac ggcaagggca atattataaa gcaggatacc 2820
tttaacatca tcggcaacga cagaatgaag accaactacc acgacaagct cgcagccatc 2880
gagaaggaca gggactcagc caggaaggac tggaagaaga tcaacaatat caaagaaatg 2940
aaggaggggt acctctccca ggtggttcat gagatcgcaa agttggtgat cgagtataac 3000
gccatcgtcg tcttcgagga cttgaatttc ggcttcaaaa ggggcaggtt caaagtcgaa 3060
aagcaggtct atcagaagct cgagaaaatg ctcatcgaga agcttaatta cctcgtcttc 3120
aaagacaacg agttcgacaa gaccggtggc gtcctcaggg cctaccaact taccgccccc 3180
ttcgaaacct ttaagaagat ggggaaacag accggcatca tttactacgt gcccgccggt 3240
ttcacttcaa agatctgtcc tgttaccgga ttcgtcaacc agctctaccc taagtatgaa 3300
agcgtgtcca agagccagga gttcttttcc aagttcgaca agatatgtta caaccttgac 3360
aaggggtact ttgagttctc attcgactac aagaacttcg gggacaaagc cgcaaagggt 3420
aaatggacaa tcgccagctt cggaagcagg ctcatcaact tcaggaacag tgacaagaac 3480
cacaattggg acacaaggga agtgtatcct accaaggagc tcgagaaact cctcaaggac 3540
tactctatcg aatacggcca tggcgaatgc atcaaggccg ccatctgcgg ggagtccgat 3600
aagaagttct tcgccaagct cacctccgtt ctcaacacta tcctccagat gaggaacagc 3660
aagaccggca ccgagttgga ctacctcatc tctcccgtcg ccgacgttaa tggaaacttc 3720
ttcgatagca ggcaggcccc caaaaacatg ccacaagacg ccgacgccaa cggtgcttat 3780
cacattgggt tgaagggact tatgttgctc gggaggatca agaacaatca ggaaggcaag 3840
aaactcaacc tcgtgatcaa gaatgaggag tacttcgagt ttgtgcagaa cagaaacaac 3900
<210> 37
<211> 1320
<212> PRT
<213> 未知
<220>
<223> 细菌
<400> 37
Met Gln Gln Tyr Gln Val Ser Lys Thr Val Arg Phe Gly Leu Thr Leu
1 5 10 15
Lys Asn Ser Glu Lys Lys His Ala Thr His Leu Leu Leu Lys Asp Leu
20 25 30
Val Asn Val Ser Glu Glu Arg Ile Lys Asn Glu Ile Thr Lys Asp Asp
35 40 45
Lys Asn Gln Ser Glu Leu Ser Phe Phe Asn Glu Val Ile Glu Thr Leu
50 55 60
Asp Leu Met Asp Lys Tyr Ile Lys Asp Trp Glu Asn Cys Phe Tyr Arg
65 70 75 80
Thr Asp Gln Ile Gln Leu Thr Lys Glu Tyr Tyr Lys Val Ile Ala Lys
85 90 95
Lys Ala Cys Phe Asp Trp Phe Trp Thr Asn Asp Arg Gly Met Lys Phe
100 105 110
Pro Thr Ser Ser Ile Ile Ser Phe Asn Ser Leu Lys Ser Ser Asp Lys
115 120 125
Ser Lys Thr Ser Asp Asn Leu Asp Arg Lys Lys Lys Ile Leu Asp Tyr
130 135 140
Trp Lys Gly Asn Ile Phe Lys Thr Gln Lys Ala Ile Lys Asp Val Leu
145 150 155 160
Asp Ile Thr Glu Asp Ile Gln Lys Ala Ile Glu Glu Lys Lys Ser His
165 170 175
Arg Glu Ile Asn Arg Val Asn His Arg Lys Met Gly Ile His Leu Ile
180 185 190
His Leu Ile Asn Asp Thr Leu Val Pro Leu Cys Asn Gly Ser Ile Phe
195 200 205
Phe Gly Asn Ile Ser Lys Leu Asp Phe Cys Glu Ser Glu Asn Glu Lys
210 215 220
Leu Ile Asp Phe Ala Ser Thr Glu Lys Gln Asp Glu Arg Lys Phe Leu
225 230 235 240
Leu Ser Lys Ile Asn Glu Ile Lys Gln Tyr Phe Glu Asp Asn Gly Gly
245 250 255
Asn Val Pro Phe Ala Arg Ala Thr Leu Asn Arg His Thr Ala Asn Gln
260 265 270
Lys Pro Asp Arg Tyr Asn Glu Glu Ile Lys Lys Leu Val Asn Glu Leu
275 280 285
Gly Val Asn Ser Leu Val Arg Ser Leu Lys Ser Lys Thr Ile Glu Glu
290 295 300
Ile Lys Thr His Phe Glu Phe Glu Asn Lys Asn Lys Ile Asn Glu Leu
305 310 315 320
Lys Asn Ser Phe Val Leu Ser Ile Val Glu Lys Ile Gln Leu Phe Lys
325 330 335
Tyr Lys Thr Ile Pro Ala Ser Val Arg Phe Leu Leu Ala Asp Tyr Phe
340 345 350
Glu Glu Gln Lys Leu Ser Thr Lys Glu Glu Ala Leu Thr Ile Phe Glu
355 360 365
Glu Ile Gly Lys Pro Gln Asn Ile Gly Phe Asp Tyr Ile Gln Leu Lys
370 375 380
Glu Lys Asp Asn Phe Thr Leu Lys Lys Tyr Pro Leu Lys Gln Ala Phe
385 390 395 400
Asp Tyr Ala Trp Glu Asn Leu Ala Arg Leu Asp Gln Asn Pro Lys Ala
405 410 415
Asn Gln Phe Ser Val Asp Glu Cys Lys Arg Phe Phe Lys Glu Val Phe
420 425 430
Ser Met Glu Met Asp Asn Ile Asn Phe Lys Thr Tyr Ala Leu Leu Leu
435 440 445
Ala Leu Lys Glu Lys Thr Thr Ala Phe Asp Lys Lys Gly Glu Gly Ala
450 455 460
Ala Lys Asn Lys Ser Glu Ile Ile Glu Gln Ile Lys Gly Val Phe Glu
465 470 475 480
Glu Leu Asp Gln Pro Phe Lys Ile Ile Ala Asn Thr Leu Arg Glu Glu
485 490 495
Val Ile Lys Lys Glu Asp Glu Leu Asn Val Leu Lys Arg Gln Tyr Arg
500 505 510
Glu Thr Asp Arg Lys Ile Lys Thr Leu Gln Asn Glu Ile Lys Lys Ile
515 520 525
Lys Asn Gln Ile Lys Asn Leu Glu Asn Ser Lys Lys Tyr Ser Phe Pro
530 535 540
Glu Ile Ile Lys Trp Ile Asp Leu Thr Glu Gln Glu Gln Leu Leu Asp
545 550 555 560
Lys Asn Lys Gln Ala Lys Ser Asn Tyr Gln Lys Ala Lys Gly Asp Leu
565 570 575
Gly Leu Ile Arg Gly Ser Gln Lys Thr Ser Ile Asn Asp Tyr Phe Tyr
580 585 590
Leu Thr Asp Lys Val Tyr Arg Lys Leu Ala Gln Asp Phe Gly Lys Lys
595 600 605
Met Ala Asp Leu Arg Glu Lys Leu Leu Asp Lys Asn Asp Val Asn Lys
610 615 620
Ile Lys Tyr Leu Ser Tyr Ile Val Lys Asp Asn Gln Gly Tyr Gln Tyr
625 630 635 640
Thr Leu Leu Lys Pro Leu Glu Asp Lys Asn Ala Glu Ile Ile Glu Leu
645 650 655
Lys Ser Glu Pro Asn Gly Asp Leu Lys Leu Phe Glu Ile Lys Ser Leu
660 665 670
Thr Ser Lys Thr Leu Asn Lys Phe Ile Lys Asn Lys Gly Ala Tyr Lys
675 680 685
Glu Phe His Ser Ala Glu Phe Glu His Lys Lys Ile Lys Glu Asp Trp
690 695 700
Lys Asn Tyr Lys Tyr Asn Ser Asp Phe Ile Val Lys Leu Lys Lys Cys
705 710 715 720
Leu Ser His Ser Asp Met Ala Asn Thr Gln Asn Trp Lys Ala Phe Gly
725 730 735
Trp Asp Leu Asp Lys Cys Lys Ser Tyr Glu Thr Ile Glu Lys Glu Ile
740 745 750
Asp Gln Lys Ser Tyr Gln Leu Val Glu Ile Lys Leu Ser Lys Thr Thr
755 760 765
Ile Glu Lys Trp Val Lys Glu Asn Asn Tyr Leu Leu Leu Pro Ile Val
770 775 780
Asn Gln Asp Ile Thr Ala Glu Lys Leu Lys Val Asn Thr Asn Gln Phe
785 790 795 800
Thr Lys Asp Trp Gln His Ile Phe Glu Lys Asn Pro Asn His Arg Leu
805 810 815
His Pro Glu Phe Asn Ile Ala Tyr Arg Gln Pro Thr Lys Asp Tyr Ala
820 825 830
Lys Glu Gly Glu Lys Arg Tyr Ser Arg Phe Gln Leu Thr Gly Gln Phe
835 840 845
Met Tyr Glu Tyr Ile Pro Gln Asp Ala Asn Tyr Ile Ser Arg Lys Glu
850 855 860
Gln Ile Thr Leu Phe Asn Asp Lys Glu Glu Gln Lys Ile Gln Val Glu
865 870 875 880
Thr Phe Asn Asn Gln Ile Ala Lys Ile Leu Asn Ala Glu Asp Phe Tyr
885 890 895
Val Ile Gly Ile Asp Arg Gly Ile Thr Gln Leu Ala Thr Leu Cys Val
900 905 910
Leu Asn Lys Asn Gly Val Ile Gln Gly Gly Phe Glu Ile Phe Thr Arg
915 920 925
Glu Phe Asp Tyr Thr Asn Lys Gln Trp Lys His Thr Lys Leu Lys Glu
930 935 940
Asn Arg Asn Ile Leu Asp Ile Ser Asn Leu Lys Val Glu Thr Thr Val
945 950 955 960
Asn Gly Glu Lys Val Leu Val Asp Leu Ser Glu Val Lys Thr Tyr Leu
965 970 975
Arg Asp Glu Asn Gly Glu Pro Met Lys Asn Glu Lys Gly Val Ile Leu
980 985 990
Thr Lys Asp Asn Leu Gln Lys Ile Lys Leu Lys Gln Leu Ala Tyr Asp
995 1000 1005
Arg Lys Leu Gln Tyr Lys Met Gln His Glu Pro Glu Leu Val Leu
1010 1015 1020
Ser Phe Leu Asp Arg Leu Glu Asn Lys Glu Gln Ile Pro Asn Leu
1025 1030 1035
Leu Ala Ser Thr Lys Leu Ile Ser Ala Tyr Lys Glu Gly Thr Ala
1040 1045 1050
Tyr Ala Asp Ile Asp Ile Glu Gln Phe Trp Asn Ile Leu Gln Thr
1055 1060 1065
Phe Gln Thr Ile Val Asp Lys Phe Gly Gly Ile Glu Asn Ala Lys
1070 1075 1080
Lys Thr Met Glu Phe Arg Gln Tyr Thr Glu Leu Asp Ala Ser Phe
1085 1090 1095
Asp Leu Lys Asn Gly Val Val Ala Asn Met Val Gly Val Val Lys
1100 1105 1110
Phe Ile Met Glu Lys Tyr Asn Tyr Lys Thr Phe Ile Ala Leu Glu
1115 1120 1125
Asp Leu Thr Phe Ala Phe Gly Gln Ser Ile Asp Gly Ile Asn Gly
1130 1135 1140
Glu Arg Leu Arg Ser Thr Lys Glu Asp Lys Glu Val Asp Phe Lys
1145 1150 1155
Glu Gln Glu Asn Ser Thr Leu Ala Gly Leu Gly Thr Tyr His Phe
1160 1165 1170
Phe Glu Met Gln Leu Leu Lys Lys Leu Ser Lys Thr Gln Ile Gly
1175 1180 1185
Asn Glu Ile Lys His Phe Val Pro Ala Phe Arg Ser Thr Glu Asn
1190 1195 1200
Tyr Glu Lys Ile Val Arg Lys Asp Lys Asn Val Lys Ala Lys Ile
1205 1210 1215
Val Ser Tyr Pro Phe Gly Ile Val Ser Phe Val Asn Pro Arg Asn
1220 1225 1230
Thr Ser Ile Ser Cys Pro Asn Cys Lys Asn Ala Asn Lys Ser Asn
1235 1240 1245
Arg Ile Lys Lys Glu Asn Asp Arg Ile Leu Cys Lys His Asn Ile
1250 1255 1260
Glu Lys Thr Lys Gly Asn Cys Gly Phe Asp Thr Ala Asn Phe Asp
1265 1270 1275
Glu Asn Lys Leu Arg Ala Glu Asn Lys Gly Lys Asn Phe Lys Tyr
1280 1285 1290
Ile Ser Ser Gly Asp Ala Asn Ala Ala Tyr Asn Ile Ala Val Lys
1295 1300 1305
Leu Leu Glu Asp Lys Ile Phe Glu Ile Asn Lys Lys
1310 1315 1320
<210> 38
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 38
atgcagcaat atcaggtgtc taagacagtg cgtttcgggc ttaccttaaa aaactcagaa 60
aaaaaacatg ctacccactt acttttgaag gatctcgtta atgtttctga agaaagaata 120
aaaaacgaga ttactaaaga tgataagaac cagtctgagc tgtctttttt caatgaggtg 180
atagaaacct tggatcttat ggataagtac attaaggact gggagaactg cttttatagg 240
acagaccaaa tccagttgac aaaggagtac tataaagtga tagcaaaaaa ggcttgcttt 300
gattggtttt ggacaaatga tagaggtatg aagttcccaa catcctccat aattagcttt 360
aattccctaa agtcctcaga caagtcaaag acttcagata atttggatag aaagaagaag 420
atccttgatt attggaaggg aaacattttt aaaactcaga aagctattaa ggatgttttg 480
gatattacag aggacataca aaaagcaata gaggaaaaaa aatcacatcg tgaaattaat 540
agagtaaatc ataggaagat gggcatccac ctcattcact tgattaatga tactcttgtg 600
cctttgtgta atggaagtat cttcttcggg aacatctcaa aactggattt ctgtgagtcc 660
gaaaatgaaa agctcattga tttcgccagc acagagaaac aagacgagag gaagttcctg 720
ctttcaaaaa taaacgaaat taagcagtac ttcgaggata atggagggaa cgttccattt 780
gctagagcaa ccctgaaccg acacactgca aaccaaaaac ctgatcgtta taacgaggaa 840
atcaagaagc tggtgaacga gcttggagtg aacagccttg tgcgatccct taagtctaag 900
accatagaag agattaagac acattttgag ttcgagaata agaacaagat taacgaattg 960
aaaaattctt tcgttctttc tattgttgaa aagatccaat tgtttaagta caaaactata 1020
ccagcatcag ttagatttct gctagcagat tactttgagg agcaaaagct ttctactaag 1080
gaggaagctt taactatttt cgaggagatc ggaaagcccc aaaacatagg cttcgactat 1140
attcagctta aggagaaaga taacttcaca cttaaaaagt atcccttaaa gcaagcattc 1200
gactacgctt gggagaactt agctagacta gatcaaaatc cgaaagctaa tcagttctct 1260
gttgatgaat gtaagaggtt tttcaaggaa gttttctcga tggagatgga taacataaac 1320
ttcaaaacct atgctctctt actcgcttta aaggagaaga ctacagcttt tgacaaaaag 1380
ggggaaggcg ccgcaaaaaa taaatctgag attatcgaac agatcaaggg cgtgtttgag 1440
gagttggatc aaccctttaa gatcattgcc aatactctaa gggaggaggt tataaaaaaa 1500
gaggatgagt taaatgtact caaacgacaa tatcgtgaaa ccgataggaa gatcaaaaca 1560
ttgcagaacg aaatcaagaa gataaagaac caaattaaaa acttggaaaa cagcaagaag 1620
tattcgttcc ccgaaattat taagtggatc gacttgaccg agcaagaaca actactggat 1680
aagaacaaac aggcaaagag taattatcaa aaagccaagg gtgacttggg tttgattcgc 1740
gggagccaaa aaacatccat taatgattat ttttacttga ccgataaagt ttatcggaag 1800
cttgcccaag atttcgggaa gaagatggca gatttaagag aaaaattgct cgacaagaac 1860
gatgtaaaca aaattaagta tttgtcttac atcgtaaagg acaatcaagg ataccagtac 1920
accctgctaa aacctttgga agacaaaaac gcagaaatta ttgagcttaa atctgagccc 1980
aatggtgatc ttaagttgtt cgaaataaag tctcttacct ctaagacatt gaacaagttt 2040
atcaaaaata agggcgctta taaggaattt cattctgccg aattcgaaca taaaaagatt 2100
aaggaagatt ggaaaaacta caaatacaat tcggatttta ttgttaagtt aaagaagtgt 2160
ctgtcacatt ccgacatggc aaacactcag aattggaagg ccttcgggtg ggatttggat 2220
aagtgcaaat cttatgaaac aattgaaaaa gaaattgatc aaaagagtta tcaactcgtc 2280
gagatcaaac tctctaagac aaccattgaa aagtgggtga aggaaaataa ttacttgctt 2340
cttcccatcg ttaaccaaga tattaccgca gagaagctta aggttaacac aaaccagttt 2400
actaaggatt ggcaacatat ttttgagaaa aaccctaacc acagacttca cccagagttt 2460
aacatcgcat accggcaacc cactaaagat tatgctaaag aaggtgaaaa acgctactct 2520
cggttccaac tgactggtca atttatgtat gaatacatcc ctcaagacgc caattatatc 2580
tcacgcaagg agcaaattac actttttaac gataaggaag agcagaaaat tcaagtcgaa 2640
actttcaaca accaaattgc aaagattcta aatgcagagg atttttatgt aattggaata 2700
gatcgtggaa tcactcaatt agctaccctt tgtgtgctta acaagaacgg agttattcag 2760
ggtgggttcg aaattttcac tagggagttc gactatacga acaaacagtg gaaacataca 2820
aagttgaagg aaaaccgtaa catccttgat atctcaaatt tgaaggtaga aacgaccgtt 2880
aacggcgaaa aggttctcgt ggatctaagt gaagtaaaga catacctgag ggacgagaat 2940
ggtgaaccaa tgaagaatga aaagggcgtg atattgacca aagataacct gcagaagatc 3000
aagttaaaac agctagctta cgatcgcaaa ctacaataca agatgcaaca tgaacctgag 3060
ttggtgctat cgtttctgga tcgtctcgaa aacaaagaac aaataccgaa ccttcttgca 3120
tcaaccaaat tgatttccgc ctataaggaa ggaactgctt acgcagatat tgacatcgaa 3180
caattttgga atattttgca aacgttccag acaattgttg acaagtttgg cgggattgaa 3240
aatgcaaaaa agaccatgga attccgtcag tatactgaac ttgatgccag ttttgatctt 3300
aaaaatggag ttgttgctaa tatggttggc gtcgtaaaat tcattatgga gaagtataac 3360
tataagactt tcatagcctt agaggacctt acttttgcat tcggtcagtc tattgatggc 3420
atcaacggtg agagacttcg atccactaag gaagacaaag aagttgattt caaagagcag 3480
gagaatagca ccttagcggg tttgggtacc taccactttt tcgaaatgca actcctaaaa 3540
aagttaagca agacccaaat aggcaacgag attaaacact ttgtgcctgc ttttcgatcc 3600
accgagaatt acgagaagat tgtgcgcaag gataagaacg ttaaagccaa aattgtgagc 3660
tacccttttg ggatcgttag cttcgttaat cctaggaata cttccataag ttgtccaaac 3720
tgtaagaatg ctaacaagag taataggatc aaaaaggaga atgacagaat tctctgtaag 3780
cacaatattg aaaagacaaa gggcaattgt ggtttcgata ccgcaaattt tgatgaaaat 3840
aaacttcgtg ctgagaacaa gggcaagaac ttcaaatata tttcaagtgg cgatgctaac 3900
gcagcttata acatagctgt taaactcctt gaagacaaaa ttttcgaaat taataaaaag 3960
<210> 39
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 39
atgcagcaat atcaggttag caaaactgtg agattcggct tgacactcaa gaactccgaa 60
aagaagcatg ctactcacct cctcctcaag gacctcgtca acgttagcga ggagagaatc 120
aagaacgaga taaccaagga cgataagaac caatcagagc ttagtttctt caatgaggtg 180
atagagaccc tcgacctcat ggataagtat atcaaagact gggaaaactg tttctacagg 240
accgatcaga tccagctcac caaggagtac tacaaggtca tcgccaagaa ggcctgcttc 300
gactggtttt ggaccaacga tagggggatg aaattcccca ccagctccat catcagcttc 360
aacagcctca agtccagtga caagtccaag accagcgata accttgacag aaagaagaaa 420
atcctcgact actggaaggg taacatcttc aaaacccaga aggccatcaa agatgttctt 480
gacatcactg aggacatcca gaaggccatc gaggagaaaa aatcccacag ggagatcaac 540
agagtcaatc acaggaagat ggggatccat ttgatccacc ttatcaacga caccctcgtg 600
cccctctgca acggatccat cttcttcggc aacatcagca agcttgactt ctgcgagtcc 660
gagaacgaga agctcattga ctttgcctcc acagagaagc aggacgagag gaagttcctc 720
ctctccaaaa tcaacgagat caagcagtat tttgaagaca acggcgggaa cgttcccttt 780
gccagggcca ccctcaacag gcacaccgcc aaccagaagc ctgacaggta caatgaagaa 840
atcaaaaagc ttgtgaacga gctcggggtg aactccctcg ttaggtctct caagtccaag 900
accatcgaag agatcaagac ccacttcgag tttgagaaca agaacaagat caacgagctc 960
aagaactcct tcgtcctctc aatagtcgag aagatccaac tcttcaagta caagaccatc 1020
cccgccagtg ttaggtttct cttggccgac tacttcgaag aacagaagct ctccaccaaa 1080
gaggaggccc tcaccatatt tgaggaaatc gggaagcctc agaacatagg attcgactac 1140
attcagctta aagaaaagga caatttcacc ctcaagaaat accctctcaa gcaagcattc 1200
gactacgcct gggagaacct cgccagactc gaccaaaacc ccaaagccaa ccagttctcc 1260
gtggacgagt gcaagaggtt ctttaaggaa gtgttctcta tggagatgga caatattaac 1320
ttcaagactt atgcccttct cctcgctctc aaggagaaga ccacagcatt cgataaaaag 1380
ggcgagggcg ccgctaagaa caagagcgag atcatcgagc agatcaaggg tgtcttcgaa 1440
gaattggacc agcccttcaa gattatcgcc aacaccctca gggaagaggt tataaaaaag 1500
gaggatgagc tcaacgttct caagaggcag tacagggaga cagataggaa gatcaaaaca 1560
ctccagaacg agatcaagaa gatcaagaac caaatcaaga accttgagaa ctccaagaag 1620
tactcatttc ccgagatcat caaatggatc gatctcaccg agcaagagca gctcctcgac 1680
aagaacaagc aggccaagtc caactaccag aaggccaagg gtgatctcgg cctcatcagg 1740
gggtcccaaa aaacatccat caatgactat ttttacttga ccgataaggt ctacaggaag 1800
ctcgcccaag acttcggcaa gaagatggct gacctcaggg agaagcttct cgacaagaac 1860
gacgtcaata agatcaaata tctcagctac atcgttaagg acaatcaggg ataccagtac 1920
acactcttga aaccactcga ggacaaaaac gccgagatca tcgagctcaa gtctgaaccc 1980
aatggcgacc tcaagctctt cgaaatcaaa agtctcacca gtaagaccct caacaaattc 2040
atcaagaaca agggtgccta taaggagttt catagcgccg agttcgaaca taagaaaatc 2100
aaagaggact ggaagaatta caagtacaat tctgatttca tcgtgaagct caagaagtgc 2160
ttgagccact ccgatatggc caacactcag aactggaagg ccttcgggtg ggacctcgat 2220
aagtgcaagt cctacgaaac aatcgaaaag gagattgatc aaaagtccta tcagctcgtt 2280
gagatcaagc tctcaaagac caccatcgag aaatgggtga aggagaacaa ctacctcctc 2340
ctccccatcg tgaaccaaga tatcacagcc gagaagttga aggtgaacac caatcagttc 2400
actaaggact ggcagcacat cttcgaaaag aaccccaacc acaggctcca cccagagttc 2460
aacatcgcct acaggcaacc cactaaagac tacgccaagg aaggcgaaaa aaggtactcc 2520
aggttccaac tcacagggca gttcatgtac gagtacatac ctcaggacgc taactacatc 2580
agcaggaagg aacagatcac cctctttaat gataaggagg aacaaaagat tcaggtggag 2640
acattcaaca accagatagc caaaatcctc aacgccgagg acttctacgt gatcggcatc 2700
gacagaggca ttacccagct cgccactctc tgcgtgctca acaagaacgg agtcatccag 2760
ggggggtttg agatcttcac aagggaattt gattacacca acaagcagtg gaaacatacc 2820
aagctcaagg aaaataggaa catcctcgac atctcaaacc ttaaggtcga aacaaccgtc 2880
aatggggaga aggttctcgt cgacctcagt gaggtcaaaa cctacctcag ggatgaaaac 2940
ggggagccca tgaagaacga gaagggggtc atactcacca aggacaacct ccagaaaatc 3000
aaactcaagc agctcgccta cgacaggaag ctccagtata agatgcagca tgaacccgag 3060
ctcgtcctca gcttcctcga caggctcgag aacaaggagc aaatccctaa cctcctcgcc 3120
agcaccaagc tcatctccgc ctacaaggaa ggaactgcct atgcagatat cgacattgag 3180
cagttctgga atatcctcca aaccttccag accatcgtgg acaagttcgg gggtattgag 3240
aacgccaaga agacaatgga gtttaggcaa tacaccgagc tcgacgcttc attcgacctc 3300
aagaatgggg ttgtggccaa catggttgga gttgtcaaat tcatcatgga gaagtacaac 3360
tacaagacct tcatagccct cgaggacctc acctttgcct tcggtcagag tatcgatggg 3420
ataaacggcg aaaggctcag gagtaccaag gaggacaagg aggtcgactt caaggagcag 3480
gagaacagca ccctcgccgg gttgggcaca taccattttt ttgagatgca actcctcaag 3540
aagctctcta agacccagat cggcaatgag atcaaacact tcgtccccgc ctttaggtcc 3600
actgagaact acgagaaaat cgtgaggaaa gataaaaacg ttaaggctaa gatcgtctct 3660
tatccctttg ggatcgtctc attcgtgaac cccaggaata cctccataag ctgcccaaac 3720
tgcaaaaacg ccaacaagag taataggata aagaaggaaa acgataggat actctgcaag 3780
cataacatcg aaaagaccaa aggcaactgt ggtttcgata ctgccaactt cgatgagaac 3840
aagttgaggg cagagaacaa gggcaagaat tttaagtaca tctcatccgg cgacgccaac 3900
gccgcataca acatagcagt caagctcctc gaggacaaga tcttcgaaat caacaagaag 3960
<210> 40
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 40
atgcagcagt accaagtctc caaaaccgtg agattcgggc tcactctcaa gaatagcgag 60
aagaagcatg ccacccatct cctcctcaag gaccttgtta atgtttccga ggagagaatc 120
aaaaacgaga taacaaagga cgacaagaac cagtcagagc tcagcttctt caacgaggtg 180
atcgagacac tcgatttgat ggacaagtac atcaaggact gggagaactg cttctatagg 240
accgaccaga ttcagctcac aaaagaatat tacaaggtta tcgccaaaaa ggcatgtttc 300
gactggtttt ggaccaacga cagaggcatg aagttcccca ccagcagcat aatctccttt 360
aactctctca agagctccga taagtccaaa accagcgaca accttgatag gaagaaaaag 420
attctcgact actggaaggg caacattttc aagacacaga aggccatcaa ggacgtgctc 480
gacattaccg aggacatcca gaaggcaatc gaggaaaaga aatcacatag ggagataaac 540
agagtgaatc acagaaaaat gggaatccac ttgatccacc tcatcaacga cacacttgtg 600
cccctctgca acgggtccat ttttttcggt aacatcagca agctcgactt ctgcgagagt 660
gagaacgaga aactcatcga cttcgccagc accgagaagc aagatgaaag aaagttcctt 720
ctcagcaaga tcaacgaaat taaacagtac ttcgaggaca atggcggcaa cgtgcctttc 780
gccagggcca ctctcaatag gcacaccgcc aaccagaagc cagacaggta caacgaagag 840
atcaagaagc tcgttaacga attgggagtc aactccctcg tcagaagcct caagtcaaag 900
actatcgagg agatcaagac ccatttcgag ttcgagaaca agaataagat taacgagctc 960
aagaatagct tcgtcctctc catcgtcgag aagatccaac tcttcaaata caagaccatc 1020
cccgccagcg tcaggttcct cctcgccgac tatttcgagg agcaaaagct ctccaccaaa 1080
gaagaagccc ttacaatttt cgaggaaata ggcaagcctc agaacatcgg gttcgactac 1140
atccagctta aggagaagga caatttcacc ctcaaaaagt accccctcaa gcaggccttc 1200
gactacgcct gggagaatct cgccaggctc gaccaaaatc ccaaggccaa ccagttctca 1260
gtggatgagt gcaagagatt cttcaaggaa gtcttcagca tggagatgga caacatcaac 1320
ttcaagacct acgcccttct cttggctttg aaggagaaaa ccaccgcctt tgacaaaaaa 1380
ggcgaggggg ctgccaaaaa caagagcgaa atcatcgagc aaattaaggg cgtcttcgag 1440
gagctcgacc agcccttcaa aataatcgcc aacaccttga gggaggaagt catcaagaag 1500
gaggacgagc tcaacgtgct caagagacag tacagggaga cagatagaaa aatcaagacc 1560
ctccagaacg agatcaagaa gattaagaac cagatcaaaa acttggagaa cagcaagaag 1620
tactcctttc cagaaatcat aaagtggata gacctcaccg agcaggagca gctcctcgat 1680
aagaacaagc aggccaagtc caactaccag aaagctaagg gcgatttggg cctcatcagg 1740
gggtcacaga agacctccat aaacgactac ttttacctta ccgacaaggt gtacagaaag 1800
cttgcccagg acttcggcaa aaagatggcc gacctcaggg agaagctcct cgacaaaaat 1860
gatgtcaata aaatcaagta cctctcctac atagtcaaag acaaccaggg gtaccagtat 1920
actctcctca aacccctcga ggataaaaac gccgaaataa tcgagctcaa gtccgaaccc 1980
aatggcgacc tcaagctctt tgaaatcaag tccctcacaa gcaagacctt gaataagttt 2040
atcaagaaca agggagccta taaggagttc cacagcgccg agtttgaaca caaaaagatc 2100
aaggaggact ggaagaacta caagtacaac agcgacttca tcgttaagct caagaagtgc 2160
ctcagccact ccgacatggc caacacccag aactggaagg ccttcgggtg ggacctcgac 2220
aagtgcaagt cctacgagac catcgagaag gaaatcgacc agaagagtta ccagcttgtt 2280
gagattaagt tgagcaagac aaccatcgag aagtgggtca aggaaaataa ttacctcctc 2340
ctccccatag tcaaccagga tatcaccgcc gagaagctca aagtcaacac caaccagttt 2400
accaaagatt ggcagcacat cttcgaaaag aacccaaacc acaggctcca ccccgagttc 2460
aatatcgcct acaggcaacc cactaaggat tacgctaaag agggcgagaa gaggtacagc 2520
aggttccagc tcactggcca gtttatgtac gagtatatcc cccaggacgc caactacatt 2580
tccaggaaag agcaaatcac cctcttcaac gacaaggagg aacagaagat ccaggtcgag 2640
acatttaaca accagatcgc aaaaatcctc aatgctgaag atttctatgt catcgggatc 2700
gacaggggga tcacccagct cgctaccctc tgcgttctca ataagaacgg agtgatccag 2760
gggggcttcg agatctttac cagggagttt gactatacca ataagcaatg gaaacatacc 2820
aagctcaagg agaacagaaa tatcctcgac atcagcaact tgaaggttga aaccactgtg 2880
aacggcgaga aggtactcgt cgacttgagt gaggtgaaaa catacctcag ggacgaaaac 2940
ggtgagccca tgaagaatga gaagggggtg atcctcacca aggacaatct ccagaagatc 3000
aagttgaagc agttggccta cgacaggaag ctccaataca aaatgcagca tgagcccgag 3060
ctcgtgctct ccttcctcga taggctcgag aacaaggagc agatccccaa tctcctcgcc 3120
tccaccaaac tcatctctgc ctacaaggag ggcaccgcct acgccgacat cgacatcgag 3180
cagttttgga acatcctcca aacctttcag accatcgtcg acaaatttgg gggcatcgag 3240
aacgctaaga agacaatgga gtttagacaa tataccgagc tcgacgcctc ctttgacctc 3300
aaaaacggcg tcgtcgccaa catggtgggc gtggttaagt tcataatgga aaagtacaac 3360
tacaagactt tcatcgcact cgaggatctc acctttgcat ttggacaaag catcgacggc 3420
atcaatggag agaggctcag aagcacaaag gaggacaagg aggtcgactt caaggagcag 3480
gaaaactcaa ctctcgctgg cctcggcacc taccacttct tcgagatgca gctcctcaag 3540
aagctctcca aaacccagat cggaaacgag attaagcact tcgtccctgc cttcaggtct 3600
accgagaact acgagaagat cgtcaggaag gacaaaaatg tcaaagccaa aatcgtgtcc 3660
tacccattcg ggatcgtcag cttcgttaac cccaggaaca cctccatcag ctgccccaat 3720
tgcaaaaatg ccaacaagtc taacagaata aagaaggaga atgataggat cttgtgcaag 3780
cacaacattg aaaaaaccaa ggggaactgt ggttttgaca ccgcaaattt cgacgagaac 3840
aagcttagag ccgagaacaa aggtaagaac ttcaagtaca tcagcagcgg ggatgccaac 3900
gccgcataca acatcgccgt gaagcttctc gaggataaaa tcttcgagat caacaagaaa 3960
<210> 41
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 41
atgcagcagt accaggttag caagacagtc aggttcggac ttaccctcaa gaactccgag 60
aagaaacacg ccactcacct cctcctcaaa gaccttgtta acgtgagcga ggagaggatc 120
aaaaacgaga tcacaaagga cgacaaaaac caaagcgagc tctccttctt caacgaggtt 180
atcgagacct tggaccttat ggataagtat atcaaggact gggagaactg cttctacagg 240
accgaccaga tacagctcac caaggagtat tacaaagtga ttgccaagaa ggcctgcttt 300
gactggtttt ggaccaacga cagagggatg aagtttccta cttcttccat catcagcttc 360
aacagcttga agagctccga caaatctaag accagcgaca acctcgacag aaagaagaag 420
atactcgact actggaaggg caacatcttc aaaacccaga aagccatcaa ggacgtgctc 480
gatataaccg aggatatcca aaaggcaatc gaggagaaga agagccacag ggaaatcaac 540
agggtcaacc acaggaaaat gggtatccac ctcatacacc tcatcaacga taccttggtt 600
cccttgtgca acgggagtat cttcttcggg aacatctcca agctcgactt ctgcgagagc 660
gaaaacgaga agctcattga cttcgccagc accgagaagc aggacgagag gaagtttctc 720
ctttccaaga tcaatgaaat caagcagtac ttcgaggaca acggcgggaa cgttcccttc 780
gccagggcaa ccttgaacag acacactgct aaccagaagc ccgacaggta caatgaggaa 840
atcaaaaagt tggtcaacga gctcggcgtc aacagcttgg tgaggtctct caagagcaag 900
accatcgagg agatcaagac ccacttcgaa ttcgagaata aaaacaagat caacgagctc 960
aaaaatagct ttgtgctttc catcgtcgaa aagattcagc tctttaagta taagaccatc 1020
cccgcatcag tgagattcct cctcgccgac tacttcgaag agcagaagct ctctaccaag 1080
gaagaagctc tcaccatttt tgaggagatc ggcaagcccc agaatatcgg cttcgactac 1140
atccagctca aggaaaaaga caactttacc ctcaagaaat accctctcaa gcaagccttt 1200
gactacgcct gggagaacct cgctagactc gaccagaatc ccaaagccaa ccagttcagt 1260
gtcgacgagt gcaagagatt cttcaaagag gtgttcagta tggaaatgga caatatcaac 1320
ttcaaaacct acgccctctt gctcgccctc aaagagaaga ccaccgcttt tgacaagaag 1380
ggagagggcg ccgccaagaa caaaagcgag atcatcgagc agatcaaagg ggtcttcgaa 1440
gaactcgatc agccattcaa gattatcgcc aacacactca gggaagaggt gatcaaaaag 1500
gaggacgagc ttaacgttct caagaggcag tatagggaga ccgataggaa gatcaagacc 1560
ttgcaaaatg agatcaaaaa gatcaagaac cagatcaaga acctcgaaaa cagcaaaaaa 1620
tactctttcc ccgaaattat aaaatggatc gatctcaccg aacaggaaca gctcctcgac 1680
aaaaacaagc aagccaaatc taactaccaa aaggccaaag gcgatctcgg cctcatcagg 1740
ggaagtcaga agaccagcat caatgactac ttctacctca ccgacaaggt ctacaggaag 1800
ctcgcccaag atttcggtaa gaaaatggca gacctcaggg agaaacttct cgacaagaat 1860
gacgtgaaca agatcaagta tctctcctac atcgttaaag acaaccaggg gtaccagtac 1920
acccttctca aaccactcga ggacaaaaac gccgaaatca tcgagctcaa gtccgagccc 1980
aatggtgacc tcaagctctt tgagatcaag agtcttactt ccaaaactct caacaagttt 2040
atcaagaata agggggccta caaggagttt cattcagccg agttcgagca caagaagatc 2100
aaggaagact ggaagaatta taaatacaac tccgatttca tcgttaagct caagaagtgc 2160
ttgagtcact ccgatatggc caacacccag aactggaaag ccttcggctg ggacctcgac 2220
aagtgcaaga gctatgagac catcgagaag gagatcgacc agaagagtta tcagttggtc 2280
gagatcaagc tcagtaagac cactatcgag aagtgggtta aggagaacaa ttatctcctc 2340
ttgcccatcg tcaaccagga tatcactgcc gaaaagctca aggtgaatac caaccaattt 2400
actaaggatt ggcagcacat cttcgagaag aacccaaacc acaggctcca tcccgaattt 2460
aacatcgcct acaggcagcc caccaaagac tacgccaaag agggcgagaa gaggtactca 2520
agattccagc ttaccgggca gttcatgtac gagtacatac cccaggacgc caattatatc 2580
agcaggaagg agcagatcac cctctttaat gacaaggagg agcagaagat ccaagtggag 2640
actttcaaca accagatcgc caaaatcctc aacgcagaag acttctacgt tattggcatt 2700
gataggggca ttacacaact cgccaccctt tgtgttctca acaaaaatgg ggtgatccag 2760
ggcggctttg agatattcac cagggaattc gactacacca acaagcagtg gaagcacacc 2820
aagctcaagg agaatagaaa catcctcgat atctccaatc tcaaagttga gaccacagtg 2880
aatggggaaa aggtcctcgt cgacctcagc gaggtcaaga cctacctcag ggacgagaac 2940
ggggagccta tgaaaaatga aaagggagtt atcctcacta aggacaacct ccaaaagatc 3000
aaactcaaac agctcgccta tgataggaaa ctccagtaca agatgcagca cgagcccgag 3060
ctcgtgctta gcttcctcga taggcttgag aacaaggagc agatcccaaa cctcttggcc 3120
agcaccaagc tcatcagcgc ctacaaggaa gggaccgcct acgcagacat cgatatcgaa 3180
cagttctgga atatcttgca gacctttcag accatagtgg acaagtttgg cggtatcgaa 3240
aatgctaaga agacaatgga gttcaggcag tacactgagc ttgatgcttc ctttgacctc 3300
aagaacggcg tcgttgccaa catggtgggg gtggtgaagt ttatcatgga gaaatacaac 3360
tacaagacat ttatagcact cgaagacctc accttcgcct tcggacagtc catcgacggc 3420
attaatgggg aaaggctcag aagcactaag gaggacaagg aggttgactt caaggagcag 3480
gaaaattcaa ccctcgcagg cctcggcacc taccacttct tcgagatgca actcttgaaa 3540
aagctctcta aaacccaaat cgggaacgaa atcaagcact ttgtgcccgc ttttaggtcc 3600
accgagaact acgagaagat cgtcaggaag gacaagaacg tgaaggccaa gatcgtgtcc 3660
taccccttcg ggatcgtctc tttcgtcaac cccagaaaca catcaatcag ctgtcccaac 3720
tgcaagaacg caaacaagag caacaggatc aagaaagaga atgacaggat cttgtgcaaa 3780
cacaacatcg agaagaccaa gggcaactgc ggcttcgaca cagccaactt cgacgaaaat 3840
aagctcagag ctgagaataa aggaaaaaac ttcaagtata tcagcagcgg cgacgctaat 3900
gccgcctaca acatcgccgt caagctcctc gaggacaaaa ttttcgaaat caacaagaag 3960
<210> 42
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 42
atgcagcagt accaagtttc caagaccgtc agattcggct tgaccctcaa gaacagcgag 60
aagaagcacg ccacccacct cctcctcaaa gatctcgtta acgtttccga ggagaggatc 120
aagaacgaaa ttacaaagga cgacaagaac cagagcgagc tctctttttt taacgaggtc 180
atagagaccc tcgacctcat ggacaagtac atcaaagact gggaaaactg tttttacagg 240
accgaccaaa tccagttgac caaggagtac tataaggtga tagccaaaaa ggcctgcttt 300
gactggttct ggaccaacga taggggaatg aaattcccta cctcatcaat catatccttt 360
aactccctca agtcctccga caagtccaag actagcgaca acctcgacag gaagaagaag 420
atcctcgact attggaaggg gaacatcttc aagactcaaa aagccatcaa ggacgtcctc 480
gacatcaccg aggacatcca gaaggccata gaagagaaga agagccacag agagatcaac 540
agggtcaacc acaggaaaat gggtatccac ctcatccacc ttataaacga caccttggtc 600
cccctttgca atggtagcat attctttggc aacatcagca aactcgactt ctgcgagagc 660
gagaatgaaa agcttattga ctttgccagc accgagaaac aggacgaaag aaagttcctc 720
ctctccaaaa tcaacgagat aaagcaatat ttcgaggaca acggcggaaa cgtccccttc 780
gccagggcca ccctcaacag gcacaccgcc aatcaaaagc ccgacaggta caacgaggag 840
atcaaaaagt tggtgaacga gctcggggtc aactctctcg ttaggtccct caaatccaag 900
accatcgagg agatcaagac ccattttgag tttgagaaca agaataagat caacgagctt 960
aaaaattcct tcgtcctctc catcgtggag aagattcagc tcttcaagta taagactatc 1020
ccagcttccg tcagatttct cctcgcagac tactttgaag agcagaagct ctccaccaaa 1080
gaggaggccc tcaccatctt cgaagagatc ggcaagccac agaacattgg gttcgactac 1140
atccaactca aggaaaagga caacttcacc ctcaagaagt atcccctcaa gcaagccttt 1200
gactatgctt gggagaacct cgccaggctc gatcaaaacc ctaaggccaa ccagtttagt 1260
gtggacgagt gtaagagatt cttcaaagag gtctttagca tggaaatgga caacattaac 1320
ttcaagacat atgccctcct cttggccctc aaggagaaga ccacagcctt cgacaagaag 1380
ggcgaaggcg cagccaaaaa caagtccgag atcatcgagc agatcaaagg ggtttttgaa 1440
gagctcgacc agcccttcaa aatcatcgcc aacaccctca gggaggaggt gattaagaag 1500
gaggacgagc tcaatgttct caagaggcag tacagggaga ccgacaggaa gatcaagacc 1560
ctccagaatg agatcaagaa gataaagaac cagatcaaga acctcgagaa ctccaaaaag 1620
tatagctttc cagagataat caagtggatc gacctcaccg agcaagagca gctcctcgat 1680
aaaaacaagc aggccaagag caactaccag aaggcaaagg gggacctcgg gctcattagg 1740
gggagccaga agacctccat caacgactat ttctacctca ccgataaggt gtacaggaag 1800
ctcgctcagg acttcggaaa aaagatggcc gaccttaggg agaagctcct cgacaaaaac 1860
gacgttaaca agatcaagta cctctcttat atcgtgaaag acaatcaggg gtaccaatac 1920
accctcctca agcccttgga ggacaagaac gccgagatca tcgagctcaa gtccgagcct 1980
aacggcgacc tcaagctctt tgagatcaaa agtctcacct ccaaaacact taataagttc 2040
attaaaaaca aaggggccta taaggagttc cacagcgcag agtttgagca caaaaagatc 2100
aaggaggact ggaagaacta caagtacaac agcgacttca tagtgaagct taaaaagtgc 2160
ctctcccaca gcgacatggc caacacccag aactggaaag cattcgggtg ggacctcgac 2220
aagtgcaaaa gctacgagac aattgaaaag gagatcgacc aaaagtccta ccagctcgtt 2280
gagatcaagt tgagcaaaac caccatcgaa aagtgggtta aggagaataa ctatcttctc 2340
ctcccaatcg tcaaccaaga catcaccgca gagaaactca aagttaacac caaccaattc 2400
actaaggatt ggcagcacat ctttgagaag aaccccaacc acaggctcca ccccgagttc 2460
aatatcgcat acaggcaacc caccaaagac tacgccaagg agggcgagaa aaggtatagc 2520
agattccagc tcaccggcca gttcatgtac gagtacatac cccaggacgc aaattatatt 2580
agcaggaagg aacagatcac tctcttcaat gacaaggaag aacagaagat ccaagtcgaa 2640
acctttaaca accagatcgc aaagatcttg aacgccgagg acttctatgt tatcgggatc 2700
gacaggggca ttactcagct cgccaccctc tgcgtcctta acaaaaatgg cgttattcag 2760
ggcgggttcg agatcttcac aagggagttc gactacacca acaagcagtg gaaacacaca 2820
aaattgaagg agaacaggaa cattctcgac ataagcaacc ttaaggtgga aaccacagtg 2880
aacggggaaa aggttctcgt cgacctcagc gaagttaaga cctacctcag ggacgagaat 2940
ggcgagccaa tgaagaacga gaagggagtc atcctcacca aggacaacct ccagaagatc 3000
aagctcaagc agctcgccta tgacaggaag ctccagtaca agatgcaaca cgagcctgaa 3060
ctcgtcttga gcttcttgga taggctcgaa aacaaagagc agatccccaa cctcctcgca 3120
tccaccaagc tcatctccgc ctacaaagag gggaccgcct atgctgacat tgacatcgag 3180
cagttttgga acatcttgca gaccttccag accatcgtgg acaagttcgg cggcattgaa 3240
aatgctaaga agactatgga gtttagacag tacaccgagc tcgacgccag cttcgacttg 3300
aaaaacgggg ttgtggctaa catggttggc gtggtcaagt tcataatgga gaagtacaac 3360
tacaagacat ttatcgccct cgaagacctc actttcgcct tcggacaaag catagacggt 3420
atcaacggcg agaggcttag gtccaccaag gaggacaagg aggtggactt caaagagcag 3480
gaaaactcca ccctcgccgg gctcggaacc taccacttct tcgagatgca gctcctcaag 3540
aagctctcta agacacagat aggcaacgag attaagcact ttgtccctgc atttaggtca 3600
actgagaact acgagaagat cgtcaggaag gataagaatg ttaaagctaa gattgtcagc 3660
taccccttcg gtatcgtgtc cttcgttaac cccagaaaca ccagcatctc ttgtcccaac 3720
tgcaagaacg caaataagag caacaggatc aagaaggaga atgacaggat tctctgcaaa 3780
cacaacatcg agaagaccaa aggcaactgc ggcttcgaca ctgccaactt cgacgagaac 3840
aagctcaggg ccgagaacaa agggaagaac ttcaagtata tcagctcagg cgacgccaac 3900
gccgcctata acatcgccgt caaactcctt gaggacaaga tctttgaaat caacaagaag 3960
<210> 43
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 43
atgcagcagt atcaggtgtc caaaaccgtc aggtttgggc tcaccctcaa gaactctgag 60
aagaagcatg ctacccacct cctcttgaaa gacctcgtta atgtgagcga agagaggatc 120
aaaaacgaga tcaccaaaga cgacaagaac caatccgagc tctccttttt taacgaagtt 180
atagagaccc tcgacctcat ggacaagtac atcaaggatt gggaaaattg cttctacagg 240
acagaccaga tacaactcac caaggagtac tacaaggtta tcgccaaaaa ggcttgtttc 300
gactggttct ggaccaacga cagggggatg aagttcccca ccagcagcat catttctttc 360
aactcattga agagctccga caagtcaaag acctctgaca acctcgacag aaagaagaag 420
attctcgact actggaaggg caacatcttc aagacccaga aggccatcaa ggatgttctc 480
gacatcaccg aggatatcca aaaagccatc gaagagaaga agtcccacag ggagatcaac 540
agagttaacc ataggaagat gggtatccac ctcatccatc tcataaatga cactttggtg 600
cccctttgca acggttctat cttcttcggt aacatatcta aattggactt ctgcgaatcc 660
gagaacgaga agctcatcga cttcgccagt actgagaagc aggacgagag gaagttcctc 720
ctcagcaaga tcaacgagat caagcagtac ttcgaggaca acggtgggaa cgtccccttc 780
gccagggcta cattgaacag gcacaccgcc aaccaaaaac cagacagata caacgaggag 840
atcaaaaagc tcgtcaacga gctcggcgtg aactcccttg tgaggtccct caaaagcaag 900
accatcgagg agatcaagac ccactttgag ttcgagaaca aaaacaagat caacgagttg 960
aagaactcct tcgtcctctc catcgtggaa aagatacagc ttttcaagta caaaaccata 1020
cccgcctccg tgaggttcct cctcgccgac tacttcgagg agcagaagct ctcaaccaaa 1080
gaggaggctt tgacaatctt cgaggagatc ggcaaacccc agaatatcgg ctttgattat 1140
atccagctca aagaaaagga taactttacc ctcaagaagt accccctcaa acaggccttc 1200
gactacgcct gggagaactt ggccaggctt gatcagaacc ccaaggccaa ccagttcagt 1260
gtggacgagt gcaagagatt cttcaaggaa gtgttcagca tggagatgga taacatcaac 1320
ttcaaaacct acgccctcct cttggccctc aaggagaaga caaccgcatt tgacaagaag 1380
ggcgagggcg ccgctaaaaa taaaagcgag atcatcgagc agatcaaggg ggtcttcgag 1440
gagctcgatc agcccttcaa gatcatagct aacaccctca gggaggaggt gatcaagaag 1500
gaggatgagc tcaacgtcct caagaggcaa tacagggaga cagacaggaa gatcaaaacc 1560
ctccagaacg agatcaagaa gatcaaaaac caaatcaaga acctcgagaa cagcaagaag 1620
tactccttcc ctgaaattat caagtggata gacctcaccg aacaggagca gctcctcgac 1680
aagaacaagc aggcaaagtc caactaccag aaggccaagg gcgaccttgg cctcattaga 1740
gggtcccaga agaccagcat caacgactac ttctacctca cagacaaagt gtacagaaag 1800
ctcgcccagg actttgggaa gaagatggcc gacctcagag agaaactcct cgacaaaaac 1860
gacgtcaaca agatcaagta cctttcatac atcgtgaagg acaaccaggg ctaccagtac 1920
acccttctta agccactcga ggacaagaac gctgaaatca tcgagctcaa gtccgagcca 1980
aatggagacc tcaagctttt cgagatcaag agcctcacca gcaagacctt gaataaattc 2040
attaaaaata agggagccta caaggagttc cacagcgccg agttcgagca caaaaagatt 2100
aaggaggact ggaaaaacta caagtacaac tccgacttca tcgttaaact caaaaaatgc 2160
ctcagccact ccgacatggc caacactcag aactggaagg ccttcggctg ggacctcgac 2220
aaatgcaaga gctatgagac catagagaag gagatcgacc agaagagtta ccagcttgtc 2280
gagatcaagc tttccaagac caccatcgag aagtgggtga aggagaacaa ctatctcctc 2340
cttcccattg ttaatcaaga catcaccgcc gaaaagctca aggttaacac caaccaattc 2400
accaaggact ggcagcacat ctttgaaaag aaccccaacc acaggttgca ccccgagttc 2460
aacatcgcct acaggcagcc caccaaggac tacgcaaagg agggcgagaa gaggtacagt 2520
agattccagt tgactggcca attcatgtac gagtacatcc cccaagatgc caactacatc 2580
agcaggaagg aacagattac cctcttcaac gacaaggagg agcaaaagat ccaagtcgag 2640
acattcaaca atcagatcgc caagatcctc aacgccgaag atttctatgt tatcggaata 2700
gacaggggga tcacccagct ggccaccctc tgcgttctca acaagaacgg tgtgatccag 2760
ggcggtttcg agattttcac aagagagttc gattacacca acaagcaatg gaagcacaca 2820
aaattgaaag agaacaggaa tatactcgac atctccaacc tcaaagtgga gactaccgtc 2880
aacggggaga aggtgttggt cgacctctcc gaggtgaaga cctaccttag agacgaaaat 2940
ggcgagccca tgaagaacga gaagggggtc atccttacca aggacaacct ccagaaaatc 3000
aaactcaagc agctcgccta cgacaggaag ctccagtaca agatgcagca tgaacccgag 3060
ctcgtgctct ccttcctcga cagactcgag aacaaggagc agatccccaa cctcctcgct 3120
agcaccaagc tcatcagcgc ttacaaggag ggcaccgcct acgccgatat cgacatcgaa 3180
cagttctgga atatcctcca gaccttccag accatcgttg acaaattcgg gggcatcgag 3240
aacgccaaga agaccatgga gttcaggcaa tacaccgagc ttgacgccag ctttgacctc 3300
aaaaacggcg tcgtcgccaa catggtcggt gtcgtcaagt tcattatgga gaagtacaac 3360
tacaagactt tcatcgccct cgaggacctc accttcgctt ttggccagtc cattgacggg 3420
attaatggcg agaggcttag gtccacaaag gaagacaaag aagtcgactt caaggagcag 3480
gagaactcca cactcgccgg actcggcact tatcactttt tcgaaatgca gttgctcaag 3540
aagctctcca aaactcagat agggaacgag atcaagcact tcgtgccagc attcaggtcc 3600
accgagaatt acgagaaaat cgtcaggaaa gacaaaaatg ttaaagccaa gattgtgagc 3660
taccccttcg gtatcgtcag cttcgtgaac cctagaaata ccagcatatc ctgtcccaac 3720
tgtaagaacg ccaacaagtc caacaggatc aagaaagaga acgacaggat cctctgtaag 3780
cacaatatcg agaagaccaa ggggaactgt ggcttcgaca cagccaattt cgacgagaac 3840
aagctcaggg ccgagaataa gggcaagaac ttcaaataca tcagctccgg ggacgccaac 3900
gccgcataca acatagccgt caagctcctc gaagacaaaa tatttgagat taacaagaag 3960
<210> 44
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 44
atgcaacagt accaggtgtc caaaaccgtc aggtttggcc tcaccctcaa gaacagcgag 60
aaaaaacacg ccacccactt gctcttgaaa gaccttgtca acgtttccga ggaaaggatc 120
aagaacgaga tcaccaaaga cgacaagaac cagagcgagc tcagcttctt caacgaggtt 180
atcgagaccc tcgacctcat ggacaagtac atcaaagact gggagaattg cttttacagg 240
accgaccaga tccagctcac caaggagtac tacaaggtca tagccaagaa agcctgcttc 300
gactggttct ggaccaacga cagggggatg aagtttccca ccagctccat aataagcttc 360
aactccctca agtccagcga caagtccaaa acctccgata atctcgacag gaaaaaaaag 420
attctcgact actggaaggg aaacatattc aagacccaga aagccatcaa ggatgtgctc 480
gacattaccg aggatatcca gaaggccata gaggaaaaga aaagccacag agagatcaat 540
agggtgaacc acaggaaaat ggggatccac cttatccacc tcattaacga cacactcgtt 600
cccctctgca acggatccat cttcttcggt aacatcagca agctcgactt ctgcgagagc 660
gaaaacgaga agctcatcga cttcgccagc accgagaagc aagacgagag aaagtttctc 720
ctcagcaaga tcaacgagat taagcaatac ttcgaagaca atgggggcaa cgttcccttt 780
gccagggcca ccctcaacag acacaccgcc aaccagaaac ccgataggta taatgaggag 840
atcaaaaagc tcgttaacga gctcggggtc aactccttgg ttagaagctt gaagagcaag 900
accatcgaag agattaagac ccacttcgag ttcgagaaca agaacaagat caacgagctc 960
aagaacagct tcgtgttgag cattgtcgag aagatacagc tcttcaagta caaaaccatc 1020
cccgcaagcg tcagattcct cttggcagac tacttcgaag agcagaagct ctccacaaaa 1080
gaggaggctc tcaccatctt cgaagagata ggaaagcccc agaacattgg gttcgattac 1140
atccagctca aggagaaaga caacttcacc ctcaaaaagt acccactcaa gcaggctttc 1200
gactacgcct gggagaacct cgccaggctc gaccaaaacc ctaaggccaa ccagttcagc 1260
gttgacgagt gcaagaggtt ctttaaggag gtgttctcca tggaaatgga taacattaac 1320
ttcaagacct acgccttgct cctcgccctt aaggagaaga ccaccgcctt cgacaagaag 1380
ggcgaaggcg ccgccaagaa caagtccgag atcatcgagc agatcaaagg tgtttttgag 1440
gagctcgacc aacccttcaa gatcattgcc aataccttga gggaggaggt catcaagaag 1500
gaggatgagc tcaatgttct caaaaggcag tacagggaga ccgacaggaa aatcaaaacc 1560
cttcagaacg aaataaagaa gatcaagaac cagatcaaga acctcgagaa cagcaagaag 1620
tacagcttcc cagagatcat caagtggatc gacctcactg aacaagagca gctcttggac 1680
aagaacaagc aagccaagtc taactatcag aaggccaagg gagacctcgg gcttatcagg 1740
gggtctcaga agacctccat aaacgattat ttctacctca ccgacaaagt gtacaggaag 1800
ctcgctcagg atttcggtaa aaagatggcc gatttgaggg agaaactcct cgacaaaaac 1860
gacgttaaca agatcaaata tcttagctac atcgtcaagg acaaccaggg gtaccaatac 1920
actctcctca agcccttgga agacaagaat gccgagatca tcgagctcaa gagcgagccc 1980
aacggtgacc tcaagctttt cgagataaag tccttgacct ccaagaccct caacaagttc 2040
atcaagaaca aaggtgctta caaggagttc cacagcgccg aattcgagca caagaaaatc 2100
aaagaggact ggaagaatta caaatacaac agtgacttca tcgtcaagct caagaagtgc 2160
ttgtcccaca gcgacatggc caatacccag aactggaagg cattcgggtg ggacttggac 2220
aagtgcaaaa gttacgagac catcgagaaa gaaatagacc agaagtccta ccagcttgtg 2280
gagatcaaac tcagtaagac caccattgag aagtgggtta aggagaacaa ttatttgctc 2340
ttgcccatag tcaaccagga catcacagcc gaaaagctca aggtcaatac caaccagttc 2400
accaaggact ggcagcacat cttcgagaaa aaccccaacc atagactcca ccccgagttc 2460
aacatcgcct acagacaacc aaccaaggac tacgccaagg agggggaaaa aagatactca 2520
aggttccaac tcacaggtca attcatgtac gagtatatcc cccaggacgc caactacatc 2580
agccggaagg agcagatcac cctcttcaac gataaggagg aacagaagat ccaggttgag 2640
accttcaata accagattgc caaaatactc aacgccgagg acttttatgt catcgggata 2700
gacaggggga tcacacagct tgctaccctc tgcgttctta acaagaatgg ggtcatccag 2760
ggtgggttcg agatcttcac tagggaattc gactacacca acaaacagtg gaagcacacc 2820
aagctcaagg agaacaggaa tatcttggac atctctaatt tgaaggtcga gacaaccgtg 2880
aacggtgaga aagtcctcgt cgatctcagc gaggtgaaga cctacctcag agacgagaat 2940
ggcgagccca tgaagaatga gaagggggtt atcctcacta aggacaactt gcaaaagatt 3000
aaactcaagc agctcgccta cgacagaaag ctccagtaca aaatgcagca cgaacccgag 3060
ctcgttctct ccttcctcga caggctcgag aacaaggagc agatccccaa cttgctcgct 3120
agcaccaagc tcatcagcgc ctacaaggag ggtaccgcct acgccgatat cgacattgaa 3180
cagttctgga acatcctcca gaccttccaa accatcgtcg acaaattcgg agggatcgag 3240
aatgccaaga agaccatgga gttcaggcag tataccgagc tcgacgcttc tttcgacctc 3300
aaaaacggag tggtggccaa catggtcggc gtggtcaaat tcatcatgga gaagtacaac 3360
tataagacct tcatcgccct cgaagacctc actttcgcct tcggccaatc cattgacgga 3420
atcaacgggg agaggctcag atctactaag gaggacaagg aagttgactt taaagagcag 3480
gagaattcaa cactcgcagg gttggggacc taccactttt tcgagatgca actcctcaaa 3540
aagttgtcca aaacccagat cggcaacgag ataaaacact tcgttcccgc ctttaggtcc 3600
accgagaact acgagaaaat agtcaggaag gacaagaacg tgaaggcaaa gatcgtttcc 3660
tacccctttg gcatcgtcag cttcgtcaac cccaggaaca cctctatcag ctgtcccaac 3720
tgcaaaaatg ccaacaagtc caacaggatc aagaaggaga acgatagaat tctctgtaag 3780
cacaacatag agaagactaa ggggaattgt ggattcgata ccgctaattt cgacgagaac 3840
aagcttaggg ccgagaacaa ggggaagaac ttcaagtata tcagctctgg cgacgccaac 3900
gccgcctaca acatcgccgt taagctcctc gaagacaaga ttttcgagat taacaagaag 3960
<210> 45
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 45
atgcagcagt atcaggtttc aaagaccgtc aggttcggac tcaccctcaa aaatagcgag 60
aaaaagcacg ccacccactt gctcctcaag gatctcgtga acgtctcaga ggaaaggatc 120
aagaatgaaa taaccaagga cgacaagaac cagtccgagc tcagcttctt taacgaggtg 180
atcgagaccc tcgatctcat ggataagtac atcaaggatt gggagaactg cttttacagg 240
accgaccaga tccagctcac caaggagtac tacaaggtta ttgctaaaaa ggcttgcttc 300
gactggttct ggactaacga cagggggatg aagttcccaa cctctagcat catcagtttc 360
aattccctca aatcctctga caagtccaag acctccgaca acttggacag gaagaagaaa 420
atcctcgact actggaaggg caacatcttc aaaacccaaa aagccatcaa agacgtgctc 480
gacatcaccg aagacatcca gaaagccatc gaagaaaaaa agagccacag ggaaatcaac 540
agagtcaatc acaggaagat ggggatccat cttatccacc tcatcaatga tacccttgtc 600
cccctctgca acggctcaat attcttcggc aacatcagca agcttgactt ctgcgaatcg 660
gagaacgaaa agctcatcga ctttgccagc accgagaaac aagatgagag gaaattcctc 720
ctctccaaaa tcaatgagat caagcagtac ttcgaggata atggcggtaa cgtgcccttc 780
gccagggcca ccctcaacag acacaccgcc aaccagaagc ctgataggta caacgaagaa 840
atcaaaaagc tcgttaacga gctcggcgtt aactccctcg tcaggagcct caaatccaag 900
accatcgagg agatcaagac tcacttcgaa tttgagaata aaaacaaaat caacgaactc 960
aagaacagct tcgttctctc cattgttgag aagatccaat tgtttaagta caaaaccatt 1020
cccgcctctg tcaggttcct cttggccgac tactttgagg agcagaagct ctccaccaag 1080
gaggaggccc ttaccatctt cgaagagatc ggcaagcctc agaacatcgg tttcgactac 1140
atccagctca aggagaagga caatttcacc ctcaagaagt acccccttaa acaggccttt 1200
gattacgcct gggagaatct cgcaaggctc gaccaaaacc ccaaggccaa ccagttctcc 1260
gttgacgagt gcaagaggtt ttttaaagag gtttttagca tggagatgga caacatcaac 1320
tttaagacct acgccctcct cctcgccctt aaggaaaaga ccacagcctt tgacaagaaa 1380
ggcgagggcg ccgccaagaa taaatccgaa atcatcgagc agataaaagg ggtgttcgag 1440
gaactcgatc agccctttaa gatcattgcc aacaccctca gggaggaagt gataaagaag 1500
gaggatgaac tcaacgttct caaaagacag tatagagaga ccgacaggaa gatcaaaacc 1560
ctccagaacg agatcaagaa aatcaagaac cagatcaaga acctcgagaa ctccaagaaa 1620
tactcttttc ccgaaatcat caagtggatc gaccttaccg agcaagaaca attgctcgac 1680
aagaacaaac aggcaaagag taactatcag aaggctaagg gtgacctcgg gcttatcaga 1740
gggtcccaga agacctccat caacgactac ttctacctca ccgacaaggt ctataggaag 1800
ctcgcccagg atttcggcaa gaagatggcc gaccttaggg agaagctttt ggataaaaac 1860
gacgtcaata agatcaagta cctctcttac attgtgaaag acaaccaggg gtaccaatat 1920
actctcctca agcccctcga ggacaaaaac gcagagatca ttgaactcaa gtccgagccc 1980
aacggggacc tcaagctctt cgaaatcaag tccctcacct caaagactct caataagttc 2040
atcaagaaca agggcgccta caaggagttt cattccgctg agtttgagca taagaagatc 2100
aaggaggact ggaagaacta caagtacaac tcagatttca tcgttaagct caagaaatgc 2160
ctctcccata gcgacatggc caatacccaa aactggaaag ccttcggatg ggacctcgac 2220
aagtgcaaaa gctacgagac cattgagaag gagatcgacc aaaagagcta ccagttggtg 2280
gagatcaagc tcagcaagac caccatcgag aaatgggtga aggagaacaa ctacctcctc 2340
ctccccatcg ttaaccaaga tataaccgct gagaagctca aggttaacac caaccagttc 2400
accaaggatt ggcaacatat cttcgagaag aatcccaatc atagactcca tcctgagttc 2460
aatatcgcct acaggcagcc cacaaaagac tacgccaagg agggcgagaa gaggtactcc 2520
agattccagc tcactggcca gttcatgtac gaatacatcc cccaggatgc caactatatc 2580
tccaggaagg aacagatcac tctctttaac gataaagagg agcagaagat ccaggttgaa 2640
acctttaaca accagattgc caagatcctc aatgctgagg acttttacgt catcggcatc 2700
gacaggggaa tcacccagct cgccaccctc tgcgttctca ataagaacgg cgttatccag 2760
ggggggttcg agatctttac cagggagttc gattacacca acaagcagtg gaagcacacc 2820
aagttgaagg agaataggaa tatcctcgac atcagtaact tgaaggtcga aaccaccgtc 2880
aatggcgaga aggtgctcgt tgacctcagc gaagttaaga cttacctcag ggacgagaat 2940
ggagagccca tgaagaatga gaagggggtg atcctcacaa aagacaacct ccagaaaatc 3000
aaactcaagc aactcgccta tgacagaaag cttcagtaca agatgcaaca cgaaccagag 3060
ctcgttctca gctttctcga cagacttgag aataaggaac agatccccaa cctcctcgcc 3120
agcaccaagc tcatcagcgc ctacaaggag ggcaccgcct acgccgacat cgatatagag 3180
cagttttgga acatcctcca gacctttcag acaatcgtcg acaagttcgg cggtatcgag 3240
aacgccaaaa agaccatgga gttcaggcaa tataccgagt tggatgcaag tttcgatctc 3300
aaaaatggcg tcgtcgccaa catggtgggg gtcgtgaagt tcattatgga gaagtacaac 3360
tacaaaacct tcattgcact cgaggacttg accttcgcct tcgggcagag catcgacgga 3420
ataaacggcg agaggcttag gtccaccaag gaggataagg aggttgactt caaagagcag 3480
gaaaactcca ccctcgctgg cctcggcaca taccatttct ttgagatgca gctcctcaag 3540
aaactctcca aaactcagat agggaacgag atcaagcact tcgtgcccgc ctttaggtct 3600
accgagaatt atgagaagat cgtgaggaaa gacaaaaacg tgaaagccaa aattgttagt 3660
tacccctttg gcatcgtttc cttcgtcaac cccaggaaca ccagtatttc ctgcccaaac 3720
tgcaagaacg ccaacaagag caacaggatt aagaaagaga acgacaggat cctctgcaag 3780
cacaatatcg agaagacaaa gggtaactgt ggcttcgaca ccgccaactt tgatgagaac 3840
aagctcaggg ctgagaacaa gggcaaaaat tttaaatata tctccagcgg agacgcaaac 3900
gcagcctaca acatagccgt caaactcctc gaggacaaga tcttcgagat caacaagaag 3960
<210> 46
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 46
atgcagcagt accaggtgag caagaccgtg aggttcggtc tcaccctcaa gaacagcgag 60
aagaaacacg ccacccacct cctcctcaag gacctcgtca acgtgagcga ggagaggatc 120
aagaacgaga tcaccaaaga cgataagaat cagagcgaac tcagcttctt caacgaagtc 180
attgaaaccc tcgacctcat ggataaatac atcaaggatt gggagaattg tttctatagg 240
accgaccaaa tacaacttac caaggagtac tacaaggtga tcgccaagaa ggcctgcttc 300
gactggtttt ggaccaacga taggggcatg aagttcccca ccagctccat catcagcttc 360
aacagcctca agagtagtga caagagcaag accagcgata acttggacag aaagaagaag 420
atcctcgatt actggaaagg taatatcttc aagactcaga aagccatcaa ggacgttctc 480
gacatcaccg aggacatcca gaaggcaatt gaagagaaga agtcacacag ggagatcaac 540
agggtcaacc ataggaaaat gggcatccac ctcattcacc ttatcaacga caccctcgtg 600
cccctctgca acgggagcat ctttttcggc aacataagca agttggattt ctgcgagagc 660
gaaaatgaga agctcatcga ctttgcaagc accgagaagc aagacgagag aaagttcctc 720
ctctccaaga tcaacgagat caaacaatat ttcgaagata acgggggcaa cgttcccttc 780
gcaagagcaa ccctcaacag gcacaccgct aaccaaaagc ccgacaggta caacgaagaa 840
atcaaaaagc tcgtgaacga gctcggggtt aatagcctcg ttagaagcct caagtccaaa 900
acaatcgaag aaatcaagac tcactttgaa ttcgagaata aaaacaaaat taacgagttg 960
aagaacagct tcgttttgtc catcgtggag aagatccagc tcttcaagta caaaaccatc 1020
cccgcttccg tcagattcct ccttgctgac tatttcgaag aacagaaatt gtccactaag 1080
gaggaggccc tcaccatctt cgaggagatc ggcaaacccc agaacatcgg ttttgactat 1140
atccaattga aggaaaagga caatttcacc cttaagaagt accccctcaa gcaggcattc 1200
gattacgcct gggaaaatct cgccagactc gatcagaacc ccaaggccaa ccagttctcc 1260
gtcgacgagt gcaagagatt ctttaaggag gtctttagca tggagatgga taatataaac 1320
ttcaaaacct acgccctctt gctcgccctc aaagagaaaa caaccgcctt tgacaagaaa 1380
ggcgagggtg ccgccaagaa caagtctgag attatcgagc agatcaaggg ggtcttcgaa 1440
gagctcgacc aacccttcaa gattatcgcc aataccctta gagaagaggt tatcaagaaa 1500
gaagacgagc tcaacgtgct caaaagacag tatagggaga cagacaggaa aatcaagacc 1560
ctccagaacg agatcaagaa gatcaagaac cagataaaga acctcgaaaa ttcaaagaaa 1620
tattcctttc ccgagattat aaagtggatc gacctcacag agcaggagca gctcctcgac 1680
aagaataagc aggccaaatc taactaccag aaggccaagg gggacttggg tcttatcagg 1740
gggagtcaaa agaccagcat caacgactac ttctacctca ccgacaaggt gtacagaaaa 1800
ctcgcccagg actttggtaa gaaaatggcc gacctcaggg aaaagctttt ggacaaaaac 1860
gatgtcaaca agattaagta cctctcttac atcgtgaagg acaaccaggg gtatcagtat 1920
accctcctta aaccactcga ggacaagaac gccgagataa tcgagctcaa gagcgagccc 1980
aatggcgacc tcaaactctt cgaaattaag tccctcacct ccaagacctt gaacaaattc 2040
atcaagaaca agggggccta caaggagttt cattccgccg agtttgagca caagaagatc 2100
aaagaggatt ggaagaacta caagtacaat agcgatttca tcgtcaaact caagaagtgc 2160
ctcagccatt cagacatggc caatacacaa aactggaagg cattcggctg ggacctcgat 2220
aagtgcaagt cctatgagac catcgagaaa gaaatcgatc aaaagtcata ccagctcgtc 2280
gagatcaagc tctccaaaac aacaatagag aaatgggtca aggaaaacaa ttacctcttg 2340
ctccccatcg tgaaccaaga catcaccgct gagaagctca aggtgaatac caaccaattt 2400
accaaagact ggcagcacat cttcgagaag aaccccaacc acaggctcca ccccgagttc 2460
aacatcgcct acagacagcc caccaaggac tacgcaaagg aaggggaaaa aagatacagt 2520
agattccagc tcaccggcca gtttatgtat gagtacatcc cccaagacgc caactacatc 2580
agcagaaagg agcagattac cctctttaac gacaaggagg agcaaaagat ccaggtcgaa 2640
accttcaaca accaaatagc taagatcctc aatgccgagg acttctatgt cataggaatc 2700
gataggggta tcacccaact cgcaaccctc tgtgttctca acaagaacgg ggtgatccaa 2760
gggggattcg agatattcac cagggagttc gattacacta acaaacagtg gaagcacaca 2820
aaactcaaag agaacaggaa tatcctcgac atctccaatc ttaaggttga aaccaccgtg 2880
aacggggaga aggttctcgt tgacctcagc gaggtcaaga cctatttgag agacgagaac 2940
ggggagccta tgaagaatga aaagggggtg atccttacca aggacaacct tcagaagatc 3000
aagctcaaac agctcgccta cgacaggaag ctccagtata agatgcaaca cgagccagag 3060
ttggtcctct cattcctcga cagattggag aacaaggaac agatccccaa ccttctcgcc 3120
agcaccaagt tgatctccgc ttataaggag gggaccgcct acgccgacat tgatatcgaa 3180
caattttgga atatactcca aacctttcag actatagttg acaaattcgg tggcatcgag 3240
aatgccaaga agacaatgga atttaggcag tacactgagc tcgatgcaag ctttgacttg 3300
aagaacggtg ttgtcgctaa catggtcggg gtcgtcaaat tcatcatgga gaagtataac 3360
tataagacct ttatcgcatt ggaggatctc acctttgcct tcggccagag tatcgacggg 3420
atcaatggcg agaggctcag aagtacaaag gaagataagg aggttgactt caaagagcag 3480
gagaattcta ccctcgccgg ccttggtacc taccacttct tcgagatgca gctcctcaag 3540
aagctcagca agacccaaat tggcaatgag atcaaacact tcgttcccgc cttcaggagc 3600
accgagaact atgagaaaat tgtgaggaag gacaagaatg tcaaggccaa gatcgttagt 3660
tatcccttcg ggatcgtgtc cttcgtcaac ccaaggaaca ccagcatctc atgcccaaat 3720
tgcaagaacg ccaacaaatc aaataggatc aagaaggaga acgacaggat cctctgcaag 3780
cacaacattg aaaaaaccaa ggggaactgc gggttcgaca ccgctaactt cgacgagaat 3840
aaactcaggg cagaaaataa gggcaagaat ttcaagtaca tcagctctgg agatgccaac 3900
gccgcctaca acattgccgt taagctcctt gaggacaaga tcttcgagat caacaaaaaa 3960
<210> 47
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 47
atgcagcagt atcaggtgtc aaaaactgtg aggttcgggc tcaccttgaa gaacagcgag 60
aagaagcacg ccacccacct tctcctcaag gacctcgtga acgtgagcga ggaaagaatc 120
aagaatgaga taaccaagga cgacaagaac caatctgaac tctcattctt caacgaggtc 180
atcgagaccc tcgacctcat ggacaagtac attaaggact gggagaactg cttctatagg 240
accgatcaga tccagctcac caaggagtat tataaggtga tcgccaagaa ggcttgcttc 300
gactggttct ggaccaacga taggggcatg aagttcccca ccagctctat tattagtttc 360
aatagtttga aaagcagcga caagagcaag acctccgaca accttgatag aaagaagaag 420
atcctcgact actggaaggg gaacatattc aagactcaaa aggctatcaa agacgtcctc 480
gatatcaccg aggacatcca gaaggccatc gaggagaaga agtcccatag ggaaatcaac 540
agggttaacc acaggaagat ggggatccat ctcatccacc tcataaatga taccctcgtg 600
ccactctgta acggctccat ctttttcgga aacatcagca agttggattt ctgcgagtcc 660
gagaacgaga agcttatcga cttcgcctct acagaaaagc aagacgaaag gaagttcctc 720
ttgagcaaaa tcaatgagat aaaacagtac ttcgaggaca acggcgggaa tgtcccattc 780
gccagggcca ccctcaacag gcacaccgcc aatcagaagc ccgacaggta caacgaagag 840
atcaagaagc tcgttaacga gttgggcgtc aacagccttg tgagatccct caagtccaaa 900
accatcgagg aaatcaagac ccacttcgag ttcgagaata aaaataaaat caacgaactc 960
aagaacagct ttgtcctctc catcgtcgag aagatacagc tctttaagta taagaccatc 1020
cccgcctcag tgaggttcct cctcgccgac tactttgagg agcagaagct cagtaccaag 1080
gaagaagccc tcaccatctt tgaggaaatc ggcaagcccc agaacattgg ctttgactac 1140
atccagctca aggagaaaga caatttcacc ctcaagaagt accccctcaa gcaggccttc 1200
gactacgcct gggagaacct cgctagattg gaccaaaacc ccaaggccaa ccaattctcc 1260
gtggacgagt gcaagaggtt cttcaaggag gttttctcaa tggagatgga caatatcaac 1320
tttaagacct acgccctcct cctcgccttg aaagagaaga ccaccgcctt cgataagaag 1380
ggggagggcg ccgccaaaaa caagtccgag atcatcgagc aaatcaaggg cgtcttcgaa 1440
gagctcgacc agcctttcaa aatcattgcc aacaccctca gggaggaagt gatcaaaaaa 1500
gaggacgagc tcaacgttct caagaggcag tacagggaga ccgacaggaa gatcaaaacc 1560
ctccagaatg agatcaaaaa gatcaagaac cagataaaaa acctcgagaa ctccaagaaa 1620
tattcctttc ccgagataat taagtggatt gaccttaccg agcaggagca gctcctcgat 1680
aagaacaaac aggccaaatc caactaccag aaggccaaag gggatctcgg gcttatcagg 1740
ggcagccaga agaccagcat caacgactat ttttacctca ccgacaaggt ctacaggaag 1800
ctcgcccagg actttggcaa gaagatggcc gatctcaggg agaagctcct cgacaaaaac 1860
gacgttaaca agatcaagta cctctcctac atcgtgaaag acaaccaagg ctaccaatat 1920
accctcctca agcctctcga agacaagaac gcagagatca tcgagttgaa atcagagccc 1980
aatggagact tgaagctctt cgagatcaag tctctcacct ctaagaccct caacaagttc 2040
ataaagaata agggggccta caaggaattc cactccgctg agttcgagca taagaagatc 2100
aaagaggatt ggaagaacta caagtacaac agcgacttca tcgtcaaact caagaaatgc 2160
ctcagtcatt ctgatatggc caacactcaa aactggaagg cattcggctg ggacctcgac 2220
aagtgcaaga gctacgagac tatagaaaaa gagatcgacc agaaatctta tcagctcgtt 2280
gagatcaagc tctccaagac caccatcgag aaatgggtca aggaaaacaa ctacctcctc 2340
ctccccatcg tcaatcagga catcaccgcc gagaagctta aagtcaacac caaccaattt 2400
accaaagact ggcagcacat cttcgagaag aatcccaacc acaggctcca ccctgagttc 2460
aacatcgcat ataggcaacc cacaaaggac tatgctaagg agggagaaaa aaggtactcc 2520
aggtttcagc tcacaggcca gttcatgtac gagtatattc cccaagacgc taactacatc 2580
tccaggaagg aacagatcac cctctttaac gacaaggagg agcagaagat ccaagttgag 2640
accttcaata accagatcgc caagatcttg aacgccgagg acttctacgt gattgggatt 2700
gacaggggga tcacacagct cgccacactc tgcgtcctca acaaaaatgg tgtcattcaa 2760
ggggggttcg agatcttcac cagagagttc gattacacca ataagcaatg gaagcacaca 2820
aagcttaagg agaacaggaa catccttgac attagcaacc tcaaggtgga gaccacagtg 2880
aacggtgaga aggtcctcgt cgatctttcc gaagtcaaga cctatcttag agacgagaat 2940
ggggaaccta tgaagaacga aaaaggggtg atactcacta aggacaacct ccagaagata 3000
aagctcaagc aacttgccta tgacaggaaa ctccagtaca agatgcaaca tgagcccgag 3060
ctcgtcctca gtttcctcga caggctcgag aacaaggaac agatccccaa cctcctcgct 3120
agcaccaagc tcataagcgc ctacaaggag gggaccgctt acgccgacat cgacatcgag 3180
caattctgga acatcctcca gaccttccag accatcgtgg acaagttcgg gggcatcgag 3240
aatgccaaga aaaccatgga attcagacag tacaccgaac ttgacgcttc cttcgacctc 3300
aaaaatggcg ttgtggccaa catggtgggg gtcgtgaagt tcatcatgga gaaatacaac 3360
tacaagacct tcatcgccct cgaggacctc acctttgcct tcggtcagtc cattgatggg 3420
ataaacgggg agaggttgag aagcaccaag gaagacaagg aggtcgactt taaggagcag 3480
gagaattcta cattggccgg ccttggcacc tatcattttt ttgagatgca actcctcaag 3540
aaactctcca aaacccagat cggaaatgag atcaagcact tcgtgcccgc ctttagaagc 3600
accgagaatt acgagaagat cgtcagaaag gataagaacg tcaaggcaaa aatcgtctcc 3660
taccccttcg gaatcgtttc cttcgtcaac cccaggaaca ccagcatttc ctgtcccaac 3720
tgcaagaacg ccaataagtc aaacaggatt aagaaggaaa acgataggat tctctgcaag 3780
cacaacatcg agaagactaa gggaaattgc gggtttgaca ccgccaactt tgatgagaac 3840
aaactcagag ccgagaataa aggcaagaac ttcaagtaca tctccagcgg cgatgccaac 3900
gccgcctaca acattgccgt taaactcctc gaggacaaaa tctttgagat aaacaaaaag 3960
<210> 48
<211> 3960
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 48
atgcagcaat accaggtttc caagaccgtg agatttggcc tcacactcaa aaattctgag 60
aagaagcacg ctacccacct tcttctcaag gatctcgtga acgtttccga ggagaggatc 120
aaaaacgaga tcaccaaaga cgacaagaac caatccgagc tctccttctt caacgaggtt 180
atcgagacac tcgatctcat ggacaagtat atcaaggact gggaaaattg cttctacagg 240
accgaccaga tccagctcac taaggaatac tataaggtga tagccaagaa ggcctgcttc 300
gattggtttt ggaccaatga taggggtatg aaattcccca cctccagcat catcagcttc 360
aatagcctca agagcagcga caagagcaag accagcgaca acctcgatag gaaaaagaag 420
atactcgact actggaaggg caacattttc aagactcaaa aggccattaa ggatgtgctt 480
gatatcacag aagacataca aaaagctatc gaggagaaaa aatcccacag ggagatcaac 540
agagtgaacc acaggaagat gggaatccac ctcatccacc tcatcaatga cacacttgtg 600
cccctctgca acgggtctat tttcttcggc aacatatcca agctcgactt ttgcgaaagc 660
gaaaacgaga agctcatcga ctttgccagc accgaaaagc aggacgagag gaagttcctc 720
ctttccaaaa tcaacgagat aaagcaatac ttcgaggaca acgggggcaa cgtgcccttt 780
gccagggcca ccctcaatag gcacactgcc aaccaaaagc ccgacaggta caacgaggag 840
atcaagaagc tcgttaacga gctcggagtc aactccctcg tgaggtcact caaaagcaag 900
acaatcgaag agatcaagac ccactttgaa tttgagaaca agaataagat caacgaactc 960
aaaaacagtt tcgttctctc catagtcgaa aagatccagc tcttcaagta caagaccatc 1020
cctgccagtg tcaggttcct tctcgccgac tacttcgagg agcagaaact cagcactaag 1080
gaagaggccc tcaccatctt cgaagaaatc ggtaagcccc agaacatagg ctttgactac 1140
atccaactca aggagaaaga caacttcacc ctcaaaaagt atcccctcaa gcaggccttc 1200
gactatgcct gggagaacct cgccaggctc gatcaaaacc ccaaggccaa ccagttcagc 1260
gttgacgagt gcaaaaggtt cttcaaagag gtttttagca tggaaatgga caatataaac 1320
ttcaaaacct acgccctcct cctcgccctc aaggaaaaga ccaccgcatt cgacaagaaa 1380
ggagagggcg ccgccaagaa taagagtgaa attattgaac agatcaaggg agttttcgag 1440
gaactcgacc aacctttcaa gattatcgcc aataccctca gggaggaagt gatcaagaag 1500
gaggacgagc tcaacgtcct caaaaggcag tacagggaga cagacaggaa gatcaagacc 1560
ctccaaaacg agatcaagaa gatcaagaac cagattaaga acctcgagaa ctcaaagaag 1620
tactccttcc ccgagatcat aaaatggatc gacctcaccg agcaggagca gcttctcgac 1680
aagaacaagc aggccaagag caattatcag aaggccaaag gcgacctcgg gctcatcagg 1740
ggaagtcaga agacctccat caacgactac ttctacctca ccgacaaggt gtacaggaaa 1800
ctcgcccagg acttcgggaa gaaaatggcc gacttgagag agaagctcct cgacaagaat 1860
gacgtgaata agattaaata tttgagctac atcgtgaaag acaaccaggg atatcagtat 1920
accctcttga agcccctcga ggacaagaac gctgagatca tcgagctcaa gtccgagcca 1980
aatggtgacc ttaagctttt cgagatcaag tcacttacaa gtaaaaccct taacaaattt 2040
atcaagaata agggcgccta caaggagttc cacagcgccg agtttgagca caagaagatc 2100
aaagaggact ggaagaacta caaatacaac tccgatttca tcgtgaagct caaaaaatgc 2160
ctcagccaca gcgacatggc taacacccag aactggaaag ctttcgggtg ggacttggat 2220
aagtgcaaat cctacgagac catcgaaaag gagatcgacc agaaaagcta ccagctagtg 2280
gaaatcaagc ttagtaagac caccatagaa aagtgggtca aggagaataa ctatctcctc 2340
ctccccatag ttaaccaaga catcaccgcc gagaagctca aggtgaacac caatcaattc 2400
accaaggatt ggcagcacat tttcgagaag aaccccaacc acaggctcca tcccgagttc 2460
aacatcgcct acagacaacc cactaaagac tacgccaagg agggtgagaa gagatacagc 2520
aggtttcagc ttactggtca gtttatgtat gagtacatcc cccaggacgc caactacatc 2580
tcaaggaaag agcagattac cctctttaac gacaaggagg agcaaaagat ccaggtggag 2640
accttcaaca accagatcgc taaaatcctc aacgccgagg acttttatgt tatcggcatc 2700
gataggggta ttacccaact cgctaccctc tgcgttctca acaagaacgg tgtcatccag 2760
ggcgggtttg agatcttcac tagggagttc gactatacca acaaacagtg gaaacacacc 2820
aagctcaagg aaaatagaaa catcctcgac atcagcaacc tcaaggtgga gacaaccgtg 2880
aacggcgaga aggtgctcgt cgacctcagt gaagtgaaga cctacttgag ggacgaaaac 2940
ggcgagccca tgaagaacga gaaaggcgtg atcctcacaa aggacaacct ccagaagatc 3000
aagctcaagc agctcgccta cgacaggaag cttcagtaca agatgcagca cgagccagaa 3060
ctcgtgctct cctttctcga taggctcgag aacaaagagc agatccccaa tctcctcgcc 3120
tccaccaaac tcatatccgc ctacaaggag gggacagctt acgctgacat cgatatagaa 3180
caattctgga acatcctcca gaccttccag accatcgtgg acaagttcgg ggggatcgag 3240
aatgccaaga aaaccatgga gtttaggcag tataccgaac ttgacgccag cttcgacttg 3300
aagaatgggg tcgtggccaa tatggttggg gtcgtcaagt tcatcatgga aaaatataac 3360
tacaaaacct tcatcgccct cgaagacctt accttcgcct tcggccagtc catcgatggc 3420
atcaacggcg agagactccg atccacaaaa gaagacaagg aggtggactt caaggagcaa 3480
gagaattcca ctcttgccgg cctcgggacc taccactttt tcgagatgca gctcctcaag 3540
aagctctcca agacccaaat cgggaacgag ataaagcact ttgtccctgc cttcaggtca 3600
accgagaact acgagaaaat cgtcaggaaa gacaaaaacg tcaaggccaa gatcgtttcc 3660
taccccttcg gcatcgttag tttcgttaac cccaggaaca ccagcatctc ctgccccaac 3720
tgcaagaacg ccaataagag caacaggatc aaaaaagaaa acgacaggat cctttgtaaa 3780
cataatatcg agaagaccaa ggggaactgc gggtttgaca ccgccaactt cgatgagaac 3840
aagctcaggg cagagaataa agggaagaat ttcaagtaca tctccagcgg ggacgccaac 3900
gccgcctaca atattgccgt gaagctcttg gaggacaaga tcttcgaaat caataagaag 3960
<210> 49
<211> 1307
<212> PRT
<213> 未知
<220>
<223> 氨基酸球菌属物种(Acidaminococcus sp. )
<400> 49
Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln
20 25 30
Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys
35 40 45
Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln
50 55 60
Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile
65 70 75 80
Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile
85 90 95
Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly
100 105 110
Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile
115 120 125
Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys
130 135 140
Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg
145 150 155 160
Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg
165 170 175
Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg
180 185 190
Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe
195 200 205
Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn
210 215 220
Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val
225 230 235 240
Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp
245 250 255
Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu
260 265 270
Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn
275 280 285
Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro
290 295 300
Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu
305 310 315 320
Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr
325 330 335
Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu
340 345 350
Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His
355 360 365
Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr
370 375 380
Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys
385 390 395 400
Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu
405 410 415
Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser
420 425 430
Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala
435 440 445
Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys
450 455 460
Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu
465 470 475 480
Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe
485 490 495
Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser
500 505 510
Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val
515 520 525
Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp
530 535 540
Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn
545 550 555 560
Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys
565 570 575
Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys
580 585 590
Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys
595 600 605
Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr
610 615 620
Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys
625 630 635 640
Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln
645 650 655
Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala
660 665 670
Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr
675 680 685
Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr
690 695 700
Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His
705 710 715 720
Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu
725 730 735
Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys
740 745 750
Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu
755 760 765
Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln
770 775 780
Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His
785 790 795 800
Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr
805 810 815
Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His
820 825 830
Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn
835 840 845
Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe
850 855 860
Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln
865 870 875 880
Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu
885 890 895
Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg
900 905 910
Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu
915 920 925
Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu
930 935 940
Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val
945 950 955 960
Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile
965 970 975
His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu
980 985 990
Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu
995 1000 1005
Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu
1010 1015 1020
Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly
1025 1030 1035
Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala
1040 1045 1050
Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro
1055 1060 1065
Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe
1070 1075 1080
Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu
1085 1090 1095
Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe
1100 1105 1110
Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly
1115 1120 1125
Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn
1130 1135 1140
Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys
1145 1150 1155
Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr
1160 1165 1170
Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu
1175 1180 1185
Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu
1190 1195 1200
Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu
1205 1210 1215
Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly
1220 1225 1230
Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys
1235 1240 1245
Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp
1250 1255 1260
Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu
1265 1270 1275
Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile
1280 1285 1290
Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn
1295 1300 1305
<210> 50
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 50
atgacacagt ttgaaggatt tacaaatctt taccaggtgt caaaaacttt aaggtttgaa 60
ttaattcccc agggaaaaac tctgaaacac atacaagagc agggttttat tgaagaggac 120
aaggcacgaa acgatcacta caaagaatta aagcccatta ttgatcgtat ctataaaacc 180
tatgctgacc agtgcctgca attggttcaa ctagattggg aaaacctgag tgccgctatt 240
gattcttaca ggaaagagaa aacagaagaa accagaaatg ctcttataga agagcaagct 300
acataccgga acgcgattca tgactatttc atagggagga ccgacaatct tacggatgca 360
attaacaagc gtcacgctga aatatacaaa ggtctcttca aggctgagct cttcaacggt 420
aaggtcttga aacagttagg aaccgtgaca actacggaac acgaaaatgc attgttgaga 480
tcctttgata aatttactac ttacttcagt gggttttatg aaaacaggaa aaatgttttt 540
tccgccgagg acatttctac ggcgattcct catagaatcg tacaggacaa cttccccaag 600
tttaaagaga actgtcacat ctttactagg cttattaccg ctgttccatc attaagggag 660
cattttgaaa acgtgaagaa agcaatcgga atttttgttt ccacatccat tgaagaggtt 720
ttttcttttc cattttataa ccagcttctt acccagacac agattgatct ttacaatcaa 780
cttcttggtg gaataagcag ggaggctgga actgagaaga ttaaaggtct caacgaagtt 840
ttgaacctcg caattcaaaa aaacgacgaa actgctcata ttatagcgtc cctcccgcat 900
cgattcattc ctttgtttaa acaaattctt tctgatcgta acactctttc ttttattcta 960
gaggaattta agtcagacga ggaggtaata caaagcttct gtaagtataa gactttgcta 1020
agaaatgaaa acgttttgga gactgcagaa gctcttttta atgaattaaa ttccatagac 1080
ttgactcata ttttcatctc tcataagaaa ttagaaacca ttagtagtgc cctctgtgat 1140
cactgggata ccttgcgtaa tgctctttat gagagaagaa tttcggaact gacaggtaag 1200
ataactaagt cagccaagga aaaggttcaa aggtccttga agcacgagga tattaacctg 1260
caggagatca tttctgctgc tgggaaggaa cttagcgagg cttttaagca gaagacatcc 1320
gagatcctct cccatgccca cgcagcactt gatcaacctt tgcctactac tcttaaaaaa 1380
caggaggaaa aagaaatctt gaaatctcaa cttgatagtt tacttgggct ttatcacttg 1440
ctcgattggt tcgctgtgga tgagtccaat gaggtggacc ctgaattctc ggccaggctg 1500
acaggcatta agcttgagat ggagccatct ttgtcatttt acaataaggc ccggaattac 1560
gccactaaga aaccttattc cgtggagaaa ttcaaactta acttccagat gcctacattg 1620
gctagcggat gggacgttaa taaagaaaaa aataacggag caattctatt cgtaaagaac 1680
gggctctact atctcggaat tatgccaaaa caaaaggggc gctataaggc tcttagtttc 1740
gagccaaccg aaaagacatc cgagggcttt gataaaatgt actacgatta tttcccagat 1800
gcagctaaaa tgattccaaa atgctctact caattaaagg ccgtcacagc tcactttcag 1860
actcatacaa cgcctattct tctctctaat aacttcatcg aaccactgga gattaccaag 1920
gaaatctatg atttaaacaa ccccgaaaaa gaacccaaaa agtttcagac tgcgtatgct 1980
aaaaaaactg gagatcagaa gggttaccgc gaagctctct gtaaatggat agattttact 2040
cgtgatttcc ttagcaaata tactaaaacc acctccatag atttgtcctc cctaagacct 2100
agcagtcaat acaaagatct tggggagtac tatgcagagc taaatccact gttgtaccat 2160
atctcttttc aaagaatcgc agagaaagaa attatggatg ccgttgaaac tggaaagcta 2220
tatttattcc agatatacaa taaggacttt gcgaagggtc atcatgggaa accaaacctt 2280
catactctat attggacagg tctcttttcc cccgaaaatt tagctaaaac atcgattaag 2340
ctcaatggcc aggccgaact gttctatagg ccgaaaagta ggatgaaaag aatggctcat 2400
aggctcggag aaaagatgct taacaagaag cttaaagacc agaaaactcc cattccagac 2460
acactctacc aggagttgta tgattacgtt aatcacaggt taagccacga cctttctgat 2520
gaggcaagag cccttctccc aaacgtgatc accaaggaag tctcccacga gattattaag 2580
gatagacgat ttacctccga caaattcttt ttccacgtgc ctattaccct gaactatcag 2640
gctgcaaatt cacctagtaa attcaatcaa cgagtaaacg cctacttgaa ggagcatccc 2700
gaaactccta taattggtat tgatcgtgga gaacgaaatc ttatatatat tactgttatt 2760
gattcaactg gtaaaatact tgagcagagg tcattaaata ccatacaaca gtttgattac 2820
cagaagaaat tagataacag agaaaaagag agggtcgctg caagacaagc atggtccgtg 2880
gttgggacaa ttaaagacct taaacaaggg tacctatcac aggtgatcca cgaaattgta 2940
gatctgatga ttcactacca ggccgtggtt gttttagaaa acttaaattt cggctttaag 3000
tccaaaagaa cagggatcgc cgaaaaggca gtttatcaac aatttgaaaa aatgctcatt 3060
gataagttaa attgcttggt gctcaaggat tatccggctg aaaaggttgg aggcgtactt 3120
aatccatacc aactgacgga ccagtttacc agcttcgcta agatgggtac tcagagtgga 3180
tttttgttct atgttcccgc accgtacact tcaaagatag atcccctaac gggtttcgtt 3240
gatccattcg tgtggaagac gataaagaac catgagagcc gcaaacattt cctagaagga 3300
tttgacttcc tgcactatga cgtaaaaacc ggggacttta tcctccactt taagatgaat 3360
cggaatctgt cctttcaaag aggtctgcca ggatttatgc ccgcttggga tatagtgttc 3420
gaaaagaacg aaactcagtt tgacgctaag gggacacctt tcatagctgg gaagcgaata 3480
gttcctgtga tcgaaaatca ccgtttcacc ggtcgctata gggaccttta ccctgctaac 3540
gagctgatcg cgctccttga agagaaagga atcgttttca gagatggtag taacattctt 3600
cctaaactac ttgaaaacga cgactcacac gccattgaca ctatggtcgc actcattagg 3660
tctgtgttgc agatgcgcaa ctctaacgcg gcaactggtg aggactatat taatagccca 3720
gttagagatt taaacggcgt atgttttgac tctcgcttcc aaaatcctga gtggcccatg 3780
gacgctgacg ctaacggcgc ttatcacatt gcattgaagg gacaattgtt actaaaccat 3840
ctcaaggaat ccaaagatct taagttgcaa aatggtattt ctaatcaaga ctggttggcc 3900
tatatacagg agcttaggaa t 3921
<210> 51
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 51
atgacccagt tcgagggatt taccaatctc tatcaggtta gcaaaaccct caggttcgag 60
ctcatccctc agggcaagac actcaagcat atccaggagc agggtttcat cgaggaggac 120
aaggccagga atgaccacta caaggagctc aagccaatca tcgacaggat ttacaagacc 180
tacgccgacc agtgcctcca gctcgtgcag ctcgactggg agaacctctc cgccgccatc 240
gacagctaca gaaaggagaa gaccgaggaa accagaaacg ccctcatcga agagcaagct 300
acatacagga atgcaatcca tgactacttt atcgggagaa ccgacaacct caccgatgca 360
ataaacaaaa ggcacgctga gatctacaaa ggcctcttca aggccgagct cttcaacggt 420
aaggtgctca agcagctcgg caccgtcacc accaccgagc acgagaacgc cctcctcagg 480
agcttcgaca agttcactac ctatttcagc ggcttctacg agaacaggaa gaacgtcttc 540
tccgccgaag acatctctac cgccatccct cacaggattg tgcaagacaa cttccctaag 600
tttaaggaaa actgtcacat cttcaccagg ctcatcaccg ccgttcccag cctcagggag 660
cacttcgaaa acgtgaagaa ggccatcggg atcttcgtgt ccacctccat cgaggaggtg 720
ttctccttcc ccttttacaa tcagctcctc acccagaccc agatcgacct ctacaaccag 780
ctcctcggtg gcatcagcag ggaggccggc accgagaaga tcaaggggtt gaacgaggtg 840
cttaacctcg ccatccagaa aaacgacgag accgcacata tcatcgcctc tctcccccac 900
aggttcatac ccctcttcaa acagatcctc tcagacagga acacactcag ctttatcttg 960
gaggaattta agtccgacga ggaggtcata cagagcttct gcaaatataa gaccctcttg 1020
aggaatgaga acgttctcga gaccgccgag gcactcttca acgaactcaa cagcatcgat 1080
ctcactcaca tctttatatc acacaaaaaa ctcgagacca tctcatccgc attgtgtgac 1140
cactgggata ccttgagaaa cgccttgtac gagaggagga tctccgagct caccgggaag 1200
ataaccaagt ccgccaagga gaaggttcag aggtccctca agcacgagga cataaacctc 1260
caggagatca tctccgccgc cggaaaggaa ttgtccgagg ccttcaagca gaaaacttca 1320
gagatcctca gccacgccca cgccgctctc gatcagcccc tccccacaac cctcaagaag 1380
caagaggaaa aggaaatctt gaagagccaa ttggactccc tcctcggctt gtaccacctt 1440
cttgactggt tcgctgtcga cgagtccaat gaagtcgacc ccgaattcag cgctaggctc 1500
actggcataa agctcgagat ggagccttcc ctctccttct acaataaggc aaggaactat 1560
gctaccaaga agccctacag cgtcgagaaa ttcaagctca actttcagat gcccaccctc 1620
gcttctggct gggacgtcaa caaggagaaa aacaacggcg ccatcttgtt cgtgaagaac 1680
ggtctttact acctcggtat catgccaaag caaaagggga ggtacaaggc cctctccttc 1740
gagcccacag aaaagacctc cgagggcttc gacaaaatgt actacgacta cttccccgac 1800
gccgccaaga tgattcccaa gtgtagcacc cagctcaagg ccgtgaccgc ccacttccag 1860
acccatacca cccccatatt gttgtccaac aacttcatag agcccttgga aatcaccaaa 1920
gaaatatacg acctcaacaa ccccgagaag gaacccaaga agttccagac cgcctacgcc 1980
aaaaagaccg gggaccagaa gggatacaga gaggccttgt gcaagtggat tgacttcacc 2040
agggatttct tgtcaaagta taccaagaca accagcatcg acctcagcag tctcagaccc 2100
tctagccaat ataaggacct cggcgagtac tacgccgagc tcaaccccct cctctaccac 2160
atcagcttcc aaaggatcgc cgagaaggaa attatggacg ctgtggagac cggcaagctc 2220
tatctcttcc aaatctacaa caaagatttc gctaagggcc accacgggaa gcccaacctt 2280
cacaccctct actggaccgg tctcttcagc cccgaaaacc tcgccaagac cagtatcaag 2340
ctcaacggcc aggccgagct cttctatagg cccaagagca ggatgaagag gatggcccac 2400
aggctcggtg agaagatgct caacaagaaa ctcaaagacc agaagacccc catcccagac 2460
accctctatc aagagctcta cgactacgtc aaccacaggc tcagtcacga tctctccgat 2520
gaggccagag cccttcttcc caacgtcatc accaaggagg tgagccacga gatcatcaag 2580
gacaggagat tcaccagcga caagttcttt ttccacgtcc ccatcaccct caactaccaa 2640
gccgccaata gccccagcaa gttcaaccag agggtgaacg catacctcaa ggagcacccc 2700
gagaccccta tcatcggcat cgacagaggc gaaaggaacc tcatctacat cactgtgatc 2760
gacagcaccg gtaagatcct cgagcagagg agcctcaaca ccatccaaca gttcgattac 2820
caaaagaagc ttgacaatag ggagaaagag agggtcgccg ccaggcaggc ctggtccgtt 2880
gtgggtacca taaaggatct taaacagggc taccttagcc aggtcatcca cgaaatcgtg 2940
gacttgatga tacactacca ggccgtcgtt gtgctcgaaa acctcaactt cggattcaag 3000
agcaagagga ccggtatagc cgagaaagct gtctaccaac agttcgagaa gatgctcatc 3060
gacaagctca actgcctcgt gcttaaggac taccccgccg aaaaggttgg cggtgtgctt 3120
aatccctacc aactcaccga tcagtttacc tctttcgcca agatggggac ccaatctggt 3180
ttcctcttct atgtgcccgc cccatacact tctaagatcg acccccttac cgggttcgtg 3240
gacccattcg tgtggaaaac cattaagaac catgagtcca gaaagcattt cctcgagggc 3300
ttcgattttc tccactacga tgtcaagact ggcgacttca tcctccactt taagatgaac 3360
aggaatctca gcttccaaag gggtctcccc ggattcatgc ccgcctggga catcgtcttt 3420
gagaagaacg agacccagtt cgacgccaag gggaccccct tcatcgcagg gaaaagaatc 3480
gttcccgtga tagaaaacca caggttcacc gggaggtaca gggacctcta tcccgccaat 3540
gagctcatcg ccctcctcga agagaagggt atagtgttca gggacgggtc caacatcctc 3600
cccaagctcc tcgaaaatga cgactcacac gccatcgaca ctatggttgc cctcatcagg 3660
agtgtcctcc aaatgaggaa ctccaacgct gccaccgggg aggactatat caatagtccc 3720
gtgagagacc tcaatggggt ctgcttcgac tcaagattcc agaatcccga gtggccaatg 3780
gatgccgacg caaacggcgc ttaccacatc gctcttaagg gacagctcct cttgaaccat 3840
ttgaaagaat ccaaagatct caagctccag aacggcatct ccaaccaaga ctggctcgcc 3900
tatatccagg agctcaggaa c 3921
<210> 52
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 52
atgacccagt tcgaaggctt caccaacctc taccaggtta gcaagaccct cagattcgag 60
ctcattcctc aggggaaaac tttgaaacat atccaagagc aggggtttat agaagaggac 120
aaggccagaa acgatcacta caaggagctt aagcccatca tcgacagaat ctacaagact 180
tacgctgatc agtgcctcca gctcgtgcaa ctcgactggg agaacctctc agcagccatc 240
gattcctata ggaaggagaa gaccgaagag accaggaacg ccctcatcga ggagcaggcc 300
acctacagga acgccatcca cgattacttt atcggcagaa ccgacaactt gaccgacgct 360
attaataaga ggcacgccga gatctataag gggctcttca aggccgagct cttcaacggc 420
aaggttctca aacagctcgg caccgtcacc accaccgagc acgagaacgc tctcttgaga 480
agcttcgaca agtttacaac ctatttctcc ggcttctacg agaatagaaa aaatgtgttc 540
agcgccgaag acatcagcac cgccatcccc cacagaattg tgcaggacaa cttccccaag 600
ttcaaggaga actgccacat cttcactagg ctcataaccg ccgtgccctc cttgagggag 660
cactttgaaa acgttaagaa ggcaattggc atcttcgtgt caacctcgat cgaggaggtc 720
ttcagctttc ccttctacaa ccagctcctc acccagaccc agatagacct ctacaaccaa 780
ctcctcggtg ggatctccag ggaagccggg accgagaaga tcaaggggct caacgaagtc 840
cttaacctcg ccatccagaa aaacgacgag acagcccaca ttatcgcctc cctcccccac 900
aggtttatcc ccctcttcaa gcagattctc agcgatagaa acaccctctc ctttatcctc 960
gaggagttta agagcgacga agaagttata cagtcatttt gtaagtacaa gaccctcctc 1020
aggaatgaga acgtgctcga gaccgccgaa gcacttttca atgagctcaa cagcatagac 1080
ctcacccata tatttatcag ccacaaaaag ctcgagacca tcagctccgc cttgtgcgac 1140
cactgggata ccctcaggaa cgcactctac gagaggagga tcagtgagtt gaccggcaaa 1200
atcaccaagt ccgcaaaaga gaaagtccag agaagcctca aacacgagga cataaacctc 1260
caggagatca tctccgccgc cggcaaagag ctctctgagg cattcaagca aaagaccagt 1320
gagatccttt cccacgccca tgccgctctc gaccagcccc tccccaccac cctcaagaag 1380
caagaggaga aggagatcct caagagccaa ttggacagcc tcctcgggct ctatcacctc 1440
ttggactggt tcgccgttga tgagagcaac gaggttgacc ccgagttctc cgccagactc 1500
accggcatca aactcgagat ggaaccctcc ctcagcttct acaataaagc caggaactac 1560
gccaccaaga agccctactc cgtggagaag ttcaagctca acttccaaat gcccaccctc 1620
gcatccggct gggacgtcaa caaggagaag aacaatgggg ctatcctctt cgtgaaaaac 1680
gggctctact accttgggat catgcccaaa cagaagggaa gatataaagc cctctcattc 1740
gaacctaccg agaaaacctc cgaggggttt gacaagatgt attacgatta ctttcccgac 1800
gccgctaaga tgatcccaaa atgcagcacc cagttgaagg ctgtgacagc ccatttccag 1860
acccacacca ccccaatcct cctctccaac aacttcattg aacccctcga aatcaccaag 1920
gagatctatg atctcaataa tcccgaaaag gagcccaaaa agttccaaac cgcctacgca 1980
aagaaaaccg gcgaccagaa gggctacagg gaggccctct gcaaatggat cgattttacc 2040
agggactttc tctccaagta cacaaagacc acaagcattg acttgtcatc cctcaggccc 2100
agctcccagt acaaggacct cggcgagtac tatgctgagc tcaaccccct tctctaccac 2160
atcagctttc agaggatcgc tgagaaggaa attatggacg ccgttgagac cgggaagctt 2220
tacctcttcc agatctataa caaggacttc gccaagggtc accatggcaa gcctaacctc 2280
catactctct actggacagg gcttttcagc cccgagaacc tcgccaagac atccatcaag 2340
ctcaacggtc aggccgaatt gttctatagg cccaagtcta ggatgaagag gatggcccat 2400
aggctcgggg agaaaatgct caacaaaaag ctcaaggacc agaagacccc catccccgac 2460
accttgtatc aggagctcta cgactacgtg aatcacaggc tcagccacga cctcagcgac 2520
gaggccaggg ccttgctccc caacgttatc accaaggagg tgagccatga gatcataaag 2580
gacaggaggt tcacctccga caagttcttt ttccacgtcc caatcaccct caattaccaa 2640
gccgccaact caccttcaaa gttcaaccag agggtgaacg cctaccttaa ggagcacccc 2700
gagacaccta tcatcgggat cgacaggggg gagaggaacc tcatctacat aaccgtgatc 2760
gactctactg ggaagatcct tgagcaaagg tccctcaaca ccatccagca attcgattat 2820
caaaagaagc tcgacaacag ggagaaggag agggttgccg ccaggcaggc ttggagcgtt 2880
gtgggcacaa tcaaggactt gaagcagggc tacctctccc aggtgatcca cgagatagtt 2940
gacctcatga tacattacca ggctgttgtt gttctcgaga atctcaactt cggcttcaag 3000
agcaagagga cagggatcgc cgaaaaagcc gtctaccagc agttcgagaa gatgctcatc 3060
gacaaactta attgcctcgt tctcaaagac tatcccgcag agaaggtggg aggcgtgctc 3120
aacccctatc agctcaccga ccagtttacc tccttcgcca agatggggac ccaatcaggc 3180
ttcctcttct acgtgcccgc cccctacacc tctaaaatcg atcccctcac cgggtttgtc 3240
gatcccttcg tctggaagac catcaagaac cacgagtcca ggaagcattt ccttgagggc 3300
ttcgacttcc tccactacga cgtcaaaacc ggggatttca tcctccactt taagatgaat 3360
aggaacctct ctttccagag gggcctccca gggttcatgc ccgcctggga catcgtgttc 3420
gagaagaacg agacccagtt cgacgccaag ggaaccccct tcatcgccgg caagaggatt 3480
gtccccgtca ttgagaacca caggttcacc ggcagatata gagacctcta ccccgccaat 3540
gaactcatcg ccctcctcga ggaaaagggc atcgtcttca gggacgggtc caacatcctc 3600
cccaagttgc tcgagaacga cgattcccac gccatcgaca ccatggtggc tctcatcagg 3660
agcgtgctcc agatgaggaa cagcaacgcc gccactggtg aggactacat caatagccct 3720
gtcagagacc tcaacggagt ttgctttgac agcaggttcc aaaaccccga gtggcccatg 3780
gatgccgacg ccaatggtgc ctaccatatc gccctcaagg gccaactcct ccttaatcac 3840
ttgaaggaga gcaaggatct caagctccag aatgggatca gcaatcaaga ctggctcgca 3900
tacatccagg agctcaggaa t 3921
<210> 53
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 53
atgactcagt tcgaggggtt caccaacctt taccaagtct caaagaccct taggttcgag 60
ctcatccctc agggaaagac tcttaagcac atccaggagc agggcttcat cgaagaggat 120
aaagccagga acgaccatta taaggagctc aaaccaatca tcgacaggat ctacaagacc 180
tacgccgacc agtgcttgca actcgttcag ctcgactggg agaatctctc cgccgcaatc 240
gacagctaca ggaaagaaaa gaccgaggaa acaaggaacg ccttgatcga ggaacaggcc 300
acctacagaa acgccatcca tgactacttt ataggcagaa ccgacaatct caccgacgcc 360
atcaacaaga ggcatgccga gatatacaag gggcttttca aggcagagct ctttaatggg 420
aaggttttga agcagctcgg caccgtcacc actaccgagc acgagaatgc cttgttgagg 480
tcattcgaca aattcaccac ttacttctcc gggttctatg agaataggaa gaacgtcttt 540
agcgccgaag acatcagcac agccatccca cataggatcg tccaggacaa tttccccaag 600
ttcaaagaga actgccacat cttcaccagg ctcattaccg cagtgccaag tctcagggag 660
cacttcgaga acgttaaaaa ggctatcggc atcttcgtct ccacatccat cgaggaagtg 720
ttcagctttc ccttctacaa ccagctcctc acccagaccc agatagacct ctacaaccag 780
ttgcttggag gaatcagtag ggaggctggt accgagaaga taaagggact caacgaggtc 840
ctcaacttgg ctattcagaa gaacgacgag actgcacata tcatcgcctc cctcccccat 900
aggtttatcc ccttgtttaa gcagattctc tcagacagga acacactctc attcatcctc 960
gaggagttta agtctgacga ggaagttatc cagagcttct gcaaatacaa gactcttttg 1020
aggaacgaga acgttctcga gaccgccgag gcactcttca atgagctcaa ctccatagat 1080
ctcacccaca tatttatctc ccataagaag cttgagacca tttcctccgc tttgtgtgac 1140
cactgggaca cccttaggaa cgctctctac gaaaggagaa taagcgaact caccggcaag 1200
atcaccaaga gcgccaagga aaaggtgcaa aggtctctta agcatgagga tatcaacctc 1260
caggagatca ttagtgccgc cggcaaggag ttgtccgagg cctttaaaca aaaaaccagc 1320
gagatcctct cacacgcaca cgccgctctc gaccagcccc ttcctaccac ccttaagaaa 1380
caggaggaga aggagatcct caagtcacag ttggactctc tcctcggtct ctatcatctc 1440
ctcgactggt tcgctgtgga cgagagcaac gaggtcgacc ctgagttctc cgccagactc 1500
accggcatca agctcgaaat ggagccctcc ctcagctttt ataataaagc caggaactac 1560
gccaccaaaa agccctacag cgtggagaaa ttcaagctca actttcaaat gcctaccctc 1620
gcctccggct gggatgtcaa caaggagaaa aacaacgggg ccatactttt cgtgaaaaat 1680
gggctctact atttggggat catgcccaaa cagaagggca ggtataaggc cctctccttt 1740
gagcccactg agaagacaag tgaaggattc gataagatgt actacgacta cttccccgat 1800
gccgccaaaa tgatacccaa atgcagcacc cagctcaaag cagtgaccgc ccacttccag 1860
acacatacca cccccatcct cttgagcaac aatttcattg aacccttgga gatcaccaaa 1920
gagatctacg atctcaacaa ccccgagaag gagcccaaga agttccagac cgcctacgcc 1980
aagaagacag gggatcaaaa gggctacagg gaggccctct gtaagtggat cgacttcaca 2040
agagatttcc tcagcaagta taccaagacc actagcatcg atctctcctc cctcaggcca 2100
tcatcacagt acaaggatct cggggagtat tacgccgagc tcaatcctct cctctaccac 2160
ataagcttcc agaggatcgc cgagaaggag ataatggatg ccgtggagac cgggaagctc 2220
tacctctttc agatctacaa caaagacttc gctaaggggc accacggcaa gcccaacctc 2280
cacaccctct actggaccgg cctcttctcc cccgagaacc tcgccaaaac ctccatcaag 2340
ttgaacggcc aggccgagtt gttctacagg cccaagagca ggatgaagag gatggcccat 2400
aggttgggag agaagatgct caataagaag ttgaaggacc agaaaacccc catccccgac 2460
acactctatc aggaacttta tgactacgtc aaccacaggt tgagccacga cctctccgac 2520
gaggccagag cactcctccc caacgtgatc accaaggagg tctcccacga gatcatcaag 2580
gacaggaggt ttaccagcga caaatttttc ttccacgtcc ctatcaccct caactaccag 2640
gccgccaatt cccccagtaa attcaatcaa agagtgaacg cctatctcaa ggagcacccc 2700
gagaccccca taatcggcat cgacaggggc gagaggaacc tcatctacat caccgttatc 2760
gattctaccg gtaagatcct cgaacagagg agcctcaata ccatccagca gttcgactat 2820
cagaaaaagc tcgacaacag ggagaaagag agggttgccg ctagacaggc ctggagcgtc 2880
gtcgggacca tcaaggatct caagcagggt tacctctccc aagtgattca cgagatcgtc 2940
gaccttatga ttcactacca ggccgtggtc gttctcgaaa atcttaactt cgggttcaaa 3000
agcaagagga ctgggatcgc tgagaaagcc gtgtaccagc agttcgaaaa gatgctcatc 3060
gacaaactca actgcctcgt cctcaaggac taccccgccg agaaggtcgg gggggtgctc 3120
aacccctatc aactcaccga tcagttcacc agtttcgcca aaatggggac acagtccgga 3180
ttcctctttt acgtccccgc cccctacacc tctaagatag acccactcac cggcttcgtc 3240
gacccctttg tgtggaaaac cattaagaac cacgaaagca ggaagcattt cctcgaaggt 3300
ttcgatttcc tccactacga cgtcaaaact ggagacttca tccttcactt caagatgaac 3360
aggaaccttt ccttccagag ggggcttccc gggtttatgc ccgcatggga catcgtcttc 3420
gagaaaaatg agacccaatt cgacgcaaag ggcacccctt tcatcgccgg caagagaatc 3480
gtccccgtca tcgagaacca caggttcacc ggcaggtaca gggacttgta ccccgccaac 3540
gagctcattg ccctcctcga agagaagggc atcgttttca gggacgggtc caacatcctc 3600
cccaagctcc tcgaaaacga cgactcacac gccatcgaca ccatggttgc tctcatcagg 3660
agcgtgctcc agatgaggaa ttccaacgcc gccaccggtg aggactatat caactctccc 3720
gttagggacc tcaacggagt gtgcttcgac agcaggttcc agaatcccga gtggcccatg 3780
gacgccgacg ccaatggcgc ctaccacatc gccctcaaag ggcagttgct cctcaatcat 3840
ctcaaggagt ccaaggacct caaactccag aacggcatca gcaaccagga ttggcttgcc 3900
tatatccagg agttgaggaa c 3921
<210> 54
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 54
atgacccagt ttgagggttt caccaacctc taccaagttt ccaagactct caggttcgaa 60
ctcatccccc agggaaaaac actcaagcac attcaggagc aggggttcat cgaggaggat 120
aaggcaagaa acgaccatta caaggagctc aagcctatta ttgacaggat ctacaagact 180
tacgctgacc aatgcctcca gctcgtgcaa ctcgactggg aaaacctctc cgccgcaatc 240
gatagttaca gaaaggagaa gactgaggag actaggaacg ccctcatcga ggaacaggcc 300
acctatagga acgccatcca cgactacttc atcgggagaa ccgacaacct caccgacgcc 360
atcaacaaga gacacgccga gatatacaag ggcctcttca aggccgaact tttcaacggg 420
aaagttctca aacagctcgg gaccgttacc acaaccgagc acgagaacgc cctcctcagg 480
agcttcgaca agttcaccac atacttctcc ggattttacg aaaatagaaa gaatgtcttc 540
agcgccgagg acatttccac cgccatccct cacagaatcg ttcaagacaa ctttcccaaa 600
ttcaaagaaa actgccacat cttcaccaga ctcatcaccg ccgttccctc tctcagggag 660
cactttgaga acgtcaagaa ggccatcggt atcttcgttt ctaccagtat cgaggaggtt 720
ttctccttcc cattctacaa ccagctcttg acccagaccc agatcgacct ctataatcag 780
ctcctcggcg gcatatctag agaggccgga accgaaaaga tcaaagggct caacgaggtc 840
ctcaacctcg ccatccagaa aaacgacgaa accgctcata tcatcgcatc cctcccccac 900
agattcattc ccttgtttaa gcagatcttg tccgacagaa acactctctc ctttatcttg 960
gaggaattta agtccgacga ggaggttatc cagtccttct gcaagtacaa gaccctcctt 1020
aggaacgaga acgtcctcga gaccgccgag gcccttttca acgagctcaa cagcatcgac 1080
ctcactcaca tcttcatcag ccacaagaaa cttgagacca tctcaagcgc cctctgcgat 1140
cactgggaca ccttgaggaa tgccctctac gaaaggagga taagcgagct caccggcaag 1200
atcaccaagt ccgccaagga aaaggttcag aggagcctca agcacgagga catcaacctc 1260
caggagatta tcagtgccgc aggcaaagaa cttagcgagg catttaagca aaagacctcc 1320
gagattctca gccacgccca cgccgctctt gaccagcccc tccccaccac ccttaagaag 1380
caggaggaaa aggagatcct caaaagtcaa ctcgactccc tcctcggctt gtatcacttg 1440
ctcgattggt ttgccgttga cgagtccaat gaagtcgatc ccgagttcag cgccaggctt 1500
accggcatca agctcgaaat ggagcctagt ctctccttct acaacaaggc cagaaattac 1560
gccaccaaga agccctactc cgttgagaaa ttcaaactca atttccagat gcccactctc 1620
gcttcagggt gggacgttaa caaagagaag aacaacggag ccattctttt cgttaagaac 1680
ggcctctact acttgggcat catgcccaag cagaagggga ggtacaaggc cctcagcttc 1740
gagcccaccg agaagacctc cgaggggttc gataaaatgt actacgacta ttttccagat 1800
gccgccaaga tgatccccaa atgcagtacc caactcaaag ccgtgaccgc tcatttccag 1860
acccacacaa cccccatcct cctcagcaac aacttcatcg agcccctcga aattaccaag 1920
gagatctacg acttgaacaa ccccgagaag gaacccaaga agttccagac cgcctacgcc 1980
aaaaaaaccg gcgaccagaa agggtacagg gaggccctct gtaagtggat cgactttacc 2040
agggattttt tgtctaagta caccaaaacc acatccatcg acctcagctc cctcaggccc 2100
tctagccaat acaaagacct cggtgaatac tacgcagaac tcaaccccct cctctaccac 2160
attagcttcc agaggatcgc tgagaaggag atcatggacg ccgtggagac cgggaagctc 2220
tacctcttcc agatctacaa taaggacttc gccaagggcc accacgggaa gccaaatctc 2280
cacaccctct attggaccgg gctcttcagc cccgaaaatc ttgcaaagac aagcatcaag 2340
ctcaacggcc aagccgaact cttctacagg cccaagtcca ggatgaagag gatggcccac 2400
aggctcggcg aaaagatgct caacaagaag ttgaaagacc aaaagacccc catccccgat 2460
accttgtacc aggagttgta cgattacgtc aaccataggc tctcccacga tctctccgac 2520
gaggccaggg cactcctccc caacgttatt accaaagaag tctcacacga gatcatcaag 2580
gacagaaggt ttacatccga caaattcttc ttccacgttc ccatcacctt gaattaccaa 2640
gctgctaatt cccccagcaa gttcaaccaa agggtgaatg cctatttgaa ggagcatccc 2700
gagaccccca ttataggcat cgacaggggg gaaaggaacc ttatctacat taccgttatc 2760
gactccaccg gaaagatcct cgagcagagg tccttgaaca ctattcagca gttcgactac 2820
cagaagaagc tcgacaatag ggagaaagag agggtcgccg ccagacaggc ctggagcgtc 2880
gtcggtacca tcaaagacct caagcagggc tacctgagcc aggtcatcca cgagatcgtt 2940
gacctcatga tccactacca ggccgtcgtc gttcttgaga acctcaactt tggttttaag 3000
tccaagagga ctgggatcgc cgagaaggca gtgtatcagc agttcgaaaa gatgctcata 3060
gacaagttga actgtctcgt tctcaaggac taccccgctg agaaggtggg aggcgtcctt 3120
aacccctacc aactcaccga tcagttcacc tccttcgcca agatgggaac ccaaagcggc 3180
ttcctctttt acgtcccagc tccctacacc tccaagatcg acccactcac cggtttcgtc 3240
gacccttttg tttggaagac catcaagaac cacgagtcca ggaagcactt cctcgagggg 3300
ttcgacttcc tccattacga tgtgaagacc ggcgacttca tcctccattt caagatgaat 3360
agaaacctct cattccaaag agggctcccc gggttcatgc ccgcttggga cattgttttc 3420
gagaagaacg agacccaatt cgatgccaag ggcacacctt ttatcgccgg caagagaata 3480
gttcccgtca tcgagaacca cagattcaca gggaggtaca gagacctcta ccccgctaat 3540
gagctcattg ccctcctcga ggagaaaggg atcgtcttca gggacggcag caacatcctc 3600
cccaaactcc tcgaaaacga cgactcacac gccattgata ccatggtcgc actcatcaga 3660
tccgttctcc agatgagaaa ttccaacgcc gccaccggag aggactacat aaacagcccc 3720
gtcagggacc tcaacggcgt gtgcttcgac agcaggttcc aaaaccctga gtggcccatg 3780
gacgctgacg ctaacggagc ctaccacatc gcccttaagg ggcagctcct cctcaaccac 3840
ctcaaagaga gtaaagatct caagttgcaa aacggaatca gcaaccagga ctggctcgcc 3900
tacatccagg agctcagaaa t 3921
<210> 55
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 55
atgacccagt ttgagggctt taccaacttg tatcaggtta gcaaaaccct taggttcgaa 60
ctcatccctc aaggtaagac cttgaaacac attcaggaac aggggttcat cgaagaagac 120
aaagccagga atgaccacta caaagagctc aaacccatta tcgacagaat ctataaaact 180
tacgccgacc agtgtctcca gctcgtgcag ttggactggg agaatctcag cgccgccatc 240
gacagctata gaaaagagaa aaccgaggag acaaggaacg ccctcatcga ggagcaggcc 300
acttacagga acgccatcca tgactacttc atcggcagga ccgacaacct caccgacgcc 360
atcaataaga ggcatgccga gatctacaag ggcctcttta aggccgagct cttcaacggc 420
aaagtcctca agcagctcgg gaccgtgacc accaccgagc acgagaacgc ccttcttaga 480
tccttcgaca agttcaccac ctatttcagt ggcttctatg agaacagaaa gaatgttttc 540
tccgccgaag acatcagcac cgccattccc cacagaatag tgcaggacaa tttccccaag 600
ttcaaggaga actgccacat cttcactagg ttgatcaccg ccgtcccctc tctcagagaa 660
cattttgaga atgttaagaa ggccatcggc attttcgtta gtaccagcat cgaagaggtc 720
ttcagtttcc ccttctacaa tcagttgctc actcaaaccc agatcgacct ttacaaccag 780
ctcctcgggg gtatctccag ggaggccggc accgagaaaa tcaaggggct caacgaggtg 840
cttaacctcg ccatccaaaa aaacgacgaa accgcccaca tcatcgccag cctcccccat 900
aggttcatcc ccttgttcaa gcagatcctt tccgacagaa acactctctc ctttatcctc 960
gaggagttta aaagcgacga ggaggtgatc cagtccttct gcaagtacaa aacacttctc 1020
agaaacgaga acgtgctcga gaccgcagag gcccttttta acgagctcaa cagcatcgac 1080
ctcacccaca ttttcatctc ccataaaaag ctcgagacca ttagctccgc cctctgcgac 1140
cactgggaca cactcagaaa cgccctgtac gaaaggagga tcagtgagtt gactgggaag 1200
atcaccaaga gcgccaagga aaaagtgcag aggtccctca agcacgagga catcaacctc 1260
caggaaatca tcagcgccgc cggtaaggag ctcagtgagg ctttcaaaca gaagacctcc 1320
gagatactct cacacgccca cgctgcactc gaccagccat tgcccaccac cctcaagaag 1380
caagaagaga aggagatctt gaagtcccag ctcgattccc tcctcgggct ctaccacctc 1440
ctcgactggt tcgccgttga cgagagcaat gaagttgacc ccgagttctc cgctaggctc 1500
accggcatca agctcgagat ggagccctcc ctctccttct acaacaaggc cagaaactac 1560
gccacaaaga agccctactc cgtggagaag tttaaactca acttccagat gcccaccctc 1620
gcctccggct gggatgttaa caaggaaaaa aacaacggcg ccatcctctt tgtgaagaac 1680
ggactctact acctcgggat catgcccaaa cagaaaggca ggtacaaggc actctcattt 1740
gaacccaccg aaaagacaag cgaggggttt gataagatgt attacgatta cttccccgac 1800
gccgcaaaga tgatccccaa gtgctctact cagttgaaag ccgtcaccgc ccattttcaa 1860
acccacacca cccccatcct cctttccaac aacttcatcg agcctttgga gatcaccaag 1920
gagatctacg acttgaataa ccccgaaaag gagcccaaaa agtttcagac tgcatacgcc 1980
aaaaagaccg gcgaccagaa gggatatagg gaagccctct gcaagtggat agacttcact 2040
agagacttcc tcagtaagta caccaagacc acttcaatcg acctctcctc tctcaggcct 2100
agctcccagt acaaagacct cggtgagtat tatgcagagc tcaatcctct cctttaccat 2160
ataagcttcc agaggatcgc cgagaaggag atcatggacg ccgtcgagac agggaagctc 2220
tacctcttcc agatctataa caaagatttt gctaaagggc accacggcaa acctaacctc 2280
cacacactct attggacagg gcttttcagc cctgagaacc tcgccaagac cagtatcaaa 2340
ctcaacggcc aggccgagct cttctacaga cccaaaagca ggatgaaaag gatggcccac 2400
aggctcggag agaagatgtt gaataagaag ctcaaggacc agaagacacc catccccgac 2460
accctctatc aggagctcta cgactacgtc aatcacaggc tctctcatga cttgtccgac 2520
gaggccaggg cccttctccc caacgtcatc accaaggagg tctcccatga gattatcaag 2580
gacaggaggt tcacctcaga caagttcttc ttccacgtcc ctattaccct caactaccag 2640
gccgccaaca gccccagcaa gttcaatcag agggtcaacg cctacctcaa ggaacatcca 2700
gagaccccca tcatcggcat tgacagaggt gagaggaacc tcatctatat caccgtgatc 2760
gactccactg gcaagatcct cgagcagagg tccctcaaca ccatccagca attcgactat 2820
cagaagaagc tcgacaacag ggaaaaagag agggtggccg ccagacaggc ctggagcgtc 2880
gtggggacca tcaaggacct caagcaggga tacctttccc aggtgatcca cgagatcgtc 2940
gacctcatga tccactatca ggccgttgtc gtgcttgaga acctcaactt cggatttaag 3000
tccaagagga ccggcatcgc cgagaaagct gtgtaccagc agttcgaaaa aatgctcatc 3060
gacaagctca actgccttgt cctcaaggac taccccgccg agaaggtggg gggcgttctc 3120
aacccttacc agttgactga ccaatttacc tcattcgcca agatggggac tcagtccggg 3180
ttcttgttct acgtccccgc cccctacaca tccaagattg accccttgac tgggtttgtt 3240
gaccccttcg tctggaagac catcaagaac cacgaatcaa ggaaacactt cctcgagggt 3300
ttcgacttcc tccactatga cgtcaagacc ggggacttca tcctccattt caagatgaat 3360
agaaatctta gctttcagag gggtctccca ggtttcatgc ctgcctggga tatagtcttc 3420
gagaagaatg agacccagtt cgatgccaaa ggcaccccct tcatcgccgg gaagaggatc 3480
gtgcccgtca tcgagaacca caggttcact gggaggtaca gggaccttta ccctgccaat 3540
gaactcatcg ccctcctcga agagaaggga atcgtgttta gggatgggtc caacatcctt 3600
cccaaactcc tcgaaaacga cgacagccac gccatcgata ccatggtcgc cctcatcaga 3660
agcgttctcc aaatgaggaa ctccaacgcc gccaccgggg aagactacat caactccccc 3720
gttagggatc ttaatggcgt gtgtttcgac tccaggttcc aaaatcccga gtggcccatg 3780
gacgccgatg ccaacggggc ctaccacatc gccctcaagg ggcagctcct tctcaaccat 3840
ttgaaagaga gcaaagacct caaattgcag aacgggattt caaatcagga ctggcttgcc 3900
tatatccaag agctcaggaa t 3921
<210> 56
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 56
atgacccagt ttgaagggtt caccaacctc taccaggtga gcaagaccct cagattcgag 60
ttgatcccac aggggaaaac tctcaagcac atccaagagc agggcttcat cgaggaagac 120
aaagccagga acgaccatta caaggagctc aagcccatca tcgacaggat ctacaagacc 180
tacgcagacc agtgcctcca gctcgtccag ttggactggg agaacctcag cgcagccata 240
gactcctata gaaaagagaa gaccgaggag actaggaacg ccttgatcga ggagcaggcc 300
acctacagga atgccataca cgattacttc ataggcagaa ccgacaatct caccgacgcc 360
atcaataaga ggcacgccga aatatataag gggctcttca aagccgaact tttcaatggt 420
aaagttctca agcagttggg cacagtgacc accaccgagc acgagaacgc cctcctcagg 480
tccttcgata aattcaccac ctatttctcc ggtttctacg agaacaggaa gaacgtcttt 540
agcgccgaag atatttcaac tgccattcca cacaggatcg tgcaggacaa cttccccaaa 600
tttaaggaga actgccacat attcacaagg ttgatcaccg ctgtccccag tctcagagag 660
cacttcgaaa atgtgaagaa ggccatcggc atcttcgttt ccacttccat tgaagaagtg 720
ttctccttcc ccttctataa tcagctcttg acccagacac agatcgacct ctacaaccag 780
ctcctcggcg ggatctccag agaagccggc accgagaaga tcaaagggct caatgaggtg 840
ctcaacttgg ccatccagaa aaacgacgag accgcccaca taatcgcctc cctcccccac 900
aggttcatcc ccctcttcaa gcagatactc tccgacagga acaccctctc tttcatactc 960
gaagagttca agagcgacga ggaggtgatc cagtcattct gtaagtacaa gaccctcctc 1020
aggaacgaga acgtgctcga gaccgccgag gccctcttca acgagcttaa cagcatcgac 1080
ctcacccaca tcttcatctc acacaagaag cttgagacca tcagcagtgc cctctgcgat 1140
cactgggaca cattgaggaa tgccctctat gagagaagga tctccgagct caccggcaaa 1200
attaccaaga gcgccaagga gaaggtgcag aggtccctca aacacgagga catcaacctc 1260
caggaaatca tcagcgccgc tggtaaagaa ctctccgagg catttaaaca aaagacctcc 1320
gagatcctct cccatgccca cgccgccctc gatcagcccc tccctaccac acttaagaaa 1380
caggaagaaa aggagatcct caaaagccaa ctcgattctc tcctcggtct ctaccacctc 1440
ctcgactggt tcgccgtgga cgagagcaac gaggttgacc ccgaattcag cgctaggctc 1500
accgggatca agctcgagat ggagccttcc ctctcatttt ataacaaggc caggaattac 1560
gccaccaaga agccttattc agtggaaaaa ttcaagttga atttccagat gccaaccctc 1620
gcctccgggt gggacgtgaa taaagagaaa aacaacggag ccatcctctt cgtcaagaac 1680
ggcctctatt acttggggat catgcccaaa cagaagggca ggtacaaggc cctcagcttt 1740
gagcccactg aaaaaacaag tgagggcttc gacaagatgt actacgacta cttccccgat 1800
gccgccaaga tgatccccaa atgcagcacc caacttaaag ccgttaccgc ccactttcag 1860
acccacacca cccccatcct cctcagcaac aacttcatag agccccttga gatcaccaag 1920
gaaatctacg atctcaacaa cccagagaag gagcccaaaa agttccagac cgcctatgcc 1980
aagaagaccg gggaccagaa aggctacaga gaggcactct gcaaatggat cgactttacc 2040
agggactttt tgagtaaata caccaagacc acctccatcg acctcagctc cctcaggccc 2100
agtagtcaat ataaggatct cggcgagtac tatgccgagc tcaaccccct cctctatcac 2160
atcagctttc aaaggatcgc cgagaaggag attatggatg ctgtggaaac cggtaagttg 2220
tacctcttcc agatctacaa caaagacttc gcaaagggac atcacgggaa gcccaacctc 2280
cacaccctct attggaccgg ccttttcagc cccgagaatc tcgccaagac ctctataaaa 2340
ctcaacgggc aggccgaact cttctacaga cccaagtcta ggatgaagag gatggcccat 2400
aggctcggcg agaagatgct caacaagaag ctcaaggatc agaaaactcc catccccgat 2460
accctctacc aggaactcta cgactacgtg aaccacagac tcagccacga cctctccgat 2520
gaggcaaggg cactcctccc taacgtcatc accaaggagg tgagtcatga aatcatcaag 2580
gacagaaggt ttacatccga caagttcttc ttccatgtcc ccatcaccct caattaccag 2640
gccgccaaca gcccctccaa gttcaaccag agggtcaacg cctacctcaa agagcacccc 2700
gagaccccca taatagggat cgacagaggc gaaaggaatc tcatctacat taccgtcatc 2760
gacagcacag gcaagatcct cgagcaaagg tccctcaaca caatccagca gttcgattac 2820
cagaaaaagc tcgataacag ggaaaaggaa agggtggctg ccaggcaggc ctggtccgtt 2880
gtgggaacca tcaaggacct taaacagggc tatctctcac aggtcatcca cgagatcgtc 2940
gatcttatga ttcactatca ggccgtcgtt gtgctcgaga acctcaattt cggtttcaag 3000
tccaagagga ccggcatcgc cgagaaggca gtctaccaac agttcgagaa gatgctcatc 3060
gacaaactca actgcctcgt tctcaaggac taccccgccg agaaggttgg aggggtgttg 3120
aacccctacc aactcaccga ccagttcacc tcatttgcca aaatgggcac ccaaagcggt 3180
ttcctcttct atgtccccgc tccctacact agcaagatcg accccttgac aggcttcgtc 3240
gacccattcg tctggaagac catcaagaac cacgaatcca ggaaacactt tcttgagggg 3300
tttgattttc tccactacga cgtgaaaact ggcgatttta ttctccattt caagatgaac 3360
agaaacctct ccttccagag gggcctcccc gggttcatgc cagcctggga cattgttttc 3420
gaaaagaacg aaacccaatt cgacgccaaa gggacaccct tcatcgccgg gaaaaggatc 3480
gtccccgtca tcgaaaacca taggttcacc ggcagataca gggatctcta tcccgccaac 3540
gagctcatcg cactcttgga ggagaagggg atcgtgttca gggacggatc caacatcctc 3600
ccaaagctcc tcgagaacga cgactctcac gccatcgata ccatggttgc cctcatcagg 3660
agcgttctcc aaatgagaaa ctctaacgca gccaccggag aggactatat caacagcccc 3720
gtgagggact tgaatggcgt ttgcttcgac agcaggtttc agaaccccga gtggcccatg 3780
gatgccgatg ccaacggcgc atatcacata gccctcaagg gtcagctcct cttgaaccac 3840
ttgaaagaat ctaaggacct caagctccag aatggtatct ccaaccagga ctggctcgcc 3900
tacatccagg aactcagaaa c 3921
<210> 57
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 57
atgactcagt tcgagggatt caccaacctc taccaagtct ccaagaccct caggttcgag 60
ctcatccccc aggggaagac acttaagcac atccaggagc aaggattcat cgaggaggac 120
aaggccagga acgaccacta taaggagctc aagcccatca tcgataggat ctacaagact 180
tacgcagacc agtgcctcca gctcgtgcag ctcgactggg aaaacctcag cgcagccatc 240
gacagctaca gaaaggagaa gaccgaggag accagaaacg ccctcatcga ggagcaagcc 300
acctacagga acgcaataca tgactacttc atcggaagga ccgacaatct cacagacgcc 360
atcaacaaga ggcacgccga gatctacaag gggctcttca aggccgagct cttcaacggc 420
aaggtgttga agcagctcgg cactgtgacc accaccgaac acgagaatgc cctcctcagg 480
agtttcgata agtttaccac atactttagt ggtttctacg aaaacagaaa gaacgtcttc 540
tctgctgaag acataagcac cgccatcccc cacagaatcg tgcaggataa cttccccaag 600
ttcaaggaga actgtcatat cttcaccagg ctcatcactg ccgtgccttc ccttagggag 660
catttcgaaa atgtcaagaa ggccataggt atcttcgtga gcaccagcat tgaagaggtg 720
ttttccttcc ccttctacaa ccagctcctc acccagaccc agatcgatct ctataaccag 780
ctcttggggg gcatttccag agaagccggt acagagaaga tcaagggcct caatgaggtt 840
ctcaacctcg ctatccagaa aaatgacgag accgcacata taatcgcctc cctcccccac 900
aggttcatac ccctcttcaa acagatcctc tcagacagga ataccctcag cttcatcctc 960
gaggagttca agtccgacga ggaggtgatc cagtccttct gcaaatacaa gaccctcctc 1020
aggaatgaga acgtccttga gaccgccgag gccctcttta atgagctcaa cagcatcgac 1080
ttgacccaca tcttcatctc tcacaaaaag ctcgaaacaa tcagctccgc cctctgcgac 1140
cattgggaca ccttgaggaa cgccctctac gaaaggagaa tctccgagct caccggcaag 1200
ataacaaaga gcgccaagga aaaagtgcag aggagcctca agcatgagga tatcaacctc 1260
caggagatca tctccgccgc cggtaaggag ctcagcgagg ccttcaagca gaagacttcc 1320
gagatcttga gccatgccca cgccgctttg gaccagcccc tccctacaac cctcaagaag 1380
caagaggaga aggagattct caagagccag ctcgactcat tgctcggact ctaccacctc 1440
ctcgactggt ttgccgtgga cgagtccaac gaggtcgacc ccgaattctc cgccaggttg 1500
accggcataa aactcgagat ggagccctca ctcagctttt acaataaggc caggaactac 1560
gcaacaaaga agccctactc cgtcgaaaag ttcaagctca acttccagat gcccaccctc 1620
gcatccgggt gggatgtgaa caaggagaag aacaacgggg ccatcctctt tgtcaagaat 1680
gggctctact acctcgggat catgcccaag cagaaaggta gatataaagc cctcagcttc 1740
gagcccacag agaagacctc cgaaggattc gacaagatgt attacgacta cttccccgat 1800
gccgccaaaa tgatccccaa gtgcagcacc caactcaagg cagtgaccgc ccatttccag 1860
acccacacta cccccatcct cttgagcaac aattttatcg agcccttgga gatcaccaag 1920
gagatctatg acctcaacaa tccagagaag gagcccaaga agttccaaac cgcctatgcc 1980
aagaagaccg gcgaccaaaa gggctacagg gaggccctct gcaagtggat cgattttacc 2040
agggacttcc tcagcaagta caccaaaact acctctatcg accttagcag cctcagaccc 2100
tcaagccagt ataaggacct cggcgagtac tacgccgagc tcaatccctt gttgtaccat 2160
atcagctttc agaggatcgc cgagaaggag atcatggacg ccgtcgagac cgggaagctt 2220
tacttgttcc aaatctacaa caaggacttc gccaagggac accacggcaa gcctaacctc 2280
cacaccctct actggaccgg gctcttttcc cccgagaact tggccaagac ctctatcaag 2340
ctcaacggcc aggccgagct cttctataga cccaagagca ggatgaagag gatggcacac 2400
agactcgggg agaagatgct caacaagaag ctcaaggatc agaagacccc catccccgac 2460
accctctacc aagaactcta cgattacgtc aaccacagac tctctcacga tctctctgac 2520
gaagcaagag ctctcctccc caacgtgatt actaaggagg tctcacacga gattataaag 2580
gacaggagat tcacctccga caagttcttc ttccacgtcc ccatcaccct caactaccag 2640
gctgccaaca gcccatccaa gttcaaccag agggttaacg cctacctcaa ggaacatcct 2700
gagaccccca tcatcggcat cgacaggggg gagaggaact tgatttacat caccgtcatc 2760
gacagtaccg gtaaaatcct cgagcagagg agcctcaaca ccatccagca gttcgactat 2820
caaaagaagc tcgacaacag agagaaagag agagtggctg ctaggcaagc ctggagcgtt 2880
gtgggaacaa ttaaagacct caagcagggc tacctttccc aagtgatcca cgaaatcgtg 2940
gacctcatga tccactatca ggccgtcgtc gtccttgaaa acctcaattt tggtttcaaa 3000
agtaagagga ctggtatcgc cgagaaggcc gtctaccaac agttcgaaaa aatgctcatc 3060
gacaaactca actgcttggt gctcaaagac taccccgccg aaaaggtggg aggcgtcctc 3120
aacccatacc agctcaccga ccaattcact agctttgcta agatgggaac ccagtccggg 3180
tttttgttct acgttcccgc cccctacaca agcaagatag accccctcac tgggtttgtg 3240
gaccccttcg tgtggaagac cataaaaaac cacgagagca gaaaacattt tctcgaggga 3300
ttcgactttc tccactacga tgtcaaaacc ggtgacttta tactccactt caagatgaac 3360
agaaatctct cctttcagag aggcctccct gggttcatgc ccgcctggga catcgtcttc 3420
gagaagaacg agacccagtt tgacgctaag ggcaccccct tcatcgccgg caagaggatc 3480
gtgcccgtca tcgaaaacca caggttcacc ggcaggtata gggatctcta ccccgccaac 3540
gagctcatcg ccctccttga agaaaagggt atcgtcttca gagacgggag caatatcctc 3600
ccaaagttgt tggagaacga cgatagccac gccatcgaca ccatggtcgc ccttatcagg 3660
tccgtcctcc agatgaggaa ctctaacgcc gccaccggag aggactacat aaactcaccc 3720
gttagagacc tcaacggcgt ttgtttcgac tccaggtttc aaaatcccga gtggcccatg 3780
gatgccgacg ccaacggggc ttaccacatc gccttgaagg ggcagcttct tctcaaccac 3840
ctcaaggaga gcaaggacct caagctccaa aacgggatca gcaaccaaga ctggttggcc 3900
tatatccagg agctcagaaa t 3921
<210> 58
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 58
atgacccagt tcgagggctt caccaacctc tatcaggtct ctaagacact cagatttgaa 60
ctcatccccc aggggaaaac cctcaaacac atccaggagc agggctttat agaggaggac 120
aaggcaagga atgaccacta taaggagctt aagcccatca ttgataggat ctacaaaacc 180
tacgccgatc agtgccttca gctcgttcag ctcgactggg agaacctcag cgcagccatc 240
gattcctaca ggaaggaaaa aaccgaggag accaggaacg cactcatcga agagcaggcc 300
acctacagaa acgccatcca tgattatttc atcggcagga ccgacaacct caccgacgcc 360
atcaataaga ggcatgctga gatctacaag ggccttttca aggctgagct cttcaacggg 420
aaggtgctta agcagcttgg caccgtcacc accaccgagc acgaaaatgc attgcttagg 480
tcattcgaca agttcacaac ctattttagt ggtttctatg aaaataggaa gaacgtgttc 540
tccgccgagg acatcagtac cgccatcccc cacagaattg tgcaagacaa cttccccaaa 600
ttcaaggaga actgccacat ctttaccaga ctcatcaccg ccgtgcccag cttgagggag 660
cacttcgaga acgtcaagaa agccatcggg atctttgtca gcacctccat cgaggaggtg 720
ttctccttcc ccttctataa ccagctcctc actcagaccc agatcgacct ctacaatcag 780
ctcctcggcg gaatctccag ggaggcaggt accgagaaga taaaagggct taatgaggtc 840
ctcaacctcg ccatccagaa aaatgacgaa accgcccaca tcatcgcctc cctcccccac 900
agatttatcc ccttgttcaa gcaaatcctc agcgacagga acaccctcag cttcattctt 960
gaggagttca agagcgatga agaagttatc cagtccttct gcaagtacaa aacactgctc 1020
aggaacgaaa atgtgctcga aaccgccgaa gccctcttca acgagctcaa cagcattgac 1080
ctcacccaca tcttcatcag ccacaaaaag cttgagacca tcagctccgc tctgtgcgac 1140
cattgggaca ctctcaggaa cgccctttac gaaaggagga taagcgagct taccggcaag 1200
atcaccaaat ccgccaaaga gaaggttcag agatccctca agcacgagga catcaacctc 1260
caagagatta tcagtgccgc cgggaaggag ctttctgaag ccttcaagca gaagaccagc 1320
gagatccttt cccacgccca tgctgccctc gaccagcctc tccccaccac cctcaagaag 1380
caggaggaga aggaaattct caagagccag ctcgacagtc tcctcggcct ctaccacctt 1440
ctcgactggt tcgccgtcga cgagtccaat gaagtcgacc ccgaatttag cgccaggctc 1500
accgggatta agctcgagat ggaaccaagt cttagcttct ataacaaggc caggaattac 1560
gccaccaaga agccttactc agtggagaag ttcaagctta acttccagat gcccacactt 1620
gcctccgggt gggacgtgaa caaggaaaag aacaacggag caatactctt cgtcaagaac 1680
gggttgtact atctcggcat tatgcccaag cagaaaggca ggtacaaggc tctctccttt 1740
gagcctaccg agaaaacctc agagggtttc gacaaaatgt actacgacta cttccctgac 1800
gccgccaaga tgatccccaa gtgctccacc cagctcaagg ccgtcaccgc tcatttccaa 1860
actcacacca ctcccatcct ccttagcaac aactttatcg agcccctcga gatcaccaaa 1920
gagatctacg acctcaacaa cccagagaag gaacccaaga agttccaaac cgcctacgcc 1980
aagaaaaccg gggaccagaa gggctatagg gaggctctct gcaagtggat agacttcacc 2040
agagactttc tctccaagta cacaaagacc acctccatcg acctctccag cctcaggccc 2100
tcctcccagt acaaggacct tggggagtac tacgcagagc tcaacccact cctctaccat 2160
atttcatttc agaggattgc cgagaaggag atcatggacg ccgtcgagac cgggaaactc 2220
tacctcttcc agatatataa caaggacttc gccaaggggc accacgggaa gcccaacctc 2280
cacaccctct actggaccgg gctctttagc ccagagaatc tcgccaaaac cagcatcaag 2340
ctcaatggcc aggccgagct tttctatagg ccaaagagca ggatgaagag gatggcccac 2400
aggcttggcg aaaagatgct taacaagaag ctcaaggatc agaaaacccc catccctgac 2460
accttgtacc aagagctcta cgactacgtg aaccatagac tctcccacga tctcagtgac 2520
gaggccaggg ccctcctccc taacgttatt actaaggaag tctcacatga gatcatcaaa 2580
gacagaaggt tcaccagcga taagtttttc tttcacgtcc ccataacact caactaccag 2640
gctgccaact ccccaagcaa gttcaatcag agggtcaacg cctacctcaa ggagcaccct 2700
gagaccccca tcataggaat cgacaggggg gagaggaacc tcatatacat caccgtgatc 2760
gacagcaccg ggaaaattct cgaacaaaga tccctcaaca caatacagca gtttgactat 2820
cagaagaagt tggataacag ggagaaggag agggtggccg ccaggcaggc ctggagcgtg 2880
gtcgggacca tcaaggactt gaagcagggg tacctctccc aggtcatcca tgagatcgtt 2940
gacctcatga tccactacca ggcagtggtc gtgctcgaaa atctcaattt cggtttcaaa 3000
agcaagagga ccgggatcgc cgagaaggct gtttaccagc aattcgaaaa gatgctcata 3060
gacaagctta actgcttggt gctcaaagac taccccgctg agaaggtggg aggcgtgctc 3120
aacccatacc aattgaccga tcaatttacc agcttcgcca aaatggggac ccaatctggc 3180
ttcctcttct acgtccctgc cccatacacc agcaagatcg atccactcac tggtttcgtt 3240
gatcccttcg tttggaagac catcaagaac catgagtcca ggaagcactt cctcgagggt 3300
tttgacttcc tccactacga cgtgaagacc ggtgatttca tccttcactt caagatgaat 3360
aggaacctca gctttcagag ggggctcccc gggtttatgc ccgcctggga catcgtcttc 3420
gagaagaacg agacccagtt tgacgccaag gggacaccct tcatcgctgg caagaggatc 3480
gtcccagtta tcgagaacca taggtttacc gggaggtaca gagacctcta ccccgctaac 3540
gaactcatcg ctttgctcga agagaaaggt atcgtgttca gggatgggag taatatactc 3600
cccaagctcc tcgagaacga cgactcccac gcaatcgata ccatggtcgc cctcatcagg 3660
tccgtccttc agatgagaaa tagcaacgca gccactggag aagactacat caactccccc 3720
gtgagagact tgaacggcgt ctgcttcgat tccagattcc agaaccccga gtggcccatg 3780
gacgccgacg ctaacggggc ctaccacatc gccctcaagg gccagcttct cctcaaccac 3840
ctcaaggaat ccaaggacct taagctccag aacggcatta gcaaccagga ctggttggca 3900
tacatccaag agctcaggaa c 3921
<210> 59
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 59
atgacccagt tcgaagggtt caccaacctc tatcaggtca gcaagaccct cagattcgag 60
ctcatccccc agggtaaaac cctcaagcac atccaggaac aagggtttat cgaggaagac 120
aaggccagga atgatcatta caaggagctc aagcccatta tcgatagaat ctacaagacc 180
tacgctgatc aatgtctcca gttggtgcag ctcgactggg aaaacctctc tgccgccatc 240
gattcctaca gaaaggaaaa gaccgaggag accaggaatg ccctcatcga agagcaagca 300
acctacagaa acgcaatcca cgattacttc atcggcagga ccgacaacct cactgatgct 360
ataaacaaga ggcacgccga aatatacaag gggcttttca aagcagagct cttcaacggc 420
aaggtgctca aacaactcgg caccgtcacc accaccgaac atgaaaacgc cctcctcaga 480
agcttcgaca aattcaccac ctacttcagc ggtttttacg agaataggaa aaacgttttc 540
tccgccgagg acatcagcac cgctatccct cacagaatcg tgcaagacaa cttccccaag 600
ttcaaggaga actgtcacat cttcaccagg ctcattaccg ccgtccccag tctcagggag 660
cacttcgaaa acgtcaagaa agccataggg atcttcgtta gcacctccat cgaagaggtc 720
ttcagtttcc ccttttacaa ccagctcctc acccagaccc agatcgactt gtacaatcaa 780
ctcctcggag ggatcagcag agaggctggg accgagaaga ttaagggact caatgaggtc 840
ctcaacctcg ccatccagaa gaacgacgag accgcccaca tcatcgcctc cctcccccac 900
aggttcatcc ccctcttcaa gcagatactc tccgacagga ataccctcag tttcatcctt 960
gaggagttca agagcgatga ggaagttata cagagcttct gcaaatacaa gaccctcctt 1020
aggaatgaga acgttttgga gaccgctgag gcactcttta atgagctcaa ctccatcgac 1080
cttacccaca tcttcatctc tcataagaag ctcgagacta tatcttccgc tctctgcgat 1140
cactgggaca ccctcagaaa cgccctctat gagaggagga tctccgaatt gaccggcaag 1200
atcaccaagt ccgccaagga gaaggtccaa aggagcctca agcatgagga catcaatctc 1260
caggaaatca tatcagccgc cggaaaagag ttgtccgaag cattcaagca gaagacctcc 1320
gaaatactca gccacgctca cgctgccctc gatcaaccac tccccaccac cctcaagaag 1380
caggaagaga aggaaatcct caagtcccag ttggatagcc tgttgggctt gtaccacctc 1440
ctcgactggt tcgcagttga cgagtccaac gaggttgacc ccgaattcag cgccaggctc 1500
accgggatca agctcgaaat ggagccctct ctctccttct acaacaaggc caggaactat 1560
gccaccaaga aaccctacag cgtggaaaag ttcaagctca acttccaaat gcccacactc 1620
gcctcaggtt gggatgtgaa caaggaaaaa aacaacggcg caatcctctt cgtcaagaac 1680
gggctctact atcttgggat catgcccaaa cagaaaggga ggtataaggc cctctccttc 1740
gagcctaccg agaaaacctc cgagggtttt gacaagatgt actacgacta ctttcctgac 1800
gccgccaaga tgatccccaa gtgcagcacc cagctcaagg ccgtcaccgc ccacttccag 1860
acacacacca cccccatcct cctctccaac aacttcatcg aacccttgga gatcaccaag 1920
gagatatacg atctcaacaa cccagaaaag gagcctaaga agttccagac cgcctacgcc 1980
aagaaaacag gggatcagaa agggtacagg gaagccctct gtaagtggat cgactttacc 2040
agggacttcc tctccaagta cacaaagaca acctccattg atctttccag cctcagaccc 2100
agctcccagt acaaagacct cggggagtac tacgctgagt tgaaccccct cttgtaccat 2160
attagcttcc aaaggatcgc cgagaaggag atcatggacg ccgtcgagac cggtaagctc 2220
tacctcttcc agatttataa caaggatttc gccaaagggc accacgggaa gcccaacctt 2280
cataccctct actggaccgg gctcttttcc cccgaaaacc tcgccaagac ctccataaag 2340
ctcaacgggc aggccgaact tttctacagg cctaaatcaa ggatgaaaag aatggcccac 2400
agattgggcg agaagatgct caacaagaag ctcaaggacc aaaagactcc cataccagat 2460
accttgtacc aagaactcta tgattacgtc aaccacaggc tctctcatga cctcagcgac 2520
gaggctcgag cccttctccc caacgtgatc accaaggagg tttcacacga gatcatcaag 2580
gacaggagat tcacctccga taagtttttc ttccatgtgc ccatcactct caattaccag 2640
gccgccaact cccccagcaa gttcaaccag agggtgaacg cctacctcaa agagcacccc 2700
gagaccccca tcatcgggat cgatagaggg gaaaggaatc tcatctacat taccgttatc 2760
gacagtaccg gcaagatact cgagcagaga agtcttaaca caatccaaca gttcgattac 2820
cagaagaagc tcgacaacag agagaaggaa agagtcgccg ccagacaggc ctggtccgtg 2880
gtcgggacta tcaaggactt gaagcagggc tacctcagtc aggttattca cgaaatcgtc 2940
gatctcatga tccactacca ggccgtcgtt gtgctcgaga atctcaactt cgggttcaaa 3000
agcaagagga ccgggatcgc cgagaaggcc gtgtaccagc agttcgagaa gatgttgatc 3060
gataaactca attgtttggt gctcaaggac tatcccgctg agaaagtggg tggagttctc 3120
aacccctacc agctcaccga ccagttcacc agtttcgcca aaatggggac tcagagcggg 3180
ttcctcttct acgtccccgc cccctatacc tccaagatcg accccctcac cggcttcgtt 3240
gacccttttg tttggaaaac tatcaaaaac catgagagca ggaagcactt cctcgagggg 3300
tttgatttcc tccactacga tgttaagacc ggcgatttca tcctccactt caagatgaac 3360
aggaatctct ccttccagag gggccttccc ggcttcatgc ctgcctggga catagtcttc 3420
gagaagaacg aaacccaatt cgacgccaag ggcacaccat tcatcgccgg caagaggatc 3480
gtgcctgtga tcgaaaacca caggttcacc ggcagataca gagatctcta tcctgccaac 3540
gagctcatcg cactcctcga ggagaagggc attgtcttca gggacggaag taatatcctt 3600
ccaaagcttc tcgagaacga cgactcccac gccatcgaca ccatggttgc tctcatcagg 3660
tccgtgctcc agatgaggaa ttccaacgcc gccaccggcg aggattacat aaacagcccc 3720
gtcagggacc tcaacggggt ttgcttcgac tccaggttcc aaaaccccga gtggcccatg 3780
gacgctgacg ctaacggcgc ataccacatc gcactcaaag gccagctcct cttgaaccat 3840
ctcaaggagt caaaagacct caagctccag aacgggatca gcaaccagga ttggttggcc 3900
tacatacagg agctcaggaa c 3921
<210> 60
<211> 3921
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 60
atgacacaat tcgagggctt cacaaacctc taccaggtca gcaaaaccct caggttcgag 60
ctcatccctc aggggaagac cctcaaacac atccaggagc aggggttcat agaggaggac 120
aaggctagga atgaccatta caaggagctt aagccaatca tcgacaggat ctataaaacc 180
tatgccgacc aatgcctcca gctcgtgcag cttgactggg aaaacctctc cgccgccatc 240
gacagctata ggaaggagaa aaccgaggaa accagaaacg cccttatcga agaacaagca 300
acctacagga acgccatcca tgactacttc atagggagga cagacaatct caccgacgcc 360
atcaacaaga ggcacgccga gatctacaaa ggccttttca aggcagagtt gtttaacggg 420
aaggtcctca agcagttggg tacagtcacc accaccgaac acgaaaacgc cttgttgagg 480
tccttcgaca agttcaccac ctacttcagc ggcttctatg aaaacaggaa aaacgtcttc 540
tctgccgagg atatctcaac cgccatcccc cataggattg tgcaagacaa cttccccaag 600
tttaaagaga actgtcacat cttcaccaga ctcatcactg ctgtgcccag tctcagggaa 660
cacttcgaga atgtcaaaaa ggccatcggc atcttcgtct ctacaagcat cgaggaggtt 720
ttcagcttcc ctttctacaa ccagttgctc acccaaaccc agattgatct ctacaatcaa 780
ctcctcgggg gcatcagcag ggaggccggg accgagaaga tcaagggcct taacgaggtc 840
ctcaacctcg ccatccagaa gaacgatgag accgcccata tcattgccag cctccctcac 900
aggttcatcc ccctttttaa gcagatcctc agcgatagga ataccctctc attcatcctc 960
gaggaattca agtccgacga ggaagtcatt cagtccttct gtaagtataa gaccctcctc 1020
aggaatgaaa atgttctcga gaccgccgag gctctcttca acgaactcaa ctccatcgat 1080
cttacccaca tattcataag ccacaaaaag ctcgagacca taagcagcgc cctctgcgac 1140
cactgggata ctctcagaaa cgccctctac gagaggagga tctccgagct taccggtaag 1200
atcaccaagt ccgccaagga gaaggttcag aggagcctca agcacgagga tatcaacctc 1260
caggaaataa tctcagccgc cgggaaggag ttgagtgagg ctttcaaaca gaagacaagc 1320
gaaatcctca gccacgccca cgcagccctc gaccaaccac tccccaccac cctcaaaaag 1380
caagaggaaa aggagatact caaaagccag cttgattcat tgctcggcct ctaccacctc 1440
ctcgactggt tcgccgtgga cgaatccaac gaggtcgacc ccgaatttag cgctaggctc 1500
accggaatca aactcgagat ggagcccagc ctcagcttct acaacaaggc caggaactac 1560
gccaccaaga agccttactc cgtggagaag ttcaaactca acttccagat gccaaccctc 1620
gcctccggtt gggacgtgaa caaagagaag aacaatggtg ctatcttgtt cgtgaagaat 1680
gggctttatt accttggaat catgcccaaa cagaaaggaa ggtacaaggc cctctcattc 1740
gaacccaccg aaaagacttc cgaggggttc gacaagatgt attacgacta cttccctgat 1800
gccgcaaaga tgatacccaa gtgctccact caactcaagg ccgtcaccgc ccacttccag 1860
acccacacca cccccatctt gctctccaat aacttcatag aacctctcga aatcaccaag 1920
gagatctacg atcttaataa cccagagaag gagcccaaaa agttccaaac cgcatacgcc 1980
aagaagaccg gggaccagaa ggggtacagg gaagccttgt gcaagtggat agactttacc 2040
agggacttct tgtccaagta taccaagacc accagtatcg atctcagctc tctcaggccc 2100
agcagccagt acaaggattt gggggagtac tacgccgagt tgaaccccct cctctaccac 2160
atctcattcc agaggatcgc cgagaaggaa atcatggatg cagtcgagac tggcaagctc 2220
tacctcttcc agatctataa caaggacttc gccaagggcc accacgggaa gccaaacctc 2280
catacactct attggaccgg gttgttttcc cccgagaacc ttgccaagac ctccatcaag 2340
ttgaacgggc aggccgagct cttctacaga cccaagagca ggatgaagag aatggcacac 2400
aggctcggtg agaagatgct caacaagaag ctcaaggacc agaaaactcc catccccgac 2460
accctctacc aggagttgta tgactacgtt aaccacaggc tctcccatga cctctccgac 2520
gaagctagag ccctccttcc caacgtcatc accaaggagg tctcccacga gatcatcaaa 2580
gacaggaggt ttaccagcga caagttcttc tttcacgtcc ctataaccct taactaccag 2640
gctgctaact ccccctccaa atttaaccag agggtcaacg cttaccttaa ggaacacccc 2700
gagacaccca ttattggaat cgacagaggc gaaaggaacc tcatctatat taccgtcatc 2760
gacagcaccg gcaagatcct cgagcagagg agcctcaaca ccatccagca gttcgactat 2820
cagaagaagt tggacaatag agagaaagag agggtcgccg ccaggcaagc ctggagcgtc 2880
gtgggaacca tcaaggacct caagcaggga tatctctccc aggtcatcca cgagatagtg 2940
gatctcatga tccattacca agccgtggtg gttctcgaga acctcaactt cggcttcaag 3000
agcaagagga ccggcatcgc cgaaaaggcc gtttaccaac agtttgagaa gatgttgatc 3060
gacaaattga actgcctcgt ccttaaagat taccctgccg agaaggtggg tggggtttta 3120
aacccctacc agctcaccga ccagttcacc tcctttgcca agatgggcac ccagagcggc 3180
ttcctcttct acgttccagc cccctacacc tccaagatcg atcccctcac cgggttcgtg 3240
gaccccttcg tctggaagac catcaagaac cacgaaagca gaaagcattt cctcgaaggg 3300
ttcgactttc tccactacga tgtgaagacc ggggatttca tcctccactt caagatgaat 3360
aggaacctta gcttccagag ggggttgcct ggatttatgc ccgcctggga catcgtgttt 3420
gagaagaacg agactcagtt cgacgccaag gggactccct tcatcgccgg caagaggatc 3480
gtgccagtaa tcgagaatca tagattcacc ggcaggtata gggaccttta ccccgccaac 3540
gagctcatcg ccctcctcga ggagaagggc atcgttttta gagacgggtc caacatcctt 3600
cccaaactcc tcgagaacga cgattctcac gcaatcgata ccatggtcgc cctcatcagg 3660
agcgtgctcc agatgagaaa ctccaacgcc gccactggcg aggactatat taactcccct 3720
gttagggacc ttaacggcgt gtgtttcgat agcagattcc agaatccaga gtggcccatg 3780
gacgccgatg ctaatggggc ctatcatatc gctcttaagg gacagctcct cttgaaccac 3840
cttaaggagt ccaaggactt gaaattgcag aatgggatat ccaaccagga ctggctcgcc 3900
tacattcagg agctcaggaa t 3921
<210> 61
<211> 1064
<212> PRT
<213> 未知
<220>
<223> 史密斯氏菌属物种(Smithella sp. )
<400> 61
Met Glu Lys Tyr Lys Ile Thr Lys Thr Ile Arg Phe Lys Leu Leu Pro
1 5 10 15
Asp Lys Ile Gln Asp Ile Ser Arg Gln Val Ala Val Leu Gln Asn Ser
20 25 30
Thr Asn Ala Glu Lys Lys Asn Asn Leu Leu Arg Leu Val Gln Arg Gly
35 40 45
Gln Glu Leu Pro Lys Leu Leu Asn Glu Tyr Ile Arg Tyr Ser Asp Asn
50 55 60
His Lys Leu Lys Ser Asn Val Thr Val His Phe Arg Trp Leu Arg Leu
65 70 75 80
Phe Thr Lys Asp Leu Phe Tyr Asn Trp Lys Lys Asp Asn Thr Glu Lys
85 90 95
Lys Ile Lys Ile Ser Asp Val Val Tyr Leu Ser His Val Phe Glu Ala
100 105 110
Phe Leu Lys Glu Trp Glu Ser Thr Ile Glu Arg Val Asn Ala Asp Cys
115 120 125
Asn Lys Pro Glu Glu Ser Lys Thr Arg Asp Ala Glu Ile Ala Leu Ser
130 135 140
Ile Arg Lys Leu Gly Ile Lys His Gln Leu Pro Phe Ile Lys Gly Phe
145 150 155 160
Val Asp Asn Ser Asn Asp Lys Asn Ser Glu Asp Thr Lys Ser Lys Leu
165 170 175
Thr Ala Leu Leu Ser Glu Phe Glu Ala Val Leu Lys Ile Cys Glu Gln
180 185 190
Asn Tyr Leu Pro Ser Gln Ser Ser Gly Ile Ala Ile Ala Lys Ala Ser
195 200 205
Phe Asn Tyr Tyr Thr Ile Asn Lys Lys Gln Lys Asp Phe Glu Ala Glu
210 215 220
Ile Val Ala Leu Lys Lys Gln Leu His Ala Arg Tyr Gly Asn Lys Lys
225 230 235 240
Tyr Asp Gln Leu Leu Arg Glu Leu Asn Leu Ile Pro Leu Lys Glu Leu
245 250 255
Pro Leu Lys Glu Leu Pro Leu Ile Glu Phe Tyr Ser Glu Ile Lys Lys
260 265 270
Arg Lys Ser Thr Lys Lys Ser Glu Phe Leu Glu Ala Val Ser Asn Gly
275 280 285
Leu Val Phe Asp Asp Leu Lys Ser Lys Phe Pro Leu Phe Gln Thr Glu
290 295 300
Ser Asn Lys Tyr Asp Glu Tyr Leu Lys Leu Ser Asn Lys Ile Thr Gln
305 310 315 320
Lys Ser Thr Ala Lys Ser Leu Leu Ser Lys Asp Ser Pro Glu Ala Gln
325 330 335
Lys Leu Gln Thr Glu Ile Thr Lys Leu Lys Lys Asn Arg Gly Glu Tyr
340 345 350
Phe Lys Lys Ala Phe Gly Lys Tyr Val Gln Leu Cys Glu Leu Tyr Lys
355 360 365
Glu Ile Ala Gly Lys Arg Gly Lys Leu Lys Gly Gln Ile Lys Gly Ile
370 375 380
Glu Asn Glu Arg Ile Asp Ser Gln Arg Leu Gln Tyr Trp Ala Leu Val
385 390 395 400
Leu Glu Asp Asn Leu Lys His Ser Leu Ile Leu Ile Pro Lys Glu Lys
405 410 415
Thr Asn Glu Leu Tyr Arg Lys Val Trp Gly Ala Lys Asp Asp Gly Ala
420 425 430
Ser Ser Ser Ser Ser Ser Thr Leu Tyr Tyr Phe Glu Ser Met Thr Tyr
435 440 445
Arg Ala Leu Arg Lys Leu Cys Phe Gly Ile Asn Gly Asn Thr Phe Leu
450 455 460
Pro Glu Ile Gln Lys Glu Leu Pro Gln Tyr Asn Gln Lys Glu Phe Gly
465 470 475 480
Glu Phe Cys Phe His Lys Ser Asn Asp Asp Lys Glu Ile Asp Glu Pro
485 490 495
Lys Leu Ile Ser Phe Tyr Gln Ser Val Leu Lys Thr Asp Phe Val Lys
500 505 510
Asn Thr Leu Ala Leu Pro Gln Ser Val Phe Asn Glu Val Ala Ile Gln
515 520 525
Ser Phe Glu Thr Arg Gln Asp Phe Gln Ile Ala Leu Glu Lys Cys Cys
530 535 540
Tyr Ala Lys Lys Gln Ile Ile Ser Glu Ser Leu Lys Lys Glu Ile Leu
545 550 555 560
Glu Asn Tyr Asn Thr Gln Ile Phe Lys Ile Thr Ser Leu Asp Leu Gln
565 570 575
Arg Ser Glu Gln Lys Asn Leu Lys Gly His Thr Arg Ile Trp Asn Arg
580 585 590
Phe Trp Thr Lys Gln Asn Glu Glu Ile Asn Tyr Asn Leu Arg Leu Asn
595 600 605
Pro Glu Ile Ala Ile Val Trp Arg Lys Ala Lys Lys Thr Arg Ile Glu
610 615 620
Lys Tyr Gly Glu Arg Ser Val Leu Tyr Glu Pro Glu Lys Arg Asn Arg
625 630 635 640
Tyr Leu His Glu Gln Tyr Thr Leu Cys Thr Thr Val Thr Asp Asn Ala
645 650 655
Leu Asn Asn Glu Ile Thr Phe Ala Phe Glu Asp Thr Lys Lys Lys Gly
660 665 670
Thr Glu Ile Val Lys Tyr Asn Glu Lys Ile Asn Gln Thr Leu Lys Lys
675 680 685
Glu Phe Asn Lys Asn Gln Leu Trp Phe Tyr Gly Ile Asp Ala Gly Glu
690 695 700
Ile Glu Leu Ala Thr Leu Ala Leu Met Asn Lys Asp Lys Glu Pro Gln
705 710 715 720
Leu Phe Thr Val Tyr Glu Leu Lys Lys Leu Asp Phe Phe Lys His Gly
725 730 735
Tyr Ile Tyr Asn Lys Glu Arg Glu Leu Val Ile Arg Glu Lys Pro Tyr
740 745 750
Lys Ala Ile Gln Asn Leu Ser Tyr Phe Leu Asn Glu Glu Leu Tyr Glu
755 760 765
Lys Thr Phe Arg Asp Gly Lys Phe Asn Glu Thr Tyr Asn Glu Leu Phe
770 775 780
Lys Glu Lys His Val Ser Ala Ile Asp Leu Thr Thr Ala Lys Val Ile
785 790 795 800
Asn Gly Lys Ile Ile Leu Asn Gly Asp Met Ile Thr Phe Leu Asn Leu
805 810 815
Arg Ile Leu His Ala Gln Arg Lys Ile Tyr Glu Glu Leu Ile Glu Asn
820 825 830
Pro His Ala Glu Leu Lys Glu Lys Asp Tyr Lys Leu Tyr Phe Glu Ile
835 840 845
Glu Gly Lys Asp Lys Asp Ile Tyr Ile Ser Arg Leu Asp Phe Glu Tyr
850 855 860
Ile Lys Pro Tyr Gln Glu Ile Ser Asn Tyr Leu Phe Ala Tyr Phe Ala
865 870 875 880
Ser Gln Gln Ile Asn Glu Ala Arg Glu Glu Glu Gln Ile Asn Gln Thr
885 890 895
Lys Arg Ala Leu Ala Gly Asn Met Ile Gly Val Ile Tyr Tyr Leu Tyr
900 905 910
Gln Lys Tyr Arg Gly Ile Ile Ser Ile Glu Asp Leu Lys Gln Thr Lys
915 920 925
Val Glu Ser Asp Arg Asn Lys Phe Glu Gly Asn Ile Glu Arg Pro Leu
930 935 940
Glu Trp Ala Leu Tyr Arg Lys Phe Gln Gln Glu Gly Tyr Val Pro Pro
945 950 955 960
Ile Ser Glu Leu Ile Lys Leu Arg Glu Leu Glu Lys Phe Pro Leu Lys
965 970 975
Asp Val Lys Gln Pro Lys Tyr Glu Asn Ile Gln Gln Phe Gly Ile Ile
980 985 990
Lys Phe Val Ser Pro Glu Glu Thr Ser Thr Thr Cys Pro Lys Cys Leu
995 1000 1005
Arg Arg Phe Lys Asp Tyr Asp Lys Asn Lys Gln Glu Gly Phe Cys
1010 1015 1020
Lys Cys Gln Cys Gly Phe Asp Thr Arg Asn Asp Leu Lys Gly Phe
1025 1030 1035
Glu Gly Leu Asn Asp Pro Asp Lys Val Ala Ala Phe Asn Ile Ala
1040 1045 1050
Lys Arg Gly Phe Glu Asp Leu Gln Lys Tyr Lys
1055 1060
<210> 62
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 62
atggaaaagt ataagatcac aaaaacaatc cgattcaagc tacttcctga caaaatacag 60
gatatttctc ggcaggtggc cgtgttgcaa aattcaacga acgcagaaaa aaagaacaat 120
ttgttgaggc tcgttcaaag ggggcaagag ctcccaaagc tcttgaatga atacattagg 180
tattccgata atcacaaact caaaagtaac gttacggtgc atttcaggtg gcttagatta 240
ttcactaagg acttatttta taattggaaa aaagacaaca ccgaaaagaa gataaagatt 300
tccgatgttg tttatctgag ccacgttttt gaagctttcc tgaaggagtg ggagtccaca 360
attgagaggg tgaacgcaga ttgcaacaaa ccggaagaat ccaaaacacg cgatgcagaa 420
atcgcactct caatcagaaa actcggcatc aaacatcaat tgccttttat taagggattc 480
gtggacaact ctaacgataa gaattcagag gatactaaat ctaagctaac cgcacttctg 540
tccgagtttg aagctgtgct taaaatttgt gagcagaact acttaccctc tcagtcatcg 600
ggtattgcaa tcgctaaagc cagtttcaac tactatacaa ttaacaagaa gcaaaaggat 660
tttgaggctg agatagttgc tttaaagaag cagttgcacg ccaggtatgg aaataaaaag 720
tatgaccagc tactgagaga gctaaacctc attccactta aagaattgcc tcttaaagaa 780
cttccgctta ttgaatttta cagcgaaatc aagaagagaa agtccaccaa aaagtccgag 840
ttccttgaag ctgtctcaaa tgggttggtc tttgatgatc ttaagagtaa gttcccattg 900
tttcaaacag agagcaacaa atacgacgag taccttaaac tatcaaacaa aatcacccaa 960
aaatccactg caaagagcct tctctcaaaa gacagcccag aggcacagaa attacaaact 1020
gagattacaa agctgaaaaa aaatagggga gagtatttca agaaggcatt cggcaagtat 1080
gtgcaattgt gtgagctgta taaagagatc gcaggaaagc ggggcaagct taaggggcaa 1140
attaagggaa tagaaaacga gcgcatagat tcacagaggc tccaatactg ggctttggtc 1200
ctggaggata acttgaagca tagtttaatt ttgattccta aggaaaagac taacgagttg 1260
tatcgaaagg tttggggtgc aaaggatgac ggtgccagtt cctccagcag tagtacccta 1320
tattacttcg aatccatgac gtacagggca ctacgtaaac tctgttttgg gattaacgga 1380
aatacatttc tccctgaaat ccagaaagaa ctacctcagt acaaccaaaa ggagttcggc 1440
gaattctgct ttcataagtc caatgacgac aaagagatcg atgagccaaa gttgatctcg 1500
ttctaccagt cagttctcaa gacagatttc gttaagaaca cactagctct ccctcaatca 1560
gttttcaatg aggtagctat ccaatcattt gagacgaggc aggattttca aattgctctt 1620
gagaaatgtt gttatgcaaa gaagcaaatt atttctgaga gcctgaaaaa agagattctt 1680
gaaaattata acacgcagat ttttaaaatt acgtctcttg accttcagag gagcgagcag 1740
aagaatctga aggggcacac aaggatctgg aatagatttt ggacaaaaca gaacgaagag 1800
ataaattaca atcttcgttt aaatcctgag attgctatcg tgtggagaaa ggctaagaag 1860
actaggattg aaaagtacgg agaaagatct gtgctttatg agccagagaa gaggaataga 1920
tatttacacg aacaatatac actgtgtacc accgtgactg ataatgcttt gaacaatgag 1980
ataacttttg ctttcgagga tactaagaaa aagggtaccg agattgttaa gtataatgaa 2040
aagatcaatc agactttgaa gaaagaattt aataagaatc aattgtggtt ttacggaatt 2100
gacgccggtg aaattgagct tgctacactg gcccttatga acaaagacaa ggaacctcaa 2160
ttgtttacgg tgtacgaact gaagaagctt gacttcttca aacacggtta catttataac 2220
aaagagcgtg agttggttat tcgcgaaaag ccttacaagg ctatccaaaa tctttcttat 2280
ttcctgaacg aggaacttta cgagaagact ttcagggacg gaaagtttaa cgaaacctat 2340
aacgagttat ttaaggagaa gcatgtgagt gccatagact tgaccacagc caaggtaatt 2400
aacggtaaaa ttatattgaa cggagatatg attacgttct tgaatttgag gattcttcac 2460
gcgcagagaa agatttatga agagcttatt gaaaatcctc atgccgaact taaggagaag 2520
gactataagc tttattttga gattgaaggt aaggataaag atatatatat ctccaggttg 2580
gatttcgaat acattaagcc gtaccaggag atcagtaatt atttgtttgc ctactttgca 2640
tctcaacaga ttaacgaggc cagagaagaa gagcagatta atcaaactaa gagagctcta 2700
gccggaaaca tgataggagt gatttattat ttataccaga agtacagagg aattattagc 2760
atcgaggacc tcaagcaaac gaaggtagag tcagatagaa ataagtttga gggcaacatt 2820
gagagaccac ttgagtgggc actttacagg aaatttcagc aagaaggata tgttcctcca 2880
attagtgagc ttataaagct aagagagctt gagaagtttc cactgaagga tgtgaaacaa 2940
ccaaaatacg aaaatatcca gcaattcggt attatcaaat ttgtttctcc agaagagaca 3000
tcaaccacct gcccgaaatg tttgcgccga tttaaggact acgataaaaa taaacaggaa 3060
ggattctgca agtgtcaatg cggttttgat acgagaaacg atttgaaggg attcgaggga 3120
ctgaatgacc cagacaaagt ggctgccttt aatattgcta aaaggggatt tgaggacttg 3180
caaaagtaca aa 3192
<210> 63
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 63
atggagaagt acaagattac caagacaatt aggttcaaac tcctccccga taagattcag 60
gatatctcca gacaagtcgc cgtcctccaa aacagcacaa atgccgagaa gaagaacaac 120
ctccttagac tcgtgcagag ggggcaggag ctccccaaac ttctcaatga gtatatcagg 180
tattctgaca atcataagct caagagcaac gtgaccgtcc acttcaggtg gctcaggctt 240
ttcaccaagg acctcttcta caactggaag aaggataaca ccgagaagaa gattaagatc 300
tccgacgttg tgtacctctc ccacgtcttc gaggcctttc tcaaagagtg ggagagtacc 360
atagagaggg ttaacgccga ctgcaacaag cctgaagagt ccaaaactag ggacgccgaa 420
atcgccctca gcatcaggaa gcttggcata aagcatcagc tccccttcat caaaggcttc 480
gtggacaact ccaacgacaa aaactccgag gacaccaaat ccaagctcac agccttgctt 540
agcgagttcg aggccgtgct caagatctgt gagcagaact acctcccctc ccagagctcc 600
gggatcgcca tcgccaaagc ctccttcaac tactacacca ttaacaagaa gcaaaaggac 660
ttcgaagccg agatcgttgc cctcaagaaa caactccacg ctaggtacgg caataaaaag 720
tacgaccaac ttctcaggga actcaacctc atcccactga aggagctccc ccttaaggag 780
cttccactca tcgagttcta ttccgagatc aagaagagga aatctaccaa gaaatccgag 840
tttctcgagg ctgtctccaa cggcctcgtc ttcgacgacc ttaaatccaa atttcccttg 900
ttccagaccg agagcaacaa atacgatgag tacttgaaat tgtccaacaa gataacccag 960
aagagcaccg ccaagtccct tctctccaag gacagccctg aggcccagaa gctccaaacc 1020
gagatcacaa aactcaaaaa gaataggggg gagtacttca agaaagcctt tggcaagtac 1080
gtgcagctct gcgaactcta caaggagatc gccggcaaga ggggcaaact caaaggccag 1140
ataaagggga tcgagaatga gcggatcgat agccagaggc ttcagtactg ggctctcgtc 1200
ctcgaggaca atctcaagca tagcttgatc ctcattccca aggagaagac caatgaattg 1260
tataggaagg tctggggtgc aaaggatgat ggcgcatcca gctctagtag ctccaccctc 1320
tactatttcg agtcaatgac ctatagggcc cttaggaagc tctgcttcgg gatcaatggc 1380
aatacctttc ttcccgaaat ccagaaggag ctcccacagt acaaccaaaa ggagtttggt 1440
gagttctgct tccacaagag caacgacgat aaggagatcg acgagcctaa gctcatttcc 1500
ttctaccaat cagtcctcaa gactgacttc gttaagaaca cccttgcctt gccccagtcc 1560
gtcttcaacg aagtggccat ccagagcttc gagaccagac aggatttcca gatcgccttg 1620
gagaagtgct gctatgccaa gaaacagatc atctccgagt ccctcaagaa ggagatcctc 1680
gaaaactaca acacccaaat ctttaagatc actagcctcg acctccagag gtccgaacag 1740
aagaacctca aggggcacac caggatctgg aacaggttct ggaccaagca gaacgaggag 1800
atcaactaca acctcagact caatcccgag atcgcaatcg tctggagaaa ggccaagaaa 1860
accagaatcg agaagtacgg ggagagatcc gtcctctacg aacccgagaa aaggaacagg 1920
tatctccacg agcagtacac cctctgcacc accgttactg acaatgcact caacaacgaa 1980
atcacctttg ccttcgagga caccaaaaag aagggcaccg aaatcgtcaa gtataacgag 2040
aagatcaacc agactctcaa gaaggagttc aataagaacc aactctggtt ctatgggatc 2100
gacgccggcg agattgagct cgcaacactc gccctcatga ataaggacaa ggagccccag 2160
ctcttcaccg tctacgagct caagaagctc gacttcttca agcacggcta catctacaac 2220
aaagagagag agctcgtgat tagggagaag ccatataaag ccatccagaa tctttcctat 2280
ttcctcaacg aagaattgta cgagaagaca ttcagggacg ggaagttcaa cgagacctac 2340
aatgagctct ttaaagaaaa gcacgtttcc gccatagact tgactaccgc caaggttatc 2400
aacgggaaga tcatcttgaa cggggacatg atcacctttc tcaacctcag aatcctccac 2460
gcacagagga agatctacga ggagttgatc gaaaaccccc acgccgaact caaggagaag 2520
gactataagc tctatttcga aatcgagggg aaggacaaag acatctacat cagcaggctc 2580
gactttgaat acatcaaacc ataccaggag atcagcaatt acctcttcgc ctacttcgcc 2640
agccaacaga taaatgaggc cagggaggag gaacagatca accagaccaa gagggccctc 2700
gccggaaaca tgatcggcgt gatctattac ctctaccaga agtacagggg gatcatctcc 2760
atcgaggatc ttaaacagac caaggtcgaa tccgacagga acaaattcga agggaacatc 2820
gagaggcctc tcgagtgggc cctttacaga aagttccagc aggagggtta cgttcccccc 2880
atcagcgagt tgatcaagct gagggagctc gaaaaattcc ccctcaaaga cgtgaaacag 2940
ccaaagtacg agaatattca gcagtttggc atcatcaagt tcgtctctcc cgaggaaacc 3000
tccaccacct gccccaagtg cctcaggaga ttcaaggact acgacaagaa caagcaggag 3060
gggttttgca aatgccagtg tggattcgat acaaggaacg atctcaaggg attcgagggc 3120
ctcaacgacc ccgacaaggt cgccgccttc aatatcgcaa agagggggtt cgaggacttg 3180
caaaagtaca ag 3192
<210> 64
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 64
atggagaagt acaagatcac caagactatc agattcaagc ttctccccga caagatccag 60
gacatttcaa ggcaagttgc cgtcctccag aactccacaa acgccgagaa gaagaacaac 120
ttgctcaggc tcgtgcagag aggacaggag ttgcccaagc tccttaacga gtacatcagg 180
tactcagaca atcacaagtt gaagagcaac gttacagttc acttcaggtg gctcaggttg 240
ttcaccaagg atttgttcta taactggaag aaggacaaca cagaaaaaaa gatcaagatc 300
agcgacgtcg tgtacctctc acacgtcttc gaggccttcc tcaaggagtg ggaatccacc 360
atcgagaggg ttaacgccga ttgcaacaag cccgaggagt ccaaaactag agacgcagag 420
atcgcacttt ctatcagaaa gctcggcatc aaacaccagc tcccattcat caaagggttc 480
gttgataata gtaacgataa gaacagcgag gacaccaagt caaaattgac cgccctcctc 540
agtgagttcg aggccgtgct caagatatgc gagcaaaact acctccccag tcagtcttcc 600
ggtatcgcca tcgccaaggc ctcctttaac tattacacca taaacaagaa gcagaaggac 660
ttcgaggccg aaatcgttgc cctcaagaag caactccacg ccagatacgg taacaaaaag 720
tacgatcaac tcttgaggga gctcaacctt atcccattga aggaactccc actcaaggag 780
cttcccctca tagaattcta ctccgagatc aagaagagga agtccaccaa gaagagtgag 840
tttctcgagg ccgtcagcaa cggcctcgtt ttcgacgacc tcaagagcaa gtttcctctc 900
ttccagacag agtccaataa gtacgatgag taccttaagc tctccaacaa gattacccag 960
aaatcgaccg caaaatccct cctctccaag gactctcccg aggcccagaa gctccagacc 1020
gagataacaa agcttaagaa gaacagggga gagtacttta agaaggcctt cggcaaatat 1080
gttcagcttt gcgagctcta taaggagatc gctggcaaga gaggtaagct caagggccaa 1140
atcaagggga tcgaaaatga gagaatcgac tcccagagac tccaatactg ggcccttgtg 1200
ctcgaggaca acctcaagca tagcttgatc ctcatcccca aggagaagac caacgaactc 1260
tatagaaaag tctggggcgc caaggacgac ggcgccagca gctcttccag ctccactttg 1320
tactatttcg agagcatgac ctacagggcc ctcagaaagc tctgcttcgg catcaacggg 1380
aatacattcc tccccgagat tcagaaggaa ctccctcaat ataaccagaa ggagttcgga 1440
gagttctgct tccataagtc caacgatgac aaagaaatcg acgaacccaa gcttataagt 1500
ttctaccaga gtgtgttgaa gaccgacttc gtgaagaata ccctcgccct ccctcagtct 1560
gtgttcaacg aggtcgccat tcagagcttc gaaactaggc aggacttcca gatcgccctc 1620
gaaaagtgtt gttacgcaaa gaagcaaatc atcagcgaga gcctcaagaa agagatcctc 1680
gagaattaca atacacagat cttcaagatt acctcactcg acctccaaag gtcagagcag 1740
aagaacctta agggccacac caggatctgg aataggttct ggaccaagca gaacgaggag 1800
atcaactaca atctcagact taatcccgag atcgccatcg tgtggaggaa ggcaaagaag 1860
accaggatcg agaagtacgg cgagaggagc gtcctttatg agcccgaaaa gaggaacagg 1920
tacctccacg agcagtacac cctctgcacc accgtgaccg acaacgccct caataatgaa 1980
atcaccttcg cattcgagga taccaagaag aagggaaccg agatcgttaa gtacaacgag 2040
aaaatcaacc agaccctcaa aaaggagttc aacaaaaatc aactttggtt ctacgggatc 2100
gacgccgggg aaatcgagct cgccacactc gccttgatga acaaggacaa ggaaccccag 2160
ctctttaccg tctacgagct caagaagctc gacttcttca agcacggtta catctacaat 2220
aaggagagag aactcgttat cagggagaaa ccctacaagg ccatccagaa cttgtcctac 2280
ttcctcaacg aggaactcta cgagaagacc ttcagggacg ggaagttcaa tgagacctac 2340
aacgagctct ttaaggaaaa acacgtctcc gctatcgatc tcaccaccgc caaggtgatc 2400
aacggcaaga tcattctcaa cggtgatatg atcacattcc tcaatctcag gatccttcac 2460
gcccagcgta agatctacga ggaacttatc gagaaccccc atgccgagct caaagaaaag 2520
gattacaagc tttacttcga gatcgagggg aaggacaaag acatctacat ctccaggctc 2580
gactttgagt acatcaagcc ctaccaggag atcagcaact acttgttcgc ctacttcgcc 2640
tcacagcaga tcaacgaagc aagggaggag gagcaaatca accagaccaa aagggccctc 2700
gcaggtaaca tgataggggt catctactac ttgtaccaga aatacagagg catcatcagt 2760
atcgaggacc tcaagcagac taaggtggag agcgacagga acaagttcga aggcaatatc 2820
gagagacccc tcgagtgggc cctctacaga aaattccagc aggagggcta cgtgcccccc 2880
atcagcgaat tgatcaagct cagggagctc gagaagtttc ccctcaagga tgtgaagcag 2940
cccaagtacg agaacatcca acaattcggc atcatcaaat tcgtgtcacc cgaggagacc 3000
tccaccacct gccccaagtg cctcaggagg ttcaaggact acgacaaaaa taagcaagag 3060
gggttctgca aatgtcaatg cgggttcgac accaggaacg acctaaaggg cttcgagggt 3120
ttgaatgacc ccgacaaggt cgcagcattc aacatcgcca agaggggctt cgaggacctc 3180
cagaagtata ag 3192
<210> 65
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 65
atggagaagt acaagataac taagaccatc aggttcaagc tcctccccga caagatccag 60
gacatctcaa gacaagtggc cgtgttgcag aactccacta atgccgagaa aaagaacaac 120
ttgctcaggt tggttcaaag gggccaggag ctccccaagc ttctcaacga gtacatcagg 180
tactccgata accacaagct caagagcaac gttaccgtcc acttcaggtg gctcaggctc 240
ttcaccaaag acctctttta taactggaag aaggacaaca ccgagaagaa gataaagatc 300
agcgatgttg tttacttgag tcacgtcttc gaagccttcc ttaaggagtg ggaatctacc 360
atcgagaggg tgaatgccga ttgcaataaa cctgaagagt ccaaaaccag ggacgctgag 420
atcgccctct ctatcaggaa gctcgggatc aagcaccaac tcccctttat caaggggttc 480
gtcgataaca gcaacgataa gaacagcgaa gacactaaat ccaaactcac cgccctcctc 540
tccgagttcg aggccgtgct caagatttgc gagcagaact accttcccag ccaatccagc 600
ggcatcgcca tcgccaaggc ttccttcaac tactacacca tcaataagaa gcagaaggac 660
ttcgaagccg aaatcgtcgc actcaagaag caactccatg caagatacgg gaataagaag 720
tacgaccaac tccttaggga gctcaacctc atccccctca aggagctccc tctcaaggag 780
ttgcctctca tcgaattcta cagcgagatc aagaaaagga agtctactaa gaagagcgag 840
ttcctcgaag ccgtgagcaa cgggctcgtg ttcgacgacc tcaagtctaa gttcccactc 900
ttccagactg agagcaataa gtacgacgag tacctcaagc tcagcaacaa gatcacccaa 960
aagtctaccg ccaagagtct cctttccaag gatagccccg aagctcagaa gctccagacc 1020
gagatcacta agctcaagaa gaacaggggc gagtatttca agaaggcctt tggcaagtac 1080
gtgcagctct gcgagctcta caaagagatc gccggcaaga gggggaaact caaggggcaa 1140
ataaagggca tcgaaaatga gaggatcgat agtcaaaggc tccagtactg ggccctcgtt 1200
ttggaagaca acctcaagca ctcactcatc ctcatcccca aggagaagac taacgaactc 1260
tacaggaagg tttggggcgc caaggacgac ggggccagct cctcaagctc ctccaccctc 1320
tattatttcg agagcatgac ctacagggcc ctcaggaagc tctgcttcgg catcaacggg 1380
aacaccttcc tccccgaaat ccagaaggag ctcccccagt ataaccagaa ggaattcggg 1440
gagttctgtt ttcacaagag caatgacgat aaagagatcg acgaaccaaa actcatcagc 1500
ttttaccaga gcgttttgaa aacagacttc gtgaaaaaca ctcttgccct cccacagtct 1560
gtttttaacg aggtggccat ccaaagtttt gagaccaggc aggacttcca gatcgcactc 1620
gaaaagtgct gctacgctaa gaagcagatc atctctgagt cactcaagaa ggagatcctt 1680
gaaaactaca acacccaaat ctttaagatc accagcctcg acctccagag gtccgagcag 1740
aagaacctca aagggcatac caggatttgg aacagatttt ggacaaagca gaacgaggag 1800
atcaactata atttgaggct caaccccgag attgccattg tctggagaaa ggccaagaag 1860
accaggatcg agaaatacgg cgagaggtcc gttctttacg agcccgagaa gagaaatagg 1920
tacctccacg agcagtacac actctgcacc accgttactg ataacgctct caacaacgag 1980
ataaccttcg ccttcgaaga tacaaagaag aaggggaccg agatcgtgaa gtacaacgaa 2040
aaaatcaatc aaacactcaa gaaggagttc aacaagaacc aactctggtt ctatgggatc 2100
gacgccggag agatcgaact tgccaccctc gccctcatga ataaagataa ggagccacag 2160
ctcttcaccg tttacgagtt gaagaagttg gacttcttca agcatgggta tatctacaac 2220
aaggagaggg agctcgtgat aagagagaag ccctataaag ccatccagaa tctctcctat 2280
ttcctcaacg aggaactcta cgagaagaca tttagggacg ggaagttcaa cgagacttat 2340
aatgagcttt ttaaggagaa gcatgtgagc gccattgatc tcaccaccgc caaagtgatc 2400
aatgggaaga tcatcctcaa cggcgacatg atcaccttcc tcaacctcag aatcctccac 2460
gcacaaagga aaatatacga ggagctcatc gagaacccac acgccgagct taaggagaag 2520
gattacaagc tttattttga aatcgagggt aaagacaaag acatctacat cagcaggctt 2580
gatttcgaat atatcaagcc ctaccaggag attagtaact acctctttgc ctacttcgcc 2640
agccagcaaa taaacgaggc aagagaggag gagcagatca accagaccaa gagagctctc 2700
gccggtaaca tgatcggggt gatttactat ctctaccaga aatacagggg gatcatctcc 2760
atcgaggacc tcaaacagac taaggtggaa tccgacagga ataagttcga agggaacatc 2820
gagaggcccc tcgagtgggc tctctacagg aagtttcaac aggagggtta cgtgcctccc 2880
atatccgaac ttatcaagtt aagggagctc gaaaagttcc cccttaagga cgttaagcag 2940
cccaagtatg agaacatcca gcagttcggg atcatcaaat tcgtcagccc cgaggagact 3000
tccaccacct gccctaagtg tctcagaagg ttcaaggatt acgacaaaaa taagcaggag 3060
ggcttctgca agtgccagtg tggcttcgac accagaaacg acttgaaagg gttcgagggg 3120
cttaatgacc ccgacaaggt cgccgccttc aacatcgcca agaggggctt cgaggacctc 3180
caaaagtaca aa 3192
<210> 66
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 66
atggagaaat acaaaatcac caagaccatc aggtttaagc ttctcccaga caaaatccaa 60
gacatatcaa gacaggttgc cgtgttgcag aacagcacca acgccgagaa gaagaacaac 120
ctcctcagat tggtgcagag gggccaagaa ctccccaaac tcctcaacga gtacatcagg 180
tactcagaca accacaaact caagagtaat gtcacagtcc acttcaggtg gctcaggctc 240
tttaccaagg acctcttcta taactggaag aaggacaaca ccgagaagaa aatcaaaatc 300
agcgacgtgg tctacctcag ccatgtcttt gaggccttcc tcaaggagtg ggaaagcacc 360
atagagagag ttaacgccga ctgcaacaag cccgaggagt ccaagactag ggacgccgag 420
atcgctctta gcatcagaaa actcggcatt aaacatcagc tccccttcat aaaaggattc 480
gtggataact ccaacgacaa gaacagcgag gacaccaagt ccaagctcac cgccttgctc 540
agcgaattcg aagccgttct caagatctgc gagcagaact atctccccag tcagagtagt 600
ggaatcgcca tcgccaaggc ctccttcaac tactatacca tcaacaaaaa gcagaaggac 660
ttcgaagcag agatcgtcgc cctcaagaaa cagctccacg ccagatacgg gaacaagaaa 720
tacgaccagc tcctcaggga gttgaacttg attcccctta aagaactccc cctcaaggag 780
ttgcctctca tcgagttcta ctccgagatc aagaagagga agagtactaa gaaatccgag 840
tttcttgagg ccgttagtaa cggactcgtc ttcgacgact tgaagtccaa attccccctc 900
ttccagaccg agtccaacaa gtacgacgag tacttgaagc tctccaataa gatcacccag 960
aagtctaccg ccaaaagcct cctttcaaag gattcccccg aggcccagaa actccagact 1020
gagatcacta agctcaagaa gaacaggggc gagtacttca agaaggcctt cggcaagtat 1080
gtgcagctct gcgagctcta caaggagatc gccggcaaga gggggaaact caaggggcag 1140
atcaagggca tcgagaatga gagaatcgac tcccaaaggc tccagtactg ggccttggtt 1200
ttggaggaca acttgaagca ctccttgatc ctcatcccta aagaaaagac caacgagctc 1260
tacaggaaag tttggggggc caaagacgac ggcgcctcct catccagtag cagcaccctc 1320
tactacttcg agagcatgac ctatagggca ctcaggaagt tgtgtttcgg gatcaacggt 1380
aacacattcc ttcccgagat ccagaaggag cttccccagt acaatcagaa ggagttcggg 1440
gagttctgct tccacaagag caacgacgac aaggagatcg acgagcctaa gcttatctca 1500
ttctaccaga gtgttctcaa aacagatttc gtgaagaaca ccctcgctct cccccagagc 1560
gtcttcaacg aagtggccat tcagtccttc gagaccagac aggacttcca gatcgccttg 1620
gagaaatgct gctacgccaa gaaacaaatc atatcagagt ccctcaagaa agagatcctt 1680
gaaaattaca acacacaaat cttcaagata accagcttgg acctccagag gagtgagcag 1740
aagaacctca aggggcacac cagaatctgg aacaggttct ggacaaagca gaacgaggaa 1800
atcaattaca acctcaggct taaccccgag atcgctatcg tgtggaggaa ggccaagaaa 1860
acaaggatcg agaaatacgg cgagaggagc gtgctctatg agcccgagaa gaggaatagg 1920
tacctccacg aacaatacac cctctgcacc accgtgaccg acaacgccct taataatgag 1980
attaccttcg ccttcgagga taccaagaaa aaagggaccg agatcgtgaa gtacaacgag 2040
aagatcaacc agacccttaa gaaggagttc aacaagaacc agctctggtt ctacgggata 2100
gatgccgggg agattgagct cgccaccctc gccctcatga acaaggacaa ggagccccag 2160
ctcttcaccg tgtatgagct taagaagctt gacttcttca agcatggcta tatctacaac 2220
aaggaaaggg agctcgtgat cagggagaaa ccatataagg ccatccagaa tctcagttac 2280
ttcctcaacg aggagctcta tgagaagacc ttcagggatg gcaagttcaa cgagacctat 2340
aacgagctct ttaaagagaa gcacgtgtcc gccatcgacc tcaccaccgc caaggtgatc 2400
aacgggaaga taatcctcaa cggagacatg atcacctttc tcaaccttag gatcctccac 2460
gcccagagga agatctacga ggaactcatc gagaaccctc acgccgagtt gaaggagaag 2520
gattataagc tttattttga gatcgagggg aaagacaaag atatctacat cagcagactt 2580
gatttcgagt atattaagcc ctatcaggag atcagcaact acttgtttgc ctacttcgct 2640
tcccaacaga tcaatgaggc cagggaggag gagcagatca accagaccaa gagggccctc 2700
gccggtaaca tgatcggggt gatctattac ctctaccaga agtacagggg gattatctcc 2760
atcgaggacc tcaaacagac caaggttgag tccgacagga acaagttcga gggcaacatc 2820
gagagaccac tcgagtgggc cctctacaga aagtttcagc aagaggggta cgtccccccc 2880
ataagcgagc ttatcaagct cagggagctc gagaagtttc cactcaagga cgtcaagcag 2940
cccaaatacg agaacatcca gcagtttggg atcattaagt tcgttagccc cgaggaaacc 3000
agcacaacct gccccaaatg cctcaggaga tttaaagact acgacaagaa caagcaggag 3060
ggattctgca agtgccagtg tggtttcgac accaggaatg acctcaaagg gttcgagggc 3120
ctcaacgacc ccgacaaggt tgccgccttc aatatagcca agaggggctt cgaggacctc 3180
cagaagtata aa 3192
<210> 67
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 67
atggagaagt acaagatcac aaaaaccatt aggtttaagc tccttcccga caaaattcag 60
gacatctcca gacaggttgc agtcctccag aactctacca acgcagagaa gaagaacaat 120
ctcctcagac tcgtccagag gggacaggag ctccccaagc tcctcaatga gtacatcagg 180
tactccgaca accacaagct caagtccaac gtgactgttc atttcaggtg gttgaggctc 240
ttcaccaagg acctcttcta taactggaag aaggacaaca ccgagaagaa gatcaagatc 300
agcgacgtcg tgtacctctc ccatgtcttt gaggccttcc ttaaggagtg ggagagcaca 360
atcgagagag ttaacgctga ctgtaacaag cccgaggaat ccaagaccag ggatgccgaa 420
atcgccttga gcataaggaa actcggcatc aagcaccaac tccccttcat aaaggggttt 480
gttgacaatt ccaatgacaa gaactctgag gacaccaagt ctaaactcac cgcccttctc 540
agcgagttcg aagccgtgct caagatctgc gagcagaact atctcccctc tcagtcctct 600
ggcatcgcca tagccaaggc ctccttcaac tactacacca tcaacaagaa acagaaggac 660
ttcgaggcag aaatcgtggc cctcaagaaa cagctccacg ccaggtacgg taacaaaaag 720
tacgaccaac tcctcaggga gctcaacctc atcccactca aggaactccc tctcaaagag 780
ttgcccctta tcgagttcta ctccgagatc aaaaaaagaa aaagtactaa gaaatctgag 840
tttctcgaag ccgtgtccaa tgggcttgtt ttcgacgacc tcaagtccaa gttccccttg 900
tttcagactg aaagcaacaa atacgacgaa tacctcaagt tgtctaacaa gatcacccag 960
aagagcaccg ctaaaagcct tctctccaag gactcccccg aggcccaaaa actccagacc 1020
gagatcacca agctcaagaa gaacaggggc gagtatttca aaaaagcatt cggcaagtac 1080
gttcagttgt gcgaactcta caaagagatt gcaggcaaga gggggaagct caaggggcag 1140
ataaaaggaa tcgagaacga gagaattgat tcccaaaggc tccagtactg ggcacttgtc 1200
ttggaagaca acctcaagca ctccttgatc ctcataccca aggagaagac caacgaactc 1260
tacaggaagg tgtggggtgc caaagatgac ggcgctagca gcagttcctc ctccaccctc 1320
tactacttcg agagcatgac ctacagggcc ctcaggaagt tgtgcttcgg cattaacggc 1380
aacacattcc tccccgagat acaaaaagag ctcccccagt acaaccagaa agagtttggg 1440
gagttctgct tccacaagtc caatgacgac aaggagatcg acgagcccaa gctcatcagc 1500
ttctaccaaa gcgtcctcaa gaccgacttt gtcaaaaaca ccctcgcctt gccacagagc 1560
gtgttcaacg aggttgccat acagagtttc gagaccagac aggacttcca gatcgcactc 1620
gagaagtgct gttacgccaa gaagcaaata atcagcgaat cccttaagaa ggaaatactt 1680
gagaactaca acacccagat cttcaaaatc acctccctcg acctccagag gtcagaacag 1740
aagaacctca agggccatac aaggatctgg aataggtttt ggaccaaaca gaatgaggag 1800
atcaattata atctcaggct caaccctgag atcgccatcg tctggagaaa ggccaagaag 1860
actagaatcg agaagtacgg cgagaggtcc gttctctacg aacccgaaaa gaggaacagg 1920
taccttcacg agcagtacac cttgtgcacc accgtgaccg ataatgccct caacaacgaa 1980
atcacctttg cattcgagga taccaaaaag aaggggaccg agattgtcaa gtacaacgaa 2040
aagattaatc agacccttaa gaaggagttt aataagaacc aactctggtt ctacggcatc 2100
gatgccggcg agatcgagct tgccacactc gccctcatga acaaggacaa ggagccccag 2160
cttttcactg tttatgagct caagaagctc gatttcttca agcacgggta catctacaac 2220
aaggaaaggg agctcgtgat cagggagaag ccctacaagg ccatacagaa cctctcctac 2280
ttcctcaacg aagagttgta cgagaagacc ttcagagatg ggaaattcaa cgaaacctac 2340
aacgagctct tcaaagagaa gcacgtgtcc gccatcgacc tcaccaccgc caaggttatc 2400
aacggaaaga tcatccttaa tggtgatatg atcacattcc tcaatctcag aatcctccat 2460
gcccagagga agatctacga ggaacttatc gagaatcccc acgccgagct caaggagaaa 2520
gactacaagc tctattttga gatcgagggg aaagacaaag acatctatat ctccagactc 2580
gacttcgagt atatcaaacc ataccaggag atctctaact acttgtttgc ctactttgcc 2640
agccagcaga tcaatgaagc cagggaggaa gaacaaatta accagaccaa aagggcactc 2700
gcaggcaaca tgatcggggt gatctactat ttgtaccaga agtatagggg aataatcagc 2760
attgaggatt tgaagcagac caaggtggag tccgacagaa ataagttcga gggcaatatc 2820
gagagaccct tggagtgggc cctttatagg aaattccagc aagaggggta cgttcccccc 2880
atctccgagc tcatcaagct cagggagctc gaaaagttcc ccctcaagga cgttaagcag 2940
cctaagtatg agaacatcca acaattcggc atcatcaagt ttgtctcccc cgaagagact 3000
tctaccacct gccccaagtg tcttaggagg tttaaggact atgacaagaa caagcaagag 3060
ggtttctgca agtgccagtg cgggttcgat accaggaacg atctcaaggg gttcgaaggc 3120
ctcaatgacc ccgataaggt ggcagccttc aacatcgcca agagaggatt cgaggatctc 3180
cagaagtaca aa 3192
<210> 68
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 68
atggaaaaat acaagatcac caagaccatc aggttcaaac ttctccccga taagatacaa 60
gacatttcaa ggcaagtcgc cgttctccag aacagtacca acgccgaaaa gaagaacaac 120
ctcttgaggc tcgtccaaag gggccaggag ctcccaaagc tcctcaacga gtacattagg 180
tatagcgata accacaagct caagtcaaac gtgaccgtcc acttcaggtg gttgagactc 240
ttcaccaagg acctcttcta caactggaag aaggacaaca ccgaaaagaa gattaagata 300
tcagacgtgg tgtatttgag ccatgttttc gaggcattcc tcaaggagtg ggagagcaca 360
atcgagagag tgaacgccga ctgcaacaaa cccgaggaga gcaagaccag ggacgccgaa 420
atcgccctca gcatcaggaa gctcggcatc aagcaccagc tccccttcat caagggtttc 480
gtcgacaaca gcaatgacaa gaatagcgag gataccaagt ccaaattgac cgccctcctc 540
tccgaattcg aggctgtctt gaaaatatgt gagcagaact atctcccctc ccagagctcc 600
ggcattgcca tcgccaaggc aagcttcaac tactacacca tcaacaagaa gcaaaaagac 660
tttgaggccg agatagtggc cctcaaaaaa cagctccatg caaggtacgg caacaagaag 720
tatgaccagc tcctcagaga gctcaacctc atccccctca aggaactccc cctcaaggag 780
ctccccctca tcgaattcta ctcagagatc aaaaagagga agtccaccaa gaaaagcgag 840
ttcctcgagg ccgtctccaa cgggctcgtg ttcgatgacc tcaagtccaa gttccccctc 900
ttccagaccg agagcaataa gtacgatgag tacctcaaac tctccaacaa aatcactcag 960
aaaagtacag ctaagagtct tctctccaag gactcccccg aggcacagaa gctccagacc 1020
gagatcacca agctcaagaa gaacagaggg gagtatttca agaaagcctt cggcaaatac 1080
gttcagctct gcgaactcta caaggagata gctgggaaaa ggggcaaact caagggccag 1140
atcaagggga ttgagaatga gaggatcgat tcccagaggc tccaatattg ggctctcgtg 1200
ctcgaggaca atcttaagca ctcactcatc ttgatcccca aggagaagac aaacgagctc 1260
tacaggaagg tctggggggc caaagatgat ggtgcaagca gctccagcag ttccaccctc 1320
tactacttcg aatccatgac atatagggcc ctcagaaaac tctgcttcgg cattaacggc 1380
aatacattcc tccccgagat ccagaaggag ctcccccagt acaaccagaa ggagttcggc 1440
gagttttgct tccacaaatc caacgacgat aaagagatag acgagcccaa gctcataagc 1500
ttctaccagt ccgtgctcaa gaccgatttc gtgaagaaca ccctcgccct cccccaatct 1560
gtctttaacg aggtggccat acagagcttc gagaccaggc aggacttcca gatcgccctc 1620
gagaaatgtt gctacgctaa gaagcagatc atcagtgaaa gcctcaagaa ggagatcctc 1680
gagaactata atacccagat ctttaaaatc acctccctcg acctccagag gtcagagcaa 1740
aagaatctca agggtcacac caggatctgg aacaggttct ggactaaaca aaacgaggag 1800
atcaattaca acctcaggtt gaaccccgag atcgccattg tctggagaaa agccaagaaa 1860
accaggattg agaagtatgg ggagaggtct gtcctctatg agcctgaaaa aagaaacagg 1920
tatctccatg aacagtacac cctctgtacc accgtcaccg acaacgccct caacaacgag 1980
atcaccttcg cattcgagga caccaagaag aagggaaccg agatcgtgaa atacaatgag 2040
aaaatcaacc agaccctcaa gaaagagttt aacaaaaacc aactctggtt ctacgggata 2100
gacgccggcg agatcgaact tgccaccctc gccctcatga acaaggataa ggagccccag 2160
ctctttactg tttatgagct caagaagctc gacttcttca agcatgggta catctacaac 2220
aaggaaaggg agctcgtgat cagggaaaag ccctacaaag ccatccaaaa tcttagctat 2280
ttccttaacg aggaattgta cgagaagacc ttcagggacg gcaagttcaa tgagacctac 2340
aacgagctct tcaaagagaa gcacgtttcc gccatcgatc tcacaaccgc caaggttatc 2400
aacgggaaga tcattctcaa tggcgacatg atcacattcc tcaatctcag gatcctccac 2460
gcccagagga agatctacga ggagttgatc gagaaccccc acgccgagct caaggagaag 2520
gactataagc tttacttcga aatagagggc aaagataaag atatctacat ctccagactc 2580
gacttcgagt atatcaagcc ctaccaggag atctccaact acctctttgc ctacttcgcg 2640
agccaacaaa ttaatgaggc cagagaggag gagcagatca accagaccaa aagggccttg 2700
gccggcaata tgatcggagt tatctactat ctctaccaga agtacagggg catcatcagc 2760
atcgaggacc tcaagcaaac caaggtcgag agtgatagga acaaattcga gggcaacatc 2820
gagaggcctc tcgagtgggc cctctacagg aagttccagc aggaggggta cgtcccccct 2880
ataagcgaac tcatcaagct cagggagctc gaaaaatttc ccctcaagga cgttaagcag 2940
cccaaatacg aaaacatcca gcagttcggg atcatcaagt tcgtgtcccc tgaagagacc 3000
agcacaacat gtcccaagtg cctcaggagg ttcaaggact acgacaagaa taagcaggag 3060
ggcttttgca agtgccagtg tggtttcgat accaggaacg acctcaaagg atttgagggc 3120
ctcaacgatc ccgataaggt cgccgccttc aacatcgcca agaggggctt tgaggacctc 3180
cagaagtata ag 3192
<210> 69
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 69
atggagaagt acaaaatcac caagactata aggtttaagc tcctccccga caagatccag 60
gacatcagca gacaagtggc cgtgctccag aatagcacca acgccgagaa gaagaataac 120
ctcctcaggc tcgttcaaag ggggcaggag ctcccaaagc tcctcaacga gtacatcagg 180
tacagcgata accacaagtt gaagtccaac gttaccgttc actttagatg gctcaggctc 240
ttcaccaagg acctcttcta caattggaaa aaggacaaca ccgagaaaaa gatcaagatc 300
agcgacgtgg tgtacctcag ccatgtcttc gaggccttcc tcaaggagtg ggagagcact 360
atcgaaaggg tcaacgccga ctgtaataaa ccagaggaga gcaagactag ggacgccgaa 420
attgctctct ccatcaggaa gctcggtatc aagcaccagc tccctttcat caagggattc 480
gtcgacaact ccaacgataa gaacagtgag gacaccaaga gtaagcttac cgcactcctc 540
agtgagttcg aggccgttct caagatctgc gagcaaaatt acctcccctc acagtcttct 600
ggcatcgcca tcgccaaggc atcattcaac tactacacca tcaacaagaa gcaaaaagac 660
ttcgaagccg agatcgtggc cttgaagaaa caactccacg ccaggtacgg caacaagaag 720
tacgaccagt tgctcaggga gcttaatctc atccccttga aggagctccc cctcaaggag 780
ttgcccctca ttgagttcta ctccgagatt aagaagagga agtccacaaa gaaatccgag 840
tttctcgaag ccgtcagcaa cgggctcgtc ttcgacgatc tcaagtcaaa gttcccactc 900
ttccagactg agtccaacaa gtacgacgag tacctcaagt tgtctaacaa gataacccag 960
aagtccacag ccaagtcatt gctctccaaa gattccccag aagcccagaa gctccagacc 1020
gaaataacaa agctcaaaaa gaacaggggc gaatacttca aaaaagcctt cggcaagtac 1080
gttcaattgt gtgagctcta caaggaaatc gctggcaaaa gaggaaaatt gaagggtcaa 1140
ataaaaggaa tcgaaaacga aaggatcgat agtcagagac tccaatactg ggcccttgtg 1200
ctcgaggaca atctcaagca ctccctcatc ctcataccca aggagaagac caacgagctc 1260
tacagaaagg tgtggggcgc taaagatgac ggggctagta gctccagctc cagtaccctc 1320
tattactttg agagcatgac ctacagagct ctcaggaaac tctgcttcgg catcaatgga 1380
aacactttcc tcccagagat ccagaaggag ctcccccagt acaatcaaaa ggagttcgga 1440
gagttctgct tccacaagag caacgacgat aaggaaatcg acgagcccaa gctcatcagt 1500
ttctatcaga gcgtgttgaa gacagacttt gtcaagaaca ccctcgcact cccccagagc 1560
gtcttcaacg aggtggccat acaatccttc gagaccaggc aggactttca gattgctctc 1620
gaaaagtgct gttatgctaa gaagcagatc atcagcgagt ccctcaagaa ggagatcctc 1680
gagaactaca atacacagat cttcaagatc accagcctcg acttgcagag atctgagcag 1740
aagaacctta aggggcacac caggatatgg aacaggtttt ggacaaaaca gaatgaggag 1800
atcaattaca atttgaggct taatcccgag atcgctatcg tgtggaggaa ggccaagaag 1860
accaggatcg agaagtatgg agagagatca gttctctacg agcccgagaa gaggaacaga 1920
tacctccacg agcagtatac cctctgcact accgtgaccg acaatgccct caacaatgag 1980
atcacctttg catttgaaga caccaaaaag aaggggaccg aaatcgtgaa gtacaacgag 2040
aagatcaacc agacactcaa gaaggagttt aacaaaaacc aattgtggtt ttacggcatc 2100
gacgccgggg agatcgagct cgccaccttg gccctcatga acaaggacaa ggagccacag 2160
ctcttcaccg tgtacgagct taagaaactc gatttcttca aacacggtta catctacaat 2220
aaggagagag aactcgtcat cagggagaag ccctacaagg ccattcagaa cctctcctat 2280
ttcctcaacg aggagctcta cgagaagacc ttcagggacg ggaagttcaa cgagacctac 2340
aatgagctct tcaaggagaa gcacgtttcc gctatcgacc tcaccaccgc caaggtgatc 2400
aacgggaaga tcatcttgaa tggtgacatg atcaccttcc ttaatctcag aatcctccat 2460
gcccagagaa agatctacga ggagctcatc gaaaaccccc acgccgaact caaggaaaag 2520
gactacaagc tctactttga aatagaggga aaagacaaag atatctacat ctccaggctc 2580
gacttcgaat acatcaagcc ctaccaagag atcagcaatt acctctttgc ctacttcgca 2640
agccaacaga taaatgaggc cagggaagag gaacagatca accagacaaa gagggccctc 2700
gccgggaaca tgatcggtgt gatatactac ctctatcaga agtatagggg catcatctcc 2760
atcgaggact tgaaacagac aaaagtggaa tctgatagga acaaatttga ggggaatatc 2820
gagaggcccc ttgagtgggc cctctacagg aaatttcagc aggaagggta cgttccacca 2880
atcagcgaac ttatcaaact cagggagctc gaaaagtttc ccctcaagga cgtgaaacag 2940
cccaagtacg agaacattca acaatttggc ataatcaaat ttgtcagccc cgaggagacc 3000
tccaccacct gccccaagtg cctcagaagg ttcaaggatt acgataagaa caagcaggag 3060
ggcttctgta agtgccagtg cggcttcgat accaggaacg acctcaaggg gttcgagggg 3120
ctcaatgacc ccgacaaggt cgccgccttc aatatcgcca aaagaggctt cgaggacctc 3180
cagaagtaca ag 3192
<210> 70
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 70
atggaaaagt acaagatcac caagaccatc aggttcaagt tgctccccga caaaatccaa 60
gacatctcca ggcaagttgc cgtgctccaa aactcaacca acgccgaaaa gaagaacaac 120
ctccttaggc tcgttcagag gggtcaggaa ctccccaagc tcctcaacga gtatatcagg 180
tacagcgata accacaagct caagtccaac gtcacagtgc acttcagatg gctcaggttg 240
ttcaccaaag atctttttta taactggaag aaggacaaca ccgagaagaa aattaagatc 300
agcgacgtcg tgtatctcag ccacgttttt gaggccttcc tcaaggagtg ggagagcacc 360
atcgagagag tgaatgccga ctgcaacaaa cccgaggagt ccaaaaccag agacgccgag 420
atcgccctca gcatcaggaa gctcggcatc aagcatcaac ttccctttat caagggcttt 480
gttgacaact ctaacgacaa gaacagcgag gacaccaaga gcaagctcac tgccctcctc 540
tccgagttcg aagccgttct caaaatttgc gagcagaatt acctccccag tcagtcctcc 600
gggatcgcta tcgcaaaggc cagctttaac tactacacaa tcaacaagaa gcagaaggac 660
ttcgaggctg agatcgtcgc cctcaagaag cagctccacg ccaggtacgg caataagaag 720
tacgaccagc tcctcaggga acttaacctc atacctctca aagagctccc cctcaaggag 780
ctccccctca tagagttcta ctccgagatc aagaagagga agagcaccaa aaaaagtgag 840
ttcctcgagg ctgtctccaa cggtttggtg ttcgacgact tgaagtccaa gttccccctt 900
tttcagaccg aatctaacaa gtacgacgaa tacctcaagc tctccaataa gatcacccaa 960
aagagtaccg caaagagcct tctcagcaag gacagccctg aagctcaaaa gctccagacc 1020
gagatcacca agctcaagaa aaacaggggc gagtacttca agaaagcctt tggcaagtac 1080
gttcagctct gtgagctcta caaggagatc gccgggaaaa gggggaaact caagggtcag 1140
atcaagggca tagaaaacga gaggatagac tcccaaaggc ttcaatactg ggccctcgtc 1200
ctcgaggaca acctcaaaca ctccttgatc ttgatcccca aggagaagac caacgagctc 1260
tatagaaagg tttggggggc caaggacgac ggggccagct cttctagctc cagtacactc 1320
tattatttcg agtccatgac ttatagggca ctcaggaaac tctgcttcgg catcaacggg 1380
aacaccttcc tccccgagat tcagaaggaa ttgcctcaat acaaccagaa ggagttcggc 1440
gagttttgct tccacaagtc caacgatgac aaggagatcg acgaacccaa gcttatctcc 1500
ttttaccagt ccgtcctcaa gacagacttt gttaaaaata ctctcgccct cccacagagc 1560
gtgttcaacg aggttgccat ccagagcttc gagacaaggc aggatttcca gatcgccctt 1620
gagaagtgtt gttacgctaa gaaacagatc attagcgaga gtctcaagaa agagatcctc 1680
gagaactaca acacacagat tttcaagatc acctccctcg atctccagag atccgagcaa 1740
aagaatctca aggggcacac caggatctgg aacaggttct ggaccaagca aaacgaagag 1800
atcaactaca atctcaggct caatcccgag attgctatag tgtggaggaa agcaaagaaa 1860
accaggatcg agaaatacgg ggagagatct gttctctacg aacccgagaa gaggaatagg 1920
tacttgcacg agcagtacac tttgtgtacc actgtgaccg acaatgccct caacaacgag 1980
ataacattcg catttgagga caccaaaaag aaagggactg aaatcgtgaa gtacaacgag 2040
aagattaacc agaccctcaa gaaggagttc aataagaacc agctctggtt ctacgggatc 2100
gacgcaggcg agatagagct tgccaccctt gcactcatga acaaggataa ggagccccag 2160
ctcttcacag tgtacgagtt gaagaagctc gattttttca agcacgggta catctacaac 2220
aaggaaaggg aactcgtcat cagggagaag ccatacaaag ccatccagaa tctctcctat 2280
tttctcaatg aagagctcta tgagaagacc ttcagagacg gcaagtttaa cgagacctac 2340
aatgagctct tcaaggagaa gcacgtttcc gccatcgacc tcaccacagc caaggtgatc 2400
aacggtaaaa tcatcctcaa cggcgatatg atcaccttcc tcaacctcag gatcttgcac 2460
gcacagagga agatctacga ggaactcatc gaaaaccccc atgcagaact caaggagaag 2520
gactacaagc tctatttcga gatcgagggg aaagacaagg acatctatat cagtaggctc 2580
gatttcgagt acatcaagcc ttatcaagag atcagtaact acctctttgc ctacttcgcc 2640
tcccagcaaa tcaacgaggc cagggaggaa gagcagatca accagactaa aagggccctt 2700
gctggtaaca tgatcggggt tatatactac ctttaccaga aatacagggg tattatctcc 2760
atcgaggatc tcaagcagac caaggtcgag tctgacagaa acaaattcga gggcaacata 2820
gaaaggcccc tcgaatgggc tctttacagg aagttccagc aggagggata cgtccccccc 2880
atcagtgagc tcatcaaact cagagagttg gagaagtttc cccttaaaga tgtcaagcag 2940
cccaaatacg agaacatcca acaattcggg atcatcaagt tcgtgagccc cgaggagacc 3000
tctaccacct gtcctaagtg ccttaggagg tttaaggact atgacaagaa caagcaggag 3060
ggtttctgca agtgtcagtg cgggttcgac acaaggaacg acctcaaggg ctttgagggc 3120
ctcaacgacc ccgacaaagt tgccgccttc aacatcgcca agaggggctt cgaggatttg 3180
cagaaataca ag 3192
<210> 71
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 71
atggagaagt acaagatcac taaaacaatt aggttcaaac ttctccccga taagatccag 60
gacatctcca ggcaggtcgc cgtgctccaa aatagcacca acgctgaaaa gaaaaataac 120
cttctcaggc tcgttcaaag aggccaggag ctccccaagc ttctcaacga gtacatcagg 180
tacagtgaca atcataagct caagagcaac gtgactgttc atttcagatg gctcaggttg 240
tttaccaagg acctctttta caattggaaa aaggataaca ccgagaagaa aatcaagata 300
tctgacgtgg tctacctcag tcatgttttt gaagccttcc tcaaagaatg ggagtccact 360
attgagagag tgaatgccga ctgcaataag cccgaagaat ccaagaccag ggacgccgag 420
attgccctct caatcaggaa gctcgggatc aagcaccagt tgcccttcat caaaggcttc 480
gtcgacaaca gtaacgataa gaattctgaa gacactaaga gcaagttgac cgccctcctc 540
agcgagtttg aggccgtgct caaaatatgc gagcagaatt acttgccctc ccagagctcc 600
ggcatcgcca tcgccaaagc cagctttaat tattatacca tcaacaaaaa acaaaaggat 660
ttcgaggccg agatcgtcgc cctcaagaag caactccacg ccaggtatgg aaacaaaaag 720
tacgatcagc tcctcaggga gctcaacttg atccccctca aggagcttcc attgaaggag 780
ctccccctta tcgagttcta cagcgagatc aagaagagga agagcaccaa aaagagcgag 840
ttcctcgagg ccgtttccaa cgggcttgtc ttcgacgatt tgaagagtaa gtttcccctt 900
ttccagaccg aatccaacaa gtacgacgaa tatctcaagc tcagcaacaa gatcacacaa 960
aagagcaccg ctaagtccct cctctccaag gacagccccg aggctcaaaa gctccagacc 1020
gagatcacca agctcaagaa gaataggggc gaatacttca agaaggcctt tggcaagtac 1080
gttcagctct gcgagctcta caaggagatc gcaggcaaga ggggaaaact caagggccaa 1140
attaagggga tagagaacga gaggatcgat agccagaggc tccagtattg ggccctcgtg 1200
ctcgaggata acctcaagca cagcctcatc ctcatcccta aggagaagac caacgagctc 1260
tacaggaagg tgtggggtgc taaggacgac ggcgcttcct catccagttc cagcaccctc 1320
tactactttg agtccatgac atacagggcc cttaggaagc tctgctttgg gatcaacggc 1380
aatactttcc tccccgagat tcagaaggag ctcccacagt acaaccagaa agagtttggc 1440
gagttctgct tccataaaag caatgacgac aaggagatcg acgagcctaa gctcatctct 1500
ttctatcaga gtgtcctcaa gaccgacttc gtgaagaaca ccctcgctct ccctcaatcc 1560
gtcttcaacg aagtcgccat tcagtccttc gagactaggc aggacttcca gattgccttg 1620
gagaaatgct gctacgcaaa gaagcaaatc atcagtgagt ctctcaagaa ggagatcctc 1680
gagaactaca acacccagat tttcaagatc acatccctcg acctccaaag gtcagagcag 1740
aagaacttga agggacacac caggatctgg aatagattct ggaccaagca gaacgaagaa 1800
ataaattaca acctcagact caatcccgag atcgcaatcg tctggaggaa ggccaagaag 1860
accaggatcg aaaagtacgg agagcgaagc gtcctctacg agcccgagaa gaggaacagg 1920
tatttgcacg aacagtacac cctctgtact accgtcacag acaacgccct caacaatgag 1980
atcaccttcg ctttcgaaga caccaagaaa aagggcaccg aaatcgtgaa atacaacgaa 2040
aagatcaacc agaccctcaa gaaggagttc aacaagaacc agttgtggtt ctacggcatc 2100
gacgcaggcg agattgagct cgctaccctc gccctcatga acaaggacaa agagccccag 2160
ctcttcaccg tgtatgagct caagaagctc gactttttca aacatggata catatacaac 2220
aaggagaggg aactcgttat cagggagaaa ccctataagg ccatccagaa tctttcctac 2280
ttccttaacg aagagctcta cgaaaagacc ttcagagacg gaaagttcaa cgaaacctac 2340
aacgagctct ttaaggaaaa acacgtcagt gccattgact tgaccactgc aaaagttatc 2400
aacggtaaga tcatcctcaa cggcgacatg atcaccttcc tcaacctcag aattctccac 2460
gcccagagga agatatacga ggagctcatc gagaatcccc acgccgaact caaggagaaa 2520
gattacaagc tctacttcga aatagaagga aaggacaagg atatttacat ctcaagattg 2580
gatttcgagt acatcaagcc ctaccaagag atctccaact acctgttcgc ctatttcgcc 2640
agccaacaaa tcaatgaggc cagggaggag gagcagataa accagaccaa gagagctctc 2700
gctgggaata tgatcggtgt catctattac ctttaccaga agtatagagg gatcatcagt 2760
atagaggacc tcaagcagac caaagtggag agcgacagaa ataagttcga agggaatatc 2820
gagaggcccc tcgagtgggc cctctacagg aaattccagc aggagggcta tgttccccca 2880
atctcagagc tcatcaagct cagggaactc gagaagttcc ctctcaagga cgttaaacaa 2940
cccaagtacg agaacatcca gcaattcggg ataatcaagt tcgttagccc agaggagacc 3000
agcaccactt gccccaagtg cctcaggagg ttcaaggact acgacaagaa caaacaggag 3060
ggtttctgta agtgtcagtg tggcttcgac accaggaacg acctcaaggg ttttgagggg 3120
ctcaacgacc ccgacaaggt tgccgcattc aacatagcca agaggggctt tgaggacttg 3180
cagaagtata ag 3192
<210> 72
<211> 3192
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 72
atggagaagt acaagatcac caagaccatt agatttaagc tcctcccaga caagatccag 60
gacatcagca ggcaggttgc cgtgctccag aactctacaa atgcagagaa gaaaaataac 120
cttctcaggc tcgtgcaaag gggtcaggag ctccctaagc tcctcaacga gtacatcaga 180
tattcagata atcacaaact caagtccaac gtgaccgtcc acttcagatg gctcagactc 240
tttaccaagg acctcttcta taactggaag aaggacaaca ccgagaagaa gatcaagatt 300
tccgacgtcg tctatctctc acacgtgttc gaggcattcc tcaaggagtg ggagtccacc 360
atcgagagag tgaacgccga ctgcaacaag cccgaggaga gcaagaccag ggacgccgaa 420
atcgcattgt ctataaggaa actcggtatc aaacaccaat tgcccttcat caaagggttc 480
gtcgacaact ccaacgacaa gaacagcgag gacaccaaga gcaagctcac cgccctcctc 540
agcgagtttg aggccgtcct caagatctgc gagcaaaact atcttcccag ccaatccagc 600
gggattgcta ttgctaaggc ctcattcaat tattacacca ttaacaagaa gcagaaagat 660
tttgaggccg aaattgtcgc cctcaagaag cagctccacg ccaggtacgg taacaaaaag 720
tacgaccaac tcctcaggga gctcaacctc attcccctca aggagctccc cctcaaggag 780
ctcccactca tcgagttcta cagcgagatt aagaagagga agtccaccaa aaagagcgag 840
ttcctcgagg ccgtttcaaa cgggctcgtt tttgacgatc tcaaaagcaa gttccccctt 900
tttcagaccg agtccaacaa atacgacgag tatctcaagt tgagtaacaa gatcacccaa 960
aagtccactg ccaagtcctt gctctccaag gactcccctg aggcccagaa actccaaacc 1020
gagatcacca aactcaaaaa gaacaggggc gaatacttca aaaaggcctt cggcaaatac 1080
gtgcagctct gcgagcttta caaggagatc gccggaaaga gggggaagct caaggggcag 1140
atcaagggca tcgaaaacga gaggattgac agccagaggc tccagtactg ggccctcgtc 1200
cttgaggata acctcaagca ctccctcatc ctcatcccca aggagaagac aaacgaactg 1260
tacaggaagg tctggggcgc caaagatgac ggagccagca gctcttccag cagcacactc 1320
tactacttcg aatccatgac ttatagggcc ttgagaaaac tctgcttcgg tatcaacggt 1380
aacacctttc tcccagagat acagaaagag ttgccccagt acaaccagaa ggagttcggg 1440
gaattctgct tccacaagag caacgacgac aaggagatcg acgagcccaa actcatctcc 1500
ttttaccaga gcgttctcaa gaccgacttc gtcaagaata ccttggccct ccctcagtct 1560
gtctttaatg aggttgccat tcagagcttc gagactaggc aggacttcca gatcgccctc 1620
gagaaatgct gctacgcaaa gaaacagatc atttcagagt ctctcaagaa ggagatactc 1680
gagaactaca atactcagat attcaagatc acctccctcg atctccagag aagcgagcag 1740
aaaaatctca agggtcacac taggatctgg aataggttct ggaccaagca gaacgaagag 1800
ataaactaca acctcaggct caacccagag atcgctatcg tttggaggaa ggccaagaag 1860
accaggatcg aaaagtacgg ggagaggagc gtgctctatg agcccgagaa gaggaacagg 1920
tacttgcacg agcagtatac tctctgcacc accgtcaccg acaacgccct caacaatgaa 1980
atcaccttcg ccttcgagga tacaaagaag aaggggaccg agattgtgaa atataacgag 2040
aagatcaacc agaccctcaa gaaggagttt aacaaaaacc agctctggtt ttacggcata 2100
gacgccggag aaatcgagct cgccaccctc gccctcatga ataaggacaa ggagccccaa 2160
ctcttcaccg tgtatgagct taagaagctc gacttcttca agcacggcta catctacaac 2220
aaggaaaggg agcttgtgat cagggagaaa ccctacaagg caatccagaa cctcagctat 2280
ttccttaacg aggaactcta cgagaagacc tttagggacg ggaagttcaa cgagacctac 2340
aacgagctct tcaaagaaaa gcacgtgagc gccatcgacc ttaccaccgc aaaagtgatc 2400
aacggtaaga tcattttgaa cggggacatg attaccttcc tcaatctcag aatactccac 2460
gcccagagga agatttacga ggaactcatc gagaaccccc acgccgagct caaggagaag 2520
gattacaagc tctacttcga aatcgagggg aaggataagg acatctacat cagtaggctc 2580
gattttgaat acattaagcc ctaccaggag atcagcaact accttttcgc atacttcgcc 2640
tcccagcaga tcaacgaggc cagggaggaa gagcaaatca accaaaccaa gagggccttg 2700
gccgggaaca tgatcggggt gatctactac ctctaccaaa agtatagggg gatcatcagc 2760
atcgaggacc tcaagcagac taaggtcgag agtgatagaa acaagttcga gggcaacatc 2820
gagagaccac tcgagtgggc cctttacaga aagtttcaac aggaggggta cgttcccccc 2880
atctccgagc tcatcaagct cagggaactc gagaagttcc ccctcaagga cgtcaagcag 2940
cccaagtatg agaatataca gcagtttggg atcatcaaat tcgtgagtcc cgaggagacc 3000
tccaccacct gtcctaaatg cctcaggagg ttcaaggatt acgacaagaa caagcaagag 3060
gggttctgca aatgccagtg cggattcgat accaggaacg accttaaggg gttcgagggc 3120
ctcaacgatc ccgataaggt tgccgccttt aacatcgcca agaggggctt tgaggacctc 3180
cagaaataca ag 3192
<210> 73
<211> 1228
<212> PRT
<213> 未知
<220>
<223> 毛螺科菌细菌(Lachnospiraceae bacterium)
<400> 73
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
165 170 175
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
180 185 190
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
195 200 205
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
210 215 220
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
225 230 235 240
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
245 250 255
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
260 265 270
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
275 280 285
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
290 295 300
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
305 310 315 320
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
325 330 335
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
340 345 350
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
355 360 365
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
370 375 380
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
385 390 395 400
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
405 410 415
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
420 425 430
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
435 440 445
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
450 455 460
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
465 470 475 480
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
485 490 495
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
500 505 510
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
515 520 525
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
530 535 540
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
545 550 555 560
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
565 570 575
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
580 585 590
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
595 600 605
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
610 615 620
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
625 630 635 640
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
645 650 655
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
660 665 670
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
675 680 685
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
690 695 700
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
705 710 715 720
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
725 730 735
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
740 745 750
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
755 760 765
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
770 775 780
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
785 790 795 800
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
805 810 815
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp
820 825 830
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
835 840 845
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
850 855 860
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
865 870 875 880
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
885 890 895
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
900 905 910
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
915 920 925
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
930 935 940
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
945 950 955 960
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
965 970 975
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
980 985 990
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
995 1000 1005
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp
1010 1015 1020
Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro
1025 1030 1035
Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser
1040 1045 1050
Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr
1055 1060 1065
Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val
1070 1075 1080
Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu
1085 1090 1095
Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala
1100 1105 1110
Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met
1115 1120 1125
Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly
1130 1135 1140
Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp
1145 1150 1155
Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala
1160 1165 1170
Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala
1175 1180 1185
Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp
1190 1195 1200
Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp
1205 1210 1215
Leu Glu Tyr Ala Gln Thr Ser Val Lys His
1220 1225
<210> 74
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 74
atgtcaaagt tggaaaagtt caccaattgc tattcacttt cgaaaacttt gaggttcaaa 60
gctatacccg tcggtaagac gcaagaaaac attgataata agagactact ggtcgaagat 120
gaaaaacgtg cagaagatta caaaggtgta aagaaacttc tggacagata ctacctcagc 180
ttcataaatg atgtgctcca tagtattaaa ttgaagaact tgaataacta tattagtctt 240
ttccgaaaga aaactagaac agaaaaggag aataaagaat tggagaatct ggagattaat 300
ctgcgcaagg aaattgcaaa agcattcaag ggtaacgaag ggtacaagtc actcttcaaa 360
aaagacatta tagagacgat tctaccagaa ttccttgatg ataaagacga gattgccctc 420
gttaactcct ttaacggttt tactacagca tttactggct tcttcgataa cagggaaaat 480
atgttcagcg aagaagctaa atctacttca atagcctttc gttgcataaa tgagaacctg 540
acccgctata taagcaacat ggatatattt gagaaggttg acgctatttt cgataagcat 600
gaggttcagg agatcaagga gaagattctc aacagcgact acgacgtgga ggattttttc 660
gaaggcgagt tctttaactt cgttttaacc caagagggaa tagacgttta taatgcaatc 720
attggaggat tcgttaccga gagcggagaa aagataaaag gtcttaacga atatatcaat 780
ctatacaacc agaagacaaa gcagaaactc ccaaaattta aaccacttta caagcaagtc 840
ctctctgatc gtgaaagtct ttctttttat ggcgaaggtt ataccagtga cgaagaggtg 900
ttagaagtgt ttagaaacac attgaataaa aactctgaaa ttttttccag catcaagaaa 960
ctggagaaac tgtttaagaa ctttgatgag tactctagtg cgggcatttt cgttaagaat 1020
ggtcccgcta tttcaaccat ctcaaaagat atcttcgggg agtggaacgt catccgtgac 1080
aaatggaacg ctgagtatga tgacattcat ctgaagaaga aagcagttgt aactgagaag 1140
tatgaagacg ataggagaaa atcatttaaa aaaatcggtt ctttctcttt ggaacagctc 1200
caagagtatg cggacgcgga tctctccgta gttgagaagt taaaggagat cataattcaa 1260
aaagtcgacg aaatttataa ggtgtatggc tcaagtgaaa agctatttga cgcagacttc 1320
gtcctggaga aaagccttaa aaagaatgat gcagtcgtgg ctataatgaa ggatctgttg 1380
gactcagtga agtctttcga gaactacatt aaggcttttt ttggagaagg aaaggagacg 1440
aacagagatg agagctttta cggagacttc gtgcttgcct acgatatcct tctgaaagtt 1500
gatcatattt acgatgcaat acgtaattac gttacccaaa agccttatag caaagacaag 1560
tttaagttat attttcaaaa tccgcaattt atgggcggat gggacaagga taaggaaaca 1620
gattatcgcg caacaatttt gagatatggt tccaagtact atctggctat catggataag 1680
aaatatgcta agtgcctgca gaaaatcgac aaagatgatg ttaatgggaa ttatgaaaag 1740
attaactata aattattgcc cggacctaac aaaatgttgc ctaaggtttt tttttcaaag 1800
aaatggatgg cctactataa tccttccgag gacatccaaa aaatttataa gaacggtaca 1860
tttaaaaaag gagatatgtt caacttaaat gattgtcaca agctaattga tttttttaag 1920
gactccatct ctagataccc aaaatggagt aacgcctatg attttaactt cagcgaaact 1980
gaaaagtata aggatattgc tggattttac cgggaagtgg aggaacaggg atacaaggta 2040
agtttcgaaa gtgcgtctaa aaaggaagtc gacaaattgg ttgaggaagg caaactatac 2100
atgtttcaga tctataacaa agacttcagc gataagtcac atggcacccc aaaccttcac 2160
actatgtact ttaagctgct ttttgacgaa aacaaccacg gccaaatcag attgtcaggc 2220
ggggcagagc tttttatgcg tagggcttca ctcaaaaagg aggagcttgt cgttcatccc 2280
gccaattctc ctatagctaa caaaaatccc gataatccta agaagaccac gacacttagc 2340
tatgatgttt ataaggacaa gagattttct gaggatcaat atgagttgca tatccccatt 2400
gctataaata aatgtcctaa gaatatattt aaaattaata cagaagttag agtgctcctt 2460
aaacacgatg ataacccata cgtgatcggt attgacaggg gagagagaaa ccttctgtac 2520
atcgttgttg ttgacggcaa aggcaatatt gtggaacagt atagtctcaa cgaaatcatc 2580
aacaacttta acggtatacg tattaagaca gactatcata gccttttaga taagaaagag 2640
aaggaacgtt tcgaagcgag acaaaattgg acaagcatcg agaatataaa ggaactgaaa 2700
gctgggtata tcagtcaggt cgttcataag atatgtgaat tagttgaaaa gtacgatgcc 2760
gtaattgctt tggaggactt aaactccggg ttcaagaact ctcgtgtcaa ggtagagaaa 2820
caagtatacc agaagtttga gaagatgctt attgataagc ttaactacat ggtagataag 2880
aagagcaatc cttgtgctac aggtggggca cttaaaggtt atcagataac caacaagttt 2940
gaaagtttca aatctatgag tacccaaaat gggttcattt tttacattcc cgcatggctt 3000
acctcgaaga tcgacccaag tactggattt gtaaatttgt tgaaaactaa gtatacaagt 3060
attgcagatt ctaagaaatt tatttcttca ttcgaccgca ttatgtatgt tcctgaagag 3120
gatttattcg agtttgctct ggactataag aacttctcca ggacagatgc tgattatatt 3180
aagaaatgga aactttattc ttacggaaac cggattagaa ttttcaggaa cccaaagaaa 3240
aataacgtat tcgattggga ggaggtgtgt ttgacctctg cctacaagga gctatttaat 3300
aaatacggca ttaactatca acaaggagac atcagggcct tgctttgtga gcaaagcgat 3360
aaagctttct attcgtcatt catggcctta atgagtctca tgctacagat gaggaattct 3420
ataacaggtc gcacagatgt tgattttttg atatcacccg tgaagaattc agacggaata 3480
ttctacgact cacgtaacta tgaggcccaa gagaatgcaa tacttccaaa aaacgctgac 3540
gctaatggcg cctataatat cgcaaggaag gtgctctggg ccattggtca gtttaagaaa 3600
gccgaagatg aaaaactgga taaggtgaag attgcaatct cgaataagga gtggctcgag 3660
tatgctcaaa ccagtgtaaa gcac 3684
<210> 75
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 75
atgtccaagc tcgagaagtt taccaattgc tatagcctca gcaaaaccct cagattcaag 60
gcaatccccg ttggcaagac ccaagaaaac atcgataaca agaggctcct cgtggaggac 120
gagaagaggg ccgaggacta caagggggtg aagaagctcc tcgacaggta ctacctcagc 180
ttcatcaacg acgtgctcca ctccatcaag ctcaagaatc tcaataacta catctccctt 240
ttcaggaaga agaccaggac cgagaaggag aacaaggagc tcgaaaacct cgaaatcaat 300
ctcagaaagg agatcgccaa agccttcaag ggcaacgagg gttacaagag cctcttcaag 360
aaagacatca ttgagaccat actccccgaa ttcctcgacg acaaggatga gatcgccctc 420
gttaacagtt tcaacggctt caccaccgct ttcaccggtt ttttcgacaa cagggagaac 480
atgtttagtg aggaggcaaa gagtaccagc atcgccttca gatgtatcaa tgagaacctc 540
accaggtaca tcagcaacat ggatatattc gagaaggtgg acgccatctt cgacaagcat 600
gaggtgcagg agataaagga gaaaatcctc aactccgact acgacgttga ggatttcttc 660
gagggcgagt tcttcaactt cgttctcacc caagagggca tagacgtcta caacgccatc 720
ataggcgggt tcgtgaccga gtcaggggag aaaatcaagg ggcttaacga gtacattaac 780
ctttacaatc agaaaacaaa gcaaaagctc cccaagttca aacccctcta caagcaggtc 840
ctctccgaca gggagtccct ctccttctac ggtgaggggt ataccagcga tgaagaggtc 900
ctcgaggtgt tcaggaacac ccttaataag aacagcgaga tcttcagcag cataaagaag 960
ctcgagaaat tgttcaagaa ctttgacgag tactcctccg ccggcatctt cgtcaagaac 1020
ggtcccgcca tttctactat cagcaaagac atattcggcg agtggaacgt catcagggac 1080
aaatggaatg ccgaatacga tgacatacac ctcaagaaga aggctgtcgt taccgagaaa 1140
tacgaagacg acagaaggaa gagttttaaa aagatcggta gtttctccct tgagcagttg 1200
caggaatatg ctgatgctga ccttagcgtg gtcgaaaaac tcaaggagat catcatacag 1260
aaggtcgacg agatttacaa ggtctatggc tccagtgaga agctcttcga cgccgacttc 1320
gttttggaga agagccttaa gaagaacgac gccgttgttg ccatcatgaa ggacctcctc 1380
gatagcgtta agtccttcga aaactatata aaggcctttt tcggagaggg taaagagacc 1440
aacagggacg aaagcttcta cggagacttc gtcctcgcct acgacatcct cctcaaagtc 1500
gaccacatct acgacgccat caggaactac gttacccaaa agccctacag taaggacaag 1560
ttcaagttgt atttccagaa cccccagttc atgggagggt gggacaaaga taaagagacc 1620
gactacagag ccaccatcct caggtatggt agcaagtact acctcgccat catggataag 1680
aagtacgcta agtgcctcca gaagatcgac aaagatgacg ttaacgggaa ctatgagaag 1740
atcaactata agctcctccc cggtcccaac aagatgttgc ccaaggtctt cttcagcaag 1800
aagtggatgg cctattacaa tcccagcgag gacatccaga agatctacaa gaatgggacc 1860
ttcaagaagg gggacatgtt caacctcaat gactgccata agctcatcga ctttttcaag 1920
gatagcattt caaggtaccc caagtggtcc aacgcctacg acttcaactt cagcgaaacc 1980
gagaagtaca aagacatcgc cgggttctac agggaagtcg aggagcaggg ctacaaggtt 2040
tctttcgaga gtgcatctaa gaaggaggtt gacaagctcg tcgaggaggg caagctctat 2100
atgttccaaa tatacaacaa ggacttctcc gacaagagcc acggcacacc caatcttcat 2160
accatgtatt tcaaactcct cttcgacgaa aacaaccacg ggcagattag gctcagcggt 2220
ggcgccgagc tttttatgag aagggcatcc ctcaaaaagg aggagctcgt ggtccatccc 2280
gccaactccc ccatcgctaa caagaacccc gacaatccaa agaaaactac caccctctcc 2340
tacgacgtct acaaggacaa gagattcagc gaggaccagt acgagctcca catccccatc 2400
gccatcaaca agtgtcccaa gaacatcttc aaaatcaaca ccgaggttag ggttttgctc 2460
aagcacgacg ataaccccta cgtcattgga atcgacagag gggaaaggaa cctcctctat 2520
attgttgtcg tggacggcaa gggcaacatt gtggaacagt actcccttaa cgagatcatc 2580
aacaatttta acgggatcag gatcaagacc gattatcaca gtctcctcga caagaaagaa 2640
aaggagaggt ttgaggccag gcagaattgg acaagcattg aaaacataaa agaacttaaa 2700
gccggctata tcagccaagt ggtgcacaag atctgcgagt tggtggagaa gtacgacgcc 2760
gtcatcgccc tcgaggacct caacagcggc tttaagaact ccagagtgaa ggtggagaaa 2820
caggtttacc agaagttcga gaaaatgctc atagacaagc tcaattacat ggtcgacaag 2880
aagtccaacc cctgcgccac cggtggggcc ctcaagggtt atcagatcac taacaagttc 2940
gagtccttca agtccatgag tacccagaac gggtttatct tctacattcc cgcttggctc 3000
acatcgaaga tagacccctc cacaggtttc gttaacttgc ttaaaactaa atacacctcc 3060
atcgccgaca gcaagaaatt tatatcctcc ttcgatagga tcatgtacgt ccccgaggag 3120
gacctcttcg aattcgccct tgactataag aacttctcta ggaccgacgc agactacatc 3180
aaaaagtgga agctctacag ttacggaaat aggatcagga tcttcaggaa ccccaagaaa 3240
aacaacgtgt tcgactggga ggaggtctgt ctcaccagcg catacaagga gttgttcaac 3300
aaatacggca tcaattacca gcagggggat ataagagcac tcctctgcga gcaatccgac 3360
aaagcattct acagctcttt catggccctc atgtccctta tgcttcagat gaggaactcc 3420
atcaccggca ggaccgacgt tgacttcctc atctctcccg tgaagaactc tgacgggatc 3480
ttctacgaca gcagaaacta tgaggcccag gagaatgcca tactcccaaa gaacgccgac 3540
gccaatgggg cctataacat tgctagaaag gtcctctggg ccatcggaca gttcaaaaag 3600
gctgaggacg agaagctcga caaggtcaag atcgccatct ccaataagga atggctcgaa 3660
tacgcccaga ccagcgtcaa gcac 3684
<210> 76
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 76
atgtccaagc tcgaaaaatt cacaaactgc tactccttga gcaaaaccct caggttcaag 60
gccattcccg ttgggaagac ccaggaaaat atcgataaca agaggctctt ggtggaggat 120
gagaaaaggg ccgaggacta caagggtgtg aaaaagctcc tcgacaggta ctacctcagc 180
tttattaacg acgttctcca ctccatcaag cttaaaaacc tcaataacta catcagcctc 240
tttagaaaga aaaccaggac agagaaggag aataaggagc tcgagaacct cgagataaat 300
ctcagaaagg agatcgccaa ggctttcaag ggcaacgagg gctataagag tttgttcaag 360
aaggatatca tcgaaactat cctccccgaa ttcctcgacg ataaagatga gatcgctctt 420
gtcaactctt tcaacggttt tactactgct tttaccggtt tcttcgataa cagggagaac 480
atgttcagcg aagaggctaa atccacctcc atcgctttca ggtgcataaa cgaaaatctc 540
accagataca tcagtaacat ggatatcttt gagaaggtgg acgccatctt cgacaagcat 600
gaggttcaag aaatcaaaga aaaaatcctt aactccgact atgacgtgga ggattttttc 660
gagggcgagt tcttcaactt cgttctcacc caggagggga tcgatgtcta caatgccatc 720
atcggtggct tcgttactga gagcggcgag aaaatcaaag ggctcaacga gtacatcaat 780
ctctataatc agaagaccaa gcaaaagctc cctaagttca agcccctcta caaacaggtg 840
ctcagcgaca gggagtccct ctccttctac ggagagggat acacctccga cgaagaggtt 900
ctcgaggtgt ttaggaacac actcaacaag aactccgaga tcttctcctc cataaagaag 960
ctcgagaaat tgttcaagaa tttcgacgag tattcctccg ctggaatctt cgtcaagaat 1020
ggccctgcca tcagcaccat tagcaaggac atctttggcg agtggaatgt gatcagagac 1080
aaatggaacg ccgaatacga cgacatccac ctcaaaaaga aagccgttgt gacagagaag 1140
tacgaagacg ataggaggaa gtccttcaag aagattggaa gcttcagctt ggagcagctc 1200
caagagtacg ccgatgccga cctcagcgtc gtcgagaagc tcaaggagat cataatccag 1260
aaggttgacg agatctacaa ggtgtacggg agcagcgaga agctcttcga cgccgacttt 1320
gttctcgaga agtccctcaa aaagaacgac gccgtggtcg caatcatgaa ggacttgctc 1380
gacagcgtca agagtttcga gaattacatc aaggccttct ttggcgaggg aaaggagact 1440
aacagggacg agagcttcta cggggatttc gtcctcgcct atgacatctt gttgaaggtg 1500
gaccacatct acgacgccat caggaattac gtcactcaga aaccctacag caaagacaag 1560
ttcaagctct acttccaaaa cccacaattc atgggcggct gggataagga taaggagacc 1620
gactataggg ccaccatact caggtatggg tccaagtact accttgccat catggacaag 1680
aagtacgcaa agtgccttca aaagatcgac aaagacgatg tgaatggaaa ctacgagaag 1740
atcaactaca agctcctccc cgggcccaat aaaatgctcc ctaaggtgtt tttcagcaaa 1800
aagtggatgg cttactacaa cccctccgag gacatccaga agatctacaa gaacggcact 1860
ttcaagaagg gcgacatgtt taaccttaac gactgtcaca agttgatcga cttcttcaag 1920
gattcaatca gtaggtaccc caagtggagc aacgcctacg acttcaactt cagcgagaca 1980
gagaagtaca aggacattgc tggtttctac agggaggtcg aagagcaagg ttacaaggta 2040
tcattcgaaa gcgccagcaa gaaggaggtg gacaaactcg tcgaggaggg caagctctat 2100
atgttccaaa tctataacaa ggatttctcc gacaagagcc acgggacccc caacctccac 2160
actatgtatt tcaagctcct cttcgacgag aataatcacg gtcagatcag gctcagtggg 2220
ggtgccgagc tctttatgag aagggcctcc ctcaagaagg aggagctcgt cgttcacccc 2280
gccaatagcc ctatcgccaa caagaacccc gacaacccca aaaaaaccac caccttgtcc 2340
tacgacgtgt acaaagacaa gaggttcagc gaggatcagt atgaattgca catccccatc 2400
gcaatcaaca agtgccccaa gaatatcttc aaaatcaaca ccgaggtcag ggtcctcctt 2460
aaacatgacg ataaccccta tgtcatcggg atcgatagag gtgagaggaa cctcctctat 2520
atcgttgtgg tggacggcaa aggtaacatt gttgagcagt acagcctcaa cgagatcatc 2580
aacaacttta acggaatcag gatcaaaacc gactatcact ccctccttga caaaaaagag 2640
aaagagaggt tcgaggccag acagaactgg accagcatcg aaaacatcaa ggagcttaag 2700
gccgggtata ttagccaagt tgttcataag atttgcgagc tcgtggagaa gtacgacgcc 2760
gtgatcgcct tggaggacct caattccggt ttcaagaact ccagggtcaa agtggagaaa 2820
caagtgtacc agaagtttga gaagatgctc atcgacaaac tcaactacat ggtcgacaaa 2880
aagagcaacc cctgcgcaac cgggggagcc ttgaagggct accagatcac caataagttc 2940
gaatccttca aaagcatgag cacccagaac ggattcatat tttacatccc agcctggctc 3000
accagtaaaa tcgatccctc caccgggttc gtgaatttgc ttaaaaccaa gtacaccagt 3060
atcgccgatt ccaagaaatt catcagctca ttcgatagaa tcatgtatgt ccccgaggag 3120
gacctcttcg agttcgccct tgactacaag aacttctcca gaaccgatgc cgattatatc 3180
aagaagtgga agttgtatag ctacgggaac aggattagga tcttcaggaa ccccaagaag 3240
aacaacgtct tcgactggga ggaggtgtgc ctcacctccg cctataagga gcttttcaac 3300
aaatacggga tcaactacca gcagggcgac atcagggccc tcctttgcga gcaaagcgat 3360
aaggccttct acagcagctt catggctctc atgtccctca tgctccagat gaggaatagc 3420
atcaccggca gaaccgacgt cgacttcctc atttcccccg tgaagaatag cgacggcatc 3480
ttctacgact ccaggaacta cgaggcccag gagaacgcca tcctccccaa gaacgccgac 3540
gccaacggtg cttacaacat cgctaggaag gtgctctggg ccataggtca attcaagaaa 3600
gccgaggacg aaaagctcga caaggtgaaa atcgccatct ccaacaaaga gtggcttgag 3660
tacgcccaga cttccgtcaa acac 3684
<210> 77
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 77
atgtccaaac tcgagaagtt caccaattgt tactccctca gcaagacctt gaggttcaag 60
gccatccctg tgggtaaaac ccaggaaaac atcgacaaca agagactcct cgtcgaggac 120
gagaagagag ccgaggacta taagggcgtc aaaaagctct tggacaggta ctatctctcc 180
ttcatcaacg acgtgctcca ttccattaaa cttaagaacc ttaacaacta tatctccctc 240
ttcagaaaaa agaccagaac cgagaaagag aacaaggagc tcgaaaacct tgagataaac 300
ctcagaaagg agatcgccaa ggccttcaaa gggaacgagg ggtacaagag cctctttaaa 360
aaggatatta tcgagaccat tctccccgaa tttctcgatg acaaggacga aatcgccctc 420
gtgaactcct ttaacgggtt caccacagca ttcaccggtt tcttcgacaa cagggagaat 480
atgttctctg aggaggcaaa gtccaccagc atagccttta gatgcatcaa cgagaacctc 540
accaggtaca tctccaacat ggacatcttc gagaaggtgg atgctatttt cgataagcac 600
gaggtccagg agatcaagga gaagatcctc aatagcgact acgatgttga ggattttttc 660
gagggtgagt tctttaactt cgtgctcacc caggagggta tcgacgtcta taacgcaatc 720
ataggtgggt tcgtgacaga aagcggtgaa aagatcaaag gcctcaatga gtatattaac 780
ctctacaatc agaaaaccaa gcaaaagctc ccaaagttca agcccctcta caagcaagtg 840
ctcagcgaca gggagtccct cagcttctac ggagagggct acaccagtga cgaggaggtt 900
cttgaggtgt tcaggaatac cctcaacaag aacagcgaaa ttttctcttc cattaagaag 960
ctcgagaagc tctttaagaa cttcgacgag tacagctccg ccggcatctt cgtgaagaac 1020
gggcctgcca tatctaccat atccaaggac atcttcggcg agtggaacgt tatcagagac 1080
aagtggaacg ccgagtacga tgacatccac ctcaagaaga aggccgtcgt caccgagaag 1140
tatgaagatg acaggagaaa gagtttcaag aaaatcgggt cattctcact cgagcagctc 1200
caggagtatg cagacgccga cctcagcgtc gtcgagaagc tcaaagagat cataatccag 1260
aaggtcgacg agatttacaa agtctacggg tcctccgaga agcttttcga tgccgacttc 1320
gtgcttgaga aaagtctcaa aaaaaacgac gccgtggtgg ccatcatgaa ggacctcctt 1380
gacagcgtca agtccttcga gaactatatc aaggcattct ttggcgaagg taaagagaca 1440
aacagagacg agagtttcta cggtgatttc gttctcgcct acgatatact cctcaaggtc 1500
gatcacatct acgacgccat caggaactac gtcacccaga aaccctacag caaagacaag 1560
tttaagctct acttccagaa cccccagttc atgggcggct gggacaaaga caaggagacc 1620
gattataggg ctaccatcct cagatacggt tccaaatact acctcgccat catggacaag 1680
aaatatgcca agtgcctcca gaagatagat aaagatgacg ttaatggaaa ctacgagaag 1740
atcaactaca agctcctccc cggccccaac aagatgctcc ccaaagtttt tttcagcaaa 1800
aagtggatgg cctactataa ccccagcgag gacatccaga aaatatacaa gaacgggacc 1860
ttcaagaaag gtgacatgtt taacctaaac gactgccaca agcttatcga ttttttcaag 1920
gacagcatca gcaggtaccc caagtggtcc aacgcctatg acttcaactt cagcgagaca 1980
gagaaataca aggacattgc cgggttttat agggaggtgg aagagcaggg atacaaagtg 2040
tctttcgaat ccgcctcaaa gaaggaagtc gacaagctcg ttgaagaagg caagctttat 2100
atgttccaga tctataataa ggacttcagt gacaagtccc acggcacccc aaatcttcat 2160
accatgtact tcaagttgct tttcgatgag aataatcacg ggcagatcag actctctggg 2220
ggggccgagc tcttcatgag aagggcctcc ctcaaaaagg aggagctcgt ggttcacccc 2280
gccaacagcc ccatcgcaaa taagaacccc gacaacccca agaagaccac caccctctcc 2340
tacgacgttt acaaggataa gaggttctcc gaagaccagt atgagctcca tatcccaatc 2400
gctatcaaca agtgccccaa gaacatcttc aagatcaaca ctgaggttag agtgctcctc 2460
aagcacgacg acaaccctta tgtgataggg atcgacaggg gcgagaggaa cctcttgtac 2520
atcgtggttg ttgacgggaa aggtaacatc gtcgagcaat attccctcaa tgaaatcatt 2580
aataatttta acggcatcag gataaaaaca gactaccaca gcctcctcga caagaaggaa 2640
aaggagaggt tcgaggccag gcagaactgg acctctatcg agaacatcaa ggagctcaag 2700
gccggctata tctctcaggt tgtgcacaaa atctgcgagc ttgtcgaaaa gtacgacgcc 2760
gttatcgccc tcgaggacct taacagcgga ttcaagaact ccagggtgaa ggtggagaag 2820
caggtctacc agaagttcga gaagatgctc attgacaagc tcaactatat ggtcgacaag 2880
aaatccaatc cttgcgccac aggcggagcc ctcaaggggt atcagatcac caacaagttc 2940
gaaagcttta agagcatgtc aacccaaaac gggtttatat tttatatccc agcctggctc 3000
accagcaaga tcgacccatc caccggattc gttaacctcc tcaaaaccaa gtacaccagc 3060
atcgccgatt ccaagaaatt catctcctca ttcgatagaa tcatgtatgt gccagaggaa 3120
gacctcttcg agtttgctct cgactacaag aacttctcta ggaccgacgc cgactacatc 3180
aagaagtgga agttgtacag ctacggcaac aggatcagga tcttcaggaa ccccaagaag 3240
aataacgtct ttgattggga ggaggtgtgt ctcacctccg cctacaagga gctctttaat 3300
aagtatggca tcaactacca gcagggggat atcagggccc tcctctgcga gcagagcgat 3360
aaggccttct acagctcctt tatggccctc atgagcctca tgctccagat gaggaatagc 3420
atcaccggaa ggaccgacgt tgacttcctc atctctcccg ttaagaactc cgacggcatc 3480
ttttacgact ccaggaacta cgaggcccaa gagaacgcca ttctccccaa gaacgccgat 3540
gccaatggcg cctacaacat cgccaggaaa gtgctctggg ccatcggcca gttcaaaaag 3600
gctgaggacg agaagctcga caaagttaag atcgctataa gcaacaagga gtggcttgag 3660
tatgcccaga cctctgttaa gcac 3684
<210> 78
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 78
atgtccaagc tcgaaaaatt tacaaattgc tacagcctca gcaagaccct caggtttaag 60
gctatccccg ttggcaagac ccaggaaaac atcgacaaca agagactcct tgttgaggac 120
gagaaaagag ctgaagatta caagggggtg aaaaagcttc tcgacaggta ctacttgtca 180
tttatcaacg acgtcctcca cagcatcaag ctcaagaatt tgaacaatta tattagcctc 240
ttcaggaaga agaccaggac cgagaaagag aacaaggagc tcgagaacct tgaaattaac 300
cttaggaaag agatagccaa ggctttcaag gggaacgagg ggtacaagtc cctcttcaag 360
aaggacatca tcgaaaccat ccttcccgag ttcttggacg ataaggacga gattgccctt 420
gtcaactcat tcaacgggtt caccaccgcc tttactgggt tcttcgacaa cagggaaaat 480
atgttttccg aagaagccaa gagcacctcc atagctttca ggtgcatcaa cgagaacctt 540
accaggtata tcagtaacat ggacatcttc gaaaaggtcg atgctatctt cgacaagcac 600
gaagtccagg aaattaagga gaagattctt aactccgatt acgacgtgga ggacttcttc 660
gagggagagt tcttcaactt cgtcctcact caggagggga tcgacgtgta caacgccatc 720
atcgggggct ttgtgacaga gtctggggag aagattaagg gcctcaacga atacatcaac 780
ctctataatc aaaagaccaa gcaaaagttg cccaagttca aacctctcta caagcaggtg 840
ctttcagaca gagagagcct cagcttctac ggcgagggtt acacctccga tgaggaggtg 900
ctcgaagtct tcaggaacac tctcaacaag aatagcgaga tattctccag tatcaaaaag 960
ctcgagaagc tcttcaagaa ctttgacgag tactctagcg ccggcatatt cgtcaaaaac 1020
ggccccgcta tctccaccat cagcaaggac atctttggtg agtggaacgt gatccgggac 1080
aaatggaacg ccgaatacga cgatatccac ctcaagaaaa aggccgtcgt caccgaaaag 1140
tatgaggacg acaggaggaa gtcctttaag aagattggtt ccttctcctt ggagcagctt 1200
caagagtacg ccgacgccga cctcagcgtc gtcgaaaagc ttaaggagat catcatacag 1260
aaagtggacg aaatatacaa ggtttacggg tcctctgaga agctctttga tgcagacttc 1320
gtccttgaga agtccctcaa gaagaacgac gccgttgtcg ccatcatgaa ggacctcctc 1380
gacagcgtga aatcatttga aaactacata aaggcctttt ttggggaggg caaggagacc 1440
aacagagacg aatccttcta cggcgacttc gtgctcgcct acgatatcct ccttaaagtc 1500
gatcacattt atgacgcaat cagaaactat gtcacccaaa aaccctacag caaggacaaa 1560
ttcaagttgt actttcaaaa cccccagttc atgggagggt gggacaagga taaggagacc 1620
gactataggg ccaccatctt gagatacggg tccaagtatt accttgccat catggacaag 1680
aaatacgcca aatgtcttca gaaaatcgac aaggacgacg tgaacgggaa ctacgaaaaa 1740
atcaactaca agctcctccc cggccccaac aagatgctcc ccaaggtctt tttcagtaag 1800
aagtggatgg cctattacaa cccctccgag gacatccaga aaatatacaa gaacggtacc 1860
tttaagaagg gcgatatgtt caatttgaac gactgccaca agctcattga tttcttcaag 1920
gattccatca gcaggtatcc caaatggtca aacgcctacg atttcaattt ctccgagacc 1980
gagaaataca aggacatcgc cgggttctat agggaggtcg aggagcaggg gtataaggtc 2040
agttttgagt cagccagcaa gaaagaggtg gacaagttgg ttgaagaggg taagctttac 2100
atgttccaaa tctacaacaa ggacttcagc gacaagtccc acgggacccc caacctccac 2160
accatgtact tcaagctcct cttcgacgaa aacaatcacg gtcagatcag gctctccgga 2220
ggcgccgagc tcttcatgag gagggcctcc cttaagaagg aagaactcgt ggttcacccc 2280
gccaacagcc ccatcgccaa caagaacccc gataacccca aaaaaaccac aacactttcc 2340
tatgacgttt acaaggacaa aaggttctct gaggaccagt acgagctcca catccctatc 2400
gccatcaata agtgtcccaa gaatatcttt aagatcaaca ccgaggtgag ggtgctcctc 2460
aagcacgacg acaatcccta cgtgatcggc atcgacaggg gagagaggaa cctcctctat 2520
atcgtcgtcg tcgacggcaa ggggaatatc gttgaacaat atagcctcaa cgaaatcatc 2580
aacaacttta acggcatcag gatcaagacc gactaccaca gcctcctcga taagaaggaa 2640
aaggagagat tcgaggctag gcaaaattgg acctccatcg agaacatcaa ggagctcaaa 2700
gccggttaca tcagccaagt ggtgcataag atttgcgaac ttgtggagaa gtatgacgcc 2760
gtgatcgccc ttgaagacct caacagcggg ttcaaaaact ccagggtcaa ggtcgagaag 2820
caggtctacc aaaagttcga gaagatgctc atcgacaagc tcaactatat ggtggacaag 2880
aagagcaatc cctgcgccac agggggggcc ctcaaggggt accagatcac caacaagttt 2940
gaaagcttca aaagcatgtc cacccagaac ggctttatct tctacatccc tgcctggctc 3000
acctccaaga tcgaccccag cacaggcttc gttaacctcc tcaagactaa gtatacctcc 3060
atcgccgaca gcaagaagtt tatcagctca ttcgacagga tcatgtacgt cccagaggag 3120
gacctcttcg agttcgccct cgactacaag aattttagca ggaccgacgc tgactatata 3180
aagaagtgga aactctacag ttacggcaat aggatcagga tcttcaggaa ccccaagaaa 3240
aacaacgttt tcgactggga ggaggtctgc ctcaccagcg cctacaagga actctttaac 3300
aagtacggca taaactacca gcagggcgac atcagggccc tcttgtgcga acagtccgac 3360
aaggccttct actcctcctt catggccctc atgtccctca tgcttcaaat gagaaattcc 3420
atcaccggta ggaccgatgt cgacttcctc atcagccctg ttaaaaacag tgacgggatc 3480
ttctacgact ccaggaacta cgaggcacag gaaaacgcca tccttcccaa gaacgcagac 3540
gctaacggtg cctacaatat cgctaggaaa gtcctctggg ctatcggaca gttcaagaag 3600
gccgaggacg aaaaactcga taaggtcaaa atagccatct ctaacaagga atggctcgaa 3660
tacgcccaga ctagtgtgaa gcac 3684
<210> 79
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 79
atgagtaagc ttgagaagtt caccaactgt tacagccttt caaagaccct caggttcaaa 60
gccatccccg ttggcaaaac ccaggaaaac atcgacaaca aaaggcttct cgttgaggac 120
gaaaagaggg ccgaggatta taagggcgtc aagaagttgc tcgacaggta ctacttgtct 180
ttcatcaacg acgttctcca cagcatcaag ctcaagaatc tcaacaacta tatctccctc 240
ttcaggaaaa aaactagaac cgagaaggag aacaaggagc tcgagaacct tgagatcaac 300
ttgagaaagg aaatagctaa ggcattcaag ggaaacgaag ggtacaaaag cctttttaag 360
aaggatatca tcgagaccat tctccccgag ttcctcgacg acaaggacga gatcgccctc 420
gttaattcct tcaacgggtt caccactgcc tttacagggt tcttcgacaa cagggagaac 480
atgttctctg aggaggccaa gtccacatct atcgccttta ggtgcattaa cgagaacctc 540
accaggtaca tttccaatat ggacatcttc gagaaggtgg acgccatctt cgacaagcac 600
gaggtgcaag agatcaagga gaagatactc aactccgact acgatgtgga ggatttcttc 660
gagggggagt tttttaactt cgttctcacc caagagggga ttgatgtcta caatgccatc 720
ataggggggt tcgtcacaga gagcggtgag aagatcaagg gtctcaacga gtatatcaac 780
ctttataacc aaaagacaaa gcaaaaactc cccaaattca agcccttgta caagcaggtc 840
ctctccgata gggagagtct ctccttctac ggggaggggt acacatccga cgaggaagtc 900
cttgaagtct tcaggaacac tttgaacaag aactccgaaa tcttcagctc cattaaaaag 960
ctcgagaagc tcttcaagaa cttcgacgag tattctagcg ccggtatctt cgttaagaac 1020
gggcctgcca tcagcaccat ctccaaggac atatttgggg agtggaatgt tattagggat 1080
aagtggaatg ccgagtatga cgacatccat ctcaagaaaa aggccgtggt taccgaaaag 1140
tatgaagacg acaggaggaa gagcttcaag aaaatcggat ccttctccct cgagcagctc 1200
caggagtatg ccgacgccga cctcagcgtg gttgaaaagc tcaaggagat catcatccag 1260
aaggtcgacg aaatctacaa ggtctacggc tcctcagaga agcttttcga tgccgacttt 1320
gtcctcgaaa agtcacttaa gaagaatgac gccgtcgttg ccatcatgaa agatctcctc 1380
gactcagtca agagcttcga gaattacatc aaagcatttt tcggcgaggg aaaggagacc 1440
aacagggacg agtccttcta cggcgatttt gtcctcgcct acgatatcct cctcaaggtg 1500
gaccacatct acgatgctat aaggaactac gtcactcaga agccatactc aaaggacaag 1560
ttcaagcttt attttcagaa cccccaattt atgggcgggt gggacaagga taaggagacc 1620
gattacaggg ccaccatcct cagatatggg agcaagtatt acttggccat catggacaag 1680
aaatatgcaa agtgtctcca gaagatcgac aaggacgacg tcaacggtaa ctacgaaaaa 1740
atcaactata agctcttgcc aggtccaaac aagatgctcc ccaaggtttt tttctcaaag 1800
aaatggatgg cctactacaa cccctccgag gacatccaga agatctacaa gaatggcacc 1860
tttaaaaagg gtgacatgtt taaccttaac gattgccata aactcatcga cttcttcaag 1920
gactccatca gcaggtaccc caagtggagc aacgcctacg acttcaattt ctcagagacc 1980
gaaaaatata aggacatcgc tggcttctac agagaagtcg aggagcaggg gtataaggtg 2040
agcttcgaaa gtgcttccaa gaaggaggtg gacaaactcg tggaggaagg taagctctac 2100
atgttccaga tctataacaa ggactttagc gacaagagcc acggcacccc aaatctccac 2160
acaatgtact tcaagttgct cttcgacgaa aataaccacg ggcaaatcag gctcagcggg 2220
ggggccgagc tcttcatgag gagggccagc ctcaagaagg aggaactcgt cgtgcacccc 2280
gccaatagcc ccattgccaa caagaacccc gacaacccca agaagaccac caccctctct 2340
tatgacgtct acaaggacaa aaggttctct gaggaccaat acgagttgca cattccaatc 2400
gccatcaaca agtgtcccaa gaacatcttc aagatcaaca cagaggtgag ggtcctcctc 2460
aagcacgacg acaaccccta cgtcatcgga atcgacagag gagagaggaa tctcctctac 2520
atcgtggtgg tcgacggtaa ggggaacata gtcgagcagt actcactcaa cgaaatcatt 2580
aacaacttca acggcatcag aatcaaaacc gattaccaca gcctcctcga caagaaggag 2640
aaggagaggt tcgaagccag gcagaattgg acaagcatcg agaacatcaa ggaactcaaa 2700
gccggttata ttagccaggt cgtgcataag atctgcgagc tcgttgagaa gtacgacgcc 2760
gtcatcgccc tcgaggattt gaactccggg ttcaagaact caagggtgaa ggtggagaaa 2820
caggtgtacc agaagttcga gaagatgctc atcgataagc tcaactatat ggtggacaag 2880
aagagcaatc catgcgccac cggcggagcc ctcaagggct accagataac caataagttt 2940
gaatccttca agagcatgag cacccaaaat ggttttatct tttacattcc cgcctggctc 3000
accagcaaga tcgatcccag caccgggttc gtcaatctcc tcaagaccaa gtacacctct 3060
atcgccgata gtaagaaatt catcagctcc ttcgatagga tcatgtacgt ccccgaggaa 3120
gacctctttg agttcgcact cgactacaaa aacttctcaa ggacagatgc cgactacatc 3180
aagaagtgga agctttattc atacgggaac aggatcagga tcttcagaaa ccccaagaag 3240
aacaacgtgt tcgactggga agaagtctgc ctcacttccg cctacaagga actcttcaac 3300
aaatacggca tcaactatca gcaaggcgat atcagggcct tgctctgcga gcaaagcgat 3360
aaggcctttt acagcagctt catggctctc atgtccctca tgctccagat gaggaactcc 3420
atcactggca ggaccgatgt cgactttctc atctcccccg tgaagaacag cgacggcatc 3480
ttttatgata gcaggaacta cgaggcccaa gagaatgcca tcttgcccaa gaacgccgac 3540
gctaatggtg cctataacat agccaggaag gtgttgtggg ccatcggaca gtttaagaag 3600
gccgaggatg agaaattgga taaggtgaag atagccatca gcaacaagga gtggcttgag 3660
tacgcacaga ccagtgttaa gcat 3684
<210> 80
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 80
atgagcaagc tcgagaagtt taccaactgc tacagcctct ccaaaaccct cagattcaag 60
gccatacccg ttggaaagac acaggagaac atcgacaaca agagactcct cgtcgaggac 120
gagaagaggg ctgaagacta caagggagtc aaaaagcttc tcgacaggta ttacctcagc 180
ttcataaatg acgtcctcca cagcatcaag ctcaagaatc tcaacaatta catctcactc 240
tttaggaaga agaccaggac tgagaaagag aacaaggagc tcgagaactt ggagatcaac 300
ctcagaaaag agatcgccaa ggccttcaag ggcaacgaag gctacaagtc cctctttaag 360
aaggacatca tcgagaccat cctccccgag ttcttggacg acaaggacga gatcgcactc 420
gtgaattcct tcaacggctt taccactgcc tttaccggtt tcttcgacaa tagggaaaac 480
atgttcagcg aggaagccaa gtcaacaagt atcgccttca ggtgcatcaa cgagaatctc 540
actaggtata ttagcaatat ggacatcttc gagaaggtcg atgccatctt cgataaacac 600
gaggtccaag agatcaagga gaagattctc aactccgatt acgatgtgga ggacttcttc 660
gagggggagt ttttcaattt cgtcctcacc caggagggga tcgatgtgta caacgccatc 720
atcgggggct tcgtcaccga gtccggagag aagatcaagg gcctcaacga gtacatcaac 780
ctctataacc agaagaccaa acaaaagctc cccaagttta agccactcta caagcaggtt 840
ctctccgaca gggagagttt gagcttctac ggggaaggct acacctccga cgaggaagtt 900
ctcgaggtct ttaggaacac cttgaacaaa aattccgaga tcttcagcag catcaagaag 960
ttggagaagc tctttaagaa cttcgacgag tactcatccg caggaatatt cgttaagaat 1020
ggccctgcca tctccactat ctccaaggat atcttcgggg agtggaacgt tatcagggac 1080
aagtggaacg ccgagtacga cgacatacac ctcaagaaga aggcagtggt gaccgaaaag 1140
tacgaggacg acaggaggaa gtcatttaag aagatcgggt catttagcct cgagcaactc 1200
caggagtacg ccgacgccga tctctccgtc gttgagaagc tcaaggagat tatcattcaa 1260
aaggtcgacg agatctacaa ggtttatggg agttccgaaa agctcttcga cgccgacttt 1320
gtgctcgaga agtccttgaa gaagaacgac gccgtcgtcg ccattatgaa ggacctcctt 1380
gatagcgtca agtccttcga gaattacata aaagccttct ttggggaggg caaggagaca 1440
aacagggacg agagcttcta cggggatttt gttcttgctt acgacatctt gctcaaggtc 1500
gaccacatct acgacgccat caggaactac gtgacccaaa aaccttatag caaggacaag 1560
tttaagctct atttccagaa cccccaattc atgggcggat gggacaaaga caaggagacc 1620
gattacaggg ccaccatact taggtacggc tccaagtatt atttggccat catggacaag 1680
aagtacgcca aatgcttgca aaagatcgac aaggacgacg tgaacggcaa ctatgagaag 1740
atcaactaca agttgctccc cgggcctaac aagatgctcc ccaaggtttt cttctccaaa 1800
aagtggatgg cctactacaa ccccagtgag gacatccaaa agatatacaa gaacggtacc 1860
ttcaagaaag gcgatatgtt caacctcaat gactgtcaca agctcatcga ttttttcaag 1920
gacagcatct caagataccc caagtggagc aacgcttacg acttcaactt ctctgagaca 1980
gaaaagtata aggacatcgc cggcttctac agggaagttg aggagcaagg ctacaaggtc 2040
tcctttgagt ccgccagcaa gaaagaggtg gataaactcg tcgaggaggg gaaactttac 2100
atgttccaga tctataataa ggacttctca gacaagagcc atggcacccc taacctccac 2160
accatgtact tcaaactcct ctttgacgag aacaaccatg gccagatcag gttgagcggt 2220
ggagccgagc tcttcatgag gagggccagc ctcaagaagg aggaactcgt ggtgcacccc 2280
gccaactccc ccatagccaa taagaacccc gacaatccca agaagaccac caccctctct 2340
tacgacgtct acaaggacaa aaggttttct gaggaccagt acgagctcca tatccctatc 2400
gccatcaaca agtgccccaa gaatatcttc aaaatcaaca ccgaggtgag agtgcttctc 2460
aagcacgatg acaaccccta tgtgataggc atcgataggg gcgagaggaa cctcttgtat 2520
atagtcgtcg tcgacggcaa ggggaatatc gttgaacagt atagcctcaa cgagatcatc 2580
aataatttca atggcatcag gatcaagaca gactaccatt cccttctcga caagaaggag 2640
aaagaaaggt tcgaagccag gcagaattgg acctccattg agaacatcaa ggaacttaag 2700
gccggttaca tctcccaagt cgtgcacaaa atctgcgaac tcgtcgagaa gtacgatgcc 2760
gttatcgcat tggaggacct caactcaggg tttaagaact ccagggtgaa agtcgagaag 2820
caggtgtacc agaagttcga gaaaatgttg atcgacaagc tcaactacat ggtggacaaa 2880
aagagcaacc cctgcgccac cggtggtgca ctcaaggggt accaaatcac aaacaagttt 2940
gaatccttca agagcatgtc cacccagaac gggttcatct tctacatccc cgcttggctc 3000
acttccaaga tagacccctc aaccgggttt gtgaacctcc tcaaaaccaa gtacacctcc 3060
atagccgact ccaagaagtt catttcaagc ttcgacagaa ttatgtacgt gcctgaggag 3120
gatctcttcg agtttgccct cgactacaag aacttcagta ggaccgacgc cgattacatc 3180
aagaagtgga aactctacag ctacggcaac agaatcagaa tcttcagaaa tcctaagaaa 3240
aacaacgttt tcgactggga ggaggtctgc ctcacctccg catacaagga gctcttcaac 3300
aagtacggaa tcaattacca gcagggggac atcagggcac tcctctgcga gcagtccgac 3360
aaggctttct acagctcctt catggccctc atgtccctca tgctccagat gaggaattcg 3420
atcaccggga ggaccgacgt cgatttcctc atctcccccg tcaagaacag cgacgggatc 3480
ttctacgaca gcaggaatta cgaggcccag gagaatgcca tcctccccaa aaacgccgat 3540
gccaacggcg cctacaatat agccaggaag gttctctggg caataggcca attcaagaag 3600
gccgaggacg aaaaactcga caaagtgaag atcgccatat caaataagga gtggctcgag 3660
tacgcccaga ccagcgttaa gcac 3684
<210> 81
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 81
atgtctaagc tcgagaagtt cacaaactgc tactcactca gcaaaacctt gaggttcaaa 60
gccatcccag tgggcaagac ccaggaaaac atcgataaca agaggctcct tgtcgaggat 120
gaaaaaaggg ccgaggacta caagggcgtc aagaagctcc tcgacaggta ctaccttagt 180
ttcatcaacg acgtcctcca cagtatcaag ctcaaaaacc ttaacaacta catctccctt 240
ttcagaaaaa agaccaggac cgaaaaggag aacaaggagc tcgaaaattt ggaaatcaat 300
ctcaggaagg aaatcgccaa ggccttcaag ggaaacgagg ggtacaagtc cctcttcaag 360
aaagacatca tagagaccat tctccccgag tttcttgatg acaaagacga gatcgctctc 420
gtgaactctt tcaatggctt caccactgcc ttcaccggct tctttgacaa cagagagaac 480
atgttctccg aagaggccaa gagcacaagc attgccttca gatgtatcaa cgagaacctc 540
acaaggtaca tcagtaacat ggacattttt gagaaggtcg atgccatctt cgataaacac 600
gaagttcaag agatcaagga gaagattctc aacagcgact acgacgtcga ggattttttc 660
gaaggtgagt tctttaactt cgtcctcact caggagggca tcgacgtcta taacgccatc 720
atcggtggct tcgtcaccga gtctggagaa aagatcaaag ggcttaacga gtatatcaat 780
ctttacaacc agaagaccaa gcagaaactc cccaaattca agcccttgta caagcaagtg 840
ctcagcgaca gggagagcct ttccttctac ggcgaaggct ataccagcga cgaggaggtg 900
ctcgaggtgt ttagaaacac cctcaacaag aactccgaga tattctcaag catcaagaag 960
ctcgagaagc tctttaagaa cttcgatgag tacagcagtg ccggtatctt cgttaaaaac 1020
ggacccgcca tctccaccat ttctaaggac atcttcggag agtggaacgt gatcagggat 1080
aagtggaacg ccgaatacga tgacatccac ttgaagaaga aggccgtcgt taccgagaag 1140
tacgaggacg acaggaggaa aagcttcaag aagatcggca gcttctcact cgagcaactc 1200
caggaatacg ccgatgccga cctcagcgtt gtggagaagc tcaaagagat aatcatccag 1260
aaggtcgacg aaatttataa ggtgtacggg tcatccgaga aactctttga cgccgacttc 1320
gtcctcgaga agtctctcaa gaagaacgac gcagtcgttg ccatcatgaa ggatctcctt 1380
gactccgtca agtcctttga gaattacatc aaggctttct ttggagaggg caaggagacc 1440
aacagggacg agtcctttta cggcgacttc gtccttgcct acgacattct cctcaaagtc 1500
gatcacatct acgacgcaat caggaactac gtcacccaga aaccttactc caaggacaag 1560
ttcaaactct atttccagaa cccccagttc atgggtggct gggataaaga caaggagacc 1620
gactataggg ccaccatcct caggtacggt tccaaatact accttgccat catggacaag 1680
aagtatgcca agtgcctcca gaaaatcgac aaggacgacg ttaatggtaa ttacgagaag 1740
atcaattaca agctcttgcc cggtcccaat aagatgcttc ccaaggtgtt tttcagcaag 1800
aaatggatgg cttactataa tcccagtgag gatatccaaa agatctacaa gaacgggacc 1860
ttcaaaaaag gagacatgtt caacctcaat gattgccata aactcatcga tttcttcaag 1920
gacagcataa gcaggtatcc caagtggtcc aacgcctacg acttcaactt cagcgaaaca 1980
gagaagtaca aggacattgc cggattctat agagaggtgg aagaacaggg ctacaaggtg 2040
agcttcgagt ccgcctccaa gaaggaggtg gacaagctcg ttgaggaggg caagctttac 2100
atgttccaga tctacaacaa ggatttttct gacaaaagcc acggaactcc caacctccac 2160
accatgtact tcaagctcct cttcgatgag aacaaccacg gtcagatcag gcttagcggg 2220
ggcgccgagt tgtttatgag aagggccagc ctcaaaaagg aggagctcgt ggtgcacccc 2280
gccaactcac ccatcgccaa caaaaacccc gacaacccca agaagaccac caccctctct 2340
tacgacgttt acaaagacaa gaggttcagc gaggatcaat atgagctcca tatcccaatc 2400
gccataaata agtgcccaaa gaacatcttt aaaatcaaca ccgaagttag ggtgctcctc 2460
aagcacgatg acaaccccta cgtcatcggc attgacaggg gcgagagaaa tctcttgtac 2520
atagttgttg tggatggcaa gggcaatatc gttgagcagt attccctcaa cgagatcata 2580
aacaacttca acggcatcag gatcaagaca gactaccact ccctccttga caagaaggag 2640
aaagagaggt ttgaggccag gcaaaactgg acttccattg agaacatcaa ggagctcaag 2700
gccggttaca tctcccaagt ggtgcacaaa atctgcgagc tcgtggagaa gtacgacgcc 2760
gtcatagccc tcgaggacct caactctgga ttcaagaaca gcagagttaa ggtggaaaag 2820
caagtctatc agaagtttga gaaaatgctc atcgacaaac tcaactacat ggttgacaaa 2880
aagagcaacc cctgcgcaac cgggggggcc ttgaagggtt accagatcac caacaagttc 2940
gagtctttca aaagcatgag cacccagaat gggttcattt tctacatccc cgcctggctc 3000
accagtaaga tcgacccctc caccggcttt gtgaacttgc tcaagacaaa atacaccagc 3060
atcgccgaca gtaagaaatt catatcaagc ttcgatagga tcatgtacgt gcccgaggaa 3120
gacctcttcg agttcgccct cgactacaaa aacttcagca gaaccgatgc cgactacata 3180
aagaagtgga aactctattc atacggaaat agaataagga tcttcaggaa ccccaagaaa 3240
aacaacgttt tcgactggga ggaggtctgc ctcaccagcg cttacaaaga actctttaac 3300
aaatatggga tcaactatca gcagggggac ataagggcct tgctttgcga gcagtcagat 3360
aaggccttct actccagctt catggccctc atgagcctca tgctccagat gagaaactcc 3420
atcaccggga gaaccgatgt ggatttcctt atctcccccg tgaagaacag tgatggaatc 3480
ttctatgaca gcagaaacta cgaggcacaa gagaacgcca tcctcccaaa gaacgctgac 3540
gccaacgggg cctataacat tgccaggaag gttctctggg ccatcgggca gtttaagaaa 3600
gccgaagatg agaagctcga caaggtcaag atcgccatct ccaataaaga atggctcgag 3660
tacgctcaga cctccgtgaa gcac 3684
<210> 82
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 82
atgtctaagc tcgagaagtt caccaattgc tattccctta gtaagacctt gaggttcaag 60
gctatcccag tcggcaagac ccaggagaac atagataaca aaaggctcct cgtggaggac 120
gagaagaggg ccgaggacta caagggagtc aagaagcttc tcgacaggta ctatctctcc 180
tttatcaacg atgtgctcca ctccatcaag ctcaagaacc tcaataacta catctctctc 240
tttaggaaga agaccaggac agagaaggag aacaaggaac tcgagaacct cgagataaac 300
cttaggaagg agatcgccaa agcattcaaa gggaacgaag gctacaagag cctctttaag 360
aaagacatca ttgaaacaat cctccccgag ttcctcgacg ataaggatga gatcgctttg 420
gtgaactcct ttaacggttt taccaccgcc ttcaccggct tcttcgacaa cagggagaat 480
atgttctccg aggaagccaa aagcacctct atcgcattca gatgtatcaa cgaaaacctc 540
accaggtaca tcagcaacat ggacatattt gagaaggttg acgccatctt tgataagcat 600
gaagtgcaag aaataaagga gaagatcctc aatagcgact acgatgtcga agatttcttt 660
gaaggcgagt tcttcaactt cgtgctcacc caagagggca tcgatgttta caacgccata 720
atcggaggat tcgtcaccga atccggggag aagattaagg ggctcaacga gtacatcaac 780
ctctacaacc agaaaaccaa gcagaaactc cccaagttca agcccctcta caagcaggtg 840
ctcagcgaca gagagtcctt gagcttctac ggggaaggct acaccagtga cgaagaggta 900
cttgaagttt tcaggaacac cctcaacaag aatagcgaga tcttttccag catcaaaaag 960
ctcgagaaac tcttcaagaa cttcgatgag tactcctccg ccggtatctt cgtgaagaac 1020
ggaccagcca tcagcaccat cagcaaagac atcttcggag agtggaatgt gatcagggac 1080
aagtggaacg ccgagtacga cgacatccat ctcaagaaaa aggccgtggt caccgagaag 1140
tacgaggacg acaggaggaa gagcttcaag aagatcggga gttttagcct cgagcagctt 1200
caggagtacg ccgacgctga cctttccgtc gtggaaaaac ttaaggagat aataatccag 1260
aaggttgatg agatctacaa agtttacggg tcctcagaga agctctttga cgccgacttc 1320
gtgcttgaga agtccctcaa gaagaacgac gctgtggtgg ccatcatgaa ggacctcctc 1380
gacagtgtta agtccttcga gaactacatc aaggccttct tcggcgaggg caaggagacc 1440
aacagagacg agtccttcta tggtgacttc gtgctcgcat acgacatcct tctcaaagtg 1500
gaccatatct acgacgccat caggaactac gttacccaga agccctactc caaggacaaa 1560
tttaagcttt atttccagaa tccccaattc atgggaggat gggacaagga caaggagacc 1620
gactacaggg caaccattct caggtacggg agcaagtact atctcgccat catggataag 1680
aagtacgcca aatgtctcca gaagatcgac aaggacgatg ttaacggcaa ttacgaaaag 1740
atcaattaca agttgctccc tgggcccaat aagatgctcc ccaaagtgtt cttctccaag 1800
aagtggatgg catactacaa cccttccgag gacatccaga agatttacaa gaatggcacc 1860
ttcaaaaagg gggacatgtt caatctcaat gattgccata aactcattga cttttttaaa 1920
gacagcatct ccagataccc caaatggagc aacgcctacg acttcaactt ctccgagacc 1980
gaaaagtaca aggacatcgc cgggttctac agggaagttg aggagcaggg ctacaaggtc 2040
tccttcgagt ctgcctccaa gaaggaagtt gacaagttgg tggaggaagg gaaactctac 2100
atgttccaga tctacaataa ggacttctcc gacaagtccc acggcacccc caaccttcac 2160
accatgtact tcaagcttct ctttgatgag aacaaccatg gtcaaatcag attgagcggc 2220
ggagccgagc tcttcatgag gagagctagc ctcaagaaag aggagctcgt cgttcacccc 2280
gccaactccc ccatcgccaa caagaacccc gacaatccca agaagaccac caccctctca 2340
tatgacgtgt acaaggacaa gaggttcagc gaagaccagt acgaactcca catccccata 2400
gccatcaata agtgtcccaa gaacatcttc aagataaaca ccgaggtgag ggtcctcctc 2460
aaacacgatg ataatcctta cgtcatcggc atcgacaggg gggagaggaa ccttctctac 2520
atcgttgtcg tcgacgggaa ggggaatatc gttgagcagt attccctcaa tgagataata 2580
aataacttca acggcattag gataaaaacc gactaccaca gcctcctcga caagaaagaa 2640
aaggagaggt tcgaggccag acagaactgg accagcatcg agaacatcaa ggagctcaag 2700
gccggctaca tttcccaggt cgttcacaag atttgcgagc tcgtcgaaaa atacgacgcc 2760
gtgatcgccc tcgaggacct caactcaggg ttcaaaaaca gcagggttaa ggtggaaaag 2820
caagtttacc agaagtttga aaagatgctc atcgacaaac tcaactacat ggtggacaag 2880
aagagcaacc cctgcgccac cggtggcgcc ctcaagggat accagattac caacaagttc 2940
gaatccttca aatctatgag cacacaaaat ggcttcatat tctacatccc cgcctggctc 3000
acctcaaaaa tcgaccccag caccggtttc gtgaatctcc tcaagacaaa gtacacctct 3060
atcgcagact ccaagaaatt tataagcagc ttcgatagaa tcatgtatgt ccccgaggag 3120
gatctcttcg aatttgctct cgattacaag aacttcagca ggaccgacgc cgattatatc 3180
aagaagtgga agctctactc ctacggcaac agaatcagga tcttcaggaa ccccaagaag 3240
aacaatgtct tcgactggga ggaggtgtgc cttacctccg catacaagga gctcttcaac 3300
aagtacggca ttaactacca gcagggcgac atcagggccc tcctctgcga gcaatccgac 3360
aaggccttct actcctcctt catggctctc atgagcttga tgctccagat gaggaattcc 3420
attaccggca ggactgacgt cgatttcctc atatcacccg tcaagaatag cgatgggatc 3480
ttctacgact caaggaatta cgaggcccag gagaacgcca ttctcccaaa gaacgctgac 3540
gccaacggcg cctacaacat cgctaggaaa gtgctctggg ccatcggtca gttcaagaaa 3600
gccgaggacg agaagctcga caaggttaag atcgccatca gtaacaagga gtggctcgaa 3660
tatgcccaga ccagcgttaa gcac 3684
<210> 83
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 83
atgtccaagc tcgaaaagtt caccaactgc tactccctct ccaaaaccct taggtttaag 60
gccatcccag tgggaaagac ccaggagaac atcgataata aaaggctcct cgtggaggac 120
gagaaaaggg ctgaggacta caagggggtg aagaaattgc tcgacaggta ttatttgagc 180
ttcatcaacg atgttcttca cagcatcaag ctcaagaatc tcaacaacta catctccctt 240
ttcaggaaga agaccaggac cgagaaggag aacaaggagc ttgagaacct cgagatcaac 300
ctcagaaaag agatcgccaa ggccttcaag ggcaacgagg ggtacaagtc cttgtttaag 360
aaggacatca tcgagaccat ccttcccgag ttcctcgacg acaaagatga gatcgccctc 420
gttaactcat tcaacggttt tactactgcc ttcaccgggt tttttgacaa tagggagaac 480
atgttctctg aggaggccaa gagtacctcc atcgcattca gatgcattaa tgagaatctt 540
actagataca tatccaatat ggacatattc gagaaggtgg acgccatctt tgacaagcac 600
gaggtccagg agatcaagga gaagattctt aactctgatt acgacgtcga ggacttcttc 660
gagggagagt tcttcaattt tgtgctcact caggagggca tcgacgtgta taacgccatt 720
atagggggtt ttgtcacaga gtctggagaa aagatcaagg gcctcaacga atacatcaac 780
ctctataacc aaaagaccaa acagaaactt cccaaattta agcccctcta caagcaggtc 840
ctctccgaca gggaatccct ttccttttac ggggagggct acacctccga cgaagaggtg 900
ctcgaggtgt tcaggaatac cttgaacaag aacagcgaga ttttcagctc cataaaaaag 960
ctcgaaaagc tcttcaagaa ctttgacgag tactcctccg ccggaatctt cgtcaagaac 1020
gggccagcca tctccaccat cagtaaagac attttcgggg agtggaacgt tatcagggac 1080
aagtggaacg ccgaatacga cgacatccac ctcaagaaga aagccgtggt caccgagaag 1140
tacgaggacg acaggagaaa gtcttttaag aagatcggga gcttcagcct tgagcagctc 1200
caagagtacg ccgacgccga cctcagcgtg gtcgaaaagc ttaaggaaat catcatccag 1260
aaggttgacg agatctataa ggtgtacggc agcagcgaga agctcttcga cgctgatttt 1320
gtgctcgaga aaagcctcaa gaagaacgac gccgttgtgg ccatcatgaa ggatctcctc 1380
gactcagtca aaagcttcga gaactacatc aaggcatttt tcggcgaggg gaaggagacc 1440
aacagagacg aaagctttta cggggacttt gtcctcgcct acgacatcct ccttaaggtc 1500
gaccacatct acgacgccat cagaaactac gtcacccaga agccatactc caaggataag 1560
ttcaagctct actttcaaaa tccccaattc atggggggct gggacaagga caaggaaaca 1620
gactacaggg ccaccatcct caggtatggg tctaagtact acctcgccat catggataag 1680
aagtacgcca aatgtcttca gaagatcgac aaggacgacg ttaacgggaa ctacgagaag 1740
atcaactaca agctcctccc tggcccaaat aagatgctcc ccaaggtctt cttctctaaa 1800
aagtggatgg cctactacaa cccctccgaa gacatccaga aaatctacaa gaacgggacc 1860
ttcaagaaag gggacatgtt caaccttaac gactgccata aactcatcga cttttttaag 1920
gacagtatct ccaggtaccc caagtggagt aacgcctatg acttcaattt cagcgagaca 1980
gagaaataca aggatatagc cgggttttac agggaggtgg aggagcaggg ctacaaagtc 2040
agcttcgaat ccgcaagcaa gaaggaggtg gacaaactcg tggaggaagg gaagttgtat 2100
atgttccaga tttacaacaa ggacttcagc gataagagcc acgggacccc taaccttcac 2160
accatgtact tcaagctcct cttcgacgag aacaatcacg gccaaatcag gctcagcggc 2220
ggagctgagc tcttcatgag gagggcctct ctcaaaaagg aggagcttgt tgtccaccct 2280
gccaactctc ccatcgccaa caagaatccc gacaacccca aaaagaccac caccttgagc 2340
tacgatgtgt acaaggacaa gaggttttcc gaggaccagt acgagctcca tatccctatc 2400
gctatcaaca agtgccccaa gaacatcttc aagatcaaca ccgaggtcag ggtccttctc 2460
aagcatgacg acaatcccta cgtcatagga atcgatagag gcgagaggaa cctcctctac 2520
atcgtggtgg tggacggaaa aggcaacatc gtcgagcagt actcactcaa cgagatcatc 2580
aacaacttca acggaatcag gatcaagacc gattaccaca gcctcctcga caagaaggaa 2640
aaagaaaggt tcgaggccag acaaaactgg accagcatcg agaacatcaa ggagcttaaa 2700
gccggctata tctctcaggt cgtccacaag atctgcgagc tcgttgagaa gtatgacgcc 2760
gtgattgctt tggaggacct caactccggt ttcaagaaca gtagagtcaa ggtcgagaag 2820
caggtttacc agaagttcga gaagatgctt attgacaaac ttaactacat ggtcgacaag 2880
aagagcaacc cctgcgcaac cgggggagcc ctcaagggtt accagatcac aaacaagttc 2940
gagagcttca agagcatgag cacccagaat ggcttcatct tttatattcc cgcctggctc 3000
accagtaaaa tcgacccctc tacaggtttc gtcaacctct tgaagactaa atataccagc 3060
atcgcagaca gcaaaaagtt tatcagcagc tttgatagga tcatgtacgt gcccgaggaa 3120
gacttgttcg aatttgccct cgattacaag aatttcagca ggaccgatgc agactacatc 3180
aagaagtgga aactctatag ttacggcaat aggatcagaa tcttcagaaa cccaaagaaa 3240
aacaacgtct tcgactggga ggaggtctgc ctcacatctg cctataagga gctcttcaat 3300
aagtacggaa tcaattatca gcagggcgac ataagggccc tcctctgcga gcagagcgac 3360
aaagccttct attctagctt catggccctc atgtccctta tgctccagat gaggaatagc 3420
atcactggga ggaccgacgt ggactttctc atctcccctg tcaagaattc agacgggatt 3480
ttctacgatt ccaggaatta cgaggctcag gaaaatgcca tccttcctaa aaacgcagat 3540
gccaacggcg cctacaacat cgccagaaag gtgctttggg ctatcggtca attcaagaaa 3600
gccgaggacg agaagctcga caaggtcaag atcgctatca gcaacaagga gtggctcgag 3660
tatgcccaaa ccagtgtcaa gcac 3684
<210> 84
<211> 3684
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 84
atgagcaagc ttgagaagtt taccaattgc tactctctca gcaaaaccct caggttcaag 60
gccatccccg tcgggaagac ccaggagaac attgacaaca agagactctt ggttgaagat 120
gaaaaaaggg ccgaggacta taagggggtg aagaaactcc tcgacaggta ctaccttagc 180
tttatcaacg acgtgctcca cagcataaaa ttgaaaaatc tcaacaatta catcagcctc 240
ttcaggaaga agaccaggac cgaaaaagag aacaaagagc tcgagaacct cgagatcaac 300
ctcaggaagg aaatagcaaa agccttcaaa ggcaatgagg gctataagag cctcttcaag 360
aaggacatta tcgagaccat cctccccgag ttcctcgacg acaaggacga gatcgccctc 420
gtgaacagct tcaacggttt caccaccgcc ttcaccggct tctttgacaa cagggagaat 480
atgttcagcg aggaggccaa gtctacatca atcgccttta ggtgcataaa cgagaacctc 540
accaggtaca tcagcaacat ggacatcttc gagaaggtgg acgccatctt cgataagcac 600
gaggtgcagg agatcaagga gaagatcctc aacagcgact acgacgtcga agactttttc 660
gaaggggaat ttttcaactt cgttctgacc caggagggca tcgacgtgta caacgccatt 720
atcgggggtt tcgtgaccga gtctggcgag aagatcaagg ggctcaatga gtacatcaac 780
ctttataacc agaagaccaa gcaaaagctc cccaagttta aaccccttta caagcaggtg 840
ttgagcgata gggagagcct ctccttctat ggggagggct atactagcga cgaggaagtt 900
ctcgaagtgt ttaggaacac ccttaacaag aattctgaga tcttcagctc catcaagaag 960
ctcgagaagt tgttcaagaa tttcgatgaa tactcctccg ccggcatctt tgtcaagaat 1020
ggccccgcca tctccacaat ctccaaggac atctttggtg agtggaacgt tattagggac 1080
aaatggaacg ccgagtacga tgacatccac ctcaagaaga aggctgtggt gaccgaaaag 1140
tacgaggacg acagaaggaa gtccttcaag aagatcggct cctttagcct cgaacagctt 1200
caggaatacg ccgatgccga cctctccgtc gttgaaaaac tcaaagagat tattatccag 1260
aaggttgacg agatctacaa ggtttacggg tcctcagaga agctttttga cgccgatttt 1320
gttctcgaga agagcctcaa gaagaacgat gccgttgtcg ccatcatgaa agatctcctc 1380
gatagcgtca agagcttcga aaactacatc aaggccttct ttggcgaggg gaaagaaacc 1440
aacagggacg agtcatttta tggggacttc gtcctcgcct acgacatcct tctcaaagtt 1500
gaccatatct atgatgccat caggaactac gttacccaga agccctacag caaggacaaa 1560
ttcaaactct acttccagaa cccccaattt atggggggct gggacaagga caaggaaacc 1620
gactacaggg ccactatcct taggtacggg agcaaatatt atctcgcaat catggataag 1680
aagtacgcca agtgtctcca aaagatcgac aaagacgacg tgaatgggaa ttacgagaag 1740
atcaattaca agctcctccc cggtccaaac aagatgctcc ccaaggtgtt tttctcaaag 1800
aagtggatgg cctactacaa cccctccgag gatatccaaa agatctacaa aaacggcact 1860
tttaagaagg gggatatgtt caatctcaac gactgccaca agctcatcga ctttttcaag 1920
gacagcatta gcaggtaccc caagtggagc aacgcatatg atttcaattt tagcgagacc 1980
gaaaagtaca aggacatcgc tggcttctac agggaggtcg aagagcaggg gtacaaggtc 2040
tctttcgagt ccgcctccaa gaaggaggtg gacaaattgg tggaggaggg gaaactttac 2100
atgtttcaga tctacaacaa ggatttctcc gacaagtccc atggcacccc caacctccac 2160
accatgtact tcaagcttct cttcgacgag aataatcacg ggcaaatcag gctttccggc 2220
ggcgcagagc tcttcatgag gagagcctcc ctcaagaaag aggagctcgt tgtgcacccc 2280
gccaactccc ccatcgccaa caagaaccca gataatccca agaagaccac caccctcagc 2340
tacgacgtct acaaggacaa aaggttctca gaggaccagt atgagctcca catccctatc 2400
gccatcaaca agtgccccaa aaatatcttt aagataaaca ccgaggtgag agtgttgttg 2460
aaacacgacg ataaccccta cgtcattgga atcgacaggg gggagaggaa cctcctctac 2520
atagtcgtgg tcgacggcaa aggtaacatc gttgaacaat actccctcaa tgagatcatt 2580
aacaacttca atgggatcag gataaagacc gactatcaca gcctccttga caagaaggag 2640
aaagagaggt ttgaagctag acaaaactgg acttccatag agaacatcaa agagctcaag 2700
gccggctaca tctcccaagt ggtgcacaaa atctgtgagc tcgtcgagaa gtacgacgcc 2760
gtcatcgccc tcgaggacct caactccggg ttcaagaaca gtagagtcaa ggttgagaag 2820
caagtctacc agaagtttga gaagatgctt atcgataagc ttaactacat ggttgataag 2880
aaatctaacc cctgcgctac tggcggcgcc ttgaaggggt accaaatcac caacaagttc 2940
gagagcttca agagcatgtc cactcagaac ggattcatct tctacatacc cgcctggctc 3000
acttccaaaa tagaccccag caccggtttc gttaacctct tgaagaccaa gtatacctca 3060
atcgcagaca gtaagaagtt catttccagc ttcgatagaa taatgtacgt ccccgaggag 3120
gacttgttcg agtttgcttt ggactataag aacttctcaa ggacagacgc cgactacatc 3180
aaaaagtgga agctctacag ctatgggaac aggatcagga tatttaggaa ccccaagaaa 3240
aacaacgtct ttgattggga agaggtgtgt ttgaccagcg catacaagga gcttttcaac 3300
aagtacggga tcaactatca gcagggcgac atcagggccc tcctttgcga acaatccgac 3360
aaggcctttt attccagttt catggccctc atgagtttga tgttgcagat gaggaactcc 3420
atcactggta ggaccgacgt ggacttcttg atcagccccg tgaaaaactc cgacggaatc 3480
ttctacgaca gcagaaacta tgaggcccag gagaatgcaa ttctccccaa gaacgccgac 3540
gccaatgggg catataacat cgccaggaag gtgctttggg ccataggcca atttaaaaag 3600
gcagaagacg agaaacttga caaggtgaag atcgccataa gcaataaaga gtggctcgaa 3660
tacgctcaga cctcagtcaa acac 3684
<210> 85
<211> 1263
<212> PRT
<213> 未知
<220>
<223> 细菌
<400> 85
Met Asn Asn Gly Thr Asn Asn Phe Gln Asn Phe Ile Gly Ile Ser Ser
1 5 10 15
Leu Gln Lys Thr Leu Arg Asn Ala Leu Ile Pro Thr Glu Thr Thr Gln
20 25 30
Gln Phe Ile Val Lys Asn Gly Ile Ile Lys Glu Asp Glu Leu Arg Gly
35 40 45
Glu Asn Arg Gln Ile Leu Lys Asp Ile Met Asp Asp Tyr Tyr Arg Gly
50 55 60
Phe Ile Ser Glu Thr Leu Ser Ser Ile Asp Asp Ile Asp Trp Thr Ser
65 70 75 80
Leu Phe Glu Lys Met Glu Ile Gln Leu Lys Asn Gly Asp Asn Lys Asp
85 90 95
Thr Leu Ile Lys Glu Gln Thr Glu Tyr Arg Lys Ala Ile His Lys Lys
100 105 110
Phe Ala Asn Asp Asp Arg Phe Lys Asn Met Phe Ser Ala Lys Leu Ile
115 120 125
Ser Asp Ile Leu Pro Glu Phe Val Ile His Asn Asn Asn Tyr Ser Ala
130 135 140
Ser Glu Lys Glu Glu Lys Thr Gln Val Ile Lys Leu Phe Ser Arg Phe
145 150 155 160
Ala Thr Ser Phe Lys Asp Tyr Phe Lys Asn Arg Ala Asn Cys Phe Ser
165 170 175
Ala Asp Asp Ile Ser Ser Ser Ser Cys His Arg Ile Val Asn Asp Asn
180 185 190
Ala Glu Ile Phe Phe Ser Asn Ala Leu Val Tyr Arg Arg Ile Val Lys
195 200 205
Ser Leu Ser Asn Asp Asp Ile Asn Lys Ile Ser Gly Asp Met Lys Asp
210 215 220
Ser Leu Lys Glu Met Ser Leu Glu Glu Ile Tyr Ser Tyr Glu Lys Tyr
225 230 235 240
Gly Glu Phe Ile Thr Gln Glu Gly Ile Ser Phe Tyr Asn Asp Ile Cys
245 250 255
Gly Lys Val Asn Ser Phe Met Asn Leu Tyr Cys Gln Lys Asn Lys Glu
260 265 270
Asn Lys Asn Leu Tyr Lys Leu Gln Lys Leu His Lys Gln Ile Leu Cys
275 280 285
Ile Ala Asp Thr Ser Tyr Glu Val Pro Tyr Lys Phe Glu Ser Asp Glu
290 295 300
Glu Val Tyr Gln Ser Val Asn Gly Phe Leu Asp Asn Ile Ser Ser Lys
305 310 315 320
His Ile Val Glu Arg Leu Arg Lys Ile Gly Asp Asn Tyr Asn Gly Tyr
325 330 335
Asn Leu Asp Lys Ile Tyr Ile Val Ser Lys Phe Tyr Glu Ser Val Ser
340 345 350
Gln Lys Thr Tyr Arg Asp Trp Glu Thr Ile Asn Thr Ala Leu Glu Ile
355 360 365
His Tyr Asn Asn Ile Leu Pro Gly Asn Gly Lys Ser Lys Ala Asp Lys
370 375 380
Val Lys Lys Ala Val Lys Asn Asp Leu Gln Lys Ser Ile Thr Glu Ile
385 390 395 400
Asn Glu Leu Val Ser Asn Tyr Lys Leu Cys Ser Asp Asp Asn Ile Lys
405 410 415
Ala Glu Thr Tyr Ile His Glu Ile Ser His Ile Leu Asn Asn Phe Glu
420 425 430
Ala Gln Glu Leu Lys Tyr Asn Pro Glu Ile His Leu Val Glu Ser Glu
435 440 445
Leu Lys Ala Ser Glu Leu Lys Asn Val Leu Asp Val Ile Met Asn Ala
450 455 460
Phe His Trp Cys Ser Val Phe Met Thr Glu Glu Leu Val Asp Lys Asp
465 470 475 480
Asn Asn Phe Tyr Ala Glu Leu Glu Glu Ile Tyr Asp Glu Ile Tyr Pro
485 490 495
Val Ile Ser Leu Tyr Asn Leu Val Arg Asn Tyr Val Thr Gln Lys Pro
500 505 510
Tyr Ser Thr Lys Lys Ile Lys Leu Asn Phe Gly Ile Pro Thr Leu Ala
515 520 525
Asp Gly Trp Ser Lys Ser Lys Glu Tyr Ser Asn Asn Ala Ile Ile Leu
530 535 540
Met Arg Asp Asn Leu Tyr Tyr Leu Gly Ile Phe Asn Ala Lys Asn Lys
545 550 555 560
Pro Asp Lys Lys Ile Ile Glu Gly Asn Thr Ser Glu Asn Lys Gly Asp
565 570 575
Tyr Lys Lys Met Ile Tyr Asn Leu Leu Pro Gly Pro Asn Lys Met Ile
580 585 590
Pro Lys Val Phe Leu Ser Ser Lys Thr Gly Val Glu Thr Tyr Lys Pro
595 600 605
Ser Ala Tyr Ile Leu Glu Gly Tyr Lys Gln Asn Lys His Ile Lys Ser
610 615 620
Ser Lys Asp Phe Asp Ile Thr Phe Cys His Asp Leu Ile Asp Tyr Phe
625 630 635 640
Lys Asn Cys Ile Ala Ile His Pro Glu Trp Lys Asn Phe Gly Phe Asp
645 650 655
Phe Ser Asp Thr Ser Thr Tyr Glu Asp Ile Ser Gly Phe Tyr Arg Glu
660 665 670
Val Glu Leu Gln Gly Tyr Lys Ile Asp Trp Thr Tyr Ile Ser Glu Lys
675 680 685
Asp Ile Asp Leu Leu Gln Glu Lys Gly Gln Leu Tyr Leu Phe Gln Ile
690 695 700
Tyr Asn Lys Asp Phe Ser Lys Lys Ser Thr Gly Asn Asp Asn Leu His
705 710 715 720
Thr Met Tyr Leu Lys Asn Leu Phe Ser Glu Glu Asn Leu Lys Asp Ile
725 730 735
Val Leu Lys Leu Asn Gly Glu Ala Glu Ile Phe Phe Arg Lys Ser Ser
740 745 750
Ile Lys Asn Pro Ile Ile His Lys Lys Gly Ser Ile Leu Val Asn Arg
755 760 765
Thr Tyr Glu Ala Glu Glu Lys Asp Gln Phe Gly Asn Ile Gln Ile Val
770 775 780
Arg Lys Asn Ile Pro Glu Asn Ile Tyr Gln Glu Leu Tyr Lys Tyr Phe
785 790 795 800
Asn Asp Lys Ser Asp Lys Glu Leu Ser Asp Glu Ala Ala Lys Leu Lys
805 810 815
Asn Val Val Gly His His Glu Ala Ala Thr Asn Ile Val Lys Asp Tyr
820 825 830
Arg Tyr Thr Tyr Asp Lys Tyr Phe Leu His Met Pro Ile Thr Ile Asn
835 840 845
Phe Lys Ala Asn Lys Thr Gly Phe Ile Asn Asp Arg Ile Leu Gln Tyr
850 855 860
Ile Ala Lys Glu Lys Asp Leu His Val Ile Gly Ile Asp Arg Gly Glu
865 870 875 880
Arg Asn Leu Ile Tyr Val Ser Val Ile Asp Thr Cys Gly Asn Ile Val
885 890 895
Glu Gln Lys Ser Phe Asn Ile Val Asn Gly Tyr Asp Tyr Gln Ile Lys
900 905 910
Leu Lys Gln Gln Glu Gly Ala Arg Gln Ile Ala Arg Lys Glu Trp Lys
915 920 925
Glu Ile Gly Lys Ile Lys Glu Ile Lys Glu Gly Tyr Leu Ser Leu Val
930 935 940
Ile His Glu Ile Ser Lys Met Val Ile Lys Tyr Asn Ala Ile Ile Ala
945 950 955 960
Met Glu Asp Leu Ser Tyr Gly Phe Lys Lys Gly Arg Phe Lys Val Glu
965 970 975
Arg Gln Val Tyr Gln Lys Phe Glu Thr Met Leu Ile Asn Lys Leu Asn
980 985 990
Tyr Leu Val Phe Lys Asp Ile Ser Ile Thr Glu Asn Gly Gly Leu Leu
995 1000 1005
Lys Gly Tyr Gln Leu Thr Tyr Ile Pro Asp Lys Leu Lys Asn Val
1010 1015 1020
Gly His Gln Cys Gly Cys Ile Phe Tyr Val Pro Ala Ala Tyr Thr
1025 1030 1035
Ser Lys Ile Asp Pro Thr Thr Gly Phe Val Asn Ile Phe Lys Phe
1040 1045 1050
Lys Asp Leu Thr Val Asp Ala Lys Arg Glu Phe Ile Lys Lys Phe
1055 1060 1065
Asp Ser Ile Arg Tyr Asp Ser Glu Lys Asn Leu Phe Cys Phe Thr
1070 1075 1080
Phe Asp Tyr Asn Asn Phe Ile Thr Gln Asn Thr Val Met Ser Lys
1085 1090 1095
Ser Ser Trp Ser Val Tyr Thr Tyr Gly Val Arg Ile Lys Arg Arg
1100 1105 1110
Phe Val Asn Gly Arg Phe Ser Asn Glu Ser Asp Thr Ile Asp Ile
1115 1120 1125
Thr Lys Asp Met Glu Lys Thr Leu Glu Met Thr Asp Ile Asn Trp
1130 1135 1140
Arg Asp Gly His Asp Leu Arg Gln Asp Ile Ile Asp Tyr Glu Ile
1145 1150 1155
Val Gln His Ile Phe Glu Ile Phe Arg Leu Thr Val Gln Met Arg
1160 1165 1170
Asn Ser Leu Ser Glu Leu Glu Asp Arg Asp Tyr Asp Arg Leu Ile
1175 1180 1185
Ser Pro Val Leu Asn Glu Asn Asn Ile Phe Tyr Asp Ser Ala Lys
1190 1195 1200
Ala Gly Asp Ala Leu Pro Lys Asp Ala Asp Ala Asn Gly Ala Tyr
1205 1210 1215
Cys Ile Ala Leu Lys Gly Leu Tyr Glu Ile Lys Gln Ile Thr Glu
1220 1225 1230
Asn Trp Lys Glu Asp Gly Lys Phe Ser Arg Asp Lys Leu Lys Ile
1235 1240 1245
Ser Asn Lys Asp Trp Phe Asp Phe Ile Gln Asn Lys Arg Tyr Leu
1250 1255 1260
<210> 86
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 86
atgaacaacg gaacaaataa cttccaaaac ttcattggca tcagctcact ccagaagacc 60
ttgagaaatg cacttattcc tacagagact acacaacaat ttatcgtgaa gaatggaatc 120
attaaggagg acgaactccg cggagaaaac cgccaaattc ttaaagacat aatggatgat 180
tactatagag gatttatttc cgagacactg tcaagcattg atgatatcga ttggacttca 240
ctttttgaga agatggaaat tcagctgaag aatggcgata acaaagatac tcttattaag 300
gaacaaacag agtatcgcaa agcaatacac aaaaagtttg ctaatgacga tagattcaag 360
aatatgttct cagcgaagct gatttccgat attctccccg agtttgtgat tcacaacaat 420
aattactcgg cttctgagaa agaggaaaaa actcaggtta taaagttatt ctcacgcttt 480
gcaactagct ttaaagatta ttttaaaaac agagccaact gtttcagtgc ggatgatatt 540
tcctcatcat cctgtcaccg gattgttaac gataatgcgg aaatcttttt cagcaacgct 600
ctagtttaca gacgcattgt caaatcgctg tcgaacgatg atattaacaa gatcagcggc 660
gatatgaagg atagtttgaa ggagatgagc ctcgaggaaa tttattctta cgagaaatat 720
ggggaattta ttactcaaga gggtattagt ttttacaatg acatatgcgg taaagtcaat 780
tcctttatga atttgtattg ccagaaaaac aaggaaaata aaaatttgta caaattgcag 840
aagctccata agcagattct ctgcatcgct gacacgtctt acgaagttcc ttacaaattc 900
gagtctgacg aggaagttta tcagtccgtt aacggtttct tggataacat cagttctaaa 960
cacattgttg aaaggcttcg gaaaattgga gacaattaca atgggtacaa cttagacaag 1020
atttatattg tttcgaaatt ctatgagagc gtctcacaaa agacataccg tgattgggaa 1080
accattaaca ctgcacttga aatccattac aataacattc ttcctggaaa tggaaagtct 1140
aaggccgaca aggtgaaaaa ggctgtcaaa aatgatctac agaaaagtat taccgaaata 1200
aatgagctcg ttagcaacta taaactttgt tcagatgata atataaaggc cgaaacctat 1260
atacacgaaa tcagtcacat tttgaacaac tttgaagctc aggaactgaa gtataatcct 1320
gaaatccatc tagtggaatc tgaacttaag gcatccgagc taaaaaatgt ccttgacgtt 1380
ataatgaatg cgttccactg gtgtagcgtt tttatgacgg aggagctagt ggataaagat 1440
aacaacttct acgcagagtt agaagaaatc tacgatgaga tatatcctgt cattagtttg 1500
tataatcttg tcaggaatta tgtgacacag aaaccatact ctactaaaaa aattaagctt 1560
aactttggaa ttccgacact cgctgacggc tggtccaagt ctaaggagta ctctaataat 1620
gcaatcatat tgatgaggga caacttatat tatctgggca tatttaatgc caaaaacaag 1680
ccagataaga aaattatcga ggggaacacg tctgaaaaca agggagatta caaaaaaatg 1740
atttataacc tcttgcctgg gccaaataag atgataccaa aagtttttct aagttctaag 1800
acaggcgtgg aaacttataa gccttcagcg tatatacttg aaggatacaa acagaataag 1860
catatcaagt ctagcaagga tttcgacatc actttttgcc atgacttgat cgattatttt 1920
aaaaactgta tagcaataca tcctgagtgg aagaacttcg gattcgattt ttccgatact 1980
agcacctatg aggacatttc tggtttctac agagaagttg agctgcaggg ctataaaata 2040
gattggacat atatcagtga gaaagacatt gatctactcc aggagaaagg gcagctctac 2100
ttgtttcaga tctacaacaa ggatttttca aaaaagtcaa ctggaaatga caatttgcac 2160
accatgtact tgaaaaatct ttttagcgaa gaaaatctta aggacattgt attaaaactg 2220
aacggagagg ctgagatttt ttttaggaag agctccatta agaacccaat cattcacaag 2280
aaagggagta tcctggtcaa cagaacctat gaagctgagg aaaaagatca attcggaaat 2340
attcagatcg tccgcaaaaa tatacctgag aacatctacc aagaactata caaatatttt 2400
aatgacaagt ccgataagga gctttccgat gaggccgcta agctgaaaaa cgtggttggt 2460
caccacgagg cggccactaa cattgttaaa gactaccgtt atacttatga caagtacttt 2520
ttgcatatgc ctattacaat caacttcaag gcaaacaaga ccggctttat caatgacagg 2580
attcttcagt acatcgcaaa ggagaaggat ctacacgtga tcgggataga tagaggagag 2640
agaaacttga tctatgtgag tgtgattgat acatgtggga atattgtcga acagaaatca 2700
ttcaacattg tgaatggcta cgattaccaa attaagctga agcagcaaga gggagcaaga 2760
caaatcgctc gcaaggaatg gaaggagatc ggaaagatca aggagataaa agaaggctat 2820
ctcagcctcg taatccacga gatttcaaag atggtgatta agtacaatgc aatcattgcc 2880
atggaggatc tctcgtacgg attcaaaaaa ggtaggttca aggttgaacg acaagtctac 2940
cagaaatttg agacgatgtt aatcaacaaa ttgaactacc tcgtatttaa ggatattagt 3000
attactgaaa atggtgggct tttgaagggt tatcagttga cctacattcc agacaagctc 3060
aagaatgtgg gacaccaatg tggctgcatt ttttacgtgc ccgcagcata tacttctaaa 3120
attgatccta ccacaggatt cgtaaatatc tttaaattca aagatcttac cgttgacgct 3180
aagagggagt ttattaagaa attcgactct attcgctatg atagtgaaaa gaatctcttt 3240
tgcttcacat ttgattacaa caacttcata acccaaaaca ctgtgatgag caagagcagt 3300
tggtctgttt acacctacgg agttcgtatt aagaggcgat ttgtcaatgg tcggttctct 3360
aatgaatcag acacaattga tatcactaaa gatatggaaa agactcttga aatgacagat 3420
attaactgga gagatggaca cgatttaagg caagatatta tagattatga aattgtgcaa 3480
catatttttg aaatatttag gttgacggta caaatgcgca actcattgtc agaacttgag 3540
gatagagact atgatcgttt gatatcccct gttctcaatg aaaataatat cttctacgat 3600
tcagctaagg caggggacgc gcttcctaaa gatgctgatg cgaatggagc ttactgtatt 3660
gcgctaaaag gcttgtatga gataaagcaa attaccgaaa actggaaaga ggatggaaaa 3720
tttagtcgag ataagctcaa aatatcaaac aaagattggt ttgacttcat tcagaacaaa 3780
aggtacctc 3789
<210> 87
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 87
atgaacaacg ggaccaacaa cttccagaat ttcatcggca tctcctctct ccagaagacc 60
ctcaggaacg ccctcattcc aacagagacc acacaacagt tcatcgtcaa gaacgggatc 120
ataaaggagg acgagcttag gggagagaac aggcaaattc ttaaggacat catggacgat 180
tactacaggg gcttcatcag tgagaccctc agctccatag acgacatcga ctggacctcc 240
ctctttgaga agatggagat ccagttgaag aacggggaca acaaggatac actcatcaag 300
gagcaaaccg agtacagaaa ggccatccat aagaagtttg ccaacgacga caggttcaag 360
aacatgttca gcgccaaatt gatcagcgac attctccccg agttcgttat ccacaataat 420
aactacagcg cctccgagaa agaggagaag acccaggtga tcaagctctt ttccaggttc 480
gcaaccagtt tcaaggacta cttcaagaac agggctaact gcttcagcgc cgacgacatc 540
tcctcctcat cctgccacag gatagtcaat gacaatgccg agatcttctt ctccaatgcc 600
ctcgtttaca gaaggattgt gaaaagcctc tccaatgacg acatcaacaa gatctccggt 660
gacatgaaag acagtctcaa ggaaatgtca ctcgaggaga tttacagtta tgagaagtac 720
ggtgaattta tcactcagga ggggatctcc ttctataacg atatttgcgg taaggttaac 780
agcttcatga acctctactg tcagaaaaac aaggagaata agaacttgta caagttacag 840
aagctccaca aacaaatctt gtgtatagcc gataccagct acgaagtgcc ctataagttc 900
gaatccgacg aagaggtgta tcagtccgtt aatggttttc tcgataacat ctcctccaaa 960
cacatcgtgg agaggttgag aaagatcggg gacaactata atggctacaa cctcgataaa 1020
atatacatcg ttagcaagtt ctatgagagc gtttcccaaa aaacctacag agactgggag 1080
accatcaaca ccgcccttga aatccactat aacaacatct tgcccggcaa cggtaagtcc 1140
aaggccgaca aggtgaagaa ggccgtcaag aatgatttgc agaagagcat caccgagata 1200
aatgaactcg tctccaacta taagctctgc tccgatgaca acatcaaagc cgagacatac 1260
attcatgaga tttcccacat ccttaacaat tttgaggccc aggagctcaa gtacaacccc 1320
gagatccacc tcgtcgaaag cgagctcaag gccagcgagc ttaagaacgt gctcgatgtg 1380
ataatgaacg ccttccattg gtgctccgtc ttcatgactg aggagctcgt cgacaaggat 1440
aataacttct acgccgagct cgaggagatc tacgacgaga tctaccccgt catctccctc 1500
tataacctcg ttaggaacta cgtcactcaa aagccctatt ccaccaagaa gatcaaattg 1560
aactttggca tacccacact cgccgacggc tggagtaagt ccaaggagta ttcaaataat 1620
gccatcatcc tcatgaggga taacctctac tacttgggca tctttaacgc caagaataaa 1680
cctgacaaga agatcatcga ggggaatacc tccgagaaca agggagatta taagaagatg 1740
atttacaatc tcctccctgg gcccaacaaa atgataccca aggtgtttct ctcatccaag 1800
accggcgtcg agacatataa gcctagcgcc tatatacttg aaggctacaa acagaataag 1860
cacatcaagt cttccaagga cttcgatatc acattctgcc acgatctcat cgactacttc 1920
aaaaactgca tcgcaattca tcctgagtgg aagaattttg ggtttgattt ctccgatacc 1980
agcacttacg aggacatatc cggcttctac agggaagtcg agctccaggg atacaagata 2040
gactggacat acatttctga gaaggacata gaccttttgc aggagaaggg gcagctctac 2100
ctcttccaga tttataacaa ggacttcagt aagaaaagca ccgggaacga caacctccat 2160
accatgtacc tcaagaacct tttcagtgaa gaaaacctca aggacatagt gctcaagctt 2220
aatggcgagg ctgaaatctt cttcaggaag tcttcaatca agaatcccat catccataag 2280
aagggcagta tccttgtgaa taggacctac gaagccgagg agaaggacca gttcggcaac 2340
attcagatag tcaggaaaaa tatccccgag aacatctacc aggaactcta taagtacttc 2400
aacgacaaga gcgacaaaga gctcagcgac gaggccgcca agctcaagaa cgtcgtcggg 2460
caccatgagg ccgccaccaa tatcgttaag gactacaggt acacttatga caagtatttc 2520
ctccacatgc ctatcacaat taatttcaag gcaaacaaga ccgggtttat caacgacagg 2580
atcctccagt acatcgccaa ggagaaagac ttgcatgtta tcggcattga caggggcgaa 2640
aggaatctca tctacgttag cgtgatagac acctgcggca atatcgtcga gcagaagtct 2700
ttcaacattg tcaacggcta cgattaccag atcaagctca agcaacagga gggggccagg 2760
cagatcgcca ggaaggagtg gaaggaaatc gggaagatca aggagatcaa ggaggggtat 2820
ctctctctcg tcatccacga aatctcaaag atggttatca agtataacgc catcatcgct 2880
atggaagact tgtcctacgg gttcaagaag ggcaggttca aggttgagag gcaagtgtat 2940
cagaagtttg agacaatgct catcaacaaa ctcaactatc tcgtgttcaa ggacatctcc 3000
atcactgaga acggtgggct cctcaaaggg taccagctca cctacatccc tgataagctc 3060
aaaaatgtcg gccaccaatg cggctgcatc ttctacgtcc ccgccgccta cacatctaag 3120
atcgacccca ccaccgggtt tgtgaacatc ttcaagttca aggacctcac agtggacgcc 3180
aaaagagaat tcatcaagaa atttgactcc atcaggtacg atagcgagaa aaacttgttc 3240
tgcttcacat ttgactataa caactttata acccagaata cagtgatgag caaaagctcc 3300
tggagcgtgt acacatacgg cgtgaggatc aagagaaggt tcgttaacgg gaggttctcc 3360
aacgagtccg acaccatcga cattaccaag gacatggaaa aaaccttgga gatgactgac 3420
atcaactgga gagacgggca cgacctcagg caggacatca tcgattacga gatcgtgcag 3480
cacattttcg aaatctttag attgacagtt cagatgagaa actccctctc cgagctcgag 3540
gacagagatt acgacaggct cataagcccc gtgctcaacg agaacaacat cttttacgac 3600
tccgccaagg ccggggacgc acttcccaag gacgctgacg ccaacggggc ctactgcatc 3660
gcactcaaag gcctctatga gataaaacag atcaccgaaa actggaagga ggacggtaag 3720
ttcagcaggg acaagctcaa aatatctaat aaagactggt tcgatttcat ccaaaacaag 3780
aggtacttg 3789
<210> 88
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 88
atgaataatg ggactaacaa cttccagaac ttcatcggta ttagcagtct ccagaagacc 60
ctcaggaacg ccctcattcc caccgagacc acccagcagt ttatcgtgaa gaacggcatc 120
ataaaggagg acgagctcag gggcgaaaat aggcagattc tcaaggacat catggatgat 180
tactacaggg gattcatctc cgaaactctc tcctctatag atgatatcga ctggaccagt 240
ctcttcgaaa agatggaaat ccaactcaag aacggcgaca acaaggatac acttatcaaa 300
gagcagactg agtacagaaa agctatccac aagaagttcg ccaacgatga caggtttaag 360
aacatgttct ccgctaagct catttctgac attctccctg aatttgtcat ccacaacaat 420
aattacagtg cctctgagaa ggaggagaaa acccaagtga ttaagctctt cagcaggttc 480
gccacatcat tcaaagacta cttcaagaat agggccaact gcttctccgc cgacgatatc 540
agctcctcaa gttgccacag gatcgtgaat gataacgccg agatcttctt cagcaacgcc 600
ctcgtctaca ggaggatcgt caaatccctc tcaaatgatg acatcaacaa gatcagcggg 660
gacatgaagg acagtctcaa ggagatgagc cttgaggaga tctactccta tgagaagtac 720
ggggagttca taacccaaga ggggatctcc ttctacaacg acatctgcgg aaaagtgaac 780
agctttatga acctctactg ccagaaaaac aaggagaaca agaacctcta caagctccag 840
aagctccaca aacagatctt gtgcatcgca gatacctctt acgaggtgcc ctacaagttc 900
gagtccgacg aagaggtcta ccaaagcgtt aacgggtttc tcgacaacat cagctctaag 960
cacatcgtgg aaagactcag gaagattggt gataactaca acgggtataa tctcgataaa 1020
atctatatag tgtccaagtt ctatgagagc gtgagccaga agacctacag ggactgggag 1080
accatcaaca cagccctcga gatccactat aacaacatcc tccccgggaa cgggaagtcc 1140
aaggccgata aggttaagaa agccgtcaag aacgacctcc agaagagcat aaccgagatc 1200
aatgagctcg tctccaacta caagctctgc tctgatgaca atatcaaggc tgagacctac 1260
atccacgaga tcagtcacat cctcaataac ttcgaagccc aggagctcaa gtacaaccca 1320
gaaatccacc tcgtcgagtc cgagctcaag gcatctgagc tcaagaacgt cttggacgtg 1380
attatgaacg ccttccattg gtgcagtgtc ttcatgaccg aggagttggt ggacaaggac 1440
aataacttct acgccgagct cgaggagatc tatgacgaga tctatcccgt tatcagtctc 1500
tacaacctcg ttaggaacta tgtgacccag aagccctact ccacaaagaa gatcaagctc 1560
aacttcggga tccccacttt ggccgatggg tggagcaaga gcaaggagta ctccaacaat 1620
gcaattatcc tcatgaggga caatctctac tacctcggga tctttaacgc caaaaataag 1680
cccgacaaga agatcatcga ggggaacacc tccgaaaaca agggggacta caagaagatg 1740
atctataatc tcctccccgg tcccaacaag atgattccca aggtgtttct cagttccaag 1800
accggcgttg agacctacaa accctctgca tacatccttg agggctacaa gcagaataaa 1860
cacatcaaat cctctaaaga cttcgacatc accttctgcc acgacttgat cgactacttc 1920
aagaactgca ttgccatcca ccccgagtgg aagaacttcg gtttcgactt ttccgatacc 1980
agcacctacg aggacatcag cggtttttac agggaagttg agcttcaggg ctacaagatc 2040
gactggacct acatctccga gaaggacatc gacctcttgc aggagaaggg gcagctctat 2100
ctcttccaaa tctataacaa ggactttagc aagaagtcca ccgggaacga caacttgcac 2160
accatgtacc tcaaaaacct cttttccgag gaaaacctca aggacatcgt gttgaagctc 2220
aatggcgagg ccgagatctt cttcaggaag tcttcaatca agaaccccat catccacaag 2280
aaggggtcaa tccttgttaa tagaacttac gaagctgaag agaaggacca atttgggaac 2340
atccagatcg tcaggaagaa tatccctgag aacatctacc aggaactcta caaatacttc 2400
aacgacaaga gcgacaagga gctctccgac gaggccgcca agctcaagaa tgttgtgggc 2460
caccacgaag ctgccaccaa catcgtcaaa gactacaggt acacttatga caagtacttt 2520
ctccacatgc ccattaccat caatttcaaa gccaacaaaa ccggcttcat caacgacagg 2580
attctccagt acatcgccaa agagaaggac ctccatgtta tcgggatcga ccggggagag 2640
agaaacctca tctacgttag cgtcatagac acctgtggga acatagtcga acaaaaaagc 2700
ttcaacatcg ttaacggata cgattaccag atcaagctca agcaacagga gggtgcaagg 2760
cagatcgcca ggaaggagtg gaaggagatc gggaagatta aggagatcaa ggagggctac 2820
ctctcccttg tgatccacga gatatccaaa atggtgataa agtataacgc aattatcgcc 2880
atggaggacc tcagctatgg gttcaaaaaa ggcaggttca aggtggagag gcaagtgtac 2940
cagaagtttg aaaccatgtt gatcaacaag ctcaactacc tcgtgttcaa ggacatcagc 3000
atcaccgaga atggtggcct cctcaagggc taccagctca cctacatccc cgataaactt 3060
aagaacgtgg gccaccagtg cgggtgcata ttctatgtcc ccgctgccta caccagtaaa 3120
atcgacccaa ccaccgggtt cgttaacatc tttaagttca aggacctcac cgttgacgca 3180
aaaagggagt tcatcaaaaa attcgacagc atcagatacg acagcgaaaa gaatcttttc 3240
tgcttcacct ttgactacaa caatttcatc actcagaaca ctgtcatgtc taagagcagc 3300
tggtcagtgt acacctacgg agtgaggatc aagagaagat tcgtcaacgg gagattcagt 3360
aatgagagtg acaccattga cataacaaaa gacatggaga agacactcga gatgaccgat 3420
atcaattgga gggatgggca tgacctcagg caggacatca tagattacga aatcgtccag 3480
cacatcttcg agattttcag actcacagtg cagatgagga atagtctcag cgagctcgag 3540
gacagggact acgacaggtt gatctccccc gtgctcaacg agaataatat cttctacgac 3600
tctgccaagg ccggcgacgc cctccccaaa gacgcagacg caaacggggc ctactgcatc 3660
gcactcaagg ggctctacga gatcaagcag atcaccgaga attggaagga ggacgggaag 3720
ttctccaggg acaaacttaa gatcagtaac aaggactggt tcgacttcat ccaaaataag 3780
aggtacttg 3789
<210> 89
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 89
atgaacaacg gaaccaacaa cttccaaaat ttcatcggta tctcctccct tcagaagaca 60
ctcaggaacg cccttatccc caccgaaaca acacagcagt tcatcgtgaa gaatggaatc 120
atcaaggaag acgagctcag gggggagaat aggcagattc tcaaggatat tatggacgat 180
tactacaggg gcttcatcag cgaaaccctc agcagtatcg acgacatcga ctggactagc 240
ctcttcgaga agatggagat ccagctcaag aacggtgaca ataaagacac cttgatcaag 300
gagcaaaccg aatataggaa ggccatccac aagaaatttg ccaacgacga cagattcaag 360
aacatgttct ccgccaaact catctccgat atcctccccg aattcgtcat ccacaataat 420
aactactccg ccagcgagaa ggaggagaag acccaggtca tcaaactctt cagcaggttc 480
gccacatcct ttaaggacta ctttaagaac agggccaact gcttctccgc tgacgacata 540
agctccagct cctgccacag gattgtcaat gacaacgcag aaatcttctt ctccaatgcc 600
ctcgtctaca ggaggatcgt gaagtccctc agcaatgacg acatcaacaa gatctccggg 660
gacatgaaag actcactcaa ggaaatgagt ctcgaggaga tctacagtta cgagaagtat 720
ggcgaattta taacccagga aggtattagt ttctacaacg acatctgcgg gaaagtcaac 780
agcttcatga acctctattg tcagaagaac aaggagaaca aaaacctcta caagctccag 840
aagctccaca agcagatcct ctgtatcgct gacaccagct acgaggtccc ctataagttc 900
gagtccgatg aggaggtgta tcagagcgtg aatggctttc tcgacaacat atccagcaag 960
cacatcgtcg agaggctcag gaagatcggc gacaattaca acgggtacaa cctcgacaag 1020
atctacatcg tgtcaaagtt ttacgagagc gtgtcacaaa aaacctacag ggactgggaa 1080
acaatcaata ccgccctcga gatccactac aacaatatcc tccccggcaa cggtaagagc 1140
aaggccgaca aagtgaaaaa ggccgtgaaa aacgacctcc agaagagcat caccgagatc 1200
aacgagctcg tctccaatta caagctttgc tccgatgata acatcaaggc cgaaacctac 1260
atccacgaaa tttcccacat cctcaataac tttgaggccc aggagctcaa gtacaatccc 1320
gaaatccacc tcgtcgagtc cgaactcaag gcctccgaac tcaaaaacgt cctcgacgtc 1380
atcatgaacg cctttcactg gtgctccgtt tttatgaccg aggagttggt ggacaaagac 1440
aacaacttct acgccgaact tgaggaaatc tatgacgaga tctatcccgt catcagcctc 1500
tacaatctcg tgagaaacta cgtgacccag aagccctact caaccaaaaa gattaagctc 1560
aacttcggca ttcccaccct cgccgacggc tggagcaagt ccaaggagta ctccaacaat 1620
gccataatct tgatgagaga taatttgtac tacctcggga tcttcaacgc caagaacaag 1680
cccgacaaga aaattataga gggcaacacc tccgagaaca agggggatta caagaagatg 1740
atctataacc tccttcccgg tccaaacaag atgatcccca aggtgttcct ctccagcaag 1800
actggtgtgg agacctacaa gccctcagcc tatatcctcg aggggtacaa acagaacaaa 1860
cacatcaaga gctctaagga cttcgacatt accttctgtc atgacctcat cgactatttt 1920
aagaactgca tcgctatcca ccccgaatgg aagaacttcg gtttcgactt cagcgacacc 1980
agcacctacg aggacataag cgggttctat agagaggtcg agctccaggg atataagatc 2040
gattggacct acataagtga aaaggacatt gatttgctcc aggagaaggg ccagctctat 2100
ctcttccaga tttataataa ggacttctct aagaagagca ccggcaacga taacctccac 2160
accatgtacc tcaagaacct cttctccgag gagaacctca aggatatcgt gctcaagctc 2220
aacggggagg ccgagatttt cttcaggaag agcagcatca aaaaccccat catccacaag 2280
aaggggagca ttctcgtgaa caggacctac gaggccgagg agaaggacca gttcggtaac 2340
atccagatag tgaggaaaaa catccccgag aacatctatc aagagctcta caaatatttc 2400
aacgacaaat ctgacaagga gctctccgac gaggccgcca agcttaagaa cgttgttggg 2460
caccacgagg ccgccaccaa tatcgtcaaa gactacagat atacatacga caagtacttc 2520
ctccacatgc caattaccat caatttcaag gccaacaaga ccggcttcat caacgacaga 2580
atcctccaat acatagctaa ggagaaggac ttgcatgtca tcggtatcga caggggggaa 2640
aggaacctca tctacgtcag cgttatagac acctgcggaa acatcgtcga gcagaagtcc 2700
ttcaatatcg tcaacggcta cgactaccag ataaagctca agcagcagga aggggcaaga 2760
cagatcgcca gaaaggagtg gaaagagatc ggcaagatca aggagatcaa agaggggtac 2820
ctcagccttg tgatccatga gatcagcaag atggtcatca agtacaacgc tataatcgcc 2880
atggaggacc tctcctatgg gttcaagaag ggcagattta aggtggagag gcaggtctac 2940
caaaaattcg agaccatgct cataaacaag ctcaattatt tggttttcaa agacatcagc 3000
attacagaga atggcgggct cctcaaaggg tatcagctta cctacattcc cgacaagctc 3060
aagaacgtcg gccaccagtg cgggtgcatc ttctacgtcc ccgccgccta cacctccaaa 3120
atcgacccca ccaccggctt cgtgaacatc tttaagttca aggacctcac cgtggatgcc 3180
aaaagggagt tcatcaagaa attcgacagt atcagatatg actcagaaaa gaacctcttc 3240
tgcttcactt ttgactataa caacttcata acccagaaca ccgttatgag caagagtagc 3300
tggtccgtgt atacctacgg cgtcagaatc aagaggagat ttgtgaacgg taggttcagc 3360
aacgagagtg acaccatcga catcaccaag gatatggaga agaccctcga aatgaccgac 3420
atcaattgga gggatggaca cgacctcagg caagacatca tcgattacga gatagttcag 3480
cacatcttcg aaatatttag gttgaccgtt cagatgagga attccctcag cgagctcgag 3540
gatagagact acgacagact tatcagcccc gttctcaacg agaacaacat tttctacgac 3600
tcagcaaagg ccggcgacgc cctccccaag gacgccgacg ccaatggcgc ctattgcatc 3660
gccctcaagg gtctctacga gatcaagcag atcaccgaaa attggaagga agacggcaag 3720
ttcagcagag acaagctcaa aatatcaaac aaggattggt tcgatttcat acagaacaag 3780
cgatacctc 3789
<210> 90
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 90
atgaacaatg ggactaacaa cttccagaac ttcatcggca tctcatccct ccagaaaacc 60
ctcaggaacg ccctcatccc caccgagaca acccagcaat ttatcgtcaa aaacggtatc 120
atcaaagaag acgaactcag aggtgagaac aggcagatcc tcaaggacat tatggacgac 180
tattacagag gcttcatctc agagaccctc agtagtatcg acgacatcga ctggacctct 240
ttgtttgaga agatggagat tcagctcaag aatggagaca ataaggacac tctcatcaag 300
gagcaaaccg agtacagaaa agccatacac aagaaatttg ccaacgacga caggttcaag 360
aacatgttca gcgcaaagct cattagcgac atactccctg agtttgtgat ccataacaac 420
aactactccg cttccgagaa ggaggaaaag acccaggtta taaagctttt cagcagattc 480
gccacctctt tcaaggatta ctttaaaaat agggccaact gcttttccgc cgatgatatc 540
agttccagca gctgtcatag gatcgtgaat gataacgccg agatcttctt cagcaatgcc 600
ctcgtctaca ggaggatcgt gaagagcttg tccaacgacg acatcaacaa gatctccggg 660
gacatgaagg acagcctcaa ggaaatgagc ctcgaggaga tctactctta tgagaagtac 720
ggcgagttca tcacccagga aggcatatca ttttacaacg acatttgcgg caaggtgaac 780
agtttcatga acctctactg tcagaaaaac aaggagaata aaaacctcta caagctccag 840
aaattgcaca aacagattct ctgtatcgcc gataccagct acgaggtccc ctacaaattc 900
gagagcgatg aggaggtcta ccagtccgtt aacggctttc tcgacaacat ctcctctaaa 960
cacattgtcg agaggttgag aaagatcggc gacaactata acggctataa tctcgacaag 1020
atctacatcg tgagtaagtt ctacgagagc gttagtcaga agacctacag ggactgggag 1080
accatcaaca cagctctcga aatccactat aacaatatcc tccccgggaa cggtaagagc 1140
aaggcagaca aggtgaagaa ggccgtgaag aacgaccttc agaagtccat caccgagatc 1200
aacgagttgg tgagcaatta caagctctgc tccgacgaca acatcaaagc agagacctac 1260
atccacgaaa tcagccacat cttgaacaac ttcgaggccc aggagttgaa atacaacccc 1320
gaaatccacc tcgttgagag tgaactcaaa gccagtgagc tcaagaacgt cctcgacgtt 1380
attatgaacg ccttccactg gtgctctgtt ttcatgaccg aggagctcgt cgataaagat 1440
aacaacttct acgccgagct cgaagagata tacgatgaga tataccccgt catctccctc 1500
tacaatctcg ttaggaatta tgtcacccaa aagccctact ctaccaagaa gattaagctc 1560
aacttcggaa tccccaccct cgccgacggg tggagcaaga gcaaggagta ctctaacaat 1620
gccatcatcc ttatgaggga caatctctac tacctcggta tcttcaatgc caagaacaag 1680
cccgacaaga aaatcatcga gggcaacacc tccgagaata aaggggacta caagaagatg 1740
atctataatc tcctccccgg gcccaacaag atgatcccca aagtctttct ctcctcaaaa 1800
accggggtgg aaacttacaa gccctccgcc tacatcctcg agggctacaa gcaaaacaaa 1860
cacatcaagt cctccaagga cttcgacata acattctgcc acgacctgat cgactacttc 1920
aagaattgca tcgccataca cccagaatgg aaaaacttcg ggtttgactt cagcgacaca 1980
agcacctacg aggatattag tggcttctat agagaagttg agctccaggg atacaagatc 2040
gactggacct acatcagcga gaaggacatt gacttgctcc aggagaaggg gcaactttac 2100
ctcttccaga tctataataa ggacttctcc aaaaagtcca ccggcaatga caacctccac 2160
acaatgtacc tcaagaacct tttcagtgag gagaacctca aagacatcgt cctcaaattg 2220
aacggcgagg ccgaaatctt cttcaggaag tccagcatca agaatcccat catccataag 2280
aaggggagca tcttggttaa caggacctat gaggccgagg agaaggacca gtttggcaac 2340
atccagatag tgaggaaaaa catccccgag aacatttacc aggagctcta caagtatttc 2400
aacgacaaat ccgacaagga attgagtgac gaggctgcca agctcaagaa cgttgtgggg 2460
caccatgagg ccgccaccaa tatagtcaaa gactataggt acacctacga caagtacttc 2520
ctccacatgc ccattaccat caacttcaaa gctaacaaga ccgggttcat aaacgacagg 2580
attctccagt acattgccaa agagaaggac ctccacgtca tcggcatcga caggggagag 2640
aggaacttga tctacgtgag cgtcatcgac acatgcggga acatcgtgga acagaagtct 2700
tttaacatcg tcaacgggta cgactaccag atcaagctca agcagcagga gggggccagg 2760
cagatcgcta gaaaggagtg gaaggagatc gggaagatca aggaaattaa ggaggggtat 2820
ttgtccctcg tgattcacga gatctccaag atggtcatca aatacaatgc catcatcgcc 2880
atggaggacc tcagctacgg atttaagaag ggcaggttca aagtggaaag gcaagtctac 2940
caaaagttcg agaccatgct catcaacaag ctcaattatc ttgtatttaa agatatcagc 3000
atcaccgaga acggaggcct cttgaaaggc taccaactca cctacatccc cgacaagttg 3060
aagaatgttg ggcaccagtg cggctgcatc ttctacgtgc ccgccgccta cacatcaaag 3120
atcgacccaa ctaccgggtt cgtgaacatt ttcaagttta aggatctcac cgtggacgcc 3180
aaaagggagt tcatcaagaa attcgacagc atcaggtatg acagcgagaa gaatctcttt 3240
tgttttacct tcgactacaa caacttcatc acccagaaca ccgtcatgtc caagtccagc 3300
tggagcgtct acacctatgg tgtgaggatc aagaggaggt ttgttaacgg gaggttttct 3360
aacgagtcag acaccatcga tataaccaaa gacatggaga aaacacttga gatgaccgac 3420
atcaactgga gagacggcca cgaccttagg caggacatca tcgattacga gatcgtccaa 3480
cacattttcg aaatcttcag gctcaccgtt caaatgagga actccctcag cgagcttgag 3540
gacagggact atgacaggct catcagtccc gttctcaacg agaataacat cttctacgat 3600
agcgcaaagg ccggcgacgc cttgcccaag gacgctgatg ccaatggcgc ctactgtatc 3660
gccttgaaag gcctctacga gatcaagcag atcaccgaga actggaaaga ggacggtaag 3720
ttcagcaggg acaaactcaa aatcagtaat aaggactggt ttgattttat ccaaaacaag 3780
aggtacttg 3789
<210> 91
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 91
atgaataacg ggaccaataa ttttcaaaac ttcatcggca tttccagcct ccagaagacc 60
ctcaggaacg cactcatacc caccgagacc acccagcagt tcatcgtcaa aaacggcatc 120
atcaaggagg acgagttgag gggcgagaac aggcaaatac tcaaagacat catggatgac 180
tactacaggg gcttcatctc agagaccctc agttctatcg acgacatcga ctggacatct 240
ctcttcgaaa agatggagat tcagcttaaa aatggggaca acaaggacac cctcatcaag 300
gaacagaccg agtacaggaa ggcaattcac aaaaagttcg ccaacgatga caggttcaag 360
aacatgttct ccgccaagct catcagcgat atcctcccag agttcgtcat tcacaataat 420
aactatagcg cctccgagaa ggaagaaaag acacaggtta tcaaactctt cagcaggttc 480
gctacctcat ttaaggacta cttcaagaac agggccaact gcttttccgc tgacgatatc 540
agctcctcct cttgccatag gatcgttaac gacaatgccg agatcttttt ctcaaacgcc 600
ttggtttata ggaggatcgt gaagagcttg tctaacgatg atatcaacaa gatctctggc 660
gacatgaagg actcactcaa ggagatgtct ttggaggaga tctattctta cgagaagtat 720
ggggagttca taactcagga gggcatcagc ttctacaacg acatctgtgg caaggtcaac 780
tccttcatga acctctactg ccaaaagaac aaggagaaca agaacctcta taaactccag 840
aagctccaca agcaaatcct ctgcattgca gacacctcat acgaagttcc ctacaagttc 900
gagagtgatg aggaggtgta ccagagcgtt aacgggttcc tcgataacat cagctccaag 960
cacattgtgg aaaggttgag gaagatagga gacaactata atggctacaa cctcgacaag 1020
atctacatcg tgtctaagtt ctacgaaagc gtgagccaga agacatacag ggattgggag 1080
accattaaca ccgctctcga gatacattac aacaacatcc tccccggcaa cggcaagtca 1140
aaggccgaca aggttaaaaa ggctgtcaag aacgatctcc agaagtccat caccgagata 1200
aacgagctcg tgagcaacta caaactctgc agcgacgaca acatcaaagc cgagacctac 1260
atccatgaga tttcccacat ccttaataac tttgaggccc aagagctcaa gtataaccct 1320
gagatccacc tcgtcgagag cgagcttaag gcaagcgaac tcaagaacgt gctcgacgtt 1380
atcatgaacg cttttcattg gtgctccgtt ttcatgaccg aggagctcgt tgacaaagac 1440
aacaattttt acgccgaact cgaggagatc tacgacgaga tctaccccgt catctccctc 1500
tacaacttgg tgagaaacta cgtcacccag aagccctaca gcacaaagaa gatcaagctc 1560
aacttcggca tcccaaccct cgccgacggc tggtcaaaaa gtaaggagta ctccaacaac 1620
gccatcatac tcatgaggga caacctctac tatctcggta tcttcaacgc taagaacaaa 1680
ccagacaaga aaatcatcga ggggaacacc tccgaaaaca agggggatta caagaagatg 1740
atctataacc tcctcccagg cccaaacaag atgatcccca aggtcttcct cagctccaag 1800
accggagttg agacatacaa gccctcagcc tatatcctcg agggttacaa gcagaacaaa 1860
catatcaaga gcagcaaaga tttcgatatc accttctgcc acgacctcat cgactatttt 1920
aagaactgca tcgccataca tcccgagtgg aagaacttcg gttttgactt ttccgacacc 1980
tcaacttacg aggacatctc cggcttctac agggaggtcg agcttcaggg atataagatc 2040
gactggactt acatctccga aaaggacatc gacttgctcc aggagaaggg acagctctat 2100
ttgttccaga tctacaacaa ggactttagc aagaagagca ccggtaatga caatctccat 2160
accatgtatc ttaagaacct cttttccgag gagaacctca aagacatcgt gctcaaattg 2220
aacggcgagg ccgagatctt cttcagaaag agcagcatca agaatcccat catccacaag 2280
aagggttcca tcttggttaa taggacatat gaggccgagg agaaggacca gtttgggaac 2340
attcagatcg ttaggaagaa catcccagag aacatctacc aggaactcta caaatacttc 2400
aacgacaaat ccgacaagga gttgagcgac gaggctgcca aacttaagaa cgttgtcggc 2460
caccatgaag ccgccaccaa tatcgtcaag gactataggt acacctacga taagtacttt 2520
ctccacatgc ccatcaccat taatttcaag gccaataaga ccggctttat caacgacagg 2580
attttgcagt acatcgcaaa ggagaaggac ctccacgtga taggcatcga caggggggag 2640
aggaacctca tctacgtttc agtcatcgat acctgcggca acattgtcga gcagaaatca 2700
ttcaacatcg tgaacgggta cgactaccag atcaaactca agcagcagga gggggccagg 2760
caaattgcca ggaaggagtg gaaggagatt ggcaagataa aggagatcaa ggaggggtac 2820
ctctcccttg tgatccacga aattagtaaa atggtgatta aatataatgc catcatagcc 2880
atggaagacc ttagctatgg gttcaagaag ggaaggttca aggtcgagag gcaggtctac 2940
cagaagttcg agaccatgtt gatcaataag ctcaactatc tcgtctttaa agacatcagc 3000
atcaccgaga atgggggtct ccttaagggc tatcagctca cctacatccc cgacaagctc 3060
aagaatgttg ggcaccagtg cggctgcatc ttctacgtcc ccgccgccta cacttcaaag 3120
atcgacccca caaccgggtt cgttaacatc ttcaagttca aggatctcac cgtggacgca 3180
aagagggagt tcatcaagaa gttcgatagc atcaggtacg actccgagaa aaacctcttc 3240
tgcttcactt ttgactataa caattttatc actcagaaca ccgtcatgtc aaagtcatca 3300
tggagcgtgt acacttatgg agttaggatc aaaagaaggt ttgttaatgg caggttctca 3360
aacgagagcg acactatcga catcacaaaa gatatggaga agaccctcga aatgactgac 3420
attaattgga gggacggcca cgacctcagg caagacatca ttgactacga gatcgtccag 3480
cacatcttcg aaatcttcag actcaccgtt cagatgagaa actcactcag cgagcttgaa 3540
gacagagatt acgacaggct catctccccc gtgctcaacg agaacaatat cttctacgac 3600
tctgccaaag ccggcgacgc cttgcccaag gatgcagatg ccaacggcgc ttactgtatc 3660
gccctcaagg gcctctacga gatcaagcag atcacagaga actggaagga ggacgggaag 3720
ttctccaggg ataagcttaa gatcagcaac aaggactggt tcgactttat ccaaaacaag 3780
aggtatctc 3789
<210> 92
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 92
atgaacaacg gcaccaacaa ctttcagaac ttcataggga ttagttcact tcaaaaaacc 60
ctcaggaatg ccctcatacc cactgaaaca acccagcagt tcattgttaa gaacggcatt 120
atcaaagagg atgagctcag gggagagaac agacagatac ttaaagacat tatggacgac 180
tactacagag gcttcattag tgaaactctc tcctccatcg atgacatcga ctggacttcc 240
ctctttgaga agatggaaat ccagctcaag aacggggata acaaagacac cctcatcaag 300
gagcagaccg agtataggaa ggccatccac aaaaagttcg caaatgacga caggttcaag 360
aacatgttct cagccaagct tatctccgat atcctcccag agttcgttat ccacaacaac 420
aactactccg ccagcgaaaa ggaggaaaag acccaggtta tcaagctctt ctctaggttc 480
gctacatcct ttaaggacta tttcaagaac agggctaact gcttctccgc cgacgacatc 540
tcatcctcca gttgccatag gatcgttaac gacaatgccg agatcttctt ctccaatgcc 600
ttggtctaca ggaggatcgt gaagtctctc tccaacgacg acattaacaa gatcagcgga 660
gacatgaagg attcactcaa ggagatgagc ctcgaggaga tctactccta tgagaagtat 720
ggcgaattca tcacccagga ggggatctcc ttctacaacg atatctgtgg caaagtcaac 780
agcttcatga acttgtactg ccaaaagaat aaggagaata aaaacctcta taagctccag 840
aaattgcaca agcagatcct ctgcatcgcc gacaccagct acgaagtgcc ctacaagttt 900
gagtccgacg aggaggtcta ccagtccgtg aacggatttc tcgacaacat ctccagcaaa 960
cacatcgttg agaggttgag aaaaatcggt gataattaca acggttacaa cctcgacaag 1020
atctacatcg tctccaagtt ctacgagagt gtgtcccaga agacctacag ggactgggag 1080
accatcaaca ccgccctcga aatacattac aacaacatac tccctggcaa cgggaagtct 1140
aaggccgaca aggtcaagaa ggccgtgaaa aacgacttgc agaagagcat aaccgagatt 1200
aacgagctcg tctctaacta caagctctgt tccgacgaca acatcaaagc cgagacctac 1260
atacacgaga tatctcacat cctcaacaac tttgaggccc aggagctcaa atacaatccc 1320
gaaatccacc tcgttgagag cgagctcaaa gccagtgagc tcaagaacgt cctcgacgtt 1380
atcatgaacg ccttccattg gtgcagcgtt ttcatgaccg aggagctcgt ggacaaggac 1440
aacaacttct acgccgagct cgaggagatc tatgacgaaa tctaccccgt catctccctc 1500
tacaacctcg tcaggaacta cgtcacccaa aagccctaca gcaccaagaa aatcaaactc 1560
aattttggca tccccactct cgctgacggc tggagcaagt ccaaggagta cagcaacaac 1620
gctattatcc tcatgagaga taacctctat tacctcggca tctttaacgc caaaaacaag 1680
cccgacaaga agatcataga gggcaacacc tccgagaata agggagacta taagaagatg 1740
atatacaacc tccttcccgg ccccaacaaa atgatcccca aggtcttcct ctccagcaag 1800
accggcgttg agacctacaa gccctccgcc tatatcctcg agggatacaa gcagaataag 1860
cacattaaga gtagcaaaga tttcgacatt actttctgcc atgatctcat cgactacttt 1920
aaaaactgca tcgccatcca cccagagtgg aagaactttg gcttcgactt cagtgacacc 1980
agtacctacg aggacatctc cggattctat agggaggtgg agctccaggg atacaaaatc 2040
gactggacct acataagcga gaaggacatc gacttgttgc aggagaaggg ccagctctac 2100
ctctttcaaa tctacaacaa ggacttcagc aagaagtcca caggcaacga caacctccat 2160
accatgtatc tcaagaatct cttctccgag gaaaacctca aagatatcgt gctcaagctc 2220
aacggggaag cagagatctt cttcaggaaa agcagcatca agaaccccat aatccacaag 2280
aaaggatcca tcctcgtcaa caggacctac gaagccgagg aaaaggatca gttcgggaac 2340
attcagatcg tcagaaagaa catccccgaa aacatctacc aggaacttta caagtacttc 2400
aatgacaaaa gcgacaagga actcagcgac gaggccgcta agctcaagaa cgtcgtcggg 2460
caccatgagg cagcaaccaa tatcgtcaag gactacaggt acacctacga caagtatttc 2520
ttgcacatgc ccatcaccat caactttaag gccaacaaaa ccggctttat taacgacagg 2580
atcttgcagt acatagctaa agagaaggat ctccacgtca tcggaataga cagaggggag 2640
aggaatctca tttacgtgtc tgtcatcgac acatgcggca acatcgtcga gcagaagtcc 2700
ttcaacatcg tcaacggtta cgattatcag atcaaattga aacaacagga gggcgccaga 2760
cagattgcca ggaaggaatg gaaggagatc ggaaaaatta aagagatcaa agaggggtac 2820
ctcagcctcg tgatccacga gatcagtaag atggttatca agtacaatgc catcatcgca 2880
atggaagatc tctcctacgg cttcaaaaag gggaggttca aagtcgagag gcaggtttat 2940
caaaagttcg agaccatgct catcaacaaa ctcaactacc tcgtcttcaa agacatctcc 3000
atcacagaaa atggtggcct cctcaaagga taccaactca cctacatccc cgacaaactc 3060
aagaacgtgg gtcaccaatg cgggtgcata ttttacgtgc ctgctgctta caccagcaag 3120
atagacccaa ccaccgggtt cgttaatatt tttaagttca aggacctcac cgtggatgcc 3180
aagagagaat tcatcaaaaa gttcgactcc atcaggtatg acagcgagaa gaaccttttc 3240
tgctttacct tcgactataa taacttcatc acccagaaca cagtgatgtc taagtccagc 3300
tggagcgtct acacctacgg ggtcagaatc aagaggaggt ttgtcaacgg caggttcagc 3360
aacgagtccg ataccatcga catcaccaag gacatggaga agaccctcga aatgaccgat 3420
atcaactgga gagacggaca cgacctcaga caggatataa tcgattacga gatcgttcag 3480
cacatcttcg aaatcttcag gctcaccgtc cagatgagga actccctctc cgagctcgag 3540
gacagggact acgacaggct tatcagcccc gtgcttaatg aaaacaacat tttctacgac 3600
agcgccaagg ccggtgacgc ccttcccaag gacgccgacg ccaacggcgc ctactgtatc 3660
gcactcaaag gtctctacga gatcaagcag atcaccgaga actggaagga ggacgggaag 3720
tttagcaggg ataaattgaa gatcagcaac aaagactggt ttgactttat ccaaaataag 3780
aggtacctc 3789
<210> 93
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 93
atgaacaacg gaaccaacaa ctttcagaat ttcataggca tctcctccct ccagaagacc 60
ctcaggaacg ccctcatccc caccgagacc acccagcaat tcattgtgaa aaacgggatc 120
atcaaggagg atgagctcag aggcgagaac aggcagatcc ttaaggatat catggacgac 180
tactataggg ggttcatatc cgagaccctc tcctccatcg acgacatcga ctggaccagc 240
ctcttcgaga agatggagat acagctcaaa aacggcgaca acaaagatac cctcatcaaa 300
gagcagaccg agtacaggaa ggctatccat aaaaaattcg ccaacgacga caggttcaag 360
aatatgttct ccgcaaaact catcagcgat atcctccccg agtttgtgat ccacaacaac 420
aattactcag cctccgagaa ggaggaaaag actcaagtga taaagctctt ctctagattc 480
gccacctcct tcaaggacta tttcaaaaat agggccaact gtttctccgc cgacgatatc 540
agctctagca gctgccacag gatcgtcaac gacaacgctg agatcttctt ttccaacgcc 600
ctcgtctaca ggaggatcgt gaagagcctc agcaacgacg acatcaacaa gatcagtggg 660
gatatgaagg acagcctcaa ggaaatgagc ctcgaggaaa tctatagcta cgagaagtac 720
ggggagttca tcacccagga gggaatcagc ttctacaatg atatctgcgg gaaggttaat 780
tccttcatga acttgtactg ccagaagaac aaggagaata agaatcttta caagctccag 840
aagcttcaca agcaaatcct ctgtatcgcc gatacttcct acgaggtccc ctataaattc 900
gagagcgacg aggaggtgta tcaatccgtg aatgggtttc tcgataatat cagttccaaa 960
catatcgtcg agaggctcag gaaaataggg gacaactaca acgggtataa tttggacaaa 1020
atatacatcg tgagcaagtt ttatgagtca gtcagccaga agacttatag ggactgggag 1080
accatcaaca cagccctcga aatccactac aataacatcc tccccggcaa cggcaagtca 1140
aaggcagata aggtgaaaaa agccgtgaag aacgacctcc aaaaaagcat caccgagatc 1200
aacgagttgg tctcaaacta caagctctgc agcgacgaca atatcaaagc cgagacctat 1260
attcatgaga tttcccacat cctcaacaac ttcgaggccc aggagctcaa gtacaacccc 1320
gagatccacc tcgtcgagag cgagctcaaa gcttcagaac tcaagaacgt tcttgatgtt 1380
atcatgaacg cctttcactg gtgttcagtc ttcatgacag aagagctcgt ggacaaggac 1440
aacaacttct acgcagagct cgaagagatc tatgatgaga tctaccccgt catctcattg 1500
tacaatctcg tcaggaacta tgttacccag aagccatact ccaccaagaa gattaagttg 1560
aactttggca ttcctaccct cgccgacggt tggtccaaat caaaggagta ctctaacaac 1620
gccatcatcc tcatgaggga caacctctac tatctcggaa tcttcaacgc caagaataag 1680
cccgacaaga agatcatcga aggcaacaca agcgaaaata agggggacta taagaagatg 1740
atctacaacc tcctccccgg gcccaacaaa atgatcccta aggtcttcct tagcagcaag 1800
accggagttg agacctacaa accatccgcc tacatcctcg agggctacaa gcaaaacaaa 1860
cacatcaagt ccagcaagga ctttgatatc acattctgcc atgacctcat agactacttt 1920
aagaactgta tcgccatcca tcccgagtgg aagaacttcg gtttcgactt cagcgacacc 1980
tccacatatg aagatatctc tggtttctac agagaggtcg aactccaggg ctacaaaatc 2040
gattggacct atatctccga aaaggatata gacctcctcc aggagaaggg gcagctctac 2100
ctttttcaga tctacaataa ggacttctcc aagaagagca ccggtaatga caatctccat 2160
actatgtact tgaagaacct cttctcagag gagaacttga aagacatcgt tctcaagctc 2220
aacggcgagg ccgagatatt cttcagaaag tcttccatta agaatcccat catccacaag 2280
aaagggagca tcctcgtgaa caggacctac gaggccgagg agaaagacca gttcgggaac 2340
atccagatcg tcagaaagaa catccctgag aacatctacc aagagctcta taagtacttc 2400
aacgataaat ccgacaaaga gctctctgac gaagccgcca agttgaagaa cgtcgtcggg 2460
caccacgaag ccgctaccaa catcgtcaaa gactacagat atacctatga caagtacttc 2520
ctccacatgc ctatcaccat aaactttaag gccaataaga ccggattcat caacgacagg 2580
atcctccaat acatcgccaa ggaaaaggac ctccatgtta tcggcataga tagaggagaa 2640
aggaatctca tctatgtgtc cgtcatcgac acctgcggaa acattgttga gcagaaatcc 2700
tttaacatcg ttaacgggta tgactaccag attaagctca agcaacagga gggcgccagg 2760
cagatagcaa ggaaggagtg gaaggagatc gggaagataa aggagatcaa agagggatac 2820
ctcagccttg tcatccacga gatcagcaag atggttatca agtacaacgc cataatcgcc 2880
atggaagacc tttcatacgg gttcaagaag ggcaggttca aggtggagag gcaggtgtac 2940
caaaagttcg agaccatgct catcaacaag ctcaactacc tcgttttcaa agatatcagc 3000
atcaccgaaa acggcggtct tcttaagggt taccagctca cctatatccc cgacaagttg 3060
aagaatgttg gccaccaatg tgggtgtatc ttctacgtcc ccgccgccta caccagcaaa 3120
atagacccca caaccgggtt cgtgaatatt ttcaaattca aggacctcac agtcgacgcc 3180
aaaagagaat tcataaagaa gttcgactcc atcaggtacg attccgaaaa gaacctcttt 3240
tgcttcacct ttgactacaa caactttatc acccaaaaca ccgtgatgtc caaatccagc 3300
tggtctgttt atacctacgg cgttaggatc aagagaaggt tcgtcaatgg gagattctct 3360
aacgagtccg acaccattga catcaccaag gatatggaga aaaccctcga gatgaccgat 3420
attaactgga gggacggcca cgacctcaga caggacatca tcgactacga gatcgtccag 3480
cacatcttcg aaatcttcag actcaccgtt cagatgagga actctctctc cgagctcgag 3540
gacagggact acgacaggct catctcccca gtcctcaacg agaacaacat attttatgac 3600
tccgccaagg ctggtgacgc ccttcccaag gatgccgacg ccaacggggc ttattgcatc 3660
gctctcaagg gcctctatga aataaagcag atcaccgaga actggaagga ggacggtaag 3720
ttctctaggg acaagctcaa aatcagcaat aaagactggt tcgacttcat ccagaacaag 3780
aggtacctc 3789
<210> 94
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 94
atgaacaacg ggaccaataa cttccaaaat ttcatcggta ttagcagttt gcaaaaaacc 60
ttgaggaacg ccctcatacc caccgaaaca acccagcagt tcatcgtgaa gaacggcata 120
atcaaggaag acgaactcag aggcgaaaac aggcaaatcc tcaaggacat catggatgac 180
tactacaggg gcttcatttc cgagaccctc tcctccatcg acgacatcga ctggacctca 240
ctctttgaga agatggagat ccagctcaag aatggcgata acaaggacac cttgataaag 300
gagcagaccg aatacaggaa ggctatccat aagaagtttg ccaacgacga caggtttaag 360
aacatgttta gcgctaagct catcagcgac attctccccg agttcgtcat ccataacaac 420
aattactccg cctccgagaa ggaggagaaa acccaggtca tcaagctctt ctccaggttc 480
gccaccagct tcaaggacta tttcaagaac agagccaact gttttagcgc cgatgacatc 540
tcctcctcca gctgccatag gatcgtcaac gataatgccg agatcttttt cagcaacgcc 600
ctcgtttaca ggagaatcgt gaagagcctt tccaatgacg acataaataa gatcagcggc 660
gacatgaagg acagccttaa ggaaatgagc ctcgaggaga tctacagcta tgaaaagtac 720
ggcgaattta tcacccagga gggaatcagt ttctacaacg atatctgcgg caaagttaac 780
agcttcatga acctctactg ccagaagaac aaggagaaca agaacctcta caaactccag 840
aagctccaca aacaaatttt gtgcatcgct gacaccagct acgaggttcc ttacaagttc 900
gagtccgatg aggaggtcta ccaaagcgtg aatggattcc tcgacaacat aagtagcaaa 960
cacatcgtcg agaggttgag gaagatcggc gacaactaca atggatacaa cctcgacaag 1020
atatacatcg tctcaaagtt ctacgagagc gtctctcaga agacctacag ggattgggag 1080
accatcaaca ccgcccttga gatccactac aacaacatac tccccggcaa cggcaagtcc 1140
aaagccgata aggttaagaa ggctgtgaag aacgacctcc agaagtccat taccgagatc 1200
aacgaactcg tgtccaacta caagttgtgc agcgatgaca atatcaaggc cgaaacatac 1260
atccacgaaa tctcccatat actcaacaac ttcgaggccc aggagctcaa atataacccc 1320
gagatccacc tcgtcgaatc cgagcttaaa gccagcgagc tcaagaacgt gctcgatgtc 1380
attatgaacg ccttccactg gtgctccgtt ttcatgaccg aagagctcgt cgataaggac 1440
aacaacttct acgccgagct cgaagagatc tacgacgaga tttaccccgt tatctcactt 1500
tacaacctcg tcaggaacta cgtgacccag aaaccttaca gtacaaaaaa gatcaagttg 1560
aattttggga tccccacact cgccgacggg tggagcaagt ccaaagagta ctccaacaac 1620
gccatcatcc tcatgaggga caacctttat tatctcggca tcttcaacgc caagaacaag 1680
cccgacaaaa agatcatcga ggggaacaca tccgaaaaca agggggatta caagaaaatg 1740
atctataacc tcctccccgg ccccaataaa atgatcccca aggtttttct ctcaagcaaa 1800
accggcgtcg aaacctataa gcccagcgcc tacatcctcg agggctacaa gcagaacaag 1860
cacataaagt cttccaagga tttcgacata accttctgtc acgacctcat agactatttc 1920
aaaaactgta tcgccatcca ccctgagtgg aaaaatttcg gctttgactt ctccgacacc 1980
tccacttacg aggacatctc cgggttttac agggaggtcg agctccaggg ctacaagatc 2040
gactggacct atattagtga aaaggacata gacctcttgc aggagaaggg gcaactctat 2100
ctcttccaga tctacaacaa ggactttagc aagaagagca cagggaacga caacctccac 2160
accatgtacc tcaagaacct cttcagtgag gagaatctca aggacatcgt tcttaaactc 2220
aatggggagg ccgagatctt ctttaggaaa agttccatta agaaccccat catccacaag 2280
aagggctcca tattggtcaa caggacctac gaggccgaag agaaggacca attcgggaac 2340
attcagatcg ttaggaagaa catccccgag aacatctacc aggagctcta caagtatttt 2400
aacgacaaga gtgacaaaga gctctccgat gaggccgcca aactcaaaaa cgtggtcggt 2460
caccacgagg ccgccactaa tatcgtgaag gattacaggt acacatacga taaatacttt 2520
ctccacatgc ccattaccat caatttcaag gctaacaaga ccggcttcat caacgacagg 2580
atactccagt acatcgcaaa agagaaagac ctccacgtca tcggcatcga cagaggggaa 2640
aggaacctta tctacgtcag cgttatagac acctgcggca acatcgttga gcagaagagc 2700
ttcaacattg tcaatggcta tgactatcag ataaagctca aacagcagga gggggccagg 2760
caaatcgcca ggaaggagtg gaaggagatc ggcaagatca aagaaatcaa ggagggctac 2820
ctctccctcg ttatccacga gatctccaaa atggtcatca agtacaatgc catcatcgca 2880
atggaggacc tcagctacgg cttcaagaag gggaggttca aggttgaaag gcaggtgtac 2940
cagaaatttg agacaatgct cattaacaag ttgaactatc tcgtttttaa ggacatcagc 3000
attaccgaga acggggggct cctcaagggg tatcagttga cttacatacc tgacaaactc 3060
aaaaatgtgg gccaccaatg cggctgcatc ttctacgttc cagccgccta cacctccaaa 3120
atcgatccaa ccaccggatt tgtcaatatc ttcaagttta aggacctcac agttgacgcc 3180
aagagggagt ttatcaagaa gttcgattcc atcaggtacg atagtgagaa gaacctcttc 3240
tgcttcacct ttgactataa taactttatc acacaaaata ccgttatgtc caaaagctcc 3300
tggtccgtgt acacctatgg cgttaggatt aaaaggaggt tcgtgaatgg cagattctca 3360
aacgagagtg acaccatcga tataactaaa gatatggaaa agacactcga aatgaccgat 3420
atcaactgga gggacgggca tgatctcagg caggacataa tcgactacga aatcgtccaa 3480
catatcttcg agatcttcag gctcaccgtt cagatgagga acagcctcag tgagctcgag 3540
gacagggact atgacaggct catcagcccc gttcttaacg agaacaatat cttttacgac 3600
tccgccaagg ctggagacgc cttgcccaag gacgccgacg ccaacggcgc ctactgcatc 3660
gccctcaagg ggctctacga gatcaaacag attaccgaga actggaagga ggacggcaag 3720
ttctctaggg acaagctcaa gatctccaat aaagactggt tcgacttcat ccagaacaag 3780
aggtacttg 3789
<210> 95
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 95
atgaacaacg ggaccaacaa cttccagaac ttcatcggaa tcagctccct ccagaagacc 60
ctcaggaacg cactcatccc taccgaaacc actcagcagt tcatcgttaa aaatggcatc 120
atcaaagagg atgagcttag aggcgaaaat aggcagattc ttaaggacat catggacgac 180
tattacaggg gtttcatttc cgaaactctc tccagtatcg acgacatcga ttggacctcc 240
ctcttcgaaa agatggaaat ccagctcaag aatggtgaca acaaagacac cctcatcaag 300
gaacagaccg agtacaggaa ggctatccac aagaagtttg ccaacgatga cagattcaag 360
aatatgttca gtgccaagtt gatcagcgac atcctccccg agttcgtcat ccacaacaac 420
aattactctg cttccgagaa agaggagaag acccaggtca taaagctctt ctcaaggttc 480
gctacatcct tcaaggacta tttcaaaaac agggctaact gctttagcgc cgacgacatc 540
agcagctctt cctgccacag gatcgtgaat gacaatgccg aaatcttctt cagcaacgcc 600
ctcgtctaca ggaggatagt caagtccctc agcaacgacg acatcaacaa gatcagcggc 660
gacatgaagg acagcttgaa ggagatgtct ctcgaggaga tctacagcta cgagaagtac 720
ggggagttta tcacccagga agggatttcc ttttacaacg acatttgcgg gaaggttaac 780
agctttatga atctctactg ccagaaaaac aaggagaaca agaaccttta taagttgcaa 840
aaacttcaca agcagatcct ctgcatcgca gatacctcct atgaggttcc ctataagttc 900
gagagcgacg aagaggtcta ccagtccgtc aacggcttcc tcgataacat cagcagcaag 960
cacatagttg agagactcag gaagatcggc gataattaca acggctacaa tctcgacaag 1020
atttatatcg tgagcaagtt ttatgaatcc gtgagccaaa aaacctacag ggactgggaa 1080
accattaata ctgctctcga gatccactat aacaatatcc tcccagggaa cggtaagtca 1140
aaggccgaca aggtgaagaa ggccgtcaag aacgatctcc agaagagcat cactgagatc 1200
aatgagctcg tctccaacta taagctctgc tccgacgaca acatcaaggc cgaaacctac 1260
atccacgaga tctcccatat cctcaacaat ttcgaggccc aagagctcaa gtacaacccc 1320
gagatccact tggtggagtc cgaactcaag gccagtgagc tcaaaaacgt gctcgacgtt 1380
atcatgaacg catttcactg gtgctccgtg ttcatgaccg aagagctcgt ggacaaagac 1440
aacaacttct atgccgagct cgaggagatc tacgacgaga tctatcccgt gatctccctc 1500
tacaatctcg tgaggaacta cgtcacccag aaaccttaca gtaccaaaaa aattaagctc 1560
aacttcggca tccccaccct cgccgatggt tggagcaagt ccaaggagta ctccaataac 1620
gctatcatcc tcatgaggga caatctctac tacctcggga tcttcaacgc caagaataag 1680
cccgacaaga aaatcatcga gggaaacaca tccgagaaca agggcgatta caagaaaatg 1740
atatataacc tcctccccgg acctaacaag atgattccca aggttttcct cagcagcaag 1800
accggggtgg agacctataa acccagtgcc tacatcctcg agggctataa gcaaaataag 1860
cacattaagt ccagcaagga cttcgacatc accttctgtc acgacctcat cgactacttc 1920
aagaattgca tcgccatcca ccccgagtgg aaaaatttcg gtttcgactt cagcgacacc 1980
tctacctacg aagacatctc cgggttctac agagaggtcg agctccaggg ctacaagatc 2040
gactggacct acatcagcga gaaggacatc gacctcctcc aagagaaggg acagctctac 2100
ctcttccaaa tatacaacaa ggacttctct aaaaagagca ccggcaacga caacctccat 2160
accatgtact tgaaaaacct cttctcagag gagaacctca aagacatcgt gcttaaactt 2220
aacggcgaag ccgaaatctt cttcagaaag agctccatca agaaccccat tatccacaag 2280
aagggatcaa tcctcgtcaa caggacctac gaggccgagg agaaggacca attcggcaat 2340
atccaaatcg tcaggaagaa catccccgaa aacatctacc aggaactcta taaatatttc 2400
aatgacaagt ccgacaagga gctctcagat gaggccgcca aattgaagaa tgtcgtgggc 2460
caccatgagg ccgcaactaa cattgttaaa gactataggt acacctatga taagtacttc 2520
ctccatatgc ccattaccat caattttaaa gccaacaaaa ccggtttcat caatgatagg 2580
attctccagt acatcgctaa ggagaaggac ctccacgtca taggtatcga caggggcgag 2640
aggaacctta tctacgtttc cgtcatcgac acctgcggca atattgtgga gcagaagagt 2700
tttaatatcg tcaatggcta cgactaccag atcaaactta agcagcaaga gggcgccagg 2760
cagattgcta gaaaggaatg gaaggaaatc ggcaagatca aggagattaa ggaggggtac 2820
ttgagcctcg tcattcacga gatcagcaag atggtcataa agtacaacgc catcatagcc 2880
atggaagatc tcagttatgg gttcaagaag gggaggttca aggttgagag gcaggtttac 2940
cagaagttcg agacaatgct catcaacaag ctcaactacc tcgtctttaa ggacatctcc 3000
attaccgaaa acggtggcct cctcaagggt taccaactca cctacatccc cgataagctt 3060
aagaacgtgg gccaccagtg cggctgtatc ttctatgttc ccgccgccta caccagtaag 3120
atcgacccaa caaccggctt cgtgaacatc ttcaaattca aggacctcac cgtcgacgcc 3180
aagagagagt tcatcaagaa gttcgactcc atcagatacg actccgagaa gaacctcttc 3240
tgcttcacct tcgactacaa caatttcatc acccaaaaca ccgtcatgag caagagctcc 3300
tggtcagtgt acacttacgg ggttaggatc aagaggaggt tcgtgaacgg caggttcagc 3360
aacgagagcg acaccatcga catcaccaag gatatggaga agaccctcga gatgaccgat 3420
atcaattgga gggacggtca tgacctcagg caggacatta tcgactacga gatcgtccag 3480
cacatcttcg aaatctttag gctcaccgtg cagatgagaa actccctctc cgagctcgag 3540
gacagggact acgacaggct catctctccc gtgctcaacg agaacaacat attctacgat 3600
tccgccaagg ccggggatgc cctccccaag gacgccgacg ccaacggagc atattgtata 3660
gccctcaagg gtctctacga gatcaagcag atcacagaaa actggaaaga ggacggcaag 3720
ttctccaggg acaagctcaa gataagcaac aaggactggt tcgacttcat acaaaacaag 3780
aggtacctc 3789
<210> 96
<211> 3789
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 96
atgaacaacg ggactaacaa cttccagaat ttcatcggga taagctccct ccagaaaact 60
ctcagaaacg ccctcatacc cactgagacc acccaacaat tcatcgttaa aaacggaata 120
atcaaagagg acgagttgag gggcgagaac aggcaaatcc tcaaggacat aatggacgac 180
tattacaggg gcttcatctc cgagaccctc agctcaatag atgatataga ctggacctct 240
ctcttcgaaa aaatggagat ccaattgaag aacggtgaca acaaagacac cctcatcaag 300
gagcagaccg agtacaggaa ggcaattcac aaaaagtttg ccaacgacga tagattcaaa 360
aacatgtttt cagccaagtt gatctccgac atcctccctg agtttgtgat ccacaacaac 420
aactacagcg cctctgaaaa ggaagagaag acccaggtga tcaagctttt ctcaaggttt 480
gccaccagct ttaaggatta cttcaagaac agggctaact gtttctcagc cgacgacatc 540
tccagctcat cttgccacag gatcgtcaat gacaacgccg agatattttt cagcaacgct 600
ctcgtgtata gaagaatcgt gaagtccctc tctaacgatg acatcaacaa gatcagcggt 660
gacatgaagg acagtctcaa agagatgtca ctcgaagaga tttacagtta cgagaaatat 720
ggcgaattca tcacccagga gggcatcagc ttctacaatg atatctgcgg gaaggtcaac 780
agcttcatga atctctactg ccagaagaac aaggagaata agaacctcta caaacttcaa 840
aagctccaca agcagatcct ctgcattgca gacacctcct acgaagtgcc ctacaaattt 900
gagagtgatg aggaggtcta ccaatcagtt aatggcttcc tcgataacat cagttccaag 960
cacatcgtcg agaggctcag gaaaatcggt gataattaca atggttataa cctcgataaa 1020
atatacatcg tgagcaagtt ctacgaatcc gttagccaga agacctacag ggactgggag 1080
accatcaaca ccgccctcga aatccactac aacaacatcc tccctggaaa cggcaagtcc 1140
aaggctgata aggtcaagaa ggccgttaag aacgacctcc aaaagagcat cacagaaatc 1200
aatgagctcg tgagtaacta taagttgtgt tccgacgata acatcaaggc cgaaacctac 1260
atccacgaga tctctcacat cctcaacaac ttcgaggctc aggagctcaa gtacaacccc 1320
gagattcacc tcgtcgagag cgagctcaag gcatcagagc tcaagaacgt cttggacgtg 1380
atcatgaacg ccttccactg gtgtagtgtg ttcatgacag aggagctcgt cgacaaagac 1440
aacaatttct acgcagagtt ggaggagatc tacgacgaga tatacccagt gatctctctc 1500
tataacctcg ttaggaatta cgtgacgcag aagccctact ccaccaagaa aatcaaactc 1560
aacttcggca tccccaccct cgctgacggg tggagtaagt ccaaggagta ctccaacaac 1620
gccattattc tcatgaggga caatctctac tatctcggga tcttcaacgc caagaataaa 1680
cccgacaaga aaatcatcga gggtaacacc tccgagaaca agggggacta caaaaagatg 1740
atctataacc tcctccctgg ccctaacaag atgatcccca aggtgttcct cagcagcaag 1800
accggcgtgg agacctacaa gcccagcgcc tacatcctcg agggatataa gcagaataag 1860
catatcaaaa gcagcaagga cttcgacatc accttctgcc atgacctcat cgattacttc 1920
aagaattgca ttgccatcca tccagaatgg aagaatttcg gctttgactt tagcgacacc 1980
tccacctacg aggacatctc tggcttttac agggaggtgg agctccaggg atacaagatc 2040
gattggacct acattagcga aaaggacatc gatctcctgc aggagaaggg gcagctctat 2100
cttttccaga tttacaacaa ggacttcagc aaaaagagca ccggcaacga caacttgcat 2160
accatgtacc tcaagaacct cttcagcgag gagaatctca aggatatcgt gctcaaactt 2220
aacggcgagg ccgaaatttt tttcaggaaa agctccatca agaaccccat aatccacaag 2280
aaggggagca tccttgtgaa tagaacctac gaggccgagg aaaaggacca attcggcaac 2340
atacaaatcg tcaggaagaa catccccgag aacatctatc aggagctcta taagtacttc 2400
aacgacaagt ctgacaaaga gctcagcgat gaggccgcca aattgaaaaa cgtcgtgggc 2460
caccatgagg cagccaccaa catcgtgaag gactacaggt acacctatga caaatatttc 2520
ctccacatgc ccatcaccat caactttaag gccaataaga ccgggttcat caacgacagg 2580
atattgcagt atatagctaa ggagaaagac ctccacgtta taggcataga cagaggcgag 2640
aggaacctca tttacgtgtc tgtcatagac acctgcggga acattgtcga gcagaagagc 2700
ttcaacatcg tgaacgggta cgactaccaa attaagctca agcagcaaga gggcgccagg 2760
cagatcgcca ggaaggagtg gaaggagatc ggcaagatca aggagataaa ggaaggctac 2820
ctttccctcg tcattcatga gatctccaaa atggtcatca agtacaatgc cataatcgcc 2880
atggaggacc tcagctatgg cttcaaaaag ggcagattta aagtggaaag acaggtgtac 2940
cagaagttcg agacaatgct catcaacaag ctcaattacc tcgtgtttaa ggatataagt 3000
attaccgaga atggcggact cctcaaaggg taccagctca cctacatccc cgataaactc 3060
aagaacgtgg ggcaccagtg cgggtgcata ttctacgttc ccgccgccta cactagcaag 3120
attgacccca ccaccgggtt tgttaatatc ttcaagttta aggacctcac cgtcgacgcc 3180
aagagagaat tcatcaaaaa gttcgattcc atcagatatg acagcgagaa gaatctcttt 3240
tgcttcacct tcgactataa taacttcatc actcagaaca ccgttatgtc caagagcagt 3300
tggagcgtgt acacatacgg ggtgaggatc aagaggaggt tcgtcaacgg gaggttctcc 3360
aacgagagcg ataccataga cattaccaaa gacatggaga agaccctcga gatgaccgac 3420
atcaactgga gggacggtca cgacctcagg caggacatta tcgattacga gatcgtgcaa 3480
cacatcttcg agatctttag gctcaccgtt caaatgagaa acagcctcag cgagctcgag 3540
gacagggact acgacaggct catcagcccc gttctcaacg aaaacaacat cttttacgac 3600
tccgccaagg ccggggacgc tctccccaag gatgccgacg ccaatggcgc atactgcatc 3660
gccctcaaag ggctctacga aatcaaacaa atcaccgaga attggaagga ggacgggaaa 3720
ttctctaggg acaagctcaa gatctccaac aaagactggt tcgatttcat ccagaacaag 3780
aggtaccta 3789
<210> 97
<211> 986
<212> PRT
<213> 未知
<220>
<223> δ变形细菌(deltaproteobacterium)
<400> 97
Met Glu Lys Arg Ile Asn Lys Ile Arg Lys Lys Leu Ser Ala Asp Asn
1 5 10 15
Ala Thr Lys Pro Val Ser Arg Ser Gly Pro Met Lys Thr Leu Leu Val
20 25 30
Arg Val Met Thr Asp Asp Leu Lys Lys Arg Leu Glu Lys Arg Arg Lys
35 40 45
Lys Pro Glu Val Met Pro Gln Val Ile Ser Asn Asn Ala Ala Asn Asn
50 55 60
Leu Arg Met Leu Leu Asp Asp Tyr Thr Lys Met Lys Glu Ala Ile Leu
65 70 75 80
Gln Val Tyr Trp Gln Glu Phe Lys Asp Asp His Val Gly Leu Met Cys
85 90 95
Lys Phe Ala Gln Pro Ala Ser Lys Lys Ile Asp Gln Asn Lys Leu Lys
100 105 110
Pro Glu Met Asp Glu Lys Gly Asn Leu Thr Thr Ala Gly Phe Ala Cys
115 120 125
Ser Gln Cys Gly Gln Pro Leu Phe Val Tyr Lys Leu Glu Gln Val Ser
130 135 140
Glu Lys Gly Lys Ala Tyr Thr Asn Tyr Phe Gly Arg Cys Asn Val Ala
145 150 155 160
Glu His Glu Lys Leu Ile Leu Leu Ala Gln Leu Lys Pro Glu Lys Asp
165 170 175
Ser Asp Glu Ala Val Thr Tyr Ser Leu Gly Lys Phe Gly Gln Arg Ala
180 185 190
Leu Asp Phe Tyr Ser Ile His Val Thr Lys Glu Ser Thr His Pro Val
195 200 205
Lys Pro Leu Ala Gln Ile Ala Gly Asn Arg Tyr Ala Ser Gly Pro Val
210 215 220
Gly Lys Ala Leu Ser Asp Ala Cys Met Gly Thr Ile Ala Ser Phe Leu
225 230 235 240
Ser Lys Tyr Gln Asp Ile Ile Ile Glu His Gln Lys Val Val Lys Gly
245 250 255
Asn Gln Lys Arg Leu Glu Ser Leu Arg Glu Leu Ala Gly Lys Glu Asn
260 265 270
Leu Glu Tyr Pro Ser Val Thr Leu Pro Pro Gln Pro His Thr Lys Glu
275 280 285
Gly Val Asp Ala Tyr Asn Glu Val Ile Ala Arg Val Arg Met Trp Val
290 295 300
Asn Leu Asn Leu Trp Gln Lys Leu Lys Leu Ser Arg Asp Asp Ala Lys
305 310 315 320
Pro Leu Leu Arg Leu Lys Gly Phe Pro Ser Phe Pro Val Val Glu Arg
325 330 335
Arg Glu Asn Glu Val Asp Trp Trp Asn Thr Ile Asn Glu Val Lys Lys
340 345 350
Leu Ile Asp Ala Lys Arg Asp Met Gly Arg Val Phe Trp Ser Gly Val
355 360 365
Thr Ala Glu Lys Arg Asn Thr Ile Leu Glu Gly Tyr Asn Tyr Leu Pro
370 375 380
Asn Glu Asn Asp His Lys Lys Arg Glu Gly Ser Leu Glu Asn Pro Lys
385 390 395 400
Lys Pro Ala Lys Arg Gln Phe Gly Asp Leu Leu Leu Tyr Leu Glu Lys
405 410 415
Lys Tyr Ala Gly Asp Trp Gly Lys Val Phe Asp Glu Ala Trp Glu Arg
420 425 430
Ile Asp Lys Lys Ile Ala Gly Leu Thr Ser His Ile Glu Arg Glu Glu
435 440 445
Ala Arg Asn Ala Glu Asp Ala Gln Ser Lys Ala Val Leu Thr Asp Trp
450 455 460
Leu Arg Ala Lys Ala Ser Phe Val Leu Glu Arg Leu Lys Glu Met Asp
465 470 475 480
Glu Lys Glu Phe Tyr Ala Cys Glu Ile Gln Leu Gln Lys Trp Tyr Gly
485 490 495
Asp Leu Arg Gly Asn Pro Phe Ala Val Glu Ala Glu Asn Arg Val Val
500 505 510
Asp Ile Ser Gly Phe Ser Ile Gly Ser Asp Gly His Ser Ile Gln Tyr
515 520 525
Arg Asn Leu Leu Ala Trp Lys Tyr Leu Glu Asn Gly Lys Arg Glu Phe
530 535 540
Tyr Leu Leu Met Asn Tyr Gly Lys Lys Gly Arg Ile Arg Phe Thr Asp
545 550 555 560
Gly Thr Asp Ile Lys Lys Ser Gly Lys Trp Gln Gly Leu Leu Tyr Gly
565 570 575
Gly Gly Lys Ala Lys Val Ile Asp Leu Thr Phe Asp Pro Asp Asp Glu
580 585 590
Gln Leu Ile Ile Leu Pro Leu Ala Phe Gly Thr Arg Gln Gly Arg Glu
595 600 605
Phe Ile Trp Asn Asp Leu Leu Ser Leu Glu Thr Gly Leu Ile Lys Leu
610 615 620
Ala Asn Gly Arg Val Ile Glu Lys Thr Ile Tyr Asn Lys Lys Ile Gly
625 630 635 640
Arg Asp Glu Pro Ala Leu Phe Val Ala Leu Thr Phe Glu Arg Arg Glu
645 650 655
Val Val Asp Pro Ser Asn Ile Lys Pro Val Asn Leu Ile Gly Val Asp
660 665 670
Arg Gly Glu Asn Ile Pro Ala Val Ile Ala Leu Thr Asp Pro Glu Gly
675 680 685
Cys Pro Leu Pro Glu Phe Lys Asp Ser Ser Gly Gly Pro Thr Asp Ile
690 695 700
Leu Arg Ile Gly Glu Gly Tyr Lys Glu Lys Gln Arg Ala Ile Gln Ala
705 710 715 720
Ala Lys Glu Val Glu Gln Arg Arg Ala Gly Gly Tyr Ser Arg Lys Phe
725 730 735
Ala Ser Lys Ser Arg Asn Leu Ala Asp Asp Met Val Arg Asn Ser Ala
740 745 750
Arg Asp Leu Phe Tyr His Ala Val Thr His Asp Ala Val Leu Val Phe
755 760 765
Glu Asn Leu Ser Arg Gly Phe Gly Arg Gln Gly Lys Arg Thr Phe Met
770 775 780
Thr Glu Arg Gln Tyr Thr Lys Met Glu Asp Trp Leu Thr Ala Lys Leu
785 790 795 800
Ala Tyr Glu Gly Leu Thr Ser Lys Thr Tyr Leu Ser Lys Thr Leu Ala
805 810 815
Gln Tyr Thr Ser Lys Thr Cys Ser Asn Cys Gly Phe Thr Ile Thr Thr
820 825 830
Ala Asp Tyr Asp Gly Met Leu Val Arg Leu Lys Lys Thr Ser Asp Gly
835 840 845
Trp Ala Thr Thr Leu Asn Asn Lys Glu Leu Lys Ala Glu Gly Gln Ile
850 855 860
Thr Tyr Tyr Asn Arg Tyr Lys Arg Gln Thr Val Glu Lys Glu Leu Ser
865 870 875 880
Ala Glu Leu Asp Arg Leu Ser Glu Glu Ser Gly Asn Asn Asp Ile Ser
885 890 895
Lys Trp Thr Lys Gly Arg Arg Asp Glu Ala Leu Phe Leu Leu Lys Lys
900 905 910
Arg Phe Ser His Arg Pro Val Gln Glu Gln Phe Val Cys Leu Asp Cys
915 920 925
Gly His Glu Val His Ala Asp Glu Gln Ala Ala Leu Asn Ile Ala Arg
930 935 940
Ser Trp Leu Phe Leu Asn Ser Asn Ser Thr Glu Phe Lys Ser Tyr Lys
945 950 955 960
Ser Gly Lys Gln Pro Phe Val Gly Ala Trp Gln Ala Phe Tyr Lys Arg
965 970 975
Arg Leu Lys Glu Val Trp Lys Pro Asn Ala
980 985
<210> 98
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 98
atggagaaga ggattaacaa gattaggaag aaattatccg ctgataacgc aaccaaacca 60
gtttcccgaa gcggcccaat gaagactctg ctcgttagag tgatgacaga cgatcttaag 120
aaaagactcg agaagcgtag aaagaagccg gaggttatgc cccaggtgat ttctaataac 180
gcagcaaaca atcttcgaat gttgttggac gattatacta aaatgaagga agccatcctt 240
caggtgtact ggcaggaatt taaagatgac cacgtgggtc ttatgtgcaa attcgcgcag 300
cccgcaagca agaagattga ccagaataaa ctcaagccag agatggacga aaagggcaat 360
ttgaccactg cagggttcgc ttgttctcaa tgtggacagc cgttgttcgt ttataagctc 420
gaacaggtga gtgagaaagg aaaggcgtat accaattact ttgggagatg taatgtggct 480
gagcatgaga agttaattct tctcgctcaa ttgaagcctg agaaagacag tgatgaggct 540
gtgacatact ctttgggtaa atttgggcaa cgggcattag atttttattc catccacgtg 600
actaaagaat caacccaccc agtgaagcct ctagctcaaa tcgctgggaa caggtacgcc 660
tcaggcccag taggaaaagc gctgtcagac gcatgtatgg gcactatcgc atccttcttg 720
agtaagtatc aggatattat catagagcac cagaaagtcg tgaagggtaa tcagaagaga 780
ttagaaagtc tcagagaatt agcgggtaaa gagaatttag aatacccatc agttacattg 840
ccaccgcagc cacatactaa ggagggcgtg gatgcctata acgaggtaat cgcaagggtt 900
cggatgtggg ttaacctaaa tttatggcaa aaacttaaac tgagtaggga cgatgctaag 960
cccttactcc gattgaaggg gtttccatct tttcctgtgg tagaacgccg cgagaatgag 1020
gtcgattggt ggaatacaat aaacgaggta aagaagctga ttgatgcaaa gcgcgatatg 1080
ggtcgagtgt tctggtctgg ggtgacggcc gagaagcgca ataccatatt agagggttac 1140
aactatttgc caaacgaaaa tgatcacaaa aaacgtgagg gttccttgga gaatcccaaa 1200
aagcctgcca agcgtcaatt cggggatttg ttgttgtatc tagagaaaaa atatgcagga 1260
gactggggaa aagtcttcga cgaggcctgg gaacggatcg acaaaaaaat agcagggctt 1320
acttcacata ttgaaaggga agaagctaga aacgcggagg acgctcaatc aaaggcagtg 1380
cttaccgatt ggctcagagc aaaggcatca ttcgttttag aacgattgaa ggaaatggac 1440
gagaaggaat tttacgcttg cgaaattcaa ttacaaaagt ggtacggtga cctccgtggt 1500
aacccctttg ctgtggaggc agagaacagg gttgtagata tctctggatt ttctattggt 1560
agtgatggtc acagtattca gtataggaat ttactagcat ggaaatacct tgagaacggc 1620
aagagagagt tctacttact aatgaattac ggcaagaaag gcaggattcg ctttaccgat 1680
ggaactgata ttaaaaagag tggcaaatgg caagggcttc tatatggagg gggtaaggct 1740
aaagtgattg atttaacctt tgatcctgac gacgaacaac taattattct gcctctagcg 1800
tttggaactc gccaaggaag agaatttatc tggaacgact tgttgtcctt agagaccgga 1860
ctcatcaagc ttgcaaacgg cagagtaata gaaaagacaa tatataacaa aaagattggg 1920
agagatgaac cggctctctt cgttgcatta acattcgaga ggcgggaggt ggtggatcca 1980
tctaacataa agccggtaaa cttaattggc gtggatcgtg gtgaaaatat tccagctgtc 2040
atcgcattga cagacccaga gggttgccca ctgcctgaat tcaaagactc ttcaggtgga 2100
cccacagata ttctccgaat aggggagggt tacaaggaga agcagcgtgc tattcaagct 2160
gctaaagagg ttgagcagag gagggccggg ggttactccc gtaaattcgc ctctaaatct 2220
cgaaacttgg ccgacgatat ggttcggaat tctgctagag atctatttta ccatgctgtt 2280
actcacgatg cagtcttggt gttcgagaat ttgtccaggg gtttcggtag acaaggaaag 2340
agaacattta tgaccgaaag acaatatacg aagatggaag actggctcac agctaagttg 2400
gcatatgagg gactgacatc caaaacttac ctatcgaaga cccttgcgca atatacgtcc 2460
aagacttgct ctaactgcgg atttactatt acgacggctg actatgatgg gatgcttgtg 2520
agattaaaaa agacctcgga tggttgggcc acaacattga acaataaaga gttgaaagct 2580
gagggccaaa taacttacta taataggtac aaacgccaaa cagtggaaaa ggagttgtcc 2640
gcagagttag acaggctttc tgaagaatcg gggaacaatg acatttcgaa gtggactaaa 2700
gggcgtcgtg acgaggcatt attcttgctt aagaaaagat tctcccatag accagtgcag 2760
gagcagtttg tgtgcctgga ttgcggacac gaggttcacg cagatgagca agccgcattg 2820
aacattgcca ggtcgtggct ttttctgaac tctaatagca ccgaattcaa gtcatataag 2880
tcggggaaac aaccctttgt aggggcatgg caagcttttt ataagagaag gcttaaggag 2940
gtatggaagc ccaatgca 2958
<210> 99
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 99
atggagaaga gaatcaataa gatcaggaag aagctctccg ccgacaatgc caccaagcct 60
gtgtccagat caggccccat gaaaaccctc ctcgtgaggg tgatgaccga cgaccttaag 120
aagagactcg agaagagaag gaagaagccc gaagtgatgc cccaggtcat ttcaaacaac 180
gccgccaaca atctcaggat gctcctcgac gactacacca agatgaagga agccatcctc 240
caggtctact ggcaggagtt caaggacgac cacgtggggt tgatgtgcaa gttcgcccag 300
cccgcttcca aaaaaatcga ccaaaacaag ctcaaacctg agatggacga gaagggcaac 360
ctcacaaccg ccggtttcgc ctgctcccag tgcggacagc cactcttcgt ttacaagttg 420
gagcaggtta gcgagaaggg caaggcttac accaactact tcggcaggtg caatgtcgcc 480
gagcatgaaa agcttatcct tctcgcccaa ctcaagccag agaaagacag cgacgaagca 540
gttacttact ctctcggtaa gtttggtcaa agggccctcg atttttacag catccatgtc 600
accaaggagt ccactcatcc cgtgaagcct ctcgcccaaa tcgccgggaa caggtatgcc 660
agcgggcccg tcgggaaggc actcagcgat gcctgcatgg gcacaatcgc cagcttcctc 720
agcaagtacc aggatatcat catcgaacac cagaaggtcg tgaaaggcaa ccaaaaaagg 780
cttgagagcc tcagggagct cgccggcaag gagaatttgg aatatcccag cgtgactctc 840
cccccccagc cacacaccaa ggagggggtc gacgcttaca atgaggttat agccagggtc 900
agaatgtggg ttaacctcaa cctctggcag aaactcaaac tcagtaggga cgacgccaaa 960
cccctcctta ggctcaaagg gttccccagc tttccagtcg tggaaagaag ggagaacgag 1020
gttgactggt ggaatacaat caacgaggtg aagaaactca tcgacgccaa gagggatatg 1080
ggcagggtct tctggagcgg ggtcaccgcc gaaaagagaa acaccatcct cgaagggtat 1140
aactaccttc ccaatgagaa cgatcacaag aagagagagg gcagcctcga aaaccccaag 1200
aagccagcca agaggcaatt cggggatctc ctcttgtacc tcgagaagaa gtacgctgga 1260
gactggggga aggttttcga cgaagcctgg gagaggatcg acaaaaaaat tgccgggctt 1320
acctcccata ttgagaggga agaggctaga aacgccgagg acgcacagtc taaggccgtc 1380
ctcaccgact ggcttagggc caaagcctct ttcgtcctcg agaggctcaa ggagatggac 1440
gagaaggagt tctacgcctg cgagatccaa ctccaaaagt ggtacggaga cctcaggggt 1500
aaccccttcg ccgtggaggc cgaaaatagg gtcgtggata tctccggatt cagcataggt 1560
agcgacggtc actccatcca gtatagaaac ttgttggcct ggaagtactt ggagaacggt 1620
aagagggagt tctaccttct catgaactac ggcaagaagg gcaggatcag gtttaccgat 1680
gggaccgata tcaagaagtc cggaaagtgg caggggctcc tctacggcgg aggtaaggcc 1740
aaggttatcg atctcacctt cgacccagac gacgagcaac tcatcatcct cccccttgcc 1800
tttgggacaa ggcagggaag agagttcatc tggaatgacc tcctttccct cgagaccggc 1860
cttatcaagc tcgccaacgg tagggtcatc gagaagacca tctacaacaa aaagatcggg 1920
agggacgagc ccgccttgtt cgttgcactc acctttgaga ggagggaagt ggtcgaccct 1980
agcaacatca agcctgtgaa tcttatcgga gtggacaggg gggagaacat acccgccgtg 2040
atagctttga ccgaccccga gggttgtccc cttcccgagt tcaaggactc atcagggggg 2100
cctaccgaca tcctcaggat cggggagggc tacaaggaga agcagagggc catccaggcc 2160
gccaaagagg ttgagcagag gagagccggg gggtacagca ggaaattcgc cagcaaatcc 2220
aggaacctcg ccgacgacat ggttaggaac agcgctaggg atctcttcta ccatgccgtt 2280
actcacgacg ccgtgctcgt tttcgaaaac ctctccaggg ggttcggtag acagggcaag 2340
agaactttca tgaccgaaag gcaatatacc aagatggagg actggctcac cgccaaactc 2400
gcctacgagg ggctcacttc taagacctac ctcagcaaga ccctcgccca gtatacctca 2460
aagacttgct ccaactgcgg gtttacaatt accaccgctg attacgacgg catgctcgtc 2520
aggcttaaaa agaccagcga cgggtgggcc accaccctca ataacaagga gctcaaggcc 2580
gagggccaga taacctacta caacaggtac aaaaggcaga ccgtcgaaaa agagttgtcc 2640
gctgagctcg acaggctctc cgaggagtcc ggcaacaacg acatcagcaa gtggaccaag 2700
ggaaggagag atgaggccct cttcttgctc aaaaaaaggt tcagccacag gcccgtccag 2760
gagcagtttg tgtgcctcga ctgcggccac gaggtgcacg ccgatgaaca agccgccttg 2820
aatatagcca ggagctggct tttcctcaac tccaattcaa ccgaattcaa gagctacaaa 2880
agcgggaaac aacccttcgt tggcgcatgg caggctttct acaagaggag gttgaaggag 2940
gtgtggaagc ccaacgcc 2958
<210> 100
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 100
atggagaaga ggattaacaa gatcaggaag aagctctccg ccgacaacgc taccaagccc 60
gtttccaggt caggtcccat gaaaaccctc ctcgtgagag ttatgaccga tgacctcaaa 120
aagaggctcg aaaagaggag gaagaagccc gaggtgatgc cccaggtgat aagtaacaat 180
gctgccaata atctcaggat gctcctcgac gactacacca agatgaagga agcaatcctc 240
caggtttact ggcaggaatt caaggacgac cacgtgggcc tcatgtgcaa gttcgcccag 300
cccgcctcca agaagatcga ccaaaacaag ctcaaaccag agatggacga gaagggcaac 360
ctcaccaccg caggcttcgc ctgcagccag tgcggccagc cactcttcgt ctacaagctc 420
gaacaggtca gcgaaaaggg gaaggcctat accaattact tcggtaggtg caacgtcgcc 480
gagcacgaga agctcatcct tctcgcccaa ctcaagcccg agaaggactc cgacgaggcc 540
gtcacctaca gcttggggaa attcgggcag agggccctcg atttctattc aatccacgtt 600
accaaggagt ccacccatcc cgtgaagcct ctcgcccaaa tcgccggcaa taggtacgca 660
tccggccccg tcggaaaggc cttgtccgac gcttgtatgg ggaccattgc aagcttcctc 720
agcaaatacc aggacatcat catcgagcac cagaaggttg tcaaagggaa tcagaagagg 780
ctcgagagct tgagggagct tgccggtaag gagaacttgg aataccccag cgttacactc 840
cccccccagc ctcacacaaa ggagggggtc gacgcctata acgaggtcat cgccagagtc 900
aggatgtggg tcaatcttaa cctttggcag aagcttaaac tctccaggga cgacgccaaa 960
cccctcctca gactcaaggg gtttcccagc ttccccgtgg ttgaaagaag ggagaacgag 1020
gtggactggt ggaatacaat caacgaggtc aagaagctca tagatgctaa gagggatatg 1080
gggagggtct tctggtccgg agttaccgcc gagaaaagga acaccatcct cgagggttat 1140
aactacctcc ccaacgaaaa tgaccacaag aaaagggaag gaagcctcga gaaccctaag 1200
aagcccgcca agaggcaatt cggggacctc ctcctttacc tcgagaaaaa gtacgccgga 1260
gactggggta aggtttttga cgaagcctgg gagaggatcg acaagaagat cgccggactc 1320
acctcccaca tcgaaaggga ggaggccagg aacgctgagg acgcacaatc caaggccgtt 1380
ctcaccgatt ggcttagggc caaggccagc tttgtccttg agagacttaa ggaaatggac 1440
gagaaggagt tttacgcctg tgaaatccag ttgcaaaagt ggtacggtga tctcaggggc 1500
aaccctttcg cagtggaagc cgaaaataga gtggtcgaca tcagtggttt cagcattggc 1560
tccgatggac attccattca atacaggaat ctcctcgcct ggaagtacct cgagaacggt 1620
aagagggagt tctacttgct catgaactac ggcaaaaagg ggagaattag gttcaccgac 1680
ggcactgaca tcaagaaatc aggaaagtgg caggggttgc tctatggggg tggaaaggca 1740
aaagttattg acttgacctt cgaccccgat gacgaacaac tcatcatcct ccccctcgcc 1800
ttcgggacaa ggcagggcag ggagttcatc tggaacgacc tcctcagcct tgagaccggg 1860
ctcatcaagc tcgccaacgg cagagtgatc gagaagacca tctataacaa gaagatcggc 1920
agagatgagc ccgctctctt cgttgccctc actttcgaga gaagggaggt ggtcgacccc 1980
tccaacataa aacccgtgaa cctcatcggt gtggacagag gggagaatat ccccgccgtt 2040
atcgccctca ccgaccccga gggctgcccc ctccccgagt tcaaggacag cagcggcggc 2100
ccaactgata ttctcaggat cggggagggg tataaggaga agcagagggc catccaagcc 2160
gccaaggagg tggagcaaag gagggctggg gggtatagca ggaaattcgc ctcaaagtcc 2220
aggaacctcg ccgatgatat ggtgaggaac tcagctaggg acctctttta ccacgccgtg 2280
acccacgacg ctgttctcgt gttcgagaac ctcagcagag gttttggtag gcaggggaaa 2340
agaaccttta tgacagagag gcagtatacc aagatggagg attggttgac agctaagctc 2400
gcctatgagg gccttacttc aaagacctac ctcagcaaga ccctcgccca gtatacctca 2460
aagacctgca gcaactgtgg gttcaccatt accaccgccg actacgacgg gatgctcgtc 2520
aggcttaaga agaccagcga cggttgggca accaccctca acaacaaaga gctcaaggcc 2580
gaagggcaga tcacctacta caacaggtat aagaggcaga ccgtcgaaaa agaattgagc 2640
gccgagctcg acagactctc tgaggagagc ggcaacaacg acattagcaa gtggaccaag 2700
gggagaagag atgaggccct tttcctcctc aagaaaaggt tcagccatag gcccgtgcag 2760
gagcagttcg tttgcctcga ctgtggtcat gaggtgcatg cagacgagca ggccgctctc 2820
aacatcgcca ggtcctggct ctttcttaat agcaactcta ccgagttcaa gagctacaag 2880
agcggcaagc aacccttcgt gggggcctgg caggcctttt acaagaggag gttgaaggag 2940
gtttggaagc ccaacgcc 2958
<210> 101
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 101
atggaaaaga ggatcaataa gataaggaag aaactctccg ccgacaatgc caccaagccc 60
gttagcaggt ccggtcccat gaagaccctc cttgtcaggg ttatgaccga tgatctcaag 120
aaaaggctcg agaaaaggag gaagaagccc gaggttatgc cccaggtcat ctccaacaat 180
gccgcaaaca acttgagaat gcttctcgac gactacacca agatgaagga ggccatactc 240
caagtgtact ggcaggagtt caaggatgac cacgtgggcc tcatgtgcaa gttcgcccag 300
cccgcctcca aaaagatcga ccaaaacaag ctcaagcccg agatggacga gaaggggaac 360
ctcaccaccg ccgggttcgc ttgcagccag tgtggacagc ccctcttcgt ctataagctc 420
gagcaagtct ccgaaaaggg caaggcctac accaactact tcggaagatg caacgtcgct 480
gaacacgaga agctcatcct cctcgcccag ctcaaacccg aaaaggacag cgacgaggcc 540
gtcacctata gcttggggaa gttcggccaa agggccctcg atttttactc catccacgtg 600
accaaggaaa gcacccaccc cgtcaaaccc ctcgcccaga tcgccggcaa caggtacgcc 660
tcaggccccg ttggcaaagc cctcagcgac gcctgcatgg ggaccatcgc ctccttcctc 720
tccaaatatc aggacatcat aatcgaacac caaaaggttg tgaaggggaa ccagaagagg 780
ctcgaaagcc tcagggagct cgccgggaaa gagaacctcg agtacccctc agtgaccctc 840
cccccccagc cccacacaaa ggagggggtg gatgcctata acgaagttat cgccagagtc 900
aggatgtggg tcaacctcaa cctctggcag aagctcaaac tcagcaggga cgatgccaag 960
ccactcctca gactcaaggg tttcccctcc ttccccgtgg ttgagaggag ggaaaacgaa 1020
gtcgactggt ggaacaccat caacgaggtg aaaaagctca tcgacgccaa gagggacatg 1080
gggagagtct tctggtccgg cgtgaccgcc gagaagagga acaccatcct cgaggggtat 1140
aactacctcc ccaatgagaa cgatcataaa aagagggaag gtagcctcga aaatcctaag 1200
aagccagcca aaaggcagtt cggcgacctt ctcctttatc tcgaaaaaaa gtacgccggg 1260
gactggggga aagtgttcga cgaggcctgg gagaggatcg acaagaaaat cgccggcctc 1320
actagccaca ttgaaaggga ggaggccagg aacgccgagg acgcccagtc aaaggccgtt 1380
ctcaccgact ggttgagggc aaaggcctcc tttgtgctcg agaggctcaa ggagatggac 1440
gagaaggagt tctatgcctg cgagattcag ctccagaagt ggtacggaga cctcaggggc 1500
aacccatttg ccgtggaagc cgagaacagg gtcgtcgaca tcagcggctt ctccataggc 1560
tccgacgggc actccatcca gtacaggaat ttgctcgctt ggaagtacct cgagaacggg 1620
aagagagaat tctacctcct catgaactac ggtaagaaag ggaggatcag gttcaccgac 1680
gggaccgaca ttaagaagag cggtaaatgg cagggcctct tgtacggtgg ggggaaggcc 1740
aaggtgattg acctcacctt tgaccccgac gacgagcagc tcatcatctt gcccctcgcc 1800
ttcggcacca ggcaggggag ggagttcatc tggaacgatc tcctcagcct tgagaccgga 1860
ctcatcaagc ttgccaatgg aagggtcatt gagaaaacca tatacaacaa gaagatcggg 1920
agagatgaac ccgccctctt tgttgcactc accttcgaaa ggagggaggt cgttgacccc 1980
tccaatataa agccagtgaa cctcatcggg gtcgacaggg gggagaacat ccccgcagtc 2040
atcgccctca ccgatcccga gggctgcccc cttcccgagt ttaaggacag ctcaggcggc 2100
cccaccgaca tcctcaggat cggcgagggc tacaaggaga agcagagggc catccaagct 2160
gccaaggagg ttgagcagag gagggccggg gggtatagta ggaagtttgc cagcaagagc 2220
aggaatctcg ccgacgacat ggtcagaaac tccgccagag acctcttcta tcatgctgtg 2280
acccacgatg ccgtcctcgt cttcgagaac ctctccaggg gtttcggcag gcaggggaaa 2340
aggacattca tgaccgagag gcagtacacc aagatggagg attggctcac agcaaagctc 2400
gcctatgagg gccttaccag caaaacttac ctttctaaga ccctcgctca atacacttca 2460
aaaacatgca gcaattgtgg gttcactatc accaccgccg attatgacgg gatgttggtg 2520
agacttaaga agactagcga cggctgggcc accaccctca acaacaagga gctcaaggcc 2580
gaggggcaga taacctacta caacaggtac aagaggcaaa ccgttgaaaa ggagctcagc 2640
gccgagctcg acaggctctc agaggagagc ggcaacaacg acatcagcaa gtggaccaag 2700
ggcaggaggg acgaggccct cttcctcctc aagaagaggt tctcacatag gccagtgcaa 2760
gagcagttcg tgtgtctcga ctgcggccac gaggtgcacg ccgacgagca ggccgccctc 2820
aacatcgcca ggtcttggct cttcctcaat agcaactcaa ccgagttcaa gagttataag 2880
tccggaaagc agccctttgt gggggcctgg caagccttct acaagaggag attgaaggag 2940
gtctggaagc ccaacgcc 2958
<210> 102
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 102
atggagaaaa ggatcaacaa gatcagaaag aagctcagcg cagacaacgc tactaagccc 60
gtttctagga gcgggcccat gaagaccctc ttggtcaggg ttatgaccga cgacctcaag 120
aagaggcttg agaagaggag gaagaagccc gaggtcatgc ctcaggtgat cagtaacaac 180
gctgccaata atctcaggat gctcctcgat gactacacca agatgaaaga ggctatcctc 240
caggtctact ggcaggaatt caaggacgac cacgtggggc tcatgtgcaa gtttgcccag 300
cccgctagta agaagatcga tcagaacaag ctcaaaccag agatggatga gaaggggaac 360
ctcaccaccg ccggatttgc ctgctcccag tgtggccagc ccctcttcgt gtacaaactc 420
gaacaggtct ctgagaaagg caaggcttat actaactact tcggaaggtg caacgtggcc 480
gagcacgaga aactcatcct cctcgcccag ctcaagcccg agaaggactc cgacgaagcc 540
gtgacctact cactcgggaa gtttgggcag agggctctcg atttttacag cattcatgtg 600
actaaagaat cgacccaccc cgtgaagccc ctcgcccaga tcgctggcaa caggtatgcc 660
agcggccccg tcggcaaggc ccttagcgac gcttgcatgg gcaccatcgc cagctttctc 720
tccaagtacc aggacatcat tattgagcat cagaaagtcg tgaaggggaa ccagaagagg 780
cttgagtcct tgagggagct cgctgggaag gagaaccttg agtaccccag cgtgaccctc 840
cctccacagc cccacaccaa agagggagtg gatgcctaca acgaggttat cgccagagtg 900
aggatgtggg ttaacctcaa cctctggcaa aagctcaaac tcagcaggga cgatgccaaa 960
ccactcctca ggctcaaggg gtttccctct tttcccgttg ttgagaggag ggaaaacgaa 1020
gttgactggt ggaacaccat caacgaggtg aagaagctca tcgacgccaa gagagacatg 1080
ggcagggttt tctggagcgg ggtcaccgcc gagaagagga ataccatcct cgagggatac 1140
aattatttgc ccaatgagaa cgaccacaag aagagggagg gatccctcga aaatcccaag 1200
aagcccgcca aaagacaatt cggggacttg ctcctctatc tcgagaagaa gtatgccggc 1260
gactggggca aagtcttcga cgaggcctgg gagaggatcg acaagaagat cgccgggctc 1320
acctctcaca tcgagaggga ggaagccaga aacgccgaag acgcccagtc aaaggcagtt 1380
ctcaccgact ggctcagggc caaagcctcc ttcgtgcttg agaggctcaa agaaatggac 1440
gagaaggaat tctacgcctg cgaaatccag cttcagaagt ggtacggtga cctcagaggt 1500
aatccattcg ccgtcgaggc agagaacagg gtcgttgaca tctcagggtt ctccattggc 1560
tccgacgggc actccatcca gtacagaaac ctcctcgcct ggaagtacct cgagaacgga 1620
aagagggagt tctatttgct catgaactac ggtaagaagg ggaggatcag attcaccgac 1680
ggtaccgaca tcaagaaatc aggaaagtgg caagggctcc tctacggcgg gggcaaggcc 1740
aaggtcatag accttacatt tgaccccgac gacgagcagc tcataatcct cccccttgct 1800
tttggcacaa ggcagggaag agaattcatc tggaacgacc tcctcagcct cgagactggg 1860
ctcatcaaac tcgctaacgg gagggtcatc gagaagacca tctacaacaa aaaaattggg 1920
agggacgagc cagccctttt cgtcgcactc accttcgaga gaagggaggt tgtggatccc 1980
agcaacatca agcctgtgaa cttgatcggc gtggatagag gcgaaaacat ccccgccgtc 2040
atcgctctca ccgaccccga gggctgcccc ctccccgagt tcaaggactc ctccggcggc 2100
cccaccgaca tactcaggat cggcgaaggg tacaaggaaa agcagagggc catccaggca 2160
gccaaggaag tggaacaaag gagggccggg gggtattcca gaaaattcgc ctccaaaagc 2220
agaaatctcg ccgacgacat ggtcagaaac tccgccaggg acctcttcta ccacgccgtg 2280
acccacgatg ccgtgcttgt gttcgagaac cttagcaggg ggttcgggag acagggtaag 2340
aggaccttca tgaccgagag gcaatacacc aaaatggagg attggctcac cgccaagctc 2400
gcctacgagg gtctcaccag caagacttac ttgagcaaaa cactcgcaca gtacactagc 2460
aaaacctgca gtaactgcgg gttcaccatc accaccgccg actacgacgg gatgctcgtg 2520
aggctcaaga agacctctga cgggtgggca accacactta ataacaagga gctcaaggct 2580
gagggccaga tcacctacta caacaggtat aagaggcaga ccgtggagaa ggagctctcc 2640
gccgaacttg acaggctctc cgaggaaagc gggaataacg acatctccaa atggaccaag 2700
ggcaggaggg acgaggccct ctttttgctc aagaagaggt tctcccacag gcccgtgcag 2760
gagcagttcg tgtgtctcga ttgcggtcac gaggtccatg ccgatgagca ggccgcactc 2820
aatatcgcca ggagctggct ctttctcaac agtaattcca ccgaattcaa atcctacaaa 2880
agcgggaagc aacccttcgt tggtgcctgg caggcattct acaagagaag actcaaagag 2940
gtttggaagc ccaacgcc 2958
<210> 103
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 103
atggagaaaa ggatcaacaa aatcaggaag aagcttagcg ccgacaacgc aaccaagcca 60
gtgtccaggt ccggtcccat gaagaccctc ctcgtgaggg tgatgacaga cgacttgaag 120
aagagacttg aaaagaggag gaagaaacca gaggttatgc cccaggtcat aagtaacaac 180
gctgccaaca acctcagaat gctcctcgac gactatacca agatgaagga ggccatcctc 240
caggtgtact ggcaggaatt caaggacgat catgtgggac tcatgtgcaa gttcgcccag 300
cccgcctcca aaaagatcga ccagaacaag ctcaaacccg aaatggacga aaagggcaac 360
ctcaccaccg ccgggttcgc ctgcagccag tgtggccagc ccctcttcgt ttataagctc 420
gagcaagtgt cagaaaaggg caaagcctac acaaactact tcggcaggtg caacgtcgcc 480
gagcacgaga aacttatcct ccttgcccag ctcaagcccg agaaggactc cgacgaggca 540
gtgacataca gcctcgggaa gtttggccag agggctctcg acttttatag catacacgtc 600
accaaagagt ctacccaccc cgtgaaaccc ctcgcccaga tcgctgggaa caggtacgcc 660
tctggccccg ttgggaaggc cctcagtgac gcctgtatgg gtactatcgc aagctttttg 720
tccaaatatc aggatatcat catcgaacac cagaaagttg ttaaagggaa ccagaagagg 780
ctcgagtccc tcagggagct cgccggcaag gagaacctcg aatacccctc cgtcaccctc 840
cctcctcagc ctcataccaa ggagggcgtc gatgcttata acgaggtgat tgccagggtt 900
aggatgtggg tcaacctcaa cttgtggcag aagttgaagt tgtccaggga cgatgccaag 960
cccctcttga ggctcaaggg gttcccctca ttccccgttg tggagaggag ggagaacgag 1020
gttgactggt ggaacaccat aaatgaggtg aaaaagctca ttgacgccaa gagagacatg 1080
ggaagggtct tctggtccgg agtcaccgcc gagaagagga ataccatcct cgagggctac 1140
aactacttgc ctaacgagaa tgaccacaag aagagagagg gcagcctcga gaatccaaag 1200
aagcccgcca agaggcagtt cggggacctc ctcttgtacc tcgaaaaaaa gtacgccggc 1260
gactggggca aggtttttga tgaggcctgg gaaaggatcg ataagaagat cgctggcctc 1320
actagccaca tcgagagaga ggaagccagg aacgccgagg acgcccaaag caaggccgtt 1380
ctcaccgatt ggttgagggc caaagccagt tttgtcctcg aaaggctcaa agagatggac 1440
gagaaagaat tctacgcctg cgagatacag ctccagaagt ggtatggcga cctcaggggg 1500
aaccccttcg ccgtggaggc cgagaacagg gttgtcgaca tcagcgggtt ctcaatcggg 1560
agcgacggcc attccatcca gtacaggaat ctcctcgcct ggaagtacct cgagaatggg 1620
aagagggaat tttacttgct catgaactat ggcaagaagg gaaggataag atttaccgac 1680
ggtaccgaca tcaagaagtc cggaaaatgg caaggcctcc tctacggcgg cggtaaggca 1740
aaggttatcg acctcacctt cgacccagac gacgaacaac tcatcattct tcctctcgcc 1800
ttcgggacca gacaagggag ggagttcatc tggaatgacc tcttgtccct cgagaccgga 1860
ctcatcaagc tcgccaacgg gagggtcatc gagaagacca tctataacaa aaagatcggg 1920
agagacgaac ccgccctctt cgttgccctc accttcgaaa ggagggaagt ggtcgacccc 1980
agcaacatca aacccgtcaa cctcatcggt gtcgacaggg gggaaaacat acctgccgtt 2040
atcgccctca ccgaccccga aggctgcccc ctccccgaat ttaaggactc aagcggtggg 2100
cccaccgata tcctcaggat aggcgagggc tacaaagaga agcagagagc aatacaggcc 2160
gccaaagagg tggagcagag aagggctggc ggatacagca ggaagtttgc ttccaagtcc 2220
aggaacctcg ccgacgacat ggttaggaat agcgccagag acctcttcta ccatgccgtg 2280
acccacgacg ccgtcttggt cttcgagaac ctctctaggg gattcgggag acaggggaag 2340
aggaccttta tgaccgagag gcagtatacc aagatggagg attggctcac tgcaaagctc 2400
gcctacgagg gtttgacaag caaaacctac ctcagtaaga cactcgccca gtatacctcc 2460
aagacatgct caaattgcgg atttaccatt accaccgccg attacgacgg tatgctcgtg 2520
agactcaaga agaccagcga tggatgggcc accactctca acaacaagga gcttaaggcc 2580
gaaggccaga tcacctacta taacagatac aagaggcaaa ccgtggagaa ggaactcagc 2640
gccgagctcg acagactctc tgaggagagc ggaaacaacg atatctctaa gtggaccaag 2700
ggcagaagag acgaggcact cttcttgctc aagaagaggt tttcccacag gcccgtccag 2760
gaacaattcg tctgcttgga ctgtgggcac gaggttcacg ccgacgaaca ggcagccctc 2820
aacatagcca ggtcctggtt gttcttgaac tccaatagca ccgagtttaa gtcctataag 2880
agtggtaagc agcccttcgt tggggcctgg caggcattct ataaaaggag gctcaaggag 2940
gtgtggaaac ccaacgcc 2958
<210> 104
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 104
atggagaaga gaatcaacaa gatcagaaag aagctcagcg cagacaacgc caccaagccc 60
gtcagtagat ccggacccat gaagaccctc cttgtgaggg tgatgactga tgacctcaag 120
aaaagactcg agaaaaggag aaagaagcct gaggtgatgc cacaagtcat ctccaacaac 180
gccgccaata acctcagaat gctcctcgac gattacacta aaatgaaaga agccatcctc 240
caggtctatt ggcaggaatt taaggacgac cacgtgggac tcatgtgcaa gttcgcccag 300
cccgcctcca agaagataga ccagaacaaa ttgaagcccg aaatggacga gaagggcaac 360
cttaccaccg ccggttttgc atgtagtcaa tgcggccagc cactcttcgt gtacaaactc 420
gagcaagtct ctgagaaggg gaaggcctac accaactact ttggcaggtg taacgtcgcc 480
gagcatgaga agcttatcct cttggctcag ctcaagcccg aaaaggactc tgacgaggcc 540
gtgacttatt cccttggaaa gttcgggcag agagccctcg atttctacag catccacgtg 600
acaaaggaga gcacccatcc cgtcaagccc ctcgcccaga tcgccgggaa caggtatgcc 660
agcggccctg tcggcaaggc cctttccgac gcctgtatgg ggaccatcgc atccttcctc 720
agcaagtacc aagacattat catcgagcac caaaaggttg ttaagggcaa ccagaagaga 780
ctcgagtccc tcagggaatt ggccggcaag gagaatctcg agtaccccag tgttacattg 840
ccacctcagc ctcatacaaa ggaaggcgtc gacgcctata acgaagtgat cgctagggtt 900
aggatgtggg tcaatctcaa cctctggcag aagctcaagc ttagcagaga tgacgccaag 960
cccctcctca ggcttaaggg cttccctagc ttcccagttg ttgagaggag ggaaaatgaa 1020
gtggactggt ggaataccat caatgaggtt aagaagctca tcgatgccaa gagggacatg 1080
ggcagggtct tctggagcgg ggttactgcc gagaagagga ataccatcct cgaggggtac 1140
aactacttgc caaatgaaaa cgaccacaag aagagggaag gttccctcga aaaccccaaa 1200
aagccagcta aaagacagtt cggggatctc ctcctctacc tcgagaaaaa gtacgctggc 1260
gactggggta aggtcttcga cgaggcttgg gagaggatcg acaagaagat cgccggactc 1320
accagccaca tcgagaggga ggaggccagg aatgccgagg acgcccaaag taaggccgtg 1380
ctcaccgact ggttgagagc caaggccagc tttgttctcg aaaggttgaa ggagatggat 1440
gagaaggagt tctacgcctg cgaaatccaa ctccagaagt ggtacggcga tctcagagga 1500
aaccccttcg ccgttgaggc cgagaacagg gtcgtggaca tttccggctt ctcaatcggg 1560
agtgacggcc actcaatcca gtacagaaac ctcctcgcct ggaagtacct cgagaacggg 1620
aagagggagt tttacctcct catgaattat ggcaagaagg ggaggattag gtttaccgac 1680
ggaactgaca tcaaaaaaag cggtaagtgg cagggcctcc tttacggcgg gggtaaggcc 1740
aaagtcatcg acttgacctt cgaccccgac gacgagcagc tcatcatcct ccccctcgcc 1800
ttcggaacaa gacagggcag ggagttcatc tggaacgacc ttttgagctt ggagaccggg 1860
cttatcaaac tcgccaacgg cagagttatt gagaaaacca tctacaacaa gaagatcggc 1920
agagacgagc ccgctctctt cgttgccttg acattcgaga gaagggaggt ggtcgaccct 1980
agtaacatca aacctgtcaa cttgattggc gtcgatagag gtgaaaatat ccccgccgtc 2040
atcgctttga ccgatcccga gggttgcccc ttgcccgaat ttaaggacag ttccggcggg 2100
cccaccgaca tactcaggat tggagagggc tacaaagaga aacaaagggc catccaggcc 2160
gcaaaggagg ttgagcagag gagggctggg ggctactcaa ggaagtttgc cagcaagagc 2220
agaaacctcg ccgacgatat ggttagaaac agtgccaggg acctcttcta tcacgccgtt 2280
acccacgacg ccgtgctcgt ctttgagaat ctctcaaggg gattcggaag gcaagggaag 2340
aggaccttta tgaccgagag gcagtacaca aaaatggagg attggcttac cgccaaactt 2400
gcctacgagg gactcacctc caaaacctac ttgtcaaaga ctctcgccca gtacaccagc 2460
aagacctgct ccaattgcgg cttcaccatc actaccgccg actacgatgg catgctcgtt 2520
aggctcaaga aaacctcaga cgggtgggcc accacactca acaataaaga gctcaaggca 2580
gaaggtcaaa tcacctacta caacaggtac aagaggcaaa ctgtggagaa ggaactcagc 2640
gccgaactcg acaggctcag cgaagaaagc ggcaacaacg acatcagtaa gtggacaaaa 2700
ggcaggaggg acgaggccct tttcctcctc aagaagaggt tttcccacag gcccgtgcaa 2760
gagcagttcg tctgcctcga ctgcgggcac gaggttcatg ccgacgagca ggccgccctc 2820
aacatcgcta ggtcttggct ctttttgaac tccaacagta ccgagttcaa aagctacaag 2880
agcgggaagc agcccttcgt tggggcctgg caggcttttt acaagagaag gctcaaagag 2940
gtgtggaagc ccaatgcc 2958
<210> 105
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 105
atggagaaga gaataaacaa gataaggaaa aaactctccg ccgacaacgc caccaagcct 60
gtctccaggt ccgggcctat gaaaaccctc cttgtgaggg tcatgaccga tgacctcaag 120
aagaggcttg aaaaaaggag gaagaaaccc gaggtcatgc cccaggtgat cagcaacaac 180
gccgcaaaca acctcagaat gctcctcgat gactacacca agatgaagga agccatcctc 240
caggtctact ggcaggagtt caaagacgac cacgtcgggc tcatgtgcaa attcgcacaa 300
cccgccagca agaagataga tcaaaacaag ctcaaaccag agatggacga aaaagggaac 360
ctcaccaccg ccggctttgc ctgcagtcag tgcggtcaac ccctctttgt ttacaaactc 420
gaacaggtga gcgagaaagg gaaagcctac acaaattact ttggaagatg caacgtcgct 480
gagcacgaaa agttgatcct cctcgcccag ttgaagcccg agaaggattc tgatgaggcc 540
gtcacctatt ccctcgggaa attcggccag agggccctcg atttctacag catccacgtt 600
accaaggaga gcacccaccc cgtgaagccc ctcgcccaga tcgccggcaa taggtatgcc 660
tccggcccag tgggcaaggc cctctccgac gcttgtatgg gaaccatcgc ctcctttctt 720
agcaaatacc aggacattat catcgagcac cagaaagtgg tgaagggcaa ccagaaaaga 780
ctcgagtccc tcagggagct cgccgggaag gagaacttgg agtaccccag cgtgaccctc 840
cccccccagc cccacaccaa ggaaggggtg gacgcctaca acgaagttat cgccagagtc 900
aggatgtggg tgaacctcaa cctctggcag aagttgaagc tctcgaggga tgacgccaag 960
cccctcctca ggctcaaggg atttcccagc ttccccgtcg tggaaaggag ggagaacgaa 1020
gtggactggt ggaacaccat taatgaagtg aagaagctca tcgacgcaaa gagggatatg 1080
ggcagggtct tctggtccgg agtcacagct gagaagagga acaccatcct cgaggggtac 1140
aactacttgc ccaacgaaaa cgaccacaag aaaagggagg gcagtttgga gaatcccaag 1200
aaacctgcca agaggcagtt cggagacctc ctcctctacc tcgagaagaa gtacgccggg 1260
gattggggga aggttttcga cgaggcctgg gagaggattg acaagaagat cgccggcttg 1320
accagccaca tcgaaagaga ggaggcaagg aacgccgaag acgcccaaag caaggctgtc 1380
ctcaccgatt ggctcagggc caaggccagc ttcgtcctcg agaggctcaa ggagatggac 1440
gagaaagaat tctacgcctg cgagattcag ttgcagaagt ggtatgggga cctcagggga 1500
aaccccttcg ccgtggaggc cgagaacagg gtggtcgata tcagcgggtt cagcatcgga 1560
tcagacggtc actcaattca gtacaggaat ctcttggcct ggaagtatct cgagaacggc 1620
aagagggagt tctatctcct tatgaactac ggcaagaagg gcaggatcag gttcaccgac 1680
gggaccgaca taaaaaagag cggcaagtgg cagggcctcc tctacggagg tgggaaggcc 1740
aaagtcatcg atctcacctt cgacccagac gacgagcagc tcatcatcct ccccttggcc 1800
ttcgggacaa ggcaaggaag ggagtttatc tggaacgatc ttctcagcct cgagaccggc 1860
ctcatcaagc ttgccaacgg gagagtgatc gagaaaacca tctacaacaa gaaaatagga 1920
agggacgagc ccgcactctt cgtcgctttg accttcgaga ggagggaggt tgtggaccct 1980
tccaacatta agcccgtgaa cctcataggc gtcgacagag gcgagaatat ccccgccgtg 2040
atcgccctca ccgatcccga gggctgtccc ctccccgagt tcaaggactc ctctgggggc 2100
cccaccgaca tcctcaggat cggcgaaggc tacaaggaga agcaaagggc catccaggca 2160
gcaaaggagg tcgaacaaag aagggccggg gggtactcca ggaagttcgc ctccaagtcc 2220
aggaacttgg ccgacgacat ggtgaggaac agcgccagag acctcttcta ccacgccgtc 2280
acccacgatg cagtcctcgt cttcgagaac ctctccagag ggttcgggag acaggggaag 2340
agaaccttta tgaccgagag gcagtacacc aagatggaag actggctcac tgcaaaactc 2400
gcctacgagg gcctcacctc caaaacatac ctctccaaaa cactcgccca gtatacctcc 2460
aagacctgct caaactgtgg gttcacaatc accaccgctg actacgacgg gatgttggtg 2520
aggctcaaga agacatccga cgggtgggcc accaccctca acaataaaga acttaaggca 2580
gagggccaaa tcacctacta caacaggtac aaaaggcaga ccgtggagaa agaattgtcc 2640
gccgagttgg acagactttc cgaagagtcc ggcaacaacg acatttccaa gtggaccaag 2700
gggaggaggg atgaggcact cttcttgctc aaaaagaggt ttagtcatag acccgtccag 2760
gagcagttcg tctgcctcga ctgcgggcac gaagtccacg ccgacgagca ggccgccctc 2820
aatatcgcta gaagctggct tttcctcaac tcaaatagca ccgagttcaa atcctacaag 2880
agcggaaagc aacccttcgt tggcgcctgg caggccttct acaagcgaag actcaaggag 2940
gtctggaagc ccaacgct 2958
<210> 106
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 106
atggagaaga ggattaataa gatcaggaag aagctttccg ccgacaatgc caccaagcct 60
gtttccagaa gcggtcccat gaaaaccctt ctcgtccgcg tcatgactga cgacctcaaa 120
aagaggctcg agaagaggag gaagaagcca gaggtgatgc cccaggtcat cagtaacaac 180
gccgcaaata acctcaggat gttgttggac gactacacca agatgaagga ggccatcctc 240
caggtctact ggcaggaatt caaggacgac cacgtcggcc tcatgtgcaa gtttgctcag 300
cccgcctcta aaaaaatcga ccagaacaag ctcaaacccg aaatggatga gaaggggaac 360
cttaccaccg ccggcttcgc ctgcagccag tgtggacaac ctctctttgt ctataagctc 420
gagcaagtgt ccgagaaggg caaggcctac accaactact ttggcaggtg caacgttgcc 480
gagcatgaaa agcttatcct cctcgcacaa cttaagcccg agaaggactc cgacgaggcc 540
gtcacctaca gcctcggaaa atttggccag agggccctcg acttctactc aatccacgtc 600
actaaggaga gtacccatcc cgtgaagccc ctcgctcaga tcgccgggaa caggtatgcc 660
tccgggcccg tggggaaggc cttgagtgac gcctgcatgg gtaccatcgc tagcttcctc 720
agcaagtacc aggacatcat catagagcac cagaaggtcg ttaagggcaa ccagaagagg 780
ctcgagtctc tcagggagct cgccggcaaa gagaatttgg agtaccccag cgttaccctt 840
cccccccagc cccataccaa ggagggtgtg gacgcctaca acgaggttat cgccagggtg 900
aggatgtggg tgaacctcaa cctctggcaa aagctcaagc tcagcaggga tgatgccaaa 960
cccctcctca gattgaaggg cttcccatcc ttccccgtcg tcgagaggag agaaaacgag 1020
gtcgactggt ggaatactat taatgaagtc aaaaagctca tcgatgccaa gagggatatg 1080
ggaagggtgt tttggtccgg cgtcaccgcc gagaagagga acaccatcct cgagggatac 1140
aactacctcc ccaacgaaaa cgaccacaaa aaaagagagg ggtcccttga aaaccccaag 1200
aagcccgcca agaggcagtt tggggatctc ctcctctacc tcgagaagaa gtacgctggc 1260
gactggggaa aggtgttcga cgaagcctgg gagaggatcg acaaaaagat tgctggactt 1320
acctcccaca ttgagaggga ggaggctagg aacgcagaag acgcccaatc caaggccgtg 1380
ctcaccgact ggttgagggc aaaagcctca ttcgttctcg agaggctcaa agaaatggat 1440
gaaaaggaat tttacgcctg cgaaatccag ttgcagaagt ggtacgggga tctcaggggg 1500
aaccccttcg ccgtggaagc agaaaacaga gttgtcgaca tcagcggctt cagcatcggg 1560
agtgacggac acagcatcca gtacaggaac ctcctcgcct ggaaatacct cgaaaacggc 1620
aagagggagt tctacctcct catgaactat ggcaagaagg ggaggatcag gttcaccgac 1680
gggactgaca tcaagaaatc cggaaaatgg cagggactcc tctacggggg gggcaaggca 1740
aaggtgatcg atctcacctt cgaccccgac gacgagcagc tcatcatcct ccccttggcc 1800
tttgggacca ggcaagggag ggagtttatc tggaacgacc tcctctctct cgaaactggg 1860
ctcatcaagc tcgccaacgg cagagttatc gaaaagacca tctataacaa aaagatcggt 1920
agggacgagc ccgcactttt cgtggccctc acctttgaga ggagggaggt ggttgaccct 1980
tccaacatca agccagtcaa cttgataggc gttgataggg gggaaaacat tccagccgtc 2040
atagcactca ccgaccctga ggggtgtccc ctccccgagt ttaaagactc atctgggggc 2100
cccactgaca tactcaggat cggggagggg tataaggaga agcagagggc catccaggcc 2160
gccaaagaag tggagcagag gagggccggc ggttactcaa ggaagttcgc ctccaaaagc 2220
agaaacctcg ctgacgacat ggtcagaaac tccgcaaggg acctcttcta ccatgccgtt 2280
acccacgacg ccgtcctcgt cttcgagaac ctcagcaggg gcttcgggag acagggcaag 2340
agaaccttta tgacagagag gcagtatacc aagatggaag actggctcac cgccaaactc 2400
gcatacgagg ggcttaccag caagacctac ctctccaaga cattggccca gtacacaagc 2460
aaaacctgct ccaattgcgg attcaccatc accaccgccg actacgacgg catgctcgtc 2520
agactcaaga agacaagcga cgggtgggcc accaccctca acaacaagga acttaaagcc 2580
gagggacaaa tcacctacta caacaggtat aagaggcaaa ccgttgagaa agagctcagt 2640
gccgagttgg acaggctctc tgaggagtcc gggaacaacg acatctccaa gtggaccaaa 2700
ggcaggaggg acgaagctct tttcctcctc aagaagaggt tctcccacag gcccgtccag 2760
gagcaatttg tctgtctcga ctgcgggcac gaggttcacg ccgacgagca ggccgccctc 2820
aacatcgcca ggagttggct ctttctcaac tcaaactcca ccgagttcaa gagctataag 2880
tccgggaagc agccctttgt tggcgcctgg caagctttct acaagagaag gctcaaagag 2940
gtctggaaac ccaacgct 2958
<210> 107
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 107
atggagaaaa ggatcaacaa gatcaggaag aagctcagcg ccgacaacgc caccaagcca 60
gtttccaggt caggccctat gaagactctc ctcgtcaggg tgatgactga cgacctcaag 120
aaaagactcg aaaagagaag gaagaaaccc gaggttatgc cccaggtcat aagcaacaac 180
gccgccaaca atctcagaat gcttctcgat gattacacca agatgaaaga ggccatcctc 240
caggtttact ggcaagagtt taaggatgat cacgtggggc ttatgtgtaa gtttgcccaa 300
cccgcctcca agaagattga ccaaaacaaa ctcaagcccg agatggatga gaagggtaac 360
ctcacaaccg ccggcttcgc ctgcagccag tgcgggcagc ccctcttcgt gtacaagctc 420
gagcaggtca gcgagaaggg caaagcctac acaaactact tcggcaggtg caacgttgcc 480
gagcacgaga aactcatact cctcgcccag ctcaagcccg agaaggacag tgacgaggca 540
gtgacctact ccctcggcaa gttcgggcag agggccctcg acttctacag catccacgtg 600
accaaggaaa gcacccaccc tgttaagccc cttgcccaga ttgccgggaa caggtacgct 660
tctgggcccg tcgggaaagc cctcagcgac gcctgcatgg ggaccatcgc cagcttcctc 720
tcaaagtacc aggacatcat catagagcac caaaaggtcg tgaaaggcaa ccagaagagg 780
ctcgagagtc tcagagagct cgccggcaaa gagaacctcg agtatccctc tgtcaccctc 840
ccaccccaac cccacaccaa ggagggggtc gacgcctaca acgaggtgat cgccagggtt 900
aggatgtggg tgaacctcaa cctctggcag aagcttaagc tctcaaggga cgatgccaag 960
ccccttctta ggctcaaagg gttcccctcc ttccccgttg tcgaaaggag ggagaacgag 1020
gtcgactggt ggaacaccat caatgaggtt aagaaactca tcgacgcaaa gagggacatg 1080
gggagagtct tctggagcgg agttaccgcc gagaagagaa acaccatcct cgaagggtat 1140
aattaccttc ccaacgagaa cgatcacaag aaaagagaag ggagcctcga gaaccccaag 1200
aagcccgcaa agaggcagtt cggcgacttg ctcctctacc tcgaaaagaa gtacgccggg 1260
gactggggga aggtcttcga cgaggcctgg gagaggatcg acaagaagat cgccgggttg 1320
acctcccaca tcgagaggga agaggccagg aacgctgagg acgcccaaag caaggcagtg 1380
ctcaccgact ggctcagagc caaggcctca ttcgtcctcg aaagacttaa agagatggac 1440
gaaaaagagt tctacgcctg cgagattcag ctccagaaat ggtacgggga tctcagaggg 1500
aatcccttcg ccgtggaggc agaaaacaga gtggttgata tcagcgggtt tagcatcggc 1560
agcgacgggc atagcatcca gtataggaat ttgctcgcct ggaagtacct cgagaacggc 1620
aagagggagt tctatctcct catgaattac ggcaaaaagg gcagaatcag atttaccgac 1680
ggcaccgaca taaagaagag cgggaagtgg caagggctcc tctatggggg gggaaaagcc 1740
aaggtcatcg atctcacctt cgaccccgac gacgagcagc tcatcatact ccccctcgct 1800
ttcgggacca ggcaggggag agagttcatc tggaacgact tgctctcact cgagaccggg 1860
cttataaagc tcgccaacgg cagggtgatc gagaagacta tctacaacaa gaagatcggg 1920
agagacgagc ccgctctctt cgtcgctctc acctttgaga ggagggaggt cgttgacccc 1980
tccaacataa agccagtgaa ccttatcggc gtggacagag gcgagaatat cccagccgtg 2040
atcgcactca ctgaccccga ggggtgcccc ctcccagagt tcaaggactc ctccggtggg 2100
cccacagaca tcctcaggat cggtgaaggc tacaaggaaa agcagagggc catccaggct 2160
gccaaagaag tcgagcagag gagggccggc gggtattcca ggaagttcgc ctccaagagc 2220
aggaacctcg cagatgacat ggtcaggaac agcgccaggg acctctttta tcatgcagtg 2280
acccacgacg ccgtgctcgt cttcgagaac ctctcaagag ggttcggcag gcaagggaag 2340
aggaccttca tgaccgagag gcagtacacc aaaatggagg attggcttac cgccaagctc 2400
gcttatgagg gcttgaccag caagacctac ctcagtaaga ccctcgccca atacactagc 2460
aaaacctgct caaactgcgg gttcaccatc actaccgccg actatgacgg tatgctcgtc 2520
aggcttaaga agacctccga cggttgggcc acaaccttga ataataagga gctcaaagcc 2580
gagggccaga taacctacta caacaggtat aagaggcaga ccgtcgaaaa ggagttgtcc 2640
gccgaactcg acaggttgtc cgaggaatcc gggaacaacg acatcagcaa gtggaccaag 2700
ggcaggaggg acgaggccct ctttcttctc aagaagaggt tcagccatag gccagtgcag 2760
gagcagttcg tgtgccttga ctgcggccac gaagtccacg cagacgaaca agccgctctc 2820
aatatcgcca ggagctggct ctttttgaac tccaacagca ccgagttcaa gtcctacaag 2880
tccgggaaac agccattcgt tggtgcctgg caggccttct acaaaagaag actcaaagaa 2940
gtctggaagc ccaacgcc 2958
<210> 108
<211> 2958
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 108
atggaaaaga ggatcaacaa gattaggaag aaactctctg ccgacaacgc caccaagccc 60
gtgtcccgtt ctgggcccat gaaaaccctc ctcgtcaggg ttatgaccga cgacctcaag 120
aaaaggcttg agaaaagaag gaagaagcct gaggtgatgc cccaggttat ctccaacaac 180
gccgcaaata atttgaggat gctcttggac gactatacca agatgaaaga ggctatactc 240
caagtctatt ggcaggagtt caaagatgac cacgtggggc tcatgtgcaa gttcgcccag 300
cccgccagca agaaaatcga ccagaacaag ctcaagcccg agatggacga gaagggcaac 360
ctcaccaccg ccggtttcgc ctgctcccag tgtgggcagc ccctcttcgt ttacaaactc 420
gagcaggtga gcgagaaggg caaagcctac accaactact tcggcagatg caacgtcgct 480
gagcacgaga agctcatcct cctcgcccaa ctcaagccag agaaggactc cgatgaggcc 540
gtgacctaca gcctcgggaa gtttggacag agagcccttg atttctacag catccacgtg 600
accaaagagt ccacccaccc agtgaagccc ctcgcccaaa tcgccggaaa caggtacgcc 660
tctgggcctg tcgggaaggc tctctccgac gcctgcatgg gcacaatcgc cagcttcttg 720
agcaagtacc aggacatcat catcgaacac cagaaagtgg tgaaggggaa ccagaaaagg 780
ctcgagtccc tcagagaatt ggccggtaag gagaacctcg agtaccccag cgtgaccctc 840
cccccccaac cccataccaa agagggtgtc gatgcatata acgaggtcat cgccagggtc 900
agaatgtggg tcaacctcaa tctctggcag aagttgaaac tcagtaggga tgacgctaaa 960
cccctcttga ggctcaaggg gtttcccagc ttccccgttg tggagaggag ggagaatgag 1020
gtcgactggt ggaacaccat caacgaggtg aagaagttga tcgacgccaa gagggacatg 1080
gggagggtgt tctggtccgg cgtgaccgcc gagaaaagga acaccatctt ggagggttac 1140
aactacctcc ccaacgagaa tgaccacaaa aagagggagg ggtccctcga aaaccccaaa 1200
aaacccgcca agaggcagtt cggggacctc cttctctatt tggaaaagaa gtacgccgga 1260
gactggggga aagtctttga tgaggcctgg gagaggattg acaagaagat cgcagggttg 1320
accagccaca tcgagaggga agaggccagg aacgccgaag acgcccagtc caaggccgtt 1380
ctcactgact ggttgagggc caaggcctct ttcgtgctcg agaggctcaa ggagatggac 1440
gaaaaggaat tttatgcctg cgagatccaa ctccagaaat ggtacggcga tctcaggggc 1500
aacccatttg ccgtcgaggc cgaaaatagg gtggttgaca tatccgggtt ctctatcggc 1560
tccgacgggc actccatcca gtacaggaac ctcctcgcct ggaagtacct cgaaaacgga 1620
aagagggagt tttacttgct catgaattat ggcaagaagg gcaggattag gtttaccgac 1680
ggaaccgaca taaaaaagag cggcaagtgg caaggcctcc tctacggggg gggcaaagcc 1740
aaggtcatcg acctcacctt cgaccccgac gacgagcagc tcatcatact tcccctcgcc 1800
ttcggcacca gacaggggag ggaattcatt tggaacgacc tcttgtccct cgagaccggc 1860
ctcatcaaac tcgctaatgg cagagtgatc gaaaagacta tctacaacaa gaagatcggg 1920
agggacgagc ctgccctctt cgtggctctt acattcgaga gaagggaggt tgtggatccc 1980
tccaatatca agcccgtcaa cctcataggc gtcgacaggg gggagaatat ccccgccgtc 2040
atagccctca ccgaccccga ggggtgtccc ctccccgagt tcaaagactc ctccggcggc 2100
cccaccgaca tactcaggat aggggagggg tacaaggaga aacagagagc catacaggca 2160
gccaaggagg tcgagcagag gagggcaggc ggttacagca ggaagttcgc ctccaaatcc 2220
agaaacctcg ccgacgatat ggtcagaaac tccgcaaggg atctctttta ccacgctgtg 2280
acccatgacg ccgtgctcgt gttcgaaaac ctctccaggg gattcggcag gcaaggtaaa 2340
aggaccttca tgaccgagag gcagtacacc aaaatggagg actggctcac tgccaaattg 2400
gcctacgagg gtctcaccag taaaacatac ttgtccaaaa ctctcgccca gtatacctcc 2460
aagacatgct ccaattgcgg gtttaccata accactgccg actatgatgg catgttggtc 2520
aggcttaaga agacctccga cgggtgggcc actaccctca ataataagga gttgaaggcc 2580
gaaggccaga tcacctacta caataggtac aagaggcaaa ctgttgagaa ggaactctca 2640
gccgaactcg acagactctc cgaggagtcc ggcaacaacg acatctccaa gtggaccaaa 2700
gggagaaggg acgaagccct tttccttctt aagaaaagat tttcacacag gcccgtccag 2760
gagcagttcg tctgcttgga ctgtggccat gaagtgcacg ccgacgagca ggctgccctc 2820
aatatcgcca ggagctggct cttcctcaat agcaacagca ccgagtttaa gagttacaag 2880
agcggcaagc aacccttcgt cggggcctgg caggctttct ataagaggag acttaaggag 2940
gtttggaagc caaatgcc 2958
<210> 109
<211> 531
<212> PRT
<213> 人工
<220>
<223> 合成的
<400> 109
Glu Ala Ser Gly Ser Gly Arg Ala Asp Ala Leu Asp Asp Phe Asp Leu
1 5 10 15
Asp Met Leu Gly Ser Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu
20 25 30
Gly Ser Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Ser Asp
35 40 45
Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Ile Asn Ser Arg Ser Ser
50 55 60
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ser Gln Tyr Leu Pro Asp
65 70 75 80
Thr Asp Asp Arg His Arg Ile Glu Glu Lys Arg Lys Arg Thr Tyr Glu
85 90 95
Thr Phe Lys Ser Ile Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp
100 105 110
Pro Arg Pro Pro Pro Arg Arg Ile Ala Val Pro Ser Arg Ser Ser Ala
115 120 125
Ser Val Pro Lys Pro Ala Pro Gln Pro Tyr Pro Phe Thr Ser Ser Leu
130 135 140
Ser Thr Ile Asn Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly
145 150 155 160
Gln Ile Ser Gln Ala Ser Ala Leu Ala Pro Ala Pro Pro Gln Val Leu
165 170 175
Pro Gln Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu
180 185 190
Ala Gln Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gln
195 200 205
Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gln Ala Gly Glu Gly Thr
210 215 220
Leu Ser Glu Ala Leu Leu Gln Leu Gln Phe Asp Asp Glu Asp Leu Gly
225 230 235 240
Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala
245 250 255
Ser Val Asp Asn Ser Glu Phe Gln Gln Leu Leu Asn Gln Gly Ile Pro
260 265 270
Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala
275 280 285
Ile Thr Arg Leu Val Thr Gly Ala Gln Arg Pro Pro Asp Pro Ala Pro
290 295 300
Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp
305 310 315 320
Glu Asp Phe Ser Ser Ile Ala Asp Met Asp Phe Ser Ala Leu Leu Gly
325 330 335
Ser Gly Ser Gly Ser Arg Asp Ser Arg Glu Gly Met Phe Leu Pro Lys
340 345 350
Pro Glu Ala Gly Ser Ala Ile Ser Asp Val Phe Glu Gly Arg Glu Val
355 360 365
Cys Gln Pro Lys Arg Ile Arg Pro Phe His Pro Pro Gly Ser Pro Trp
370 375 380
Ala Asn Arg Pro Leu Pro Ala Ser Leu Ala Pro Thr Pro Thr Gly Pro
385 390 395 400
Val His Glu Pro Val Gly Ser Leu Thr Pro Ala Pro Val Pro Gln Pro
405 410 415
Leu Asp Pro Ala Pro Ala Val Thr Pro Glu Ala Ser His Leu Leu Glu
420 425 430
Asp Pro Asp Glu Glu Thr Ser Gln Ala Val Lys Ala Leu Arg Glu Met
435 440 445
Ala Asp Thr Val Ile Pro Gln Lys Glu Glu Ala Ala Ile Cys Gly Gln
450 455 460
Met Asp Leu Ser His Pro Pro Pro Arg Gly His Leu Asp Glu Leu Thr
465 470 475 480
Thr Thr Leu Glu Ser Met Thr Glu Asp Leu Asn Leu Asp Ser Pro Leu
485 490 495
Thr Pro Glu Leu Asn Glu Ile Leu Asp Thr Phe Leu Asn Asp Glu Cys
500 505 510
Leu Leu His Ala Met His Ile Ser Thr Gly Leu Ser Ile Phe Asp Thr
515 520 525
Ser Leu Phe
530
<210> 110
<211> 20
<212> PRT
<213> 人工
<220>
<223> 合成的
<220>
<221> 变体
<222> (4)..(4)
<223> Xaa是赖氨酸、组氨酸或精氨酸
<220>
<221> 变体
<222> (8)..(8)
<223> Xaa是赖氨酸、组氨酸或精氨酸
<220>
<221> 变体
<222> (11)..(11)
<223> Xaa是赖氨酸、组氨酸或精氨酸
<220>
<221> 变体
<222> (15)..(15)
<223> Xaa是赖氨酸、组氨酸或精氨酸
<220>
<221> 变体
<222> (19)..(19)
<223> Xaa是赖氨酸、组氨酸或精氨酸
<400> 110
Gly Leu Phe Xaa Ala Leu Leu Xaa Leu Leu Xaa Ser Leu Trp Xaa Leu
1 5 10 15
Leu Leu Xaa Ala
20
<210> 111
<211> 20
<212> PRT
<213> 人工
<220>
<223> 合成的
<400> 111
Gly Leu Phe His Ala Leu Leu His Leu Leu His Ser Leu Trp His Leu
1 5 10 15
Leu Leu His Ala
20
<210> 112
<211> 7
<212> PRT
<213> 猿猴病毒 40
<400> 112
Pro Lys Lys Lys Arg Lys Val
1 5
<210> 113
<211> 23
<212> PRT
<213> 玉蜀黍(Zea mays)
<400> 113
Arg Lys Arg Lys Glu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Arg
1 5 10 15
Ser Arg Tyr Arg Lys Lys Val
20
<210> 114
<211> 14
<212> PRT
<213> 猿猴病毒 40
<400> 114
Ala Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser Gly Ser
1 5 10
<210> 115
<211> 11
<212> PRT
<213> 人免疫缺陷病毒1型
<400> 115
Tyr Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg
1 5 10
<210> 116
<211> 5
<212> PRT
<213> 人工
<220>
<223> 合成的
<400> 116
Gly Ser Gly Gly Ser
1 5
<210> 117
<211> 6
<212> PRT
<213> 人工
<220>
<223> 合成的
<400> 117
Gly Gly Ser Gly Gly Ser
1 5
<210> 118
<211> 4
<212> PRT
<213> 人工
<220>
<223> 合成的
<400> 118
Gly Gly Gly Ser
1
<210> 119
<211> 18
<212> DNA
<213> 未知
<220>
<223> 细菌
<400> 119
atttctacta ttgtagat 18
<210> 120
<211> 707
<212> PRT
<213> 未知
<220>
<223> 噬菌体
<400> 120
Met Ala Asp Thr Pro Thr Leu Phe Thr Gln Phe Leu Arg His His Leu
1 5 10 15
Pro Gly Gln Arg Phe Arg Lys Asp Ile Leu Lys Gln Ala Gly Arg Ile
20 25 30
Leu Ala Asn Lys Gly Glu Asp Ala Thr Ile Ala Phe Leu Arg Gly Lys
35 40 45
Ser Glu Glu Ser Pro Pro Asp Phe Gln Pro Pro Val Lys Cys Pro Ile
50 55 60
Ile Ala Cys Ser Arg Pro Leu Thr Glu Trp Pro Ile Tyr Gln Ala Ser
65 70 75 80
Val Ala Ile Gln Gly Tyr Val Tyr Gly Gln Ser Leu Ala Glu Phe Glu
85 90 95
Ala Ser Asp Pro Gly Cys Ser Lys Asp Gly Leu Leu Gly Trp Phe Asp
100 105 110
Lys Thr Gly Val Cys Thr Asp Tyr Phe Ser Val Gln Gly Leu Asn Leu
115 120 125
Ile Phe Gln Asn Ala Arg Lys Arg Tyr Ile Gly Val Gln Thr Lys Val
130 135 140
Thr Asn Arg Asn Glu Lys Arg His Lys Lys Leu Lys Arg Ile Asn Ala
145 150 155 160
Lys Arg Ile Ala Glu Gly Leu Pro Glu Leu Thr Ser Asp Glu Pro Glu
165 170 175
Ser Ala Leu Asp Glu Thr Gly His Leu Ile Asp Pro Pro Gly Leu Asn
180 185 190
Thr Asn Ile Tyr Cys Tyr Gln Gln Val Ser Pro Lys Pro Leu Ala Leu
195 200 205
Ser Glu Val Asn Gln Leu Pro Thr Ala Tyr Ala Gly Tyr Ser Thr Ser
210 215 220
Gly Asp Asp Pro Ile Gln Pro Met Val Thr Lys Asp Arg Leu Ser Ile
225 230 235 240
Ser Lys Gly Gln Pro Gly Tyr Ile Pro Glu His Gln Arg Ala Leu Leu
245 250 255
Ser Gln Lys Lys His Arg Arg Met Arg Gly Tyr Gly Leu Lys Ala Arg
260 265 270
Ala Leu Leu Val Ile Val Arg Ile Gln Asp Asp Trp Ala Val Ile Asp
275 280 285
Leu Arg Ser Leu Leu Arg Asn Ala Tyr Trp Arg Arg Ile Val Gln Thr
290 295 300
Lys Glu Pro Ser Thr Ile Thr Lys Leu Leu Lys Leu Val Thr Gly Asp
305 310 315 320
Pro Val Leu Asp Ala Thr Arg Met Val Ala Thr Phe Thr Tyr Lys Pro
325 330 335
Gly Ile Val Gln Val Arg Ser Ala Lys Cys Leu Lys Asn Lys Gln Gly
340 345 350
Ser Lys Leu Phe Ser Glu Arg Tyr Leu Asn Glu Thr Val Ser Val Thr
355 360 365
Ser Ile Asp Leu Gly Ser Asn Asn Leu Val Ala Val Ala Thr Tyr Arg
370 375 380
Leu Val Asn Gly Asn Thr Pro Glu Leu Leu Gln Arg Phe Thr Leu Pro
385 390 395 400
Ser His Leu Val Lys Asp Phe Glu Arg Tyr Lys Gln Ala His Asp Thr
405 410 415
Leu Glu Asp Ser Ile Gln Lys Thr Ala Val Ala Ser Leu Pro Gln Gly
420 425 430
Gln Gln Thr Glu Ile Arg Met Trp Ser Met Tyr Gly Phe Arg Glu Ala
435 440 445
Gln Glu Arg Val Cys Gln Glu Leu Gly Leu Ala Asp Gly Ser Ile Pro
450 455 460
Trp Asn Val Met Thr Ala Thr Ser Thr Ile Leu Thr Asp Leu Phe Leu
465 470 475 480
Ala Arg Gly Gly Asp Pro Lys Lys Cys Met Phe Thr Ser Glu Pro Lys
485 490 495
Lys Lys Lys Asn Ser Lys Gln Val Leu Tyr Lys Ile Arg Asp Arg Ala
500 505 510
Trp Ala Lys Met Tyr Arg Thr Leu Leu Ser Lys Glu Thr Arg Glu Ala
515 520 525
Trp Asn Lys Ala Leu Trp Gly Leu Lys Arg Gly Ser Pro Asp Tyr Ala
530 535 540
Arg Leu Ser Lys Arg Lys Glu Glu Leu Ala Arg Arg Cys Val Asn Tyr
545 550 555 560
Thr Ile Ser Thr Ala Glu Lys Arg Ala Gln Cys Gly Arg Thr Ile Val
565 570 575
Ala Leu Glu Asp Leu Asn Ile Gly Phe Phe His Gly Arg Gly Lys Gln
580 585 590
Glu Pro Gly Trp Val Gly Leu Phe Thr Arg Lys Lys Glu Asn Arg Trp
595 600 605
Leu Met Gln Ala Leu His Lys Ala Phe Leu Glu Leu Ala His His Arg
610 615 620
Gly Tyr His Val Ile Glu Val Asn Pro Ala Tyr Thr Ser Gln Thr Cys
625 630 635 640
Pro Val Cys Arg His Cys Asp Pro Asp Asn Arg Asp Gln His Asn Arg
645 650 655
Glu Ala Phe His Cys Ile Gly Cys Gly Phe Arg Gly Asn Ala Asp Leu
660 665 670
Asp Val Ala Thr His Asn Ile Ala Met Val Ala Ile Thr Gly Glu Ser
675 680 685
Leu Lys Arg Ala Arg Gly Ser Val Ala Ser Lys Thr Pro Gln Pro Leu
690 695 700
Ala Ala Glu
705
<210> 121
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 121
atggcagata cgcccactct ttttacacag tttctcaggc atcacttacc tggtcaacgg 60
ttccgtaagg acatcttgaa gcaagctgga cgtattcttg ccaacaaggg agaggatgct 120
accatcgcat ttcttagggg taagtctgaa gaaagtcccc cagattttca acctccagtt 180
aaatgcccta tcattgcttg cagcagacca ttgactgaat ggcctatata ccaggcaagt 240
gtggcaatcc aaggatacgt ctacggccaa agtcttgctg agtttgaagc aagtgatccc 300
ggatgttcaa aggatgggct attgggctgg ttcgataaga ccggagtttg tactgactac 360
ttttcagttc aaggtttgaa cttaatattc cagaatgctc gtaaaagata cattggagtc 420
cagaccaagg ttacgaatag aaatgaaaaa cgccacaaga aattaaagag aatcaacgcc 480
aagagaatag cagagggctt gccagagcta acttcagatg agccagaatc cgctctagac 540
gaaacgggac atttgatcga cccacctggg ttgaacacca atatctattg ttaccagcag 600
gtcagcccca agccattggc tctgtcagaa gtgaaccaat tgccgactgc atacgcgggt 660
tactctacgt ccggtgatga tcccattcaa cccatggtta cgaaggaccg cttaagtatc 720
tctaagggac aaccaggata tatccccgaa caccaaagag cactcctctc tcaaaaaaag 780
cacagacgca tgagagggta cgggttaaag gcccgcgctt tacttgtgat cgtgcgaatt 840
caagatgatt gggcagttat tgaccttaga tccttgttga gaaacgcata ctggagaaga 900
attgttcaga ctaaagaacc ttcgacgatt acaaaacttt taaagttagt tacaggggac 960
cccgttctcg atgcaacacg gatggttgct acattcactt acaagcctgg aatagtgcag 1020
gttagaagtg caaagtgtct caagaataag caaggaagta aattgttctc tgagaggtat 1080
ctcaacgaga cagtttcggt tacttcaatt gacctcggat cgaataattt agttgctgtt 1140
gcaacatata gattggttaa tggtaacaca ccagaacttc ttcaaaggtt taccttacca 1200
tcacacttgg ttaaggattt cgagagatat aagcaagcgc acgacacttt ggaggatagc 1260
atccagaaaa ctgctgtagc gtccttaccg cagggacagc aaacagaaat aagaatgtgg 1320
tcgatgtacg gattcagaga ggcacaagaa agggtgtgcc aagagttggg gctagccgac 1380
ggatcaatcc catggaatgt gatgacggca acttcgacca ttttgactga tctgttcctc 1440
gcgagaggcg gggaccctaa aaagtgtatg ttcacatccg aacctaaaaa aaaaaagaat 1500
agcaaacaag tcttgtataa gatccgcgat cgggcttggg caaaaatgta tcgtacatta 1560
ctctccaagg agacccgcga agcatggaat aaagccttgt gggggcttaa gagaggtagt 1620
cctgattacg cccgcttatc aaaacgtaaa gaagagctcg ctaggcgttg tgtgaactac 1680
acgatttcta ctgctgaaaa gagggcccag tgcggacgta caattgttgc actcgaggac 1740
ttgaacatcg ggttcttcca cggtagaggg aagcaagagc ctggatgggt tggtttgttc 1800
acaaggaaaa aggagaatcg ctggttgatg caggctcttc acaaagcttt tttggagctc 1860
gcgcatcata ggggctacca tgtcattgag gttaaccccg cttacacgag tcagacttgc 1920
cccgtttgtc gtcactgtga tccagataac cgggatcaac ataataggga agcatttcat 1980
tgtattggtt gcggattcag aggcaacgca gatttggacg ttgctaccca caacattgcg 2040
atggtggcta taactggaga gtcattgaaa agggccagag ggtctgttgc atcaaagacc 2100
cctcagccat tagcggctga g 2121
<210> 122
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 122
atggccgaca cccctaccct cttcactcag ttcctcaggc accacttgcc agggcagagg 60
tttaggaagg acatcctcaa acaggccggg aggattctcg ccaacaaggg tgaggacgcc 120
accatcgcct tcctcagggg caagtccgag gagtcccccc cagactttca gcccccagtg 180
aagtgcccaa tcatcgcctg cagcaggccc ttgactgagt ggcccatcta ccaggccagc 240
gtcgccatcc agggatacgt gtacgggcag agcctcgccg agttcgaggc cagcgaccct 300
ggatgctcta aggacggcct cctcggttgg ttcgacaaaa ccggcgtgtg caccgactac 360
ttttccgtgc aagggctcaa ccttatcttc caaaacgcca gaaagaggta cattggagtc 420
cagactaaag tgaccaacag gaacgagaaa aggcacaaaa agctcaaaag gatcaatgcc 480
aagaggatcg ccgagggcct ccccgagctc acctccgatg agcctgagtc cgccctcgac 540
gagactgggc acctcatcga ccctcctggt ctcaacacca acatctattg ctaccagcaa 600
gtgtctccca agccactcgc cttgagcgaa gtgaaccaac tccccacagc ttacgcaggc 660
tacagcacat ccggtgacga tcccatccag cccatggtca ccaaggacag gctcagtatc 720
agcaagggtc agcccgggta catccccgag caccagaggg ccctcctcag ccagaaaaaa 780
cacaggagaa tgagaggcta cggactcaaa gctagggccc ttctcgtgat cgtgagaata 840
caggacgact gggccgtcat cgacctcagg agcctcctca ggaacgccta ctggagaagg 900
atcgttcaga caaaggagcc ctccaccatc accaaacttc tcaaattggt caccggggac 960
cccgtcctcg acgccaccag gatggtggcc acattcacct ataagcctgg gattgtccag 1020
gtgaggagtg ccaagtgcct caagaacaag caggggagca aactcttcag cgagagatac 1080
ttgaacgaaa ccgtttccgt gacctccatc gatttgggga gcaacaacct cgtggccgtt 1140
gccacataca gactcgttaa tggaaacacc cccgagttgc tccagagatt caccttgcct 1200
tcacatttgg ttaaggattt tgagaggtac aagcaagccc atgacaccct cgaggatagc 1260
atccagaaga ccgccgttgc ctccctcccc caggggcagc agactgagat cagaatgtgg 1320
agcatgtacg gctttagaga agcccaggag agggtttgcc aggagctcgg cctcgccgac 1380
ggcagcatcc cctggaacgt tatgactgcc accagcacca tactcaccga cctcttcctc 1440
gccagaggcg gcgatcccaa gaagtgcatg tttacatccg agcccaagaa gaagaagaac 1500
agcaagcagg tcctctacaa gatcagggac agggcttggg ccaagatgta cagaaccctc 1560
ctcagcaagg agaccaggga ggcctggaat aaagctctct ggggcctcaa gagggggagc 1620
cccgattacg ccaggctcag taagaggaag gaggagttgg caaggaggtg cgtgaattac 1680
actattagca ctgccgagaa gagagcccag tgtggtagga ctatcgtcgc cttggaagac 1740
ctcaacatcg gcttctttca tggcagaggc aaacaagagc ctggttgggt cgggctcttt 1800
accaggaaga aagagaacag gtggctcatg caggctctcc acaaggcctt cctcgaactc 1860
gctcatcaca ggggctacca cgtcatcgaa gtgaatcccg cttacaccag ccaaacctgc 1920
ccagtttgca ggcactgtga tcccgacaat agggaccagc acaacaggga ggcttttcat 1980
tgtatcggct gcgggttcag aggaaacgcc gatctcgacg tcgcaacaca caatatcgcc 2040
atggttgcca tcacaggaga atccctcaaa agggcaaggg ggagcgtggc ctctaagact 2100
ccccagcccc ttgctgccga a 2121
<210> 123
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 123
atggccgaca ccccaactct cttcactcag ttcctcagac accacttgcc cggacagagg 60
ttcaggaaag acatcctcaa gcaggccggc aggattctcg ccaacaaggg ggaggatgcc 120
accatcgcct tcctcagggg caaaagcgaa gagagccctc ccgacttcca accccccgtg 180
aagtgcccca tcatcgcctg ctccagaccc ctcaccgagt ggcccatcta ccaagccagc 240
gtcgccatcc aggggtatgt gtatggccaa agcctcgctg agtttgaggc atctgatccc 300
ggctgcagca aagatggcct cttgggctgg ttcgacaaaa ccggcgtttg caccgattat 360
ttcagcgtgc agggcttgaa tcttattttc cagaacgcca gaaagagata cataggtgtg 420
cagactaagg ttaccaacag aaatgaaaaa aggcacaaga agctcaaaag gattaacgcc 480
aagaggatcg ccgagggcct ccccgaactc acctccgacg aacccgaaag cgccctcgat 540
gagaccggac acttgatcga cccccccggg ctcaatacca acatctactg ctaccaacaa 600
gtctccccca agcccctcgc actcagcgaa gtcaaccagt tgcccacagc ctacgccggt 660
tactccactt ccggggacga cccaatccag ccaatggtta ccaaggacag gctctctatc 720
agtaaggggc agcccggata cataccagag caccagagag ctctcttgag ccaaaagaag 780
cataggagga tgagaggcta cgggctcaag gccagagcct tgctcgtgat cgttaggatc 840
caggacgact gggccgttat cgacctcaga tcactcctta gaaacgccta ctggaggagg 900
atcgtccaaa ccaaggagcc ctccaccatc accaagcttc tcaagttggt gactggggat 960
cccgttctcg acgccaccag gatggttgca accttcacct acaagcccgg cattgtccaa 1020
gtcaggagcg ccaaatgcct caagaacaag caagggagca agctcttttc cgagaggtac 1080
ctcaatgaga ccgtgtccgt gacttcaatc gatttggggt ctaacaacct cgtggccgtc 1140
gccacctaca gactcgtgaa tgggaacacc cccgagctcc ttcaaaggtt caccctccct 1200
tcccaccttg tgaaggactt cgagaggtac aaacaagccc acgataccct tgaggacagc 1260
attcagaaga ctgctgtcgc cagccttccc cagggccagc agaccgagat aagaatgtgg 1320
agcatgtacg ggttcagaga ggcacaggag agggtctgcc aagagctcgg actcgccgac 1380
ggtagcatcc cctggaacgt tatgaccgct acctccacca tccttacaga cctcttcctc 1440
gccaggggcg gggaccctaa aaagtgcatg tttacctccg aacccaagaa gaagaagaac 1500
tccaaacagg tgctctataa gatcagggac agggcctggg ctaaaatgta tagaaccctc 1560
ctctccaagg agaccaggga ggcctggaac aaggccctct ggggtctcaa gaggggatcc 1620
cccgattacg ccagactctc aaagagaaag gaagagctcg ccagaaggtg tgtgaactac 1680
acaattagca ccgccgagaa aagggcccag tgtgggagga caatcgtggc cctcgaagac 1740
ctcaatatcg gcttcttcca cgggagaggc aaacaggagc ccggctgggt gggcctcttt 1800
accaggaaaa aggagaacag gtggctcatg caggccctcc acaaggcctt tctcgagctt 1860
gcccaccata ggggctacca cgtgatcgaa gttaaccccg cctataccag ccagacatgc 1920
ccagtctgca ggcactgcga tcctgacaac agagaccagc acaacagaga ggcttttcac 1980
tgtatcgggt gcgggttcag ggggaacgcc gacctcgacg tggcaaccca caatatcgct 2040
atggtcgcca tcacagggga gagcctcaag agagctaggg gtagcgttgc ttccaagacc 2100
ccccaacccc tcgctgccga a 2121
<210> 124
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 124
atggccgaca cccccacact cttcacccag ttcctcaggc atcaccttcc cgggcaaagg 60
ttcaggaagg acatcctcaa acaagccggg aggatcctcg ccaacaaggg cgaggacgcc 120
accatcgcct tcctcagggg gaagtccgaa gagtccccgc ccgacttcca gccccccgtc 180
aaatgcccca taatagcctg ctccaggcct ctcactgagt ggccaatcta ccaggcctcc 240
gtggccatcc agggatacgt ctacgggcag agcctcgccg agttcgaagc ctccgacccc 300
ggctgcagca aggacggatt gctcggctgg ttcgacaaaa ccggcgtctg taccgactac 360
ttctcagtgc agggcctcaa ccttatcttc cagaacgcca ggaagaggta tattggcgtc 420
cagaccaagg tcaccaacag gaacgagaag aggcacaaga agttgaagag aatcaacgcc 480
aaaaggattg ctgagggtct ccccgaactc acctcagatg agccagagtc cgcactcgac 540
gagaccggtc acctcatcga cccacctggg ctcaacacca acatctactg ctaccaacag 600
gtgagcccta agcccttggc cctctctgaa gtgaaccagt tgcccactgc ctacgcaggc 660
tattccacct caggagacga tcctatccaa cccatggtga ccaaggatag gctctcaatc 720
tcaaagggcc agcccggtta catccccgag caccagagag ccctcctctc ccaaaagaag 780
cacaggagga tgagaggcta cgggctcaag gccagagccc tcctcgtgat agtgaggatc 840
caggacgact gggccgtgat cgacttgagg agcctcctca ggaacgcata ctggaggagg 900
attgttcaga ccaaggagcc cagcaccatc accaagttgc ttaagctcgt taccggagat 960
cccgtcctcg acgccactag gatggttgcc acctttacct ataagcccgg catcgtccag 1020
gtcaggtcag ccaagtgcct caagaataag cagggctcca agcttttctc cgaaaggtac 1080
ctcaatgaga ccgtctccgt gacctccatt gatctcggga gcaataactt ggtcgccgtt 1140
gccacctaca ggctcgtcaa cggtaacacc cccgaattgc tccaaaggtt caccctccca 1200
agccatctcg tcaaggactt tgagaggtac aagcaggcac atgataccct cgaggactcc 1260
atccagaaaa ccgccgtcgc ctccctcccc cagggccagc agaccgaaat caggatgtgg 1320
tcaatgtacg gctttaggga agcccaagag agggtctgtc aagagctcgg gctcgccgac 1380
gggagtattc cctggaacgt catgaccgcc acctccacca tcctcaccga cctcttcctt 1440
gccagagggg gagaccccaa gaaatgcatg tttacctccg aacctaagaa gaagaaaaac 1500
tccaagcagg tcctctataa gatcagggac agggcctggg ccaagatgta tagaactctt 1560
ctctccaagg agaccagaga ggcttggaac aaggccctct ggggccttaa aagagggtct 1620
cccgactatg caaggttgag caagagaaaa gaggagcttg caaggaggtg cgtcaactac 1680
accatctcca cagcagagaa aagagcccaa tgtggcagga ccatcgtcgc ccttgaagac 1740
cttaatatcg gctttttcca cggtaggggg aagcaggagc ctggctgggt gggcctcttc 1800
accaggaaga aagaaaacag gtggcttatg caagccttgc acaaggcctt cctcgaactc 1860
gcccaccata gagggtacca cgtcatcgag gttaatcccg cctacaccag ccagacctgc 1920
cccgtctgta ggcactgcga cccagacaat agggaccaac acaacagaga ggccttccat 1980
tgtatcggtt gcggcttcag aggcaacgcc gatcttgacg tcgctaccca caatatcgcc 2040
atggtcgcca tcaccgggga gagcctcaaa agggccaggg ggagcgttgc ctctaagacc 2100
ccccagcccc ttgcagccga g 2121
<210> 125
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 125
atggccgaca cccccaccct gttcacccag tttcttaggc atcacctccc cggccagaga 60
ttcaggaagg acatcctgaa acaggcaggt aggatcctgg ccaacaaggg agaagatgct 120
accatcgcct ttctgagggg gaagtccgaa gagagtcccc ctgactttca gcctcccgtg 180
aagtgcccta taatcgcctg ttccagaccc ctgaccgaat ggccaatcta ccaggcctca 240
gttgccatcc aagggtacgt gtacggccaa tcactggcag aatttgaagc atccgacccc 300
gggtgttcca aagacggcct cctgggctgg ttcgacaaga ccggggtgtg caccgactac 360
ttctccgtcc agggcctcaa cctcatcttt caaaacgcca ggaagaggta catcggggtt 420
cagaccaagg tgaccaacag gaatgaaaag aggcacaaaa agttgaagag gatcaacgcc 480
aagaggatag cagagggcct gcccgaactc acctccgacg agcccgagtc cgccctggat 540
gaaaccgggc acctgatcga cccacccggg ctgaacacca atatctattg ctatcaacag 600
gtgtctccca agcccctcgc cctctcagag gttaaccagc tgcccactgc ctacgccggc 660
tacagtacct ctggagatga ccctattcaa cccatggtta ccaaagacag actgtccatt 720
tccaagggcc aacccggcta catccccgaa catcagagag ccctcctttc ccagaagaaa 780
catagaagga tgaggggcta tggcctcaag gccagggcac ttttggtgat cgtgaggatc 840
caggacgatt gggccgtgat cgacctgagg tccctgctta ggaatgcata ctggagaagg 900
atagttcaga ccaaggagcc ctccaccatc accaagctgc ttaagctggt gacaggggac 960
cctgtcctcg acgcaactag aatggtggcc acattcacct acaagcccgg aatcgttcag 1020
gtgagaagtg ccaagtgcct caagaacaag cagggatcta agcttttctc cgagaggtac 1080
ctgaatgaga ctgtgtccgt tacctccatc gacctgggtt ccaataatct cgtggccgtg 1140
gcaacttaca ggctggtgaa cggcaacacc cccgaattgc tgcagaggtt cactttgccc 1200
tcacacctgg tgaaagactt cgaaaggtac aaacaagccc acgatacatt ggaggacagt 1260
atccagaaga ccgcagtggc ctccttgccc cagggccaac agaccgagat caggatgtgg 1320
tccatgtacg ggttcaggga ggcccaggaa agagtgtgcc aggagctggg tcttgccgac 1380
gggtcaatcc cctggaacgt catgaccgca acctccacca tcttgactga tctttttctg 1440
gctagagggg gcgaccccaa aaagtgcatg ttcacctccg agccaaaaaa gaaaaagaat 1500
tctaaacagg tgctgtacaa gatcagggac agagcctggg ccaaaatgta cagaactctc 1560
ctgtcaaagg aaacaaggga agcatggaat aaggccctgt ggggcctgaa gagggggtcc 1620
cccgactatg ccaggctctc caagaggaag gaggaactgg ccagaaggtg cgtcaactac 1680
accatctcca ccgccgaaaa aagggctcag tgcgggagga ctatcgtggc cctggaggac 1740
ttgaacatcg gctttttcca tggaagaggc aagcaggagc cagggtgggt tggcctcttt 1800
accaggaaaa aggagaatag gtggctgatg caggcactgc acaaggcctt cctcgaactc 1860
gcccaccata gggggtatca cgttatcgag gttaaccccg cttacacctc ccaaacctgc 1920
cccgtgtgca ggcactgcga ccctgataat agggaccagc acaacaggga ggcctttcac 1980
tgcatcggct gcggctttag ggggaatgcc gacttggacg tcgccaccca caacatcgcc 2040
atggttgcca tcacagggga gtccttgaaa agggccaggg gctccgtcgc ttccaaaaca 2100
ccccagcccc ttgccgccga g 2121
<210> 126
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 126
atggccgaca ctcccacctt gttcacccag tttctcagac accatctccc cggccaaagg 60
tttaggaaag acatccttaa gcaggctggc aggatcctcg ccaacaaagg tgaggatgcc 120
accatcgcat ttctcagagg gaaatccgaa gagtcacctc ccgacttcca gccccccgtg 180
aaatgtccca tcatcgcctg tagcagaccc ttgaccgagt ggcccatcta tcaggcctcc 240
gtcgccatac agggctacgt ttacggccag agtctcgccg aattcgaggc ctccgacccc 300
ggctgcagta aggacggtct cctcggctgg ttcgacaaga ccggcgtctg caccgactac 360
tttagcgtcc agggtctcaa tctcatattc cagaacgcaa ggaagagata catcggagtg 420
cagaccaagg tgacaaacag gaatgaaaag agacataaga agctcaagag aattaatgct 480
aagaggatcg ccgaggggct ccccgagctc acctccgacg agcccgagag tgccctcgac 540
gagaccgggc acttgatcga tcctcccggc ttgaatacca acatctactg ctatcaacag 600
gtcagtccca aacccctcgc cttgtcagaa gtgaaccagc tccccacagc ctacgcaggc 660
tacagcacca gcggagacga ccctattcag cccatggtga caaaggacag gctctcaatc 720
tccaaaggcc agcccggata catccccgag caccagaggg ctcttctcag ccagaagaaa 780
cacaggagga tgaggggtta cggcctcaag gccagagcct tgctcgtcat cgtgaggatc 840
caggacgact gggccgtgat cgacctcagg agtttgctca gaaacgccta ttggaggaga 900
atagtgcaaa ccaaggaacc ctccacaata accaagctcc tcaagctcgt taccggcgac 960
cccgttctcg atgccaccag gatggtggcc accttcactt acaagcccgg aatcgtgcag 1020
gtcaggagcg ccaagtgcct caagaataaa cagggctcca agctcttctc cgagaggtat 1080
ctcaatgaga ccgtcagcgt tacctccatt gacctcggga gcaacaacct cgtggccgtc 1140
gctacatata ggctcgtcaa cggcaacacc cccgagctcc tccaaagatt cacccttcct 1200
tcccacctcg ttaaggattt cgagagatac aagcaagcac acgacacctt ggaagattca 1260
atccagaaga ccgccgtcgc ctccctcccc cagggccagc aaaccgagat taggatgtgg 1320
agcatgtatg ggttcaggga ggcccaggag agggtctgcc aggagctcgg tctcgccgac 1380
ggcagcatac cctggaatgt catgacagcc actagtacca tccttactga cctcttcctc 1440
gccagaggcg gcgaccctaa gaagtgtatg ttcaccagcg agcctaagaa gaagaagaac 1500
agcaagcaag ttctttacaa aatcagagac agggcctggg ccaagatgta caggaccctc 1560
ctcagcaagg agaccaggga ggcttggaat aaggccttgt ggggccttaa aaggggctcc 1620
cccgactacg ccaggctctc caaaaggaag gaggagctcg ccagaagatg cgtcaactac 1680
accatcagca ccgccgaaaa gagggcccaa tgcggtagga ccatcgttgc actcgaagac 1740
ctcaacatcg gctttttcca cggcaggggg aaacaagagc ctgggtgggt tgggctcttc 1800
accaggaaga aggagaacag gtggcttatg caggccctcc acaaagcctt cctcgagttg 1860
gcccaccata gggggtacca tgttatcgag gtgaaccccg cctacaccag ccagacctgt 1920
cccgtgtgca ggcactgtga tcccgacaac agggaccagc acaacaggga ggccttccat 1980
tgcatcggct gcgggttcag agggaacgcc gatctcgacg ttgctaccca taatatagcc 2040
atggtcgcca tcaccggcga aagcctcaag agggctaggg gcagtgtggc cagcaagaca 2100
ccccagccct tggcagccga g 2121
<210> 127
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 127
atggccgaca cccctaccct tttcacccag ttcctcaggc accacctccc cggccagagg 60
ttcaggaagg acatccttaa gcaggccgga agaatcctcg caaacaaggg tgaggacgcc 120
actatcgcat tcctcagggg caagtctgag gagagccctc ccgacttcca gcctcccgtc 180
aagtgcccta tcatcgcctg tagcaggccc cttacagaat ggcccatata ccaagccagc 240
gtcgcaatcc aggggtatgt ctacggccag tccctcgccg agtttgaagc ctccgacccc 300
ggctgttcaa aggatggcct cctcggctgg ttcgacaaaa ccggagtgtg cacagactat 360
ttcagcgtgc aggggttgaa cctcatcttc caaaacgcca ggaaaaggta catcggagtc 420
cagaccaagg tgacaaacag gaacgagaag aggcacaaga agctcaagag gatcaatgct 480
aagaggatcg ccgaagggct ccctgagctc acctccgacg agcctgagag cgccctcgac 540
gagaccggcc atcttatcga cccacccggt ttgaacacca acatctactg ctatcagcag 600
gtctctccca agccactcgc cctctccgag gtcaaccaac tccctaccgc ctacgccggc 660
tactccacat ccggtgatga ccccatccag cccatggtca ccaaagacag gctctccatc 720
agcaaaggac aacctggata catccccgag caccagaggg ccctcctcag tcagaagaag 780
cataggagga tgaggggata tggcctcaag gcaagggccc tcctcgtgat cgttaggatc 840
caggacgact gggcagtgat cgacctcagg tccctcctca ggaatgccta ctggaggaga 900
attgtccaaa ccaaagaacc ctccaccatc accaaactcc tcaagctcgt caccggcgac 960
cccgtcctcg acgccaccag aatggttgcc acctttacct acaagcccgg tatcgtccag 1020
gtcaggtccg ccaaatgcct caagaacaag cagggatcca aactcttcag cgagagatat 1080
ctcaacgaga ccgtgtccgt tacctccatc gatctcggga gtaataatct cgtggccgtc 1140
gcaacctata ggctcgtgaa cgggaacaca cccgaactcc ttcagaggtt caccctcccc 1200
tcccacctcg tcaaggactt cgagaggtat aaacaggccc acgataccct tgaggatagc 1260
atccagaaga ccgccgtggc cagtctccca caggggcagc aaaccgaaat taggatgtgg 1320
agcatgtacg gcttcaggga ggcccaggag agggtctgcc aggaattggg tctcgccgac 1380
ggctccatcc cctggaacgt tatgactgca accagcacca tcttgaccga cctcttcctc 1440
gcaagaggtg gggaccctaa gaagtgcatg tttactagcg agcccaagaa gaagaagaac 1500
tccaagcagg ttctctacaa gatcagggat agggcctggg ccaagatgta caggaccttg 1560
ctctccaaag agaccaggga ggcctggaac aaggctctct ggggcctcaa aaggggcagc 1620
cccgactacg ccaggctctc caagagaaag gaggaactcg ccaggaggtg cgttaactac 1680
acaatcagca ccgccgagaa gagggcccag tgcgggagga caatcgtcgc cctcgaggac 1740
ctcaatatcg gtttcttcca cggaaggggc aagcaggagc ctgggtgggt gggcctcttc 1800
accaggaaaa aggagaacag atggctcatg caggcactcc ataaggcctt cctcgaattg 1860
gcccaccaca ggggctatca cgtcatcgaa gtgaacccag catacaccag tcagacctgt 1920
cccgtctgca gacactgcga ccccgacaac agggatcagc acaacaggga agccttccac 1980
tgcatcggat gcggcttcag gggtaatgcc gatctcgatg ttgccaccca taacattgcc 2040
atggtcgcca ttaccgggga gtccctcaaa agagcaaggg ggagtgttgc aagcaaaact 2100
ccccaacccc tcgccgccga g 2121
<210> 128
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 128
atggccgaca cccccacctt gttcactcag ttcctcaggc atcacttgcc cgggcagagg 60
ttcagaaagg acatcctcaa gcaggccgga aggatcctcg ccaacaaggg cgaggatgcc 120
accatcgctt tcctcagggg gaaaagcgag gagagtcccc ccgacttcca gccccccgtg 180
aagtgcccca tcatcgcctg cagcagaccc ttgaccgagt ggcccatcta ccaggccagt 240
gtcgccatcc agggctatgt gtacgggcag agcctcgccg agtttgaagc ctccgacccc 300
gggtgcagca aggacggtct cctcgggtgg ttcgacaaga ccggggtgtg caccgattac 360
ttctccgtgc aggggctcaa cttgatcttc cagaatgcca gaaagaggta tatcggtgtt 420
cagaccaagg tcaccaatag gaacgagaag aggcacaaga agctcaagag aatcaacgcc 480
aagaggatcg ccgaaggtct ccccgagctc accagcgacg agcccgagag cgccttggac 540
gaaaccgggc acttgatcga cccccccggg ctcaacacaa atatatactg ttaccaacag 600
gtttcaccca agcccctcgc cctttccgag gtcaaccagc tccctaccgc ctacgccggc 660
tactcaactt ccggagatga tcccatccag cccatggtga ccaaggatag gttgtccatt 720
agcaagggcc agcccggcta tatacccgag caccaaaggg ccctcctctc acagaagaag 780
cacagaagga tgagggggta cggcctcaag gccagggccc tccttgtcat tgtcaggatc 840
caagacgact gggccgtcat cgacctcaga agtctcctca ggaacgccta ctggaggagg 900
atcgttcaga ctaaggagcc cagcacaata accaaactcc tcaagctcgt gaccggcgac 960
cccgttctcg acgcaaccag aatggtggca accttcacat acaagcccgg catcgtccag 1020
gttaggtcag ccaagtgtct caagaacaag caaggaagca agttgttctc tgagaggtac 1080
cttaatgaga ctgtgtccgt gacctccata gacctcgggt ccaacaacct cgtggccgtg 1140
gccacataca ggctcgtcaa cgggaacacc cccgagctcc ttcagaggtt caccctcccc 1200
agtcacctcg tcaaggactt cgagaggtac aagcaggccc acgataccct cgaggactct 1260
atccagaaga ccgccgtggc ctccctcccc cagggccaac aaaccgagat aaggatgtgg 1320
agcatgtacg gtttcaggga agcccaggaa agggtgtgcc aagagctcgg cctcgccgac 1380
gggagcatcc cctggaacgt catgaccgcc acctccacca tcctcaccga cctcttcttg 1440
gcaagagggg gggaccccaa gaagtgcatg tttaccagcg agcccaagaa gaagaagaat 1500
tccaagcagg ttctctacaa gattagggac agagcctggg ctaaaatgta caggactctt 1560
ctctccaagg agaccaggga ggcctggaac aaggccctct ggggattgaa aagaggtagc 1620
cccgattatg ccagactctc aaagaggaag gaagagctcg ccagaaggtg tgtcaactac 1680
accatcagca cagccgagaa aagggcccaa tgcggtagaa ctatagtcgc cctcgaagac 1740
ctcaacatcg ggttcttcca cggtaggggc aagcaggagc ccggctgggt tggcttgttc 1800
accagaaaga aggaaaacag atggctcatg caagcactcc ataaggcctt cctcgagctc 1860
gcccatcaca gagggtatca cgtgatcgag gtcaatccag cttacactag ccagacctgc 1920
cccgtgtgca ggcactgtga ccccgacaac agggatcaac acaataggga agctttccac 1980
tgcatagggt gcgggttcag ggggaacgcc gacctcgatg tggccactca taacatcgcc 2040
atggtcgcca tcaccgggga gtccctcaaa agagccagag ggagtgttgc ctccaagacc 2100
ccccagcctc tcgccgccga a 2121
<210> 129
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 129
atggccgaca ccccaaccct cttcacacag ttccttaggc accacctccc cggccagagg 60
ttcaggaagg acatcctcaa gcaggcaggt aggatcctcg ccaataaggg ggaggacgcc 120
actatagcct tcctcagagg caagtccgag gaaagccccc ccgacttcca gccccccgtc 180
aagtgcccca tcattgcctg tagcagaccc ctcaccgagt ggcctatcta ccaggctagc 240
gtcgccatcc agggttacgt ctacggccag agcctcgccg agttcgaggc ctccgacccc 300
gggtgctcaa aagatggact cctcgggtgg ttcgataaga ccggggtgtg caccgactac 360
ttttctgtgc agggcctcaa cctcatcttc cagaacgcca ggaaaaggta tatcggcgtg 420
caaaccaaag tgacaaatag gaacgagaaa aggcacaaaa agcttaagag gatcaatgcc 480
aaaagaattg ccgagggcct ccccgagctc accagcgacg agcccgagag cgccttggac 540
gagaccgggc acctcattga cccccccggc ctcaacacca acatctattg ctaccagcag 600
gtgtccccca agccccttgc cctcagcgag gttaaccagt tgcccaccgc ttacgctggg 660
tactcaacca gtggtgacga cccaatacag cccatggtta ccaaggatag actctccatc 720
tccaagggcc agcccggcta catcccagag catcagagag ccctccttag tcagaaaaag 780
cacaggagaa tgagggggta tggcctcaaa gccagagccc tcctcgtgat agttaggatc 840
caggacgact gggccgttat agacctcagg tccctcctca ggaacgccta ctggagaaga 900
atcgtccaaa ccaaggagcc cagcaccatc accaagctcc tcaagctcgt gaccggggat 960
cccgttctcg acgccaccag gatggtcgca accttcacct acaaacccgg gatcgttcag 1020
gttaggagcg ccaagtgcct caaaaataag caaggaagta agctcttctc agaaaggtac 1080
ttgaatgaga ccgtgagcgt cacctccatc gatctcgggt ctaacaacct cgtggctgtc 1140
gccacctata ggctcgtcaa tggcaacacc cccgagctcc tccaaaggtt caccctccct 1200
tcccatctcg ttaaggactt tgagaggtat aagcaggccc acgacacctt ggaggacagc 1260
atccagaaga ccgccgtggc ctccctcccc caggggcagc agacagagat cagaatgtgg 1320
tccatgtacg gcttcagaga ggcccaagag agggtgtgcc aagagctcgg gctcgccgac 1380
ggctctatcc cctggaacgt gatgaccgcc acttcaacca tcttgaccga cctcttcctc 1440
gcacggggag gggatcccaa gaaatgtatg ttcacctccg agcccaaaaa gaaaaagaat 1500
tccaagcagg tgctttacaa gatcagggac agggcctggg caaagatgta taggaccctc 1560
ctctcaaagg aaaccaggga ggcttggaat aaggccctct ggggcctcaa gaggggatcc 1620
cccgactacg ccaggctcag caaaaggaaa gaggagctcg ccaggaggtg cgtgaactac 1680
acaatttcca ccgctgagaa gagggcccaa tgcggcagaa ccatcgttgc ccttgaagac 1740
ctcaacatcg ggttcttcca cggcagggga aagcaggagc caggttgggt cgggctcttc 1800
acaaggaaaa aagaaaacag atggctcatg caagcccttc acaaggcctt cctcgagctc 1860
gcacaccaca ggggatatca cgtgatcgaa gtgaaccccg cttacacctc ccagacctgc 1920
cctgtctgca gacactgtga ccccgacaac agagaccagc ataataggga agccttccac 1980
tgcatcggct gtgggtttag gggtaacgct gacctcgacg tcgctactca caacatcgcc 2040
atggtggcca tcacagggga gtctttgaag agggccagag gcagcgtggc cagtaagact 2100
ccccagccac ttgccgcaga g 2121
<210> 130
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 130
atggcagaca cacccacact ttttacccag tttctcaggc accacttgcc cggacagaga 60
tttaggaagg acatcctcaa gcaagccggc aggatcttgg ccaacaaggg ggaggacgcc 120
actatcgcat tcctcagggg gaaatccgag gaatcacccc ctgacttcca gcccccagtc 180
aaatgtccaa tcattgcctg tagcaggcct ttgactgaat ggcccatata ccaggcatct 240
gtggccatcc aaggctacgt gtatggccag agcctcgctg agttcgaagc tagtgaccct 300
ggatgctcta aggacgggct cctcggctgg ttcgacaaga ctggtgtgtg cacagattac 360
ttcagcgttc agggactcaa tctcatcttc cagaacgcca ggaagaggta catcggggtg 420
cagaccaagg tgaccaacag gaacgagaag aggcacaaga agctcaagag gatcaatgcc 480
aagaggatcg ctgagggtct ccctgagctc acctccgacg agccagagtc cgccctcgat 540
gagaccggac acttgattga cccccccggg ctcaatacca acatctattg ctaccaacaa 600
gtgagcccca agcccctcgc cctctccgaa gtgaaccagc tccccaccgc ctacgcaggc 660
tactccacca gcggggacga ccccattcag cctatggtga ccaaggatag gttgagcatt 720
tctaaggggc agccaggcta catccctgag caccagaggg cccttctctc ccagaagaaa 780
cacaggagga tgagggggta cgggttgaag gcaagagcac tcctcgtgat tgttagaatt 840
caggacgact gggccgtcat cgacctcaga agcctcctca gaaacgccta ttggaggaga 900
atcgtccaaa ccaaggagcc cagcaccatc actaaacttc tcaagctcgt taccggtgac 960
cccgtgctcg atgcaaccag gatggtggcc accttcacct acaagcccgg gatcgtccag 1020
gtgaggagcg caaaatgtct caaaaacaag caggggtcca agctcttttc tgagagatac 1080
ctcaacgaga ccgtgtccgt gacctcaatc gatctcggga gcaataacct cgtcgccgtt 1140
gcaacctaca gactcgtgaa cgggaacacc cccgagctct tgcagaggtt cacccttccc 1200
agtcacttgg tcaaagattt tgagaggtac aaacaggccc atgacaccct cgaggattca 1260
atccagaaga ccgcagtcgc ctccctcccc cagggacagc aaaccgagat caggatgtgg 1320
agcatgtacg ggttcaggga agcccaggag agggtgtgtc aagagcttgg gctcgccgac 1380
ggctccatcc catggaatgt gatgaccgcc acctcaacca tcctcaccga cctctttctc 1440
gccagggggg gtgaccccaa gaagtgcatg tttacttccg agccaaagaa gaagaaaaac 1500
tccaagcagg tgttgtataa aataagggac agggcctggg ccaagatgta caggaccctc 1560
ctctccaagg agaccaggga ggcctggaac aaggccctct ggggcctcaa aaggggcagc 1620
cccgactacg ccagactctc caagaggaag gaagagctcg cccggaggtg cgtcaactat 1680
accattagca ctgccgagaa gagggcccag tgcggcagga ccatcgtggc cttggaggat 1740
ctcaacatcg gcttcttcca cgggaggggc aaacaggagc ccgggtgggt gggcttgttt 1800
accaggaaga aagaaaatag gtggcttatg caggccctcc acaaggcctt cctcgagctc 1860
gcacaccaca ggggctatca cgttatcgag gtcaaccccg cctataccag ccagacctgc 1920
ccagtctgca ggcactgcga ccccgacaac agggaccagc acaataggga ggctttccat 1980
tgcatcgggt gcggatttag gggcaacgcc gacctcgatg tggctacaca caatatcgcc 2040
atggttgcta tcaccggcga gagtctcaaa agggccaggg gatcagtcgc ctctaagacc 2100
ccccagcccc tcgccgccga g 2121
<210> 131
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 131
atggccgaca cccccaccct tttcacacag tttttgaggc accacctccc cggacagagg 60
ttcaggaagg acattctcaa gcaagccggg aggatcctcg ccaacaaagg ggaggacgcc 120
accatcgcct tcttgagagg taagagcgag gagtcccccc ccgatttcca accccccgtg 180
aagtgtccca tcatcgcctg ctccaggccc ctcaccgagt ggcccatcta ccaggcctct 240
gtcgctatcc agggatatgt ttacgggcag agtcttgcag agttcgaggc cagcgacccc 300
ggatgtagta aggacggcct cctcggttgg ttcgacaaga ctggcgtctg caccgactat 360
tttagcgttc aggggctcaa tctcatcttc cagaacgcca gaaagaggta catcggcgtc 420
caaactaagg tgactaatag gaacgagaag aggcacaaga agctcaaaag aatcaacgcc 480
aagagaatcg ctgagggcct ccccgagctc accagcgacg agcccgagtc cgcactcgac 540
gagaccggcc atctcatcga cccccccggg ctcaacacaa acatctactg ctatcaacaa 600
gtctccccca aacccctcgc cttgtccgag gtgaaccaac ttcccacagc atacgctggc 660
tacagcacct ccggcgacga ccccatccag ccaatggtga ccaaagacag actctcaatc 720
agcaagggcc agcccggata catccctgag caccagaggg ccctcctctc ccagaagaag 780
cacaggagga tgaggggtta tggactcaag gctagggccc tcctcgtgat tgtgaggatc 840
caggacgact gggcagttat tgatctcagg agcctcctta ggaatgccta ctggagaagg 900
attgttcaaa ccaaggagcc tagcaccatc accaagctcc tcaagctcgt gaccggggat 960
cccgtcctcg acgccaccag aatggtggcc accttcacct acaaacctgg cattgtgcag 1020
gtcaggtctg ccaaatgctt gaagaacaag caggggtcca aactcttcag cgagaggtac 1080
ctcaacgaga ccgtttccgt cacctcaatc gatttggggt ccaacaacct cgtcgccgtc 1140
gccacctata ggctcgtgaa tggcaacacc ccagaattgc ttcagaggtt cacactccct 1200
tcccaccttg ttaaagactt tgaaagatac aagcaagccc acgacacact cgaggacagc 1260
atccaaaaga ccgccgtcgc ctctctcccc caaggacaac agaccgagat aaggatgtgg 1320
agcatgtatg ggttcaggga ggcccaggaa agggtttgcc aggagctcgg tttggcagac 1380
ggctccattc cctggaacgt gatgaccgcc acctccacca tcctcacaga cttgtttctc 1440
gccaggggag gagaccctaa gaagtgtatg ttcacctcag agcctaagaa aaagaaaaac 1500
tccaagcagg tcctctacaa gatcagggac agggcctggg ccaagatgta taggactctc 1560
ctctccaagg agaccaggga ggcatggaac aaggccctct ggggccttaa gaggggatcc 1620
cccgattatg ccaggctcag caagaggaag gaggagctcg ccaggagatg cgttaattat 1680
actatatcta ccgccgaaaa gagggcccaa tgcgggagga ccatcgtggc cctcgaggac 1740
ctcaatatcg ggttctttca cggcaggggt aagcaggaac ccgggtgggt gggccttttc 1800
accaggaaga aggaaaacag gtggctcatg caagccctcc acaaggcctt cctcgagttg 1860
gcccaccaca ggggatacca cgtcatcgag gttaatcccg cctacacctc tcagacctgc 1920
cccgtgtgca gacactgcga tcccgacaac agggatcagc acaacaggga agcctttcac 1980
tgcatcgggt gtgggttcag gggtaacgcc gacctcgatg tcgcaaccca caatatcgcc 2040
atggtggcta tcaccgggga gagcctcaag agggccagag gaagcgttgc ctcaaaaaca 2100
ccccagcccc tcgccgccga a 2121
<210> 132
<211> 757
<212> PRT
<213> 未知
<220>
<223> 噬菌体
<400> 132
Met Pro Lys Pro Ala Val Glu Ser Glu Phe Ser Lys Val Leu Lys Lys
1 5 10 15
His Phe Pro Gly Glu Arg Phe Arg Ser Ser Tyr Met Lys Arg Gly Gly
20 25 30
Lys Ile Leu Ala Ala Gln Gly Glu Glu Ala Val Val Ala Tyr Leu Gln
35 40 45
Gly Lys Ser Glu Glu Glu Pro Pro Asn Phe Gln Pro Pro Ala Lys Cys
50 55 60
His Val Val Thr Lys Ser Arg Asp Phe Ala Glu Trp Pro Ile Met Lys
65 70 75 80
Ala Ser Glu Ala Ile Gln Arg Tyr Ile Tyr Ala Leu Ser Thr Thr Glu
85 90 95
Arg Ala Ala Cys Lys Pro Gly Lys Ser Ser Glu Ser His Ala Ala Trp
100 105 110
Phe Ala Ala Thr Gly Val Ser Asn His Gly Tyr Ser His Val Gln Gly
115 120 125
Leu Asn Leu Ile Phe Asp His Thr Leu Gly Arg Tyr Asp Gly Val Leu
130 135 140
Lys Lys Val Gln Leu Arg Asn Glu Lys Ala Arg Ala Arg Leu Glu Ser
145 150 155 160
Ile Asn Ala Ser Arg Ala Asp Glu Gly Leu Pro Glu Ile Lys Ala Glu
165 170 175
Glu Glu Glu Val Ala Thr Asn Glu Thr Gly His Leu Leu Gln Pro Pro
180 185 190
Gly Ile Asn Pro Ser Phe Tyr Val Tyr Gln Thr Ile Ser Pro Gln Ala
195 200 205
Tyr Arg Pro Arg Asp Glu Ile Val Leu Pro Pro Glu Tyr Ala Gly Tyr
210 215 220
Val Arg Asp Pro Asn Ala Pro Ile Pro Leu Gly Val Val Arg Asn Arg
225 230 235 240
Cys Asp Ile Gln Lys Gly Cys Pro Gly Tyr Ile Pro Glu Trp Gln Arg
245 250 255
Glu Ala Gly Thr Ala Ile Ser Pro Lys Thr Gly Lys Ala Val Thr Val
260 265 270
Pro Gly Leu Ser Pro Lys Lys Asn Lys Arg Met Arg Arg Tyr Trp Arg
275 280 285
Ser Glu Lys Glu Lys Ala Gln Asp Ala Leu Leu Val Thr Val Arg Ile
290 295 300
Gly Thr Asp Trp Val Val Ile Asp Val Arg Gly Leu Leu Arg Asn Ala
305 310 315 320
Arg Trp Arg Thr Ile Ala Pro Lys Asp Ile Ser Leu Asn Ala Leu Leu
325 330 335
Asp Leu Phe Thr Gly Asp Pro Val Ile Asp Val Arg Arg Asn Ile Val
340 345 350
Thr Phe Thr Tyr Thr Leu Asp Ala Cys Gly Thr Tyr Ala Arg Lys Trp
355 360 365
Thr Leu Lys Gly Lys Gln Thr Lys Ala Thr Leu Asp Lys Leu Thr Ala
370 375 380
Thr Gln Thr Val Ala Leu Val Ala Ile Asp Leu Gly Gln Thr Asn Pro
385 390 395 400
Ile Ser Ala Gly Ile Ser Arg Val Thr Gln Glu Asn Gly Ala Leu Gln
405 410 415
Cys Glu Pro Leu Asp Arg Phe Thr Leu Pro Asp Asp Leu Leu Lys Asp
420 425 430
Ile Ser Ala Tyr Arg Ile Ala Trp Asp Arg Asn Glu Glu Glu Leu Arg
435 440 445
Ala Arg Ser Val Glu Ala Leu Pro Glu Ala Gln Gln Ala Glu Val Arg
450 455 460
Ala Leu Asp Gly Val Ser Lys Glu Thr Ala Arg Thr Gln Leu Cys Ala
465 470 475 480
Asp Phe Gly Leu Asp Pro Lys Arg Leu Pro Trp Asp Lys Met Ser Ser
485 490 495
Asn Thr Thr Phe Ile Ser Glu Ala Leu Leu Ser Asn Ser Val Ser Arg
500 505 510
Asp Gln Val Phe Phe Thr Pro Ala Pro Lys Lys Gly Ala Lys Lys Lys
515 520 525
Ala Pro Val Glu Val Met Arg Lys Asp Arg Thr Trp Ala Arg Ala Tyr
530 535 540
Lys Pro Arg Leu Ser Val Glu Ala Gln Lys Leu Lys Asn Glu Ala Leu
545 550 555 560
Trp Ala Leu Lys Arg Thr Ser Pro Glu Tyr Leu Lys Leu Ser Arg Arg
565 570 575
Lys Glu Glu Leu Cys Arg Arg Ser Ile Asn Tyr Val Ile Glu Lys Thr
580 585 590
Arg Arg Arg Thr Gln Cys Gln Ile Val Ile Pro Val Ile Glu Asp Leu
595 600 605
Asn Val Arg Phe Phe His Gly Ser Gly Lys Arg Leu Pro Gly Trp Asp
610 615 620
Asn Phe Phe Thr Ala Lys Lys Glu Asn Arg Trp Phe Ile Gln Gly Leu
625 630 635 640
His Lys Ala Phe Ser Asp Leu Arg Thr His Arg Ser Phe Tyr Val Phe
645 650 655
Glu Val Arg Pro Glu Arg Thr Ser Ile Thr Cys Pro Lys Cys Gly His
660 665 670
Cys Glu Val Gly Asn Arg Asp Gly Glu Ala Phe Gln Cys Leu Ser Cys
675 680 685
Gly Lys Thr Cys Asn Ala Asp Leu Asp Val Ala Thr His Asn Leu Thr
690 695 700
Gln Val Ala Leu Thr Gly Lys Thr Met Pro Lys Arg Glu Glu Pro Arg
705 710 715 720
Asp Ala Gln Gly Thr Ala Pro Ala Arg Lys Thr Lys Lys Ala Ser Lys
725 730 735
Ser Lys Ala Pro Pro Ala Glu Arg Glu Asp Gln Thr Pro Ala Gln Glu
740 745 750
Pro Ser Gln Thr Ser
755
<210> 133
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 133
atgcctaagc ccgctgtgga gtctgaattc agcaaggttc tgaaaaaaca ctttcctggt 60
gaacgtttta gatccagtta catgaagcgc ggtggaaaaa tcttggcagc ccagggggag 120
gaagcagtag tcgcatacct acaaggtaag tccgaggagg agccgcctaa tttccaacca 180
ccagccaagt gtcatgtggt gacaaaaagt agagacttcg ctgaatggcc tattatgaaa 240
gcaagcgagg caatccagag gtatatttat gcgttgagca ccaccgaacg cgctgcatgc 300
aaacctggca agtcctcgga gtcccacgcc gcctggtttg ctgctacggg ggtttctaat 360
cacgggtatt cacatgtaca aggactgaat ttgattttcg atcacacttt gggaagatac 420
gatggtgtcc ttaaaaaggt ccaactgaga aatgagaagg ccagggcacg ccttgagtct 480
ataaacgctt cacgcgcgga tgaaggactg cctgagataa aggctgaaga agaggaagtt 540
gctaccaacg agacgggaca cttgttgcaa cctccgggga tcaatccttc attctatgtt 600
taccaaacga tatctcctca agcttacaga ccacgtgacg aaatagtgtt accccctgaa 660
tatgccggtt atgttagaga tccaaacgca ccaatccctc taggagttgt tagaaacagg 720
tgtgatattc aaaagggatg tccaggttat ataccagagt ggcagagaga ggctggcact 780
gcgattagtc ctaaaactgg caaagcggtg acggttcctg ggttgagtcc aaaaaagaac 840
aagcgcatgc gccgctattg gagaagtgaa aaggagaaag cccaggatgc attgcttgtg 900
actgtcagga ttggcactga ttgggtcgta attgatgtga gaggtctgtt gcgtaacgcg 960
cgatggagaa caattgctcc taaggatatc tcgctcaatg ccctgctcga cttgttcact 1020
ggagatcccg tcatagacgt caggaggaat atagttacat tcacatacac actggacgcc 1080
tgcggtactt acgcacgtaa gtggactctt aaagggaagc aaaccaaagc tactttggac 1140
aaattgacgg caacacagac agtcgcattg gtggccatcg atttgggcca aacaaatcct 1200
atttctgctg gaatctcgag ggtgacacaa gagaacggtg ccctccagtg tgaaccctta 1260
gaccggttca ctcttccaga cgatcttctg aaggacatta gtgcttacag aattgcttgg 1320
gataggaacg aagaagagtt gagggctaga agtgtggagg ctctgcccga agcacagcag 1380
gccgaagtca gagctctcga tggagtgtca aaggaaaccg cgagaacaca gctctgcgca 1440
gattttggcc ttgaccccaa gaggttgccc tgggacaaga tgtcctccaa cacaactttc 1500
attagcgagg cactgttgtc aaacagcgtt tcccgcgatc aggtattttt tactcctgcc 1560
ccaaagaaag gagcaaagaa gaaggctcct gttgaggtta tgcgcaagga caggacctgg 1620
gcgagggctt acaagccaag gctaagcgtg gaagcacaga agcttaagaa tgaagcgctt 1680
tgggctctga agaggacttc accagagtac cttaagctct caagacgaaa agaagaattg 1740
tgtcggagga gcataaatta tgttatcgaa aagactcgga gaagaactca gtgtcagatc 1800
gttattcccg tgatagagga tttgaatgtt cgatttttcc atggctcagg caagagattg 1860
cctggttggg ataatttttt cactgctaag aaggaaaacc gttggttcat acaaggtttg 1920
cataaggcct tctcagactt gcgtacccac cgctcgttct acgtttttga ggtgagacca 1980
gaacgtactt caattacttg tcctaaatgc ggccattgcg aggttggaaa cagagatggt 2040
gaagcattcc aatgtctgtc gtgcgggaaa acctgtaatg cagaccttga cgttgcaacg 2100
cacaatctta cacaggtagc tcttactggg aagaccatgc ctaagaggga agaaccacgc 2160
gatgctcaag gaactgctcc tgcaagaaag acaaaaaaag cttctaagtc aaaggctcca 2220
cctgcagaaa gggaagacca gacccctgcc caggagccat ctcagacatc a 2271
<210> 134
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 134
atgcctaagc ccgccgttga gtctgaattt tccaaggttc tcaagaagca ttttcccgga 60
gagaggttta gaagcagcta catgaagaga ggggggaaaa tcctcgcagc ccagggggag 120
gaagctgtgg tcgcttacct ccaaggcaag agcgaagaag agccccccaa cttccagccc 180
ccagccaaat gtcatgtggt tacaaagtct agggactttg ccgaatggcc aataatgaag 240
gccagcgagg ccatccagag gtacatctat gccctcagca ccaccgagag agccgcctgc 300
aagcccggaa agtcaagcga gtcacacgct gcctggttcg ccgccaccgg cgtttcaaac 360
cacgggtatt ctcatgtcca gggccttaac cttatcttcg accacaccct cggcaggtat 420
gacggagtcc tcaagaaagt tcaactcagg aacgagaagg ctagggccag gctcgagagc 480
atcaatgcct ctagggcaga tgaaggcctc cccgagatca aggccgagga ggaggaggtg 540
gccaccaacg agaccggtca cctcctccaa cctcccggca tcaatccaag cttctatgtg 600
taccagacca tcagccccca ggcctatagg cccagggacg agatcgtcct cccccccgag 660
tacgccggat acgtcaggga ccctaatgcc ccaatccctc tcggtgtggt gaggaatagg 720
tgcgacatcc aaaaggggtg ccccggctac atcccagaat ggcagaggga ggccggaacc 780
gccatctccc ccaagaccgg taaggccgtc accgtccccg ggctcagtcc caagaagaac 840
aagaggatga ggagatactg gaggagcgag aaggaaaagg cccaggatgc actcctcgtt 900
accgtcagga tcggcaccga ctgggtggtc atcgacgtga gaggcctcct caggaacgcc 960
agatggagga ccatcgcccc caaggatatc agcctcaacg ccctcctcga tctcttcact 1020
ggagaccccg tcatagacgt taggagaaac atcgtcacct tcacctacac cctcgacgcc 1080
tgcggaacct acgctagaaa gtggaccttg aaagggaaac agaccaaggc cacccttgac 1140
aaactcactg ccacccagac tgttgccctc gtggccatcg acctcggcca gaccaacccc 1200
atctcagccg gcatctcaag ggtgacacag gagaatggcg ccctccagtg cgagcctctc 1260
gacagattca cactccccga cgacttgctc aaggacatca gcgcatacag gatcgcctgg 1320
gacaggaacg aagaagaact tagggcaaga tccgtcgagg ccctccctga agcacagcaa 1380
gccgaggtga gggccctcga cggggtgagc aaggagaccg ccaggaccca gctctgtgcc 1440
gatttcgggc tcgatcccaa gagacttcct tgggacaaga tgagcagtaa caccaccttc 1500
atttccgaag ccctcctcag caacagcgtt agcagagacc aggtcttctt cactcccgcc 1560
cccaagaaag gcgccaagaa aaaagccccc gtggaggtca tgaggaagga caggacctgg 1620
gcaagagcct acaagcccag gctctcagtt gaagcccaga aattgaaaaa cgaagccctc 1680
tgggctctca agaggacctc ccccgaatac cttaaattga gcaggaggaa ggaggagctc 1740
tgcaggaggt ccataaacta cgtgatcgaa aaaactagaa ggaggacaca atgtcagatc 1800
gttatccccg ttatcgaaga tctcaacgtg aggttcttcc acggctcagg gaagaggctc 1860
ccaggatggg ataacttctt caccgccaag aaggagaata ggtggttcat tcagggattg 1920
cacaaggcat tcagtgatct caggactcac agaagctttt atgtgttcga ggtgaggccc 1980
gagaggacct ccatcacctg ccccaagtgc gggcattgcg aggtcggcaa cagggacggg 2040
gaggcattcc agtgcctctc ctgcggcaaa acttgcaacg ccgacctcga cgtggcaaca 2100
cataacctca cccaagtcgc tctcacaggg aagaccatgc ccaagaggga ggaacccaga 2160
gacgcccagg gtaccgcacc cgccaggaag accaagaagg ccagcaaaag caaggcaccc 2220
cccgccgaga gggaggatca gacccccgct caggagccct cacagacctc c 2271
<210> 135
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 135
atgcccaagc ccgcagtcga gagcgagttt tcaaaggttc tcaaaaagca ctttcccggc 60
gagaggttca gatccagcta tatgaaaaga ggggggaaaa tcctcgccgc ccagggcgaa 120
gaggccgttg tggcctactt gcagggcaag agcgaagaag aaccccccaa cttccagcct 180
cccgccaagt gccacgtcgt gacaaagagc agagacttcg ccgagtggcc catcatgaag 240
gcttccgagg ccatccagag gtacatctac gccctctcca ccacagagag ggccgcctgc 300
aagcccggca agagcagcga gtcccacgcc gcctggtttg ctgccaccgg cgttagcaat 360
catgggtact cccacgtcca ggggcttaat ctcattttcg accacaccct cgggaggtac 420
gacggtgtgc tcaaaaaggt tcagctcagg aacgagaagg caagggctag gctcgaatca 480
atcaacgcca gtagagctga tgagggtcta cccgagatta aagccgaaga ggaggaggtc 540
gccaccaatg agactggtca cctcctccag ccacccggta tcaatcccag tttctacgtg 600
taccaaacca tctccccaca ggcttacagg cccagagatg agatcgtcct cccacccgag 660
tacgccgggt acgttagaga ccccaacgcc cccatccccc tcggtgtggt gaggaacagg 720
tgcgatatcc agaaggggtg ccccgggtac atccccgaat ggcagaggga ggccgggacc 780
gccatcagcc ccaaaaccgg caaggccgtt actgtgcccg gactctctcc caaaaagaac 840
aagagaatga ggaggtactg gaggagcgaa aaggagaagg cccaggacgc cttgctcgtc 900
actgtcagaa tcgggaccga ctgggtggtg atcgacgtga ggggcctcct caggaacgcc 960
aggtggagga caatcgcccc caaggacatc tccctcaatg ccctcttgga cctcttcacc 1020
ggcgatccag ttatcgatgt caggaggaac atcgtgacct tcacatatac cctcgacgcc 1080
tgtggtacct acgccagaaa gtggaccctc aaggggaaac agaccaaggc taccctcgac 1140
aagctcaccg ccacccagac cgtggccttg gtcgcaatcg acctcggaca gaccaacccc 1200
atcagcgccg ggatctctag ggtgacccaa gagaacgggg ccttgcagtg tgagcccctc 1260
gacaggttca ccctccccga cgaccttctt aaggatatta gcgcttacag aatcgcctgg 1320
gacaggaacg aggaggagct cagggccaga agcgtcgagg ccctccccga agcccagcag 1380
gccgaagtca gggccctcga cggcgtgagc aaggagaccg ccaggaccca actttgtgct 1440
gacttcggcc ttgaccccaa gaggctcccc tgggacaaaa tgtcatccaa caccaccttc 1500
atctctgagg ccctcctcag caatagcgtg tccagagacc aggtcttttt cacccccgcc 1560
cccaagaagg gagccaagaa gaaggccccc gtggaggtta tgaggaaaga caggacctgg 1620
gccagggcct acaagccaag gctcagcgtc gaggcccaaa aactcaagaa cgaggctctc 1680
tgggcactca aaagaacctc ccccgagtac cttaagctca gcaggaggaa ggaggagctc 1740
tgtaggaggt ctattaacta cgtgatcgag aagaccagga ggaggacaca gtgccagatc 1800
gttatccccg tgatcgagga tctcaacgtg aggttcttcc acgggagcgg gaagaggctc 1860
cccggctggg acaacttctt caccgccaaa aaagagaaca ggtggttcat acaagggctc 1920
cataaggcct ttagcgacct caggacccac aggagctttt acgtcttcga ggtgaggccc 1980
gagaggacca gcattacttg tcctaagtgc gggcactgtg aggtcgggaa cagggatggg 2040
gaagcattcc agtgcttgtc ctgcgggaag acctgcaatg ccgacttgga cgtcgccacc 2100
cacaacctca cccaagtggc cctcaccgga aagaccatgc ccaagaggga ggaacctagg 2160
gacgcccaag ggaccgcccc cgctagaaag acaaagaagg cctccaaatc caaggcaccc 2220
cccgccgaga gggaagacca aaccccagcc caagagccct cccagaccag c 2271
<210> 136
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 136
atgcccaaac ctgccgtgga gagcgagttt agcaaggttc tcaagaagca cttcccaggt 60
gagaggttca ggagctcata catgaagagg ggggggaaga tactcgcagc ccagggggag 120
gaagccgtgg ttgcctacct ccaggggaag tcagaagagg agcctcccaa ttttcagccc 180
cccgccaagt gccacgtggt gacaaagagt agggacttcg ccgagtggcc catcatgaaa 240
gccagtgagg ctattcagag gtatatctac gcactctcta ccaccgagag ggccgcctgt 300
aagcccggga agagctcaga gtcccatgcc gcctggttcg ccgccacagg ggtttctaac 360
cacgggtata gccacgttca aggtctcaac ttgatttttg accacaccct cggtagatac 420
gacggagttc ttaaaaaggt tcaactcaga aacgagaaag ccagggccag gctcgagagt 480
atcaacgcct ccagggccga tgaaggcctc cccgagatca aggcagagga ggaagaagtc 540
gccaccaacg agaccggtca tcttctccag ccccctggga tcaaccccag cttctacgtc 600
taccaaacta tcagccctca ggcctatagg cccagggacg agatcgtgct cccccctgag 660
tacgccggat acgtcaggga ccccaacgct cccatccccc tcggggttgt gaggaacagg 720
tgcgacatcc agaaggggtg ccccgggtac atccccgagt ggcagaggga ggccgggaca 780
gccatctccc caaagaccgg caaagccgtg acagttccag gcctctcacc taaaaagaac 840
aagaggatga ggaggtactg gagatcagag aaagaaaagg cccaagatgc cctcctcgtc 900
accgttagga ttggcactga ctgggttgtc atcgacgtga ggggcctcct caggaatgcc 960
agatggagaa ctatcgcccc caaagacatc tccctcaacg ccctcttgga cctcttcacc 1020
ggggaccccg tgatcgacgt gaggaggaac atcgtgacct tcacctacac cttggacgcc 1080
tgcgggactt acgccaggaa gtggaccctc aaaggcaagc agactaaggc cacccttgac 1140
aagctcaccg ccacccagac cgttgccctc gtcgcaatcg accttgggca gaccaacccc 1200
atcagcgcag ggatctctag ggtgacccag gagaatggcg cccttcagtg cgagcccttg 1260
gacagattca ccctccccga cgacctcctc aaggacatct cagcctacag gatagcctgg 1320
gacaggaacg aggaggagct cagagccaga tccgtcgaag ccctccccga ggcccagcaa 1380
gccgaggtga gggccctcga cggtgtctcc aaggaaaccg ccaggaccca gctctgcgcc 1440
gacttcggcc tcgaccccaa aagactcccc tgggacaaga tgtccagcaa caccactttc 1500
atttccgagg ccttgctcag caacagcgtt agcagagacc aagtgttctt cacccccgct 1560
cccaaaaagg gagccaagaa gaaagccccc gtcgaggtta tgaggaaaga cagaacctgg 1620
gccagggcct acaagcccag gctcagcgtc gaggcacaga aactcaagaa tgaagccctc 1680
tgggctctca agagaacaag ccccgagtac ttgaaactca gcagaaggaa ggaggaactc 1740
tgcaggaggt ccatcaatta cgtgatcgag aagactagga ggaggaccca gtgccagata 1800
gttatccctg ttatcgaaga ccttaacgtg aggttctttc acggcagcgg gaagaggctt 1860
cccgggtggg acaacttttt caccgccaaa aaggaaaaca ggtggtttat ccagggcctc 1920
cacaaggcct tctcagacct caggacccac aggtccttct acgttttcga ggtgagaccc 1980
gagaggacca gcatcacctg ccccaagtgc gggcactgcg aggtcggcaa tagggacggg 2040
gaggctttcc aatgcttgag ttgcgggaag acctgcaatg ccgacctcga cgttgccact 2100
cacaacttga cccaagtggc tctcaccggg aaaaccatgc ccaagaggga agaacccaga 2160
gacgctcagg ggaccgctcc cgccagaaaa accaagaagg cctcaaagtc caaggctccc 2220
cccgccgaaa gggaggacca aacccccgct caggagcctt cccagacatc c 2271
<210> 137
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 137
atgcccaagc ccgccgtcga gtccgagttc tcaaaggtcc tcaagaagca cttccccggg 60
gagagattca ggagctctta catgaagagg gggggtaaaa tcctcgcagc ccagggcgag 120
gaggccgtgg tcgcctactt gcaggggaag tcagaggagg aaccccccaa ttttcagccc 180
cccgccaagt gccacgtcgt gacaaagtcc agggactttg ctgaatggcc catcatgaaa 240
gcctccgagg ctatccagag gtacatatac gccctctcca ccaccgagag ggccgcctgt 300
aagcccggga agagcagcga gtcccatgcc gcctggttcg ctgccaccgg ggtttccaac 360
cacggctact cccacgttca gggactcaat ctcatctttg accacacact cggcagatac 420
gacggcgttc tcaagaaggt ccagctcagg aatgagaagg ccagggccag gctcgagtcc 480
atcaacgcaa gcagagccga cgagggactc cccgagatca aggccgagga ggaggaagtt 540
gccaccaacg agaccgggca cctcctccaa ccacctggga taaacccatc cttctacgtg 600
taccaaacca tcagccccca ggcttacagg cccagagatg agatcgtgtt gccccccgag 660
tatgccggat acgttaggga ccccaatgcc cctatccccc tcggcgtcgt caggaacaga 720
tgcgacatcc agaaaggatg ccccggctac atccccgaat ggcagagaga ggccggcacc 780
gcaatctccc ccaagaccgg caaagccgtg accgttcccg gcctcagccc aaagaaaaac 840
aaaaggatga gaagatactg gaggagtgag aaggagaaag ctcaagatgc cctccttgtc 900
actgtcagga tcgggaccga ctgggtcgtg attgacgtga ggggactcct taggaacgcc 960
aggtggagaa ccatcgcccc caaggacata tccctcaacg ccctcttgga cctcttcacc 1020
ggtgacccag tgatcgacgt cagaaggaat atcgtgacct ttacctacac cttggacgcc 1080
tgcgggacat atgccaggaa gtggaccttg aaggggaagc agaccaaggc tactctcgac 1140
aaactcaccg ccacccagac cgttgccctc gtcgccatag acctcggcca gaccaacccc 1200
atcagcgccg gcatctctag agtaactcag gaaaatgggg ccctccagtg tgagcccctc 1260
gacagattca ccctccccga cgatctcttg aaggacattt ccgcctacag gatcgcctgg 1320
gacaggaacg aggaggagct cagggccaga tcagtggaag cacttcccga ggcccagcag 1380
gcagaagtga gggccctcga cggggtgagc aaagagacag ccaggaccca gctttgtgcc 1440
gatttcggac tcgaccccaa gagactcccc tgggacaaga tgtccagcaa tacaaccttc 1500
atcagcgagg cccttctctc taacagcgtc agtagagacc aagtcttctt cactcccgcc 1560
cccaagaagg gagccaagaa gaaggcccca gtggaggtca tgaggaagga taggacctgg 1620
gccagagctt acaagcccag gctttccgtg gaggcccaga agttgaaaaa tgaggcactc 1680
tgggccctca agagaaccag ccccgagtac ttgaagctct ccaggaggaa ggaggaactc 1740
tgcaggaggt ccatcaacta cgtcatagag aagaccagaa gaaggaccca gtgtcagatc 1800
gtcatccctg tcatcgagga tcttaacgtc agattcttcc acgggtccgg taagaggttg 1860
cctggctggg acaacttctt caccgcaaag aaggagaaca ggtggtttat ccaaggcctc 1920
cacaaggcct ttagcgatct cagaacccac agaagctttt acgtgttcga ggtcaggccc 1980
gagagaacca gcatcacctg tccaaagtgc gggcactgcg aagttgggaa tagggacggg 2040
gaggctttcc agtgcctctc ctgcgggaaa acttgcaacg ccgacttgga cgtcgcaaca 2100
cataacctca cccaggtggc cctcaccgga aagaccatgc ccaagagaga ggagcctagg 2160
gacgcccaag gtactgcccc cgccagaaaa accaaaaagg ccagcaagtc caaggccccc 2220
cccgccgaaa gggaggacca gacccccgct caggaaccca gccaaacctc c 2271
<210> 138
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 138
atgcccaagc ccgccgtcga atcagagttc agcaaagttc ttaaaaaaca ctttcccggt 60
gaaaggttca gaagttcata catgaagagg gggggtaaga tcctcgccgc acagggcgag 120
gaggcagtcg tcgcttacct ccagggaaaa tccgaggagg agccccccaa ctttcaacca 180
cccgctaaat gtcacgtcgt gaccaagtcc agggactttg cagaatggcc catcatgaaa 240
gccagcgagg ccatccagag gtatatctac gctttgagta ctaccgaaag ggcagcctgc 300
aaacctggga aaagcagcga gtcacacgcc gcctggttcg ccgcaaccgg agtcagcaac 360
cacgggtact cccacgtgca gggcctcaac ctcattttcg atcacaccct cggaaggtat 420
gacggggttc tcaaaaaagt gcagctcagg aacgaaaaag ccagggcaag gttggagtcc 480
atcaacgcca gtagggccga cgagggcctc cccgagatca aggcagagga agaggaagtc 540
gccaccaacg agacaggcca ccttctccaa ccccccggga taaacccaag tttttacgtg 600
taccagacca tctcacccca agcctacagg cccagggatg agatcgttct ccctcccgag 660
tatgccgggt atgtaaggga tcccaacgcc ccaatccccc tcggcgtggt gaggaacagg 720
tgcgacatcc agaagggatg ccccggatat atccccgagt ggcagaggga ggccgggacc 780
gccatcagtc ccaaaacagg caaagccgtg accgtccccg ggctcagccc caaaaagaac 840
aagaggatga ggagatattg gagaagcgag aaggagaagg cccaagacgc cctcctcgtg 900
accgtcagaa tcggcaccga ctgggtggtt atcgatgtta ggggactcct caggaacgcc 960
aggtggagga ccatcgcccc caaggacatc tccctcaacg ccctcttaga cttgttcacc 1020
ggcgaccccg ttattgacgt caggaggaac attgttacct tcacttatac cctcgacgct 1080
tgcgggacct acgccaggaa gtggaccctt aagggaaagc agaccaaggc caccctcgac 1140
aagctcacag ccacccagac cgttgccctc gttgccattg acctcggcca gaccaacccc 1200
atcagcgccg gtatcagcag ggttacccaa gagaacgggg cccttcaatg cgagcctctc 1260
gacagattca cccttcccga cgacctcctc aaagacatct ccgcctatag gatcgcctgg 1320
gacagaaacg aggaggagct cagggccagg agcgtggagg ccctccctga ggcccagcag 1380
gccgaggtta gggccctcga tggagtctcc aaagagaccg ccaggaccca gctttgcgcc 1440
gacttcgggc tcgacccaaa gaggctccct tgggacaaga tgtcaagcaa caccaccttc 1500
atcagcgagg ccttgctctc aaacagtgtc tccagggacc aagtgttttt taccccagcc 1560
cccaagaagg gcgctaagaa gaaagctccc gtcgaggtca tgagaaagga caggacctgg 1620
gctagggcct acaagcccag gctcagcgtc gaggcccaga agctcaagaa cgaggccctc 1680
tgggccctca agaggacctc ccctgagtac cttaagctca gcagaaggaa ggaggagctc 1740
tgcaggagga gcattaacta cgtgattgaa aagaccagga ggaggaccca gtgccagata 1800
gtgattcccg tcatcgagga cctcaatgtt aggttcttcc acggcagcgg aaagaggctc 1860
ccagggtggg acaacttctt caccgccaaa aaggagaaca gatggtttat ccaggggctc 1920
cataaggctt tcagcgacct cagaacccac aggtccttct acgtcttcga agtgaggccc 1980
gaaaggacct ccatcacctg cccaaagtgc gggcactgcg aggtcggcaa tagagacggc 2040
gaagccttcc agtgtctcag ctgcgggaag acctgcaacg ccgatctcga cgtcgccaca 2100
cataacctca cccaggtcgc cctcaccggg aagaccatgc ccaagaggga ggaacccagg 2160
gacgcccagg gcacagcccc cgccagaaag accaaaaagg ccagcaagtc caaagctcca 2220
cctgcagaga gggaagacca gactcccgcc caggagccct cccagaccag t 2271
<210> 139
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 139
atgcccaaac ccgccgttga gtctgagttc tccaaggttc tgaaaaagca cttccccggt 60
gagaggttca ggagttctta catgaaaaga gggggcaaga tacttgcagc ccagggggaa 120
gaggccgttg tcgcctacct ccagggcaaa tctgaggaag agcctccaaa cttccagcca 180
cccgccaagt gtcacgtggt caccaagtcc agagacttcg ccgagtggcc catcatgaag 240
gcctccgaag ccatacaaag gtacatctac gctctctcca ctaccgagag ggcagcctgc 300
aagccaggaa agtcctctga gagtcacgca gcctggtttg cagccaccgg agtgtccaac 360
cacggctact cccacgtgca gggactgaac ctgatctttg accacacact gggtaggtac 420
gatggtgtcc tgaagaaagt ccagttgagg aacgagaagg ccagggccag gctggagtcc 480
atcaacgcct ccagagctga tgaggggctg cccgaaatca aggccgaaga ggaagaggtg 540
gccaccaatg agaccggcca ccttctgcag ccacccggaa tcaacccctc cttttacgtt 600
taccagacca tctctcccca ggcctataga cccagagatg agatagtcct gccccctgag 660
tacgcagggt acgtcagaga ccccaacgcc cccattcccc tcggcgtcgt gaggaatagg 720
tgtgacatcc agaaggggtg tccaggctac atccccgaat ggcagaggga ggccggaacc 780
gctatctctc ccaaaactgg gaaagccgtg accgtgccag gcctctcccc caagaaaaac 840
aagaggatga gaaggtattg gagaagtgaa aaggagaagg cccaagacgc ccttctggtt 900
accgttagga tcggaaccga ctgggttgtg atcgacgtca gggggctctt gaggaacgcc 960
aggtggagga ccatcgcccc caaggacata tccctcaacg ccctcctgga cctgttcacc 1020
ggtgatcccg tgatcgacgt gaggagaaac atcgtgacat tcacatatac ccttgacgcc 1080
tgcggcacct acgcaagaaa gtggactctg aaggggaaac agaccaaggc cacattggat 1140
aagctgaccg ccacacagac agttgccctt gtggccatcg accttggcca gactaacccc 1200
atcagtgcag gaatcagtag ggtgacccag gagaatggcg ccctgcaatg tgagcccctc 1260
gatagattca ccctccccga tgaccttctg aaagacatct ccgcctatag gatcgcctgg 1320
gacagaaacg aagaggaact gagggccagg tccgtggagg ccctgccaga agcccaacag 1380
gccgaagtga gggccctgga cggcgtgtcc aaggagaccg ccaggaccca gctttgcgca 1440
gatttcgggc tcgaccccaa gaggctgccc tgggacaaaa tgtcctctaa tacaaccttc 1500
atctcagagg ccctcctgtc caactccgtt tccagggacc aggttttttt cactcctgcc 1560
cccaaaaagg gggccaagaa aaaggccccc gtcgaggtga tgaggaaaga caggacctgg 1620
gctagagcct ataagcccag gctctcagtg gaagcccaaa agctgaagaa cgaggccctc 1680
tgggccctga agagaacctc ccccgaatac ctcaagttgt ccagaaggaa ggaagagctt 1740
tgtagaaggt ccattaatta cgttatcgag aaaacaagga gaaggaccca atgccagatt 1800
gtcatccccg tcatcgagga tcttaacgtt agatttttcc acgggtccgg taagaggttg 1860
cccgggtggg acaatttctt taccgccaaa aaggagaata ggtggttcat ccaaggattg 1920
cacaaggcct tttccgactt gaggacccac aggtcctttt acgttttcga agtcaggccc 1980
gagaggacct ccattacctg ccccaaatgc ggacattgcg aggtcgggaa tagggacggg 2040
gaagcattcc agtgcttgag ttgcgggaag acctgtaacg ccgaccttga cgtcgccacc 2100
cacaacctca ctcaggtggc cctgacaggc aagactatgc ctaagaggga agagcccagg 2160
gacgcccagg gaaccgcccc tgccaggaaa accaaaaagg cctccaagtc caaggctcca 2220
cccgccgaga gggaagacca gactcctgcc caggaaccct cccagacctc c 2271
<210> 140
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 140
atgcccaagc ccgccgtcga gagcgagttc agcaaggtcc tcaagaagca cttccccgga 60
gagaggttta ggtccagcta catgaagaga ggcgggaaga tccttgccgc ccagggcgag 120
gaagccgtcg tcgcctacct ccaggggaag agtgaggagg agccccctaa cttccagcca 180
cccgccaagt gccatgtggt caccaagtcc agagacttcg ccgaatggcc catcatgaag 240
gcctcagaag caatccagag atacatctac gccctctcca ctaccgagag ggccgcctgt 300
aagcccggga agagtagcga gagccatgcc gcctggttcg ccgccaccgg agtctccaac 360
catggatact cccacgttca gggcctcaac ctcatctttg accacaccct cggaaggtac 420
gacggtgttc tcaagaaggt ccagctcagg aacgaaaaag ccagggccag gttggaatcc 480
atcaacgcta gcagggccga cgagggcctc cccgagataa aagccgagga ggaagaggtc 540
gccaccaacg agaccgggca cctcctccag ccacccggta taaaccctag cttctacgtt 600
tatcagacta tcagccccca agcctatagg cccagggacg agatcgtgct cccccccgag 660
tacgccggat acgtcaggga ccccaacgcc cccatcccac tcggggtggt caggaacagg 720
tgcgacatcc agaaggggtg cccaggatac atccccgagt ggcaaaggga agccgggaca 780
gccatcagcc ccaagaccgg aaaagccgtc accgttcctg ggttgtcccc caagaagaac 840
aagaggatga ggaggtactg gaggtcagag aaggagaagg cccaagacgc actcctcgtc 900
accgttagga tcgggaccga ctgggtggtt atcgatgtga gaggcctttt gagaaacgcc 960
aggtggagaa ctatcgcccc caaggatatt tccctcaatg cactccttga cctcttcacc 1020
ggggaccccg tcattgacgt cagaaggaac atcgtcacct tcacctacac tttggacgct 1080
tgtggtacct acgctagaaa gtggaccctc aagggcaagc agaccaaggc caccttggac 1140
aagctcactg ccacccagac cgtcgccctc gtggccatcg atctcggaca gaccaacccc 1200
atcagcgccg gtatctcaag ggtcacccag gaaaacgggg ctctccagtg cgagccactt 1260
gacaggttca ccctccccga cgatttgctc aaagacatca gcgcctacag gatcgcctgg 1320
gacagaaacg aagaggagct cagggccagg tccgtcgagg ctctccccga ggcccagcag 1380
gctgaggtta gagccctcga tggggtgtcc aaggaaaccg ccagaaccca gctctgcgcc 1440
gacttcgggc tcgaccccaa gaggttgccc tgggataaga tgtcctccaa caccactttc 1500
atctctgagg ccctcctcag caacagtgtg agcagggacc aggttttctt tacccccgcc 1560
cccaagaagg gggccaagaa gaaggctcca gttgaggtga tgaggaagga caggacttgg 1620
gccagggcat ataagcccag gctcagcgtt gaagcacaaa agctcaagaa tgaggccctt 1680
tgggccctca agaggacctc ccccgagtac ctcaagctca gcagaaggaa ggaggagctc 1740
tgtaggaggt ccatcaacta cgtcatcgaa aagactagga ggaggaccca atgtcagatc 1800
gtcatccccg tgatcgagga cctcaacgtc agattcttcc acggtagtgg caagagactc 1860
cccgggtggg acaatttttt caccgccaaa aaagaaaata ggtggttcat ccagggcttg 1920
cataaggcat tctccgacct caggacccat agatccttct acgtgttcga ggttaggcct 1980
gagaggactt caatcacatg ccccaagtgc ggacactgcg aggttgggaa cagagacggg 2040
gaagccttcc agtgcctcag ctgcggtaaa acctgcaacg ccgatctcga cgtcgccacc 2100
cataacctca cccaggtggc tctcactggc aagactatgc ccaagaggga ggagcccagg 2160
gacgcccaag gcactgcccc cgccaggaag accaagaagg cctctaagag caaagccccc 2220
cccgccgaga gagaggacca gacccccgcc caggagccca gccaaacctc c 2271
<210> 141
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 141
atgcccaagc cagctgtcga gagcgagttc tcaaaggttc ttaaaaagca ctttcctgga 60
gagagattca ggtctagcta catgaagaga ggcggcaaga tactcgccgc ccagggtgaa 120
gaagccgtgg tcgcctacct ccagggtaag tccgaggagg aaccccccaa cttccagccc 180
ccagccaaat gtcacgtcgt caccaaaagt agggactttg ccgagtggcc catcatgaag 240
gcctccgaag ccatccagag gtacatctat gctctcagca ccaccgagag ggcagcttgc 300
aagcccggta agagcagcga gtcccacgcc gcctggttcg ccgccaccgg cgtcagcaac 360
cacgggtact cccatgtcca aggactcaat cttatcttcg atcacaccct cggcaggtac 420
gacggggtcc tcaagaaggt ccagctcaga aacgagaaag ccagggctag gctcgagtcc 480
atcaacgcat caagagccga tgaggggttg cccgagataa aggccgagga ggaggaggtc 540
gccacaaacg agaccggcca tctcttgcag ccccccggaa tcaacccctc cttctacgtg 600
taccaaacca tctcccccca ggcctacagg ccaagggacg agatcgtgtt gccccccgaa 660
tatgctgggt acgtgaggga tcccaacgcc cccatcccct tgggggtggt gaggaacagg 720
tgcgacatcc aaaaaggctg ccccggatac atccccgagt ggcaaaggga ggccgggacc 780
gccatcagcc ccaagaccgg gaaggcagtc accgtgcccg ggttgtctcc caaaaagaac 840
aagaggatga ggagatattg gaggtccgag aaagaaaagg cccaggatgc actcctcgtg 900
accgtcagga tcgggacaga ctgggtggtt attgatgtga gagggctcct taggaacgcc 960
aggtggagga ccattgcccc caaagacatc tctctcaacg cactcctcga cctttttacc 1020
ggcgatccag ttatcgacgt gaggagaaac atagtgacct tcacctacac actcgatgca 1080
tgcggtacct atgctagaaa gtggaccctc aagggaaagc agaccaaagc caccctcgac 1140
aagctcaccg ccacacaaac cgtggccctc gtggccatcg acctcgggca gaccaaccca 1200
atctctgccg gaatctccag ggtgacccaa gagaacggag cactccagtg cgagcccctc 1260
gacagattta ccctcccaga tgatcttctc aaagacatca gcgcctacag gatcgcttgg 1320
gacaggaacg aggaggagct cagggccagg agcgttgagg ccctccccga ggcacagcag 1380
gccgaggtga gggccctcga cggcgtcagc aaggagaccg ccaggaccca attgtgcgcc 1440
gacttcggcc tcgacccaaa gagactcccc tgggacaaga tgagctccaa taccaccttc 1500
atctctgagg ccctcttgtc taactccgtg agcagagacc aggtcttctt caccccagca 1560
cccaagaagg gagccaagaa gaaggcaccc gtcgaagtca tgaggaagga caggacctgg 1620
gccagggcat acaagcccag actcagcgtt gaggcccaaa agcttaagaa cgaagccctc 1680
tgggccctca agagaacctc cccagaatac ctcaagctca gcagaaggaa ggaggagctc 1740
tgcagaagat caatcaacta tgttatcgag aagaccagaa ggaggaccca atgccaaatc 1800
gtcattcctg tgattgagga cctcaacgtg aggtttttcc atggctccgg caagagactc 1860
ccggggtggg ataacttctt taccgccaag aaggaaaaca ggtggttcat acagggtctc 1920
cataaagcct tctctgattt gaggacccac agaagctttt acgtgttcga ggtgagaccc 1980
gagaggacca gcatcacctg ccccaagtgc gggcactgcg aggtcgggaa cagggatggc 2040
gaggccttcc agtgcctcag ttgcggcaag acctgcaacg ccgacctcga tgttgctacc 2100
cacaatctca cccaggtcgc ccttaccggg aagaccatgc ccaagaggga ggaacccagg 2160
gacgcccaag gtaccgcccc cgccaggaag accaaaaagg cctccaagag caaagccccc 2220
cctgccgaaa gagaggacca aacccccgcc caggagccca gccagaccag t 2271
<210> 142
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 142
atgcccaaac ccgccgtcga aagcgagttc tcaaaggtcc tcaaaaagca cttccccgga 60
gagaggttta ggagcagcta catgaaaagg ggcggcaaaa tcctcgccgc ccagggagag 120
gaggctgtcg tggcatacct ccaggggaag agtgaggagg agccccctaa cttccagccc 180
cccgcaaaat gtcacgtcgt gaccaaatcc agggacttcg ccgaatggcc tatcatgaag 240
gccagcgaag ccatccagag gtatatctac gccctttcaa ccaccgagag ggcagcttgc 300
aaaccaggta agtcctccga gtcccacgcc gcctggttcg ccgccaccgg tgtctccaac 360
catgggtact cacacgtgca ggggctcaac cttatcttcg accataccct cgggaggtac 420
gacggcgtct tgaagaaagt ccagctcagg aacgagaagg ccagagccag gctcgagtcc 480
ataaacgcat ccagagctga cgaggggctc cccgagatca aggccgaaga ggaggaggtt 540
gccaccaacg agactgggca cctcctccag ccacccggga taaacccctc tttctacgtg 600
taccaaacta tctcccccca ggcctatagg cccagagacg agatcgtgct cccacccgaa 660
tacgccgggt acgtgaggga tcccaatgcc cccatccccc tcggagttgt gagaaacaga 720
tgcgacatcc agaaggggtg ccccgggtac attcccgaat ggcaaaggga agccggcacc 780
gccatatccc ccaagaccgg caaggctgtc accgtgcctg ggctctcccc caagaaaaac 840
aagagaatga gaaggtattg gagatccgag aaggaaaaag cacaggacgc cctccttgtg 900
accgtcagga tcggcaccga ctgggttgtt atcgatgtca ggggcctcct cagaaacgcc 960
aggtggagga ccattgcccc caaggacatc agcctcaacg ccctcctcga tctctttacc 1020
ggggaccccg ttatcgacgt caggaggaac attgtgacat tcacctatac cctcgacgcc 1080
tgtgggactt acgccagaaa atggaccctc aaggggaagc aaaccaaggc caccctcgac 1140
aagttgaccg caacacaaac agtcgccctc gtggccatcg acctcggtca gacaaacccc 1200
atctccgccg gcatctccag ggtgacccag gagaacgggg ccctccagtg cgagcctctc 1260
gataggttta ccctcccaga cgacctcctc aaggacatca gcgcctacag gatcgcctgg 1320
gataggaatg aggaagagct cagggccagg tccgtggagg ccctccccga ggcccaacag 1380
gcagaggtga gggccctcga cggggtgtcc aaggagaccg ccaggactca gctctgtgcc 1440
gacttcggct tggatccaaa gaggctcccc tgggacaaga tgagcagcaa caccaccttt 1500
atctccgagg ccctcctctc caactccgtc tcaagggacc aggtgttttt cacccccgcc 1560
cccaagaagg gtgctaagaa aaaggccccc gtggaggtta tgaggaaaga cagaacttgg 1620
gccagggcct acaagcccag gttgtccgtt gaggcccaaa agctcaagaa cgaggccttg 1680
tgggccctca agagaacttc acccgaatac cttaagctta gtaggaggaa ggaggagttg 1740
tgtaggaggt ccatcaacta cgtgattgag aaaaccagga ggaggactca gtgtcagatc 1800
gtgatccccg tgatcgagga cttgaacgtt agattctttc acgggagcgg taagaggttg 1860
cccgggtggg acaatttctt caccgcaaag aaagagaata ggtggttcat ccaaggcctc 1920
cacaaggctt tcagcgacct tagaacacat agaagcttct acgtctttga agttaggcct 1980
gagaggacca gtatcacctg ccccaagtgc ggccactgcg aggtggggaa cagagacggg 2040
gaagcattcc aatgcctttc ctgcggcaag acctgcaacg ccgacctcga cgtcgctact 2100
cacaacctca cccaggtggc cctcaccggc aagactatgc ccaaaaggga ggagcccagg 2160
gatgctcagg gtaccgcccc cgccaggaag accaagaagg catccaagag caaggctccc 2220
ccagccgaga gggaagacca gacccctgcc caggagccct cccagacctc c 2271
<210> 143
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 143
atgccaaaac ccgccgtcga atccgagttc agcaaggtcc tcaagaagca tttccccggg 60
gagaggttca ggtcctccta tatgaagagg ggaggcaaga tcctcgctgc tcagggcgag 120
gaagccgttg tggcctactt gcaggggaag agcgaggaag agccccccaa cttccagccc 180
cctgccaagt gccacgttgt gacaaagtcc agggacttcg ccgagtggcc catcatgaaa 240
gccagcgagg ccatccagag gtatatctac gccctcagca ctactgagag agctgcctgc 300
aagcccggga agtctagcga atcccacgcc gcctggtttg ccgccaccgg cgtgagtaac 360
cacggttaca gccatgtgca gggcctcaac cttatctttg atcacaccct cgggaggtac 420
gatggcgttc ttaagaaggt gcagctcaga aatgagaagg caagggccag gcttgagagc 480
attaacgcct ccagggccga cgaggggctt cccgagatca aggctgagga ggaggaggtt 540
gcaaccaacg agaccggcca cctcctccaa ccacccggga tcaatccctc cttctacgtt 600
taccagacaa tatcccctca ggcctacaga cccagggacg aaattgtcct ccccccagaa 660
tacgcagggt acgtgaggga ccctaacgcc cccatccccc ttggcgtcgt taggaacagg 720
tgtgacatcc aaaaggggtg ccccggctac atccccgagt ggcaaagaga ggccggtacc 780
gccatctctc caaagaccgg caaggccgtc accgtccccg gcctctcccc caaaaagaac 840
aagaggatga ggagatactg gaggtcagaa aaggagaagg cccaagacgc cctcctcgtg 900
actgtgagaa tcggcaccga ctgggtcgtt attgatgtga gggggcttct caggaacgcc 960
aggtggagga ccatcgcccc aaaagacatc agcctcaatg ctctcctcga cttgttcacc 1020
ggcgaccccg tcatcgatgt caggaggaat attgtgacct tcacatacac cctcgacgcc 1080
tgcgggacct acgcacgtaa gtggactctc aagggcaagc agaccaaggc cactctcgac 1140
aaactcaccg ccacccagac tgtggccctc gtggccatcg accttgggca gaccaatccc 1200
ataagcgccg ggatcagcag agtcacccag gagaatgggg ccctccaatg cgagcccctc 1260
gacaggttca ccctccccga cgatctcctt aaggacattt ctgcctacag gatcgcctgg 1320
gacaggaacg aagaagaact cagggccaga tccgtcgagg cactccccga ggcccagcag 1380
gccgaggtca gggccttgga cggcgtgtca aaagaaaccg ccaggactca gctctgcgcc 1440
gactttgggc tcgaccccaa aaggttgccc tgggacaaga tgtccagcaa caccaccttc 1500
atctccgagg ctttgctctc caattctgtg agcagggacc aggtgttctt tacacccgcc 1560
cccaagaagg gtgccaagaa gaaggcccct gtcgaggtca tgaggaagga caggacctgg 1620
gccagagcct acaagcccag gttgtccgtg gaggctcaga agttgaaaaa cgaggccctc 1680
tgggcactca agaggacaag cccagagtac ctcaagctct ccaggaggaa ggaagagctc 1740
tgcaggaggt ccatcaacta cgttatcgag aaaacaagga ggaggaccca gtgccagatc 1800
gtgattcccg tcatcgagga cctcaacgtc aggttcttcc acgggagcgg caagaggctt 1860
cccgggtggg acaacttctt caccgctaag aaggagaaca ggtggttcat ccagggcctt 1920
cacaaggcct tctccgatct caggacccac aggtccttct atgtgttcga ggtcaggccc 1980
gaaaggacct ccataacctg ccccaagtgc gggcactgcg aggtgggtaa cagggacggt 2040
gaggcctttc agtgcttgag ttgcggcaaa acctgcaacg ccgacctcga cgtcgctacc 2100
cataacctca cccaagttgc cctcaccgga aagaccatgc ccaaaagaga ggagccaagg 2160
gacgcccagg gcaccgcccc cgccagaaag accaagaagg cctctaagag caaggccccc 2220
cccgccgaga gggaggacca gacccccgcc caggaaccca gccagacatc a 2271
<210> 144
<211> 766
<212> PRT
<213> 未知
<220>
<223> 噬菌体
<400> 144
Met Glu Lys Glu Ile Thr Glu Leu Thr Lys Ile Arg Arg Glu Phe Pro
1 5 10 15
Asn Lys Lys Phe Ser Ser Thr Asp Met Lys Lys Ala Gly Lys Leu Leu
20 25 30
Lys Ala Glu Gly Pro Asp Ala Val Arg Asp Phe Leu Asn Ser Cys Gln
35 40 45
Glu Ile Ile Gly Asp Phe Lys Pro Pro Val Lys Thr Asn Ile Val Ser
50 55 60
Ile Ser Arg Pro Phe Glu Glu Trp Pro Val Ser Met Val Gly Arg Ala
65 70 75 80
Ile Gln Glu Tyr Tyr Phe Ser Leu Thr Lys Glu Glu Leu Glu Ser Val
85 90 95
His Pro Gly Thr Ser Ser Glu Asp His Lys Ser Phe Phe Asn Ile Thr
100 105 110
Gly Leu Ser Asn Tyr Asn Tyr Thr Ser Val Gln Gly Leu Asn Leu Ile
115 120 125
Phe Lys Asn Ala Lys Ala Ile Tyr Asp Gly Thr Leu Val Lys Ala Asn
130 135 140
Asn Lys Asn Lys Lys Leu Glu Lys Lys Phe Asn Glu Ile Asn His Lys
145 150 155 160
Arg Ser Leu Glu Gly Leu Pro Ile Ile Thr Pro Asp Phe Glu Glu Pro
165 170 175
Phe Asp Glu Asn Gly His Leu Asn Asn Pro Pro Gly Ile Asn Arg Asn
180 185 190
Ile Tyr Gly Tyr Gln Gly Cys Ala Ala Lys Val Phe Val Pro Ser Lys
195 200 205
His Lys Met Val Ser Leu Pro Lys Glu Tyr Glu Gly Tyr Asn Arg Asp
210 215 220
Pro Asn Leu Ser Leu Ala Gly Phe Arg Asn Arg Leu Glu Ile Pro Glu
225 230 235 240
Gly Glu Pro Gly His Val Pro Trp Phe Gln Arg Met Asp Ile Pro Glu
245 250 255
Gly Gln Ile Gly His Val Asn Lys Ile Gln Arg Phe Asn Phe Val His
260 265 270
Gly Lys Asn Ser Gly Lys Val Lys Phe Ser Asp Lys Thr Gly Arg Val
275 280 285
Lys Arg Tyr His His Ser Lys Tyr Lys Asp Ala Thr Lys Pro Tyr Lys
290 295 300
Phe Leu Glu Glu Ser Lys Lys Val Ser Ala Leu Asp Ser Ile Leu Ala
305 310 315 320
Ile Ile Thr Ile Gly Asp Asp Trp Val Val Phe Asp Ile Arg Gly Leu
325 330 335
Tyr Arg Asn Val Phe Tyr Arg Glu Leu Ala Gln Lys Gly Leu Thr Ala
340 345 350
Val Gln Leu Leu Asp Leu Phe Thr Gly Asp Pro Val Ile Asp Pro Lys
355 360 365
Lys Gly Val Val Thr Phe Ser Tyr Lys Glu Gly Val Val Pro Val Phe
370 375 380
Ser Gln Lys Ile Val Pro Arg Phe Lys Ser Arg Asp Thr Leu Glu Lys
385 390 395 400
Leu Thr Ser Gln Gly Pro Val Ala Leu Leu Ser Val Asp Leu Gly Gln
405 410 415
Asn Glu Pro Val Ala Ala Arg Val Cys Ser Leu Lys Asn Ile Asn Asp
420 425 430
Lys Ile Thr Leu Asp Asn Ser Cys Arg Ile Ser Phe Leu Asp Asp Tyr
435 440 445
Lys Lys Gln Ile Lys Asp Tyr Arg Asp Ser Leu Asp Glu Leu Glu Ile
450 455 460
Lys Ile Arg Leu Glu Ala Ile Asn Ser Leu Glu Thr Asn Gln Gln Val
465 470 475 480
Glu Ile Arg Asp Leu Asp Val Phe Ser Ala Asp Arg Ala Lys Ala Asn
485 490 495
Thr Val Asp Met Phe Asp Ile Asp Pro Asn Leu Ile Ser Trp Asp Ser
500 505 510
Met Ser Asp Ala Arg Val Ser Thr Gln Ile Ser Asp Leu Tyr Leu Lys
515 520 525
Asn Gly Gly Asp Glu Ser Arg Val Tyr Phe Glu Ile Asn Asn Lys Arg
530 535 540
Ile Lys Arg Ser Asp Tyr Asn Ile Ser Gln Leu Val Arg Pro Lys Leu
545 550 555 560
Ser Asp Ser Thr Arg Lys Asn Leu Asn Asp Ser Ile Trp Lys Leu Lys
565 570 575
Arg Thr Ser Glu Glu Tyr Leu Lys Leu Ser Lys Arg Lys Leu Glu Leu
580 585 590
Ser Arg Ala Val Val Asn Tyr Thr Ile Arg Gln Ser Lys Leu Leu Ser
595 600 605
Gly Ile Asn Asp Ile Val Ile Ile Leu Glu Asp Leu Asp Val Lys Lys
610 615 620
Lys Phe Asn Gly Arg Gly Ile Arg Asp Ile Gly Trp Asp Asn Phe Phe
625 630 635 640
Ser Ser Arg Lys Glu Asn Arg Trp Phe Ile Pro Ala Phe His Lys Ala
645 650 655
Phe Ser Glu Leu Ser Ser Asn Arg Gly Leu Cys Val Ile Glu Val Asn
660 665 670
Pro Ala Trp Thr Ser Ala Thr Cys Pro Asp Cys Gly Phe Cys Ser Lys
675 680 685
Glu Asn Arg Asp Gly Ile Asn Phe Thr Cys Arg Lys Cys Gly Val Ser
690 695 700
Tyr His Ala Asp Ile Asp Val Ala Thr Leu Asn Ile Ala Arg Val Ala
705 710 715 720
Val Leu Gly Lys Pro Met Ser Gly Pro Ala Asp Arg Glu Arg Leu Gly
725 730 735
Asp Thr Lys Lys Pro Arg Val Ala Arg Ser Arg Lys Thr Met Lys Arg
740 745 750
Lys Asp Ile Ser Asn Ser Thr Val Glu Ala Met Val Thr Ala
755 760 765
<210> 145
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 145
atggaaaagg agatcactga gttaaccaag attagacgag aattccctaa taagaaattc 60
agctcaaccg atatgaagaa ggccggaaaa ttgttaaagg cagagggtcc agatgctgtc 120
agagattttt taaattcgtg tcaggagatt attggagact tcaagcctcc cgtgaagact 180
aacatcgttt ccatctcgcg gcccttcgag gagtggccag tgtcgatggt agggagggct 240
attcaagagt attacttctc tttgactaag gaagaacttg aatcagtgca tccgggcacg 300
tcatctgagg atcataaatc attctttaat ataactggac tctccaatta taactacacc 360
tccgttcagg gacttaatct catctttaaa aatgcaaaag ctatttatga tggaactttg 420
gttaaggcca acaacaagaa taagaaatta gaaaagaaat ttaatgagat caatcataag 480
cgctcactcg aaggactacc aataatcaca cctgacttcg aggagccctt cgacgaaaac 540
gggcacctca ataatcctcc aggaattaac agaaatattt atggatatca gggctgcgcc 600
gctaaagttt ttgtgccatc aaaacacaag atggtatcgc tgccaaagga gtacgaaggt 660
tataacagag acccaaatct gagtctcgca ggtttcagga atcgtcttga aatcccggag 720
ggagaaccag gtcatgtccc atggttccaa cgaatggaca tacctgaggg gcaaattgga 780
cacgtaaaca agattcagcg atttaatttt gttcacggca agaattcagg taaggtgaaa 840
ttcagcgaca agacagggcg cgtgaagaga tatcaccata gcaagtataa ggacgcaact 900
aaaccttata aattcttgga ggaatctaag aaggtgtcgg ctctcgatag catattagcg 960
atcattacaa tcggcgatga ttgggttgtg tttgatatac gaggactcta tagaaacgtc 1020
ttttatagag agcttgctca aaaaggtttg actgccgtcc aattgctgga tcttttcacc 1080
ggtgatcccg ttattgaccc aaagaaggga gtcgtaactt tctcctataa agagggggtt 1140
gtgcctgtct tttcccagaa aatcgttcct cgttttaaat cacgagacac cttagagaag 1200
ctcacgtcac aggggcctgt tgcgcttttg agcgttgatc ttggacagaa tgagcccgtc 1260
gcagccaggg tgtgttctct taagaatatt aacgacaaga ttaccctaga taacagttgc 1320
cgtatatctt ttcttgatga ttacaaaaag caaataaaag actaccgtga ttccctcgat 1380
gaattggaaa ttaaaatccg ccttgaagcg attaatagtt tggagactaa ccagcaggtt 1440
gagattaggg atctggacgt attttctgct gatcgggcca aagcaaacac agtggatatg 1500
ttcgatatcg acccaaatct aatctcctgg gacagcatga gtgatgccag agtttcaacc 1560
caaataagtg acctgtatct taagaatgga ggggatgaat caagggtgta cttcgagatt 1620
aataataaaa ggattaagag atcagactat aacattagcc agttggtgcg cccaaagctt 1680
tcagatagta caaggaagaa cctaaatgat tctatctgga aactcaagcg tacttctgag 1740
gaatatctta aactatcaaa gcgtaaactt gagctcagca gagctgtggt taactacacc 1800
attagacaaa gtaagttact ttctggtatc aatgatatcg ttatcatcct ggaagatttg 1860
gacgttaaga agaagtttaa cggacgtggt attcgcgaca tcgggtggga caatttcttc 1920
tctagccgaa aggaaaaccg ctggtttatc cctgctttcc ataaggcttt ttccgaactt 1980
tcatccaaca gagggctttg tgtgatagaa gtaaaccctg catggaccag tgcaacctgc 2040
ccagactgcg gcttctgcag taaagaaaac agagacggta ttaattttac atgcagaaag 2100
tgcggtgtca gttatcatgc tgacattgac gtggcaaccc taaatattgc aagggtagca 2160
gtcttgggaa agcctatgtc aggtcccgct gatcgcgagc gtctgggaga tactaagaaa 2220
ccaagagttg ctcggtctag gaaaacaatg aagcggaagg atataagcaa ttcaactgtg 2280
gaggccatgg tcacagcg 2298
<210> 146
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 146
atggagaagg aaataaccga actcacaaaa atcaggaggg aattccccaa caaaaagttc 60
tcctccaccg acatgaagaa ggccggcaag ttgctcaagg ccgagggccc tgacgccgtg 120
agggatttcc tcaattcctg ccaagagatc atcggagact ttaaaccccc cgtcaagact 180
aacatcgtca gcataagcag gccattcgag gagtggccag tctccatggt gggcagagcc 240
atccaggagt actacttcag cctcaccaag gaggagctcg agtccgtgca ccccgggact 300
agcagtgaag accataagag cttcttcaac attaccgggc tctccaacta caattacacc 360
tccgttcagg gtttgaacct catcttcaag aacgccaaag ccatctatga cggcactctc 420
gttaaagcca acaacaagaa caagaagctc gaaaagaagt tcaatgagat taaccacaag 480
aggtccctcg agggcctccc catcatcacc cccgacttcg aggagccctt cgatgaaaat 540
gggcacttga acaacccccc cggtattaac aggaacatct atgggtatca gggctgcgcc 600
gccaaagtct tcgtgccatc caagcataag atggttagcc tcccaaagga gtacgagggg 660
tataacaggg accccaactt gtccctcgcc gggttcagga acagactcga gatcccagag 720
ggagagcctg gccacgtgcc ttggttccag aggatggata tacccgaggg acagattggc 780
cacgtgaaca agatccagag gttcaatttc gtccacggga agaattccgg aaaagtgaaa 840
ttcagcgata aaaccgggag ggtcaagagg taccaccaca gtaagtacaa ggatgcaacc 900
aagccctaca agtttctcga ggagagtaag aaagtctccg ctctcgacag catcctcgcc 960
atcattacaa ttggggacga ctgggtggtc ttcgacatca gagggctcta caggaacgtc 1020
ttttacaggg agctcgccca gaaggggctc accgccgtgc agctcctcga cctcttcacc 1080
ggggaccccg tgatcgaccc caaaaaagga gtcgttacct tctcctataa ggagggagtg 1140
gtccccgttt ttagccagaa gatcgttccc aggtttaagt ccagggacac cctcgagaag 1200
cttaccagcc agggccccgt cgcccttttg tctgtcgacc ttggacaaaa cgagcccgtg 1260
gccgcaagag tctgttccct caagaacatc aacgataaga tcacccttga taattcatgt 1320
agaatcagct tcctcgatga ctacaaaaag cagatcaagg actatagaga ctccctcgac 1380
gagctcgaga tcaagattag gctcgaagcc atcaacagcc ttgagaccaa ccaacaagtc 1440
gaaatcagag atttggacgt tttctcagca gatagagcca aggccaacac cgtcgacatg 1500
ttcgacatcg acccaaacct catctcctgg gactccatgt ctgacgccag ggtctccaca 1560
cagatctccg atctttactt gaaaaatggc ggtgacgaat ccagagttta ctttgagatc 1620
aataacaaga ggatcaaaag gagcgactac aacatatccc agctcgtcag gccaaagctc 1680
tccgacagca ccaggaagaa cctcaacgat tccatctgga aactcaagag aaccagcgag 1740
gagtacctca agctctcaaa gagaaagctc gaactcagca gggccgtcgt caactacacc 1800
atcaggcagt ccaagctctt gtccgggata aacgacatcg tcatcatatt ggaggatctc 1860
gacgttaaaa agaagttcaa cggcagaggc atcagggaca tcggttggga caatttcttc 1920
tctagtagga aggaaaacag gtggttcatc cccgccttcc acaaagcctt tagcgagctc 1980
agctccaata ggggcctttg cgttatcgaa gtcaaccccg cctggacatc tgccacttgc 2040
ccagactgtg gcttctgtag caaggaaaac agagacggga tcaactttac ctgtaggaag 2100
tgcggggtct cctaccatgc cgacatcgac gttgccactc ttaacatcgc cagggtcgcc 2160
gtcctcggca agcccatgtc cggccccgcc gatagggaga ggttgggcga caccaagaaa 2220
cccagggtgg ccagaagcag gaagaccatg aagaggaaag acatcagcaa ctctaccgtg 2280
gaggccatgg ttaccgcc 2298
<210> 147
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 147
atggagaagg agatcaccga gctcacaaaa atcaggaggg agttccccaa caagaagttc 60
tcaagcactg acatgaagaa ggccggcaag ctcctcaagg ccgagggccc cgacgccgtc 120
agggacttcc tcaactcctg tcaggaaatc ataggagact tcaagccccc cgtgaaaacc 180
aacatcgttt ccatcagcag gcccttcgaa gagtggcccg tcagtatggt cgggagagcc 240
atccaagaat actatttcag cctcaccaag gaagaactcg agtccgtcca tcccggaacc 300
tcctctgagg accacaaaag cttctttaat atcaccggcc tcagcaatta caactacacc 360
agcgtgcagg gcttgaatct catcttcaaa aacgccaaag ccatctacga cggtactctc 420
gttaaggcca acaacaaaaa caagaagttg gagaagaagt tcaacgaaat caaccacaag 480
aggtccctcg agggcctccc tataatcacc cccgacttcg aggaaccttt cgacgagaac 540
gggcacctca acaacccccc cggcattaat aggaacatct acggctacca gggctgcgct 600
gccaaagttt tcgtgcccag caagcacaag atggtgagcc tccccaagga atacgaaggt 660
tacaacaggg accctaactt gtcccttgcc ggcttcagga acaggctcga gatccccgag 720
ggggaacccg gacacgtgcc ctggtttcag aggatggaca tcccagaggg ccagatcggc 780
cacgtcaata aaatacagag gttcaatttt gtccacggca agaacagcgg gaaggtcaag 840
ttttccgaca agaccggcag ggttaaaagg taccaccaca gcaagtacaa ggacgctacc 900
aagccctaca agtttctcga ggagagcaaa aaggtgagtg ccctcgactc aatcctcgcc 960
atcatcacca tcggcgacga ttgggtggtt tttgacatca ggggtctcta taggaacgtg 1020
ttctataggg agctcgccca gaaggggctc accgctgtgc agctcttgga cctcttcact 1080
ggggaccccg tcattgaccc caagaaaggg gttgtgacat tctcctacaa agaaggcgtg 1140
gtgcccgtgt tcagccagaa gattgtcccc agattcaaat ccagggatac cttggagaag 1200
ctcaccagcc agggccctgt ggcccttttg tccgtggacc tcggccagaa cgaacccgtc 1260
gccgccaggg tctgttcact caagaacatc aacgataaga tcaccctcga caactcctgc 1320
aggatcagtt tccttgacga ttataagaag cagatcaagg attacaggga ctccctcgat 1380
gaattggaga tcaagatcag gttggaggcc atcaacagcc tcgagaccaa tcagcaagtt 1440
gaaatcaggg acctcgacgt gttctcagct gacagggcta aggctaacac tgtcgacatg 1500
ttcgacatcg accccaacct catatcctgg gattccatga gcgacgcaag agtgtccacc 1560
cagatctccg acctctacct caagaacggc ggagacgagt ccagggtcta tttcgagatc 1620
aacaacaaga ggatcaagag gtccgactac aacatcagcc aacttgtcag gcccaaattg 1680
tctgactcta ccaggaagaa cctcaacgac tccatctgga aactcaagag gacctccgaa 1740
gagtacctca aactcagcaa gaggaagctc gagctctcca gggccgttgt gaattacacc 1800
atcaggcagt ccaagctcct ttccggtatc aacgacatcg tgatcatcct cgaagacctc 1860
gacgtgaaga agaagtttaa tggcagaggt attagggaca tcgggtggga caatttcttc 1920
agctccagaa aggaaaacag gtggttcatc cccgcctttc acaaggcctt tagcgaactc 1980
agctccaaca gagggctctg cgtcatagag gtgaacccag cctggacctc agccacttgc 2040
cccgattgtg gcttttgctc taaagagaac agggacggca tcaacttcac ctgcaggaaa 2100
tgtggcgtgt cctatcacgc cgacatcgac gttgccaccc tcaatatcgc cagagtggct 2160
gtcctcggaa agcccatgtc cgggcccgct gatagggaga gacttgggga caccaagaag 2220
ccaagggtcg ccaggtccag gaaaaccatg aaaaggaagg acatctccaa ttccactgtg 2280
gaggccatgg tgaccgct 2298
<210> 148
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 148
atggagaagg aaatcaccga gctcactaaa attaggaggg agttccccaa caaaaagttt 60
tcttccaccg atatgaagaa ggccggtaag ctcctcaagg ccgagggccc agacgccgtg 120
agggacttcc ttaactcctg ccaggagatc attggcgact tcaagccccc cgttaaaacc 180
aacatagtgt ctatctccag gcccttcgaa gagtggcccg tgagcatggt tggaagggcc 240
attcaggagt actatttctc actcaccaag gaggagctcg agagcgttca ccctgggacc 300
tcctccgaag accataaaag ctttttcaac atcaccggat tgtccaacta caactatacc 360
tccgtccagg gtctcaatct tatctttaag aatgccaaag ccatttatga cgggaccctc 420
gtgaaggcca acaacaaaaa taagaagctt gagaagaagt ttaacgagat taaccacaaa 480
aggtctctcg agggcctccc catcatcacc cccgacttcg aggagccatt cgacgagaat 540
gggcacctca acaacccacc cgggatcaat aggaacatct acggctacca ggggtgcgcc 600
gccaaggtgt ttgtccccag caagcacaag atggtttccc tccccaaaga gtacgagggg 660
tataacaggg atcctaatct ctccttggct ggattcagga acaggctcga gatcccagaa 720
ggagagcccg gtcacgttcc ttggtttcag aggatggaca tccccgaggg ccagatcggc 780
cacgtcaata agatccagag gttcaacttt gtgcacggta agaactccgg taaggttaaa 840
ttttcagata agaccggcag agtgaagagg tatcaccatt ccaagtacaa agacgccacc 900
aagccttata agttcctcga agaaagcaaa aaagtgagcg ccctcgacag catcctcgct 960
atcatcacca taggggacga ctgggtcgtt ttcgatatca gggggcttta cagaaatgtt 1020
ttctacagag aactcgccca gaaaggcctc accgccgtcc agttgctcga cctcttcacc 1080
ggggaccccg ttatcgaccc caagaagggc gtggtcacct tctcatacaa ggaaggagtg 1140
gttcccgttt tctcccaaaa gatcgtgccc agattcaaga gcagggacac ccttgagaag 1200
ctcaccagcc agggccccgt ggcccttctt tcagtcgacc tcgggcaaaa cgagcccgtc 1260
gctgctaggg tgtgctccct caagaacatc aacgataaga tcaccctcga caattcctgc 1320
aggatctcct ttctcgatga ctacaagaag cagataaagg actacaggga ctccctcgac 1380
gagctcgaaa ttaagatcag gttggaggct attaatagcc tcgagaccaa ccagcaagtt 1440
gaaatcagag acctcgacgt tttctccgcc gacagggcca aggccaacac cgtcgatatg 1500
ttcgatatcg accccaacct catcagctgg gacagcatgt ctgacgcaag ggtcagcaca 1560
cagatcagtg atctctactt gaagaacggt ggcgatgaaa gcagggtcta ctttgaaatc 1620
aacaataaaa ggatcaagag gagcgattat aacatcagcc agctcgtcag gcccaaactc 1680
agcgacagca ccagaaaaaa tctcaacgac agcatctgga agctcaagag gactagtgag 1740
gaatacctca aactcagtaa gaggaaactc gaactcagca gggccgtggt gaactacacc 1800
attagacagt caaagctctt gtctggcatc aacgacattg tcatcatttt ggaggacctc 1860
gacgtgaaaa agaagttcaa cggaaggggg ataagggaca tcggttggga caatttcttt 1920
agctccagga aggagaacag gtggttcatc ccagccttcc ataaagcttt ctcagagttg 1980
tcctcaaaca gaggtctctg cgtgatcgag gttaaccccg catggacctc cgctacatgc 2040
cccgactgcg gcttttgctc caaggaaaac agggacggca tcaacttcac atgcagaaag 2100
tgcggcgtgt cctaccatgc cgacatcgac gtcgctaccc tcaatatcgc cagggtcgcc 2160
gtccttggga agcccatgag cggcccagct gacagggaga ggcttgggga tacaaagaag 2220
cccagggtcg ccaggagccg gaagaccatg aagaggaagg acatttctaa ctctaccgtg 2280
gaagccatgg tcaccgcc 2298
<210> 149
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 149
atggagaaag agatcactga gcttactaaa attagaaggg agtttcctaa taagaaattt 60
agttctaccg atatgaagaa ggcagggaaa ctgctcaagg cagagggccc cgacgccgtg 120
agggacttcc ttaactcctg ccaagaaatt atcggggact ttaagccacc cgtcaaaacc 180
aacattgttt ccatctctag gcctttcgaa gagtggcctg tttccatggt ggggagggcc 240
atccaggagt attacttctc cctcaccaag gaagagctgg agtccgtcca ccccgggacc 300
tcatccgagg accataaatc cttctttaac attaccgggc tctccaacta caattacacc 360
tccgtccaag gcctcaacct gatatttaag aacgccaagg ccatttacga cgggaccctc 420
gtgaaagcca ataacaaaaa taagaagctt gaaaaaaaat ttaacgagat taaccacaaa 480
agatccctgg aggggcttcc cataatcact ccagacttcg aagagccctt tgatgagaac 540
ggccacctga ataacccacc cgggattaat aggaacatct acgggtacca gggatgcgct 600
gccaaggttt ttgtcccttc caaacataaa atggtctccc tccctaaaga gtacgagggc 660
tacaacagag accccaacct gagtctggcc ggcttcagga acaggctgga gatccccgag 720
ggtgaacccg gtcacgtgcc ctggtttcaa aggatggaca tccccgaggg acagataggg 780
cacgtgaata aaatacaaag gttcaacttc gtgcacggaa aaaactccgg caaggttaag 840
ttctccgata aaaccggcag ggtcaagagg taccaccatt ccaaatataa ggacgccacc 900
aaaccctaca agttcctgga agagtccaag aaagtgtctg ctctcgattc cattctcgct 960
ataatcacca ttggcgatga ctgggtggtc ttcgacatca gaggattgta taggaacgtc 1020
ttttataggg agcttgcaca aaaaggactg accgccgtcc aactcctgga ccttttcacc 1080
ggggatccag tgatcgatcc aaaaaaggga gtggtcacct tctcctacaa agagggggtc 1140
gtgcccgtgt tctcccagaa gatcgttccc aggtttaagt ccagggatac cttggaaaag 1200
ctgacctccc aaggtcccgt cgccctgctc tccgtggatc tggggcagaa cgagcccgtt 1260
gctgccaggg tctgctccct caagaacatc aacgacaaga tcactcttga caactcctgt 1320
aggatctcct tcctggacga ttataaaaag cagattaaag actacagaga ctccctggac 1380
gagctggaaa tcaaaataag gctcgaagcc atcaactccc ttgagacaaa ccagcaagtg 1440
gagatcaggg atctcgatgt gttctccgcc gacagggcca aggccaacac cgtcgacatg 1500
ttcgacatcg atcccaacct gatctcatgg gactccatgt ccgacgcaag ggtgtcaacc 1560
caaatctccg acctctattt gaagaacggt ggcgatgaga gtagagtgta ttttgagatc 1620
aataataaga ggattaaaag gtccgactat aacatctcac agctcgttag gcccaagctg 1680
tccgactcca ctaggaagaa cctgaacgac tccatctgga agctcaagag gacctccgaa 1740
gagtatctca agctctccaa aaggaagctc gaactgtcaa gggccgtggt caattacaca 1800
atcagacaat ccaagctgct ctccgggatc aacgacatcg tgataatcct ggaggacctc 1860
gacgttaaga aaaagttcaa cggcaggggc atcagagata tcggctggga taatttcttt 1920
agttccagga aagagaatag atggttcatc cccgccttcc acaaggcctt ttccgaactc 1980
tcatccaata gggggctgtg cgttatcgag gtcaatcccg cctggacctc cgcaacctgc 2040
ccagactgcg gcttttgttc caaagaaaac agggatggaa ttaacttcac ttgcagaaag 2100
tgcggcgtgt cctaccacgc cgacatcgac gtcgccacac tcaacatcgc aagggtggca 2160
gttctgggga agcccatgtc cggtcccgcc gatagggaaa ggctcggcga tactaagaaa 2220
cccagggtgg caaggtccag aaagaccatg aagaggaagg atatttccaa ttcaaccgtc 2280
gaggccatgg tcaccgcc 2298
<210> 150
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 150
atggagaaag agatcaccga gcttaccaag atcaggaggg agtttcccaa caagaagttc 60
agcagtaccg atatgaagaa ggccgggaag ctcctcaagg ccgaaggccc agacgccgtg 120
agagacttcc tcaattcatg ccaggagatc atcggggact tcaagccccc cgtcaagacc 180
aacatcgtgt ctatcagcag gcccttcgaa gagtggccag tgtccatggt cgggagggcc 240
atccaggaat actacttcag tttgaccaaa gaggagctcg aaagcgttca ccccgggacc 300
tcctccgagg accacaaaag ttttttcaac atcaccgggc tctccaacta caactacaca 360
agcgtgcagg gcctcaacct catcttcaaa aacgccaagg ccatctacga cggcaccctc 420
gtcaaagcca acaacaagaa caaaaagctc gagaagaaat tcaacgaaat taatcacaag 480
aggagcttgg agggcctccc aataataacc cccgacttcg aggagccttt cgacgaaaac 540
gggcatctca ataacccccc cgggatcaat aggaatatct acgggtatca ggggtgtgcc 600
gccaaggtgt tcgttcccag caaacataag atggtgtccc tccctaagga gtacgagggg 660
tacaacaggg atcccaacct cagccttgcc ggcttcagga acagactcga gatcccagaa 720
ggggagccag ggcacgtgcc atggtttcag aggatggaca ttcccgaggg ccaaatcgga 780
cacgttaaca aaatccagag attcaacttt gtccacggaa agaactccgg caaggttaag 840
ttttccgaca agaccgggag agtgaagagg taccaccact ccaagtataa ggacgccact 900
aagccctaca agtttcttga ggaaagtaag aaggtgtccg ccttggactc cattctcgct 960
atcatcacca tcggggatga ctgggtggtt tttgacatta ggggcttgta caggaacgtc 1020
ttctacaggg agctcgccca gaagggactc accgccgtcc agctccttga cctctttacc 1080
ggggaccccg tgattgaccc aaaaaaaggc gtcgtcacct tctcctacaa ggagggcgtg 1140
gtgcctgttt tctcccaaaa gatcgtgccc aggttcaaga gtagggatac ccttgaaaag 1200
ctcaccagcc agggacctgt tgccctcctc agcgtggacc tcggccaaaa cgagccagtc 1260
gccgccaggg tctgctccct caaaaacatc aacgacaaaa tcaccctcga caactcatgc 1320
aggatctcct tcctcgacga ctacaagaag caaatcaagg attacaggga ctccctcgac 1380
gagctcgaga tcaaaatcag gctcgaggct ataaactcct tggaaactaa ccaacaggtc 1440
gagataaggg acttggacgt gttcagcgcc gatagggcta aggccaatac agtggacatg 1500
ttcgacattg atcccaacct catttcctgg gactccatgt ccgatgctag ggtgtccacc 1560
cagatctccg acttgtacct caagaacggc ggggacgaga gcagggtcta ttttgagatc 1620
aacaataaga ggatcaagag gagcgactac aacatatccc agctcgtcag gcccaagctc 1680
agcgattcca ccagaaagaa tctcaatgac agcatctgga agctcaagag gaccagcgaa 1740
gagtacctta agctctccaa gagaaaactc gagctcagta gggccgtggt caactatacc 1800
atcagacaat ccaagctcct ttcaggcatc aacgacatcg tgatcatcct tgaagacctc 1860
gacgtgaaaa aaaagttcaa cgggaggggt atcagggata tcgggtggga caatttcttt 1920
tcctccagga aagaaaacag gtggttcatc cccgccttcc acaaggcctt ctccgagctc 1980
tcctccaaca ggggtctctg cgtgatcgag gtgaatcccg cctggaccag cgccacttgc 2040
cccgactgtg ggttctgctc caaagagaat agagacggca tcaacttcac ctgcagaaaa 2100
tgcggggtct catatcacgc cgatatcgac gtggccactc ttaatatcgc cagggtggcc 2160
gtgctcggta aacccatgtc agggcccgcc gacagggaga ggttggggga caccaagaaa 2220
cccagggtgg ccaggagcag gaagaccatg aagagaaaag acatcagcaa cagcaccgtg 2280
gaggccatgg tgacagcc 2298
<210> 151
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 151
atggagaagg agataaccga actcaccaag atcaggaggg agttcccaaa taagaaattc 60
agctcaaccg acatgaagaa agctggcaag ctcctcaagg ccgaaggacc tgatgctgtt 120
agagacttcc tcaactcctg ccaggagatc ataggcgatt ttaagccccc cgtcaagact 180
aacatcgtgt ctataagcag gccttttgag gaatggcccg tgagcatggt cggcagagcc 240
atacaggagt attatttcag cctcacaaag gaggagttgg agtccgttca cccaggtacc 300
agctccgagg accataaatc cttcttcaac atcaccggcc tttccaacta caattatacc 360
agcgtgcagg ggcttaatct catctttaaa aacgccaagg ccatctacga cgggacactc 420
gtcaaagcta acaacaagaa taagaagttg gagaagaagt ttaacgagat caaccacaag 480
aggtccttgg aggggctccc tatcatcact cccgacttcg aggagccctt cgacgagaac 540
gggcacctca acaacccccc tgggatcaac agaaacatct acggatacca ggggtgcgcc 600
gccaaagtct ttgtccccag caaacacaaa atggtcagcc tccccaagga gtacgagggc 660
tacaacagag atcccaacct ctccctcgcc gggtttagga acagactcga gatccccgag 720
ggggagcccg ggcatgtccc ctggtttcag agaatggata ttcccgaggg gcaaatcggt 780
cacgtcaata aaatccaaag gtttaacttt gtccacggga aaaactctgg caaggtgaag 840
ttcagcgata agactgggag ggtgaagagg taccaccaca gcaagtataa ggacgccaca 900
aagccctaca agttcctcga ggagagcaaa aaggtctctg ctctcgatag catcctcgcc 960
atcatcacca tcggcgatga ctgggtcgtg ttcgacatca gaggcctcta caggaacgtg 1020
ttctacaggg agctcgccca gaaggggctc acagccgttc agctcctcga cctttttacc 1080
ggggaccccg tcatcgaccc caagaagggc gttgtcactt tctcctacaa agagggcgtg 1140
gtccccgtct tctcccaaaa gattgttccc aggtttaaaa gtagggacac attggaaaag 1200
ctcacctccc agggccccgt cgccctcctc tccgttgatc tcgggcagaa cgagcccgtg 1260
gcagccaggg tttgctccct caaaaacata aatgacaaaa tcaccctcga taactcctgc 1320
aggatcagct tcttggacga ctacaaaaag cagataaagg actatagaga cagcctcgac 1380
gagctcgaaa taaaaatcag gttggaagcc atcaactctc tcgagactaa ccagcaggtg 1440
gagatcaggg accttgacgt gtttagcgcc gacagggcca aggccaacac tgtggacatg 1500
ttcgacattg acccaaacct catcagctgg gacagcatga gcgacgccag ggtttccacc 1560
cagatctccg acctctacct caagaacggc ggggatgaga gcagggtgta cttcgagatc 1620
aacaacaaga gaataaagag gagcgactac aatatatccc agctcgtcag gcccaaactc 1680
agcgacagca ccaggaagaa tttgaacgac tcaatatgga agctcaagag gacctcagaa 1740
gagtacttga aattgtccaa gaggaaactc gagctcagta gggccgttgt gaactacacc 1800
attaggcaga gcaagctcct cagcgggatc aatgacatag ttatcatact cgaggacctc 1860
gatgtcaaga agaagttcaa cggcagaggc attagggata tcggctggga taatttcttc 1920
tcctccagga aagagaatag atggttcatc ccagccttcc acaaagcctt cagcgagctc 1980
tccagcaaca gaggattgtg cgtgattgag gtcaaccctg cttggacctc tgccacatgc 2040
ccagactgcg gattctgcag caaggagaac agagacggga taaattttac atgcaggaaa 2100
tgtggagtct cctaccacgc agacatcgac gttgccacac tcaatatcgc aagagttgcc 2160
gtgcttggga agcccatgag cggccccgcc gatagggaaa ggctggggga cacaaagaaa 2220
ccaagggtcg caaggagcag gaagaccatg aaaaggaagg atatcagcaa ctctacagtc 2280
gaggccatgg ttaccgcc 2298
<210> 152
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 152
atggaaaaag aaattactga gctcacaaaa atcagaaggg aattccccaa caaaaagttt 60
agcagcaccg acatgaaaaa ggccggtaag ctcctcaagg ccgaggggcc cgacgccgtg 120
agggactttc tcaacagttg ccaggagatc attggagatt tcaagccccc cgttaagacc 180
aacatcgttt ccatatctag acccttcgag gagtggcccg tctctatggt tgggagggcc 240
attcaagagt attacttctc cctcaccaaa gaagagcttg aatctgttca tcccgggacc 300
agttccgagg accacaagag cttcttcaac atcaccgggt tgagcaatta caactacacc 360
tccgtgcagg gacttaacct catcttcaag aatgctaagg ctatctacga tggaaccctc 420
gtcaaagcca ataataagaa caagaagctt gagaaaaaat tcaacgagat taaccacaag 480
aggagcctcg aggggctccc aattatcacc cccgatttcg aggagccctt cgacgagaac 540
gggcacctca ataacccccc cggaatcaat agaaatatat acggatacca gggatgtgcc 600
gcaaaggttt tcgtccccag caaacacaag atggtgtccc tccccaagga gtacgagggg 660
tacaaccgag accccaacct ctcccttgcc ggttttagaa ataggctcga gatccccgag 720
ggggagcccg gtcacgttcc atggttccag agaatggaca tccctgaagg gcagatcggt 780
cacgtcaata aaatccagag gttcaatttc gttcacggga agaactcagg caaggtgaag 840
tttagcgaca aaaccggaag agtgaaaagg taccaccact ccaagtacaa ggacgccaca 900
aagccataca agttcctcga agagagcaaa aaggtgagcg ccctcgatag catacttgcc 960
atcatcacca tcggcgacga ctgggttgtc ttcgacatca gggggctcta caggaacgtt 1020
ttctacagag agctcgcaca aaaggggctt accgctgtcc agctcctcga cctcttcacc 1080
ggcgaccccg tgatcgatcc caagaagggc gttgtgacct tcagctacaa agagggggtt 1140
gtgcccgttt ttagtcagaa gattgtcccc aggttcaaaa gcagggacac actcgagaag 1200
ctcaccagcc aaggtcccgt cgccttgctc tccgtggacc tcggccagaa cgagcccgtg 1260
gccgcaaggg tgtgctctct taagaacatc aacgacaaga tcaccctcga caacagctgc 1320
aggatttcat tcctcgacga ctacaagaaa cagattaaag actacaggga cagcctcgac 1380
gagcttgaga tcaagattag gctcgaggcc atcaacagct tggagaccaa ccagcaagtt 1440
gagataagag acctcgacgt gttcagcgcc gacagggcca aagccaatac cgtcgatatg 1500
ttcgacatcg accccaacct catctcctgg gactctatgt cagacgccag ggtgtcaact 1560
caaatatccg acttgtacct caagaacggg ggcgacgagt ctagagttta ctttgagatc 1620
aataacaaga ggattaagag gtcagactac aacatctccc agctcgtcag gcccaagctt 1680
tccgattcaa ccagaaagaa cctcaacgac tccatctgga agctcaagag gacctccgaa 1740
gaatacctca agctctccaa gaggaagctc gagctctcca gagccgtcgt caattatact 1800
attaggcaga gtaagctcct tagcggcatc aatgacattg tgatcatcct tgaggatctc 1860
gacgtgaaga aaaagttcaa cggcagaggg atcagagaca ttggatggga caactttttc 1920
tcctccagga aagagaatag gtggttcatc cccgccttcc acaaggcctt cagcgagctc 1980
agtagcaaca gggggctctg cgtcatcgaa gtcaaccccg cctggacctc tgccacttgc 2040
cccgactgcg ggttctgcag caaagagaac agggacggta tcaacttcac ctgcagaaag 2100
tgcggagtga gctaccacgc cgacatcgac gttgcaacat tgaacatcgc cagggttgcc 2160
gtgcttggaa agcccatgtc agggcccgcc gacagggaga gactcggaga caccaagaag 2220
cccagagttg ctagaagcag aaagaccatg aaaaggaagg acatctcaaa ctccaccgtt 2280
gaggccatgg ttacagcc 2298
<210> 153
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 153
atggagaagg agatcactga gctcaccaag atcaggagag agttccccaa caagaaattc 60
agcagcaccg acatgaagaa agcaggcaaa cttttgaagg ctgagggtcc cgacgcagtc 120
agagacttcc ttaacagttg ccaggaaatc atcggtgact tcaagccccc cgtcaagact 180
aacatcgttt ccatcagcag acccttcgaa gaatggcccg tgtcaatggt cgggagggcc 240
atccaggagt attacttcag cctcaccaaa gaggaactcg agtccgtcca ccccggcacc 300
tcatccgagg atcacaagtc cttcttcaac attacaggtc tcagcaacta caactacact 360
agcgtccagg gactcaactt gatctttaag aacgccaagg ccatctatga cgggaccctc 420
gtgaaggcca acaacaagaa caagaagctc gaaaaaaagt tcaacgagat aaaccacaaa 480
aggagtctcg agggcctccc catcatcacc cccgactttg aagagccctt cgatgagaat 540
ggacacttga ataacccccc cgggatcaac aggaacatat acggttacca ggggtgtgcc 600
gccaaggtgt tcgtccccag caagcacaag atggttagcc tccccaagga atatgaggga 660
tacaataggg atcccaatct ctccctcgcc ggttttagaa acaggctcga gatcccagaa 720
ggggagcccg ggcacgtgcc ctggttccag aggatggata tccccgaggg acaaataggc 780
cacgttaaca agatacagag gttcaacttt gtccatggca agaactccgg gaaagttaag 840
ttcagcgaca aaaccggcag ggtgaagagg tatcaccaca gcaagtacaa ggacgcaacc 900
aagccctaca agttcctcga ggaaagcaag aaggtgagcg ccctcgacag tatcctcgcc 960
atcattacca ttggggacga ctgggttgtg tttgacatca gaggcctcta cagaaacgtc 1020
ttctatagag agctcgccca gaaaggcctc actgccgtcc agctcctcga tctcttcact 1080
ggcgaccctg tcatcgaccc aaagaaggga gtggtcacct tcagctataa ggagggcgtg 1140
gttcccgtgt tctcccagaa gatcgttccc aggttcaagt ccagggatac cctcgagaaa 1200
ctcacttcac aggggcccgt tgcccttctc tctgttgatc tcgggcagaa tgagcccgtc 1260
gccgccaggg tctgttccct caagaatata aacgacaaga tcaccctcga caacagctgt 1320
agaatctcct ttctcgatga ctacaagaag caaattaagg actacaggga ctctctcgat 1380
gagctcgaga taaagatcag gctcgaagcc atcaactccc tcgagaccaa ccagcaggtc 1440
gagatcaggg acctcgatgt tttctccgcc gatagagcaa aggccaacac cgtggatatg 1500
ttcgacatcg atcccaacct catctcctgg gactccatgt cagacgccag ggtgtccaca 1560
cagatctccg acctctacct caaaaacggt ggcgacgagt ccagggttta cttcgagatc 1620
aacaataaga gaatcaagag gtccgattat aacatctcac agctcgtcag gccaaagctc 1680
tctgacagta ccaggaagaa ccttaatgac agcatctgga agttgaaaag gaccagcgag 1740
gagtacctca aactctccaa gagaaagctc gagctcagca gggctgtcgt taactacacc 1800
atcaggcaaa gcaagttgtt gagtgggatc aacgacatcg tgatcattct tgaagatctc 1860
gatgttaaga agaagttcaa cgggaggggg attagagaca ttgggtggga taatttcttt 1920
agctccagga aagaaaatag gtggttcatc cccgccttcc acaaagcctt ctcagaactc 1980
tcatctaaca ggggcctctg tgtcatcgaa gtcaaccccg catggacctc tgcaacctgc 2040
cccgactgcg gtttctgttc caaggagaac agggatggca tcaacttcac ctgtagaaag 2100
tgcggcgtca gctaccatgc cgacattgac gtggccaccc tcaacatagc cagggtggcc 2160
gttctcggca agccaatgtc cggtcccgcc gatagggaga ggctcggtga caccaagaag 2220
cccagagttg ccaggagcag aaagaccatg aagaggaagg acatctccaa tagcaccgtc 2280
gaggccatgg tgacagcc 2298
<210> 154
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 154
atggagaaag agatcaccga gctcacaaag attagaaggg agtttcccaa caagaaattc 60
agctccaccg acatgaagaa ggcaggtaag ctcctcaagg ccgaaggacc cgacgcagtt 120
agggattttc tcaactcctg ccaagagatc attggggatt tcaaaccccc agtgaagact 180
aacattgtga gcatctctag gcctttcgag gagtggcccg tctctatggt ggggagagcc 240
atccaagaat actacttctc cctcaccaag gaggagttgg agagcgtgca ccccgggacc 300
agttctgagg accacaagtc tttcttcaat atcaccggct tgagcaatta caattacacc 360
tccgtccagg gccttaatct catcttcaag aatgctaagg ctatttatga cggtaccctc 420
gtcaaggcca acaacaagaa caagaagctc gagaagaaat tcaatgagat aaatcataag 480
aggtccctcg agggcctccc catcatcact ccagacttcg aggagccctt cgacgagaac 540
ggtcacctta acaacccccc cggcatcaac aggaatatct atggctacca ggggtgtgcc 600
gctaaagtct tcgtgccttc caaacacaag atggtgtccc tcccaaagga atacgagggc 660
tacaacaggg accccaactt gtccctcgcc gggttcagaa atagactcga gatccccgag 720
ggggagcccg gccacgtccc ctggtttcag agaatggata tcccagaggg ccaaattggt 780
catgtgaaca agatacaaag gtttaacttc gttcacggca agaacagcgg caaagtcaag 840
ttctccgaca agaccgggag ggttaagagg tatcatcaca gcaagtataa agacgccacc 900
aagccctaca agtttctcga ggagtcaaag aaggtcagcg ccttggactc aatcctcgcc 960
attatcacca tcggggacga ctgggtggtt ttcgatatca gaggactcta caggaacgtg 1020
ttctatagag agctcgccca gaaggggctc acagccgtcc agcttctcga cctcttcacc 1080
ggggatcccg tcatcgaccc caagaagggg gtcgttacat tctcctacaa ggagggcgtg 1140
gtgcccgtct ttagccagaa aattgtcccc agattcaagt ccagggacac cctcgagaaa 1200
ttgacatccc agggacccgt ggccctcctt tccgtggacc tcgggcaaaa cgaacctgtg 1260
gccgccaggg tgtgctccct caagaacatc aacgacaaaa tcaccctcga caactcctgc 1320
aggatcagtt ttcttgacga ttacaagaaa cagatcaagg attatagaga cagcttggat 1380
gaattggaga tcaaaatcag attggaagcc atcaacagcc tcgaaactaa ccagcaggtc 1440
gagatcagag acctcgatgt tttctctgcc gacagagcca aggccaacac cgtcgacatg 1500
ttcgatatcg accccaacct catcagctgg gattcaatgt ccgacgccag ggtctccacc 1560
cagatcagcg acctctacct taaaaacgga ggagatgagt ctagggtcta ctttgaaatc 1620
aacaataaaa ggatcaagag gagcgattac aacatctctc aactcgtgag acccaagctc 1680
tccgacagta ccaggaaaaa ccttaacgac agcatctgga agctcaagag gacctccgag 1740
gaatacttga aactcagtaa gaggaagctc gagctctcca gggccgtggt taactacacc 1800
ataaggcaga gcaagttgtt gtctggcatc aacgacatag tcatcatcct tgaggacctc 1860
gacgttaaaa agaagttcaa cggtaggggc atcagggata taggttggga taatttcttc 1920
tcctctagga aggagaacag atggttcatc cccgccttcc acaaggcctt ctccgagctc 1980
agctcaaata ggggcctttg cgtcatcgaa gtgaaccctg cctggaccag cgccacctgc 2040
cccgactgcg gcttctgcag caaggagaac agggacggca tcaacttcac ctgtaggaaa 2100
tgtggcgtgt cctaccatgc agacatcgac gttgccacct tgaacatcgc aagggtggcc 2160
gtgcttggca aacccatgtc tggaccagcc gacagagaga ggctcggcga caccaagaaa 2220
cccagggtgg ccaggagcag aaagaccatg aagaggaaag acatcagcaa ctcaaccgtg 2280
gaggccatgg tcaccgcc 2298
<210> 155
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 155
atggagaagg aaatcaccga gctcaccaag attaggaggg agttccctaa caagaagttc 60
agcagcaccg acatgaagaa ggccggcaag ctcctcaagg ctgagggccc cgacgccgtt 120
agggactttc tcaacagctg ccaggagata atcggcgact tcaagccccc cgtcaagacc 180
aacatcgttt caatctctag gccctttgag gagtggcccg tttcaatggt gggaagggcc 240
atccaagagt attacttctc cctcaccaaa gaagagcttg aaagcgtcca ccccggcacc 300
agttccgagg atcataagtc ctttttcaac atcaccgggc tcagcaacta caattacacc 360
tctgtccagg gcctcaacct catattcaag aatgccaaag ctatctacga cgggaccctc 420
gtgaaggcaa acaacaagaa taaaaagctc gagaagaaat tcaacgagat caaccacaaa 480
aggagcctcg aggggttgcc catcataacc ccagacttcg aggagccttt cgacgaaaac 540
ggtcacctca ataacccccc cgggattaac agaaacatct acggatacca ggggtgcgcc 600
gccaaggtct tcgtccccag taagcataag atggttagcc tccccaaaga gtatgaaggg 660
tacaacaggg atcctaatct cagcttggcc ggcttcagga acaggctcga gatacccgag 720
ggggaacccg gccacgtgcc ctggtttcag aggatggata tccccgaagg acaaataggg 780
cacgtcaaca agatccagag gttcaacttt gtgcacggca agaacagtgg caaggtgaag 840
ttcagcgaca aaactgggag agtgaagaga taccatcaca gcaagtacaa ggatgccact 900
aagccttaca agttccttga agagtcaaag aaagtctccg cactcgacag catcctcgcc 960
atcatcacca tcggcgatga ttgggttgtc ttcgacatca gggggctcta cagaaacgtt 1020
ttctacaggg agctcgccca gaaagggttg accgccgtgc aattgcttga tctttttacc 1080
ggcgatccag tgatcgaccc caaaaagggc gtcgtgacct tctcctacaa agagggcgtg 1140
gttcctgtgt tctcccagaa gatcgtgcct aggttcaaaa gcagagacac cctcgagaag 1200
ctcacatccc aggggcccgt ggcactcctc agcgtcgacc tcggccagaa cgagcccgtc 1260
gccgccaggg tctgttcctt gaagaacatc aacgacaaga tcaccctcga caactcatgc 1320
aggatctcct tcttggacga ttacaaaaag cagatcaagg actacaggga cagcctcgac 1380
gagctcgaga tcaagatcag gctggaggcc atcaacagct tggagaccaa ccagcaggtg 1440
gagatcaggg acctcgacgt cttctccgca gacagggcaa aagccaacac cgtcgacatg 1500
ttcgacatcg accccaacct catctcttgg gatagcatgt ccgacgccag ggttagtaca 1560
caaattagcg acctctatct caagaacggt ggcgatgagt ccagggtgta cttcgaaata 1620
aacaataaga ggatcaagag atctgactac aacatcagcc agctcgttag gcctaagctc 1680
tccgatagta ccaggaaaaa cctcaatgac tcaatctgga agctcaagag gactagcgag 1740
gagtacctta agctcagcaa gaggaagctc gagctcagca gggccgtcgt caactacacc 1800
atcaggcaga gtaagcttct cagcggcatc aacgacattg tcatcatcct cgaggatttg 1860
gacgtgaaga aaaagttcaa cggtagaggc atcagggaca tagggtggga taacttcttc 1920
agcagtagga aggaaaacag atggttcatc cccgcatttc acaaggcctt ctccgagctc 1980
agctccaata ggggcttgtg cgtcatcgaa gtgaatcccg cttggacaag cgcaacctgt 2040
cccgactgtg gcttctgtag caaagagaac agagatggga tcaacttcac ctgcaggaag 2100
tgcggtgtgt catatcacgc cgacatcgac gttgcaacac tcaatatcgc cagagtcgcc 2160
gtcctcggta agccaatgtc cggccccgcc gatagggaga gactcgggga caccaagaag 2220
cccagggtcg ccagaagcag gaagaccatg aagaggaaag acatcagcaa tagcaccgtt 2280
gaggccatgg ttaccgcc 2298
<210> 156
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 156
atggcggata cgcccactct gttcacacag tttcttcgtc atcacttacc tggccagagg 60
tttcgcaaag atatccttaa gcaggccggc cgcatcttgg ccaataaggg agaagatgcc 120
accatcgcct tcctccgcgg aaagagcgaa gagtctcccc cggattttca accgcccgta 180
aagtgcccga tcattgcttg cagtcgcccg ctgacagagt ggcccatata ccaggcatca 240
gtggctatcc aagggtacgt ttatgggcag agtcttgctg agttcgaagc gtcggaccct 300
ggctgcagta aggatggtct gttagggtgg ttcgacaaga ctggtgtgtg taccgactat 360
ttctcggtcc aaggtttaaa tctgatcttt cagaatgcta gaaaacgcta cattggcgtg 420
cagacgaagg tcactaaccg taatgagaag cgccacaaga agctgaagcg gattaacgct 480
aaacggattg ctgaaggcct ccctgagctg acatctgacg agcccgagtc tgccctggac 540
gagacgggac acctcatcga cccgccaggt ctgaacacaa atatatactg ttaccagcag 600
gttagtccga agcctttggc attgtcggag gttaaccagc tgccgacggc ctatgcaggg 660
tattcgacca gcggcgatga cccgatccag ccaatggtca ccaaggaccg gctctcgatt 720
tctaagggtc agccagggta catacctgaa caccaaaggg ccttgctcag tcagaagaag 780
cacaggcgga tgcgcggcta cggactgaaa gcacgcgcgc tactggtcat cgtccgcatc 840
caggatgact gggccgttat agacctgcgc tcactcttga ggaacgccta ttggaggcgc 900
atcgtacaga ctaaggaacc aagcaccata accaaactgt tgaagttggt taccggtgat 960
ccagtactgg acgccacccg tatggtggct acttttacat acaagccggg aatagtgcaa 1020
gtgcgctccg caaagtgcct gaagaataag cagggttcca agctgttctc tgagcggtac 1080
ctgaacgaga cagtgtctgt aacctctatc gacctgggca gcaataacct ggttgccgtt 1140
gcgacctatc gcctggttaa tgggaatacc ccagagttgc tccagaggtt cactctccca 1200
tcgcatctcg ttaaagactt tgagaggtac aaacaggcgc atgacacgct cgaggactca 1260
atccaaaaga cagcggttgc ctcccttcct caggggcaac agactgagat tcgcatgtgg 1320
agcatgtacg ggttccgaga ggcccaagag agggtgtgcc aagagctcgg cctggcggac 1380
ggtagcatcc cctggaacgt gatgacggct actagtacga tcctcacgga cctttttctt 1440
gcacgcggtg gcgaccccaa gaagtgcatg ttcacctcag aaccgaagaa gaagaagaac 1500
agtaagcaag tactctacaa gatccgtgac agagcatggg cgaaaatgta taggaccctc 1560
ctcagcaaag agacgaggga ggcctggaat aaggcgttgt ggggcctaaa gaggggtagt 1620
ccagactacg cccggctcag taagcgaaag gaggagctgg ccagaagatg tgttaattac 1680
accatttcaa cagctgagaa aagagcgcaa tgcggccgta cgatcgtggc cctcgaggac 1740
ctcaacatcg gctttttcca cggcagaggg aagcaagaac ctggatgggt tgggctattc 1800
actcgcaaga aggaaaatag gtggctcatg caggcgttgc ataaagcctt cctggagttg 1860
gcgcaccata ggggatacca cgtgatcgag gtgaacccgg cttatacgtc ccaaacctgc 1920
ccagtgtgca gacattgcga tccagacaac agggaccaac ataacaggga ggcattccac 1980
tgcatcggct gtggattccg cgggaacgcg gaccttgacg ttgcgacgca caacatcgca 2040
atggtcgcca taaccggtga gtccctgaag agggcccgtg gctcggtggc gagcaagaca 2100
ccacaaccgc tcgccgcgga a 2121
<210> 157
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 157
atggcagaca ccccaacgct gttcacgcag ttcctccgac atcatctgcc aggccagcgc 60
ttccggaaag acatactcaa gcaagcaggc cggatcctgg ccaataaggg tgaagacgct 120
acaatcgcct ttctgcgggg gaagtctgag gagtcgcctc cggatttcca accaccggtt 180
aagtgcccaa taatcgcatg cagtagaccc ctgaccgaat ggcccatata ccaagcctct 240
gtggcgatcc agggatacgt gtatggtcag tctttggctg agttcgaggc ttcagacccg 300
gggtgttcta aggacggcct gctcggctgg ttcgataaaa ctggcgtgtg cactgactac 360
ttcagcgttc aaggcttaaa cctcatcttt cagaatgcac ggaagcgcta cattggggtc 420
caaaccaagg tcacaaatcg taacgaaaag cggcataaga agttgaagag aataaacgct 480
aaaaggatcg cggagggatt gcctgagctg acctccgatg aaccagagtc cgcactcgac 540
gagacgggcc acctcatcga ccctccaggc cttaacacca acatctactg ctaccagcaa 600
gtatcgccaa agccgttggc gttgagtgag gtaaatcaat tgccgacggc atacgcgggc 660
tattctacgt cgggagacga tcctattcag cccatggtga cgaaggaccg tctgagcatc 720
tccaaggggc agccaggata catcccagag caccagcggg ccctcctttc tcagaagaag 780
cataggcgca tgaggggtta cgggctgaag gcacgtgctc tcctagtgat tgttcgcatc 840
caggacgact gggccgttat tgatctgaga tcgctgctca ggaatgccta ctggcgcagg 900
atcgtgcaga cgaaagagcc atcaaccatc accaagctcc tgaagctagt gactggggac 960
ccagtactag atgccacacg gatggtggca acttttacct acaaaccggg catcgttcag 1020
gtccggagcg cgaaatgcct caaaaacaag cagggctcta agctattctc cgaaaggtac 1080
ctgaacgaaa ccgtgtccgt tacctctatt gatctcggat ccaataattt ggtggctgtt 1140
gccacgtacc ggctcgtcaa cggaaacacg ccggagctgc tgcagaggtt tacgttgccc 1200
tctcacctgg tgaaggattt cgagcgttat aagcaggctc acgacaccct ggaggattcc 1260
atccaaaaga cagctgtcgc ctccttgcca cagggccaac agactgaaat ccgtatgtgg 1320
tctatgtacg gtttccgcga ggctcaagaa cgggtgtgtc aagagttggg ccttgccgac 1380
gggtctatac cttggaacgt tatgactgct acctctacta tccttaccga cctgttcttg 1440
gcgcgtggtg gtgatccgaa aaagtgcatg ttcacgagtg agcctaagaa gaaaaagaac 1500
tccaaacagg tcctgtataa gatacgtgac agggcttggg ctaaaatgta ccgcactctg 1560
ctgtctaagg agacgaggga agcatggaac aaggcccttt ggggattgaa gagaggtagc 1620
cctgattacg cacgcctgtc caagcgtaag gaggagctcg cacgacgctg cgtcaattac 1680
accatcagta cagctgagaa gagggctcag tgcggtagga ccatcgtggc cctggaggat 1740
cttaacattg gattcttcca tggaaggggt aaacaagagc cggggtgggt cggactcttc 1800
acccgcaaga aggagaaccg ctggctcatg caagcattgc ataaggcctt cctcgaacta 1860
gcacaccacc gaggctacca tgtgatcgag gtcaaccctg cgtatacatc ccagacctgc 1920
ccggtctgca ggcactgcga tcccgacaac cgtgatcagc ataaccgcga ggcatttcat 1980
tgcattggtt gcggcttccg cggcaacgcc gatctcgacg tggccacaca caacattgct 2040
atggttgcca tcactggaga gagcctcaag cgcgcgaggg gctcagttgc atcgaagaca 2100
cctcagcctt tagcggcaga g 2121
<210> 158
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 158
atggcggaca ccccgaccct cttcacacag ttcctcagac accacttgcc tggccagcga 60
ttccggaagg acatccttaa gcaggccggg agaatcctag caaacaaggg ggaagatgcc 120
acgatagctt tcctcagagg aaagtctgag gagtcaccgc cagatttcca accgcctgtc 180
aagtgcccga tcatcgcctg ctccagacct ctgacagaat ggcctatcta ccaggcttca 240
gtcgctatac agggatatgt gtacgggcaa tctctcgcgg agtttgaagc ctctgaccct 300
ggttgcagta aagatgggct gctggggtgg ttcgataaga ccggagtctg cacagattat 360
ttctcagtgc agggcctgaa cctgattttc cagaatgcta gaaagcgata cattggcgtc 420
cagacaaagg tgactaaccg caacgaaaag agacacaaga agcttaagag aattaatgca 480
aagaggatcg cggagggttt gcctgaactc acctctgatg aaccggaatc ggcgctcgac 540
gagaccggcc acctgatcga cccacctggg ttgaacacaa acatctattg ttaccaacaa 600
gttagtccca aaccgttggc cctctctgag gttaaccaac tccctacggc ctacgccggc 660
tactcgactt ccggcgacga cccaatccag ccgatggtga ctaaggaccg cttatctatc 720
agcaaagggc agccagggta catcccagaa catcaaaggg ctctcctttc gcaaaagaag 780
catcggcgga tgagaggcta cgggcttaag gcccgcgcct tgttggttat tgtgaggatc 840
caagacgact gggcggtaat cgacctacgc tcgctgctgc gtaacgccta ttggcgccgg 900
atcgtacaga ctaaggagcc cagcacaatt acaaagctgc tcaagctggt taccggcgac 960
cccgtgctag atgcgacccg gatggtcgcg acatttacct acaagcctgg catcgtgcag 1020
gtacggagcg ccaaatgcct taaaaataag caaggctcga agctcttctc tgaaaggtac 1080
ctgaacgaga ccgtgtctgt aacttcgatc gatttgggta gcaacaatct cgtggccgtg 1140
gcaacctatc gcttagtaaa cggtaacacc ccagagctgc ttcagaggtt taccttaccg 1200
tctcatctgg ttaaggactt tgagagatac aaacaagctc atgacactct ggaagattcc 1260
atccagaaga cagctgtggc atcactccca caaggccagc aaacagagat acgtatgtgg 1320
agtatgtacg gattccgcga ggcccaggaa agagtctgcc aggaattagg cctagccgac 1380
ggcagtatcc cttggaatgt gatgactgcg accagcacca ttctcacgga cctctttctg 1440
gcaaggggtg gcgatcctaa gaagtgcatg ttcacctcgg aacccaagaa gaaaaagaac 1500
tctaagcaag tcttgtacaa gatccgggat cgggcttggg caaagatgta caggaccctg 1560
ctgtctaaag agacccggga agcatggaac aaagcgctct ggggcctcaa gcggggctct 1620
ccggactatg ctcggctctc gaagcgaaaa gaagaattgg ccagacggtg cgtaaattac 1680
actatctcca cagctgagaa aagagctcag tgcgggcgca caattgtggc cctagaagac 1740
ctaaacatag ggttcttcca cggccgcgga aagcaagaac ccgggtgggt gggtttgttc 1800
acaaggaaga aggagaacag atggctcatg caggctttgc acaaggcgtt cctcgagttg 1860
gctcatcatc gaggatatca cgtcatcgaa gttaacccgg catacacatc acagacatgc 1920
cccgtttgcc ggcactgcga tccagataac agagatcaac acaacaggga agcatttcac 1980
tgcattggct gcgggttccg cggcaacgcg gatttggacg tggcaacaca caatattgcg 2040
atggtggcca taacagggga gagcctcaag cgggccaggg ggagcgtggc ctccaaaacc 2100
cctcaaccct tagctgctga g 2121
<210> 159
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 159
atggccgaca ccccgaccct cttcacccag ttcctccgcc accatctgcc gggccagcgc 60
ttccgcaagg acatcctgaa gcaggccggc aggatcctcg ccaacaaggg cgaggacgcc 120
accatcgcct tcctccgcgg caagagcgag gagtccccgc cggacttcca gccgccggtg 180
aagtgcccga tcatcgcctg cagcaggccg ctcaccgagt ggccgattta ccaggcctcc 240
gtggccatcc agggctacgt gtacggccag tccctggccg agttcgaggc ctccgacccg 300
ggctgctcca aggacggcct cctgggctgg ttcgataaaa ccggcgtgtg caccgactac 360
ttctccgtgc agggcctgaa cctgatcttc cagaacgcca ggaagaggta catcggcgtg 420
cagaccaagg tgaccaacag gaatgaaaag aggcacaaga agctgaagcg catcaacgcc 480
aagaggatcg ccgagggcct cccggaactc accagcgacg agccggagag cgccctggac 540
gagaccggcc acctgatcga cccgccgggc ctgaacacca acatctactg ctaccaacag 600
gtgagcccga agccgctggc cctgagcgag gtgaaccagc tgccgaccgc ctacgccggc 660
tactccacct ccggcgacga cccgatccag ccgatggtga ccaaggaccg cctgagcatc 720
tccaagggcc agccgggcta catcccggag caccagcgcg ccctgctctc ccagaagaag 780
caccgcagga tgcgcggcta cggcctgaag gcccgcgccc tcctggtgat cgtgaggatc 840
caggacgact gggccgtgat cgacctccgc agcctgctca ggaacgccta ctggcgcagg 900
atcgtgcaga ccaaggagcc gagcaccatc accaagctcc tgaagctcgt gaccggcgac 960
ccggtgctgg acgccacccg catggtggcc accttcacct acaagccggg catcgtccag 1020
gtgaggagcg ccaagtgcct caagaacaag cagggctcca agctgttcag cgagaggtac 1080
ctcaacgaga ccgtgtccgt gacctccatc gacctcggca gcaacaacct cgtggccgtg 1140
gccacctacc gcctcgtgaa cggcaacacc ccggagctgc tccagcgctt caccctcccg 1200
tcccacctcg tgaaggactt cgagaggtac aagcaggccc acgacaccct ggaggactcc 1260
atccagaaga ccgccgtggc ctccctcccg cagggccagc agaccgagat caggatgtgg 1320
agcatgtacg gcttccgcga ggcccaggag agggtgtgcc aggagctggg cctcgccgac 1380
ggctccatcc cgtggaacgt gatgaccgcc acctccacca tcctgaccga cctgttcctc 1440
gcccgcggcg gcgacccgaa gaagtgcatg ttcaccagcg agccgaagaa gaagaagaac 1500
tccaagcagg tcctgtataa aatacgcgac cgcgcctggg ccaagatgta ccgcaccctc 1560
ctgagcaagg agaccaggga ggcctggaac aaggccctgt ggggcctgaa gaggggcagc 1620
ccggactacg cccgcctctc caagaggaag gaggagctgg ccaggcgctg cgtgaactac 1680
accatcagca ccgccgagaa gagggcccag tgcggccgca ccatcgtggc cctggaggac 1740
ctgaacatcg gcttcttcca cggcaggggc aagcaggagc cgggctgggt gggcctgttc 1800
accaggaaga aggaaaatag gtggctcatg caggccctgc acaaggcctt cctggagctg 1860
gcccaccaca ggggctacca cgtgatcgag gtgaacccgg cctacaccag ccagacctgc 1920
ccggtgtgcc gccactgcga cccggacaac agggaccagc acaacaggga ggccttccac 1980
tgcatcggct gcggcttcag gggcaacgcc gacctcgacg tggccaccca caacatcgcc 2040
atggtggcca tcaccggcga gtccctcaag cgcgcccgcg gcagcgtggc ctccaagacc 2100
ccgcagccgc tggccgccga g 2121
<210> 160
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 160
atggctgaca cgcctacgct gttcactcaa tttttgcggc atcatttgcc aggccaacgg 60
ttccggaagg acatcttaaa acaggcgggt aggatccttg caaataaggg cgaagacgca 120
accatagcat tcctgagggg taagtccgaa gaaagcccac cggacttcca gccgcctgtg 180
aagtgtccta ttatcgcctg ctctagaccc ctaactgaat ggccgattta tcaagcgtcg 240
gtcgctatcc aggggtacgt ctacggccag agtcttgccg agttcgaagc ctctgatcct 300
ggctgctcca aggatggttt gctcggctgg tttgacaaaa cgggcgtgtg tacggactac 360
ttctcggtgc agggcttgaa tctgattttt cagaacgctc gtaagcgata tataggggtc 420
cagactaagg tgacgaatag gaatgaaaag cggcacaaga aactcaagcg aatcaacgca 480
aagcggatcg cggagggctt gccggaattg actagcgatg aaccggaatc agcacttgac 540
gagacgggcc acttaatcga cccccccggg ttgaacacca acatctactg ctaccagcag 600
gttagcccga agcctctcgc actctctgag gtcaaccagc tacctacagc ctacgctgga 660
tactccacat ctggggacga tccaatccag ccaatggtga ccaaagacag gctatccata 720
tcgaagggcc aacccggata tatccccgaa caccagcgag ccctgttgag ccagaagaag 780
caccgcagga tgagaggtta tggcctaaag gcccgcgcgc tgctcgtcat cgtacgtata 840
caggacgact gggcggtgat cgacctccgc agccttctca ggaacgctta ctggcggaga 900
atagtgcaga cgaaggagcc ttccactatt actaagctac tgaaactcgt aaccggtgat 960
ccagttctcg atgctactcg catggtggca acattcacat acaagccagg gatcgtgcag 1020
gtgcgtagcg caaagtgctt gaagaataag cagggatcaa agctgttctc agagcggtac 1080
ctcaatgaaa ccgtcagtgt gacctcgatc gatctcggat ccaataactt agtcgctgtc 1140
gcgacgtatc gccttgtgaa cgggaacacc cctgaacttc ttcagaggtt tacccttcca 1200
agccacttgg tcaaggattt tgagagatat aaacaggctc atgacaccct ggaggattca 1260
atacagaaga ctgcagtcgc tagtctcccc caaggccagc agacggagat caggatgtgg 1320
tccatgtatg gcttcaggga ggcccaagaa agggtttgcc aggaattggg tcttgcggac 1380
ggctccatac cctggaacgt tatgacggcc acgagcacta ttcttacaga cttgtttttg 1440
gctcgcggcg gcgacccaaa gaagtgcatg ttcaccagtg agccgaagaa gaagaagaat 1500
tcaaagcaag tcctgtacaa aatccgggat agggcatggg ctaaaatgta caggacccta 1560
ctcagcaaag aaacccgtga ggcatggaat aaggccttat gggggctgaa acgcggcagt 1620
cccgattatg cgcgcctctc caagcgaaag gaggagctcg cgcgtaggtg cgtcaattac 1680
acaatttcaa cggcggagaa gcgtgcccag tgcgggcgta ctatagtcgc cttggaagac 1740
ctgaacatcg gtttcttcca cggccgcggc aagcaagagc cggggtgggt tggccttttc 1800
acccgtaaga aggaaaacag gtggctgatg caggccctgc ataaggcctt tctagagctc 1860
gcgcatcacc gcgggtacca cgttatcgag gtgaatccgg catacacttc ccagacatgt 1920
cccgtctgcc gccactgtga cccggacaat cgcgaccagc acaatcggga agcgttccac 1980
tgcatcggtt gcggtttcag aggcaatgcc gacctcgacg tggcaaccca taatatcgct 2040
atggtggcta ttaccggcga gtcgttgaaa cgggcgaggg gctcagtggc ctctaagacc 2100
ccacagccac ttgcagccga g 2121
<210> 161
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 161
atggctgaca caccgacttt gtttactcag ttcctgaggc accacctgcc gggtcaaaga 60
ttccggaaag atatcctcaa gcaggcaggt cgaattttgg cgaacaaggg ggaagatgcg 120
acgatcgcgt tccttagagg caaaagcgag gagtcccctc cggatttcca gcctcccgtt 180
aaatgcccta ttatcgcttg ttcacgtccc cttacggagt ggccaattta ccaggcttcg 240
gtggcgatac aggggtatgt gtatggccaa tcccttgcag agttcgaagc aagcgaccca 300
ggttgctcaa aggatggttt actgggttgg ttcgacaaaa ccggagtttg caccgactac 360
ttcagcgtgc aaggcctcaa cctgatcttt cagaacgctc gcaagcgtta tatcggggtc 420
cagaccaagg tgacaaacag gaacgagaag cggcacaaaa agctaaagag gattaacgcc 480
aaacgtatcg ccgaaggtct tcccgagctt acaagcgacg agccggaatc cgcattggat 540
gagacaggcc atctgattga cccgcccggt ttaaacacta acatatactg ctatcagcaa 600
gtctcgccta aacctctggc cctctcagag gtaaaccagt tgccaacggc ctatgcaggg 660
tatagtacct caggcgatga ccctatccag cctatggtca ccaaagaccg tctcagcatc 720
agtaagggtc agccaggata cataccggaa caccaaaggg ctctgctttc gcagaagaaa 780
caccggcgca tgaggggcta cgggctaaaa gcccgggcac tcttggtcat tgtgcgcatt 840
caagatgact gggctgttat tgacctgaga agccttcttc gtaatgctta ttggcgtagg 900
atcgtacaga ccaaagagcc atcaaccata acaaagctgt tgaagttagt cacaggggat 960
cccgtgcttg acgcaaccag aatggtggcg actttcacct acaagccagg tatcgtgcaa 1020
gtcagaagtg ctaagtgtct taagaacaaa caaggtagta agctattcag cgagcggtac 1080
ttgaacgaga ccgtgtccgt tacgagtata gatttgggct ctaataattt ggtcgctgtg 1140
gcaacctacc gcctcgtgaa cggcaatacc cccgaactgc tccagagatt cacgctccca 1200
agccacctag tcaaggactt cgagcggtac aagcaagctc acgacacact tgaagattca 1260
atccagaaga ctgctgtcgc gagccttccc cagggtcaac aaaccgaaat tcgtatgtgg 1320
agcatgtacg gtttccgcga ggcgcaggag cgcgtgtgcc aggaactcgg cttagccgac 1380
ggcagcatcc catggaatgt gatgactgcg acgtctacaa tcttaactga cctgttcctc 1440
gcaaggggcg gtgacccaaa gaaatgcatg ttcactagcg aaccgaagaa aaagaaaaat 1500
tccaagcaag tgctatacaa gatccgtgac cgtgcgtggg ccaagatgta ccggaccctc 1560
ctctctaagg aaacccgcga agcatggaat aaggctttgt gggggctgaa gaggggcagt 1620
ccagattacg ctaggctctc gaagaggaaa gaagagcttg cgagaagatg tgtgaattac 1680
actattagca ccgccgagaa acgtgcacag tgcggtagga ccatcgtggc gctggaggac 1740
ttgaacattg gcttcttcca cggcaggggc aagcaggagc ccggttgggt tggcctgttc 1800
acaaggaaga aggaaaatcg ctggcttatg caggcccttc acaaggcgtt cctagagcta 1860
gcgcaccacc gcggttacca tgttatagaa gttaacccgg cgtatacttc ccagacttgc 1920
ccggtgtgcc gccactgcga tccagataac cgcgatcaac acaatcgaga ggccttccac 1980
tgcatcggct gcggcttccg agggaacgcc gatctggatg ttgctacgca caatatcgca 2040
atggtggcaa ttacggggga gtcacttaaa cgcgcgagag gctccgtcgc gtcgaaaacc 2100
cctcaacccc ttgccgccga a 2121
<210> 162
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 162
atggcggata cgcccaccct tttcacacag ttcctccgcc accaccttcc agggcagcgg 60
ttccgcaaag atattctgaa acaggctggt aggattttgg ctaataaggg cgaggatgct 120
actatagcat tcctgcgtgg aaagagcgaa gagagcccac cggacttcca gccgcccgtt 180
aaatgcccca ttatcgcctg cagcagacct ttaacagagt ggcctatcta ccaggcgtcg 240
gtcgcgatcc aggggtacgt ctacggccag agcctcgccg aattcgaggc cagtgaccct 300
ggatgcagca aggatggtct tctgggttgg ttcgataaaa ctggtgtgtg cacagactat 360
ttctcggtgc agggactcaa ccttattttt caaaacgcac gcaagaggta tataggggtt 420
caaaccaagg tcacaaaccg caatgaaaag cgccacaaga aactgaagag aatcaacgcg 480
aagaggatag ctgagggctt gcctgaattg acctcagacg agccggaatc ggctcttgac 540
gagacaggac acctcatcga ccccccgggc ttgaatacca atatctattg ctaccaacag 600
gtgagtccta agccactggc cttgagcgag gtgaaccagc tgcccactgc ctacgccggt 660
tactcaacct ctggcgatga tccgatccag ccaatggtca caaaggaccg gctcagcatc 720
tccaagggtc agccaggata cataccggag caccagcgag ccctgctgtc acagaagaaa 780
caccgcagga tgagggggta cggccttaag gcacgggccc tgcttgttat cgtgaggatc 840
caggacgatt gggccgtgat tgacctgagg tctcttttaa ggaatgcgta ctggcgtagg 900
attgtgcaga caaaggagcc aagcacaatc actaagctcc ttaagctggt tactggcgac 960
ccggtattgg acgccaccag aatggtggca acgtttacct acaagccggg tatcgtacag 1020
gttaggtccg ccaaatgtct taaaaataag cagggttcga aactgttctc cgagcgttat 1080
cttaacgaga cagtctccgt caccagcatt gatctggggt caaacaacct agtggctgtg 1140
gcaacgtacc gcttggtgaa tggcaacact ccggagttgc ttcagcggtt cactttaccc 1200
tcccatcttg tcaaggactt tgagcgatac aagcaagccc atgataccct tgaggacagc 1260
atccagaaga cggcggtcgc ctccctcccc caggggcagc agaccgaaat ccggatgtgg 1320
agcatgtacg gcttccgcga ggcacaggag cgggtctgtc aagaactggg cctcgctgat 1380
gggagcattc catggaacgt gatgacggca accagcacaa ttttgactga cctgttcttg 1440
gcccgtggcg gcgatccaaa gaagtgtatg tttacttcag aaccgaaaaa gaagaagaac 1500
tccaagcagg tgttgtacaa gatccgcgac agagcatggg ctaaaatgta ccggacattg 1560
ctgagtaaag agacgcgtga ggcctggaat aaagcgttgt ggggacttaa gcgggggtcc 1620
cctgactacg caagattatc aaagcggaaa gaggagctgg cgaggaggtg cgtaaattac 1680
acgatctcca cggcagaaaa aagagcacag tgcggacgga cgattgttgc gctggaggat 1740
ctgaatatcg ggtttttcca cggtcgcggg aagcaagaac cggggtgggt gggtctcttc 1800
acgcggaaaa aggaaaaccg gtggcttatg caggctctcc ataaggcgtt cctcgagctt 1860
gcccaccata gaggctatca cgtaattgaa gttaatccgg cctacactag ccaaacctgt 1920
cccgtgtgtc gtcactgtga cccggacaac cgcgatcaac ataataggga ggcatttcac 1980
tgtatcggtt gcggattcag gggcaatgct gacctcgacg tggccacgca taatatcgct 2040
atggtggcca ttaccggcga aagcttgaag agagccaggg ggtctgtcgc gtctaaaacg 2100
ccccagcccc ttgcagcgga a 2121
<210> 163
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 163
atggctgaca cacctacgct attcacccag tttctgcgcc accatctccc aggtcagaga 60
tttcgcaagg acatccttaa gcaagccggt cggatcttag ccaataaagg cgaggatgcg 120
acgattgcat tcctgcgtgg caagtccgag gaatcccctc ctgacttcca gccgccggtg 180
aagtgcccca taatcgcatg ttctcggcct ctgaccgaat ggcccattta ccaggccagc 240
gtggccattc agggctatgt gtacgggcag agtctggcgg aattcgaggc aagcgaccct 300
gggtgctcaa aagatggatt gctgggctgg ttcgataaaa caggcgtatg cactgactac 360
ttctccgtcc agggccttaa tctgattttc caaaacgcta gaaaacgcta cattggtgtg 420
cagacaaagg tgaccaaccg gaatgagaag aggcataaaa agctcaagag aatcaatgcg 480
aagcggatcg ccgaagggct gcccgaactg accagcgacg agccggagtc agccttggat 540
gagactggcc acctcataga ccctccaggt ctgaacacaa acatttactg ctatcaacag 600
gtgtcaccaa agcccctcgc cctaagcgag gtgaatcagt taccgactgc gtacgccggc 660
tactccacct ccggtgacga cccaatccaa cccatggtaa ctaaagacag actctcaatc 720
tctaaagggc agcccggata cattcccgaa caccaaagag ctctcctgag tcagaagaag 780
cacaggcgta tgaggggtta cggcctcaag gctcgggcat tacttgtgat cgttcgcata 840
caggatgatt gggctgtcat agacctgcgc tcccttttgc gcaacgcgta ctggcgaagg 900
atcgtccaga ccaaggagcc ctccacgatc actaaactcc ttaagctagt gactggcgac 960
cccgtgctgg acgctacccg gatggtagcg acctttacct acaagccggg catagttcaa 1020
gtccgctcag cgaagtgcct aaaaaacaaa cagggctcta agctgtttag cgagaggtac 1080
cttaatgaga cagtttccgt aacctctatc gacctcggtt ccaacaactt ggttgcggtt 1140
gccacctacc ggctggtaaa cggaaacacc cctgaactac tgcagagatt cacccttcct 1200
tcccatctcg tcaaggattt cgagcgctac aagcaggcgc atgatacgct ggaagacagt 1260
atccagaaga ccgcagtggc gtcacttcct cagggccaac aaacagagat tcggatgtgg 1320
tctatgtacg gtttcagaga agcgcaggag cgggtctgcc aggagttggg cctcgcagac 1380
ggatccatac cgtggaacgt catgactgca acctctacca tcttgaccga tttgttttta 1440
gcgcggggcg gcgacccgaa gaaatgcatg ttcacttcag agcctaagaa aaagaagaat 1500
tccaagcagg tactctacaa gatcagggac agggcatggg ctaaaatgta taggacactc 1560
ttgtccaagg agacgaggga ggcttggaat aaggcactgt ggggccttaa gcgcggttct 1620
cctgactacg cacgtttatc caaaaggaag gaggagttag ccagaaggtg tgttaactac 1680
accatttcca ccgcggagaa gcgcgcgcag tgcgggagaa ccatcgtggc cctggaggat 1740
ctcaacattg gattcttcca cggaaggggc aaacaagagc caggctgggt tggtttattc 1800
acgcggaaga aggagaaccg ctggctcatg caagctctac ataaggcctt tcttgaactg 1860
gcgcaccatc gggggtacca tgtgatagaa gtgaacccag catatacctc ccagacctgc 1920
ccggtctgcc gacactgtga cccagacaac agagatcagc acaaccgcga ggcttttcac 1980
tgtattggct gcggattccg cgggaatgcc gacctggacg ttgccaccca caatattgct 2040
atggtggcca tcacaggcga gtcactcaaa cgcgcacgcg gctcggttgc cagcaagacc 2100
ccccagccac ttgccgcgga g 2121
<210> 164
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 164
atggccgaca cacccacatt gttcacgcag ttcctaaggc accacttacc aggccaacgc 60
tttcgcaaag acatcctgaa gcaggccggc agaattctcg ccaacaaggg tgaagatgcg 120
acaatcgcat tcctcagggg caaatccgag gagtcaccgc cagattttca gccaccggtg 180
aagtgcccta tcattgcgtg ttcccgtccc ttgacagaat ggcctatcta ccaggcgtct 240
gtcgctatcc aaggatacgt ctatggccaa agcctcgcgg aattcgaagc gtctgatcct 300
ggctgcagta aggatgggct tctgggatgg ttcgacaaga ccggggtttg taccgattac 360
ttctccgttc agggcctgaa cttgattttc caaaacgcca ggaaacggta catcggcgtc 420
cagaccaagg tgacaaacag gaacgagaaa cggcataaga agctgaagcg gatcaacgca 480
aagaggattg cggagggcct tccagagcta acttccgatg agcctgagag cgctctcgac 540
gagacgggtc acctcataga ccctcccggc ctcaacacta acatctattg ttaccagcag 600
gtaagcccta agccacttgc cctctcagag gtcaaccagc ttccaaccgc ctatgccgga 660
tacagcacct cgggagatga cccgatacag ccaatggtca caaaagatag gctctcgatc 720
tcaaagggcc aacctgggta catcccggaa catcaacgtg ccctgctgtc gcagaagaag 780
caccgccgca tgaggggcta cggattgaaa gctcgggcgc ttctggtgat tgtacgcatt 840
caggatgatt gggcggttat cgacctcagg tctctcttgc gcaacgcgta ctggagaagg 900
atcgtgcaga ccaaagagcc ttccactatc accaagctgc taaaactggt gacaggcgac 960
ccagtcttag acgcaacgag aatggtcgcc accttcactt acaagcctgg cattgtccag 1020
gtgagaagcg caaagtgtct caaaaacaag cagggctcca agcttttctc ggagcggtat 1080
ttgaacgaga ctgtgagtgt caccagcatc gacctaggct cgaataacct ggtggcggtg 1140
gcaacctaca ggttagtaaa cggaaacacc ccggagcttc tccagcgctt tacgctcccc 1200
tcacatctgg tgaaggattt cgagcgctat aagcaggccc acgacacact cgaggatagc 1260
atccaaaaaa ctgccgtggc tagcctacca caaggtcagc aaacggagat ccgcatgtgg 1320
tccatgtacg ggtttcgtga ggcacaggag cgggtctgcc aggaacttgg actcgctgat 1380
ggttccatcc cgtggaatgt tatgaccgcc actagtacaa tcctcactga cctgttcctt 1440
gcacgaggcg gcgatccgaa gaagtgcatg tttaccagcg agcccaaaaa gaagaagaat 1500
agcaagcaag tactctacaa gatccgggac agagcttggg ctaagatgta tcgcacgctc 1560
ctgagcaagg agacccgcga ggcctggaac aaggccctct ggggtttgaa gaggggcagc 1620
cccgattatg ctcgcctatc taagaggaag gaggagctgg cgagacgatg tgtcaactac 1680
actatctcca cggccgagaa gagagctcag tgcggcagaa caatcgtggc cctcgaggat 1740
ctcaatattg gctttttcca cggtcgcggc aagcaggagc ccggttgggt gggtctgttc 1800
acacgcaaga aagagaaccg ctggctcatg caggcgctgc acaaagcgtt tctcgagctt 1860
gcgcaccaca ggggttacca cgtgattgag gtgaacccgg cttacacatc tcaaacgtgc 1920
ccggtctgtc ggcactgtga tccggacaat cgcgaccaac acaaccgcga ggcatttcac 1980
tgcataggat gtggtttccg tggtaacgcg gacctagatg tggctacgca taacatcgcg 2040
atggtggcaa taacgggcga aagcttaaag agggcccgcg gctcggtcgc ctcgaagacc 2100
ccacaaccgc tggcggctga g 2121
<210> 165
<211> 2121
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 165
atggctgata cgcccacgct cttcacacag ttcttgcgtc accatctgcc tgggcagcgc 60
tttcgcaaag acatcctcaa gcaggctggc aggattcttg cgaataaggg agaggatgcc 120
actattgcct ttctgagggg taaaagtgag gaaagcccac ccgactttca gcctcctgtt 180
aagtgcccta tcatcgcctg ttcaaggcca ctgacggagt ggcccatcta tcaagcgtct 240
gtggctattc aaggctacgt ctatgggcag tcacttgccg agttcgaagc gtccgacccg 300
gggtgcagta aggacgggct cttgggctgg tttgacaaaa ccggtgtatg taccgactac 360
ttctccgttc aagggttaaa cctgattttt cagaatgccc gaaagcggta cattggagta 420
cagacgaagg taactaaccg caatgagaaa cgccacaaga aacttaagcg catcaacgcc 480
aagcgtatcg ccgagggcct gccggagctc accagcgatg agccagagtc cgcgctggat 540
gagacgggcc acttgatcga cccccctggc ctgaacacca acatttactg ctatcagcag 600
gtctctccaa agccgttggc ccttagcgaa gtaaaccaac tgcccactgc ctatgccggc 660
tactccacca gtggggacga cccaatccaa cctatggtca caaaagatag gctctcaatc 720
tcgaaaggcc agccgggata cattccggaa caccaacggg ccctgctttc tcagaaaaag 780
caccgccgta tgaggggtta tggactcaag gcccgcgcac tgctggtcat cgtgaggatc 840
caggacgact gggcggtgat cgatcttagg agcttattga ggaacgctta ctggaggcgg 900
atcgtccaga ctaaagagcc atctaccatc acaaagttgc tcaagttagt cacaggtgac 960
ccagtgctgg acgcaacaag aatggtggcc acttttacat acaaaccggg aatcgtgcaa 1020
gttagaagtg ccaaatgcct taagaacaag cagggttcca agctgttctc cgagcggtac 1080
ctgaacgaaa cagtctccgt aactagcatc gacttgggct ccaacaacct ggtggctgtc 1140
gctacatatc gcttggttaa tggcaacacc cccgagttgc ttcaacggtt cacactacca 1200
tcccatctag tcaaggattt cgaaaggtat aagcaggcgc acgatactct ggaagactcc 1260
atccagaaga cggctgtggc ctcactccct caagggcaac agaccgagat ccgaatgtgg 1320
tcgatgtacg gctttcggga ggcgcaggaa cgcgtctgcc aggagctggg gctagccgac 1380
gggtccatcc cttggaacgt gatgaccgcc accagcacaa tccttacaga tctgttcctg 1440
gcgagaggcg gcgacccgaa gaagtgcatg tttaccagcg agcccaagaa gaaaaagaac 1500
tcaaagcaag ttctctacaa gatccgcgat agagcttggg ctaagatgta caggaccctc 1560
ctctcaaagg agacgcgcga agcttggaac aaggcgctct gggggttaaa gaggggatct 1620
cctgactatg ccagactgtc taagagaaag gaagaactgg cgcggaggtg tgtgaattac 1680
accatatcca cggcggaaaa gagagcgcag tgcggtcgta cgatcgtggc actcgaagat 1740
ttgaatatcg gtttttttca cggcaggggg aagcaagagc cgggttgggt tgggctcttt 1800
acacgcaaaa aggaaaaccg ctggcttatg caagccctgc acaaggcgtt tcttgaactg 1860
gctcaccatc gtgggtatca cgttattgag gttaacccag cctatacgtc tcagacatgt 1920
cctgtgtgca ggcactgcga cccagataac cgggatcagc acaacaggga ggccttccat 1980
tgcattggat gcggctttcg aggcaacgct gacctagacg tcgccacaca taatattgcc 2040
atggtggcga ttaccggcga atcactcaaa cgagcccgcg gttctgtggc ttcaaagacg 2100
ccacaaccac tcgcagctga g 2121
<210> 166
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 166
atgcccaagc ccgccgtgga gtcagagttc agtaaggtcc tgaagaagca cttccccgga 60
gagcggtttc ggtcatcgta catgaaacgg ggtggcaaga tactggctgc ccagggagag 120
gaagccgtgg tagcttactt gcagggcaag tccgaggagg agcccccaaa ttttcagccc 180
ccggccaagt gccacgtggt gactaagtcg cgggactttg ctgagtggcc aatcatgaag 240
gcttccgagg ctatacagag atatatttat gccctgtcga ccacagaacg tgccgcttgc 300
aaaccaggaa aaagctccga gtcccacgcc gcttggttcg cggccaccgg agtttcaaac 360
catgggtaca gccatgtcca gggtctgaat ctgattttcg accataccct gggaaggtac 420
gacggtgtgc tcaagaaagt gcagcttcgg aacgagaagg cccgcgccag gttggagagt 480
attaatgcga gcagagctga cgaggggctc cctgagatca aggccgagga agaggaagtc 540
gcgacaaacg aaactgggca cctgttgcag ccgcccggta tcaatccgag cttctatgtg 600
tatcagacaa tctcccctca agcctaccga ccgcgagatg agatcgtcct ccctcccgag 660
tatgctgggt atgtgcgcga tccaaatgcc cccatcccac tgggtgttgt gcgcaaccgg 720
tgcgatatac agaagggctg tcccggctac ataccagagt ggcagcgcga agctggaact 780
gccatatcac ctaagaccgg taaggcggta actgtaccgg gtctcagccc aaagaaaaat 840
aaacgtatga gaaggtactg gaggagtgag aaggaaaagg cccaagatgc gcttctggtg 900
acggttcgca taggcacaga ctgggtcgtg atagacgtga gaggcctcct ccgcaacgct 960
agatggcgca ctatcgcccc caaggacatc tccctgaacg ccctcctgga tcttttcaca 1020
ggtgaccctg tgattgacgt cagacggaac atcgtcacct ttacctatac actggacgcc 1080
tgcggaactt acgctaggaa gtggactctc aagggtaagc aaacgaaagc caccctcgat 1140
aagctcaccg caacccagac agttgcattg gtggctatag acttaggtca gacgaatcca 1200
atttccgcag gtatttcgcg cgtgacgcag gagaatggtg ctctgcagtg cgagccgctc 1260
gaccggttca cactcccaga cgatctctta aaggacattt cagcatacag gatagcttgg 1320
gacaggaatg aggaggagct cagggcccgc tccgtggagg cgctccccga ggctcagcag 1380
gctgaagtcc gagcattgga cggcgtgtct aaggagacgg cccgcacgca actctgcgcc 1440
gactttggcc tggacccgaa acgcctgcca tgggataaga tgtcatcaaa caccacgttc 1500
atctcagaag ccctgctttc caatagcgtt tcgagggatc aggtgttctt cacgcccgcg 1560
cccaagaagg gcgcgaagaa gaaggcacca gtcgaagtta tgcggaagga caggacgtgg 1620
gcgcgggcat acaagcccag gctttctgtt gaagcccaga agctgaagaa tgaggcgctg 1680
tgggctttaa agagaacctc ccccgagtat ttaaagctgt cgcgccgcaa ggaagagctg 1740
tgccggcgga gcatcaatta tgtgatagaa aagactcgaa gacgcacgca atgccaaatc 1800
gtgatcccgg tcatagaaga cctgaacgtt agattcttcc atggctccgg caagaggctc 1860
ccgggatggg ataacttctt tacagccaag aaggaaaacc gctggttcat ccagggattg 1920
cacaaggcgt tttcggacct ccgcactcac agatcttttt atgtcttcga agtgcgccca 1980
gaacggacgt ccataacctg tccaaagtgc ggccactgtg aggtggggaa ccgcgatggt 2040
gaggcgtttc agtgtctatc atgtggcaag acatgcaatg ccgacttgga cgttgcaacg 2100
cacaatctta cgcaggtggc actcactgga aagacgatgc cgaagcggga ggaaccgagg 2160
gacgcccagg gtactgcccc tgctcggaag acgaaaaagg cttccaagtc taaggctccc 2220
ccagccgaac gcgaagatca gacacccgcc caggaaccga gccagacgtc a 2271
<210> 167
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 167
atgcccaagc ccgccgtcga gtcggaattc tcgaaggtcc tgaagaagca tttccccggt 60
gaaaggtttc gcagctctta catgaagcgc ggtggtaaga tcctcgcggc tcagggagaa 120
gaagccgtag tagcttatct ccaagggaag tccgaagagg agcccccaaa cttccagccg 180
cctgctaaat gccacgtggt gacaaaatca agggatttcg ccgagtggcc aatcatgaag 240
gcttccgagg cgatacaacg atacatttac gctttgtcaa ccactgaaag ggcggcgtgt 300
aaaccaggca aaagctctga gtcacacgcg gcttggtttg cggccactgg ggtgtccaac 360
catggctata gtcacgttca gggactgaac ctgatttttg atcatacgct tggtcgatac 420
gacggcgtct taaagaaagt tcaattacga aatgaaaagg ccagggcgcg gctggagtcc 480
attaatgcct cccgggcgga tgagggacta cccgagatta aggcagagga agaggaggtt 540
gctactaatg aaaccggcca cctgttgcag ccccccggaa tcaacccctc gttctacgtc 600
taccagacga tctccccaca agcctaccgc cccagagatg agatcgtgtt gccaccggaa 660
tatgcgggct acgtgcgcga tcccaacgcg ccaatccccc taggcgtggt ccggaatagg 720
tgcgacatcc aaaagggttg tcccggatac atcccagagt ggcagcggga agccgggact 780
gccatctcgc cgaaaacggg caaggccgtg accgttcccg ggctcagtcc gaagaaaaac 840
aaaagaatga ggagatactg gaggtccgaa aaggagaagg cacaggacgc cttgctcgtg 900
acggtgcgca ttggaaccga ctgggtggtt atcgatgtga gagggctcct ccgcaatgca 960
cgctggcgaa ctatagcgcc taaggatatc tccttgaacg cccttcttga cctttttacg 1020
ggcgatccgg ttatagacgt acgaaggaac attgtcacct tcacctatac ccttgacgcc 1080
tgcgggacat acgcccgtaa gtggaccctg aaaggaaaac aaacgaaagc gaccctcgac 1140
aagctcacgg cgacgcaaac ggtcgccctc gtggccatcg acctggggca gacaaatccg 1200
atctcagcag gcatctcacg ggtcactcaa gaaaacggcg cgcttcagtg tgaaccacta 1260
gaccggttta ccctgccgga tgacctcctc aaagacataa gcgcctaccg catagcgtgg 1320
gacaggaacg aggaggaact gagggcccgg tcagtcgagg ccctcccgga ggcgcagcag 1380
gccgaggtgc gcgccctgga cggggtgtca aaagagaccg ctcgcacgca actgtgcgcg 1440
gattttggtc ttgacccaaa gcgcctccca tgggacaaga tgagcagcaa cacaacattc 1500
attagcgaag ccttgctctc aaactccgtt tccagagatc aagttttttt cacccccgcc 1560
cctaaaaagg gcgcgaagaa aaaggcaccg gtggaggtga tgcgcaagga cagaacctgg 1620
gcgcgggcct ataagccacg cctcagcgta gaggcgcaga agcttaagaa cgaggcgctc 1680
tgggcgctta agaggacatc ccccgaatac ttgaaacttt cgcgaaggaa agaggagttg 1740
tgtcgccgca gcattaacta cgtgattgag aaaacacgcc ggaggactca atgtcagatc 1800
gtgatcccag ttattgaaga tcttaacgtg cggttcttcc acgggtctgg taagaggttg 1860
ccgggttggg ataacttctt tacggcgaag aaggagaata ggtggtttat acaaggccta 1920
cacaaggcat tctcagacct aagaactcac aggtctttct acgtcttcga agtacgacct 1980
gaaagaacgt ccatcacgtg cccaaagtgc ggccactgcg aagttggcaa cagggacgga 2040
gaggcattcc aatgtttgag ttgtggaaaa acctgcaacg ccgatctcga tgtggctacg 2100
cacaacctca cccaggtcgc attgaccggc aagacaatgc cgaagcgaga agagccaagg 2160
gacgctcagg gaaccgctcc cgccaggaag actaagaagg ccagcaagtc taaggcaccc 2220
cctgcagaac gagaggatca gaccccggcc caagagccct ctcaaacatc t 2271
<210> 168
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 168
atgccgaagc cggctgtcga gagtgaattc agtaaggttt tgaagaagca ttttcctggc 60
gaaagattcc ggagctcgta catgaagaga ggtggcaaga tacttgcggc tcaaggcgaa 120
gaggccgtgg tggcctacct ccagggtaag agcgaagaag agccacccaa cttccaaccc 180
cctgccaagt gccatgttgt aaccaagtct agagacttcg cagagtggcc cattatgaag 240
gcgtccgagg ctattcagcg atacatctac gcactcagca ccaccgaacg agcggcctgc 300
aaacctggaa agtcgtccga atctcacgca gcctggttcg cggccacggg ggtgtccaat 360
cacgggtact cccatgtcca aggattgaac cttattttcg accacaccct tggacggtac 420
gacggggttc ttaagaaggt tcagctccga aatgagaaag ctcgcgcccg actcgaatca 480
attaacgcaa gccgagctga tgaggggctg ccagagatca aggcagagga ggaggaggtg 540
gctacaaacg agacaggtca cttgctccaa ccaccgggca ttaacccgtc cttctacgtt 600
taccaaacaa tcagccctca ggcctaccga cctcgtgatg aaatcgtgtt gcccccggag 660
tacgccggct acgtacgcga cccgaatgcc cctataccac taggcgtcgt gcgcaacagg 720
tgcgacatac agaagggctg cccggggtat atccccgagt ggcaacgcga agctggtacc 780
gccatctccc ctaaaaccgg caaggcagtt acggtcccgg ggctgagtcc caaaaagaat 840
aagcgcatgc gccggtactg gagaagcgag aaagagaagg cccaggatgc attactggtc 900
actgtgagaa ttggaaccga ctgggtggtc attgatgtga ggggccttct aaggaacgca 960
agatggcgca caatcgcgcc aaaagacatc tcactgaacg ctctgttgga cctatttacc 1020
ggtgacccag tgatcgacgt tcgtaggaat atcgtcactt tcacttacac actagacgcc 1080
tgcggtacct acgctcgaaa gtggactctt aagggcaagc aaactaaagc tactttggat 1140
aaactgaccg ccacgcagac ggtcgccctc gtggccatcg accttggcca gaccaatccg 1200
atcagcgctg gcatttcccg ggtcacccag gaaaacggcg cgctgcaatg cgagccgctc 1260
gataggttta cacttccaga tgatctgtta aaggacatat ctgcgtaccg cattgcttgg 1320
gaccggaatg aagaagaact cagagcgcga tccgttgaag ccctacccga ggcacagcag 1380
gcggaggtta gagccctgga cggggttagc aaggagaccg cgcgcacaca gctgtgtgcg 1440
gatttcggtc tcgacccgaa gaggctccct tgggataaga tgagctctaa caccacgttc 1500
atttccgagg cattgctgtc caattcggtc tctagggatc aagtcttctt cactccagcc 1560
cccaaaaagg gggcaaagaa gaaagctccg gtggaggtta tgaggaagga tcgcacctgg 1620
gcacgggcat acaagccgag actaagcgtc gaagcccaga agttgaaaaa cgaagcccta 1680
tgggctttga aacggacttc cccagaatac ctcaagcttt ccagacgtaa agaagagttg 1740
tgcaggcgca gcataaacta tgtcatcgag aaaacccggc gccgcactca atgccagatc 1800
gtcattccag tgatcgaaga cctcaacgtc agattcttcc acgggtccgg caagagactg 1860
cccggatggg acaacttttt caccgcgaag aaggagaacc gctggttcat ccaggggctg 1920
cataaggcat tctcagacct gaggacacac cggagctttt acgtgtttga ggtgcggccg 1980
gaacgaactt ccatcacctg ccctaaatgc gggcactgtg aggtcggcaa tcgcgatggg 2040
gaggcgttcc aatgcctctc ctgtggcaag acttgcaatg ccgacctgga cgtcgctaca 2100
cacaacctta cgcaggtcgc cctgacagga aagaccatgc caaagaggga agagccccga 2160
gacgcccaag gtacagcgcc ggctcgcaag acgaaaaagg cttctaagtc aaaggctcct 2220
cctgcggaga gagaggatca gaccccggcg caggagccta gccagacgag t 2271
<210> 169
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 169
atgccgaagc cggccgtgga gagcgagttc agcaaggtgc tgaagaagca cttcccgggc 60
gagaggttcc gctccagcta catgaagagg ggcggcaaga tcctcgccgc ccagggcgag 120
gaggccgtgg tggcctacct ccagggcaag agcgaggagg agccgccgaa cttccagccg 180
ccggccaagt gccacgtggt gaccaagtcc agggacttcg ccgagtggcc gatcatgaag 240
gcctccgagg ccatccagag gtacatctac gccctctcca ccaccgagag ggccgcctgc 300
aagccgggca agtccagcga gtcccacgcc gcctggttcg ccgccaccgg cgtgagcaac 360
cacggctact cccacgtgca gggcctgaac ctcatcttcg accacaccct cggccgctac 420
gacggcgtgc tgaagaaggt gcagctgagg aacgagaagg ccagggcccg cctggaatca 480
attaatgcca gcagggccga cgagggcctc ccggaaatta aggccgagga ggaggaggtg 540
gccaccaacg agaccggcca tctgctccag ccgccgggaa tcaacccgtc cttctacgtg 600
taccagacca tcagcccgca ggcctaccgc ccgcgcgacg agatcgtgct gccgccggag 660
tacgccggct acgtgaggga cccgaacgcc ccgatcccgc tcggcgtggt gaggaacagg 720
tgcgacatcc agaagggctg cccgggatac atcccggagt ggcagcgcga ggccggcacc 780
gccatctccc cgaagaccgg caaggccgtg accgtgccgg gcctcagccc gaagaaaaat 840
aagaggatga ggcgctactg gaggtccgag aaggagaagg cccaggacgc cctgctcgtg 900
accgtgcgca tcggcaccga ctgggtggtg atcgacgtga ggggcctcct gaggaacgcc 960
cgctggcgca ccatcgcccc gaaggacatc agcctcaacg ccctcctgga cctcttcacc 1020
ggcgacccgg tgatcgacgt gaggcgcaac atcgtgacct tcacctacac cctcgacgcc 1080
tgcggcacct acgcccgcaa gtggaccctc aagggcaagc agaccaaggc caccctggac 1140
aagctcaccg ccacccagac cgtggccctg gtggccatcg acctgggcca gaccaacccg 1200
atctccgccg gcatctcccg cgtgacccag gagaacggcg ccctgcagtg cgagccgctc 1260
gaccgcttca ccctgccgga cgacctcctg aaggacatct ccgcctacag gatcgcctgg 1320
gacaggaacg aggaggaact ccgcgccagg agcgtggagg ccctgccgga ggcccagcag 1380
gccgaggtgc gcgccctgga cggcgtgtcc aaggagaccg cccgcaccca gctctgcgcc 1440
gacttcggcc tggacccgaa gaggctcccg tgggacaaga tgtcctccaa caccaccttc 1500
atcagcgaag cattgctgag caactccgtg tccagggacc aggtgttctt caccccggcc 1560
ccgaagaagg gcgccaagaa gaaggccccg gtggaggtga tgcgcaagga ccgcacctgg 1620
gccagggcct acaagccgag gctgtccgtg gaggcccaga agctcaagaa cgaggccctc 1680
tgggccctca agcgcaccag cccggagtac ctcaagctga gcaggcgcaa ggaggagctg 1740
tgccgcagga gcataaacta cgtgatcgag aagaccaggc gcaggaccca gtgccagatc 1800
gtgatcccgg tgatcgagga cctgaacgtg aggttcttcc acggctccgg caagcgcctc 1860
ccgggctggg acaacttctt caccgccaag aaggagaacc gctggttcat ccagggcctc 1920
cacaaggcct tcagcgacct gcgcacccac cgcagcttct acgtgttcga ggtgcgcccg 1980
gagaggacca gcatcacctg cccgaagtgc ggccactgcg aggtgggcaa ccgcgacggc 2040
gaggccttcc agtgcctgtc ctgcggcaag acctgcaacg ccgacctcga cgtggccacc 2100
cacaacctga cccaggtggc cctgaccggc aagaccatgc cgaagaggga ggagccgagg 2160
gacgcccagg gcaccgcccc ggccaggaag accaagaagg ccagcaagtc caaggccccg 2220
ccggccgaga gggaggacca gaccccggcc caggagccgt cccagaccag c 2271
<210> 170
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 170
atgcccaagc cagctgtgga aagcgagttt agcaaagtgc ttaaaaagca ctttccgggc 60
gaacgattcc gctcgagcta tatgaaacgc ggcggtaaga tcctagcagc ccagggcgaa 120
gaggccgttg tagcgtacct tcaaggcaaa agcgaggagg agccgccgaa ctttcagccg 180
cccgccaagt gccacgtagt cactaagtct cgcgatttcg ccgagtggcc gattatgaag 240
gcttcagaag ccattcagcg atacatttat gccctttcga ccactgagag ggccgcctgc 300
aagccgggca aatcttcaga gagccacgcg gcctggttcg ccgcgaccgg cgtctccaac 360
cacggataca gccatgtgca aggtctcaac cttatattcg accatacact tggcagatac 420
gacggtgtcc tgaagaaagt ccaacttaga aacgagaagg caagagcccg gttagagagc 480
ataaacgcgt caagagctga tgaaggcctg ccggagataa aagcagagga agaagaggtc 540
gcaacaaatg aaaccggtca tctgttgcag ccaccgggca tcaaccctag tttctatgtt 600
tatcagacta tttctccgca ggcttacagg ccgcgggatg aaatcgtatt gccgcccgaa 660
tacgctggtt atgtgcggga tcccaatgcc ccaatcccac tcggagtagt cagaaaccgc 720
tgcgacatcc agaagggctg cccaggctac ataccggagt ggcaacggga ggccgggaca 780
gctatctctc caaaaacagg taaggccgtg acagtgccag gcctctctcc gaagaagaat 840
aagcggatgc gccggtattg gcgcagcgag aaggagaagg cgcaggacgc actactggtg 900
acagtaagaa ttggtactga ttgggtggtg attgatgtcc gcggactgct ccggaatgcc 960
cgctggcgca caatcgcacc taaggatata tccttgaacg ctcttctcga tctcttcact 1020
ggcgacccag tgatcgatgt gcgcagaaac atcgttacat tcacatacac cctagacgcg 1080
tgcggcacgt atgcccgcaa gtggacactt aaggggaagc aaacaaaggc taccctggac 1140
aaactcacgg cgacccagac cgtggccctt gtagcgatcg acttggggca gacgaatccc 1200
attagcgctg gtataagtag ggttactcag gaaaatggcg ctctgcagtg cgaacctctg 1260
gacaggttca cgcttccgga cgatctattg aaggacattt cggcttatcg catagcctgg 1320
gaccgcaacg aggaggagct tcgtgcacgg agcgttgaag cattaccaga ggcgcagcaa 1380
gcggaagttc gtgcattgga tggtgtgtcc aaggagacag cgcgaactca actatgtgcg 1440
gatttcggtc tcgaccctaa acgtctgccg tgggacaaga tgagctcgaa tactacattt 1500
atctctgagg ctctgttgtc taatagcgtc tcccgcgatc aagtgttctt cacgccggcc 1560
ccaaaaaaag gagccaaaaa gaaggccccg gtggaggtga tgagaaagga caggacttgg 1620
gctcgagcgt ataagccacg tttgtccgtg gaggctcaga agctgaaaaa cgaagcgttg 1680
tgggcactga agcgcaccag cccggaatat ctaaagctct cgagacgcaa agaggaactc 1740
tgcaggcggt cgatcaatta cgtcattgag aagacccgtc gacgcaccca gtgtcaaatt 1800
gttataccag taatcgaaga cctcaacgtg aggttcttcc atgggtccgg caagagactg 1860
cccggctggg ataacttctt caccgccaag aaggagaacc gttggtttat tcagggactg 1920
cacaaggcct tctctgatct gagaactcac cggtccttct acgtgttcga ggtcaggccc 1980
gaacggacga gcatcacttg tccgaagtgc ggacactgcg aagtcggcaa ccgcgatgga 2040
gaggcattcc agtgtctctc ctgtgggaaa acatgcaacg ccgacctcga cgtggctacc 2100
cacaacctta ctcaagtggc gctcacgggt aaaacgatgc ctaagcgtga ggagccccgg 2160
gatgcgcagg ggaccgctcc cgctcgtaag accaagaagg cgagcaagtc aaaggctccc 2220
ccggccgaac gggaggacca gactccggcg caggagccgt cccagacctc c 2271
<210> 171
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 171
atgcccaagc cagcggttga gtcggaattc tctaaggtcc tcaagaagca ctttcctgga 60
gagcgattca ggtcttcata catgaaacgt ggaggcaaga ttctggctgc tcagggggaa 120
gaagctgtgg tggcatacct ccagggcaag tccgaggaag agcctcccaa tttccagcca 180
ccggctaagt gtcatgtggt caccaagtcc agagactttg ccgagtggcc gatcatgaag 240
gcttcggagg ctatccaacg gtacatttat gccctgtcca ctactgagag ggcagcgtgc 300
aagccaggaa aatcttcaga gagccacgcc gcctggttcg ccgctaccgg cgtctccaac 360
cacggctaca gtcacgtgca aggacttaat cttattttcg accatacact cggtcggtat 420
gacggtgtgc tcaagaaagt gcaactccgc aacgaaaagg ccagggccag gctggaatcc 480
attaacgcct cgcgggccga tgagggactg cctgaaataa aagcagagga agaggaggta 540
gcgactaacg agaccggcca tcttcttcag ccacccggga taaatccctc attctatgtt 600
tatcaaacca tttcccctca ggcctaccgt ccgagagatg agatcgtgct gccacctgaa 660
tacgcaggct acgtaagaga tcctaatgcc cccattccac tgggcgtggt gaggaataga 720
tgtgacatcc aaaagggatg tcccggctat atccctgagt ggcagcgcga ggcgggcacg 780
gccatctctc cgaagaccgg aaaggccgtg actgtaccgg gcctgtcccc taagaagaat 840
aaacgtatga ggcgttattg gaggtccgag aaggagaagg cacaggacgc tttgcttgta 900
actgtccgca taggaactga ttgggttgtg atagatgtgc gcgggctgct caggaacgca 960
cggtggcgca cgatagcacc aaaagacata tcgctgaatg cgctcttaga cctgttcacg 1020
ggcgacccgg ttattgacgt gcggaggaat attgtcacct tcacttacac tctggatgcg 1080
tgcggtacat atgcgcgaaa atggactctg aagggcaagc agaccaaggc cactcttgat 1140
aagctcacag ctacgcagac cgttgcactg gttgccatag acctcggaca aacaaacccc 1200
atctccgcag gtatatccag agtcacccag gagaacggcg ccctgcagtg tgagccactg 1260
gatcggttta ctcttccaga cgaccttctt aaggacatct cagcataccg gatcgcgtgg 1320
gataggaacg aggaggagtt gcgggctaga tcagtcgaag cactccccga agctcagcag 1380
gccgaggtta gggcgctgga cggcgtttca aaggagaccg cccgcacaca actctgtgcc 1440
gatttcgggc ttgatcccaa gcggctccca tgggacaaaa tgagctctaa caccaccttc 1500
atatctgagg ctctcctaag caactctgtt tcccgcgacc aagtcttctt cactcccgcg 1560
ccaaaaaagg gcgctaagaa gaaggctcct gtggaagtta tgcgcaagga ccgtacctgg 1620
gctagggctt acaagcccag gctctcagtt gaagctcaaa aactcaagaa cgaagctttg 1680
tgggcgctta agaggacctc cccagagtac ctcaagctct caagacggaa ggaggagctc 1740
tgccgccgga gtattaatta tgtcattgaa aagacaagga gacgcacgca atgccaaatc 1800
gtcatccccg tgattgaaga tttgaacgtt cggttctttc acggctcggg taagaggctt 1860
ccgggctggg acaacttttt cacggcgaaa aaggaaaacc gttggttcat tcagggtctc 1920
cacaaggcct tctcggacct cagaacacat cgttcattct atgtcttcga ggttcggccc 1980
gaacgtacca gcattacatg ccctaaatgc ggccactgcg aggtaggcaa ccgtgacggc 2040
gaggcgttcc aatgcttgtc gtgcggcaaa acctgtaatg ctgatcttga cgtggccaca 2100
cacaatctta cccaggtggc actaacggga aagaccatgc cgaagcggga ggaaccgcgc 2160
gatgcccagg gcaccgcccc ggctcggaag acaaagaagg cgagcaagag caaggcgccg 2220
cctgcggaaa gagaggacca aacacctgct caggagccat cccaaacgag c 2271
<210> 172
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 172
atgccgaagc ccgcagtgga gagtgagttc tcgaaagtgc tcaaaaagca cttcccagga 60
gaacgattcc gctcctcata catgaagagg ggcggcaaga tcctggccgc ccagggtgag 120
gaggcagtgg tcgcttatct tcagggtaaa tcagaagagg agccaccaaa ctttcaaccc 180
cctgccaagt gccatgtggt caccaaatcc cgagactttg ccgagtggcc catcatgaag 240
gcctcggagg cgatacaacg gtacatctac gctctatcca cgaccgaacg ggcagcgtgc 300
aaacctggta aatcatctga aagccacgcc gcctggttcg ctgcaacggg cgtctcgaac 360
cacggctact ctcacgttca ggggctgaat ctcatattcg accatacgct gggcagatat 420
gacggtgttc tcaaaaaggt ccagttgagg aacgaaaagg ctcgcgcgcg cctcgagtct 480
atcaacgcgt cccgcgcgga cgagggccta cccgagatca aggcagagga agaagaggtc 540
gcgacgaatg agacgggcca cttgttacag cctccgggga ttaacccgag cttctacgtc 600
taccaaacta tttcaccgca agcataccgg cctagggatg aaattgtcct gccgccagag 660
tatgcaggct acgtgcggga cccaaacgcg ccgattcctt tgggcgtagt gagaaatagg 720
tgtgatatcc agaaggggtg ccccggctat attccagagt ggcagagaga ggcaggcacg 780
gccatttctc caaagaccgg taaggctgtc actgttccag gactatcccc aaagaagaat 840
aagaggatgc ggagatactg gcgtagcgag aaggagaagg ctcaggatgc actgctagtg 900
accgtgagga taggcacaga ttgggttgtt attgacgtga ggggcctatt gcggaacgct 960
cgttggcgga cgatcgcccc taaggatata tctctcaacg ctctccttga cctgtttacg 1020
ggggatccag tgattgacgt acgccgcaat attgttacgt tcacttacac tcttgatgcc 1080
tgtggcacgt acgcccggaa atggaccctc aaaggcaagc agaccaaggc cacgcttgac 1140
aaactcacag ctacccagac ggtcgcttta gtggctattg acttgggaca aaccaaccca 1200
ataagcgctg gaatctcaag agtgacccag gagaacggcg ctctgcagtg cgagccgctg 1260
gacagattta ctcttcccga cgacctccta aaagacatct ccgcctatag gatagcttgg 1320
gaccgaaacg aagaggagct gcgcgccagg tccgtcgagg cccttccgga ggctcaacaa 1380
gcggaggtgc gggcgctgga cggagtgagt aaggagactg cgcgtaccca gctgtgcgct 1440
gacttcggcc ttgatccgaa aaggctgccc tgggacaaga tgtccagtaa cacgaccttc 1500
atctcagagg cgcttctctc aaattcggtc tcccgagatc aggtgttctt cacgcccgcc 1560
cccaagaaag gtgccaaaaa gaaagccccc gttgaggtga tgaggaaaga tcgcacttgg 1620
gcccgcgctt acaaaccgcg cctctcggtg gaggcgcaaa agttaaagaa tgaggcgctt 1680
tgggcgctta agcgaacttc tccagagtat cttaagctct cacggcggaa agaggagctg 1740
tgcagaagga gtatcaacta tgtcatcgaa aagacgcgtc gtaggaccca atgccaaatc 1800
gttatccccg tcatcgagga tctgaacgtg aggtttttcc acgggtctgg caaacggctt 1860
cccggctggg ataatttttt caccgctaag aaagaaaatc gatggttcat ccaggggttg 1920
cacaaggcct tctcagatct caggacccac cgcagctttt acgtcttcga ggttcgccct 1980
gagcgcacct ctatcacttg ccctaagtgc ggtcactgcg aggttgggaa ccgggacggc 2040
gaggcttttc agtgcctctc atgcggcaag acctgcaacg cggacctaga tgtcgcaacg 2100
cataacctaa cccaggtggc ccttaccggc aagacgatgc cgaagaggga agaacccagg 2160
gacgcgcaag ggacggcacc agcgcgtaaa acgaaaaaag catcgaagtc aaaagcaccg 2220
cccgccgagc gggaagatca gacccccgcg caagaaccgt cacaaacatc c 2271
<210> 173
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 173
atgccaaaac ctgccgtcga atctgagttc tcaaaagttt tgaaaaagca cttcccaggg 60
gaaaggtttc gttcttcgta catgaagaga ggaggaaaaa tactggcagc ccagggagag 120
gaggctgtcg tagcctactt gcaaggcaag tctgaggagg agccaccaaa tttccaaccg 180
cctgcgaagt gccacgtcgt tactaaaagc cgggactttg cggaatggcc cattatgaag 240
gcgagtgagg ccattcaaag gtatatctac gctctgagta caaccgagcg tgccgcgtgt 300
aaacccggga aatcttctga gtcgcatgcc gcctggttcg cggcaactgg ggtctccaac 360
catggctact ctcatgtgca gggcctgaac ctcattttcg atcatacgct cgggcgatac 420
gatggagttt tgaagaaggt ccagttacgg aatgagaaag cccgggctcg ccttgagtct 480
ataaacgctt cgcgcgccga tgagggcctg ccagagatca aggcagaaga agaagaggtc 540
gcaacaaacg agactggcca ccttctccaa ccccccggca tcaatccaag cttctatgtc 600
tatcagacga tctcaccaca ggcataccgg cctcgcgatg agatagtctt gccaccagag 660
tacgctggtt atgtacgcga cccaaacgct ccgatcccgc tgggcgtcgt ccgcaatcgt 720
tgcgatattc agaagggttg tccgggatac atcccagaat ggcagagaga ggccggcacc 780
gcaatttccc caaagacagg gaaagcggtg accgtgcctg gcctctctcc aaagaaaaat 840
aagagaatga ggcggtactg gcggtcagag aaggaaaagg ctcaggacgc gctcctggtg 900
accgttcgca tcggcacgga ctgggtggtg atcgacgtca gaggcctgct ccgtaacgcg 960
aggtggcgta caatcgcccc caaagacatt agccttaatg ctctgctgga cctctttaca 1020
ggtgaccccg tcattgacgt caggaggaac atcgtaactt tcacttacac cctcgacgct 1080
tgcggaacct acgcgcgcaa gtggaccctg aagggaaagc agactaaggc caccctcgat 1140
aaacttacgg ccactcaaac tgtcgccctg gtcgctatcg acttgggcca aacgaatcca 1200
ataagcgccg gcatctcccg cgttacgcag gagaatggcg ccctccagtg tgagccgctg 1260
gaccgattca ctcttcctga cgatttgtta aaagatatat ccgcctacag aatagcgtgg 1320
gaccgcaacg aagaggaact cagggcccga tcggttgagg ccttgcctga ggcgcagcag 1380
gcggaggtga gagcgcttga tggcgtttct aaggagaccg cccgtacgca gctctgcgcg 1440
gatttcgggc tggaccctaa gcggctgccc tgggacaaga tgtcaagtaa tacaactttc 1500
ataagcgaag cccttctttc caactctgtg tcgcgtgatc aggtgttttt cacgccggca 1560
cccaaaaagg gtgctaagaa gaaggcccct gtggaagtga tgcggaagga tagaacttgg 1620
gcaagagcct acaagccccg tttatccgtg gaggcccaga agctcaagaa cgaggctttg 1680
tgggcgctca agaggacgag ccctgagtac ctcaaactca gccgcaggaa agaagagctc 1740
tgtagaagga gcatcaacta cgtcattgag aagacaagaa ggcgcaccca gtgccagatt 1800
gtgattccag tcatagagga cttgaacgtc agattctttc acgggtcggg taagaggttg 1860
cccggctggg ataatttctt cacagcgaaa aaggagaatc ggtggttcat tcagggtttg 1920
cataaggctt tttccgatct acgcacacac cgaagcttct atgtgttcga ggtccggcca 1980
gagcggacct caattacatg ccccaaatgc ggccattgcg aggtcggaaa ccgcgatggt 2040
gaggcctttc agtgcttgtc ctgcggcaag acttgcaatg ccgaccttga cgtggcaact 2100
cacaatctca ctcaggtggc gcttactgga aagacaatgc caaaaagaga ggagcctcgt 2160
gacgctcaag gaaccgcccc tgcccgaaag accaaaaagg cgtctaagag caaggcaccg 2220
ccagcggaga gggaggacca gactccagcc caagagccat ctcagacgtc g 2271
<210> 174
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 174
atgcccaagc cagcagttga gagtgaattc tcaaaggtcc tcaagaagca ttttccgggg 60
gagaggttcc gcagctccta catgaagagg ggcgggaaaa tccttgcggc acagggcgag 120
gaggctgtag tggcctattt gcagggcaag tcagaagagg agccaccaaa tttccagcct 180
cccgccaagt gccacgtcgt caccaagtct cgtgattttg cagaatggcc gattatgaag 240
gcttcggagg cgatccaacg gtacatctat gcattgtcaa ccacagaaag agctgcatgc 300
aaaccaggta agtcctccga gtcccacgct gcctggttcg ctgccaccgg cgttagcaat 360
catggctaca gtcacgttca gggacttaac ctcatctttg atcacacact cgggaggtac 420
gatggcgtac tgaagaaagt tcagttaagg aatgaaaagg caagggccag actggaaagc 480
atcaacgcca gtagggccga tgagggtttg cccgaaatca aagctgaaga agaagaagtt 540
gctactaacg agacgggcca tctccttcaa cccccgggaa taaacccatc cttctatgtc 600
taccagacca tctcaccaca ggcataccgg ccaagggacg aaatcgtcct cccaccggag 660
tacgccggct acgtcaggga cccaaacgct ccgatcccac tcggagttgt gcgcaacaga 720
tgcgatatcc agaaggggtg cccgggctac atccctgaat ggcagcggga ggccgggacc 780
gccataagcc ctaagacagg taaagcggtg acggtacctg gcctgtcacc caagaagaac 840
aagaggatga gacgttactg gaggtccgaa aaggagaaag cccaggatgc tctgctcgtg 900
acagtccgga tcgggaccga ttgggtcgtc attgacgtcc gaggcctctt gagaaatgcc 960
aggtggcgta cgattgcgcc taaagacatc tcccttaatg cgttattgga tttgttcaca 1020
ggcgacccag tgattgatgt ccgcaggaac atcgtgacct tcacctacac cctagacgct 1080
tgcgggacct acgcacggaa gtggacactt aagggcaagc aaaccaaagc tacacttgac 1140
aaactgacag caacccaaac tgtggcgctg gttgccatag acctgggcca aacgaacccg 1200
atttctgctg gtatttcccg tgtcacgcag gaaaatggtg cgctgcagtg cgaacccctg 1260
gaccgattta cactgcctga tgatctgcta aaggacatct ctgcgtaccg aattgcttgg 1320
gaccgcaacg aggaagagtt gcgtgcccgg agcgtcgagg ctttaccgga agcccagcaa 1380
gcggaggtta gagcgttaga cggcgtttca aaggagacgg ctcgcacgca gctttgcgcg 1440
gacttcggtc tcgaccctaa gcgcctcccc tgggataaga tgtctagcaa taccacgttc 1500
atatcggagg cactattatc gaattccgta tctcgtgacc aagtgttctt cactccagcg 1560
cccaagaagg gcgcgaagaa aaaagcacct gtagaggtga tgcgcaagga caggacctgg 1620
gcccgggcgt ataagccacg gttgagtgtt gaagctcaga agctcaagaa cgaggctctt 1680
tgggcgctga agcgcaccag tccggaatat ctgaagctaa gtaggcggaa ggaggagttg 1740
tgcagacgct caattaacta tgtgatcgag aagactcgca ggcgaaccca atgtcagatc 1800
gtcattcccg tgatagaaga tcttaacgtt cgatttttcc acggcagtgg caagcgacta 1860
cccggctggg acaatttttt taccgctaag aaggaaaacc gctggttcat ccagggatta 1920
cacaaagctt tctcggactt gaggacacac agaagcttct atgtctttga agtaagaccc 1980
gaaagaacaa gtataacgtg tcccaagtgc ggccattgcg aggtgggcaa tagggacggg 2040
gaggctttcc aatgcctctc atgcggtaaa acttgtaacg cggacctgga cgttgctacc 2100
cacaacctga cacaagtggc cctcacagga aagactatgc ccaagcgaga ggagcctagg 2160
gatgcccaag gtaccgcgcc ggctaggaaa acaaagaagg catcgaagtc caaggccccg 2220
ccagcagaac gggaagacca gacgccagcg caggagccca gccagaccag c 2271
<210> 175
<211> 2271
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 175
atgccaaagc cagccgtaga aagcgaattc tccaaggttt tgaagaagca tttcccggga 60
gagagattta gatcaagtta catgaagcgg ggtgggaaaa tactcgccgc ccagggcgag 120
gaggcagtag tcgcatatct tcaggggaaa tcagaggagg agccccccaa ctttcagccg 180
ccggcgaagt gccacgttgt caccaagagc cgtgatttcg cagagtggcc aataatgaag 240
gctagtgaag cgatccagcg ctacatctat gcactcagca ccaccgagag agctgcttgc 300
aagcctggca agagttccga gtcccatgct gcgtggttcg ctgctaccgg cgtcagcaat 360
cacggctact cccacgtcca aggcctcaat ctgatcttcg accatactct cgggcggtac 420
gatggcgtgc tgaagaaagt tcagctccgc aatgagaagg cccgcgcaag actcgaatcc 480
atcaatgcat ctagagcgga tgaagggctc cctgagataa aagcggagga agaagaggtg 540
gccacaaacg aaacgggtca cctactccaa ccgcccggca tcaatccgtc cttctatgtt 600
tatcagacta tttcaccgca ggcatatagg ccccgagacg aaatcgtgct tcctcccgag 660
tatgctggtt acgtgcggga ccctaacgcg cccattcccc tcggcgtggt caggaatcgg 720
tgcgacatac agaagggctg ccctggctac attccggagt ggcagagaga ggctggaaca 780
gcgattagtc ctaagactgg gaaggctgtt accgtgccgg gtctaagccc caaaaaaaac 840
aagcgtatgc ggcgctattg gaggtctgag aaggaaaaag ctcaagacgc cttgcttgtc 900
accgtgcgca ttgggacgga ttgggtggtt attgacgtca ggggcctcct gcggaatgcg 960
cgatggagga ccatagctcc gaaagatatt tcgctcaacg cgctgctgga cctgtttacg 1020
ggtgatccag tcatagacgt gaggcgcaat atcgtaactt tcacgtatac gctggatgct 1080
tgtgggacat atgcacggaa gtggaccctg aagggcaagc aaacgaaggc gacgctagat 1140
aaactcacag ccacccagac cgtcgccctc gtcgcaatcg atctaggtca gaccaaccct 1200
atttccgcag gtatcagtcg tgttacacag gaaaacggcg cgcttcagtg cgagccgttg 1260
gatcgcttca cactcccaga cgatctgctc aaggatatct ctgcttatag gattgcctgg 1320
gaccggaatg aagaggagct ccgtgctcga tccgtcgagg cgctgcctga ggcacaacag 1380
gctgaggtca gggcactaga tggcgtgtcc aaggagacag ccaggactca gctctgtgcg 1440
gactttggcc tcgatcccaa gaggctgccg tgggataaga tgtcctccaa tactacattt 1500
atctcggaag ctctgctgtc caactcggtc agtagagacc aagttttttt tacgccggcc 1560
ccgaagaagg gcgccaagaa gaaggcgccc gtggaagtta tgcggaagga caggacatgg 1620
gcccgcgctt ataagccccg cctctccgtg gaggctcaga agctcaagaa cgaagccctg 1680
tgggctttga agcgcacatc acctgagtat ctaaagctca gcaggaggaa agaagagctc 1740
tgtcgcagat ctatcaatta cgtaatcgag aagacgaggc ggcggactca atgccaaatc 1800
gttataccag taatagagga tttgaacgtg agattcttcc atggatcagg caaacgtctg 1860
ccgggctggg acaacttttt tactgctaag aaggaaaatc gatggtttat acaaggcctc 1920
cacaaggctt tctctgactt gcggacccat cgctccttct acgttttcga agttaggccc 1980
gagagaacaa gtattacctg cccgaagtgt ggccattgcg aggttggtaa ccgcgacggg 2040
gaggcgtttc agtgcctttc gtgcggcaag acctgcaacg cggacttaga tgtagcgaca 2100
cacaacttga cacaagttgc actaaccggt aagacgatgc caaagaggga ggagccgcgg 2160
gacgcgcagg gaacagctcc agcgagaaag acaaagaaag cgagcaaatc taaggcgcca 2220
ccggccgaga gagaagatca gactccagct caggaacctt ctcagacgtc c 2271
<210> 176
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 176
atggagaagg agatcactga actgacaaag attcggcggg aattcccgaa taagaaattc 60
agtagcaccg acatgaagaa ggctggaaag ttactcaagg cagaaggtcc ggatgctgtc 120
agggatttcc tcaactcatg tcaggagatc atcggcgatt tcaaaccacc tgtcaagacg 180
aatatcgtca gcatctccag gcccttcgag gagtggcctg tgagtatggt cgggcgcgct 240
atccaggagt actatttctc tctcactaag gaggagctcg agtccgtgca ccctggtacc 300
tcttctgagg atcacaagag ctttttcaat attacagggc tgtccaatta caattacaca 360
tctgttcaag gcctaaatct tatcttcaag aatgccaagg cgatctacga tgggaccctc 420
gttaaggcca acaacaagaa taagaagctg gagaagaagt tcaatgagat aaaccataaa 480
cgctccctgg agggtctccc tatcatcacc ccggactttg aagagccatt cgacgagaac 540
ggccatctta ataacccacc aggcattaac aggaatatat acggttatca aggttgcgca 600
gcaaaggtgt tcgtaccctc caagcacaag atggttagct tacctaagga gtacgaagga 660
tacaaccgtg atccaaatct tagcttggca ggcttccgta atcgactaga gatcccggag 720
ggcgagccag gtcacgtacc atggtttcag cgtatggaca ttccggaagg tcaaattggc 780
catgtcaaca aaatccaaag gttcaacttc gtgcatggca agaactccgg taaagtaaag 840
ttctcggata agacaggtag ggtgaagcgt taccatcaca gcaagtataa agatgcgacg 900
aaaccgtata agtttctaga ggagagcaag aaggtgagcg cgcttgacag cattcttgct 960
attatcacaa tcggcgatga ttgggtcgtg tttgatatcc gggggctgta tagaaatgtt 1020
ttctacaggg agctagctca aaagggccta acggcggttc agctccttga cctcttcacg 1080
ggcgaccccg tgatcgatcc aaagaaggga gtggtgacat tctcatacaa ggagggggtt 1140
gtcccagtgt tctctcagaa gatcgttccc cggttcaaaa gccgcgatac tttagagaag 1200
ctgacctcgc agggtcctgt ggcgctgcta agcgtcgatc tggggcagaa tgagcctgtc 1260
gcggccaggg tgtgctcttt gaagaatatt aatgacaaga taaccctaga taattcatgc 1320
aggatctcct ttttggatga ctacaagaag cagatcaagg actacagaga cagtttggac 1380
gagctcgaga tcaagattcg tctcgaggcc atcaatagcc tcgagactaa ccaacaggta 1440
gaaatccgcg atttagacgt gttctcagct gaccgcgcca aggccaacac agtagatatg 1500
tttgacatcg accctaactt aatttcttgg gactctatga gcgacgctcg ggtcagtacg 1560
cagatctctg atctgtacct gaagaatggc ggcgatgaga gtcgcgttta ctttgaaatc 1620
aacaacaaga ggatcaagag gagcgactac aacatttctc agcttgtgcg acccaaactc 1680
tcggacagca cccgcaaaaa cctgaacgac tcgatctgga agcttaagag gacttcggag 1740
gagtacctga agctttccaa gagaaagctg gaattgtcaa gggccgtggt taattacacc 1800
atccgccaga gcaagttact ctcgggtatt aatgatatcg ttataatact cgaggactta 1860
gacgtgaaaa aaaagttcaa cggaagaggc attcgcgaca taggctggga caactttttt 1920
tcttcacgta aggagaatag gtggttcatc cccgcctttc ataaggcgtt ttccgagctg 1980
agctcgaata gaggcctttg tgtcattgag gtcaatcctg cctggacatc agcgacgtgt 2040
cccgattgcg gcttctgttc caaggagaac cgcgatggca tcaatttcac atgccgtaag 2100
tgcggcgtca gctaccacgc agatatcgac gtcgcgacgt tgaacatcgc cagggtggcg 2160
gtacttggta aaccaatgag cggcccagcg gacagagaaa gactcgggga taccaaaaag 2220
ccgcgggtag cgagatctcg caagacgatg aagcggaagg acatcagcaa ttccaccgtg 2280
gaagcgatgg tcacagct 2298
<210> 177
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 177
atggagaagg agatcaccga gttgactaag attcgccgag aatttcctaa caagaagttc 60
tcgagcaccg atatgaagaa ggctggaaag ttgttgaagg ccgaggggcc ggatgctgtg 120
agggattttc tcaattcctg ccaggagatt atcggcgatt tcaagccgcc cgtcaagact 180
aacatagttt ccatttcgcg cccttttgaa gagtggcccg tgtcaatggt cgggcgcgct 240
attcaggagt attactttag ccttacaaag gaggagctcg agagcgtgca tcctgggact 300
tccagtgaag accacaagtc cttcttcaac attaccggcc tatcaaacta caattacacc 360
agcgttcagg gcctcaacct tatcttcaaa aacgcgaagg ctatctacga cgggaccttg 420
gtcaaggcca ataataaaaa caagaagctt gaaaagaagt ttaacgagat caatcacaag 480
cgctcgctcg aaggattacc aatcataacg ccagattttg aagagccctt tgacgaaaac 540
ggccatctga acaatccgcc cggtataaat agaaatatct atggttacca aggatgtgct 600
gcaaaggtct tcgtgccatc taaacacaaa atggtgtccc tccccaagga atacgagggt 660
tacaaccgcg acccaaactt gtcgttggct ggattcagga atcggctaga gatcccagaa 720
ggagagcctg gtcacgtccc gtggtttcag cgcatggaca tcccggaggg gcaaattggt 780
catgtgaaca agatccagag attcaatttc gtgcacggaa agaactcggg caaggtgaag 840
ttttctgaca aaacaggacg ggtgaagcgg taccaccatt cgaagtataa agacgccact 900
aagccttata agtttctgga agagtcgaag aaagtgtctg cgttagacag tatcctcgcc 960
attattacca tcggggatga ctgggttgtg tttgacattc gcggccttta taggaacgtg 1020
ttctacagag agctggcaca gaagggctta acggccgtcc agctgcttga cctttttacg 1080
ggcgacccgg tcatcgatcc aaagaaaggc gtagtgacgt tttcgtacaa ggaaggcgtc 1140
gtccctgtgt tctcgcagaa gatcgtgcct cgctttaagt ctcgagatac cctggagaag 1200
ttgacctcac aaggccctgt ggcccttttg agtgtggatc ttggtcaaaa cgagccagtg 1260
gcggcgaggg tctgctccct caagaatatt aacgataaga tcaccctgga caactcatgc 1320
cggatttcat tcctcgacga ctacaagaag caaatcaaag actaccgtga ttccctcgat 1380
gagctcgaga taaagataag gcttgaagcc attaactcac tggagacgaa ccagcaagtt 1440
gaaatcagag atcttgacgt cttttctgcg gaccgggcca aggccaatac ggtcgacatg 1500
ttcgacattg atcccaactt gatatcgtgg gacagtatga gtgatgctcg agtatcaacc 1560
cagataagcg acctgtactt aaagaacggc ggggacgaaa gcagagtata cttcgagatc 1620
aacaacaaac gcattaaacg gtcagactac aacatcagcc agcttgtgcg ccccaagttg 1680
agcgatagta cgcgcaagaa cctgaatgat tccatctgga agttaaagag gacctctgag 1740
gagtatttga agctgtcaaa acgcaagcta gagctctctc gcgccgtcgt aaattatacg 1800
attagacagt cgaaactatt gagcggcatt aacgacattg taattattct ggaggaccta 1860
gatgtgaaaa aaaaattcaa tggaaggggc atcagggata ttggctggga caactttttt 1920
agctcgagga aagaaaatcg ctggttcatc ccggccttcc acaaggcctt cagtgaactc 1980
tccagtaacc gggggctgtg cgtgatcgag gtgaatcccg cgtggaccag cgccacatgt 2040
cctgattgcg gattctgttc caaagagaac agagatggta ttaactttac ttgcaggaag 2100
tgtggggtga gttaccatgc tgacatcgac gtggcgaccc tcaacatcgc aagggtggcc 2160
gttcttggaa agcccatgtc aggcccggcc gatcgtgaaa ggctgggtga cacgaagaag 2220
ccgcgggtcg ctcgtagcag aaagacaatg aaaaggaagg acatttctaa ttcgaccgtg 2280
gaggccatgg ttaccgca 2298
<210> 178
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 178
atggagaagg agataaccga gctcaccaag attcgtaggg agttccctaa caagaagttt 60
agcagcacag acatgaagaa agcgggaaag ctcctgaagg ctgaggggcc agatgctgtt 120
cgagactttc ttaactcgtg ccaagagatt atcggcgatt tcaagcctcc cgttaaaact 180
aacatcgtgt ccatatcgcg cccattcgag gagtggccgg tgtccatggt cggaagagcg 240
atacaggaat attacttttc acttaccaag gaggagttgg aatccgttca ccccggcacc 300
agttccgaag accataagtc tttctttaac attactggac tctccaacta caactacact 360
tccgttcagg gccttaatct cattttcaaa aacgctaagg caatctacga tgggacattg 420
gtcaaggcca ataacaagaa caaaaagctc gaaaagaagt ttaatgaaat taaccataag 480
aggtcgctgg aaggcctccc aattatcacc cccgactttg aggaaccgtt cgacgaaaac 540
ggccatctca ataacccgcc tggaatcaat cggaacatat atgggtatca gggctgcgcc 600
gcgaaggtct ttgtgccctc taaacataaa atggtatccc tcccaaagga gtacgagggt 660
tataataggg atcccaactt gtcccttgca ggatttcgga accggttgga aatccctgag 720
ggcgaacctg ggcacgtgcc gtggttccag cgtatggaca taccagaggg ccagattggg 780
catgtgaata agatccagag gtttaatttt gtccacggta agaacagtgg caaggtgaag 840
ttcagcgata aaacaggacg tgtcaagcgc tatcaccact ccaagtacaa ggacgccact 900
aagccttaca agttccttga ggagtcgaaa aaggtctctg cactcgactc aatcctggcg 960
atcattacaa tcggcgatga ttgggtggta ttcgatatcc gcggtctcta ccggaatgtt 1020
ttctaccggg agctagcgca gaaaggcctc acggctgtac agctattaga cctgttcacc 1080
ggcgacccgg ttatagaccc aaaaaagggg gtcgttacat tcagctacaa ggagggcgtc 1140
gtgccagtct tctctcaaaa gatagtgccg cgattcaagt ctcgggacac gctggagaag 1200
ctgacttcac aggggcccgt ggcactgctc agcgtggatc tcggccaaaa cgagcctgtg 1260
gcagccaggg tctgctcact gaagaatatc aacgataaga tcacacttga caacagctgc 1320
cgcatatcat ttctcgacga ttataagaag cagatcaagg actaccgaga ttccctggac 1380
gagctggaga ttaaaatcag actggaagca atcaattcgt tagagactaa ccagcaagtc 1440
gagataaggg atctggacgt attcagcgcc gacagagcta aagcgaatac cgtagatatg 1500
tttgacatcg acccgaatct gatctcctgg gattcaatga gcgacgcgcg cgtcagtact 1560
cagattagtg atctctacct taagaatggc ggcgatgaat cccgcgtgta ctttgagata 1620
aataacaagc gcatcaaaag gagcgattac aatattagcc agcttgtccg cccaaagctc 1680
agcgacagca ctcgcaaaaa tctgaatgac agcatttgga aacttaagcg caccagtgag 1740
gagtacctga agctctctaa gaggaagctg gaactcagcc gcgcagttgt taactatact 1800
atcaggcaat cgaaactcct ctccgggatt aacgacattg tgatcatcct ggaagatctc 1860
gacgtaaaga agaagttcaa cgggcgcggg attagagaca tcggctggga taactttttc 1920
tcttccagga aagaaaacag atggttcata cccgcattcc ataaggcctt ctccgaattg 1980
tcttcaaacc gtgggctgtg tgttatagaa gtgaacccag cgtggacatc cgcaacctgc 2040
ccagactgcg gcttttgctc gaaggaaaac agggatggca tcaattttac atgtcgcaag 2100
tgcggcgtct cttaccacgc agatattgat gttgcgacgc taaatattgc ccgggtggca 2160
gtcctgggaa agcccatgag cgggcctgct gatagggaga ggctgggcga tacaaagaag 2220
ccgagggttg cgcggtccag gaaaacaatg aagcgcaagg acataagcaa cagtacggtg 2280
gaggctatgg tcacggcc 2298
<210> 179
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 179
atggagaagg agatcaccga gctgaccaag atcaggcgcg agttcccgaa caagaagttc 60
agctccaccg acatgaagaa ggccggcaag ctgctcaagg ccgagggccc ggacgccgtg 120
cgcgacttcc tcaactcctg ccaggagatc atcggcgact tcaagccgcc ggtgaagacc 180
aacatcgtgt ccatctcccg cccgttcgag gagtggccgg tgtccatggt gggcagggcc 240
atccaggagt actacttctc cctgaccaag gaggagctgg agtccgtgca cccgggcacc 300
tccagcgagg accataaatc cttcttcaac atcaccggcc tctccaacta caactacacc 360
agcgtgcagg gcctgaacct catcttcaag aacgccaagg ccatttacga cggcaccctc 420
gtgaaggcca ataataagaa caagaagctg gagaagaaat ttaatgaaat caaccacaag 480
cgctccctgg agggcctccc gatcatcacc ccggacttcg aggagccgtt cgacgagaac 540
ggccacctga acaacccgcc gggcatcaac aggaacatct acggctacca gggctgcgcc 600
gccaaggtgt tcgtgccgag caagcacaag atggtgtccc tcccgaagga gtacgagggc 660
tacaacaggg acccgaacct gagcctggcc ggcttcagga accgcctgga gatcccggag 720
ggcgagccgg gccacgtgcc gtggttccag aggatggaca tcccggaggg ccagatcggc 780
cacgtgaaca agatccagcg cttcaacttc gtgcacggca agaactccgg caaggtgaag 840
ttcagcgaca agaccggccg cgtgaagagg taccaccaca gcaagtacaa ggacgccacc 900
aagccgtaca agttcctgga ggagtccaag aaggtgtccg ccctcgactc catcctcgcc 960
atcatcacca tcggcgacga ctgggtggtg ttcgacatca ggggcctcta ccgcaacgtg 1020
ttctaccgcg agctggccca gaagggcctc accgccgtgc agctgctcga cctgttcacc 1080
ggcgacccgg tgatcgaccc gaagaagggc gtggtgacct tcagctacaa ggagggcgtg 1140
gtgccggtgt tctcccagaa gatcgtgccg aggttcaaga gccgcgacac cctggagaag 1200
ctcacctccc agggcccggt ggccctgctc tccgtggacc tcggccagaa cgagccggtg 1260
gccgccaggg tgtgcagcct gaagaacata aacgacaaga tcaccctgga caactcctgc 1320
cgcatctcct tcctggacga ctacaagaag cagatcaagg actaccgcga ctccctggac 1380
gagcttgaga ttaaaatacg cctggaggcc atcaactccc tggagaccaa ccagcaggtc 1440
gagatccgcg acctggacgt gttctccgcc gacagggcca aggccaacac cgtggacatg 1500
ttcgacatcg acccgaacct catcagctgg gactccatgt ccgacgccag ggtgtccacc 1560
cagatctccg acctctacct gaagaacggc ggcgacgaga gcagggtgta cttcgaaatc 1620
aacaataaac gcattaaaag gtccgactac aacatctccc agctggtgag gccgaagctc 1680
agcgactcca cccgcaagaa cctgaacgac agcatctgga agctcaagag gacctccgag 1740
gagtacctca agctctccaa gcgcaagctt gagctgagca gggccgtggt gaactacacc 1800
atccgccaga gcaagctcct gtccggcatt aatgacatcg tgatcatcct ggaggacctg 1860
gacgtgaaga agaagttcaa cggccgcggc atccgcgaca tcggctggga caacttcttc 1920
agctcccgca aggagaacag gtggttcatc ccggccttcc acaaggcctt ctccgagctg 1980
tccagcaaca ggggcctgtg cgtgatcgag gtgaacccgg cctggacctc cgccacctgc 2040
ccggactgcg gcttctgctc caaggagaac cgcgacggca ttaatttcac gtgccgcaag 2100
tgcggcgtgt cctaccacgc cgacatcgac gtggccaccc tcaacatcgc cagggtggcc 2160
gtgctgggca agccgatgtc cggcccggcc gaccgcgaga ggctcggcga caccaagaag 2220
ccgcgcgtgg ccaggagccg caagaccatg aagcgcaagg acatctccaa cagcaccgtg 2280
gaggccatgg tgaccgcc 2298
<210> 180
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 180
atggagaagg agatcacgga actgaccaag attcgccggg agtttccgaa taagaagttc 60
agtagcaccg atatgaagaa ggctggcaag ctcctcaagg ctgaaggtcc ggatgctgtt 120
cgagatttcc tgaactcgtg ccaggaaatc atcggtgatt ttaagccgcc tgtgaagacc 180
aacatcgtgt cgatctctcg gcccttcgag gagtggccag ttagtatggt gggccgcgcg 240
atacaggagt attatttctc tttgaccaag gaggaattgg aaagtgtcca tcccggaacg 300
agctctgagg accataagtc tttcttcaat ataacgggtc tgtctaatta taattacacc 360
agcgtgcagg gtttgaatct catcttcaaa aatgccaagg cgatatacga tggtaccctt 420
gtcaaagcta acaataagaa caagaagtta gagaagaagt tcaatgaaat aaaccacaag 480
aggagcctgg aagggctgcc gattataact ccagatttcg aggagccctt tgatgagaac 540
ggccacctga ataatccccc cggtattaac agaaatatct acggctacca gggttgcgcc 600
gccaaagtgt tcgtgccctc caagcacaag atggtttcgc ttcccaagga gtacgagggc 660
tacaacagag atcctaacct ctcgttggcc ggtttccgga accgccttga aattcctgag 720
ggcgagcctg ggcacgtccc atggtttcag aggatggaca tcccggaagg ccagatcggc 780
cacgtgaaca aaatccaacg gttcaacttc gtacacggca agaactccgg taaagtcaaa 840
ttctccgaca agacgggaag agtcaaacgc taccaccaca gtaaatacaa ggacgctacc 900
aaaccataca agtttcttga ggaatcaaaa aaggtttcgg ccctcgactc tattctggct 960
atcatcacta tcggcgatga ctgggtagtg ttcgacatcc gcggcctgta ccgtaatgtt 1020
ttttaccgcg agctggcaca aaaggggctg actgccgtcc aactgctgga tctatttacg 1080
ggagacccag tcatcgatcc taagaaaggc gtagtcacgt tctcctacaa ggagggggtc 1140
gtcccggtgt tctcccagaa gatcgttccg cgcttcaaat ctcgcgacac tcttgagaaa 1200
ctcacttcac agggaccagt cgctttgtta tcagtcgatt tgggccaaaa cgaaccagtg 1260
gcggctagag tttgtagctt gaaaaatatc aatgacaaga tcactctcga taacagttgc 1320
cggattagct tcctcgatga ctataagaag cagatcaagg actaccggga cagcctagac 1380
gagctggaga ttaagatcag gttagaagcc atcaacagtc tcgaaactaa ccagcaagtc 1440
gaaattaggg acctggacgt cttctcagcg gatcgtgcga aggccaacac cgttgacatg 1500
ttcgatattg atccaaatct tatctcttgg gactccatgt cggatgcacg cgtctctaca 1560
caaatcagtg atttgtacct gaagaacggc ggcgacgagt ccagagtcta cttcgagatc 1620
aacaacaagc gcattaagag gtccgattat aacatctccc agctggtcag accaaaactc 1680
tccgattcta cgagaaagaa tcttaacgac agcatttgga agctcaagag aacctcggag 1740
gagtacctca agctttccaa gaggaaactt gagctttcgc gggctgtcgt taattacacg 1800
atacgccaat ccaaactact ctcgggtatt aacgacatcg ttatcattct cgaggacctc 1860
gatgtcaaga agaagttcaa tgggcgtggc atccgtgaca tcggctggga caacttcttc 1920
tctagccgca aggaaaacag gtggttcatc ccggcgtttc acaaggcgtt cagtgagctc 1980
tcctccaacc gtggtctgtg cgtaatcgag gttaaccccg cctggacctc cgccacgtgc 2040
cctgactgcg gtttttgcag caaggaaaac cgtgacggga tcaactttac ctgcagaaaa 2100
tgcggggtgt cctaccatgc agacatcgac gtagcaactc tcaacatcgc gagagtggct 2160
gtcctgggaa agccaatgtc agggccggct gacagagaaa gattaggcga cacgaaaaag 2220
cccagggttg cccgcagcag gaaaaccatg aaaaggaagg acatttccaa tagtactgtg 2280
gaggctatgg ttacggca 2298
<210> 181
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 181
atggagaagg agattacaga gttgactaag ataaggagag agttcccgaa caagaagttc 60
tcatcgaccg acatgaaaaa ggcgggcaag ctcctgaagg cggagggccc agacgccgtg 120
cgggacttct tgaactcctg ccaggagatc attggtgatt tcaagccgcc agtgaagacg 180
aatattgtat cgatttctcg tccgttcgag gagtggcctg tttcaatggt aggccgagct 240
attcaggagt attacttttc tctcacgaag gaagaactgg agagcgttca cccggggaca 300
tcatccgaag accacaagtc atttttcaac atcacgggcc tgtccaatta caattacacg 360
agcgtgcaag ggctcaacct gatcttcaag aatgccaaag ccatctacga cggcacgcta 420
gttaaagcaa acaacaagaa caagaagttg gagaagaagt tcaacgagat caaccacaag 480
cgatccctgg agggcctccc catcatcact cctgactttg aggaaccttt cgacgaaaac 540
gggcacctta acaacccgcc tggcatcaat aggaatatct acggctacca agggtgtgcg 600
gcgaaagtat ttgtgccgtc aaaacacaag atggtctccc tgccgaagga gtatgaaggc 660
tataatcggg acccgaatct ctctctggca gggttcagga accgcctgga aatccctgag 720
ggtgagcctg gccacgttcc atggtttcag aggatggata tccctgaggg ccagataggc 780
catgtgaata agattcaacg cttcaacttt gttcatggga agaactcggg caaagttaag 840
ttctctgata agaccggcag ggtgaagaga tatcatcact caaagtacaa ggacgctaca 900
aaaccctaca aattcctcga ggagtcgaag aaggtaagcg cactcgatag catccttgcg 960
attatcacca tcggggacga ttgggtggtt tttgacatcc gtgggctgta cagaaacgtg 1020
ttctaccggg agcttgctca gaaaggcttg actgcagtgc agttgctcga tctttttact 1080
ggcgaccctg tgatcgatcc aaagaagggc gtggtcactt ttagttacaa ggagggcgtg 1140
gtgccagtct tttctcagaa gatcgtgcct cggtttaagt cgagagacac actcgagaag 1200
ctgacctcgc aaggacccgt cgctctgctg agcgtagacc ttgggcagaa cgagcctgtc 1260
gcagcacggg tttgttccct gaagaacatc aatgacaaaa taacactgga taatagctgt 1320
cgtataagtt tccttgacga ttataagaag cagattaaag attatcggga ttctctcgac 1380
gagctggaga taaagattag gctggaggcg attaactcgc tggaaacgaa ccagcaggtg 1440
gagatcaggg atctcgatgt cttctccgcg gaccgtgcaa aggcgaatac agtggatatg 1500
ttcgacatcg atcccaacct gatatcttgg gacagcatgt cggatgcgag agtaagtacg 1560
cagatcagcg atttatacct caagaacggg ggggacgagt cccgtgtgta ctttgagatt 1620
aacaacaagc gcattaaacg cagcgactac aacatttcac agcttgtacg accgaagctg 1680
tcggatagca ctcggaagaa cctcaatgac tcaatttgga agcttaagcg gacctcagaa 1740
gagtatttga aactgtcaaa gcgcaaactg gaactgagcc gtgcggtcgt taactacact 1800
atccggcaaa gtaagctcct ttcagggatc aacgacatag tgatcatcct tgaggacctc 1860
gacgttaaga agaaatttaa tggccgtggt attcgggaca tcggctggga caactttttc 1920
tccagtcgca aagagaacag gtggttcatc cctgccttcc acaaagcttt ctcagagctg 1980
tcgtcgaatc gcgggctctg cgtgattgag gtcaatccag catggacttc cgctacgtgc 2040
cctgattgtg ggttctgctc taaggagaac cgcgacggta tcaactttac gtgcaggaaa 2100
tgcggcgtct cttaccacgc cgatatagat gttgcgactc ttaacatcgc aagggttgcc 2160
gtccttggga agccaatgtc aggacctgcc gacagagagc gtctgggcga cactaagaag 2220
cctagagtcg ccaggtcccg taagaccatg aaaagaaagg atatttctaa ctcgacagtt 2280
gaggccatgg taaccgca 2298
<210> 182
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 182
atggaaaagg agataacgga gcttactaag atccgccgcg agttcccaaa caagaagttc 60
agcagtaccg acatgaagaa agcagggaag ctcctgaagg ctgaggggcc ggatgctgtt 120
agagacttcc tgaacagctg ccaggagatt attggggact tcaagccgcc ggtgaagacg 180
aacatagtgt cgatatcacg accgttcgag gaatggcccg tgtccatggt aggaagggcc 240
atacaagaat actacttctc actgacaaaa gaggagctgg aaagcgtgca ccccgggacc 300
agctctgagg atcacaagag ttttttcaac atcactggat tgagcaacta taactatacg 360
tcagtgcaag gcttgaattt gatcttcaaa aacgccaaag ccatatacga tggcactctc 420
gttaaggcga acaacaagaa caagaaactg gagaagaagt tcaacgagat aaatcacaag 480
cgctcactgg aagggctacc gattattacg ccggacttcg aggagccatt cgacgagaac 540
gggcatttga acaatcctcc cggtattaac agaaatattt acggctatca aggctgtgcc 600
gccaaagttt ttgtgccaag caaacacaag atggtcagtc tgccgaagga gtatgaaggt 660
tacaaccggg accctaattt atcactcgcc gggttccgca ataggttaga gatccctgag 720
ggtgagcccg ggcacgttcc gtggtttcag aggatggaca tcccggaggg ccagattggg 780
catgttaaca agattcagcg gttcaacttc gtgcacggca agaactccgg gaaagtgaaa 840
ttcagtgaca agactggacg cgtcaagcgc tatcaccaca gcaagtacaa ggatgccacg 900
aagccgtaca agttcctgga ggagtctaag aaggtttccg cactcgattc catcctcgca 960
attataacta tcggagatga ttgggtcgtc tttgacatca gagggcttta cagaaatgtg 1020
ttttacaggg agctcgccca gaagggtctc accgctgttc agctgctgga cctgtttact 1080
ggcgatccag tgatcgaccc taagaaaggt gttgtcacgt tcagctataa agagggggtc 1140
gtgcccgtct ttagccaaaa gatcgttccg cggttcaaat cccgggacac actcgaaaag 1200
ctcacctccc agggtcctgt ggccctacta agtgtagacc ttggacagaa cgagcctgtg 1260
gcagcgcgcg tgtgcagtct aaagaatata aacgacaaaa taaccttgga taattcatgc 1320
aggatttcct tcctcgatga ctacaagaag cagatcaagg actatcggga cagccttgac 1380
gagctggaga tcaagatcag gctggaggcc atcaacagtc tggaaactaa ccagcaagtc 1440
gagatcaggg atctcgatgt gttttctgct gatagggcga aagcaaacac cgtagacatg 1500
tttgacatcg acccaaacct gatctcgtgg gattccatga gcgacgcccg cgtcagtacc 1560
cagatatctg atttgtactt aaagaacggc ggagacgagt ctcgcgttta ctttgagatc 1620
aataacaaga ggatcaagcg aagtgattat aatatctctc aactggttag gccgaagctt 1680
tcagacagca cgcggaaaaa ccttaatgac tcgatctgga aactcaagcg tacgtccgaa 1740
gagtatctga aactcagcaa acgaaagctg gagttgtcac gggcggtggt gaactatacg 1800
attaggcagt ccaaattgct atctgggata aacgacattg tcataatctt ggaagacctg 1860
gatgtgaaga agaaattcaa cgggagagga atcagggaca ttggttggga taacttcttc 1920
agttcccgca aggagaatcg gtggttcatc ccagcatttc acaaagcctt ctcggaattg 1980
tcaagcaacc gtggtttatg cgtcatagag gtgaacccag cttggacttc agcgacgtgc 2040
cccgattgcg gcttctgcag caaagagaac cgagacggga tcaacttcac ctgccggaaa 2100
tgcggcgtgt cgtaccatgc ggacattgac gtggcaacgc tcaatatcgc ccgtgtggca 2160
gtgcttggca agccaatgag tggtccggct gatagggagc gtctcgggga taccaagaag 2220
ccgagggttg cgcgctcccg gaagactatg aagcgcaagg acatatccaa tagtaccgtt 2280
gaggcgatgg tcacggca 2298
<210> 183
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 183
atggaaaaag aaatcacgga actcacaaag atccgtaggg agttccctaa taaaaaattc 60
tcgtccacag acatgaagaa ggccggtaag cttctcaagg ctgagggccc tgacgcggtg 120
cgcgacttct tgaattcctg ccaggagatc atcggtgact ttaaacctcc agtgaagacg 180
aacatagtct ctatttctcg ccctttcgaa gaatggccag tgtcaatggt cgggcgcgcg 240
attcaggagt attacttttc tctcacgaag gaagagctag agtcagtgca cccgggtaca 300
agctctgagg atcacaaatc cttttttaac attaccggct tatccaacta taactacact 360
tccgtacagg ggttgaacct gatcttcaag aatgcaaagg ctatttacga tggcacgctc 420
gtgaaggcga ataacaaaaa taaaaagctc gaaaagaaat tcaacgaaat caaccacaag 480
cgatccttag aaggcttgcc gatcatcaca cctgacttcg aggagccatt tgacgagaac 540
ggtcacctca ataatccacc cggaattaat cgtaacatct atgggtacca aggatgtgcg 600
gccaaggtct tcgtcccgtc aaagcacaaa atggtgtcgc tccctaagga atatgagggc 660
tacaacaggg acccgaacct ctctcttgca gggttccgta atagacttga aattccagag 720
ggtgagccgg gacatgtgcc atggtttcaa cggatggaca ttcccgaggg gcagatcggc 780
cacgtgaaca agatacagag gtttaacttc gttcacggta agaactccgg caaagtcaag 840
tttagcgata aaactggcag agttaagcgg tatcaccatt ccaagtataa ggatgccacc 900
aagccctata agtttctgga agagagcaag aaggtttccg cacttgattc cattctcgca 960
atcatcacca tcggggacga ctgggttgtt tttgacatcc ggggactgta caggaacgtc 1020
ttttatcggg agctggcgca gaaagggctt acggccgtgc aattgctgga tttattcact 1080
ggcgatccgg tgatcgaccc aaaaaagggc gtcgtgacct tttcctacaa ggaaggcgtg 1140
gtccccgttt ttagtcaaaa gatcgtccct aggttcaaaa gccgagacac attggagaaa 1200
ctcaccagcc agggccctgt cgccttgctg tctgtcgatc ttggacaaaa cgaaccggtg 1260
gctgcgcgag tctgcagcct taagaacatt aacgataaga ttaccttgga taactcatgt 1320
cggatcagtt tcctggacga ttataagaaa caaatcaagg actatagaga ttccttagat 1380
gagctcgaga tcaagatccg gctggaggct attaactcgc tggagaccaa tcagcaggtg 1440
gaaatccggg acttggacgt attcagcgcc gaccgggcga aggcgaacac agtcgacatg 1500
ttcgacatag atcccaacct tatcagctgg gacagcatgt ctgacgcacg ggtctctacc 1560
cagatcagcg acctctacct caaaaatggg ggtgacgagt ctagagtgta ctttgagatc 1620
aacaataagc ggattaaacg ctcggactac aatatcagcc aattggtcag gccaaagtta 1680
tccgactcga caagaaagaa cctcaatgat tccatttgga agctgaagcg tacttctgag 1740
gagtatctga agttgtctaa gaggaagctt gagctgagcc gggcagtagt gaattatact 1800
atccgccaga gcaagcttct tagcggcatt aacgacatcg tcatcatcct cgaagatttg 1860
gacgtcaaaa agaagttcaa tggacgcggt atcagagata ttggctggga caactttttc 1920
agcagcagga aggaaaatag gtggttcatt ccggcgtttc acaaggcttt ctcagagctg 1980
tcgtcaaaca ggggattatg tgttatcgag gtgaaccctg catggaccag cgcgacttgc 2040
ccggattgcg gtttctgctc caaggaaaat cgggacggca ttaactttac ttgccgcaag 2100
tgtggcgtct cataccacgc ggacattgat gtcgctactc tcaatattgc ccgggtggca 2160
gtattgggca aacctatgtc cggccccgca gaccgggagc ggttagggga cacgaagaaa 2220
ccgcgcgtcg ctagatcacg taaaactatg aagagaaagg acatatctaa ctccaccgtt 2280
gaggccatgg tgacagca 2298
<210> 184
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 184
atggagaagg agatcaccga gttaaccaag atccggagag agtttccaaa taaaaaattt 60
tctagcaccg atatgaagaa ggcaggtaag cttctgaagg cggagggacc cgacgcagtg 120
cgtgatttcc tgaactcgtg ccaagagatc attggggatt tcaaacctcc ggtgaagact 180
aatatagttt ctatcagtag accttttgag gagtggccag tctctatggt gggccgggca 240
atccaggagt attacttctc ccttacaaag gaagagctgg aaagcgtgca tccagggacc 300
agcagtgagg accataagag cttctttaac atcacgggtc ttagcaacta taattacaca 360
agcgttcagg ggctcaatct aatcttcaag aacgcaaaag ctatttatga cggtaccctg 420
gtcaaggcaa acaataagaa taaaaagctg gagaagaaat ttaatgagat caatcacaaa 480
cggtcactgg agggactgcc gatcataacc cccgattttg aagagccctt cgatgagaac 540
ggtcatctta ataacccacc agggatcaac cgcaatatat atgggtacca ggggtgtgcg 600
gctaaggtct ttgtgccctc gaaacataag atggtctccc tcccaaagga atatgagggg 660
tacaaccgtg acccgaacct tagcctggca gggttccgca atcgcttaga gatccccgag 720
ggagagcccg gtcacgtgcc gtggtttcag aggatggaca ttcccgaggg tcagatcggg 780
catgtcaaca agatccaaag attcaacttc gtgcacggca agaactccgg gaaggtcaag 840
ttttccgaca agaccgggcg tgtcaagcgg taccatcact ccaagtacaa ggatgccacc 900
aaaccgtata agttcctaga ggagtccaaa aaggtttccg ctctggactc tatccttgcg 960
attatcacga ttggggacga ctgggtggtg ttcgacataa gaggcctgta ccgtaacgta 1020
ttttataggg agcttgcaca aaagggcctc accgcagttc aactgctgga cctttttacc 1080
ggcgatcctg ttatcgaccc gaaaaagggc gtggtgacct tctcttacaa agagggtgtc 1140
gtgcccgttt tctcacagaa gatagtccct cgcttcaaat cacgggacac gctcgagaag 1200
ctgacatctc aaggtcccgt ggccctcctg agcgtcgatc tgggtcagaa cgagccggtc 1260
gcggctaggg tttgctcact gaagaacatc aacgacaaga taacactcga taacagctgc 1320
cggatctcct tcctagatga ctacaagaag cagattaagg actatcgtga ttctctcgat 1380
gagttggaga ttaaaatcag actcgaagcc attaactccc tcgagacgaa ccaacaggtt 1440
gagatccgcg acctcgacgt attcagtgcg gaccgggcta aggcaaatac cgtcgacatg 1500
ttcgacattg accccaacct gatttcttgg gatagcatgt cggacgcccg cgtgtctacg 1560
cagatttccg atctgtacct taaaaatggg ggagatgaga gtcgtgttta cttcgagatc 1620
aataacaagc gcattaagcg gtcagactac aatatctcgc agctggtcag acctaagctg 1680
tctgatagca cgaggaagaa tctcaacgac agcatttgga agttgaagcg aacaagtgaa 1740
gaatacttaa agctaagcaa acgcaagctg gaactgtctc gcgcggtggt taactacacg 1800
atccgacaat cgaagcttct gtccgggatc aacgatatcg tcataattct tgaagacttg 1860
gacgtgaaaa aaaagttcaa tggtaggggc attcgcgaca ttgggtggga taacttcttt 1920
tcctcccgca aggagaatcg ctggtttatc cctgccttcc ataaggcctt ttccgagctg 1980
agtagtaatc gggggttgtg tgttatagaa gtcaatcccg catggactag cgccacttgt 2040
cctgattgcg gattctgctc caaggaaaac cgggacggca tcaactttac gtgccggaag 2100
tgcggggttt cttaccatgc tgacatcgat gtggcgacct tgaacatagc tagagtagct 2160
gtcttgggta agccgatgtc cgggcctgcg gacagggaga gactgggcga cacgaagaag 2220
cctcgagtcg caagaagtag aaagaccatg aagaggaagg acatatccaa ttctactgtc 2280
gaggcaatgg tgaccgct 2298
<210> 185
<211> 2298
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 185
atggagaaag aaataaccga gctcaccaag atcaggcgag agtttcccaa taagaagttc 60
tccagcaccg acatgaaaaa ggctggtaag cttctcaaag ccgagggtcc ggatgcagtc 120
agagattttt tgaactcctg ccaggagata ataggtgact ttaagccacc agtcaagaca 180
aacattgttt ctatcagccg cccctttgag gagtggccgg tgagcatggt gggcagggcg 240
atccaggagt attatttcag cctcactaag gaagagctgg aaagcgttca tcctggcacc 300
agctccgaag accacaaaag tttcttcaac atcacgggtc tctccaatta taattacacc 360
agcgtgcaag gattgaatct gatttttaag aacgctaagg cgatttacga cggaaccctc 420
gttaaagcca ataacaagaa caagaagttg gaaaagaagt ttaacgaaat caatcataag 480
aggagcctgg agggcctgcc gatcatcacc cctgacttcg aggagccctt tgacgaaaat 540
ggacatttga acaacccacc aggtataaac aggaacatct acggttacca ggggtgcgct 600
gccaaagtgt tcgtaccatc caagcataag atggtgtcgc ttcccaaaga gtatgaagga 660
tataatcgag acccaaacct ttcccttgct ggtttccgca accggctgga gatcccagag 720
ggtgagccag gacatgtgcc gtggtttcaa agaatggata taccggaggg ccaaattgga 780
catgtcaata agattcagcg cttcaatttc gtccatggga aaaatagcgg gaaggtgaag 840
ttttcagata agaccgggcg agtgaaacgg taccaccaca gtaagtataa ggatgcgaca 900
aagccatata aattcttgga agagagcaaa aaggtttcgg ccctcgattc aatactggca 960
atcatcacta tcggtgatga ctgggtcgtg tttgacattc gcgggctata taggaatgtt 1020
ttctaccgcg agcttgcgca gaaggggctc accgccgtgc agctgctgga cctctttacg 1080
ggtgatcccg tcatcgatcc taagaaaggc gtggttacgt ttagttataa agaaggggtg 1140
gtgcccgtct tcagtcaaaa aattgtgcca aggttcaagt cacgtgatac attggaaaag 1200
cttacgtctc aaggcccggt agccttgctc tccgtggacc taggccagaa cgagccagtc 1260
gctgcacgcg tgtgttcact taagaacatt aacgacaaaa tcactctcga caacagctgc 1320
cgcatttcat ttctggacga ctacaagaaa caaatcaagg actacaggga ctcgctggat 1380
gagctcgaaa ttaaaatccg gctggaagcc atcaactcac tggagaccaa ccaacaggtt 1440
gagattagag atctagacgt cttctcggcg gaccgagcca aagccaacac agtggatatg 1500
tttgacattg acccaaacct gatcagttgg gactctatgt ccgatgcgcg cgtatcaaca 1560
cagatctctg acctctacct caaaaatgga ggtgacgagt cccgcgttta ctttgagatc 1620
aataataagc gcatcaagcg ttcggactac aatattagcc agctggtaag gcccaagctt 1680
tccgacagca cacgtaaaaa cctgaatgat tcgatttgga aactcaagcg cacctcggag 1740
gaatacctaa aactctcgaa gcgaaaactc gagctttccc gggcggtcgt aaactacacg 1800
attaggcaaa gtaaactcct ctccggtatt aacgatattg taattatcct tgaggacctt 1860
gatgtaaaga agaaattcaa cggacgaggc atccgggata tcggctggga taacttcttt 1920
tcttcgcgca aagaaaaccg gtggtttatc ccggccttcc ataaggcttt cagcgagctt 1980
tcttccaacc gagggttgtg cgtgatagaa gttaacccgg cctggacgag cgcaacctgc 2040
ccggattgtg ggttctgtag caaggagaac agggatggga tcaacttcac gtgtaggaag 2100
tgcggggtgt cctaccacgc cgatatcgac gttgccacgc tcaatatcgc ccgcgtggcc 2160
gtgctgggca agcccatgtc gggccctgcc gatagagaga ggctggggga cactaagaaa 2220
ccgcgtgtcg cacgaagccg caagacgatg aagaggaagg atatctccaa ttccaccgtc 2280
gaggcaatgg ttaccgcg 2298
<210> 186
<211> 36
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 186
ggagagatct caaacgattg ctcgattagt cgagac 36
<210> 187
<211> 36
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 187
gtcggaacgc tcaacgattg cccctcacga ggggac 36
<210> 188
<211> 36
<212> DNA
<213> 人工
<220>
<223> 合成的
<400> 188
accaaaacga ctattgattg cccagtacgc tgggac 36

Claims (54)

1.一种修饰大豆基因组中的内源大豆基因的方法,该方法包括:
(a)向包含编码RNA指导的核酸内切酶(RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸的大豆植物细胞中引入针对该内源大豆基因中的靶编辑位点的指导RNA或编码指导RNA的多核苷酸和任选地与该靶编辑位点具有同源性的供体模板DNA分子,其中所述合成的多核苷酸:
(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;
(ii)具有的熔化温度(Tm)大于89或90摄氏度;
(iii)具有的大豆密码子适应指数(sCAI)低于编码该RGE或该RGN的大豆密码子优化的参考多核苷酸的sCAI,
(iv)或i、ii和iii的任何组合;并且
(b)选择包含该内源大豆基因的修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
2.一种修饰大豆基因组中的内源大豆基因的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码RNA指导的核酸内切酶(RGE)或RNA指导的切口酶(RGN)的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码该RGE或该RGN的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、所述Tm和所述较低sCAI的任何组合;
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶编辑位点;以及任选地
(iii)与该靶编辑位点具有同源性的供体模板DNA分子;并且
(b)选择包含该内源大豆基因的修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
3.如权利要求1或2所述的方法,其中该RGE包括II型Cas核酸内切酶、Cas9核酸内切酶、V型Cas核酸内切酶、Cas12a核酸内切酶、Cas12c核酸内切酶、CasX核酸内切酶、Cas12j核酸内切酶或工程化的核酸内切酶。
4.如权利要求1或2所述的方法,其中该RGN包含II型Cas切口酶、Cas9切口酶、V型Cas切口酶、Cas12a切口酶、Cas12c切口酶、CasX切口酶或工程化的切口酶。
5.如权利要求1或2所述的方法,其中该RGN包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的R1226A氨基酸突变;或(iii)SEQ ID NO:73的LbCpf1蛋白中的R1138A突变。
6.如权利要求1或2所述的方法,其中该合成的多核苷酸在以下中任一个的全长上具有至少76%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(xi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(xii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(xiii)在SEQ ID NO:122-131、134-143或146-154中任一个的全长上至少77%序列同一性。
7.如权利要求1或2所述的方法,其中该合成的多核苷酸具有大于48%或50%的GC含量并且在以下中任一个的全长上具有至少70%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(xi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(xii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
8.如权利要求1或2所述的方法,其中该合成的多核苷酸:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI;
(d)在选自由SEQ ID NO:122-131组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(e)在选自由SEQ ID NO:122-131组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(f)在选自由SEQ ID NO:122-131组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ IDNO:121的大豆密码子优化的参考多核苷酸的sCAI;
(g)在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(h)在选自由SEQ ID NO:134-143组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(i)在选自由SEQ ID NO:134-143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ IDNO:133的大豆密码子优化的参考多核苷酸的sCAI;
(j)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;
(k)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(l)在选自由SEQ ID NO:146-154和155组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
9.如权利要求1或2所述的方法,其中该合成的多核苷酸编码RGE并且:
(i)该RGE是与SEQ ID NO:1具有至少95%序列同一性的SpCas9核酸内切酶或其变体,并且编码该SpCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%序列同一性;
(ii)该RGE是与SEQ ID NO:13具有至少95%序列同一性的SaCas9核酸内切酶或其变体,并且编码该SaCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:14具有至少95%序列同一性;
(iii)该RGE是与SEQ ID NO:25具有至少95%序列同一性的FnCpf1核酸内切酶或其变体,并且编码该FnCpf1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:26具有至少95%序列同一性;
(iv)该RGE是与SEQ ID NO:37具有至少95%序列同一性的CasJ核酸内切酶或其变体,并且编码该CasJ核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%序列同一性;
(v)该RGE是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且编码该Cas12j-1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:121具有至少95%序列同一性;
(vi)该RGE是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且编码该Cas12j-2核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQID NO:133具有至少95%序列同一性;或
(vii)该RGE是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且编码该Cas12j-3核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQID NO:145具有至少95%序列同一性。
10.如权利要求1或2所述的方法,其中该合成的多核苷酸编码该RGN,具有大于48%的GC含量,并且在以下中任一个的全长上具有至少70%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN;或
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN。
11.如权利要求1或2所述的方法,其中该合成的多核苷酸编码该RGN并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN;或
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN。
12.如权利要求1或2所述的方法,其中:
(i)该合成的多核苷酸是与SEQ ID NO:1具有至少95%序列同一性的SpCas9 RGN,并且编码该SpCas9 RGN的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%序列同一性;
(ii)该RGN是与SEQ ID NO:13具有至少95%序列同一性的SaCas9 RGN,并且编码该SaCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%序列同一性;
(iii)该RGN是与SEQ ID NO:25具有至少95%序列同一性的FnCpf1 RGN,并且编码该FnCpf1 RGN或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%序列同一性;或
(iv)该RGN是与SEQ ID NO:37具有至少95%序列同一性的CasJ RGN,并且编码该CasJRGN或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%序列同一性。
13.如权利要求1或2所述的方法,其中该合成的多核苷酸:
(i)编码该RGE并且与在对照大豆植物细胞中修饰该内源基因的频率相比提供在该大豆植物细胞的核、质体或线粒体基因组中修饰该内源基因的频率方面的至少5倍增加,该对照大豆植物细胞具有包含编码该RGE的大豆密码子优化的参考多核苷酸的对照多核苷酸;或,
(ii)编码该RGN并与对照大豆植物细胞中内源靶序列的切口或切口相关修饰相比提供该大豆植物细胞的核、质体或线粒体基因组中的该内源靶序列的切口或切口相关修饰方面的至少2倍增加,该对照大豆植物细胞包含编码该RGN的对照大豆密码子优化的参考多核苷酸。
14.如权利要求1或2所述的方法,其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%、9%或10%的GC含量,或任选地其中该大豆密码子优化的参考多核苷酸具有比该合成的多核苷酸的GC含量低至少约8%至约12%的GC含量。
15.一种修饰大豆基因组中内源大豆基因的表达的方法,该方法包括:
(a)将针对该内源大豆基因中的靶DNA结合位点的指导RNA或编码指导RNA的多核苷酸引入包含编码该ndRGDBP的合成的多核苷酸的大豆植物细胞中,其中所述合成的多核苷酸:
(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%;
(ii)具有的熔化温度(Tm)大于89或90摄氏度;
(iii)具有的大豆密码子适应指数(sCAI)低于编码该ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI;
(iv)或i、ii和iii的任何组合;并且
(b)选择其中该内源大豆基因的表达已经被修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
16.一种修饰大豆基因组中内源大豆基因的表达的方法,该方法包括:
(a)向大豆植物细胞中引入:
(i)编码包含核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸,其中所述合成的多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%、48%或50%,具有的熔化温度(Tm)大于89或90摄氏度,具有的大豆密码子适应指数(sCAI)低于编码该ndRGDBP的大豆密码子优化的参考多核苷酸的sCAI,或所述GC含量、Tm和sCAI的任何组合;并且
(ii)指导RNA或编码指导RNA的多核苷酸,该指导RNA或编码指导RNA的多核苷酸针对该内源大豆基因中的靶结合位点;并且
(b)选择其中该内源大豆基因的表达已经被修饰的经修饰的大豆植物细胞、大豆植物、大豆植物部分、大豆组织或大豆愈伤组织。
17.如权利要求16所述的方法,其中该ndRGDBP包括II型Cas ndRGDBP、Cas9ndRGDBP、V型Cas ndRGDBP、Cas12a ndRGDBP、Cas12c ndRGDBP、CasX ndRGDBP、Cas12j ndRGDBP或工程化的ndRGDBP。
18.如权利要求16所述的方法,其中该ndRGDBP包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A和/或H840A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的D917A、E1006A、E1028A、D1255A和/或N1257A突变;(iii)SEQ ID NO:37的CasJ蛋白中的D901A、E1128A和/或D1298A突变;(iv)SEQ ID NO:73的LbCpf1蛋白中的D832A、E925A和/或D1148A突变;(v)选自由SEQ ID NO:120的D371A、E579A、D673A、C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S组成的组;(vi)选自由SEQ ID NO:132的D394A、E606A、D697A、C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S组成的组;或(vii)选自由SEQ ID NO:144的D413A、E618A、D710A、C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S组成的组。
19.如权利要求16所述的方法,其中该合成的多核苷酸在以下中任一个的全长上具有至少76%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(xi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(xii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
20.如权利要求16所述的方法,其中该合成的多核苷酸具有大于48%或50%的GC含量并且在以下中任一个的全长上具有至少70%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(xi)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;或
(xii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP。
21.如权利要求16所述的方法,其中该合成的多核苷酸:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(d)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(e)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;其包含一个或多个核苷酸插入、缺失和/或取代并编码该ndRGDBP;
(f)在选自由SEQ ID NO:122-130和131组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(g)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(h)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(i)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(j)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(k)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(l)在选自由SEQ ID NO:146-154和155组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
22.如权利要求16所述的方法,其中:
(i)该ndRGDBP是与SEQ ID NO:1具有至少95%序列同一性的SpCas9ndRGDBP,并且编码该SpCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%序列同一性;
(ii)该ndRGDBP是与SEQ ID NO:13具有至少95%序列同一性的SaCas9ndRGDBP,并且编码该SaCas9 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:14具有至少95%序列同一性;
(iii)该ndRGDBP是与SEQ ID NO:25具有至少95%序列同一性的FnCpf1ndRGDBP,并且编码该FnCpf1 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:26具有至少95%序列同一性;
(iv)该ndRGDBP是与SEQ ID NO:37具有至少95%序列同一性的CasJ ndRGDBP,并且编码该CasJ ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%序列同一性;
(v)该ndRGDBP是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1变体,并且该合成的多核苷酸在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列的全长上具有超过80%序列同一性;
(vi)该ndRGDBP是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2变体并且该合成的多核苷酸在选自由SEQ ID NO:134-143组成的组的至少一个、两个或三个序列的全长上具有超过80%序列同一性;或
(vii)该ndRGDBP是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3变体,并且该合成的多核苷酸在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列的全长上具有超过80%序列同一性。
23.如权利要求16所述的方法,其中该合成的多核苷酸进一步包含可操作地连接的多核苷酸,该多核苷酸编码修饰该内源大豆基因的表达的效应子结构域。
24.如权利要求16所述的方法,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;或(i)和(ii)的组合;
(c)多聚腺苷酸化序列;
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ST)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
25.如权利要求24所述的方法,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
26.如权利要求16所述的方法,其中与对照大豆植物细胞中该内源基因的表达相比,该合成的多核苷酸提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBP并且(i)具有的GC含量比该多核苷酸的GC含量低至少约8%、9%或10%,或任选地其中编码该ndRGDBP的对照多核苷酸具有的GC含量至少比该合成的多核苷酸的GC含量低约8%至12%。
27.如权利要求16所述的方法,其中该合成的多核苷酸包含编码RGE、RGN或ndRGDBP的RNA分子。
28.一种大豆植物细胞,其包含编码蛋白质的合成的多核苷酸,该蛋白质包含RNA指导的核酸内切酶(RGE)、RNA指导的切口酶(RGN)或核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP),其中所述多核苷酸:
(a)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;
(b)具有的熔化温度(Tm)大于89或90摄氏度;
(c)具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;或
(d)(a)、(b)和/或(c)的任何组合。
29.如权利要求28所述的大豆植物细胞,其中该RGE包括II型Cas核酸内切酶、Cas9核酸内切酶、V型Cas核酸内切酶、Cas12a核酸内切酶、Cas12c核酸内切酶、CasX核酸内切酶、Cas12j核酸内切酶或工程化的核酸内切酶。
30.如权利要求28所述的大豆植物细胞,其中该ndRGDBP包括II型Cas ndRGDBP、Cas9ndRGDBP、V型Cas ndRGDBP、Cas12a ndRGDBP、Cas12c ndRGDBP、CasX ndRGDBP、Cas12jndRGDBP或工程化的ndRGDBP。
31.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码RGE并且在以下中任一个的全长上具有至少76%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62、SEQ ID NO:75-84的大豆密码子优化的参考多核苷酸的sCAI,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI;
(ix)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(x)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(xi)在SEQ ID NO:122-131、134-143或146-154中任一个的全长上至少77%序列同一性。
32.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码RGE并且具有大于48%或50%的GC含量并且在以下中任一个的全长上具有至少70%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62、SEQ ID NO:75-84的大豆密码子优化的参考多核苷酸的sCAI,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI;
(vii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI;
(viii)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI;
(ix)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(x)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;或
(xi)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
33.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI;
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI;
(d)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(e)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(f)在选自由SEQ ID NO:122-130和131组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI;
(g)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(h)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(i)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI;
(j)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;
(k)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI;或
(l)在选自由SEQ ID NO:146-154和155组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI。
34.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且:
(i)该RGE是与SEQ ID NO:1具有至少95%序列同一性的SpCas9核酸内切酶或其变体,并且编码该SpCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:2具有至少95%序列同一性;
(ii)该RGE是与SEQ ID NO:13具有至少95%序列同一性的SaCas9核酸内切酶或其变体,并且编码该SaCas9核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:14具有至少95%序列同一性;
(iii)该RGE是与SEQ ID NO:25具有至少95%序列同一性的FnCpf1核酸内切酶或其变体,并且编码该FnCpf1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:26具有至少95%序列同一性;
(iv)该RGE是与SEQ ID NO:37具有至少95%序列同一性的CasJ核酸内切酶或其变体,并且编码该CasJ核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ ID NO:38具有至少95%序列同一性;
(v)该RGE是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1核酸内切酶或其变体,并且编码该Cas12j-1核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:121具有至少95%序列同一性;
(vi)该RGE是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2核酸内切酶或其变体,并且编码该Cas12j-2核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQID NO:133具有至少95%序列同一性;或
(vii)该RGE是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3核酸内切酶或其变体,并且编码该Cas12j-3核酸内切酶或其变体的大豆密码子优化的参考多核苷酸与SEQID NO:145具有至少95%序列同一性。
35.如权利要求28至34或35中任一项所述的大豆植物细胞,其中该合成的多核苷酸编码该RGE并且与在对照大豆植物细胞中修饰该靶基因的效率相比提供在该大豆植物细胞的核、质体或线粒体基因组中修饰内源基因或基因座的效率方面的至少5倍增加,该对照大豆植物细胞具有对照大豆密码子优化的参考多核苷酸。
36.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGN或该ndRGDBP并且在以下中任一个的全长上具有至少76%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xi)选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列;并且熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xii)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xiii)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xiv)选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列;并且熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xv)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且GC含量大于50%并且任选地具有的sCAI低于SEQ IDNO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(xvi)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列,并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(xvii)选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列;并且熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;或
(xviii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
37.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码RGN或RGDBP,具有大于48%或50%的GC含量,并且在以下中任一个的全长上具有至少70%序列同一性:
(i)选自由SEQ ID NO:3-12组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ii)选自由SEQ ID NO:15-24组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:14的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iii)选自由SEQ ID NO:27-36组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:26的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(iv)选自由SEQ ID NO:39-48组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:38的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(v)选自由SEQ ID NO:51-60组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:50的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vi)选自由SEQ ID NO:63-72组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:62的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(vii)选自由SEQ ID NO:75-84组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:74的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(viii)选自由SEQ ID NO:87-96组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:86的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(ix)选自由SEQ ID NO:99-108组成的组的至少一个、两个或三个多核苷酸,并且任选地具有的sCAI低于SEQ ID NO:98的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并编码该RGN或该ndRGDBP;
(x)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xi)选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xii)选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xiii)选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xiv)选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xv)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xvi)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(xvii)选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(xviii)选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
38.如权利要求28所述的大豆植物细胞,其中该合成的多核苷酸编码该RGN或该RGDBP并且:
(a)在选自由SEQ ID NO:3-11和12组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(b)在选自由SEQ ID NO:3-12组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP;
(c)在选自由SEQ ID NO:3-12组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ IDNO:2的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该RGN或该ndRGDBP
(d)在选自由SEQ ID NO:122-130和131组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(e)在选自由SEQ ID NO:122-130和131组成的组的至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(f)在选自由SEQ ID NO:122-130和131组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:121的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(g)在选自由SEQ ID NO:134-142和143组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(h)在选自由SEQ ID NO:134-142和143组成的组的至少两个或三个序列的全长上具有超过80%序列同一性并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(i)在选自由SEQ ID NO:134-142和143组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性;并且具有的GC含量大于50%并且任选地具有的sCAI低于SEQ ID NO:133的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(j)在选自由SEQ ID NO:146-154和155组成的组的至少一个、两个或三个序列上具有超过80%序列同一性并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;
(k)在选自由SEQ ID NO:146-154和155组成的组的至少两个或三个序列的全长上具有超过80%序列同一性并且具有的熔化温度(Tm)大于90摄氏度并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP;或
(l)在选自由SEQ ID NO:146-154和155组成的组的至少一个、至少两个或三个序列的全长上具有超过80%序列同一性并且具有的GC含量大于48%并且任选地具有的sCAI低于SEQ ID NO:145的大豆密码子优化的参考多核苷酸的sCAI,该大豆密码子优化的参考多核苷酸包含一个或多个核苷酸插入、缺失和/或取代并且编码该ndRGDBP。
39.如权利要求28所述的大豆植物细胞,其中:
(i)该RGN或该ndRGDBP是与SEQ ID NO:1具有至少95%序列同一性的SpCas9 RGN或ndRGDBP,并且编码该SpCas9 RGN或ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ IDNO:2具有至少95%序列同一性;
(ii)该RGN或该ndRGDBP是与SEQ ID NO:13具有至少95%序列同一性的SaCas9 RGN或ndRGDBP,并且编码该SaCas9 RGN或ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ IDNO:14具有至少95%序列同一性;
(iii)该RGN或该ndRGDBP是与SEQ ID NO:25具有至少95%序列同一性的FnCpf1 RGN或ndRGDBP,并且编码该FnCpf1 ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQID NO:26具有至少95%序列同一性;
(iv)该RGN或该ndRGDBP是与SEQ ID NO:37具有至少95%序列同一性的CasJ RGN或ndRGDBP,并且编码该CasJ ndRGDBP或其变体的大豆密码子优化的参考多核苷酸与SEQ IDNO:38具有至少95%序列同一性;
(v)该ndRGDBP是与SEQ ID NO:120具有至少95%序列同一性的Cas12j-1ndRGDBP,并且编码该Cas12j-1ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:121具有至少95%序列同一性;
(vi)该ndRGDBP是与SEQ ID NO:132具有至少95%序列同一性的Cas12j-2ndRGDBP,并且编码该Cas12j-2ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:133具有至少95%序列同一性;或
(vii)该ndRGDBP是与SEQ ID NO:144具有至少95%序列同一性的Cas12j-3ndRGDBP,并且编码该Cas12j-3ndRGDBP的大豆密码子优化的参考多核苷酸与SEQ ID NO:145具有至少95%序列同一性。
40.如权利要求28、30、36至38或39中任一项所述的大豆植物细胞,其中该合成的多核苷酸:
(iii)编码包含该ndRGDBP的蛋白质并且与对照大豆植物细胞中内源基因的表达相比提供在该大豆植物细胞的核、质体或线粒体基因组中该内源基因的表达方面的至少2倍增加或减少,该对照大豆植物细胞包含对照大豆密码子优化的参考多核苷酸,该对照大豆密码子优化的参考多核苷酸编码该ndRGDBp;或
(iv)编码该RGN并与对照大豆植物细胞中内源靶序列的切口或切口相关修饰相比提供该大豆植物细胞的核、质体或线粒体基因组中的该内源靶序列的切口或切口相关修饰方面的至少2倍增加,该对照大豆植物细胞包含编码该RGN的对照大豆密码子优化的参考多核苷酸。
41.如权利要求28至34、36至38或39中任一项所述的大豆植物细胞,其中该合成的多核苷酸包含编码该RNA指导的核酸内切酶蛋白或RNA指导的DNA结合蛋白的RNA分子。
42.如权利要求28至34、36至38或39中任一项所述的大豆植物细胞,其中该大豆植物细胞进一步包含指导RNA或编码指导RNA的多核苷酸。
43.如权利要求28至34、36至38或39中任一项所述的大豆植物细胞,其中该大豆植物细胞进一步包含与该靶编辑位点具有同源性的供体模板DNA分子。
44.如权利要求28至34、36至38或39中任一项所述的大豆植物细胞,其中该合成的多核苷酸可操作地连接至:
(a)在大豆植物细胞中可操作的启动子;
(b)5'非翻译(UT)序列和/或3'非翻译(UT)序列,任选地其中该5’UT和/或3’UT任选地(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;或(i)和(ii)的组合;
(c)多聚腺苷酸化序列;和/或
(d)编码核定位信号(NLS)、叶绿体转运肽(CTP)、表位标签(ET)、转录激活结构域(TAD)、转录阻遏结构域(TRD)的第二多核苷酸;或其组合;任选地,其中一个或多个该第二多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;(iii)具有的大豆密码子适应指数(sCAI)低于编码该NLS、CTP、ET、TAD或TRD的第二大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合;和/或
(e)编码具有修饰靶DNA的酶活性的异源多肽的第三多核苷酸序列;任选地,其中一个或多个该第三多核苷酸序列(i)具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;(ii)具有的熔化温度(Tm)大于89或90摄氏度;(iii)具有的大豆密码子适应指数(sCAI)低于编码该异源多肽的第三大豆密码子优化的参考多核苷酸的sCAI;或(i)、(ii)和(iii)的任何组合。
45.如权利要求44所述的大豆植物细胞,其中由该第三多核苷酸序列编码的异源多肽表现出一种或多种选自以下的酶活性:核酸酶活性、甲基转移酶活性、去甲基化酶活性、DNA修复活性、DNA损伤活性、脱氨基活性、歧化酶活性、烷基化活性、脱嘌呤活性、氧化活性、嘧啶二聚体形成活性、整合酶活性、转座酶活性、重组酶活性、聚合酶活性、连接酶活性、解旋酶活性、光解酶活性和/或糖基化酶活性。
46.如权利要求28、30、36至38或39中任一项所述的大豆植物细胞,其中该ndRGDBP包含HNH和/或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A和/或H840A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的D917A、E1006A、E1028A、D1255A和/或N1257A突变;(iii)SEQ ID NO:37的CasJ蛋白中的D901A、E1128A和/或D1298A突变;或(iv)SEQ ID NO:73的LbCpf1蛋白中的D832A、E925A和/或D1148A突变。
47.如权利要求28、30、36至38或39中任一项所述的大豆植物细胞,其中该RGN包含HNH或RuvC样核酸酶结构域中的突变,或任选地其中所述突变是:(i)SEQ ID NO:1的Cas9蛋白中的D10A突变;(ii)SEQ ID NO:25的FnCpf1蛋白中的R1226A氨基酸突变;(iii)SEQ ID NO:73的LbCpf1蛋白中的R1138A突变;(iv)SEQ ID NO:120的残基D371、E579、D673、C640、C643、C646、C661或C664;(v)SEQ ID NO:132的残基D394、E606、D697、C667、C670、C673、C685或C688;或(vi)SEQ ID NO:144的残基D413、E618、D710、C680、C683、C687、C698或C701。
48.一种大豆植物、植物部分、组织或愈伤组织,其包含如权利要求28至34、36至38或39中任一项所述的大豆植物细胞。
49.如权利要求48所述的大豆植物部分,其中:
(a)该部分是茎、荚、叶、芽、根或种子;
(b)该组织是愈伤组织、分生组织或胚组织;或
(c)该组织是胚愈伤组织。
50.一种获得如权利要求28至34、36至38或39中任一项所述的大豆植物细胞的方法,该方法包括:
(a)将编码包含该RNA指导的核酸内切酶(RGE)、该RNA指导的切口酶(RGN)或该核酸酶缺陷型受RNA指导的DNA结合蛋白(ndRGDBP)的蛋白质的合成的多核苷酸引入该大豆植物细胞,其中所述多核苷酸具有的GC(鸟嘌呤和胞嘧啶)含量大于47%或48%;具有的熔化温度(Tm)大于89或90摄氏度;具有的大豆密码子适应指数(sCAI)低于编码RGE的大豆密码子优化的参考多核苷酸的sCAI;所述GC含量、Tm和/或较低sCAI的任何组合;并且
(b)选择包含该合成的多核苷酸的植物细胞。
51.一种分离的多核苷酸,其包含SEQ ID NO:122-131、134-143或146-185中的任一个。
52.一种编码Cas12j多肽的分离的多核苷酸,该分离的多核苷酸包含对应于以下的突变或残基:
(a)SEQ ID NO:120的C640、SEQ ID NO:132的C667或SEQ ID NO:144的C680;
(b)SEQ ID NO:120的C643、SEQ ID NO:132的C670或SEQ ID NO:144的C683;
(c)SEQ ID NO:120的C646、SEQ ID NO:132的C673或SEQ ID NO:144的C687;
(d)SEQ ID NO:120的C661、SEQ ID NO:132的C685或SEQ ID NO:144的C698;或
(e)SEQ ID NO:120的C664、SEQ ID NO:132的C688或SEQ ID NO:144的C701。
53.一种编码多肽的分离的多核苷酸,该分离的多核苷酸包含:
(a)SEQ ID NO:120,其具有选自由C640A、C643A、C646A、C661A、C664A、C640S、C643S、C646S、C661S和C664S组成的组的突变;
(b)SEQ ID NO:132,其具有选自由C667A、C670A、C673A、C685A、C688A、C667S、C670S、C673S、C685S和C688S组成的组的突变;或
(c)SEQ ID NO:144,其具有选自由C680A、C683A、C687A、C698A、C701A、C680S、C683S、C687S、C698S和C701S组成的组的突变。
54.一种重组核酸,其包含如权利要求51、52或53中任一项所述的分离的核酸。
CN202180025334.2A 2020-03-30 2021-03-29 用于在大豆中表达rna指导的核酸酶和dna结合蛋白的改进多核苷酸 Pending CN115484815A (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US202063001806P 2020-03-30 2020-03-30
US63/001,806 2020-03-30
US202063072585P 2020-08-31 2020-08-31
US63/072,585 2020-08-31
US202063075395P 2020-09-08 2020-09-08
US63/075,395 2020-09-08
PCT/US2021/024681 WO2021202397A2 (en) 2020-03-30 2021-03-29 Improved polynucleotides for expression of rna-guided nucleases and dna binding proteins in soybean

Publications (1)

Publication Number Publication Date
CN115484815A true CN115484815A (zh) 2022-12-16

Family

ID=77930145

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180025334.2A Pending CN115484815A (zh) 2020-03-30 2021-03-29 用于在大豆中表达rna指导的核酸酶和dna结合蛋白的改进多核苷酸

Country Status (7)

Country Link
US (1) US20230175001A1 (zh)
EP (1) EP4125338A4 (zh)
CN (1) CN115484815A (zh)
AU (1) AU2021246436A1 (zh)
BR (1) BR112022019960A2 (zh)
CA (1) CA3170846A1 (zh)
WO (1) WO2021202397A2 (zh)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6072050A (en) * 1996-06-11 2000-06-06 Pioneer Hi-Bred International, Inc. Synthetic promoters
CN105916987A (zh) * 2013-08-22 2016-08-31 纳幕尔杜邦公司 使用向导rna/cas内切核酸酶系统的植物基因组修饰及其使用方法
CN106834341A (zh) * 2016-12-30 2017-06-13 中国农业大学 一种基因定点突变载体及其构建方法和应用
CN108138155A (zh) * 2015-10-20 2018-06-08 先锋国际良种公司 经由指导cas系统恢复非功能性基因产物的功能及使用方法
US20190144852A1 (en) * 2017-11-13 2019-05-16 The Board Of Trustees Of The University Of Illinois Combinatorial Metabolic Engineering Using a CRISPR System
US20190264218A1 (en) * 2016-11-04 2019-08-29 Flagship Pioneering Innovations V, Inc. Novel Plant Cells, Plants, and Seeds
US20190352655A1 (en) * 2017-01-28 2019-11-21 Inari Agriculture, Inc. Novel plant cells, plants, and seeds
CN115279898A (zh) * 2019-10-23 2022-11-01 成对植物服务股份有限公司 用于植物中rna模板化编辑的组合物和方法

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6072050A (en) * 1996-06-11 2000-06-06 Pioneer Hi-Bred International, Inc. Synthetic promoters
CN105916987A (zh) * 2013-08-22 2016-08-31 纳幕尔杜邦公司 使用向导rna/cas内切核酸酶系统的植物基因组修饰及其使用方法
CN108138155A (zh) * 2015-10-20 2018-06-08 先锋国际良种公司 经由指导cas系统恢复非功能性基因产物的功能及使用方法
US20190264218A1 (en) * 2016-11-04 2019-08-29 Flagship Pioneering Innovations V, Inc. Novel Plant Cells, Plants, and Seeds
CN106834341A (zh) * 2016-12-30 2017-06-13 中国农业大学 一种基因定点突变载体及其构建方法和应用
US20190352655A1 (en) * 2017-01-28 2019-11-21 Inari Agriculture, Inc. Novel plant cells, plants, and seeds
US20190144852A1 (en) * 2017-11-13 2019-05-16 The Board Of Trustees Of The University Of Illinois Combinatorial Metabolic Engineering Using a CRISPR System
CN115279898A (zh) * 2019-10-23 2022-11-01 成对植物服务股份有限公司 用于植物中rna模板化编辑的组合物和方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JEAN-MICHEL MICHNO等: "CRISPR/Cas mutagenesis of soybean and Medicago truncatula using a new web-tool and a modified Cas9 enzyme", 《GM CROPS&FOOD》, vol. 6, pages 243 - 252, XP055863270, DOI: 10.1080/21645698.2015.1106063 *
蔡宇鹏: "CRISPR/Cas9介导的大豆基因组定点编辑研究", 《中国优秀硕士学位论文全文数据库(电子期刊)农业科技辑》, no. 2, pages 047 - 741 *

Also Published As

Publication number Publication date
AU2021246436A1 (en) 2022-10-20
EP4125338A4 (en) 2024-05-01
US20230175001A1 (en) 2023-06-08
WO2021202397A3 (en) 2021-11-11
EP4125338A2 (en) 2023-02-08
BR112022019960A2 (pt) 2022-12-13
CA3170846A1 (en) 2021-10-07
WO2021202397A2 (en) 2021-10-07

Similar Documents

Publication Publication Date Title
AU2020202369B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
AU2020267286B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
AU2020204196B2 (en) Optimal maize loci
AU2020203872B2 (en) Optimal maize loci
AU2021200054B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing yield of plants
AU2019253901B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing nitrogen use efficiency of plants
AU2020267257C1 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing nitrogen use efficiency, yield, growth rate, vigor, biomass, oil content, and/or abiotic stress tolerance
AU2020203837B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
AU2018203835B2 (en) Recombinant dna constructs and methods for modulating expression of a target gene
KR102558931B1 (ko) 핵산 가이드 뉴클레아제
AU2021225152A1 (en) Isolated polypeptides and polynucleotides useful for increasing nitrogen use efficiency, abiotic stress tolerance, yield and biomass in plants
AU2020223681B2 (en) Plant regulatory elements and uses thereof
AU2021266196A9 (en) Isolated polynucleotides and polypeptides, construct and plants comprising same and methods of using same for increasing nitrogen use efficiency of plants
AU2021218140A1 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
KR20210049859A (ko) 게놈을 조절하는 방법 및 조성물
AU2016334225A1 (en) Novel RNA-guided nucleases and uses thereof
AU2016380351A1 (en) Novel CRISPR-associated transposases and uses thereof
KR20230053735A (ko) 게놈의 조정을 위한 개선된 방법 및 조성물
KR20170005829A (ko) 모기 제어를 위한 조성물 및 그의 용도
AU2022202318A1 (en) Methods of increasing specific plants traits by over-expressing polypeptides in a plant
CN111542610A (zh) 精确基因组编辑的新策略
CN115484815A (zh) 用于在大豆中表达rna指导的核酸酶和dna结合蛋白的改进多核苷酸
AU2020210193B2 (en) Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics
AU2017204404B2 (en) Isolated Polynucleotides and Polypeptides, and Methods of Using Same for Increasing Plant Yield and/or Agricultural Characteristics
KR20240006496A (ko) Omni 90-99, 101, 104-110, 114, 116, 118-123, 125, 126, 128, 129, 및 131-138 crispr 뉴클레아제

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination