CN112423878B - 纳米孔组件及其用途 - Google Patents

纳米孔组件及其用途 Download PDF

Info

Publication number
CN112423878B
CN112423878B CN201980020065.3A CN201980020065A CN112423878B CN 112423878 B CN112423878 B CN 112423878B CN 201980020065 A CN201980020065 A CN 201980020065A CN 112423878 B CN112423878 B CN 112423878B
Authority
CN
China
Prior art keywords
asn
ser
leu
val
asp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201980020065.3A
Other languages
English (en)
Other versions
CN112423878A (zh
Inventor
F·哈克
王少英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oxford Nanopore Technology Public Co ltd
Original Assignee
Oxford Nanopore Technology Public Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oxford Nanopore Technology Public Co ltd filed Critical Oxford Nanopore Technology Public Co ltd
Publication of CN112423878A publication Critical patent/CN112423878A/zh
Application granted granted Critical
Publication of CN112423878B publication Critical patent/CN112423878B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J20/00Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof
    • B01J20/28Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof characterised by their form or physical properties
    • B01J20/28054Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof characterised by their form or physical properties characterised by their surface properties or porosity
    • B01J20/28078Pore diameter
    • B01J20/2808Pore diameter being less than 2 nm, i.e. micropores or nanopores
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6818Sequencing of polypeptides
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J20/00Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof
    • B01J20/30Processes for preparing, regenerating, or reactivating
    • B01J20/32Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating
    • B01J20/3202Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating characterised by the carrier, support or substrate used for impregnation or coating
    • B01J20/3204Inorganic carriers, supports or substrates
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J20/00Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof
    • B01J20/30Processes for preparing, regenerating, or reactivating
    • B01J20/32Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating
    • B01J20/3202Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating characterised by the carrier, support or substrate used for impregnation or coating
    • B01J20/3206Organic carriers, supports or substrates
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J20/00Solid sorbent compositions or filter aid compositions; Sorbents for chromatography; Processes for preparing, regenerating or reactivating thereof
    • B01J20/30Processes for preparing, regenerating, or reactivating
    • B01J20/32Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating
    • B01J20/3231Impregnating or coating ; Solid sorbent compositions obtained from processes involving impregnating or coating characterised by the coating or impregnating layer
    • B01J20/3242Layers with a functional group, e.g. an affinity material, a ligand, a reactant or a complexing group
    • B01J20/3268Macromolecular compounds
    • B01J20/3272Polymers obtained by reactions otherwise than involving only carbon to carbon unsaturated bonds
    • B01J20/3274Proteins, nucleic acids, polysaccharides, antibodies or antigens
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6816Hybridisation assays characterised by the detection means
    • C12Q1/6825Nucleic acid detection involving sensors
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/543Immunoassay; Biospecific binding assay; Materials therefor with an insoluble carrier for immobilising immunochemicals
    • G01N33/54366Apparatus specially adapted for solid-phase testing
    • G01N33/54373Apparatus specially adapted for solid-phase testing involving physiochemical end-point determination, e.g. wave-guides, FETS, gratings
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/573Immunoassay; Biospecific binding assay; Materials therefor for enzymes or isoenzymes
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B82NANOTECHNOLOGY
    • B82YSPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
    • B82Y15/00Nanotechnology for interacting, sensing or actuating, e.g. quantum dots as markers in protein assays or molecular motors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/90Enzymes; Proenzymes
    • G01N2333/914Hydrolases (3)
    • G01N2333/948Hydrolases (3) acting on peptide bonds (3.4)
    • G01N2333/95Proteinases, i.e. endopeptidases (3.4.21-3.4.99)
    • G01N2333/964Proteinases, i.e. endopeptidases (3.4.21-3.4.99) derived from animal tissue
    • G01N2333/96425Proteinases, i.e. endopeptidases (3.4.21-3.4.99) derived from animal tissue from mammals
    • G01N2333/96427Proteinases, i.e. endopeptidases (3.4.21-3.4.99) derived from animal tissue from mammals in general
    • G01N2333/9643Proteinases, i.e. endopeptidases (3.4.21-3.4.99) derived from animal tissue from mammals in general with EC number
    • G01N2333/96433Serine endopeptidases (3.4.21)
    • G01N2333/96441Serine endopeptidases (3.4.21) with definite EC number
    • G01N2333/96455Kallikrein (3.4.21.34; 3.4.21.35)

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Organic Chemistry (AREA)
  • Hematology (AREA)
  • Urology & Nephrology (AREA)
  • Biomedical Technology (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Cell Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Genetics & Genomics (AREA)
  • Inorganic Chemistry (AREA)
  • Nanotechnology (AREA)
  • Peptides Or Proteins (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analyzing Materials By The Use Of Electric Means (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

本公开文本提供了一种用于检测分析物的用非膜蛋白组装的纳米孔系统。还公开了采用所公开的纳米孔系统的方法、试剂盒和检测装置。所述纳米孔系统具有广泛的应用,包括单分子检测、DNA/RNA/肽测序、化学物、生物试剂和聚合物的感测,以及疾病诊断。

Description

纳米孔组件及其用途
相关申请的交叉引用
本申请根据35U.S.C.§119(e)要求2018年2月12日提交的美国临时专利申请号62/629,604的优先权,其公开内容通过引用以其整体并入本文。
技术领域
本公开文本总体上涉及纳米孔,并且更具体地涉及使用用非膜蛋白组装的纳米孔用于检测分析物的系统和方法。
背景技术
生物纳米孔是嵌入在衬底(通常是脂膜)中的蛋白质通道。自然界中的各种各样的蛋白质复合物形成巧妙的类似通道的结构,其中的一些已经作为纳米孔加以探索,通过α-溶血素、MspA、气溶素、FluA、Omp F/G、CsgG、ClyA和PA63来例示。例如,来自金黄色葡萄球菌(Staphylococcus aureus)的生物蛋白纳米孔α-溶血素(αHL)已经用于单分子检测。
最近,研究人员在单分子分析中采用了生物固态DNA折纸术和复合纳米孔。与其合成对应物相比,生物纳米孔具有优点,主要是因为它们可以在人工纳米孔尚无法匹配的原子级精度下可重复地制造和修饰。然而,生物纳米孔的现有使用也具有缺点。例如,它们在提供不同的形状、大小和亲水/疏水特性以便以高灵敏度和特异性检测不同分析物方面并非是多功能的。
因此,仍然非常需要一种适用于检测具有不同特性的分析物的稳健纳米孔系统。
发明内容
本公开文本通过提供一种用于检测分析物的纳米孔组件来满足本领域的这种需求。纳米孔组件包括由多个亚基形成的通道。每个所述亚基包含能够形成蛋白质通道的非膜蛋白。在一些实施方案中,每个所述亚基包含具有与选自SEQ ID NO:1-35的多肽序列至少75%相同的多肽序列的多肽。在一些实施方案中,所述多肽包含与SEQ ID NO:4-12至少75%相同的多肽序列。
在一些实施方案中,所述多肽可以包含至少一个被半胱氨酸取代的残基。在一些实施方案中,所述多肽来源于phi29门户蛋白(portal protein)或尾蛋白(tail protein)。在一些实施方案中,phi29尾蛋白可以包含E595C、K321C和K358C取代中的一个或多个。在一些实施方案中,phi29尾蛋白可以包含K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A和E595V取代中的一个或多个。
一方面,纳米孔组件还包括用于检测分析物的探针。所述探针可操作地连接到至少一个亚基。所述探针可以是化学物、碳水化合物、适体、核酸、肽、蛋白质、抗体和受体中的一种。在一些实施方案中,所述探针包含与选自SEQ ID NO:36-79的序列至少75%相同的序列。在一些实施方案中,所述探针是抗PSA抗体。所述探针可以经由共价键可操作地连接到至少一个所述亚基。所述共价键包括二硫键、酯键或巯基键。在一些实施方案中,所述探针可操作地连接到接近通道的入口的位置或在通道的内侧的位置。
所述分析物可以是核酸、氨基酸、肽、蛋白质、聚合物和化学分子中的一种。在一些实施方案中,所述分析物是PSA、CEA、AFP、VCAM、MiR-155、MiR-22、MiR-7、MiR-92a、MiR-122、MiR-192、MiR-223、MiR-26a、MiR-27a和MiR-802中的一种。
根据纳米孔组件的一些实施方案,所述通道被嵌入在聚合物囊泡中。在一些实施方案中,所述通道被插入在膜中。所述膜可以包括聚合物膜或脂膜。所述聚合物膜可以包括交替共聚物、周期共聚物、嵌段共聚物、二嵌段共聚物、三嵌段共聚物、三元共聚物或其组合。在一些实施方案中,聚合物膜包含PMOXA-PDMS-PMOXA。在一些实施方案中,所述纳米孔组件还可以包括胆固醇和/或卟啉。
另一方面,本公开文本还提供了一种用于检测分析物的设备。所述设备包括以上所述的纳米孔组件和任选地用于所述纳米孔组件的支撑物。所述设备还可以包括电极,所述纳米孔组件被栓系到所述电极。
另一方面,本公开文本还提供了一种试剂盒,其包括以上所述的纳米孔组件和任选地用于使用所述纳米孔组件的说明书。
另一方面,本公开文本提供了一种检测分析物的方法。所述方法包括:(1)使含有分析物的样品与所述的纳米孔组件接触;(2)施加跨过纳米孔组件的通道的电流;(3)以一个或多个时间间隔确定穿过通道的电流;以及(4)将以一个或多个时间间隔测量的电流与参考电流进行比较,其中相对于参考电流的电流变化指示样品中分析物的存在。所述分析物可以是核酸、氨基酸、肽、蛋白质、聚合物和化学分子中的任一种。在一些实施方案中,参考电流是用不含分析物的样品测量的。在一些实施方案中,纳米孔组件被放置在支撑物上。
前述概述并非旨在限定本公开文本的每个方面,并且在其他部分,诸如以下详细描述中描述了另外的方面。整个文献旨在作为统一的公开内容相关联,并且应当理解,考虑本文所述的特征的所有组合,即使特征的组合并非在本文献的同一句子或段落或部分中一起被发现。本发明的其他特征和优点通过以下详细描述将变得清楚。然而,应当理解,尽管详细描述和具体实施例指示本公开文本的具体实施方案,但它们仅通过说明的方式给出,因为从此详细描述中,本公开文本的精神和范围内的各种改变和修改对于本领域技术人员来说将变得清楚。
附图说明
图1A和图1B是示出了用作嵌入聚合物膜的纳米孔的典型非膜蛋白(作为截锥结构示出)的不同功能层的示意图。图1A示出了对于膜锚定重要的三个不同结构域。图1B示出了对于单分子感测的功能模块的缀合特别重要的两个区域。
图2示出了在用于直接膜插入的膜锚定层中诱变的例子,其中以phi29gp9ΔLoop尾蛋白作为例子示出。在中间层(加框)上进行一系列疏水性突变以增加其直接膜插入的能力之前和之后phi29 gp9ΔLoop尾蛋白通道的结构。示例性突变位点包括:K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A和E595V。表达和纯化之后,突变体蛋白自发地插入到聚合物膜中。指示了代表性的带正电的残基(例如,R和K)和代表性的带负电的残基(例如,E、Q、D和N)。
图3示出了噬菌体蛋白表达和纯化的例子,其中显示了phi29 gp9尾蛋白的蛋白质表达和纯化。考马斯蓝染色的SDS-PAGE凝胶显示具有70.33kDa分子量的phi29 gp-9尾蛋白通道的表达。
图4是示例性考马斯蓝染色的SDS-PAGE凝胶,其显示大部分gp-9尾蛋白通道在纯化和组装之后存在于100kDa的柱中,从而指示所述通道是由其约70kDa的单体单元组装而成的。
图5是在非膜蛋白表面中靶位点选择的例子,如使用phi29 gp9尾蛋白所显示的。Phi29 gp9尾蛋白结构显示三个可能的残基,用于诱变为半胱氨酸,以缀合疏水性膜锚定模块。
图6示出了插入到聚合物膜中的非膜蛋白通道的例子,如使用phi29gp9ΔLoop尾蛋白所显示的。单通道记录数据显示phi29 gp9ΔLoop通道直接插入到组合物PMOXA6-PDMS65-PMOXA6的聚合物膜中。导电缓冲液:1M NaCl,5mM Tris,pH 7.6。施加电压:75mV。
图7A和图7B示出了噬菌体P22 gp1门户蛋白全长(图7A)和桶结构(barrel)缺失突变体(图7B)的结构。
图8示出了在非膜蛋白中用于膜锚定的靶位点选择的例子,如使用P22gp1门户蛋白所显示的。在噬菌体P22 gp1门户蛋白的结构上显示了用于并入膜锚定结构域的代表性缀合位点(例如,可及的半胱氨酸残基,C283)。
图9示出了用融合的肽探针功能化的非膜蛋白孔的例子,如使用与VCAM1融合的P22 gp1门户蛋白所显示的。考马斯蓝染色的凝胶显示在C末端与VCAM1探针融合的纯化的噬菌体P22 gp1门户蛋白全长和桶结构缺失突变体。
图10示出了用融合的肽探针功能化的非膜蛋白孔的例子,如使用与PSA融合的P22gp1门户蛋白所显示的。考马斯蓝染色的凝胶显示在C末端与PSA探针融合的纯化的噬菌体P22 gp1门户蛋白桶结构缺失突变体。
图11示出了用用于单分子感测的融合的肽探针功能化的非膜蛋白孔的例子,如使用与用于检测PSA的PSA探针融合的P22 gp1门户蛋白所显示的。单通道记录数据显示携带融合PSA探针的P22 gp1蛋白通道直接插入到组合物PMOXA6-PDMS35-PMOXA6的聚合物膜中。在PSA(10ng/uL)存在的情况下,探针与PSA结合并导致特征性电流阻断事件。导电缓冲液:1M KCl,5mM Tris,pH 7.6。施加电压:75mV。
图12示出了在非膜蛋白中用于膜锚定的靶位点选择的例子,如使用T4gp20门户蛋白所显示的。在噬菌体T4 gp20门户蛋白的结构上指示了用于并入膜锚定结构域的代表性缀合位点(例如,经由可及的半胱氨酸残基C217、245、246)。
图13示出了用胆固醇-PEG-马来酰亚胺标记非膜蛋白通道用于膜锚定的例子,如使用T4 gp20门户蛋白所显示的。
图14示出了插入在聚合物膜中的非膜蛋白通道的例子,如使用携带胆固醇的T4gp20门户蛋白所显示的。在施加的电势下观察到蛋白质的逐步直接插入。导电缓冲液:1MKCl,5mM Tris,pH 7.6。施加电压:100mV。膜:PMOXA6-PDMS35-PMOXA6。
图15示出了经由点击化学将探针缀合到非膜蛋白通道的例子,如使用phi29 gp10门户蛋白所显示的。将硫醇-miR-21探针用TCO(反式-环辛烯)标记,接着将TCO-miR-21探针缀合到甲基四嗪蛋白。SDS-PAGE凝胶验证了靶miR-21与缀合到蛋白质的miR-21探针的结合。
图16示出了使用探针功能化的非膜蛋白孔进行分析物(例如,miRNA)检测的例子,如使用T4 gp20门户蛋白所显示的。在靶miRNA存在的情况下,观察到电流阻断事件,从而指示在单分子水平下检测到miRNA。导电缓冲液:1M KCl,5mM Tris,pH 7.6。施加电压:100mV。膜:PMOXA6-PDM S35-PMOXA6。
图17A示出了用于插入作为纳米孔的非膜蛋白的基于聚噁唑啉的三嵌段共聚物的例子,如使用PMOXA6-PDMS65-PMOXA6所显示的。
图17B示出了平面膜在2天过程中的稳定性。膜没有显示膜泄漏的迹象。膜:PMOXA6-PDMS35-PMOXA6
图17C和图17D示出了使用聚合物囊泡与平面聚合物膜的融合体插入非膜蛋白孔的例子,如使用phi29 gp9尾蛋白(图17C)和P22 gp1门户蛋白(图17D)所显示的。图17C中的平面膜和聚合物囊泡组合物:PMOXA11-PDMS65-PMOXA11;图17D中的平面膜和聚合物囊泡组合物:PMOXA5-PDMS13-PM OXA5
图18示出了具有用于并入分析物探针的Spytag/Spycatcher系统的非膜蛋白的例子,如使用phi29 gp10门户蛋白所显示的。考马斯蓝染色的SDS-PAGE凝胶显示并入在phi29gp10门户蛋白中的Spytag与PSA-Spycatcher蛋白结合形成复合物。然后,功能化的孔可以用于感测PSA,如图19所示。
图19示出了用于使用spytag-spycatcher系统进行分析物检测的非膜蛋白通道的例子,如使用phi29 gp10门户蛋白所显示的。直接插入之后,phi29gp10门户上的spytag/spycatcher(PSA探针)能够检测溶液中的PSA(10ng/uL),如作为电流阻断事件显示。导电缓冲液:1M KCl,5mM Tris,pH 7.6。施加电压:100mV。膜:PMOXA6-PDMS35-PMOXA6
具体实施方式
本公开文本提供了适用于检测不同分析物的稳健纳米孔系统和方法。它允许检测单个分子和与其他污染物一起存在的靶分子。所述系统提供无标记、无扩增的实时检测。它需要非常低的样品量并且可以用于高通量分析。所述系统可以适于以高灵敏度和特异性检测具有不同形状、大小和亲水/疏水特性的各种分析物(例如,小分子、聚合物、多肽和核苷酸)。
本公开文本通过提供由非膜蛋白(例如来源于噬菌体的蛋白质)形成的纳米孔满足了本领域的需求。为了生成所公开的纳米孔,将噬菌体蛋白表达、纯化、自组装并插入在脂膜或聚合物膜中。然而,非膜蛋白通道更难直接插入到脂质双层或聚合物膜中。与膜蛋白通道不同,非膜蛋白通道通常缺少中间的疏水和两端的亲水层。本发明通过采用一系列方法克服了这一限制,所述方法用于将各种蛋白质通道直接或通过高效融合机制插入聚合物膜中。此外,在一些情况下,进行蛋白质工程化,诸如定点诱变、氨基酸的插入和缺失,以及功能模块的引入,以调谐纳米孔特性来满足不同的检测需求。
所公开的纳米孔系统具有广泛的应用,包括但不限于单分子检测、DNA/RNA/肽测序、化学物、生物试剂和聚合物的感测,以及疾病诊断。如将在以下进一步描述的,采用所公开的纳米孔系统的方法、试剂盒和检测装置也在本公开文本的范围内。
I.纳米孔组件
本公开文本的一方面涉及由非膜蛋白形成的纳米孔。适用于形成纳米孔的非膜蛋白可以来源于细胞DNA转位酶、解旋酶、末端酶、ATP酶及其片段。适用于形成纳米孔的非膜蛋白还可以包括DNA修复、复制、重组、染色体分离、DNA/RNA运输、膜分选、细胞重组、细胞分裂、细菌二分分裂和其他过程中涉及的蛋白质。
纳米孔可以包括多个亚基,每个亚基包含非膜蛋白。例如,纳米孔可以包括10至15个噬菌体门户蛋白的亚基或5至10个噬菌体尾蛋白的亚基。形成纳米孔的非膜蛋白通道可以来源于噬菌体门户蛋白,包括但不限于T3、T4、T5、T7、SPP1、P22、P2、P3、λ、μ、HK97和C1。例如,形成纳米孔的非膜蛋白通道可以来源于噬菌体尾蛋白,包括但不限于phi29、C1、脑膜炎奈瑟球菌B血清群(Neisseria meningitidis serogroup B)、T4、phiX174、λ、SPP1、T5、μ、F4-1、P2、沙雷氏菌噬菌体KSP90、肠杆菌噬菌体T7M、噬菌体HK97。
与噬菌体蛋白(例如,噬菌体门户蛋白或尾蛋白)具有显著同一性的变体和同源物也在本公开文本的范围内,包括但不限于T3、T4、T5、T7、SPP1、P22、P2、P3、λ、μ、HK97和C1。例如,此类变体和同源物可以具有与本文所述的噬菌体门户蛋白的序列具有至少约70%、约71%、约72%、约73%、约74%、约75%、约76%、约77%、约78%、约79%、约80%、约81%、约82%、约83%、约84%、约85%、约86%、约87%、约88%、约89%、约90%、约91%、约92%、约93%、约94%、约95%、约96%、约97%、约98%或约99%序列同一性的序列。
生物孔诱变可以用于优化蛋白质孔,以用于组装或用于本文所述的组合物或方法中。例如,蛋白质工程化和诱变技术可以用于使生物孔突变并针对特定应用调整其特性。因此,纳米孔组件可以包括由多个亚基形成的通道。每个所述亚基包含具有与选自SEQ IDNO:1-35的多肽序列具有至少75%同一性的多肽序列的多肽。在一些实施方案中,所述多肽包含与SEQ ID NO:4-12至少75%相同的多肽序列。
所述亚基可以包含一个或多个取代、缺失或插入,以促进纳米孔的功能化或插入膜中。例如,所述亚基可以包含至少一个被半胱氨酸取代的残基,使得官能团可以经由二硫键连接到所述亚基。例如,半胱氨酸残基可以用于将诸如卟啉的化学物缀合到纳米孔。在一些实施方案中,所述亚基可以包括具有E595C、K321C和K358C取代中的一个或多个的phi29gp9蛋白。
此外,所述亚基可以包括具有K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A和E595V取代中的一个或多个的phi29 gp9蛋白。用疏水性残基(例如,A、V、L)取代带电荷的残基(例如,K、R、D、E)局部和整体地增加了蛋白质的疏水性。
在一些实施方案中,将蛋白质通道并入膜或纳米盘中的过程可以得益于蛋白质上的亲和标签(例如,聚组氨酸亲和标签),并且得益于亲和柱(例如,Ni柱)上混合纳米孔群体的纯化。亲和标签和化学缀合技术(例如,以上所示的半胱氨酸的修饰)可以用于将系链连接到纳米孔,以用于本文所示的各种方法、组合物或装置。例如,所得的系链可以用于将纳米孔吸引或连接到固体支撑物或电极。
表1.通道蛋白的氨基酸序列
/>
/>
/>
/>
表2.接头和探针的序列
/>
/>
表3.核酸序列
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
如本文所用的术语多肽“变体”是通常与本文明确公开的多肽相差一个或多个取代、缺失、添加和/或插入的多肽。此类变体可以是天然存在的或可以是合成产生的,例如,通过修饰本公开文本的以上多肽序列中的一个或多个,和评估如本文所述的多肽的一种或多种生物活性,和/或使用本领域熟知的一些技术中的任一种。
例如,某些氨基酸可以取代蛋白质结构中的其他氨基酸,而不会明显丧失所述蛋白质结合其他多肽或细胞的能力。由于决定蛋白质的生物功能活性的是该蛋白质的结合能力和性质,所以可以在蛋白质序列中并且因此在其基础DNA编码序列中进行某些氨基酸序列取代,从而获得具有类似特性的蛋白质。因此,可以设想,可以在所公开的组合物的肽序列或编码所述肽的对应DNA序列中进行各种变化,而不会明显丧失其生物实用性或活性。
变体序列包括其中已经通过修饰编码本公开文本的多肽的多核苷酸而引入保守取代的那些。氨基酸可以根据物理特性以及对二级和三级蛋白质结构的贡献进行分类。此类保守修饰包括氨基酸取代、添加和缺失。保守氨基酸取代是其中氨基酸残基被具有相似侧链的氨基酸残基替代的氨基酸取代。本领域已经定义了具有相似侧链的氨基酸残基家族。这些家族包括具有以下侧链的氨基酸:碱性侧链(例如,赖氨酸、精氨酸、组氨酸)、酸性侧链(例如,天冬氨酸、谷氨酸)、不带电荷的极性侧链(例如,甘氨酸、天冬酰胺、谷氨酰胺、丝氨酸、苏氨酸、酪氨酸、半胱氨酸、色氨酸)、非极性的侧链(例如,丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸、甲硫氨酸)、β-分支侧链(例如,苏氨酸、缬氨酸、异亮氨酸)以及芳香族侧链(例如,酪氨酸、苯丙氨酸、色氨酸、组氨酸)。
“序列同一性”或“同源性”是指在比对序列并引入空位(如果需要的话)以实现最大同源性百分比之后,多核苷酸或多肽序列变体中与非变体序列相同的残基的百分比。在具体实施方案中,多核苷酸和多肽变体具有与本文所述的多核苷酸或多肽的至少约70%、至少约75%、至少约80%、至少约90%、至少约95%、至少约98%或至少约99%的多核苷酸或多肽同源性。
多肽变体序列可以与本公开文本中列举的序列共享70%或更高(即80%、85%、90%、95%、97%、98%、99%或更高)的序列同一性。多肽变体还可以包括多肽片段,其包含本文公开的氨基酸序列的各种长度的连续延伸段。多肽变体序列包括本文公开的一个或多个序列的至少约5、10、15、20、30、40、50、75、100、150或更多个连续肽以及它们之间的所有中间长度。
II.用探针功能化的纳米孔组件
一方面,纳米孔组件还包括用于检测分析物的探针。所述探针可操作地连接到至少一个亚基。这种探针缀合到纳米孔组件,以允许一种或多种分析物的选择性结合。探针可以以各种化学计量(探针与纳米孔组件之间的摩尔比)缀合到纳米孔组件。在一个例子中,探针可以缀合到每个亚基。在另一个例子中,仅一个探针缀合到纳米孔组件。还设想包括两种或更多种不同类型的探针的纳米孔组件。此类功能化的纳米孔组件可以用于检测分析物,诸如核酸、氨基酸、肽、蛋白质、聚合物和化学分子。在一些实施方案中,分析物是PSA、CEA、AFP、VCAM、MiR-155、MiR-22、MiR-7、MiR-92a、MiR-122、MiR-192、MiR-223、MiR-26a、MiR-27a、MiR-802或其片段中的一种。
术语“特异性结合(specific binding)”、“选择性结合(selective binding)”、“选择性地结合(selectively binds)”和“特异性地结合(specifically binds)”是指纳米孔组件上的探针与样品中的分析物的结合强于与样品中存在的其他污染物的结合。例如,在通过结合测定(例如ELISA、平衡透析或2000表面等离子体共振仪器中的表面等离子体共振(SPR)技术)确定时,探针可以以大约小于10-6M,诸如大约小于10-7M、10-8M、10-9M或10-10M或甚至更低的平衡解离常数(Kd)结合分析物。
所述探针可以是化学物、碳水化合物、适体、核酸、肽、蛋白质、抗体和受体中的一种。在一些实施方案中,探针可以包含与SEQ ID NO:36-79的序列至少75%相同的序列。在一些实施方案中,所述探针是抗PSA抗体。探针经由共价键可操作地连接到至少一个亚基。探针可以经由亚基上的一个或多个功能位点连接到所述亚基。此类功能位点可以通过诱变,例如用半胱氨酸或其他非天然氨基酸取代来引入。探针也可以经由公认的化学方法连接到通道,包括但不限于酯键、点击化学和巯基键。在一些实施方案中,探针可操作地连接到接近通道的入口的位置或在通道的内侧的位置。
III.具有膜的纳米孔组件
另一方面,本公开文本提供了一种纳米孔组件,其中通道被插入在膜中。根据应用,聚合物膜可以具有各种组成,呈独立形式或在微流体装置中。聚合物膜嵌入式通道展示出稳健的电生理特性,这是基于纳米孔的单分子分析的首要特征。膜可以包括聚合物膜(例如,平面聚合物膜)或脂膜。聚合物膜的性质可以是对称的或不对称的。例如,聚合物膜是交替共聚物(例如,A-B-A-B-…)或周期共聚物(例如,AA-BB-AA-BB…)。在另一个例子中,聚合物膜可以是由通过共价键连接的两个或更多个均聚物亚基构成的嵌段共聚物。可替代地,聚合物膜可以是二嵌段或三嵌段共聚物(例如,PMOXA-PDMS-PMOXA)。聚合物膜也可以是由三个不同单体组成的三元共聚物。
在一些实施方案中,聚合物膜包含交替共聚物、周期共聚物、嵌段共聚物、二嵌段共聚物、三嵌段共聚物、三元共聚物或其组合。在一些实施方案中,聚合物膜包含PMOXA-PDMS-PMOXA。
在一些实施方案中,通道被嵌入在各种大小的聚合物囊泡中。在一些实施方案中,聚合物囊泡与平面聚合物膜融合以插入通道。聚合物囊泡由亲水-疏水嵌段共聚物构成,以具有中心水性核的双层囊泡系统布置。它们具有亲水性内核和亲脂性双层。它们与纳米颗粒的不同之处在于,它们含有亲水性核心,而不是纳米颗粒的情况中的亲脂性核心。尽管它们具有双层结构,但是由于存在厚且刚性的双层,它们比脂质体具有更高的稳定性。它们含有提供蛋白质亲和环境的亲水性核心。
在一些实施方案中,在各种组成的任何洗涤剂(例如,基于DDM、DOC、Tween、SDS和Brij的洗涤剂)的存在下,将通道插入到聚合物膜中。在一些实施方案中,在存在不同量的甘油、CsCl和/或蔗糖以供高效融合的情况下,将蛋白质通道重构到聚合物囊泡中。
在一些实施方案中,通过诱变(例如,含有一个至若干个取代)将蛋白质通道重工程化,以有利于插入在(平面)聚合物膜中。此外,出于在聚合物膜中插入的目的,将蛋白质通道重工程化以引入功能性位点,用于用化学物或生物聚合物进行位点特异性标记。此类功能性位点包括用于缀合化学物(诸如卟啉)或生物分子(诸如胆固醇或任何疏水性脂质模块或各种长度的核酸)的半胱氨酸残基。功能性位点/基团还可以包括通过公认的化学方法产生的非天然氨基酸和连接,所述化学方法包括但不限于酯、点击化学和巯基。
在一些实施方案中,缀合的位置在通道的膜锚定层中,例如T4 gp 20门户蛋白的残基70-80、110-140、155-245、300-340、410-435;P22 gp1门户蛋白的残基10-45、250-300;以及phi29 gp9尾蛋白的残基130-170、300-325、350-390、530-595。因此,为了增加或减少用于膜插入目的的带状区的疏水性,重工程化可以包括以下中的任一个的诱变(例如,取代、插入、缺失):T4 gp 20门户蛋白的残基70-80、110-140、155-245、300-340、410-435;P22gp1门户蛋白的残基450-500、350-380;以及phi29 gp9尾蛋白的残基130-170、300-325、350-390、530-595。
在一些实施方案中,缀合的位置在通道的顺式和反式亲水层中,诸如T4 gp 20门户蛋白的残基465-515、285-305;P22 gp1门户蛋白的残基450-500、350-380;以及phi29gp9尾蛋白的残基20-50和250-300。因此,为了增加或减少用于膜插入目的的带状区的疏水性,重工程化可以包括诱变(例如,取代、插入、缺失),可以包括以下中的任一个:T4 gp 20门户蛋白的残基465-515、285-305;P22 gp1门户蛋白的残基450-500、350-380;以及phi29gp9尾蛋白的残基20-50和250-300。
IV.包含纳米孔组件的装置和试剂盒
另一方面,本公开文本还提供了一种用于检测分析物的试剂盒和检测设备。所述试剂盒可以包括所述的纳米孔组件、任选地缓冲液和任选地使用所述纳米孔组件的说明书。
所述设备可以包括所述的纳米孔组件和用于所述纳米孔组件的支撑物。所述检测设备还可以包括嵌入在固体支撑物中的电极。电极可以用于监测蛋白质纳米孔组装到膜中。电极还可以用于分析物检测步骤期间的数据采集。用于监测和检测的电极不必嵌入在支撑物中,并且可以例如设置在单独的专用集成电路(ASIC)芯片中。
如本文所用,术语“支撑物”是指不可溶于水性液体并且在没有孔口的情况下不能使液体穿过的刚性衬底。示例性固体支撑物包括但不限于玻璃和改性或功能化玻璃、塑料(包括丙烯酸树脂、聚苯乙烯以及苯乙烯与其他材料的共聚物、聚丙烯、聚乙烯、聚丁烯、聚氨酯、TeflonTM、环烯烃、聚酰亚胺等)、尼龙、陶瓷、树脂、Zeonor、二氧化硅或基于二氧化硅的材料(包括硅和改性硅)、碳、金属、无机玻璃、光纤束和聚合物。特别有用的支撑物包括改性硅,例如在Si衬底上的SiN膜。对于一些实施方案,支撑物位于流动池设备或其他容器内。
在一些实施方案中,纳米孔在衬底诸如芯片、盘、块、板等上制造。此类衬底可以由各种材料制成,所述材料包括但不限于硅、玻璃、陶瓷、锗、聚合物(例如,聚苯乙烯)和/或砷化镓。衬底可以被蚀刻或可以不被蚀刻,例如芯片可以是半导体芯片。
在具体实施方案中,检测设备可以包括与纳米孔阵列接触的储器。储器可以容纳电极,所述电极被定位以施加穿过由蛋白质纳米孔形成的孔口的电流。
在一些实施方案中,本公开文本的检测设备可以包括(a)电极;(b)栓系到电极的纳米孔;以及(c)包围纳米孔的膜。还提供了多重实施方案。例如,检测设备可以包括(a)多个电极;(b)多个纳米孔,每个所述纳米孔栓系到所述多个电极中的一个电极;以及(c)包围每个所述纳米孔的膜。
可以使用共价部分或非共价结合部分将纳米孔拴系到电极(例如,经由绝缘垫)。共价连接的一个例子是当核酸系链共价连接到纳米孔并共价连接到绝缘垫时。其他系链可以类似地用于共价连接,包括例如非核酸系链,诸如聚乙二醇或其他合成聚合物。非共价连接的一个例子是当纳米孔具有连接的亲和部分时,诸如聚组氨酸标签、Strep-标签或其他氨基酸编码的亲和部分。亲和部分可以非共价结合到绝缘垫上的配体,所述配体诸如结合聚组氨酸的镍或其他二价阳离子,或结合Strep-标签的生物素(或其类似物)。在一些实施方案中,不需要使用此类氨基酸亲和部分。
如本文所述,纳米孔(无论是复合纳米孔还是栓系纳米孔)可以与检测电路耦合以记录本公开文本的方法中的电信号,所述检测电路包括例如膜片钳电路、隧穿电极电路或横向电导测量电路(诸如石墨烯纳米带,或石墨烯纳米间隙)。此外,孔还可以与光学传感器耦合,所述光学传感器检测多核苷酸上的标记,例如荧光部分或拉曼信号产生部分。
本公开文本的检测设备可以用于检测各种分析物中的任一种,包括但不限于离子、核酸、核苷酸、多肽、生物活性小分子、脂质、糖等。因此,在本文所示的设备中,这些分析物中的一种或多种可以存在于蛋白质纳米孔的孔口中或穿过蛋白质纳米孔的孔口。
可以应用于本文所示的设备的其他检测技术包括但不限于检测事件,诸如分子或此分子的一部分的运动,特别是在所述分子是DNA或结合DNA的酶诸如聚合酶的情况下。例如,通过引用并入本文的Olsen等人,JACS 135:7855-7860(2013)披露了将DNA聚合酶I的Klenow片段(KF)的单分子生物缀合到电子纳米电路中,以便允许对酶功能和动态可变性进行电子记录,且解析单独的核苷酸并入事件。或者,例如,通过引用并入本文的Hurt等人,JACS 131:3772-3778(2009)披露了测量DNA与KF的复合物在所施加电场中在纳米孔顶上的停留时间。或者,例如,通过引用并入本文的Kim等人,Sens.Actuators B Chem.177:1075-1082(2012)披露了在实验中使用电流测量传感器,所述实验涉及在α-溶血素纳米孔中捕获的DNA。或者,例如,通过引用并入本文的Garalde等人,J.Biol.Chem.286:14480-14492(2011)披露了当被捕获在α-溶血素孔顶上的电场中时,基于KF-DNA复合物的特性来区分KF-DNA复合物。披露涉及α-溶血素的测量的其他参考文献包括以下(全部属于Howorka等人,通过引用并入本文):PNAS 98:12996-13301(2001);Biophysical Journal 83:3202-3210(2002);以及Nature Biotechnology 19:636-639(2001)。
在一些实施方案中,本发明涉及组装到脂质双层膜中的通道蛋白。通过以固定的施加电势穿过孔的离子电流来监测分析物的存在,其中电流的中断指示分析物与通道蛋白的相互作用。在一些实施方案中,稳定化的传感器芯片含有单个蛋白质纳米孔蛋白。蛋白质纳米孔传感器芯片可以应用于在单分子水平下进行的测量,即随机感测。通过监测以固定的施加电势穿过孔的离子电流,可以基于单独电流阻断事件的振幅和持续时间来区分各种分析物。
V.用于使用纳米孔组件检测分析物的方法
另一方面,本公开文本提供了一种检测/感测分析物的方法。所述方法包括:(1)使含有分析物的样品与所述的纳米孔组件接触;(2)施加跨过纳米孔组件的通道的电流;(3)以一个或多个时间间隔确定穿过通道的电流;以及(4)将以一个或多个时间间隔测量的电流与参考电流进行比较,其中相对于参考电流的电流变化指示样品中分析物的存在。在一些实施方案中,参考电流是用不含分析物的样品测量的。在一些实施方案中,纳米孔组件被放置在支撑物上。
在一些实施方案中,所述方法可以包括测量由每单位分析物或聚合物穿过通道的转位诱导的离子电流和/或电流特征的变化。在一些实施方案中,所述方法可以包括测量由每单位分析物或聚合物与通道上耦合的探针的瞬时或永久结合诱导的离子电流和/或电流特征的变化,所述结合导致电流特征的变化。
可以通过本公开文本公开的方法检测的分析物可以是核酸、氨基酸、肽、蛋白质、聚合物和化学分子中的任一种。所述方法中检测的核酸可以是单链的、双链的或含有单链和双链序列两者。核酸分子可以起源于双链形式(例如,dsDNA)并且可以任选地转化为单链形式。核酸分子也可以起源于单链形式(例如,ssDNA、ssRNA),并且ssDNA可以任选地转化为双链形式。
VI.定义
为了帮助理解根据本公开文本的组合物和方法的详细描述,提供一些明确的定义以有利于本公开文本的各个方面的无歧义公开。
除非另外定义,否则本文所用的所有技术和科学术语具有与本公开文本所属领域的普通技术人员通常所理解相同的含义。
应当注意,如在本说明书和所附权利要求书中所用的,单数形式的“一个/一种(a)”、“一个/一种(an)”和“所述”包括复数指示物,除非上下文清楚地另外指明。除非另外说明,否则术语“包括”、“包含”、“含有”或“具有”及其变型旨在涵盖其后列出的项目及其等效物以及另外的主题。
如本文所用,在关于项目的集合使用时,术语“每个”旨在标识集合中的单独项目,但不一定是指集合中的每个项目。如果明确的公开文本或上下文清楚地另外说明,则可能出现例外情况。
除非另外声明,否则本文中提供的任何和所有例子或示例性语言(例如,“诸如”)的使用都仅旨在更好地说明本发明,并且不对本发明的范围施加任何限制。说明书中的语言不应当被解释为指示任何未要求保护的要素是实践本发明所必需的。
如本文所用,术语“纳米孔”和“通道”用于是指具有离子电流可以流过的纳米级通路的结构。纳米孔的内径可以根据装置的预期用途而有极大地变化。通常,通道或纳米孔具有至少约0.5nm,通常至少约1nm并且更通常至少约1.5nm的内径,其中所述直径可以长达50nm或更长,但是在许多实施方案中,不会超过约10nm,并且通常不会超过约2nm。
如本文所用,术语“分析物”是指正在进行分析或试图被检测的物质或化学成分。预期本发明不限于特定的分析物。代表性分析物包括离子、糖、蛋白质、核酸和核酸序列。
如本文所用,术语“膜”是指防止电流或流体通过的片材或其他屏障。相比于本文所示的固体支撑物,膜通常是柔性的或可压缩的。膜可以由脂质材料制成,例如以形成脂质双层,或者膜可以由非脂质材料制成。膜可以呈例如由二嵌段聚合物或三嵌段聚合物形成的共聚物膜的形式,或者呈例如由勃拉脂质(bolalipid)形成的单层的形式。参见例如,Rakhmatullina等人,Langmuir:the ACS Journal of Surfaces and Colloids 24:6254-6261(2008),其通过引用并入本文。
如本文所用,术语“脂膜”意指主要由包含饱和或不饱和、支链或无支链、芳族或非芳族烃基团的化合物制成的膜。膜可以由多种脂质构成。脂质的例子包括但不限于脂肪酸、甘油单酯、甘油二酯和甘油三酯、甘油磷脂、鞘脂、类固醇、脂蛋白和糖脂。
“核酸”或“核酸序列”或“核酸分子”是指单链或双链形式的脱氧核糖核苷酸或核糖核苷酸及其聚合物。术语核酸可与基因、互补DNA(cDNA)、信使RNA(mRNA)、寡核苷酸和多核苷酸互换使用。所述术语涵盖含有已知核苷酸类似物或经修饰的骨架残基或连接的核酸,其是合成的、天然存在的和非天然存在的,其具有与参考核酸类似的结合特性,并且其以与参考核苷酸类似的方式代谢。此类类似物的例子包括但不限于硫代磷酸酯、氨基磷酸酯、甲基膦酸酯、手性甲基膦酸酯、2-O-甲基核糖核苷酸、肽核酸(PNA)。所述术语涵盖由DNA和RNA的任何已知碱基类似物形成的分子,诸如但不限于4-乙酰基胞嘧啶、8-羟基-N6-甲基腺嘌呤、氮丙啶基-胞嘧啶、假异胞嘧啶、5-(羧基羟甲基)尿嘧啶、5-氟尿嘧啶、5-溴尿嘧啶、5-羧甲基氨基甲基-2-硫尿嘧啶、5-羧基-甲基氨基甲基尿嘧啶、二氢尿嘧啶、肌苷、N6-异戊烯基腺嘌呤、1-甲基腺嘌呤、1-甲基假尿嘧啶、1-甲基鸟嘌呤、1-甲基肌苷、2,2-二甲基-鸟嘌呤、2-甲基腺嘌呤、2-甲基鸟嘌呤、3-甲基胞嘧啶、5-甲基胞嘧啶、N6-甲基腺嘌呤、7-甲基鸟嘌呤、5-甲基氨基甲基尿嘧啶、5-甲氧基氨基-甲基-2-硫尿嘧啶、β-D-甘露糖基辫苷(beta-D-mannosylqueosine)、5'-甲氧基羰基-甲基尿嘧啶、5-甲氧基尿嘧啶、2-甲硫基-N6-异戊烯基腺嘌呤、尿嘧啶-5-氧乙酸甲酯、尿嘧啶-5-氧乙酸、氧基丁氧基胸苷(oxybutoxosine)、假尿嘧啶、辫苷、2-硫胞嘧啶、5-甲基-2-硫尿嘧啶、2-硫尿嘧啶、4-硫尿嘧啶、5-甲基尿嘧啶、N-尿嘧啶-5-氧乙酸甲酯、尿嘧啶-5-氧乙酸、假尿嘧啶、辫苷、2-硫胞嘧啶和2,6-二氨基嘌呤。
除非另外指示,否则特定核酸序列还暗示涵盖其保守修饰的变体(例如,简并密码子取代)和互补序列以及明确指示的序列。具体地,在一些方面,简并密码子取代通过生成其中一个或多个所选(或所有)密码子的第三位置经混合碱基和/或脱氧肌苷残基取代的序列来实现(Batzer,Nucleic Acid Res.19:5081,1991;Ohtsuka等人,J.Biol.Chem.260:2605-8,1985;Rossolini等人,Mol.Cell.Probes 8:91-8,1994)。术语核酸可与基因、cDNA、mRNA、寡核苷酸和多核苷酸互换使用。
“多肽”以其常规含义使用,即作为氨基酸的序列使用。多肽不限于特定长度的产物。肽、多肽和蛋白质包括在多肽的定义内,并且除非另外明确指示,否则此类术语可以在本文中可互换使用。此术语还包括多肽的表达后修饰,例如糖基化、乙酰化、磷酸化等,以及本领域已知的其他修饰,可以是天然存在的和非天然存在的。多肽可以是完整蛋白质或其子序列。
本领域已知的术语“相同的”或“同一性”百分比是指两个或更多个多肽分子或者两个或更多个核酸分子的序列之间的关系,如通过比较序列所确定。在本领域中,“同一性”还意指核酸分子或多肽(视情况而定)之间的序列相关性程度,如通过两个或更多个核苷酸序列或者两个或更多个氨基酸序列的串之间的匹配所确定。“同一性”测量具有空位比对(如果存在)的两个或更多个序列中较小的序列之间相同匹配的百分比,所述空位通过特定数学模型或计算机程序(即,“算法”)定址。“基本同一性”是指具有与指定序列的至少约70%、约71%、约72%、约73%、约74%、约75%、约76%、约77%、约78%、约79%、约80%、约81%、约82%、约83%、约84%、约85%、约86%、约87%、约88%、约89%、约90%、约91%、约92%、约93%、约94%、约95%、约96%、约97%、约98%或约99%序列同一性的序列。在一些方面,同一性存在于长度为至少约50-100个氨基酸或核苷酸的区域。在其他方面,同一性存在于长度为至少约100-200个氨基酸或核苷酸的区域。在其他方面,同一性存在于长度为至少约200-500个氨基酸或核苷酸的区域。在某些方面,使用选自GAP、BLASTP、BLASTN、FASTA、BLASTA、BLASTX、BestFit和Smith-Waterman算法的计算机程序确定序列同一性百分比。
术语“相似性”是相关概念,但是相比于“同一性”,是指包括相同匹配和保守取代匹配两者的相似性的量度。如果两个多肽序列具有例如10/20个相同氨基酸,并且其余部分均为非保守取代,则同一性和相似性百分比均为50%。在同一例子中,如果还有五个位置存在保守取代,则同一性百分比仍为50%,但相似性百分比将为75%(15/20)。因此,在存在保守取代的情况下,两个多肽之间的相似性百分比程度将高于这两个多肽之间的同一性百分比。
还应当特别理解的是,本文列举的任何数值包括从下限值到上限值的所有值,即,在所列举的最低值与最高值之间的数值的所有可能的组合被认为在本申请中明确声明。例如,如果声明浓度范围为约1%至50%,则预期诸如2%至40%、10%至30%或1%至3%等的值明确地列举在本说明书中。上面列出的值仅是特别预期的例子。
在各个方面,范围在本文中表达为从“约”或“大约”一个具体值和/或到“约”或“大约”另一个具体值。当通过使用先行词“约”将值表达为近似值时,应当理解,一定量的变化也包括在范围内。
术语“可操作地连接”是指核酸表达控制序列(诸如启动子或转录因子结合位点的阵列)与第二核酸序列之间的功能性连接,其中表达控制序列引导对应于第二序列的核酸的转录。
如本文所用,如本文所用的术语“纯化的”或“基本上纯化的”是指所需蛋白质富集至少20%,更优选地至少50%,甚至更优选地至少75%,并且最优选地至少90%或甚至95%。
本文引用的每个出版物、专利申请、专利和其他参考文献均通过引用以其整体并入,并入程度使其与本公开文本一致。
本文公开的出版物的提供仅针对其先于本发明的提交日期的公开内容。本文中的任何内容都不应视为承认本发明因在先发明而无权早于此类出版物。此外,所提供的出版日期可能与实际出版日期不同,这可能需要独立确认。
除非本文另外指示,否则本文中列举的数值的范围仅旨在用作单独地提到落在所述范围内的每个单独值和每个终点的速记方法,并且每个单独值和终点纳入说明书中如同其在本文中单独列举一般。
除非本文中另外指示或根据上下文另有明确矛盾,否则本文所述的所有方法都以任何合适的顺序来进行。关于所提供的任何方法,方法的步骤可以同时或顺序进行。当方法的步骤顺序进行时,除非另外说明,否则所述步骤可以按任何顺序进行。
在方法包括步骤的组合的情况下,除非本文另外说明,否则步骤的每个和每一个组合或子组合都涵盖在本公开文本的范围内。
如本文所用的章节标题仅用于组织目的,而不应解释为限制所描述的主题。
应当理解,本文所述的实施例和实施方案仅用于说明目的,并且根据它们进行的各种修改或变化将为本领域技术人员知晓,并且应包括在本申请的精神和范围内以及所附权利要求书的范围内。
VII.实施例
实施例1
本实施例描述了在后续实施例中使用的材料和方法。
材料和方法
非膜蛋白通道重构
对于非膜蛋白通道,难以直接插入到脂质双层或聚合物膜中,因为与膜蛋白通道不同,非膜蛋白通道通常缺少中间的疏水层和两端的亲水层,因此需要进行广泛工程化。为了用于生物传感或测序,还需要使用不同的探针对非膜蛋白通道进行工程化和功能化。尽管非膜蛋白通道可能源自具有不同序列、形状、结构或特性的各种来源,但用于将其重工程化和使其能够用作纳米孔的策略和方法具有一些共同特征。如图1A和图1B所示,三个不同的结构域对于膜锚定是重要的,并且两个区域对于用于单分子感测的功能模块的缀合是特别重要的。为了将这些不同的结构域重工程化并改变疏水或亲水特性或功能化的缀合位点或探针,涉及一系列分子克隆工作。通常,出于各种工程化目的,采用通用的限制酶克隆方法。然而,也可以采用其他克隆方法,诸如重组克隆、/>克隆、等温组装反应或IIS型组装。
非膜蛋白的表达和纯化
尽管非膜蛋白通道可能源自具有不同序列、形状、结构或特性的各种来源,但用于表达和纯化它们的策略和方法具有一些共同特征。
首先,将重工程化的非膜蛋白通道基因克隆到在末端上具有或没有标签的表达载体中。然后将载体转化到合适的蛋白质表达宿主例如大肠杆菌(E.coli)系统中。在宿主中表达蛋白质通道之后,将宿主裂解,并采取一系列步骤来去除宿主碎片。最后,非膜蛋白可以通过以下方法之一或组合进行纯化,诸如亲和色谱法、交换色谱法、尺寸排阻色谱法或其他常用的纯化方法。
例如,将非膜蛋白通道基因和变体基因克隆到在N末端具有His标签的PET23a载体中。在转化到BL21(DE3)细胞中之后,将一个BL21(DE3)菌落在抗生素存在的情况下接种在5mL新鲜LB培养基中,并在37℃下在振动器中培养若干小时,直至OD达到0.8。然后将烧瓶保持在4℃下以冷却。然后将0.5mM IPTG添加到烧瓶中以进行诱导。将细菌在振动器中在16℃下培养过夜。将细菌在8000rpm下收获10min,弃去上清液,并且然后用裂解缓冲液悬浮沉淀物。对细菌溶液进行超声处理,直至溶液变得透明且不粘稠。将溶液在超声处理之后在12000rpm下离心30min,弃去上清液并收集沉淀物。将10ml尿素(8M)添加到沉淀中并在振荡器上低速振荡,直至沉淀完全溶解在尿素中。将溶液在12000rpm下离心10min。将上清液添加到100ml蛋白质复性缓冲液中并搅拌过夜。将溶液在重折叠之后在12000rpm下离心30min,弃去沉淀物并收集上清液。将上清液穿过0.45um注射器过滤器以弃去变性的蛋白质。将上清液添加到透析袋中,并且将透析液更换三次。收集透析袋中的液体并在12000rpm下离心10分钟,弃去沉淀物并收集上清液。用裂解缓冲液平衡镍珠,并且将上清液添加到珠中。然后用洗涤缓冲液洗涤珠。使用7~10个柱体积的洗脱缓冲液洗脱蛋白质。收集洗脱液并浓缩至5mL。将洗脱液在12000rpm下离心10min,并且然后吸收上清液并用注射器注射到AKTA FPLC中。在注射之前,用10mL裂解缓冲液洗涤样品环。在通过尺寸排阻柱之后收集蛋白质。在SEC之后运行SDS-PAGE凝胶以检查蛋白质样品,并储存在-80℃下。
缀合
为了用不同的探针功能化或增强非膜蛋白通道的疏水性或亲水性,可以采用各种缀合方法。尽管非膜蛋白通道可能源自具有不同序列、形状、结构或特性的各种来源,但用于缀合的策略和方法具有一些共同特征。为了将疏水性部分缀合到非膜蛋白通道,可以在蛋白质通道上的半胱氨酸基团上进行反应。蛋白质通道的每个亚基含有一个或多个半胱氨酸,它们位于通道的中间层并且是环境可及的。在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在室温下孵育20min之后,将4μL的4mM胆固醇-PEG-马来酰亚胺逐滴添加到蛋白质溶液中,并且将反应混合物在室温下在黑暗中孵育2小时。通过NanoSep 100K旋转柱去除过量的胆固醇-PEG-马来酰亚胺。用12%SDS-PAGE检查蛋白质的标记。
为了将探针缀合到非膜蛋白通道,采用点击化学,诸如四嗪-烯烃连接或叠氮化物-炔烃点击化学。对于四嗪-烯烃连接,在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将1.6μL的10mM甲基四嗪-PEG4-马来酰亚胺逐滴添加到蛋白质溶液中,并且将混合物在室温下在黑暗中孵育1小时。通过使用脱盐旋转柱脱盐去除过量的甲基四嗪-PEG4-马来酰亚胺。在含有0.5M NaCl、50mM Tris的pH 6.8缓冲液中制备50μL探针溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将6μL的25mM TCO-PEG3-马来酰亚胺逐滴添加到寡聚物溶液中,并且将混合物在室温下孵育2小时。通过脱盐柱去除过量的TCO-PEG3-马来酰亚胺。通过20%尿素-PAGE凝胶验证探针的标记。
对于叠氮化物-炔烃连接,将TCO修饰的探针与甲基四嗪标记的蛋白质以不同的摩尔比混合,以实现最佳蛋白质标记效率。将混合物在室温下孵育1小时。用12%SDS-PAGE验证缀合。在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备40μL的20μM蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将1.6μL的10mM磺基DBCO-PEG4-马来酰亚胺逐滴添加到蛋白质溶液中,并且将混合物在室温下在黑暗中孵育1小时。通过使用脱盐旋转柱脱盐去除过量的马来酰亚胺试剂。将DBCO修饰的蛋白质与叠氮化物修饰的探针以等摩尔浓度混合,并且将反应混合物在室温下孵育2小时。用12%SDS-PAGE凝胶验证缀合。
纳米孔实验设置和数据记录
尽管非膜蛋白通道可能源自具有不同序列、形状、结构或特性的各种来源,但所有非膜蛋白通道或变体都可以应用于相似的纳米孔设置或装置。通常,设置包括具有一个或多个孔口的传感器芯片或阵列。传感器芯片能够支持脂膜或聚合物膜的形成,从而可以将隔室分为顺式(顶部)和反式(底部)隔室。两个隔室均填充导电缓冲液。输入电极嵌入到一个隔室中,并且接地电极嵌入到另一个隔室中。此外,所述设置还可以与流体系统结合以使样品能够从一个容器流到传感器装置。为了将非膜蛋白通道插入到脂膜或聚合物膜中,将蛋白质通道悬浮在其相应的储存缓冲液中,在导电缓冲液(通常为1M KCl或1M NaCl,5mMHEPES或Tris,pH 7.6)中稀释50-100倍,并添加到顶部隔室中。在施加的电势(恒定保持电压或斜坡电压)下且在具有或没有洗涤剂的情况下,可以观察到蛋白质通道直接插入在平面膜中。当没有分析物存在时,电流稳定且平整。当分析物存在时,与探针的相互作用会导致记录到电流变化。
数据分析
尽管非膜蛋白通道可能源自具有不同序列、形状、结构或特性的各种来源,但用于数据分析的策略和方法具有一些共同特征。通常,分析约10,000+个电流阻断事件(转位或单分子结合事件),以确保结果在统计学意义内。开发了基于MATLAB或PYTHON的自定义算法,以用于事件的定量快速处理。通常,使用两个参数:(1)当前阻断分数,表示为[(电流未阻断-电流分析物阻断之后)/电流未阻断];和(2)停留时间:τ中断(事件的持续时间)和τ连通(连续事件之间的时间)。根据τ连通和τ中断,可以获得κ缔合(缔合速率常数)和κ解离(解离速率常数),并且最后获得Kd(平衡解离常数)。构建校准曲线,显示捕获率随分析物浓度而变。在将未知浓度的分析物引入到纳米孔中之后,可以计算平均捕获率,并且可以根据校准曲线确定分析物浓度。临床样品的分析需要进一步调谐分析算法,以清楚地区分“污染性”或非特异性信号与真实的分析物诱导的事件。为了使分析物检测系统标准化,通常采用具有“加标”对照的“内源”归一化器。然后,使用诊断领域中常用的标准测定(诸如免疫测定和qRT-PCR)对平台数据进行交叉验证。最后,统计样本大小/检力分析是基于用于两组比较的两样品t检验,以及用于两种因素的组合的双向方差分析(ANOVA)。
实施例2
phi29 gp-9尾蛋白的表达和纯化
SDS PAGE中的纯化的phi29 gp-9尾蛋白及其变体在图3至图4中示出。表达和纯化步骤如下。将phi29 gp-9尾蛋白基因和变体基因克隆到在N末端具有His标签的载体pBDHT中。在转化到BL21(DE3)中之后,将一个BL21菌落接种在内部具有抗生素的5mL新鲜LB培养基中,并在37℃下在振动器(220rpm)中培养若干小时,直至OD达到0.8。然后将烧瓶保持在4℃下以冷却。然后将0.5mM IPTG添加到烧瓶中以进行诱导。将细菌在振动器(180rpm)中在16℃下培养过夜。将细菌在8000rpm下收获10min,弃去上清液,并且然后将沉淀物重悬于裂解缓冲液(50mM Tris pH 8.0,500mM NaCl)中。对细菌溶液进行超声处理,直至溶液变得透明且不粘稠。将溶液在超声处理之后在12000rpm下离心30min,弃去上清液并收集沉淀物。将10ml尿素(8M)添加到沉淀中并在振荡器上低速振荡,直至沉淀完全溶解在尿素中。将溶液在12000rpm下离心10min。将上清液添加到100ml蛋白质复性缓冲液(15%甘油,500mMNaCl,50mM Tris,2M L-精氨酸,pH 8.0)中并搅拌过夜。将溶液在重折叠之后在12000rpm下离心30min,弃去沉淀物并收集上清液。将上清液穿过0.45um注射器过滤器以弃去变性的蛋白质。将上清液添加到透析袋中,并且将透析液(50mM NaCl,5mM Tris)更换三次。收集透析袋中的液体并在12000rpm下离心10分钟,弃去沉淀物并收集上清液。用裂解缓冲液平衡镍珠,并且将上清液添加到珠中。用50个柱体积的洗涤缓冲液(50mM Tris pH 8.0,500mMNaCl,25mM咪唑)洗涤珠。使用7~10个柱体积的洗脱缓冲液(50mM Tris pH 8.0,500mMNaCl,500mM咪唑)洗脱蛋白质。收集洗脱液并浓缩至5mL。将洗脱液在12000rpm下离心10min,并且然后吸收上清液并用注射器注射到AKTAFPLC中。在注射之前,用10mL裂解缓冲液洗涤样品环。在通过尺寸排阻柱之后收集蛋白质。在SEC之后运行SDS-PAGE凝胶以检查蛋白质样品,并储存在-80℃下。
实施例3
作为纳米孔的噬菌体尾蛋白
许多噬菌体含有长的可收缩或不可收缩的尾部或者短的不可收缩尾部。尾部在宿主细胞识别、膜穿透和病毒基因组弹射过程中发挥关键作用。尾蛋白来源于(包括但不限于)phi29、T4、T3、T5、T7、SPP1、P22、P2、P3、λ、μ、HK97和C1。
本发明的这些蛋白质通道对于生物分子(诸如疾病相关生物标记物、多核苷酸和多肽序列)的生物感测和测序是理想的。具有改进的膜能力,修饰的蛋白质通道可以有效地插入到脂膜或聚合物膜中,并且充当用于生物感测和测序的纳米孔。通过与各种探针缀合,修饰的蛋白质通道具有以高灵敏度和特异性检测特定疾病相关的生物标记物的能力。本发明的孔可以存在于同源或异源孔中。
代表性实施例:来自phi29的Gp-9尾蛋白
phi29噬菌体尾部(gp9)的晶体结构显示,六个gp9亚基形成了六聚体或圆柱状的管结构。在结构内部,在触发DNA弹射之前,远端被六个柔性疏水环封阻。为了将基因组dsDNA递送到宿主细胞的细胞质中,phi29尾部需要穿透细胞膜。管的长度为约12.5nm。管具有大约4nm的内径和大约9nm的外径。管壁主要由厚度为约2.5nm的β-折叠片构成。
构建了全长gp9的克隆和一系列突变体(图2至图6),诸如gp9Δloop[417-491],其中无序区(残基417-491)缺失。根据晶体结构,gp9Δ417-491结构也是圆柱形管状均六聚体。通过研究尾蛋白的结构,氨基酸130-170、300-325、350-390、530-595位于中间通道的表面上,它们可以与膜的疏水层相互作用。因此,疏水性基团连接到这些位点或这些氨基酸突变为疏水性氨基酸可以改变通道的膜插入能力,所述水性氨基酸包括甘氨酸(Gly)、丙氨酸(Ala)、缬氨酸(Val)、亮氨酸(Leu)、异亮氨酸(Ile)、脯氨酸(Pro)、苯丙氨酸(Phe)、甲硫氨酸(Met)和色氨酸(Trp)。具体地,以下位点之一或任何组合的突变是关键的:K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A、E595V。为了将疏水性基团缀合到通道的中部,将以下位点突变为半胱氨酸是重要的:E595C、K321C和K358C。此外,氨基酸250-300和20-50位于通道的上部和下部的表面上,它们可以与亲水性环境相互作用。
作为纳米孔的病毒门户蛋白
门户蛋白不仅存在于噬菌体phi29、T3、T4、T5、T7、SPP1和P22中,而且还存在于其他病毒系统中,诸如腺病毒和疱疹病毒。被称为连接体的门户通道是具有中央通道的孔状蛋白质结构,所述中央通道充当基因组DNA在包装期间进入病毒衣壳和在感染期间离开的途径。尽管结构研究指示不同病毒连接体蛋白之间的序列同源性和大小存在显著差异,但它们在拓扑上都相似,具有截锥形状。来源于过表达的蛋白质的连接体的化学计量通常根据表达条件而变化。如果本发明的这些蛋白质通道可以直接插入到坚固的聚合物膜中,则所述蛋白质通道对于生物感测和测序应用而言可能是理想的。由于来自不同病毒门户蛋白的连接体的结构显示出相似的特征,因此上文概述的原理通过扩展也适用于它们全部。
代表性实施例:P22门户通道
P22是一种有尾噬菌体,其组装空的前体衣壳,随后通过强大的包装马达将所述前体衣壳与病毒DNA包装在一起。P22门户蛋白形成用于病毒DNA双向通过的通道状结构。短尾dsDNA噬菌体的短尾病毒科(Podoviridae)包括P22样亚组的成员,诸如Sf6、CUS-3、ε34和APSE-1。门户桶结构是高度动态的,并且在溶液中易发生蛋白水解。P22门户蛋白由12个相同的亚基构成,所述亚基在中央通道周围对称排列。总高度为约30nm,具有约17nm直径的漏斗形的核心,所述核心连接到约20nm长的α螺旋管。在整个结构中,通道的平均内径在3.5nm至7.5nm之间变化。
构建了一系列突变体(图7至图11),其包括以下限定特征:(1)门户核心(残基1-602)在拓扑上与其他病毒蛋白通道相似,但螺旋桶结构管的存在是P22独有的。门户核心与螺旋桶结构之间的连接可以易于通过溶液中的胰凝乳蛋白酶裂解,这指示两个结构域在本质上是柔性的。本发明包括去除桶结构残基603-725,并且用任何肽或核酸序列作为单独的识别结构域进行替换。(2)本发明包括内部柔性环残基464-492的改变(缺失、截短、突变),以改变电生理特性和/或检测能力。(3)本发明包括使用EDTA(60mM或更高)组装十二聚体复合物:对于十二聚体环的正确组装,非特异性地捕获在单体-单体界面处的螯合二价阳离子是必需的。(4)本发明包括通过改变残基Glu70(其与Glu423、Glu414、Glu406、Glu393和Glu396簇集在一起)的五个环来改变通道内部的整体电负性。(5)本发明包括改变在翼结构域下面形成带的任何疏水性氨基酸(Phe 24、Ile25、Leu28、Phe60、Phe128、Pro129和Pro132),以改变表面的疏水性;(6)本发明包括在末端添加若干氨基酸(任何天然或非天然氨基酸),目的是使用这些氨基酸作为锚定点(诸如半胱氨酸或赖氨酸或精氨酸)用于增加功能性或针对膜插入和通道稳定性改变电生理特性(亲水或疏水标签);(7)本发明包括对疏水层和亲水层的诱变;中部:氨基酸250至300,10-45;上部:450-500;下部:350-380;Arg476或C末端至Cys的突变用于缀合;中部THR240、VAL244、Arg 273,用于Cys诱变以缀合胆固醇。
代表性实施例:T4门户通道
T4门户主要作为12聚体(根据蛋白质表达条件,一些为11聚体或13聚体)环存在,长14nm,宽7nm,并且内部通道直径3nm。由于通道由12个亚基组装而成,因此改变一个单体中的一个或多个残基将触发整个通道中的效应,所述突变存在于分子的同一平面中。本发明包括(图12至图14和图16):(1)通过改变带电残基的环来改变通道内部的整体电负性;(2)改变内部通道入口处的两个碱性残基R338和K342(使用任何氨基酸来改变亲水性);(3)在末端添加若干氨基酸(任何天然或非天然氨基酸),目的是使用这些氨基酸作为锚定点(诸如半胱氨酸或赖氨酸或精氨酸)用于增加功能性或针对膜插入和通道稳定性改变电生理特性(亲水或疏水标签);以及(5)改变(缺失、截短、突变)内部柔性环残基374-398,以改变电生理特性和/或检测能力。
实施例4
纳米孔外壳的膜的组合物
平面双层脂膜(BLM)或聚合物膜是在(a)BCH-1A水平BLM池(Eastern Scientific)或(b)自制定制腔室中产生的。将具有100或200μm孔口的特氟隆(Teflon)隔板放置在设备中,以将BLM池分为顺式(顶部)和反式(底部)隔室。自制腔室已预先钻出100或200um的孔口,从而分隔顺式隔室与反式隔室。
脂膜:通过以下方式形成具有变化组成的平面脂质双层:用己烷中的脂质(浓度:0.5mg/ml)预涂覆孔口,接着用正癸烷中的脂质(浓度:20-30mg/ml)涂覆。脂质组合物的例子包括:(i)两性离子脂质,诸如100%DPhPC或DOPC或POPC;(ii)0%-50%的阴离子脂质,诸如DPhPG/DOPG/POPG或DPhPS/DOPS/POPS,所述阴离子脂质与组合物(i)按比例混合(最终比例总计达100%);(iii)0%-25%的胆固醇,所述胆固醇与组合物(i)和(ii)按比例混合。确切的脂质组合物取决于蛋白质的特性。使用的典型脂膜组合物包括:100%DPhPC;100%DPhPC;30%DPhPS;70%DPhPC:28%DPhPG:2%胆固醇。
聚合物膜:通过使用悬浮在有机溶剂中的膜进行手工涂覆来形成具有变化组成的平面聚合物膜,所述有机溶剂诸如癸烷或硅油(基于聚苯基-甲基硅氧烷,或基于聚二甲基硅氧烷,粘度为20mPa.s)。膜组合物是基于聚噁唑啉的三嵌段共聚物(图17),诸如:
(i)PEOXA-PEO-PEOXA[聚(2-乙基噁唑啉)-b-聚(环氧乙烷)-b-聚(2-乙基噁唑啉)]PEOXA-PEO-PEOXA;
(ii)PMOXA-PDMS-PMOXA[聚(2-甲基噁唑啉)-b-聚(二甲基硅氧烷)-b-聚(2-甲基噁唑啉)];(嵌段之间具有乙基-苄基或丙基或丙基-乙氧基连接)
(iii)PMOXA-PB-PMOXA[聚(2-甲基噁唑啉)-b-聚(1,4-丁二烯)-b-聚(2-甲基噁唑啉)];
(iv)PMOXA-PE-PMOXA[聚(2-甲基噁唑啉)-b-聚(乙烯)-b-聚(2-甲基噁唑啉)];
(v)PMOXA-PEO-PMOXA[聚(2-甲基噁唑啉)-b-聚(环氧乙烷)-b-聚(2-甲基噁唑啉)]
使用的典型膜包括PMOXA6-PDMS35-PMOXA6;PMOXA6-PDMS65-PMOXA6;PMOXA11-PDMS65-PMOXA11;PMOXA5-PDMS13-PMOXA5;PMOXA3-PDMS38-PMOXA3(图17)。嵌段长度(由PMOXAX-PDMSY-PMOXAX中的下标X和Y表示)(其决定亲水或疏水嵌段的长度)可根据以下因素调谐:(1)为了稳定插入蛋白质孔,需要一定的膜厚度;(2)膜需要保持非常低的渗透性;(3)膜必须在不同的溶液条件下(包括极端pH(1-12)和高/低盐环境下)在延长时间段内在机械和化学上是稳定的。
实施例5
蛋白质通道在膜中的插入
通常,将悬浮在其相应的储存缓冲液中的蛋白质通道(尾蛋白或门户通道)在导电缓冲液(通常为1M KCl或1M NaCl,5mM HEPES或Tris,pH 7.6)中稀释50-100倍,并添加到BLM池的顶部隔室中。在施加的电势(恒定保持电压或斜坡电压)下,可以观察到蛋白质通道直接插入在平面膜中(图6、图11、图14、图16和图19)。
如果需要,所述的膜组合物还可以用于产生具有变化的多分散性的泡状聚合物囊泡结构,以重构蛋白质通道。对于高插入效率,可以将变化量的甘油、CsCl和/或蔗糖包封在聚合物囊泡内。然后,所得的蛋白质-聚合物囊泡可以以盐和电压依赖性方式与相同组成的平面双层融合。
实施例6
将疏水性部分经由与半胱氨酸残基反应缀合到非膜蛋白通道
在此使用T4 gp20门户蛋白为例来说明如何将疏水性部分经由与半胱氨酸残基反应缀合到非膜通道。蛋白质通道的每个亚基含有三个半胱氨酸,它们位于通道的中间层并且是环境可及的(图13)。在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备40μL的20μM蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在室温下孵育20min之后,将4μL的4mM胆固醇-PEG-马来酰亚胺逐滴添加到蛋白质溶液中,并且将反应混合物在室温下在黑暗中孵育2小时。通过NanoSep 100K旋转柱去除过量的胆固醇-PEG-马来酰亚胺。用12%SDS-PAGE检查蛋白质的标记(图13)。
实施例7
将携带疏水性部分的非膜蛋白通道插入到脂膜或聚合物膜中
产生了平面聚合物膜。在施加的电压(恒定保持电压或斜坡电压)下,在向顺式腔室添加蛋白质通道之后,观察到携带疏水性部分的非膜蛋白通道的直接插入(图14)。突变为疏水性氨基酸或将疏水性基团缀合到中间层phi29 gp-9尾蛋白可以显著增强插入过程及其稳定性。
实施例8
将探针经由点击反应缀合到非膜蛋白通道
使用phi29 gp10门户蛋白作为代表性例子。
策略1:经由四嗪-烯烃连接:在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备40μL的20μM蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将1.6μL的10mM甲基四嗪-PEG4-马来酰亚胺逐滴添加到蛋白质溶液中,并且将混合物在室温下在黑暗中孵育1小时。通过使用脱盐旋转柱脱盐去除过量的甲基四嗪-PEG4-马来酰亚胺。在含有0.5M NaCl、50mM Tris的pH 6.8缓冲液中制备50μL的30μM硫醇修饰的miRNA探针溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将6μL的25mM TCO-PEG3-马来酰亚胺逐滴添加到寡聚物溶液中,并且将混合物在室温下孵育2小时。通过脱盐柱去除过量的TCO-PEG3-马来酰亚胺。通过20%尿素-PAGE凝胶验证miRNA探针的标记。为了将miRNA探针缀合到蛋白质,将TCO修饰的miRNA探针与甲基四嗪标记的蛋白质以不同的摩尔比混合,以实现最佳蛋白质标记效率。将混合物在室温下孵育1小时。用12%SDS-PAGE验证缀合。
策略2:经由叠氮化物-炔烃点击化学:在含有0.5M NaCl、50mM Tris、15%甘油的pH 6.8缓冲液中制备40μL的20μM蛋白质溶液。将溶液脱气,并且将100倍过量的TCEP添加到溶液中。在孵育20min之后,将1.6μL的10mM磺基DBCO-PEG4-马来酰亚胺逐滴添加到蛋白质溶液中,并且将混合物在室温下在黑暗中孵育1小时。通过使用脱盐旋转柱脱盐去除过量的马来酰亚胺试剂。将DBCO修饰的蛋白质与叠氮化物修饰的miRNA探针以等摩尔浓度混合,并且将反应混合物在室温下孵育2小时。用12%SDS-PAGE验证缀合。
实施例9
缀合到蛋白质的DNA探针的验证
将蛋白质-miRNA探针缀合物与靶miRNA寡聚物在室温下以等摩尔浓度一起孵育30min。用12%SDS-PAGE验证蛋白质-miRNA探针与靶miRNA的结合(图15)。
实施例10
使用工程化的非膜蛋白通道进行的PSA检测
为了将蛋白质探针缀合到纳米孔通道,已经尝试并测试了各种方法,包括半胱氨酸残基与甲基四嗪-PEG4-马来酰亚胺的反应以及随后与TCO标记的探针的反应,以及SpyCatcher/Spytag蛋白质缀合系统。显示了针对PSA的单链抗体如何经由SpyCatcher/Spytag缀合到phi29 gp10门户蛋白通道。构建并纯化具有C末端SpyTag肽的Phi29 gp10蛋白通道。构建并纯化具有C末端SpyCatcher的针对PSA的单链抗体。通过凝胶验证组装的蛋白质通道-PSA单链抗体(图18)。然后将纯化的组装蛋白质通道-PSA单链抗体插入到聚合物膜中,以测试其结合PSA抗原的能力。先前描述了设置电生理实验的程序。在插入之后,将一系列不同浓度的PSA抗原添加到腔室中,并观察到独特的结合事件(图19)。
实施例11
使用工程化的非膜蛋白通道进行的微小RNA检测
为了将核酸探针缀合到纳米孔通道,已经尝试并测试了各种不同的方法,如实施例6所概述。已经证明了miR-21探针可以与缀合到T4 gp20门户蛋白通道。将纯化的缀合复合物插入到聚合物膜中,以测试其结合对应的微小RNA的能力。先前描述了设置电生理实验的程序。在插入之后,将一系列不同浓度的微小RNA添加到腔室中,并观察到独特的结合事件(图16)。
通过以下详细描述,本发明的其他目的、特征和优点将变得清楚。然而,应当理解,详细说明和实施例虽然指示了本发明的具体实施方案,但仅以说明的方式给出。另外,设想根据此详细描述,在本发明的精神和范围内的改变和修改对于本领域技术人员将变得清楚。
序列表
<110> 牛津纳米孔科技公司
Haque , Farzin
Wang , Shaoying
<120> 纳米孔组件及其用途
<130> 183298.00002
<150> US 62/629,604
<151> 2018-02-12
<160> 93
<170> PatentIn版本3.5
<210> 1
<211> 725
<212> PRT
<213> 噬菌体P22
<400> 1
Met Ala Asp Asn Glu Asn Arg Leu Glu Ser Ile Leu Ser Arg Phe Asp
1 5 10 15
Ala Asp Trp Thr Ala Ser Asp Glu Ala Arg Arg Glu Ala Lys Asn Asp
20 25 30
Leu Phe Phe Ser Arg Val Ser Gln Trp Asp Asp Trp Leu Ser Gln Tyr
35 40 45
Thr Thr Leu Gln Tyr Arg Gly Gln Phe Asp Val Val Arg Pro Val Val
50 55 60
Arg Lys Leu Val Ser Glu Met Arg Gln Asn Pro Ile Asp Val Leu Tyr
65 70 75 80
Arg Pro Lys Asp Gly Ala Arg Pro Asp Ala Ala Asp Val Leu Met Gly
85 90 95
Met Tyr Arg Thr Asp Met Arg His Asn Thr Ala Lys Ile Ala Val Asn
100 105 110
Ile Ala Val Arg Glu Gln Ile Glu Ala Gly Val Gly Ala Trp Arg Leu
115 120 125
Val Thr Asp Tyr Glu Asp Gln Ser Pro Thr Ser Asn Asn Gln Val Ile
130 135 140
Arg Arg Glu Pro Ile His Ser Ala Cys Ser His Val Ile Trp Asp Ser
145 150 155 160
Asn Ser Lys Leu Met Asp Lys Ser Asp Ala Arg His Cys Thr Val Ile
165 170 175
His Ser Met Ser Gln Asn Gly Trp Glu Asp Phe Ala Glu Lys Tyr Asp
180 185 190
Leu Asp Ala Asp Asp Ile Pro Ser Phe Gln Asn Pro Asn Asp Trp Val
195 200 205
Phe Pro Trp Leu Thr Gln Asp Thr Ile Gln Ile Ala Glu Phe Tyr Glu
210 215 220
Val Val Glu Lys Lys Glu Thr Ala Phe Ile Tyr Gln Asp Pro Val Thr
225 230 235 240
Gly Glu Pro Val Ser Tyr Phe Lys Arg Asp Ile Lys Asp Val Ile Asp
245 250 255
Asp Leu Ala Asp Ser Gly Phe Ile Lys Ile Ala Glu Arg Gln Ile Lys
260 265 270
Arg Arg Arg Val Tyr Lys Ser Ile Ile Thr Cys Thr Ala Val Leu Lys
275 280 285
Asp Lys Gln Leu Ile Ala Gly Glu His Ile Pro Ile Val Pro Val Phe
290 295 300
Gly Glu Trp Gly Phe Val Glu Asp Lys Glu Val Tyr Glu Gly Val Val
305 310 315 320
Arg Leu Thr Lys Asp Gly Gln Arg Leu Arg Asn Met Ile Met Ser Phe
325 330 335
Asn Ala Asp Ile Val Ala Arg Thr Pro Lys Lys Lys Pro Phe Phe Trp
340 345 350
Pro Glu Gln Ile Ala Gly Phe Glu His Met Tyr Asp Gly Asn Asp Asp
355 360 365
Tyr Pro Tyr Tyr Leu Leu Asn Arg Thr Asp Glu Asn Ser Gly Asp Leu
370 375 380
Pro Thr Gln Pro Leu Ala Tyr Tyr Glu Asn Pro Glu Val Pro Gln Ala
385 390 395 400
Asn Ala Tyr Met Leu Glu Ala Ala Thr Ser Ala Val Lys Glu Val Ala
405 410 415
Thr Leu Gly Val Asp Thr Glu Ala Val Asn Gly Gly Gln Val Ala Phe
420 425 430
Asp Thr Val Asn Gln Leu Asn Met Arg Ala Asp Leu Glu Thr Tyr Val
435 440 445
Phe Gln Asp Asn Leu Ala Thr Ala Met Arg Arg Asp Gly Glu Ile Tyr
450 455 460
Gln Ser Ile Val Asn Asp Ile Tyr Asp Val Pro Arg Asn Val Thr Ile
465 470 475 480
Thr Leu Glu Asp Gly Ser Glu Lys Asp Val Gln Leu Met Ala Glu Val
485 490 495
Val Asp Leu Ala Thr Gly Glu Lys Gln Val Leu Asn Asp Ile Arg Gly
500 505 510
Arg Tyr Glu Cys Tyr Thr Asp Val Gly Pro Ser Phe Gln Ser Met Lys
515 520 525
Gln Gln Asn Arg Ala Glu Ile Leu Glu Leu Leu Gly Lys Thr Pro Gln
530 535 540
Gly Thr Pro Glu Tyr Gln Leu Leu Leu Leu Gln Tyr Phe Thr Leu Leu
545 550 555 560
Asp Gly Lys Gly Val Glu Met Met Arg Asp Tyr Ala Asn Lys Gln Leu
565 570 575
Ile Gln Met Gly Val Lys Lys Pro Glu Thr Pro Glu Glu Gln Gln Trp
580 585 590
Leu Val Glu Ala Gln Gln Ala Lys Gln Gly Gln Gln Asp Pro Ala Met
595 600 605
Val Gln Ala Gln Gly Val Leu Leu Gln Gly Gln Ala Glu Leu Ala Lys
610 615 620
Ala Gln Asn Gln Thr Leu Ser Leu Gln Ile Asp Ala Ala Lys Val Glu
625 630 635 640
Ala Gln Asn Gln Leu Asn Ala Ala Arg Ile Ala Glu Ile Phe Asn Asn
645 650 655
Met Asp Leu Ser Lys Gln Ser Glu Phe Arg Glu Phe Leu Lys Thr Val
660 665 670
Ala Ser Phe Gln Gln Asp Arg Ser Glu Asp Ala Arg Ala Asn Ala Glu
675 680 685
Leu Leu Leu Lys Gly Asp Glu Gln Thr His Lys Gln Arg Met Asp Ile
690 695 700
Ala Asn Ile Leu Gln Ser Gln Arg Gln Asn Gln Pro Ser Gly Ser Val
705 710 715 720
Ala Glu Thr Pro Gln
725
<210> 2
<211> 601
<212> PRT
<213> 噬菌体 P22
<400> 2
Met Ala Asp Asn Glu Asn Arg Leu Glu Ser Ile Leu Ser Arg Phe Asp
1 5 10 15
Ala Asp Trp Thr Ala Ser Asp Glu Ala Arg Arg Glu Ala Lys Asn Asp
20 25 30
Leu Phe Phe Ser Arg Val Ser Gln Trp Asp Asp Trp Leu Ser Gln Tyr
35 40 45
Thr Thr Leu Gln Tyr Arg Gly Gln Phe Asp Val Val Arg Pro Val Val
50 55 60
Arg Lys Leu Val Ser Glu Met Arg Gln Asn Pro Ile Asp Val Leu Tyr
65 70 75 80
Arg Pro Lys Asp Gly Ala Arg Pro Asp Ala Ala Asp Val Leu Met Gly
85 90 95
Met Tyr Arg Thr Asp Met Arg His Asn Thr Ala Lys Ile Ala Val Asn
100 105 110
Ile Ala Val Arg Glu Gln Ile Glu Ala Gly Val Gly Ala Trp Arg Leu
115 120 125
Val Thr Asp Tyr Glu Asp Gln Ser Pro Thr Ser Asn Asn Gln Val Ile
130 135 140
Arg Arg Glu Pro Ile His Ser Ala Cys Ser His Val Ile Trp Asp Ser
145 150 155 160
Asn Ser Lys Leu Met Asp Lys Ser Asp Ala Arg His Cys Thr Val Ile
165 170 175
His Ser Met Ser Gln Asn Gly Trp Glu Asp Phe Ala Glu Lys Tyr Asp
180 185 190
Leu Asp Ala Asp Asp Ile Pro Ser Phe Gln Asn Pro Asn Asp Trp Val
195 200 205
Phe Pro Trp Leu Thr Gln Asp Thr Ile Gln Ile Ala Glu Phe Tyr Glu
210 215 220
Val Val Glu Lys Lys Glu Thr Ala Phe Ile Tyr Gln Asp Pro Val Thr
225 230 235 240
Gly Glu Pro Val Ser Tyr Phe Lys Arg Asp Ile Lys Asp Val Ile Asp
245 250 255
Asp Leu Ala Asp Ser Gly Phe Ile Lys Ile Ala Glu Arg Gln Ile Lys
260 265 270
Arg Arg Arg Val Tyr Lys Ser Ile Ile Thr Cys Thr Ala Val Leu Lys
275 280 285
Asp Lys Gln Leu Ile Ala Gly Glu His Ile Pro Ile Val Pro Val Phe
290 295 300
Gly Glu Trp Gly Phe Val Glu Asp Lys Glu Val Tyr Glu Gly Val Val
305 310 315 320
Arg Leu Thr Lys Asp Gly Gln Arg Leu Arg Asn Met Ile Met Ser Phe
325 330 335
Asn Ala Asp Ile Val Ala Arg Thr Pro Lys Lys Lys Pro Phe Phe Trp
340 345 350
Pro Glu Gln Ile Ala Gly Phe Glu His Met Tyr Asp Gly Asn Asp Asp
355 360 365
Tyr Pro Tyr Tyr Leu Leu Asn Arg Thr Asp Glu Asn Ser Gly Asp Leu
370 375 380
Pro Thr Gln Pro Leu Ala Tyr Tyr Glu Asn Pro Glu Val Pro Gln Ala
385 390 395 400
Asn Ala Tyr Met Leu Glu Ala Ala Thr Ser Ala Val Lys Glu Val Ala
405 410 415
Thr Leu Gly Val Asp Thr Glu Ala Val Asn Gly Gly Gln Val Ala Phe
420 425 430
Asp Thr Val Asn Gln Leu Asn Met Arg Ala Asp Leu Glu Thr Tyr Val
435 440 445
Phe Gln Asp Asn Leu Ala Thr Ala Met Arg Arg Asp Gly Glu Ile Tyr
450 455 460
Gln Ser Ile Val Asn Asp Ile Tyr Asp Val Pro Arg Asn Val Thr Ile
465 470 475 480
Thr Leu Glu Asp Gly Ser Glu Lys Asp Val Gln Leu Met Ala Glu Val
485 490 495
Val Asp Leu Ala Thr Gly Glu Lys Gln Val Leu Asn Asp Ile Arg Gly
500 505 510
Arg Tyr Glu Cys Tyr Thr Asp Val Gly Pro Ser Phe Gln Ser Met Lys
515 520 525
Gln Gln Asn Arg Ala Glu Ile Leu Glu Leu Leu Gly Lys Thr Pro Gln
530 535 540
Gly Thr Pro Glu Tyr Gln Leu Leu Leu Leu Gln Tyr Phe Thr Leu Leu
545 550 555 560
Asp Gly Lys Gly Val Glu Met Met Arg Asp Tyr Ala Asn Lys Gln Leu
565 570 575
Ile Gln Met Gly Val Lys Lys Pro Glu Thr Pro Glu Glu Gln Gln Trp
580 585 590
Leu Val Glu Ala Gln Gln Ala Lys Gln
595 600
<210> 3
<211> 599
<212> PRT
<213> 噬菌体 phi-29
<400> 3
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Leu Gln Gly Asn Lys Asn Ser Leu Glu Asn Gln Lys Ser Ser Ile Leu
420 425 430
Phe Asn Gly Ile Met Gly Met Ile Gly Gly Gly Ile Ser Ala Gly Ala
435 440 445
Ser Ala Ala Gly Gly Ser Ala Leu Gly Met Ala Ser Ser Val Thr Gly
450 455 460
Met Thr Ser Thr Ala Gly Asn Ala Val Leu Gln Met Gln Ala Met Gln
465 470 475 480
Ala Lys Gln Ala Asp Ile Ala Asn Ile Pro Pro Gln Leu Thr Lys Met
485 490 495
Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly Tyr Arg Gly Val Tyr
500 505 510
Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg Arg Ser Leu Ser Ser
515 520 525
Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg Val Lys Lys Pro Asn
530 535 540
Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln Thr Lys Asp Cys Phe
545 550 555 560
Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln Glu Ile Arg Thr Ile
565 570 575
Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp Asn Ile Gly Asn Tyr
580 585 590
Ser Val Glu Asn Glu Leu Arg
595
<210> 4
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 4
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 5
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 5
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Cys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 6
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 6
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Ile Leu Trp Asn Leu Leu Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 7
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 7
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Ile Leu Trp Asn Leu Leu Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Leu Ile Val
145 150 155 160
Ser Val Val Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 8
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 8
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Ile Leu Trp Asn Leu Leu Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Leu Ile Val
145 150 155 160
Ser Val Val Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Val Ile Val Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 9
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 9
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Ala Leu Ala Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 10
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 10
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Ala Leu Ala Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Ala Tyr Asn Ala Val Ser Ala Leu
370 375 380
Ser Gly Gly Leu Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Arg
515 520
<210> 11
<211> 524
<212> PRT
<213> 噬菌体 phi-29
<400> 11
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Ala Leu Ala Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Ala Tyr Asn Ala Val Ser Ala Leu
370 375 380
Ser Gly Gly Leu Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Gln Leu Thr Lys Met Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly
420 425 430
Tyr Arg Gly Val Tyr Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg
435 440 445
Arg Ser Leu Ser Ser Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg
450 455 460
Val Lys Lys Pro Asn Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln
465 470 475 480
Thr Lys Asp Cys Phe Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln
485 490 495
Glu Ile Arg Thr Ile Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp
500 505 510
Asn Ile Gly Asn Tyr Ser Val Glu Asn Glu Leu Ile
515 520
<210> 12
<211> 548
<212> PRT
<213> 噬菌体 phi-29
<400> 12
Met Asn His Lys His His His His His His Ser Ser Gly Glu Asn Leu
1 5 10 15
Tyr Phe Gln Gly His Met Gly Ser Met Ala Tyr Val Pro Leu Ser Gly
20 25 30
Thr Asn Val Arg Ile Leu Ala Asp Val Pro Phe Ser Asn Asp Tyr Lys
35 40 45
Asn Thr Arg Trp Phe Thr Ser Ser Ser Asn Gln Tyr Asn Trp Phe Asn
50 55 60
Ser Lys Ser Arg Val Tyr Glu Met Ser Lys Val Thr Phe Met Gly Phe
65 70 75 80
Arg Glu Asn Lys Pro Tyr Val Ser Val Ser Leu Pro Ile Asp Lys Leu
85 90 95
Tyr Ser Ala Ser Tyr Ile Met Phe Gln Asn Ala Asp Tyr Gly Asn Lys
100 105 110
Trp Phe Tyr Ala Phe Val Thr Glu Leu Glu Phe Lys Asn Ser Ala Val
115 120 125
Thr Tyr Val His Phe Glu Ile Asp Val Leu Gln Thr Trp Met Phe Asp
130 135 140
Ile Lys Phe Gln Glu Ser Phe Ile Val Arg Glu His Val Lys Leu Trp
145 150 155 160
Asn Asp Asp Gly Thr Pro Thr Ile Asn Thr Ile Asp Glu Gly Leu Ser
165 170 175
Tyr Gly Ser Glu Tyr Asp Ile Val Ser Val Glu Asn His Lys Pro Tyr
180 185 190
Asp Asp Met Met Phe Leu Val Ile Ile Ser Lys Ser Ile Met His Gly
195 200 205
Thr Pro Gly Glu Glu Glu Ser Arg Leu Asn Asp Ile Asn Ala Ser Leu
210 215 220
Asn Gly Met Pro Gln Pro Leu Cys Tyr Tyr Ile His Pro Phe Tyr Lys
225 230 235 240
Asp Gly Lys Val Pro Lys Thr Tyr Ile Gly Asp Asn Asn Ala Asn Leu
245 250 255
Ser Pro Ile Val Asn Met Leu Thr Asn Ile Phe Ser Gln Lys Ser Ala
260 265 270
Val Asn Asp Ile Val Asn Met Tyr Val Thr Asp Tyr Ile Gly Leu Lys
275 280 285
Leu Asp Tyr Lys Asn Gly Asp Lys Glu Leu Lys Leu Asp Lys Asp Met
290 295 300
Phe Glu Gln Ala Gly Ile Ala Asp Asp Lys His Gly Asn Val Asp Thr
305 310 315 320
Ile Phe Val Lys Lys Ile Pro Asp Tyr Glu Ala Leu Glu Ile Asp Thr
325 330 335
Gly Asp Lys Trp Gly Gly Phe Thr Lys Asp Gln Glu Ser Lys Leu Met
340 345 350
Met Tyr Pro Tyr Cys Val Thr Glu Ile Thr Asp Phe Lys Gly Asn His
355 360 365
Met Asn Leu Lys Thr Glu Tyr Ile Asn Asn Ser Lys Leu Lys Ile Gln
370 375 380
Val Arg Gly Ser Leu Gly Val Ser Asn Lys Val Ala Tyr Ser Val Gln
385 390 395 400
Asp Tyr Asn Ala Asp Ser Ala Leu Ser Gly Gly Asn Arg Leu Thr Ala
405 410 415
Ser Leu Asp Ser Ser Leu Ile Asn Asn Asn Pro Asn Asp Ile Ala Ile
420 425 430
Leu Asn Asp Tyr Leu Ser Ala Tyr Gln Leu Thr Lys Met Gly Gly Asn
435 440 445
Thr Ala Phe Asp Tyr Gly Asn Gly Tyr Arg Gly Val Tyr Val Ile Lys
450 455 460
Lys Gln Leu Lys Ala Glu Tyr Arg Arg Ser Leu Ser Ser Phe Phe His
465 470 475 480
Lys Tyr Gly Tyr Lys Ile Asn Arg Val Lys Lys Pro Asn Leu Arg Thr
485 490 495
Arg Lys Ala Phe Asn Tyr Val Gln Thr Lys Asp Cys Phe Ile Ser Gly
500 505 510
Asp Ile Asn Asn Asn Asp Leu Gln Glu Ile Arg Thr Ile Phe Asp Asn
515 520 525
Gly Ile Thr Leu Trp His Thr Asp Asn Ile Gly Asn Tyr Ser Val Glu
530 535 540
Asn Glu Leu Arg
545
<210> 13
<211> 599
<212> PRT
<213> 噬菌体 phi-29
<400> 13
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Cys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Leu Gln Gly Asn Lys Asn Ser Leu Glu Asn Gln Lys Ser Ser Ile Leu
420 425 430
Phe Asn Gly Ile Met Gly Met Ile Gly Gly Gly Ile Ser Ala Gly Ala
435 440 445
Ser Ala Ala Gly Gly Ser Ala Leu Gly Met Ala Ser Ser Val Thr Gly
450 455 460
Met Thr Ser Thr Ala Gly Asn Ala Val Leu Gln Met Gln Ala Met Gln
465 470 475 480
Ala Lys Gln Ala Asp Ile Ala Asn Ile Pro Pro Gln Leu Thr Lys Met
485 490 495
Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly Tyr Arg Gly Val Tyr
500 505 510
Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg Arg Ser Leu Ser Ser
515 520 525
Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg Val Lys Lys Pro Asn
530 535 540
Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln Thr Lys Asp Cys Phe
545 550 555 560
Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln Glu Ile Arg Thr Ile
565 570 575
Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp Asn Ile Gly Asn Tyr
580 585 590
Ser Val Glu Asn Glu Leu Arg
595
<210> 14
<211> 599
<212> PRT
<213> 噬菌体 phi-29
<400> 14
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Cys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Leu Gln Gly Asn Lys Asn Ser Leu Glu Asn Gln Lys Ser Ser Ile Leu
420 425 430
Phe Asn Gly Ile Met Gly Met Ile Gly Gly Gly Ile Ser Ala Gly Ala
435 440 445
Ser Ala Ala Gly Gly Ser Ala Leu Gly Met Ala Ser Ser Val Thr Gly
450 455 460
Met Thr Ser Thr Ala Gly Asn Ala Val Leu Gln Met Gln Ala Met Gln
465 470 475 480
Ala Lys Gln Ala Asp Ile Ala Asn Ile Pro Pro Gln Leu Thr Lys Met
485 490 495
Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly Tyr Arg Gly Val Tyr
500 505 510
Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg Arg Ser Leu Ser Ser
515 520 525
Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg Val Lys Lys Pro Asn
530 535 540
Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln Thr Lys Asp Cys Phe
545 550 555 560
Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln Glu Ile Arg Thr Ile
565 570 575
Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp Asn Ile Gly Asn Tyr
580 585 590
Ser Val Glu Asn Glu Leu Arg
595
<210> 15
<211> 599
<212> PRT
<213> 噬菌体 phi-29
<400> 15
Met Ala Tyr Val Pro Leu Ser Gly Thr Asn Val Arg Ile Leu Ala Asp
1 5 10 15
Val Pro Phe Ser Asn Asp Tyr Lys Asn Thr Arg Trp Phe Thr Ser Ser
20 25 30
Ser Asn Gln Tyr Asn Trp Phe Asn Ser Lys Ser Arg Val Tyr Glu Met
35 40 45
Ser Lys Val Thr Phe Met Gly Phe Arg Glu Asn Lys Pro Tyr Val Ser
50 55 60
Val Ser Leu Pro Ile Asp Lys Leu Tyr Ser Ala Ser Tyr Ile Met Phe
65 70 75 80
Gln Asn Ala Asp Tyr Gly Asn Lys Trp Phe Tyr Ala Phe Val Thr Glu
85 90 95
Leu Glu Phe Lys Asn Ser Ala Val Thr Tyr Val His Phe Glu Ile Asp
100 105 110
Val Leu Gln Thr Trp Met Phe Asp Ile Lys Phe Gln Glu Ser Phe Ile
115 120 125
Val Arg Glu His Val Lys Leu Trp Asn Asp Asp Gly Thr Pro Thr Ile
130 135 140
Asn Thr Ile Asp Glu Gly Leu Ser Tyr Gly Ser Glu Tyr Asp Ile Val
145 150 155 160
Ser Val Glu Asn His Lys Pro Tyr Asp Asp Met Met Phe Leu Val Ile
165 170 175
Ile Ser Lys Ser Ile Met His Gly Thr Pro Gly Glu Glu Glu Ser Arg
180 185 190
Leu Asn Asp Ile Asn Ala Ser Leu Asn Gly Met Pro Gln Pro Leu Cys
195 200 205
Tyr Tyr Ile His Pro Phe Tyr Lys Asp Gly Lys Val Pro Lys Thr Tyr
210 215 220
Ile Gly Asp Asn Asn Ala Asn Leu Ser Pro Ile Val Asn Met Leu Thr
225 230 235 240
Asn Ile Phe Ser Gln Lys Ser Ala Val Asn Asp Ile Val Asn Met Tyr
245 250 255
Val Thr Asp Tyr Ile Gly Leu Lys Leu Asp Tyr Lys Asn Gly Asp Lys
260 265 270
Glu Leu Lys Leu Asp Lys Asp Met Phe Glu Gln Ala Gly Ile Ala Asp
275 280 285
Asp Lys His Gly Asn Val Asp Thr Ile Phe Val Lys Lys Ile Pro Asp
290 295 300
Tyr Glu Ala Leu Glu Ile Asp Thr Gly Asp Lys Trp Gly Gly Phe Thr
305 310 315 320
Lys Asp Gln Glu Ser Lys Leu Met Met Tyr Pro Tyr Cys Val Thr Glu
325 330 335
Ile Thr Asp Phe Lys Gly Asn His Met Asn Leu Lys Thr Glu Tyr Ile
340 345 350
Asn Asn Ser Lys Leu Lys Ile Gln Val Arg Gly Ser Leu Gly Val Ser
355 360 365
Asn Lys Val Ala Tyr Ser Val Gln Asp Tyr Asn Ala Asp Ser Ala Leu
370 375 380
Ser Gly Gly Asn Arg Leu Thr Ala Ser Leu Asp Ser Ser Leu Ile Asn
385 390 395 400
Asn Asn Pro Asn Asp Ile Ala Ile Leu Asn Asp Tyr Leu Ser Ala Tyr
405 410 415
Leu Gln Gly Asn Lys Asn Ser Leu Glu Asn Gln Lys Ser Ser Ile Leu
420 425 430
Phe Asn Gly Ile Met Gly Met Ile Gly Gly Gly Ile Ser Ala Gly Ala
435 440 445
Ser Ala Ala Gly Gly Ser Ala Leu Gly Met Ala Ser Ser Val Thr Gly
450 455 460
Met Thr Ser Thr Ala Gly Asn Ala Val Leu Gln Met Gln Ala Met Gln
465 470 475 480
Ala Lys Gln Ala Asp Ile Ala Asn Ile Pro Pro Gln Leu Thr Lys Met
485 490 495
Gly Gly Asn Thr Ala Phe Asp Tyr Gly Asn Gly Tyr Arg Gly Val Tyr
500 505 510
Val Ile Lys Lys Gln Leu Lys Ala Glu Tyr Arg Arg Ser Leu Ser Ser
515 520 525
Phe Phe His Lys Tyr Gly Tyr Lys Ile Asn Arg Val Lys Lys Pro Asn
530 535 540
Leu Arg Thr Arg Lys Ala Phe Asn Tyr Val Gln Thr Lys Asp Cys Phe
545 550 555 560
Ile Ser Gly Asp Ile Asn Asn Asn Asp Leu Gln Glu Ile Arg Thr Ile
565 570 575
Phe Asp Asn Gly Ile Thr Leu Trp His Thr Asp Asn Ile Gly Asn Tyr
580 585 590
Ser Val Cys Asn Glu Leu Arg
595
<210> 16
<211> 583
<212> PRT
<213> 噬菌体 C1
<400> 16
Met Thr Leu Ser Lys Ile Lys Leu Phe Tyr Asn Thr Pro Phe Asn Asn
1 5 10 15
Met Gln Asn Thr Leu His Phe Asn Ser Asn Glu Glu Arg Asp Ala Tyr
20 25 30
Phe Asn Ser Lys Phe Asp Val His Glu Phe Thr Ser Thr Phe Asn Tyr
35 40 45
Arg Asn Met Lys Gly Val Leu Arg Val Thr Ile Asp Leu Val Ser Asp
50 55 60
Arg Ser Cys Phe Glu Gln Leu Met Gly Val Asn Tyr Cys Gln Val Gln
65 70 75 80
Tyr Ile Gln Ser Asn Arg Val Glu Tyr Leu Phe Val Thr Asp Ile Gln
85 90 95
Gln Leu Asn Asp Lys Val Cys Glu Leu Ser Leu Val Pro Asp Val Val
100 105 110
Met Thr Tyr Thr Gln Gly Asn Val Leu Asn Thr Leu Asn Asn Val Asn
115 120 125
Val Ile Arg Gln His Tyr Thr Gln Thr Glu Tyr Glu Gln Asn Leu Glu
130 135 140
Gln Ile Arg Ser Asn Asn Asp Val Leu Ala Thr Ser Thr Met Arg Val
145 150 155 160
His Ala Ile Lys Ser Glu Leu Phe Thr Gln Leu Glu Tyr Ile Leu Thr
165 170 175
Ile Gly Ala Asn Leu Arg Lys Ser Phe Gly Thr Ala Glu Lys Pro Lys
180 185 190
Phe Pro Ser Ser Ser Gly Ser Thr His Asp Gly Ile Tyr Asn Pro Tyr
195 200 205
Asp Met Tyr Trp Phe Asn Asp Tyr Glu Ser Leu Lys Glu Val Met Asp
210 215 220
Tyr Leu Thr Gly Tyr Pro Trp Ile Gln Gln Ser Ile Lys Asn Val Thr
225 230 235 240
Ile Ile Pro Ser Gly Phe Ile Lys Gln Glu Ser Leu Asn Asp His Glu
245 250 255
Pro Val Asn Gly Gly Asp Leu Ser Val Arg Lys Leu Gly Lys Gln Gly
260 265 270
Val Ser Asn Gln Lys Asp Phe Asn Ala Ile Ser Leu Asp Tyr Gln Ser
275 280 285
Leu Met Phe Thr Leu Gly Leu Asn Pro Ile Asn Asp Lys His Leu Leu
290 295 300
Arg Pro Asn Ile Val Thr Ala Glu Leu Thr Asp Tyr Ala Gly Asn Arg
305 310 315 320
Leu Pro Ile Asp Leu Ser Leu Ile Glu Thr Asn Leu Glu Phe Asp Ser
325 330 335
Phe Val Thr Met Gly Ala Lys Asn Glu Ile Lys Val Tyr Val Lys Asn
340 345 350
Tyr Asn Ala Arg Gly Asn Asn Val Gly Gln Tyr Ile Asp Asn Ala Leu
355 360 365
Thr Ile Asn Asn Phe Asp Thr Ile Gly Phe Ser Val Asp Ser Gly Glu
370 375 380
Leu Gly Lys Ala Asn Ser Ala Tyr Ser Arg Glu Leu Ser Asn Ser Arg
385 390 395 400
Gln Met Ser Ser Arg Ile Asn Thr Val Leu Asp Asn Asp Ala Ser Val
405 410 415
Lys Asp Arg Leu Phe Asn Ala Ile Ser Leu Ser Gly Gly Leu Ser Ile
420 425 430
Lys Ser Ala Leu Ser Gly Phe Asn Asn Glu Tyr Glu His Tyr Arg Asp
435 440 445
Gln Lys Ala Gln Phe Lys Gln Met Asp Ala Leu Pro Asn Ala Ile Thr
450 455 460
Glu Gly His Val Gly Tyr Ala Pro Leu Phe Lys Gln Asp Lys Phe Gly
465 470 475 480
Val His Leu Arg Leu Gly Arg Ile Ser Gln Asp Glu Leu Asn Asn Val
485 490 495
Lys Lys Tyr Tyr Asn Met Phe Gly Tyr Glu Cys Asn Asp Tyr Ser Thr
500 505 510
Lys Leu Ser Asp Ile Thr Ser Met Ser Ile Cys Asn Trp Val Gln Phe
515 520 525
Lys Gly Ile Trp Thr Leu Pro Asn Val Asp Thr Gly His Met Asn Met
530 535 540
Leu Arg Ala Leu Phe Glu Ala Gly Val Arg Leu Trp His Lys Glu Ser
545 550 555 560
Asp Met Ile Asn Asn Thr Val Val Asn Asn Val Ile Ile Lys Ser Leu
565 570 575
Glu His His His His His His
580
<210> 17
<211> 381
<212> PRT
<213> Neisseria meningitidis (group B)
<400> 17
Met Gln Asn Asn Ser Tyr Gly Tyr Ala Val Ser Val Arg Val Gly Gly
1 5 10 15
Lys Glu His Arg His Trp Glu Arg Tyr Asp Ile Asp Ser Asp Phe Leu
20 25 30
Ile Pro Ala Asp Ser Phe Asp Phe Val Ile Gly Arg Leu Gly Pro Glu
35 40 45
Ala Ala Ile Pro Asp Leu Ser Gly Glu Ser Cys Glu Val Val Ile Asp
50 55 60
Gly Gln Ile Val Met Thr Gly Ile Ile Gly Ser Gln Arg His Gly Lys
65 70 75 80
Ser Lys Gly Ser Arg Glu Leu Ser Leu Ser Gly Arg Asp Leu Ala Gly
85 90 95
Phe Leu Val Asp Cys Ser Ala Pro Gln Leu Asn Val Lys Gly Met Thr
100 105 110
Val Leu Asp Ala Ala Lys Lys Leu Ala Ala Pro Trp Pro Gln Ile Lys
115 120 125
Ala Val Val Leu Lys Ala Glu Asn Asn Pro Ala Leu Gly Lys Ile Asp
130 135 140
Ile Glu Pro Gly Glu Thr Val Trp Gln Ala Leu Thr His Ile Ala Asn
145 150 155 160
Ser Val Gly Leu His Pro Trp Leu Glu Pro Asp Gly Thr Leu Val Val
165 170 175
Gly Gly Ala Asp Tyr Ser Ser Pro Pro Val Ala Thr Leu Cys Trp Ser
180 185 190
Arg Thr Asp Ser Arg Cys Asn Ile Glu Arg Met Asp Ile Glu Trp Asp
195 200 205
Thr Asp Asn Arg Phe Ser Glu Val Thr Phe Leu Ala Gln Ser His Gly
210 215 220
Arg Ser Gly Asp Ser Ala Lys His Asp Leu Lys Trp Val Tyr Lys Asp
225 230 235 240
Pro Thr Met Thr Leu His Arg Pro Lys Thr Val Val Val Ser Asp Ala
245 250 255
Asp Asn Leu Ala Ala Leu Gln Lys Gln Ala Lys Lys Gln Leu Ala Asp
260 265 270
Trp Arg Leu Glu Gly Phe Thr Leu Thr Ile Thr Val Gly Gly His Lys
275 280 285
Thr Arg Asp Gly Val Leu Trp Gln Pro Gly Leu Arg Val His Val Ile
290 295 300
Asp Asp Glu His Gly Ile Asp Ala Val Phe Phe Leu Met Gly Arg Arg
305 310 315 320
Phe Met Leu Ser Arg Met Asp Gly Thr Gln Thr Glu Leu Arg Leu Lys
325 330 335
Glu Asp Gly Ile Trp Thr Pro Asp Ala Tyr Pro Lys Lys Ala Glu Ala
340 345 350
Ala Arg Lys Arg Lys Gly Lys Arg Lys Gly Val Ser His Lys Gly Lys
355 360 365
Lys Gly Gly Lys Lys Gln Ala Glu Thr Ala Val Phe Glu
370 375 380
<210> 18
<211> 163
<212> PRT
<213> 噬菌体 T4
<400> 18
Met Phe Val Asp Asp Val Thr Arg Ala Phe Glu Ser Gly Asp Phe Ala
1 5 10 15
Arg Pro Asn Leu Phe Gln Val Glu Ile Ser Tyr Leu Gly Gln Asn Phe
20 25 30
Thr Phe Gln Cys Lys Ala Thr Ala Leu Pro Ala Gly Ile Val Glu Lys
35 40 45
Ile Pro Val Gly Phe Met Asn Arg Lys Ile Asn Val Ala Gly Asp Arg
50 55 60
Thr Phe Asp Asp Trp Thr Val Thr Val Met Asn Asp Glu Ala His Asp
65 70 75 80
Ala Arg Gln Lys Phe Val Asp Trp Gln Ser Ile Ala Ala Gly Gln Gly
85 90 95
Asn Glu Ile Thr Gly Gly Lys Pro Ala Glu Tyr Lys Lys Ser Ala Ile
100 105 110
Val Arg Gln Tyr Ala Arg Asp Ala Lys Thr Val Thr Lys Glu Ile Glu
115 120 125
Ile Lys Gly Leu Trp Pro Thr Asn Val Gly Glu Leu Gln Leu Asp Trp
130 135 140
Asp Ser Asn Asn Glu Ile Gln Thr Phe Glu Val Thr Leu Ala Leu Asp
145 150 155 160
Tyr Trp Glu
<210> 19
<211> 140
<212> PRT
<213> 噬菌体 phi-X174
<400> 19
Met Val Asp Ala Gly Phe Glu Asn Gln Lys Glu Leu Thr Lys Met Gln
1 5 10 15
Leu Asp Asn Gln Lys Glu Ile Ala Glu Met Gln Asn Glu Thr Gln Lys
20 25 30
Glu Ile Ala Gly Ile Gln Ser Ala Thr Ser Arg Gln Asn Thr Lys Asp
35 40 45
Gln Val Tyr Ala Gln Asn Glu Met Leu Ala Tyr Gln Gln Lys Glu Ser
50 55 60
Thr Ala Arg Val Ala Ser Ile Met Glu Asn Thr Asn Leu Ser Lys Gln
65 70 75 80
Gln Gln Val Ser Glu Ile Met Arg Gln Met Leu Thr Gln Ala Gln Thr
85 90 95
Ala Gly Gln Tyr Phe Thr Asn Asp Gln Ile Lys Glu Met Thr Arg Lys
100 105 110
Val Ser Ala Glu Val Asp Leu Val His Gln Gln Thr Gln Asn Gln Arg
115 120 125
Tyr Gly Ser Ser His Ile Gly Ala Thr Ala Lys Asp
130 135 140
<210> 20
<211> 246
<212> PRT
<213> 噬菌体 lambda
<400> 20
Met Pro Val Pro Asn Pro Thr Met Pro Val Lys Gly Ala Gly Thr Thr
1 5 10 15
Leu Trp Val Tyr Lys Gly Ser Gly Asp Pro Tyr Ala Asn Pro Leu Ser
20 25 30
Asp Val Asp Trp Ser Arg Leu Ala Lys Val Lys Asp Leu Thr Pro Gly
35 40 45
Glu Leu Thr Ala Glu Ser Tyr Asp Asp Ser Tyr Leu Asp Asp Glu Asp
50 55 60
Ala Asp Trp Thr Ala Thr Gly Gln Gly Gln Lys Ser Ala Gly Asp Thr
65 70 75 80
Ser Phe Thr Leu Ala Trp Met Pro Gly Glu Gln Gly Gln Gln Ala Leu
85 90 95
Leu Ala Trp Phe Asn Glu Gly Asp Thr Arg Ala Tyr Lys Ile Arg Phe
100 105 110
Pro Asn Gly Thr Val Asp Val Phe Arg Gly Trp Val Ser Ser Ile Gly
115 120 125
Lys Ala Val Thr Ala Lys Glu Val Ile Thr Arg Thr Val Lys Val Thr
130 135 140
Asn Val Gly Arg Pro Ser Met Ala Glu Asp Arg Ser Thr Val Thr Ala
145 150 155 160
Ala Thr Gly Met Thr Val Thr Pro Ala Ser Thr Ser Val Val Lys Gly
165 170 175
Gln Ser Thr Thr Leu Thr Val Ala Phe Gln Pro Glu Gly Val Thr Asp
180 185 190
Lys Ser Phe Arg Ala Val Ser Ala Asp Lys Thr Lys Ala Thr Val Ser
195 200 205
Val Ser Gly Met Thr Ile Thr Val Asn Gly Val Ala Ala Gly Lys Val
210 215 220
Asn Ile Pro Val Val Ser Gly Asn Gly Glu Phe Ala Ala Val Ala Glu
225 230 235 240
Ile Thr Val Thr Ala Ser
245
<210> 21
<211> 252
<212> PRT
<213> 噬菌体 SPP1
<400> 21
Asn Ile Tyr Asp Ile Leu Asp Lys Val Phe Thr Met Met Tyr Asp Gly
1 5 10 15
Gln Asp Leu Thr Asp Tyr Phe Leu Val Gln Glu Val Arg Gly Arg Ser
20 25 30
Val Tyr Ser Ile Glu Met Gly Lys Arg Thr Ile Ala Gly Val Asp Gly
35 40 45
Gly Val Ile Thr Thr Glu Ser Leu Pro Ala Arg Glu Leu Glu Val Asp
50 55 60
Ala Ile Val Phe Gly Asp Gly Thr Glu Thr Asp Leu Arg Arg Arg Ile
65 70 75 80
Glu Tyr Leu Asn Phe Leu Leu His Arg Asp Thr Asp Val Pro Ile Thr
85 90 95
Phe Ser Asp Glu Pro Ser Arg Thr Tyr Tyr Gly Arg Tyr Glu Phe Ala
100 105 110
Thr Glu Gly Asp Glu Lys Gly Gly Phe His Lys Val Thr Leu Asn Phe
115 120 125
Tyr Cys Gln Asp Pro Leu Lys Tyr Gly Pro Glu Val Thr Thr Asp Val
130 135 140
Thr Thr Ala Ser Thr Pro Val Lys Asn Thr Gly Leu Ala Val Thr Asn
145 150 155 160
Pro Thr Ile Arg Cys Val Phe Ser Thr Ser Ala Thr Glu Tyr Glu Met
165 170 175
Gln Leu Leu Asp Gly Ser Thr Val Val Lys Phe Leu Lys Val Val Tyr
180 185 190
Gly Phe Asn Thr Gly Asp Thr Leu Val Ile Asp Cys His Glu Arg Ser
195 200 205
Val Thr Leu Asn Gly Gln Asp Ile Met Pro Ala Leu Leu Ile Gln Ser
210 215 220
Asp Trp Ile Gln Leu Lys Pro Gln Val Asn Thr Tyr Leu Lys Ala Thr
225 230 235 240
Gln Pro Ser Thr Ile Val Phe Thr Glu Lys Phe Leu
245 250
<210> 22
<211> 464
<212> PRT
<213> Escherichia phage T5
<400> 22
Met Ser Leu Gln Leu Leu Arg Asn Thr Arg Ile Phe Val Ser Thr Val
1 5 10 15
Lys Thr Gly His Asn Lys Thr Asn Thr Gln Glu Ile Leu Val Gln Asp
20 25 30
Asp Ile Ser Trp Gly Gln Asp Ser Asn Ser Thr Asp Ile Thr Val Asn
35 40 45
Glu Ala Gly Pro Arg Pro Thr Arg Gly Ser Lys Arg Phe Asn Asp Ser
50 55 60
Leu Asn Ala Ala Glu Trp Ser Phe Ser Thr Tyr Ile Leu Pro Tyr Lys
65 70 75 80
Asp Lys Asn Thr Ser Lys Gln Ile Val Pro Asp Tyr Met Leu Trp His
85 90 95
Ala Leu Ser Ser Gly Arg Ala Ile Asn Leu Glu Gly Thr Thr Gly Ala
100 105 110
His Asn Asn Ala Thr Asn Phe Met Val Asn Phe Lys Asp Asn Ser Tyr
115 120 125
His Glu Leu Ala Met Leu His Ile Tyr Ile Leu Thr Asp Lys Thr Trp
130 135 140
Ser Tyr Ile Asp Ser Cys Gln Ile Asn Gln Ala Glu Val Asn Val Asp
145 150 155 160
Ile Glu Asp Ile Gly Arg Val Thr Trp Ser Gly Asn Gly Asn Gln Leu
165 170 175
Ile Pro Leu Asp Glu Gln Pro Phe Asp Pro Asp Gln Ile Gly Ile Asp
180 185 190
Asp Glu Thr Tyr Met Thr Ile Gln Gly Ser Tyr Ile Lys Asn Lys Leu
195 200 205
Thr Ile Leu Lys Ile Lys Asp Met Asp Thr Asn Lys Ser Tyr Asp Ile
210 215 220
Pro Ile Thr Gly Gly Thr Phe Thr Ile Asn Asn Asn Ile Thr Tyr Leu
225 230 235 240
Thr Pro Asn Val Met Ser Arg Val Thr Ile Pro Ile Gly Ser Phe Thr
245 250 255
Gly Ala Phe Glu Leu Thr Gly Ser Leu Thr Ala Tyr Leu Asn Asp Lys
260 265 270
Ser Leu Gly Ser Met Glu Leu Tyr Lys Asp Leu Ile Lys Thr Leu Lys
275 280 285
Val Val Asn Arg Phe Glu Ile Ala Leu Val Leu Gly Gly Glu Tyr Asp
290 295 300
Asp Glu Arg Pro Ala Ala Ile Leu Val Ala Lys Gln Ala His Val Asn
305 310 315 320
Ile Pro Thr Ile Glu Thr Asp Asp Val Leu Gly Thr Ser Val Glu Phe
325 330 335
Lys Ala Ile Pro Ser Asp Leu Asp Ala Gly Asp Glu Gly Tyr Leu Gly
340 345 350
Phe Ser Ser Lys Tyr Thr Arg Thr Thr Ile Asn Asn Leu Ile Val Asn
355 360 365
Gly Asp Gly Ala Thr Asp Ala Val Thr Ala Ile Thr Val Lys Ser Ala
370 375 380
Gly Asn Val Thr Thr Leu Asn Arg Ser Ala Thr Leu Gln Met Ser Val
385 390 395 400
Glu Val Thr Pro Ser Ser Ala Arg Asn Lys Glu Val Thr Trp Ala Ile
405 410 415
Thr Ala Gly Asp Ala Ala Thr Ile Asn Ala Thr Gly Leu Leu Arg Ala
420 425 430
Asp Ala Ser Lys Thr Gly Ala Val Thr Val Glu Ala Thr Ala Lys Asp
435 440 445
Gly Ser Gly Val Lys Gly Thr Lys Val Ile Thr Val Thr Ala Gly Gly
450 455 460
<210> 23
<211> 118
<212> PRT
<213> Escherichia phage Mu
<400> 23
Met Ala Gly Asn Gln Arg Gln Gly Val Ala Phe Ile Arg Val Asn Gly
1 5 10 15
Met Glu Leu Glu Ser Met Glu Gly Ala Ser Phe Thr Pro Ser Gly Ile
20 25 30
Thr Arg Glu Glu Val Thr Gly Ser Arg Val Tyr Gly Trp Lys Gly Lys
35 40 45
Pro Arg Ala Ala Lys Val Glu Cys Lys Ile Pro Gly Gly Gly Pro Ile
50 55 60
Gly Leu Asp Glu Ile Ile Asp Trp Glu Asn Ile Thr Val Glu Phe Gln
65 70 75 80
Ala Asp Thr Gly Glu Thr Trp Met Leu Ala Asn Ala Trp Gln Ala Asp
85 90 95
Glu Pro Lys Asn Asp Gly Gly Glu Ile Ser Leu Val Leu Met Ala Lys
100 105 110
Gln Ser Lys Arg Ile Ala
115
<210> 24
<211> 246
<212> PRT
<213> 噬菌体 lambda
<400> 24
Met Pro Val Pro Asn Pro Thr Met Pro Val Lys Gly Ala Gly Thr Thr
1 5 10 15
Leu Trp Val Tyr Lys Gly Ser Gly Asp Pro Tyr Ala Asn Pro Leu Ser
20 25 30
Asp Val Asp Trp Ser Arg Leu Ala Lys Val Lys Asp Leu Thr Pro Gly
35 40 45
Glu Leu Thr Ala Glu Ser Tyr Asp Asp Ser Tyr Leu Asp Asp Glu Asp
50 55 60
Ala Asp Trp Thr Ala Thr Gly Gln Gly Gln Lys Ser Ala Gly Asp Thr
65 70 75 80
Ser Phe Thr Leu Ala Trp Met Pro Gly Glu Gln Gly Gln Gln Ala Leu
85 90 95
Leu Ala Trp Phe Asn Glu Gly Asp Thr Arg Ala Tyr Lys Ile Arg Phe
100 105 110
Pro Asn Gly Thr Val Asp Val Phe Arg Gly Trp Val Ser Ser Ile Gly
115 120 125
Lys Ala Val Thr Ala Lys Glu Val Ile Thr Arg Thr Val Lys Val Thr
130 135 140
Asn Val Gly Arg Pro Ser Met Ala Glu Asp Arg Ser Thr Val Thr Ala
145 150 155 160
Ala Thr Gly Met Thr Val Thr Pro Ala Ser Thr Ser Val Val Lys Gly
165 170 175
Gln Ser Thr Thr Leu Thr Val Ala Phe Gln Pro Glu Gly Val Thr Asp
180 185 190
Lys Ser Phe Arg Ala Val Ser Ala Asp Lys Thr Lys Ala Thr Val Ser
195 200 205
Val Ser Gly Met Thr Ile Thr Val Asn Gly Val Ala Ala Gly Lys Val
210 215 220
Asn Ile Pro Val Val Ser Gly Asn Gly Glu Phe Ala Ala Val Ala Glu
225 230 235 240
Ile Thr Val Thr Ala Ser
245
<210> 25
<211> 301
<212> PRT
<213> Lactococcus phage F4-1
<400> 25
Met Lys Leu Asp Tyr Asn Ser Arg Glu Ile Phe Phe Gly Asn Glu Ala
1 5 10 15
Leu Ile Val Ala Asp Met Thr Lys Gly Ser Asn Gly Lys Pro Glu Phe
20 25 30
Thr Asn His Lys Ile Val Thr Gly Leu Val Ser Val Gly Ser Met Glu
35 40 45
Asp Gln Ala Glu Thr Asn Ser Tyr Pro Ala Asp Asp Val Pro Asp His
50 55 60
Gly Val Lys Lys Gly Ala Thr Leu Leu Gln Gly Glu Met Val Phe Ile
65 70 75 80
Gln Thr Asp Gln Ala Leu Lys Glu Asp Met Leu Gly Gln Gln Arg Thr
85 90 95
Glu Asn Gly Leu Gly Trp Ser Pro Thr Gly Asn Trp Lys Thr Lys Cys
100 105 110
Val Gln Tyr Leu Ile Lys Gly Arg Lys Arg Asp Lys Val Thr Gly Glu
115 120 125
Phe Val Asp Gly Tyr Arg Val Val Val Tyr Pro His Leu Thr Pro Thr
130 135 140
Ala Glu Ala Thr Lys Glu Ser Glu Thr Asp Ser Val Asp Gly Val Asp
145 150 155 160
Pro Ile Gln Trp Thr Leu Ala Val Gln Ala Thr Glu Ser Asp Ile Tyr
165 170 175
Ser Asn Gly Gly Lys Lys Val Pro Ala Ile Glu Tyr Glu Ile Trp Gly
180 185 190
Glu Gln Ala Lys Asp Phe Ala Lys Lys Met Glu Ser Gly Leu Phe Ile
195 200 205
Met Gln Pro Asp Thr Val Leu Ala Gly Ala Ile Thr Leu Val Ala Pro
210 215 220
Val Ile Pro Asn Val Thr Thr Ala Thr Lys Gly Asn Asn Asp Gly Thr
225 230 235 240
Ile Val Val Pro Asp Thr Leu Lys Asp Ser Lys Gly Gly Thr Val Lys
245 250 255
Val Thr Ser Val Ile Lys Asp Ala His Gly Lys Val Ala Thr Asn Gly
260 265 270
Gln Leu Ala Pro Gly Val Tyr Ile Val Thr Phe Ser Ala Asp Gly Tyr
275 280 285
Glu Asp Val Thr Ala Gly Val Ser Val Thr Asp His Ser
290 295 300
<210> 26
<211> 172
<212> PRT
<213> 噬菌体 P2
<400> 26
Met Ala Met Pro Arg Lys Leu Lys Leu Met Asn Val Phe Leu Asn Gly
1 5 10 15
Tyr Ser Tyr Gln Gly Val Ala Lys Ser Val Thr Leu Pro Lys Leu Thr
20 25 30
Arg Lys Leu Glu Asn Tyr Arg Gly Ala Gly Met Asn Gly Ser Ala Pro
35 40 45
Val Asp Leu Gly Leu Asp Asp Asp Ala Leu Ser Met Glu Trp Ser Leu
50 55 60
Gly Gly Phe Pro Asp Ser Val Ile Trp Glu Leu Tyr Ala Ala Thr Gly
65 70 75 80
Val Asp Ala Val Pro Ile Arg Phe Ala Gly Ser Tyr Gln Arg Asp Asp
85 90 95
Thr Gly Glu Thr Val Ala Val Glu Val Val Met Arg Gly Arg Gln Lys
100 105 110
Glu Ile Asp Thr Gly Glu Gly Lys Gln Gly Glu Asp Thr Glu Ser Lys
115 120 125
Ile Ser Val Val Cys Thr Tyr Phe Arg Leu Thr Met Asp Gly Lys Glu
130 135 140
Leu Val Glu Ile Asp Thr Ile Asn Met Ile Glu Lys Val Asn Gly Val
145 150 155 160
Asp Arg Leu Glu Gln His Arg Arg Asn Ile Gly Leu
165 170
<210> 27
<211> 177
<212> PRT
<213> Serratia phage KSP90
<400> 27
Met Ala Thr Val Asn Glu Phe Arg Gly Ala Met Ser Arg Gly Gly Gly
1 5 10 15
Val Gln Arg Gln His Arg Trp Arg Val Thr Ile Ser Phe Pro Ser Phe
20 25 30
Ala Ala Ser Ala Asp Gln Thr Arg Asp Val Cys Leu Leu Ala Val Thr
35 40 45
Thr Asn Thr Pro Thr Gly Gln Leu Gly Glu Ile Leu Val Pro Trp Gly
50 55 60
Gly Arg Glu Leu Pro Phe Pro Gly Asp Arg Arg Phe Glu Ala Leu Pro
65 70 75 80
Ile Thr Phe Ile Asn Val Val Asn Asn Gly Pro Tyr Asn Ser Met Glu
85 90 95
Val Trp Gln Gln Tyr Ile Asn Gly Ser Glu Ser Asn Arg Ala Ser Ala
100 105 110
Asn Pro Asp Glu Tyr Phe Arg Asp Val Val Leu Glu Leu Leu Asp Ala
115 120 125
Asn Asp Asn Val Thr Lys Thr Trp Thr Leu Gln Gly Ala Trp Pro Gln
130 135 140
Asn Leu Gly Gln Leu Glu Leu Asp Met Ser Ala Met Asp Ser Tyr Thr
145 150 155 160
Gln Phe Thr Cys Asp Leu Arg Tyr Phe Gln Ala Val Ser Asp Arg Ser
165 170 175
Arg
<210> 28
<211> 196
<212> PRT
<213> Enterobacteria phage T7M
<400> 28
Met Arg Ser Tyr Glu Met Asn Ile Glu Thr Ala Glu Glu Leu Ser Ala
1 5 10 15
Val Asn Asp Ile Leu Ala Ser Ile Gly Glu Pro Pro Val Ser Thr Leu
20 25 30
Glu Gly Asp Ala Asn Ala Asp Val Ala Asn Ala Arg Arg Val Leu Asn
35 40 45
Lys Ile Asn Arg Gln Ile Gln Ser Arg Gly Trp Thr Phe Asn Ile Glu
50 55 60
Glu Gly Val Thr Leu Leu Pro Asp Ala Phe Ser Gly Met Ile Pro Phe
65 70 75 80
Ser Ser Asp Tyr Leu Ser Val Met Ala Thr Ser Gly Gln Thr Gln Tyr
85 90 95
Val Asn Arg Gly Gly Tyr Leu Tyr Asp Arg Ser Ala Lys Thr Asp Arg
100 105 110
Phe Pro Ser Gly Val Gln Val Asn Leu Ile Arg Leu Arg Glu Phe Asp
115 120 125
Glu Met Pro Glu Cys Phe Arg Asn Tyr Ile Val Thr Lys Ala Ser Arg
130 135 140
Gln Phe Asn Asn Arg Phe Phe Gly Ala Pro Glu Val Asp Gly Val Leu
145 150 155 160
Gln Glu Glu Glu Gln Glu Ala Trp Ser Ala Cys Phe Glu Tyr Glu Leu
165 170 175
Asp Tyr Gly Asn Tyr Asn Met Leu Asp Gly Asp Ala Phe Thr Ser Gly
180 185 190
Leu Leu Asn Arg
195
<210> 29
<211> 108
<212> PRT
<213> 噬菌体 HK97
<400> 29
Met Ala Ile Asp Val Leu Asp Val Ile Ser Leu Ser Leu Phe Lys Gln
1 5 10 15
Gln Ile Glu Phe Glu Glu Asp Asp Arg Asp Glu Leu Ile Thr Leu Tyr
20 25 30
Ala Gln Ala Ala Phe Asp Tyr Cys Met Arg Trp Cys Asp Glu Pro Ala
35 40 45
Trp Lys Val Ala Ala Asp Ile Pro Ala Ala Val Lys Gly Ala Val Leu
50 55 60
Leu Val Phe Ala Asp Met Phe Glu His Arg Thr Ala Gln Ser Glu Val
65 70 75 80
Gln Leu Tyr Glu Asn Ala Ala Ala Glu Arg Met Met Phe Ile His Arg
85 90 95
Asn Trp Arg Gly Lys Ala Glu Ser Glu Glu Gly Ser
100 105
<210> 30
<211> 309
<212> PRT
<213> 噬菌体 phi-29
<400> 30
Met Ala Arg Lys Arg Ser Asn Thr Tyr Arg Ser Ile Asn Glu Ile Gln
1 5 10 15
Arg Gln Lys Arg Asn Arg Trp Phe Ile His Tyr Leu Asn Tyr Leu Gln
20 25 30
Ser Leu Ala Tyr Gln Leu Phe Glu Trp Glu Asn Leu Pro Pro Thr Ile
35 40 45
Asn Pro Ser Phe Leu Glu Lys Ser Ile His Gln Phe Gly Tyr Val Gly
50 55 60
Phe Tyr Lys Asp Pro Val Ile Ser Tyr Ile Ala Cys Asn Gly Ala Leu
65 70 75 80
Ser Gly Gln Arg Asp Val Tyr Asn Gln Ala Thr Val Phe Arg Ala Ala
85 90 95
Ser Pro Val Tyr Gln Lys Glu Phe Lys Leu Tyr Asn Tyr Arg Asp Met
100 105 110
Lys Glu Glu Asp Met Gly Val Val Ile Tyr Asn Asn Asp Met Ala Phe
115 120 125
Pro Thr Thr Pro Thr Leu Glu Leu Phe Ala Ala Glu Leu Ala Glu Leu
130 135 140
Lys Glu Ile Ile Ser Val Asn Gln Asn Ala Gln Lys Thr Pro Val Leu
145 150 155 160
Ile Arg Ala Asn Asp Asn Asn Gln Leu Ser Leu Lys Gln Val Tyr Asn
165 170 175
Gln Tyr Glu Gly Asn Ala Pro Val Ile Phe Ala His Glu Ala Leu Asp
180 185 190
Ser Asp Ser Ile Glu Val Phe Lys Thr Asp Ala Pro Tyr Val Val Asp
195 200 205
Lys Leu Asn Ala Gln Lys Asn Ala Val Trp Asn Glu Met Met Thr Phe
210 215 220
Leu Gly Ile Lys Asn Ala Asn Leu Glu Lys Lys Glu Arg Met Val Thr
225 230 235 240
Asp Glu Val Ser Ser Asn Asp Glu Gln Ile Glu Ser Ser Gly Thr Val
245 250 255
Phe Leu Lys Ser Arg Glu Glu Ala Cys Glu Lys Ile Asn Glu Leu Tyr
260 265 270
Gly Leu Asn Val Lys Val Lys Phe Arg Tyr Asp Ile Val Glu Gln Met
275 280 285
Arg Arg Glu Leu Gln Gln Ile Glu Asn Val Ser Arg Gly Thr Ser Asp
290 295 300
Gly Glu Thr Asn Glu
305
<210> 31
<211> 315
<212> PRT
<213> 噬菌体 phi-29
<400> 31
Thr Tyr Arg Ser Ile Asn Glu Ile Gln Arg Gln Lys Arg Asn Arg Trp
1 5 10 15
Phe Ile His Tyr Leu Asn Tyr Leu Gln Ser Leu Ala Tyr Gln Leu Phe
20 25 30
Glu Trp Glu Asn Leu Pro Pro Thr Ile Asn Pro Ser Phe Leu Glu Lys
35 40 45
Ser Ile His Gln Phe Gly Tyr Val Gly Phe Tyr Lys Asp Pro Val Ile
50 55 60
Ser Tyr Ile Ala Cys Asn Gly Ala Leu Ser Gly Gln Arg Asp Val Tyr
65 70 75 80
Asn Gln Ala Thr Val Phe Arg Ala Ala Ser Pro Val Tyr Gln Lys Glu
85 90 95
Phe Lys Leu Tyr Asn Tyr Arg Asp Met Lys Glu Glu Asp Met Gly Val
100 105 110
Val Ile Tyr Asn Asn Asp Met Ala Phe Pro Thr Thr Pro Thr Leu Glu
115 120 125
Leu Phe Ala Ala Glu Leu Ala Glu Leu Lys Glu Ile Ile Ser Val Asn
130 135 140
Gln Asn Ala Gln Lys Thr Pro Val Leu Ile Arg Ala Asn Asp Asn Asn
145 150 155 160
Gln Leu Ser Leu Lys Gln Val Tyr Asn Gln Tyr Glu Gly Asn Ala Pro
165 170 175
Val Ile Phe Ala His Glu Ala Leu Asp Ser Asp Ser Ile Glu Val Phe
180 185 190
Lys Thr Asp Ala Pro Tyr Val Val Asp Lys Leu Asn Ala Gln Lys Asn
195 200 205
Ala Val Trp Asn Glu Met Met Thr Phe Leu Gly Ile Lys Asn Ala Asn
210 215 220
Leu Glu Lys Lys Glu Arg Met Val Thr Asp Glu Val Ser Ser Asn Asp
225 230 235 240
Glu Gln Ile Glu Ser Ser Gly Thr Val Phe Leu Lys Ser Arg Glu Glu
245 250 255
Ala Cys Glu Lys Ile Asn Glu Leu Tyr Gly Leu Asn Val Lys Val Lys
260 265 270
Phe Arg Tyr Asp Ile Val Glu Gln Met Arg Arg Glu Leu Gln Gln Ile
275 280 285
Glu Asn Val Ser Arg Gly Thr Ser Asp Gly Glu Thr Asn Glu Ala His
290 295 300
Ile Val Met Val Asp Ala Tyr Lys Pro Thr Lys
305 310 315
<210> 32
<211> 315
<212> PRT
<213> 噬菌体 phi-29
<400> 32
Thr Tyr Leu Ser Ile Asn Val Ile Gln Leu Gln Lys Arg Asn Arg Trp
1 5 10 15
Phe Ile His Tyr Leu Asn Tyr Leu Gln Ser Leu Ala Tyr Gln Leu Phe
20 25 30
Glu Trp Glu Asn Leu Pro Pro Thr Ile Asn Pro Ser Phe Leu Glu Lys
35 40 45
Ser Ile His Gln Phe Gly Tyr Val Gly Phe Tyr Lys Asp Pro Val Ile
50 55 60
Ser Tyr Ile Ala Cys Asn Gly Ala Leu Ser Gly Gln Arg Asp Val Tyr
65 70 75 80
Asn Gln Ala Thr Val Phe Arg Ala Ala Ser Pro Val Tyr Gln Lys Glu
85 90 95
Phe Lys Leu Tyr Asn Tyr Arg Asp Met Lys Glu Glu Asp Met Gly Val
100 105 110
Val Ile Tyr Asn Asn Asp Met Ala Phe Pro Thr Thr Pro Thr Leu Glu
115 120 125
Leu Phe Ala Ala Glu Leu Ala Glu Leu Lys Glu Ile Ile Ser Val Asn
130 135 140
Gln Asn Ala Gln Lys Thr Pro Val Leu Ile Arg Ala Asn Asp Asn Asn
145 150 155 160
Gln Leu Ser Leu Lys Gln Val Tyr Asn Gln Tyr Glu Gly Asn Ala Pro
165 170 175
Val Ile Phe Ala His Glu Ala Leu Asp Ser Asp Ser Ile Glu Val Phe
180 185 190
Lys Thr Asp Ala Pro Tyr Val Val Asp Lys Leu Asn Ala Gln Lys Asn
195 200 205
Ala Val Trp Asn Glu Met Met Thr Phe Leu Gly Ile Lys Asn Ala Asn
210 215 220
Leu Glu Lys Lys Glu Arg Met Val Thr Asp Glu Val Ser Ser Asn Asp
225 230 235 240
Glu Gln Ile Glu Ser Ser Gly Thr Val Phe Leu Lys Ser Arg Glu Glu
245 250 255
Ala Cys Glu Lys Ile Asn Glu Leu Tyr Gly Leu Asn Val Lys Val Lys
260 265 270
Phe Arg Tyr Asp Ile Val Glu Gln Met Arg Arg Glu Leu Gln Gln Ile
275 280 285
Glu Asn Val Ser Arg Gly Thr Ser Asp Gly Glu Thr Asn Glu Ala His
290 295 300
Ile Val Met Val Asp Ala Tyr Lys Pro Thr Lys
305 310 315
<210> 33
<211> 317
<212> PRT
<213> 噬菌体 phi-29
<400> 33
Ile Leu Thr Tyr Leu Ser Ile Asn Val Ile Gln Leu Gln Lys Arg Asn
1 5 10 15
Arg Trp Phe Ile His Tyr Leu Asn Tyr Leu Gln Ser Leu Ala Tyr Gln
20 25 30
Leu Phe Glu Trp Glu Asn Leu Pro Pro Thr Ile Asn Pro Ser Phe Leu
35 40 45
Glu Lys Ser Ile His Gln Phe Gly Tyr Val Gly Phe Tyr Lys Asp Pro
50 55 60
Val Ile Ser Tyr Ile Ala Cys Asn Gly Ala Leu Ser Gly Gln Arg Asp
65 70 75 80
Val Tyr Asn Gln Ala Thr Val Phe Arg Ala Ala Ser Pro Val Tyr Gln
85 90 95
Lys Glu Phe Lys Leu Tyr Asn Tyr Arg Asp Met Lys Glu Glu Asp Met
100 105 110
Gly Val Val Ile Tyr Asn Asn Asp Met Ala Phe Pro Thr Thr Pro Thr
115 120 125
Leu Glu Leu Phe Ala Ala Glu Leu Ala Glu Leu Lys Glu Ile Ile Ser
130 135 140
Val Asn Gln Asn Ala Gln Lys Thr Pro Val Leu Ile Arg Ala Asn Asp
145 150 155 160
Asn Asn Gln Leu Ser Leu Lys Gln Val Tyr Asn Gln Tyr Glu Gly Asn
165 170 175
Ala Pro Val Ile Phe Ala His Glu Ala Leu Asp Ser Asp Ser Ile Glu
180 185 190
Val Phe Lys Thr Asp Ala Pro Tyr Val Val Asp Lys Leu Asn Ala Gln
195 200 205
Lys Asn Ala Val Trp Asn Glu Met Met Thr Phe Leu Gly Ile Lys Asn
210 215 220
Ala Asn Leu Glu Lys Lys Glu Arg Met Val Thr Asp Glu Val Ser Ser
225 230 235 240
Asn Asp Glu Gln Ile Glu Ser Ser Gly Thr Val Phe Leu Lys Ser Arg
245 250 255
Glu Glu Ala Cys Glu Lys Ile Asn Glu Leu Tyr Gly Leu Asn Val Lys
260 265 270
Val Lys Phe Arg Tyr Asp Ile Val Glu Gln Met Arg Arg Glu Leu Gln
275 280 285
Gln Ile Glu Asn Val Ser Arg Gly Thr Ser Asp Gly Glu Thr Asn Glu
290 295 300
Ala His Ile Val Met Val Asp Ala Tyr Lys Pro Thr Lys
305 310 315
<210> 34
<211> 321
<212> PRT
<213> 噬菌体 phi-29
<400> 34
Ile Leu Val Ala Ile Leu Thr Tyr Leu Ser Ile Asn Val Ile Gln Leu
1 5 10 15
Gln Lys Arg Asn Arg Trp Phe Ile His Tyr Leu Asn Tyr Leu Gln Ser
20 25 30
Leu Ala Tyr Gln Leu Phe Glu Trp Glu Asn Leu Pro Pro Thr Ile Asn
35 40 45
Pro Ser Phe Leu Glu Lys Ser Ile His Gln Phe Gly Tyr Val Gly Phe
50 55 60
Tyr Lys Asp Pro Val Ile Ser Tyr Ile Ala Cys Asn Gly Ala Leu Ser
65 70 75 80
Gly Gln Arg Asp Val Tyr Asn Gln Ala Thr Val Phe Arg Ala Ala Ser
85 90 95
Pro Val Tyr Gln Lys Glu Phe Lys Leu Tyr Asn Tyr Arg Asp Met Lys
100 105 110
Glu Glu Asp Met Gly Val Val Ile Tyr Asn Asn Asp Met Ala Phe Pro
115 120 125
Thr Thr Pro Thr Leu Glu Leu Phe Ala Ala Glu Leu Ala Glu Leu Lys
130 135 140
Glu Ile Ile Ser Val Asn Gln Asn Ala Gln Lys Thr Pro Val Leu Ile
145 150 155 160
Arg Ala Asn Asp Asn Asn Gln Leu Ser Leu Lys Gln Val Tyr Asn Gln
165 170 175
Tyr Glu Gly Asn Ala Pro Val Ile Phe Ala His Glu Ala Leu Asp Ser
180 185 190
Asp Ser Ile Glu Val Phe Lys Thr Asp Ala Pro Tyr Val Val Asp Lys
195 200 205
Leu Asn Ala Gln Lys Asn Ala Val Trp Asn Glu Met Met Thr Phe Leu
210 215 220
Gly Ile Lys Asn Ala Asn Leu Glu Lys Lys Glu Arg Met Val Thr Asp
225 230 235 240
Glu Val Ser Ser Asn Asp Glu Gln Ile Glu Ser Ser Gly Thr Val Phe
245 250 255
Leu Lys Ser Arg Glu Glu Ala Cys Glu Lys Ile Asn Glu Leu Tyr Gly
260 265 270
Leu Asn Val Lys Val Lys Phe Arg Tyr Asp Ile Val Glu Gln Met Arg
275 280 285
Arg Glu Leu Gln Gln Ile Glu Asn Val Ser Arg Gly Thr Ser Asp Gly
290 295 300
Glu Thr Asn Glu Ala His Ile Val Met Val Asp Ala Tyr Lys Pro Thr
305 310 315 320
Lys
<210> 35
<211> 309
<212> PRT
<213> 噬菌体 phi-29
<400> 35
Met Ala Arg Lys Arg Ser Asn Thr Tyr Arg Ser Ile Asn Glu Ile Gln
1 5 10 15
Arg Gln Lys Arg Asn Arg Trp Phe Ile His Tyr Leu Asn Tyr Leu Gln
20 25 30
Ser Leu Ala Tyr Gln Leu Phe Glu Trp Glu Asn Leu Pro Pro Thr Ile
35 40 45
Asn Pro Ser Phe Leu Glu Lys Ser Ile His Gln Phe Gly Tyr Val Gly
50 55 60
Phe Tyr Lys Asp Pro Val Ile Ser Tyr Ile Ala Cys Asn Gly Cys Leu
65 70 75 80
Ser Gly Gln Arg Asp Val Tyr Asn Gln Ala Thr Val Phe Arg Ala Ala
85 90 95
Ser Pro Val Tyr Gln Lys Glu Phe Lys Leu Tyr Asn Tyr Arg Asp Met
100 105 110
Lys Glu Glu Asp Met Gly Val Val Ile Tyr Asn Asn Asp Met Ala Phe
115 120 125
Pro Thr Thr Pro Thr Leu Cys Leu Phe Ala Ala Glu Leu Ala Glu Leu
130 135 140
Lys Glu Ile Ile Ser Val Asn Gln Asn Ala Gln Lys Thr Pro Val Leu
145 150 155 160
Ile Arg Ala Asn Asp Asn Asn Cys Leu Ser Leu Lys Gln Val Tyr Asn
165 170 175
Gln Tyr Glu Gly Asn Ala Pro Val Ile Phe Ala His Glu Ala Leu Asp
180 185 190
Ser Asp Ser Ile Glu Val Phe Lys Thr Asp Ala Pro Tyr Val Val Asp
195 200 205
Lys Leu Asn Ala Gln Lys Asn Ala Val Trp Asn Glu Met Met Thr Phe
210 215 220
Leu Gly Ile Lys Asn Ala Asn Leu Glu Lys Lys Glu Arg Met Val Thr
225 230 235 240
Asp Glu Val Ser Ser Asn Asp Glu Gln Ile Glu Ser Ser Gly Thr Val
245 250 255
Phe Leu Lys Ser Arg Glu Glu Ala Cys Glu Lys Ile Asn Glu Leu Tyr
260 265 270
Gly Leu Asn Val Lys Val Lys Phe Arg Tyr Asp Ile Val Glu Gln Met
275 280 285
Arg Arg Glu Leu Gln Gln Ile Glu Asn Val Ser Arg Gly Thr Ser Asp
290 295 300
Gly Glu Thr Asn Glu
305
<210> 36
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 36
Ala His Ile Val Met Val Asp Ala Tyr Lys Pro Thr Lys
1 5 10
<210> 37
<211> 129
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 37
Asp Tyr Asp Ile Pro Thr Thr Glu Asn Leu Tyr Phe Gln Gly Ala Met
1 5 10 15
Val Asp Thr Leu Ser Gly Leu Ser Ser Glu Gln Gly Gln Ser Gly Asp
20 25 30
Met Thr Ile Glu Glu Asp Ser Ala Thr His Ile Lys Phe Ser Lys Arg
35 40 45
Asp Glu Asp Gly Lys Glu Leu Ala Gly Ala Thr Met Glu Leu Arg Asp
50 55 60
Ser Ser Gly Lys Thr Ile Ser Thr Trp Ile Ser Asp Gly Gln Val Lys
65 70 75 80
Asp Phe Tyr Leu Tyr Pro Gly Lys Tyr Thr Phe Val Glu Thr Ala Ala
85 90 95
Pro Asp Gly Tyr Glu Val Ala Thr Ala Ile Thr Phe Thr Val Asn Glu
100 105 110
Gln Gly Gln Val Thr Val Asn Gly Lys Ala Thr Lys Gly Asp Ala His
115 120 125
Ile
<210> 38
<211> 249
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 38
Met Ala Leu Thr Gln Pro Ser Ser Val Ser Ala Asn Pro Gly Glu Thr
1 5 10 15
Val Lys Ile Thr Cys Ser Gly Ser Ser Gly Ser Tyr Gly Trp Tyr Gln
20 25 30
Gln Lys Ser Pro Asp Ser Ala Pro Val Thr Val Ile Tyr Gln Ser Asn
35 40 45
Gln Arg Pro Ser Asp Ile Pro Ser Arg Phe Ser Gly Ser Lys Ser Gly
50 55 60
Ser Thr Gly Thr Leu Thr Ile Thr Gly Val Gln Ala Glu Asp Glu Ala
65 70 75 80
Val Tyr Tyr Cys Gly Gly Trp Gly Ser Ser Val Gly Met Phe Gly Ala
85 90 95
Gly Thr Thr Leu Thr Val Leu Gly Gln Ser Ser Arg Ser Ser Gly Gly
100 105 110
Gly Gly Ser Ser Gly Gly Gly Gly Ser Ala Val Thr Leu Asp Glu Ser
115 120 125
Gly Gly Gly Leu Gln Thr Pro Gly Gly Ala Leu Ser Leu Val Cys Lys
130 135 140
Ala Ser Gly Phe Thr Phe Ser Ser Tyr Ala Met Gly Trp Val Arg Gln
145 150 155 160
Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Gly Ile Ser Asp Asp Gly
165 170 175
Asp Ser Tyr Ile Ser Tyr Ala Thr Ala Val Lys Gly Arg Ala Thr Ile
180 185 190
Ser Arg Asp Asn Gly Gln Ser Thr Val Arg Leu Gln Leu Asn Asn Leu
195 200 205
Arg Ala Glu Asp Thr Ala Thr Tyr Tyr Cys Ala Arg Ser His Cys Ser
210 215 220
Gly Cys Arg Asn Ala Ala Leu Ile Asp Ala Trp Gly His Gly Thr Glu
225 230 235 240
Val Ile Val Ser Ser Met Ser Tyr Tyr
245
<210> 39
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 39
Val His Ser Pro Asn Lys Lys
1 5
<210> 40
<211> 241
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 40
Glu Val His Leu Gln Gln Ser Leu Ala Glu Leu Val Arg Ser Gly Ala
1 5 10 15
Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Phe Asn Ile Lys His Tyr
20 25 30
Tyr Met His Trp Val Lys Gln Arg Pro Glu Gln Gly Leu Glu Trp Ile
35 40 45
Gly Trp Ile Asn Pro Glu Asn Val Asp Thr Glu Tyr Ala Pro Lys Phe
50 55 60
Gln Gly Lys Ala Thr Met Thr Ala Asp Thr Ser Ser Asn Thr Ala Tyr
65 70 75 80
Leu Gln Leu Ser Ser Leu Thr Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Asn His Tyr Arg Tyr Ala Val Gly Gly Ala Leu Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly
115 120 125
Gly Ser Gly Gly Gly Gly Ser Asp Ile Glu Leu Thr Gln Ser Pro Ala
130 135 140
Ile Met Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Ser Ala
145 150 155 160
Ser Ser Ser Val Ser Tyr Ile His Trp Tyr Gln Gln Lys Ser Gly Thr
165 170 175
Ser Pro Lys Arg Trp Val Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val
180 185 190
Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr
195 200 205
Ile Ser Thr Met Glu Ala Glu Val Ala Ala Thr Tyr Tyr Cys Gln Gln
210 215 220
Trp Asn Asn Asn Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu Ile
225 230 235 240
Lys
<210> 41
<211> 116
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 41
Val Lys Leu Gln Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gln Ser
1 5 10 15
Leu Ser Met Ser Cys Thr Val Ser Gly Phe Ser Leu Ser Ser Tyr Gly
20 25 30
Val His Trp Val Arg Gln Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly
35 40 45
Val Ile Trp Ala Gly Gly Thr Thr Asn Tyr Asn Ser Ala Leu Met Ser
50 55 60
Arg Leu Ser Ile Ser Lys Asp Asn Ser Lys Ser Gln Val Leu Leu Lys
65 70 75 80
Met Asn Ser Leu Gln Thr Asp Asp Thr Ala Met Tyr Tyr Cys Ala Thr
85 90 95
Thr Thr Met Ile Thr Leu Met Asp Tyr Trp Gly Gln Gly Thr Thr Val
100 105 110
Thr Val Ser Ser
115
<210> 42
<211> 106
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 42
Asp Val Gln Leu Asn Gln Ala Lys Ser Ser Leu Ser Ala Ser Leu Gly
1 5 10 15
Asp Arg Val Thr Ile Ser Cys Arg Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Asp Gly Thr Val Lys Leu Leu Ile
35 40 45
Tyr Tyr Thr Ser Arg Leu His Ser Gly Val Pro Pro Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr Ile Ser Asn Leu Glu Gln
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln Gln Gly Asn Thr Val Pro Trp
85 90 95
Thr Phe Gly Gly Gly Thr Lys Leu Glu Ile
100 105
<210> 43
<211> 6
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 43
Met Ala Arg Ser Gly Leu
1 5
<210> 44
<211> 6
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 44
Met Ala Arg Ala Lys Glu
1 5
<210> 45
<211> 6
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 45
Met Ser Arg Thr Met Ser
1 5
<210> 46
<211> 6
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 46
Lys Cys Cys Tyr Ser Leu
1 5
<210> 47
<211> 8
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 47
Trp Ile Phe Pro Trp Ile Gln Leu
1 5
<210> 48
<211> 12
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 48
Trp Asp Leu Ala Trp Met Phe Arg Leu Pro Val Gly
1 5 10
<210> 49
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 49
Cys Thr Val Ala Leu Pro Gly Gly Tyr Val Arg Val Cys
1 5 10
<210> 50
<211> 12
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 50
Ile Pro Leu Val Val Pro Leu Gly Gly Ser Cys Lys
1 5 10
<210> 51
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 51
Lys Thr Leu Leu Pro Thr Pro
1 5
<210> 52
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 52
Cys Val Ala Tyr Cys Ile Glu His His Cys Trp Thr Cys
1 5 10
<210> 53
<211> 12
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 53
Cys Val Phe Ala His Asn Tyr Asp Tyr Leu Val Cys
1 5 10
<210> 54
<211> 10
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 54
Cys Val Phe Thr Ser Asn Tyr Ala Phe Cys
1 5 10
<210> 55
<211> 5
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 55
Ser Gly Arg Ser Ala
1 5
<210> 56
<211> 4
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 56
Trp Gly Phe Pro
1
<210> 57
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<220>
<221> misc_feature
<222> (1)..(1)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> misc_feature
<222> (3)..(4)
<223> Xaa can be any naturally occurring amino acid
<400> 57
Xaa Phe Xaa Xaa Tyr Leu Trp
1 5
<210> 58
<211> 17
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 58
Ala Glu Pro Met Pro His Ser Leu Asn Phe Ser Gln Tyr Leu Trp Tyr
1 5 10 15
Thr
<210> 59
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 59
Phe Ser Arg Tyr Leu Trp Ser
1 5
<210> 60
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 60
Ile Glu Leu Leu Gln Ala Arg
1 5
<210> 61
<211> 12
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 61
Asp Ile Thr Trp Asp Gln Leu Trp Asp Leu Met Lys
1 5 10
<210> 62
<211> 16
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 62
Ala Tyr Thr Lys Cys Ser Arg Gln Trp Arg Thr Cys Met Thr Thr His
1 5 10 15
<210> 63
<211> 15
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 63
Pro Gln Asn Ser Lys Ile Pro Gly Pro Thr Phe Leu Asp Pro His
1 5 10 15
<210> 64
<211> 15
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 64
Ser Met Glu Pro Ala Leu Pro Asp Trp Trp Trp Lys Met Phe Lys
1 5 10 15
<210> 65
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 65
Ala Asn Thr Pro Cys Gly Pro Tyr Thr His Asp Cys Pro
1 5 10
<210> 66
<211> 7
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 66
Phe Gln His Pro Ser Phe Ile
1 5
<210> 67
<211> 9
<212> PRT
<213> Artificial
<220>
<223> Synthetic
<400> 67
Cys Val Pro Glu Leu Gly His Glu Cys
1 5
<210> 68
<211> 24
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 68
aaccccuauc acgauuagca uuaa 24
<210> 69
<211> 24
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 69
agucaacauc agucugauaa gcua 24
<210> 70
<211> 22
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 70
acaguucuuc aacuggcagc uu 22
<210> 71
<211> 24
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 71
aacaacaaaa ucacuagucu ucca 24
<210> 72
<211> 22
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 72
acaggccggg acaagugcaa ua 22
<210> 73
<211> 22
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 73
caaacaccau ugucacacuc ca 22
<210> 74
<211> 21
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 74
ggcugucaau ucauagguca g 21
<210> 75
<211> 22
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 75
ugggguauuu gacaaacuga ca 22
<210> 76
<211> 22
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 76
agccuauccu ggauuacuug aa 22
<210> 77
<211> 21
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 77
gcggaacuua gccacuguga a 21
<210> 78
<211> 23
<212> RNA
<213> Artificial
<220>
<223> Synthetic
<400> 78
acaaggauga aucuuuguua cug 23
<210> 79
<211> 1155
<212> DNA
<213> Artificial
<220>
<223> Synthetic
<400> 79
atggcgttaa cccaacctag cagcgttagc gcgaatcctg gcgaaaccgt gaaaattacc 60
tgcagcggca gcagcggtag ctatggctgg tatcagcaga aaagcccgga ttcagcgcct 120
gtgaccgtga tttatcagag caaccagcgc ccgagcgata ttcctagccg ctttagcggc 180
agcaaaagcg gtagcaccgg caccttaacc attaccggtg tgcaggcgga agatgaagcg 240
gtgtattatt gcggcggttg gggttcaagc gttggcatgt ttggtgcggg taccacctta 300
accgtgttag gtcagagcag ccgttcaagc ggtggcggtg gtagcagcgg tggtggtggt 360
agcgcagtta ccctggatga aagcggtggc ggcttacaaa ctcctggtgg tgcgctgagc 420
ttagtttgta aagcgagcgg ctttaccttt agcagctatg cgatgggttg ggtgcgtcag 480
gcgcctggta aaggcttaga atgggtggcg ggcattagcg atgatggcga tagctatatt 540
agctatgcga ccgcggttaa aggtcgtgcg accattagcc gtgataacgg ccagagcacc 600
gttcgtctgc agctgaataa cctgcgcgcg gaagataccg cgacctatta ttgcgcgcgc 660
agccattgta gcggttgtcg taacgcggcg ctgattgatg catggggcca tggcaccgaa 720
gtgattgtga gcagcatgtc gtactaccat caccatcacc atcacgatta cgacatccca 780
acgaccgaaa acctgtattt tcagggcgcc atggttgata ccttatcagg tttatcaagt 840
gagcaaggtc agtccggtga tatgacaatt gaagaagata gtgctaccca tattaaattc 900
tcaaaacgtg atgaggacgg caaagagtta gctggtgcaa ctatggagtt gcgtgattca 960
tctggtaaaa ctattagtac atggatttca gatggacaag tgaaagattt ctacctgtat 1020
ccaggaaaat atacatttgt cgaaaccgca gcaccagacg gttatgaggt agcaactgct 1080
attaccttta cagttaatga gcaaggtcag gttactgtaa atggcaaagc aactaaaggt 1140
gacgctcata tttaa 1155
<210> 80
<211> 2631
<212> DNA
<213> 噬菌体 P22
<400> 80
atgaatcata aacatcatca tcatcatcac agcagcggcg aaaacctgta ttttcagggc 60
catatgggat ccgccgacaa tgaaaacagg ctggagagca tcctgtcgcg ctttgatgcg 120
gactggacag ccagtgatga agccagacga gaggcaaaga atgatctctt cttctcccgc 180
gtatctcagt gggatgactg gctatcacaa tacacaaccc tgcagtatcg cgggcagttc 240
gatgttgtac gtccagtggt gcgcaagctc gtttctgaga tgcgtcagaa ccctattgat 300
gttctgtatc gtccaaagga cggagcaaga cctgatgccg ctgatgtgct tatgggtatg 360
tatcgcacag acatgcggca taacacggct aaaatcgcgg ttaacatcgc tgttcgtgag 420
cagattgaag ctggagttgg tgcgtggcgt ctggtcactg actacgaaga ccaaagtccg 480
acgagcaaca atcaggttat ccgtcgagag cctatccata gtgcctgctc ccatgttatc 540
tgggacagca acagcaaact gatggataag tctgacgccc gtcactgcac agttatccac 600
tcaatgagcc agaatggttg ggaggatttc gcagaaaaat acgacctcga tgcggatgat 660
attccatcat tccagaaccc caacgattgg gtatttccat ggctgacgca ggacacaatt 720
cagatcgctg agttttacga agtggtcgag aagaaagaga cggcgtttat ctaccaagac 780
ccggttacgg gtgagccggt aagctacttt aagcgcgata ttaaagacgt catcgatgac 840
ctggctgata gtggatttat caaaattgca gagcgccaga ttaagcgtcg ccgggtatac 900
aaatcgatta tcacctgcac tgctgtactc aaagacaagc agctcattgc tggcgagcat 960
atccccattg ttccggtgtt cggagagtgg ggcttcgttg aagataaaga agtgtatgag 1020
ggtgtcgtcc gcctgacaaa agacggccag cgtctgcgca acatgattat gtcgttcaac 1080
gccgacatcg tggcccgcac tccgaagaag aagccgttct tctggcctga gcagattgca 1140
ggctttgagc atatgtacga cggtaacgac gattacccat actacctgct caatcgcact 1200
gacgaaaata gtggagacct tccgactcag ccgctggcat attatgaaaa cccggaagtg 1260
ccgcaagcca acgcctacat gctggaagca gcaaccagcg cagtaaaaga ggttgccact 1320
ctcggagttg atacagaagc ggtaaatggc ggacaggttg cgtttgatac cgtcaatcaa 1380
ctgaatatga gggctgacct tgagacatac gtgtttcagg ataatctggc taccgccatg 1440
cgccgtgacg gagagattta ccagtcgata gttaatgaca tctacgatgt tcctcgcaac 1500
gttacgatta cccttgagga tggcagcgag aaagatgttc agctaatggc tgaggttgtt 1560
gaccttgcta ctggagaaaa gcaggtacta aacgatatca gggggcgcta tgagtgctac 1620
acggatgttg gaccatcatt ccagtccatg aagcagcaaa accgcgcaga aattcttgag 1680
ttgctcggca agacgccaca gggaacgcca gaatatcaac tgctgttgct tcagtacttc 1740
accctgcttg atggtaaagg tgttgagatg atgcgtgact atgccaacaa gcagcttatt 1800
cagatgggcg ttaagaagcc agaaacgccc gaagagcagc aatggttagt agaggcgcaa 1860
caagccaaac aaggtggagg aggaggagga aagcttgcgt taacccaacc tagcagcgtt 1920
agcgcgaatc ctggcgaaac cgtgaaaatt acctgcagcg gcagcagcgg tagctatggc 1980
tggtatcagc agaaaagccc ggattcagcg cctgtgaccg tgatttatca gagcaaccag 2040
cgcccgagcg atattcctag ccgctttagc ggcagcaaaa gcggtagcac cggcacctta 2100
accattaccg gtgtgcaggc ggaagatgaa gcggtgtatt attgcggcgg ttggggttca 2160
agcgttggca tgtttggtgc gggtaccacc ttaaccgtgt taggtcagag cagccgttca 2220
agcggtggcg gtggtagcag cggtggtggt ggtagcgcag ttaccctgga tgaaagcggt 2280
ggcggcttac aaactcctgg tggtgcgctg agcttagttt gtaaagcgag cggctttacc 2340
tttagcagct atgcgatggg ttgggtgcgt caggcgcctg gtaaaggctt agaatgggtg 2400
gcgggcatta gcgatgatgg cgatagctat attagctatg cgaccgcggt taaaggtcgt 2460
gcgaccatta gccgtgataa cggccagagc accgttcgtc tgcagctgaa taacctgcgc 2520
gcggaagata ccgcgaccta ttattgcgcg cgcagccatt gtagcggttg tcgtaacgcg 2580
gcgctgattg atgcatgggg ccatggcacc gaagtgattg tgagcagcta a 2631
<210> 81
<211> 1905
<212> DNA
<213> 噬菌体 P22
<400> 81
atgaatcata aacatcatca tcatcatcac agcagcggcg aaaacctgta ttttcagggc 60
catatgggat ccgccgacaa tgaaaacagg ctggagagca tcctgtcgcg ctttgatgcg 120
gactggacag ccagtgatga agccagacga gaggcaaaga atgatctctt cttctcccgc 180
gtatctcagt gggatgactg gctatcacaa tacacaaccc tgcagtatcg cgggcagttc 240
gatgttgtac gtccagtggt gcgcaagctc gtttctgaga tgcgtcagaa ccctattgat 300
gttctgtatc gtccaaagga cggagcaaga cctgatgccg ctgatgtgct tatgggtatg 360
tatcgcacag acatgcggca taacacggct aaaatcgcgg ttaacatcgc tgttcgtgag 420
cagattgaag ctggagttgg tgcgtggcgt ctggtcactg actacgaaga ccaaagtccg 480
acgagcaaca atcaggttat ccgtcgagag cctatccata gtgcctgctc ccatgttatc 540
tgggacagca acagcaaact gatggataag tctgacgccc gtcactgcac agttatccac 600
tcaatgagcc agaatggttg ggaggatttc gcagaaaaat acgacctcga tgcggatgat 660
attccatcat tccagaaccc caacgattgg gtatttccat ggctgacgca ggacacaatt 720
cagatcgctg agttttacga agtggtcgag aagaaagaga cggcgtttat ctaccaagac 780
ccggttacgg gtgagccggt aagctacttt aagcgcgata ttaaagacgt catcgatgac 840
ctggctgata gtggatttat caaaattgca gagcgccaga ttaagcgtcg ccgggtatac 900
aaatcgatta tcacctgcac tgctgtactc aaagacaagc agctcattgc tggcgagcat 960
atccccattg ttccggtgtt cggagagtgg ggcttcgttg aagataaaga agtgtatgag 1020
ggtgtcgtcc gcctgacaaa agacggccag cgtctgcgca acatgattat gtcgttcaac 1080
gccgacatcg tggcccgcac tccgaagaag aagccgttct tctggcctga gcagattgca 1140
ggctttgagc atatgtacga cggtaacgac gattacccat actacctgct caatcgcact 1200
gacgaaaata gtggagacct tccgactcag ccgctggcat attatgaaaa cccggaagtg 1260
ccgcaagcca acgcctacat gctggaagca gcaaccagcg cagtaaaaga ggttgccact 1320
ctcggagttg atacagaagc ggtaaatggc ggacaggttg cgtttgatac cgtcaatcaa 1380
ctgaatatga gggctgacct tgagacatac gtgtttcagg ataatctggc taccgccatg 1440
cgccgtgacg gagagattta ccagtcgata gttaatgaca tctacgatgt tcctcgcaac 1500
gttacgatta cccttgagga tggcagcgag aaagatgttc agctaatggc tgaggttgtt 1560
gaccttgcta ctggagaaaa gcaggtacta aacgatatca gggggcgcta tgagtgctac 1620
acggatgttg gaccatcatt ccagtccatg aagcagcaaa accgcgcaga aattcttgag 1680
ttgctcggca agacgccaca gggaacgcca gaatatcaac tgctgttgct tcagtacttc 1740
accctgcttg atggtaaagg tgttgagatg atgcgtgact atgccaacaa gcagcttatt 1800
cagatgggcg ttaagaagcc agaaacgccc gaagagcagc aatggttagt agaggcgcaa 1860
caagccaaac aaggtggtgg cgtacattcg cctaacaaga agtaa 1905
<210> 82
<211> 2274
<212> DNA
<213> 噬菌体 P22
<400> 82
atgaatcata aacatcatca tcatcatcac agcagcggcg aaaacctgta ttttcagggc 60
catatgggat ccgccgacaa tgaaaacagg ctggagagca tcctgtcgcg ctttgatgcg 120
gactggacag ccagtgatga agccagacga gaggcaaaga atgatctctt cttctcccgc 180
gtatctcagt gggatgactg gctatcacaa tacacaaccc tgcagtatcg cgggcagttc 240
gatgttgtac gtccagtggt gcgcaagctc gtttctgaga tgcgtcagaa ccctattgat 300
gttctgtatc gtccaaagga cggagcaaga cctgatgccg ctgatgtgct tatgggtatg 360
tatcgcacag acatgcggca taacacggct aaaatcgcgg ttaacatcgc tgttcgtgag 420
cagattgaag ctggagttgg tgcgtggcgt ctggtcactg actacgaaga ccaaagtccg 480
acgagcaaca atcaggttat ccgtcgagag cctatccata gtgcctgctc ccatgttatc 540
tgggacagca acagcaaact gatggataag tctgacgccc gtcactgcac agttatccac 600
tcaatgagcc agaatggttg ggaggatttc gcagaaaaat acgacctcga tgcggatgat 660
attccatcat tccagaaccc caacgattgg gtatttccat ggctgacgca ggacacaatt 720
cagatcgctg agttttacga agtggtcgag aagaaagaga cggcgtttat ctaccaagac 780
ccggttacgg gtgagccggt aagctacttt aagcgcgata ttaaagacgt catcgatgac 840
ctggctgata gtggatttat caaaattgca gagcgccaga ttaagcgtcg ccgggtatac 900
aaatcgatta tcacctgcac tgctgtactc aaagacaagc agctcattgc tggcgagcat 960
atccccattg ttccggtgtt cggagagtgg ggcttcgttg aagataaaga agtgtatgag 1020
ggtgtcgtcc gcctgacaaa agacggccag cgtctgcgca acatgattat gtcgttcaac 1080
gccgacatcg tggcccgcac tccgaagaag aagccgttct tctggcctga gcagattgca 1140
ggctttgagc atatgtacga cggtaacgac gattacccat actacctgct caatcgcact 1200
gacgaaaata gtggagacct tccgactcag ccgctggcat attatgaaaa cccggaagtg 1260
ccgcaagcca acgcctacat gctggaagca gcaaccagcg cagtaaaaga ggttgccact 1320
ctcggagttg atacagaagc ggtaaatggc ggacaggttg cgtttgatac cgtcaatcaa 1380
ctgaatatga gggctgacct tgagacatac gtgtttcagg ataatctggc taccgccatg 1440
cgccgtgacg gagagattta ccagtcgata gttaatgaca tctacgatgt tcctcgcaac 1500
gttacgatta cccttgagga tggcagcgag aaagatgttc agctaatggc tgaggttgtt 1560
gaccttgcta ctggagaaaa gcaggtacta aacgatatca gggggcgcta tgagtgctac 1620
acggatgttg gaccatcatt ccagtccatg aagcagcaaa accgcgcaga aattcttgag 1680
ttgctcggca agacgccaca gggaacgcca gaatatcaac tgctgttgct tcagtacttc 1740
accctgcttg atggtaaagg tgttgagatg atgcgtgact atgccaacaa gcagcttatt 1800
cagatgggcg ttaagaagcc agaaacgccc gaagagcagc aatggttagt agaggcgcaa 1860
caagccaaac aaggtcaaca agacccggca atggttcagg ctcagggcgt actcctgcag 1920
gggcaggctg aactggctaa agctcagaac cagacgctgt ccctgcaaat cgatgcagct 1980
aaagtcgaag cgcagaacca gcttaacgct gccagaatcg cagaaatctt caacaacatg 2040
gacctcagta aacaatctga gtttagagag ttccttaaaa ccgttgcttc attccagcag 2100
gaccgcagcg aagacgctcg cgcaaatgct gagttactcc ttaaaggcga tgaacagacg 2160
cacaagcagc gaatggacat tgccaacatc ctgcaatcgc agagacaaaa tcaaccttcc 2220
ggcagtgtag ccgagacacc tcaaggtggc gtacattcgc ctaacaagaa gtaa 2274
<210> 83
<211> 1026
<212> DNA
<213> 噬菌体 phi-29
<400> 83
atgaatcata aacatcatca tcatcatcac agcagcggcg aaaacctgta ttttcagggc 60
catatgattc tgacatacct gtctatcaat gtgatacagc ttcaaaaacg gaatagatgg 120
tttattcact atctgaacta ccttcaatct ctagcctatc agctatttga gtgggagaac 180
ctaccgccta cgataaaccc tagtttctta gaaaagtcta ttcatcaatt cgggtacgtg 240
gggttctata aagaccctgt catcagttat atcgcttgta atggcgctct atcgggtcag 300
agagacgttt acaaccaagc tacagttttt agagccgcat ctcctgtgta tcaaaaagaa 360
ttcaagctat acaactatag agatatgaag gaagaagata tgggtgttgt tatctacaac 420
aatgacatgg ctttccctac cacgccaacg ctagaattgt ttgcggctga attggctgaa 480
ttaaaagaaa tcatatcggt caaccaaaac gctcaaaaga cacccgtctt aattagagca 540
aatgacaata accaactgag cttaaaacaa gtgtataacc agtatgaagg taatgcccct 600
gttatcttcg ctcacgaagc tctcgacagt gactctatag aagtgtttaa gactgatgct 660
ccctatgtgg tggacaagtt aaacgctcag aaaaatgcag tatggaatga gatgatgact 720
ttccttggca ttaagaacgc taacctagag aagaaagagc gcatggttac ggatgaagtt 780
tccagtaacg atgaacagat cgagtctagc ggcactgtat ttttgaagtc gagggaagaa 840
gcatgtgaga agattaatga gctatatggt ctcaatgtta aagttaaatt cagatatgac 900
atcgtggaac aaatgagacg tgagctacag caaatagaaa atgtttcacg tggaacatcg 960
gacggtgaaa caaatgaggc gcatattgtg atggtggatg cgtataaacc gaccaaataa 1020
ctcgag 1026
<210> 84
<211> 942
<212> DNA
<213> 噬菌体 phi-29
<400> 84
atgattctga catacctgtc tatcaatgtg atacagcttc aaaaacggaa tagatggttt 60
attcactatc tgaactacct tcaatctcta gcctatcagc tatttgagtg ggagaaccta 120
ccgcctacga taaaccctag tttcttagaa aagtctattc atcaattcgg gtacgtgggg 180
ttctataaag accctgtcat cagttatatc gcttgtaatg gcgctctatc gggtcagaga 240
gacgtttaca accaagctac agtttttaga gccgcatctc ctgtgtatca aaaagaattc 300
aagctataca actatagaga tatgaaggaa gaagatatgg gtgttgttat ctacaacaat 360
gacatggctt tccctaccac gccaacgcta gaattgtttg cggctgaatt ggctgaatta 420
aaagaaatca tatcggtcaa ccaaaacgct caaaagacac ccgtcttaat tagagcaaat 480
gacaataacc aactgagctt aaaacaagtg tataaccagt atgaaggtaa tgcccctgtt 540
atcttcgctc acgaagctct cgacagtgac tctatagaag tgtttaagac tgatgctccc 600
tatgtggtgg acaagttaaa cgctcagaaa aatgcagtat ggaatgagat gatgactttc 660
cttggcatta agaacgctaa cctagagaag aaagagcgca tggttacgga tgaagtttcc 720
agtaacgatg aacagatcga gtctagcggc actgtatttt tgaagtcgag ggaagaagca 780
tgtgagaaga ttaatgagct atatggtctc aatgttaaag ttaaattcag atatgacatc 840
gtggaacaaa tgagacgtga gctacagcaa atagaaaatg tttcacgtgg aacatcggac 900
ggtgaaacaa atgagctcga gcaccaccac caccaccact ga 942
<210> 85
<211> 927
<212> DNA
<213> 噬菌体 phi-29
<400> 85
atggcacgta aacgcagtaa cacataccga tctatcaatg agatacagcg tcaaaaacgg 60
aatagatggt ttattcacta tctgaactac cttcaatctc tagcctatca gctatttgag 120
tgggagaacc taccgcctac gataaaccct agtttcttag aaaagtctat tcatcaattc 180
gggtacgtgg ggttctataa agaccctgtc atcagttata tcgcttgtaa tggcgctcta 240
tcgggtcaga gagacgttta caaccaagct acagttttta gagccgcatc tcctgtgtat 300
caaaaagaat tcaagctata caactataga gatatgaagg aagaagatat gggtgttgtt 360
atctacaaca atgacatggc tttccctacc acgccaacgc tagaattgtt tgcggctgaa 420
ttggctgaat taaaagaaat catatcggtc aaccaaaacg ctcaaaagac acccgtctta 480
attagagcca atgacaataa ctgcctgagc ttaaaacaag tgtataacca gtatgaaggt 540
aatgcccctg ttatcttcgc tcacgaagct ctcgacagtg actctataga agtgtttaag 600
actgatgctc cctatgtggt ggacaagtta aacgctcaga aaaatgcagt atggaatgag 660
atgatgactt tccttggcat taagaacgct aacctagaga agaaagagcg catggttacg 720
gatgaagttt ccagtaacga tgaacagatc gagtctagcg gcactgtatt tttgaagtcg 780
agggaagaag catgtgagaa gattaatgag ctatatggtc tcaatgttaa agttaaattc 840
agatatgaca tcgtggaaca aatgagacgt gagctacagc aaatagaaaa tgtttcacgt 900
ggaacatcgg acggtgaaac aaatgag 927
<210> 86
<211> 927
<212> DNA
<213> 噬菌体 phi-29
<400> 86
atggcacgta aacgcagtaa cacataccga tctatcaatg agatacagcg tcaaaaacgg 60
aatagatggt ttattcacta tctgaactac cttcaatctc tagcctatca gctatttgag 120
tgggagaacc taccgcctac gataaaccct agtttcttag aaaagtctat tcatcaattc 180
gggtacgtgg ggttctataa agaccctgtc atcagttata tcgcttgtaa tggcgctcta 240
tcgggtcaga gagacgttta caaccaagct acagttttta gagccgcatc tcctgtgtat 300
caaaaagaat tcaagctata caactataga gatatgaagg aagaagatat gggtgttgtt 360
atctacaaca atgacatggc tttccctacc acgccaacgc tatgcttgtt tgcggctgaa 420
ttggctgaat taaaagaaat catatcggtc aaccaaaacg ctcaaaagac acccgtctta 480
attagagcca atgacaataa ccaactgagc ttaaaacaag tgtataacca gtatgaaggt 540
aatgcccctg ttatcttcgc tcacgaagct ctcgacagtg actctataga agtgtttaag 600
actgatgctc cctatgtggt ggacaagtta aacgctcaga aaaatgcagt atggaatgag 660
atgatgactt tccttggcat taagaacgct aacctagaga agaaagagcg catggttacg 720
gatgaagttt ccagtaacga tgaacagatc gagtctagcg gcactgtatt tttgaagtcg 780
agggaagaag catgtgagaa gattaatgag ctatatggtc tcaatgttaa agttaaattc 840
agatatgaca tcgtggaaca aatgagacgt gagctacagc aaatagaaaa tgtttcacgt 900
ggaacatcgg acggtgaaac aaatgag 927
<210> 87
<211> 927
<212> DNA
<213> 噬菌体 phi-29
<400> 87
atggcacgta aacgcagtaa cacataccga tctatcaatg agatacagcg tcaaaaacgg 60
aatagatggt ttattcacta tctgaactac cttcaatctc tagcctatca gctatttgag 120
tgggagaacc taccgcctac gataaaccct agtttcttag aaaagtctat tcatcaattc 180
gggtacgtgg ggttctataa agaccctgtc atcagttata tcgcttgtaa tggctgtcta 240
tcgggtcaga gagacgttta caaccaagct acagttttta gagccgcatc tcctgtgtat 300
caaaaagaat tcaagctata caactataga gatatgaagg aagaagatat gggtgttgtt 360
atctacaaca atgacatggc tttccctacc acgccaacgc tagaattgtt tgcggctgaa 420
ttggctgaat taaaagaaat catatcggtc aaccaaaacg ctcaaaagac acccgtctta 480
attagagcca atgacaataa ccaactgagc ttaaaacaag tgtataacca gtatgaaggt 540
aatgcccctg ttatcttcgc tcacgaagct ctcgacagtg actctataga agtgtttaag 600
actgatgctc cctatgtggt ggacaagtta aacgctcaga aaaatgcagt atggaatgag 660
atgatgactt tccttggcat taagaacgct aacctagaga agaaagagcg catggttacg 720
gatgaagttt ccagtaacga tgaacagatc gagtctagcg gcactgtatt tttgaagtcg 780
agggaagaag catgtgagaa gattaatgag ctatatggtc tcaatgttaa agttaaattc 840
agatatgaca tcgtggaaca aatgagacgt gagctacagc aaatagaaaa tgtttcacgt 900
ggaacatcgg acggtgaaac aaatgag 927
<210> 88
<211> 1575
<212> DNA
<213> 噬菌体 phi-29
<400> 88
atggcatatg taccattatc aggaacgaac gtcaggattt tagctgacgt tcctttctct 60
aatgattata aaaacacgag atggttcaca tcttcaagta atcagtataa ctggtttaac 120
agcaaatcac gtgtgtatga aatgagtaaa gtaacattca tggggtttag agaaaataaa 180
ccatatgttt cggttagtct tcccatagat aagctttaca gtgcgtcata tattatgttt 240
caaaatgcag actacggtaa caagtggttt tatgcatttg taaccgagtt agaatttaaa 300
aatagtgctg ttacctacgt tcactttgaa attgatgttc tccaaacatg gatgttcgat 360
attaaatttc aagaatcatt cattgtgagg gagcacgtta aattatggaa tgacgacggg 420
acaccgacta tcaacacaat tgatgagggt ctcagctacg gaagtgaata cgacatagtt 480
tctgtagaaa accataaacc atacgacgac atgatgtttc tcgtgattat ttccaaaagc 540
attatgcatg ggacgccggg agaagaggaa agcaggctaa atgacataaa cgcaagcctg 600
aacggcatgc cgcaacctct ctgctactat attcacccat tctacaaaga tggtaaagtt 660
cctaaaacgt atatcggaga taacaacgct aacttgtctc ctattgtcaa tatgctcacc 720
aatatctttt cacagaagag cgctgttaac gatattgtca atatgtatgt gactgattat 780
attggtttga agcttgacta taaaaatggt gataaagaat tgaagctcga taaagacatg 840
tttgaacagg cgggtatagc tgacgataaa cacggtaacg ttgacaccat ctttgtgaag 900
aaaatacctg attatgaagc cctagaaata gacacaggtg ataaatgggg tggcttcaca 960
aaagaccaag aaagcaaact gatgatgtac ccttactgcg ttacggaaat aactgacttt 1020
aaaggcaacc atatgaatct gaaaaccgag tacatcaata acagtaaact atgtatacag 1080
gttaggggtt cactaggggt cagtaacaag gttgcctaca gtgttcagga ttataacgca 1140
gatagcgcat tgagtggcgg caatagattg actgcgtctc tagattcatc cttaatcaac 1200
aacaacccaa atgacatagc aatactaaat gactatctat ctgcttatca gttaacgaaa 1260
atgggcggca acacagcgtt tgattacggg aatgggtaca gaggtgtgta cgtcatcaaa 1320
aagcaattga aggctgaata cagacgaagt ctatcaagtt tcttccataa atacggatac 1380
aagattaaca gggtaaagaa accaaattta agaacacgaa aagcatttaa ctatgttcag 1440
acaaaagact gtttcatttc aggggacatc aataacaatg acttacagga aataagaaca 1500
attttcgata atggtattac tctttggcat actgacaaca tcggaaatta cagcgtcgag 1560
aatgaattga ggtga 1575
<210> 89
<211> 1575
<212> DNA
<213> 噬菌体 phi-29
<400> 89
atggcatatg taccattatc aggaacgaac gtcaggattt tagctgacgt tcctttctct 60
aatgattata aaaacacgag atggttcaca tcttcaagta atcagtataa ctggtttaac 120
agcaaatcac gtgtgtatga aatgagtaaa gtaacattca tggggtttag agaaaataaa 180
ccatatgttt cggttagtct tcccatagat aagctttaca gtgcgtcata tattatgttt 240
caaaatgcag actacggtaa caagtggttt tatgcatttg taaccgagtt agaatttaaa 300
aatagtgctg ttacctacgt tcactttgaa attgatgttc tccaaacatg gatgttcgat 360
attaaatttc aagaatcatt cattgtgagg gagcacgtta ttttatggaa tctgctgggg 420
acaccgacta tcaacacaat tgatgagggt ctcagctacg gaagtgaata cgacatagtt 480
tctgtagaaa accataaacc atacgacgac atgatgtttc tcgtgattat ttccaaaagc 540
attatgcatg ggacgccggg agaagaggaa agcaggctaa atgacataaa cgcaagcctg 600
aacggcatgc cgcaacctct ctgctactat attcacccat tctacaaaga tggtaaagtt 660
cctaaaacgt atatcggaga taacaacgct aacttgtctc ctattgtcaa tatgctcacc 720
aatatctttt cacagaagag cgctgttaac gatattgtca atatgtatgt gactgattat 780
attggtttga agcttgacta taaaaatggt gataaagaat tgaagctcga taaagacatg 840
tttgaacagg cgggtatagc tgacgataaa cacggtaacg ttgacaccat ctttgtgaag 900
aaaatacctg attatgaagc cctagaaata gacacaggtg ataaatgggg tggcttcaca 960
aaagaccaag aaagcaaact gatgatgtac ccttactgcg ttacggaaat aactgacttt 1020
aaaggcaacc atatgaatct gaaaaccgag tacatcaata acagtaaact aaagatacag 1080
gttaggggtt cactaggggt cagtaacaag gttgcctaca gtgttcagga ttataacgca 1140
gatagcgcat tgagtggcgg caatagattg actgcgtctc tagattcatc cttaatcaac 1200
aacaacccaa atgacatagc aatactaaat gactatctat ctgcttatca gttaacgaaa 1260
atgggcggca acacagcgtt tgattacggg aatgggtaca gaggtgtgta cgtcatcaaa 1320
aagcaattga aggctgaata cagacgaagt ctatcaagtt tcttccataa atacggatac 1380
aagattaaca gggtaaagaa accaaattta agaacacgaa aagcatttaa ctatgttcag 1440
acaaaagact gtttcatttc aggggacatc aataacaatg acttacagga aataagaaca 1500
attttcgata atggtattac tctttggcat actgacaaca tcggaaatta cagcgtcgag 1560
aatgaattga ggtga 1575
<210> 90
<211> 1575
<212> DNA
<213> 噬菌体 phi-29
<400> 90
atggcatatg taccattatc aggaacgaac gtcaggattt tagctgacgt tcctttctct 60
aatgattata aaaacacgag atggttcaca tcttcaagta atcagtataa ctggtttaac 120
agcaaatcac gtgtgtatga aatgagtaaa gtaacattca tggggtttag agaaaataaa 180
ccatatgttt cggttagtct tcccatagat aagctttaca gtgcgtcata tattatgttt 240
caaaatgcag actacggtaa caagtggttt tatgcatttg taaccgagtt agaatttaaa 300
aatagtgctg ttacctacgt tcactttgaa attgatgttc tccaaacatg gatgttcgat 360
attaaatttc aagaatcatt cattgtgagg gagcacgtta ttttatggaa tctgctgggg 420
acaccgacta tcaacacaat tgatgagggt ctcagctacg gaagtgaata cctgatagtt 480
tctgtagtta accataaacc atacgacgac atgatgtttc tcgtgattat ttccaaaagc 540
attatgcatg ggacgccggg agaagaggaa agcaggctaa atgacataaa cgcaagcctg 600
aacggcatgc cgcaacctct ctgctactat attcacccat tctacaaaga tggtaaagtt 660
cctaaaacgt atatcggaga taacaacgct aacttgtctc ctattgtcaa tatgctcacc 720
aatatctttt cacagaagag cgctgttaac gatattgtca atatgtatgt gactgattat 780
attggtttga agcttgacta taaaaatggt gataaagaat tgaagctcga taaagacatg 840
tttgaacagg cgggtatagc tgacgataaa cacggtaacg ttgacaccat ctttgtgaag 900
aaaatacctg attatgaagc cctagaaata gacacaggtg ataaatgggg tggcttcaca 960
aaagaccaag aaagcaaact gatgatgtac ccttactgcg ttacggaaat aactgacttt 1020
aaaggcaacc atatgaatct gaaaaccgag tacatcaata acagtaaact aaagatacag 1080
gttaggggtt cactaggggt cagtaacaag gttgcctaca gtgttcagga ttataacgca 1140
gatagcgcat tgagtggcgg caatagattg actgcgtctc tagattcatc cttaatcaac 1200
aacaacccaa atgacatagc aatactaaat gactatctat ctgcttatca gttaacgaaa 1260
atgggcggca acacagcgtt tgattacggg aatgggtaca gaggtgtgta cgtcatcaaa 1320
aagcaattga aggctgaata cagacgaagt ctatcaagtt tcttccataa atacggatac 1380
aagattaaca gggtaaagaa accaaattta agaacacgaa aagcatttaa ctatgttcag 1440
acaaaagact gtttcatttc aggggacatc aataacaatg acttacagga aataagaaca 1500
attttcgata atggtattac tctttggcat actgacaaca tcggaaatta cagcgtcgag 1560
aatgaattga ggtga 1575
<210> 91
<211> 1575
<212> DNA
<213> 噬菌体 phi-29
<400> 91
atggcatatg taccattatc aggaacgaac gtcaggattt tagctgacgt tcctttctct 60
aatgattata aaaacacgag atggttcaca tcttcaagta atcagtataa ctggtttaac 120
agcaaatcac gtgtgtatga aatgagtaaa gtaacattca tggggtttag agaaaataaa 180
ccatatgttt cggttagtct tcccatagat aagctttaca gtgcgtcata tattatgttt 240
caaaatgcag actacggtaa caagtggttt tatgcatttg taaccgagtt agaatttaaa 300
aatagtgctg ttacctacgt tcactttgaa attgatgttc tccaaacatg gatgttcgat 360
attaaatttc aagaatcatt cattgtgagg gagcacgtta ttttatggaa tctgctgggg 420
acaccgacta tcaacacaat tgatgagggt ctcagctacg gaagtgaata cctgatagtt 480
tctgtagtta accataaacc atacgacgac atgatgtttc tcgtgattat ttccaaaagc 540
attatgcatg ggacgccggg agaagaggaa agcaggctaa atgacataaa cgcaagcctg 600
aacggcatgc cgcaacctct ctgctactat attcacccat tctacaaaga tggtaaagtt 660
cctaaaacgt atatcggaga taacaacgct aacttgtctc ctattgtcaa tatgctcacc 720
aatatctttt cacagaagag cgctgttaac gatattgtca atatgtatgt gactgattat 780
attggtttga agcttgacta taaaaatggt gataaagaat tgaagctcga taaagacatg 840
tttgaacagg cgggtatagc tgacgataaa cacggtaacg ttgacaccat ctttgtgaag 900
aaaatacctg attatgaagc cctagttata gttacaggtg ataaatgggg tggcttcaca 960
aaagaccaag aaagcaaact gatgatgtac ccttactgcg ttacggaaat aactgacttt 1020
aaaggcaacc atatgaatct gaaaaccgag tacatcaata acagtaaact aaagatacag 1080
gttaggggtt cactaggggt cagtaacaag gttgcctaca gtgttcagga ttataacgca 1140
gatagcgcat tgagtggcgg caatagattg actgcgtctc tagattcatc cttaatcaac 1200
aacaacccaa atgacatagc aatactaaat gactatctat ctgcttatca gttaacgaaa 1260
atgggcggca acacagcgtt tgattacggg aatgggtaca gaggtgtgta cgtcatcaaa 1320
aagcaattga aggctgaata cagacgaagt ctatcaagtt tcttccataa atacggatac 1380
aagattaaca gggtaaagaa accaaattta agaacacgaa aagcatttaa ctatgttcag 1440
acaaaagact gtttcatttc aggggacatc aataacaatg acttacagga aataagaaca 1500
attttcgata atggtattac tctttggcat actgacaaca tcggaaatta cagcgtcgag 1560
aatgaattga ggtga 1575
<210> 92
<211> 1647
<212> DNA
<213> 噬菌体 phi-29
<400> 92
atgaatcata aacatcatca tcatcatcac agcagcggcg aaaacctgta ttttcagggc 60
catatgggat ccatggcata tgtaccatta tcaggaacga acgtcaggat tttagctgac 120
gttcctttct ctaatgatta taaaaacacg agatggttca catcttcaag taatcagtat 180
aactggttta acagcaaatc acgtgtgtat gaaatgagta aagtaacatt catggggttt 240
agagaaaata aaccatatgt ttcggttagt cttcccatag ataagcttta cagtgcgtca 300
tatattatgt ttcaaaatgc agactacggt aacaagtggt tttatgcatt tgtaaccgag 360
ttagaattta aaaatagtgc tgttacctac gttcactttg aaattgatgt tctccaaaca 420
tggatgttcg atattaaatt tcaagaatca ttcattgtga gggagcacgt taaattatgg 480
aatgacgacg ggacaccgac tatcaacaca attgatgagg gtctcagcta cggaagtgaa 540
tacgacatag tttctgtaga aaaccataaa ccatacgacg acatgatgtt tctcgtgatt 600
atttccaaaa gcattatgca tgggacgccg ggagaagagg aaagcaggct aaatgacata 660
aacgcaagcc tgaacggcat gccgcaacct ctctgctact atattcaccc attctacaaa 720
gatggtaaag ttcctaaaac gtatatcgga gataacaacg ctaacttgtc tcctattgtc 780
aatatgctca ccaatatctt ttcacagaag agcgctgtta acgatattgt caatatgtat 840
gtgactgatt atattggttt gaagcttgac tataaaaatg gtgataaaga attgaagctc 900
gataaagaca tgtttgaaca ggcgggtata gctgacgata aacacggtaa cgttgacacc 960
atctttgtga agaaaatacc tgattatgaa gccctagaaa tagacacagg tgataaatgg 1020
ggtggcttca caaaagacca agaaagcaaa ctgatgatgt acccttactg cgttacggaa 1080
ataactgact ttaaaggcaa ccatatgaat ctgaaaaccg agtacatcaa taacagtaaa 1140
ctaaagatac aggttagggg ttcactaggg gtcagtaaca aggttgccta cagtgttcag 1200
gattataacg cagatagcgc attgagtggc ggcaatagat tgactgcgtc tctagattca 1260
tccttaatca acaacaaccc aaatgacata gcaatactaa atgactatct atctgcttat 1320
cagttaacga aaatgggcgg caacacagcg tttgattacg ggaatgggta cagaggtgtg 1380
tacgtcatca aaaagcaatt gaaggctgaa tacagacgaa gtctatcaag tttcttccat 1440
aaatacggat acaagattaa cagggtaaag aaaccaaatt taagaacacg aaaagcattt 1500
aactatgttc agacaaaaga ctgtttcatt tcaggggaca tcaataacaa tgacttacag 1560
gaaataagaa caattttcga taatggtatt actctttggc atactgacaa catcggaaat 1620
tacagcgtcg agaatgaatt gaggtga 1647
<210> 93
<211> 1800
<212> DNA
<213> 噬菌体 phi-29
<400> 93
atggcatatg taccattatc aggaacgaac gtcaggattt tagctgacgt tcctttctct 60
aatgattata aaaacacgag atggttcaca tcttcaagta atcagtataa ctggtttaac 120
agcaaatcac gtgtgtatga aatgagtaaa gtaacattca tggggtttag agaaaataaa 180
ccatatgttt cggttagtct tcccatagat aagctttaca gtgcgtcata tattatgttt 240
caaaatgcag actacggtaa caagtggttt tatgcatttg taaccgagtt agaatttaaa 300
aatagtgctg ttacctacgt tcactttgaa attgatgttc tccaaacatg gatgttcgat 360
attaaatttc aagaatcatt cattgtgagg gagcacgtta aattatggaa tgacgacggg 420
acaccgacta tcaacacaat tgatgagggt ctcagctacg gaagtgaata cgacatagtt 480
tctgtagaaa accataaacc atacgacgac atgatgtttc tcgtgattat ttccaaaagc 540
attatgcatg ggacgccggg agaagaggaa agcaggctaa atgacataaa cgcaagcctg 600
aacggcatgc cgcaacctct ctgctactat attcacccat tctacaaaga tggtaaagtt 660
cctaaaacgt atatcggaga taacaacgct aacttgtctc ctattgtcaa tatgctcacc 720
aatatctttt cacagaagag cgctgttaac gatattgtca atatgtatgt gactgattat 780
attggtttga agcttgacta taaaaatggt gataaagaat tgaagctcga taaagacatg 840
tttgaacagg cgggtatagc tgacgataaa cacggtaacg ttgacaccat ctttgtgaag 900
aaaatacctg attatgaagc cctagaaata gacacaggtg ataaatgggg tggcttcaca 960
aaagaccaag aaagcaaact gatgatgtac ccttactgcg ttacggaaat aactgacttt 1020
aaaggcaacc atatgaatct gaaaaccgag tacatcaata acagtaaact aaagatacag 1080
gttaggggtt cactaggggt cagtaacaag gttgcctaca gtgttcagga ttataacgca 1140
gatagcgcat tgagtggcgg caatagattg actgcgtctc tagattcatc cttaatcaac 1200
aacaacccaa atgacatagc aatactaaat gactatctat ctgcttattt acagggcaac 1260
aaaaattcac tagagaacca aaaatcgtct atccttttta atggcattat gggtatgatc 1320
ggcggaggta tatcagcggg agcaagtgcg gcaggaggtt cagccctagg gatggcttca 1380
tcagttacag ggatgacaag cactgcgggt aatgctgttc tacagatgca agcgatgcaa 1440
gccaagcaag ccgatatagc aaacattccg ccgcagttaa cgaaaatggg cggcaacaca 1500
gcgtttgatt acgggaatgg gtacagaggt gtgtacgtca tcaaaaagca attgaaggct 1560
gaatacagac gaagtctatc aagtttcttc cataaatacg gatacaagat taacagggta 1620
aagaaaccaa atttaagaac acgaaaagca tttaactatg ttcagacaaa agactgtttc 1680
atttcagggg acatcaataa caatgactta caggaaataa gaacaatttt cgataatggt 1740
attactcttt ggcatactga caacatcgga aattacagcg tcgagaatga attgaggtga 1800

Claims (25)

1.一种检测分析物的方法,包括:
(a)使含有分析物的样品与纳米孔组件接触,所述纳米孔组件包含由多个亚基形成的通道和用于检测分析物的探针,其中每个亚基含有在SEQ ID NO:3中示出的序列的变体的突变体phi 29尾蛋白,所述变体在下列位置的一个或多个处包含取代:K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A和E595V;
(b)施加跨过所述纳米孔组件的通道的电流;
(c)以一个或多个时间间隔确定穿过通道的电流;和
(d)将以一个或多个时间间隔测量的电流与参考电流进行比较,其中相对于参考电流的电流变化指示样品中分析物的存在。
2.根据权利要求1所述的方法,其中所述参考电流是用不含分析物的样品测量的。
3.根据权利要求1所述的方法,其中所述纳米孔组件被放置在支撑物上。
4.根据权利要求3所述的方法,其中所述通道插入在聚合物膜或脂膜中。
5.根据权利要求1所述的方法,其中所述通道插入在聚合物膜或脂膜中。
6.根据权利要求5所述的方法,其中所述聚合物膜包含交替共聚物、周期共聚物、嵌段共聚物、二嵌段共聚物、三嵌段共聚物、三元共聚物或其组合。
7.根据权利要求5所述的方法,其中所述聚合物膜包括PMOXA-PDMS-PMOXA。
8.根据权利要求1所述的方法,其中所述纳米孔组件还包括胆固醇和卟啉中的至少一种。
9.根据权利要求5所述的方法,其中所述纳米孔组件还包括胆固醇和卟啉中的至少一种。
10.根据权利要求1所述的方法,其中所述探针可操作地连接到phi 29尾蛋白的至少一个亚基。
11.根据权利要求9所述的方法,其中所述探针可操作地连接到phi 29尾蛋白的至少一个亚基。
12.根据权利要求10所述的方法,其中所述探针选自碳水化合物、适体、核酸、肽、蛋白质、抗体和受体。
13.根据权利要求10所述的方法,其中所述探针经由共价键可操作地连接到所述亚基中的至少一个。
14.根据权利要求13所述的方法,其中所述共价键包括二硫键、酯键或巯基键。
15.根据权利要求10所述的方法,其中所述探针可操作地连接到接近通道的入口的位置或在通道的内侧的位置。
16.根据前述权利要求中任一项所述的方法,其中所述分析物选自核酸、氨基酸、肽、蛋白质和聚合物。
17.包含在SEQ ID NO:3中示出的序列的变体的突变体phi 29尾蛋白,所述变体在下列位置的一个或多个处包含取代:K134I、D138L、D139L、D158L、E163V、E309V、D311V、K321V、K356A、K358A、D377A、D381V、N388L、R524I、R539A和E595V。
18.根据权利要求17所述的突变体phi 29尾蛋白,其中至少一个残基被半胱氨酸取代。
19.一种纳米孔组件,其包括根据权利要求17或18所述的突变体phi 29尾蛋白和用于检测分析物的探针。
20.根据权利要求19所述的纳米孔组件,其中所述探针可操作地连接到phi29尾蛋白的至少一个亚基。
21.根据权利要求19所述的纳米孔组件,其中所述探针选自碳水化合物、适体、核酸、肽、蛋白质、抗体和受体。
22.根据权利要求19所述的纳米孔组件,所述探针经由共价键可操作地连接到所述phi29尾蛋白的至少一个亚基。
23.根据权利要求22所述的纳米孔组件,其中所述共价键包括二硫键、酯键或巯基键。
24.根据权利要求19所述的纳米孔组件,其中所述探针可操作地连接到接近通道的入口的位置或在通道的内侧的位置。
25.根据权利要求19-24中任一项所述的纳米孔组件,其中所述分析物选自核酸、氨基酸、肽、蛋白质和聚合物。
CN201980020065.3A 2018-02-12 2019-02-11 纳米孔组件及其用途 Active CN112423878B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201862629604P 2018-02-12 2018-02-12
US62/629,604 2018-02-12
PCT/US2019/017432 WO2019157424A1 (en) 2018-02-12 2019-02-11 Nanopore assemblies and uses thereof

Publications (2)

Publication Number Publication Date
CN112423878A CN112423878A (zh) 2021-02-26
CN112423878B true CN112423878B (zh) 2024-06-18

Family

ID=67548611

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980020065.3A Active CN112423878B (zh) 2018-02-12 2019-02-11 纳米孔组件及其用途

Country Status (7)

Country Link
US (1) US20200399693A1 (zh)
EP (1) EP3752284A4 (zh)
JP (1) JP7353290B2 (zh)
CN (1) CN112423878B (zh)
AU (1) AU2019218263A1 (zh)
CA (1) CA3091126A1 (zh)
WO (1) WO2019157424A1 (zh)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112831395B (zh) * 2019-11-25 2024-01-16 深圳华大生命科学研究院 用于纳米孔测序的类细胞膜
GB202118908D0 (en) 2021-12-23 2022-02-09 Oxford Nanopore Tech Ltd Method
GB202118906D0 (en) 2021-12-23 2022-02-09 Oxford Nanopore Tech Ltd Method
CN115877018A (zh) * 2022-08-05 2023-03-31 四川大学华西医院 一种孔蛋白在制备检测去氢表雄酮硫酸酯的试剂盒中的应用
GB202216162D0 (en) 2022-10-31 2022-12-14 Oxford Nanopore Tech Plc Method
GB202307486D0 (en) 2023-05-18 2023-07-05 Oxford Nanopore Tech Plc Method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104220874A (zh) * 2012-02-15 2014-12-17 牛津纳米孔技术公司 适配体方法
WO2016115522A2 (en) * 2015-01-16 2016-07-21 University Of Kentucky Research Foundation Lipid bilayer-integrated spp1 connector protein nanopore and spp1 connector protein variants for use as lipid bilayer-integrated nanopore

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005530142A (ja) 2002-06-13 2005-10-06 リオトロピック セラピュティックス アイエヌシー. 標的物質を保持したナノ多孔質粒子
US8968545B2 (en) * 2012-04-02 2015-03-03 Lux Bio Group, Inc. Apparatus and method for molecular separation, purification, and sensing
AU2009320006A1 (en) * 2008-10-30 2010-06-03 Peixuan Guo Membrane-integrated viral DNA-packaging motor protein connector biosensor for DNA sequencing and other uses
GB201313477D0 (en) * 2013-07-29 2013-09-11 Univ Leuven Kath Nanopore biosensors for detection of proteins and nucleic acids
SG10202111841TA (en) * 2014-02-19 2021-12-30 Univ Washington Nanopore-based analysis of protein characteristics
WO2016187159A2 (en) * 2015-05-15 2016-11-24 Two Pore Guys, Inc. Methods and compositions for target detection in a nanopore using a labelled polymer scaffold
CN108350038A (zh) * 2015-09-18 2018-07-31 肯塔基大学研究基金会 用作整合脂质双层的纳米孔的蛋白质变体及其方法
WO2017167811A1 (en) * 2016-03-31 2017-10-05 Genia Technologies, Inc. Nanopore protein conjugates and uses thereof
WO2017184866A1 (en) * 2016-04-21 2017-10-26 F. Hoffmann-La Roche Ag Alpha-hemolysin variants and uses thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104220874A (zh) * 2012-02-15 2014-12-17 牛津纳米孔技术公司 适配体方法
WO2016115522A2 (en) * 2015-01-16 2016-07-21 University Of Kentucky Research Foundation Lipid bilayer-integrated spp1 connector protein nanopore and spp1 connector protein variants for use as lipid bilayer-integrated nanopore

Also Published As

Publication number Publication date
CA3091126A1 (en) 2019-08-15
US20200399693A1 (en) 2020-12-24
JP7353290B2 (ja) 2023-09-29
JP2021514189A (ja) 2021-06-10
EP3752284A1 (en) 2020-12-23
CN112423878A (zh) 2021-02-26
WO2019157424A1 (en) 2019-08-15
AU2019218263A1 (en) 2020-09-03
EP3752284A4 (en) 2022-05-25

Similar Documents

Publication Publication Date Title
CN112423878B (zh) 纳米孔组件及其用途
US20240199711A1 (en) Mutant lysenin pores
US11945840B2 (en) Protein pores
JP2021078500A (ja) 変異体ポア
CN102834716B (zh) 人工分枝菌酸膜
CN113195736A (zh)
US10648966B2 (en) Lipid bilayer-integrated SPP1 connector protein nanopore and SPP1 connector protein variants for use as lipid bilayer-integrated nanopore
Pavlenok et al. Control of subunit stoichiometry in single-chain MspA nanopores
AU2021206796A1 (en) Compositions and methods for improving nanopore sequencing
CN103242435B (zh) 一种亲和兼容性链霉亲和素突变体及其制备方法
CN113677693A (zh)
KR100979282B1 (ko) 실리카 결합단백질을 이용한 바이오-실리카 칩 및 그제조방법
Haque et al. Membrane-embedded channel of bacteriophage phi29 DNA-packaging motor for translocation and sensing of double-stranded DNA
WO2024033421A2 (en) Novel pore monomers and pores
Geng Membrane embedded channel of bacteriophage phi29 DNA packaging motor for single molecule sensing and nanomedicine
WO2024033422A1 (en) Novel pore monomers and pores
CN118056019A (zh) 具有穿过通道的带负电荷的聚合物的工程化纳米孔

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: oxford

Applicant after: Oxford nanopore Technology Public Co.,Ltd.

Address before: oxford

Applicant before: Oxford nanopore technology Co.

CB02 Change of applicant information
GR01 Patent grant