TW202317601A - Compositions and methods for modulating myc expression - Google Patents
Compositions and methods for modulating myc expression Download PDFInfo
- Publication number
- TW202317601A TW202317601A TW111112954A TW111112954A TW202317601A TW 202317601 A TW202317601 A TW 202317601A TW 111112954 A TW111112954 A TW 111112954A TW 111112954 A TW111112954 A TW 111112954A TW 202317601 A TW202317601 A TW 202317601A
- Authority
- TW
- Taiwan
- Prior art keywords
- sequence
- expression
- seq
- nucleic acid
- inhibitor
- Prior art date
Links
Images
Landscapes
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
Description
基因表現的異常調控為許多疾病(例如哺乳動物,例如人類)之潛在病因,例如贅瘤、神經病症、代謝障礙及肥胖症。轉錄因子MYC的調節異常在多種人類腫瘤及慢性肝病中起主要作用。MYC蛋白因多種因素(例如缺乏所定義的配位體結合位點,該配位體結合位點為維持正常組織而必需的生理功能)而被視為「不可成藥」。適於調節MYC基因表現的技術為治療此等疾病提供可行的替代方法。需要新穎的工具、系統及方法來穩定地改變(例如降低)疾病相關基因(諸如MYC)的表現。Aberrant regulation of gene expression is an underlying cause of many diseases such as neoplasms, neurological disorders, metabolic disorders and obesity in mammals such as humans. Dysregulation of the transcription factor MYC plays a major role in various human tumors and chronic liver diseases. The MYC protein is considered "undruggable" due to a number of factors, such as the lack of a defined ligand-binding site that is necessary for physiological functions to maintain normal tissues. Technologies suitable for modulating the expression of the MYC gene provide viable alternatives for the treatment of these diseases. Novel tools, systems and methods are needed to stably alter (eg, reduce) the expression of disease-associated genes such as MYC.
本發明尤其提供可用於調節(例如降低)靶基因(例如MYC)表現的表現抑制子及表現抑制系統。In particular, the present invention provides expression suppressors and expression suppression systems useful for modulating (eg, reducing) the expression of a target gene (eg, MYC).
在一些態樣中,本發明提供一種表現抑制子,其包含結合至靶基因啟動子(例如MYC啟動子)的靶向部分及視情況存在的效應部分,其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising a targeting moiety that binds to a target gene promoter (e.g., the MYC promoter) and optionally an effector moiety, wherein the expression suppressor is capable of reducing the target gene ( such as MYC) performance.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合靶基因座(例如MYC)的靶向部分及包含MQ1的效應部分或其片段或變異體,其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds to a target locus (e.g., MYC) and an effector portion comprising MQ1, or a fragment or variant thereof, wherein the expression suppressor is capable of reducing the target Gene (eg MYC) expression.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合至位於MYC之超級增強子區域中之調控元件的靶向部分及視情況存在的效應部分,其中該表現抑制子能夠降低MYC表現。In some aspects, the invention provides an expression inhibitor comprising: a targeting moiety and optionally an effector moiety that binds to a regulatory element located in a super-enhancer region of MYC, wherein the expression inhibitor is capable of reducing MYC Performance.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合至位於靶基因(例如MYC)之超級增強子區域中之調控元件的靶向部分及效應部分(例如KRAB或MQ1,或其片段或變異體),其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety and an effector moiety (such as KRAB or MQ1, or fragment or variant), wherein the expression suppressor is capable of reducing the expression of a target gene (eg MYC).
在一些態樣中,本發明提供一種表現抑制子,其包含:結合位於靶基因(例如MYC)之超級增強子區域中之調控元件的靶向部分,其中該靶向部分包含鋅指域,其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds a regulatory element located in a super-enhancer region of a target gene (e.g., MYC), wherein the targeting moiety comprises a zinc finger domain, wherein The expression suppressor is capable of reducing the expression of a target gene (eg MYC).
在一些態樣中,本發明提供一種表現抑制子,其包含:結合位於MYC之超級增強子區域中之調控元件的靶向部分,其中該靶向部分包含鋅指域或TAL效應域;以及效應部分,其中該效應部分包含轉錄抑制子(例如KRAB或其片段或變異體)或DNA甲基轉移酶(例如MQ1或其片段或變異體);其中該表現抑制子能夠降低MYC表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds a regulatory element located in a super-enhancer region of MYC, wherein the targeting moiety comprises a zinc finger domain or a TAL effector domain; and an effector Part, wherein the effector part comprises a transcriptional repressor (such as KRAB or a fragment or variant thereof) or a DNA methyltransferase (such as MQ1 or a fragment or variant thereof); wherein the expression repressor is capable of reducing MYC expression.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合靶基因座(例如MYC)的靶向部分,其中該靶向部分包含鋅指域,其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds to a target locus (e.g., MYC), wherein the targeting moiety comprises a zinc finger domain, wherein the expression suppressor is capable of reducing the target gene ( such as MYC) performance.
在一些態樣中,本發明提供表現抑制子,其包含:結合基因體基因座的靶向部分,該基因座包含SEQ ID NO: 1、3、97、98、99、100、101、102、103、104、105、106、107、109、110或75、76、78、79、80、81、84、85、86之序列中的至少14、15、16、17、18、19或20個核苷酸,其中該表現抑制子能夠降低MYC表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds to a gene body locus comprising SEQ ID NO: 1, 3, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 109, 110 or at least 14, 15, 16, 17, 18, 19 or 20 of the sequence of 75, 76, 78, 79, 80, 81, 84, 85, 86 Nucleotides, wherein the expression inhibitor is capable of reducing MYC expression.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合基因體基因座的靶向部分,該基因座包含SEQ ID NO: 2或77、82、83之序列中的至少16、17、18、19或20個核苷酸,且其中該表現抑制子能夠降低靶基因(例如MYC)表現。在一些實施例中,表現抑制子包含效應部分。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety that binds to a gene body locus comprising at least 16, 17 of the sequence of SEQ ID NO: 2 or 77, 82, 83 , 18, 19 or 20 nucleotides, and wherein the expression suppressor is capable of reducing expression of a target gene (eg MYC). In some embodiments, the expression suppressor comprises an effector moiety.
在一些態樣中,本發明提供包含靶向部分的表現抑制子,其中該靶向部分結合SEQ ID NO: 4上游或下游1400 nt內的基因體基因座。In some aspects, the invention provides expression suppressors comprising a targeting moiety, wherein the targeting moiety binds to a gene body locus within 1400 nt upstream or downstream of SEQ ID NO:4.
在一些態樣中,本發明提供包含靶向部分的表現抑制子,其中該靶向部分結合包含SEQ ID NO: 4、77、82或83之序列中之至少14、15、16、17、18、19或20個核苷酸的基因體基因座。In some aspects, the invention provides an expression inhibitor comprising a targeting moiety, wherein the targeting moiety binds at least 14, 15, 16, 17, 18 of a sequence comprising SEQ ID NO: 4, 77, 82, or 83 , 19 or 20 nucleotide gene body loci.
在一些態樣中,本發明提供包含靶向部分的表現抑制子,其中該靶向部分結合包含SEQ ID NO: 83、96或108之序列中之至少14、15、16、17、18、19或20個核苷酸的基因體基因座。In some aspects, the invention provides an expression inhibitor comprising a targeting moiety, wherein the targeting moiety binds at least 14, 15, 16, 17, 18, 19 of a sequence comprising SEQ ID NO: 83, 96 or 108 or 20 nucleotide gene body loci.
在一些態樣中,本發明提供一種系統,其包含:包含第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS)),或結合至與轉譯調控元件近接的序列;以及包含第二靶向部分及視情況存在之第二效應部分的第二表現抑制子,其中該第二表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,或結合至與錨定序列近接的序列。In some aspects, the invention provides a system comprising: a first expression inhibitor comprising a first targeting moiety and optionally a first effector moiety, wherein the first expression inhibitor is bound to an operably linked To a translational regulatory element (such as a promoter or transcriptional start (TSS)) of a target gene (such as MYC), or to a sequence in close proximity to a translational regulatory element; and comprising a second targeting moiety and optionally a second effector moiety wherein the second expression suppressor binds to an anchor sequence in an anchor sequence-mediated adapter (ASMC) comprising a target gene (e.g., MYC), or to an anchor sequence in close proximity to the anchor sequence sequence.
在一些態樣中,本發明提供一種系統,其包含:包含第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS)),或結合至與轉譯調控元件近接的序列;以及包含第二靶向部分及視情況存在之第二效應部分的第二表現抑制子,其中該第二表現抑制子結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座。In some aspects, the invention provides a system comprising: a first expression inhibitor comprising a first targeting moiety and optionally a first effector moiety, wherein the first expression inhibitor is bound to an operably linked To a translational regulatory element (such as a promoter or transcriptional start (TSS)) of a target gene (such as MYC), or to a sequence in close proximity to a translational regulatory element; and comprising a second targeting moiety and optionally a second effector moiety wherein the second expression suppressor binds to a gene body locus located in a super-enhancer region of a target gene (eg, MYC).
在某些實施例中,第一靶向部分特異性結合第一DNA序列且第二靶向部分特異性結合不同於第一DNA序列的第二DNA序列。在某些實施例中,第一效應部分不同於第二效應部分。In certain embodiments, the first targeting moiety specifically binds a first DNA sequence and the second targeting moiety specifically binds a second DNA sequence different from the first DNA sequence. In certain embodiments, the first effect moiety is different than the second effect moiety.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合至可操作地連接至靶基因(例如MYC)之轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與該轉譯調控元件近接之序列的靶向部分,該靶向部分包含CRISPR/Cas分子,例如包含無催化活性CRISPR/Cas蛋白;以及包含MQ1或其功能變異體或片段的效應部分。In some aspects, the invention provides an expression repressor comprising: binding to a translational regulatory element (such as a promoter or transcription start (TSS)) operably linked to a target gene (such as MYC) or binding to the A targeting portion of a sequence in close proximity to the translational regulatory element, the targeting portion comprising a CRISPR/Cas molecule, for example comprising a catalytically inactive CRISPR/Cas protein; and an effector portion comprising MQ1 or a functional variant or fragment thereof.
在一些態樣中,本發明提供一種表現抑制子,其包含:結合至位於靶基因(例如MYC)之超級增強子區域中之基因體基因座的靶向部分,該靶向部分包含CRISPR/Cas分子,例如包含無催化活性CRISPR/Cas蛋白;以及包含KRAB、MQ1或其功能變異體或片段的效應部分,其中該表現抑制子能夠降低靶基因(例如MYC)表現。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety comprising CRISPR/Cas that binds to a gene body locus located in a super-enhancer region of a target gene (e.g., MYC) Molecules, for example comprising a catalytically inactive CRISPR/Cas protein; and effector moieties comprising KRAB, MQ1 or functional variants or fragments thereof, wherein the expression suppressor is capable of reducing expression of a target gene (eg MYC).
在一些態樣中,本發明提供一種表現抑制子,其包含:結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中之錨定序列或結合至與錨定序列近接之序列的靶向部分,該靶向部分包含CRISPR/Cas分子,例如包含無催化活性CRISPR/Cas蛋白;以及包含KRAB或其功能變異體或片段的效應部分。In some aspects, the invention provides an expression suppressor comprising: binding to an anchor sequence in an anchor sequence-mediated adapter (ASMC) comprising a target gene (eg, MYC) or binding to an anchor sequence A targeting moiety of adjacent sequences, the targeting moiety comprising a CRISPR/Cas molecule, for example comprising a catalytically inactive CRISPR/Cas protein; and an effector moiety comprising KRAB or a functional variant or fragment thereof.
在一些態樣中,本發明提供一種表現抑制子,其包含:包含鋅指分子的靶向部分,該靶向部分結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與該轉譯調控元件近接的序列;以及包含MQ1或其功能變異體或片段的效應部分。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety comprising a zinc finger molecule that binds to a translational regulatory element (eg, a promoter or transcriptional start (TSS)) or to a sequence in close proximity to the translational regulatory element; and an effector portion comprising MQ1 or a functional variant or fragment thereof.
在一些態樣中,本發明提供一種表現抑制子,其包含:包含鋅指分子的靶向部分,該靶向部分結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,或結合至與錨定序列近接的序列;及包含KRAB或其功能變異體或片段的效應部分。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety comprising a zinc finger molecule that binds to an anchor sequence-mediated adapter (ASMC) comprising a target gene (eg, MYC) ), or bind to a sequence in close proximity to the anchor sequence; and an effector portion comprising KRAB or a functional variant or fragment thereof.
在一些態樣中,本發明提供一種表現抑制子,其包含:包含鋅指分子的靶向部分,該靶向部分結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座;及包含KRAB或其功能變異體或片段的效應部分。In some aspects, the invention provides an expression suppressor comprising: a targeting moiety comprising a zinc finger molecule that binds to a gene body locus located in a super-enhancer region of a target gene (eg, MYC) ; and an effector portion comprising KRAB or a functional variant or fragment thereof.
在一些態樣中,本發明係關於一種編碼第一表現抑制子、第二表現抑制子、此兩者的核酸,或其組分(例如gRNA、mRNA)。在一些實施例中,編碼表現抑制系統的核酸為多順反子序列。在一些實施例中,多順反子序列為雙順反子序列。In some aspects, the invention relates to a nucleic acid encoding a first suppressor of expression, a second suppressor of expression, both, or a component thereof (eg, gRNA, mRNA). In some embodiments, the nucleic acid encoding the expression suppression system is a polycistronic sequence. In some embodiments, the polycistronic sequence is a dicistronic sequence.
在一些態樣中,本發明係關於一種載體,其包含本文所述之核酸、系統或表現抑制子。在另一態樣中,本發明係關於一種脂質奈米粒子,其包含本文所述之載體、核酸、系統或表現抑制子。在另一態樣中,本發明係關於一種反應混合物,其包含本文所述之表現抑制子、系統、核酸、載體或脂質奈米粒子。在另一態樣中,本發明係關於一種醫藥組合物,其包含本文所述之表現抑制子、系統、核酸、載體、脂質奈米粒子或反應混合物。In some aspects, the invention relates to a vector comprising a nucleic acid, system or expression suppressor described herein. In another aspect, the present invention relates to a lipid nanoparticle comprising the vector, nucleic acid, system or expression inhibitor described herein. In another aspect, the invention relates to a reaction mixture comprising the expression inhibitors, systems, nucleic acids, vectors or lipid nanoparticles described herein. In another aspect, the present invention relates to a pharmaceutical composition comprising the expression inhibitor, system, nucleic acid, vector, lipid nanoparticle or reaction mixture described herein.
在一些態樣中,本發明係關於一種降低靶基因表現的方法,其包含提供本文所述之表現抑制子或表現抑制系統及使靶基因及/或一或多種可操作地連接的轉錄控制元件與表現抑制子或表現抑制系統接觸,藉此降低靶基因表現。In some aspects, the invention relates to a method of reducing expression of a target gene comprising providing an expression suppressor or expression suppression system as described herein and enabling the target gene and/or one or more operably linked transcriptional control elements Exposure to an expression suppressor or expression suppression system, thereby reducing expression of the target gene.
在一些態樣中,本發明係關於一種治療個體之與靶基因(例如MYC)過度表現相關之病狀的方法,其包含將本文所述之表現抑制子或系統、核酸或載體投與該個體,藉此治療該病狀。In some aspects, the invention relates to a method of treating a condition associated with overexpression of a target gene (eg, MYC) in an individual comprising administering to the individual an expression inhibitor or system, nucleic acid or vector described herein , to treat the condition.
在一些態樣中,本發明係關於一種治療個體之與靶基因(例如MYC)之異常調控相關之病狀的方法,其包含向該個體投與本文所述之表現抑制子、系統、核酸或載體,藉此治療該病狀。In some aspects, the invention relates to a method of treating a condition in an individual that is associated with aberrant regulation of a target gene (e.g., MYC), comprising administering to the individual an expression inhibitor, system, nucleic acid, or expression inhibitor described herein. carrier, whereby the condition is treated.
在一些態樣中,本發明提供一種降低靶基因(例如MYC)在細胞中之表現的方法,該方法包含:使該細胞與一系統接觸,藉此降低靶基因(例如MYC)在細胞中之表現,該系統包含:包含第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS));以及包含第二靶向部分及視情況存在之第二效應部分的第二表現抑制子,其中該第二表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列或結合至與錨定序列近接的序列。In some aspects, the invention provides a method of reducing expression of a target gene (eg, MYC) in a cell, the method comprising: contacting the cell with a system whereby expression of the target gene (eg, MYC) in the cell is decreased expression, the system comprising: a first expression repressor comprising a first targeting moiety and optionally a first effector moiety, wherein the first expression repressor binds to a translational expression operably linked to a target gene (e.g., MYC) a regulatory element (such as a promoter or a transcription start (TSS)); and a second expression repressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression repressor binds to a gene comprising the target gene ( For example, the anchor sequence in the anchor sequence-mediated adapter (ASMC) of MYC) or binds to a sequence in close proximity to the anchor sequence.
在一些態樣中,本發明提供一種降低靶基因(例如MYC)在細胞中之表現的方法,該方法包含:使該細胞與一系統接觸,藉此降低靶基因(例如MYC)在細胞中的表現,該系統包含:包含第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS));以及包含第二靶向部分及視情況存在之第二效應部分的第二表現抑制子,其中該第二表現抑制子結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座。In some aspects, the invention provides a method of reducing expression of a target gene (eg, MYC) in a cell, the method comprising: contacting the cell with a system whereby expression of the target gene (eg, MYC) in the cell is decreased expression, the system comprising: a first expression repressor comprising a first targeting moiety and optionally a first effector moiety, wherein the first expression repressor binds to a translational expression operably linked to a target gene (e.g., MYC) a regulatory element (such as a promoter or a transcription start (TSS)); and a second expression repressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression repressor binds to a gene located on the target gene ( For example, a gene body locus in the super-enhancer region of MYC).
本發明另外部分地提供一種套組,其包含:a)容器,該容器包含含有表現抑制子的組合物,該表現抑制子包含結合至靶基因、啟動子(例如MYC)的靶向部分及能夠調節(例如降低)靶基因(例如MYC)表現的效應部分;及b)一組說明書,其包含用該組合物調節靶基因(例如MYC)在細胞內之表現的至少一種方法。The invention additionally provides in part a kit comprising: a) a container comprising a composition comprising an expression suppressor comprising a targeting moiety that binds to a target gene, a promoter (eg MYC) and capable of an effector portion that modulates (eg reduces) expression of a target gene (eg MYC); and b) a set of instructions comprising at least one method of modulating expression of a target gene (eg MYC) in a cell using the composition.
本發明另外部分地提供一種套組,其包含:a)容器,該容器包含含有表現抑制子的組合物,該表現抑制子包含結合至位於靶基因(例如MYC)之超級增強子區域中之基因座的靶向部分及能夠調節(例如降低)靶基因(例如MYC)表現的效應部分;以及b)一組說明書,其包含用該組合物調節靶基因(例如MYC)在細胞內之表現的至少一種方法。The invention additionally provides, in part, a kit comprising: a) a container comprising a composition comprising an expression suppressor comprising a gene that binds to a super-enhancer region located in a target gene (eg, MYC) A targeting moiety of a locus and an effector moiety capable of modulating (e.g. reducing) the expression of a target gene (e.g. MYC); and b) a set of instructions comprising at least one method for regulating the expression of a target gene (e.g. MYC) in a cell with the composition a way.
在一些態樣中,套組包含a)容器,該容器包含含有一系統的組合物,該系統包含兩種表現抑制子,包含含有第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與轉譯調控元件近接的序列;以及含有第二靶向部分及視情況存在之第二效應部分的表現抑制子,其中第二表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列或結合至與錨定序列近接的序列。In some aspects, the kit comprises a) a container comprising a composition comprising a system comprising two expression inhibitors comprising a first targeting moiety and optionally a first effector moiety. An expression repressor, wherein the first expression repressor binds to a translational regulatory element (e.g., a promoter or a transcriptional start (TSS)) operably linked to a target gene (e.g., MYC) or to a sequence in close proximity to a translational regulatory element and an expression suppressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression suppressor binds to an anchor sequence-mediated adapter (ASMC) comprising a target gene (eg, MYC) anchor sequence or binds to a sequence in close proximity to the anchor sequence.
在一些態樣中,套組包含a)容器,該容器包含含有一系統的組合物,該系統包含兩種表現抑制子,包含含有第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與轉譯調控元件近接的序列;以及含有第二靶向部分及視情況存在之第二效應部分的表現抑制子,其中該第二表現抑制子結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座。In some aspects, the kit comprises a) a container comprising a composition comprising a system comprising two expression inhibitors comprising a first targeting moiety and optionally a first effector moiety. An expression repressor, wherein the first expression repressor binds to a translational regulatory element (e.g., a promoter or a transcriptional start (TSS)) operably linked to a target gene (e.g., MYC) or to a sequence in close proximity to a translational regulatory element and an expression suppressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression suppressor binds to a gene body locus located in a super-enhancer region of a target gene (eg, MYC).
在一些實施例中,套組進一步包含b)一組說明書,其包含用該組合物治療疾病或調節(例如降低)靶基因(例如MYC)在細胞內之表現的至少一種方法。在一些實施例中,套組可視情況包括用於該組合物的遞送媒劑(例如脂質奈米粒子)。可提供懸浮於賦形劑及/或遞送媒劑中的試劑,或試劑可作為各別組分提供,之後可與賦形劑及/或遞送媒劑合併。在一些實施例中,套組可視情況含有與組合物共投與的其他治療劑,以影響所需靶基因表現,例如調節MYC基因表現。雖然說明材料典型地為書面或印刷材料,但其不限於此。本發明考慮了能夠儲存此類說明書且將其傳達至最終使用者之任何介質。此類介質包括(但不限於)電子儲存介質(例如磁盤、磁帶、盒式磁盤、晶片)、光學介質(例如CD ROM)及其類似物。此類介質可包括提供此類說明材料之網際網路站點的地址。In some embodiments, the kit further comprises b) a set of instructions comprising at least one method of using the composition to treat a disease or modulate (eg, reduce) the expression of a target gene (eg, MYC) in a cell. In some embodiments, the kit optionally includes a delivery vehicle (eg, lipid nanoparticles) for the composition. The agent may be provided suspended in the excipient and/or delivery vehicle, or the agent may be provided as a separate component which may then be combined with the excipient and/or delivery vehicle. In some embodiments, the kit optionally contains other therapeutic agents co-administered with the composition to affect desired target gene expression, eg, modulate MYC gene expression. While instructional material is typically written or printed material, it is not limited to such. The present invention contemplates any medium capable of storing and communicating such instructions to the end user. Such media include, but are not limited to, electronic storage media (eg, magnetic disks, tapes, cartridges, wafers), optical media (eg, CD ROM), and the like. Such media may include addresses to Internet sites that provide such instructional materials.
任一種前述方法或組合物的其他特徵包括以下列舉的一或多個實施例。Additional features of any of the foregoing methods or compositions include one or more of the embodiments listed below.
熟習此項技術者將認識到,或僅使用常規實驗便能夠確定本文所述之本發明之特定實施例的許多等效物。以下列舉的實施例意欲涵蓋此類等效物。Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The examples listed below are intended to cover such equivalents.
本文提及的所有公開案、專利申請案、專利及其他參考文獻(例如序列資料庫參考編號)均以全文引用的方式併入。舉例而言,本文(例如本文的任何表中)所提及之所有GenBank、Unigene及Entrez序列以引用之方式併入。除非另外指明,否則本文中(包括本文的任何表中)指定之序列寄存編號係指截至2020年12月15日之資料庫條目。當一種基因或蛋白質提及複數個序列寄存編號時,涵蓋所有序列變異體。All publications, patent applications, patents, and other references (eg, sequence database reference numbers) mentioned herein are incorporated by reference in their entirety. For example, all GenBank, Unigene and Entrez sequences mentioned herein (eg, in any table herein) are incorporated by reference. Unless otherwise indicated, serial deposit numbers specified herein (including in any tables herein) refer to database entries as of December 15, 2020. When multiple sequence accession numbers are mentioned for a gene or protein, all sequence variants are covered.
所列舉的實施例1. 一種表現抑制子,其包含:
結合至MYC啟動子的靶向部分,及
視情況存在的效應部分,
其中該表現抑制子能夠減少MYC表現。
2. 如實施例1之表現抑制子,其中該靶向部分結合SEQ ID NO: 4、199或201上游或下游1400、1200、1000、800、600、400或200 nt內的基因體基因座。
3. 如實施例1之表現抑制子,其中該靶向部分結合的基因體基因座包含序列SEQ ID NO: 4、77、82、83、85、199或201的至少14、15、16、17、18、19或20個核苷酸。
4. 一種表現抑制子,其包含:
靶向部分,其結合的基因體基因座包含序列SEQ ID NO: 3、97、98、99、100、101、102、103、104、105、106、107、109、110、75、76、78、79、80、81、84、85、86、190、191、192、200或202的至少14、15、16、17、18、19或20個核苷酸;及
視情況存在的效應部分,
其中該表現抑制子能夠減少MYC表現。
5. 一種表現抑制子,其包含:
靶向部分,其結合的基因體基因座包含序列SEQ ID NO: 2、77、82、83、199或201之至少16、17、18、19或20個核苷酸,及
視情況存在的效應部分,
其中該表現抑制子能夠減少MYC表現。
6. 一種表現抑制子,其包含:
結合MYC基因座的靶向部分,及
包含MQ1或其片段或變異體的效應部分,
其中該表現抑制子能夠減少MYC表現。
7. 一種表現抑制子,其包含:
靶向部分,其結合MYC超級增強子區域中之基因座,
視情況存在的效應部分,例如包含DNA甲基轉移酶的效應部分,其中視情況地,該效應部分包含MQ1或其片段或變異體,
其中該表現抑制子能夠減少MYC表現。
8. 一種表現抑制子,其包含:
靶向部分,其結合MYC超級增強子區域中之基因座,
包含轉錄抑制子的效應部分,其中視情況,該效應部分包含KRAB或其片段或變異體,
其中該表現抑制子能夠減少MYC表現。
9. 如實施例7或8之表現抑制子,其中該靶向部分結合的基因體基因座包含SEQ ID NO: 96-110、83、199、201中之任一者之序列的至少14、15、16、17、18、19或20個核苷酸。
10. 如實施例7至9中任一例之表現抑制子,其中該靶向部分結合的基因體基因座包含使用hg19參考基因體之序列GRCh37:chr8:129162465-129212140中之至少14、15、16、17、18、19或20個核苷酸。
11. 如實施例7至10中任一例之表現抑制子,其中該靶向部分結合的基因體基因座包含序列SEQ ID NO: 96或108中之至少14、15、16、17、18、19或20個核苷酸。
12. 如實施例7至11中任一例之表現抑制子,其中該靶向部分包含鋅指域或TAL效應域。
13. 一種表現抑制子,其包含:
靶向部分,其結合基因座,例如MYC基因座,
包含EZH2或其片段或變異體的第一效應部分,及
包含KRAB或其片段或變異體的第二效應部分,
其中該表現抑制子能夠減少該基因座的表現,例如減少MYC表現。
14. 如實施例13之表現抑制子,其中該靶向部分結合MYC啟動子、超級增強子區域或錨定序列。
15. 如實施例13或14之表現抑制子,其中該靶向部分包含TAL效應域、CRISPR/Cas域或鋅指域。
16. 如實施例13至15中任一例之表現抑制子,其中第一效應部分位於第二效應子的N端,或其中第一效應子位於第二效應部分的C端。
17. 一種表現抑制子,其包含:
結合MYC基因座的靶向部分,其中該靶向部分包含鋅指域,及
視情況存在的效應部分,
其中該表現抑制子能夠減少MYC表現。
18. 一種表現抑制子,其包含:
靶向部分,其包含CRISPR/Cas域,例如包含無催化活性的CRISPR/Cas蛋白質,該靶向部分結合至可操作地連接至MYC基因的轉譯調控元件(例如啟動子、增強子、超級增強子或轉錄起點(TSS))或近接於該轉譯調控元件的序列;及
包含MQ1或其功能變異體或片段的效應部分。
19. 一種表現抑制子,其包含:
靶向部分,其包含CRISPR/Cas域,例如包含無催化活性的CRISPR/Cas蛋白質,該靶向部分結合至可操作地連接至MYC基因的轉譯調控元件(例如啟動子、增強子或轉錄起點(TSS))或近接於該轉譯調控元件的序列;及
包含MQ1或其功能變異體或片段的效應部分。
20. 一種表現抑制子,其包含:
靶向部分,其包含CRISPR/Cas域,例如包含無催化活性的CRISPR/Cas蛋白質,該靶向部分結合至可操作地連接至MYC基因的轉譯調控元件(例如啟動子、增強子或轉錄起點(TSS))或近接於該轉譯調控元件的序列;及
包含KRAB或其功能變異體或片段的效應部分。
21. 一種表現抑制子,其包含:
靶向部分,其包含CRISPR/Cas域,例如包含無催化活性的CRISPR/Cas蛋白質,其結合至包含MYC基因之錨定序列介導性接合體(ASMC)的錨定序列,或結合至近接於該錨定序列的序列;及
包含KRAB或其功能變異體或片段的效應部分。
22. 一種表現抑制子,其包含:
包含鋅指域的靶向部分,該鋅指域結合至可操作地連接至MYC基因的轉譯調控元件(例如啟動子、增強子或轉錄起點(TSS))或近接於該轉譯調控元件的序列;及
包含MQ1或其功能變異體或片段的效應部分。
23. 一種表現抑制子,其包含:
包含鋅指域的靶向部分,該鋅指域結合至可操作地連接至MYC基因的轉譯調控元件(例如啟動子、增強子或轉錄起點(TSS))或近接於該轉譯調控元件的序列;及
包含KRAB或其功能變異體或片段的效應部分。
24. 一種表現抑制子,其包含:
靶向部分,其結合的小鼠基因體基因座包含SEQ ID NO: 190-192中之任一者之序列的至少14、15、16、17、18、19或20個核苷酸;及
視情況存在的效應部分,
其中該表現抑制子能夠減少MYC表現。
25. 如技術方案24之表現抑制子,其中該效應部分包含DNA甲基轉移酶,例如MQ1或其片段或變異體。
26. 如實施例24或25之表現抑制子,其中該靶向部分包含TAL效應域、CRISPR/Cas域、鋅指域、tetR域、巨核酸酶域或寡核苷酸。
27. 如實施例24至26中任一例之表現抑制子,其中該靶向部分包含鋅指域或TAL效應域。
28. 如實施例24至27中任一例之表現抑制子,其中該表現抑制子包含選自SEQ ID NO: 160-165中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
29. 如實施例24至28中任一例之表現抑制子,其中該表現抑制子係由以下編碼:選自SEQ ID NO: 166-168中之任一者的核苷酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
30. 如實施例24至29中任一例之表現抑制子,其中該靶向部分包含根據SEQ ID NO: 154-156中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
31. 如實施例24至30中任一例之表現抑制子,其中該靶向部分包含根據SEQ ID NO: 157-159中之任一者的核酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
32. 如實施例24至31中任一例之表現抑制子,其中該效應部分為持久效應部分。
33. 如實施例24至32中任一例之表現抑制子,其中該效應部分為短暫效應部分。
34. 如實施例24至33中任一例之表現抑制子,其中該表現抑制子為融合分子。
35. 如實施例24至34中任一例之表現抑制子,其中該靶向部分包含鋅指域,且該效應部分包含表觀遺傳修飾部分,例如DNA甲基轉移酶,例如MQ1或其片段或變異體。
36. 如實施例18至20、22或23中任一例之表現抑制子,其中該調控元件為一串調控元件的一部分。
37. 如實施例18至20、22或23中之任一者的表現抑制子,其中該調控元件位於非編碼區域中。
38. 如實施例18至20、22或23中任一例之表現抑制子,其中該調控元件為遠端強化子,例如位於遠離靶基因啟動子(例如MYC)至少1,000 nt處的遠端增強子。
39. 如實施例18至20、22、23或36至38中任一例之表現抑制子,其中該調控元件增強靶基因(例如MYC)表現。
40. 如實施例18至20、22、23或36至39中任一例之表現抑制子,其中該調控元件含有一或多個突變。
41. 如實施例18至20、22、23或36至40中任一例之表現抑制子,其中該調控元件含有至少一種疾病相關單核苷酸多形性(SNP)。
42. 如實施例18至20、22、23或36至41中任一例之表現抑制子,其中該轉錄調控元件經由增強子對接位點與靶基因(例如MYC)之啟動子相互作用。
43. 如實施例42之表現抑制子,其中該增強子對接位點包含根據SEQ ID NO: 71-74中之任一者的核苷酸序列。
44. 一種表現抑制子,其包含:
包含鋅指域的靶向部分,該鋅指域結合至包含MYC基因之錨定序列介導性接合體(ASMC)的錨定序列或結合至近接於該錨定序列的序列;及
包含KRAB或其功能變異體或片段的效應部分。
45. 如實施例1至23或36至43中任一例之表現抑制子,其中該表現抑制子包含選自SEQ ID NO: 22-37、129、133、134、139-149或177-186中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
46. 如實施例1至23或36至45中任一例之表現抑制子,其中該表現抑制子係由以下編碼:選自SEQ ID NO: 55-70、130、189或193-197中之任一者的核苷酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
47. 如實施例1至23或36至46中任一例之表現抑制子,其中該靶向部分包含根據SEQ ID NO: 5-16或169-172中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
48. 如任一前述實施例之表現抑制子,其中該效應部分包含根據SEQ ID NO: 18、19或87之胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
49. 如實施例1至12、17至19、22、36至42或44至47中任一例之表現抑制子,其中該效應部分為持久效應部分。
50. 如實施例1至23或36至48中任一例之表現抑制子,其中該效應部分為短暫效應部分。
51. 如實施例1至12、17至19、22、36至42或44至48中任一例之表現抑制子,其中該效應部分包含DNA甲基轉移酶,例如MQ1或其片段或變異體。
52. 如實施例1至23、36至47或49中任一例之表現抑制子,其中該效應部分包含轉錄抑制子,例如包含KRAB或其片段或變異體。
53. 如任一前述實施例之表現抑制子,其中該靶向部分包含TAL效應域、CRISPR/Cas域、鋅指域、tetR域、巨核酸酶域或寡核苷酸。
54. 如實施例53之表現抑制子,其中該CRISPR/Cas域結合gRNA,例如所結合之基因體基因座包含SEQ ID NO: 1-4中任一者之序列之至少14、15、16、17、18、19或20個核苷酸的gRNA,例如其中該gRNA包含包括SEQ ID NO: 1-4中任一者之序列之至少14、15、16、17、18、19或20個核苷酸的序列。
55. 如實施例53之表現抑制子,其中該CRISPR/Cas域結合gRNA,例如所結合之基因體基因座包含SEQ ID NO: 96-110中任一者之序列之至少14、15、16、17、18、19或20個核苷酸的gRNA,例如其中該gRNA包含包括SEQ ID NO: 96-110中任一者之序列之至少14、15、16、17、18、19或20個核苷酸的序列。
56. 如實施例53至55中任一例之表現抑制子,其中該CRISPR/Cas域包含選自表1的Cas蛋白質或Cpf1蛋白質或其任一者之變異體(例如突變體)。
57. 如實施例53至56中任一例之表現抑制子,其中該CRISPR/Cas域包含無催化活性的CRISPR/Cas蛋白質,例如dCas9。
58. 如實施例53之表現抑制子,其中該鋅指域結合的基因體基因座包含SEQ ID NO: 96-110中任一者之序列的至少14、15、16、17、18、19或20個核苷酸,例如其中該gRNA包含包括SEQ ID NO: 96-110中任一者之序列之至少14、15、16、17、18、19或20個核苷酸的序列。
59. 如實施例17、22、26至53或57中任一例之表現抑制子,其中該鋅指域包含1、2、3、4、5、6、7、8、9或10個鋅指(且視情況不超過11、10、9、8、7、6或5個鋅指)。
60. 如實施例17、22、26至53、57或58中任一例之表現抑制子,其中該鋅指域包含1-10、1-9、1-8、1-7、1-6、1-5、1-4、1-3、1-2、2-10、2-9、2-8、2-7、2-6、2-5、2-4、2-3、3-10、3-9、3-8、3-7、3-6、3-5、3-4、4-10、4-9、4-8、4-7、4-6、4-5、5-10、5-9、5-8、5-7、5-6、6-10、6-9、6-8、6-7、7-10、7-9、7-8、8-10、8-9或9-10個鋅指。
61. 如實施例17、22、26至53或57至59中任一例之表現抑制子,其中該鋅指域包含3或9個鋅指。
62. 如任一前述實施例之表現抑制子,其為融合分子。
63. 如任一前述實施例之表現抑制子,其包含位於靶向域與效應域之間的連接子,視情況其中該連接子包含根據SEQ ID NO: 137或SEQ ID NO: 138之胺基序列。
64. 如實施例1至17、20、21、23、44至48、50或52至57中任一例之表現抑制子,其中該靶向部分包含無催化活性的CRISPR/Cas域(例如dCas9)且該效應部分包含轉錄抑制子,例如KRAB或其片段或變異體。
65. 如實施例1至17、20、21、23、44至48、50、52或53至64中任一例之表現抑制子,其中該靶向部分包含鋅指域,且該效應部分包含轉錄抑制子,例如KRAB或其片段或變異體。
66. 如實施例17、36至43、45至47、53或58至63中任一例之表現抑制子,其中該靶向部分包含鋅指域,且該表現抑制子不包含效應部分。
67. 如實施例1至12、18至19、22、36至43、45至49、51或53至57中任一例之表現抑制子,其中該靶向部分包含無催化活性的CRISPR/Cas域(例如dCas9)且該效應部分包含表觀遺傳修飾部分,例如DNA甲基轉移酶,例如MQ1或其片段或變異體。
68. 如實施例1至12、17至19、22、36至43、45至49、51、53或58至63中任一例之表現抑制子,其中該靶向部分包含鋅指域,且該效應部分包含表觀遺傳修飾部分,例如DNA甲基轉移酶,例如MQ1或其片段或變異體。
69. 如任一前述實施例之表現抑制子,其包含SEQ ID NO: 22-37、129、133、134、139-149或177-186中任一者之胺基酸序列,或與其至少80%、85%、90%、95%、96%、97%、98%、99%一致的序列。
70. 如任一前述實施例之表現抑制子,其:(i)包含一或多個核定位信號序列(NLS),或(ii)不包含NLS。
71. 如任一前述實施例之表現抑制子,其包含位於N端的第一NLS,例如其中該第一NLS具有序列SEQ ID NO: 88。
72. 如任一前述實施例之表現抑制子,其包含位於C端的NLS,例如第二NLS,例如具有序列SEQ ID NO: 89。
73. 如任一前述實施例之表現抑制子,其中第一NLS與第二NLS具有相同序列。
74. 如實施例71至73中任一例之表現抑制子,其中第一NLS與第二NLS具有不同序列。
75. 如任一前述實施例之表現抑制子,其包含抗原決定基標籤。
76. 如實施例75之表現抑制子,其中該抗原決定基標籤為HA標籤。
77. 如任一前述實施例之表現抑制子,其中該錨定序列包含序列SEQ ID NO: 71或72,或相對於其具有不超過8、7、6、5、4、3、2或1個變化的序列。
78. 如實施例1至77中任一例之表現抑制子,其中該錨定序列包含根據SEQ ID NO: 73或74之序列,或相對於其具有不超過8、7、6、5、4、3、2或1個變化的序列。
79. 如任一前述實施例之表現抑制子,其中該錨定序列與MYC基因位於同一染色體上。
80. 如任一前述實施例之表現抑制子,其中該錨定序列位於MYC基因上游(例如TSS上游或啟動子上游)。
81. 如任一前述實施例之表現抑制子,其中該錨定序列與MYC基因(例如MYC基因之TSS或啟動子)相距至少1、5、10、50、100或1000千鹼基。
82. 如任一前述實施例之表現抑制子,其中該錨定序列與MYC基因(例如MYC基因之TSS或啟動子)相距0.1-0.5、0.1-1、0.1-5、0.1-10、0.1-50、0.1-100、0.1-500、0.1-1000、0.5-1、0.5-5、0.5-10、0.5-50、0.5-100、0.5-500、0.5-1000、1-5、1-10、1-50、1-100、1-500、1-1000、5-10、5-50、5-100、5-500、5-1000、10-50、10-100、10-500、10-1000、50-100、50-500、50-1000、100-500、100-1000或500-1000千鹼基。
83. 如實施例1至79或81至82中任一例之表現抑制子,其中靶序列位於MYC基因下游(例如TSS下游或啟動子下游)。
84. 如任一前述實施例之表現抑制子,其中該靶向部分結合至位於以下染色體座標處的序列或與其近接的序列:128746342-128746364、128746321-128746343、128746525-128746547、128748014-128748036、129188878-129188900、129188958-129188980、129188960-129188982、129189067-129189089、129189457-129189479、129189554-129189576、129189679-129189701、129209511-129209533、129209643-129209665、129209658-129209680、129209856-129209878、129189452-129189474、129189190-129189212、129189274-129189296、129189421-129189443、128746405-128746425、128748069-128748089、129188825-129188845或129188822-129188842。
85. 如任一前述實施例之表現抑制子,其中相較於表現抑制子不存在下發生的甲基化,該表現抑制子結合至該標靶基因座(例如MYC)使標靶基因座(例如MYC)中之位點的甲基化增加10、20、30、40、50、60、70、80、90或100%,例如如藉由ELISA所量測或如實例7或28中之任一例中所述,其中視情況,針對甲基化分析的位點為根據hg19參考基因體的chr8:129188693-129189048,例如包含根據SEQ ID NO: 123之序列。
86. 如任一前述實施例之表現抑制子,其中該表現抑制子結合至標靶基因座(例如MYC)使標靶基因座(例如MYC)中之位點的甲基化增加以下時段:至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如實例28中所述。
87. 如任一前述實施例之表現抑制子,其中相較於表現抑制子不存在下的表現,該表現抑制子結合至MYC基因座使細胞中的MYC表現減弱10、20、30、40、50、60、70、80、90或100%,例如如藉由ELISA所量測或如實例2至7或9中之任一例中所述。
88. 如任一前述實施例之表現抑制子,其中該表現抑制子結合至MYC基因座使MYC表現明顯地減少以下時段:至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如藉由ELISA所量測或如實例2至7或9中之任一例中所述。
89. 如任一前述實施例之表現抑制子,其中在轉染後的第1、2、3、4、5、6、7、8、10、12、16、20、24、28、32、36、40、44、48、52、56、60、64、68、72、76、80或96小時,該表現抑制子結合至MYC基因座明顯地減弱MYC表現。
90. 如實施例1至23或36至89中任一例之表現抑制子,其中該靶向部分結合至人類基因體基因座。
91. 如實施例24至43、49、51、53、56至57、59至62、66至68、70至89中任一例之表現抑制子,其中該靶向部分結合至小鼠基因體基因座。
92. 如任一前述實施例之表現抑制子,其中該表現抑制子結合至MYC基因座降低了包含MYC基因座之細胞(例如癌細胞)的存活率。
93. 如任一前述實施例之表現抑制子,其中使複數個細胞與該表現抑制子或編碼該表現抑制子之核酸接觸降低了該複數個細胞之存活率。
94. 如任一前述實施例之表現抑制子,其中存活率相較於第一表現抑制子不存在下的存活率降低10、20、30、40、50、60、70、80、90或100%,例如如藉由CellTiter Glo所量測或如實例2至7中之任一例所述。
95. 如任一前述實施例之表現抑制子,其中該表現抑制子的投與使得靶細胞(例如癌細胞)中的至少5%、6%、7%、8%、9%、10%、12%、15%、17%、20%、25%、30%、40%、45%、50%、55%、60%、65%、75%發生細胞凋亡。
96. 如任一前述實施例之表現抑制子,其中該複數個細胞包含複數個癌細胞及複數個非癌細胞及/或複數個感染細胞及複數個未感染細胞。
97. 如任一前述實施例之表現抑制子,其中該複數個細胞與該表現抑制子或編碼該表現抑制子之核酸接觸使複數個癌細胞之存活率降低,降幅超過其使複數個非癌細胞之存活率達成的降幅。
98. 如任一前述實施例之表現抑制子,其中該複數個細胞與該表現抑制子或編碼該表現抑制子的核酸接觸使複數個癌細胞的存活率降低,降幅超過其使複數個非癌細胞之存活率達成之降幅的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x。
99. 如實施例92至97中任一例之表現抑制子,其中該癌細胞為肺癌細胞、胃癌細胞、胃腸癌細胞、結腸直腸癌細胞、胰臟癌細胞或肝癌細胞。
100. 如實施例92至99中任一例之表現抑制子,其中該癌症為肝細胞癌(HCC)、纖維板層肝細胞癌(FHCC)、膽管癌、血管肉瘤、繼發性肝癌、肺癌、非小細胞肺癌(NSCLC)、腺癌、小細胞肺癌(SCLC)、大細胞(未分化)癌瘤、三陰性乳癌、胃腺癌、子宮內膜癌或胰臟癌。
101. 如任一前述實施例之表現抑制子,其當與複數個感染細胞及複數個未感染細胞接觸時,使複數個感染細胞之存活率降低,降幅超過其使複數個未感染細胞之存活率達成的降幅。
102. 如任一前述實施例之表現抑制子,其中該感染為病毒感染。
103. 如實施例102之表現抑制子,其中該病毒感染為肝炎,例如B型肝炎。
104. 如實施例92至103中任一例之表現抑制子,其中感染的細胞為人類肝細胞。
105. 如任一前述實施例之表現抑制子,藉由使用LNP遞送編碼表現抑制子之mRNA,該表現抑制子在癌細胞(例如HCC細胞)存活率的分析中(例如在根據實例12的分析中)測試時,具有0.04-0.4、0.04-0.1、0.1-0.2、0.2-0.3或0.3-0.4 µg/mL的EC50。
106. 如實施例1至104中任一例之表現抑制子,藉由使用LNP遞送編碼該表現抑制子的mRNA,該表現抑制子在癌細胞(例如肺癌細胞)存活率分析中(例如在根據實例18的分析中)測試時,具有0.1-2.5、0.5-2.2、1.0-1.5、1.2-2 µg/mL的EC50。
107. 如任一前述實施例之表現抑制子,藉由使用LNP遞送編碼該表現抑制子的mRNA,該表現抑制子在關於降低癌細胞(例如HCC細胞)中之MYC mRNA水準的分析中(例如在根據實例12的分析中)測試時,具有0.004-0.08、0.004-0.01、0.01-0.02、0.02-0.04或0.04-0.08 µg/mL的EC50。
108. 如任一前述實施例之表現抑制子,藉由使用LNP遞送編碼該表現抑制子的mRNA,該表現抑制子在關於降低癌細胞(例如肺癌細胞)中之MYC mRNA水準的分析中(例如在根據實例18的分析中)測試時,具有0.04-0.1、0.04-0.09、0.05-0.09或0.06-0.8 µg/mL的EC50。
109. 如任一前述實施例之表現抑制子,相較於未處理細胞中的蛋白質水準,該表現抑制子使細胞中之靶基因(例如MYC)所編碼的蛋白質水準降低至少10%、至少20%、至少30%、至少40%、至少50%、至少60%、至少70%、至少80%或至少90%。
110. 如任一前述實施例之表現抑制子,其能夠使例如人類個體或哺乳動物模型中的腫瘤體積減小。
111. 如任一前述實施例之表現抑制子,其中該表現抑制子能夠使腫瘤體積減小的程度類似於或大於化學治療劑,例如在哺乳動物模型中,例如當在治療開始之後的第20天量測時,例如其中該表現抑制子以每5天3 mg/kg之劑量投與。
112. 如任一前述實施例之表現抑制子,其中相較於PBS對照物,該表現抑制子能夠減小腫瘤體積,例如在哺乳動物模型中,例如當治療開始之後的第20天量測時,例如其中該表現抑制子投與4次劑量,每5天一次,隨後每3天一次,投與3次劑量:1 mg/kg、1.5 mg/kg或3 mg/kg。
113. 如任一前述實施例之表現抑制子,其中相較於PBS治療的對照組,腫瘤體積減小至少約10%、20%、30%或40%,例如在治療開始之後的第20天。
114. 如實施例111至113中任一例之表現抑制子,其中該化學治療劑為索拉非尼(sorafenib)或順鉑(cisplatin)。
115. 如任一前述實施例之表現抑制子,其中系統能夠使腫瘤體積減小的程度類似於或大於小分子MYC抑制劑。
116. 如實施例115之表現抑制子,其中該小分子MYC抑制劑為MYCi975,其中視情況,腫瘤體積相較於MYCi975治療的對照組減小至少約10%、20%、30%或40%,例如在治療開始之後的第20天。
117. 如任一前述實施例之表現抑制子,相較於治療開始時,其未使體重減少,或其使體重減少小於3%、2%或1%。
118. 一種系統,其包含:
任一前述實施例之第一表現抑制子,及
第二表現抑制子,例如本文所述之第二表現抑制子,例如任一前述實施例之第二表現抑制子。
119. 一種系統,其包含:
第一表現抑制子,其包含第一靶向部分及視情況存在的第一效應部分,其中該第一表現抑制子結合至可操作地連接至MYC基因的轉錄調控元件(例如啟動子、增強子或轉錄起點(TSS))或結合至近接於該轉錄調控元件的序列;以及
第二表現抑制子,其包含第二靶向部分及視情況存在的第二效應部分,其中該第二表現抑制子結合至包含MYC基因之錨定序列介導性接合體(ASMC)中的錨定序列或結合至近接於該錨定序列的序列。
120. 如實施例118或119之系統,
其中該轉錄調控元件包含啟動子,且
其中該錨定序列包含CTCF結合模體。
121. 如實施例118至120中任一例之系統,其中第二表現抑制子結合至與CTCF結合模體鄰接的下游區域。
122. 如實施例118至120中任一例之系統,其中第二表現抑制子結合至與CTCF結合模體鄰接的上游區域。
123. 如實施例118至122中任一例之系統,其中
該第一表現抑制子包含靶向部分,該靶向部分結合的基因體基因座包含序列SEQ ID NO: 2、3、4、71至86或200至206中的至少16、17、18、19或20個核苷酸;及
該第二表現抑制子包含靶向部分,該靶向部分結合的基因體基因座包含序列SEQ ID NO: 2、3、4、71至86或200至206中的至少16、17、18、19或20個核苷酸。
124. 如實施例118至123中任一例之系統,其中
該第一表現抑制子包含的靶向部分結合包含SEQ ID NO: 96-110中之任一者之序列之至少16、17、18、19或20個核苷酸的基因體基因座。
125. 如實施例118至124中任一例之系統,其中,
該第一表現抑制子包含靶向部分,該靶向部分結合的基因體基因座包含序列SEQ ID NO: 71、SEQ ID NO: 72或SEQ ID NO: 83中的至少16、17、18、19或20個核苷酸;及
該第二表現抑制子包含靶向部分,該靶向部分結合的基因體基因座包含序列SEQ ID NO:77中的至少16、17、18、19或20個核苷酸。
126. 一種系統,其包含:
第一表現抑制子,其包含第一靶向部分及視情況存在的第一效應部分,其中該第一表現抑制子結合至可操作地連接至MYC基因的啟動子或結合至近接於該啟動子的序列,及
第二表現抑制子,其包含第二靶向部分及視情況存在的第二效應部分,其中該第二表現抑制子結合至MYC基因的增強子(例如超級增強子)。
127. 如實施例126之系統,其中,
該第一表現抑制子包含的靶向部分結合包含序列SEQ ID NO:204之至少16、17、18、19或20個核苷酸的基因體基因座,且
該第二表現抑制子包含的靶向部分結合包含SEQ ID NO: 199或201中之任一者之序列中之至少16、17、18、19或20個核苷酸的基因體基因座。
128. 一種用於減少MYC表現的系統,該系統包含:
a)第一表現抑制子,其包含:
i)第一靶向部分,其具有根據SEQ ID NO: 13的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第一效應部分,其具有根據SEQ ID NO: 19或87的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列;以及
b)第二表現抑制子,其包含:
i)第二靶向部分,其具有根據SEQ ID NO: 7、169或171的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第二效應部分,其具有根據SEQ ID NO: 18的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
129. 如實施例128之系統,其中該第一表現抑制子進一步包含第一核定位信號,例如SV40 NLS,例如根據SEQ ID NO: 135之序列或與其至少80、85、90、95、99或100%一致之序列,例如位於第一靶向部分之N端。
130. 如實施例128或129之系統,其中該第一表現抑制子進一步包含第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136之序列或與其至少80、85、90、95、99或100%一致之序列,例如位於第一效應部分之C端。
131. 如實施例128至130中任一項之系統,其中該第二表現抑制子進一步包含第一核定位信號,例如SV40 NLS,例如根據SEQ ID NO: 135之序列或與其至少80、85、90、95、99或100%一致之序列,例如位於第二靶向部分之N端。
132. 如實施例128至131中任一例之系統,其中第二表現抑制子進一步包含第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136之序列或與其至少80、85、90、95、99或100%一致之序列,例如位於第二效應部分之C端。
133. 如實施例128至132中任一例之系統,其中第一表現抑制子進一步包含位於第一靶向部分與第一效應部分之間的第一連接子,其中視情況,第一連接子具有根據SEQ ID NO: 137之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
134. 如實施例128至133中任一例之系統,其中第二表現抑制子進一步包含位於第二靶向部分與第二效應部分之間的第二連接子,其中視情況,第二連接子具有根據SEQ ID NO: 138之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
135. 如實施例128至134中任一例之系統,其中該第一表現抑制子進一步包含第一效應部分C端的胺基酸序列,例如至多30、25、20或18個胺基酸的序列,例如根據SEQ ID NO: 126的序列或與其至少80、85、90、95、99或100%一致的序列。
136. 如實施例128至132中任一例之系統,其中該第二表現抑制子進一步包含第二靶向部分N端的胺基酸序列,例如至多30、25、20或18個胺基酸的序列,例如根據SEQ ID NO: 128的序列或與其至少80、85、90、95、99或100%一致的序列。
137. 如實施例128至136中任一例之系統,其中該第一表現抑制子具有根據SEQ ID NO: 30或129的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列。
138. 如實施例128至137中任一例之系統,其中第二表現抑制子具有根據SEQ ID NO: 24之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
139. 如實施例128至137中任一例之系統,其中第二靶向部分包含根據SEQ ID NO: 169之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
140. 如實施例128至137中任一例之系統,其中第二靶向部分包含根據SEQ ID NO: 171之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
141. 如實施例128至140中任一例之系統,其中第二表現抑制子具有根據SEQ ID NO: 177或183之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
142. 如實施例128至140中任一例之系統,其中第二表現抑制子具有根據SEQ ID NO: 179、185之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
143. 一種核酸,其編碼如實施例128至142中任一例之系統中的第一表現抑制子及第二抑制子。
144. 一種核酸,其編碼用於減少MYC表現的系統,該核酸包含:
a)編碼第一表現抑制子的第一區域,該第一表現抑制子包含:
i)第一靶向部分,其具有根據SEQ ID NO: 13的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第一效應部分,其具有根據SEQ ID NO: 19或87的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列;以及
b)編碼第二表現抑制子的第二區域,該第二表現抑制子包含:
i)第二靶向部分,其具有根據SEQ ID NO: 7的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第二效應部分,其具有根據SEQ ID NO: 18的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
145. 如實施例144之核酸,其中第一區域位於第二區域的5'。
146. 如實施例144之核酸,其中第一區域位於第二區域的3'。
147. 如實施例145或146之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第一核定位信號,例如SV40 NLS,例如根據SEQ ID NO: 135的序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第一靶向部分的N端。
148. 如實施例145至147中任一例之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136的序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第一效應部分的C端。
149. 如實施例145至148中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第一核定位信號,例如SV40 NLS,例如根據SEQ ID NO: 135之序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第二靶向部分之N端。
150. 如實施例145至149中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136之序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第二效應部分之C端。
151. 如實施例145至150中任一例之核酸,其中第一區域進一步包含編碼第一連接子之核苷酸序列,該第一連接子位於該第一靶向部分與該第一效應部分之間,其中視情況,該第一連接子具有根據SEQ ID NO: 137之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
152. 如實施例145至151中任一例之核酸,其中第二區域進一步包含編碼第二連接子的核苷酸序列,該第二連接子位於第二靶向部分與第二效應部分之間,其中視情況,該第二連接子具有根據SEQ ID NO: 138之胺基酸序列或與其至少80%、85%、90%、95%、99%或100%一致的序列。
153. 如實施例145至152中任一例之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第一效應部分C端之胺基酸序列,例如至多30、25、20或18個胺基酸之序列,例如根據SEQ ID NO: 126的序列或與其至少80、85、90、95、99或100%一致的序列。
154. 如實施例145至153中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第二靶向部分N端之胺基酸序列,例如至多30、25、20或18個胺基酸之序列,例如根據SEQ ID NO: 128的序列或與其至少80、85、90、95、99或100%一致的序列。
155. 如實施例145至154中任一例之核酸,其中該第一表現抑制子具有根據SEQ ID NO: 30或129之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
156. 如實施例145至155中任一例之核酸,其中該第二表現抑制因子具有根據SEQ ID NO: 24之胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
157. 如實施例145至156中任一例之核酸,其中第一區域包含編碼第一靶向部分之核苷酸序列,其中編碼第一靶向部分之核苷酸序列包含根據SEQ ID NO: 46或131之序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
158. 如實施例145至157中任一例之核酸,其中第一區域包含編碼第一效應部分之核苷酸序列,其中編碼第一效應部分之核苷酸序列包含根據SEQ ID NO: 52或132之序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
159. 如實施例145至158中任一例之核酸,其中第二區域包含編碼第二靶向部分之核苷酸序列,其中編碼第二靶向部分之核苷酸序列包含根據SEQ ID NO: 40之序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
160. 如實施例145至159中任一例之核酸,其中第一區域包含編碼第一效應部分之核苷酸序列,其中編碼第一效應部分之核苷酸序列包含根據SEQ ID NO: 51之序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
161. 如實施例145至160中任一例之核酸,其中第一區域包含根據SEQ ID NO: 63或130之核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
162. 如實施例145至161中任一例之核酸,其中第二區域包含根據SEQ ID NO: 57之核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
163. 一種核酸,其編碼用於減少MYC表現的系統,該核酸包含:
a)編碼第一表現抑制子的第一區域,該第一表現抑制子包含:
i)第一靶向部分,其具有根據SEQ ID NO: 13的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第一效應部分,其具有根據SEQ ID NO: 19或87的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,以及
b)編碼第二表現抑制子的第二區域,該第二表現抑制子包含:
i)第二靶向部分,其具有根據SEQ ID NO: 169的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第二效應部分,其具有根據SEQ ID NO: 18的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
164. 一種核酸,其編碼用於減少MYC表現的系統,該核酸包含:
a)編碼第一表現抑制子的第一區域,該第一表現抑制子包含:
i)第一靶向部分,其具有根據SEQ ID NO: 13的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第一效應部分,其具有根據SEQ ID NO: 19或87的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列;以及
b)編碼第二表現抑制子的第二區域,該第二表現抑制子包含:
i)第二靶向部分,其具有根據SEQ ID NO: 171的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,及
ii)第二效應部分,其具有根據SEQ ID NO: 18的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
165. 如實施例163或164之核酸,其中第一區域位於第二區域的5'。
166. 如實施例163或164之核酸,其中第一區域位於第二區域的3'。
167. 如實施例163至166中任一例之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第一核定位信號,例如SV40 NLS,例如根據SEQ ID NO: 135的序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第一靶向部分之N端。
168. 如實施例163至167中任一例之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136的序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第一效應部分的C端。
169. 如實施例163至168中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第一核定位信號,例如SV40,例如根據SEQ ID NO: 135的序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第二靶向部分的N端。
170. 如實施例163至169中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第二核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136之序列或與其至少80、85、90、95、99或100%一致的序列,例如位於第二效應部分之C端。
171. 如實施例163至170中任一例之核酸,其中第一區域進一步包含編碼第一連接子的核苷酸序列,該第一連接子位於第一靶向部分與第一效應部分之間,其中視情況,該第一連接子具有根據SEQ ID NO: 137的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
172. 如實施例163至171中任一例之核酸,其中第二區域進一步包含編碼第二連接子的核苷酸序列,該第二連接子位於第二靶向部分與第二效應部分之間,其中視情況,該第二連接子具有根據SEQ ID NO: 138的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
173. 如實施例163至171中任一例之核酸,其中第一區域進一步包含核苷酸序列,該核苷酸序列編碼第一效應部分C端之胺基酸序列,例如至多30、25、20或18個胺基酸的序列,例如根據SEQ ID NO: 126之序列或與其至少80、85、90、95、99或100%一致的序列。
174. 如實施例163至173中任一例之核酸,其中第二區域進一步包含核苷酸序列,該核苷酸序列編碼第二靶向部分N端之胺基酸序列,例如至多30、25、20或18個胺基酸的序列,例如根據SEQ ID NO: 128之序列或與其至少80、85、90、95、99或100%一致的序列。
175. 如實施例163至174中任一例之核酸,其中第一表現抑制子具有根據SEQ ID NO: 30或129的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列。
176. 如實施例144至175中任一例之核酸,其中第二表現抑制子具有根據SEQ ID NO: 177或183的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列。
177. 如實施例144至176中任一例之核酸,其中第二表現抑制子具有根據SEQ ID NO: 179或185的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列。
178. 如實施例144至177中任一例之核酸,其中第一表現抑制子包含根據SEQ ID NO: 30或129的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,且第二表現抑制子具有根據SEQ ID NO: 24、141、177、179、183或185的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。
179. 如實施例144至178中任一例之核酸,其中第一區域包含編碼第一靶向部分的核苷酸序列,其中編碼第一靶向部分的核苷酸序列包含根據SEQ ID NO: 46或131的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
180. 如實施例144至179中任一例之核酸,其中第一區域包含編碼第一效應部分的核苷酸序列,其中編碼第一效應部分的核苷酸序列包含根據SEQ ID NO: 52或132的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
181. 如實施例144至180中任一例之核酸,其中第二區域包含根據SEQ ID NO: 173的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
182. 如實施例144至181中任一例之核酸,其中第二區域包含根據SEQ ID NO: 175的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
183. 如實施例144至182中任一例之核酸,其中第二區域包含編碼第二效應部分的核苷酸序列,其中編碼第二效應部分的核苷酸序列包含根據SEQ ID NO: 51的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
184. 如實施例144至183中任一例之核酸,其中第一區域包含根據SEQ ID NO: 63或130的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
185. 如實施例144至184中任一例之核酸,其中第二區域包含根據SEQ ID NO: 189的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
186. 如實施例144至185中任一例之核酸,其中第二區域包含根據SEQ ID NO: 194的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,其中視情況存在聚腺苷酸序列。
187. 如實施例144至186中任一例之核酸,其中第一區域包含編碼第一效應部分的核苷酸序列,其中編碼第一效應部分的核苷酸序列包含根據SEQ ID NO: 52或132的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
188. 如實施例144至187中任一例之核酸,其中第一區域包含編碼第一靶向部分的核苷酸序列,其中編碼第一靶向部分的核苷酸序列包含根據SEQ ID NO: 46或131的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
189. 如實施例144至188中任一例之核酸,其中第二區域包含編碼第二效應部分的核苷酸序列,其中編碼第二效應部分的核苷酸序列包含根據SEQ ID NO: 51的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
190. 如實施例144至189中任一例之核酸,其中第二區域包含根據SEQ ID NO: 189的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
191. 如實施例144至190中任一例之核酸,其中第二區域包含根據SEQ ID NO: 194的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
192. 如實施例144至191中任一例之核酸,其具有根據SEQ ID NO: 93、112的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
193. 如實施例144至192中任一例之核酸,其具有根據SEQ ID NO: 196或197的核苷酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
194. 如實施例118至193中任一例之系統或核酸,其中第一表現抑制子包含第一效應部分。
195. 如實施例118至194中任一例之系統或核酸,其中第二表現抑制子包含第二效應部分。
196. 如實施例118至195中任一例之系統或核酸,其中第一效應部分具有與第二效應部分不同之胺基酸序列。
197. 如實施例118至196中任一例之系統或核酸,其中第一效應部分為持久效應部分。
198. 如實施例118至125或144至197中任一例之系統或核酸,其中第一效應部分為短暫效應部分。
199. 如實施例118至198中任一例之系統或核酸,其中第一效應部分為表觀遺傳修飾部分。
200. 如實施例118至143、163至197或199中任一例之系統或核酸,其中第一效應部分包含組蛋白甲基轉移酶。
201. 如實施例200之系統或核酸,其中第一效應部分包含選自以下的蛋白質:SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2或其任一者的功能變異體或片段,例如其任一者的SET域。
202. 如實施例118至143、163至197或199中任一例之系統或核酸,其中第一效應部分包含組蛋白去甲基酶(例如離胺酸去甲基酶)。
203. 如實施例202之系統或核酸,其中第一效應部分包含選自以下的蛋白質:KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66 (或其任一者的功能變異體或片段)。
204. 如實施例118至143、163至197或199中任一例之系統或核酸,其中第一效應部分包含組蛋白去乙醯酶。
205. 如實施例204之系統或核酸,其中第一效應部分包含選自以下的蛋白質:HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9,或其任一者的功能變異體或片段。
206. 如實施例118至197或200中任一例之系統或核酸,其中第一效應部分包含DNA甲基轉移酶。
207. 如實施例206之系統或核酸,其中第一效應部分包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L或其任一者的功能變異體或片段。
208. 如實施例118至143、160至196或198中任一例之系統或核酸,其中第一效應部分為轉錄抑制子部分,例如包含轉錄抑制子。
209. 如實施例198或199之系統或核酸,其中第一效應部分包含選自以下的蛋白質:KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者之功能變異體或片段。
210. 如實施例118至209中任一例之系統或核酸,其中第一效應部分促進轉錄調控元件或與其接近之序列的表觀遺傳修飾。
211. 如實施例118至210中任一例之系統或核酸,其中第一效應部分催化轉錄調控元件或與其近接之序列的表觀遺傳修飾。
212. 如實施例118至125、194或197至211中任一例之系統或核酸,其中第二表現抑制子不包含效應部分。
213. 如實施例118至212中任一例之系統或核酸,其中第二效應部分為短暫效應部分。
214. 如實施例118至125或194至211中任一例之系統或核酸,其中第二效應部分為持久效應部分。
215. 如實施例118至211或214中任一例之系統或核酸,其中第二效應部分為表觀遺傳修飾部分。
216. 如實施例118至125、194至211或214至215中任一例之系統或核酸,其中第二效應部分包含組蛋白甲基轉移酶。
217. 如實施例216之系統或核酸,其中第二效應部分包含選自以下的蛋白質:SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2或其任一者的功能變異體或片段,例如其任一者的SET域。
218. 如實施例118至125、194至211或214至215中任一例之系統或核酸,其中第二效應部分包含組蛋白去甲基酶(例如離胺酸去甲基酶)。
219. 如實施例218之系統或核酸,其中第二效應部分包含選自以下的蛋白質:KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66 (或其任一者的功能變異體或片段)。
220. 如實施例118至125、194至211或214至215中任一例之系統或核酸,其中第二效應部分包含組蛋白去乙醯酶。
221. 如實施例220之系統或核酸,其中第二效應部分包含選自以下的蛋白質:HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9,或其任一者的功能變異體或片段。
222. 如實施例118至125、194至211或214至215中任一例之系統或核酸,其中第二效應部分包含DNA甲基轉移酶。
223. 如實施例222之系統或核酸,其中第二效應部分包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L或其任一者的功能變異體或片段。
224. 如實施例118至211或213中任一例之系統或核酸,其中第二效應部分為轉錄抑制子部分。
225. 如實施例224之系統或核酸,其中第二效應部分促進錨定序列或與其近接之序列的表觀遺傳修飾。
226. 如實施例223或224之系統或核酸,其中第二效應部分結合至一或多種內源性表觀遺傳修飾蛋白或一或多種內源性轉錄修飾蛋白。
227. 如實施例223至226中任一例之系統或核酸,其中第二效應部分包含KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者之功能變異體或片段。
228. 如實施例118至197、199至207、210至211、213或224至227中任一例之系統或核酸,其中:
第一效應部分為持久效應部分,且
第二效應部分為短暫效應部分。
229. 如實施例228之系統或核酸,其中第一效應部分為表觀遺傳修飾部分。
230. 如實施例227或228之系統或核酸,其中第二效應部分為轉錄抑制子部分。
231. 如實施例227至230中任一例之系統或核酸,其中:
該第一效應部分包含組蛋白甲基轉移酶、組蛋白去甲基酶、組蛋白去乙醯酶、DNA甲基轉移酶、其任一者之功能變異體或片段,或其任一者之組合,且
該第二效應部分包含轉錄抑制子或其任一者之功能變異體或片段。
232. 如實施例118至125、194、197、199至207、210至212或190中任一例之系統或核酸,其中:
該第一效應部分包含組蛋白甲基轉移酶、組蛋白去甲基酶、組蛋白去乙醯酶、DNA甲基轉移酶、其任一者之功能變異體或片段,或其任一者之組合,且
該第二表現抑制子不包含第二效應部分。
233. 如實施例118至125、199至207、210至211、213至214或224至231中任一例之系統或核酸,其中:
該第一效應部分包含SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、其任一者之功能變異體或片段,或其任一者的組合,且
該第二效應部分包含KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12、其任一者之功能變異體或片段,或其任一者之組合。
234. 如實施例118至197、199、206至207、210至211、213、215、224至231或233中任一例之系統或核酸,其中:
該第一效應部分包含DNA甲基轉移酶,且
該第二效應部分包含轉錄抑制子。
235. 如實施例118至125、194、197、200、206至207、210至212或232中任一例之系統或核酸,其中:
該第一效應部分包含DNA甲基轉移酶,且
該第二表現抑制子不包含第二效應部分。
236. 如實施例118至125、200、206至207、210至235中任一例之系統或核酸,其中第一效應部分包含MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L或其任一者的功能變異體或片段。
237. 如實施例118至211、214、224至234或236中任一例之系統或核酸,其中第二效應部分包含KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者之功能變異體或片段。
238. 如實施例118至211、199、206至207、210至211、213、224至234或236至237中任一例之系統或核酸,其中:
該第一效應部分包含MQ1或其任一者之功能變異體或片段,且
該第二效應子包含KRAB或其任一者之功能變異體或片段。
239. 如實施例118至125、194、197、199至207、或210至212、229、232、235或236中任一例之系統或核酸,其中:
該第一效應部分包含MQ1或其任一者之功能變異體或片段,且
該第二表現抑制子不包含第二效應部分。
240. 如實施例118至200中任一例之系統或核酸,其中第一表現抑制子包含選自SEQ ID NO: 22-37、129、133、134、139-149、177-180或183-186中之任一者的胺基酸序列或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
241. 如實施例118至198、200、206至211、213至216、222至223、236至237或240中任一例之系統或核酸,其中第二表現抑制子包含選自SEQ ID NO: 22-37、129、133、134、139-149、177-180或183-186中之任一者的胺基酸序列,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
242. 如實施例118至198、200、206至211、213至216、222至223、236至237或240至241中任一例之系統或核酸,其中第一表現抑制子包含胺基酸序列SEQ ID NO: 30、129、133,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,且第二表現抑制子包含胺基酸序列SEQ ID NO: 24、134、141、177、179、183或185,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
243. 如實施例118至198、200、206至211、213至216、222至223、236至237或240至242中任一例之系統或核酸,其中該第一表現抑制子由以下編碼:第一核苷酸序列SEQ ID NO: 63或130,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,且該第二表現抑制子由以下編碼:第二核苷酸序列SEQ ID NO: 57、189或194,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
244. 如實施例118至198、200、206至211、213至216、222至223、236至237或240至243中任一例之系統或核酸,其中第一抑制子及第二抑制子由以下編碼:核酸序列SEQ ID NO: 93、94、112、113、196或197,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
245. 如實施例244之系統或核酸,其包含胺基酸序列SEQ ID NO: 91、92、121、122、181、182、187或188,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
246. 如實施例118至197、199、206至207、210至211、213、215、224至231、233至234、236至237或240至244中任一例之系統或核酸,其中:
該第一表現抑制子自N端至C端包含:
(i)第一核定位信號,例如SV40 NLS;例如根據SEQ ID NO: 135的序列;
(ii)第一靶向部分,例如鋅指結合域,例如ZF9;例如根據SEQ ID NO: 13的序列;
(iii)第一效應部分,例如DNA甲基轉移酶,例如MQ1;例如根據SEQ ID NO: 19或87的序列;
(iv)第二核定位信號,例如核質蛋白NLS;例如根據SEQ ID NO: 136的序列;
且該第二表現抑制子自N端至C端包含:
(v)第三核定位信號,例如SV40NLS;例如根據SEQ ID NO: 135的序列;
(vi)第二靶向部分,例如鋅指結合域,例如ZF3;例如根據SEQ ID NO: 7的序列;
(vii)第二效應部分,例如KRAB,例如根據SEQ ID NO: 18的序列;及
(viii)第四核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136的序列。
247. 如實施例118至197、199、206至207、210至211、213、215、224至231、233至234、236至237或240至244中任一例之系統或核酸,其中:
該第一表現抑制子自N端至C端包含:
(i)第一核定位信號,例如SV40 NLS;例如根據SEQ ID NO: 135的序列;
(ii)第一靶向部分,例如鋅指結合域,例如ZF9;例如根據SEQ ID NO: 13的序列;
(iii)第一效應部分,例如DNA甲基轉移酶,例如MQ1;例如根據SEQ ID NO: 19或87的序列;
(iv)第二核定位信號,例如核質蛋白NLS;例如根據SEQ ID NO: 136的序列;
且該第二表現抑制子自N端至C端包含:
(v)第三核定位信號,例如SV40NLS;例如根據SEQ ID NO: 135的序列;
(vi)第二靶向部分,例如鋅指結合域,例如ZF54;例如根據SEQ ID NO: 169的序列;
(vii)第二效應部分,例如KRAB,例如根據SEQ ID NO: 18的序列;及
(viii)第四核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136的序列。
248. 如實施例118至197、199、206至207、210至211、213、215、224至231、233至234、236至237或240至244中任一例之系統或核酸,其中:
該第一表現抑制子自N端至C端包含:
(i)第一核定位信號,例如SV40 NLS;例如根據SEQ ID NO: 135的序列;
(ii)第一靶向部分,例如鋅指結合域,例如ZF9;例如根據SEQ ID NO: 13的序列;
(iii)第一效應部分,例如DNA甲基轉移酶,例如MQ1;例如根據SEQ ID NO: 19或87的序列;
(iv)第二核定位信號,例如核質蛋白NLS;例如根據SEQ ID NO: 136的序列;
且該第二表現抑制子自N端至C端包含:
(v)第三核定位信號,例如SV40NLS;例如根據SEQ ID NO: 135的序列;
(vi)第二靶向部分,例如鋅指結合域,例如ZF67;例如根據SEQ ID NO: 171的序列;
(vii)第二效應部分,例如KRAB,例如根據SEQ ID NO: 18的序列;及
(viii)第四核定位信號,例如核質蛋白NLS,例如根據SEQ ID NO: 136的序列。
249. 如實施例118至248中任一例之系統,其中該系統能夠使MYC表現減少的程度大於單獨的第一表現抑制子或單獨的第二表現抑制子。
250. 如實施例128至194或242至249中任一例之系統,其中該系統能夠使MYC表現減少的程度大於單獨或組合的SEQ ID: 22、23、25-29、31-37之表現抑制子中的任一者。
251. 如實施例118至250中任一例之系統,其能夠減小例如人類個體或哺乳動物模型中之腫瘤體積。
252. 如實施例128至193或242至209中任一例之系統,其中該系統能夠使腫瘤體積減小的程度類似於或大於化學治療劑,例如哺乳動物模型中,例如當在治療開始之後的第20天量測時,例如其中表現抑制子以每5天3 mg/kg之劑量投與,例如在如實例15所述的模型系統中。
253. 如實施例128至193或242至252中任一例之系統,其中該系統能夠使腫瘤體積減小的程度大於化學治療劑,例如哺乳動物模型中,例如當在治療開始之後的第15天量測時,例如其中表現抑制子以每5天6 mg/kg之劑量投與,例如在如實例14所述的模型系統中。
254. 如實施例128至193或242至253中任一例之系統,其中相較於PBS治療的對照組,腫瘤體積減小至少約10%、20%、30%、40%、50%或60%,例如在治療開始之後的第20天。
255. 如實施例254之系統,其中該化學治療劑為索拉非尼或順鉑。
256. 如實施例128至193或242至253中任一例之系統,其中該系統能夠使腫瘤體積減小的程度類似於或大於小分子MYC抑制劑。
257. 如實施例256之系統,其中該小分子MYC抑制劑為MYCi975,其中視情況,腫瘤體積相較於MYCi975治療的對照組減小至少約10%、20%、30%或40%,例如在治療開始之後的第20天。
258. 如具體實例118至257中任一例之系統,與治療開始時相比,其不引起體重降低,或引起體重的降幅小於3%、2%或1%。
259. 如實施例118至258中任一例之系統或核酸,其中該第一靶向部分選自TAL效應域、CRISPR/Cas域、鋅指域、tetR域、巨核酸酶一種寡核苷酸。
260. 如實施例11至260中任一例之系統或核酸,其中第二靶向部分選自TAL效應域、CRISPR/Cas域、鋅指域、tetR域、巨核酸酶或寡核苷酸。
261. 如實施例118至260中任一例之系統或核酸,其中第一靶向部分包含CRISPR/Cas域(例如第一CRISPR/Cas域)。
262. 如實施例118至261中任一例之系統或核酸,其中第二靶向部分包含第二CRISPR/Cas域(例如第二CRISPR/Cas域)。
263. 如實施例262之系統或核酸,其中:i)第一CRISPR/Cas域結合第一嚮導RNA,ii)第二CRISPR/Cas域結合第二嚮導RNA,或iii) (i)與(ii)兩者。
264. 如實施例262或263之系統或核酸,其中第一CRISPR/Cas域不結合第二嚮導RNA或以至少10、20、50、100、1000或10,000 nM的K
D結合,且第二CRISPR/Cas域不結合第一嚮導RNA或以至少10、20、50、100、1000或10,000 nM的K
D結合。
265. 如實施例260至264中任一例之系統或核酸,其中第一CRISPR/Cas域包含與第二CRISPR/Cas域不同的胺基酸序列。
266. 如實施例260至265中任一例之系統或核酸,其中該第一或第二CRISPR/Cas域包含選自表1之Cas蛋白質或Cpf1蛋白質或其任一者之變異體(例如突變體)的胺基酸序列。
267. 如實施例260至266中任一例之系統或核酸,其中該第一CRISPR/Cas域包含選自表1之Cas蛋白質或Cpf1蛋白質或其任一者之變異體(例如突變體)的胺基酸序列,且該第二CRISPR/Cas域包含選自表1之不同Cas蛋白質或Cpf1蛋白質或其任一者之變異體(例如突變體)的胺基酸序列。
268. 如實施例118至260中任一例之系統或核酸,其中第一靶向部分包含鋅指域(例如第一鋅指域)。
269. 如實施例118至260或268中任一例之系統或核酸,其中第二靶向部分包含鋅指域(例如第二鋅指域)。
270. 如實施例118至261或268至269中任一例之系統或核酸,其中第一靶向部分包含第一鋅指域且第二靶向部分包含第二鋅指域。
271. 如實施例268至270中任一例之系統或核酸,其中該第一鋅指域與該第二鋅指域結合相同基因體基因座,例如具有相同胺基酸序列。
272. 如實施例268至271中任一例之系統或核酸,其中第一鋅指域與第二鋅指域具有不同的胺基酸序列或結合不同的基因體基因座。
273. 如實施例118至261或267至272中任一例之系統或核酸,其中第一鋅指分子包含至少1、2、3、4、5、7、8、9或10個鋅指(且視情況不超過11、10、9、8、7、6或5個鋅指)。
274. 如實施例267至273中任一例之系統或核酸,其中第一鋅指分子包含1至10、1至9、1至8、1至7、1至6、1至5、1至4、1至3、1至2、2至10、2至9、2至8、2至7、2至6、2至5、2至4、2至3、3至10、3至9、3至8、3至7、3至6、3至5、3至4、4至10、4至9、4至8、4至7、4至6、4至5、5至10、5至9、5至8、5至7、5至6、6至10、6至9、6至8、6至7、7至10、7至9、7至8、8至10、8至9或9至10個鋅指。
275. 如實施例268至274中任一例之系統或核酸,其中該第一鋅指域包含3或9個鋅指。
276. 如實施例268至275中任一例之系統或核酸,其中第二鋅指域包含至少1、2、3、4、5、7、8、9或10個鋅指(且視情況不超過11、10、9、8、7、6或5個鋅指)。
277. 如實施例268至276中任一例之系統或核酸,其中第二鋅指域包含1至10、1至9、1至8、1至7、1至6、1至5、1至4、1至3、1至2、2至10、2至9、2至8、2至7、2至6、2至5、2至4、2至3、3至10、3至9、3至8、3至7、3至6、3至5、3至4、4至10、4至9、4至8、4至7、4至6、4至5、5至10、5至9、5至8、5至7、5至6、6至10、6至9、6至8、6至7、7至10、7至9、7至8、8至10、8至9或9至10個鋅指。
278. 如實施例268至277中任一例之系統或核酸,其中該第二鋅指域包含3或9個鋅指。
279. 如實施例11至278中任一例之系統或核酸,其中第一靶向部分包含TAL效應域(例如第一TAL效應域)。
280. 如實施例118至260或279中任一例之系統或核酸,其中第二靶向部分包含TAL效應域(例如第二TAL效應域)。
281. 如實施例279或280中任一例之系統或核酸,其中第一TAL效應域包含至少2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38或40個中心重複(且視情況不超過45、40、35、30、25、20、15或10個中心重複)。
282. 如實施例279至281中任一例之系統或核酸,其中第一TAL效應域包含2至40、5至40、10至40、15至40、20至40、25至40、30至40、35至40、2至35、5至35、10至35、15至35、20至35、25至35、30至35、2至30、5至30、10至30、15至30、20至30、25至30、2至25、5至25、10至25、15至25、20至25、2至20、5至20、10至20、15至20、2至15、5至15、10至15、2至10、5至10或2至5個中心重複。
283. 如實施例279至282中任一例之系統或核酸,其中第二TAL效應域包含至少2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38或40個中心重複(且視情況不超過45、40、35、30、25、20、15或10個中心重複)。
284. 如實施例279至283中任一例之系統或核酸,其中第二TAL效應域包含2至40、5至40、10至40、15至40、20至40、25至40、30至40、35至40、2至35、5至35、10至35、15至35、20至35、25至35、30至35、2至30、5至30、10至30、15至30、20至30、25至30、2至25、5至25、10至25、15至25、20至25、2至20、5至20、10至20、15至20、2至15、5至15、10至15、2至10、5至10或2至5個中心重複。
285. 如實施例118至284中任一例之系統或核酸,其中第一靶向部分包含核酸(例如第一核酸)。
286. 如實施例129至285中任一例之系統,其中第二靶向部分包含核酸(例如第二核酸)。
287. 如實施例129至286中任一例之系統或核酸,其中第一靶向部分包含多肽(例如第一多肽)。
288. 如實施例129至287中任一例之系統或核酸,其中第二靶向部分包含多肽(例如第二多肽)。
289. 如實施例287或288之系統,其中核酸共價連接至多肽。
290. 如實施例288或289之系統,其中核酸與多肽非共價結合。
291. 如實施例275至290中任一例之系統或核酸,其中該核酸包含與轉錄調控元件或與其近接之序列互補的序列,或相對於該轉錄調控元件或與其近接的序列,包含不超過10、9、8、7、6、5、4、3、2或1個錯配。
292. 如實施例275至291中任一例之系統或核酸,其中該核酸包含與錨定序列或與其近接之序列互補的序列,或相對於該錨定序列或與其近接的序列,包含不超過10、9、8、7、6、5、4、3、2或1個錯配。
293. 如實施例275至292中任一例之系統,其中該核酸包含DNA、肽核酸(PNA)、肽-寡核苷酸偶聯物、鎖定核酸(LNA)、橋接核酸(BNA)、聚醯胺、成三螺旋體寡核苷酸、反義寡核苷酸、tRNA、mRNA、rRNA、miRNA、gRNA、siRNA或其他RNAi分子。
294. 如實施例275至293中任一例之系統,其中該核酸包含gRNA。
295. 如實施例275至294中任一例之系統,其中該核酸包含與SEQ ID NO: 1-4中之任一者至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過1、2、3、4、5、6、7、8、9或10。
296. 如實施例275至295中任一例之系統,其中第一核酸包含與SEQ ID NO: 1-4中之任一者至少80、85、90、95、99或100%一致的序列或與其有差異之位置數不超過1、2、3、4、5、6、7、8、9或10,且第二核酸包含與SEQ ID NO: 1-4中之任一者至少80、85、90、95、99或100%一致的序列或與其有差異之位置數不超過1、2、3、4、5、6、7、8、9或10。
297. 如實施例275至295中任一例之系統,其中第一核酸包含與SEQ ID NO: 96-110中之任一者至少80、85、90、95、99或100%一致的序列或與其有差異之位置數不超過1、2、3、4、5、6、7、8、9或10,且第二核酸包含與SEQ ID NO: 96-110中之任一者至少80、85、90、95、99或100%一致的序列或與其有差異之位置數不超過1、2、3、4、5、6、7、8、9或10。
298. 如實施例118至297中任一例之系統,其中該轉錄調控元件包含啟動子。
299. 如實施例118至298中任一例之系統,其中該轉錄調控元件包含增強子;例如超級增強子。
300. 如實施例118至299中任一例之系統,其中該錨定序列包含CTCF結合模體。
301. 如實施例118至300中任一例之系統,其中該錨定序列包含YY1結合模體。
302. 如實施例118至301中任一例之系統,其中該錨定序列包含序列SEQ ID NO: 71或72,或相對於其具有不超過8、7、6、5、4、3、2或1個變化的序列。
303. 如實施例118至302中任一例之系統,其中該錨定序列包含根據SEQ ID NO: 73或74的序列,或相對於其具有不超過8、7、6、5、4、3、2或1個變化的序列。
304. 如實施例118至303中任一例之系統,其中該錨定序列與MYC基因位於同一染色體上。
305. 如實施例118至304中任一例之系統,其中該錨定序列位於MYC基因上游(例如TSS上游或啟動子上游)。
306. 如實施例118至305中任一例之系統,其中該錨定序列與MYC基因(例如MYC基因之TSS或啟動子)相距至少1、5、10、50、100或1000千鹼基。
307. 如實施例118至306中任一例之系統,其中該錨定序列與MYC基因(例如MYC基因之TSS或啟動子)相距0.1至0.5、0.1至1、0.1至5、0.1至10、0.1至50、0.1至100、0.1至500、0.1至1000、0.5至1、0.5至5、0.5至10、0.5至50、0.5至100、0.5至500、0.5至1000、1至5、1至10、1至50、1至100、1至500、1至1000、5至10、5至50、5至100、5至500、5至1000、10至50、10至100、10至500、10至1000、50至 100、50至500、50至1000、100至500、100至1000或500至1000千鹼基。
308. 如實施例118至303或305至307中任一例之系統,其中該錨定序列與該MYC基因位於不同染色體上。
309. 如實施例118至308中任一例之系統,其中第二靶向部分以足以與內源多肽(例如CTCF或YY1)競爭結合的親和力結合至錨定序列或與錨定序列近接的序列。
310. 如實施例118至309中任一例之系統,其中該第一靶向部分結合至以下染色體座標的序列或與其近接的序列:128746342-128746364、128746321-128746343或128746525-128746547。
311. 如實施例118至309中任一例之系統,其中該第一靶向部分結合至以下染色體座標的序列或與其近接的序列:128746405-128746425、128748069-128748089、129188825-129188845或129188822-129188842。
312. 如實施例118至311中任一例之系統,其中第二靶向部分結合至染色體座標128748014-128748036的序列或與其近接的序列。
313. 如實施例118至311中任一例之系統,其中該第二靶向部分結合至以下染色體座標的序列或與其近接的序列:128746405-128746425、128748069-128748089、129188825-129188845或129188822-129188842。
314. 如實施例118至314中任一例之系統,其中第一表現抑制子為融合分子。
315. 如實施例118至314中任一例之系統,其中第二表現抑制子為融合分子。
316. 如實施例118至315中任一例之系統,其中第一表現抑制子包含連接子。
317. 如實施例118至316中任一例之系統,其中第二表現抑制子包含連接子。
318. 如實施例118至267或285至317中任一例之系統,其中:
第一表現抑制子包含:靶向部分,其包含第一CRISPR/Cas分子,例如包含無催化活性的第一CRISPR/Cas蛋白質;及效應部分,其包含表觀遺傳修飾部分;且
第二表現抑制子包含:靶向部分,其包含第二CRISPR/Cas分子,例如包含無催化活性的第二CRISPR/Cas蛋白質;及視情況存在的效應部分,其包含轉錄抑制子。
319. 如實施例118至260、268至278或285至317中任一例之系統,其中:
第一表現抑制子包含含有第一鋅指域的靶向部分及含有表觀遺傳修飾部分的效應部分;且
第二表現抑制子包含含有第二鋅指域的靶向部分及視情況存在之含有轉錄抑制子的效應部分。
320. 如實施例118至120、262、268或275至318中任一例之系統,其中:
第一表現抑制子包含:靶向部分,其包含CRISPR/Cas分子,例如包含無催化活性的CRISPR/Cas蛋白質;及效應部分,其包含表觀遺傳修飾部分;且
第二表現抑制子包含含有鋅指域的靶向部分及視情況存在之含有轉錄抑制子的效應部分。
321. 如實施例118至260、268、或275至318中任一例之系統,其中:
第一表現抑制子包含含有鋅指域的靶向部分及含有表觀遺傳修飾部分的效應部分;且
第二表現抑制子包含:靶向部分,其包含CRISPR/Cas域,例如包含無催化活性的CRISPR/Cas蛋白質;及視情況存在的效應部分,其包含轉錄抑制子。
322. 如實施例260、268至278、或275至318中任一例之系統,其中鋅指域(例如第一或第二鋅指域)包含1至10、1至9、1至8、1至7、1至6、1至5、1至4、1至3、1至2、2至10、2至9、2至8、2至7、2至6、2至5、2至4、2至3、3至10、3至9、3至8、3至7、3至6、3至5、3至4、4至10、4至9、4至8、4至7、4至6、4至5、5至10、5至9、5至8、5至 7、5至6、6至10、6至9、6至8、6至7、7至10、7至9、7至8、8至10、8至9或9至10個鋅指,例如3或9個鋅指。
323. 如實施例322中任一例之系統,其中該表觀遺傳修飾部分包含DNA甲基轉移酶。
324. 如實施例118至323中任一例之系統,其中該表觀遺傳修飾部分包含MQ1或其功能變異體或片段。
325. 如實施例118至324中任一例之系統,其中該第二表現抑制子包含含有轉錄抑制子之效應部分。
326. 如實施例118至323中任一例之系統,其中該轉錄抑制子包含KRAB或其功能變異體或片段。
327. 如實施例118至326中任一例之系統,其中第一表現抑制子包含SEQ ID NO: 28-33或35-37、145-149、151、152中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
328. 如實施例118至327中任一例之系統,其中第二表現抑制子包含SEQ ID NO: 22-27、34、139-144、150、177-180、183-186中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
329. 如實施例118至328中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列減少MYC在細胞中之表現。
330. 如實施例327之系統,其中相較於第一表現抑制子不存在下的表現,該系統達成的表現減少10、20、30、40、50、60、70、80、90或100%,例如如藉由QPCR或ELISA所量測。
331. 如實施例326或327之系統,其中第一表現抑制子結合至轉錄調控元件使MYC表現明顯減少以下時段:至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如藉由QPCR或ELISA所量測。
332. 如實施例329至331中任一例之系統,其中在轉染後的第1、2、3、4、5、6、7、8、10、12、16、20、24、28、32、36、40、44、48、52、56、60、64、68、72、76、80或96小時,第一表現抑制子結合至轉錄調控元件明顯地減少MYC表現。
333. 如實施例328至332中任一例之系統,其中第二表現抑制子結合至錨定序列或與其近接的序列減少MYC在細胞中的表現。
334. 如實施例333之系統,其中相較於第二表現抑制子不存在下的表現,該系統達成的表現減少10、20、30、40、50、60、70、80、90或100%,例如如藉由QPCR或ELISA所量測。
335. 如實施例333或334之系統,其中第二表現抑制子結合至錨定序列或與其近接之序列使MYC表現減少以下時段:至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如藉由QPCR或ELISA所量測。
336. 如實施例334至335中任一例之系統,其中在轉染後的第1、2、3、4、5、6、7、8、10、12、16、20、24、28、32、36、40、44、48、52、56、60、64、68、72、76、80或96小時,第二表現抑制子結合至錨定序列或與其近接的序列明顯地減少MYC表現。
337. 如實施例329至336中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列減少MYC在細胞中的表現。
338. 如實施例329至337中任一例之系統,其中在轉染後的第1、2、3、4、5、6、7、8、10、12、16、20、24、28、32、36、40、44、48、52、56、60、64、68、72、76、80或96小時,第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列明顯地減少MYC表現。
339. 如實施例337或338中任一例之系統,其中相較於第一與第二表現抑制子不存在下的表現,該系統達成的表現減少10、20、30、40、50、60、70、80、90或100%,例如如藉由QPCR或ELISA所量測。
340. 如實施例329至339中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列使MYC表現明顯地減少以下時段:至少1、2、3、4、5、6、7、8、9、10、11或12個小時,或至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如藉由QPCR或ELISA所量測。
341. 如實施例329至340中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列所引起的表現減少大於由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起的表現減少。
342. 如實施例341之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列使表現減少,減少的幅度為由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起減少之幅度的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x,例如如藉由QPCR或ELISA所量測。
343. 如實施例329至342中任一例之系統,其中相較於由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起的表現減少,第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列所引起的表現減少保持更長的時間(例如更多的小時數、天數或細胞分裂數)。
344. 如實施例343之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列使表現減少,表現減少的時間為由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起表現減少之時間的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x,例如如藉由QPCR或ELISA所量測。
345. 如實施例329至344中任一例之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列使MYC表現減少以下時段:至少1、2、3、4、5、6、7、8、9、10、11或12小時或至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25天,或至少1、2、3、4、5、6、7、8、9或10次細胞分裂,例如如藉由QPCR或ELISA所量測。
346. 如實施例329至345中任一例之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列所引起的表現減少大於由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起的表現減少。
347. 如實施例346之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列使表現減少,減少的幅度為由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起減少之幅度的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x,例如如藉由QPCR或ELISA所量測。
348. 如實施例329至347中任一例之系統,其中相較於由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起表現減少的時間,第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列所引起的表現減少保持更長的時間(例如更多的小時數、天數或細胞分裂數)。
349. 如實施例348之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列使表現減少的時間為由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起表現減少之時間的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x長(例如以小時數、天數或細胞分裂數量度),例如如藉由QPCR或ELISA所量測。
350. 如實施例329至349中任一例之系統,其中表現無限期地(例如大於實驗上可量度的時段)明顯減少。
351. 如實施例329至350中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列使包含轉錄調控元件或與其近接之序列的細胞存活率降低。
352. 如實施例329至351中任一例之系統,其中複數個細胞與第一表現抑制子或編碼第一表現抑制子的核酸接觸使該複數個細胞的存活率降低,視情況其中複數個細胞包含癌細胞及非癌細胞及/或感染細胞及未感染細胞。
353. 如實施例352之系統,其中相較於第一表現抑制子不存在下的存活率,該系統達成的存活率降低10、20、30、40、50、60、70、80、90或100%,例如如藉由CellTiter Glo所量測。
354. 如實施例329至353中任一例之系統,其中第一表現抑制子的投與使得靶細胞(例如癌細胞)中的至少5%、6%、7%、8%、9% 10%、12%、15%、17% 20%、25% 30%、40%、45%、50%、55%、60%、65%、75%發生細胞凋亡。
355. 如實施例329至354中任一例之系統,其中第二表現抑制子結合至錨定序列或與其近接的序列使包含錨定序列或與其近接之序列的細胞存活率降低。
356. 如實施例329至355中任一例之系統,其中使複數個細胞與第二表現抑制子或編碼第二表現抑制子之核酸接觸降低了該複數個細胞之存活率。
357. 如實施例329至356中任一例之系統,其中第二表現抑制子結合至超級增強子或與其近接的序列使包含轉錄調控元件或與其近接之序列的細胞存活率降低。
358. 如實施例329至357中任一例之系統,其中複數個細胞與第二表現抑制子或編碼第一表現抑制子的核酸接觸使該複數個細胞的存活率降低,視情況其中複數個細胞包含癌細胞及非癌細胞及/或感染細胞及未感染細胞。
359. 如實施例358之系統,其中相較於第二表現抑制子不存在下的存活率,該系統達成的存活率降低10、20、30、40、50、60、70、80、90或100%,例如如藉由CellTiter Glo所量測。
360. 如實施例329至359中任一例之系統,其中第二表現抑制子的投與使得靶細胞(例如癌細胞)中的至少5%、6%、7%、8%、9%、10%、12%、15%、17%、20%、25%、30%、40%、45%、50%、55%、60%、65%、75%發生細胞凋亡。
361. 如實施例329至360中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列使包含錨定序列或與其近接之序列之細胞的存活率降低。
362. 如實施例329至361中任一例之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列使細胞存活率降低。
363. 如實施例329至362中任一例之系統,其中複數個細胞與系統或編碼該系統的核酸接觸使該複數個細胞的存活率降低。
364. 如實施例329至363之系統,其中相較於該系統不存在下的存活率,該系統達成的存活率降低10、20、30、40、50、60、70、80、90或100%,例如如藉由CellTiter Glo所量測。
365. 如實施例329至364中任一例之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列所引起的存活率降幅大於由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起的存活率降幅。
366. 如實施例329至365中任一例之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列所引起的存活率降幅大於由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起的存活率降幅。
367. 如實施例366之系統,其中第一表現抑制子結合至轉錄調控元件或與其近接的序列及第二表現抑制子結合至錨定序列或與其近接的序列使存活率降低的幅度為由第一表現抑制子結合至轉錄調控元件或與其近接的序列或第二表現抑制子結合至錨定序列或與其近接的序列個別地引起存活率降低幅度的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x,例如如藉由CellTiter Glo所量測。
368. 如實施例366或367之系統,其中第一表現抑制子結合至啟動子或與其近接的序列及第二表現抑制子結合至超級增強子或與其近接的序列使存活率降低的幅度為由第一表現抑制子結合至啟動子或與其近接的序列或第二表現抑制子結合至超級增強子或與其近接的序列個別地引起存活率降低幅度的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x,例如如藉由CellTiter Glo所量測。
369. 如實施例329至368中任一例之系統,其中第一表現抑制子及第二表現抑制子的投與使得靶細胞(例如癌細胞)中的至少5%、6%、7%、8%、9%、10%、12%、15%、17%、20%、25%、30%、40%、45%、50%、55%、60%、65%、75%發生細胞凋亡。
370. 如實施例329至369之系統,其中該複數個細胞包含複數個癌細胞及複數個非癌細胞。
371. 如實施例370之系統,其中複數個細胞與系統或編碼該系統的核酸接觸使複數個癌細胞的存活率降低,降幅超過其使複數個非癌細胞之存活率降低的幅度。
372. 如實施例370或371之系統,其中複數個細胞與系統或編碼該系統的核酸接觸使複數個癌細胞之存活率降低,降幅為其使複數個非癌細胞之存活率降低之幅度的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x。
373. 如任一前述實施例之表現抑制子或系統,其使非癌細胞(例如原代肝細胞)之存活率降低不超過5、10、15或20%,例如根據實例29分析時。
374. 如實施例320之表現抑制子或系統,其中在細胞與該表現抑制子或系統接觸之後的第72小時分析存活率。
375. 如實施例374之表現抑制子或系統,其中該分析包含使非癌細胞與2.5、2、1.25、1、0.6或0.5 μg/ml之該表現抑制子或系統接觸。
376. 如實施例352至375中任一例之系統,該系統當與複數個感染細胞及複數個未感染細胞接觸時,使複數個感染細胞之存活率降低的幅度超過其使複數個未感染細胞之存活率降低的幅度且/或使複數個癌細胞之存活率降低的幅度超過其使複數個非癌細胞之存活率降低的幅度。
377. 如實施例352至376中任一例之系統,其中該癌症為肝細胞癌(HCC)、纖維板層肝細胞癌(FHCC)、膽管癌、血管肉瘤、繼發性肝癌、非小細胞肺癌(NSCLC)、腺癌、小細胞肺癌(SCLC)、大細胞(未分化)癌瘤、三陰性乳癌、胃腺癌、子宮內膜癌或胰臟癌。
378. 如實施例352至377中任一例之系統,其中該癌細胞為肺癌細胞、胃癌細胞、胃腸癌細胞、結腸直腸癌細胞、胰臟癌細胞或肝癌細胞。
379. 如實施例352至378中任一例之系統,其中該等細胞為人類肺上皮細胞或人類肺纖維母細胞。
380. 如實施例352至379中任一例之系統,其中該感染為病毒感染。
381. 如實施例380之表現抑制子,其中該病毒感染為肝炎,例如B型肝炎。
382. 如實施例378至381中任一例之系統,其中該等感染細胞為人類肝細胞。
383. 如實施例352至382中任一例之系統,其中該病毒感染為慢性感染。
384. 一種融合蛋白,其包含:
第一胺基酸區域,其包含編碼如實施例118至383中任一例之系統中之第一表現抑制子的序列;及
第二胺基酸區域,其包含編碼如實施例118至383中任一例之系統中之第二表現抑制子的序列。
385. 如實施例384之融合蛋白,其包含第三胺基酸區域,其中該第三胺基酸區域位於第一胺基酸區域與第二胺基酸區域之間。
386. 如實施例385之融合蛋白,其中第三胺基酸區域包含蛋白酶裂解肽序列,例如自裂解肽序列,例如T2A自裂解肽序列,例如根據SEQ ID NO: 120的序列。
387. 如實施例386之融合蛋白,其中第三胺基酸區域包含蛋白酶裂解肽序列,例如自裂解肽序列,例如串聯2A肽序列,例如tPT2A序列,例如根據SEQ ID NO: 124的序列。
388. 如實施例385之融合蛋白,其中該肽序列包含T2A肽序列及P2A肽序列。
389. 如實施例384至388中任一例之融合蛋白,其中:
該第一表現抑制子包含根據SEQ ID NO: 30或129的胺基酸序列,或與其至少80、85、90、95或99%一致的序列;且
該第二表現抑制子包含根據SEQ ID NO: 24或142的胺基酸序列或與其至少80、85、90、95或99%一致的序列。
390. 如實施例384至388中任一例之融合蛋白,其中:
該第一表現抑制子包含根據SEQ ID NO: 30或129的胺基酸序列或與其至少80、85、90、95或99%一致的序列;且
該第二表現抑制子包含根據SEQ ID NO: 177或183的胺基酸序列或與其至少80、85、90、95或99%一致的序列。
391. 如實施例384至388中任一例之融合蛋白,其中:
該第一表現抑制子包含根據SEQ ID NO: 30或129的胺基酸序列或與其至少80、85、90、95或99%一致的序列;且
該第二表現抑制子包含根據SEQ ID NO: 179或185的胺基酸序列或與其至少80、85、90、95或99%一致的序列。
392. 如實施例384至391中任一例之融合蛋白,其包含胺基酸序列SEQ ID NO: 91、92、121或122,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
393. 如實施例384至392中任一例之融合蛋白,其包含胺基酸序列SEQ ID NO: 181、182、187或188,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
394. 一種核酸,其包含編碼如實施例118至393中任一例之系統的序列。
395. 一種核酸,其包含編碼如實施例394之系統的序列。
396. 如實施例394或395之核酸,其包含:
第一區域,其包含編碼如實施例118至393中任一例之系統中之第一表現抑制子的序列;及
第二區域,其包含編碼如實施例118至393中任一例之系統中之第二表現抑制子的序列。
397. 如實施例394至396中任一例之核酸,其包含第三區域,其中該第三區域位於第一區域與第二區域之間。
398. 如實施例394至397中任一例之核酸,其中該第三區域編碼核糖體跳讀序列。
399. 如實施例397或398之核酸,其中該第三區域編碼tPT2A肽序列,例如根據SEQ ID NO: 124的序列。
400. 如實施例397至399中任一例之核酸,其中該第三區域編碼蛋白酶裂解肽序列,例如自裂解肽序列,例如T2A自裂解肽序列,例如根據SEQ ID NO: 95的序列。
401. 如實施例397至400中任一例之核酸,其中該第三區域編碼蛋白酶裂解肽序列,例如自裂解肽序列,例如串聯2A肽序列,例如tPT2A肽序列,例如根據SEQ ID NO: 124的序列。
402. 如實施例394至401中任一例之核酸,其中
該第一表現抑制子包含根據SEQ ID NO: 30或129的胺基酸序列或與其至少80、85、90、95或99%一致的序列;且
該第二表現抑制子包含根據SEQ ID NO: 24、142的胺基酸序列或與其至少80、85、90、95或99%一致的序列。
403. 如實施例394至401中任一例之核酸,其中
該第一表現抑制子包含根據SEQ ID NO: 30、129的胺基酸序列或與其至少80、85、90、95或99%一致的序列;且
該第二表現抑制子包含根據SEQ ID NO: 177、179、183或185的胺基酸序列或與其至少80、85、90、95或99%一致的序列。
404. 如實施例394至403中任一例之核酸,其編碼SEQ ID NO: 91、92、121、122之胺基酸序列或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
405. 如實施例394至404中任一例之核酸,其編碼胺基酸序列SEQ ID NO: 181、182、187、188,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
406. 如實施例394至405中任一例之核酸,其包含核苷酸序列SEQ ID NO: 93、94、112或113或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
407. 如實施例394至406中任一例之核酸,其包含核苷酸序列SEQ ID NO: 196、197,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
408. 一種核酸,其包含編碼如實施例1至407中任一例之表現抑制子或表現抑制系統的序列。
409. 如實施例394至408中任一例之核酸,其為RNA,例如mRNA。
410. 如實施例394至409中任一例之核酸,其包含N7甲基化鳥苷,例如連接至RNA的5'端,例如經由逆向5'至5'三磷酸酯鍵聯。
411. 如實施例394至410中任一例之核酸,其包含5' UTR。
412. 如實施例394至411中任一例之核酸,其包含Kozak序列,例如介於5' UTR與編碼表現抑制子的序列之間。
413. 一種系統,其包含:
第一核酸,其包含編碼如實施例118至393中任一例之系統中之第一表現抑制子的序列;及
第二核酸,其包含編碼第二表現抑制子(例如如實施例118至393中任一例之系統中的第二表現抑制子)的序列。
414. 如實施例413之系統,其中該第一核酸具有核苷酸序列SEQ ID NO: 63、130,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,且該第二核酸具有核苷酸序列SEQ ID NO: 57,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
415. 如實施例414之系統,其中該第一核酸具有核苷酸序列SEQ ID NO: 63、130,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,且該第二核酸具有核苷酸序列SEQ ID NO: 189或194,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
416. 如實施例415之系統,其中該第一核酸具有核苷酸序列SEQ ID NO: 189、194,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列,且該第二核酸具有核苷酸序列SEQ ID NO: 63或130,或與其至少80、85、90、95或99%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
417. 如實施例394至416中任一例之核酸或系統,其中該核酸包含mRNA。
418. 一種載體,其包含編碼如任一前述實施例之系統或表現抑制子的核酸。
419. 一種脂質奈米粒子,其包含如任一前述實施例之系統、核酸、mRNA或載體。
420. 如實施例419之脂質奈米粒子,其包含可離子化脂質,例如陽離子脂質,例如MC3、SSOP。
421. 如實施例419或420之脂質奈米粒子,其進一步包含以下中的一或多者:中性脂質、可離子化含胺脂質、生物可降解炔基化脂質、類固醇、磷脂、多元不飽和脂質、結構脂質(例如固醇)、PEG、膽固醇,或聚合物偶聯的脂質。
422. 一種反應混合物,其包含如任一前述實施例之表現抑制子、系統、核酸、載體或脂質奈米粒子。
423. 如實施例422之反應混合物,其進一步包含細胞。
424. 一種醫藥組合物,其包含如任一前述實施例之表現抑制子、系統、核酸、載體、脂質奈米粒子或反應混合物。
425. 一種減少細胞中之MYC基因表現的方法,該方法包含:
使該細胞(例如癌細胞)與如實施例1至424中任一例之表現抑制子、系統、一或多種編碼該系統或表現抑制子之核酸、載體、脂質奈米粒子或醫藥組合物接觸,
藉此減少該細胞中的MYC基因表現。
426. 一種治療有需要之個體之癌症的方法,該方法包含:
將如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物投與該個體,
藉此治療該個體之癌症。
427. 一種減少有需要之個體之腫瘤生長的方法,該方法包含:
將如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物投與該個體,
藉此減小該個體的腫瘤尺寸。
428. 如實施例427之方法,其中腫瘤生長的減少包含腫瘤體積相較於治療開始時的腫瘤體積減小。
429. 如實施例428之方法,其中該個體之腫瘤生長的減少幅度大於未治療之個體。
430. 一種增強或恢復癌症對激酶抑制劑(例如索拉非尼)敏感性的方法,該方法包含將本文所述之表現抑制子或系統投與患有該癌症的個體。
431. 如實施例430之方法,其中該表現抑制子或系統的投與使激酶抑制劑的IC
50降低10%、20%、30%或40%,例如在癌細胞存活率分析中,例如在根據實例38的分析中。
432. 如實施例430或431之方法,其中該激酶抑制劑抑制VEGFR、PDGFR或RAF激酶中的一或多者(例如全部)。
433. 一種增強或恢復癌症對溴域抑制劑(例如BET抑制劑,例如JQ1)敏感性的方法,該方法包含將本文所述(例如實施例1至423中之任一例)之表現抑制子、系統或核酸投與患有該癌症的個體,其中視情況,該表現抑制子或系統的投與使該溴域抑制劑的IC
50降低10%、20%、30%、40%、50%、60%、70%、80%、90%或95%,例如在癌細胞存活率分析中,例如在根據實例39的分析中。
434. 如實施例433之方法,其中該溴域抑制劑為或包含JQ1、BET672或必納昔布(birabresib)。
435. 一種增強或恢復癌症對MEK抑制劑(例如曲美替尼(Trametinib))敏感性的方法,該方法包含將本文所述(例如實施例1至423中之任一例)之表現抑制子、系統或核酸投與患有該癌症的個體,其中視情況,該表現抑制子或系統的投與使該MEK抑制劑的IC
50降低10%、20%、30%、40%、50%、60%、70%、80%、90%或95%,例如在癌細胞存活率分析中,例如在根據實例51的分析中。
436. 如實施例427至435中任一例之方法,其中該個體之腫瘤生長的減少幅度大於或類似於當個體用化學治療劑或小分子MYC抑制劑治療時的腫瘤尺寸減小幅度。
437. 如實施例436之方法,其中該化學治療劑為索拉非尼或順鉑。
438. 如實施例437之方法,其中該小分子MYC抑制劑為MYCi975。
439. 一種減小有需要之個體之腫瘤尺寸的方法,該方法包含:
將如1至424之表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物投與該個體,其中腫瘤尺寸的減小幅度大於或類似於當個體用化學治療劑治療時的腫瘤尺寸減小幅度。
440. 如439之方法,其中該化學治療劑為索拉非尼或順鉑。
441. 如任一前述實施例之方法,其中與化學治療劑或小分子MYC抑制劑治療時相比,該個體未經歷任何顯著副作用。
442. 如實施例436至441中任一例之方法,其中該化學治療劑為索拉非尼或順鉑。
443. 如實施例442之方法,其中該小分子MYC抑制劑為MYCi975。
444. 如實施例426至443中任一例之方法,其中該癌症為I期、II期、III期或IV期癌症。
445. 如任一前述實施例之方法,其中該個體之體重在治療前與治療後保持大致相同。
446. 如前述實施例中任一例之方法,其中該個體未經歷體重的降低,或其中該個體的體重相較於治療開始時降幅小於3%、2%或1%。
447. 如前述實施例中任一項之方法,其中相較於該個體在治療之前的體重,該個體在治療後未經歷體重的降低或增加。
448. 一種治療有需要之個體之肝病的方法,該方法包含:
向該個體投與表現抑制子,其中該表現抑制子包含靶向部分,該靶向部分結合MYC基因座(例如MYC轉錄區域、MYC啟動子,或包含MYC基因之錨定序列介導性接合體(ASMC)中的錨定序列或近接於該錨定序列的序列);及視情況存在的效應部分,例如本文所述之效應部分;
藉此治療該個體的該肝病。
449. 如實施例447之方法,其進一步包含向該個體投與第二表現抑制子,該第二表現抑制子包含:靶向部分,該靶向部分結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列;及視情況存在的第二效應部分,例如本文所述之效應部分,例如KRAB;
藉此治療該個體的該肝病。
450. 一種治療有需要之個體之肝病的方法,該方法包含:
將如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物投與該個體,
藉此治療該個體之該肝病。
451. 如實施例450之方法,其中該肝病為慢性肝病。
452. 如實施例450或451之方法,其中該肝病與病毒或酒精有關。
453. 如實施例450至452中任一例之方法,其中該肝病為肝炎或肝細胞癌。
454. 如實施例453之方法,其中該肝細胞癌選自HCC亞型S1、HCC亞型S2或HCC亞型S3。
455. 如實施例453或454之方法,其中該肝細胞癌為HCC S1。
456. 如實施例453或454之方法,其中該肝細胞癌為HCC S2。
457. 如實施例450至456中任一例之方法,其中該肝病係由B型肝炎病毒或C型肝炎病毒引起。
458. 一種治療有需要之個體之肺病的方法,該方法包含:
向該個體投與表現抑制子,其中該表現抑制子包含靶向部分,該靶向部分結合MYC基因座(例如MYC轉錄區域、MYC啟動子,或包含MYC基因之錨定序列介導性接合體(ASMC)中的錨定序列或近接於該錨定序列的序列);及視情況存在的效應部分,例如本文所述之效應部分;
藉此治療該個體之該肺病。
459. 如實施例458之方法,其進一步包含向該個體投與第二表現抑制子,該第二表現抑制子包含:靶向部分,該靶向部分結合位於靶基因(例如MYC)之超級增強子區域中的基因體基因座;及視情況存在的第二效應部分,例如本文所述之效應部分,例如KRAB;
藉此治療該個體之該肺病。
460. 一種治療有需要之個體之肺病的方法,該方法包含:
將如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物投與該個體,
藉此治療該個體之該肺病。
461. 如實施例459或460之方法,其中該肺病為癌症,例如肺癌,例如肺癌瘤,例如非小細胞肺癌或小細胞肺癌。
462. 如實施例425至461中任一例之方法,其中接觸或投藥包含靜脈內投與個體。
463. 如實施例425至462中任一例之方法,其中接觸或投藥包含腫瘤內遞送(例如注射)。
464. 如實施例425至463中任一例之方法,其中該癌症之特徵為MYC表現相對於參考水準(例如相對於參考細胞的MYC表現,例如該個體之在其他方面類似的非癌細胞)增加。
465. 如實施例426至464中任一例之方法,其中該癌症之特徵為MYC基因之一部分或全部複製。
466. 如實施例426至465中任一例之方法,其中該癌症係選自結腸直腸癌、乳癌、AML、前列腺癌、神經母細胞瘤、肺癌、子宮內膜癌、肝癌、淋巴瘤(例如伯基特淋巴瘤)、子宮頸癌或胃癌。
467. 如實施例426至466中任一例之方法,其中該癌症為人類絨毛膜促性腺激素(hCG)分泌性癌症。
468. 如實施例426至467中任一例之方法,其中該癌症為肝癌。
469. 如實施例426至468中任一例之方法,其中該癌症為非反應性癌症,例如非反應性肝癌。
470. 如實施例426至469中任一例之方法,其中該癌症為非小細胞肺癌或小細胞肺癌。
471. 如實施例426至470中任一例之方法,其中該癌症過度表現α-胎蛋白(AFP)(例如相對於參考細胞的AFP表現,例如該個體之在其他方面類似的非癌細胞)。
472. 如實施例431至471中任一例之方法,其中該癌症之細胞的特徵為存在超級增強子,例如包含MYC基因或包含含有MYC基因之錨定序列介導性接合體,其中視情況,該癌症係選自肝癌、結腸直腸癌、乳癌、AML、前列腺癌、神經母細胞瘤、肺癌或子宮內膜癌。
472. 如實施例471之方法,其中該表現抑制子(例如第二表現抑制子)結合至包含MYC基因之錨定序列介導性接合體(ASMC)中的錨定序列或結合至近接於該錨定序列的序列。
473. 如實施例426至472中任一例之方法,其中該癌症之細胞的特徵為缺乏超級增強子,該超級增強子包含MYC基因或包含含有MYC基因的錨定序列介導性接合體。
474. 如實施例473之方法,其中該表現抑制子(例如該第一表現抑制子)結合MYC啟動子。
475. 如實施例426至474中任一例之方法,其中該癌症包含:包含超級增強子的細胞,該超級增強子包含MYC基因或包含含有MYC基因的錨定序列介導性接合體;及不包含超級增強子的細胞,該超級增強子包含MYC基因或包含含有MYC基因的錨定序列介導性接合體。
476. 如請求項426至475中任一例之方法,其中該癌症包含以MYC表現相對於參考水準(例如相對於參考細胞的MYC表現,例如該個體之在其他方面類似的非癌細胞)增加為特徵的細胞,及不以MYC表現相對於參考水準(例如相對於參考細胞的MYC表現,例如該個體之在其他方面類似的非癌細胞)增加為特徵(例如具有正常的MYC表現)的細胞。
477. 如實施例426至476中任一例之方法,其中該表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物作為單一療法投與。
478. 如實施例426至477中任一例之方法,其包含向該個體投與複數次劑量的該表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物,例如至少2、3、4、5或6次劑量。
479. 如實施例426至479中任一例之方法,其包含以5天時間間隔向該個體投與複數次劑量的該表現抑制子、系統、核酸、載體、脂質奈米粒子或醫藥組合物。
480. 如實施例426至479中任一例之方法,其包含:
a)首先向該個體投與第一複數次劑量的本文所述(例如實施例1至424中任一例)之表現抑制子或系統,其中視情況,在第一複數次劑量中之前一劑量之後的第5天投與第一複數次劑量中的各後一劑量;
b)其次,停止投與該表現抑制子或系統一段時間(「藥物假期」),例如約2週,及
c)第三,向該個體投與第二複數次劑量的該表現抑制子或系統,其中視情況,在該第二複數次劑量中之前一劑量之後的第5天,投與該第二複數次劑量中的後一劑量。
481. 如實施例480之方法,其中該第一複數次劑量包含4次劑量。
482. 如實施例479或480之方法,其中該第二複數次劑量包含2次劑量。
483. 如實施例480至482中任一例之方法,其中該個體在藥物假期期間完全不接受治療劑。
484. 如實施例480至483中任一例之方法,其中該個體在藥物假期期間接受第二治療劑。
485. 如實施例480至484中任一例之方法,其中該藥物假期為第一複數次劑量中之各劑量投與之間的時間的至少兩倍長。
486. 如實施例480至486中任一例之方法,其中該藥物假期為第二複數次劑量中之各劑量投與之間的時間的至少兩倍。
487. 如實施例426至486中任一例之方法,其中在該表現抑制子或系統治療之後,腫瘤體積降低至不可偵測的水準。
488. 如請求項426至487中任一項之方法,在該表現抑制子或系統的治療停止之後,腫瘤體積降低(例如降至不可偵測的水準)。
489. 如實施例425至488中任一例之方法,其中該癌症對該表現抑制子或系統不具有抗性,或在10、20、30、40、50或60天的時段內對該表現抑制子或系統不具有抗性。
490. 如實施例425至489中任一例之方法,其中該等癌細胞具有功能性細胞凋亡路徑。
491. 如實施例425至490中任一例之方法,其中該等癌細胞具有功能性凋亡蛋白酶3。
492. 如實施例491之方法,其中該表現抑制子或系統投與該個體後,癌細胞中的凋亡蛋白酶3上調。
493. 如實施例425至492中任一例之方法,其中該表現抑制子或系統投與該個體後,癌細胞中的Ki67下調。
494. 如實施例425至493中任一例之方法,其中該表現抑制子或系統投與該個體後,癌細胞增殖衰減。
495. 如實施例425至494中任一例之方法,其中該方法進一步包含
a.使該細胞與第二治療劑接觸,或
b.向該個體投與第二治療劑。
496. 如實施例495之方法,其中該第二治療劑不為結合至MYC啟動子的表現抑制子。
497. 如實施例495或496之方法,其中該第二治療劑不為如實施例1至424中任一例之表現抑制子、系統、融合蛋白、核酸、載體、反應混合物、醫藥組合物或脂質奈米粒子。
498. 如實施例494至496中任一例之方法,其中該第二治療劑為如實施例1至424中任一例之表現抑制子、系統、融合蛋白、核酸、載體、反應混合物、醫藥組合物或脂質奈米粒子。
499. 如實施例495至497中任一例之方法,其中該第二治療劑為免疫療法、免疫檢查點及抗血管內皮細胞生長因子療法中之一者或兩者、全身化學療法、酪胺酸激酶抑制劑(例如索拉非尼)、促分裂原活化蛋白激酶激酶抑制劑(MEK抑制劑)(例如曲美替尼),或溴域抑制劑(例如BET抑制劑,例如JQ1,例如必納昔布)。
500. 如實施例495至499中任一例之方法,其中該第二治療劑為酪胺酸激酶抑制劑,例如索拉非尼。
501. 如實施例495至499中任一例之方法,其中該第二治療劑為溴域抑制劑,例如BET抑制劑,例如JQ1、必納昔布或BET 672。
502. 如實施例495至499中任一例之方法,其中該第二治療劑為促分裂原活化蛋白激酶激酶抑制劑(MEK抑制劑),例如曲美替尼。
503. 如實施例495至502中任一例之方法,其中該方法進一步包含向該個體投與另一種療法。
504. 如實施例504之方法,其中該另一種療法包含手術切除正位肝臟移植、射頻消融、光動力學療法(PDT)、雷射療法、近接療法、放射療法、經導管動脈化療栓塞或放射栓塞,或立體定向放射療法。
505. 如實施例495至504中任一例之方法,其中該第二治療劑係選自檢查點抑制劑或小分子。
506. 如實施例495至505中任一例之方法,其中該第二治療劑為化學治療劑,例如激酶抑制劑或溴域抑制劑,例如BET抑制劑。
507. 如實施例505或506之方法,其中該第二治療劑係選自索拉非尼、JQ1、BET672、必納昔布,或曲美替尼。
508. 如實施例495至506中任一例之方法,其中該表現抑制子、系統或核酸與該第二治療劑同時投與。
509. 如實施例495至508中任一例之方法,其中該表現抑制子、系統或核酸與該第二治療劑依序投與。
509. 如實施例503至509中任一例之方法,其中同時投與另一種療法。
511. 如實施例503至510中任一例之方法,其中依序投與另一種療法。
512. 如實施例495至511中任一例之方法,其中第二治療劑與如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子、醫藥組合物或反應混合物同時投與。
513. 如實施例495至512中任一例之方法,其中第二治療劑與如實施例1至424中任一例之表現抑制子、系統、核酸、載體、脂質奈米粒子、醫藥組合物或反應混合物連貫投與。
514. 如實施例495至513中任一例之方法,其中靜脈內投與該表現抑制子、系統或核酸,且經口投與第二療法。
515. 如任一前述實施例之方法,其中該癌症為抗藥性或難治性癌症。
516. 如任一前述實施例之方法,其中該癌症對激酶抑制劑(例如抑制VEGFR、PDGFR或RAF激酶中之一或多者的激酶抑制劑,例如索拉非尼)具有抗藥性或難治性。
517. 如任一前述實施例之方法,其中該個體的MYC超級增強子擴增。
518. 一種套組,其包含:包含組合物的容器,該組合物包含如實施例1至424中任一例之表現抑制子、系統、編碼該系統或表現抑制子的一或多種核酸、載體、脂質奈米粒子、反應混合物或醫藥組合物;及一組說明書,包含至少一種使用該組合物調節(例如減少)細胞內之MYC基因表現的方法。
Examples listed CLAIMS 1. An expression suppressor comprising: a targeting moiety bound to a MYC promoter, and optionally an effector moiety, wherein the expression suppressor is capable of reducing MYC expression. 2. The expression suppressor of
定義 一 (A/an) 、該 (the) :如本文所用,單數形式「一」及「該」包括複數個提及物,除非上下文另外明確規定。 Definitions A (A/an) , the (the) : As used herein, the singular forms "a" and "the" include plural referents unless the context clearly dictates otherwise.
藥劑 :如本文所用,術語「藥劑」可用於指任何化學類別的化合物或實體,包括例如多肽、核酸、醣、脂質、小分子、金屬,或其組合或複合物。如熟習此項技術者自上下文將顯而易知,在一些實施例中,該術語可用於指一種實體,該實體為或包含細胞或生物體,或其部分、萃取物或組分。替代地或另外,如熟習此項技術者根據上下文將瞭解,在一些實施例中,該術語可用於指在自然中發現且/或從自然獲得的天然產物。在一些實施例中,再次如熟習此項技術者根據上下文將瞭解,該術語可用於指一或多種人造實體,其經設計、工程改造且/或經由人類手工操作來產生且/或在自然中未發現。在一些實施例中,藥劑可以分離或純形式使用;在一些實施例中,藥劑可以粗物質形式使用。在一些實施例中,潛在藥劑可以集合或文庫形式提供,例如其可經篩選以鑑別或表徵其內之活性劑。在一些實施例中,術語「藥劑」可指一種化合物或實體,其為或包含聚合物;在一些實施例中,該術語可指包含一或多種聚合物部分的化合物或實體。在一些實施例中,術語「藥劑」可指不為聚合物及/或基本上不含任何聚合物及/或基本上不含一或多種特定聚合部分之化合物或實體。在一些實施例中,該術語可指缺乏或基本上不含任何聚合部分之化合物或實體。 Agent : As used herein, the term "agent" may be used to refer to any chemical class of compounds or entities including, for example, polypeptides, nucleic acids, sugars, lipids, small molecules, metals, or combinations or complexes thereof. As will be apparent to those skilled in the art from the context, in some embodiments, the term may be used to refer to an entity that is or comprises a cell or organism, or a portion, extract or component thereof. Alternatively or additionally, in some embodiments, the term may be used to refer to a natural product found in and/or obtained from nature, as those skilled in the art will appreciate from the context. In some embodiments, again as those skilled in the art will understand from the context, the term may be used to refer to one or more man-made entities that are designed, engineered, and/or produced by human manipulation and/or are found in nature Not found. In some embodiments, the agent may be used in isolated or pure form; in some embodiments, the agent may be used in crude form. In some embodiments, potential agents may be provided as a collection or library, eg, that may be screened to identify or characterize the active agents therein. In some embodiments, the term "agent" may refer to a compound or entity that is or comprises a polymer; in some embodiments, the term may refer to a compound or entity comprising one or more polymer moieties. In some embodiments, the term "agent" may refer to a compound or entity that is not a polymer and/or is substantially free of any polymer and/or is substantially free of one or more specific polymeric moieties. In some embodiments, the term may refer to a compound or entity that lacks or is substantially free of any polymeric moieties.
錨定序列 :如本文所用,術語「錨定序列」係指被成核劑識別的核酸序列,該成核劑充分結合以形成錨定序列介導性接合體,例如複合物。在一些實施例中,錨定序列包含一或多個CTCF結合模體。在一些實施例中,錨定序列不位於基因編碼區內。在一些實施例中,錨定序列位於基因間區域內。在一些實施例中,錨定序列不位於增強子或啟動子內。在一些實施例中,錨定序列位於與任何轉錄起點相距至少400 bp、至少450 bp、至少500 bp、至少550 bp、至少600 bp、至少650 bp、至少700 bp、至少750 bp、至少800 bp、至少850 bp、至少900 bp、至少950 bp或至少1 kb處。在一些實施例中,錨定序列位於與基因體印跡、單對偶基因表現及/或單對偶基因表觀遺傳標記不相關的區域內。在一些實施例中,錨定序列具有一或多種選自以下的功能,結合內源成核多肽(例如CTCF)、與第二錨定序列相互作用以形成錨定序列介導性接合體,或隔絕錨定序列介導性接合體外部的增強子。在本發明的一些實施例中,提供可特異性靶向一或多種特定錨定序列、而不靶向其他錨定序列(例如在不同背景中可以含有成核劑(例如CTCF)結合模體的序列)的技術;此類所靶向的錨定序列可稱為「標靶錨定序列」。在一些實施例中,調節標靶錨定序列的序列及/或活性,而不調節可作為標靶錨定序列存在於同一系統中(例如存在於同一細胞中及/或在一些實施例中,存在於同一核酸分子(例如同一染色體)上)之一或多種其他錨定序列的序列及/或活性。在一些實施例中,錨定序列包含或為成核多肽結合模體。在一些實施例中,錨定序列鄰接於成核多肽結合模體。 Anchor sequence : As used herein, the term "anchor sequence" refers to a nucleic acid sequence that is recognized by a nucleating agent that binds sufficiently to form an anchor sequence-mediated adapter, eg, a complex. In some embodiments, the anchor sequence comprises one or more CTCF binding motifs. In some embodiments, the anchor sequence is not located within the coding region of the gene. In some embodiments, the anchor sequence is located within an intergenic region. In some embodiments, the anchor sequence is not located within an enhancer or promoter. In some embodiments, the anchor sequence is located at least 400 bp, at least 450 bp, at least 500 bp, at least 550 bp, at least 600 bp, at least 650 bp, at least 700 bp, at least 750 bp, at least 800 bp from any transcription start , at least 850 bp, at least 900 bp, at least 950 bp, or at least 1 kb. In some embodiments, the anchor sequence is located within a region that is not associated with gene body imprinting, single allele expression, and/or single allele epigenetic marks. In some embodiments, the anchor sequence has one or more functions selected from binding an endogenous nucleating polypeptide (eg, CTCF), interacting with a second anchor sequence to form an anchor sequence-mediated adapter, or Sequesters anchor sequence-mediated enhancers outside of the adapter. In some embodiments of the invention, there are provided binding motifs that can specifically target one or more specific anchor sequences but not other anchor sequences (for example, in different contexts may contain nucleating agents such as CTCF). sequences); such targeted anchor sequences may be referred to as "target anchor sequences". In some embodiments, modulation of the sequence and/or activity of a target anchor sequence without modulation may be present in the same system as the target anchor sequence (e.g., in the same cell and/or in some embodiments, The sequence and/or activity of one or more other anchor sequences present on the same nucleic acid molecule (eg, the same chromosome). In some embodiments, the anchor sequence comprises or is a nucleating polypeptide binding motif. In some embodiments, the anchor sequence is adjacent to the nucleating polypeptide binding motif.
錨定序列介導性接合體 : 如本文所用,術語「錨定序列介導性接合體」係指DNA結構,在一些情況下係指複合物,其經由一或多種多肽(諸如成核多肽)或一或多種蛋白質及/或核酸實體(諸如RNA或DNA)對DNA中之至少兩種錨定序列的物理相互作用或結合來存在及/或維持,該一或多種多肽或一或多種蛋白質及/或核酸實體結合錨定序列以能夠使錨定序列之間達成空間近接及功能關聯(參見例如圖1)。 Anchor sequence-mediated adapters : As used herein, the term "anchor sequence-mediated adapters" refers to DNA structures, and in some cases complexes, which are linked via one or more polypeptides (such as nucleating polypeptides) or one or more proteins and/or nucleic acid entities (such as RNA or DNA) to the physical interaction or combination of at least two anchor sequences in DNA to exist and/or maintain, the one or more polypeptides or one or more proteins and and/or nucleic acid entities bind anchor sequences to enable spatial proximity and functional association between the anchor sequences (see eg Figure 1).
相關 :如該術語在本文中所用,若兩個事件或實體中之一者的存在、水準、形式及/或功能與另一者相關,則該兩個事件或實體彼此「相關」。舉例而言,在一些實施例中,若特定實體(例如多肽、基因標誌、代謝物、微生物等)的存在、水準、形式及/或功能與特定疾病、病症或病狀的發生率及/或對疾病、病症或病狀的易感性(例如在整個相關群體中)相關,則認為該特定實體與該疾病、病症或病狀相關。在一些實施例中,兩個或更多個實體若直接或間接相互作用,使得其彼此處於及/或保持實體近接,則彼此在實體上「締合」。在一些實施例中,彼此在實體上締合的兩個或更多個實體彼此共價連接;在一些實施例中,彼此在實體上締合的兩個或更多個實體彼此不共價連接,而以非共價形式締合,例如藉助於氫鍵、凡得瓦爾力相互作用(van der Waals interaction)、疏水相互作用、磁力及其組合。在一些實施例中,當核酸至少部分地處於標靶基因體或轉錄複合物內且標靶基因體或轉錄複合物的形成或破壞對DNA序列中之基因的表現有影響時,DNA序列與標靶基因體或轉錄複合物「相關」。 Related : As the term is used herein, two events or entities are "related" to each other if the existence, level, form and/or function of one is related to the other. For example, in some embodiments, if the presence, level, form and/or function of a particular entity (e.g., polypeptide, genetic marker, metabolite, microorganism, etc.) is associated with the incidence and/or A susceptibility to a disease, disorder or condition is associated (eg, across a related population) then the particular entity is considered to be associated with the disease, disorder or condition. In some embodiments, two or more entities are physically "associated" with each other if they interact, directly or indirectly, such that they are in and/or remain in physical proximity to each other. In some embodiments, two or more entities that are physically associated with each other are covalently linked to each other; in some embodiments, two or more entities that are physically associated with each other are not covalently linked to each other , while being associated in a non-covalent manner, such as by means of hydrogen bonding, van der Waals interactions, hydrophobic interactions, magnetic forces, and combinations thereof. In some embodiments, when the nucleic acid is at least partially within a target gene body or transcription complex and the formation or disruption of the target gene body or transcription complex has an effect on the expression of the gene in the DNA sequence, the DNA sequence and the target The target gene body or transcription complex is "associated".
域 :如本文所用,術語「域」係指實體的一段或一部分。在一些實施例中,「域」與實體的特定結構特徵及/或功能特徵相關,使得當域在實體上與其親代實體的其餘部分分離時,其基本上或完全地保持特定結構特徵及/或功能特徵。或者或另外,在一些實施例中,域可為實體的一部分或包括實體的一部分,其當與(親代)實體分離且與不同(接受者)實體連接時,基本上保持及/或賦予接受者實體一或多種結構特徵及/或功能特徵,該等特徵為其在親代實體中的特徵。在一些實施例中,域為或包含分子(例如小分子、碳水化合物、脂質、核酸、多肽等)的一段或一部分。在一些實施例中,域為多肽的一段或包含多肽的一段。在一些此類實施例中,域的特徵為特定的結構元件(例如特定胺基酸序列或序列模體、α-螺旋特徵、β-薄片特徵、捲曲螺旋特徵、隨機螺旋特徵等),及/或特定功能特徵(例如結合活性、酶活性、摺疊活性、信號傳導活性等)。 Domain : As used herein, the term "domain" means a segment or portion of an entity. In some embodiments, a "domain" relates to specific structural and/or functional characteristics of an entity such that when a domain is physically separated from the rest of its parent entity, it substantially or completely retains the specific structural and/or functional characteristics. or functional characteristics. Alternatively or additionally, in some embodiments, a domain may be or comprise a portion of an entity which, when separated from the (parent) entity and connected to a different (recipient) entity, substantially retains and/or confers acceptance One or more structural and/or functional characteristics of the parent entity that characterize it in the parent entity. In some embodiments, a domain is or comprises a segment or portion of a molecule (eg, small molecule, carbohydrate, lipid, nucleic acid, polypeptide, etc.). In some embodiments, a domain is or comprises a stretch of a polypeptide. In some such embodiments, the domains are characterized by specific structural elements (e.g., specific amino acid sequences or sequence motifs, alpha-helical features, beta-sheet features, coiled-coil features, random helical features, etc.), and/or Or specific functional characteristics (eg, binding activity, enzymatic activity, folding activity, signaling activity, etc.).
效應部分 :如本文所用,術語「效應部分」係指當定位於細胞核中的適當位點時,能夠改變靶基因表現的域。在一些實施例中,效應部分募集轉錄機器的組分。在一些實施例中,效應部分抑制轉錄因子或表現抑制因子的組分募集。在一些實施例中,效應部分包含表觀遺傳修飾部分(例如以表觀遺傳方式修飾標靶DNA序列)。 Effector portion : As used herein, the term “effector portion” refers to a domain that, when localized at the appropriate site in the nucleus, is capable of altering the expression of a target gene. In some embodiments, the effector moiety recruits components of the transcription machinery. In some embodiments, the effector moiety inhibits the recruitment of components of transcription factors or expression repressors. In some embodiments, the effector moiety comprises an epigenetic modification moiety (eg, epigenetically modifies a target DNA sequence).
表觀遺傳修飾部分 :如本文所用,「表觀遺傳修飾部分」係指當表觀遺傳修飾部分適當地定位於核酸(例如藉由靶向部分)時,改變以下的域:i)染色質的結構,例如二維結構;及/或ii)表觀遺傳標記物(例如DNA甲基化、組蛋白甲基化、組蛋白乙醯化、組蛋白類小泛素化、組蛋白磷酸化及RNA相關靜默中的一或多者)。在一些實施例中,表觀遺傳修飾部分包含影響一或多種表觀遺傳標記物(例如提高或降低其水準)的酶或其功能片段或變異體。在一些實施例中,表觀遺傳修飾部分包含DNA甲基轉移酶、組蛋白甲基轉移酶、CREB結合蛋白(CBP),或其任一者之功能片段。 Epigenetic modification moiety : As used herein, "epigenetic modification moiety" refers to a domain that, when an epigenetic modification moiety is properly positioned on a nucleic acid (eg, by a targeting moiety), alters: i) the domain of chromatin structure, such as two-dimensional structure; and/or ii) epigenetic marks (such as DNA methylation, histone methylation, histone acetylation, histone microubiquitination, histone phosphorylation, and RNA one or more of the associated silences). In some embodiments, the epigenetic modification moiety comprises an enzyme or functional fragment or variant thereof that affects (eg, increases or decreases the level of) one or more epigenetic markers. In some embodiments, the epigenetic modification moiety comprises a DNA methyltransferase, a histone methyltransferase, a CREB binding protein (CBP), or a functional fragment of any thereof.
表現控制序列 :如本文所用,術語「表現控制序列」係指增加或減少基因轉錄的核酸序列且包括(但不限於)啟動子及增強子。「增強序列」係指表現控制序列的一種亞型且增加基因轉錄的可能性。「靜默或抑制子序列」係指表現控制序列的一種亞型且降低基因轉錄的可能性。 Expression control sequence : As used herein, the term "expression control sequence" refers to a nucleic acid sequence that increases or decreases transcription of a gene and includes, but is not limited to, promoters and enhancers. "Enhancer sequence" refers to a subtype of expression control sequence and increases the likelihood of gene transcription. A "silencing or suppressor sequence" refers to a subtype of a sequence that expresses a control sequence and reduces the likelihood of gene transcription.
表現抑制子: 如本文所用,術語「表現抑制子」係指具有一或多種功能的藥劑或實體,其減少細胞中的靶基因表現且特異性結合至DNA序列(例如與靶基因或可操作地連接至靶基因之轉錄控制元件相關的DNA序列)。表現抑制子包含至少一個靶向部分及視情況存在的一個效應部分。 Expression suppressor: As used herein, the term "expression suppressor" refers to an agent or entity having one or more functions that reduces the expression of a target gene in a cell and that specifically binds to a DNA sequence (e.g., with a target gene or operably DNA sequences associated with transcriptional control elements linked to target genes). Expression suppressors comprise at least one targeting moiety and optionally an effector moiety.
表現抑制系統 : 如本文所用,術語「表現抑制系統」係指減少細胞中之靶基因表現的複數種表現抑制子。在一些實施例中,表現抑制系統包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子與第二表現抑制子(或編碼第一表現抑制子及第二表現抑制子的核酸)一起存在於單一組合物、混合物或醫藥組合物中。在一些實施例中,表現抑制系統包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子及第二表現抑制子(或編碼第一表現抑制子及第二表現抑制子的核酸)存在於各別的組合物或醫藥組合物中。在一些實施例中,第一表現抑制子與第二表現抑制子同時存在於相同細胞中。在一些實施例中,第一表現抑制子與第二表現抑制子並非同時存在於相同細胞中,例如其依序存在。舉例而言,第一表現抑制子可在細胞中存在第一時段,且接著,第二表現抑制子可在細胞中存在第二時段,其中第一時段與第二時段可重疊或不重疊。 Expression suppression system : As used herein, the term "expression suppression system" refers to a plurality of expression suppressors that reduce the expression of a target gene in a cell. In some embodiments, the expression suppression system comprises a first expression inhibitor and a second expression inhibitor, wherein the first expression inhibitor and the second expression inhibitor (or nucleic acids encoding the first expression inhibitor and the second expression inhibitor ) are present together in a single composition, mixture or pharmaceutical composition. In some embodiments, the expression suppression system comprises a first expression inhibitor and a second expression inhibitor, wherein the first expression inhibitor and the second expression inhibitor (or nucleic acids encoding the first expression inhibitor and the second expression inhibitor ) is present in the respective composition or pharmaceutical composition. In some embodiments, the first suppressor of expression is present in the same cell as the second suppressor of expression. In some embodiments, the first suppressor of expression and the second suppressor of expression are not present in the same cell at the same time, eg, they are present sequentially. For example, a first expressed suppressor can be present in the cell for a first period of time, and then a second expressed suppressor can be present in the cell for a second period of time, wherein the first and second periods of time may or may not overlap.
融合分子 :如本文所用,術語「融合分子」係指一種化合物,其包含共價連接的兩個或更多個部分,例如靶向部分及效應部分。融合分子及其部分可包含多肽、核酸、聚糖、小分子或本文所述之其他組分的任何組合(例如靶向部分可包含核酸且效應部分可包含多肽)。在一些實施例中,融合分子為融合蛋白,例如包含一或多個經由肽鍵共價連接的多肽域。在一些實施例中,融合分子為偶聯物分子,其包含藉由除肽鍵或磷酸二酯鍵之外之共價鍵連接的靶向部分與效應部分(例如包含核酸的靶向部分與包含多肽的效應部分藉由除肽鍵或磷酸二酯鍵之外的共價鍵連接)。在一些實施例中,表現抑制子為融合分子或包含融合分子。 Fusion molecule : As used herein, the term "fusion molecule" refers to a compound comprising two or more moieties covalently linked, eg, a targeting moiety and an effector moiety. Fusion molecules and portions thereof can comprise polypeptides, nucleic acids, glycans, small molecules, or any combination of other components described herein (eg, targeting moieties can comprise nucleic acids and effector moieties can comprise polypeptides). In some embodiments, fusion molecules are fusion proteins, eg, comprising one or more polypeptide domains covalently linked via peptide bonds. In some embodiments, the fusion molecule is a conjugate molecule comprising a targeting moiety and an effector moiety linked by a covalent bond other than a peptide bond or a phosphodiester bond (e.g., a targeting moiety comprising a nucleic acid and a targeting moiety comprising The effector moieties of the polypeptides are linked by covalent bonds other than peptide bonds or phosphodiester bonds). In some embodiments, the expression inhibitor is or comprises a fusion molecule.
基因體複合物 :如本文所用,術語「基因體複合物」為一種複合物,其經由複數種蛋白質及/或其他組分(潛在地包括基因體序列元件)之間的相互作用使一或多個染色體上彼此間隔開的兩個基因體序列元件結合在一起。在一些實施例中,基因體序列元件為複合物之一或多種蛋白質組分所結合的錨定序列。在一些實施例中,基因體複合物可包含錨定序列介導性接合體。在一些實施例中,基因體序列元件可為或包含CTCF結合模體、啟動子及/或增強子。在一些實施例中,基因組序列元件包括啟動子及/或調控位點(例如強化子)中之至少一者或兩者。在一些實施例中,複合物形成在基因體序列元件處成核且/或藉由一或多種蛋白質組分結合至基因體序列元件而成核。如熟習此項技術者所瞭解,在一些實施例中,基因體位點經由複合物形成而發生的共定位(例如接合)使基因體序列元件處或附近的DNA拓樸結構(在一些實施例中,包括其間的DNA拓樸結構)發生改變。在一些實施例中,基因體複合物包含含有一或多個環的錨定序列介導性接合體。在一些實施例中,如本文所述的基因體複合物藉由成核多肽(諸如CTCF及/或黏合素)發生成核。在一些實施例中,如本文所述的基因體複合物可包括例如以下中的一或多者:CTCF、黏合素、非編碼RNA (例如eRNA)、轉錄機器蛋白質(例如RNA聚合酶、一或多種轉錄因子,例如選自由以下組成之群:TFIIA、TFIIB、TFIID、TFIIE、TFIIF、TFIIH等)、轉錄調控因子(例如介體、P300、增強子結合蛋白、抑制子結合蛋白、組蛋白修飾因子等)等。在一些實施例中,如本文所述的基因體複合物包括一或多種多肽組分及/或一或多種核酸組分(例如一或多種RNA組分),在一些實施例中,其彼此可相互作用及/或與一或多個基因體序列元件(例如錨定序列、啟動子序列、調控序列(例如增強子序列))相互作用以便將一段基因體DNA圍束成拓樸組態(例如環),當不形成複合物時,其不會呈現拓樸組態。 Genome complex : As used herein, the term "genome complex" is a complex that makes one or more The combination of two gene body sequence elements spaced apart from each other on a chromosome. In some embodiments, the gene body sequence element is an anchor sequence to which one or more protein components of the complex bind. In some embodiments, a gene body complex may comprise an anchor sequence-mediated adapter. In some embodiments, a gene body sequence element can be or comprise a CTCF binding motif, a promoter and/or an enhancer. In some embodiments, genomic sequence elements include at least one or both of a promoter and/or regulatory sites (eg, enhancers). In some embodiments, complex formation nucleates at the genome sequence element and/or is nucleated by binding of one or more protein components to the genome sequence element. As will be appreciated by those skilled in the art, in some embodiments, colocalization (e.g., conjugation) of gene body loci via complex formation causes DNA topology at or near sequence elements of the gene body (in some embodiments, , including the DNA topology in between) changes. In some embodiments, the gene body complex comprises an anchor sequence-mediated adapter comprising one or more loops. In some embodiments, a gene body complex as described herein is nucleated by a nucleating polypeptide such as CTCF and/or cohesin. In some embodiments, a gene body complex as described herein may include, for example, one or more of: CTCF, cohesin, non-coding RNA (e.g., eRNA), transcription machinery proteins (e.g., RNA polymerase, one or Various transcription factors, e.g. selected from the group consisting of: TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIH, etc.), transcriptional regulatory factors (e.g., mediators, P300s, enhancer binding proteins, repressor binding proteins, histone modifiers etc. In some embodiments, a gene body complex as described herein includes one or more polypeptide components and/or one or more nucleic acid components (e.g., one or more RNA components), which, in some embodiments, can interact with each other Interact and/or interact with one or more genome sequence elements (e.g., anchor sequences, promoter sequences, regulatory sequences (e.g., enhancer sequences)) in order to constrain a section of genome DNA into a topological configuration (e.g., ring), which does not assume a topological configuration when not forming a complex.
部分 :如本文所用,術語「部分」係指定義的化學基團或實體,其具有特定結構及/或活性,如本文所述。 Moiety : As used herein, the term "moiety" refers to a defined chemical group or entity that has a specific structure and/or activity as described herein.
調節劑 :如本文所用,術語「調節劑」係指包含一或多個靶向部分及一或多個效應部分的藥劑,其能夠改變(例如增強或減少)靶基因(例如MYC)表現。 Modulator : As used herein, the term "modulator" refers to an agent comprising one or more targeting moieties and one or more effector moieties, which is capable of altering (eg enhancing or decreasing) the expression of a target gene (eg MYC).
MYC :如本文所用,術語「MYC基因座」係指編碼MYC多肽(例如NCBI寄存編號NP002458.2中所揭示的多肽,或其突變體)的人類基因體之一部分、可操作地連接至MYC的啟動子(「MYC啟動子」),及形成包含MYC基因之ASMC的錨定序列。在一些實施例中,MYC基因座編碼具有NCBI寄存編號NM-002467的核酸。在一些實施例中,MYC基因為原癌基因,且在一些實施例中,MYC基因為致癌基因。在某些情況下,MYC基因發現於染色體8上的8q24.21處。在某些情況下,MYC基因始於距短臂末端128,816,862 bp處且終止於距短臂末端128,822,856 bp處。在某些情況下,MYC基因為約6 kb。在某些情況下,MYC基因編碼至少八種各別mRNA序列:5種交替剪接變異體及3種非剪接變異體。
MYC : As used herein, the term "MYC locus" refers to a portion of the human genome that encodes a MYC polypeptide (such as the polypeptide disclosed in NCBI Accession No. NP002458.2, or a mutant thereof), operably linked to the MYC promoter ("MYC promoter"), and an anchor sequence that forms ASMCs comprising the MYC gene. In some embodiments, the MYC locus encodes a nucleic acid having NCBI Accession No. NM-002467. In some embodiments, the MYC gene is a proto-oncogene, and in some embodiments, the MYC gene is an oncogene. In certain instances, the MYC gene is found on
核酸 :如本文所用,術語「核酸(nucleic acid)」在其最廣泛意義上係指被併入寡核苷酸鏈或可併入寡核苷酸鏈中的任何化合物及/或物質。在一些實施例中,核酸為經由磷酸二酯鍵聯併入或可併入寡核苷酸鏈中之化合物及/或物質。如自上下文將瞭解,在一些實施例中,「 核酸」係指個別核酸殘基(例如核苷酸及/或核苷);在一些實施例中,「 核酸」係指包含個別核酸殘基的寡核苷酸鏈。在一些實施例中,「 核酸」為或包含RNA;在一些實施例中,「 核酸」為或包含DNA。在一些實施例中,核酸為、包含一或多個天然核酸殘基或由一或多個天然核酸殘基組成。在一些實施例中,核酸為、包含一或多種核酸類似物或由一或多種核酸類似物組成。在一些實施例中,核酸類似物與核酸的不同之處在於其不使用磷酸二酯主鏈。舉例而言,在一些實施例中,核酸為、包含或由一或多個「 肽核酸」組成,其在此項技術中已知且主鏈中以肽鍵代替磷酸二酯鍵,其皆被視為在本發明之範疇內。或者或另外,在一些實施例中,核酸具有一或多個硫代磷酸酯鍵聯及/或5'-N-胺基亞磷酸酯鍵聯而非磷酸二酯鍵。在一些實施例中,核酸為、包含或由一或多種天然核苷(例如腺苷、胸苷、鳥苷、胞苷、尿苷、去氧腺苷、去氧胸苷、去氧鳥苷及去氧胞苷)組成。在一些實施例中,核酸為、包含或由一或多種核苷類似物(例如2-胺基腺苷、2-硫胸苷、肌苷、吡咯并嘧啶、3-甲基腺苷、5-甲基胞苷、C-5丙炔基-胞苷、C-5丙炔基-尿苷、2-胺基腺苷、C5-溴尿苷、C5-氟尿苷、C5-碘尿苷、C5-丙炔基-尿苷、C5-丙炔基-胞苷、C5-甲基胞苷、2-胺基腺苷、7-去氮腺苷、7-去氮鳥苷、8-側氧基腺苷、8-側氧基鳥苷、0(6)-甲基鳥嘌呤、2-硫胞苷、甲基化鹼基、插入鹼基及其組合)組成。在一些實施例中,相較於天然核酸中的糖,核酸包含一或多種經修飾之糖(例如2'-氟核糖、核糖、2'-去氧核糖、阿拉伯糖及己糖)。在一些實施例中,核酸具有編碼功能性基因產物(諸如RNA或蛋白質)的核苷酸序列。在一些實施例中,核酸包括一或多個內含子。在一些實施例中,核酸藉由以下中之一或多者製備:自天然來源分離、基於互補模板藉由聚合進行酶合成(活體內或活體外)、在重組細胞或系統中複製,及化學合成。在一些實施例中,核酸具有至少3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95、100、110、120、130、140、150、160、170、180、190、20、225、250、275、300、325、350、375、400、425、450、475、500、600、700、800、900、1000、1500、2000、2500、3000、3500、4000、4500、5000或更多個殘基長度。在一些實施例中,核酸部分地或全部地為單股;在一些實施例中,核酸部分地或全部地為雙股。在一些實施例中,核酸具有包含至少一個元件的核苷酸序列,該至少一個元件編碼多肽或為編碼多肽之序列的補體。在一些實施例中,核酸具有酶活性。 Nucleic acid : As used herein, the term "nucleic acid" in its broadest sense refers to any compound and/or substance that is or can be incorporated into an oligonucleotide chain. In some embodiments, nucleic acids are compounds and/or substances that are incorporated or can be incorporated into oligonucleotide chains via phosphodiester linkages. As will be understood from the context, in some embodiments, " nucleic acid " refers to individual nucleic acid residues (e.g., nucleotides and /or nucleosides); oligonucleotide chain. In some embodiments, a " nucleic acid " is or comprises RNA; in some embodiments, a " nucleic acid " is or comprises DNA. In some embodiments, the nucleic acid is, comprises, or consists of one or more natural nucleic acid residues. In some embodiments, the nucleic acid is, comprises, or consists of one or more nucleic acid analogs. In some embodiments, nucleic acid analogs differ from nucleic acids in that they do not utilize a phosphodiester backbone. For example, in some embodiments, the nucleic acid is, comprises, or consists of one or more " peptide nucleic acids " known in the art and having peptide bonds in the backbone instead of phosphodiester bonds, both of which are referred to as deemed to be within the scope of the present invention. Alternatively or additionally, in some embodiments, the nucleic acid has one or more phosphorothioate linkages and/or 5'-N-aminophosphite linkages instead of phosphodiester linkages. In some embodiments, the nucleic acid is, comprises, or consists of one or more natural nucleosides (e.g., adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine) composition. In some embodiments, the nucleic acid is, comprises, or consists of one or more nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolopyrimidine, 3-methyladenosine, 5- Methylcytidine, C-5 propynyl-cytidine, C-5 propynyl-uridine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deazaadenosine, 7-deazaguanosine, 8-oxo base adenosine, 8-side oxyguanosine, 0(6)-methylguanine, 2-thiacytidine, methylated base, inserted base and combinations thereof). In some embodiments, the nucleic acid comprises one or more modified sugars (eg, 2'-fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose) compared to the sugars in natural nucleic acids. In some embodiments, a nucleic acid has a nucleotide sequence that encodes a functional gene product such as RNA or protein. In some embodiments, a nucleic acid includes one or more introns. In some embodiments, nucleic acids are prepared by one or more of isolation from natural sources, enzymatic synthesis (in vivo or in vitro) by polymerization based on complementary templates, replication in recombinant cells or systems, and chemical synthesis. synthesis. In some embodiments, the nucleic acid has at least 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75 ,80,85,90,95,100,110,120,130,140,150,160,170,180,190,20,225,250,275,300,325,350,375,400,425,450 , 475, 500, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000 or more residues in length. In some embodiments, the nucleic acid is partially or fully single-stranded; in some embodiments, the nucleic acid is partially or fully double-stranded. In some embodiments, a nucleic acid has a nucleotide sequence comprising at least one element that encodes a polypeptide or is the complement of a sequence encoding a polypeptide. In some embodiments, the nucleic acid has enzymatic activity.
成核多肽 :如本文所用,術語「成核多肽」,或如本文所用,「接合體成核多肽」係指直接地或間接地與錨定序列締合且可與一或多種接合體成核多肽相互作用(可與錨定序列或其他核酸相互作用)以形成二聚體(或更高有序結構)的蛋白質,該二聚體包含彼此可相同或可不相同的此類接合體成核多肽。當與不同錨定序列相關的接合體成核多肽彼此間締合使得不同錨定序列在實體上維持彼此間近接時,藉此產生的結構為錨定序列介導之接合體。亦即,一種成核多肽-錨定序列與另一成核多肽-錨定序列在實體上緊密近接而相互作用,產生始於且終止於錨定序列的錨定序列介導性接合體(例如在一些情況下,為DNA環)。熟習此項技術者閱讀本說明書將立即瞭解,諸如「成核多肽」、「成核分子」、「成核蛋白質」、「接合體成核蛋白質」等術語有時可用於指接合體成核多肽。熟習此項技術者閱讀本說明書將類似地立即瞭解,兩種或更多種接合體成核多肽的組合體集合(在一些實施例中,其可包括相同藥劑的多個複本及/或在一些實施例中,複數種不同藥劑中之一或多者或每一者)可稱為「複合物」、「二聚體」、「多聚體」等。 Nucleating Polypeptide : As used herein, the term "nucleating polypeptide" or, as used herein, "adaptor nucleating polypeptide" refers to a polypeptide that associates directly or indirectly with an anchor sequence and can nucleate with one or more adapters. Proteins that interact with polypeptides (may interact with anchor sequences or other nucleic acids) to form dimers (or higher ordered structures) comprising such adapter nucleating polypeptides that may or may not be identical to each other . When adapter nucleating polypeptides associated with different anchor sequences associate with each other such that the different anchor sequences are maintained in physical proximity to each other, the resulting structure is an anchor sequence-mediated adapter. That is, one nucleating polypeptide-anchor sequence interacts with another nucleating polypeptide-anchor sequence in close physical proximity, resulting in an anchor sequence-mediated adapter that begins and ends at the anchor sequence (e.g. In some cases, DNA circles). Those skilled in the art will immediately understand upon reading this specification that terms such as "nucleating polypeptide", "nucleating molecule", "nucleating protein", "conjugate nucleating protein" and the like are sometimes used to refer to conjugate nucleating polypeptides . Those skilled in the art will similarly immediately understand upon reading this specification that a composite collection of two or more adapter-nucleating polypeptides (which in some embodiments may include multiple copies of the same agent and/or in some In embodiments, one or more or each of the plurality of different agents may be referred to as a "complex", "dimer", "multimer" and the like.
可操作地連接 :如本文所用,片語「可操作地連接」係指一種並接,其中所述組分呈允許其以其預定方式發揮作用的關係。「 可操作地連接」至功能元件(例如基因)的轉錄控制元件係以在與轉錄控制元件相容之條件下達成功能元件(例如基因)表現及/或活性的方式關聯。在一些實施例中,「 可操作地連接」的轉錄控制元件與所關注之編碼元件(例如基因)鄰接(例如共價連接);在一些實施例中,可操作地連接的轉錄控制元件相對於所關注之功能元件(例如基因)以反式發揮作用或在距其一定距離處以其他方式發揮作用。在一些實施例中,可操作地連接意謂同一核酸分子上包含兩種核酸序列。在另一實施例中,可操作地連接可進一步意謂兩種核酸序列在同一核酸分子上彼此近接,例如彼此在1000、500、100、50或10個鹼基對內或彼此直接鄰接。 Operably linked : As used herein, the phrase "operably linked" refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner. A transcriptional control element " operably linked " to a functional element (eg, a gene) is associated in such a way that expression and/or activity of the functional element (eg, gene) is achieved under conditions compatible with the transcriptional control element. In some embodiments, an " operably linked " transcriptional control element is adjacent (eg, covalently linked) to a coding element of interest (eg, a gene); in some embodiments, an operably linked transcriptional control element is relative to A functional element of interest (eg, a gene) functions in trans or otherwise at a distance from it. In some embodiments, operably linked means comprising two nucleic acid sequences on the same nucleic acid molecule. In another embodiment, operably linked may further mean that two nucleic acid sequences are in proximity to each other on the same nucleic acid molecule, for example within 1000, 500, 100, 50 or 10 base pairs of each other or directly adjacent to each other.
肽、多肽、蛋白質 :如本文所用,術語「肽」、「多肽」及「蛋白質」係指包含藉由肽鍵或藉由除肽鍵之外的方式共價連接之胺基酸殘基的化合物。蛋白質或肽必須含有至少兩個胺基酸,且對可以構成蛋白質或肽序列之胺基酸的最大數目無限制。多肽包括包含兩個或更多個胺基酸的任何肽或蛋白質,該等胺基酸彼此藉由肽鍵連接或藉由除肽鍵之外的方式連接。如本文所用,該術語係指短鏈(其在此項技術中通常亦稱為例如肽、寡肽及寡聚物)與長鏈(其在此項技術中一般稱為蛋白質,其存在多種類型)。 Peptide, polypeptide, protein : As used herein, the terms "peptide", "polypeptide" and "protein" refer to a compound comprising amino acid residues covalently linked by peptide bonds or by means other than peptide bonds . A protein or peptide must contain at least two amino acids, and there is no limit to the maximum number of amino acids that can make up a protein or peptide sequence. Polypeptides include any peptide or protein comprising two or more amino acids linked to each other by peptide bonds or by means other than peptide bonds. As used herein, the term refers to short chains (which are also commonly referred to in the art as, for example, peptides, oligopeptides, and oligomers) and long chains (which are commonly referred to in the art as proteins, which exist in various types ).
醫藥組合物 :如本文所用,術語「醫藥組合物」係指與一或多種醫藥學上可接受之載劑一起調配的活性劑(例如調節劑,例如破壞劑)。在一些實施例中,活性劑以適合投與之單位劑量的量存在於治療方案中,其在投與相關群體時顯示統計學上顯著之達成預定治療作用的機率。在一些實施例中,醫藥組合物可專門調配成固體或液體形式投與,包括針對以下調適之彼等形式:經口投藥,例如頓服藥(水性或非水性溶液或懸浮液)、錠劑(例如針對頰內、舌下及全身性吸收之錠劑)、大丸劑、散劑、顆粒、施用於舌部之糊劑;非經腸投藥,例如皮下、肌肉內、靜脈內或硬膜外注射,例如作為無菌溶液或懸浮液,或持續釋放型調配物;局部施用,例如以乳膏、軟膏或控制釋放型貼片或噴霧形式施用於皮膚、肺或口腔;陰道內或直腸內投藥,例如子宮托、乳膏或發泡體形式;舌下;眼部;經皮;或鼻、肺及/或其他黏膜表面。 Pharmaceutical composition : As used herein, the term "pharmaceutical composition" refers to an active agent (eg, modulator, eg, disruptor) formulated together with one or more pharmaceutically acceptable carriers. In some embodiments, the active agent is present in the therapeutic regimen in an amount suitable for administration in a unit dosage that exhibits a statistically significant probability of achieving a predetermined therapeutic effect when administered to a relevant population. In some embodiments, pharmaceutical compositions may be specially formulated for administration in solid or liquid form, including those forms adapted for oral administration, such as drenches (aqueous or non-aqueous solutions or suspensions), lozenges ( e.g. lozenges for buccal, sublingual and systemic absorption), boluses, powders, granules, pastes for tongue application; parenteral administration, e.g. subcutaneous, intramuscular, intravenous or epidural injection, For example, as a sterile solution or suspension, or a sustained-release formulation; topical application, for example, to the skin, lungs or mouth in the form of a cream, ointment, or controlled-release patch or spray; intravaginal or rectal administration, for example, to the uterus In the form of pads, creams, or foams; sublingually; ocularly; transdermally; or on nasal, pulmonary, and/or other mucosal surfaces.
近接 :如本文所用,「近接」係指兩個位點(例如核酸位點)接近,使得表現抑制子在第一位點的結合及/或表現抑制子對第一位點的修飾就對另一位點的結合及/或修飾而言將產生相同或基本上相同的作用。舉例而言,靶向部分可結合至近接於增強子(第二位點)的第一位點,且與該靶向部分締合的效應部分可以表觀遺傳方式修飾第一位點,以便調節增強子對靶基因表現的影響,從而與已結合及/或修飾的第二位點(增強子序列)基本上相同。在一些實施例中,近接於靶基因(例如外顯子、內含子,或靶基因內的剪接位點)、近接於可操作地連接至靶基因(例如MYC)之轉錄控制元件或近接於錨定序列的位點為距離靶基因(例如外顯子、內含子,或靶基因內的剪接位點)、轉錄控制元件或錨定序列小於5000、4000、3000、2000、1000、900、800、700、600、500、400、300、200、100、50或25個鹼基對(且視情況,距離靶基因(例如外顯子、內含子,或靶基因內的剪接位點)、轉錄控制元件或錨定序列至少20、25、50、100、200或300個鹼基對)。 Proximity : As used herein, "proximity" refers to the proximity of two sites (e.g., nucleic acid sites) such that binding of an expression inhibitor at the first site and/or modification of the first site by the expression inhibitor has no effect on the other. Combining and/or modifying at one site will produce the same or substantially the same effect. For example, a targeting moiety can bind to a first site proximate to an enhancer (second site), and an effector moiety associated with the targeting moiety can epigenetically modify the first site so as to regulate The effect of the enhancer on the expression of the target gene is thus substantially the same as the second site (enhancer sequence) that has been incorporated and/or modified. In some embodiments, the target gene is located in close proximity to a target gene (such as an exon, an intron, or a splice site within a target gene), close to a transcriptional control element operably linked to a target gene (such as MYC), or close to a The location of the anchor sequence is less than 5000, 4000, 3000, 2000, 1000, 900, 800, 700, 600, 500, 400, 300, 200, 100, 50, or 25 base pairs (and, optionally, distance from the target gene (e.g., exons, introns, or splice sites within the target gene) , transcriptional control elements or anchor sequences of at least 20, 25, 50, 100, 200 or 300 base pairs).
特異性: 熟習此項技術者應瞭解,如本文所用,術語「特異性」當提及一種具有活性的藥劑時,意謂該藥劑在潛在標靶實體或狀態之間有區別。舉例而言,在一些實施例中,若藥劑在一或多種競爭替代標靶存在下優先與其標靶結合,則將其稱為「特異性」結合至該標靶。在一些實施例中,特異性相互作用取決於標靶實體之特定結構特徵(例如抗原決定基、裂隙、結合位點)的存在。應瞭解,特異性不必為絕對的。在一些實施例中,特異性可作為結合劑對一或多種其他潛在標靶實體(例如競爭者)之特異性來評價。在一些實施例中,特異性係作為參考特異性結合劑之特異性來評價。在一些實施例中,特異性係作為參考非特異性結合劑之特異性來評價。在一些實施例中,藥劑或實體在結合至其標靶實體之條件下不可偵測地結合至競爭替代標靶。在一些實施例中,相較於競爭替代標靶,結合劑以更高締合速率、更低解離速率、增加之親和力、減少之解離及/或增加之穩定性結合至其標靶實體。 Specificity: Those skilled in the art will understand that, as used herein, the term "specificity" when referring to an active agent means that the agent differentiates between potential target entities or states. For example, in some embodiments, an agent is said to "specifically" bind to a target if it binds preferentially to its target in the presence of one or more competing alternative targets. In some embodiments, specific interactions depend on the presence of specific structural features (eg, epitopes, clefts, binding sites) of the target entity. It will be appreciated that specificity need not be absolute. In some embodiments, specificity can be assessed as the specificity of a binding agent for one or more other potential target entities (eg, competitors). In some embodiments, specificity is assessed as a reference to the specificity of a specific binding agent. In some embodiments, specificity is assessed as a reference to the specificity of a non-specific binding agent. In some embodiments, the agent or entity does not detectably bind to a competing surrogate target under conditions that bind to its target entity. In some embodiments, a binding agent binds to its target entity with a higher on-rate, lower off-rate, increased affinity, decreased dissociation, and/or increased stability compared to a competing surrogate target.
特異性結合 :如本文所用,術語「特異性結合」係指在發生結合之環境中在可能的結合搭配物之間作出區分的能力。在一些實施例中,當存在其他潛在標靶時與一種特定標靶相互作用之結合劑稱為「 特異性結合」至與其相互作用之標靶。在一些實施例中,藉由偵測或測定結合劑與其搭配物之間的締合程度來評估特異性結合;在一些實施例中,藉由偵測或測定結合劑-搭配物複合物的解離程度來評估特異性結合。在一些實施例中,藉由偵測或測定結合劑與替代相互作用競爭的能力來評估特異性結合,該替代相互作用發生於其搭配物與另一實體之間。在一些實施例中,藉由在整個濃度範圍內進行此類偵測或測定來評估特異性結合。 Specific binding : As used herein, the term "specific binding" refers to the ability to distinguish between possible binding partners in the context in which binding occurs. In some embodiments, a binding agent that interacts with one particular target in the presence of other potential targets is said to " specifically bind " to the target with which it interacts. In some embodiments, specific binding is assessed by detecting or measuring the degree of association between the binding agent and its partner; in some embodiments, by detecting or measuring the dissociation of the binding agent-partner complex. to assess specific binding. In some embodiments, specific binding is assessed by detecting or determining the ability of a binding agent to compete with an alternative interaction between its partner and another entity. In some embodiments, specific binding is assessed by performing such detection or determination over a range of concentrations.
基本上 :如本文所用,術語「基本上」係指展現所關注特徵或特性之總程度或近總程度或水準的定性條件。一般技術者應瞭解,生物學及化學現象很少(若曾有)進行完全及/或繼續進行至完全或很少達成或避免絕對結果。因此,術語「基本上」在本文的一些實施例中可用於捕捉多種生物學及化學現象中所固有之完整性的潛在缺乏。 Substantially : As used herein, the term "substantially" refers to the qualitative condition of exhibiting the total or nearly total degree or level of the characteristic or characteristic of interest. Those of ordinary skill will appreciate that biological and chemical phenomena seldom, if ever, proceed to completion and/or proceed to completion or rarely achieve or avoid an absolute result. Thus, the term "substantially" may be used in some embodiments herein to capture a potential lack of integrity inherent in a variety of biological and chemical phenomena.
症狀減少 :如本文所用,當特定疾病、病症或病狀之一或多種症狀的量級(例如強度、嚴重度等)及/或頻率減少時,可使用片語「症狀減少」。在一些實施例中,將特定症狀發作的延遲視為症狀頻率降低之一種形式。 Reduction of symptoms : As used herein, the phrase "reduction of symptoms" may be used when the magnitude (eg, intensity, severity, etc.) and/or frequency of one or more symptoms of a particular disease, disorder, or condition is reduced. In some embodiments, a delay in the onset of a particular symptom is considered a form of decreased frequency of symptoms.
標靶 :一種藥劑或實體若其在其彼此接觸的件下特異性結合至靶向劑或實體,則根據本發明被視為「靶向」另一種藥劑或實體。在一些實施例中,舉例而言,抗體(或其抗原結合片段)靶向其同源抗原決定基或抗原。在一些實施例中,具有特定序列的核酸靶向序列基本上互補的核酸。 Target : An agent or entity is considered to "target" another agent or entity according to the present invention if it specifically binds to the targeting agent or entity under conditions where they are in contact with each other. In some embodiments, for example, an antibody (or antigen-binding fragment thereof) targets its cognate epitope or antigen. In some embodiments, a nucleic acid having a specific sequence targets a nucleic acid that is substantially complementary to the sequence.
靶基因 :如本文所用,術語「靶基因」意謂為了調節(例如調節表現)而被靶向的基因。在一些實施例中,靶基因為標靶基因體複合物的一部分(例如其基因體序列的至少一部分作為標靶基因體複合物之一部分的基因,例如位於錨定序列介導性接合體的內部),如本文所述的一或多種調節劑靶向該基因體複合物。在一些實施例中,調節包含抑制靶基因的表現。在一些實施例中,藉由使靶基因或可操作地連接至靶基因的轉錄控制元件與本文所述之表現抑制系統(例如表現抑制子)接觸來調節靶基因。在一些實施例中,靶基因異常表現(例如過度表現)於細胞中,例如個體(例如患者)的細胞。 Target gene : As used herein, the term "target gene" means a gene that is targeted for regulation (eg, modulation of expression). In some embodiments, the target gene is part of a target gene body complex (e.g., a gene whose gene body sequence is part of a target gene body complex, e.g., within an anchor sequence-mediated adapter ), one or more modulators as described herein target the gene body complex. In some embodiments, modulating comprises suppressing expression of a target gene. In some embodiments, the target gene is regulated by contacting the target gene, or a transcriptional control element operably linked to the target gene, with an expression repression system described herein (eg, an expression repressor). In some embodiments, the target gene is aberrantly expressed (eg, overexpressed) in a cell, such as a cell of an individual (eg, a patient).
靶向部分 :如本文所用,術語「靶向部分」意謂特異性靶向(例如結合)基因體序列元件(例如表現控制序列或錨定序列)的藥劑或實體。在一些實施例中,基因體序列元件近接於且/或可操作地連接至靶基因(例如MYC)。 Targeting moiety : As used herein, the term "targeting moiety" means an agent or entity that specifically targets (eg, binds to) a gene body sequence element (eg, an expression control sequence or an anchor sequence). In some embodiments, the gene body sequence element is in proximity to and/or is operably linked to a target gene (eg, MYC).
治療劑 :如本文所用,片語「治療劑」係指當投與個體時,具有治療作用且/或引發所要生物學及/或藥理學作用的藥劑。在一些實施例中,治療劑為可用於緩解、改善、減輕、抑制、預防疾病、病症及/或病狀、延遲其發作、降低其嚴重度及/或減小其一或多種症狀或特徵之發生率的任何物質。在一些實施例中,治療劑包含本文所述之表現抑制系統,例如表現抑制子。在一些實施例中,治療劑包含編碼本文所述之表現抑制系統(例如表現抑制子)的核酸。在一些實施例中,治療劑包含本文所述之醫藥組合物。 Therapeutic agent : As used herein, the phrase "therapeutic agent" refers to an agent that, when administered to a subject, has a therapeutic effect and/or elicits a desired biological and/or pharmacological effect. In some embodiments, a therapeutic agent is one that is useful in alleviating, ameliorating, alleviating, inhibiting, preventing, delaying the onset of, reducing the severity of, and/or reducing one or more symptoms or characteristics of a disease, disorder, and/or condition. incidence of any substance. In some embodiments, the therapeutic agent comprises an expression inhibition system described herein, eg, an expression inhibitor. In some embodiments, a therapeutic agent comprises a nucleic acid encoding an expression suppression system (eg, an expression inhibitor) described herein. In some embodiments, the therapeutic agent comprises a pharmaceutical composition described herein.
治療有效量 :如本文所用,術語「治療有效量」意謂當作為治療方案之一部分投與時可引發所要生物反應的物質(例如治療劑、組合物及/或調配物)之量。在一些實施例中,物質之治療有效量為當投與患有或易患疾病、病症及/或病狀之個體時,足以治療、診斷、預防該疾病、病症及/或病狀及/或延遲其發作的量。如一般熟習此項技術者將瞭解,物質的有效量可依據諸如所需生物學終點、待遞送的物質、靶細胞或組織等因素而變化。舉例而言,在一些實施例中,調配物中之化合物治療疾病、病症及/或病狀之有效量為緩解、改善、減輕、抑制、預防疾病、病症及/或病狀、延遲其發作、降低其嚴重度及/或降低其一或多種症狀或特徵之發生率的量。在一些實施例中,治療有效量係以單次劑量投與;在一些實施例中,遞送治療有效量需要多次單位劑量。 Therapeutically effective amount : As used herein, the term "therapeutically effective amount" means an amount of a substance (eg, a therapeutic agent, composition and/or formulation) that elicits a desired biological response when administered as part of a treatment regimen. In some embodiments, a therapeutically effective amount of a substance is sufficient to treat, diagnose, prevent the disease, disorder and/or condition and/or when administered to an individual suffering from or susceptible to a disease, disorder and/or condition Amount to delay its onset. As will be appreciated by those of ordinary skill in the art, effective amounts of a substance may vary depending on factors such as the desired biological endpoint, the substance to be delivered, the target cell or tissue, and the like. For example, in some embodiments, the effective amount of the compound in the formulation to treat the disease, disorder and/or condition is to relieve, ameliorate, alleviate, inhibit, prevent the disease, disorder and/or condition, delay its onset, An amount that reduces its severity and/or reduces the incidence of one or more symptoms or characteristics thereof. In some embodiments, a therapeutically effective amount is administered in a single dose; in some embodiments, multiple unit doses are required to deliver a therapeutically effective amount.
相關申請案的交叉參考本申請案主張2021年6月21日申請之美國臨時申請案63/212,991、2021年11月18日申請之美國臨時申請案63/281,022及2021年12月15日申請之專利合作條約申請案PCT/US21/10059的優先權,其全部內容以引用的方式併入本文中。
CROSS REFERENCE TO RELATED APPLICATIONS This application claims
序列表本申請案含有序列表,該序列表已以ASCII格式、以電子方式提交且以全文引用的方式併入本文中。2021年12月6日創建之該ASCII複本命名為O2057-7029WO_SL.txt且大小為624,274個位元組。 SEQUENCE LISTING This application contains a Sequence Listing, which has been filed electronically in ASCII format and is hereby incorporated by reference in its entirety. This ASCII copy created on December 6, 2021 is named O2057-7029WO_SL.txt and is 624,274 bytes in size.
本發明提供經由利用本文所述之表現抑制子或系統調節(例如降低)細胞(例如個體或患者中之細胞)中之靶基因(例如MYC)表現的技術。The present invention provides techniques for modulating (eg, reducing) the expression of a target gene (eg, MYC) in a cell (eg, a cell in an individual or patient) through the use of the expression suppressors or systems described herein.
基因表現的異常調控可導致多種不同疾病及症候群,包括癌症、自體免疫、心血管疾病及肥胖症。特定而言,長久以來已知轉錄因子的過度表現會造成腫瘤發生,且近期研究表明過度表現的致癌轉錄因子可改變細胞的核心自調控迴路。Dysregulation of gene expression can lead to a variety of diseases and syndromes, including cancer, autoimmunity, cardiovascular disease and obesity. In particular, overexpression of transcription factors has long been known to contribute to tumorigenesis, and recent studies have shown that overexpression of oncogenic transcription factors can alter core autoregulatory circuits of the cell.
轉錄因子及主要細胞調控因子MYC在逾50%的人類癌症中頻繁地受到異常調控且在致瘤過程的幾乎每個方面皆起主要作用。除早期反應基因之外,MYC典型地上調基因表現。MYC為最頻繁擴增的致癌基因,且其基因產物的表現升高與腫瘤攻擊性及不良臨床結果相關。c-MYC的水準升高可促進多種組織中的腫瘤發生。大多數腫瘤細胞的生長及增殖依賴於轉錄因子c-MYC。MYC過度表現亦與慢性肝病相關,例如病毒及酒精相關肝病。MYC過度表現量因特定的癌症亞型而異。不希望受理論束縛,調節(例如降低)罹患MYC異常調控病症之個體(例如整個個體,或特定靶組織或組織)中的MYC水準被認為可減輕或消除MYC異常調控病症之症狀。The transcription factor and master cellular regulator MYC is frequently dysregulated in more than 50% of human cancers and plays a central role in nearly every aspect of the tumorigenic process. In addition to early response genes, MYC typically upregulates gene expression. MYC is the most frequently amplified oncogene, and elevated expression of its gene product correlates with tumor aggressiveness and adverse clinical outcome. Elevated levels of c-MYC promote tumorigenesis in a variety of tissues. The growth and proliferation of most tumor cells depend on the transcription factor c-MYC. MYC overexpression is also associated with chronic liver diseases, such as viral and alcohol-related liver diseases. The amount of MYC overexpression varies across specific cancer subtypes. Without wishing to be bound by theory, modulating (eg, reducing) MYC levels in an individual (eg, an entire individual, or a specific target tissue or tissues) suffering from a MYC dysregulated disorder is believed to reduce or eliminate symptoms of the MYC dysregulated disorder.
本發明部分地提供一種表現抑制子,其包含:結合至靶基因啟動子(例如MYC啟動子或可操作地連接至靶基因,例如MYC基因)的靶向部分,及當藉由靶向部分定位時能夠調節(例如降低)靶基因(例如MYC)表現的效應部分。在一些實施例中,本文所揭示之表現抑制子經由靶向部分特異性結合至可操作地連接至靶基因(例如MYC)的表現控制元件(例如啟動子或增強子、抑制子或緘默子),且效應部分調節靶基因(例如MYC)的表現。在一些實施例中,本文所揭示之表現抑制子經由靶向部分特異性結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列或與錨定序列近接的序列,且效應部分調節靶基因(例如MYC)的表現。在一些實施例中,本文所揭示之表現抑制子特異性結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座,且效應部分調節靶基因(例如MYC)的表現。The present invention provides, in part, an expression suppressor comprising: a targeting moiety that binds to a target gene promoter (such as the MYC promoter or is operably linked to a target gene, such as the MYC gene), and when positioned by the targeting moiety is an effector moiety capable of modulating (eg, reducing) the expression of a target gene (eg, MYC). In some embodiments, an expression suppressor disclosed herein specifically binds to an expression control element (e.g., a promoter or enhancer, repressor, or silencer) operably linked to a target gene (e.g., MYC) via a targeting moiety , and the effector moiety regulates the expression of target genes such as MYC. In some embodiments, the expression suppressor disclosed herein specifically binds to or binds to an anchor sequence in an anchor sequence-mediated adapter (ASMC) comprising a target gene (eg, MYC) via a targeting moiety. Proximate sequences, and effector moieties regulate the expression of target genes such as MYC. In some embodiments, an expression suppressor disclosed herein specifically binds to a gene body locus located in a super-enhancer region of a target gene (eg, MYC), and the effector moiety modulates expression of the target gene (eg, MYC).
本發明另外部分地提供一種表現抑制系統,其包含兩種或更多種各自包含靶向部分及視情況存在之效應部分的表現抑制子。在一些實施例中,靶向部分靶向兩種或更多種不同序列(例如各表現抑制因子可靶向不同序列)。在一些實施例中,第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS)),且第二表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列。在一些實施例中,第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS)),且第二表現抑制子結合至可操作地連接至靶基因(例如MYC)的表現控制元件(例如增強子、超級增強子、抑制子或緘默子)。在一些實施例中,第一表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,且第二表現抑制子結合至可操作地連接至靶基因的表現控制元件(例如增強子、超級增強子、抑制子或緘默子)。一般而言,藉由表現抑制系統調節靶基因(例如MYC)的表現涉及使第一表現抑制子及第二表現抑制子分別結合至第一及第二DNA序列。第一及第二DNA序列的結合使第一及第二效應部分的功能定位於彼等位點。不希望受理論所束縛,在一些實施例中,利用第一與第二抑制子部分的功能穩定地抑制與第一及/或第二DNA序列相連或包含第一及/或第二DNA序列之靶基因的表現,例如其中第一及/或第二DNA序列為或包含靶基因或一或多個可操作地連接之轉錄控制元件的序列。在一些實施例中,表現抑制系統係由雙順反子核酸序列編碼。The invention additionally provides in part an expression inhibition system comprising two or more expression inhibitors each comprising a targeting moiety and optionally an effector moiety. In some embodiments, the targeting moiety targets two or more different sequences (eg, each expression inhibitor can target a different sequence). In some embodiments, the first expression repressor binds to a translational regulatory element (e.g., a promoter or transcription start (TSS)) operably linked to a target gene (e.g., MYC), and the second expression repressor binds to a gene comprising the target Anchor sequence in the anchor sequence-mediated adapter complex (ASMC) of a gene such as MYC. In some embodiments, the first expression repressor binds to a translational regulatory element (e.g., a promoter or transcription start (TSS)) operably linked to a target gene (e.g., MYC), and the second expression repressor binds to an operably linked Expression control elements (such as enhancers, super-enhancers, suppressors, or silencers) that are positively linked to target genes (such as MYC). In some embodiments, a first expression suppressor binds to an anchor sequence in an anchor sequence-mediated adapter (ASMC) comprising a target gene (e.g., MYC), and a second expression suppressor binds to an operably linked Expression control elements (eg, enhancers, super-enhancers, suppressors, or silencers) to target genes. In general, modulation of expression of a target gene (eg, MYC) by an expression repression system involves binding a first expression repressor and a second expression repressor to first and second DNA sequences, respectively. The combination of the first and second DNA sequences localizes the function of the first and second effector moieties to those sites. Without wishing to be bound by theory, in some embodiments, the function of the first and second repressor moieties is utilized to stably inhibit a gene linked to or comprising the first and/or second DNA sequence. Expression of a target gene, eg, wherein the first and/or second DNA sequence is or comprises the sequence of the target gene or one or more operably linked transcriptional control elements. In some embodiments, the expression suppression system is encoded by a bicistronic nucleic acid sequence.
本發明進一步提供編碼該等表現抑制子及/或表現抑制因子系統的核酸、包含表現抑制子及/或表現抑制系統的組合物,及遞送該等核酸的方法。進一步提供利用本文所述之表現抑制子或表現抑制系統增強細胞中之靶基因表現(例如MYC基因表現)的方法。The present invention further provides nucleic acids encoding the expression inhibitors and/or expression inhibitor systems, compositions comprising the expression inhibitors and/or expression inhibitor systems, and methods of delivering the nucleic acids. Further provided are methods of enhancing expression of a target gene (eg, MYC gene expression) in a cell using the expression suppressors or expression suppression systems described herein.
表現抑制子 如本文所述,本發明部分地提供用於調節(例如降低)靶基因(例如MYC)表現的表現抑制子。在一些實施例中,表現抑制子可包含結合至靶基因啟動子(例如MYC啟動子)的靶向部分及視情況存在的效應部分。在一些實施例中,靶向部分特異性結合標靶DNA序列,例如MYC DNA序列,藉此將表現抑制子的功能定位於DNA序列。在一些實施例中,表現抑制子包含一個靶向部分及一個效應部分。在一些實施例中,表現抑制子包含一個靶向部分及複數個效應部分(例如1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20個或更多個效應域(且視情況,少於20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3或2個效應域))。Expression Suppressors As described herein, the present invention provides, in part, expression suppressors for modulating (e.g., reducing) the expression of a target gene (e.g., MYC). In some embodiments, an expression suppressor may comprise a targeting moiety that binds to a target gene promoter (eg, the MYC promoter) and optionally an effector moiety. In some embodiments, the targeting moiety specifically binds to a target DNA sequence, such as a MYC DNA sequence, thereby localizing the expression suppressor function to the DNA sequence. In some embodiments, the expression suppressor comprises a targeting moiety and an effector moiety. In some embodiments, the expression inhibitor comprises a targeting moiety and a plurality of effector moieties (eg, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 , 16, 17, 18, 19, 20 or more effector domains (and optionally, fewer than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3 or 2 effector fields)).
表現抑制子可包含複數個效應部分,其中各效應部分包含與其他效應部分不同的功能。舉例而言,表現抑制子可包含兩個效應部分,其中第一效應部分包含DNA甲基化酶功能,且第二效應部分包含轉錄抑制子功能。在一些實施例中,表現抑制子包含在降低靶基因(例如MYC)表現方面具有彼此互補之功能的效應部分,其中該等功能合起來能夠抑制表現且視情況不抑制表現或當個別地存在時對表現的抑制可忽略。在一些實施例中,表現抑制子包含複數個效應部分,其中各效應部分與各其他效應部分互補,各效應部分減少靶基因(例如MYC)表現。An expression suppressor may comprise a plurality of effector moieties, wherein each effector moiety comprises a different function than the other effector moieties. For example, an expression repressor can comprise two effector moieties, wherein a first effector moiety comprises a DNA methylase function and a second effector moiety comprises a transcription repressor function. In some embodiments, the expression suppressor comprises effector moieties that have functions that are complementary to each other in reducing expression of a target gene (e.g., MYC), wherein the functions together are capable of inhibiting expression and optionally do not inhibit expression or when present individually Inhibition of performance is negligible. In some embodiments, the expression suppressor comprises a plurality of effector moieties, wherein each effector moiety is complementary to each other effector moiety, each effector moiety reduces expression of a target gene (eg, MYC).
在一些實施例中,表現抑制子包含效應部分的組合,在減少靶基因(例如MYC)表現方面,該等效應部分的功能彼此協同發揮作用。不希望受理論束縛,在一些實施例中,對基因體基因座的表觀遺傳修飾累積起來,因為多種個別的轉錄活化表觀遺傳標記物(例如多種不同類型的表觀遺傳標記物及/或指定類型的更大範圍標記)合起來抑制表現比單獨的個別修飾更有效(例如使表現更大幅度的減少及/或使表現更持久的減少)。在一些實施例中,表現抑制子包含複數個效應部分,其中各效應部分與各其他效應部分互補,例如各效應部分減少靶基因(例如MYC)表現。在一些實施例中,表現抑制子(包含複數個彼此協同作用的效應部分)在抑制靶基因(例如MYC)表現方面比包含個別效應部分的表現抑制子更有效。在一些實施例中,包含該等複數個效應部分的表現抑制子有效降低靶基因(例如MYC)表現的有效性為包含個別效應部分之表現抑制子的至少1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.55x、1.6x、1.65x、1.7x、1.75x、1.8x、1.85x、1.9x、1.95x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、30x、40x、50x、60x、70x、80x、90x或100x。In some embodiments, the expression suppressor comprises a combination of effector moieties that function synergistically with each other in reducing the expression of a target gene (eg, MYC). Without wishing to be bound by theory, in some embodiments, epigenetic modifications to gene body loci accumulate as multiple individual transcriptional activation epigenetic markers (e.g., multiple different types of epigenetic markers and/or A wider range of markers of a given type) together are more effective in suppressing a representation than the individual modifications alone (eg, a greater reduction in the representation and/or a longer-lasting reduction in the representation). In some embodiments, the expression suppressor comprises a plurality of effector moieties, wherein each effector moiety is complementary to each other effector moiety, eg, each effector moiety reduces expression of a target gene (eg, MYC). In some embodiments, an expression suppressor comprising a plurality of effector moieties that act synergistically with each other is more effective at inhibiting expression of a target gene (eg, MYC) than an expression suppressor comprising individual effector moieties. In some embodiments, an expression suppressor comprising the plurality of effector moieties is at least 1.05x (i.e., 1.05-fold) less effective at reducing expression of a target gene (e.g., MYC) than an expression suppressor comprising individual effector moieties. 1.1x, 1.15x, 1.2x, 1.25x, 1.3x, 1.35x, 1.4x, 1.45x, 1.5x, 1.55x, 1.6x, 1.65x, 1.7x, 1.75x, 1.8x, 1.85x, 1.9x , 1.95x, 2x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, 10x, 20x, 30x, 40x, 50x, 60x, 70x, 80x, 90x, or 100x.
在一些實施例中,表現抑制子包含一或多個靶向部分,例如Cas域、TAL效應域或Zn指域。在一個實施例中,當表現抑制系統包含相同類型的兩個或更多個靶向部分(例如兩個或更多個Cas域)時,該等靶向部分特異性結合兩種或更多種不同序列。舉例而言,在包含兩個或更多個Cas域的表現抑制系統中,可選擇或改變兩個或更多個Cas域,使得其僅明顯地結合與其靶序列對應的gRNA (例如與對應於另一Cas域之標靶的結合不明顯)。In some embodiments, the expression suppressor comprises one or more targeting moieties, such as a Cas domain, a TAL effector domain, or a Zn finger domain. In one embodiment, when the expression inhibition system comprises two or more targeting moieties of the same type (eg, two or more Cas domains), the targeting moieties specifically bind two or more different sequence. For example, in an expression suppression system comprising two or more Cas domains, two or more Cas domains can be selected or altered such that they only significantly bind gRNAs corresponding to their target sequences (e.g., corresponding to Binding to the target of another Cas domain was not evident).
在一些實施例中,表現抑制子包含共價連接(例如藉由肽鍵共價連接)的靶向部分及效應部分。在一些實施例中,靶向部分及效應部分位於同一多肽鏈上,例如藉由一或多個肽鍵及/或一個連接子連接。在一些實施例中,表現抑制子為或包含融合分子,例如包含藉由肽鍵及/或連接子連接之靶向部分及效應部分。在一些實施例中,表現抑制子包含安置於同一多肽鏈上之效應部分之N端的靶向部分。在一些實施例中,表現抑制子包含安置於同一多肽鏈上之效應部分之C端的靶向部分。在一些實施例中,表現抑制子包含藉由非肽鍵共價連接的靶向部分及效應部分。在一些實施例中,靶向部分與效應部分藉由非肽鍵偶聯。在一些實施例中,表現抑制子包含一個靶向部分及複數個效應部分,其中靶向部分與複數個效應部分共價連接,例如藉由肽鍵共價連接(例如靶向部分與複數個效應部分皆藉由一系列共價鍵連接,但各個別部分與每個其他效應部分可以不共享共價鍵)。In some embodiments, an expression inhibitor comprises a targeting moiety and an effector moiety covalently linked (eg, by a peptide bond). In some embodiments, the targeting moiety and the effector moiety are on the same polypeptide chain, eg, linked by one or more peptide bonds and/or a linker. In some embodiments, the expression inhibitor is or comprises a fusion molecule, for example comprising a targeting moiety and an effector moiety linked by a peptide bond and/or a linker. In some embodiments, the expression inhibitor comprises a targeting moiety disposed N-terminal to the effector moiety on the same polypeptide chain. In some embodiments, the expression inhibitor comprises a targeting moiety disposed C-terminal to the effector moiety on the same polypeptide chain. In some embodiments, the expression inhibitor comprises a targeting moiety and an effector moiety covalently linked by a non-peptide bond. In some embodiments, targeting moieties and effector moieties are coupled via non-peptide bonds. In some embodiments, the expression inhibitor comprises a targeting moiety and a plurality of effector moieties, wherein the targeting moiety is covalently linked to the plurality of effector moieties, such as by peptide bonds (eg, the targeting moiety is covalently linked to the plurality of effector moieties). The moieties are all linked by a series of covalent bonds, but each individual moiety may not share a covalent bond with every other effector moiety).
在其他實施例中,表現抑制子包含非共價連接(例如彼此非共價結合)的靶向部分及效應部分。在一些實施例中,表現抑制子包含非共價結合至效應部分的靶向部分或反之亦然。在一些實施例中,表現抑制子包含一個靶向部分及複數個效應部分,其中靶向部分與至少一個效應部分非共價連接,例如彼此非共價結合,且其中靶向部分與至少一個其他效應部分共價連接,例如藉由肽鍵共價連接。In other embodiments, the expression inhibitor comprises a targeting moiety and an effector moiety that are non-covalently linked (eg, non-covalently bound to each other). In some embodiments, the expression inhibitor comprises a targeting moiety non-covalently bound to the effector moiety or vice versa. In some embodiments, the expression inhibitor comprises a targeting moiety and a plurality of effector moieties, wherein the targeting moiety is non-covalently linked to at least one effector moiety, such as to each other, and wherein the targeting moiety is non-covalently linked to at least one other The effector moiety is covalently linked, eg, by a peptide bond.
一般而言,如本文所述的表現抑制子結合(例如經由靶向部分)近接於及/或可操作地連接至靶基因(例如MYC)的基因體序列元件。在一些實施例中,表現抑制子對基因體序列元件的結合調節(例如降低)靶基因(例如MYC)的表現。舉例而言,包含效應部分之表現抑制子結合至基因體序列元件可調節(例如降低)靶基因(例如MYC)的表現,該效應部分募集轉錄機構的組分或抑制轉錄機構的組分募集。作為另一實例,包含具有酶活性之效應部分(例如表觀遺傳修飾部分)之表現抑制子的結合可經由效應部分的酶活性定位來調節(例如降低)靶基因(例如MYC)的表現。作為另一實例,表現抑制子對基因體序列元件的結合與表現抑制子的酶活性定位可對靶基因(例如MYC)之表現促成所得的調節(例如降低)。In general, an expression suppressor as described herein binds (eg, via a targeting moiety) to a gene body sequence element in close proximity to and/or operably linked to a target gene (eg, MYC). In some embodiments, binding of a gene body sequence element by an expression suppressor modulates (eg, reduces) expression of a target gene (eg, MYC). For example, binding of an expression suppressor comprising an effector moiety that recruits or inhibits the recruitment of a component of the transcriptional machinery to a gene body sequence element can modulate (eg, decrease) the expression of a target gene (eg, MYC). As another example, the incorporation of an expression suppressor comprising an effector moiety having enzymatic activity (eg, an epigenetic modification moiety) can modulate (eg, decrease) the expression of a target gene (eg, MYC) via localization of the enzymatic activity of the effector moiety. As another example, binding of an expression repressor to a gene body sequence element and localization of the enzymatic activity of the expression repressor can contribute to the resulting modulation (eg, reduction) in the expression of a target gene (eg, MYC).
在一些實施例中,表現抑制子包含效應部分,其中該效應部分包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KRAB (例如KRAB域)、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其功能變異體或片段。In some embodiments, the expression suppressor comprises an effector portion, wherein the effector portion comprises a protein selected from the group consisting of MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (ie, LSD1), KDM1B (ie, LSD2 ), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 (ie, G9A), EHMT1 (ie, GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, KRAB (eg KRAB domain), MeCP2, HP1, RBBP4, REST, FOG1, SUZ12 or functional variants or fragments thereof.
在一些實施例中,表現抑制子包含第一效應部分及第二效應部分,其中該第一效應部分包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KRAB (例如KRAB域)、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其功能變異體或片段,且第二效應部分包含選自以下的不同蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KRAB (例如KRAB域)、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其功能變異體或片段。In some embodiments, the expression suppressor comprises a first effector moiety and a second effector moiety, wherein the first effector moiety comprises a protein selected from the group consisting of: MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5 , DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (i.e., LSD1), KDM1B (ie, LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 (ie, G9A), EHMT1 (ie, GLP), SUV39H1, EZH2 , EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, KRAB (e.g. KRAB domain), MeCP2, HP1, RBBP4, REST, FOG1, SUZ12 or a functional variant or fragment thereof, and the second effector moiety comprises a different protein selected from: MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (aka LSD1), KDM1B (aka LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 ( i.e., G9A), EHMT1 (i.e., GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, KRAB (eg, KRAB domain), MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or functional variants thereof body or fragment.
在一些實施例中,本發明提供編碼如本文所述之表現抑制子、表現抑制系統、靶向部分及/或效應部分的核酸序列。熟習此項技術者已知RNA的核酸序列與相應DNA序列一致,但其中胸腺嘧啶(T)典型地被尿嘧啶(U)置換。應理解,當核苷酸序列由DNA序列(例如包含、A、T、G、C)表示時,本發明亦提供相應的RNA序列(例如包含A、U、G、C),其中「U」置換「T」。本文中使用習知符號來描述聚核苷酸序列:單股聚核苷酸序列之左側末端為5'端;雙股聚核苷酸序列之左側方向稱為5'方向。In some embodiments, the present invention provides nucleic acid sequences encoding expression inhibitors, expression inhibition systems, targeting moieties and/or effector moieties as described herein. The nucleic acid sequence of RNA is known to those skilled in the art to be identical to the corresponding DNA sequence, but in which thymine (T) is typically replaced by uracil (U). It should be understood that when a nucleotide sequence is represented by a DNA sequence (such as comprising, A, T, G, C), the present invention also provides a corresponding RNA sequence (such as comprising A, U, G, C), wherein "U" Replace "T". Conventional symbols are used herein to describe polynucleotide sequences: the left end of a single-stranded polynucleotide sequence is the 5' end; the left direction of a double-stranded polynucleotide sequence is referred to as the 5' direction.
熟習此項技術者應瞭解,由於遺傳密碼的簡併,因此可產生編碼包含如本文所述之靶向部分及/或效應部分之表現抑制子的多種核苷酸序列,其中一些與本文所揭示之核酸序列具有相似性,例如90%、95%、96%、97%、98%或99%一致。舉例而言,密碼子AGA、AGG、CGA、CGC、CGG及CGU皆編碼胺基酸精胺酸。因此,在本發明之核酸分子中的每個位置,在精胺酸由密碼子說明的情況下,可將密碼子改變成上述任一種相應密碼子而不改變所編碼的多肽。Those skilled in the art will appreciate that due to the degeneracy of the genetic code, a variety of nucleotide sequences encoding expression inhibitors comprising targeting moieties and/or effector moieties as described herein can be generated, some of which are similar to those disclosed herein. The nucleic acid sequences have similarity, such as 90%, 95%, 96%, 97%, 98% or 99% identity. For example, the codons AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine. Thus, at each position in the nucleic acid molecules of the invention, where arginine is specified by a codon, the codon can be changed to any of the corresponding codons described above without altering the encoded polypeptide.
在一些實施例中,編碼包含靶向部分及/或效應部分之表現抑制子的核酸內聚性可為密碼子優化之編碼區(根據哺乳動物(例如人類)中之密碼子使用來優化)的一部分或全部。在一些實施例中,編碼靶向部分及/或效應部分的核酸內聚性經密碼子優化以增強蛋白質表現及/或增加蛋白質表現的持續時間。在一些實施例中,藉由密碼子優化之核酸序列產生的蛋白質比該蛋白質當藉由未經密碼子優化之核酸序列編碼時的水準高至少1%、至少2%、至少5%、至少10%、至少15%、至少20%、至少30%、至少40%或至少50%。In some embodiments, the cohesiveness of a nucleic acid encoding an expression suppressor comprising a targeting moiety and/or an effector moiety may be codon-optimized in a coding region (optimized according to codon usage in mammals, such as humans). part or all. In some embodiments, the cohesion of the nucleic acid encoding the targeting moiety and/or the effector moiety is codon-optimized to enhance protein expression and/or increase the duration of protein expression. In some embodiments, the protein produced by the codon-optimized nucleic acid sequence is at least 1%, at least 2%, at least 5%, at least 10% higher than the level of the protein when encoded by the non-codon-optimized nucleic acid sequence %, at least 15%, at least 20%, at least 30%, at least 40%, or at least 50%.
表現抑制系統 本發明之表現抑制系統可包含兩種或更多種表現抑制子。在一些實施例中,表現抑制系統包含2、3、4、5、6、7、8、9、10、11、12種或更多種表現抑制子(且視情況不超過15、14、13、12、11、10、9、8、7、6、5、4、3或2)。在一些實施例中,表現抑制系統靶向兩種或更多種不同序列(例如第1及第2、第3、第4、第5、第6、第7、第8、第9、第10、第11、第12及/或其他DNA序列,且視情況不超過第20、第19、第18、第17、第16、第15、第14、第13、第12、第11、第10、第9、第8、第6、第5、第4、第3或第2序列)。在一些實施例中,表現抑制系統包含複數個表現抑制子,其中複數個表現抑制子中的每個成員對複數個表現抑制子中之另一個成員的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子對第二表現抑制子的結合偵測不到,例如不結合。Expression Inhibition System The expression inhibition system of the present invention may comprise two or more expression inhibitors. In some embodiments, the expression suppression system comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more expression suppressors (and optionally no more than 15, 14, 13 , 12, 11, 10, 9, 8, 7, 6, 5, 4, 3 or 2). In some embodiments, the expression suppression system targets two or more different sequences (e.g., 1st and 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th, 10th , No. 11, No. 12 and/or other DNA sequences, and not exceeding No. 20, No. 19, No. 18, No. 17, No. 16, No. 15, No. 14, No. 13, No. 12, No. 11, No. 10 as the case may be , 9th, 8th, 6th, 5th, 4th, 3rd or 2nd sequence). In some embodiments, the expression suppression system comprises a plurality of expression suppressors, wherein each member of the plurality of expression suppressors is undetectable, eg, does not bind, another member of the plurality of expression suppressors. In some embodiments, the expression suppression system comprises a first expression inhibitor and a second expression inhibitor, wherein binding of the first expression inhibitor to the second expression inhibitor is undetectable, eg, does not bind.
在一些實施例中,本發明之表現抑制系統包含兩種或更多種表現抑制子,其中表現抑制子一起存在於組合物、醫藥組合物或混合物中。在一些實施例中,本發明之表現抑制系統包含兩種或更多種表現抑制子,其中一或多個表現抑制子不與至少一個其他表現抑制子混合。舉例而言,表現抑制系統可包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子於一個細胞之核中的存在與第二表現抑制子於同一細胞之核中的存在不重疊,其中表現抑制系統經由第一與第二表現抑制子之不重疊存在來達成MYC基因表現的降低。在一些實施例中,表現抑制系統使MYC基因表現達成的降幅大於單獨的第一或第二表現抑制子所達成之MYC基因表現的降幅。In some embodiments, the expression inhibitory systems of the invention comprise two or more expression inhibitors, wherein the expression inhibitors are present together in a composition, pharmaceutical composition or mixture. In some embodiments, the expression inhibitory systems of the invention comprise two or more expression inhibitors, wherein one or more expression inhibitors are not mixed with at least one other expression inhibitor. For example, an expression inhibitory system can comprise a first expression inhibitor and a second expression inhibitor, wherein the presence of the first expression inhibitor in the nucleus of a cell is different from the presence of the second expression inhibitor in the nucleus of the same cell. Overlap, wherein the expression repression system achieves a reduction in MYC gene expression through the non-overlapping presence of first and second expression repressors. In some embodiments, the expression suppression system achieves a reduction in MYC gene expression that is greater than the reduction in MYC gene expression achieved by the first or second expression suppressor alone.
在一些實施例中,表現抑制系統中的表現抑制子各自包含不同的靶向部分(例如第一、第二、第三或其他表現抑制子各自包含彼此不同的靶向部分)。舉例而言,表現抑制系統可包含第一表現抑制子及第二表現抑制子,其中該第一表現抑制子包含第一靶向部分(例如Cas9域、TAL效應域或Zn指域),且第二表現抑制子包含不同於第一靶向部分的第二靶向部分(例如Cas9域、TAL效應域或Zn指域)。在一些實施例中,不同可意謂包含不同類型的靶向部分,例如第一靶向部分包含Cas9域,且第二DNA靶向部分包含Zn指域。在其他實施例中,不同可意謂包含相同類型之靶向部分的不同變異體,例如第一靶向部分包含第一Cas9域(例如來自第一物種)且第二靶向部分包含第二Cas9域(例如來自第二物種)。在一個實施例中,當表現抑制系統包含相同類型的兩個或更多個靶向部分(例如兩個或更多個Cas9或ZF域)時,靶向部分特異性結合兩種或更多種不同序列。舉例而言,在包含兩個或更多個Cas9域的表現抑制系統中,可選擇或改變兩個或更多個Cas9域,使得其僅明顯地結合與其靶序列對應的gRNA (例如與對應於另一Cas9域之標靶的結合不明顯)。在另一實例中,在包含兩個或更多個效應部分的表現抑制系統中,可選擇或改變兩個或更多個效應部分,使得其僅明顯地結合至其靶序列(例如與另一效應部分之靶序列的結合不明顯)。In some embodiments, the expression inhibitors in the expression inhibition system each comprise a different targeting moiety (eg, the first, second, third or other expression inhibitors each comprise a different targeting moiety from the other). For example, an expression suppression system can comprise a first expression inhibitor and a second expression inhibitor, wherein the first expression inhibitor comprises a first targeting moiety (e.g., a Cas9 domain, a TAL effector domain, or a Zn finger domain), and the second The two-expression suppressor comprises a second targeting moiety (eg Cas9 domain, TAL effector domain or Zn finger domain) different from the first targeting moiety. In some embodiments, different may mean comprising different types of targeting moieties, for example a first targeting moiety comprising a Cas9 domain and a second DNA targeting moiety comprising a Zn finger domain. In other embodiments, different may mean different variants comprising the same type of targeting moiety, for example a first targeting moiety comprising a first Cas9 domain (e.g. from a first species) and a second targeting moiety comprising a second Cas9 domain (eg from a second species). In one embodiment, when the expression inhibition system comprises two or more targeting moieties of the same type (eg, two or more Cas9 or ZF domains), the targeting moieties specifically bind two or more different sequence. For example, in an expression suppression system comprising two or more Cas9 domains, two or more Cas9 domains can be selected or altered such that they only significantly bind gRNAs corresponding to their target sequences (e.g., those corresponding to Binding of the target of another Cas9 domain was not evident). In another example, in an expression inhibition system comprising two or more effector moieties, the two or more effector moieties can be selected or altered such that they only significantly bind to their target sequence (e.g., with another binding of the target sequence of the effector moiety was not evident).
在一些實施例中,表現抑制系統包含三種或更多種表現抑制子且兩種或更多種表現抑制子包含相同的靶向部分。舉例而言,表現抑制系統可包含三種表現抑制子,其中第一與第二表現抑制子均包含第一靶向部分且第三表現抑制子包含不同的第二靶向部分。在另一實例中,表現抑制系統可包含四種表現抑制子,其中第一與第二表現抑制子均包含第一靶向部分且第三及第四表現抑制子包含不同的第二靶向部分。在另一實例中,表現抑制系統可包含五種表現抑制子,其中第一與第二表現抑制子均包含第一靶向部分,第三與第四表現抑制子均包含不同的第二靶向部分,且第五表現抑制子包含不同的第三靶向部分。如上文所述,不同可意謂包含不同類型的靶向部分或包含相同類型之靶向部分的不同變異體。In some embodiments, the expression inhibition system comprises three or more expression inhibitors and the two or more expression inhibitors comprise the same targeting moiety. For example, an expression inhibitory system may comprise three expression inhibitors, wherein the first and second expression inhibitors both comprise a first targeting moiety and the third expression inhibitor comprises a different second targeting moiety. In another example, the expression inhibitory system can comprise four expression inhibitors, wherein the first and second expression inhibitors each comprise a first targeting moiety and the third and fourth expression inhibitors comprise a different second targeting moiety . In another example, the expression inhibitory system may comprise five expression inhibitors, wherein the first and second expression inhibitors each comprise a first targeting moiety, and the third and fourth expression inhibitors each comprise a different second targeting moiety. moiety, and the fifth expression suppressor comprises a different third targeting moiety. As mentioned above, different can mean comprising different types of targeting moieties or comprising different variants of the same type of targeting moieties.
在一些實施例中,表現抑制系統中的表現抑制子各自結合至不同DNA序列(例如第一、第二、第三或其他表現抑制子各自結合彼此不同的DNA序列)。舉例而言,表現抑制系統可包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子結合至第一DNA序列,且第二表現抑制子結合至第二DNA序列。在一些實施例中,不同可意謂:一種表現抑制子所結合之DNA序列與另一種表現抑制子所結合之DNA序列之間存在至少一個不一致的位置,或一種表現抑制子所結合之DNA序列中存在的至少一個位置不存在於另一種表現抑制子所結合的DNA序列中。In some embodiments, the expression inhibitors in the expression inhibition system each bind to a different DNA sequence (eg, the first, second, third, or other expression inhibitors each bind to a different DNA sequence from the other). For example, an expression inhibitory system can comprise a first expression inhibitor and a second expression inhibitor, wherein the first expression inhibitor binds to a first DNA sequence and the second expression inhibitor binds to a second DNA sequence. In some embodiments, differing may mean that there is at least one position of discordance between the DNA sequence bound by one expression inhibitor and the DNA sequence bound by another expression inhibitor, or that the DNA sequence bound by an expression inhibitor At least one position present in is not present in the DNA sequence to which another expression suppressor binds.
在一些實施例中,第一DNA序列可位於第一基因體DNA股上且第二DNA序列可位於第二基因體DNA股上。在一些實施例中,第一DNA序列可位於與第二DNA序列相同的基因體DNA股上。In some embodiments, the first DNA sequence can be located on a first genome DNA strand and the second DNA sequence can be located on a second genome DNA strand. In some embodiments, the first DNA sequence can be located on the same genomic DNA strand as the second DNA sequence.
在一些實施例中,表現抑制系統包含三種或更多種表現抑制子且兩種或更多種表現抑制子包含相同的DNA序列。舉例而言,表現抑制系統可包含三種表現抑制子,其中第一與第二表現抑制子均包含第一DNA序列且第三表現抑制子包含不同的第二DNA序列。在另一實例中,表現抑制系統可包含四種表現抑制子,其中第一與第二表現抑制子均結合第一DNA序列且第三與第四表現抑制子均結合第二DNA序列。在另一實例中,表現抑制系統可包含五種表現抑制子,其中第一與第二表現抑制子均結合第一DNA序列,第三與第四表現抑制子均結合第二DNA序列,且第五表現抑制子結合第三DNA序列。如上文所述,不同可意謂:一種表現抑制子所結合之DNA序列與另一種表現抑制子所結合之DNA序列之間存在至少一個不一致的位置,或一種表現抑制子所結合之DNA序列中存在的至少一個位置不存在於另一種表現抑制子所結合的DNA序列中。In some embodiments, the expression suppression system comprises three or more expression suppressors and the two or more expression suppressors comprise the same DNA sequence. For example, the expression suppression system may comprise three expression suppressors, wherein the first and second expression suppressors both comprise a first DNA sequence and the third expression suppressor comprises a second, different DNA sequence. In another example, the expression suppression system can comprise four expression suppressors, wherein both the first and second expression suppressors bind to a first DNA sequence and the third and fourth expression suppressors both bind to a second DNA sequence. In another example, the expression suppression system can comprise five expression suppressors, wherein the first and second expression suppressors both bind to a first DNA sequence, the third and fourth expression suppressors both bind to a second DNA sequence, and the first The five-expression suppressor binds a third DNA sequence. As mentioned above, different can mean: there is at least one position of discord between the DNA sequence bound by one expression inhibitor and the DNA sequence bound by another expression inhibitor, or in the DNA sequence bound by one expression inhibitor. At least one position present is absent in the DNA sequence to which another expression suppressor binds.
在一些實施例中,表現抑制系統包含兩種或更多種(例如兩種)表現抑制子且複數種(例如兩種)表現抑制子包含結合至不同DNA序列的靶向部分。在此類實施例中,第一靶向部分可結合至第一DNA序列且第二DNA靶向部分可結合至第二DNA序列,其中第一與第二DNA序列不同且不重疊。在一些此類實施例中,第一DNA序列與第二DNA序列相隔至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95或100個鹼基對(且視情況,不超過500、400、300、200、100、95、90、85、80、75、70、65、60、55或50個鹼基對)。在一些此類實施例中,第一DNA序列與第二DNA序列相隔1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95或100個鹼基對(且視情況,無鹼基對,例如第一與第二序列彼此直接鄰接)。In some embodiments, the expression suppression system comprises two or more (eg, two) expression inhibitors and the plurality (eg, two) of the expression inhibitors comprises targeting moieties that bind to different DNA sequences. In such embodiments, the first targeting moiety can bind to a first DNA sequence and the second DNA targeting moiety can bind to a second DNA sequence, wherein the first and second DNA sequences are different and non-overlapping. In some such embodiments, the first DNA sequence is separated from the second DNA sequence by at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 , 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 base pairs (and, as the case may be, not More than 500, 400, 300, 200, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, or 50 base pairs). In some such embodiments, the first DNA sequence is separated from the second DNA sequence by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 base pairs (and optionally, base-free base pair, e.g. the first and second sequences are directly adjacent to each other).
在一些實施例中,表現抑制系統中的表現抑制子各自包含不同的靶向部分(例如第一、第二、第三或其他表現抑制子各自包含彼此不同的效應部分)。舉例而言,表現抑制系統可包含第一表現抑制子及第二表現抑制子,其中第一表現抑制子包含第一效應部分(例如包含DNA甲基轉移酶或其功能片段),且第二表現抑制子包含與第一效應部分不同的第二效應部分(例如包含轉錄抑制子(例如KRAB)或其功能片段)。在一些實施例中,不同可意謂包含不同類型的效應部分。在其他實施例中,不同可意謂包含相同類型之效應部分的不同變異體,例如第一效應部分包含第一DNA甲基轉移酶(例如具有第一位點特異性或胺基酸序列)且第二效應部分包含第二DNA甲基轉移酶(例如具有第二位點特異性或胺基酸序列)。In some embodiments, the expression inhibitors in the expression inhibition system each comprise a different targeting moiety (eg, the first, second, third or other expression inhibitors each comprise a different effector moiety from the other). For example, an expression repression system can comprise a first expression repressor and a second expression repressor, wherein the first expression repressor comprises a first effector moiety (e.g., comprising a DNA methyltransferase or a functional fragment thereof), and the second expression repressor The repressor comprises a second effector moiety different from the first effector moiety (eg comprising a transcriptional repressor (eg KRAB) or a functional fragment thereof). In some embodiments, different may mean comprising different types of effector moieties. In other embodiments, different may mean different variants comprising the same type of effector moiety, e.g. a first effector moiety comprising a first DNA methyltransferase (e.g. having a first site specificity or amino acid sequence) and The second effector moiety comprises a second DNA methyltransferase (eg, having a second site specificity or amino acid sequence).
在一些實施例中,表現抑制系統包含含有第一效應部分的第一表現抑制子及含有第二效應部分的第二表現抑制子,其中該第一效應部分包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其功能變異體或片段;且第二效應部分包含選自以下的不同蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2、KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其功能變異體或片段。In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first effector moiety and a second expression inhibitor comprising a second effector moiety, wherein the first effector moiety comprises a protein selected from the group consisting of: MQ1, DNMT1 , DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5 , SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (ie, LSD1), KDM1B (ie, LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 (ie, G9A), EHMT1 (i.e., GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, KRAB, MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or functional variants or fragments thereof; and the second effector Portion comprising different proteins selected from: MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10 , HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (aka LSD1), KDM1B (aka LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B , NO66, SETDB1, SETDB2, EHMT2 (ie, G9A), EHMT1 (ie, GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, KRAB, MeCP2, HP1, RBBP4, REST, FOG1, SUZ12 or a functional variant or fragment thereof.
在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性(例如MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L或其任一者之功能變異體或片段,且另一效應部分包含轉錄抑制活性(例如KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者之功能變異體或片段),且第一或第二效應部分包含組蛋白甲基轉移酶活性,且另一效應部分包含組蛋白去乙醯酶活性(例如HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9或其任一者之功能變異體或片段)。在一些實施例中,第一或第二效應部分包含組蛋白甲基轉移酶活性且另一效應部分包含DNA甲基轉移酶活性(例如MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L或其任一者的功能變異體或片段)。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含轉錄抑制活性。在一些實施例中,第一或第二效應部分包含組蛋白甲基轉移酶活性且另一效應部分包含轉錄抑制活性(例如KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者之功能變異體或片段)。在一些實施例中,第一或第二效應部分包含轉錄抑制活性且另一效應部分包含不同的轉錄抑制活性。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含相同的DNA甲基轉移酶活性。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含組蛋白去乙醯酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去甲基酶活性且另一效應部分包含DNA甲基轉移酶活性。在一些實施例中,第一或第二效應部分包含組蛋白甲基轉移酶活性且另一效應部分包含DNA去甲基酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去甲基酶活性且另一效應部分包含轉錄抑制活性。在一些實施例中,第一或第二效應部分包含組蛋白去甲基酶活性且另一效應部分包含不同的組蛋白去甲基酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去甲基酶活性且另一效應部分包含相同的組蛋白去甲基酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去乙醯酶活性且另一效應部分包含DNA甲基轉移酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去乙醯酶活性且另一效應部分包含DNA去甲基酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去乙醯酶活性且另一效應部分包含轉錄抑制活性。在一些實施例中,第一或第二效應部分包含組蛋白去乙醯酶活性且另一效應部分包含不同的組蛋白去乙醯酶活性。在一些實施例中,第一或第二效應部分包含組蛋白去乙醯酶活性且另一效應部分包含相同的組蛋白去乙醯酶活性。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含DNA去甲基酶活性。在一些實施例中,第一或第二效應部分包含DNA去甲基酶活性且另一效應部分包含轉錄抑制活性。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含不同的DNA甲基轉移酶活性。在一些實施例中,第一或第二效應部分包含DNA甲基轉移酶活性且另一效應部分包含相同的DNA甲基轉移酶活性。在一些實施例中,第一或第二效應部分包含DNA去甲基酶活性且另一效應部分包含轉錄抑制活性。在一些實施例中,第一或第二效應部分包含DNA去甲基酶活性且另一效應部分包含不同的DNA去甲基酶活性。在一些實施例中,第一或第二效應部分包含DNA去甲基酶活性且另一效應部分包含相同的DNA去甲基酶活性。在一些實施例中,第一或第二效應部分包含轉錄抑制活性且另一效應部分包含不同的轉錄抑制活性。在一些實施例中,第一或第二效應部分包含轉錄抑制活性且另一效應部分包含相同的轉錄抑制活性。In some embodiments, the first or second effector moiety comprises a DNA methyltransferase activity (e.g., the function of MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, or any of them variant or fragment, and the other effector portion comprises transcriptional repression activity (e.g. KRAB, MeCP2, HP1, RBBP4, REST, FOG1, SUZ12 or a functional variant or fragment of any of them), and the first or second effector portion comprising histone methyltransferase activity and another effector moiety comprising histone deacetylase activity (e.g. HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, or a functional variant or fragment thereof). In some embodiments, the first or second effector moiety comprises histone methyltransferase activity and the other The effector moiety comprises DNA methyltransferase activity (e.g., MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, or a functional variant or fragment of any of them). In some embodiments , the first or second effector moiety comprises DNA methyltransferase activity and the other effector moiety comprises transcriptional repression activity. In some embodiments, the first or second effector moiety comprises histone methyltransferase activity and the other effector moiety comprises The moiety comprises transcriptional repressive activity (e.g., KRAB, MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or a functional variant or fragment of any of them). In some embodiments, the first or second effector moiety comprises transcriptional repressive activity And another effector moiety comprises different transcriptional repression activities. In some embodiments, the first or second effector moiety comprises DNA methyltransferase activity and another effector moiety comprises the same DNA methyltransferase activity. In some implementations In some embodiments, the first or second effector moiety comprises DNA methyltransferase activity and the other effector moiety comprises histone deacetylase activity. In some embodiments, the first or second effector moiety comprises histone demethylation enzymatic activity and the other effector moiety comprises DNA methyltransferase activity. In some embodiments, the first or second effector moiety comprises histone methyltransferase activity and the other effector moiety comprises DNA demethylase activity. In In some embodiments, the first or second effector moiety comprises histone demethylase activity and the other effector moiety comprises transcriptional repression activity. In some embodiments, the first or second effector moiety comprises histone demethylase activity and another effector moiety comprising different histone demethylase activities. In some embodiments, the first or second effector moiety comprises histone demethylase activity and the other effector moiety comprises the same histone demethylase activity. In some embodiments, the first or second effector moiety comprises histone deacetylase activity and the other effector moiety comprises DNA methyltransferase activity. In some embodiments, the first or second effector moiety comprises histone deacetylase activity and the other effector moiety comprises DNA demethylase activity. In some embodiments, the first or second effector moiety comprises histone deacetylase activity and the other effector moiety comprises transcriptional repression activity. In some embodiments, the first or second effector moiety comprises a histone deacetylase activity and the other effector moiety comprises a different histone deacetylase activity. In some embodiments, the first or second effector moiety comprises histone deacetylase activity and the other effector moiety comprises the same histone deacetylase activity. In some embodiments, the first or second effector moiety comprises DNA methyltransferase activity and the other effector moiety comprises DNA demethylase activity. In some embodiments, the first or second effector moiety comprises DNA demethylase activity and the other effector moiety comprises transcriptional repression activity. In some embodiments, the first or second effector moiety comprises a DNA methyltransferase activity and the other effector moiety comprises a different DNA methyltransferase activity. In some embodiments, the first or second effector moiety comprises DNA methyltransferase activity and the other effector moiety comprises the same DNA methyltransferase activity. In some embodiments, the first or second effector moiety comprises DNA demethylase activity and the other effector moiety comprises transcriptional repression activity. In some embodiments, the first or second effector moiety comprises DNA demethylase activity and the other effector moiety comprises a different DNA demethylase activity. In some embodiments, the first or second effector moiety comprises DNA demethylase activity and the other effector moiety comprises the same DNA demethylase activity. In some embodiments, the first or second effector moiety comprises transcriptional repression activity and the other effector moiety comprises a different transcriptional repression activity. In some embodiments, the first or second effector moiety comprises transcriptional repression activity and the other effector moiety comprises the same transcriptional repression activity.
在一些實施例中,表現抑制系統包含三種或更多種表現抑制子且兩種或更多種表現抑制子包含相同的DNA靶向部分。舉例而言,表現抑制系統可包含三種表現抑制子,其中第一與第二表現抑制子均包含第一效應部分且第三表現抑制子包含不同的第二效應部分。在另一實例中,表現抑制系統可包含四種表現抑制子,其中第一與第二表現抑制子均包含第一效應部分且第三及第四表現抑制子包含不同的第二效應部分。在另一實例中,表現抑制系統可包含五種表現抑制子,其中第一與第二表現抑制子均包含第一效應部分,第三與第四表現抑制子均包含不同的第二效應部分,且第五表現抑制子包含不同的第三效應部分。如上文所述,不同可意謂包含不同類型的效應部分或包含相同類型之效應部分的不同變異體。In some embodiments, the expression suppression system comprises three or more expression suppressors and the two or more expression suppressors comprise the same DNA targeting moiety. For example, an expression inhibitory system may comprise three expression inhibitors, wherein a first and a second expression inhibitor both comprise a first effector moiety and a third expression inhibitor comprises a different second effector moiety. In another example, the expression inhibitory system may comprise four expression inhibitors, wherein the first and second expression inhibitors each comprise a first effector moiety and the third and fourth expression inhibitors comprise a different second effector moiety. In another example, the expression inhibitory system may comprise five expression inhibitors, wherein the first and second expression inhibitors each comprise a first effector moiety, the third and fourth expression inhibitors each comprise a different second effector moiety, And the fifth expression suppressor comprises a different third effector moiety. As mentioned above, different can mean comprising different types of effector moieties or comprising different variants of the same type of effector moieties.
在一些實施例中,表現抑制系統中的兩種或更多種(例如所有)表現抑制子彼此間未共價結合,例如各種表現抑制子與任何其他表現抑制子未共價結合。在另一個實施例中,表現抑制系統中的兩種或更多種表現抑制子彼此共價結合。在一個實施例中,表現抑制系統包含安置於同一多肽上的第一表現抑制子及第二表現抑制子,例如呈融合分子形式,例如藉由肽鍵且視情況藉由連接子連接。在一些實施例中,肽為自裂解肽,例如T2A自裂解肽。在一個實施例中,表現抑制系統包含藉由非肽鍵連接(例如彼此偶聯)的第一表現抑制子及第二表現抑制子。In some embodiments, two or more (eg, all) of the expression inhibitors in the expression inhibition system are not covalently bound to each other, eg, each expression inhibitor is not covalently bound to any other expression inhibitor. In another embodiment, two or more expression inhibitors in the expression inhibition system are covalently bound to each other. In one embodiment, the expression inhibition system comprises a first expression inhibitor and a second expression inhibitor disposed on the same polypeptide, for example in the form of a fusion molecule, linked for example by a peptide bond and optionally by a linker. In some embodiments, the peptide is a self-cleaving peptide, such as a T2A self-cleaving peptide. In one embodiment, the expression inhibitory system comprises a first expression inhibitor and a second expression inhibitor linked (eg, coupled to each other) by a non-peptide bond.
連接子 如本文所揭示之表現抑制子或表現抑制系統可包含一或多個連接子。連接子可使靶向部分與效應部分連接、使效應部分與另一效應部分連接,或使靶向部分與另一靶向部分連接。連接子可為化學鍵,例如一或多個共價鍵或非共價鍵。在一些實施例中,連接子為共價的。在一些實施例中,連接子為非共價的。在一些實施例中,連接子為肽連接子。此類連接子的長度可為2-30、5-30、10-30、15-30、20-30、25-30、2-25、5-25、10-25、15-25、20-25、2-20、5-20、10-20、15-20、2-15、5-15、10-15、2-10、5-10或2-5個胺基酸,或長度大於或等於2、5、10、15、20、25或30個胺基酸(且視情況,長度至多50、40、30、25、20、15、10或5個胺基酸)。在一些實施例中,連接子可用於將第一域或部分與第二域或部分隔開,例如將DNA靶向部分與效應部分隔開。在一些實施例中,舉例而言,連接子可位於DNA靶向部分與效應部分之間,例如以提供二級及三級結構的分子柔順性。連接子可包含本文所述之柔順性、剛性及/或可裂解連接子。在一些實施例中,連接子包括至少一個甘胺酸、丙胺酸及絲胺酸胺基酸以提供柔順性。在一些實施例中,連接子為疏水性連接子,諸如包括帶負電的磺酸酯基團、聚乙二醇(PEG)基團或焦磷酸二酯基團。在一些實施例中,連接子可裂解以選擇性地自調節劑釋放部分(例如多肽),但充分穩定以防止過早裂解。 Linker An expression suppressor or expression suppressor system as disclosed herein may comprise one or more linkers. A linker can link a targeting moiety to an effector moiety, an effector moiety to another effector moiety, or a targeting moiety to another targeting moiety. A linker can be a chemical bond, such as one or more covalent or non-covalent bonds. In some embodiments, linkers are covalent. In some embodiments, linkers are non-covalent. In some embodiments, the linker is a peptide linker. Such linkers can be 2-30, 5-30, 10-30, 15-30, 20-30, 25-30, 2-25, 5-25, 10-25, 15-25, 20- 25, 2-20, 5-20, 10-20, 15-20, 2-15, 5-15, 10-15, 2-10, 5-10, or 2-5 amino acids, or a length greater than or Equal to 2, 5, 10, 15, 20, 25 or 30 amino acids (and optionally up to 50, 40, 30, 25, 20, 15, 10 or 5 amino acids in length). In some embodiments, a linker can be used to separate a first domain or moiety from a second domain or moiety, eg, a DNA targeting moiety from an effector moiety. In some embodiments, for example, a linker can be positioned between the DNA targeting moiety and the effector moiety, eg, to provide molecular flexibility for secondary and tertiary structure. Linkers can comprise flexible, rigid and/or cleavable linkers as described herein. In some embodiments, the linker includes at least one glycine, alanine, and serine amino acids to provide flexibility. In some embodiments, the linker is a hydrophobic linker, such as comprising a negatively charged sulfonate group, a polyethylene glycol (PEG) group, or a pyrophosphodiester group. In some embodiments, the linker is cleavable to selectively release moieties (eg, polypeptides) from the modulator, but is sufficiently stable to prevent premature cleavage.
在一些實施例中,本文所述之表現抑制子中的一或多個部分及/或域經由一或多個連接子連接。在一些實施例中,表現抑制可包含位於靶向部分與效應部分之間的連接子。在一些實施例中,連接子可具有ASGSGGGSGGARD (SEQ ID NO: 137)或ASGSGGGSGG (SEQ ID NO: 138)之序列。在一些實施例中,包含第一及第二抑制子的系統可包含位於第一靶向部分與第一效應部分之間的第一連接子及位於第二靶向部分與第二效應部分之間的第二連接子。在一些實施例中,第一與第二連接子可相同。在一些實施例中,第一與第二連接子可不同。在一些實施例中,第一連接子可包含根據SEQ ID NO: 137的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,且第二連接子可包含根據SEQ ID NO: 138的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列。In some embodiments, one or more moieties and/or domains in the expression inhibitors described herein are linked via one or more linkers. In some embodiments, expression inhibition may comprise a linker positioned between the targeting moiety and the effector moiety. In some embodiments, the linker may have the sequence of ASGSGGGSGGARD (SEQ ID NO: 137) or ASGSGGGSGG (SEQ ID NO: 138). In some embodiments, a system comprising a first and a second inhibitor can comprise a first linker between the first targeting moiety and the first effector moiety and a second targeting moiety and the second effector moiety. the second linker. In some embodiments, the first and second linkers can be the same. In some embodiments, the first and second linkers can be different. In some embodiments, the first linker can comprise an amino acid sequence according to SEQ ID NO: 137 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, and the second linker can comprise an amino acid sequence according to The amino acid sequence of SEQ ID NO: 138 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto.
如熟習此項技術者已知,常用的柔性連接子具有主要由Gly及Ser殘基之鏈段組成的序列(「GS」連接子)。柔性連接子可適用於將需要某種程度之移動或相互作用的域/部分連接且可包括小的、非極性(例如Gly)或極性(例如Ser或Thr)胺基酸。併入Ser或Thr亦可藉由與水分子形成氫鍵來維持連接子在水溶液中之穩定性,且因此減少連接子與部分/域之間的不利相互作用。在一些實施例中,連接子為GS連接子或其變異體,例如G4S (SEQ ID NO: 207)。As known to those skilled in the art, commonly used flexible linkers have a sequence consisting essentially of stretches of Gly and Ser residues ("GS" linkers). Flexible linkers may be suitable for linking domains/moieties that require some degree of movement or interaction and may include small, non-polar (eg Gly) or polar (eg Ser or Thr) amino acids. Incorporation of Ser or Thr may also maintain the stability of the linker in aqueous solution by forming hydrogen bonds with water molecules, and thus reduce adverse interactions between the linker and the moiety/domain. In some embodiments, the linker is a GS linker or a variant thereof, such as G4S (SEQ ID NO: 207).
剛性連接子適用於保持域/部分之間的固定距離且維持其獨立功能。當各域的空間間隔對於保持融合物中之一或多種組分的穩定性或生物活性至關重要時,剛性連接子亦可為適用的。剛性連接子可具有α螺旋結構或富Pro序列(XP) n,其中X表示任何胺基酸,較佳為Ala、Lys或Glu。 Rigid linkers are suitable for maintaining a fixed distance between domains/parts and maintaining their independent functions. Rigid linkers may also be suitable when the spatial separation of the domains is critical to maintaining the stability or biological activity of one or more components in the fusion. The rigid linker can have an α-helical structure or a Pro-rich sequence (XP) n , where X represents any amino acid, preferably Ala, Lys or Glu.
可裂解連接子可在活體內釋放自由功能域。在一些實施例中,連接子可在特定條件下裂解,諸如在還原試劑或蛋白酶存在下裂解。活體內可裂解連接子可利用二硫鍵之可逆性質。一個實例包括兩個Cys殘基之間的凝血酶敏感性序列(例如PRS)。CPRSC之活體外凝血酶處理引起凝血酶敏感性序列裂解,而可逆的二硫鍵保持完整。此類連接子已知且描述於例如Chen等人, 2013. Fusion Protein Linkers: Property, Design and Functionality. Adv Drug Deliv Rev. 65(10): 1357-1369。融合物中連接子之活體內裂解亦可藉由蛋白酶進行,該等蛋白酶在活體內在某些條件下、在特定細胞或組織中表現,或限定於某些細胞隔室內。多種蛋白酶之特異性使得連接子在限定的隔室中較慢地裂解。在一些實施例中,可裂解連接子可為自裂解連接子,例如T2A肽連接子。在一些實施例中,連接子可包含「核糖體跳讀」序列,例如tPT2A連接子。Cleavable linkers release free functional domains in vivo. In some embodiments, linkers are cleavable under specific conditions, such as in the presence of reducing reagents or proteases. In vivo cleavable linkers can take advantage of the reversible nature of disulfide bonds. An example includes a thrombin sensitive sequence (eg PRS) between two Cys residues. In vitro thrombin treatment of CPRSCs causes cleavage of the thrombin-sensitive sequence, while the reversible disulfide bonds remain intact. Such linkers are known and described, for example, in Chen et al., 2013. Fusion Protein Linkers: Property, Design and Functionality. Adv Drug Deliv Rev. 65(10): 1357-1369. In vivo cleavage of linkers in fusions can also be performed by proteases that are expressed in vivo under certain conditions, in specific cells or tissues, or restricted to certain cellular compartments. The specificity of various proteases results in slower cleavage of linkers in defined compartments. In some embodiments, the cleavable linker may be a self-cleavable linker, such as a T2A peptide linker. In some embodiments, a linker may comprise a "ribosomal skipping" sequence, such as a tPT2A linker.
適用於本文所述之連接子之分子實例包括帶負電的磺酸酯基團;脂質,諸如聚(--CH 2--)烴鏈,諸如聚乙二醇(PEG)基團、其不飽和變化形式、其羥基化變化形式、其醯胺化變化形式或另外含N的變化形式;非碳連接子;碳水化合物連接子;磷酸二酯連接子,或能夠使表現抑制子之兩種或更多種組分共價連接的其他分子。亦包括非共價連接子,諸如多肽連接(例如經由多肽之疏水性區域或多肽之疏水性延伸部分連接)之疏水性脂質球,諸如富含白胺酸、異白胺酸、纈胺酸或可能亦富含丙胺酸、苯丙胺酸或甚至酪胺酸、甲硫胺酸、甘胺酸或其他疏水性殘基的一系列殘基。表現抑制子中的組分可利用基於電荷的化學試劑連接,使得表現抑制子中的帶正電組分連接至帶負電荷的另一組分。 Examples of molecules suitable for use in the linkers described herein include negatively charged sulfonate groups; lipids, such as poly(-- CH2-- ) hydrocarbon chains, such as polyethylene glycol (PEG) groups, unsaturated A variant, a hydroxylated variant thereof, an amidated variant thereof, or another N-containing variant; a non-carbon linker; a carbohydrate linker; a phosphodiester linker, or two or more capable of expressing an inhibitor Other molecules to which various components are covalently linked. Also included are non-covalent linkers, such as hydrophobic lipid globules, such as leucine, isoleucine, valine or A series of residues that may also be rich in alanine, phenylalanine or even tyrosine, methionine, glycine or other hydrophobic residues. Components in the expression suppressor can be linked using charge-based chemistry such that a positively charged component in the expression suppressor is linked to another negatively charged component.
靶向部分 本發明提供例如包含靶向部分的表現抑制子,其特異性靶向(例如結合)近接於及/或可操作地連接至靶基因的基因體序列元件(例如啟動子、TSS或錨定序列)。靶向部分可特異性結合DNA序列,例如與靶基因(例如MYC)相關的DNA序列。特異性結合DNA序列的任何分子或化合物可用作靶向部分。Targeting Moieties The invention provides, for example, expression suppressors comprising a targeting moiety that specifically targets (e.g., binds) a gene body sequence element (e.g., a promoter, TSS or anchor) in close proximity to and/or operably linked to a target gene. sequence). A targeting moiety can specifically bind a DNA sequence, such as a DNA sequence associated with a target gene (eg, MYC). Any molecule or compound that specifically binds a DNA sequence can be used as a targeting moiety.
在一些實施例中,靶向部分靶向(例如結合)基因體複合物(例如ASMC)中的組分。在一些實施例中,靶向部分靶向(例如結合)可操作地連接至靶基因的表現控制序列(例如啟動子或增強子)。在一些實施例中,靶向部分靶向(例如結合)靶基因或靶基因的一部分。靶向部分的標靶可稱為其標靶組分。標靶組分可為可操作地連接至靶基因的任何基因體序列元件或靶基因本身,包括(但不限於)啟動子、增強子、錨定序列、外顯子、內含子、UTR編碼序列、剪接位點或轉錄起點。在一些實施例中,靶向部分特異性結合至一或多種標靶錨定序列(例如細胞內)而不結合至非靶向錨定序列(例如同一細胞內)。In some embodiments, a targeting moiety targets (eg, binds to) a component in a gene body complex (eg, ASMC). In some embodiments, a targeting moiety targets (eg, binds to) an expression control sequence (eg, a promoter or enhancer) operably linked to a target gene. In some embodiments, a targeting moiety targets (eg, binds to) a target gene or a portion of a target gene. The target of a targeting moiety can be referred to as its targeting component. A targeting component can be any gene body sequence element operably linked to a target gene or the target gene itself, including (but not limited to) promoters, enhancers, anchor sequences, exons, introns, UTR coding sequence, splice site or start of transcription. In some embodiments, a targeting moiety specifically binds to one or more target anchor sequences (eg, within a cell) and does not bind to non-target anchor sequences (eg, within the same cell).
在一些實施例中,靶向部分可為或包含CRISPR/Cas域、TAL效應域、Zn指域、肽核酸(PNA)或核酸分子。在一些實施例中,表現抑制子包含一個效應部分。在一些實施例中,表現抑制子包含複數個靶向部分,其中各靶向部分對另一靶向部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含複數個表現抑制子,其中複數個表現抑制子中的各成員包含靶向部分,其中各靶向部分對另一靶向部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含含有第一靶向部分的第一表現抑制子及含有第二靶向部分的第二表現抑制子,其中第一靶向部分對第二靶向部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含含有第一靶向部分的第一表現抑制子及含有第二靶向部分的第二表現抑制子,其中第一靶向部分對另一個第一靶向部分的結合偵測不到,例如不結合,且第二靶向部分對另一個第二靶向部分的結合偵測不到,例如不結合。在一些實施例中,本文所述之組合物及方法中使用的靶向部分在單體(例如非二聚)狀態下發揮功能(例如結合至DNA序列)。In some embodiments, a targeting moiety can be or comprise a CRISPR/Cas domain, a TAL effector domain, a Zn finger domain, a peptide nucleic acid (PNA), or a nucleic acid molecule. In some embodiments, the expression suppressor comprises an effector moiety. In some embodiments, the expression inhibitor comprises a plurality of targeting moieties, wherein binding of each targeting moiety to another targeting moiety is undetectable, eg, does not bind. In some embodiments, the expression inhibition system comprises a plurality of expression inhibitors, wherein each member of the plurality of expression inhibitors comprises a targeting moiety, wherein binding of each targeting moiety to another targeting moiety is undetectable, e.g. Not combined. In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first targeting moiety and a second expression inhibitor comprising a second targeting moiety, wherein binding of the first targeting moiety to the second targeting moiety Not detectable, eg, not bound. In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first targeting moiety and a second expression inhibitor comprising a second targeting moiety, wherein the first targeting moiety is opposite to the other first targeting moiety. The binding of the second targeting moiety is undetectable, eg, does not bind, and the binding of the second targeting moiety to another second targeting moiety is undetectable, eg, does not bind. In some embodiments, targeting moieties used in the compositions and methods described herein function (eg, bind to DNA sequences) in the monomeric (eg, non-dimeric) state.
在一些實施例中,靶向部分對標靶組分的結合使標靶組分對另一種轉錄因子、基因體複合物組分或基因體序列元件的結合親和力減小。在一些實施例中,靶向部分結合至其靶序列的K D小於或等於500、450、400、350、300、250、200、150、100、50、40、30、20、10、9、8、7、6、5、4、3、2、1、0.9、0.8、0.7、0.6、0.5、0.4、0.3、0.2、0.1、0.09、0.08、0.07、0.06、0.05、0.04、0.03、0.02、0.01、0.005、0.002或0.001 nM (且視情況,K D為至少50、40、30、20、10、9、8、7、6、5、4、3、2、1、0.9、0.8、0.7、0.6、0.5、0.4、0.3、0.2、0.1、0.09、0.08、0.07、0.06、0.05、0.04、0.03、0.02、0.01、0.005、0.002或0.001 nM)。在一些實施例中,靶向部分結合至其靶序列的K D為0.001 nM至500 nM,例如0.1 nM至5 nM,例如約0.5 nM。在一些實施例中,靶向部分結合至非標靶序列的K D為至少500、600、700、800、900、1000、2000、5000、10,000或100,000 nM (且視情況,對非標靶序列的結合不明顯)。在一些實施例中,靶向部分不結合至非標靶序列。 In some embodiments, binding of a targeting moiety to a target component reduces the binding affinity of the target component for another transcription factor, component of a gene body complex, or sequence element of a gene body. In some embodiments, the targeting moiety binds to its target sequence with a KD less than or equal to 500, 450, 400, 350, 300, 250, 200, 150, 100, 50, 40, 30, 20, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.09, 0.08, 0.07, 0.06, 0.05, 0.04, 0.03, 0.02, 0.01, 0.005, 0.002, or 0.001 nM (and, optionally, a K of at least 50, 40, 30, 20, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.9, 0.8, 0.7 , 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.09, 0.08, 0.07, 0.06, 0.05, 0.04, 0.03, 0.02, 0.01, 0.005, 0.002 or 0.001 nM). In some embodiments, the targeting moiety binds to its target sequence with a KD of 0.001 nM to 500 nM, such as 0.1 nM to 5 nM, such as about 0.5 nM. In some embodiments, the targeting moiety binds to a non-target sequence with a KD of at least 500, 600, 700, 800, 900, 1000, 2000, 5000, 10,000, or 100,000 nM (and optionally, for the non-target sequence combination is not obvious). In some embodiments, targeting moieties do not bind to non-target sequences.
在一些實施例中,靶向部分包含與標靶組分(例如靶基因(例如MYC)的調控元件(例如啟動子或增強子))互補的核酸序列。在一些實施例中,靶向部分包含與標靶組分至少10%、15%、20%、25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%、99%或100%互補的核酸序列。In some embodiments, the targeting moiety comprises a nucleic acid sequence complementary to a target component, eg, a regulatory element (eg, a promoter or enhancer) of a target gene (eg, MYC). In some embodiments, the targeting moiety comprises at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65% %, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% complementary nucleic acid sequences.
在一些實施例中,靶向部分可為或包含CRISPR/Cas域、TAL效應域、Zn指域或核酸分子。In some embodiments, the targeting moiety can be or comprise a CRISPR/Cas domain, a TAL effector domain, a Zn finger domain, or a nucleic acid molecule.
在一些實施例中,表現抑制子中的靶向部分包含不超過100、90、80、70、60、50、40、30或20個核苷酸(且視情況,至少10、20、30、40、50、60、70、80或90個核苷酸)。在一些實施例中,融合分子中的表現抑制子或效應部分包含不超過2000、1900、1800、1700、1600、1500、1400、1300、1200、1100、1000、900、800、700、600、500、400、300、200或100個胺基酸(且視情況,至少50、100、200、300、400、500、600、700、800、900、1000、1100、1200、1300、1400、1500、1600、1700、1800或1900個胺基酸)。在一些實施例中,融合分子中的表現抑制子或效應部分包含100-2000、100-1900、100-1800、100-1700、100-1600、100-1500、100-1400、100-1300、100-1200、100-1100、100-1000、100-900、100-800、100-700、100-600、100-500、100-400、100-300、100-200、200-2000、200-1900、200-1800、200-1700、200-1600、200-1500、200-1400、200-1300、200-1200、200-1100、200-1000、200-900、200-800、200-700、200-600、200-500、200-400、200-300、300-2000、300-1900、300-1800、300-1700、300-1600、300-1500、300-1400、300-1300、300-1200、300-1100、300-1000、300-900、300-800、300-700、300-600、300-500、300-400、400-2000、400-1900、400-1800、400-1700、400-1600、400-1500、400-1400、400-1300、400-1200、400-1100、400-1000、400-900、400-800、400-700、400-600、400-500、500-2000、500-1900、500-1800、500-1700、500-1600、500-1500、500-1400、500-1300、500-1200、500-1100、500-1000、500-900、500-800、500-700、500-600、600-2000、600-1900、600-1800、600-1700、600-1600、600-1500、600-1400、600-1300、600-1200、600-1100、600-1000、600-900、600-800、600-700、700-2000、700-1900、700-1800、700-1700、700-1600、700-1500、700-1400、700-1300、700-1200、700-1100、700-1000、700-900、700-800、800-2000、800-1900、800-1800、800-1700、800-1600、800-1500、800-1400、800-1300、800-1200、800-1100、800-1000、800-900、900-2000、900-1900、900-1800、900-1700、900-1600、900-1500、900-1400、900-1300、900-1200、900-1100、900-1000、1000-2000、1000-1900、1000-1800、1000-1700、1000-1600、1000-1500、1000-1400、1000-1300、1000-1200、1000-1100、1100-2000、1100-1900、1100-1800、1100-1700、1100-1600、1100-1500、1100-1400、1100-1300、1100-1200、1200-2000、1200-1900、1200-1800、1200-1700、1200-1600、1200-1500、1200-1400、1200-1300、1300-2000、1300-1900、1300-1800、1300-1700、1300-1600、1300-1500、1300-1400、1400-2000、1400-1900、1400-1800、1400-1700、1400-1600、1400-1500、1500-2000、1500-1900、1500-1800、1500-1700、1500-1600、1600-2000、1600-1900、1600-1800、1600-1700、1700-2000、1700-1900、1700-1800、1800-2000、1800-1900或1900-2000個胺基酸。In some embodiments, the targeting moiety in the expression suppressor comprises no more than 100, 90, 80, 70, 60, 50, 40, 30 or 20 nucleotides (and optionally at least 10, 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides). In some embodiments, the expression inhibitor or effector moiety in the fusion molecule comprises no more than 2000, 1900, 1800, 1700, 1600, 1500, 1400, 1300, 1200, 1100, 1000, 900, 800, 700, 600, 500 , 400, 300, 200 or 100 amino acids (and optionally at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1900 amino acids). In some embodiments, the expression inhibitor or effector moiety in the fusion molecule comprises 100-2000, 100-1900, 100-1800, 100-1700, 100-1600, 100-1500, 100-1400, 100-1300, 100 -1200, 100-1100, 100-1000, 100-900, 100-800, 100-700, 100-600, 100-500, 100-400, 100-300, 100-200, 200-2000, 200-1900 , 200-1800, 200-1700, 200-1600, 200-1500, 200-1400, 200-1300, 200-1200, 200-1100, 200-1000, 200-900, 200-800, 200-700, 200 -600, 200-500, 200-400, 200-300, 300-2000, 300-1900, 300-1800, 300-1700, 300-1600, 300-1500, 300-1400, 300-1300, 300-1200 , 300-1100, 300-1000, 300-900, 300-800, 300-700, 300-600, 300-500, 300-400, 400-2000, 400-1900, 400-1800, 400-1700, 400 -1600, 400-1500, 400-1400, 400-1300, 400-1200, 400-1100, 400-1000, 400-900, 400-800, 400-700, 400-600, 400-500, 500-2000 , 500-1900, 500-1800, 500-1700, 500-1600, 500-1500, 500-1400, 500-1300, 500-1200, 500-1100, 500-1000, 500-900, 500-800, 500 -700, 500-600, 600-2000, 600-1900, 600-1800, 600-1700, 600-1600, 600-1500, 600-1400, 600-1300, 600-1200, 600-1100, 600-1000 , 600-900, 600-800, 600-700, 700-2000, 700-1900, 700-1800, 700-1700, 700-1600, 700-1500, 700-1400, 700-1300, 700-1200, 700 -1100, 700-1000, 700-900, 700-800, 800-2000, 800-1900, 800-1800, 800-1700, 800-1600, 800-1500, 800-1400, 800-1300, 800-1200 , 800-1100, 800-1000, 800-900, 900-2000, 900-1900, 900-1800, 900-1700, 900-1600, 900-1500, 900-1400, 900-1300, 900-1200, 900 -1100, 900-1000, 1000-2000, 1000-1900, 1000-1800, 1000-1700, 1000-1600, 1000-1500, 1000-1400, 1000-1300, 1000-1200, 1000-1100, 1100-20 00 ,1100-1900,1100-1800,1100-1700,1100-1600,1100-1500,1100-1400,1100-1300,1100-1200,1200-2000,1200-1900,1200-1800,1200-1700, 1200 -1600, 1200-1500, 1200-1400, 1200-1300, 1300-2000, 1300-1900, 1300-1800, 1300-1700, 1300-1600, 1300-1500, 1300-1400, 1400-2000, 1400-1 900 ,1400-1800,1400-1700,1400-1600,1400-1500,1500-2000,1500-1900,1500-1800,1500-1700,1500-1600,1600-2000,1600-1900,1600-1800, 1600 - 1700, 1700-2000, 1700-1900, 1700-1800, 1800-2000, 1800-1900 or 1900-2000 amino acids.
表現抑制子或包含如本文所揭示之抑制子的系統可包含核酸,例如一或多個核酸。術語「核酸」係指併入或可併入寡核苷酸鏈中的任何化合物。在一些實施例中,核酸為經由磷酸二酯鍵聯併入或可併入寡核苷酸鏈中之化合物及/或物質。如自上下文將瞭解,在一些實施例中,「核酸」係指個別核酸殘基(例如核苷酸及/或核苷);在一些實施例中,「核酸」係指包含個別核酸殘基的寡核苷酸鏈。在一些實施例中,「核酸」為或包含RNA;在一些實施例中,「核酸」為或包含DNA。在一些實施例中,核酸為或包含超過50%的核糖核苷酸且在本文中稱為核糖核酸(RNA)。在一些實施例中,核酸為、包含一或多個天然核酸殘基或由一或多個天然核酸殘基組成。在一些實施例中,核酸為、包含一或多種核酸類似物或由一或多種核酸類似物組成。在一些實施例中,核酸類似物與核酸的不同之處在於其不使用磷酸二酯主鏈。舉例而言,在一些實施例中,核酸為、包含一或多種「肽核酸」或由一或多種「肽核酸」組成,該等肽核酸在此項技術中已知且在主鏈中具有肽鍵而非磷酸二酯鍵且被認為在本發明之範疇內。或者或另外,在一些實施例中,核酸具有一或多個硫代磷酸酯鍵聯及/或5'-N-胺基亞磷酸酯鍵聯而非磷酸二酯鍵。在一些實施例中,核酸為、包含或由一或多種天然核苷(例如腺苷、胸苷、鳥苷、胞苷、尿苷、去氧腺苷、去氧胸苷、去氧鳥苷及去氧胞苷)組成。在一些實施例中,核酸為、包含或由一或多種核苷類似物(例如2-胺基腺苷、2-硫胸苷、肌苷、吡咯并嘧啶、3-甲基腺苷、5-甲基胞苷、C-5丙炔基-胞苷、C-5丙炔基-尿苷、2-胺基腺苷、C5-溴尿苷、C5-氟尿苷、C5-碘尿苷、C5-丙炔基-尿苷、C5-丙炔基-胞苷、C5-甲基胞苷、2-胺基腺苷、7-去氮腺苷、7-去氮鳥苷、8-側氧基腺苷、8-側氧基鳥苷、0(6)-甲基鳥嘌呤、2-硫胞苷、甲基化鹼基、插入鹼基及其組合)組成。在一些實施例中,相較於天然核酸中的糖,核酸包含一或多種經修飾之糖(例如2'-氟核糖、核糖、2'-去氧核糖、阿拉伯糖及己醣)。在一些實施例中,核酸具有編碼功能性基因產物(諸如RNA或蛋白質)的核苷酸序列。在一些實施例中,核酸包括一或多個內含子。在一些實施例中,核酸藉由以下中之一或多者製備:自天然來源分離、基於互補模板藉由聚合進行酶合成(活體內或活體外)、在重組細胞或系統中複製,及化學合成。如本文所用,「重組」當用於描述核酸時,係指非天然存在的任何核酸。在一些實施例中,核酸具有至少3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95、100、110、120、130、140、150、160、170、180、190、20、225、250、275、300、325、350、375、400、425、450、475、500、600、700、800、900、1000、1500、2000、2500、3000、3500、4000、4500、5000個或更多個殘基的長度。在一些實施例中,核酸可具有約2至約5000 nts、約10至約100 nts、約50至約150 nts、約100至約200 nts、約150至約250 nts、約200至約300 nts、約250至約350 nts、約300至約500 nts、約10至約1000 nts、約50至約1000 nts、約100至約1000 nts、約1000至約2000 nts、約2000至約3000 nts、約3000至約4000 nts、約4000至約5000 nts或其間任何範圍內的長度。在一些實施例中,核酸部分地或全部地為單股;在一些實施例中,核酸部分地或全部地為雙股。在一些實施例中,核酸具有包含至少一個元件的核苷酸序列,該至少一個元件編碼多肽或為編碼多肽之序列的補體。在一些實施例中,核酸具有酶活性。A system expressing a suppressor or comprising a suppressor as disclosed herein may comprise a nucleic acid, eg, one or more nucleic acids. The term "nucleic acid" refers to any compound that is or can be incorporated into an oligonucleotide chain. In some embodiments, nucleic acids are compounds and/or substances that are incorporated or can be incorporated into oligonucleotide chains via phosphodiester linkages. As will be understood from the context, in some embodiments, "nucleic acid" refers to individual nucleic acid residues (e.g., nucleotides and/or nucleosides); oligonucleotide chain. In some embodiments, a "nucleic acid" is or comprises RNA; in some embodiments, a "nucleic acid" is or comprises DNA. In some embodiments, the nucleic acid is or comprises more than 50% ribonucleotides and is referred to herein as ribonucleic acid (RNA). In some embodiments, the nucleic acid is, comprises, or consists of one or more natural nucleic acid residues. In some embodiments, the nucleic acid is, comprises, or consists of one or more nucleic acid analogs. In some embodiments, nucleic acid analogs differ from nucleic acids in that they do not utilize a phosphodiester backbone. For example, in some embodiments, the nucleic acid is, comprises, or consists of one or more "peptide nucleic acids" known in the art and having a peptide in the backbone linkages other than phosphodiester linkages and are considered within the scope of the present invention. Alternatively or additionally, in some embodiments, the nucleic acid has one or more phosphorothioate linkages and/or 5'-N-aminophosphite linkages instead of phosphodiester linkages. In some embodiments, the nucleic acid is, comprises, or consists of one or more natural nucleosides (e.g., adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine) composition. In some embodiments, the nucleic acid is, comprises, or consists of one or more nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolopyrimidine, 3-methyladenosine, 5- Methylcytidine, C-5 propynyl-cytidine, C-5 propynyl-uridine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deazaadenosine, 7-deazaguanosine, 8-oxo base adenosine, 8-side oxyguanosine, 0(6)-methylguanine, 2-thiacytidine, methylated base, inserted base and combinations thereof). In some embodiments, the nucleic acid comprises one or more modified sugars (eg, 2'-fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose) compared to the sugars in natural nucleic acids. In some embodiments, a nucleic acid has a nucleotide sequence that encodes a functional gene product such as RNA or protein. In some embodiments, a nucleic acid includes one or more introns. In some embodiments, nucleic acids are prepared by one or more of isolation from natural sources, enzymatic synthesis (in vivo or in vitro) by polymerization based on complementary templates, replication in recombinant cells or systems, and chemical synthesis. synthesis. As used herein, "recombinant" when used to describe a nucleic acid refers to any nucleic acid that does not occur in nature. In some embodiments, the nucleic acid has at least 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75 ,80,85,90,95,100,110,120,130,140,150,160,170,180,190,20,225,250,275,300,325,350,375,400,425,450 , 475, 500, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000 or more residues in length. In some embodiments, the nucleic acid can have a length of about 2 to about 5000 nts, about 10 to about 100 nts, about 50 to about 150 nts, about 100 to about 200 nts, about 150 to about 250 nts, about 200 to about 300 nts , about 250 to about 350 nts, about 300 to about 500 nts, about 10 to about 1000 nts, about 50 to about 1000 nts, about 100 to about 1000 nts, about 1000 to about 2000 nts, about 2000 to about 3000 nts, A length of about 3000 to about 4000 nts, about 4000 to about 5000 nts, or any range therebetween. In some embodiments, the nucleic acid is partially or fully single-stranded; in some embodiments, the nucleic acid is partially or fully double-stranded. In some embodiments, a nucleic acid has a nucleotide sequence comprising at least one element that encodes a polypeptide or is the complement of a sequence encoding a polypeptide. In some embodiments, the nucleic acid has enzymatic activity.
在一些實施例中,靶向部分包含或為核酸序列、蛋白質、蛋白質融合物,或膜易位多肽。在一些實施例中,靶向部分選自外源接合成核分子、編碼接合成核分子的核酸,或序列靶向性多肽與接合成核分子的融合物。接合成核分子可為例如CTCF、黏合素、USF1、YY1、TATA-box結合蛋白相關因子3 (TAF3)、ZNF143結合模體。在一些實施例中,靶向部分包含或為聚合物或聚合部分,例如核苷酸聚合物(諸如寡核苷酸)、肽核酸、肽-核酸混聚物、肽或多肽、聚醯胺、碳水化合物等。In some embodiments, the targeting moiety comprises or is a nucleic acid sequence, protein, protein fusion, or membrane translocating polypeptide. In some embodiments, the targeting moiety is selected from an exogenous engaging nuclear molecule, a nucleic acid encoding an engaging nuclear molecule, or a fusion of a sequence-targeting polypeptide and an engaging nuclear molecule. Engaging nucleating molecules can be, for example, CTCF, cohesin, USF1, YY1, TATA-box binding protein-associated factor 3 (TAF3), ZNF143 binding motifs. In some embodiments, the targeting moiety comprises or is a polymer or polymeric moiety, such as a nucleotide polymer (such as an oligonucleotide), a peptide nucleic acid, a peptide-nucleic acid hybrid, a peptide or polypeptide, a polyamide, carbohydrates etc.
在一些實施例中,靶向部分包含或為核酸。在一些實施例中,效應部分包含或為核酸。在一些實施例中,可包括於一個部分中的核酸可為或包含DNA、RNA及/或人工或合成核酸或核酸類似物或模擬物。舉例而言,在一些實施例中,核酸可為或包括以下中的一或多者:基因體DNA (gDNA)、互補DNA (cDNA)、肽核酸(PNA)、肽-核酸混聚物、肽-寡核苷酸偶聯物、鎖定核酸(LNA)、橋接式核酸(BNA)、聚醯胺、形成三螺旋體的寡核苷酸、反義寡核苷酸、tRNA、mRNA、rRNA、miRNA、siRNA或其他RNAi分子(例如靶向如本文所述的非編碼RNA及/或靶向與如本文所述之標靶基因體複合物相關之特定基因的表現產物)等。適用於調節劑的核酸序列可包括經修飾的寡核苷酸(例如化學修飾,諸如改變主鏈鍵聯、糖分子及/或核酸鹼基的修飾)及/或人工核酸。在一些實施例中,核酸序列包括(但不限於)基因體DNA、cDNA、肽核酸(PNA)或肽寡核苷酸偶聯物、鎖定核酸(LNA)、橋接式核酸(BNA)、聚醯胺、形成三螺旋體的寡核苷酸、經修飾的DNA、反義DNA寡核苷酸、tRNA、mRNA、rRNA、經修飾的RNA、miRNA、gRNA,及siRNA或其他RNA或DNA分子。在一些實施例中,核酸可包括不為天然存在之DNA或RNA殘基的一或多個殘基,可包括不為磷酸二酯鍵的一或多個鍵聯(例如可為例如硫代磷酸酯鍵等),且/或可包括一或多個修飾,諸如2'O修飾,諸如2'-OmeP。適用於製備合成核酸的多種核酸結構在此項技術中已知(參見例如WO2017/062862l及WO2014/012081),熟習此項技術者將瞭解此等核酸結構可根據本發明加以利用。In some embodiments, targeting moieties comprise or are nucleic acids. In some embodiments, an effector moiety comprises or is a nucleic acid. In some embodiments, nucleic acids that may be included in a portion can be or include DNA, RNA and/or artificial or synthetic nucleic acids or nucleic acid analogs or mimetics. For example, in some embodiments, the nucleic acid can be or include one or more of the following: genomic DNA (gDNA), complementary DNA (cDNA), peptide nucleic acid (PNA), peptide-nucleic acid hybrids, peptide - oligonucleotide conjugates, locked nucleic acid (LNA), bridged nucleic acid (BNA), polyamide, triplex forming oligonucleotides, antisense oligonucleotides, tRNA, mRNA, rRNA, miRNA, siRNA or other RNAi molecules (eg, targeting non-coding RNA as described herein and/or targeting the expression product of a specific gene associated with a target gene body complex as described herein), and the like. Nucleic acid sequences suitable for modulators may include modified oligonucleotides (eg, chemical modifications, such as modifications that alter backbone linkages, sugar molecules, and/or nucleic acid bases) and/or artificial nucleic acids. In some embodiments, nucleic acid sequences include, but are not limited to, genomic DNA, cDNA, peptide nucleic acid (PNA) or peptide-oligonucleotide conjugates, locked nucleic acid (LNA), bridged nucleic acid (BNA), polyamide Amines, triplex-forming oligonucleotides, modified DNA, antisense DNA oligonucleotides, tRNA, mRNA, rRNA, modified RNA, miRNA, gRNA, and siRNA or other RNA or DNA molecules. In some embodiments, a nucleic acid may include one or more residues that are not naturally occurring DNA or RNA residues, may include one or more linkages that are not phosphodiester bonds (such as may be, for example, phosphorothioate ester bond, etc.), and/or may include one or more modifications, such as a 2'O modification, such as 2'-OmeP. A variety of nucleic acid structures suitable for use in the preparation of synthetic nucleic acids are known in the art (see eg WO2017/0628621 and WO2014/012081), and those skilled in the art will appreciate that such nucleic acid structures can be utilized in accordance with the present invention.
核酸的一些實例包括(但不限於)與靶基因(例如MYC)雜交的核酸(例如如本文在別處所述的gRNA或反義ssDNA)、與外源核酸雜交的核酸(諸如病毒DNA或RNA)、與RNA雜交的核酸、干擾基因轉錄的核酸、干擾RNA轉譯的核酸、使RNA穩定或使RNA不穩定(諸如經由靶向以達成降解)的核酸、干擾DNA或RNA結合因子(經由干擾其表現或其功能)的核酸、與細胞內蛋白質或蛋白質複合物連接且調節其功能的核酸等。Some examples of nucleic acids include, but are not limited to, nucleic acids (such as gRNA or antisense ssDNA as described elsewhere herein) that hybridize to a target gene (such as MYC), nucleic acids that hybridize to exogenous nucleic acids (such as viral DNA or RNA) , nucleic acids that hybridize to RNA, nucleic acids that interfere with gene transcription, nucleic acids that interfere with RNA translation, nucleic acids that stabilize or destabilize RNA (such as by targeting for degradation), interfere with DNA or RNA binding factors (by interfering with their expression or its functions), nucleic acids that link to intracellular proteins or protein complexes and regulate their functions, etc.
在一些實施例中,表現抑制子包含一或多種核苷類似物。在一些實施例中,核酸序列另外或作為替代方案可包括一或多種天然核苷,例如嘌呤或嘧啶,例如腺嘌呤、胞嘧啶、鳥嘌呤、胸腺嘧啶及尿嘧啶;一或多種核苷類似物。在一些實施例中,核酸序列包括一或多種核苷類似物。核苷類似物可包括(但不限於)諸如以下核苷類似物:5-氟尿嘧啶、5-溴尿嘧啶、5-氯尿嘧啶、5-碘尿嘧啶、次黃嘌呤、黃嘌呤、4-乙醯胞嘧啶、4-甲基苯并咪唑、5-(羧基羥基甲基)尿嘧啶、5-羧甲基胺基甲基-2-硫尿苷、5-羧甲基胺基甲基尿嘧啶、二氫尿嘧啶、二氫尿苷、β-D-半乳糖苷基Q核苷、肌苷、N6-異戊烯基腺嘌呤、1-甲基鳥嘌呤、1-甲基肌苷、2,2-二甲基鳥嘌呤、2-甲基腺嘌呤、2-甲基鳥嘌呤、3-甲基胞嘧啶、5-甲基胞嘧啶、N6-腺嘌呤、7-甲基鳥嘌呤、5-甲基胺基甲基尿嘧啶、5-甲氧基胺基甲基-2-硫尿嘧啶、β-D-甘露糖苷基Q核苷、5'-甲氧基羧基甲基尿嘧啶、5-甲氧基尿嘧啶、2-甲基硫基-N6-異戊烯基腺嘌呤、尿嘧啶-5-氧基乙酸(v)、懷丁氧苷(wybutoxosine)、假尿嘧啶、Q核苷(queosine)、2-硫胞嘧啶、5-甲基-2-硫尿嘧啶、2-硫尿嘧啶、4-硫尿嘧啶、5-甲基尿嘧啶、尿嘧啶-5-氧基乙酸甲酯、尿嘧啶-5-氧基乙酸(v)、5-甲基-2-硫基尿嘧啶、3-(3-胺基-3-N-2-羧丙基)尿嘧啶、(acp3)w、2,6-二胺基嘌呤、3-硝基吡咯、肌苷、硫尿苷、Q核苷(queuosine)、懷俄苷(wyosine)、二胺基嘌呤、異鳥嘌呤、異胞嘧啶、二胺基嘧啶、2,4-二氟甲苯、異喹啉、吡咯并[2,3-β]吡啶,及可與嘌呤或嘧啶側鏈鹼基配對的任何其他核苷類似物。In some embodiments, the expression suppressor comprises one or more nucleoside analogs. In some embodiments, the nucleic acid sequence may additionally or alternatively comprise one or more natural nucleosides, such as purines or pyrimidines, such as adenine, cytosine, guanine, thymine, and uracil; one or more nucleoside analogs; . In some embodiments, the nucleic acid sequence includes one or more nucleoside analogs. Nucleoside analogs may include, but are not limited to, nucleoside analogs such as: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-ethyl Acylcytosine, 4-methylbenzimidazole, 5-(carboxyhydroxymethyl)uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil , dihydrouracil, dihydrouridine, β-D-galactosyl Q nucleoside, inosine, N6-prenyl adenine, 1-methylguanine, 1-methylinosine, 2 ,2-Dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5 -Methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, β-D-mannosyl Q nucleoside, 5'-methoxycarboxymethyluracil, 5 -Methoxyuracil, 2-methylthio-N6-prenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, Q nucleoside (queosine), 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, methyl uracil-5-oxyacetate , uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl)uracil, (acp3)w , 2,6-diaminopurine, 3-nitropyrrole, inosine, thiouridine, Q nucleoside (queuosine), wyosine (wyosine), diaminopurine, isoguanine, isocytosine, Diaminopyrimidine, 2,4-difluorotoluene, isoquinoline, pyrrolo[2,3-β]pyridine, and any other nucleoside analog that can base pair with a purine or pyrimidine side chain.
CRISPR/Cas域 在一些實施例中,靶向部分為或包含CRISPR/Cas域。CRISPR/Cas蛋白可包含CRISPR/Cas效應子及視情況存在的一或多種其他域。CRISPR/Cas域典型地與涉及叢集之調控性間雜短回文重複(CRISPR)系統的蛋白質(例如Cas蛋白)具有結構及/或功能相似性。CRISPR/Cas域視情況包含嚮導RNA,例如單嚮導RNA (sgRNA)。在一些實施例中,CRISPR/Cas域所含的gRNA被CRISPR/Cas域非共價結合。CRISPR/Cas domains In some embodiments, the targeting moiety is or comprises a CRISPR/Cas domain. A CRISPR/Cas protein can comprise a CRISPR/Cas effector and optionally one or more other domains. CRISPR/Cas domains typically share structural and/or functional similarities to proteins involved in the Clustered Regulatory Interspersed Short Palindromic Repeat (CRISPR) system, such as Cas proteins. The CRISPR/Cas domain optionally includes a guide RNA, such as a single guide RNA (sgRNA). In some embodiments, the gRNA contained by the CRISPR/Cas domain is non-covalently bound by the CRISPR/Cas domain.
CRISPR系統為最初在細菌及古菌中發現的適應性防禦系統。CRISPR系統利用RNA導引的核酸酶(稱為CRISPR相關的核酸內切酶或「Cas」核酸內切酶(例如Cas9或Cpf1))使外來DNA裂解。舉例而言,在典型的CRISPR/Cas系統中,內切核酸酶藉由靶向單股或雙股DNA序列的序列特異性非編碼「嚮導RNA」被引向標靶核苷酸序列(例如基因體中序列將被編輯的位點)。已鑑定出三類(I-III) CRISPR系統。II類CRISPR系統使用單一Cas核酸內切酶(而非多種Cas蛋白)。一個II類CRISPR系統包括II型Cas核酸內切酶,諸如Cas9、CRISPR RNA (「crRNA」)及反式活化crRNA (「tracrRNA」)。crRNA含有「嚮導RNA」,典型地為對應於標靶DNA序列的約20個核苷酸RNA序列。crRNA亦含有結合至tracrRNA的區域,以形成被核糖核酸酶III裂解之部分雙股結構,產生crRNA/tracrRNA雜交體。crRNA/tracrRNA雜交體接著導引Cas9核酸內切酶識別且裂解標靶DNA序列。標靶DNA序列通常必須鄰接於指定Cas核酸內切酶所特有的「原間隔序列相鄰模體」(「PAM」);然而,PAM序列出現在整個指定基因體中。自多種原核生物物種鑑定的CRISPR核酸內切酶具有獨特的PAM序列要求;PAM序列之實例包括5'-NGG (釀膿鏈球菌(Streptococcus pyogenes))、5'-NNAGAA (嗜熱鏈球菌(Streptococcus thermophilus) CRISPR1)、5'-NGGNG (嗜熱鏈球菌CRISPR3)及5'-NNNGATT (腦膜炎奈瑟氏菌(Neisseria meningiditis))。一些核酸內切酶(例如Cas9核酸內切酶)與富含G之PAM位點(例如5'-NGG)相關,且在PAM位點上游(5') 3個核苷酸位置處對標靶DNA進行鈍端裂解。另一種II類CRISPR系統包括V型核酸內切酶Cpf1,其小於Cas9;實例包括AsCpf1 (來自胺基酸球菌屬種(Acidaminococcus sp.))及LbCpf1 (來自毛螺菌科種(Lachnospiraceae sp.))。與Cpf1相關的CRISPR陣列在無需tracrRNA的情況下被加工成成熟crRNA;換言之,Cpf1系統僅需Cpf1核酸酶及crRNA便使標靶DNA序列裂解。Cpf1核酸內切酶與富含T之PAM位點(例如5'-TTN)相關。Cpf1亦可識別5'-CTA PAM模體。Cpf1藉由引入具有4或5個核苷酸5'突出端之偏移或交錯雙股斷裂而使標靶DNA裂解,例如在編碼股上之PAM位點下游(3') 18個核苷酸處及互補股上之PAM位點下游23個核苷酸處,利用5個核苷酸偏移切割或交錯切割使標靶DNA裂解;此類偏移裂解產生之5個核苷酸突出端允許藉由同源重組、藉由DNA插入進行的基因體編輯比在鈍端裂解之DNA處進行的插入更精確。參見例如Zetsche等人 (2015) Cell, 163:759 – 771。The CRISPR system is an adaptive defense system originally discovered in bacteria and archaea. CRISPR systems utilize RNA-guided nucleases known as CRISPR-associated endonucleases or “Cas” endonucleases (such as Cas9 or Cpf1) to cleave foreign DNA. For example, in a typical CRISPR/Cas system, an endonuclease is guided to a target nucleotide sequence (such as a gene site in the body where the sequence will be edited). Three classes (I-III) of CRISPR systems have been identified. Class II CRISPR systems use a single Cas endonuclease (rather than multiple Cas proteins). A class II CRISPR system includes type II Cas endonucleases, such as Cas9, CRISPR RNA ("crRNA"), and trans-activating crRNA ("tracrRNA"). crRNA contains a "guide RNA," typically an RNA sequence of about 20 nucleotides corresponding to a target DNA sequence. crRNA also contains a region that binds to tracrRNA to form a partially double-stranded structure that is cleaved by RNase III, resulting in a crRNA/tracrRNA hybrid. The crRNA/tracrRNA hybrid then directs the Cas9 endonuclease to recognize and cleave the target DNA sequence. The target DNA sequence must generally be contiguous to a "protospacer adjacent motif" ("PAM") that is characteristic of a given Cas endonuclease; however, PAM sequences occur throughout a given gene body. CRISPR endonucleases identified from various prokaryotic species have unique PAM sequence requirements; examples of PAM sequences include 5'-NGG (Streptococcus pyogenes), 5'-NNAGAA (Streptococcus thermophilus thermophilus) CRISPR1), 5'-NGGNG (Streptococcus thermophilus CRISPR3) and 5'-NNNGATT (Neisseria meningiditis). Some endonucleases (e.g. Cas9 endonuclease) associate with a G-rich PAM site (e.g. 5'-NGG) and target at a
本發明所提供之技術中可使用多種CRISPR相關(Cas)基因或蛋白質且Cas蛋白的選擇視方法的特定條件而定。Cas蛋白的特定實例包括II類系統,包括Cas1、Cas2、Cas3、Cas4、Cas5、Cas6、Cas7、Cas8、Cas9、Cas10、Cpf1、C2C1或C2C3。在一些實施例中,Cas蛋白(例如Cas9蛋白)可來自多種原核生物物種中的任一者。在一些實施例中,選擇特定的Cas蛋白(例如特定的Cas9蛋白)識別特定的原間隔子鄰接模體(PAM)序列。在一些實施例中,DNA靶向部分包括序列靶向性多肽,諸如Cas蛋白,例如Cas9。在某些實施例中,Cas蛋白(例如Cas9蛋白)可自細菌或古菌獲得或利用已知方法合成。在某些實施例中,Cas蛋白可來自革蘭氏陽性細菌(gram-positive bacteria)或革蘭氏陰性細菌。在某些實施例中,Cas蛋白可來自鏈球菌屬(例如釀膿鏈球菌或嗜熱鏈球菌)、弗朗西斯菌屬(Francisella)(例如新兇手弗朗西斯菌(F. novicida))、葡萄球菌(例如金黃色葡萄球菌(S. aureus))、胺基酸球菌屬(例如胺基酸球菌屬BV3L6)、奈瑟氏菌屬(例如奈瑟氏腦膜炎菌(N. meningitidis))、隱球菌屬(Cryptococcus)、棒狀桿菌屬(Corynebacterium)、嗜血桿菌屬(Haemophilus)、真桿菌屬(Eubacterium)、巴斯德菌屬(Pasteurella)、普氏菌屬(Prevotella)、韋榮氏球菌屬(Veillonella)或海桿菌屬(Marinobacter)。Various CRISPR-associated (Cas) genes or proteins can be used in the technology provided by the present invention, and the selection of Cas protein depends on the specific conditions of the method. Specific examples of Cas proteins include class II systems, including Cas1, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9, Cas10, Cpf1, C2C1 or C2C3. In some embodiments, the Cas protein (eg, Cas9 protein) can be from any of a variety of prokaryotic species. In some embodiments, a particular Cas protein (eg, a particular Cas9 protein) is selected to recognize a particular protospacer adjacent motif (PAM) sequence. In some embodiments, the DNA targeting moiety comprises a sequence targeting polypeptide, such as a Cas protein, eg Cas9. In certain embodiments, Cas proteins (eg, Cas9 proteins) can be obtained from bacteria or archaea or synthesized using known methods. In certain embodiments, the Cas protein can be from gram-positive bacteria or gram-negative bacteria. In certain embodiments, the Cas protein may be from Streptococcus (eg, Streptococcus pyogenes or Streptococcus thermophilus), Francisella (eg, F. novicida), Staphylococcus (eg, Staphylococcus aureus (S. aureus)), Aminococcus (e.g. Aminococcus BV3L6), Neisseria (e.g. Neisseria meningitidis (N. meningitidis)), Cryptococcus ( Cryptococcus), Corynebacterium, Haemophilus, Eubacterium, Pasteurella, Prevotella, Veillonella ) or Marinobacter.
在一些實施例中,Cas蛋白需要標靶DNA序列中存在或鄰接原間隔序列相鄰模體(PAM)以便Cas蛋白結合及/或發揮功能。在一些實施例中,PAM自5'至3'為或包含NGG、YG、NNGRRT、NNNRRT、NGA、TYCV、TATV、NTTN或NNNGATT,其中N表示任何核苷酸,Y表示C或T,R表示A或G,且V表示A或C或G。在一些實施例中,Cas蛋白為表1中所列的蛋白質。在一些實施例中,Cas蛋白包含改變其PAM的一或多個突變。在一些實施例中,Cas蛋白包含E1369R、E1449H及R1556A突變或類似的針對與該等位置對應之胺基酸的取代。在一些實施例中,Cas蛋白包含E782K、N968K及R1015H突變或類似的針對與該等位置對應之胺基酸的取代。在一些實施例中,Cas蛋白包含D1135V、R1335Q及T1337R突變或類似的針對與該等位置對應之胺基酸的取代。在一些實施例中,Cas蛋白包含S542R及K607R突變或類似的針對與該等位置對應之胺基酸的取代。在一些實施例中,Cas蛋白包含S542R、K548V及N552R突變或類似的針對與該等位置對應之胺基酸的取代。
表1
在一些實施例中,Cas蛋白經修飾以使核酸酶不活化,例如缺乏核酸酶的Cas。在一些實施例中,Cas蛋白為Cas9蛋白。儘管野生型Cas9在gRNA靶向的特定DNA序列處產生雙股斷裂(DSB),但可使用具有經修改之功能的多種CRISPR核酸內切酶,例如:Cas9的「切口酶」版本僅產生單股斷裂;無催化活性的Cas9 (「dCas9」)不切割標靶DNA。在一些實施例中,dCas結合至DNA序列可藉由空間位阻來干擾該位點處的轉錄。在一些實施例中,DNA靶向部分為或包含無催化活性的Cas,例如dCas。在此項技術中已知多種無催化活性的Cas蛋白。在一些實施例中,dCas9包含位於Cas蛋白之各核酸內切酶域中的突變,例如D10A及H840A突變。In some embodiments, the Cas protein is modified to inactivate nucleases, eg, Cas lacking nucleases. In some embodiments, the Cas protein is a Cas9 protein. While wild-type Cas9 produces double-stranded breaks (DSBs) at the specific DNA sequence targeted by the gRNA, a variety of CRISPR endonucleases with modified functions are available, e.g. a "nickase" version of Cas9 that produces only single strands Fragmentation; Catalytically inactive Cas9 (“dCas9”) does not cleave target DNA. In some embodiments, dCas binding to a DNA sequence can interfere with transcription at that site through steric hindrance. In some embodiments, the DNA targeting moiety is or comprises a catalytically inactive Cas, such as dCas. Various catalytically inactive Cas proteins are known in the art. In some embodiments, dCas9 comprises mutations in each endonuclease domain of the Cas protein, such as D10A and H840A mutations.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D11A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含H969A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含N995A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D11A、H969A和N995A突變或類似的針對與該等位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D11A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the H969A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the N995A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the D11A, H969A, and N995A mutations or similar substitutions for the amino acids corresponding to these positions.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D10A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含H557A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D10A和H557A突變或類似的針對與該等位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D10A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the H557A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the D10A and H557A mutations or similar substitutions for the amino acids corresponding to these positions.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D839A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含H840A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含N863A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D10A、D839A、H840A和N863A突變或類似的針對與該等位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D839A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the H840A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the N863A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises D10A, D839A, H840A, and N863A mutations or similar substitutions for the amino acids corresponding to these positions.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含E993A突變或類似的針對與該位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the E993A mutation or similar substitution for the amino acid corresponding to this position.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D917A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含E1006A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D1255A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D917A、E1006A和D1255A突變或類似的針對與該等位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D917A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the E1006A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D1255A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the D917A, E1006A, and D1255A mutations or similar substitutions for the amino acids corresponding to these positions.
在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D16A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D587A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含H588A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含N611A突變或類似的針對與該位置對應之胺基酸的取代。在一些實施例中,無催化活性的Cas9蛋白(例如dCas9)包含D16A、D587A、H588A和N611A突變或類似的針對與該等位置對應之胺基酸的取代。In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D16A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises a D587A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the H588A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the N611A mutation or similar substitution for the amino acid corresponding to this position. In some embodiments, the catalytically inactive Cas9 protein (eg, dCas9) comprises the D16A, D587A, H588A, and N611A mutations or similar substitutions for the amino acids corresponding to these positions.
在另一態樣中,本發明係關於一種表現抑制子或多肽,其包含一或多個(例如一個)靶向部分及一或多個效應部分,其中一或多個靶向部分為或包含CRISPR/Cas域,包含Cas蛋白,例如無催化活性的Cas9蛋白,例如dCas9,或其功能變異體或片段。在一些實施例中,dCas9包含胺基酸序列SEQ ID NO: 17: In another aspect, the invention relates to an expression inhibitor or polypeptide comprising one or more (eg, one) targeting moieties and one or more effector moieties, wherein one or more targeting moieties are or comprise A CRISPR/Cas domain comprising a Cas protein, such as a catalytically inactive Cas9 protein, such as dCas9, or a functional variant or fragment thereof. In some embodiments, dCas9 comprises the amino acid sequence of SEQ ID NO: 17:
在一些實施例中,dCas9由核酸序列SEQ ID NO: 50編碼: In some embodiments, dCas9 is encoded by the nucleic acid sequence SEQ ID NO: 50:
在一些實施例中,靶向部分可包含含有gRNA或連接(例如共價連接)至gRNA的Cas域。gRNA為短合成RNA,其由為了Cas蛋白結合而必需的「支架」序列及使用者定義之針對基因體標靶的約20種核苷酸靶向序列構成。實務上,嚮導RNA序列通常設計成具有17至24個核苷酸(例如19、20或21個核苷酸)的長度且與靶向核酸序列互補。用於設計有效嚮導RNA的定製gRNA產生器及算法可市購。亦已使用嵌合「單嚮導RNA」(「sgRNA」)實現基因編輯,該單嚮導RNA為模擬天然存在之crRNA-tracrRNA複合物且含有tracrRNA (用於結合核酸酶)與至少一種crRNA (以將核酸酶導引至用於編輯的標靶序列)的經工程改造(合成)之單一RNA分子。亦已證明經化學修飾之sgRNA有效用於Cas蛋白;參見例如Hendel等人(2015) Nature Biotechnol., 985 - 991。例示性嚮導RNA序列揭示於表2及表13中。In some embodiments, a targeting moiety can comprise a Cas domain comprising or linked (eg, covalently linked) to a gRNA. gRNAs are short synthetic RNAs that consist of a "scaffold" sequence necessary for Cas protein binding and user-defined ~20 nucleotide targeting sequences for gene body targets. In practice, guide RNA sequences are usually designed to have a length of 17 to 24 nucleotides (eg, 19, 20 or 21 nucleotides) and are complementary to the target nucleic acid sequence. Custom gRNA generators and algorithms for designing efficient guide RNAs are commercially available. Gene editing has also been achieved using chimeric "single-guide RNA" ("sgRNA"), which mimics the naturally occurring crRNA-tracrRNA complex and contains tracrRNA (for binding nucleases) and at least one crRNA (for binding A single RNA molecule engineered (synthesized) by a nuclease that guides to a target sequence for editing. Chemically modified sgRNAs have also been shown to be effective for Cas proteins; see eg Hendel et al. (2015) Nature Biotechnol., 985-991. Exemplary guide RNA sequences are disclosed in Table 2 and Table 13.
在一些實施例中,gRNA包含與靶基因相關DNA序列互補的核酸序列。在一些實施例中,DNA序列為可操作地連接至靶基因的表現控制元件、包含該元件或與該元件重疊。在一些實施例中,gRNA包含與靶基因相關DNA序列至少90、95、99或100%互補的核酸序列。在一些實施例中,聯合包含Cas分子之DNA靶向部分使用的gRNA為sgRNA。In some embodiments, the gRNA comprises a nucleic acid sequence complementary to a DNA sequence associated with a target gene. In some embodiments, the DNA sequence is, comprises, or overlaps with an expression control element operably linked to a target gene. In some embodiments, the gRNA comprises a nucleic acid sequence that is at least 90, 95, 99, or 100% complementary to a DNA sequence associated with a target gene. In some embodiments, the gRNA used in combination with a DNA targeting moiety comprising a Cas molecule is an sgRNA.
在一些實施例中,聯合CRISPR/Cas域使用的gRNA特異性結合與CTCF相關的靶序列。在一些實施例中,聯合CRISPR/Cas域使用的gRNA特異性結合與啟動子相關的靶序列。在一些實施例中,gRNA結合表2或表13中所列之靶序列。在一些實施例中,本文所述之表現抑制子結合至表2或表13中所列之靶序列。
表2:例示性gRNA序列
在一些實施例中,表現抑制系統包含含有第一DNA靶向部分的第一表現抑制子及含有第二DNA靶向部分的第二表現抑制子,其中該第一DNA靶向部分包含或為第一CRISPR/Cas域且該第二DNA靶向部分包含或為第二CRISPR/Cas域。在一些實施例中,第一CRISPR/Cas域包含第一CRISPR/Cas蛋白及第一嚮導RNA,且第二CRISPR/Cas域包含第二CRISPR/Cas蛋白及第二嚮導RNA。在一些實施例中,第一CRISPR/Cas蛋白對第二嚮導RNA的結合不明顯(例如不結合),例如以至少10、20、50、100、1000或10,000 nM的KD結合,且第二CRISPR/Cas蛋白對第一嚮導RNA的結合不明顯(例如不結合),例如以至少10、20、50、100、1000或10,000 nM的K D結合。 In some embodiments, the expression suppression system comprises a first expression suppressor comprising a first DNA targeting moiety and a second expression suppressor comprising a second DNA targeting moiety, wherein the first DNA targeting moiety comprises or is a second DNA targeting moiety. a CRISPR/Cas domain and the second DNA targeting moiety comprises or is a second CRISPR/Cas domain. In some embodiments, the first CRISPR/Cas domain comprises a first CRISPR/Cas protein and a first guide RNA, and the second CRISPR/Cas domain comprises a second CRISPR/Cas protein and a second guide RNA. In some embodiments, the first CRISPR/Cas protein binds insignificantly (e.g., does not bind) to the second guide RNA, e.g., with a KD of at least 10, 20, 50, 100, 1000, or 10,000 nM, and the second CRISPR The Cas protein binds insignificantly (eg, does not bind) to the first guide RNA, eg, with a KD of at least 10, 20, 50, 100, 1000, or 10,000 nM.
TAL效應域 在一些實施例中,DNA靶向部分為或包含TAL效應域。TAL效應域,例如特異性結合DNA序列的TAL效應域,包含複數個TAL效應子重複序列或其片段,且視情況包含天然存在之TAL效應子重複序列的一或多個其他部分(例如複數個TAL效應域的N端及/或C端),其中各TAL效應子重複序列識別核苷酸。TAL效應蛋白可包含TAL效應域及視情況存在的一或多種其他域。多種TAL效應域已為熟習此項技術者所知且可市購,例如購自Thermo Fisher Scientific。TAL effector domains In some embodiments, the DNA targeting moiety is or comprises a TAL effector domain. A TAL effector domain, e.g., a TAL effector domain that specifically binds a DNA sequence, comprises a plurality of TAL effector repeats or fragments thereof, and optionally one or more other portions of a naturally occurring TAL effector repeat (e.g., a plurality of N-terminal and/or C-terminal of TAL effector domain), wherein each TAL effector repeat sequence recognizes nucleotides. A TAL effector protein may comprise a TAL effector domain and optionally one or more other domains. A variety of TAL effector domains are known to those skilled in the art and are commercially available, eg, from Thermo Fisher Scientific.
TALE為細菌病原體之多種菌種所分泌的天然效應蛋白,該等菌種包括植物病原體調節宿主植物中之基因表現且促進細菌拓殖及存活的黃單孢菌屬( Xanthomonas)。TAL效應子的特異性結合係基於中心重複域,其為典型地33或34個胺基酸的串聯排列、幾乎一致的重複序列(重複-可變雙殘基,RVD域)。 TALEs are natural effector proteins secreted by various species of bacterial pathogens, including the phytopathogen Xanthomonas that modulate gene expression in host plants and promote bacterial colonization and survival. The specific binding of TAL effectors is based on a central repeat domain, which is a tandem arrangement of typically 33 or 34 amino acids, nearly identical repeat sequences (repeat-variable diresidue, RVD domains).
TAL效應子家族成員不同之處主要在於其重複的數目及次序。重複的數目在1.5至33.5個重複的範圍內且C端重複通常具有較短的長度(例如約20個胺基酸)且一般稱為「半重複」。TAL效應子中之每個重複的特徵為一個重複與一個鹼基對的相關性,其中不同的重複類型展現不同的鹼基對特異性(一個重複識別靶基因序列上的一個鹼基對)。一般而言,重複數目愈小,則蛋白質-DNA相互作用愈弱。6.5個重複的數目已顯示足以活化報導基因的轉錄(Scholze等人, 2010)。Members of the TAL effector family differ mainly in the number and order of their repeats. The number of repeats ranges from 1.5 to 33.5 repeats and the C-terminal repeats are usually of shorter length (eg, about 20 amino acids) and are generally referred to as "half-repeats." Each repeat in a TAL effector is characterized by an association of one repeat to one base pair, where different repeat types exhibit different base pair specificities (one repeat recognizes one base pair on the target gene sequence). In general, the smaller the number of repeats, the weaker the protein-DNA interaction. A number of 6.5 repeats has been shown to be sufficient to activate transcription of the reporter gene (Scholze et al., 2010).
重複序列間的變化主要存在於胺基酸位置12及13處,其因此稱為「高變」且對與標靶DNA啟動子序列相互作用之特異性負責,如列舉例示性重複可變雙殘基(RVD)及其與核酸鹼基標靶之對應關係的表3中所示。
表3 - RVD及核酸鹼基特異性
相應地,可調節TAL效應子中的重複序列以靶向特定的DNA序列。其他研究已表明RVD NK可靶向G。TAL效應子的靶點亦傾向於包括與第一重複序列所靶向之5'鹼基側接的T,但此識別的確切機制未知。迄今為止已知超過113種TAL效應序列。來自黃單孢菌屬之TAL效應子的非限制性實例包括Hax2、Hax3、Hax4、AvrXa7、AvrXa10及AvrBs3。Accordingly, repetitive sequences in TAL effectors can be modulated to target specific DNA sequences. Other studies have shown that RVD NKs can target G. The targets of TAL effectors also tend to include a T flanking the 5' base targeted by the first repeat, but the exact mechanism of this recognition is unknown. More than 113 TAL effector sequences are known to date. Non-limiting examples of TAL effectors from Xanthomonas include Hax2, Hax3, Hax4, AvrXa7, AvrXa10, and AvrBs3.
相應地,本發明之TAL效應域中的TAL效應子重複序列可來源於任何細菌菌種(例如黃單孢菌屬菌種,諸如非洲水稻黃單孢菌水稻變種( Xanthomonas oryzae pv . Oryzae)菌株(Yu等人2011)、野油菜黃單孢菌蘿蔔致病變種( Xanthomonas campestris pv . raphani)菌株756C及水稻黃單孢菌條斑病菌菌株( Xanthomonas oryzae pv . Oryzicolastrain) BLS256 (Bogdanove等人2011))的TAL效應子。如本文所用,根據本發明之TAL效應域包含RVD域以及亦來自天然存在之TAL效應子的側接序列(RVD域之N端及/或C端一側的序列)。其可包含比天然存在之TAL效應域中的RVD更多或更少的重複序列。本發明之TAL效應域基於上述代碼及此項技術中已知的其他代碼設計成靶向指定的DNA序列。TAL效應子重複(例如單體或模組)及其特定序列的數目係基於所需DNA靶序列來選擇。舉例而言,可移除或添加TAL效應子重複以便適合特定的靶序列。在一個實施例中,本發明之TAL效應域包含6.5至33.5個TAL效應子重複。在一個實施例中,本發明之TAL效應域包含8至33.5個TAL效應子重複,例如10至25個TAL效應子重複,例如10至14個TAL效應子重複。 Correspondingly, the TAL effector repeat sequence in the TAL effector domain of the present invention can be derived from any bacterial species (such as Xanthomonas oryzae pv . Oryzae strains) (Yu et al. 2011), Xanthomonas campestris pv . raphani strain 756C and Xanthomonas oryzae pv . Oryzicola strain BLS256 (Bogdanove et al. 2011 )) TAL effectors. As used herein, a TAL effector domain according to the invention comprises a RVD domain and flanking sequences (sequences on the N-terminal and/or C-terminal side of the RVD domain) also from naturally occurring TAL effectors. It may contain more or fewer repeats than RVDs in naturally occurring TAL effector domains. The TAL effector domains of the present invention are designed to target specified DNA sequences based on the codes described above and others known in the art. The number of TAL effector repeats (eg, monomers or modules) and their specific sequences are selected based on the desired DNA target sequence. For example, TAL effector repeats can be removed or added to suit a particular target sequence. In one embodiment, the TAL effector domain of the invention comprises 6.5 to 33.5 TAL effector repeats. In one embodiment, the TAL effector domain of the present invention comprises 8 to 33.5 TAL effector repeats, such as 10 to 25 TAL effector repeats, such as 10 to 14 TAL effector repeats.
在一些實施例中,TAL效應域包含與DNA靶序列完全匹配對應的TAL效應子重複序列。在一些實施例中,重複序列與DNA靶序列上之標靶鹼基對之間的錯配只要其讓表現抑制系統(例如包含TAL效應域的表現抑制子)發揮功能即被允許。一般而言,TALE結合與錯配數目負相關。在一些實施例中,相對於標靶DNA序列,本發明之表現抑制子中的TAL效應域包含不超過7個錯配、6個錯配、5個錯配、4個錯配、3個錯配、2個錯配或1個錯配,及視情況無錯配。不希望受理論束縛,一般而言,TAL效應域中的TAL效應子重複數目愈小,則耐受的錯配數目愈小且仍允許表現抑制系統(例如包含TAL效應域的表現抑制子)發揮功能。結合親和力被認為依賴於匹配的重複序列-DNA組合之總和。舉例而言,具有25個或更多個TAL效應子重複的TAL效應域能夠耐受至多7個錯配。In some embodiments, the TAL effector domain comprises a TAL effector repeat sequence corresponding to an exact match to the DNA target sequence. In some embodiments, mismatches between the repeat sequence and a target base pair on the DNA target sequence are allowed as long as they enable the function of an expression repression system (eg, an expression repressor comprising a TAL effector domain). In general, TALE binding is inversely correlated with the number of mismatches. In some embodiments, the TAL effector domain in the expression suppressor of the invention comprises no more than 7 mismatches, 6 mismatches, 5 mismatches, 4 mismatches, 3 mismatches relative to the target DNA sequence match, 2 mismatches or 1 mismatch, and no mismatch as appropriate. Without wishing to be bound by theory, in general, the smaller the number of TAL effector repeats in a TAL effector domain, the smaller the number of mismatches tolerated and still allow expression suppression systems (e.g., expression suppressors comprising TAL effector domains) to function. Function. Binding affinity is thought to depend on the sum of matched repeat-DNA combinations. For example, a TAL effector domain with 25 or more TAL effector repeats can tolerate up to 7 mismatches.
除TAL效應子重複之外,本發明之TAL效應域可包含來源於天然存在之TAL效應子的其他序列。TAL效應域中之TAL效應子重複部分的每一側所包括的C端及/或N端序列長度可變化且可由熟習此項技術者選擇,例如基於Zhang等人(2011)的研究。Zhang等人已鑑定出基於Hax3衍生之TAL效應子之蛋白質的多種C端及N端截斷突變體且已鑑定出對最佳結合至靶序列及因此對轉錄活化有貢獻的關鍵元件。一般而言,發現轉錄活性與N端長度負相關。就C端而言,鑑定出對Hax 3序列之前68個胺基酸內之DNA結合殘基重要的元件。相應地,在一些實施例中,本發明之表現抑制子的TAL效應域中包括天然存在之TAL效應子中之TAL效應子重複序列的C端一側的前68個胺基酸。相應地,在一個實施例中,本發明之TAL效應域包含:1)來源於天然存在之TAL效應子的一或多個TAL效應子重複序列;2)來自天然存在之TAL效應子、位於TAL效應子重複序列之N端一側的至少70、80、90、100、110、120、130、140、150、170、180、190、200、220、230、240、250、260、270、280個或更多個胺基酸;及/或3)來自天然存在之TAL效應子、位於TAL效應子重複序列之C端一側的至少68、80、90、100、110、120、130、140、150、170、180、190、200、220、230、240、250、260個或更多個胺基酸。In addition to TAL effector repeats, the TAL effector domains of the invention may comprise other sequences derived from naturally occurring TAL effectors. The C-terminal and/or N-terminal sequences included on each side of the TAL effector repeat in the TAL effector domain can vary in length and can be selected by those skilled in the art, eg based on the work of Zhang et al. (2011). Zhang et al. have identified various C- and N-terminal truncation mutants of Hax3-derived TAL effector-based proteins and have identified key elements that contribute to optimal binding to target sequences and thus transcriptional activation. In general, transcriptional activity was found to be inversely correlated with N-terminal length. For the C-terminus, elements important for DNA binding residues within the first 68 amino acids of the
在一些實施例中,調節劑包含靶向部分,該靶向部分包含經工程改造之DNA結合域(DBD),例如包含TAL效應子重複序列的TAL效應子,該TAL效應子重複序列結合至靶序列,例如可操作地連接至靶基因(例如MYC)的啟動子或轉錄起點(TSS)序列,例如與轉譯調控元件近接的序列,例如包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,例如與錨定序列近接的序列。在一些實施例中,TAL效應域可經工程改造以將表觀遺傳效應部分載送至靶點。In some embodiments, the modulator comprises a targeting moiety comprising an engineered DNA binding domain (DBD), such as a TAL effector comprising a TAL effector repeat that binds to the target Sequences, e.g., a promoter or transcription start (TSS) sequence operably linked to a target gene (e.g., MYC), e.g., sequences in close proximity to translational regulatory elements, e.g., an anchor sequence comprising a target gene (e.g., MYC) to mediate ligation An anchor sequence in a body (ASMC), such as a sequence that is in close proximity to the anchor sequence. In some embodiments, TAL effector domains can be engineered to deliver epigenetic effector moieties to the target.
Zn指域 在一些實施例中,DNA靶向部分為或包含Zn指域。Zn指域包含Zn指,例如天然存在之Zn指,例如天然存在之Zn指或經工程改造之Zn指,或其片段。多種Zn指已為熟習此項技術者所知且可市購,例如購自Sigma-Aldrich。一般而言,Zn指域包含複數個Zn指,其中每個Zn指識別三個核苷酸。Zn指蛋白可包含Zn指域及視情況存在的一或多種其他域。Zn finger domains In some embodiments, the DNA targeting moiety is or comprises a Zn finger domain. A Zn finger domain comprises a Zn finger, such as a naturally occurring Zn finger, such as a naturally occurring Zn finger or an engineered Zn finger, or a fragment thereof. Various Zn fingers are known to those skilled in the art and are commercially available, for example from Sigma-Aldrich. Generally, a Zn finger domain comprises a plurality of Zn fingers, wherein each Zn finger recognizes three nucleotides. A Zn finger protein may comprise a Zn finger domain and optionally one or more other domains.
在一些實施例中,Zn指分子包含非天然存在之Zn指蛋白,其經工程改造以結合至所選的標靶DNA序列。參見例如Beerli等人, (2002) Nature Biotechnol. 20:135-141;Pabo等人, (2001) Ann. Rev. Biochem. 70:313-340;Isalan等人, (2001) Nature Biotechnol. 19:656-660;Segal等人, (2001) Curr. Opin. Biotechnol. 12:632-637;Choo等人, (2000) Curr. Opin. Struct. Biol. 10:411-416;美國專利第6,453,242號、第6,534,261號、第6,599,692號、第6,503,717號、第6,689,558號、第7,030,215號、第6,794,136號、第7,067,317號、第7,262,054號、第7,070,934號、第7,361,635號、第7,253,273號及美國專利公開案第2005/0064474號、第2007/0218528號、第2005/0267061號,該等文獻皆以全文引用之方式併入本文中。In some embodiments, the Zn finger molecule comprises a non-naturally occurring Zn finger protein engineered to bind to a selected target DNA sequence. See eg Beerli et al., (2002) Nature Biotechnol. 20:135-141; Pabo et al., (2001) Ann. Rev. Biochem. 70:313-340; Isalan et al., (2001) Nature Biotechnol. 19:656 -660; Segal et al., (2001) Curr. Opin. Biotechnol. 12:632-637; Choo et al., (2000) Curr. Opin. Struct. Biol. 10:411-416; 6,534,261, 6,599,692, 6,503,717, 6,689,558, 7,030,215, 6,794,136, 7,067,317, 7,262,054, 7,070,934, 7,361,635, 7 , 253,273 and US Patent Publication No. 2005/ No. 0064474, No. 2007/0218528, and No. 2005/0267061, all of which are incorporated herein by reference in their entirety.
與天然存在之Zn指相比,經工程改造之Zn指可具有新穎的結合特異性。工程化方法包括(但不限於)合理設計及各種類型之選擇。合理設計包括例如使用包含三聯體(或四聯體)核苷酸序列及個別Zn指胺基酸序列之資料庫,其中各三聯體或四聯體核苷酸序列與結合特定三聯體或四聯體序列之鋅指的一或多個胺基酸序列相關。參見例如美國專利第6,453,242號及第6,534,261號,該等專利以全文引用的方式併入本文中。The engineered Zn fingers can have novel binding specificities compared to naturally occurring Zn fingers. Engineering methods include, but are not limited to, rational design and selection of various types. Rational design includes, for example, the use of a database comprising triplet (or quadruplet) nucleotide sequences and individual Zn finger amino acid sequences, wherein each triplet or quadruple nucleotide sequence is associated with a particular triplet or quadruple. One or more amino acid sequences of the zinc fingers of the body sequence are related. See, eg, US Patent Nos. 6,453,242 and 6,534,261, which are incorporated herein by reference in their entirety.
例示性選擇方法,包括噬菌體呈現及雙雜交體系統,揭示於美國專利第5,789,538號、第5,925,523號、第6,007,988號、第6,013,453號、第6,410,248號、第6,140,466號、第6,200,759號及第6,242,568號;以及國際專利公開案第WO 98/37186號、第WO 98/53057號、第WO 00/27878號及第WO 01/88197號及第GB 2,338,237號。另外,針對鋅指蛋白之結合特異性的增強已描述於例如國際專利公開案第WO 02/077227號中。Exemplary selection methods, including phage display and two-hybrid systems, are disclosed in US Pat. and International Patent Publication Nos. WO 98/37186, WO 98/53057, WO 00/27878, WO 01/88197 and GB 2,338,237. Additionally, enhancement of binding specificity for zinc finger proteins has been described, for example, in International Patent Publication No. WO 02/077227.
另外,如此等及其他參考文獻中所揭示,鋅指及/或多指鋅指域可利用任何適合的連接子序列(包括例如長度為5個或更多個胺基酸的連接子)連接在一起。關於長度為6個或更多個胺基酸的例示性連接子序列,亦參見美國專利第6,479,626號、第6,903,185號及第7,153,949號。本文所述之蛋白質可在蛋白質之個別鋅指之間包括適合連接子之任何組合。另外,針對鋅指結合域之結合特異性的增強已描述於例如共同擁有之國際專利公開案第WO 02/077227號中。In addition, as disclosed in these and other references, zinc fingers and/or multi-finger zinc finger domains may be linked using any suitable linker sequence, including, for example, linkers of 5 or more amino acids in length. Together. See also US Patent Nos. 6,479,626, 6,903,185 and 7,153,949 for exemplary linker sequences of 6 or more amino acids in length. The proteins described herein can include any combination of suitable linkers between individual zinc fingers of the protein. Additionally, enhancement of binding specificity for zinc finger binding domains has been described, for example, in co-owned International Patent Publication No. WO 02/077227.
用於設計及構築融合蛋白(及編碼其的聚核苷酸)的Zn指及方法已為熟習此項技術者所知且詳細地描述於美國專利第6,140,0815號、第789,538號、第6,453,242號、第6,534,261號、第5,925,523號、第6,007,988號、第6,013,453號及第6,200,759號;國際專利公開案第WO 95/19431號、第WO 96/06166號、第WO 98/53057號、第WO 98/54311號、第WO 00/27878號、第WO 01/60970號、第WO 01/88197號、第WO 02/099084號、第WO 98/53058號、第WO 98/53059號、第WO 98/53060號、第WO 02/016536號及第WO 03/016496號。Zn fingers and methods for designing and constructing fusion proteins (and polynucleotides encoding them) are known to those skilled in the art and are described in detail in U.S. Pat. Nos. 6,140,0815, 789,538, 6,453,242 No. 6,534,261, 5,925,523, 6,007,988, 6,013,453 and 6,200,759; International Patent Publication Nos. WO 95/19431, WO 96/06166, WO 98/53057, WO 98 /54311, WO 00/27878, WO 01/60970, WO 01/88197, WO 02/099084, WO 98/53058, WO 98/53059, WO 98/ 53060, WO 02/016536 and WO 03/016496.
在某些實施例中,DNA靶向部分包含Zn指域,該Zn指域包含結合(以序列特異性方式)至標靶DNA序列的經工程改造之鋅指。在一些實施例中,Zn指域包含一個Zn指或其片段。在一些實施例中,Zn指域包含複數個Zn指(或其片段),例如2、3、4、5、6個或更多個Zn指(及視情況,不超過12、11、10、9、8、7、6、5、4、3或2個Zn指)。在一些實施例中,Zn指域包含至少三個Zn指。在一些實施例中,Zn指域包含四個、五個或六個Zn指。在一些實施例中,Zn指域包含8、9、10、11或12個Zn指。在一些實施例中,包含三個Zn指的Zn指域識別包含9或10個核苷酸的標靶DNA序列。在一些實施例中,包含四個Zn指的Zn指域識別包含12至14個核苷酸的標靶DNA序列。在一些實施例中,包含六個Zn指的Zn指域識別包含18至21個核苷酸的標靶DNA序列。In certain embodiments, a DNA targeting moiety comprises a Zn finger domain comprising an engineered zinc finger that binds (in a sequence-specific manner) to a target DNA sequence. In some embodiments, the Zn finger domain comprises a Zn finger or a fragment thereof. In some embodiments, the Zn finger domain comprises a plurality of Zn fingers (or fragments thereof), such as 2, 3, 4, 5, 6 or more Zn fingers (and optionally no more than 12, 11, 10, 9, 8, 7, 6, 5, 4, 3 or 2 Zn fingers). In some embodiments, the Zn finger domain comprises at least three Zn fingers. In some embodiments, the Zn finger domain contains four, five or six Zn fingers. In some embodiments, the Zn finger domain comprises 8, 9, 10, 11 or 12 Zn fingers. In some embodiments, a Zn finger domain comprising three Zn fingers recognizes a target DNA sequence comprising 9 or 10 nucleotides. In some embodiments, a Zn finger domain comprising four Zn fingers recognizes a target DNA sequence comprising 12 to 14 nucleotides. In some embodiments, a Zn finger domain comprising six Zn fingers recognizes a target DNA sequence comprising 18 to 21 nucleotides.
在一些實施例中,靶向域包含雙手Zn指蛋白。雙手鋅指蛋白為其中鋅指的兩個叢集被中間胺基酸分隔的彼等蛋白質,使得兩個鋅指域結合至兩個不連續的標靶DNA序列。雙手型鋅指結合蛋白的一個實例為SIP1,其中四個鋅指的叢集位於蛋白質的胺基端且三個Zn指的叢集位於羧基末端(參見Remade等人(1999) EMBO Journal 18(18):5073-5084)。此等域中之鋅指的每個叢集能夠結合至獨特的靶序列且兩個靶序列之間的間距可包含多個核苷酸。In some embodiments, the targeting domain comprises a two-handed Zn finger protein. Two-handed zinc finger proteins are those proteins in which the two clusters of zinc fingers are separated by an intermediate amino acid, such that the two zinc finger domains bind to two discrete target DNA sequences. An example of a two-handed zinc finger binding protein is SIP1, in which a cluster of four zinc fingers is located at the amino terminus of the protein and a cluster of three Zn fingers is located at the carboxy terminus (see Remade et al. (1999) EMBO Journal 18(18) :5073-5084). Each cluster of zinc fingers in these domains is capable of binding to a unique target sequence and the spacing between two target sequences can comprise multiple nucleotides.
在一些實施例中,表現抑制子包含靶向部分,該靶向部分包含經工程改造之DNA結合域(DBD),例如包含Zn指(ZFN)的Zn指域,其結合至靶序列,例如可操作地連接至靶基因(例如MYC)的啟動子或轉錄起點(TSS),例如與轉譯調控元件近接的序列,例如包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,例如與錨定序列近接的序列。在一些實施例中,ZFN可經工程改造以將表觀遺傳效應分子載送至靶點。在一些實施例中,靶向部分包含含有2、3、4、5、6、7或8個鋅指的Zn指域。本文所揭示之例示性靶向部分的胺基酸序列列舉於表4中。編碼本文所揭示之例示性靶向部分的核苷酸序列列舉於表5中。在一些實施例中,本文所述之表現抑制子或系統包含具有表4中所示之序列或與其至少70%、75%、80%、85%、90%、95%、98%或99%一致之序列的靶向部分。在一些實施例中,本文所述之核酸包含表5中所示的序列,或與其至少70%、75%、80%、85%、90%、95%、98%或99%一致的序列。
表4:例示性靶向部分的胺基酸序列
在一些實施例中,表現抑制系統包含靶向部分,該靶向部分包含經工程改造之DNA結合域(DBD),例如包含Zn指(ZFN)的Zn指域,其結合至靶序列,例如可操作地連接至靶基因(例如MYC)的啟動子或轉錄起點(TSS)序列,例如與轉譯調控元件近接的序列,例如包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列,例如與小鼠基因體中之錨定序列近接的序列。在一些實施例中,ZFN可經工程改造以將表觀遺傳效應分子載送至靶點。在一些實施例中,靶向部分包含含有2、3、4、5、6、7或8個鋅指的Zn指域。本文所揭示之例示性靶向部分的胺基酸序列列舉於表14中。編碼本文所揭示之例示性靶向部分的核苷酸序列列舉於表15中。在一些實施例中,本文所述之表現抑制子或系統包含具有表14中所示之序列或與其至少70%、75%、80%、85%、90%、95%、98%或99%一致之序列的靶向部分。在一些實施例中,本文所述之核酸包含表15中所示的序列,或與其至少70%、75%、80%、85%、90%、95%、98%或99%一致的序列。 表14:例示性小鼠特異性靶向部分的胺基酸序列
核酸分子在一些實施例中,靶向部分為或包含來自核酸酶的DNA結合域。舉例而言,已知歸巢核酸內切酶及巨核酸酶(諸如I-SceI、I-CeuI、PI-PspI、PI-Sce、I-SceIV、I-CsmI、I-PanI、I-SceII、I-PpoI、I-SceIII、I-CreI、I-TevI、I-TevII及I-TevIII)的識別序列。亦參見美國專利第5,420,032號、第6,833,252號;Belfort等人, (1997) Nucleic Acids Res. 25:3379-3388; Dujon等人, (1989) Gene 82:115-118; Perler等人, (1994) Nucleic Acids Res. 22:1125-1127; Jasin (1996) Trends Genet. 12:224-228; Gimble等人, (1996); J. Mol. Biol. 263:163-180; Argast等人, (1998) J. Mol. Biol. 280:345-353及New England Biolabs目錄。另外,歸巢核酸內切酶及巨核酸酶的DNA結合特異性可經工程改造以結合非天然靶點。參見例如Chevalier等人, (2002) Molec. Cell 10:895-905; Epinat等人, (2003) Nucleic Acids Res. 31:2952-2962; Ashworth等人, (2006) Nature 441:656-659; Paques等人, (2007) Current Gene Therapy 7:49-66;美國專利公開案第2007/0117128號。 Nucleic Acid Molecules In some embodiments, the targeting moiety is or comprises a DNA binding domain from a nuclease. For example, known homing endonucleases and meganucleases (such as I-SceI, I-CeuI, PI-PspI, PI-Sce, I-SceIV, I-CsmI, I-PanI, I-SceII, Recognition sequences of I-PpoI, I-SceIII, I-CreI, I-TevI, I-TevII and I-TevIII). See also U.S. Patent Nos. 5,420,032, 6,833,252; Belfort et al., (1997) Nucleic Acids Res. 25:3379-3388; Dujon et al., (1989) Gene 82:115-118; Perler et al., (1994) Nucleic Acids Res. 22:1125-1127; Jasin (1996) Trends Genet. 12:224-228; Gimble et al., (1996); J. Mol. Biol. 263:163-180; Argast et al., (1998) J. Mol. Biol. 280:345-353 and New England Biolabs catalog. In addition, the DNA binding specificities of homing endonucleases and meganucleases can be engineered to bind non-natural targets. See, for example, Chevalier et al., (2002) Molec. Cell 10:895-905; Epinat et al., (2003) Nucleic Acids Res. 31:2952-2962; Ashworth et al., (2006) Nature 441:656-659; Paques et al., (2007) Current Gene Therapy 7:49-66; US Patent Publication No. 2007/0117128.
在一些實施例中,DNA靶向部分包含或為核酸。在一些實施例中,可包括於DNA靶向部分中的核酸可為或包含DNA、RNA及/或人工或合成核酸或核酸類似物或模擬物。舉例而言,在一些實施例中,核酸可為或可包括以下中的一或多者:基因體DNA (gDNA)、互補DNA (cDNA)、肽核酸(PNA)、肽-寡核苷酸偶聯物、鎖定核酸(LNA)、橋接式核酸(BNA)、聚醯胺、形成三螺旋體的寡核苷酸、反義寡核苷酸、tRNA、mRNA、rRNA、miRNA、gRNA、siRNA或其他RNAi分子(例如靶向如本文所述的非編碼RNA及/或靶向與如本文所述之標靶基因體複合物相關之特定基因的表現產物)等。在一些實施例中,核酸可包括不為天然存在之DNA或RNA殘基的一或多個殘基,可包括不為磷酸二酯鍵的一或多個鍵聯(例如可為例如硫代磷酸酯鍵等),且/或可包括一或多個修飾,諸如2'O修飾,諸如2'-OmeP。適用於製備合成核酸的多種核酸結構在此項技術中已知(參見例如WO2017/062862l及WO2014/012081),熟習此項技術者將瞭解此等核酸結構可根據本發明加以利用。In some embodiments, a DNA targeting moiety comprises or is a nucleic acid. In some embodiments, nucleic acids that can be included in a DNA targeting moiety can be or comprise DNA, RNA and/or artificial or synthetic nucleic acids or nucleic acid analogs or mimetics. For example, in some embodiments, the nucleic acid can be or include one or more of: genomic DNA (gDNA), complementary DNA (cDNA), peptide nucleic acid (PNA), peptide-oligonucleotide Conjugates, locked nucleic acids (LNA), bridged nucleic acids (BNA), polyamides, triplex-forming oligonucleotides, antisense oligonucleotides, tRNA, mRNA, rRNA, miRNA, gRNA, siRNA, or other RNAi Molecules (eg targeting non-coding RNAs as described herein and/or targeting expression products of specific genes associated with target gene body complexes as described herein) and the like. In some embodiments, a nucleic acid may include one or more residues that are not naturally occurring DNA or RNA residues, may include one or more linkages that are not phosphodiester bonds (such as may be, for example, phosphorothioate ester bond, etc.), and/or may include one or more modifications, such as a 2'O modification, such as 2'-OmeP. A variety of nucleic acid structures suitable for use in the preparation of synthetic nucleic acids are known in the art (see eg WO2017/0628621 and WO2014/012081), and those skilled in the art will appreciate that such nucleic acid structures can be utilized in accordance with the present invention.
適用於表現抑制子(例如DNA靶向部分)的核酸可包括(但不限於) DNA、RNA、經修飾的寡核苷酸(例如化學修飾,諸如改變主鏈鍵聯、糖分子及/或核酸鹼基的修飾),及人工核酸。在一些實施例中,核酸包括(但不限於)基因體DNA、cDNA、肽核酸(PNA)或肽寡核苷酸偶聯物、鎖定核酸(LNA)、橋接式核酸(BNA)、聚醯胺、形成三螺旋體的寡核苷酸、經修飾的DNA、反義DNA寡核苷酸、tRNA、mRNA、rRNA、經修飾的RNA、miRNA、gRNA,及siRNA或其他RNA或DNA分子。Nucleic acids suitable for expression of inhibitors (e.g., DNA targeting moieties) can include, but are not limited to, DNA, RNA, modified oligonucleotides (e.g., chemical modifications, such as altering backbone linkages, sugar molecules, and/or nucleic acid base modification), and artificial nucleic acids. In some embodiments, nucleic acids include, but are not limited to, gene body DNA, cDNA, peptide nucleic acid (PNA) or peptide-oligonucleotide conjugates, locked nucleic acid (LNA), bridged nucleic acid (BNA), polyamide , triplex forming oligonucleotides, modified DNA, antisense DNA oligonucleotides, tRNA, mRNA, rRNA, modified RNA, miRNA, gRNA, and siRNA or other RNA or DNA molecules.
在一些實施例中,DNA靶向部分包含長度為約15-200、20-200、30-200、40-200、50-200、60-200、70-200、80-200、90-200、100-200、110-200、120-200、130-200、140-200、150-200、160-200、170-200、180-200、190-200、215-190、20-190、30-190、40-190、50-190、60-190、70-190、80-190、90-190、100-190、110-190、120-190、130-190、140-190、150-190、160-190、170-190、180-190、15-180、20-180、30-180、40-180、50-180、60-180、70-180、80-180、90-180、100-180、110-180、120-180、130-180、140-180、150-180、160-180、170-180、15-170、20-170、30-170、40-170、50-170、60-170、70-170、80-170、90-170、100-170、110-170、120-170、130-170、140-170、150-170、160-170、15-160、20-160、30-160、40-160、50-160、60-160、70-160、80-160、90-160、100-160、110-160、120-160、130-160、140-160、150-160、215-150、20-150、30-150、40-150、50-150、60-150、70-150、80-150、90-150、100-150、110-150、120-150、130-150、140-150、15-140、20-140、30-140、40-140、50-140、60-140、70-140、80-140、90-140、100-140、110-140、120-140、130-140、15-130、20-130、30-130、40-130、50-130、60-130、70-130、80-130、90-130、100-130、110-130、120-130、215-120、20-120、30-120、40-120、50-120、60-120、70-120、80-120、90-120、100-120、110-120、15-110、20-110、30-110、40-110、50-110、60-110、70-110、80-110、90-110、100-110、15-100、20-100、30-100、40-100、50-100、60-100、70-100、80-100、90-100、15-90、20-90、30-90、40-90、50-90、60-90、70-90、80-90、15-80、20-80、30-80、40-80、50-80、60-80、70-80、15-70、20-70、30-70、40-70、50-70、60-70、15-60、20-60、30-60、40-60、50-60、15-50、20-50、30-50、40-50、15-40、20-40、30-40、15-30、20-30或15-20個核苷酸或其間任何範圍的核酸。In some embodiments, the DNA targeting moiety comprises a length of about 15-200, 20-200, 30-200, 40-200, 50-200, 60-200, 70-200, 80-200, 90-200, 100-200, 110-200, 120-200, 130-200, 140-200, 150-200, 160-200, 170-200, 180-200, 190-200, 215-190, 20-190, 30- 190, 40-190, 50-190, 60-190, 70-190, 80-190, 90-190, 100-190, 110-190, 120-190, 130-190, 140-190, 150-190, 160-190, 170-190, 180-190, 15-180, 20-180, 30-180, 40-180, 50-180, 60-180, 70-180, 80-180, 90-180, 100- 180, 110-180, 120-180, 130-180, 140-180, 150-180, 160-180, 170-180, 15-170, 20-170, 30-170, 40-170, 50-170, 60-170, 70-170, 80-170, 90-170, 100-170, 110-170, 120-170, 130-170, 140-170, 150-170, 160-170, 15-160, 20- 160, 30-160, 40-160, 50-160, 60-160, 70-160, 80-160, 90-160, 100-160, 110-160, 120-160, 130-160, 140-160, 150-160, 215-150, 20-150, 30-150, 40-150, 50-150, 60-150, 70-150, 80-150, 90-150, 100-150, 110-150, 120- 150, 130-150, 140-150, 15-140, 20-140, 30-140, 40-140, 50-140, 60-140, 70-140, 80-140, 90-140, 100-140, 110-140, 120-140, 130-140, 15-130, 20-130, 30-130, 40-130, 50-130, 60-130, 70-130, 80-130, 90-130, 100- 130, 110-130, 120-130, 215-120, 20-120, 30-120, 40-120, 50-120, 60-120, 70-120, 80-120, 90-120, 100-120, 110-120, 15-110, 20-110, 30-110, 40-110, 50-110, 60-110, 70-110, 80-110, 90-110, 100-110, 15-100, 20- 100, 30-100, 40-100, 50-100, 60-100, 70-100, 80-100, 90-100, 15-90, 20-90, 30-90, 40-90, 50-90, 60-90, 70-90, 80-90, 15-80, 20-80, 30-80, 40-80, 50-80, 60-80, 70-80, 15-70, 20-70, 30- 70, 40-70, 50-70, 60-70, 15-60, 20-60, 30-60, 40-60, 50-60, 15-50, 20-50, 30-50, 40-50, A nucleic acid of 15-40, 20-40, 30-40, 15-30, 20-30 or 15-20 nucleotides or any range therebetween.
效應部分 在一些實施例中,本發明之表現抑制子包含一或多個效應部分。在一些實施例中,效應部分當作為本文所述之表現抑制子或表現抑制系統的一部分使用時,減少靶基因在細胞中的表現。Effector Moieties In some embodiments, the expression inhibitors of the invention comprise one or more effector moieties. In some embodiments, the effector moiety, when used as part of an expression suppressor or expression suppression system described herein, reduces the expression of a target gene in a cell.
在一些實施例中,效應部分具有與靶向部分結合無關的功能。舉例而言,效應部分可靶向(例如結合)靶向部分所靶向之基因體序列元件或與該基因體序列元件近接的基因體複合物組分,或募集轉錄因子。作為另一實例,效應部分可包含酶活性,例如基因修飾功能。In some embodiments, the effector moiety has a function independent of binding of the targeting moiety. For example, an effector moiety can target (eg, bind to) a gene body sequence element targeted by a targeting moiety or a component of a gene body complex in proximity to the gene body sequence element, or recruit a transcription factor. As another example, an effector moiety may comprise enzymatic activity, such as a gene modification function.
在一些實施例中,效應部分包含表觀遺傳修飾部分。在一些實施例中,效應部分包含DNA修飾功能,例如DNA甲基轉移酶。在一些實施例中,效應部分為或包含選自以下的蛋白質:MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L,或其任一者的功能變異體或片段。In some embodiments, the effector moiety comprises an epigenetic modification moiety. In some embodiments, the effector moiety comprises a DNA modifying function, such as a DNA methyltransferase. In some embodiments, the effector moiety is or comprises a protein selected from MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, or a functional variant or fragment of any of them .
在一些實施例中,效應部分包含轉錄抑制子。在一些實施例中,轉錄抑制子阻斷刺激或促進例如靶基因轉錄之因子募集。在一些實施例中,轉錄抑制子募集抑制例如靶基因轉錄的因子。在一些實施例中,效應部分(例如轉錄抑制子)為或包含選自以下的蛋白質:KRAB、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12,或其任一者的功能變異體或片段。In some embodiments, the effector moiety comprises a transcriptional repressor. In some embodiments, transcriptional repressors block the recruitment of factors that stimulate or promote, for example, transcription of a target gene. In some embodiments, a transcriptional repressor recruits factors that inhibit, for example, transcription of a target gene. In some embodiments, the effector moiety (eg, transcriptional repressor) is or comprises a protein selected from KRAB, MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or a functional variant or fragment of any of them.
在一些實施例中,效應部分促進表觀遺傳修飾,例如直接地或間接地促進表觀遺傳修飾。舉例而言,效應部分可藉由募集對染色質進行表觀遺傳修飾的內源蛋白來間接地促進表觀遺傳修飾。效應部分可藉由催化表觀遺傳修飾來直接促進表觀遺傳修飾,其中該效應部分包含酶活性且將表觀遺傳標記直接置於染色質上。In some embodiments, the effector moiety promotes an epigenetic modification, eg, directly or indirectly. For example, effector moieties can indirectly contribute to epigenetic modification by recruiting endogenous proteins that epigenetically modify chromatin. The effector moiety can directly promote epigenetic modification by catalyzing the epigenetic modification, wherein the effector moiety comprises enzymatic activity and places the epigenetic mark directly on the chromatin.
在一些實施例中,效應部分包含組蛋白修飾功能,例如組蛋白甲基轉移酶、組蛋白去甲基酶或組蛋白去乙醯酶活性。在一些實施例中,效應部分為或包含選自以下的蛋白質:KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66,或其任一者的功能變異體或片段。在一些實施例中,效應部分為或包含選自以下的蛋白質:HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9,或其任一者的功能變異體或片段。In some embodiments, the effector moiety comprises a histone modifying function, eg, histone methyltransferase, histone demethylase, or histone deacetylase activity. In some embodiments, the effector moiety is or comprises a protein selected from the group consisting of: KDM1A (ie, LSD1), KDM1B (ie, LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, or Functional variants or fragments of either. In some embodiments, the effector moiety is or comprises a protein selected from the group consisting of: HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6 , SIRT7, SIRT8, SIRT9, or a functional variant or fragment of any of them.
在一些實施例中,效應部分包含具有本文所述之功能的蛋白質。在一些實施例中,效應部分為或包含選自以下的蛋白質:KRAB (例如根據NP_056209.2或由NM_015394.5編碼之蛋白質);SET域(例如以下中的SET域:SETDB1 (例如根據NP_001353347.1或由NM_001366418.1編碼之蛋白質);EZH2 (例如根據NP-004447.2或由NM_004456.5編碼之蛋白質);G9A (例如根據NP_001350618.1或由NM_001363689.1編碼之蛋白質);或SUV39H1 (例如根據NP_003164.1或由NM_003173.4編碼之蛋白質));組蛋白去甲基酶LSD1 (例如根據NP_055828.2或由NM_015013.4編碼之蛋白質);FOG1 (例如FOG1的N端殘基)(例如根據NP_722520.2或由NM_153813.3編碼之蛋白質);或KAP1 (例如根據NP_005753.1或由NM_005762.3編碼之蛋白質);其任一者的功能片段或變異體,或序列與任一種上述序列至少80、85、90、91、92、93、94、95、96、97、98或99%一致的多肽。在一些實施例中,效應部分為或包含選自以下的蛋白質:DNMT3A (例如人類DNMT3A)(例如根據NP_072046.2或由NM_022552.4編碼之蛋白質);DNMT3B (例如根據NP_008823.1或由NM_006892.4編碼之蛋白質);DNMT3L (例如根據NP_787063.1或由NM_175867.3編碼之蛋白質);DNMT3A/3L複合物細菌MQ1 (例如根據CAA35058.1或P15840.3);其任一者的功能片段,或序列與任一種上述序列至少80、85、90、91、92、93、94、95、96、97、98或99%一致的多肽。In some embodiments, the effector moiety comprises a protein having a function described herein. In some embodiments, the effector moiety is or comprises a protein selected from: KRAB (e.g., according to NP_056209.2 or a protein encoded by NM_015394.5); a SET domain (e.g., a SET domain in: SETDB1 (e.g., according to NP_001353347. 1 or a protein encoded by NM_001366418.1); EZH2 (for example according to NP-004447.2 or a protein encoded by NM_004456.5); G9A (for example according to NP_001350618.1 or a protein encoded by NM_001363689.1); or SUV39H1 (for example according to NP_003164.1 or a protein encoded by NM_003173.4)); histone demethylase LSD1 (eg according to NP_055828.2 or a protein encoded by NM_015013.4); FOG1 (eg N-terminal residues of FOG1) (eg according to NP_722520.2 or a protein encoded by NM_153813.3); or KAP1 (eg, according to NP_005753.1 or a protein encoded by NM_005762.3); a functional fragment or variant of any of them, or a sequence at least the same as any of the
在另一態樣中,本發明係關於包含一或多個(例如一個)靶向部分及一或多個效應部分的表現抑制子或多肽,其中一或多個效應部分為或包含Krueppel相關盒(KRAB),例如根據NP_056209.2或由NM_015394.5編碼之蛋白質,或其功能變異體或片段。在一些實施例中,KRAB為合成KRAB構築體。在一些實施例中,KRAB包含胺基酸序列SEQ ID NO: 18: In another aspect, the invention relates to expression inhibitors or polypeptides comprising one or more (e.g., one) targeting moieties and one or more effector moieties, wherein one or more effector moieties are or comprise a Krueppel-associated cassette (KRAB), eg the protein according to NP_056209.2 or encoded by NM_015394.5, or a functional variant or fragment thereof. In some embodiments, the KRAB is a synthetic KRAB construct. In some embodiments, KRAB comprises the amino acid sequence of SEQ ID NO: 18:
在一些實施例中,KRAB效應部分由核苷酸序列SEQ ID NO: 51編碼。在一些實施例中,本文所述之核苷酸序列包含序列SEQ ID NO: 51或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。 In some embodiments, the KRAB effector portion is encoded by the nucleotide sequence of SEQ ID NO: 51. In some embodiments, the nucleotide sequences described herein comprise the sequence SEQ ID NO: 51 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or differ therefrom by no more than 20 positions , 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
在一些實施例中,本文所述之多肽或表現抑制子中所用的KRAB為變異體,例如相對於SEQ ID NO: 18之KRAB序列,包含一或多個突變的變異體。在一些實施例中,相對於SEQ ID NO: 18,KRAB變異體包含一或多個胺基酸取代、缺失或插入。In some embodiments, the KRAB used in the polypeptides or expression inhibitors described herein is a variant, eg, a variant comprising one or more mutations relative to the KRAB sequence of SEQ ID NO: 18. In some embodiments, the KRAB variant comprises one or more amino acid substitutions, deletions or insertions relative to SEQ ID NO: 18.
在一些實施例中,多肽或表現抑制子為包含效應部分(為或包含KRAB)及DNA靶向部分的融合蛋白。在一些實施例中,靶向部分為或包含鋅指域、TAL域或CRISPR/Cas域,例如包含CRISPR/Cas蛋白,例如dCas9蛋白。在一些實施例中,多肽或表現抑制子包含本文所述之其他部分。在一些實施例中,多肽或表現抑制子減少靶基因(例如MYC)的表現。在一些實施例中,該多肽或表現抑制子可用於調節(例如降低)基因表現的方法、治療病狀的方法或對靶基因(例如MYC)或本文所述之轉錄控制元件進行表觀遺傳修飾(例如代替表現抑制系統)的方法中。在一些實施例中,表現抑制系統包含兩種或更多種(例如兩種、三種或四種)表現抑制子,其中第一表現抑制子包含效應部分,該效應部分包含SEQ ID NO: 18之KRAB序列或其功能變異體或片段。In some embodiments, the polypeptide or expression inhibitor is a fusion protein comprising an effector moiety (being or comprising KRAB) and a DNA targeting moiety. In some embodiments, the targeting moiety is or comprises a zinc finger domain, a TAL domain or a CRISPR/Cas domain, eg comprising a CRISPR/Cas protein, eg a dCas9 protein. In some embodiments, the polypeptide or expression inhibitor comprises other moieties described herein. In some embodiments, the polypeptide or expression suppressor reduces the expression of a target gene (eg, MYC). In some embodiments, the polypeptide or inhibitor of expression can be used in a method of modulating (e.g., reducing) gene expression, in a method of treating a condition, or in making epigenetic modifications to a target gene (e.g., MYC) or a transcriptional control element described herein (e.g. in place of performance suppression systems). In some embodiments, the expression inhibition system comprises two or more (e.g., two, three or four) expression inhibitors, wherein the first expression inhibitor comprises an effector moiety comprising a sequence of SEQ ID NO: 18. KRAB sequences or functional variants or fragments thereof.
在另一態樣中,本發明係關於包含一或多個(例如一個)靶向部分及一或多個效應部分的表現抑制子或多肽,其中一或多個效應部分為或包含MQ1,例如細菌MQ1,或其功能變異體或片段。在一些實施例中,MQ1為柔膜菌螺原體( Mollicutes spiroplasma) MQ1。在一些實施例中,MQ1為胡蜂螺原體( Spiroplasma monobiae) MQ1。在一些實施例中,MQ1為來自菌株ATCC 33825且/或與Uniprot ID P15840對應的MQ1。在一些實施例中,MQ1包含胺基酸序列SEQ ID NO: 19。在一些實施例中,MQ1包含胺基酸序列SEQ ID NO: 87。在一些實施例中,本文所述之效應域包含SEQ ID NO: 19或87,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。 In another aspect, the invention relates to expression inhibitors or polypeptides comprising one or more (e.g. one) targeting moieties and one or more effector moieties, wherein one or more effector moieties are or comprise MQ1, e.g. Bacterial MQ1, or a functional variant or fragment thereof. In some embodiments, MQ1 is Mollicutes spiroplasma MQ1. In some embodiments, MQ1 is Spiroplasma monobiae MQ1. In some embodiments, the MQ1 is a MQ1 from strain ATCC 33825 and/or corresponding to Uniprot ID P15840. In some embodiments, MQ1 comprises the amino acid sequence of SEQ ID NO: 19. In some embodiments, MQ1 comprises the amino acid sequence of SEQ ID NO: 87. In some embodiments, an effector domain described herein comprises SEQ ID NO: 19 or 87, or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differs therefrom in no more than 20 positions , 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
在一些實施例中,MQ1由核苷酸序列SEQ ID NO: 52或132編碼。在一些實施例中,本文所述之核酸包含SEQ ID NO: 52、132之序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。 In some embodiments, MQ1 is encoded by the nucleotide sequence of SEQ ID NO: 52 or 132. In some embodiments, the nucleic acids described herein comprise the sequence of SEQ ID NO: 52, 132, or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differ therefrom by no more than A sequence of 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
在一些實施例中,本文所述之多肽或表現抑制子中所用的MQ1為變異體,例如相對於野生型MQ1 (例如SEQ ID NO: 19),包含一或多個突變。在一些實施例中,相對於野生型MQ1,例如SEQ ID NO: 19之MQ1,MQ1變異體包含一或多個胺基酸取代、缺失或插入。在一些實施例中,MQ1變異體包含K297P取代。在一些實施例中,MQ1變異體包含N299C取代。在一些實施例中,MQ1變異體包含E301Y取代。在一些實施例中,MQ1變異體包含Q147L取代(例如相對於野生型MQ1,具有減小的DNA甲基轉移酶活性)。在一些實施例中,MQ1變異體包含K297P、N299C及E301Y取代(例如相對於野生型MQ1,具有減小的DNA結合親和力)。在一些實施例中,MQ1變異體包含Q147L、K297P、N299C及E301Y取代(例如相對於野生型MQ1,具有減小的DNA甲基轉移酶活性及DNA結合親和力)。In some embodiments, the MQ1 used in the polypeptides or expression suppressors described herein is a variant, eg, comprising one or more mutations relative to wild-type MQ1 (eg, SEQ ID NO: 19). In some embodiments, the MQ1 variant comprises one or more amino acid substitutions, deletions or insertions relative to wild-type MQ1, eg, MQ1 of SEQ ID NO: 19. In some embodiments, the MQ1 variant comprises a K297P substitution. In some embodiments, the MQ1 variant comprises a N299C substitution. In some embodiments, the MQ1 variant comprises an E301Y substitution. In some embodiments, the MQ1 variant comprises a Q147L substitution (eg, has reduced DNA methyltransferase activity relative to wild-type MQ1 ). In some embodiments, the MQ1 variant comprises K297P, N299C, and E301Y substitutions (eg, has reduced DNA binding affinity relative to wild-type MQ1 ). In some embodiments, the MQ1 variant comprises Q147L, K297P, N299C, and E301Y substitutions (eg, has reduced DNA methyltransferase activity and DNA binding affinity relative to wild-type MQ1 ).
在一些實施例中,多肽或表現抑制子為包含效應部分及靶向部分的融合蛋白,該效應部分為或包含MQ1,且該靶向部分為或包含鋅指域、TAL域或CRISPR/Cas域、dCas9域。在一些實施例中,多肽或表現抑制子包含本文所述之其他部分。在一些實施例中,多肽或表現抑制子減少靶基因(例如MYC)的表現。在一些實施例中,該多肽或表現抑制子可用於調節(例如降低)基因表現的方法、治療病狀的方法或對靶基因(例如MYC)或本文所述之轉錄控制元件進行表觀遺傳修飾(例如代替表現抑制系統)的方法中。在一些實施例中,表現抑制系統包含兩種或更多種(例如兩種、三種或四種)表現抑制子,其中第一表現抑制子包含效應部分,該效應部分包含MQ1,例如細菌MQ1,或其功能變異體或片段。In some embodiments, the polypeptide or expression inhibitor is a fusion protein comprising an effector portion and a targeting portion, the effector portion being or comprising MQ1, and the targeting portion being or comprising a zinc finger domain, a TAL domain or a CRISPR/Cas domain , dCas9 domain. In some embodiments, the polypeptide or expression inhibitor comprises other moieties described herein. In some embodiments, the polypeptide or expression suppressor reduces the expression of a target gene (eg, MYC). In some embodiments, the polypeptide or inhibitor of expression can be used in a method of modulating (e.g., reducing) gene expression, in a method of treating a condition, or in making epigenetic modifications to a target gene (e.g., MYC) or a transcriptional control element described herein (e.g. in place of performance suppression systems). In some embodiments, the expression suppression system comprises two or more (e.g., two, three or four) expression suppressors, wherein the first expression suppressor comprises an effector moiety comprising MQ1, such as bacterial MQ1, or a functional variant or fragment thereof.
在另一態樣中,本發明係關於包含一或多個(例如一個)靶向部分及一或多個效應部分的表現抑制子或多肽,其中一或多個效應部分為或包含DNMT1,例如人類DNMT1,或其功能變異體或片段。在一些實施例中,DNMT1為人類DNMT1,例如對應於基因ID 1786,例如對應於UniProt ID P26358.2。在一些實施例中,DNMT1包含胺基酸序列SEQ ID NO: 20。在一些實施例中,本文所述之效應域包含根據SEQ ID NO: 20的序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列: In another aspect, the invention relates to expression inhibitors or polypeptides comprising one or more (e.g. one) targeting moieties and one or more effector moieties, wherein one or more effector moieties are or comprise DNMT1, e.g. Human DNMT1, or a functional variant or fragment thereof. In some embodiments, DNMT1 is human DNMT1, eg, corresponding to Gene ID 1786, eg, corresponding to UniProt ID P26358.2. In some embodiments, DNMT1 comprises the amino acid sequence of SEQ ID NO: 20. In some embodiments, the effector domain described herein comprises a sequence according to SEQ ID NO: 20 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or differs therefrom by no more than 20 positions , 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 sequence:
在一些實施例中,DNMT1由核苷酸序列SEQ ID NO: 53編碼。在一些實施例中,本文所述之核酸包含序列SEQ ID NO: 53或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列 In some embodiments, DNMT1 is encoded by the nucleotide sequence of SEQ ID NO: 53. In some embodiments, a nucleic acid described herein comprises the sequence SEQ ID NO: 53 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or differs therefrom by no more than 20, 19, Sequence of 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1
在一些實施例中,本文所述之多肽或表現抑制子中所用的DNMT1為變異體,例如相對於SEQ ID NO: 20之DNMT序列,包含一或多個突變。在一些實施例中,相對於野生型DNMT1,效應域包含一或多個胺基酸取代、缺失或插入。在一些實施例中,多肽為包含抑制域(其為或包含DNMT1)及靶向部分的融合蛋白。在一些實施例中,靶向部分為或包含鋅指域、TAL域或CRISPR/Cas域,例如dCas9域。在一些實施例中,表現抑制系統包含兩種或更多種(例如兩種、三種或四種)表現抑制子,其中第一表現抑制子包含效應部分,該效應部分包含DNMT1,或其功能變異體或片段。In some embodiments, the DNMT1 used in the polypeptides or expression inhibitors described herein is a variant, eg, comprising one or more mutations relative to the DNMT sequence of SEQ ID NO: 20. In some embodiments, the effector domain comprises one or more amino acid substitutions, deletions or insertions relative to wild-type DNMT1. In some embodiments, the polypeptide is a fusion protein comprising an inhibitory domain that is or comprises DNMT1 and a targeting moiety. In some embodiments, the targeting moiety is or comprises a zinc finger domain, a TAL domain, or a CRISPR/Cas domain, such as a dCas9 domain. In some embodiments, the expression suppression system comprises two or more (e.g., two, three or four) expression suppressors, wherein the first expression suppressor comprises an effector moiety comprising DNMT1, or a functional variant thereof body or fragment.
在另一態樣中,本發明係關於包含一或多個(例如一個)靶向部分及一或多個效應部分的表現抑制子或多肽,其中該一或多個效應部分為或包含DNMT3a/3L複合物,或其功能變異體或片段。在一些實施例中,DNMT3a/3L複合物融合構築體。在一些實施例中,DNMT3a/3L複合物包含DNMT3A (例如人類DNMT3A)(例如根據NP_072046.2或由NM_022552.4編碼之蛋白質)。在一些實施例中,DNMT3a/3L複合物包含DNMT3L (例如根據NP_787063.1或由NM_175867.3編碼之蛋白質)。在一些實施例中,DNMT3a/3L包含胺基酸序列SEQ ID NO: 21或SEQ ID NO: 114。在一些實施例中,本文所述之效應域包含SEQ ID NO: 21或SEQ ID NO: 114,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。 In another aspect, the invention relates to expression inhibitors or polypeptides comprising one or more (eg, one) targeting moieties and one or more effector moieties, wherein the one or more effector moieties are or comprise DNMT3a/ 3L complex, or a functional variant or fragment thereof. In some embodiments, the DNMT3a/3L complex is a fusion construct. In some embodiments, the DNMT3a/3L complex comprises DNMT3A (eg, human DNMT3A) (eg, a protein according to NP_072046.2 or encoded by NM_022552.4). In some embodiments, the DNMT3a/3L complex comprises DNMT3L (eg, a protein according to NP_787063.1 or encoded by NM_175867.3). In some embodiments, DNMT3a/3L comprises the amino acid sequence of SEQ ID NO: 21 or SEQ ID NO: 114. In some embodiments, an effector domain described herein comprises SEQ ID NO: 21 or SEQ ID NO: 114, or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or a position that differs therefrom A sequence of numbers not exceeding 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1.
在一些實施例中,DNMT3a/3L由核苷酸序列SEQ ID NO: 54編碼。在一些實施例中,本文所述之核酸包含序列SEQ ID NO: 54或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。 In some embodiments, DNMT3a/3L is encoded by the nucleotide sequence of SEQ ID NO: 54. In some embodiments, the nucleic acids described herein comprise the sequence SEQ ID NO: 54 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or differ therefrom by no more than 20, 19, The sequence of 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
在一些實施例中,本文所述之多肽或表現抑制子中所用的DNMT3a/3L為變異體,例如相對於SEQ ID NO: 21或SEQ ID NO: 114之DNMT3a/3L,包含一或多個突變。在一些實施例中,相對於SEQ ID NO: 21或SEQ ID NO: 114,DNMT3a/3L變異體包含一或多個胺基酸取代、缺失或插入。在一些實施例中,多肽或表現抑制子為包含效應部分(為或包含DNMT3a/3L)及靶向部分的融合蛋白。在一些實施例中,靶向部分為或包含鋅指域、TAL域或CRISPR/Cas域,例如dCas9域。在一些實施例中,表現抑制系統包含兩種或更多種(例如兩種、三種或四種)表現抑制子,其中第一表現抑制子包含效應部分,該效應部分包含DNMT3a/3L,或其功能變異體或片段。In some embodiments, the DNMT3a/3L used in the polypeptides or expression inhibitors described herein is a variant, e.g., comprising one or more mutations relative to DNMT3a/3L of SEQ ID NO: 21 or SEQ ID NO: 114 . In some embodiments, the DNMT3a/3L variant comprises one or more amino acid substitutions, deletions or insertions relative to SEQ ID NO: 21 or SEQ ID NO: 114. In some embodiments, the polypeptide or expression inhibitor is a fusion protein comprising an effector moiety (being or comprising DNMT3a/3L) and a targeting moiety. In some embodiments, the targeting moiety is or comprises a zinc finger domain, a TAL domain, or a CRISPR/Cas domain, such as a dCas9 domain. In some embodiments, the expression suppression system comprises two or more (e.g., two, three or four) expression suppressors, wherein the first expression suppressor comprises an effector moiety comprising DNMT3a/3L, or Functional variants or fragments.
在一些實施例中,效應部分為或包含多肽。在一些實施例中,效應部分為或包含核酸。在一些實施例中,效應部分為化學試劑,例如調節胞嘧啶I或腺嘌呤(A)的化學試劑(例如亞硫酸氫鈉、亞硫酸氫銨)。在一些實施例中,效應部分具有酶活性(例如甲基轉移酶、去甲基酶、核酸酶(例如Cas9)或去胺酶活性)。效應部分可為或包含以下中的一或多者:小分子、肽、核酸、奈米粒子、適體,或具有不良PK/PD的醫藥劑。In some embodiments, the effector moiety is or comprises a polypeptide. In some embodiments, an effector moiety is or comprises a nucleic acid. In some embodiments, the effector moiety is a chemical reagent, such as a chemical reagent that modulates cytosine I or adenine (A) (eg, sodium bisulfite, ammonium bisulfite). In some embodiments, the effector moiety has enzymatic activity (eg, methyltransferase, demethylase, nuclease (eg, Cas9), or deaminase activity). The effector moiety can be or comprise one or more of the following: small molecules, peptides, nucleic acids, nanoparticles, aptamers, or pharmaceutical agents with undesirable PK/PD.
在一些實施例中,效應部分可包含肽配位體、全長蛋白質、蛋白質片段、抗體、抗體片段及/或靶向適體。在一些實施例中,蛋白質可結合受體,諸如細胞外受體、神經肽、激素肽、肽藥物、毒性肽、病毒或微生物肽、合成肽,或促效劑或拮抗劑肽。In some embodiments, effector moieties may comprise peptide ligands, full-length proteins, protein fragments, antibodies, antibody fragments, and/or targeting aptamers. In some embodiments, proteins can bind receptors, such as extracellular receptors, neuropeptides, hormone peptides, peptide drugs, toxic peptides, viral or microbial peptides, synthetic peptides, or agonist or antagonist peptides.
在一些實施例中,效應部分可包含抗原、抗體、抗體片段(諸如單域抗體)、配位體或受體,諸如升糖素樣肽-1 (GLP-1)、GLP-2受體2、膽囊收縮素B (CCKB)或生長抑素受體;肽治療劑,諸如結合至特定細胞表面受體(諸如G蛋白偶聯受體(GPCR))或離子通道的彼等物;合成肽或來自天然生物活性肽的肽類似物、抗微生物肽、成孔肽、靶向腫瘤的肽或細胞毒性肽,或降解肽或自摧毀肽,諸如誘導細胞凋亡的肽信號或光敏肽。In some embodiments, the effector moiety may comprise an antigen, an antibody, an antibody fragment (such as a single domain antibody), a ligand, or a receptor, such as glucagon-like peptide-1 (GLP-1 ), GLP-2
如本文所述之效應部分中所用的肽或蛋白質部分亦可包括小抗原結合肽,例如抗原結合抗體或抗體樣片段,諸如單鏈抗體、奈米抗體(參見例如Steeland等人, 2016. Nanobodies as therapeutics: big opportunities for small antibodies. Drug Discov Today: 21(7):1076-113)。此類小抗原結合肽可結合胞溶質抗原、細胞核抗原或細胞器內抗原。Peptides or protein moieties as used in effector moieties as described herein may also include small antigen-binding peptides, such as antigen-binding antibodies or antibody-like fragments, such as single-chain antibodies, nanobodies (see e.g. Steeland et al., 2016. Nanobodies as therapeutics: big opportunities for small antibodies. Drug Discov Today: 21(7):1076-113). Such small antigen-binding peptides can bind cytosolic, nuclear or intracellular antigens.
在一些實施例中,效應部分包含負顯性組分(例如負顯性部分),例如識別且結合序列(例如錨定序列,例如CTCF結合模體)、但具有非活性(例如突變)二聚域(例如不能形成功能錨定序列介導性接合體的二聚域)的蛋白質,或結合至基因體複合物之組分(例如轉錄因子亞單元等)的蛋白質,從而防止形成功能轉錄因子等。舉例而言,可改變CTCF的鋅指域,以便其結合特異性錨定序列(藉由添加識別側接核酸的鋅指),同時改變均二聚域,以防止經工程改造之CTCF與內源形式之CTCF之間發生相互作用。在一些實施例中,負顯性組分包含合成的成核多肽,其對標靶錨定序列介導性接合體內的錨定序列具有選定的結合親和力。在一些實施例中,結合親和力可比與標靶錨定序列締合之內源成核多肽(例如CTCF)的結合親和力的至少10%、15%、20%、25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、95%或更高或更低。合成成核多肽與相應的內源成核多肽可具有30-90%、30-85%、30-80%、30-70%、50-80%、50-90%胺基酸序列一致性。成核多肽可經由諸如競爭性結合來調節(例如破壞)例如與內源成核多肽競爭結合至其錨定序列。In some embodiments, the effector moiety comprises a dominant negative component (e.g., a dominant negative portion), e.g., a recognition and binding sequence (e.g., an anchor sequence, e.g., a CTCF binding motif), but has an inactive (e.g., mutated) dimerization domains (such as dimerization domains that cannot form functional anchor sequence-mediated adapters), or proteins that bind to components of gene body complexes (such as transcription factor subunits, etc.), thereby preventing the formation of functional transcription factors, etc. . For example, the zinc finger domain of CTCF can be altered so that it binds specific anchor sequences (by adding zinc fingers that recognize flanking nucleic acids), while the homodimerization domain can be altered to prevent the engineered CTCF from interacting with endogenous Interactions occur between forms of CTCF. In some embodiments, the dominant-negative component comprises a synthetic nucleating polypeptide that has a selected binding affinity for the anchor sequence within the target anchor sequence-mediated adapter. In some embodiments, the binding affinity is comparable to at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or higher or lower. A synthetic nucleating polypeptide may have 30-90%, 30-85%, 30-80%, 30-70%, 50-80%, 50-90% amino acid sequence identity to a corresponding endogenous nucleating polypeptide. A nucleating polypeptide can modulate (eg, disrupt) eg, compete with an endogenous nucleating polypeptide for binding to its anchor sequence via, for example, competitive binding.
在一些實施例中,效應部分包含抗體或其片段。在一些實施例中,經由使用效應部分來改變靶基因(例如MYC)表現,該等效應部分為或包含一或多種抗體或其片段。在一些實施例中,經由使用效應部分來改變基因表現,該等效應部分為或包含一或多種抗體(或其片段)及dCas9。In some embodiments, the effector moiety comprises an antibody or fragment thereof. In some embodiments, target gene (eg, MYC) expression is altered through the use of effector moieties that are or comprise one or more antibodies or fragments thereof. In some embodiments, gene expression is altered through the use of effector moieties that are or comprise one or more antibodies (or fragments thereof) and dCas9.
在一些實施例中,效應部分中所用的抗體或其片段可為單株的。抗體可為融合物、嵌合抗體、非人類化抗體、部分或完全人類化抗體等。如熟習此項技術者所瞭解,所用抗體的形式可為相同或不同的,此視指定的標靶而定。In some embodiments, the antibody or fragment thereof used in the effector portion may be monoclonal. Antibodies can be fusions, chimeric antibodies, non-humanized antibodies, partially or fully humanized antibodies, and the like. As will be appreciated by those skilled in the art, the format of the antibody used may be the same or different, depending on the intended target.
在一些實施例中,效應部分包含接合成核分子、編碼接合成核分子的核酸,或其組合。接合成核分子可為例如CTCF、黏合素、USF1、YY1、TATA盒結合蛋白相關因子3 (TAF3)、ZNF143結合模體,或促進錨定序列介導性接合體形成的另一多肽。接合成核分子可為內源多肽或其他蛋白質,諸如轉錄因子,例如自體免疫調控因子(AIRE)、另一種因子,例如X-不活化特異性轉錄本(XIST),或經工程改造之多肽,該多肽經工程改造以識別所關注之特異性DNA序列,例如具有用於序列識別的鋅指、白胺酸拉鏈或bHLH域。接合成核分子可調節錨定序列介導性接合體內或周圍的DNA相互作用(例如與靶向部分所靶向的基因體序列元件相關或包含靶向部分所靶向的基因體序列元件)。舉例而言,接合成核分子可將其他因子募集至改變錨定序列介導性接合體形成或斷裂的錨定序列。In some embodiments, the effector moiety comprises an engaging nucleating molecule, a nucleic acid encoding an engaging nucleating molecule, or a combination thereof. The conjugative nucleating molecule can be, for example, CTCF, cohesin, USF1, YY1, TATA box-binding protein-associated factor 3 (TAF3), ZNF143 binding motif, or another polypeptide that facilitates anchor sequence-mediated conjugate formation. Engaging nuclear molecules can be endogenous polypeptides or other proteins, such as transcription factors, such as autoimmune regulatory element (AIRE), another factor, such as X-inactivation-specific transcripts (XIST), or engineered polypeptides , the polypeptide is engineered to recognize a specific DNA sequence of interest, for example with zinc fingers, leucine zippers or bHLH domains for sequence recognition. The conjugating nuclear molecule can modulate DNA interactions in or around the anchor sequence-mediated conjugator (eg, associated with or comprising a gene body sequence element targeted by the targeting moiety). For example, conjugation nucleases can recruit other factors to anchor sequences that alter anchor sequence-mediated conjugate formation or fragmentation.
接合成核分子亦可具有均二聚或雜二聚的二聚域。一或多種接合成核分子(例如經工程改造之內源接合成核分子)可相互作用而形成錨定序列介導性接合體。在一些實施例中,接合成核分子經工程改造以進一步包括使錨定序列介導性接合體穩定的穩定域,例如內聚相互作用域。在一些實施例中,接合成核分子經工程改造以結合靶序列,例如調節靶序列結合親和力。在一些實施例中,接合成核分子經選擇或工程改造而對錨定序列介導性接合體內待錨定序列具有選定的結合親和力。Conjugated nucleating molecules may also have homodimeric or heterodimeric dimerization domains. One or more conjugating nucleating molecules (eg, engineered endogenous conjugating nucleating molecules) can interact to form an anchor sequence-mediated adapter. In some embodiments, the conjugative nucleating molecule is engineered to further comprise a stabilizing domain, such as a cohesive interaction domain, that stabilizes the anchor sequence-mediated conjugator. In some embodiments, engaging nucleosynthetic molecules are engineered to bind a target sequence, eg, modulate target sequence binding affinity. In some embodiments, the conjugative nucleating molecule is selected or engineered to have a selected binding affinity for the sequence to be anchored within the anchor sequence-mediated conjugator.
接合成核分子及其相應的錨定序列可經由使用CTCF中含有不活化突變之細胞及染色體構象捕捉方法或基於3C之方法加以鑑定,例如Hi-C或高通量定序,以檢查拓樸相關域,例如在缺乏CTCF的情況下,遠端DNA區域或基因座之間的拓樸相互作用。亦可鑑定長程DNA相互作用。其他分析可包括ChIA-PET分析,其利用誘鉺(諸如黏合素、YY1或USF1、ZNF143結合模體,及MS)鑑定與誘鉺相關的複合物。Conjugated nucleogenic molecules and their corresponding anchor sequences can be identified by using cell and chromosome conformation capture methods containing inactivating mutations in CTCF or 3C-based methods such as Hi-C or high-throughput sequencing to examine topology Related domains, such as topological interactions between distal DNA regions or loci in the absence of CTCF. Long-range DNA interactions can also be identified. Other assays may include ChIA-PET analysis using baits such as cohesin, YY1 or USF1, ZNF143 binding motifs, and MS to identify complexes associated with baits.
在一些實施例中,效應部分包含蛋白質的DNA結合域。在一些實施例中,效應部分中的DNA結合域增強或改變調節劑的靶向,但不能單獨達成調節劑的完全靶向(例如靶向部分仍需達成調節劑的靶向)。在一些實施例中,DNA結合域增強調節劑的靶向。在一些實施例中,DNA結合域增強調節劑的功效。DNA結合蛋白具有熟習此項技術者已知的不同結構模體,其在結合DNA方面起重要作用。在一些實施例中,DNA結合域包含螺旋-轉角-螺旋(HTH)模體(抑制子蛋白質中通用的DNA識別模體)。此類模體包含兩個螺旋,其中之一識別側鏈提供結合特異性的DNA (亦稱為識別螺旋)。此類模體常用於調控涉及發育過程的蛋白質。有時,超過一種蛋白質競爭相同序列或識別相同DNA片段。不同蛋白質的不同之處可在於其對相同序列或DNA構象的親和力分別經由氫鍵、鹽橋及凡得瓦爾相互作用(Van der Waals interactions)達成。In some embodiments, the effector moiety comprises the DNA binding domain of a protein. In some embodiments, the DNA binding domain in the effector moiety enhances or alters the targeting of the modulator, but does not alone achieve full targeting of the modulator (eg, the targeting moiety is still required to achieve targeting of the modulator). In some embodiments, the DNA binding domain enhances targeting of the modulator. In some embodiments, the DNA binding domain enhances the efficacy of the modulator. DNA binding proteins have different structural motifs known to those skilled in the art, which play an important role in binding DNA. In some embodiments, the DNA binding domain comprises a helix-turn-helix (HTH) motif, a common DNA recognition motif in repressor proteins. Such motifs comprise two helices, one of which recognizes the DNA whose side chain provides binding specificity (also known as the recognition helix). Such motifs are commonly used to regulate proteins involved in developmental processes. Sometimes, more than one protein competes for the same sequence or recognizes the same DNA segment. Different proteins may differ in their affinity for the same sequence or DNA conformation via hydrogen bonds, salt bridges and Van der Waals interactions, respectively.
在一些實施例中,DNA結合域包含螺旋-髮夾-螺旋(HhH)模體。具有HhH結構模體的DNA結合蛋白可涉及非序列特異性DNA結合,該結合係經由蛋白質主鏈氮與DNA磷酸酯基團之間形成氫鍵而發生。In some embodiments, the DNA binding domain comprises a helix-hairpin-helix (HhH) motif. DNA-binding proteins with the HhH structural motif can be involved in non-sequence-specific DNA binding via hydrogen bond formation between protein backbone nitrogens and DNA phosphate groups.
在一些實施例中,DNA結合域包含螺旋-環-螺旋(HLH)模體。具有HLH結構模體的DNA結合蛋白為轉錄調控性蛋白質且主要與寬範圍的發育過程有關。就殘基而言,HLH結構模體比HTH或HhH模體長。許多此等蛋白質相互作用而形成均二聚體及雜二聚體。結構模體係由兩個長螺旋區域構成,其中N端螺旋結合至DNA,而複合區域允許蛋白質發生二聚。In some embodiments, the DNA binding domain comprises a helix-loop-helix (HLH) motif. DNA-binding proteins with the HLH structural motif are transcriptional regulatory proteins and are primarily associated with a wide range of developmental processes. In terms of residues, the HLH structural motif is longer than the HTH or HhH motifs. Many of these proteins interact to form homodimers and heterodimers. The structural module system consists of two long helical regions, where the N-terminal helix binds to DNA, and the composite region allows protein dimerization.
在一些實施例中,DNA結合域包含白胺酸拉鏈模體。在一些轉錄因子中,針對DNA的二聚體結合位點形成白胺酸拉鏈。此模體包括兩個兩親媒性螺旋(一個來自各亞單元),該兩個螺旋彼此相互作用,產生左手捲曲螺旋超二級結構。白胺酸拉鏈為一個螺旋中之有規律間隔之白胺酸殘基與來自相鄰螺旋之白胺酸的交錯接合。涉及白胺酸拉鏈的螺旋大部分展現七肽序列(abcdefg),其中殘基 a及 d具疏水性且其他殘基具親水性。白胺酸拉鏈模體可介導均二聚體或雜二聚體形成。 In some embodiments, the DNA binding domain comprises a leucine zipper motif. In some transcription factors, a leucine zipper is formed against the dimer binding site of DNA. This motif includes two amphipathic helices (one from each subunit) that interact with each other to generate a left-handed coiled-coil supersecondary structure. A leucine zipper is an interleaving junction of regularly spaced leucine residues in one helix with leucine from an adjacent helix. The helices involved in the leucine zipper mostly exhibit a heptad sequence (abcdefg) where residues a and d are hydrophobic and the others are hydrophilic. The leucine zipper motif can mediate homodimer or heterodimer formation.
在一些實施例中,DNA結合域包含Zn指域,其中Zn ++離子與2個Cys及2個His殘基配位。此類轉錄因子包括具有化學計量ββ 'α大三聚體。Zn ++配位的表觀效應為使小複合物結構而非疏水性核心殘基穩定。各Zn指係以構形相同方式與雙螺旋之較大溝槽中的連續三重鹼基對鏈段發生相互作用。蛋白質-DNA相互作用係根據兩個因素測定:(i) α-螺旋與DNA鏈段之間的氫鍵相互作用,大部分為Arg殘基與鳥嘌呤鹼基之間的氫鍵相互作用。(ii)與DNA磷酸酯主鏈的H鍵相互作用,大部分為與Arg及His的H鍵相互作用。替代的Zn指模體將Zn ++與6個Cys螯合在一起。 In some embodiments, the DNA binding domain comprises a Zn finger domain where the Zn ++ ion is coordinated to 2 Cys and 2 His residues. Such transcription factors include large trimers with stoichiometry ββ'α. The apparent effect of Zn ++ coordination is to stabilize the small complex structure rather than the hydrophobic core residues. Each Zn finger interacts in a conformationally identical manner with consecutive triple base pair segments in the larger groove of the duplex. Protein-DNA interactions are determined based on two factors: (i) hydrogen-bonding interactions between α-helices and DNA segments, mostly between Arg residues and guanine bases. (ii) Interact with the H-bonds of the DNA phosphate main chain, mostly with the H-bonds of Arg and His. An alternative Zn finger motif chelates Zn ++ with 6 Cys.
在一些實施例中,DNA結合域包含TATA盒結合蛋白(TBP)。TBP最初鑑定為II類起始因子TFIID的組分。此等結合蛋白參與所有三種核RNA聚合酶的轉錄,充當其每一者中的亞單元。TBP結構顯示具有89-90個胺基酸的兩個α/β結構域。TBP的C端或核心區域以高親和力結合至TATA共同序列(TATAa/tAa/t,SEQ ID NO: 210),該共同序列識別小溝決定子且促進DNA彎曲。TBP類似於分子鞍。結合側襯有10股反向平行β-薄片中的中心8個股。上表面含有四個α-螺旋且結合至轉錄機構之各種組分。In some embodiments, the DNA binding domain comprises a TATA box binding protein (TBP). TBP was originally identified as a component of the class II initiation factor TFIID. These binding proteins are involved in the transcription of all three nuclear RNA polymerases, serving as subunits in each of them. The TBP structure shows two α/β domains of 89-90 amino acids. The C-terminal or core region of TBP binds with high affinity to the TATA consensus sequence (TATAa/tAa/t, SEQ ID NO: 210), which recognizes minor groove determinants and promotes DNA bending. TBP resembles a molecular saddle. The bonding side is lined with the central 8 strands of 10 antiparallel β-sheets. The upper surface contains four α-helices and binds to various components of the transcription machinery.
在一些實施例中,DNA結合域為或包含轉錄因子。轉錄因子(TF)可為模組化蛋白質,其含有負責特異性識別鹼基序列的DNA結合域及可活化或抑制轉錄的一或多種效應域。TF與染色質相互作用且募集充當或共活化因子或共抑制因子的蛋白質複合物。In some embodiments, the DNA binding domain is or comprises a transcription factor. A transcription factor (TF) can be a modular protein that contains a DNA-binding domain responsible for specific recognition of base sequences and one or more effector domains that can activate or repress transcription. TFs interact with chromatin and recruit protein complexes that act as either co-activators or co-repressors.
在一些實施例中,效應部分包含一或多種RNA (例如gRNA)及dCas9。在一些實施例中,一或多種RNA經由dCas9及標靶特異性嚮導RNA來靶向基因體序列元件。如熟習此項技術者所瞭解,用於靶向的RNA可為相同或不同的,此視指定的標靶而定。效應部分可包含適體,諸如寡核苷酸適體或肽適體。適體部分為寡核苷酸或肽適體。In some embodiments, the effector moiety comprises one or more RNAs (eg, gRNAs) and dCas9. In some embodiments, one or more RNAs are targeted to a genome sequence element via dCas9 and a target-specific guide RNA. As will be appreciated by those skilled in the art, the RNAs used for targeting may be the same or different, depending on the designated target. The effector moiety may comprise an aptamer, such as an oligonucleotide aptamer or a peptide aptamer. The aptamer portion is an oligonucleotide or peptide aptamer.
效應部分可包含寡核苷酸適體。寡核苷酸適體為單股DNA或RNA (ssDNA或ssRNA)分子,其可以高親和力及特異性結合至預先選擇的標靶,包括蛋白質及肽。The effector portion may comprise an oligonucleotide aptamer. Oligonucleotide aptamers are single-stranded DNA or RNA (ssDNA or ssRNA) molecules that can bind to preselected targets, including proteins and peptides, with high affinity and specificity.
寡核苷酸適體為核酸物種,其可經由重複數輪之活體外選擇或等效地經由SELEX (指數級富集的配位體系統進化)加以工程改造以便結合至各種分子標靶,諸如小分子、蛋白質、核酸,及甚至細胞、組織及生物體。適體提供有區別的分子識別且可由化學合成產生。另外,適體具有所需儲存特性,且在治療應用中引發極少免疫原性或無免疫原性。Oligonucleotide aptamers are nucleic acid species that can be engineered to bind to various molecular targets, such as Small molecules, proteins, nucleic acids, and even cells, tissues, and organisms. Aptamers provide differential molecular recognition and can be produced by chemical synthesis. In addition, aptamers have desirable storage properties and elicit little or no immunogenicity in therapeutic applications.
DNA與RNA適體對各種標靶均顯示穩固的結合親和力。舉例而言,DNA及RNA適體已選用於t溶菌酶、凝血酶、人類免疫缺乏病毒反式作用反應元件(HIV TAR)、血晶素、干擾素γ、血管內皮生長因子(VEGF)、前列腺特異性抗原(PSA)、多巴胺及非經典致癌基因熱休克因子1 (HSF1)。Both DNA and RNA aptamers display robust binding affinities for a variety of targets. For example, DNA and RNA aptamers have been selected for tlysozyme, thrombin, human immunodeficiency virus trans-acting response element (HIV TAR), hemin, interferon gamma, vascular endothelial growth factor (VEGF), prostate Specific antigen (PSA), dopamine, and the nonclassical oncogene heat shock factor 1 (HSF1).
基於適體之血漿蛋白質概況分析用的診斷技術包括適體血漿蛋白質體學。此技術能夠在未來實現多生物標記蛋白質量測,從而可有助於診斷性區分疾病相對於健康狀態。Diagnostic techniques for aptamer-based plasma protein profiling include aptamer plasma proteomics. This technology could enable multi-biomarker protein quantification in the future, which could help to diagnostically differentiate disease from healthy states.
效應部分可包含肽適體部分。肽適體具有一(或多)個可變短肽域,包括具有低分子量12-14 kDa之肽。肽適體可設計成特異性結合且干擾細胞內之蛋白質-蛋白質相互作用。The effector portion may comprise a peptide aptamer portion. Peptide aptamers have one (or more) variable short peptide domains, including peptides with low molecular weights of 12-14 kDa. Peptide aptamers can be designed to specifically bind to and interfere with protein-protein interactions within cells.
肽適體為經選擇或經工程改造以結合特定靶分子之人工蛋白質。此等蛋白質包括可變序列之一或多種肽複合物。其典型地自組合文庫中分離且通常隨後藉由定向突變或數輪可變區誘變及選擇來改良。肽適體可活體內結合細胞蛋白質標靶且發揮生物效應,包括干擾其靶分子與其他蛋白質的正常蛋白質相互作用。特定而言,針對連接至轉錄因子活化域之靶蛋白來篩選連接至轉錄因子結合域之可變肽適體複合物。肽適體經由此選擇策略活體內結合至其標靶係根據下游酵母標記基因之表現來偵測。此類實驗鑑定出適體所結合的特定蛋白質,及使適體斷裂而產生指定表型的蛋白質相互作用。另外,經適當官能部分衍生的肽適體可促使其標靶蛋白質發生特異性轉譯後修飾,或改變標靶的亞細胞定位。肽適體亦可活體外識別標靶。其已用於代替生物感測器的抗體且用於偵測含有非活性與活性蛋白質形式之群體中的蛋白質之活性同功異型物。稱為蝌蚪(其中肽適體「頭」共價連接至獨特的序列雙股DNA「尾」)之衍生物允許藉由對其DNA尾進行PCR (使用例如定量式即時聚合酶鏈反應)來定量混合物中的稀少靶分子。Peptide aptamers are artificial proteins that are selected or engineered to bind specific target molecules. These proteins include one or more peptide complexes of variable sequences. They are typically isolated from combinatorial libraries and often subsequently improved by directed mutagenesis or rounds of variable region mutagenesis and selection. Peptide aptamers can bind cellular protein targets in vivo and exert biological effects, including interfering with normal protein interactions of their target molecules with other proteins. In particular, variable peptide aptamer complexes linked to transcription factor binding domains are screened against target proteins linked to transcription factor activation domains. In vivo binding of peptide aptamers to their targets via this selection strategy is detected based on the expression of downstream yeast marker genes. Such experiments identify the specific proteins to which the aptamer binds, and the protein interactions that fragment the aptamer to produce a defined phenotype. In addition, peptide aptamers derivatized with appropriate functional moieties can induce specific post-translational modification of their target proteins, or alter the subcellular localization of targets. Peptide aptamers can also recognize targets in vitro. It has been used to replace antibodies in biosensors and to detect active isoforms of proteins in populations containing inactive and active protein forms. Derivatives called tadpoles (in which the peptide aptamer "head" is covalently linked to a unique sequence double-stranded DNA "tail") allow quantification by PCR (using, for example, quantitative real-time polymerase chain reaction) of their DNA tails Rare target molecules in a mixture.
可使用不同系統進行肽適體選擇,但當前最常使用酵母雙雜交系統。肽適體亦可選自組合肽文庫,其藉由噬菌體呈現及其他表面呈現技術構築而成,諸如mRNA呈現、核糖體呈現、細菌呈現及酵母呈現。此等實驗程序亦稱為生物淘選(biopannings)。在自生物淘洗獲得之肽中,可將模擬表位(mimotope)視為一種肽適體。自組合肽文庫中淘選的肽已儲存於名為MimoDB之特殊資料庫中。Peptide aptamer selection can be performed using different systems, but the yeast two-hybrid system is currently most commonly used. Peptide aptamers can also be selected from combinatorial peptide libraries constructed by phage display and other surface display techniques such as mRNA display, ribosome display, bacterial display and yeast display. These experimental procedures are also known as biopannings. Among peptides obtained from biopanning, mimotopes can be considered as a type of peptide aptamer. Peptides panned from combinatorial peptide libraries have been deposited in a special database called MimoDB.
例示性效應部分可包括(但不限於):泛素、作為泛素連接酶抑制劑的雙環肽、轉錄因子、DNA及蛋白質修飾酶,諸如拓樸異構酶;拓樸異構酶抑制劑,諸如拓朴替康(topotecan);DNA甲基轉移酶,諸如DNMT家族(例如DNMT3A、DNMT3B、DNMT3a/3L、MQ1);蛋白質甲基轉移酶(例如病毒離胺酸甲基轉移酶(vSET)、蛋白質-離胺酸N-甲基轉移酶(SMYD2)、去胺酶(例如APOBEC、UG1)、組蛋白甲基轉移酶,諸如zeste同源物2增強子(EZH2)、PRMT1、組蛋白-離胺酸-N-甲基轉移酶(Setdb1)、組蛋白甲基轉移酶(SET2)、真染色質組蛋白-離胺酸N-甲基轉移酶2 (G9a)、組蛋白-離胺酸N-甲基轉移酶(SUV39H1)及G9a);組蛋白去乙醯酶(例如HDAC1、HDAC2、HDAC3)、具有DNA脫甲基化作用的酶(例如催化5-甲基胞嘧啶氧化成5-羥甲基胞嘧啶及更高價氧化衍生物的TET家族酶);蛋白質去甲基酶,諸如KDM1A及離胺酸特異性組蛋白去甲基酶1 (LSD1);解螺旋酶,諸如DHX9、去乙醯酶(例如長壽蛋白(sirtuin) 1、2、3、4、5、6或7)、激酶、磷酸酶、DNA插入劑(諸如溴化乙錠、SYBR綠及普羅黃素(proflavine));流出泵抑制劑,諸如肽模擬物,如苯丙胺酸精胺醯基β-萘基醯胺或喹啉衍生物;核受體活化劑及抑制劑、蛋白酶體抑制劑、針對諸如涉及溶酶體貯積病之酶的競爭性抑制劑、蛋白質合成抑制劑、核酸酶(例如Cpf1、Cas9、鋅指核酸酶)、來自蛋白質的特定域(諸如KRAB域),其一或多者的融合物(例如dCas9-DNMT、dCas9-MQ1、dCas9-KRAB)。Exemplary effector moieties may include, but are not limited to: ubiquitin, bicyclic peptides as ubiquitin ligase inhibitors, transcription factors, DNA and protein modifying enzymes such as topoisomerases; topoisomerase inhibitors, such as topotecan; DNA methyltransferases such as the DNMT family (e.g. DNMT3A, DNMT3B, DNMT3a/3L, MQ1); protein methyltransferases (e.g. viral lysine methyltransferase (vSET), Protein-lysine N-methyltransferase (SMYD2), deaminases (e.g. APOBEC, UG1), histone methyltransferases such as enhancer of zeste homolog 2 (EZH2), PRMT1, histone-lysine Amino acid-N-methyltransferase (Setdb1), Histone methyltransferase (SET2), Euchromatin histone-lysine N-methyltransferase 2 (G9a), Histone-lysine N - Methyltransferases (SUV39H1) and G9a); histone deacetylases (eg HDAC1, HDAC2, HDAC3), enzymes with DNA demethylation (eg catalyzing the oxidation of 5-methylcytosine to 5-hydroxy TET family enzymes of methylcytosine and higher oxidized derivatives); protein demethylases such as KDM1A and lysine-specific histone demethylase 1 (LSD1); helicases such as DHX9, deethylase Acylases (such as sirtuins 1, 2, 3, 4, 5, 6, or 7), kinases, phosphatases, DNA intercalators (such as ethidium bromide, SYBR green, and proflavine); Efflux pump inhibitors such as peptidomimetics such as phenylalanine arginyl β-naphthylamide or quinoline derivatives; nuclear receptor activators and inhibitors, proteasome inhibitors, Competitive inhibitors of diseased enzymes, protein synthesis inhibitors, nucleases (e.g. Cpf1, Cas9, zinc finger nucleases), specific domains from proteins (such as KRAB domains), fusions of one or more thereof (e.g. dCas9-DNMT, dCas9-MQ1, dCas9-KRAB).
在一些實施例中,可藉由熟習此項技術者已知之方法測定適用作效應部分的候選域。舉例而言,可藉由分析來測試候選效應部分,不論當候選效應部分存在於細胞核中且適當地定位(例如定位至靶基因或可操作地連接至該靶基因的轉錄控制元件,例如經由靶向部分)時,候選效應部分減少靶基因於細胞中的表現,例如降低靶基因所編碼之RNA轉錄本的水準(例如如RNASeq或北方墨點法所量測)或降低靶基因所編碼之蛋白質的水準(例如如ELISA所量測)。In some embodiments, candidate domains suitable for use as effector moieties can be determined by methods known to those skilled in the art. For example, a candidate effector moiety can be tested by assay whether the candidate effector moiety is present in the nucleus and appropriately localized (e.g. localized to a target gene or operably linked to a transcriptional control element of the target gene, e.g. via a target candidate effector portion) reduces expression of the target gene in the cell, such as reducing the level of RNA transcripts encoded by the target gene (as measured, for example, by RNASeq or northern blotting) or reducing the protein encoded by the target gene levels (eg as measured by ELISA).
在一些實施例中,表現抑制子包含複數個效應部分,其中各效應部分對另一效應部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含含有第一效應部分的第一表現抑制子及含有第二效應部分的第二表現抑制子,其中第一效應部分對第二效應部分的結合偵測不到,例如不結合。In some embodiments, the expression inhibitor comprises a plurality of effector moieties, wherein each effector moiety has no detectable binding to another effector moiety, eg, does not bind. In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first effector moiety and a second expression inhibitor comprising a second effector moiety, wherein binding of the first effector moiety to the second effector moiety is undetectable , such as not binding.
在一些實施例中,表現抑制系統包含複數個表現抑制子,其中複數個表現抑制子中的各成員包含效應部分,其中各效應部分對另一效應部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含含有第一效應部分的第一表現抑制子及含有第二效應部分的第二表現抑制子,其中第一效應部分對第二效應部分的結合偵測不到,例如不結合。在一些實施例中,表現抑制系統包含含有第一效應部分的第一表現抑制子及含有第二效應部分的第二表現抑制子,其中第一效應部分對另一個第一效應部分的結合偵測不到,例如不結合,且第二效應部分對另一個第二效應部分的結合偵測不到,例如不結合。在一些實施例中,用於本文所述之組合物及方法中的效應部分在單體(例如非二聚)狀態下發揮功能。In some embodiments, the expression inhibition system comprises a plurality of expression inhibitors, wherein each member of the plurality of expression inhibitors comprises an effector moiety, wherein each effector moiety is undetectable, eg, does not bind, another effector moiety. In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first effector moiety and a second expression inhibitor comprising a second effector moiety, wherein binding of the first effector moiety to the second effector moiety is undetectable , such as not binding. In some embodiments, the expression inhibition system comprises a first expression inhibitor comprising a first effector moiety and a second expression inhibitor comprising a second effector moiety, wherein binding of the first effector moiety to another first effector moiety is detected does not, eg does not bind, and the binding of the second effector moiety to another second effector moiety is undetectable, eg does not bind. In some embodiments, effector moieties used in the compositions and methods described herein function in the monomeric (eg, non-dimeric) state.
在一些實施例中,效應部分為或包含表觀遺傳修飾部分,例如調節染色質之二維結構(亦即,以改變其二維呈現之方式調節染色質結構)的表觀遺傳修飾部分。In some embodiments, the effector moiety is or comprises an epigenetic modifying moiety, eg, an epigenetic modifying moiety that modulates the two-dimensional structure of chromatin (ie, modulates chromatin structure in a manner that alters its two-dimensional presentation).
適用於本發明之方法及組合物中的表觀遺傳修飾部分包括影響表觀遺傳標記物的藥劑,例如DNA甲基化、組蛋白甲基化、組蛋白乙醯化、組蛋白類小泛素化、組蛋白磷酸化及RNA相關靜默。可靶向如本文所述之基因體序列元件的例示性表觀遺傳酶包括DNA甲基化酶(例如DNMT3a、DNMT3b、DNMT3a/3L、MQ1)、DNA去甲基化(例如TET家族)、組蛋白甲基轉移酶、組蛋白去乙醯酶(例如HDAC1、HDAC2、HDAC3)、長壽蛋白1、2、3、4、5、6或7、離胺酸特異性組蛋白去甲基酶1 (LSD1)、組蛋白-離胺酸-N-甲基轉移酶(Setdb1)、真染色質組蛋白-離胺酸N-甲基轉移酶2 (G9a)、組蛋白-離胺酸N-甲基轉移酶(SUV39H1)、zeste同源物2增強子(EZH2)、病毒離胺酸甲基轉移酶(vSET)、組蛋白甲基轉移酶(SET2),及蛋白質-離胺酸N-甲基轉移酶(SMYD2)。此類表觀遺傳調節劑之實例描述於例如de Groote等人, Nuc. Acids Res. (2012):1-18中。Epigenetic modification moieties suitable for use in the methods and compositions of the invention include agents that affect epigenetic markers such as DNA methylation, histone methylation, histone acetylation, histone ubiquitin Histone phosphorylation, histone phosphorylation, and RNA-associated silencing. Exemplary epigenetic enzymes that can target sequence elements of the genome as described herein include DNA methylases (e.g. DNMT3a, DNMT3b, DNMT3a/3L, MQ1), DNA demethylation (e.g. TET family), tissue Protein methyltransferase, histone deacetylase (e.g. HDAC1, HDAC2, HDAC3),
在一些實施例中,本文中適用的表現抑制子(例如包含表觀遺傳修飾部分的表現抑制子)包含或為Koferle等人, Genome Medicine 7.59 (2015):1-3中所述的構築體,該文獻以引用的方式併入本文中。舉例而言,在一些實施例中,表現抑制子包含或為Koferle等人之表1中所發現的構築體,例如表1中所述的組蛋白去乙醯酶、組蛋白甲基轉移酶、DNA去甲基化或H3K4及/或H3K9組蛋白去甲基酶(例如dCas9-p300、TALE-TET1、ZF-DNMT3A或TALE-LSD1)。In some embodiments, an expression suppressor useful herein (e.g., an expression suppressor comprising an epigenetic modification moiety) comprises or is a construct described in Koferle et al., Genome Medicine 7.59 (2015): 1-3, This document is incorporated herein by reference. For example, in some embodiments, the expression inhibitor comprises or is a construct found in Table 1 of Koferle et al., such as histone deacetylase, histone methyltransferase, DNA demethylation or H3K4 and/or H3K9 histone demethylases (eg dCas9-p300, TALE-TET1, ZF-DNMT3A or TALE-LSD1).
在一些實施例中,效應部分包含基因編輯系統的組分,例如CRISPR/Cas域,例如Zn指域,例如TAL效應域。在一些實施例中,表觀遺傳修飾部分可包含多肽(例如肽或蛋白質部分),該多肽連接至gRNA及標靶核酸酶,例如Cas9,例如野生型Cas9、切口酶Cas9 (例如Cas9 D10A)、無催化活性的Cas9 (dCas9)、eSpCas9、Cpf1、C2C1或C2C3,或編碼此類核酸酶的核酸。In some embodiments, the effector moiety comprises a component of a gene editing system, such as a CRISPR/Cas domain, such as a Zn finger domain, such as a TAL effector domain. In some embodiments, the epigenetic modification moiety may comprise a polypeptide (e.g., a peptide or protein moiety) linked to a gRNA and a target nuclease, e.g., Cas9, e.g., wild-type Cas9, nickase Cas9 (e.g., Cas9 D10A), Catalytically inactive Cas9 (dCas9), eSpCas9, Cpf1, C2C1, or C2C3, or a nucleic acid encoding such a nuclease.
如本文所用,「效應域之生物活性部分」為維持效應域(例如「最小」或「核心」域)之功能(例如完全地、部分地、最低程度地維持該功能)的部分。在一些實施例中,dCas9與表觀遺傳修飾劑(諸如DNA甲基化酶或具有DNA去甲基化作用的酶,例如DNMT3a、DNMT3b、DNMT3L、DNMT抑制劑、其組合、TET家族酶、蛋白質乙醯基轉移酶或去乙醯酶、dCas9-DNMT3a/3L、dCas9-DNMT3a/3L/KRAB、dCas9/VP64)之一或多個效應域之全部或一部分的融合形成嵌合蛋白,其連接至多肽且適用於本文所述的方法。包含此類嵌合蛋白的效應部分稱為基因修飾部分(由於其使用基因編輯系統組分Cas9)或表觀遺傳修飾部分(由於其使用表觀遺傳修飾劑的效應域)。As used herein, a "biologically active portion of an effector domain" is a portion that maintains the function (eg, fully, partially, minimally) of an effector domain (eg, a "minimal" or "core" domain). In some embodiments, dCas9 is combined with an epigenetic modifier such as a DNA methylase or an enzyme with DNA demethylation, e.g., DNMT3a, DNMT3b, DNMT3L, DNMT inhibitors, combinations thereof, TET family enzymes, proteins Acetyltransferase or deacetylase, dCas9-DNMT3a/3L, dCas9-DNMT3a/3L/KRAB, dCas9/VP64) fusion of all or part of one or more effector domains to form a chimeric protein linked to Polypeptides and are suitable for use in the methods described herein. Effector portions comprising such chimeric proteins are referred to as genetically modified portions (due to their use of the gene editing system component Cas9) or epigenetically modified portions (due to their use of effector domains of epigenetic modifiers).
在一些實施例中,描述了包含gRNA特異性靶向靶基因的所提供技術。在一些實施例中,靶基因為致癌基因、腫瘤抑制因子,或MYC異常調控病症相關基因。在一些實施例中,靶基因為MYC。In some embodiments, provided techniques comprising a gRNA specifically targeting a target gene are described. In some embodiments, the target gene is an oncogene, a tumor suppressor, or a gene associated with a disorder in which MYC is abnormally regulated. In some embodiments, the target gene is MYC.
在一些實施例中,本文提供的技術包括將本文所述之一或多個基因修飾部分(例如CRISPR系統的組分)遞送至個體的方法,例如遞送至個體之細胞核或組織的方法,此係藉由將此類部分與靶向部分作為融合分子之一部分連接來達成。在一些實施例中,本文提供的技術包括藉由將一或多個基因修飾部分(例如CRISPR系統的組分)囊封於脂質奈米粒子中而將本文所述之一或多個基因修飾部分(例如CRISPR系統的組分)遞送至個體(例如遞送至個體之細胞核或組織)的方法。In some embodiments, the technologies provided herein include methods of delivering one or more genetic modification moieties described herein (e.g., components of a CRISPR system) to an individual, e.g., to the nucleus or tissue of an individual, by This is accomplished by linking such moieties to the targeting moiety as part of a fusion molecule. In some embodiments, the techniques provided herein include incorporating one or more genetic modification moieties described herein by encapsulating one or more genetic modification moieties (e.g., components of a CRISPR system) in lipid nanoparticles A method of delivery (eg, a component of a CRISPR system) to an individual (eg, to a nucleus or a tissue of an individual).
其他部分 表現抑制子可進一步包含一或多個其他部分(例如除一或多個靶向部分及一或多個效應部分之外)。在一些實施例中,其他部分係選自標記或監測部分、可裂解部分(例如位於DNA靶向部分與效應部分之間,或位於多肽之N端或C端多可裂解部分)、小分子、膜易位多肽,或醫藥劑部分。Other Moieties The expression suppressor may further comprise one or more other moieties (eg, in addition to one or more targeting moieties and one or more effector moieties). In some embodiments, the additional moiety is selected from a labeling or monitoring moiety, a cleavable moiety (e.g., between a DNA targeting moiety and an effector moiety, or at the N-terminal or C-terminal multiple cleavable moiety of a polypeptide), small molecules, Membrane translocation polypeptides, or pharmaceutical agent moieties.
例示性表現抑制子 以下例示性表現抑制子僅為了說明的目的而展示且不希望具有限制性。Exemplary Expression Inhibitors The following exemplary expression inhibitors are shown for illustrative purposes only and are not intended to be limiting.
在一些實施例中,表現抑制子包含含有dCas9 (例如金黃色葡萄球菌dCas9)的靶向部分及含有MQ1 (例如細菌MQ1)的效應部分。在一些實施例中,表現抑制子由核酸序列SEQ ID NO: 68 (例如編碼表現抑制子的核酸(例如cDNA))編碼。在一些實施例中,表現抑制子由核酸序列SEQ ID NO: 119編碼。在一些實施例中,本文所述之核酸包含核酸序列SEQ ID NO: 68、119或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20, 19, 18, 17, 16、 15、 14、 13、 12、 11、 10、 9、 8、 7、 6、 5、 4、 3、 2或1的序列。In some embodiments, the expression suppressor comprises a targeting moiety comprising dCas9 (eg, S. aureus dCas9) and an effector moiety comprising MQ1 (eg, bacterial MQ1). In some embodiments, the expression suppressor is encoded by the nucleic acid sequence of SEQ ID NO: 68 (eg, nucleic acid (eg, cDNA) encoding the expression suppressor). In some embodiments, the expression suppressor is encoded by the nucleic acid sequence of SEQ ID NO: 119. In some embodiments, the nucleic acid described herein comprises a nucleic acid sequence of SEQ ID NO: 68, 119 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or the number of positions differing therefrom is no more than 20 , 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 sequence.
dCas9-MQ1核苷酸序列: dCas9-MQ1 nucleotide sequence:
在一些實施例中,表現抑制子包含胺基酸序列SEQ ID NO: 35或151。在一些實施例中,本文所述之表現抑制子包含胺基酸序列SEQ ID NO: 35、151或與其具有至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises the amino acid sequence of SEQ ID NO: 35 or 151. In some embodiments, the expression suppressor described herein comprises the amino acid sequence SEQ ID NO: 35, 151 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or differs therefrom A sequence whose number of positions does not exceed 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
dCas9-MQ1蛋白質序列: dCas9-MQ1 protein sequence:
在一些實施例中,表現抑制子包含含有dCas9 (例如釀膿鏈球菌dCas9)的靶向部分及含有KRAB (例如KRAB域)的效應部分。在一些實施例中,表現抑制子由核酸序列SEQ ID NO: 67 (例如編碼表現抑制子的核酸(例如cDNA))編碼。在一些實施例中,本文所述之核酸包含核酸序列SEQ ID NO: 67或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises a targeting moiety comprising dCas9 (eg, S. pyogenes dCas9) and an effector moiety comprising KRAB (eg, a KRAB domain). In some embodiments, the expression suppressor is encoded by the nucleic acid sequence of SEQ ID NO: 67 (eg, nucleic acid (eg, cDNA) encoding the expression suppressor). In some embodiments, the nucleic acid described herein comprises the nucleic acid sequence SEQ ID NO: 67 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or the number of positions differing therefrom is no more than 20, 19 , 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
dCas9-KRAB核苷酸序列: dCas9-KRAB nucleotide sequence:
在一些實施例中,表現抑制子包含胺基酸序列SEQ ID NO: 34或150。在一些實施例中,本文所述之核酸包含胺基酸序列SEQ ID NO: 34、150或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises the amino acid sequence of SEQ ID NO: 34 or 150. In some embodiments, a nucleic acid described herein comprises the amino acid sequence of SEQ ID NO: 34, 150, or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differs therefrom in an infinite number of positions. More than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 sequence.
dCas9-KRAB蛋白質序列: dCas9-KRAB protein sequence:
在一些實施例中,表現抑制子包含含有dCas9 (例如金黃色葡萄球菌dCas9)的DNA靶向部分及含有DNMT1 (例如人類DNMT1)的效應部分。在一些實施例中,表現抑制子由核酸序列SEQ ID NO: 69 (例如編碼表現抑制子的核酸(例如cDNA))編碼。在一些實施例中,本文所述之核酸包含核酸序列SEQ ID NO: 69或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1個位置的序列。In some embodiments, the expression suppressor comprises a DNA targeting moiety comprising dCas9 (eg, S. aureus dCas9) and an effector moiety comprising DNMT1 (eg, human DNMT1). In some embodiments, the expression inhibitor is encoded by the nucleic acid sequence of SEQ ID NO: 69 (eg, a nucleic acid (eg, cDNA) encoding an expression inhibitor). In some embodiments, the nucleic acid described herein comprises the nucleic acid sequence SEQ ID NO: 69 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or the number of positions differing therefrom is no more than 20, 19 , 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 position sequence.
dCas9-DNMT1核苷酸序列 dCas9-DNMT1 nucleotide sequence
在一些實施例中,表現抑制子包含胺基酸序列SEQ ID NO: 36或152。在一些實施例中,本文所述之表現抑制子包含胺基酸序列SEQ ID NO: 36、152或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises the amino acid sequence of SEQ ID NO: 36 or 152. In some embodiments, the expression suppressor described herein comprises the amino acid sequence of SEQ ID NO: 36, 152 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or a position different therefrom A sequence of numbers not exceeding 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1.
dCas9-DNMT1蛋白質序列: dCas9-DNMT1 protein sequence:
在一些實施例中,表現抑制子包含含有dCas9 (例如金黃色葡萄球菌dCas9)的DNA靶向部分及含有DNMT13a/3L的效應部分。在一些實施例中,表現抑制子由核酸序列SEQ ID NO: 70 (例如編碼表現抑制子的核酸(例如cDNA))編碼。在一些實施例中,本文所述之核酸包含核酸序列SEQ ID NO: 70或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises a DNA targeting moiety comprising dCas9 (eg, S. aureus dCas9) and an effector moiety comprising DNMT13a/3L. In some embodiments, the expression inhibitor is encoded by the nucleic acid sequence of SEQ ID NO: 70 (eg, a nucleic acid (eg, cDNA) encoding an expression inhibitor). In some embodiments, the nucleic acid described herein comprises a nucleic acid sequence of SEQ ID NO: 70 or a sequence at least 80, 85, 90, 95, 99 or 100% identical thereto, or the number of positions differing therefrom is no more than 20, 19 , 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
dCas9-DNMT3a/3L核苷酸序列 dCas9-DNMT3a/3L nucleotide sequence
在一些實施例中,表現抑制子包含胺基酸序列SEQ ID NO: 37或153。在一些實施例中,本文所述之表現抑制子包含胺基酸序列SEQ ID NO: 37或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the expression suppressor comprises the amino acid sequence of SEQ ID NO: 37 or 153. In some embodiments, the expression inhibitors described herein comprise the amino acid sequence of SEQ ID NO: 37 or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differ therefrom in an infinite number of positions. More than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 sequence.
dCas9-DNMT3a/3L蛋白質序列 dCas9-DNMT3a/3L protein sequence
在一些實施例中,表現抑制子包含含有Zn指域的靶向部分及含有KRAB (例如KRAB域)的效應部分。在一些實施例中,表現抑制子係由SEQ ID NO: 55、56、57、58、59、60、189、194、195及196中之任一者的核酸序列(例如編碼表現抑制子的核酸(例如cDNA))編碼。此等例示性表現抑制子之核酸序列揭示於表6中。在一些實施例中,本文所述之核酸包含SEQ ID NO:55-60、189、194-196中之任一者的核酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。在一些實施例中,核酸序列包含聚腺苷酸序列,且在其他實施例中,核酸缺乏聚腺苷酸序列。
表 6 : 例示性 ZF - KRAB 效應子之核苷酸序列
在一些實施例中,表現抑制子包含含有Zn指域(例如具有根據SEQ ID NO: 5-10或169-172中之任一者的胺基酸序列)的靶向部分及含有KRAB (例如胺基酸序列SEQ ID NO: 18)(例如KRAB域)的效應部分。在一些實施例中,本文所述之表現抑制子包含SEQ ID NO: 22、23、24、25、26、27、139-144、177-180或183-186中之任一者的胺基序列。此等例示性表現抑制子的蛋白質序列揭示於表7中。在一些實施例中,本文所述之表現抑制子包含SEQ ID NO: 22-27、139-144、177-180、183-186中之任一者的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
表7:例示性鋅指-KRAB效應子的胺基酸序列
在一些實施例中,表現抑制子包含含有Zn指域(例如由SEQ ID NO: 44-49或115中之任一者的核苷酸序列編碼的Zn指域)的靶向部分及含有MQ1 (例如細菌MQ1 (例如由核苷酸序列SEQ ID NO: 52編碼的細菌MQ1))的效應部分。在一些實施例中,表現抑制子係由核酸序列SEQ ID NO: 61、62、63、64、65、66、116、117、118或130編碼。此等例示性表現抑制子之核酸序列揭示於表8中。在一些實施例中,本文所述之核酸包含SEQ ID NO: 61-66、116-118、130中之任一者的核酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。在一些實施例中,核酸序列包含聚腺苷酸序列,且在其他實施例中,核酸缺乏聚腺苷酸序列。舉例而言,在一些實施例中,本文所述之核酸包含根據SEQ ID NO: 61-66、116-118或130中之任一者的序列(或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列),但缺乏3'聚腺苷酸序列或包含長度較短之3'聚腺苷酸序列。
表8:例示性ZF-MQ1效應子之核苷酸序列
在一些實施例中,表現抑制子包含含有Zn指域(例如SEQ ID NO: 11-14中之任一者的胺基酸序列)的靶向部分及含有MQ1 (例如細菌MQ1 (例如SEQ ID NO: 19))的效應部分。在一些實施例中,表現抑制子包含SEQ ID NO: 28、29、30、31、32、33、129及145-149中之任一者的胺基序列。此等例示性表現抑制子的蛋白質序列揭示於表9中。在一些實施例中,本文所述之表現抑制子包含SEQ ID NO: 28-33、129中之任一者的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。
表9.例示性ZF-MQ1效應子的胺基酸序列
在一些實施例中,表現抑制子包含含有Zn指域(例如具有SEQ ID NO: 11-14中之任一者的胺基酸序列)的靶向部分及含有MQ1 (例如細菌MQ1 (例如SEQ ID NO: 87))的效應部分。In some embodiments, the expression suppressor comprises a targeting moiety comprising a Zn finger domain (eg, having the amino acid sequence of any one of SEQ ID NOs: 11-14) and a targeting moiety comprising MQ1 (eg, bacterial MQ1 (eg, SEQ ID NO: 11-14) NO: 87)) Effects section.
在一些實施例中,表現抑制子包含含有Zn指域(例如由SEQ ID NO: 166-168中之任一者的核苷酸序列編碼的Zn指域)的靶向部分及含有MQ1 (例如細菌MQ1 (例如由核苷酸序列SEQ ID NO: 52編碼的細菌MQ1))的效應部分。在一些實施例中,表現抑制子係由SEQ ID NO: 157、158或159之核酸序列編碼。此等例示性表現抑制子之核酸序列揭示於表16中。在一些實施例中,本文所述之核酸包含SEQ ID NO: 166-168中之任一者的核酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。在一些實施例中,核酸序列包含聚腺苷酸序列,且在其他實施例中,核酸缺乏聚腺苷酸序列。舉例而言,在一些實施例中,本文所述之核酸包含根據SEQ ID NO: 166-168中之任一者的序列(或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列),但缺乏3'聚腺苷酸序列或包含長度較短之3'聚腺苷酸序列。
表16:例示性小鼠特異性ZF-MQ1效應子之核苷酸序列
在一些實施例中,表現抑制子包含含有Zn指域(例如SEQ ID NO: 154-156中之任一者的胺基酸序列)的靶向部分及含有MQ1 (例如細菌MQ1 (例如SEQ ID NO: 19))的效應部分。在一些實施例中,表現抑制子包含SEQ ID NO: 160-165中之任一者的胺基序列。此等例示性表現抑制子的蛋白質序列揭示於表17中。在一些實施例中,本文所述之表現抑制子包含SEQ ID NO: 160-165中之任一者的胺基酸序列或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13,12、11、10、9、8、7、6、5、4、3、2或1的序列。
表17:例示性ZF-MQ1效應子的胺基酸序列
在一些實施例中,本發明提供一種表現抑制系統,其包含:含有第一ZF的第一靶向部分、含有DNA甲基轉移酶(例如MQ1或其功能片段)的第一效應部分、含有第二ZF的第二靶向部分,及含有KRAB (例如KRAB域)的第二效應部分。在一些實施例中,表現抑制系統係由編碼第一靶向部分及第一效應部分的第一核酸編碼,其中表現係由第一啟動子或IRES驅動;以及編碼第二靶向部分及第二效應部分的第二核酸,其中表現係由第二啟動子或IRES驅動。在一些實施例中,使用單順反子序列。在一些實施例中,編碼表現抑制系統的核酸為多順反子序列。在一些實施例中,多順反子序列為雙順反子序列。在一些實施例中,多順反子序列包含編碼第一表現抑制子的序列及編碼第二表現抑制子的序列。在一些實施例中,多順反子序列編碼可自裂解肽序列,例如2A肽序列,例如T2A肽序列、P2A序列。在一些實施例中,多順反子序列編碼T2A肽序列及P2A肽序列。在一些實施例中,多順反子序列編碼串聯2A序列,例如tPT2A序列。在一些實施例中,多順反子構築體自5'至3'編碼:(i)第一核定位信號,例如SV40 NLS;(ii)第一靶向部分,例如DNA結合域,例如鋅指結合域,例如ZF-9;(iii)第一效應部分,例如DNA甲基轉移酶,例如MQ1;(iv)第二核定位信號,例如核質蛋白NLS;(v)連接子,例如tPT2A連接子;(vi)第三核定位信號,例如SV40NLS;(vii)第二靶向部分,例如DNA結合域,例如鋅指結合域,例如ZF-3;(viii)第二效應部分,例如轉錄抑制部分,例如KRAB;及(ix)第四核定位信號,例如核質蛋白NLS。在一些實施例中,雙順反子構築體進一步包含聚腺苷酸尾。在一些實施例中,在雙順反子基因構築體轉錄後,產生編碼第一表現抑制子及第二表現抑制子的單一mRNA轉錄本,該轉錄本在轉譯後裂解,例如在2A肽內的甘胺酸殘基之後裂解,產生第一表現抑制子及第二表現抑制子作為兩種各別蛋白質。在一些實施例中,第一與第二表現抑制子藉由「核糖體跳讀」分隔。在一些實施例中,在核糖體跳讀之後,第一表現抑制子及/或第二表現抑制子保持2A肽的片段。在一些實施例中,第一與第二表現抑制子的表現量相等。在一些實施例中,第一與第二表現抑制子的表現量不同。在一些實施例中,第一表現抑制子的蛋白質水準在(大於或小於)第二表現抑制子之蛋白質水準的1%、2%、5%或10%內。In some embodiments, the present invention provides an expression inhibition system comprising: a first targeting moiety comprising a first ZF, a first effector moiety comprising a DNA methyltransferase (e.g., MQ1 or a functional fragment thereof), comprising a second a second targeting portion of a ZF, and a second effector portion comprising KRAB (eg, a KRAB domain). In some embodiments, the expression suppression system is encoded by a first nucleic acid encoding a first targeting moiety and a first effector moiety, wherein expression is driven by a first promoter or IRES; and encoding a second targeting moiety and a second The second nucleic acid of the effector portion, wherein expression is driven by a second promoter or IRES. In some embodiments, monocistronic sequences are used. In some embodiments, the nucleic acid encoding the expression suppression system is a polycistronic sequence. In some embodiments, the polycistronic sequence is a dicistronic sequence. In some embodiments, the polycistronic sequence comprises a sequence encoding a first expression suppressor and a sequence encoding a second expression suppressor. In some embodiments, the polycistronic sequence encodes a self-cleavable peptide sequence, eg, a 2A peptide sequence, eg, a T2A peptide sequence, a P2A sequence. In some embodiments, the polycistronic sequence encodes a T2A peptide sequence and a P2A peptide sequence. In some embodiments, the polycistronic sequence encodes a tandem 2A sequence, such as a tPT2A sequence. In some embodiments, the polycistronic construct encodes from 5' to 3': (i) a first nuclear localization signal, such as SV40 NLS; (ii) a first targeting moiety, such as a DNA binding domain, such as a zinc finger Binding domains, such as ZF-9; (iii) first effector moieties, such as DNA methyltransferases, such as MQ1; (iv) secondary nuclear localization signals, such as nucleoplasmin NLS; (v) linkers, such as tPT2A linkers (vi) a third nuclear localization signal, such as SV40NLS; (vii) a second targeting moiety, such as a DNA binding domain, such as a zinc finger binding domain, such as ZF-3; (viii) a second effector moiety, such as transcriptional repression Moieties, such as KRAB; and (ix) a fourth nuclear localization signal, such as the nucleoplasmic protein NLS. In some embodiments, the bicistronic construct further comprises a polyA tail. In some embodiments, following transcription of the bicistronic gene construct, a single mRNA transcript encoding a first repressor of expression and a second repressor of expression is generated that is cleaved post-translationally, e.g., within a 2A peptide. The glycine residues are then cleaved, yielding the first and second expression suppressors as two separate proteins. In some embodiments, the first and second expression suppressors are separated by "ribosome skipping". In some embodiments, the first expression suppressor and/or the second expression suppressor retains a fragment of the 2A peptide after ribosomal skipping. In some embodiments, the first and second expressed inhibitors are expressed in equal amounts. In some embodiments, the first and second expressed inhibitors are expressed in different amounts. In some embodiments, the protein level of the first expressed suppressor is within (greater than or less than) 1%, 2%, 5% or 10% of the protein level of the second expressed suppressor.
在一些實施例中,與其中第一及第二表現抑制子由單順反子核酸編碼的其他類似系統相比,由雙順反子核酸編碼的系統使靶基因(例如MYC)在細胞中的表現減少至少1%、至少2%、至少3%、至少4%、至少5%、至少10%、至少20%、至少30%、至少40%、至少50%。In some embodiments, a system encoded by a bicistronic nucleic acid enables expression of a target gene (e.g., MYC) in a cell as compared to an otherwise similar system in which the first and second expression suppressors are encoded by a monocistronic nucleic acid. Performance is reduced by at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 10%, at least 20%, at least 30%, at least 40%, at least 50%.
在一些實施例中,雙順反子序列編碼SEQ ID NO: 91、92、121、122、181、182、187、188之胺基酸,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。在一些實施例中,表現抑制系統包含含有Zn指域(例如包含SEQ ID NO: 7或13中之任一者的胺基酸序列)的靶向部分及含有MQ1 (例如細菌MQ1 (例如SEQ ID NO: 19)或KRAB,例如KRAB域(例如SEQ ID NO: 18)的效應部分。在一些實施例中,表現抑制子包含SEQ ID NO: 91、92、121、122、181、182、187、188中之任一者的胺基序列。此等例示性表現抑制系統的蛋白質序列揭示於表10中。在一些實施例中,本文所述之表現抑制系統包含SEQ ID NO: 91-92、121-122、181、182、187、188中之任一者的胺基酸序列,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。In some embodiments, the bicistronic sequence encodes the amino acid of SEQ ID NO: 91, 92, 121, 122, 181, 182, 187, 188, or at least 80, 85, 90, 95, 99, or 100 % Consistent sequence, or the number of positions that differ from it does not exceed 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, A sequence of 2 or 1. In some embodiments, the expression inhibition system comprises a targeting moiety comprising a Zn finger domain (eg, comprising the amino acid sequence of any one of SEQ ID NO: 7 or 13) and a targeting moiety comprising MQ1 (eg, bacterial MQ1 (eg, SEQ ID NO: 7 or 13) NO: 19) or KRAB, such as the effector portion of a KRAB domain (eg, SEQ ID NO: 18). In some embodiments, the expression suppressor comprises SEQ ID NO: 91, 92, 121, 122, 181, 182, 187, 188. The protein sequences of these exemplary expression inhibition systems are disclosed in Table 10. In some embodiments, the expression inhibition systems described herein comprise SEQ ID NOs: 91-92, 121 - The amino acid sequence of any one of 122, 181, 182, 187, 188, or a sequence that is at least 80, 85, 90, 95, 99 or 100% identical to it, or the number of positions that differ from it does not exceed 20 , 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1.
在一些實施例中,雙順反子序列包含核酸序列SEQ ID NO: 93或112 (例如編碼表現抑制子的核酸(例如cDNA))或SEQ ID NO: 94或113 (例如編碼表現抑制子的核酸(例如cDNA))。在一些實施例中,雙順反子序列包含核酸序列SEQ ID NO: 196 (例如編碼表現抑制子的核酸(例如cDNA))或SEQ ID NO: 197 (例如編碼表現抑制子的核酸(例如cDNA))。在一些實施例中,本文所述之核酸包含核酸序列SEQ ID NO: 93、94、112、113、196或197,或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列。編碼此等例示性表現抑制系統的核酸序列揭示於表10中。在一些實施例中,核酸序列包含聚腺苷酸序列,且在其他實施例中,核酸缺乏聚腺苷酸序列。
表10.例示性表現抑制系統的胺基酸序列及編碼例示性表現抑制系統的核酸序列
在一些實施例中,表現抑制子包含核定位序列(NLS)。在一些實施例中,表現抑制子包含NLS,例如在N端包含SV40 NLS。在一些實施例中,表現抑制子包含NLS,例如在C端包含核質蛋白NLS。在一些實施例中,表現抑制子在N端包含第一NLS且在C端包含第二NLS。在一些實施例中,第一與第二NLS具有相同序列。在一些實施例中,第一與第二NLS具有不同序列。在一些實施例中,表現抑制抑制子包含SV40 NLS,例如表現抑制子包含根據PKKKRK (SEQ ID NO: 135)的序列。在一些實施例中,N端序列包含NLS及間隔子,例如具有根據MAPKKKRKVGIHGVPAAGSSGS (SEQ ID NO: 88)的序列。在一些實施例中,表現抑制子包含含有以下中之一或多者(例如任何兩者或全部三者)的C端序列:間隔子、核質蛋白核定位序列及HA標籤:例如SGGKRPAATKKAGQAKKKGSYPYDVPDYA (SEQ ID NO: 89)。在一些實施例中,表現抑制子包含抗原決定基標籤,例如HA標籤:YPYDVPDYA (SEQ ID NO: 90)。舉例而言,表現抑制子可包含抗原決定基標籤的兩個複本。In some embodiments, the expression suppressor comprises a nuclear localization sequence (NLS). In some embodiments, the expression suppressor comprises an NLS, for example comprising a SV40 NLS at the N-terminus. In some embodiments, the expression suppressor comprises an NLS, eg, a nucleoplasmin NLS at the C-terminus. In some embodiments, the expression suppressor comprises a first NLS at the N-terminus and a second NLS at the C-terminus. In some embodiments, the first and second NLS have the same sequence. In some embodiments, the first and second NLS have different sequences. In some embodiments, the expression suppressor comprises a SV40 NLS, eg, the expression suppressor comprises a sequence according to PKKKRK (SEQ ID NO: 135). In some embodiments, the N-terminal sequence comprises an NLS and a spacer, for example having a sequence according to MAPKKKRKVGIHGVPAAGSSGS (SEQ ID NO: 88). In some embodiments, the expression suppressor comprises a C-terminal sequence comprising one or more (e.g., any two or all three) of: a spacer, a nucleoplasmin nuclear localization sequence, and an HA tag: e.g., SGGKRPAATKKAGQAKKKGSYPYDVPDYA (SEQ ID NO: 89). In some embodiments, the expression suppressor comprises an epitope tag, eg, HA tag: YPYDVPDYA (SEQ ID NO: 90). For example, an expression suppressor can comprise two copies of an epitope tag.
儘管抗原決定基標籤適用於多種研究背景中,但在治療背景中有時需要省略抗原決定基標籤。相應地,在一些實施例中,表現抑制子缺乏抗原決定基標籤。在一些實施例中,本文所述之表現抑制子包含本文提供的序列(或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列),但其缺乏SEQ ID NO: 90之HA標籤。在一些實施例中,本文所述之核酸包含本文提供的序列(或與其至少80、85、90、95、99或100%一致的序列,或與其有差異之位置數不超過20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2或1的序列),但其缺乏編碼SEQ ID NO: 90之HA標籤的區域。在一些實施例中,表現抑制子包含核質蛋白NLS,例如表現抑制子包含KRPAATKKAGQAKKK (SEQ ID NO: 136)之序列。在一些實施例中,表現抑制子不包含NLS。在一些實施例中,表現抑制子不包含抗原決定基標籤。在一些實施例中,表現抑制子不包含HA標籤。在一些實施例中,表現抑制子不包含根據SEQ ID NO: 90之HA標籤序列。Although epitope tags are useful in a variety of research settings, it is sometimes necessary to omit epitope tags in therapeutic settings. Accordingly, in some embodiments, the expression suppressor lacks an epitope tag. In some embodiments, an expression inhibitor described herein comprises a sequence provided herein (or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differs therefrom by no more than 20, 19 , 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1), but it lacks the HA of SEQ ID NO: 90 Label. In some embodiments, a nucleic acid described herein comprises a sequence provided herein (or a sequence at least 80, 85, 90, 95, 99, or 100% identical thereto, or differs therefrom by no more than 20, 19, 18 positions). , 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1), but it lacks the HA tag encoding SEQ ID NO: 90 Area. In some embodiments, the expression suppressor comprises nucleoplasmic protein NLS, for example, the expression suppressor comprises the sequence of KRPAATKKAGQAKKK (SEQ ID NO: 136). In some embodiments, the expression suppressor does not comprise NLS. In some embodiments, the expression suppressor does not comprise an epitope tag. In some embodiments, the expression suppressor does not comprise an HA tag. In some embodiments, the expression suppressor does not comprise the HA tag sequence according to SEQ ID NO:90.
在一些實施例中,本發明提供包含自裂解肽的表現抑制系統。最初發現於小核糖核酸病毒中的自裂解肽為長度在19至22個胺基酸之間的肽且通常發現於小核糖核酸病毒家族之一些成員中的兩種蛋白質之間。小核糖核酸病毒能夠利用自裂解蛋白質、自相同mRNA產生等莫耳水準的多個基因。已知此類自裂解蛋白質發現於病毒的其他屬種中,且熟習此項技術者在需要時基於本文提供的資訊能夠容易確定適用於本文所揭示之自裂解蛋白質的取代。在一些實施例中,表現抑制系統包含自裂解肽,例如2A自裂解肽。在一些實施例中,2A肽包含單個裂解位點,例如2A肽,例如P2A、T2A、E2A或F2A肽。在一些實施例中,自裂解肽(例如2A肽)包含兩個裂解位點,例如pPT2A或P2A-T2A-E2A。在一些實施例中,表現抑制系統包含含有複數個裂解位點的自裂解肽,例如T2A自裂解肽及P2A自裂解肽。在一些實施例中,2A肽在轉譯之後裂解。在一些實施例中,自裂解肽在裂解之後產生兩個或更多個片段。在一些實施例中,2A肽片段包含SEQ ID NO: 126-128之序列。在一些實施例中,2A自裂解肽包含SEQ ID NO: 120、124、125之序列或其衍生物。在一些實施例中,SEQ ID NO: 95包含自裂解肽序列。 In some embodiments, the invention provides expression inhibition systems comprising self-cleaving peptides. Self-cleaving peptides, originally found in picornaviruses, are peptides between 19 and 22 amino acids in length and are usually found between two proteins in some members of the picornavirus family. Picornaviruses are able to utilize multiple genes at equimolar levels from self-cleaved proteins, produced from the same mRNA. Such self-cleaving proteins are known to be found in other species of viruses, and those skilled in the art can readily determine suitable substitutions for the self-cleaving proteins disclosed herein, if desired, based on the information provided herein. In some embodiments, the expression inhibition system comprises a self-cleaving peptide, such as a 2A self-cleaving peptide. In some embodiments, the 2A peptide comprises a single cleavage site, eg, a 2A peptide, eg, a P2A, T2A, E2A or F2A peptide. In some embodiments, a self-cleaving peptide (eg, a 2A peptide) comprises two cleavage sites, eg, pPT2A or P2A-T2A-E2A. In some embodiments, the expression inhibition system comprises a self-cleaving peptide containing multiple cleavage sites, such as a T2A self-cleaving peptide and a P2A self-cleaving peptide. In some embodiments, the 2A peptide is cleaved after translation. In some embodiments, a self-cleaving peptide produces two or more fragments upon cleavage. In some embodiments, the 2A peptide fragment comprises the sequence of SEQ ID NO: 126-128. In some embodiments, the 2A self-cleaving peptide comprises the sequence of SEQ ID NO: 120, 124, 125 or derivatives thereof. In some embodiments, SEQ ID NO: 95 comprises a self-cleaving peptide sequence.
當然應瞭解,雖然2A序列(例如tPT2A序列(例如根據SEQ ID NO: 124)在科學文獻及本文中可稱為自裂解肽,但此係依據非限制性理論。根據另一非限制性理論,在一些實施例中,2A序列經由核糖體跳讀來起作用。舉例而言,編碼2A序列的mRNA可誘導核糖體跳讀,其中核糖體未能形成肽鍵,同時轉譯2A區域,引起轉譯產物的第一部分釋放。核糖體接著產生轉譯產物的第二部分。總體而言,已明確確定,置於第一序列與第二序列之間的2A序列引起包含第一序列之第一蛋白質及包含第二序列之各別第二蛋白質產生。就藉以達成此的分子機制而言,本發明不受任何特定理論束縛。It will of course be understood that although a 2A sequence such as a tPT2A sequence (e.g. according to SEQ ID NO: 124) may be referred to as a self-cleaving peptide in the scientific literature and herein, this is by non-limiting theory. According to another non-limiting theory, In some embodiments, the 2A sequence functions via ribosomal skipping. For example, an mRNA encoding a 2A sequence can induce ribosome skipping, wherein the ribosome fails to form a peptide bond while simultaneously translating the 2A region, resulting in a translation product The first part of the translation product is released. The ribosome then produces the second part of the translation product. Overall, it has been clearly established that the 2A sequence placed between the first sequence and the second sequence causes a first protein comprising the first sequence and a protein comprising the second Respective second proteins of the two sequences are produced.The present invention is not bound by any particular theory as to the molecular mechanism by which this is achieved.
功能特徵本發明之表現抑制子或系統可用於減少靶基因(例如MYC)在細胞中的表現。一般而言,如本文所述的表現抑制子或系統結合(例如經由靶向部分)近接至及/或可操作地連接至靶基因(例如MYC)的基因體序列元件。在一些實施例中,表現抑制子或系統對基因體序列元件的結合調節(例如降低)靶基因(例如MYC)的表現。舉例而言,包含效應部分的表現抑制子或系統結合至基因體序列元件可調節(例如減少)靶基因(例如MYC)表現,該效應部分抑制轉錄機構的組分募集。作為另一實例,包含具有酶活性之效應部分(例如表觀遺傳修飾部分)之表現抑制子或系統的結合可經由效應部分的酶活性定位來調節(例如降低)靶基因(例如MYC)的表現。作為另一實例,表現抑制子或系統對基因體序列元件的結合與表現抑制子或系統的酶活性定位可對靶基因(例如MYC)之表現促成所得的調節(例如降低)。 Functional Features The expression suppressors or systems of the invention can be used to reduce the expression of a target gene (eg, MYC) in a cell. In general, an expression suppressor or system as described herein binds (eg, via a targeting moiety) in proximity to and/or is operably linked to a gene body sequence element of a target gene (eg, MYC). In some embodiments, binding of a gene body sequence element by an expression suppressor or system modulates (eg, reduces) the expression of a target gene (eg, MYC). For example, target gene (eg, MYC) expression can be modulated (eg, reduced) by expression suppressors or systemic binding to gene body sequence elements comprising an effector moiety that inhibits the recruitment of components of the transcriptional machinery. As another example, the combination of an expression suppressor or system comprising an effector moiety having enzymatic activity (e.g., an epigenetic modification moiety) can modulate (e.g., reduce) the expression of a target gene (e.g., MYC) via localization of the effector moiety's enzymatic activity . As another example, binding of an expression repressor or system to a gene body sequence element and localization of the enzymatic activity of the expression repressor or system can result in a resulting modulation (eg, reduction) in the expression of a target gene (eg, MYC).
在一些實施例中,降低表現包含降低靶基因(例如MYC)所編碼之RNA (例如mRNA)的水準。在一些實施例中,降低表現包含降低靶基因(例如MYC)所編碼之蛋白質的水準。在一些實施例中,降低表現包含降低靶基因(例如MYC)所編碼之mRNA與蛋白質的水準。在一些實施例中,與未接觸本文所揭示之表現抑制子或表現抑制系統或不包含本文所揭示之表現抑制子或表現抑制系統之細胞中的靶基因表現量相比,接觸本文所揭示之表現抑制子或表現抑制系統或包含本文所揭示之表現抑制子或表現抑制系統之細胞中的靶基因表現降低至少1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.55x、1.6x、1.65x、1.7x、1.75x、1.8x、1.85x、1.9x、1.95x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、30x、40x、50x、60x、70x、80x、90x或100x。靶基因(例如MYC)的表現可藉由熟習此項技術者已知的方法分析,包括RT-PCR、ELISA、西方墨點法及實例2至9之方法。可藉由評價血液(例如全血)中的靶基因(例如MYC)水準,例如藉由以下文獻中的方法來評估個體中的靶基因(例如MYC)表現量,例如患者,例如患有MYC異常調控病症的患者,例如患有肝疾病的患者、患有贅瘤及/或病毒或酒精相關肝疾病的患者,例如患有肝癌的患者,例如患有肝癌亞型S1或肝癌亞型S2的患者:Oglesbee等人, Clin Chem. 2013年10月;59(10):1461-9. 數位物件識別碼: 10.1373/clinchem.2013.207472 或Deutsch等人, J Neurol Neurosurg Psychiatry. 2014年9月;85(9):994- 1002. 數位物件識別碼: 10.1136/jnnp-2013-306788,該等文獻之內容以全文引用的方式併入本文中。In some embodiments, reducing expression comprises reducing the level of RNA (eg, mRNA) encoded by a target gene (eg, MYC). In some embodiments, reducing expression comprises reducing the level of a protein encoded by a target gene (eg, MYC). In some embodiments, reducing expression comprises reducing the levels of mRNA and protein encoded by a target gene (eg, MYC). In some embodiments, exposure to an expression suppressor or expression suppression system disclosed herein is compared to the expression of a target gene in a cell that has not been exposed to or does not comprise an expression suppressor or expression suppression system disclosed herein, and is exposed to an expression suppressor or expression suppression system disclosed herein. Expression suppressors or expression suppression systems or cells comprising expression suppressors or expression suppression systems disclosed herein reduce target gene expression by at least 1.05x (i.e., 1.05-fold), 1.1x, 1.15x, 1.2x, 1.25x , 1.3x, 1.35x, 1.4x, 1.45x, 1.5x, 1.55x, 1.6x, 1.65x, 1.7x, 1.75x, 1.8x, 1.85x, 1.9x, 1.95x, 2x, 3x, 4x, 5x , 6x, 7x, 8x, 9x, 10x, 20x, 30x, 40x, 50x, 60x, 70x, 80x, 90x, or 100x. Expression of a target gene (eg, MYC) can be analyzed by methods known to those skilled in the art, including RT-PCR, ELISA, Western blotting, and the methods of Examples 2-9. The expression level of a target gene (such as MYC) in an individual, such as a patient, such as suffering from MYC abnormality, can be assessed by evaluating the level of the target gene (such as MYC) in blood (such as whole blood), such as by the method in the following literature Patients with a modulating disorder, such as patients with liver disease, patients with neoplastic and/or viral or alcohol-related liver diseases, such as patients with liver cancer, such as patients with liver cancer subtype S1 or liver cancer subtype S2 : Oglesbee et al., Clin Chem. 2013 Oct;59(10):1461-9. Digital Object ID: 10.1373/clinchem.2013.207472 or Deutsch et al., J Neurol Neurosurg Psychiatry. 2014 Sep;85(9 ):994-1002. Digital Object Identification Number: 10.1136/jnnp-2013-306788, the contents of which are incorporated herein by reference in their entirety.
本發明之表現抑制子或系統可用於在某一時段內減少靶基因(例如MYC)在細胞中的表現。在一些實施例中,接觸表現抑制子或系統或包含表現抑制子或系統之細胞中的靶基因(例如MYC)表現明顯地減少至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6、7、10、14或15天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如無限期地)。視情況,接觸表現抑制子或系統或包含表現抑制子或系統之細胞中的靶基因(例如MYC)表現明顯地減少不超過10、9、8、7、6、5、4、3、2或1年。在一些實施例中,接觸表現抑制子或系統或包含表現抑制子或系統之細胞中的靶基因(例如MYC)表現在至少1、2、3、4、5、6、7、8、9或10次細胞分裂期間明顯減少。本發明之表現抑制子或系統可用於使標靶啟動子(例如MYC啟動子)中的CpG核苷酸發生甲基化。在一些實施例中,MYC表現發生的轉錄變化與CpG甲基化百分比相關。在一些實施例中,本文所揭示之表現抑制子或系統處理後,甲基化保持至少1天、至少2天、至少5天、至少7天、至少10天、至少15天或至少20天。The expression suppressors or systems of the invention can be used to reduce the expression of a target gene (eg, MYC) in a cell over a certain period of time. In some embodiments, contacting the expression suppressor or system or a target gene (e.g., MYC) in a cell comprising the expression suppressor or system significantly reduces expression by at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, 7, 10 , 14 or 15 days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 months, or at least 1 , 2, 3, 4 or 5 years (eg indefinitely). Optionally, exposure to the expression suppressor or system or a target gene (e.g. MYC) expression in cells comprising the expression suppressor or system significantly reduces expression by no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 year. In some embodiments, the target gene (e.g., MYC) in contact with the expressed suppressor or system or a cell comprising the expressed suppressor or system is expressed at least 1, 2, 3, 4, 5, 6, 7, 8, 9 or Significantly decreased during 10 cell divisions. The expression suppressor or system of the invention can be used to methylate CpG nucleotides in a target promoter, such as the MYC promoter. In some embodiments, the transcriptional change in MYC expression correlates with percent CpG methylation. In some embodiments, methylation is maintained for at least 1 day, at least 2 days, at least 5 days, at least 7 days, at least 10 days, at least 15 days, or at least 20 days following treatment with an expression inhibitor or system disclosed herein.
本發明之表現抑制子或系統可用於減少包含標靶基因座(例如MYC基因座)之細胞的存活率。在一些實施例中,本發明之表現抑制子或系統可用於減少包含標靶基因座(例如MYC基因座)之複數個細胞的存活率。在一些實施例中,與未接觸表現抑制子或系統或不包含表現抑制子或系統之對照細胞群中的活細胞數目相比,接觸表現抑制子或系統或包含表現抑制子或系統之活細胞的數目明顯減少10、20、30、40、50、60、70、80、90或100%。The expression suppressors or systems of the invention can be used to reduce the survival of cells comprising a targeted locus (eg, the MYC locus). In some embodiments, an expression suppressor or system of the invention can be used to reduce the survival of a plurality of cells comprising a targeted locus (eg, the MYC locus). In some embodiments, the number of viable cells exposed to the expressed inhibitor or system or comprising the expressed inhibitor or system is compared to the number of viable cells in a control cell population not contacted with the expressed inhibitor or system or not comprising the expressed inhibitor or system The number of significantly reduced by 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100%.
在一些實施例中,本發明之表現抑制子或系統可用於減少包含癌細胞及非癌細胞之複數個細胞的存活率。在一些實施例中,本發明之表現抑制子或系統可用於降低複數個癌細胞的存活率,降幅超過其降低複數個非癌細胞之存活率的降幅。在一些實施例中,本發明之表現抑制子或系統可用於降低複數個癌細胞的存活率,降幅為其降低複數個非癌細胞之存活率之降幅的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x。In some embodiments, the expression inhibitors or systems of the invention can be used to reduce the survival of a plurality of cells, including cancer cells and non-cancer cells. In some embodiments, an expression inhibitor or system of the invention can be used to reduce the survival rate of a plurality of cancer cells more than it reduces the survival rate of a plurality of non-cancerous cells. In some embodiments, the expression inhibitors or systems of the invention can be used to reduce the survival of a plurality of cancer cells by 1.05x (i.e., 1.05 times), the reduction in the survival of a plurality of non-cancer cells, 1.1x, 1.15x, 1.2x, 1.25x, 1.3x, 1.35x, 1.4x, 1.45x, 1.5x, 1.6x, 1.7x, 1.8x, 1.9x, 2x, 3x, 4x, 5x, 6x, 7x , 8x, 9x, 10x, 20x, 50x or 100x.
在一些實施例中,本發明之表現抑制子或系統可用於降低包含感染細胞及未感染細胞之複數個細胞的存活率。在一些實施例中,本發明之表現抑制子或系統可用於降低複數個感染細胞的存活率,降幅超過其降低複數個未感染細胞之存活率的降幅。在一些實施例中,本發明之表現抑制子或系統可用於降低複數個感染細胞的存活率,降幅為其降低複數個未感染細胞之存活率之降幅的1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.6x、1.7x、1.8x、1.9x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、50x或100x。In some embodiments, the expression inhibitors or systems of the invention can be used to reduce the viability of a plurality of cells, including both infected and uninfected cells. In some embodiments, an expression inhibitor or system of the invention can be used to reduce the survival of infected cells more than it reduces the survival of uninfected cells. In some embodiments, an expression inhibitor or system of the invention can be used to reduce the survival of infected cells by 1.05x (i.e., 1.05 times), the reduction in the survival of uninfected cells by a factor of 1.05, 1.1x, 1.15x, 1.2x, 1.25x, 1.3x, 1.35x, 1.4x, 1.45x, 1.5x, 1.6x, 1.7x, 1.8x, 1.9x, 2x, 3x, 4x, 5x, 6x, 7x , 8x, 9x, 10x, 20x, 50x or 100x.
表現抑制系統可包含複數個表現抑制子,其中各表現抑制子包含功能與另一表現抑制子之效應部分不同的效應部分。舉例而言,表現抑制系統可包含兩個表現抑制子,其中第一表現抑制子包含含有表觀遺傳修飾部分(例如DNA甲基轉移酶,例如MQ1)的第一效應部分且第二表現抑制子包含含有轉錄抑制子(例如KRAB)的第二效應部分。在一些實施例中,第二表現抑制子不包含第二效應部分。在一些實施例中,表現抑制系統包含含有效應部分之組合的表現抑制子,該等效應部分的功能就抑制靶基因(例如MYC)表現而言彼此互補,其中該等功能合起來能夠抑制表現且視情況當個別地存在時不抑制或可忽略地抑制表現。在一些實施例中,表現抑制系統包含複數個表現抑制子,其中各表現抑制子包含與各其他表現抑制子之效應部分互補的效應部分,例如各效應部分降低靶基因(例如MYC)表現。An expression inhibitory system may comprise a plurality of expression inhibitors, wherein each expression inhibitor comprises an effector moiety that is functionally different from the effector moiety of another expression inhibitor. For example, an expression repression system may comprise two expression repressors, wherein a first expression repressor comprises a first effector moiety comprising an epigenetic modification moiety (e.g., a DNA methyltransferase, e.g., MQ1) and a second expression repressor A second effector moiety comprising a transcriptional repressor (eg KRAB) is included. In some embodiments, the second expression suppressor does not comprise a second effector moiety. In some embodiments, the expression suppression system comprises an expression suppressor comprising a combination of effector moieties whose functions are complementary to each other in suppressing expression of a target gene (e.g., MYC), wherein the functions together suppress expression and Performance is not suppressed or negligibly suppressed when present individually, as appropriate. In some embodiments, the expression suppression system comprises a plurality of expression suppressors, wherein each expression suppressor comprises an effector moiety complementary to the effector moiety of each other expression suppressor, eg, each effector moiety reduces expression of a target gene (eg, MYC).
在一些實施例中,表現抑制系統包含含有效應部分之組合的表現抑制子,該等效應部分的功能就抑制靶基因(例如MYC)表現而言彼此協同作用。不希望受理論束縛,對基因體基因座的表觀遺傳修飾可累積起來,因為多種個別的抑制性表觀遺傳標記物(例如多種不同類型的表觀遺傳標記物及/或指定類型的更大範圍標記)合起來抑制表現比單獨的個別修飾更有效(例如使表現更大幅度的減少及/或使表現更持久的減少)。在一些實施例中,表現抑制系統包含複數個表現抑制子,其中各表現抑制子包含與各其他表現抑制子之效應部分協同作用的效應部分,例如各效應部分降低靶基因(例如MYC)表現。In some embodiments, the expression suppression system comprises an expression suppressor comprising a combination of effector moieties that function synergistically with each other in inhibiting expression of a target gene (eg, MYC). Without wishing to be bound by theory, epigenetic modifications to gene body loci can accumulate as multiple individual repressive epigenetic markers (e.g. multiple different types of epigenetic markers and/or larger range markers) together are more effective in suppressing expression than the individual modifications alone (eg, a greater reduction in expression and/or a longer-lasting reduction in expression). In some embodiments, the expression suppression system comprises a plurality of expression suppressors, wherein each expression suppressor comprises an effector moiety that acts synergistically with the effector moieties of each other expression suppressor, eg, each effector moiety reduces the expression of a target gene (eg, MYC).
在一些實施例中,表現抑制子或系統調節(例如降低)靶基因(例如MYC)表現,其藉由改變與靶基因(例如MYC)或可操作地連接至其之表現控制序列相關的一或多種表觀遺傳標記物來達成。在一些實施例中,改變包含降低與靶基因(例如MYC)或可操作地連接至其之表現控制序列相關的表觀遺傳標記物水準。表觀遺傳標記物包括(但不限於) DNA甲基化、組蛋白甲基化及組蛋白去乙醯化。In some embodiments, the expression suppressor or system modulates (e.g., reduces) the expression of a target gene (e.g., MYC) by altering one or more of the expression control sequences associated with the target gene (e.g., MYC) or operably linked thereto. multiple epigenetic markers. In some embodiments, altering comprises reducing the level of an epigenetic marker associated with a target gene (eg, MYC) or an expression control sequence operably linked thereto. Epigenetic marks include, but are not limited to, DNA methylation, histone methylation, and histone deacetylation.
在一些實施例中,改變表觀遺傳標記物的水準使與靶基因(例如MYC)或可操作地連接至其的表現控制序列相關的表觀遺傳標記物水準降低,比未接觸表現抑制子或系統或不包含表現抑制子或系統之細胞中之與靶基因(例如MYC)或可操作地連接至其之表現控制序列相關的表觀遺傳標記物水準低至少1.05x (亦即,1.05倍)、1.1x、1.15x、1.2x、1.25x、1.3x、1.35x、1.4x、1.45x、1.5x、1.55x、1.6x、1.65x、1.7x、1.75x、1.8x、1.85x、1.9x、1.95x、2x、3x、4x、5x、6x、7x、8x、9x、10x、20x、30x、40x、50x、60x、70x、80x、90x或100x。表觀遺傳標記物水準可藉由熟習此項技術者已知的方法分析,包括全基因體亞硫酸氫鹽定序、簡化表示亞硫酸氫鹽定序、亞硫酸氫鹽擴增子定序、甲基化陣列、焦磷酸定序、ChIP-seq或ChIP-qPCR。在一些實施例中,可利用亞硫酸氫鹽基因體定序,分析表觀遺傳標記物(例如DNA甲基化)在根據hg19參考基因體的精確基因體座標(例如根據hg19參考基因體之chr8:129188693-129189048之間)處的變化(例如增加或降低)。在一些實施例中,可利用亞硫酸氫鹽基因體定序,分析表觀遺傳標記物(例如DNA甲基化)在根據SEQ ID NO: 123的基因體位置處的變化(例如增加或降低)。 In some embodiments, altering the level of an epigenetic marker reduces the level of an epigenetic marker associated with a target gene (e.g., MYC) or an expression control sequence operably linked thereto, compared to uncontacted expression suppressors or The level of an epigenetic marker associated with a target gene (eg, MYC) or an expression control sequence operably linked thereto is at least 1.05x (i.e., 1.05-fold) lower in the system or in cells that do not comprise the expression suppressor or system , 1.1x, 1.15x, 1.2x, 1.25x, 1.3x, 1.35x, 1.4x, 1.45x, 1.5x, 1.55x, 1.6x, 1.65x, 1.7x, 1.75x, 1.8x, 1.85x, 1.9 x, 1.95x, 2x, 3x, 4x, 5x, 6x, 7x, 8x, 9x, 10x, 20x, 30x, 40x, 50x, 60x, 70x, 80x, 90x, or 100x. Epigenetic marker levels can be analyzed by methods known to those skilled in the art, including whole genome bisulfite sequencing, simplified representation bisulfite sequencing, bisulfite amplicon sequencing, Methylation array, pyrosequencing, ChIP-seq or ChIP-qPCR. In some embodiments, bisulfite genome sequencing can be used to analyze epigenetic markers (e.g., DNA methylation) at precise gene body coordinates according to the hg19 reference gene body (e.g., chr8 according to the hg19 reference gene body). : between 129188693-129189048) (eg increase or decrease). In some embodiments, bisulfite gene body sequencing can be used to analyze changes (e.g., increases or decreases) in epigenetic markers (e.g., DNA methylation) at gene body positions according to SEQ ID NO: 123 .
本發明之表現抑制子或系統可用於在某一時段內改變與靶基因(例如MYC)或可操作地連接至其之表現控制序列相關之表觀遺傳標記物在細胞中的水準。在一些實施例中,接觸表現抑制子或系統或包含表現抑制子或系統之細胞中之與靶基因或可操作地連接至其之表現控制序列相關的表觀遺傳標記物水準明顯地降低至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6、7、10或14天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如無限期地)。視情況,接觸表現抑制子或系統或包含表現抑制子或系統之細胞中之與靶基因(例如MYC)或可操作地連接至其之表現控制序列相關的表觀遺傳標記物水準明顯降低不超過10、9、8、7、6、5、4、3、2或1年。The expression suppressors or systems of the invention can be used to alter the level in a cell of an epigenetic marker associated with a target gene (eg, MYC) or an expression control sequence operably linked thereto over a period of time. In some embodiments, contacting the expression suppressor or system or a cell comprising the expression suppressor or system significantly reduces the level of an epigenetic marker associated with the target gene or an expression control sequence operably linked thereto by at least 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, 7, 10, or 14 days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 months, or at least 1, 2, 3, 4 or 5 years (eg indefinitely). Optionally, contacting the expression suppressor or system or a cell comprising the expression suppressor or system significantly reduces the level of an epigenetic marker associated with a target gene (e.g., MYC) or an expression control sequence operably linked thereto by no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 year.
抑制子組合suppressor combination
在一些實施例中,表現抑制系統包含含有第一效應部分的第一表現抑制子及含有第二效應部分的第二表現抑制子,其中第一效應部分與第二效應部分彼此不同。在一些實施例中,第一效應部分為或包含第一表觀遺傳修飾部分(例如增加或減少第一表觀遺傳標記物)或其功能片段且第二效應部分為或包含第二表觀遺傳修飾部分(例如增加或減少第二表觀遺傳標記物)或其功能片段。在一些實施例中,第一效應部分為或包含DNA甲基轉移酶或其功能片段且第二效應部分為或包含KRAB或其功能片段。在一些實施例中,第一效應部分為或包含組蛋白去乙醯酶或其功能片段且第二效應部分為或包含KRAB或其功能片段。在一些實施例中,第一效應部分為或包含組蛋白甲基轉移酶或其功能片段且第二效應部分n為或包含KRAB或其功能片段。在一些實施例中,第一效應部分為或包含組蛋白去甲基酶或其功能片段且第二效應部分為或包含KRAB或其功能片段。In some embodiments, the expression suppression system comprises a first expression inhibitor comprising a first effector moiety and a second expression inhibitor comprising a second effector moiety, wherein the first effector moiety and the second effector moiety are different from each other. In some embodiments, the first effector moiety is or comprises a first epigenetic modification moiety (eg, increases or decreases a first epigenetic marker) or a functional fragment thereof and the second effector moiety is or comprises a second epigenetic modification moiety A modifying moiety (eg, increasing or decreasing a second epigenetic marker) or a functional fragment thereof. In some embodiments, the first effector moiety is or comprises a DNA methyltransferase or a functional fragment thereof and the second effector moiety is or comprises a KRAB or a functional fragment thereof. In some embodiments, the first effector moiety is or comprises a histone deacetylase or a functional fragment thereof and the second effector moiety is or comprises KRAB or a functional fragment thereof. In some embodiments, the first effector moiety is or comprises a histone methyltransferase or a functional fragment thereof and the second effector moiety n is or comprises KRAB or a functional fragment thereof. In some embodiments, the first effector moiety is or comprises a histone demethylase or a functional fragment thereof and the second effector moiety is or comprises KRAB or a functional fragment thereof.
在一些實施例中,第一效應部分為或包含MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2或其任一者的功能片段,且第二效應部分為或包含KRAB (例如KRAB域)、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者的功能片段。In some embodiments, the first effector moiety is or comprises MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8 , HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (aka LSD1), KDM1B (aka LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C , KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 (ie, G9A), EHMT1 (ie, GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, or a functional fragment of any of them, and The second effector moiety is or comprises KRAB (eg, KRAB domain), MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or a functional fragment of any of them.
在一些實施例中,第一效應部分為或包含KRAB (例如KRAB域)、MeCP2、HP1、RBBP4、REST、FOG1、SUZ12或其任一者的功能片段,且第二效應部分為或包含MQ1、DNMT1、DNMT3A1、DNMT3A2、DNMT3B1、DNMT3B2、DNMT3B3、DNMT3B4、DNMT3B5、DNMT3B6、DNMT3L、HDAC1、HDAC2、HDAC3、HDAC4、HDAC5、HDAC6、HDAC7、HDAC8、HDAC9、HDAC10、HDAC11、SIRT1、SIRT2、SIRT3、SIRT4、SIRT5、SIRT6、SIRT7、SIRT8、SIRT9、KDM1A (亦即,LSD1)、KDM1B (亦即,LSD2)、KDM2A、KDM2B、KDM5A、KDM5B、KDM5C、KDM5D、KDM4B、NO66、SETDB1、SETDB2、EHMT2 (亦即,G9A)、EHMT1 (亦即,GLP)、SUV39H1、EZH2、EZH1、SUV39H2、SETD8、SUV420H1、SUV420H2或其任一者的功能片段。In some embodiments, the first effector moiety is or comprises a functional fragment of KRAB (e.g., a KRAB domain), MeCP2, HP1, RBBP4, REST, FOG1, SUZ12, or any one thereof, and the second effector moiety is or comprises MQ1, DNMT1, DNMT3A1, DNMT3A2, DNMT3B1, DNMT3B2, DNMT3B3, DNMT3B4, DNMT3B5, DNMT3B6, DNMT3L, HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, SIRT1, SIRT2, SIRT3, SIRT4, SIRT5, SIRT6, SIRT7, SIRT8, SIRT9, KDM1A (aka LSD1), KDM1B (aka LSD2), KDM2A, KDM2B, KDM5A, KDM5B, KDM5C, KDM5D, KDM4B, NO66, SETDB1, SETDB2, EHMT2 (aka , G9A), EHMT1 (ie, GLP), SUV39H1, EZH2, EZH1, SUV39H2, SETD8, SUV420H1, SUV420H2, or a functional fragment of any of them.
在一些實施例中,第一效應部分為或包含MQ1或其功能變異體或片段且第二效應部分為或包含KRAB或其功能變異體或片段。In some embodiments, the first effector moiety is or comprises MQ1 or a functional variant or fragment thereof and the second effector moiety is or comprises KRAB or a functional variant or fragment thereof.
在一些實施例中,第一效應部分為或包含DNMT3A或其功能變異體或片段且第二效應部分為或包含KRAB或其功能變異體或片段。In some embodiments, the first effector moiety is or comprises DNMT3A or a functional variant or fragment thereof and the second effector moiety is or comprises KRAB or a functional variant or fragment thereof.
在一些實施例中,第一效應部分為或包含DNMT3B或其功能變異體或片段且第二效應部分為或包含KRAB或其功能變異體或片段。In some embodiments, the first effector moiety is or comprises DNMT3B or a functional variant or fragment thereof and the second effector moiety is or comprises KRAB or a functional variant or fragment thereof.
在一些實施例中,第一效應部分為或包含DNMT3L或其功能變異體或片段且第二效應部分為或包含KRAB或其功能變異體或片段。In some embodiments, the first effector moiety is or comprises DNMT3L or a functional variant or fragment thereof and the second effector moiety is or comprises KRAB or a functional variant or fragment thereof.
在一些實施例中,第一效應部分為或包含DNMT3a/3L複合物或其功能變異體或片段且第二效應部分為或包含KRAB或其功能變異體或片段。In some embodiments, the first effector moiety is or comprises a DNMT3a/3L complex or a functional variant or fragment thereof and the second effector moiety is or comprises KRAB or a functional variant or fragment thereof.
靶點 本文所揭示之表現抑制子或表現抑制系統可用於調節(例如降低)靶基因(例如MYC)於細胞(例如個體或患者之細胞)中的表現。靶基因(例如MYC)可為熟習此項技術者已知的任何基因。在一些實施例中,靶基因(例如MYC)與個體(例如哺乳動物,例如人類、牛、馬、綿羊、雞、大鼠、小鼠、貓或犬)的疾病或病狀相關。靶基因可包括編碼序列,例如外顯子,及/或非編碼序列,例如內含子、3'UTR或5'UTR。在一些實施例中,靶基因可操作地連接至轉錄控制元件。Targets The expression suppressors or expression suppression systems disclosed herein can be used to modulate (eg, reduce) the expression of a target gene (eg, MYC) in cells (eg, cells of an individual or patient). The target gene (eg MYC) can be any gene known to those skilled in the art. In some embodiments, the target gene (eg, MYC) is associated with a disease or condition in an individual (eg, a mammal, eg, a human, cow, horse, sheep, chicken, rat, mouse, cat, or dog). A target gene may include coding sequences, such as exons, and/or non-coding sequences, such as introns, 3'UTRs or 5'UTRs. In some embodiments, the target gene is operably linked to a transcriptional control element.
適用於本文所述之表現抑制子或表現抑制系統的靶向部分可結合(例如特異性結合)至:靶基因(例如MYC)內的任何位點、可操作地連接至靶基因(例如MYC)之轉錄控制元件、錨定序列(例如與靶基因近接的錨定序列或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體(例如若接合體之斷裂改變了靶基因(例如MYC)的表現,則錨定序列介導性接合體可操作地連接至靶基因)相關的錨定序列),或位於超級增強子區域中的調控元件(例如位於MYC之超級增強子區域中的調控元件)。Targeting moieties suitable for use in the expression suppressors or expression suppression systems described herein can bind (e.g., specifically bind) to: any site within a target gene (e.g., MYC), be operably linked to a target gene (e.g., MYC) Transcriptional control elements, anchor sequences (e.g., anchor sequences in close proximity to the target gene, or anchor sequences operatively linked to the target gene (e.g., MYC) mediate adapters (e.g., if fragmentation of the adapter changes the target expression of a gene (such as MYC), the anchor sequence-mediated adapter is operably linked to the target gene) associated anchor sequence), or a regulatory element located in a super-enhancer region (such as a super-enhancer located in MYC Regulatory elements in the region).
在一些實施例中,本文所述之表現抑制子係在某一位點或在鄰近於該位點的位置結合。舉例而言,靶向部分可結合至鄰近於抑制子(第二位點)的第一位點,且與該靶向部分相連的效應部分可以表觀遺傳方式修飾第一位點,以便調節增強子對靶基因表現的影響,從而與已結合及/或修飾的第二位點(增強子序列)基本上相同。在一些實施例中,近接於靶基因(例如外顯子、內含子,或靶基因內的剪接位點)、近接於可操作地連接至靶基因(例如MYC)之轉錄控制元件或近接於錨定序列的位點為來自靶基因(例如MYC)(例如外顯子、內含子,或靶基因(例如MYC)內的剪接位點)、轉錄控制元件或錨定序列的小於5000、4000、3000、2000、1000、900、800、700、600、500、400、300、200、100、50或25個鹼基對(且視情況,來自靶基因(例如MYC)(例如外顯子、內含子,或靶基因內的剪接位點)、轉錄控制元件或錨定序列的至少20、25、50、100、200或300個鹼基對)。In some embodiments, an expression inhibitor described herein binds at or adjacent to a site. For example, a targeting moiety can bind to a first site adjacent to a repressor (second site), and an effector moiety linked to the targeting moiety can epigenetically modify the first site so as to modulate potentiation The effect of the promoter on the expression of the target gene is substantially the same as the second site (enhancer sequence) that has been bound and/or modified. In some embodiments, the target gene is located in close proximity to a target gene (such as an exon, an intron, or a splice site within a target gene), close to a transcriptional control element operably linked to a target gene (such as MYC), or close to a The site of the anchor sequence is less than 5000, 4000 from the target gene (eg MYC) (eg exon, intron, or splice site within the target gene (eg MYC), transcriptional control element or anchor sequence , 3000, 2000, 1000, 900, 800, 700, 600, 500, 400, 300, 200, 100, 50, or 25 base pairs (and optionally, from a target gene (e.g., MYC) (e.g., an exon, Introns, or splice sites within the target gene), transcriptional control elements, or at least 20, 25, 50, 100, 200, or 300 base pairs of an anchor sequence).
在一些實施例中,靶向部分結合至靶基因,例如MYC。在一些實施例中,DNA靶向部分結合至靶基因(例如MYC)之外顯子內的位點。在一些實施例中,靶向部分結合至靶基因(例如MYC)之內含子內的位點。在一些實施例中,靶向部分結合至靶基因(例如MYC)之外顯子與內含子邊界處的位點,例如剪接位點。在一些實施例中,靶向部分結合至靶基因(例如MYC)之5'UTR內的位點。在一些實施例中,靶向部分結合至靶基因(例如MYC)之3'UTR內的位點。靶基因包括(但不限於)編碼MYC的基因。In some embodiments, the targeting moiety binds to a target gene, such as MYC. In some embodiments, the DNA targeting moiety binds to a site within the exon of a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a site within an intron of a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a site, such as a splice site, at an exon-intron boundary of a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a site within the 5'UTR of a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a site within the 3'UTR of a target gene (eg, MYC). Target genes include, but are not limited to, the gene encoding MYC.
在一些實施例中,靶向部分結合至可操作地連接至靶基因(例如MYC)的轉錄控制元件。在一些實施例中,靶向部分結合至可操作地連接至靶基因(例如MYC)之啟動子的一部分或該啟動子內的位點。在一些實施例中,靶向部分結合至靶基因(例如MYC)之轉錄起點。在一些實施例中,靶向部分結合至可操作地連接至靶基因(例如MYC)之增強子的一部分或該增強子內的位點。在一些實施例中,基因體複合物(例如ASMC)使兩個或更多個基因體序列元件共定位,其中該兩個或更多個基因體序列元件包括啟動子。啟動子典型地為起始相連關基因轉錄的序列元件。啟動子典型地靠近基因的5'端,距離其轉錄起點不遠。如一般技術者所瞭解,真核細胞中之蛋白質編碼基因的轉錄典型地始於一般轉錄因子(例如TFIID、TFIIE、TFIIH、FUSE、CT元件等)及介體結合至核心啟動子序列成為起始前複合物,該起始前複合物將RNA聚合酶II引向轉錄起點,且在許多情形中保持結合至核心啟動子序列,甚至在RNA聚合酶逃逸且起始初級轉錄本延伸之後。在一些實施例中,啟動子包括諸如TATA、Inr、DPE或BRE之序列元件,但熟習此項技術者充分瞭解此類序列不一定為定義啟動子而必需的。熟習此項技術者熟悉與基因相連的多種正轉錄控制元件(例如增強子)或負轉錄控制元件(例如抑制子或靜默子)。在一些實施例中,轉錄控制元件為轉錄因子結合位點。典型地,當同源調控蛋白結合至此類轉錄控制元件時,來自相連基因的轉錄被改變(例如增加或減少)。在一些實施例中,靶向部分結合至位於基因體座標GRCh37:chr8:129162465-129212140內的基因體序列。In some embodiments, the targeting moiety binds to a transcriptional control element operably linked to a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a portion of, or a site within, a promoter operably linked to a target gene (eg, MYC). In some embodiments, targeting moieties bind to the start of transcription of a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a portion of, or a site within, an enhancer operably linked to a target gene (eg, MYC). In some embodiments, a gene body complex (eg, ASMC) colocalizes two or more gene body sequence elements, wherein the two or more gene body sequence elements include a promoter. A promoter is typically a sequence element that initiates transcription of an associated gene. A promoter is typically near the 5' end of a gene, not far from its start of transcription. As is understood by those of ordinary skill, transcription of protein-coding genes in eukaryotic cells typically begins with binding of common transcription factors (eg, TFIID, TFIIE, TFIIH, FUSE, CT elements, etc.) and mediators to the core promoter sequence to initiate The pre-initiation complex, the pre-initiation complex, directs RNA polymerase II towards the start of transcription and in many cases remains bound to the core promoter sequence even after RNA polymerase escapes and initiates primary transcript elongation. In some embodiments, a promoter includes sequence elements such as TATA, Inr, DPE, or BRE, but those skilled in the art are well aware that such sequences are not necessarily necessary to define a promoter. Those skilled in the art are familiar with a variety of positive transcriptional control elements (such as enhancers) or negative transcriptional control elements (such as repressors or silencers) associated with genes. In some embodiments, the transcriptional control element is a transcription factor binding site. Typically, when a cognate regulatory protein binds to such a transcriptional control element, transcription from the linked gene is altered (eg, increased or decreased). In some embodiments, the targeting moiety binds to a gene body sequence located within gene body coordinates GRCh37:chr8:129162465-129212140.
在一些實施例中,靶向部分結合至基因體序列元件所包含或部分包含的靶序列。在一些實施例中,基因體序列元件為或包含表現控制序列。在一些實施例中,基因體序列元件為或包含靶基因,例如MYC或靶基因(例如MYC)的一部分。在一些實施例中,靶向部分結合至長度為至少10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34或35個鹼基(且視情況,長度不超過40、39、38、37、36、35、34、33、32、31、30、29、28、27、26、25、24、23、22、21或20個鹼基)的靶序列。在一些實施例中,靶向部分結合至長度為10-30、15-30、15-25、18-24、19-23、20-23、21-23或22-23個鹼基的靶序列。在一些實施例中,靶序列具有10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39或40個鹼基的長度。在一些實施例中,基因體序列元件為或包含錨定序列。In some embodiments, a targeting moiety binds to a target sequence comprised or partially comprised by a gene body sequence element. In some embodiments, the genome sequence element is or comprises an expression control sequence. In some embodiments, the gene body sequence element is or comprises a target gene, eg, MYC, or a portion of a target gene, eg, MYC. In some embodiments, the targeting moiety binds to a length of at least 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 bases (and up to 40, 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21 or 20 bases) of the target sequence. In some embodiments, the targeting moiety binds to a target sequence that is 10-30, 15-30, 15-25, 18-24, 19-23, 20-23, 21-23, or 22-23 bases in length . In some embodiments, the target sequence has 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 , 31, 32, 33, 34, 35, 36, 37, 38, 39 or 40 bases in length. In some embodiments, the genome sequence element is or comprises an anchor sequence.
各ASMC包含一或多個錨定序列,例如複數個。在一些實施例中,可操控或改變錨定序列以調節(例如破壞)天然存在之基因體複合物(例如ASMC)或形成新基因體複合物(例如ASMC)(例如與外源或改變的錨定序列一起形成非天然存在之基因體複合物(例如ASMC))。在一些實施例中,錨定序列介導性接合體可斷裂以改變(例如抑制,例如降低)靶基因表現。此斷裂可調節基因表現,例如藉由改變DNA的拓樸結構,例如藉由調節靶基因與轉錄控制元件(例如增強及靜默/抑制序列)相互作用的能力來調節基因表現。Each ASMC comprises one or more anchor sequences, eg, a plurality. In some embodiments, the anchor sequence can be manipulated or altered to modulate (e.g., disrupt) a naturally occurring gene body complex (e.g., ASMC) or to form a new gene body complex (e.g., ASMC) (e.g., with an exogenous or altered anchor Sequences together form non-naturally occurring gene body complexes (eg, ASMC)). In some embodiments, anchor sequence-mediated adapters are cleavable to alter (eg, inhibit, eg, reduce) target gene expression. Such breaks can modulate gene expression, eg, by altering the topology of the DNA, eg, by modulating the ability of the target gene to interact with transcriptional control elements (eg, enhancing and silencing/repressing sequences).
在一些實施例中,靶向部分結合至錨定序列,例如近接於靶基因(例如MYC)或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體(ASMC)相關的錨定序列(例如若接合體的斷裂改變靶基因(例如MYC)表現,則錨定序列介導性接合體可操作地連接至靶基因,例如MYC)。一般而言,錨定序列為基因體複合物組分(例如成核多肽)所特異性結合的基因體序列元件。在一些實施例中,基因體複合物組分結合至錨定序列引起成核複合物形成,例如ASMC形成。在一些實施例中,靶向部分結合至靶基因,例如MYC基因座。基因座通常定義為包含轉錄區域、啟動子,及包含靶基因(例如MYC)之ASMC的錨定位點。在一些實施例中,靶向部分結合至包含SEQ ID NO: 75-86或199-206中之任一者的序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 75-86中之任一者的序列且第二靶向部分結合至包含SEQ ID NO: 75-86中之任一者的序列,其中第一及第二靶向部分結合至相同序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 75-86中之任一者的序列且第二靶向部分結合至包含SEQ ID NO: 75-86中之任一者的序列,其中第一與第二靶向部分結合至不同序列。在一些實施例,第一靶向部分結合至包含SEQ ID NO: 83、203或206中之任一者的序列,且第二靶向部分結合至包含SEQ ID NO: 77的序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 77之序列且第二靶向部分結合至包含SEQ ID NO: 83、203或206中之任一者的序列。在一些實施例,第一靶向部分結合至包含SEQ ID NO: 83、203或206中之任一者的序列,且第二靶向部分結合至包含SEQ ID NO: 199、204或205中之任一者的序列。在一些實施例,第一靶向部分結合至包含SEQ ID NO: 199、204或205中之任一者的序列,且第二靶向部分結合至包含SEQ ID NO: 83、203或206中之任一者的序列。在一些實施例,第一靶向部分結合至包含SEQ ID NO: 83、203或206中之任一者的序列,且第二靶向部分結合至包含SEQ ID NO: 201的序列。在一些實施例中,編碼第一及第二表現抑制子的核酸包含編碼第一表現抑制子的第一區域,其中該第一區域位於編碼第二表現抑制子之第二區域的上游。在一些實施例中,編碼第一及第二表現抑制子的核酸包含編碼第一表現抑制子的第一區域,其中該第一區域在編碼第二表現抑制子之第二區域的下游。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 75-86或199-206中之任一者的序列,且第二靶向部分(例如包含gRNA的CRISPR/Cas域)結合至包含SEQ ID NO: 1-4中之任一者的序列。在一些實施例中,靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列且第二靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列,其中第一及第二靶向部分結合至相同序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列且第二靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列,其中第一與第二靶向部分結合至不同序列。在一些實施例中,第一靶向部分結合至包含SEQ ID NO: 96-110中之任一者的序列,且第二靶向部分(例如包含gRNA的CRISPR/Cas域)結合至包含SEQ ID NO: 1-4中之任一者的序列。在一些實施例中,第一靶向部分結合至包含表2、12或13中所揭示之任一種SEQ ID No.的序列,且第二靶向部分(例如包含gRNA的CRISPR/Cas域)結合至包含表2、12或13中所揭示之任一種SEQ ID No.的序列。In some embodiments, the targeting moiety binds to the anchor sequence, e.g., in proximity to the target gene (e.g., MYC) or associated with an anchor sequence-mediated adapter (ASMC) operably linked to the target gene (e.g., MYC) (eg, if disruption of the adapter alters expression of the target gene (eg, MYC), the anchor sequence mediates the operative linkage of the adapter to the target gene, such as MYC). In general, an anchor sequence is a gene body sequence element to which a gene body complex component (eg, a nucleating polypeptide) specifically binds. In some embodiments, binding of a gene body complex component to an anchor sequence results in nucleation complex formation, eg, ASMC formation. In some embodiments, the targeting moiety binds to a target gene, such as the MYC locus. A locus is generally defined as comprising a transcribed region, a promoter, and an anchor site for ASMC comprising a target gene (eg, MYC). In some embodiments, the targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86 or 199-206. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86 and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86. sequence, wherein the first and second targeting moieties bind to the same sequence. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86 and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86. sequence, wherein the first and second targeting moieties bind to different sequences. In some embodiments, the first targeting moiety binds to a sequence comprising any one of SEQ ID NO: 83, 203, or 206 and the second targeting moiety binds to a sequence comprising SEQ ID NO: 77. In some embodiments, the first targeting moiety binds to a sequence comprising SEQ ID NO: 77 and the second targeting moiety binds to a sequence comprising any of SEQ ID NO: 83, 203, or 206. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 83, 203, or 206, and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 199, 204, or 205. sequence of either. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 199, 204, or 205, and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 83, 203, or 206. sequence of either. In some embodiments, the first targeting moiety binds to a sequence comprising any one of SEQ ID NO: 83, 203, or 206 and the second targeting moiety binds to a sequence comprising SEQ ID NO: 201. In some embodiments, the nucleic acid encoding the first and second expression inhibitors comprises a first region encoding the first expression inhibitor, wherein the first region is upstream of the second region encoding the second expression inhibitor. In some embodiments, the nucleic acid encoding the first and second expression inhibitors comprises a first region encoding the first expression inhibitor, wherein the first region is downstream of the second region encoding the second expression inhibitor. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 75-86 or 199-206, and the second targeting moiety (e.g., a CRISPR/Cas domain comprising a gRNA) binds to a sequence comprising any one of SEQ ID NO: 1-4. In some embodiments, the targeting moiety binds to a sequence comprising any one of SEQ ID NOs: 96-110. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 96-110 and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 96-110 sequence, wherein the first and second targeting moieties bind to the same sequence. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 96-110 and the second targeting moiety binds to a sequence comprising any of SEQ ID NOs: 96-110 sequence, wherein the first and second targeting moieties bind to different sequences. In some embodiments, the first targeting moiety binds to a sequence comprising any of SEQ ID NOs: 96-110, and the second targeting moiety (e.g., a CRISPR/Cas domain comprising a gRNA) binds to a sequence comprising any of SEQ ID NO: 96-110. NO: the sequence of any one of 1-4. In some embodiments, the first targeting moiety binds to a sequence comprising any one of the SEQ ID Nos disclosed in Table 2, 12 or 13, and the second targeting moiety (e.g., a CRISPR/Cas domain comprising a gRNA) binds to a sequence comprising any one of the SEQ ID No. disclosed in Table 2, 12 or 13.
例示性靶序列揭示於表12中。
表12:例示性靶序列
在一些實施例中,表現抑制子結合具有本文所示之序列(例如SEQ ID NO: 1-4、75-86、96-110或199-206中之任一者)的基因體基因座。應瞭解,在許多情況下,所結合的基因體基因座包含雙股DNA,且此基因座可藉由示出其有義股或其反義股的序列來描述。因此,具有指定間隔子序列的gRNA可促使表現抑制子結合至特定基因體基因座,其中該基因體基因座中的一個股具有類似於或等同於間隔子序列的序列,且該基因體基因座中的另一個股具有互補序列。典型地,gRNA對基因體基因座的結合將涉及基因體基因座的一些展開及gRNA間隔子與間隔子互補股的配對。In some embodiments, the expression suppressor binds to a gene body locus having a sequence set forth herein (eg, any of SEQ ID NOs: 1-4, 75-86, 96-110, or 199-206). It will be appreciated that in many cases the associated gene body locus comprises double-stranded DNA, and that this locus can be described by showing the sequence of its sense or antisense strand. Thus, a gRNA with a specified spacer sequence can induce expression suppressor binding to a specific gene body locus, wherein one strand in the gene body locus has a sequence similar to or identical to the spacer sequence, and the gene body locus The other strand in has a complementary sequence. Typically, binding of a gRNA to a gene body locus will involve some unwinding of the gene body locus and pairing of the gRNA spacer with the spacer complementary strand.
在一些實施例中,靶向部分結合至錨定序列,例如近接於靶基因(例如MYC)或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體(ASMC)相關的錨定序列(例如若接合體的斷裂改變小鼠基因體中之靶基因(例如MYC)的表現,則錨定序列介導性接合體可操作地連接至靶基因,例如MYC)。一般而言,錨定序列為基因體複合物組分(例如成核多肽)所特異性結合的基因體序列元件。在一些實施例中,基因體複合物組分結合至錨定序列引起成核複合物形成,例如ASMC形成。在一些實施例中,靶向部分結合至靶基因,例如MYC基因座。基因座通常定義為包含轉錄區域、啟動子,及包含靶基因(例如MYC)之ASMC的錨定位點。在一些實施例中,靶向部分結合至包含SEQ ID NO: 190-192中之任一者的序列。在一些實施例中,靶向部分結合至包含表18中所揭示之任一種SEQ ID No.的序列。小鼠基因體中的例示性靶序列揭示於表18中。
表18:小鼠基因體中的例示性靶序列
在一些實施例中,表現抑制子結合具有本文所示之序列(例如SEQ ID NO: 190-192中之任一者)的基因體基因座。應瞭解,在許多情況下,所結合的基因體基因座包含雙股DNA,且此基因座可藉由示出其有義股或其反義股的序列來描述。In some embodiments, the expression suppressor binds to a gene body locus having a sequence set forth herein (eg, any of SEQ ID NOs: 190-192). It will be appreciated that in many cases the associated gene body locus comprises double-stranded DNA, and that this locus can be described by showing the sequence of its sense or antisense strand.
在一個實施例中,錨定序列介導性接合體包含環,諸如染色體內環。在某些實施例中,錨定序列介導性接合體具有複數個環。一或多個環可包括第一錨定序列、核酸序列、轉錄控制序列及第二錨定序列。在另一個實施例中,至少一個環依次包括第一錨定序列、轉錄控制序列及第二錨定序列,或第一錨定序列、核酸序列及第二錨定序列。在又另一個實施例中核酸序列與轉錄控制序列中的一或兩者位於環內或環外。在另一實施例中,一或多個環包含轉錄控制序列。In one embodiment, the anchor sequence-mediated adapter comprises a loop, such as an intrachromosomal loop. In certain embodiments, the anchor sequence-mediated adapter has a plurality of loops. One or more loops may include a first anchor sequence, a nucleic acid sequence, a transcription control sequence, and a second anchor sequence. In another embodiment, at least one loop comprises a first anchor sequence, a transcriptional control sequence and a second anchor sequence, or a first anchor sequence, a nucleic acid sequence and a second anchor sequence, in sequence. In yet another embodiment, one or both of the nucleic acid sequence and the transcriptional control sequence are located within or outside the loop. In another embodiment, one or more loops comprise transcriptional control sequences.
在一些實施例中,錨定序列介導性接合體包括TATA盒、CAAT盒、GC盒或CAP位點。在一些實施例中,錨定序列介導性接合體包含複數個環,且其中錨定序列介導性接合體在一或多個環中包含錨定序列、核酸序列及轉錄控制序列中的至少一者。In some embodiments, the anchor sequence-mediated adapter comprises a TATA box, a CAAT box, a GC box, or a CAP site. In some embodiments, the anchor sequence-mediated adapter comprises a plurality of loops, and wherein the anchor sequence-mediated adapter comprises at least one of an anchor sequence, a nucleic acid sequence, and a transcription control sequence in one or more loops one.
在一些實施例中,藉由取代、添加或缺失錨定序列內的一或多個核苷酸來修飾染色質結構。在一些實施例中,藉由取代、添加或缺失錨定序列介導性接合體之錨定序列內的一或多個核苷酸來修飾染色質結構。在一些實施例中,藉由包括活化環或排除抑制環來抑制轉錄。在一個此類實施例中,錨定序列介導性接合體不包括減少核酸序列轉錄的轉錄控制序列。在一些實施例中,藉由包括抑制環或排除活化環來抑制轉錄。在一個此類實施例中,錨定序列介導性接合體包括減少核酸序列轉錄的轉錄控制序列。In some embodiments, the chromatin structure is modified by substituting, adding, or deleting one or more nucleotides within the anchor sequence. In some embodiments, the chromatin structure is modified by substituting, adding, or deleting one or more nucleotides within the anchor sequence of the anchor sequence-mediated adapter. In some embodiments, transcription is inhibited by including an activation loop or excluding a repression loop. In one such embodiment, the anchor sequence-mediated adapter does not include a transcription control sequence that reduces transcription of the nucleic acid sequence. In some embodiments, transcription is inhibited by including an inhibitory loop or excluding an activation loop. In one such embodiment, the anchor sequence-mediated adapter includes a transcription control sequence that reduces transcription of the nucleic acid sequence.
錨定序列彼此可不鄰接。在使用非鄰接錨定序列的實施例中,第一錨定序列與第二錨定序列可相隔約500 bp至約500 Mb、約750 bp至約200 Mb、約1 kb至約100 Mb、約25 kb至約50 Mb、約50 kb至約1 Mb、約100 kb至約750 kb、約150 kb至約15 500 kb,或約175 kb至約500 kb。在一些實施例中,第一錨定序列與第二錨定序列相隔約500 bp、600 bp、700 bp、800 bp、900 bp、1 kb、5 kb、10 kb、15 kb、20 kb、25 kb、30 kb、35 kb、40 kb、45 kb、50 kb、55 kb、60 kb、65 kb、70 kb、75 kb、80 kb、85 kb、90 kb、95 kb、100 kb、125 kb、150 kb、175 kb、200 kb、225 kb、250 kb、275 kb、300 kb、350 kb、400 kb、500 kb、600 kb、700 kb、800 kb、900 kb、1 Mb、2 Mb、3 Mb、4 Mb、5 Mb、6 Mb、7 Mb、8 Mb、9 Mb、10 Mb、15 Mb、20 Mb、25 Mb、50 Mb、75 Mb、20 100 Mb、200 Mb、300 Mb、400 Mb、500 Mb或其間的任何尺寸。The anchor sequences may not be contiguous to each other. In embodiments using non-contiguous anchor sequences, the first and second anchor sequences can be separated by about 500 bp to about 500 Mb, about 750 bp to about 200 Mb, about 1 kb to about 100 Mb, about 1 kb to about 100 Mb, about 25 kb to about 50 Mb, about 50 kb to about 1 Mb, about 100 kb to about 750 kb, about 150 kb to about 15,500 kb, or about 175 kb to about 500 kb. In some embodiments, the first anchor sequence is separated from the second anchor sequence by about 500 bp, 600 bp, 700 bp, 800 bp, 900 bp, 1 kb, 5 kb, 10 kb, 15 kb, 20 kb, 25 kb, kb, 30 kb, 35 kb, 40 kb, 45 kb, 50 kb, 55 kb, 60 kb, 65 kb, 70 kb, 75 kb, 80 kb, 85 kb, 90 kb, 95 kb, 100 kb, 125 kb, 150 kb, 175 kb, 200 kb, 225 kb, 250 kb, 275 kb, 300 kb, 350 kb, 400 kb, 500 kb, 600 kb, 700 kb, 800 kb, 900 kb, 1 Mb, 2 Mb, 3 Mb , 4 Mb, 5 Mb, 6 Mb, 7 Mb, 8 Mb, 9 Mb, 10 Mb, 15 Mb, 20 Mb, 25 Mb, 50 Mb, 75 Mb, 20 100 Mb, 200 Mb, 300 Mb, 400 Mb, 500 Mb or any size in between.
在一些更多實施例中,靶向部分引入以下中之至少一者:至少一個外源錨定序列;至少一個接合成核分子結合位點的變化,諸如藉由改變接合成核分子的結合親和力;至少一種共同核苷酸序列的取向變化,諸如CTCF結合模體、YY1結合模體、ZNF143結合模體或本文中提及的其他結合模體;以及至少一種錨定序列的取代、添加或缺失,諸如CTCF結合模體、YY1結合模體、ZNF143結合模體或本文中提及的其他結合模體。In some further embodiments, the targeting moiety introduces at least one of: at least one exogenous anchor sequence; at least one alteration of the binding site of the engaging nuclear molecule, such as by altering the binding affinity of the engaging nuclear molecule ; a change in the orientation of at least one common nucleotide sequence, such as a CTCF binding motif, a YY1 binding motif, a ZNF143 binding motif or other binding motifs mentioned herein; and a substitution, addition or deletion of at least one anchor sequence , such as CTCF binding motifs, YY1 binding motifs, ZNF143 binding motifs or other binding motifs mentioned herein.
在一些實施例中,錨定序列包含成核多肽結合模體,例如CTCF結合模體:N(T/C/G)N(G/A/T)CC(A/T/G)(C/G)(C/T/A)AG(G/A)(G/T)GG(C/A/T)(G/A)(C/G)(C/T/A)(G/A/C) (SEQ ID NO: 71),其中N為任何核苷酸。In some embodiments, the anchor sequence comprises a nucleating polypeptide binding motif, such as a CTCF binding motif: N(T/C/G)N(G/A/T)CC(A/T/G)(C/ G)(C/T/A)AG(G/A)(G/T)GG(C/A/T)(G/A)(C/G)(C/T/A)(G/A/ C) (SEQ ID NO: 71), wherein N is any nucleotide.
CTCF結合模體亦可呈相反取向,例如(G/A/C)(C/T/A)(C/G)(G/A)(C/A/T)GG(G/T)(G/A)GA(C/T/A)(C/G)(A/T/G)CC(G/A/T)N(T/C/G)N (SEQ ID NO: 72)。其中N為任何核苷酸。The CTCF binding motif can also be in the opposite orientation, for example (G/A/C)(C/T/A)(C/G)(G/A)(C/A/T)GG(G/T)(G /A)GA(C/T/A)(C/G)(A/T/G)CC(G/A/T)N(T/C/G)N (SEQ ID NO: 72). wherein N is any nucleotide.
在一些實施例中,錨定序列包含SEQ ID NO: 71或SEQ ID NO: 72或與SEQ ID NO: 71或SEQ ID NO: 72至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%一致的序列。In some embodiments, the anchor sequence comprises SEQ ID NO: 71 or SEQ ID NO: 72 or at least 75%, at least 80%, at least 90%, at least 95%, At least 96%, at least 97%, at least 98%, at least 99% identical sequences.
在一些實施例中,錨定序列包含成核多肽結合模體,例如YY1結合模體:CCGCCATNTT (SEQ ID NO: 73),其中N為任何核苷酸。In some embodiments, the anchor sequence comprises a nucleating polypeptide binding motif, such as the YY1 binding motif: CCGCCANTTT (SEQ ID NO: 73), wherein N is any nucleotide.
YY1結合模體亦可呈相反取向,例如AANATGGCGG (SEQ ID NO: 74),其中N為任何核苷酸。The YY1 binding motif can also be in the opposite orientation, such as AANATGGCGG (SEQ ID NO: 74), wherein N is any nucleotide.
在一些實施例中,錨定序列介導性接合體至少包含第一錨定序列及第二錨定序列。舉例而言,在一些實施例中,第一錨定序列及第二錨定序列可各自包含成核多肽結合模體,例如各包含CTCF結合模體。In some embodiments, the anchor sequence-mediated adapter comprises at least a first anchor sequence and a second anchor sequence. For example, in some embodiments, the first anchor sequence and the second anchor sequence can each comprise a nucleating polypeptide binding motif, eg, each comprise a CTCF binding motif.
在一些實施例中,第一錨定序列及第二錨定序列包含不同序列,例如第一錨定序列包含CTCF結合模體,且第二錨定序列包含除CTCF結合模體之外的錨定序列。在一些實施例中,各錨定序列包含成核多肽結合模體及位於成核多肽結合模體之一側或兩側的一或多個側接核苷酸。In some embodiments, the first anchor sequence and the second anchor sequence comprise different sequences, for example the first anchor sequence comprises a CTCF binding motif and the second anchor sequence comprises an anchor other than the CTCF binding motif sequence. In some embodiments, each anchor sequence comprises a nucleating polypeptide binding motif and one or more flanking nucleotides flanking one or both sides of the nucleating polypeptide binding motif.
可形成ASMC的兩個CTCF結合模體(例如鄰接或非鄰接的CTCF結合模體)可以任何取向存在於基因體中,例如相同取向(串聯) 5'-3' (左串聯,例如兩個包含SEQ ID NO: 71的CTCF結合模體)或3'-5' (右串聯,例如兩個包含SEQ ID NO: 72的CTCF結合模體),或收斂取向,其中一個CTCF結合模體包含SEQ ID NO: 71且另一個其餘模體包含SEQ ID NO: 72。Two CTCF binding motifs that can form ASMCs (e.g. contiguous or non-contiguous CTCF binding motifs) can be present in the gene body in any orientation, e.g. same orientation (tandem) 5'-3' (left tandem, e.g. two containing CTCF binding motif of SEQ ID NO: 71) or 3'-5' (right tandem, e.g. two CTCF binding motifs comprising SEQ ID NO: 72), or convergent orientation, wherein one CTCF binding motif comprises SEQ ID NO: 71 and another remaining motif comprising SEQ ID NO: 72.
在一些實施例中,錨定序列包含與靶基因(例如MYC)相關的CTCF結合模體,其中該靶基因與疾病、病症及/或病狀相關,例如MYC異常調控病症,例如肝病(例如肝癌)或肺癌。In some embodiments, the anchor sequence comprises a CTCF binding motif associated with a target gene (e.g., MYC), wherein the target gene is associated with a disease, disorder, and/or condition, e.g., a MYC dysregulated disorder, e.g., liver disease (e.g., liver cancer ) or lung cancer.
在一些實施例中,本發明之方法包含調節(例如破壞)基因體複合物(例如ASMC),例如藉由修飾染色質結構,藉由取代、添加或缺失錨定序列(例如成核多肽結合模體)內的一或多個核苷酸。可特異性靶向一或多個核苷酸,例如靶向變化,以便在錨定序列(例如成核多肽結合模體)內達成取代、添加或缺失。In some embodiments, the methods of the invention comprise modulating (e.g., disrupting) a gene body complex (e.g., ASMC), e.g., by modifying chromatin structure, by substituting, adding, or deleting anchor sequences (e.g., nucleating polypeptide binding one or more nucleotides in the body). One or more nucleotides can be specifically targeted, eg, targeted for changes, to effect a substitution, addition or deletion within an anchor sequence (eg, a nucleating polypeptide binding motif).
在一些實施例中,可藉由改變至少一種成核多肽結合模體之取向來改變基因體複合物(例如ASMC)。在一些實施例中,錨定序列包含成核多肽結合模體,例如CTCF結合模體,且靶向部分將變化引入至少一種成核多肽結合模體,例如改變針對成核多肽的結合親和力。In some embodiments, a gene body complex (eg, ASMC) can be altered by altering the orientation of at least one nucleating polypeptide binding motif. In some embodiments, the anchor sequence comprises a nucleating polypeptide binding motif, such as a CTCF binding motif, and the targeting moiety introduces a change in at least one nucleating polypeptide binding motif, such as altering the binding affinity for the nucleating polypeptide.
在一些實施例中,本文所述之表現抑制子或系統投與之前,靶基因(例如MYC)具有定義的表現狀態,例如病變狀態。舉例而言,靶基因(例如MYC)在病變細胞中可具有高表現量。藉由破壞錨定序列介導性接合體,可減少靶基因(例如MYC)的表現。In some embodiments, the target gene (eg, MYC) has a defined expression state, such as a disease state, prior to administration of an expression suppressor or system described herein. For example, a target gene (eg, MYC) may be highly expressed in diseased cells. By disrupting the anchor sequence-mediated adapter, the expression of target genes such as MYC can be reduced.
適用於本文所述之表現抑制系統中之表現抑制子的靶向部分可結合(例如特異性結合)至包含至少10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49或50個核苷酸或鹼基對(且視情況,不超過50、49、48、47、46、45、44、43、42、41、40、39、38、37、36、35、34、33、32、31、30、29、28、27、26、25、24、23、22、21、20、19、18、17、16、15、14、13、12、11或10個核苷酸或鹼基對)的位點。在一些實施例中,DNA靶向部分結合至包含10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49或50個核苷酸或鹼基對的位點。Targeting moieties for expression inhibitors suitable for use in the expression inhibition systems described herein can bind (e.g., specifically bind) to a group comprising at least 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 , 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45 , 46, 47, 48, 49, or 50 nucleotides or base pairs (and, as the case may be, no more than 50, 49, 48, 47, 46, 45, 44, 43, 42, 41, 40, 39, 38 , 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13 , 12, 11 or 10 nucleotides or base pairs). In some embodiments, the DNA targeting moiety binds to a , 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides or bases Base pair positions.
本發明之表現抑制系統可包含兩種或更多種表現抑制子。在一些實施例中,表現抑制系統中的表現抑制子各自包含不同的靶向部分。The expression inhibition system of the present invention may comprise two or more expression inhibitors. In some embodiments, the expression inhibitors in the expression inhibition system each comprise a different targeting moiety.
在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合靶基因(例如外顯子、內含子,或外顯子與內含子邊界(例如剪接位點))的靶向部分;及第二表現抑制子,其包含結合靶基因(例如外顯子、內含子,或外顯子與內含子邊界(例如剪接位點))的靶向部分。在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合靶基因(例如外顯子、內含子,或外顯子與內含子邊界(例如剪接位點))的靶向部分;及第二表現抑制子,其包含結合至可操作地連接至靶基因(例如MYC)之轉錄控制元件(例如啟動子或增強子)的靶向部分。在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合至可操作地連接至靶基因之轉錄控制元件(例如啟動子或增強子)的靶向部分;及第二表現抑制子,其包含結合至可操作地連接至靶基因之轉錄控制元件(例如啟動子或增強子)的靶向部分。在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合至與靶基因(例如MYC)近接或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體相關之錨定序列的靶向部分;及第二表現抑制子,其包含結合至可操作地連接至靶基因(例如MYC)之轉錄控制元件(例如啟動子或增強子)的靶向部分。在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合至與靶基因(例如MYC)近接或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體相關之錨定序列的靶向部分;及第二表現抑制子,其包含結合至靶基因(例如MYC)(例如外顯子、內含子,或外顯子與內含子邊界(例如剪接位點))的靶向部分。在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合至與靶基因(例如MYC)近接或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體相關之錨定序列的靶向部分;及第二表現抑制子,其包含結合至與靶基因(例如MYC)近接或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體相關之錨定序列的靶向部分。In some embodiments, the expression suppression system comprises: a first expression suppressor comprising a target that binds to a target gene (e.g., an exon, an intron, or an exon-intron boundary (e.g., a splice site)). and a second expression suppressor comprising a targeting moiety that binds to a target gene (eg, an exon, an intron, or an exon-intron boundary (eg, a splice site)). In some embodiments, the expression suppression system comprises: a first expression suppressor comprising a target that binds to a target gene (e.g., an exon, an intron, or an exon-intron boundary (e.g., a splice site)). and a second expression suppressor comprising a targeting moiety that binds to a transcriptional control element (eg, a promoter or enhancer) operably linked to a target gene (eg, MYC). In some embodiments, the expression suppression system comprises: a first expression suppressor comprising a targeting moiety that binds to a transcriptional control element (such as a promoter or enhancer) operably linked to a target gene; and a second expression suppressor A promoter comprising a targeting moiety that binds to a transcriptional control element (eg, a promoter or enhancer) operably linked to a target gene. In some embodiments, the expression suppression system comprises: a first expression suppressor comprising an anchor sequence mediated engagement that binds to a target gene (e.g., MYC) in proximity to or operably linked to a target gene (e.g., MYC) and a second expression suppressor comprising a targeting moiety that binds to a transcriptional control element (such as a promoter or enhancer) operably linked to a target gene (such as MYC). In some embodiments, the expression suppression system comprises: a first expression suppressor comprising an anchor sequence mediated engagement that binds to a target gene (e.g., MYC) in proximity to or operably linked to a target gene (e.g., MYC) targeting moieties of body-associated anchor sequences; and a second expression suppressor comprising binding to target genes (e.g. MYC) (e.g. exons, introns, or exon-intron boundaries (e.g. splicing) The targeting moiety of the site)). In some embodiments, the expression suppression system comprises: a first expression suppressor comprising an anchor sequence mediated engagement that binds to a target gene (e.g., MYC) in proximity to or operably linked to a target gene (e.g., MYC) a targeting portion of an anchor sequence associated with a body; and a second expression suppressor comprising a mediator that binds to an anchor sequence that is in close proximity to or operably linked to a target gene (such as MYC) The targeting moiety of the adapter-associated anchor sequence.
在一些實施例中,表現抑制系統包含:第一表現抑制子,其包含結合至第一位點(例如可操作地連接至靶基因(例如MYC)的啟動子中的第一位點)的靶向部分;及第二表現抑制子,其包含結合至第二位點(例如可操作地連接至靶基因(例如MYC)的啟動子中的第二位點)的靶向部分。第一位點與第二位點可為不同且不重疊的位點,例如第一位點與第二位點不共享共同的任何序列。第一位點與第二位點可為不同但重疊的位點,例如第一位點及第二位點包含不同序列,但共享共同的一些序列。In some embodiments, the expression suppression system comprises: a first expression suppressor comprising a target bound to a first site, such as a first site in a promoter operably linked to a target gene (eg, MYC) and a second expression suppressor comprising a targeting moiety that binds to a second site, eg, a second site in a promoter operably linked to a target gene (eg, MYC). The first and second positions may be distinct and non-overlapping positions, eg, the first and second positions do not share any sequence in common. The first site and the second site can be different but overlapping sites, eg, the first site and the second site comprise different sequences but share some sequences in common.
在一些實施例中,靶基因為MYC。在一些實施例中,MYC位於人類染色體8。在一些實施例中,如本文所述的表現抑制子或表現抑制系統結合至MYC的轉錄起點(TSS)。In some embodiments, the target gene is MYC. In some embodiments, MYC is located on
其他組合物 核酸及載體 本發明進一步部分地關於編碼本文所述之表現抑制子或表現抑制系統的核酸。在一些實施例中,表現抑制子可經由包含編碼表現抑制子之核酸的組合物提供,其中該核酸與足以達成表現抑制子在所關注之系統(例如特定細胞、組織、生物體等)中表現的其他序列相關。在一些實施例中,表現抑制系統可經由包含編碼表現抑制系統(例如表現抑制系統中的表現抑制子)之核酸的組合物提供,其中該核酸與足以達成表現抑制系統(例如表現抑制系統中的表現抑制子)在所關注之系統(例如特定細胞、組織、生物體等)中表現的其他序列相關。 Other compositions Nucleic acid and carrier The invention further relates, in part, to nucleic acids encoding the expression suppressors or expression suppression systems described herein. In some embodiments, the expression suppressor may be provided via a composition comprising a nucleic acid encoding the expression suppressor, wherein the nucleic acid is associated with sufficient expression of the expression suppressor in a system of interest (e.g., a particular cell, tissue, organism, etc.) other serial correlations. In some embodiments, the expression suppression system may be provided via a composition comprising a nucleic acid encoding the expression suppression system (e.g., an expression suppressor in the expression suppression system), wherein the nucleic acid is sufficiently compatible to achieve the expression suppression system (e.g., Expression inhibitors) other sequence associations expressed in a system of interest (eg, a particular cell, tissue, organism, etc.).
在一些特定實施例中,本發明提供編碼表現抑制子或其多肽部分之核酸的組合物。在一些此類實施例中,所提供的核酸可為或包括如本文所述的DNA、RNA或任何其他核酸部分或實體,且可藉由本文所述或者此項技術中可獲得之任何技術製備(例如合成、選殖、擴增、活體外或活體內轉錄等)。在一些實施例中,編碼表現抑制子或其多肽部分的所提供核酸可在操作上與一或多個複製、整合及/或表現信號締合,該等信號適當地且/或足以達成所提供之核酸在所關注之系統(例如特定細胞、組織、生物體等)中的整合、複製及/或表現。In some specific embodiments, the invention provides compositions of nucleic acids encoding expression inhibitors or polypeptide portions thereof. In some such embodiments, provided nucleic acids can be or include DNA, RNA, or any other nucleic acid portion or entity as described herein, and can be prepared by any technique described herein or available in the art (eg, synthesis, cloning, amplification, in vitro or in vivo transcription, etc.). In some embodiments, provided nucleic acids encoding expression inhibitors, or polypeptide portions thereof, may be operatively associated with one or more replication, integration and/or expression signals that are appropriate and/or sufficient to achieve the provided The integration, replication and/or expression of the nucleic acid in the concerned system (such as a specific cell, tissue, organism, etc.).
在一些實施例中,用於遞送本文所述之表現抑制子的組合物為或包含載體,例如病毒載體,該載體包含編碼如本文所述之表現抑制子或表現抑制子之一或多種組分的一或多種核酸。In some embodiments, a composition for delivery of an expression inhibitor as described herein is or comprises a vector, such as a viral vector, comprising an expression inhibitor as described herein or one or more components encoding an expression inhibitor as described herein of one or more nucleic acids.
在一些特定實施例中,本發明提供編碼表現抑制系統、一或多種表現抑制子或其多肽部分的核酸之組合物。在一些此類實施例中,所提供的核酸可為或包括如本文所述的DNA、RNA或任何其他核酸部分或實體,且可藉由本文所述或者此項技術中可獲得之任何技術製備(例如合成、選殖、擴增、活體外或活體內轉錄等)。在一些實施例中,編碼表現抑制系統、一或多種表現抑制子或其多肽部分的所提供核酸可在操作上與一或多種複製、整合及/或表現信號締合,該等信號適當地且/或足以達成所提供之核酸在所關注之系統(例如特定細胞、組織、生物體等)中的整合、複製及/或表現。In some specific embodiments, the invention provides compositions of nucleic acids encoding an expression suppression system, one or more expression suppressors, or polypeptide portions thereof. In some such embodiments, provided nucleic acids can be or include DNA, RNA, or any other nucleic acid portion or entity as described herein, and can be prepared by any technique described herein or available in the art (eg, synthesis, cloning, amplification, in vitro or in vivo transcription, etc.). In some embodiments, provided nucleic acids encoding an expression suppression system, one or more expression suppressors, or polypeptide portions thereof, may be operatively associated with one or more replication, integration, and/or expression signals, suitably and /or sufficient to achieve integration, replication and/or expression of the provided nucleic acid in the system of interest (eg, specific cells, tissues, organisms, etc.).
在一些實施例中,用於遞送本文所述之表現抑制系統的組合物為或包含載體,例如病毒載體,該載體包含編碼表現抑制系統之一或多種組分(例如如本文所述之表現抑制系統中的表現抑制子)的一或多種核酸。In some embodiments, a composition for delivery of an expression suppression system described herein is or comprises a vector, such as a viral vector, comprising a vector encoding one or more components of an expression suppression system (e.g., an expression suppression system as described herein). One or more nucleic acids expressing a suppressor in the system.
在一些實施例中,用於遞送本文所述之表現抑制子的組合物為或包含RNA,例如mRNA,該RNA包含編碼如本文所述之表現抑制子或表現抑制子之一或多種組分的一或多種核酸。In some embodiments, a composition for delivering an expression suppressor as described herein is or comprises RNA, such as mRNA, comprising a protein encoding an expression suppressor or one or more components of an expression suppressor as described herein. one or more nucleic acids.
在一些實施例中,用於遞送本文所述之表現抑制系統的組合物為或包含RNA,例如mRNA,其包含編碼表現抑制系統之一或多種組分(例如如本文所述之表現抑制系統中的表現抑制子)的一或多種核酸。In some embodiments, a composition for delivery of an expression suppression system described herein is or comprises RNA, such as mRNA, comprising a protein encoding one or more components of an expression suppression system (e.g., as in an expression suppression system described herein). One or more nucleic acids of expression inhibitors of .
如本文所述之核酸或編碼本文所述之蛋白質的核酸可併入載體中。載體,包括來源於逆轉錄病毒(諸如慢病毒)之載體,為達成長期基因轉移之適合工具,因為其允許轉殖基因長期穩定整合且在子細胞中繁殖。載體之實例包括表現載體、複製載體、探針產生載體及定序載體。表現載體可以病毒載體形式提供給細胞。病毒載體技術在此項技術中已熟知且描述於多個病毒學及分子生物學手冊中。適用作載體之病毒包括(但不限於)逆轉錄病毒、腺病毒、腺相關病毒、疱疹病毒及慢病毒。一般而言,適合載體含有在至少一種生物體中發揮功能之複製起點、啟動子序列、適宜限制性核酸內切酶位點及一或多種可選擇標記物。A nucleic acid as described herein or a nucleic acid encoding a protein described herein can be incorporated into a vector. Vectors, including those derived from retroviruses such as lentiviruses, are suitable tools for long-term gene transfer because they allow long-term stable integration of the transgene and propagation in daughter cells. Examples of vectors include expression vectors, replication vectors, probe generation vectors, and sequencing vectors. Expression vectors can be provided to cells as viral vectors. Viral vector technology is well known in the art and described in various virology and molecular biology manuals. Viruses suitable for use as vectors include, but are not limited to, retroviruses, adenoviruses, adeno-associated viruses, herpes viruses, and lentiviruses. In general, suitable vectors contain an origin of replication functional in at least one organism, a promoter sequence, suitable restriction endonuclease sites, and one or more selectable markers.
天然或合成核酸的表現典型地藉由將編碼所關注之基因的核酸可操作地連接至啟動子及將構築體併入表現載體來達成。載體可適於在真核生物中複製且整合。典型的選殖載體含有轉錄及轉譯終止子、起始序列及適用於表現所需核酸序列的啟動子。Expression of natural or synthetic nucleic acids is typically achieved by operably linking the nucleic acid encoding the gene of interest to a promoter and incorporating the construct into an expression vector. Vectors may be suitable for replication and integration in eukaryotes. A typical cloning vector contains transcriptional and translational terminators, initiation sequences and a promoter suitable for expression of the desired nucleic acid sequence.
其他啟動子元件(例如增強序列)可調控轉錄起始頻率。典型地,此等序列位於轉錄起點上游30-110 bp的區域中,但多種啟動子最近已顯示亦可在轉錄起始位點下游含有功能元件。Other promoter elements, such as enhancer sequences, can regulate the frequency of transcription initiation. Typically, these sequences are located in a region 30-110 bp upstream of the start of transcription, but various promoters have recently been shown to also contain functional elements downstream of the transcription start site.
在一些實施例中,本文所述之表現抑制子或系統作用於增強序列。在一些實施例中,增強序列為增強子、延伸段增強子、影子增強子、基因座控制區域(LCR)或超級增強子。在一些實施例中,超級增強子包含增強子及其他調控元件之叢集。在一些實施例中,此等序列位於轉錄起點上游或下游.2-2 Mb的區域中。在一些實施例中,該區域為非編碼區域。在一些實施例中,該區域含有至少一種與產生癌症之較高風險相關的SNP。在一些實施例中,該區域與靶基因(例如MYC)之長程調控相關。在一些實施例中,該區域具有細胞類型特異性。在一些實施例中,超級增強子藉由募集靶基因啟動子(例如MYC啟動子)來調節(例如增加或減少)靶基因表現,例如MYC表現。在一些實施例中,超級增強子與靶基因啟動子(例如MYC啟動子)經由增強子對接位點發生相互作用。在一些實施例中,增強子對接位點為錨定序列。在一些實施例中,增強子對接位點位於離靶基因啟動子(例如MYC啟動子)至少100 bp、200 bp、500 bp、1000 bp、1500 bp、2000 bp或3000 bp處。在一些實施例中,超級增強子區域具有至少100 bp、至少200 bp、至少300 bp、至少500 bp、至少1 kb、至少2 kb、至少3 kb、至少5 kb、至少10 kb、至少15 kb、至少20 kb或至少25 kb長度。In some embodiments, the expression inhibitors or systems described herein act on enhancer sequences. In some embodiments, the enhancer sequence is an enhancer, stretch enhancer, shadow enhancer, locus control region (LCR), or super enhancer. In some embodiments, a super-enhancer comprises a cluster of enhancers and other regulatory elements. In some embodiments, such sequences are located in a region .2-2 Mb upstream or downstream of the start of transcription. In some embodiments, this region is a non-coding region. In some embodiments, the region contains at least one SNP associated with a higher risk of developing cancer. In some embodiments, the region is associated with long-range regulation of a target gene (eg, MYC). In some embodiments, the region is cell type specific. In some embodiments, a super-enhancer modulates (eg, increases or decreases) target gene expression, eg, MYC expression, by recruiting a target gene promoter (eg, MYC promoter). In some embodiments, a super-enhancer interacts with a target gene promoter (eg, MYC promoter) via an enhancer docking site. In some embodiments, the enhancer docking site is an anchor sequence. In some embodiments, the enhancer docking site is located at least 100 bp, 200 bp, 500 bp, 1000 bp, 1500 bp, 2000 bp, or 3000 bp from the target gene promoter (eg, the MYC promoter). In some embodiments, the super-enhancer region has at least 100 bp, at least 200 bp, at least 300 bp, at least 500 bp, at least 1 kb, at least 2 kb, at least 3 kb, at least 5 kb, at least 10 kb, at least 15 kb , at least 20 kb or at least 25 kb in length.
啟動子元件之間的間距通常為靈活的,因此當元件相對彼此倒置或移動時,啟動子功能得以保持。在胸苷激酶(tk)啟動子中,啟動子元件之間的間距在活性開始下降之前可以增加至相隔50 bp。視啟動子而定,個別元件似乎可協作地或獨立地起活化轉錄作用。The spacing between promoter elements is usually flexible so that when elements are inverted or moved relative to each other, promoter function is maintained. In the thymidine kinase (tk) promoter, the spacing between promoter elements can increase to 50 bp apart before activity begins to decline. Depending on the promoter, individual elements appear to act cooperatively or independently to activate transcription.
適合啟動子之一個實例為即刻早期巨細胞病毒(CMV)啟動子序列。此啟動子序列為強組成型啟動子序列,其能夠驅使可操作地連接至其之任何聚核苷酸序列高量表現。在之一些實施例中,適合啟動子為延伸生長因子-1α (EF-1α)。然而,亦可使用其他組成型啟動子序列,包括(但不限於)猴病毒40 (SV40)早期啟動子、小鼠乳腺腫瘤病毒(MMTV)、人類免疫缺乏病毒(HIV)長末端重複序列(LTR)啟動子、MoMuLV啟動子、禽類白血病病毒啟動子、埃-巴二氏病毒(Epstein-Barr virus)即刻早期啟動子、勞斯肉瘤病毒(Rous sarcoma virus)啟動子,以及人類基因啟動子,諸如(但不限於)肌動蛋白啟動子、肌凝蛋白啟動子、血紅素啟動子及肌酸激酶啟動子。One example of a suitable promoter is the immediate early cytomegalovirus (CMV) promoter sequence. This promoter sequence is a strong constitutive promoter sequence capable of driving high expression of any polynucleotide sequence operably linked to it. In some embodiments, the suitable promoter is elongation growth factor-1α (EF-1α). However, other constitutive promoter sequences can also be used, including but not limited to Simian Virus 40 (SV40) early promoter, mouse mammary tumor virus (MMTV), human immunodeficiency virus (HIV) long terminal repeat (LTR) ) promoter, MoMuLV promoter, avian leukemia virus promoter, Epstein-Barr virus (Epstein-Barr virus) immediate early promoter, Rous sarcoma virus (Rous sarcoma virus) promoter, and human gene promoters such as But not limited to, actin promoter, myosin promoter, heme promoter, and creatine kinase promoter.
本發明不應解釋為限於使用任何特定啟動子或啟動子類別(例如組成型啟動子)。舉例而言,在一些實施例中,考慮誘導型啟動子作為本發明之一部分。在一些實施例中,使用誘導型啟動子提供分子開關,該分子開關能夠開啟可操作地連接至其的聚核苷酸序列之表現(當需要此類表現時)。在一些實施例中,使用誘導型啟動子提供分子開關,當不需要表現時,該分子開關能夠關閉表現。誘導型啟動子之實例包括(但不限於)金屬硫蛋白啟動子、糖皮質激素啟動子、孕酮啟動子及四環素啟動子。The present invention should not be construed as being limited to use with any particular promoter or class of promoters (eg, constitutive promoters). For example, inducible promoters are contemplated as part of the invention in some embodiments. In some embodiments, an inducible promoter is used to provide a molecular switch capable of turning on the expression of a polynucleotide sequence operably linked thereto when such expression is desired. In some embodiments, an inducible promoter is used to provide a molecular switch capable of turning off expression when it is not desired. Examples of inducible promoters include, but are not limited to, the metallothionein promoter, the glucocorticoid promoter, the progesterone promoter, and the tetracycline promoter.
在一些實施例中,待引入之表現載體亦可含有可選標記基因或報導基因或兩者以有助於自設法經由病毒載體轉染或感染之細胞群中鑑別及選擇表現細胞。在一些態樣中,可選標記物可攜載於單獨的DNA段上且用於共轉染程序。可選標記物與報導基因均可側接適當的轉錄控制序列以能夠在宿主細胞中表現。適用的可選標記物可包括例如抗生素抗性基因,諸如neo等。In some embodiments, the expression vector to be introduced may also contain a selectable marker gene or a reporter gene or both to facilitate the identification and selection of expressing cells from a population of cells seeking to be transfected or infected by the viral vector. In some aspects, selectable markers can be carried on separate DNA segments and used in co-transfection procedures. Both the selectable marker and the reporter gene may be flanked by appropriate transcriptional control sequences to enable expression in the host cell. Suitable selectable markers may include, for example, antibiotic resistance genes such as neo and the like.
在一些實施例中,報導基因可以用於鑑別潛在地經轉染的細胞及/或用於評價轉錄控制序列的功能。一般而言,報導基因為不存在於(報導基因之)接受者來源中或由(報導基因之)接受者來源表現的基因,且該基因所編碼之多肽的表現體現為一些容易偵測的特性,例如酶活性或可視化螢光。在DNA已引入接受者細胞中之後的適合時間分析報導基因之表現。適合的報導基因可包括編碼螢光素酶、β-半乳糖苷酶、氯黴素乙醯基轉移酶、分泌性鹼性磷酸酶的基因,或綠色螢光蛋白基因(例如Ui-Tei等人, 2000 FEBS Letters 479: 79-82)。適合的表現系統已熟知且可使用已知技術製備或商業購買。一般而言,顯示報導基因之最高表現量、具有最小5'側接區域的構築體經鑑別為啟動子。此類啟動子區域可連接至報導基因且用於評價藥劑調節啟動子驅動之轉錄的能力。In some embodiments, reporter genes can be used to identify potentially transfected cells and/or to assess the function of transcriptional control sequences. In general, a reporter gene is a gene that is absent from or expressed by the source of the recipient (of the reporter gene), and the expression of the polypeptide encoded by the gene shows some easily detectable characteristics , such as enzymatic activity or visualizing fluorescence. Expression of the reporter gene is analyzed at a suitable time after the DNA has been introduced into the recipient cells. Suitable reporter genes may include genes encoding luciferase, β-galactosidase, chloramphenicol acetyltransferase, secreted alkaline phosphatase, or the green fluorescent protein gene (e.g. Ui-Tei et al. , 2000 FEBS Letters 479: 79-82). Suitable presentation systems are well known and can be prepared using known techniques or purchased commercially. In general, the construct showing the highest expression of the reporter gene with the smallest 5' flanking region was identified as the promoter. Such promoter regions can be linked to reporter genes and used to assess the ability of agents to modulate promoter-driven transcription.
細胞本發明另外部分地關於包含本文所述之表現抑制子或表現抑制系統的細胞。熟習此項技術者已知的任何細胞,例如細胞株,例如適於表現重組多肽的細胞株,適於包含本文所述之表現抑制子或表現抑制系統。在一些實施例中,細胞(例如細胞株)可用於表現表現抑制子或表現抑制系統,例如本文所述之表現抑制子。在一些實施例中,細胞(例如細胞株)可用於表現或擴增編碼表現抑制子或表現抑制系統(例如本文所述之表現抑制子)的核酸,例如載體。在一些實施例中,細胞包含編碼表現抑制子或表現抑制系統(例如本文所述之表現抑制子)的核酸。 Cells The present invention additionally relates in part to cells comprising the expression suppressors or expression suppression systems described herein. Any cell, such as a cell line known to those skilled in the art, eg, a cell line suitable for expressing a recombinant polypeptide, is suitable for comprising an expression suppressor or expression suppression system as described herein. In some embodiments, cells (eg, cell lines) can be used to express an expression suppressor or expression suppression system, such as the expression suppressors described herein. In some embodiments, cells (eg, cell lines) can be used to express or amplify nucleic acids, eg, vectors, encoding expression suppressors or expression suppressor systems (eg, expression suppressors described herein). In some embodiments, a cell comprises a nucleic acid encoding an expression suppressor or expression suppressor system (eg, an expression suppressor described herein).
在一些實施例中,細胞包含編碼表現抑制系統之第一組分(例如第一表現抑制子)的第一核酸,及編碼表現抑制系統之第二組分(例如第二表現抑制子)的第二核酸。在其中細胞包含編碼包含兩個或更多個表現抑制子之表現抑制系統之核酸的一些實施例中,編碼各表現抑制子的序列安置於各別的核酸分子上,例如安置於不同載體上,例如編碼第一表現抑制子的第一載體及編碼第二表現抑制子的第二載體。在一些實施例中,編碼各表現抑制子的序列安置於相同核酸分子上,例如相同載體上。在一些實施例中,編碼表現抑制系統之核酸中的一些或全部整合於細胞之基因體DNA中。在一些實施例中,編碼表現抑制系統中之第一表現抑制子的核酸整合於細胞之基因體DNA中,且編碼表現抑制系統中之第二表現抑制子的核酸不整合於細胞之基因體DNA中(例如位於載體上)。在一些實施例中,編碼表現抑制系統中之第一及第二表現抑制子的核酸整合於細胞之基因體DNA中,例如整合於基因體DNA中之相同(例如相鄰或共定位)位點或不同位點。In some embodiments, the cell comprises a first nucleic acid encoding a first component of the expression suppression system (e.g., a first expression suppressor), and a first nucleic acid encoding a second component of the expression suppression system (e.g., a second expression suppressor). Two nucleic acids. In some embodiments wherein the cell comprises a nucleic acid encoding an expression inhibition system comprising two or more expression inhibitors, the sequences encoding each expression inhibitor are placed on separate nucleic acid molecules, for example on different vectors, For example, a first vector encoding a first expression suppressor and a second vector encoding a second expression suppressor. In some embodiments, the sequences encoding the respective expression inhibitors are placed on the same nucleic acid molecule, eg, on the same vector. In some embodiments, some or all of the nucleic acid encoding the expression suppression system is integrated in the gene body DNA of the cell. In some embodiments, the nucleic acid encoding the first expression suppressor in the expression suppression system is integrated in the gene body DNA of the cell, and the nucleic acid encoding the second expression suppressor in the expression suppression system is not integrated in the gene body DNA of the cell in (e.g. on a carrier). In some embodiments, the nucleic acids encoding the first and second expression suppressors in the expression suppression system are integrated in the genomic DNA of the cell, e.g., at the same (e.g., adjacent or colocalized) site in the genomic DNA or different locations.
可包含且/或表現本文所述之表現抑制系統或表現抑制子的細胞實例包括(但不限於)肝細胞、神經元細胞、內皮細胞、肌細胞及淋巴球。Examples of cells that may contain and/or express an expression suppression system or expression suppressor described herein include, but are not limited to, hepatocytes, neuronal cells, endothelial cells, myocytes, and lymphocytes.
本發明另外部分地關於藉由本文所述之方法或製程製備的細胞。在一些實施例中,本發明提供一種如下產生的細胞:提供本文所述之表現抑制子或表現抑制系統,提供細胞,及使該細胞與表現抑制子(或編碼表現抑制子的核酸,或包含該表現抑制子或核酸的組合物)或表現抑制系統(或編碼表現抑制系統的核酸,或包含該表現抑制系統或核酸的組合物)接觸。在一些實施例中,使細胞與表現抑制子接觸包含使該細胞與編碼表現抑制子的核酸在允許細胞產生表現抑制子的條件下接觸。在一些實施例中,使細胞與表現抑制子接觸包含使包含細胞的生物體與表現抑制子或編碼表現抑制子的核酸在允許細胞產生表現抑制子的條件下接觸。The invention additionally pertains to cells produced by the methods or processes described herein. In some embodiments, the invention provides a cell produced by providing an expression suppressor or expression suppression system described herein, providing the cell, and contacting the cell with the expression suppressor (or nucleic acid encoding the expression suppressor, or comprising The expression inhibitor or nucleic acid composition) or the expression inhibition system (or the nucleic acid encoding the expression inhibition system, or the composition comprising the expression inhibition system or nucleic acid) is contacted. In some embodiments, contacting the cell with the expression inhibitor comprises contacting the cell with a nucleic acid encoding the expression inhibitor under conditions that allow the cell to produce the expression inhibitor. In some embodiments, contacting the cell with the expression inhibitor comprises contacting an organism comprising the cell with the expression inhibitor or a nucleic acid encoding the expression inhibitor under conditions that allow the cell to produce the expression inhibitor.
不希望受理論所束縛,與尚未接觸表現抑制子或表現抑制系統的類似細胞相比,與本文所述之表現抑制子或表現抑制系統接觸的細胞可展現:靶基因(例如MYC)表現的降低及/或與以下相關之表觀遺傳標記物的改變:靶基因(例如MYC)、可操作地連接至靶基因(例如MYC)的轉錄控制元件,或近接於靶基因或與可操作地連接至靶基因(例如MYC)之錨定序列介導性接合體相關的錨定序列。在一些實施例中,展現靶基因(例如MYC)表現之該降低及/或表觀遺傳標記物之改變的細胞不包含表現抑制子或表現抑制系統。表觀之降低及/或遺傳標記物表現之改變可持久存在,例如在與表現抑制子或表現抑制系統接觸之後,存在至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6、7、10或14天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如無限期地)。Without wishing to be bound by theory, cells contacted with an expression suppressor or expression suppression system described herein may exhibit a reduction in the expression of a target gene (e.g., MYC) compared to similar cells that have not been contacted with the expression suppressor or expression suppression system and/or changes in epigenetic markers associated with the target gene (eg MYC), transcriptional control elements operably linked to the target gene (eg MYC), or in proximity to the target gene or operably linked to the target gene The anchor sequence of the target gene (eg MYC) mediates the anchor sequence associated with the adapter. In some embodiments, cells exhibiting such reduction in expression of a target gene (eg, MYC) and/or changes in epigenetic markers do not comprise expression suppressors or expression suppression systems. The reduction in appearance and/or alteration of expression of a genetic marker may be persistent, for example, following contact with an expression suppressor or expression suppression system, in the presence of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, 7, 10, or 14 hours days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 months, or at least 1, 2, 3 , 4 or 5 years (eg indefinitely).
在一些實施例中,在表現抑制子或表現抑制系統不再存在於細胞中之後,表現抑制子或表現抑制系統先前接觸之細胞中的表現之降低及/或表觀遺傳標記物之改變得以保持,例如在表現抑制子或表現抑制系統不再存在於細胞中之後,保持至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6、7、10或14天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如無限期地)。In some embodiments, the reduction in expression and/or changes in epigenetic markers in cells previously contacted with the expression inhibitor or expression inhibition system are maintained after the expression inhibitor or expression inhibition system is no longer present in the cell , for example, after the expression suppressor or expression suppression system is no longer present in the cell, for at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, 7, 10, or 14 days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 months, or at least 1, 2, 3, 4, or 5 years (eg, indefinitely).
製造表現抑制系統及/或表現抑制子的方法 在一些實施例中,表現抑制子包含或為蛋白質且因此可藉由蛋白質製造方法產生。在一些實施例中,表現抑制系統(例如表現抑制系統中的表現抑制子)包含一或多種蛋白質且因此可藉由蛋白質製造方法產生。如熟習此項技術者將瞭解,蛋白質或多肽(其可包括於如本文所述之調節劑中)製造方法為此項技術中之常規方法。一般參見Smales及James (編), Therapeutic Proteins: Methods and Protocols (Methods in Molecular Biology), Humana Press (2005); 及Crommelin, Sindelar及Meibohm (編), Pharmaceutical Biotechnology: Fundamentals and Applications, Springer (2013)。Methods of Making Expression Suppressor Systems and/or Expression Suppressors In some embodiments, expression suppressors comprise or are proteins and thus can be produced by protein production methods. In some embodiments, the expression suppression system (eg, the expression suppressor in the expression suppression system) comprises one or more proteins and thus can be produced by protein production methods. As will be appreciated by those skilled in the art, methods of making proteins or polypeptides (which may be included in modulators as described herein) are routine in the art. See generally Smales and James (eds), Therapeutic Proteins: Methods and Protocols (Methods in Molecular Biology), Humana Press (2005); and Crommelin, Sindelar and Meibohm (eds), Pharmaceutical Biotechnology: Fundamentals and Applications, Springer (2013).
本發明組合物中的蛋白質或多肽可利用標準固相技術、以生物化學方式合成。此類方法包括排他性固相合成、部分固相合成方法、片段縮合、經典溶液合成。當肽相對較短(例如10 kDa)時及/或當其無法藉由重組技術產生(亦即,不由核酸序列編碼)且因此涉及不同化學試劑時,可使用此等方法。The proteins or polypeptides in the compositions of the invention can be biochemically synthesized using standard solid phase techniques. Such methods include exclusive solid phase synthesis, partial solid phase synthesis methods, fragment condensation, classical solution synthesis. These methods can be used when the peptide is relatively short (eg, 10 kDa) and/or when it cannot be produced by recombinant techniques (ie, not encoded by a nucleic acid sequence) and thus involves different chemical reagents.
固相合成程序在此項技術中已熟知且進一步描述於John Morrow Stewart及Janis Dillaha Young, Solid Phase Peptide Syntheses, 第2版, Pierce Chemical Company, 1984; 及Coin, I.等人, Nature Protocols, 2:3247-3256, 2007。Solid phase synthesis procedures are well known in the art and are further described in John Morrow Stewart and Janis Dillaha Young, Solid Phase Peptide Syntheses, 2nd Edition, Pierce Chemical Company, 1984; and Coin, I. et al., Nature Protocols, 2 :3247-3256, 2007.
對於較長肽而言,可使用重組方法。重組治療多肽製造方法在此項技術中為常規方法。一般參見Smales及James (編), Therapeutic Proteins: Methods and Protocols (Methods in Molecular Biology), Humana Press (2005); 及Crommelin, Sindelar & Meibohm (編), Pharmaceutical Biotechnology: Fundamentals and Applications, Springer (2013)。For longer peptides, recombinant methods can be used. Methods of manufacturing recombinant therapeutic polypeptides are routine in the art. See generally Smales and James (eds), Therapeutic Proteins: Methods and Protocols (Methods in Molecular Biology), Humana Press (2005); and Crommelin, Sindelar & Meibohm (eds), Pharmaceutical Biotechnology: Fundamentals and Applications, Springer (2013).
用於產生治療性醫藥蛋白質或多肽之例示性方法涉及在哺乳動物細胞中表現,但亦可使用昆蟲細胞、酵母、細菌或其他細胞在適當啟動子控制下產生重組蛋白。哺乳動物表現載體可包含非轉錄元件,諸如複製起點、適合啟動子及其他5'或3'側接非轉錄序列,及5'或3'非轉譯序列,諸如必需核糖體結合位點、聚腺苷酸化位點、剪接供體及受體位點,及終止序列。來源於SV40病毒基因體的DNA序列,例如SV40起點、早期啟動子、剪接位點及聚腺苷酸化位點,可用於提供供異源DNA序列表現而必需的其他基因元件。適用於細菌、真菌、酵母及哺乳動物細胞宿主的選殖及表現載體描述於Green及Sambrook, Molecular Cloning: A Laboratory Manual (第四版), Cold Spring Harbor Laboratory Press (2012)中。Exemplary methods for producing therapeutic pharmaceutical proteins or polypeptides involve expression in mammalian cells, but insect cells, yeast, bacteria or other cells can also be used to produce recombinant proteins under the control of appropriate promoters. Mammalian expression vectors may contain non-transcribed elements, such as an origin of replication, a suitable promoter, and other 5' or 3' flanking non-transcribed sequences, and 5' or 3' non-translated sequences, such as essential ribosome binding sites, polyadenylation nucleotide acidification sites, splice donor and acceptor sites, and termination sequences. DNA sequences derived from the SV40 viral genome, such as the SV40 origin, early promoter, splice sites, and polyadenylation sites, can be used to provide other genetic elements necessary for expression of heterologous DNA sequences. Selection and expression vectors suitable for bacterial, fungal, yeast, and mammalian cell hosts are described in Green and Sambrook, Molecular Cloning: A Laboratory Manual (Fourth Edition), Cold Spring Harbor Laboratory Press (2012).
在需要大量蛋白質或多肽的情況下,其可使用諸如以下文獻所述的技術產生:Brian Bray, Nature Reviews Drug Discovery, 2:587-593, 2003; 及Weissbach及Weissbach, 1988, Methods for Plant Molecular Biology, Academic Press, NY, 第VIII章, 第421-463頁。Where large quantities of proteins or polypeptides are desired, they can be produced using techniques such as those described in: Brian Bray, Nature Reviews Drug Discovery, 2:587-593, 2003; and Weissbach and Weissbach, 1988, Methods for Plant Molecular Biology , Academic Press, NY, Chapter VIII, pp. 421-463.
可利用各種哺乳動物細胞培養系統表現及製造重組蛋白。哺乳動物表現系統之實例包括CHO細胞、COS細胞、HeLA及BHK細胞株。用於產生蛋白質治療劑之宿主細胞培養方法描述於Zhou及Kantardjieff (編), Mammalian Cell Cultures for Biologics Manufacturing (Advances in Biochemical Engineering/Biotechnology), Springer (2014)中。本文所述之組合物可包括編碼重組蛋白的載體,諸如病毒載體,例如慢病毒載體。在一些實施例中,載體(例如病毒載體)可包含編碼重組蛋白的核酸。本文所述之組合物可包括脂質奈米粒子,該脂質奈米粒子囊封編碼重組蛋白的載體,諸如病毒載體,例如慢病毒載體。在一些實施例中,囊封載體(例如病毒載體)的脂質奈米粒子可包含編碼重組蛋白的核酸。Recombinant proteins can be expressed and produced using a variety of mammalian cell culture systems. Examples of mammalian expression systems include CHO cells, COS cells, HeLA and BHK cell lines. Host cell culture methods for the production of protein therapeutics are described in Zhou and Kantardjieff (eds), Mammalian Cell Cultures for Biologics Manufacturing (Advances in Biochemical Engineering/Biotechnology), Springer (2014). The compositions described herein may include vectors encoding recombinant proteins, such as viral vectors, eg lentiviral vectors. In some embodiments, a vector (eg, a viral vector) may comprise nucleic acid encoding a recombinant protein. The compositions described herein may include lipid nanoparticles that encapsulate recombinant protein-encoding vectors, such as viral vectors, eg lentiviral vectors. In some embodiments, a lipid nanoparticle encapsulating a vector (eg, a viral vector) can comprise a nucleic acid encoding a recombinant protein.
蛋白質治療劑之純化描述於Franks, Protein Biotechnology: Isolation, Characterization, and Stabilization, Humana Press (2013); 及Cutler, Protein Purification Protocols (Methods in Molecular Biology), Humana Press (2010)。蛋白質治療劑之調配描述於Meyer (編), Therapeutic Protein Drug Products: Practical Approaches to formulation in the Laboratory, Manufacturing, and the Clinic, Woodhead Publishing Series (2012)。Purification of protein therapeutics is described in Franks, Protein Biotechnology: Isolation, Characterization, and Stabilization, Humana Press (2013); and Cutler, Protein Purification Protocols (Methods in Molecular Biology), Humana Press (2010). The formulation of protein therapeutics is described in Meyer (ed.), Therapeutic Protein Drug Products: Practical Approaches to formulation in the Laboratory, Manufacturing, and the Clinic, Woodhead Publishing Series (2012).
蛋白質包含一或多個胺基酸。胺基酸包括可併入多肽鏈中(例如經由一或多個肽鍵的形成)的任何化合物及/或物質。在一些實施例中,胺基酸具有通式結構H 2N-C(H)I-COOH。在一些實施例中,胺基酸為天然存在之胺基酸。在一些實施例中,胺基酸為非天然胺基酸;在一些實施例中,胺基酸為D-胺基酸;在一些實施例中,胺基酸為L-胺基酸。「標準胺基酸」係指通常發現於天然存在之肽中之二十種標準L-胺基酸中的任一者。「非標準胺基酸」係指除標準胺基酸之外的任何胺基酸,無論其以合成方式製備或自天然來源獲得。在一些實施例中,相較於上述通式結構,胺基酸(包括多肽中之羧基及/或胺基端胺基酸)可含有結構修飾。舉例而言,在一些實施例中,相較於通式結構,可藉由(例如胺基、羧酸基、一或多個質子及/或羥基)甲基化、醯胺化、乙醯化、聚乙二醇化、糖基化、磷酸化及/或取代來修飾胺基酸。在一些實施例中,與含有在其他方面相同的未經修飾胺基酸之多肽相比,此類修飾可例如改變含有經修飾胺基酸之多肽的循環半衰期。在一些實施例中,與含有在其他方面相同的未經修飾胺基酸之多肽相比,此類修飾不能顯著改變含有經修飾氨基酸之多肽的相關活性。如自上下文顯見,在一些實施例中,術語「胺基酸」可用於指游離胺基酸;在一些實施例中,其可用於指多肽之胺基酸殘基。 Proteins contain one or more amino acids. Amino acids include any compound and/or substance that can be incorporated into a polypeptide chain, eg, via the formation of one or more peptide bonds. In some embodiments, the amino acid has the general structure H2NC (H)I-COOH. In some embodiments, the amino acid is a naturally occurring amino acid. In some embodiments, the amino acid is an unnatural amino acid; in some embodiments, the amino acid is a D-amino acid; in some embodiments, the amino acid is an L-amino acid. "Standard amino acid" refers to any of the twenty standard L-amino acids commonly found in naturally occurring peptides. "Non-standard amino acid" means any amino acid other than a standard amino acid, whether prepared synthetically or obtained from a natural source. In some embodiments, amino acids (including carboxyl and/or amino terminal amino acids in polypeptides) may contain structural modifications compared to the above general structure. For example, in some embodiments, compared with the general structure, methylation, amidation, acetylation (such as amine group, carboxylic acid group, one or more protons and/or hydroxyl group) can be used , PEGylation, glycosylation, phosphorylation and/or substitution to modify amino acids. In some embodiments, such modifications can, for example, alter the circulating half-life of a polypeptide containing a modified amino acid compared to a polypeptide containing the otherwise identical unmodified amino acid. In some embodiments, such modifications do not significantly alter the associated activity of a polypeptide containing the modified amino acid as compared to a polypeptide containing the otherwise identical unmodified amino acid. As apparent from the context, in some embodiments, the term "amino acid" may be used to refer to a free amino acid; in some embodiments, it may be used to refer to an amino acid residue of a polypeptide.
醫藥組合物、調配物、遞送及投藥 本發明另外部分地關於包含表現抑制子或表現抑制系統(例如本文所述之表現抑制子)的醫藥組合物;包含編碼表現抑制子或表現抑制系統(例如本文所述之表現抑制子)之核酸的醫藥組合物;及/或將表現抑制子或表現抑制系統(例如本文所述之表現抑制子)遞送至細胞、組織、器官及/或個體的組合物。Pharmaceutical Compositions, Formulations, Delivery and Administration The present invention additionally relates in part to pharmaceutical compositions comprising expression inhibitors or expression inhibition systems such as those described herein; comprising encoding expression inhibitors or expression inhibition systems (e.g. Pharmaceutical compositions of nucleic acids expressing inhibitors described herein); and/or compositions for delivery of expressing inhibitors or expression inhibiting systems (such as expressing inhibitors described herein) to cells, tissues, organs and/or individuals .
如本文所用,術語「醫藥組合物」係指連同醫藥學上可接受之一或多種載劑(例如熟習此項技術者已知的醫藥學上可接受之載劑)一起調配的活性劑(例如表現抑制子或表現受體之核酸,例如表現抑制系統,例如表現抑制系統中的表現抑制子,或編碼其的核酸)。在一些實施例中,活性劑以適合投與之單位劑量的量存在於治療方案中,其在投與相關群體時顯示統計學上顯著之達成預定治療作用的機率。在一些實施例中,包含本發明之表現抑制子的醫藥組合物包含表現抑制子或編碼其的核酸。在一些實施例中,包含本發明之表現抑制系統的醫藥組合物包含表現抑制系統中之各種表現抑制子或編碼其的核酸(例如若表現抑制系統包含第一表現抑制子及第二表現抑制子,則醫藥組合物包含第一及第二表現抑制子)。在一些實施例中,醫藥組合物包含表現抑制系統中之少於全部的表現抑制子,該表現抑制系統包含複數種表現抑制子。舉例而言,表現抑制系統可包含第一表現抑制子及第二表現抑制子,且第一醫藥組合物可包含第一表現抑制子或編碼其的核酸且第二醫藥組合物可包含第二表現抑制子或編碼其的核酸。在一些實施例中,醫藥組合物可包含一或多種表現抑制子或編碼其之核酸的共調配物。As used herein, the term "pharmaceutical composition" refers to an active agent (such as A nucleic acid that expresses a suppressor or expresses a receptor, such as an expression suppressor system, such as an express suppressor in an expression suppressor system, or a nucleic acid encoding the same). In some embodiments, the active agent is present in the therapeutic regimen in an amount suitable for administration in a unit dosage that exhibits a statistically significant probability of achieving a predetermined therapeutic effect when administered to a relevant population. In some embodiments, a pharmaceutical composition comprising an expressed inhibitor of the invention comprises an expressed inhibitor or a nucleic acid encoding the same. In some embodiments, pharmaceutical compositions comprising the expression inhibition system of the present invention comprise various expression inhibitors in the expression inhibition system or nucleic acids encoding them (for example, if the expression inhibition system comprises a first expression inhibitor and a second expression inhibitor , the pharmaceutical composition comprises the first and the second expression inhibitor). In some embodiments, the pharmaceutical composition comprises less than all of the expression inhibitors in an expression inhibition system comprising a plurality of expression inhibitors. For example, an expression inhibitory system can comprise a first expression inhibitor and a second expression inhibitor, and a first pharmaceutical composition can comprise the first expression inhibitor or a nucleic acid encoding the same and a second pharmaceutical composition can comprise the second expression inhibitor. A suppressor or a nucleic acid encoding the same. In some embodiments, a pharmaceutical composition may comprise a co-formulation of one or more expression inhibitors or nucleic acids encoding the same.
在一些實施例中,醫藥組合物可專門調配成固體或液體形式投與,包括針對以下調適之彼等形式:經口投藥,例如頓服藥(水性或非水性溶液或懸浮液)、錠劑(例如針對頰內、舌下及全身性吸收之錠劑)、大丸劑、散劑、顆粒、施用於舌部之糊劑;非經腸投藥,例如皮下、肌肉內、靜脈內或硬膜外注射,例如作為無菌溶液或懸浮液,或持續釋放型調配物;局部施用,例如以乳膏、軟膏或控制釋放型貼片或噴霧形式施用於皮膚、肺或口腔;陰道內或直腸內投藥,例如子宮托、乳膏或發泡體形式;舌下;眼部;經皮;或鼻、肺及/或其他黏膜表面。In some embodiments, pharmaceutical compositions may be specially formulated for administration in solid or liquid form, including those forms adapted for oral administration, such as drenches (aqueous or non-aqueous solutions or suspensions), lozenges ( e.g. lozenges for buccal, sublingual and systemic absorption), boluses, powders, granules, pastes for tongue application; parenteral administration, e.g. subcutaneous, intramuscular, intravenous or epidural injection, For example, as a sterile solution or suspension, or a sustained-release formulation; topical application, for example, to the skin, lungs or mouth in the form of a cream, ointment, or controlled-release patch or spray; intravaginal or rectal administration, for example, to the uterus In the form of pads, creams, or foams; sublingually; ocularly; transdermally; or on nasal, pulmonary, and/or other mucosal surfaces.
如本文所用,術語「醫藥學上可接受」係指在合理醫學判斷範疇內,適合與人類及動物組織接觸而無過度毒性、刺激、過敏反應或其他問題或併發症、與合理益處/風險比相稱的彼等化合物、材料、組合物及/或劑型。As used herein, the term "pharmaceutically acceptable" means, within the scope of sound medical judgment, suitable for contact with human and animal tissues without undue toxicity, irritation, allergic reaction or other problems or complications, and reasonable benefit/risk ratio. commensurate with those compounds, materials, compositions and/or dosage forms.
如本文所用,術語「醫藥學上可接受之載劑」意謂涉及將本發明化合物自身體之一個器官或部分載運或輸送至身體之另一器官或部分的醫藥學上可接受之材料、組合物或媒劑,諸如液體或固體填充劑、稀釋劑、賦形劑或溶劑囊封材料。各載劑就與調配物之其他成分相容且對患者無害而言必須為「可接受的」。在一些實施例中,舉例而言,可充當醫藥學上可接受之載劑的材料包括:糖,諸如乳糖、葡萄糖及蔗糖;澱粉,諸如玉米澱粉及馬鈴薯澱粉;纖維素及其衍生物,諸如羧甲基纖維素鈉、乙基纖維素及乙酸纖維素;粉末狀黃蓍膠;麥芽;明膠;滑石;賦形劑,諸如可可脂及栓劑蠟;油類,諸如花生油、棉籽油、紅花油、芝麻油、橄欖油、玉米油及大豆油;二醇類,諸如丙二醇;多元醇,諸如甘油、山梨糖醇、甘露糖醇及聚乙二醇;酯類,諸如油酸乙酯及月桂酸乙酯;瓊脂;緩衝劑,諸如氫氧化鎂及氫氧化鋁;褐藻酸;無熱原質水;等張生理鹽水;林格氏溶液(Ringer's solution);乙醇;pH緩衝溶液;聚酯、聚碳酸酯及/或聚酸酐;及醫藥調配物中採用之其他無毒相容物質。As used herein, the term "pharmaceutically acceptable carrier" means a pharmaceutically acceptable material, combination or material involved in carrying or transporting a compound of the present invention from one organ or part of the body to another organ or part of the body. Substance or vehicle, such as liquid or solid filler, diluent, excipient or solvent encapsulating material. Each carrier must be "acceptable" in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. In some embodiments, materials that may serve as pharmaceutically acceptable carriers include, for example: sugars, such as lactose, glucose, and sucrose; starches, such as corn starch and potato starch; cellulose and its derivatives, such as Sodium carboxymethylcellulose, ethylcellulose, and cellulose acetate; powdered gum tragacanth; malt; gelatin; talc; excipients such as cocoa butter and suppository waxes; oils such as peanut oil, cottonseed oil, safflower Oils, sesame oil, olive oil, corn oil, and soybean oil; glycols, such as propylene glycol; polyols, such as glycerin, sorbitol, mannitol, and polyethylene glycol; esters, such as ethyl oleate and lauric acid Ethyl esters; agar; buffers such as magnesium hydroxide and aluminum hydroxide; alginic acid; pyrogen-free water; isotonic saline; Ringer's solution; ethanol; pH buffer solution; polyester, poly Carbonates and/or polyanhydrides; and other non-toxic compatible substances used in pharmaceutical formulations.
如本文所用,術語「醫藥學上可接受之鹽」係指此類化合物之鹽,其適合在醫藥背景下使用,亦即,在合理醫學判斷之範疇內,適合與人類及低等動物之組織接觸使用而無異常毒性、刺激、過敏反應及其類似者且與合理的益處/風險比相稱的鹽。醫藥學上可接受的鹽在此項技術中已熟知。舉例而言,S. M. Berge等人在J. Pharmaceutical Sciences, 66: 1-19 (1977)中詳細描述了醫藥學上可接受之鹽。在一些實施例中,醫藥學上可接受之鹽包括(但不限於)無毒酸加成鹽,其為胺基與無機酸或與有機酸形成之鹽或藉由使用此項技術中所用之其他方法(諸如離子交換)形成之鹽,無機酸諸如鹽酸、氫溴酸、磷酸、硫酸及過氯酸,有機酸諸如乙酸、順丁烯二酸、酒石酸、檸檬酸、丁二酸或丙二酸。在一些實施例中,醫藥學上可接受之鹽包括(但不限於)己二酸鹽、褐藻酸鹽、抗壞血酸鹽、天冬胺酸鹽、苯磺酸鹽、苯甲酸鹽、硫酸氫鹽、硼酸鹽、丁酸鹽、樟腦酸鹽、樟腦磺酸鹽、檸檬酸鹽、環戊烷丙酸鹽、二葡糖酸鹽、十二烷基硫酸鹽、乙烷磺酸鹽、甲酸鹽、反丁烯二酸鹽、葡庚糖酸鹽、甘油磷酸鹽、葡糖酸鹽、半硫酸鹽、庚酸鹽、己酸鹽、氫碘酸鹽、2-羥基-乙烷磺酸鹽、乳糖酸鹽、乳酸鹽、月桂酸鹽、月桂基硫酸鹽、蘋果酸鹽、順丁烯二酸鹽、丙二酸鹽、甲烷磺酸鹽、2-萘磺酸鹽、菸鹼酸鹽、硝酸鹽、油酸鹽、草酸鹽、棕櫚酸鹽、雙羥萘酸鹽、果膠酸鹽、過硫酸鹽、3-苯基丙酸鹽、磷酸鹽、苦味酸鹽、特戊酸鹽、丙酸鹽、硬脂酸鹽、丁二酸鹽、硫酸鹽、酒石酸鹽、硫氰酸鹽、對甲苯磺酸鹽、十一烷酸鹽、戊酸鹽及其類似鹽。代表性鹼金屬或鹼土金屬鹽包括鈉鹽、鋰鹽、鉀鹽、鈣鹽、鎂鹽及類似鹽。在一些實施例中,適當時,醫藥學上可接受之鹽包括無毒銨、四級銨及使用相對離子形成之胺陽離子,該等相對離子諸如鹵離子、氫氧根、羧酸根、甲酸根、硫酸根、磷酸根、硝酸根、具有1至6個碳原子之烷基、磺酸根及芳基磺酸根。As used herein, the term "pharmaceutically acceptable salts" refers to salts of such compounds which are suitable for use in a medical context, that is, which, within the scope of sound medical judgment, are suitable for use with tissues of humans and lower animals. Salts that are used by contact without unusual toxicity, irritation, allergic reaction and the like and commensurate with a reasonable benefit/risk ratio. Pharmaceutically acceptable salts are well known in the art. For example, S. M. Berge et al. describe pharmaceutically acceptable salts in detail in J. Pharmaceutical Sciences, 66: 1-19 (1977). In some embodiments, pharmaceutically acceptable salts include, but are not limited to, non-toxic acid addition salts, which are salts of an amino group with an inorganic acid or with an organic acid or obtained by using other compounds used in the art. Salts formed by methods such as ion exchange, inorganic acids such as hydrochloric acid, hydrobromic acid, phosphoric acid, sulfuric acid and perchloric acid, organic acids such as acetic acid, maleic acid, tartaric acid, citric acid, succinic acid or malonic acid . In some embodiments, pharmaceutically acceptable salts include, but are not limited to, adipate, alginate, ascorbate, aspartate, besylate, benzoate, bisulfate , Borate, Butyrate, Camphorate, Camphorsulfonate, Citrate, Cyclopentanepropionate, Digluconate, Lauryl Sulfate, Ethanesulfonate, Formate , fumarate, glucoheptonate, glycerophosphate, gluconate, hemisulfate, heptanoate, hexanoate, hydroiodide, 2-hydroxy-ethanesulfonate, Lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, nitric acid Salt, oleate, oxalate, palmitate, pamoate, pectate, persulfate, 3-phenylpropionate, phosphate, picrate, pivalate, propionate salts, stearates, succinates, sulfates, tartrates, thiocyanates, p-toluenesulfonates, undecanoates, valerates and similar salts. Representative alkali or alkaline earth metal salts include sodium, lithium, potassium, calcium, magnesium, and the like. In some embodiments, where appropriate, pharmaceutically acceptable salts include nontoxic ammonium, quaternary ammonium, and amine cations formed using counterions such as halides, hydroxides, carboxylates, formates, Sulfate, phosphate, nitrate, alkyl having 1 to 6 carbon atoms, sulfonate and arylsulfonate.
在多個實施例中,本發明提供本文所述之醫藥組合物,其具有醫藥學上可接受之賦形劑。醫藥學上可接受之賦形劑包括適用於製備通常安全、無毒且合乎需要之醫藥組合物的賦形劑,且包括對於獸醫學用途以及人類醫藥用途可接受之賦形劑。此類賦形劑可為固體、液體、半固體,或在氣溶膠組合物情況下為氣體。In various embodiments, the present invention provides a pharmaceutical composition described herein having a pharmaceutically acceptable excipient. Pharmaceutically acceptable excipients include excipients suitable for the manufacture of generally safe, nontoxic and desirable pharmaceutical compositions, and include excipients acceptable for veterinary use as well as human pharmaceutical use. Such excipients may be solid, liquid, semi-solid, or, in the case of aerosol compositions, a gas.
可依循習知製藥技術製備醫藥製劑:對於錠劑形式,涉及研磨、混合、造粒及必要時之壓縮;或對於硬明膠膠囊形式,涉及研磨、混合及填充。當使用液體載劑時,製劑將呈糖漿、酏劑、乳液或水性或非水性溶液或懸浮液形式。此類液體調配物可直接經口投與。The pharmaceutical preparations can be prepared following conventional pharmaceutical techniques involving milling, mixing, granulating and, if necessary, compression for tablet form or milling, mixing and filling for hard gelatin capsule form. When a liquid carrier is used, the formulation will be in the form of a syrup, elixir, emulsion or aqueous or non-aqueous solution or suspension. Such liquid formulations can be administered directly orally.
在一些實施例中,醫藥組合物可調配成經由任何投藥途徑遞送至細胞及/或個體。向個體投與之模式可包括注射、輸注、吸入、鼻內、眼內、體表遞送、插管內遞送或攝入。注射包括(但不限於)靜脈內、肌肉內、動脈內、鞘內、心室內、囊內、眶內、心內、皮內、腹膜內、經氣管、皮下、表皮下、關節內、囊下、蛛網膜下、脊柱內、腦脊髓內,及胸骨內注射及輸注。在一些實施例中,投藥包括氣溶膠吸入,例如利用霧化。在一些實施例中,投藥為全身的(例如經口、直腸、鼻、舌下、頰內或非經腸)、腸內的(例如全系統效應,但經由胃腸道遞送)或局部的(例如局部施用於皮膚、玻璃體內注射)。在一些實施例中,一或多種組合物全身性投與。在一些實施例中,投藥為非腸胃外的且治療劑為非經腸治療劑。在一些特定實施例中,投藥可經由支氣管(例如藉由支氣管滴注)、頰內、真皮(可為或包含例如體表至真皮、皮內、真皮間、經皮等中之一或多者)、腸內、動脈內、皮內、胃內、髓內、肌內、鼻內、腹膜內、鞘內、靜脈內、心室內、特定器官內(例如肝內)、黏膜、鼻、經口、直腸、皮下、舌下、體表、氣管(例如藉由氣管內滴注)、陰道、經玻璃體等。在一些實施例中,投藥可為單次劑量。在一些實施例中,投藥可涉及間歇性給藥(例如時間分隔開之複數次劑量)及/或定期給藥(例如以恆定時段分隔開之個別劑量)。在一些實施例中,投藥可涉及連續給藥(例如灌注)至少所選時段。在一些實施例中,在一次治療期間或在作為治療療程的一段時間內,可向個體給與六次、八次、十次、12次、15次或20次或更多次投藥。In some embodiments, pharmaceutical compositions can be formulated for delivery to cells and/or individuals via any route of administration. The mode of administration to a subject may include injection, infusion, inhalation, intranasal, intraocular, topical delivery, intracatheter delivery, or ingestion. Injections include (but are not limited to) intravenous, intramuscular, intraarterial, intrathecal, intraventricular, intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcutaneous, intraarticular, subcapsular , subarachnoid, intraspinal, intracerebrospinal, and intrasternal injection and infusion. In some embodiments, administration comprises aerosol inhalation, eg, by nebulization. In some embodiments, administration is systemic (e.g., oral, rectal, nasal, sublingual, buccal, or parenteral), enteral (e.g., systemic effects, but delivered via the gastrointestinal tract), or topical (e.g., Topical application to the skin, intravitreal injection). In some embodiments, one or more compositions are administered systemically. In some embodiments, the administration is non-parenteral and the therapeutic agent is a parenteral therapeutic agent. In some specific embodiments, administration may be via one or more of bronchi (e.g., by bronchial instillation), buccal, dermis (which may be or include, e.g., body surface to dermis, intradermal, interdermal, transdermal, etc. ), enteral, intraarterial, intradermal, intragastric, intramedullary, intramuscular, intranasal, intraperitoneal, intrathecal, intravenous, intraventricular, intraspecific (e.g. intrahepatic), mucosal, nasal, oral , rectal, subcutaneous, sublingual, superficial, tracheal (eg by intratracheal instillation), vaginal, transvitreous, etc. In some embodiments, administration can be a single dose. In some embodiments, administration can involve intermittent administration (eg, multiple doses spaced apart in time) and/or periodic administration (eg, individual doses separated by a constant period of time). In some embodiments, administration can involve continuous administration (eg, infusion) for at least a selected period of time. In some embodiments, six, eight, ten, 12, 15, or 20 or more administrations may be administered to an individual during a single treatment or over a period of time as a course of treatment.
在一些實施例中,可視需要投藥,例如只要與疾病、病症或病狀相關的症狀持續存在。在一些實施例中,可在個體生命之剩餘時間內指定重複投藥。治療期可變化且可為例如一天、兩天、三天、一週、兩週、一個月、兩個月、三個月、六個月、一年或更長。In some embodiments, administration is optional, eg, as long as symptoms associated with the disease, disorder or condition persist. In some embodiments, repeated dosing may be prescribed for the remainder of the subject's life. The treatment period can vary and can be, for example, one day, two days, three days, one week, two weeks, one month, two months, three months, six months, one year or longer.
劑量 所投與之藥劑或組合物的劑量可基於以下而變化:例如所治療之病狀、疾病之嚴重度、個體的個別參數,包括年齡、生理狀況、體型及體重、治療持續時間、待進行之治療類型(若存在)、特定投藥途徑及類似因素。因此,本文所述之藥劑的投與劑量可視此類不同參數而定。所投與之組合物的劑量亦可視其他因素而變化,如個體之性別、一般醫學狀況及所治療之病症嚴重程度。可能需要向個體提供一定劑量的本文所揭示之調節劑或調節劑組合,該劑量作為單次靜脈內輸注係在約1 mg/kg至6 mg/kg範圍內,但環境決定時,亦可投與更低或更高劑量。必要時可重複給藥,例如每天一次(例如1-30天)、每3天一次(例如1-30天)、每5天一次(例如1-30天)、每週一次(例如1-6週或2-5週)。在一些實施例中,劑量可包括(但不限於) 1.0 mg/kg-6mg/kg、1.0 mg/kg-5 mg/kg、1.0 mg/kg-4 mg/kg、1.0-3.0mg/kg、1.5 mg/kg-3.0mg/kg、1.0 mg/kg-1.5 mg/kg、1.5 mg/kg-3 mg/kg、3 mg/kg-4 mg/kg、4 mg/kg-5 mg/kg或5 mg/kg-6 mg/kg。劑量可投與多次,例如一週一次或兩次,或每1或2週一次。在一些實施例中,向個體提供一定劑量的本文所揭示調節劑或調節劑組合,該劑量作為多次靜脈內輸注係在約1 mg/kg至6 mg/kg範圍內,但環境決定時,亦可投與更低或更高劑量。 dose The dose of the agent or composition administered may vary based on, for example, the condition being treated, the severity of the disease, individual parameters of the individual including age, physiological condition, size and weight, duration of treatment, the The type of treatment (if any), the particular route of administration, and similar factors. Accordingly, the administered doses of the agents described herein can be determined according to such various parameters. The dosage of the composition administered may also vary depending on other factors, such as the sex of the subject, the general medical condition and the severity of the condition being treated. It may be desirable to provide an individual with a modulator or combination of modulators disclosed herein at a dose in the range of about 1 mg/kg to 6 mg/kg as a single intravenous infusion, but as circumstances dictate, can also be administered with lower or higher doses. Dosing can be repeated if necessary, for example once a day (for example 1-30 days), once every 3 days (for example 1-30 days), once every 5 days (for example 1-30 days), once a week (for example 1-6 week or 2-5 weeks). In some embodiments, doses may include, but are not limited to, 1.0 mg/kg-6 mg/kg, 1.0 mg/kg-5 mg/kg, 1.0 mg/kg-4 mg/kg, 1.0-3.0 mg/kg, 1.5 mg/kg-3.0 mg/kg, 1.0 mg/kg-1.5 mg/kg, 1.5 mg/kg-3 mg/kg, 3 mg/kg-4 mg/kg, 4 mg/kg-5 mg/kg or 5 mg/kg-6 mg/kg. Doses may be administered multiple times, such as once or twice a week, or every 1 or 2 weeks. In some embodiments, an individual is provided with a modulator or combination of modulators disclosed herein at a dose in the range of about 1 mg/kg to 6 mg/kg as multiple intravenous infusions, but as circumstances dictate, Lower or higher doses may also be administered.
如本文所揭示之調節劑或調節劑組合可每3-5天投與一次劑量,重複總共至少3次劑量。或者,如本文所揭示之調節劑或調節劑組合可以3 mg/kg、每5天一次投與25天。或者,如本文所揭示之調節劑或調節劑組合可以1.0-5.0 mg/kg、每3-5天一次投與1-10次劑量。或者,如本文所揭示之調節劑或調節劑組合可以1.0-3.0 mg/kg、每5天一次投與3次劑量,接著每3天一次投與3次劑量。或者,如本文所揭示之調節劑或調節劑組合可以1.0-3.0 mg/kg、每5天一次投與4次劑量,接著每3天一次投與3次劑量。或者,如本文所揭示之調節劑或調節劑組合可以6 mg/kg、每5天一次投與1-10次劑量。或者,如本文所揭示之調節劑或調節劑組合可以3 mg/kg、每5天一次投與1-10次劑量。或者,如本文所揭示之調節劑或調節劑組合可以1.5 mg/kg、每5天一次投與2次劑量;以3 mg/kg、每5天一次投與3次劑量;以3 mg/kg、每3天一次投與1次劑量。或者,如本文所揭示之調節劑或調節劑組合可以6 mg/kg、每5天一次,或以1.5 mg/kg、一天一次投與5天,停藥2天。給藥時程可視情況以其他時間間隔重複且可經由各種非經腸途徑,在適當調節劑量及時程的情況下給與劑量。在一些實施例中,調節劑或調節劑組合的給與可包括1.0 mg/kg至6.0 mg/kg之間的劑量,其視情況每週給與一次、每週給與兩次,或每隔一週給與一次。一般技術者將認識到,在選擇如本文所揭示之調節劑或調節劑組合的劑量時,可考慮多種因素,且在治療過程期間,可增加或減少投藥劑量及/或頻率。視需要可重複給藥,證據表明至少2至8次劑量之後觀測到腫瘤體積減小。與順鉑、索拉非尼(sorafenib)或小分子比較物相比,本文所揭示之投藥劑量及時程對個體之總體重顯示最小影響。本發明方法可包括每隔一定時間使用CT及/或PET/CT或MRI量測腫瘤反應。亦可監測腫瘤標記物的血液水準。需要時,可根據成像結果及/或標記物血液水準來調節劑量及/或投藥時程。A modulator or combination of modulators as disclosed herein may be administered in one dose every 3-5 days, repeated for a total of at least 3 doses. Alternatively, a modulator or combination of modulators as disclosed herein can be administered at 3 mg/kg every 5 days for 25 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 1-10 doses of 1.0-5.0 mg/kg once every 3-5 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 3 doses of 1.0-3.0 mg/kg every 5 days, followed by 3 doses every 3 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 4 doses of 1.0-3.0 mg/kg every 5 days, followed by 3 doses every 3 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 1-10 doses of 6 mg/kg every 5 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 1-10 doses at 3 mg/kg every 5 days. Alternatively, a modulator or combination of modulators as disclosed herein may be administered in 2 doses at 1.5 mg/kg every 5 days; 3 doses at 3 mg/kg every 5 days; at 3 mg/
在一些實施例中,本文所揭示之組合物可與一或多種選自以下的治療劑或方法組合投與以治療疾病或病症:手術切除;酪胺酸激酶抑制劑(TKI),例如索拉非尼;溴域抑制劑,例如BET抑制劑,例如JQ1,例如BET672,例如必納昔布(birabresib);MEK抑制劑(例如曲美替尼(Trametinib))、正位肝臟移植、射頻消融、免疫療法、免疫檢查點加抗血管內皮細胞生長因子組合療法、光動力療法(PDT)、雷射療法、近接療法、放射療法、經導管動脈化學或放射栓塞、立體定向放射療法、化學療法及/或全身化學療法。下表21揭示例示性治療劑。In some embodiments, the compositions disclosed herein may be administered in combination with one or more therapeutic agents or methods selected from the group consisting of surgical resection; tyrosine kinase inhibitors (TKIs), such as Sola Fenib; bromodomain inhibitors, e.g. BET inhibitors, e.g. JQ1, e.g. BET672, e.g. birabresib; MEK inhibitors (e.g. Trametinib), orthotopic liver transplantation, radiofrequency ablation, Immunotherapy, immune checkpoint plus anti-vascular endothelial growth factor combination therapy, photodynamic therapy (PDT), laser therapy, brachytherapy, radiotherapy, transcatheter arterial chemo or radioembolization, stereotactic radiotherapy, chemotherapy and/or or systemic chemotherapy. Table 21 below discloses exemplary therapeutic agents.
表21:小分子化合物,例如與本文所述之表現抑制子用於組合療法
本發明之醫藥組合物可以治療有效量遞送。精確的治療有效量為就治療功效而言將在指定個體中產生最有效結果之組合物的量。此量將視多種因素而變,包括(但不限於)治療性化合物之特徵(包括活性、藥物動力學、藥效學及生物可用性)、個體之生理狀況(包括年齡、性別、疾病類型及階段、一般身體狀況、對指定劑量之反應及藥物類型)、調配物中之醫藥學上可接受之一或多種載劑的性質,及投藥途徑。The pharmaceutical compositions of the invention can be delivered in therapeutically effective amounts. A precise therapeutically effective amount is that amount of the composition which will produce the most effective results in a given individual with respect to therapeutic efficacy. This amount will vary depending on many factors including, but not limited to, the characteristics of the therapeutic compound (including activity, pharmacokinetics, pharmacodynamics, and bioavailability), the physiological condition of the individual (including age, sex, type and stage of disease) , general physical condition, response to the prescribed dose and type of drug), the nature of the pharmaceutically acceptable carrier or carriers in the formulation, and the route of administration.
在一些態樣中,本發明提供遞送治療劑的方法,包含向個體投與如本文所述的組合物,其中調節劑為治療劑且/或其中相對於治療劑不存在下的基因表現,治療劑的遞送引起基因表現發生變化。In some aspects, the invention provides methods of delivering a therapeutic agent comprising administering to an individual a composition as described herein, wherein the modulator is a therapeutic agent and/or wherein the therapeutic agent is expressed relative to gene expression in the absence of the therapeutic agent. The delivery of an agent causes changes in gene expression.
如本文之各種實施例中所提供之方法可用於本文所述的任一些態樣中。在一些實施例中,一或多種組合物靶向特定細胞或一或多種特定組織。Methods as provided in the various embodiments herein can be used in any of the aspects described herein. In some embodiments, one or more compositions target specific cells or one or more specific tissues.
舉例而言,在一些實施例中,一或多種組合物靶向肝、上皮、結締組織、肌肉、生殖及/或神經組織或細胞。在一些實施例中,組合物靶向特定器官系統的細胞或組織,例如心血管系統(心臟、血管);消化系統(食道、胃、肝臟、膽囊、胰臟、腸、結腸、直腸及肛門);內分泌系統(下丘腦、腦垂體腺體、松果體或松果體腺體、甲狀腺、副甲狀腺、腎上腺);排泄系統(腎臟、輸尿管、膀胱);淋巴系統(淋巴、淋巴結、淋巴管、扁桃體、腺樣體、胸腺、脾臟);表皮系統(皮膚、毛髮、指甲);肌肉系統(例如骨骼肌);神經系統(腦、脊髓、神經);生殖系統(卵巢、子宮、乳腺、睪丸、輸精管、儲精囊、前列腺);呼吸系統(咽、喉、氣管、支氣管、肺、隔膜);骨骼系統(骨骼、軟骨);及/或其組合。For example, in some embodiments, one or more compositions target liver, epithelial, connective tissue, muscle, reproductive and/or neural tissues or cells. In some embodiments, the composition targets cells or tissues of specific organ systems, such as the cardiovascular system (heart, blood vessels); digestive system (esophagus, stomach, liver, gallbladder, pancreas, intestine, colon, rectum, and anus) ; endocrine system (hypothalamus, pituitary gland, pineal or pineal gland, thyroid, parathyroid, adrenal gland); excretory system (kidneys, ureters, bladder); lymphatic system (lymph, lymph nodes, lymphatic vessels, tonsils, adenoids, thymus, spleen); epidermal system (skin, hair, nails); muscular system (e.g. skeletal muscle); nervous system (brain, spinal cord, nerves); reproductive system (ovaries, uterus, breast, testes, Vas deferens, seminal vesicles, prostate); respiratory system (pharynx, larynx, trachea, bronchi, lungs, diaphragm); skeletal system (bones, cartilage); and/or combinations thereof.
在一些實施例中,本發明之組合物穿越血腦障壁、胎盤膜或血液-睪丸障壁。In some embodiments, compositions of the invention cross the blood-brain barrier, placental membrane, or blood-testicular barrier.
在一些實施例中,如本文所提供的醫藥組合物係全身性投與。In some embodiments, pharmaceutical compositions as provided herein are administered systemically.
在一些實施例中,投藥為非腸胃外的且治療劑為非經腸治療劑。In some embodiments, the administration is non-parenteral and the therapeutic agent is a parenteral therapeutic agent.
本文所提供之方法及組合物可包含藉由足以緩解疾病、病症及/或病狀之症狀的療法投與的醫藥組合物。在一些態樣中,本發明提供藉由投與如本文所述之組合物來遞送治療劑的方法。The methods and compositions provided herein may include pharmaceutical compositions administered by therapy sufficient to alleviate the symptoms of a disease, disorder, and/or condition. In some aspects, the invention provides methods of delivering a therapeutic agent by administering a composition as described herein.
本發明之醫藥用途可包括如本文所述之組合物(例如調節劑,例如干擾劑)。Medical uses of the invention may include compositions as described herein (eg, modulators, eg, interfering agents).
在一些實施例中,相較於單獨的活性劑,本發明之醫藥組合物具有改善的PK/PD,例如增強的藥物動力學或藥效學,諸如改善的靶向、吸收或傳輸(例如改善至少5%、10%、15%、20%、30%、40%、50%、60%、75%、80%、90%或更多)。在一些實施例中,相較於單獨的治療劑,醫藥組合物的不良效應減少,諸如向非標靶位置的擴散、脫靶活性或毒性代謝減少(例如相較於單獨的活性劑,減少至少5%、10%、15%、20%、30%、40%、50%、60%、75%、80%、90%或更多)。在一些實施例中,相較於單獨的活性劑,組合物增加治療劑的功效且/或減少治療劑的毒性(例如至少5%、10%、15%、20%、30%、40%、50%、60%、75%、80%、90%或更多)。In some embodiments, the pharmaceutical compositions of the invention have improved PK/PD, e.g., enhanced pharmacokinetics or pharmacodynamics, such as improved targeting, absorption or delivery (e.g., improved At least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 75%, 80%, 90% or more). In some embodiments, the pharmaceutical composition has reduced adverse effects, such as reduced diffusion to off-target sites, off-target activity, or toxic metabolism, as compared to the therapeutic agent alone (e.g., a reduction of at least 5 compared to the active agent alone). %, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 75%, 80%, 90% or more). In some embodiments, the composition increases the efficacy of the therapeutic agent and/or reduces the toxicity of the therapeutic agent (e.g., at least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 75%, 80%, 90% or more).
所調配的本文所述醫藥組合物可例如包括載劑,諸如醫藥載劑及/或聚合載劑,例如奈米粒子、脂質體或囊泡,且藉由已知方法遞送至有需要之個體(例如人類或非人類農畜或家畜,例如牛、犬、貓、馬、家禽)。此類方法包括轉染(例如脂質介導、陽離子聚合物、磷酸鈣);電穿孔或膜破壞(例如核轉染)及病毒遞送(例如慢病毒、逆轉錄病毒、腺病毒、AAV)之其他方法。遞送方法亦描述於例如Gori等人, Delivery and Specificity of CRISPR/Cas9 Genome Editing Technologies for Human Gene Therapy. Human Gene Therapy. 2015年7月, 26(7): 443-451. 數位物件識別碼:10.1089/hum.2015.074; 及Zuris等人, Cationic lipid-mediated delivery of proteins enables efficient protein-based genome editing in vitro and in vivo. Nat Biotechnol. 2014年10月30日;33(1):73-80。The formulated pharmaceutical compositions described herein may, for example, include a carrier, such as a pharmaceutical carrier and/or a polymeric carrier, such as nanoparticles, liposomes or vesicles, and be delivered to an individual in need thereof by known methods ( For example human or non-human farm or domestic animals such as cattle, dogs, cats, horses, poultry). Such methods include transfection (e.g., lipid-mediated, cationic polymers, calcium phosphate); electroporation or membrane disruption (e.g., nucleofection) and viral delivery (e.g., lentivirus, retrovirus, adenovirus, AAV) among others. method. Delivery methods are also described, e.g., in Gori et al., Delivery and Specificity of CRISPR/Cas9 Genome Editing Technologies for Human Gene Therapy. Human Gene Therapy. 2015 Jul, 26(7): 443-451. DOI: 10.1089/ hum.2015.074; and Zuris et al., Cationic lipid-mediated delivery of proteins enables efficient protein-based genome editing in vitro and in vivo. Nat Biotechnol. 2014
脂質奈米粒子 如本文所述的表現抑制子或表現抑制系統可使用包括粒子的任何生物學遞送系統/調配物(例如奈米粒子遞送系統)遞送。奈米粒子包括維度(例如直徑)在約1奈米與約1000奈米之間、約1奈米與約500奈米尺寸之間、約1與約100 nm之間、約30 nm與約200 nm之間、約50 nm與約300 nm之間、約75 nm與約200 nm之間、約100 nm與約200 nm之間及其間任何範圍的粒子。奈米粒子具有奈米級維度之複合結構。在一些實施例中,奈米粒子典型地呈球形,但視奈米粒子組成而定,可呈不同形態。奈米粒子之與奈米粒子外部環境接觸的部分通常鑑別為奈米粒子之表面。在一些實施例中,奈米粒子具有25 nm與200 nm之間的最大維度範圍。如本文所述的奈米粒子包含可以任何形式提供的遞送系統,包括(但不限於)固體、半固體、乳液或膠態奈米粒子。奈米粒子遞送系統可包括(但不限於)基於脂質之系統、脂質體、微胞、微囊泡、胞外體或基因槍。在一個實施例中,奈米粒子為脂質奈米粒子(LNP)。在一些實施例中,LNP為包含複數個脂質分子之粒子,該等脂質分子彼此藉由分子間力以物理方式締合。 lipid nanoparticles Expression inhibitors or expression inhibitory systems as described herein can be delivered using any biological delivery system/formulation including particles (eg, nanoparticle delivery systems). Nanoparticles include dimensions (e.g., diameters) between about 1 nm and about 1000 nm, between about 1 nm and about 500 nm in size, between about 1 and about 100 nm, between about 30 nm and about 200 nm Particles between about 50 nm and about 300 nm, between about 75 nm and about 200 nm, between about 100 nm and about 200 nm, and any range therebetween. Nanoparticles have a composite structure with nanoscale dimensions. In some embodiments, nanoparticles are typically spherical in shape, but may have different morphologies depending on the composition of the nanoparticles. The portion of the nanoparticle that is in contact with the external environment of the nanoparticle is generally identified as the surface of the nanoparticle. In some embodiments, nanoparticles have a maximum dimensional range between 25 nm and 200 nm. Nanoparticles as described herein comprise delivery systems that may be provided in any form, including but not limited to solid, semi-solid, emulsion or colloidal nanoparticles. Nanoparticle delivery systems may include, but are not limited to, lipid-based systems, liposomes, micelles, microvesicles, exosomes, or gene guns. In one embodiment, the nanoparticles are lipid nanoparticles (LNPs). In some embodiments, the LNP is a particle comprising a plurality of lipid molecules physically associated with each other by intermolecular forces.
在一些實施例中,LNP可包含多種組分,例如3至4種組分。在一個實施例中,表現抑制子或包含該表現抑制子(或編碼其的核酸,或包含該表現抑制子核酸的醫藥組合物)的醫藥組合物囊封於LNP中。在一個實施例中,表現抑制系統或包含該表現抑制系統(或編碼其之核酸,或包含該表現抑制系統核酸之醫藥組合物)的醫藥組合物囊封於LNP中。在一些實施例中,編碼第一表現抑制子的核酸與編碼第二表現抑制子的核酸存在於同一LNP中。在一些實施例中,編碼第一表現抑制子的核酸與編碼第二表現抑制子的核酸存在於不同LNP中。可根據Rosin等人, Molecular Therapy, 第19卷, 第12期, 第1286-2200頁, 2011年12月)來使用及/或調適LNP製備及調節劑囊封。在一些實施例中,本文所揭示之脂質奈米粒子組合物適用於mRNA所編碼之蛋白質的表現。在一些實施例中,核酸當存在於脂質奈米粒子中時,在水溶液中抵抗核酸酶的降解作用。In some embodiments, the LNP may comprise multiple components, eg, 3 to 4 components. In one embodiment, the expression inhibitor or a pharmaceutical composition comprising the expression inhibitor (or nucleic acid encoding the same, or a pharmaceutical composition comprising the expression inhibitor nucleic acid) is encapsulated in LNP. In one embodiment, the expression inhibition system or the pharmaceutical composition comprising the expression inhibition system (or the nucleic acid encoding it, or the pharmaceutical composition comprising the expression inhibition system nucleic acid) is encapsulated in LNP. In some embodiments, the nucleic acid encoding the first inhibitor of expression is present in the same LNP as the nucleic acid encoding the second inhibitor of expression. In some embodiments, the nucleic acid encoding the first inhibitor of expression is present in a different LNP than the nucleic acid encoding the second inhibitor of expression. LNP preparation and modulator encapsulation can be used and/or adapted according to Rosin et al., Molecular Therapy, Vol. 19, No. 12, pp. 1286-2200, Dec. 2011). In some embodiments, the lipid nanoparticle compositions disclosed herein are suitable for the expression of proteins encoded by mRNA. In some embodiments, the nucleic acid, when present in the lipid nanoparticles, is resistant to degradation by nucleases in aqueous solution.
在一些實施例中,LNP調配物可包括CCD脂質、中性脂質及/或輔助脂質。在一些實施例中,LNP調配物包含可離子化脂質。在一些實施例中,可離子化脂質可為陽離子脂質、可離子化陽離子脂質,或可容易質子化的含胺脂質。在一些實施例中,脂質為陽離子脂質,視pH而定,其可以帶正電或中性形式存在。在一些實施例中,陽離子脂質為能夠在例如生理條件下帶正電之脂質。在一些實施例中,脂質粒子包含陽離子脂質與以下中之一或多者的調配物:中性脂質、可離子化含胺脂質、可生物降解的炔烴脂質、類固醇、包括多元不飽和脂質之磷脂、結構性脂質(例如固醇)、PEG、膽固醇及聚合物偶聯的脂質。In some embodiments, LNP formulations can include CCD lipids, neutral lipids, and/or helper lipids. In some embodiments, the LNP formulation comprises ionizable lipids. In some embodiments, the ionizable lipid can be a cationic lipid, an ionizable cationic lipid, or an amine-containing lipid that is readily protonatable. In some embodiments, the lipids are cationic lipids, which can exist in positively charged or neutral form, depending on pH. In some embodiments, cationic lipids are lipids capable of being positively charged, eg, under physiological conditions. In some embodiments, the lipid particle comprises a cationic lipid in a formulation with one or more of: neutral lipids, ionizable amine-containing lipids, biodegradable alkyne lipids, steroids, including polyunsaturated lipids Phospholipids, structured lipids (such as sterols), PEG, cholesterol, and polymer-conjugated lipids.
在一些實施例中,LNP調配物(例如MC3及/或SSOP)包括膽固醇、PEG及/或輔助脂質。LNP可為例如在一些實施例中大體上呈球形的微球體(包括單層及多層囊泡、層狀相脂質雙層)。In some embodiments, LNP formulations (eg, MC3 and/or SSOP) include cholesterol, PEG, and/or helper lipids. LNPs can be, for example, substantially spherical microspheres (including unilamellar and multilamellar vesicles, lamellar phase lipid bilayers) in some embodiments.
在一些實施例中,LNP可包含水性核心,例如包含編碼如本文所揭示之表現抑制子或系統的核酸。在本發明之一些實施例中,LNP調配物之負載包括至少一種嚮導RNA。在一些實施例中,負載(例如編碼如本文所揭示之表現抑制子或系統的核酸)可吸附至LNP (例如包含陽離子脂質的LNP)的表面。在一些實施例中,負載(例如編碼如本文所揭示之表現抑制子或系統的核酸)可與LNP締合。在一些實施例中,負載(例如編碼如本文所揭示之表現抑制子或系統的核酸)可囊封(例如完全地囊封及/或部分地囊封)於LNP中。In some embodiments, a LNP may comprise an aqueous core, eg, comprising a nucleic acid encoding an expression inhibitor or system as disclosed herein. In some embodiments of the invention, the payload of the LNP formulation includes at least one guide RNA. In some embodiments, a payload (eg, a nucleic acid encoding an expression inhibitor or system as disclosed herein) can be adsorbed to the surface of a LNP (eg, a LNP comprising a cationic lipid). In some embodiments, a payload (eg, a nucleic acid encoding an expression inhibitor or system as disclosed herein) can be associated with a LNP. In some embodiments, payloads (eg, nucleic acids encoding expression inhibitors or systems as disclosed herein) can be encapsulated (eg, fully and/or partially encapsulated) in LNPs.
在一些實施例中,包含負載的LNP可針對全身遞送來投與,例如遞送治療有效劑量的負載,從而可使得活性劑在生物體內廣泛暴露。脂質奈米粒子的全身遞送可藉由此項技術中已知的任何方式達成,包括例如靜脈內、動脈內、皮下及腹膜內遞送。在一些實施例中,脂質奈米粒子的全身遞送係藉由靜脈內遞送達成。在一些實施例中,包含負載的LNP可針對局部遞送來投與,例如將活性劑直接遞送至生物體內的靶點。在一些實施例中,LNP可局部遞送至疾病位點(例如腫瘤)、其他靶點(例如發炎位點),或遞送至標靶器官,例如肝臟、肺、胃、大腸、胰臟、子宮、乳房、淋巴結及其類似物。在一些實施例中,如本文所揭示之LNP可局部遞送至特定細胞,例如肝細胞、星形細胞、庫弗細胞(Kupffer cells)、內皮細胞、肺泡及/或上皮細胞。在一些實施例中,如本文所揭示之LNP可局部遞送至特定腫瘤位點,例如皮下、正位。In some embodiments, LNPs comprising a payload can be administered for systemic delivery, eg, delivery of a therapeutically effective dose of the payload, such that broad exposure of the active agent in the organism can be achieved. Systemic delivery of lipid nanoparticles can be achieved by any means known in the art, including, for example, intravenous, intraarterial, subcutaneous and intraperitoneal delivery. In some embodiments, systemic delivery of lipid nanoparticles is achieved by intravenous delivery. In some embodiments, LNPs comprising a cargo can be administered for local delivery, eg, delivery of an active agent directly to a target in an organism. In some embodiments, LNP can be delivered locally to a site of disease (e.g., a tumor), other target (e.g., a site of inflammation), or to a target organ, such as liver, lung, stomach, large intestine, pancreas, uterus, Breast, lymph nodes and the like. In some embodiments, LNPs as disclosed herein can be delivered locally to specific cells, such as hepatocytes, astrocytes, Kupffer cells, endothelial cells, alveoli, and/or epithelial cells. In some embodiments, LNPs as disclosed herein can be delivered locally to a specific tumor site, eg, subcutaneously, orthotopically.
LNP可作為分散相調配於乳液、微胞中,或作為內相調配於懸浮液中。在一些實施例中,LNP為生物可降解的。在一些實施例中,LNP在治療有效劑量下、在活體內不能累積至細胞毒性水準或產生毒性。在一些實施例中,LNP在治療有效劑量下重複投與之後,在活體內不能累積至細胞毒性水準或產生毒性。在一些實施例中,LNP在治療有效劑量下不產生導致實質性副作用的先天性免疫反應。LNP can be formulated as a dispersed phase in emulsions, micelles, or as an internal phase in suspensions. In some embodiments, LNPs are biodegradable. In some embodiments, the LNP does not accumulate to cytotoxic levels or become toxic in vivo at therapeutically effective doses. In some embodiments, the LNP does not accumulate to cytotoxic levels or become toxic in vivo following repeated administration at a therapeutically effective dose. In some embodiments, the LNP does not produce an innate immune response that results in substantial side effects at therapeutically effective doses.
在一些實施例中,所用LNP包含式4-(二甲胺基)丁酸(6Z,9Z,28Z,31Z)-三十七碳-6,9,28,31-四烯-19-基酯或ssPalmO-苯基-P4C2 (ssPalmO-Phe,SS-OP)。在一些實施例中,LNP調配物包含4-(二甲胺基)丁酸式(6Z,9Z,28Z,31Z)-三十七碳-6,9,28,31-四烯-19-基酯(MC3)、1,2-二油醯基-sn-甘油-3-磷酸膽鹼(DOPC)、膽固醇、1,2-二肉豆蔻醯基-rac-甘油-3-甲氧基聚乙二醇-2000 (PEG2k-DMG),例如MC3 LNP或ssPalmO-苯基-P4C2 (ssPalmO-Phe,SS-OP), 1,2-二油醯基-sn-甘油-3-磷酸膽鹼(DOPC)、膽固醇、1,2-二肉豆蔻醯基-rac-甘油-3-甲氧基聚乙二醇-2000 (PEG2k-DMG),例如SSOP-LNP。In some embodiments, the LNP used comprises the formula 4-(dimethylamino)butyric acid (6Z,9Z,28Z,31Z)-heptakadec-6,9,28,31-tetraen-19-yl ester or ssPalmO-Phenyl-P4C2 (ssPalmO-Phe, SS-OP). In some embodiments, the LNP formulation comprises 4-(dimethylamino)butanoic acid (6Z,9Z,28Z,31Z)-heptakadec-6,9,28,31-tetraen-19-yl Esters (MC3), 1,2-dioleyl-sn-glycero-3-phosphocholine (DOPC), cholesterol, 1,2-dimyristyl-rac-glycero-3-methoxypolyethylene Diol-2000 (PEG2k-DMG), such as MC3 LNP or ssPalmO-phenyl-P4C2 (ssPalmO-Phe, SS-OP), 1,2-dioleyl-sn-glycero-3-phosphocholine (DOPC ), cholesterol, 1,2-dimyristyl-rac-glycerol-3-methoxypolyethylene glycol-2000 (PEG2k-DMG), eg SSOP-LNP.
脂質體為球狀囊泡結構,其由包圍內部水性隔室之單層或多層脂質雙層及相對不可滲透之外部親脂性磷脂雙層構成。脂質體可為陰離子、中性或陽離子型。脂質體具有生物相容性,無毒性,可遞送親水性與親脂性藥物分子,防止其負載被血漿酶降解,且傳輸其負載穿越生物膜及血腦障壁(BBB)(欲回顧,參見例如Spuch及Navarro, Journal of Drug Delivery, 第2011卷, Article ID 469679, 第12頁, 2011. 數位物件識別碼:10.1155/2011/469679)。Liposomes are spherical vesicular structures composed of a single or multilamellar lipid bilayer surrounding an inner aqueous compartment and a relatively impermeable outer lipophilic phospholipid bilayer. Liposomes can be anionic, neutral or cationic. Liposomes are biocompatible, non-toxic, deliver hydrophilic and lipophilic drug molecules, protect their load from degradation by plasma enzymes, and transport their load across biomembranes and the blood-brain barrier (BBB) (for review, see e.g. Spuch and Navarro, Journal of Drug Delivery, Volume 2011, Article ID 469679, p. 12, 2011. Digital Object ID: 10.1155/2011/469679).
囊泡可由若干種不同類型的脂質製成;然而,磷脂最常用於產生脂質體作為藥物載劑。囊泡可包含但不限於DOTMA、DOTAP、DOTIM、DDAB,其單獨或連同膽固醇一起產生DOTMA及膽固醇、DOTAP及膽固醇、DOTIM及膽固醇以及DDAB及膽固醇。用於製備多層囊泡脂質之方法係此項技術中已知的(參見例如美國專利第6,693,086號,其關於多層囊泡脂質製備之教示內容以引用的方式併入本文中)。雖然當脂質膜與水溶液混合時可自發形成囊泡,但其亦可藉由使用均質器、音波處理器或擠出設備施加振盪形式的力來加快(欲回顧,參見例如Spuch及Navarro, Journal of Drug Delivery, 第2011卷, 論文 ID 469679, 第12頁, 2011. 數位物件識別碼:10.1155/2011/469679)。擠出式脂質可經由減小尺寸之過濾器擠出來製備,如Templeton等人, Nature Biotech, 15:647-652, 1997中所述,關於擠出式脂質製備的教示內容以引用的方式併入本文中。Vesicles can be made from several different types of lipids; however, phospholipids are most commonly used to generate liposomes as drug carriers. Vesicles may include, but are not limited to, DOTMA, DOTAP, DOTIM, DDAB, which alone or together with cholesterol yield DOTMA and cholesterol, DOTAP and cholesterol, DOTIM and cholesterol, and DDAB and cholesterol. Methods for preparing multilamellar vesicle lipids are known in the art (see, eg, US Patent No. 6,693,086, which is incorporated herein by reference for its teachings regarding the preparation of multilamellar vesicle lipids). Although vesicle formation can occur spontaneously when a lipid film is mixed with an aqueous solution, it can also be accelerated by applying an oscillating form of force using a homogenizer, sonicator, or extrusion device (for a review, see, e.g., Spuch and Navarro, Journal of Drug Delivery, Volume 2011, Paper ID 469679,
本文所提供之方法及組合物可包含藉由足以緩解疾病、病症及/或病狀之症狀的療法投與的醫藥組合物。在一些態樣中,本發明提供藉由投與如本文所述之組合物來遞送治療劑的方法。The methods and compositions provided herein may include pharmaceutical compositions administered by therapy sufficient to alleviate the symptoms of a disease, disorder, and/or condition. In some aspects, the invention provides methods of delivering a therapeutic agent by administering a composition as described herein.
用途 本發明另外關於本文所揭示之表現抑制子或表現抑制系統的用途。此外,在一些實施例中,所提供的此類技術可用於達成例如細胞中之靶基因(例如MYC)表現的調節,例如抑制,例如能夠控制靶基因,例如控制MYC活性、遞送及基因表現頻率。在一些實施例中,細胞為哺乳動物(例如人類)細胞。在一些實施例中,細胞為體細胞。在一些具體實例中,細胞為原代細胞。舉例而言,在一些實施例中,細胞為哺乳動物體細胞。在一些實施例中,哺乳動物體細胞為原代細胞。在一些實施例中,哺乳動物體細胞為非胚胎細胞。Uses The present invention additionally relates to the use of the expression inhibitors or expression inhibition systems disclosed herein. Furthermore, in some embodiments, such technology provided can be used to achieve, for example, modulation, e.g., suppression, of the expression of a target gene (e.g., MYC) in a cell, e.g., enabling control of a target gene, e.g., control of MYC activity, delivery, and frequency of gene expression . In some embodiments, the cells are mammalian (eg, human) cells. In some embodiments, the cells are somatic cells. In some embodiments, the cells are primary cells. For example, in some embodiments, the cells are mammalian somatic cells. In some embodiments, the mammalian somatic cells are primary cells. In some embodiments, the mammalian somatic cells are non-embryonic cells.
調節基因表現regulate gene expression
本發明另外部分地關於一種調節(例如降低)靶基因(例如MYC)表現的方法,該方法包含:提供本文所述之表現抑制子(或編碼其的核酸,或包含該表現抑制子核酸的醫藥組合物)或表現抑制系統(或編碼其的核酸,或包含該表現抑制系統或核酸的醫藥組合物),以及使靶基因(例如MYC)及/或可操作地連接的轉錄控制元件與表現抑制子或表現抑制系統接觸。在一些實施例中,調節(例如降低)靶基因(例如MYC)表現包含:相較於參考值(例如靶基因(例如MYC)在表現抑制子或表現抑制系統缺乏情況下的轉錄),調節靶基因(例如MYC)的轉錄。在一些實施例中,離體使用調節(例如降低)靶基因(例如MYC)表現的方法,例如針對來自個體(例如哺乳動物個體,例如人類個體)之細胞使用。在一些實施例中,活體內使用調節(例如降低)靶基因(例如MYC)表現的方法,例如針對哺乳動物個體(例如人類個體)使用。在一些實施例中,活體外使用調節(例如降低)靶基因例如MYC)表現的方法,例如針對本文所述之細胞或細胞株使用。The present invention also relates in part to a method of modulating (e.g., reducing) the expression of a target gene (e.g., MYC), the method comprising: providing an expression suppressor (or a nucleic acid encoding it, or a medicament comprising the expression suppressor nucleic acid) described herein composition) or expression inhibition system (or nucleic acid encoding it, or a pharmaceutical composition comprising the expression inhibition system or nucleic acid), and the target gene (such as MYC) and/or operably linked transcriptional control elements and expression inhibition Sub or performance inhibition system contacts. In some embodiments, modulating (eg, reducing) expression of a target gene (eg, MYC) comprises: modulating target gene (eg, MYC) transcription compared to a reference value (eg, transcription of a target gene (eg, MYC) in the absence of an expression repressor or expression repression system). Transcription of genes such as MYC. In some embodiments, methods of modulating (eg, reducing) the expression of a target gene (eg, MYC) are used ex vivo, eg, on cells from an individual (eg, a mammalian individual, eg, a human individual). In some embodiments, methods of modulating (eg, reducing) the expression of a target gene (eg, MYC) are used in vivo, eg, in a mammalian subject (eg, a human subject). In some embodiments, methods of modulating (eg, reducing) the expression of a target gene, eg, MYC, are used in vitro, eg, on a cell or cell line as described herein.
本發明另外部分地關於一種治療個體之與靶基因(例如MYC)之異常調控(例如過度表現)相關之病狀的方法,包含向該個體投與本文所述之表現抑制子(或編碼其的核酸,或包含該表現抑制子核酸的醫藥組合物)或表現抑制系統(或編碼其的核酸,或包含該表現抑制系統或核酸的醫藥組合物)。與特定基因之過度表現相關的病狀已為熟習此項技術者所知。此類病狀包括(但不限於)代謝障礙、癌症(例如實體腫瘤)及肝炎。The invention also relates in part to a method of treating a condition in an individual associated with aberrant regulation (e.g., overexpression) of a target gene (e.g., MYC), comprising administering to the individual an expression suppressor as described herein (or a gene encoding the same) nucleic acid, or a pharmaceutical composition comprising the expression inhibitor nucleic acid) or an expression inhibition system (or a nucleic acid encoding it, or a pharmaceutical composition comprising the expression inhibition system or nucleic acid). Pathologies associated with overexpression of particular genes are known to those skilled in the art. Such conditions include, but are not limited to, metabolic disorders, cancer (eg, solid tumors), and hepatitis.
如本文所提供的方法及組合物可藉由穩定地或暫時地改變(例如減少)靶基因(例如MYC)轉錄來治療與靶基因(例如MYC)之過度表現或異常調控相關的病狀。在一些實施例中,此類調節保持至少約1小時至約30天、或至少約2小時、6小時、12小時、18小時、24小時、2天、3天、4天、5天、6天、7天、8天、9天、10天、11天、12天、13天、14天、15天、16天、17天、18天、19天、20天、21天、22天、23天、24天、25天、26天、27天、28天、29天、30天或更長時間或其間的任何時間。在一些實施例中,此類調節保持至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6或7天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如永久地或無限期地)。視情況,此類調節保持不超過10、9、8、7、6、5、4、3、2或1年。The methods and compositions as provided herein can treat conditions associated with overexpression or dysregulation of a target gene (eg, MYC) by stably or temporarily altering (eg, reducing) transcription of the target gene (eg, MYC). In some embodiments, such modulation is maintained for at least about 1 hour to about 30 days, or at least about 2 hours, 6 hours, 12 hours, 18 hours, 24 hours, 2 days, 3 days, 4 days, 5 days, 6 days. days, 7 days, 8 days, 9 days, 10 days, 11 days, 12 days, 13 days, 14 days, 15 days, 16 days, 17 days, 18 days, 19 days, 20 days, 21 days, 22 days, 23 days, 24 days, 25 days, 26 days, 27 days, 28 days, 29 days, 30 days or more or any time in between. In some embodiments, such adjustments maintain at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 , 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, or 7 days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 months, or at least 1, 2, 3, 4 or 5 years (e.g. permanently or indefinitely). Such adjustments are maintained for no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 year, as appropriate.
在一些實施例中,相對於組合物未接觸或未經方法處理之細胞中的靶基因表現,本文所提供之方法或組合物可使靶基因(例如MYC)在細胞中之表現減少至少10、20、30、40、50、60、70、80、90或100% (且視情況,高達100%)。In some embodiments, the methods or compositions provided herein reduce the expression of a target gene (e.g., MYC) in cells by at least 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100% (and up to 100% as appropriate).
在一些實施例中,本文提供的方法可藉由破壞與該靶基因相關的基因體複合物(例如錨定序列介導性接合體)來調節(例如減少)靶基因(例如MYC)表現。與錨定序列介導性接合體相關的基因可至少部分地處於接合體內(亦即,逐序列位於第一與第二錨定序列之間),或其可處於接合體外部,使得其並非逐序列位於第一與第二錨定序列之間,而是位於同一染色體上且至少與第一或第二錨定序列充分近接,以便可藉由控制錨定序列介導性接合體之拓樸結構來調節其表現。一般熟習此項技術者應瞭解,在一些實施例中,兩個元件之間(例如基因與錨定序列介導性接合體之間)之三維空間距離的相關性可大於就鹼基對而言的距離。在一些實施例中,處於外部但相關的基因位於第一或第二錨定序列的2 Mb內、1.9 Mb內、1.8 Mb內、1.7 Mb內、1.6 Mb內、1.5 Mb內、1.4 Mb內、1.3 Mb、1.3 Mb內、1.2 Mb內、1.1 Mb內、1 Mb內、900 kb內、800 kb內、700 kb內、500 kb內、400 kb內、300 kb內、200 kb內、100 kb內、50 kb內、20 kb內、10 kb內或5 kb內。In some embodiments, the methods provided herein modulate (eg, reduce) expression of a target gene (eg, MYC) by disrupting a gene body complex (eg, anchor sequence-mediated adapter) associated with the target gene. The gene associated with the anchor sequence-mediated adapter may be at least partially within the adapter (i.e., positioned sequentially between the first and second anchor sequences), or it may be outside the adapter such that it is not sequentially located within the adapter. The sequence is located between the first and second anchor sequences, but is located on the same chromosome and at least sufficiently close to the first or second anchor sequences so that the topology of the adapter can be mediated by controlling the anchor sequences to adjust its performance. Those of ordinary skill in the art will appreciate that in some embodiments, the three-dimensional distance between two elements (e.g., between a gene and an anchor sequence-mediated adapter) can be more correlated than in base pairs distance. In some embodiments, the external but related gene is located within 2 Mb, within 1.9 Mb, within 1.8 Mb, within 1.7 Mb, within 1.6 Mb, within 1.5 Mb, within 1.4 Mb, of the first or second anchor sequence 1.3 Mb, 1.3 Mb, 1.2 Mb, 1.1 Mb, 1 Mb, 900 kb, 800 kb, 700 kb, 500 kb, 400 kb, 300 kb, 200 kb, 100 kb , within 50 kb, within 20 kb, within 10 kb, or within 5 kb.
在一些實施例中,調節基因(例如MYC)表現包含改變轉錄控制序列對基因(例如MYC)的可及性。轉錄控制序列(不論處於錨定序列介導性接合體內部或外部)可為增強序列或靜默(或抑制)序列。In some embodiments, modulating expression of a gene (eg, MYC) comprises altering the accessibility of a transcriptional control sequence to the gene (eg, MYC). Transcriptional control sequences (whether internal or external to an anchor sequence-mediated adapter) can be enhancing sequences or silencing (or repressing) sequences.
在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的基因異常調控病症,例如MYC基因異常調控病症,例如與MYC基因異常調控相關的症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的MYC基因異常調控病症或與MYC基因異常調控病症相關的症狀。在一些實施例中,該病症與MYC異常調控(例如MYC過度表現)相關。在一些實施例中,該病症與AFP異常調控(例如AFP過度表現)相關。在一些實施例中,所提供的此類技術可用於使靶基因(例如MYC)之啟動子發生甲基化,以治療有需要之個體(例如患者)的基因異常調控病症,例如MYC基因異常調控病症,例如與MYC基因異常調控相關的症狀。在一些實施例中,所提供的此類技術可選擇性地影響細胞存活率,該細胞異常地表現由靶基因(例如MYC)編碼的多肽。In some embodiments, such provided techniques can be used to treat a gene dysregulation disorder, such as a MYC gene dysregulation disorder, such as a symptom associated with a MYC gene dysregulation, in an individual (eg, a patient) in need thereof. In some embodiments, the provided technology can be used to treat a disorder of MYC gene dysregulation or a symptom associated with a disorder of MYC gene dysregulation in an individual (eg, a patient) in need thereof. In some embodiments, the disorder is associated with dysregulation of MYC (eg, MYC overexpression). In some embodiments, the disorder is associated with dysregulation of AFP (eg, AFP overexpression). In some embodiments, the provided technology can be used to methylate the promoter of a target gene (such as MYC) to treat a disorder of gene dysregulation in an individual (such as a patient) in need, such as aberrant regulation of the MYC gene Conditions, such as symptoms associated with abnormal regulation of the MYC gene. In some embodiments, such techniques are provided to selectively affect the survival of cells that aberrantly express a polypeptide encoded by a target gene (eg, MYC).
在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的肝病或與肝病相關的病症,例如症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的肺病或與肝病相關的病症,例如症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的贅瘤病症,例如與贅瘤病症相關的病症或症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的病毒感染相關病症,例如與病毒感染相關病症相關的病症或症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)的酒精誤用相關病症,例如與酒精誤用相關病症有關的病症或症狀。在一些實施例中,所提供的此類技術可用於治療有需要之個體(例如患者)之與病毒感染或酒精誤用相關的贅瘤病症,例如與病毒感染或酒精誤用相關之贅瘤病症有關的病症或症狀。In some embodiments, such technology provided can be used to treat liver disease or a condition, eg, a symptom, associated with liver disease in an individual (eg, a patient) in need thereof. In some embodiments, provided such techniques are useful for treating conditions, eg, symptoms, of lung or liver disease in individuals (eg, patients) in need thereof. In some embodiments, such provided techniques are useful for treating a neoplastic disorder, such as a condition or symptom associated with a neoplastic disorder, in an individual (eg, a patient) in need thereof. In some embodiments, such provided techniques can be used to treat a viral infection-associated condition, eg, a condition or symptom associated with a viral infection-associated condition, in a subject (eg, a patient) in need thereof. In some embodiments, such provided techniques can be used to treat an alcohol misuse-related disorder, such as a disorder or symptom associated with an alcohol misuse-related disorder, in a subject (eg, a patient) in need thereof. In some embodiments, such technology provided can be used to treat a neoplastic disorder associated with a viral infection or alcohol misuse, such as a neoplastic disorder associated with a viral infection or alcohol misuse, in an individual (e.g., a patient) in need thereof. Condition or symptom.
在一些實施例中,所治療之病狀為贅瘤。在一些實施例中,所治療之病狀為腫瘤發生。在一些實施例中,所治療之病狀為癌症。在一些實施例中,癌症與不良預後相關。在一些實施例中,癌症與MYC異常調控(例如MYC過度表現)相關。在一些實施例中,癌症與AFP異常調控(例如AFP過度表現)相關。在一些實施例中,癌症為乳癌、肝癌、大腸直腸癌、肺癌、胰臟癌、胃癌及/或子宮癌。在一些實施例中,癌症與感染(例如病毒,例如細菌)相關。在一些實施例中,癌症與酒精濫用相關。在一些實施例中,癌症為肝癌。In some embodiments, the condition treated is a neoplasia. In some embodiments, the condition treated is tumorigenesis. In some embodiments, the condition treated is cancer. In some embodiments, cancer is associated with poor prognosis. In some embodiments, the cancer is associated with MYC dysregulation (eg, MYC overexpression). In some embodiments, the cancer is associated with dysregulation of AFP (eg, AFP overexpression). In some embodiments, the cancer is breast cancer, liver cancer, colorectal cancer, lung cancer, pancreatic cancer, gastric cancer and/or uterine cancer. In some embodiments, the cancer is associated with an infection (eg, a virus, such as a bacterium). In some embodiments, the cancer is associated with alcohol abuse. In some embodiments, the cancer is liver cancer.
在一些實施例中,癌細胞為肺癌細胞、胃癌細胞、胃腸癌細胞、大腸直腸癌細胞、胰臟癌細胞或肝癌細胞。在一些實施例中,癌症為肝細胞癌(HCC)、纖維板層肝細胞癌(FHCC)、膽管癌、血管肉瘤、繼發性肝癌、非小細胞肺癌(NSCLC)、腺癌、小細胞肺癌(SCLC)、大細胞(未分化)癌瘤、三陰性乳癌、胃腺癌、子宮內膜癌或胰臟癌。In some embodiments, the cancer cells are lung cancer cells, gastric cancer cells, gastrointestinal cancer cells, colorectal cancer cells, pancreatic cancer cells, or liver cancer cells. In some embodiments, the cancer is hepatocellular carcinoma (HCC), fibrolamellar hepatocellular carcinoma (FHCC), cholangiocarcinoma, angiosarcoma, secondary liver cancer, non-small cell lung cancer (NSCLC), adenocarcinoma, small cell lung cancer ( SCLC), large cell (undifferentiated) carcinoma, triple-negative breast cancer, gastric adenocarcinoma, endometrial cancer, or pancreatic cancer.
在一些實施例中,所治療之病狀為肝病。在一些實施例中,所治療之病狀與MYC異常調控(例如MYC過度表現)相關。在一些實施例中,所治療之病狀為慢性疾病。在一些實施例中,所治療之病狀為慢性肝病。在一些實施例中,所治療之病狀為病毒感染。在一些實施例中,所治療之病狀為酒精誤用相關病症。In some embodiments, the condition treated is liver disease. In some embodiments, the condition being treated is associated with dysregulation of MYC (eg, MYC overexpression). In some embodiments, the condition being treated is a chronic disease. In some embodiments, the condition treated is chronic liver disease. In some embodiments, the condition treated is a viral infection. In some embodiments, the condition treated is an alcohol misuse related disorder.
在一些實施例中,所治療之病狀為肺病。在一些實施例中,所治療之病狀與MYC異常調控(例如MYC過度表現)相關。在一些實施例中,所治療之病狀為慢性疾病。在一些實施例中,所治療之病狀為慢性肺病。在一些實施例中,所提供的此類技術可用於治療或減少肺癌生長、轉移、抗藥性及/或癌症幹細胞(CSC)維持。在一些實施例中,所治療之病狀為癌瘤,例如非小細胞肺癌(NSCLC)。在一些實施例中,慢性肺病與菸草誤用相關。In some embodiments, the condition treated is pulmonary disease. In some embodiments, the condition being treated is associated with dysregulation of MYC (eg, MYC overexpression). In some embodiments, the condition being treated is a chronic disease. In some embodiments, the condition being treated is chronic lung disease. In some embodiments, such techniques provided are useful for treating or reducing lung cancer growth, metastasis, drug resistance, and/or cancer stem cell (CSC) maintenance. In some embodiments, the condition treated is cancer, such as non-small cell lung cancer (NSCLC). In some embodiments, chronic lung disease is associated with tobacco misuse.
在一些實施例中,癌症肝癌亞型S1 (HCC S1)、肝癌亞型S2 (HCC S2)或肝癌亞型S3 (HCC S2)。在一些實施例中,HCC亞型與MYC過度表現相關。在一些實施例中,癌症為HCC S1或HCC S2。在一些實施例中,癌症亞型與侵襲性腫瘤及不良的臨床結果相關。In some embodiments, the cancer is hepatocellular carcinoma subtype S1 (HCC S1), hepatic carcinoma subtype S2 (HCC S2), or hepatic carcinoma subtype S3 (HCC S2). In some embodiments, the HCC subtype is associated with MYC overexpression. In some embodiments, the cancer is HCC S1 or HCC S2. In some embodiments, cancer subtypes are associated with aggressive tumors and poor clinical outcome.
在一些實施例中,本發明提供可針對個體、基於個體中之HCC亞型設計的治療方案,例如基於個體中之HCC亞型定製治療攻擊性的個人化方案。在一些實施例中,本發明提供一種使用本文所揭示之表現抑制子或表現抑制系統治療的方法,該方法包含鑑別患者中的HCC亞型及基於HCC亞型鑑別來確定該等表現抑制子及/或表現抑制系統的劑量及投藥時程。In some embodiments, the present invention provides a treatment regimen that can be designed for an individual based on the subtype of HCC in the individual, for example, a personalized regimen that tailors the aggressiveness of the treatment based on the subtype of HCC in the individual. In some embodiments, the present invention provides a method of treatment using an expression suppressor or expression suppressor system disclosed herein, the method comprising identifying HCC subtypes in a patient and determining the expression suppressors based on the HCC subtype identification and / or the dosage and administration schedule of the expression inhibitory system.
本文描述向個體遞送藥劑或如本文所揭示之組合物以治療病症的方法,使得該個體遭受的副作用或全身毒性與化學療法治療相比最小。在一些實施例中,個體當用本文所述之藥劑及/或組合物治療時,未經歷與化學療法典型相關的任何顯著副作用。在一些實施例中,個體當用本文所述之藥劑及/或組合物治療時,未經歷顯著副作用,包括(但不限於)禿髮、噁心、嘔吐、食慾不振、酸痛、嗜中性球減少症、貧血、血小板減少症、眩暈、疲勞、便秘、口腔潰瘍、皮膚發癢、脫皮、神經及肌肉損傷、聽覺變化、體重減輕、腹瀉、免疫抑制、淤血、心臟損傷、出血、肝損傷、腎損傷、水腫、口腔及咽喉生瘡、不育、纖維化、脫髮、濕性脫皮、黏膜乾燥、眩暈及腦病變。在一些實施例中,個體當用本文所述之藥劑及/或組合物治療時,未顯示顯著的體重減輕。Described herein are methods of delivering an agent, or a composition as disclosed herein, to an individual to treat a disorder such that the individual suffers minimal side effects or systemic toxicity compared to chemotherapy treatment. In some embodiments, the individual does not experience any significant side effects typically associated with chemotherapy when treated with the agents and/or compositions described herein. In some embodiments, the subject does not experience significant side effects when treated with the agents and/or compositions described herein, including, but not limited to, alopecia, nausea, vomiting, loss of appetite, soreness, neutropenia anemia, thrombocytopenia, dizziness, fatigue, constipation, mouth sores, itchy skin, peeling, nerve and muscle damage, hearing changes, weight loss, diarrhea, immunosuppression, congestion, heart damage, bleeding, liver damage, kidney Injuries, edema, mouth and throat sores, infertility, fibrosis, hair loss, wet peeling, dry mucous membranes, dizziness, and brain lesions. In some embodiments, the individual does not exhibit significant weight loss when treated with the agents and/or compositions described herein.
本文所述之藥劑及組合物可投與個體,例如哺乳動物,例如活體內投與,以治療或預防如本文所述的多種病症。該病症包括涉及以MYC之表現模式改變為特徵之細胞的病症。The agents and compositions described herein can be administered to an individual, such as a mammal, eg, in vivo, to treat or prevent a variety of disorders as described herein. The disorder includes a disorder involving cells characterized by an altered expression pattern of MYC.
表觀遺傳修飾本發明另外部分地關於一種對靶基因、可操作地連接至靶基因之轉錄控制元件或錨定序列(例如近接於靶基因或與可操作地連接至靶基因之錨定序列介導性接合體相關的錨定序列)進行表觀遺傳修飾的方法,該方法包含:提供表現抑制子(或編碼其的核酸)或表現抑制系統(例如表現抑制子),或編碼其的核酸,或包含該表現抑制子(或編碼其的核酸)或表現抑制系統或核酸的醫藥組合物;以及使靶基因或可操作地連接至靶基因的轉錄控制元件與表現抑制子或表現抑制系統接觸,藉此對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)的轉錄控制元件進行表觀遺傳修飾。 Epigenetic modification The present invention is additionally directed in part to a target gene, a transcriptional control element or an anchor sequence operably linked to a target gene (e.g., in close proximity to or with an anchor sequence operably linked to a target gene) An anchor sequence associated with a conductive adapter) for epigenetic modification, the method comprising: providing an expression repressor (or a nucleic acid encoding it) or an expression repression system (such as an expression repressor), or a nucleic acid encoding it, or a pharmaceutical composition comprising the expression inhibitor (or nucleic acid encoding it) or expression inhibition system or nucleic acid; and contacting the target gene or a transcriptional control element operably linked to the target gene with the expression inhibitor or expression inhibition system, Epigenetic modifications are thereby made to a target gene (eg MYC) or a transcriptional control element operably linked to a target gene (eg MYC).
在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法包含增加或減少靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件的DNA甲基化。在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法包含增加或減少與靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件相關之組蛋白的組蛋白甲基化。在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法包含減少與靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件相關之組蛋白的組蛋白乙醯化。在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法包含增加或減少與靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件相關之組蛋白的組蛋白類小泛素化。在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法包含增加或減少與靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件相關之組蛋白的組蛋白磷酸化。In some embodiments, the method of epigenetically modifying a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises increasing or decreasing a target gene (eg, MYC) or operably linked to a target gene (eg, MYC) DNA methylation of transcriptional control elements linked to target genes such as MYC. In some embodiments, the method of epigenetically modifying a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises increasing or decreasing a transcriptional control element operably linked to a target gene (eg, MYC) or Histone methylation of histones associated with transcriptional control elements that are positively linked to target genes such as MYC. In some embodiments, the method of epigenetically modifying a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises reducing Histone acetylation to histones associated with transcriptional control elements of target genes (eg MYC). In some embodiments, the method of epigenetically modifying a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises increasing or decreasing a transcriptional control element operably linked to a target gene (eg, MYC) or Histone-like microubiquitination of histones associated with transcriptional control elements that are positively linked to target genes such as MYC. In some embodiments, the method of epigenetically modifying a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises increasing or decreasing a transcriptional control element operably linked to a target gene (eg, MYC) or Histone phosphorylation of histones associated with transcriptional control elements that are positively linked to target genes such as MYC.
在一些實施例中,相對於組合物未接觸或未經該方法處理之細胞中的該位點之表觀遺傳修飾水準,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法可使表觀遺傳修飾水準降低至少10、20、30、40、50、60、70、80、90或100% (且視情況,高達100%)。在一些實施例中,相對於組合物未接觸或未經該方法處理之細胞中的該位點之表觀遺傳修飾水準,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾的方法可使表觀遺傳修飾水準增加至少10、20、30、40、50、60、70、80、90、100、120、140、160、180、200、300、400、500、600、700、800、900或1000% (且視情況,高達200、300、400、500、600、700、800、900、1000或2000%)。在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件的表觀遺傳修飾可修改靶基因(例如MYC)之表現量,例如如本文所述。In some embodiments, the target gene (e.g., MYC) or operably linked to the target gene (e.g., MYC A method of epigenetic modification of a transcriptional control element of ) that reduces the level of epigenetic modification by at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100% (and optionally, up to 100%) . In some embodiments, the target gene (e.g., MYC) or operably linked to the target gene (e.g., MYC ) method for epigenetic modification of a transcriptional control element that increases the level of epigenetic modification by at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200 . In some embodiments, epigenetic modification of a target gene (e.g., MYC) or a transcriptional control element operably linked to a target gene (e.g., MYC) modifies the expression of the target gene (e.g., MYC), e.g., as described herein stated.
在一些實施例中,藉由本文所述之方法產生的表觀遺傳修飾保持至少約1小時至約30天,或至少約2小時、6小時、12小時、18小時、24小時、2天、3天、4天、5天、6天、7天、8天、9天、10天、11天、12天、13天、14天、15天、16天、17天、18天、19天、20天、21天、22天、23天、24天、25天、26天、27天、28天、29天、30天或更長,或其間的任何時間。在一些實施例中,此類調節保持至少1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23或24小時,或至少1、2、3、4、5、6或7天,或至少1、2、3、4或5週,或至少1、2、3、4、5、6、7、8、9、10、11或12個月,或至少1、2、3、4或5年(例如無限期地)。視情況,此類調節保持不超過10、9、8、7、6、5、4、3、2或1年。In some embodiments, epigenetic modifications produced by the methods described herein are maintained for at least about 1 hour to about 30 days, or at least about 2 hours, 6 hours, 12 hours, 18 hours, 24 hours, 2 days, 3 days, 4 days, 5 days, 6 days, 7 days, 8 days, 9 days, 10 days, 11 days, 12 days, 13 days, 14 days, 15 days, 16 days, 17 days, 18 days, 19 days , 20 days, 21 days, 22 days, 23 days, 24 days, 25 days, 26 days, 27 days, 28 days, 29 days, 30 days or longer, or any period in between. In some embodiments, such adjustments maintain at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 , 21, 22, 23, or 24 hours, or at least 1, 2, 3, 4, 5, 6, or 7 days, or at least 1, 2, 3, 4, or 5 weeks, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 months, or at least 1, 2, 3, 4 or 5 years (eg indefinitely). Such adjustments are maintained for no more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 year, as appropriate.
在一些實施例中,對靶基因(例如MYC)或可操作地連接至靶基因(例如MYC)之轉錄控制元件進行表觀遺傳修飾之方法中使用的表現抑制子或表現抑制系統包含含有效應部分的表現抑制子,該效應部分為或包含表觀遺傳修飾部分。In some embodiments, the expression suppressor or expression suppression system used in the method of epigenetic modification of a target gene (eg, MYC) or a transcriptional control element operably linked to a target gene (eg, MYC) comprises an effector moiety comprising The expression suppressor of, the effector portion is or comprises an epigenetic modification portion.
舉例而言,效應部分可為或包含具有DNA甲基轉移酶活性的表觀遺傳修飾部分,且內源或天然存在之靶序列(例如靶基因,例如MYC,或轉錄控制元件)可經改變以增加其甲基化(例如減少轉錄因子與靶基因(例如MYC)或轉錄控制元件之一部分的相互作用,減少成核蛋白質對錨定序列的結合,及/或破壞或阻礙錨定序列介導性接合體),或可經改變以減少其甲基化(例如增加轉錄因子與靶基因(例如MYC)或轉錄控制元件之一部分的相互作用,增加成核蛋白質對錨定序列的結合,及/或促進或增加錨定序列介導性接合體的強度)。For example, an effector moiety can be or comprise an epigenetic modification moiety having DNA methyltransferase activity, and an endogenous or naturally occurring target sequence (e.g., a target gene, such as MYC, or a transcriptional control element) can be altered to Increase its methylation (e.g., reduce the interaction of a transcription factor with a target gene (e.g., MYC) or part of a transcriptional control element, reduce binding of nucleating proteins to anchor sequences, and/or disrupt or prevent anchor sequence-mediated adapters), or can be altered to reduce their methylation (e.g., to increase the interaction of a transcription factor with a target gene (e.g., MYC) or part of a transcriptional control element, to increase the binding of a nucleating protein to an anchor sequence, and/or promote or increase the strength of anchor sequence-mediated adapters).
套組本發明另外部分地關於一種套組,其包含表現抑制子或表現抑制系統,例如本文所述之表現抑制子。在一些實施例中,套組包含表現抑制子或表現抑制系統(例如表現抑制系統中的表現抑制子)及該表現抑制子或表現抑制系統之使用說明書。在一些實施例中,套組包含編碼表現抑制子的核酸或編碼表現抑制系統或其組分(例如表現抑制系統中之表現抑制子)的核酸及該表現抑制子(及/或該核酸)及/或該表現抑制系統(及/或該核酸)的使用說明書。在一些實施例中,套組包含細胞,該細胞包含編碼表現抑制子的核酸或編碼表現抑制系統或其組分(例如表現抑制系統中的表現抑制子)的核酸;及該細胞、核酸及/或該表現抑制子或表現抑制系統的使用說明書。 Kits The present invention additionally relates in part to a kit comprising an expression suppressor or expression suppressor system, such as the expression suppressors described herein. In some embodiments, a kit comprises an expression inhibitor or expression inhibition system (eg, an expression inhibitor in an expression inhibition system) and instructions for use of the expression inhibitor or expression inhibition system. In some embodiments, the kit comprises a nucleic acid encoding an expression inhibitor or a nucleic acid encoding an expression inhibition system or a component thereof (e.g., an expression inhibitor in an expression inhibition system) and the expression inhibitor (and/or the nucleic acid) and and/or instructions for use of the expression suppression system (and/or the nucleic acid). In some embodiments, the kit comprises a cell comprising a nucleic acid encoding an expression inhibitor or a nucleic acid encoding an expression inhibition system or a component thereof (e.g., an expression inhibitor in an expression inhibition system); and the cell, the nucleic acid, and/or Or instructions for use of the expression suppressor or expression suppression system.
在一些態樣中,套組包含a)容器,該容器包含含有一系統的組合物,該系統包含兩種表現抑制子,包含含有第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與轉譯調控元件近接的序列;以及含有第二靶向部分及視情況存在之第二效應部分的表現抑制子,其中第二表現抑制子結合至包含靶基因(例如MYC)之錨定序列介導性接合體(ASMC)中的錨定序列或結合至與錨定序列近接的序列。In some aspects, the kit comprises a) a container comprising a composition comprising a system comprising two expression inhibitors comprising a first targeting moiety and optionally a first effector moiety. An expression repressor, wherein the first expression repressor binds to a translational regulatory element (e.g., a promoter or a transcriptional start (TSS)) operably linked to a target gene (e.g., MYC) or to a sequence in close proximity to a translational regulatory element and an expression suppressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression suppressor binds to an anchor sequence-mediated adapter (ASMC) comprising a target gene (eg, MYC) anchor sequence or binds to a sequence in close proximity to the anchor sequence.
在一些態樣中,套組包含a)容器,該容器包含含有一系統的組合物,該系統包含兩種表現抑制子,包含含有第一靶向部分及視情況存在之第一效應部分的第一表現抑制子,其中該第一表現抑制子結合至可操作地連接至靶基因(例如MYC)的轉譯調控元件(例如啟動子或轉錄起點(TSS))或結合至與轉譯調控元件近接的序列;以及含有第二靶向部分及視情況存在之第二效應部分的表現抑制子,其中該第二表現抑制子結合至位於靶基因(例如MYC)之超級增強子區域中的基因體基因座。In some aspects, the kit comprises a) a container comprising a composition comprising a system comprising two expression inhibitors comprising a first targeting moiety and optionally a first effector moiety. An expression repressor, wherein the first expression repressor binds to a translational regulatory element (e.g., a promoter or a transcriptional start (TSS)) operably linked to a target gene (e.g., MYC) or to a sequence in close proximity to a translational regulatory element and an expression suppressor comprising a second targeting moiety and optionally a second effector moiety, wherein the second expression suppressor binds to a gene body locus located in a super-enhancer region of a target gene (eg, MYC).
在一些實施例中,套組進一步包含b)一組說明書,其包含用該組合物治療疾病或調節(例如降低)靶基因(例如MYC)在細胞內之表現的至少一種方法。在一些實施例中,套組可視情況包括用於該組合物的遞送媒劑(例如脂質奈米粒子)。可提供懸浮於賦形劑及/或遞送媒劑中的試劑,或試劑可作為各別組分提供,之後可與賦形劑及/或遞送媒劑合併。在一些實施例中,套組可視情況含有與組合物共投與的其他治療劑,以影響所需靶基因表現,例如調節MYC基因表現。雖然說明材料典型地為書面或印刷材料,但其不限於此。考慮能夠儲存此類說明書且將其傳達至最終使用者之任何介質。此類介質包括(但不限於)電子儲存介質(例如磁盤、磁帶、盒式磁盤、晶片)、光學介質(例如CD ROM)及其類似物。此類介質可包括提供此類說明材料之網際網路站點的地址。In some embodiments, the kit further comprises b) a set of instructions comprising at least one method of using the composition to treat a disease or modulate (eg, reduce) the expression of a target gene (eg, MYC) in a cell. In some embodiments, the kit optionally includes a delivery vehicle (eg, lipid nanoparticles) for the composition. The agent may be provided suspended in the excipient and/or delivery vehicle, or the agent may be provided as a separate component which may then be combined with the excipient and/or delivery vehicle. In some embodiments, the kit optionally contains other therapeutic agents co-administered with the composition to affect desired target gene expression, eg, modulate MYC gene expression. While instructional material is typically written or printed material, it is not limited to such. Any medium capable of storing such instructions and communicating them to end users is contemplated. Such media include, but are not limited to, electronic storage media (eg, magnetic disks, tapes, cartridges, wafers), optical media (eg, CD ROM), and the like. Such media may include addresses to Internet sites that provide such instructional materials.
在一些實施例中,套組包含表現抑制子、表現抑制系統(例如本文所述之表現抑制子)的單位劑型,或編碼表現抑制系統(例如本文所述之表現抑制子)之核酸(例如載體)的單位劑型。In some embodiments, the kit comprises a unit dosage form of an expression inhibitor, an expression inhibitor system, such as an expression inhibitor described herein, or a nucleic acid (e.g., a vector) encoding an expression inhibitor system, such as an expression inhibitor described herein. ) in unit dosage form.
以下實例係為了進一步說明本發明之一些實施例而提供,但不希望其限制本發明之範疇;應瞭解,根據其例示性性質,可替代地使用熟習此項技術者已知之其他程序、方法或技術。The following examples are provided to further illustrate some embodiments of the present invention, but are not intended to limit the scope of the present invention; technology.
實例example 實例example 11 :: CTCFCTCF 模體的靶向修飾引起Targeted modification of the motif causes MYCMYC 下調lower down
此實例描述利用Cas9對CTCF模體或與CTCF模體鄰接的區域進行核酸酶編輯以下調MYC表現。This example describes nuclease editing of the CTCF motif or regions adjacent to the CTCF motif using Cas9 to downregulate MYC expression.
在此實例中,為了鑑別出導引表觀遺傳效應子介導MYC基因下調的最佳標靶區域,設計與外顯子1中所含之啟動子區域CpG島互補的四種sgRNA。c-MYC基因含有較長的非轉譯外顯子1 (Spencer CA, Groudine M 1991),外顯子2及3含有主要編碼區。為了鑑別出適合的下調區域,使用與MYC啟動子CpG島區域互補之四種sgRNA所靶向的CRISPR-dCas9-KRAB及CRISPR-dCas9-MQ1 (來自細菌柔膜菌螺原體的DNMT)效應子進行基因體編輯。使用(1) dCas9 (無效應子)與sgRNA的組合及(2)未處理的細胞作為對照,以評估MYC表現的變化。對容易生長且可轉染的細胞K562及HEK293進行此等初始篩選以選擇sgRNA且確定sgRNA GD-28617對MYC mRNA下調介導最強的作用(資料未展示)。In this example, four sgRNAs complementary to the CpG islands of the promoter region contained in
利用2.5 μg/ml SSOP調配物將靶向CTCF模體(GD-28616)或緊鄰於CTCF模體之上游區域(GD-28859)(CTCF錨定位點)的Cas9轉染至三種人類HCC模型(HepG2、Hep3B及SKHEP1)中(表11及圖1A-B)。Cas9 (與GD-28616的組合)對CTCF模體的破壞引起所有三種HCC細胞株(HepG2、Hep3B及SKHEP1)中的MYC表現下調32-39%。對CTCF模體相鄰區域(GD-28859)的破壞將三種株系中之兩者(HepG2及Hep3B)中的MYC表現下調35-45% (圖2A)。如AmpSeq所評估的編輯效率證實細胞株發生77-100%編輯(圖2B)。
表11:嚮導
此實例描述藉由使與dCas9sgRNA融合的KRAB效應子靶向CTCF模體(GD-28616)或緊鄰於CTCF模體的上游區域(GD-28859)或MYC啟動子(GD-28617)而下調MYC1表現。為了靶向HCC中的MYC表現,藉由將CRISPR-dCas9系統繫拴至KRAB來修改該系統。This example describes the downregulation of MYC1 expression by targeting a KRAB effector fused to dCas9sgRNA to the CTCF motif (GD-28616) or immediately upstream of the CTCF motif (GD-28859) or the MYC promoter (GD-28617) . To target MYC expression in HCC, the CRISPR-dCas9 system was modified by tethering the system to KRAB.
在此實例中,將dCas9-KRAB mRNA與個別sgRNA (表1及圖1A-B)一起遞送至人類HCC細胞株(HepG2、Hep3B及SKHEP1),從而使其靶向CTCF模體(GD-28616)、CTCF「錨定」位點(GD-28859)或MYC啟動子(GD-28617)。利用SSOP形式的LNP遞送,將效應子mRNA與sgRNA共遞送。作為對照,使用(1) dCas9 (無效應子)與sgRNA的組合及(2)未處理的細胞評估變化。將HCC細胞於生長培養基中接種於96孔盤中(5000個細胞/孔)。接著將LNP調配物(2-2.5 μg/ml)添加至細胞中以轉染mRNA及sgRNA。所有處理均以生物學方式重複三次。評估24至168小時範圍內之時間點的MYC表現及細胞存活率。使用RNeasy ®plus 96孔套組(Qiagen),依循製造商方案,分離來自四個獨立實驗的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現。未處理的樣品及dCas9 (無效應子)樣品用作校準物。 In this example, dCas9-KRAB mRNA was delivered to human HCC cell lines (HepG2, Hep3B, and SKHEP1) along with individual sgRNAs (Table 1 and Figure 1A-B) to target the CTCF motif (GD-28616) , CTCF "anchor" site (GD-28859) or MYC promoter (GD-28617). Effector mRNA was co-delivered with sgRNA using LNP delivery in the form of SSOP. As controls, changes were assessed using (1) dCas9 (no effector) in combination with sgRNA and (2) untreated cells. HCC cells were seeded in 96-well plates (5000 cells/well) in growth medium. LNP formulations (2-2.5 μg/ml) were then added to cells to transfect mRNA and sgRNA. All treatments were biologically replicated three times. MYC expression and cell viability were assessed at time points ranging from 24 to 168 hours. RNA from four independent experiments was isolated using the RNeasy® plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method. Untreated samples and dCas9 (no effector) samples were used as calibrators.
實驗資料表明,在48/72小時時間點,LNP介導之dCas9-KRAB/GD-28616轉染使Hep3B及SKHEP1中的MYC表現下調11-34%。在48/72小時時間點,LNP介導之dCas9-KRAB/GD-28859轉染使所有3種HCC模型中的MYC表現下調18-44% (圖3)。截至第168小時,此效應在兩種株系(Hep3B及SKHEP1)中減小,但HepG2株系中的MYC表現在第168小時下降28% (圖3)。在48/72小時時間點,經由dCas9-KRAB/GD-28617將dCas9-KRAB引向MYC啟動子使所有3種HCC模型中的MYC表現下調24-58%。截至第168小時,此效應在兩種株系(Hep3B及SKHEP1)中減小,但HepG2中的MYC在第168小時下降43% (圖3)。 實例3:與鋅指域融合的KRAB效應子對MYC1表現的下調作用 Experimental data showed that LNP-mediated dCas9-KRAB/GD-28616 transfection downregulated MYC expression in Hep3B and SKHEP1 by 11-34% at the time point of 48/72 hours. At the 48/72 hour time point, LNP-mediated dCas9-KRAB/GD-28859 transfection downregulated MYC expression by 18-44% in all 3 HCC models (Figure 3). By 168 hours, this effect was reduced in both lines (Hep3B and SKHEP1), but MYC expression in the HepG2 line was reduced by 28% at 168 hours (Figure 3). Directing dCas9-KRAB to the MYC promoter via dCas9-KRAB/GD-28617 downregulated MYC expression by 24-58% in all 3 HCC models at the 48/72 hour time point. By 168 hours, this effect was reduced in both lines (Hep3B and SKHEP1), but MYC in HepG2 was reduced by 43% at 168 hours (Figure 3). Example 3: Down-regulation of MYC1 expression by KRAB effectors fused to zinc finger domains
此實例描述藉由使與鋅指域融合的KRAB效應子靶向緊鄰於CTCF模體的上游區域來下調MYC1表現(GD-28859)。在此實例中,指向鋅指的KRAB效應子(ZF-KRAB效應子)設計成結合位於GD-28859所靶向之DNA區域處的錨定位點(圖4A)。設計7種構築體(dCas-KRAB/GD-59、ZF1-KRAB、ZF2-KRAB、ZF3-KRAB、ZF4-KRAB、ZF5-KRAB及ZF6-KRAB)且在人類HCC模型(HepG2、Hep3B、SKHEP1)中進行篩選。此等實驗的陰性對照包括未處理的細胞且使用dCas9-KRAB/GD-28859作為陽性對照。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自兩個獨立實驗(各以生物學方式重複三次)的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性Taqman引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品及dCas9 (無效應子)樣品作為校準物。使用CellTiter-GLO ®發光細胞存活率分析(Promega #G9241),藉由ATP定量來評估細胞存活率。將HCC細胞於生長培養基中接種於96孔盤中(5000個細胞/孔)。接著將LNP調配物(0.6-2.5 μg/ml)添加至細胞中以轉染mRNA及sgRNA。所有處理均以生物學方式重複三次。 This example describes the downregulation of MYC1 expression by targeting a KRAB effector fused to a zinc finger domain immediately upstream of a CTCF motif (GD-28859). In this example, zinc finger-directed KRAB effectors (ZF-KRAB effectors) were designed to bind anchor sites located at the DNA region targeted by GD-28859 (Figure 4A). Seven constructs (dCas-KRAB/GD-59, ZF1-KRAB, ZF2-KRAB, ZF3-KRAB, ZF4-KRAB, ZF5-KRAB, and ZF6-KRAB) were designed and tested in human HCC models (HepG2, Hep3B, SKHEP1) filter in. Negative controls for these experiments included untreated cells and dCas9-KRAB/GD-28859 was used as a positive control. RNA from two independent experiments (each biologically replicated in triplicate) was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific Taqman Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific) (Technical experiment repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples and dCas9 (no effector) samples as calibrators. Cell viability was assessed by ATP quantification using the CellTiter-GLO ® Luminescent Cell Viability Assay (Promega #G9241). HCC cells were seeded in 96-well plates (5000 cells/well) in growth medium. LNP formulations (0.6-2.5 μg/ml) were then added to cells to transfect mRNA and sgRNA. All treatments were biologically replicated three times.
此等資料表明,ZF2-KRAB、ZF3-KRAB及ZF4-KRAB使Hep3B細胞中之MYC下調的程度等效於或大於dCas9-KRAB/GD-28859,其中ZF3-KRAB具有最強的下調作用(圖4B)。在其他兩種HCC模型HepG2及SKHEP1中,ZF3-KRAB亦顯示使MYC下調的程度等效於或大於dCas9-KRAB/GD-28859 (圖4C)。亦測定ZF3-無效應子(NE)對MYC表現具有下調作用,原因可能為調控位點出現空間位阻。ZF3-KRAB與ZF3-NE在Hep3B中、在24、72及120小時的時程內均對MYC表現及存活率展現下調作用(圖4D)。 實例4:與dCas9sgRNA融合之MQ1效應子對MYC1表現的下調作用 These data indicate that ZF2-KRAB, ZF3-KRAB and ZF4-KRAB downregulate MYC in Hep3B cells to a degree equivalent to or greater than that of dCas9-KRAB/GD-28859, among which ZF3-KRAB has the strongest downregulation effect (Fig. 4B ). In two other HCC models, HepG2 and SKHEP1, ZF3-KRAB was also shown to downregulate MYC to an equivalent or greater extent than dCas9-KRAB/GD-28859 (Fig. 4C). It was also determined that ZF3-neutral effector (NE) has a down-regulation effect on MYC expression, which may be due to steric hindrance at the regulatory site. Both ZF3-KRAB and ZF3-NE exhibited downregulation of MYC expression and survival in Hep3B over a time course of 24, 72 and 120 hours (Fig. 4D). Example 4: Down-regulation of MYC1 expression by MQ1 effectors fused to dCas9sgRNA
此實例描述藉由使與dCas9sgRNA融合的MQ1效應子靶向CTCF模體(GD-28616)或緊鄰於CTCF模體的上游區域(GD-28859)、GD-28862或MYC啟動子(GD-28617)而下調MYC1表現。This example describes targeting the CTCF motif (GD-28616) or immediately upstream of the CTCF motif (GD-28859), GD-28862, or the MYC promoter (GD-28617) by targeting an MQ1 effector fused to a dCas9 sgRNA while down-regulating MYC1 expression.
在此實例中,將dCas9-MQ1 mRNA與個別sgRNA (表11及圖1A-B)一起遞送至人類HCC細胞株(HepG2、Hep3B及SKHEP1),從而使其靶向CTCF「錨定」位點或MYC啟動子。利用SSOP形式的LNP遞送,將效應子mRNA與sgRNA共遞送。作為對照,使用(1) dCas9 (無效應子)與sgRNA的組合及(2)未處理的細胞評估變化。依循基本上如實例2中所述的方案來進行實驗。In this example, dCas9-MQ1 mRNA was delivered to human HCC cell lines (HepG2, Hep3B, and SKHEP1) along with individual sgRNAs (Table 11 and Figure 1A-B), allowing them to target the CTCF "anchor" site or MYC promoter. Effector mRNA was co-delivered with sgRNA using LNP delivery in the form of SSOP. As controls, changes were assessed using (1) dCas9 (no effector) in combination with sgRNA and (2) untreated cells. Experiments were performed following a protocol essentially as described in Example 2.
結果表明,dCas9-MQ1藉由GD-16、GD-59及GD-62靶向CTCF,以細胞株特異性方式引起可變的下調及上調。dCas9-MQ1藉由GD-17靶向啟動子,在第72小時引起MYC下調50-90%,且在所有三種HCC模型中保持至第168小時(圖5)。儘管MYC下調對SK-HEP-1存活率的影響最小,但在第72小時及第168小時,MYC下調使HepG2及Hep3B的存活率顯著降低。相較於未處理的對照物,dCas9-sgRNA對照物對表現或存活率無影響。 實例5:與鋅指域融合的MQ1效應子對MYC1表現的下調作用 The results showed that dCas9-MQ1 targets CTCF via GD-16, GD-59, and GD-62, causing variable down-regulation and up-regulation in a cell line-specific manner. Targeting the promoter by GD-17, dCas9-MQ1 caused a 50-90% downregulation of MYC at 72 hours and was maintained until 168 hours in all three HCC models (Fig. 5). Although MYC downregulation had minimal effect on SK-HEP-1 survival, MYC downregulation significantly decreased HepG2 and Hep3B survival at 72 hours and 168 hours. The dCas9-sgRNA control had no effect on performance or survival compared to untreated controls. Example 5: Down-regulation of MYC1 expression by MQ1 effectors fused to zinc finger domains
此實例描述藉由使與鋅指域融合的MQ1效應子靶向MYC啟動子(GD-28617)而下調MYC1表現。This example describes the downregulation of MYC1 expression by targeting an MQ1 effector fused to a zinc finger domain to the MYC promoter (GD-28617).
在此實例中,指向鋅指的MQ1效應子(ZF-MQ1效應子)設計成結合GD-28617所靶向的DNA區域(圖4A及圖6A)。圍繞GD-28617設計6種構築體(ZF7-MQ1、ZF8-MQ1、ZF9-MQ1、ZF10-MQ1、ZF11-MQ1及ZF12-MQ1)以結合實例4篩選中所鑑別的區域且在人類HCC模型(HepG2、Hep3B、SKHEP1)中篩選。使用未處理的細胞及經單獨ZF蛋白質(ZF-無效應子或ZF-NE)轉染的細胞作為實驗的陰性對照且使用dCas9-KRAB/GD-28859作為陽性對照。依循如實例3中所述之方案。In this example, a zinc finger-directed MQ1 effector (ZF-MQ1 effector) was designed to bind the DNA region targeted by GD-28617 (Figure 4A and Figure 6A). Six constructs (ZF7-MQ1 , ZF8-MQ1 , ZF9-MQ1 , ZF10-MQ1 , ZF11-MQ1 , and ZF12-MQ1 ) were designed around GD-28617 to bind the regions identified in the Example 4 screen and in human HCC models ( Screening in HepG2, Hep3B, SKHEP1). Untreated cells and cells transfected with ZF protein alone (ZF-null effector or ZF-NE) were used as negative controls for the experiment and dCas9-KRAB/GD-28859 was used as positive control. The protocol as described in Example 3 was followed.
結果表明,ZF8-MQ1、ZF9-MQ1及ZF11-MQ1以最大程度下調Hep3B細胞中的MYC,其中ZF9-MQ1的下調作用最強(圖6B)。資料進一步表明,成功地減少MYC表現之ZF-MQ1分子(ZF8-MQ1、ZF9-MQ1及ZF11-MQ1)的基因體結合位點與不改變MYC表現之ZF-MQ1分子(ZF7-MQ1、ZF10-MQ1及ZF12-MQ1)的基因體結合位點緊密鄰近。此可為以下各者的影響:結合效率、3-D結合取向、區域甲基化延伸或特異性效應子相對於MYC啟動子中之調控元件定位的結果。ZF-效應子結合位點的稍微偏移似乎給MYC基因調控帶來顯著後果。發現ZF9-MQ1強烈下調MYC mRNA且降低所有三種HCC模型的存活率,且在ZF9-MQ1構築體存在下發現的效應比dCas9-MQ1/GD-28617系統可指向的效應大得多。 實例6:與其他ZF-MQ1效應子相比,ZF9-MQ1對MYC1表現展現最強的下調作用 The results showed that ZF8-MQ1, ZF9-MQ1 and ZF11-MQ1 down-regulated MYC in Hep3B cells to the greatest extent, and ZF9-MQ1 had the strongest down-regulation effect ( FIG. 6B ). The data further indicate that the gene body binding sites of ZF-MQ1 molecules that successfully reduce MYC expression (ZF8-MQ1, ZF9-MQ1, and ZF11-MQ1) are more The gene body binding sites of MQ1 and ZF12-MQ1) are in close proximity. This could be an effect of binding efficiency, 3-D binding orientation, region methylation stretching, or a consequence of the positioning of specific effectors relative to regulatory elements in the MYC promoter. A slight shift in the ZF-effector binding site appears to have dramatic consequences for MYC gene regulation. ZF9-MQ1 was found to strongly downregulate MYC mRNA and reduce survival in all three HCC models, and the effect found in the presence of the ZF9-MQ1 construct was much larger than could be pointed to by the dCas9-MQ1/GD-28617 system. Example 6: ZF9-MQ1 exhibits the strongest downregulation of MYC1 expression compared to other ZF-MQ1 effectors
此實例描述相較於所測試的其他ZF-MQ1效應子,ZF9-MQ1藉由使與鋅指域融合的MQ1效應子靶向MYC啟動子而在下調MYC1表現方面發揮最強的作用(GD-28617)。This example describes that ZF9-MQ1 exerts the strongest effect in downregulating MYC1 expression by targeting the MQ1 effector fused to the zinc finger domain to the MYC promoter compared to other ZF-MQ1 effectors tested (GD-28617 ).
在此實例中,將ZF9-MQ1下調所有HCC模型(HepG2、Hep3B及SKHEP1)中之MYC表現的作用分別與ZF12-MQ1、ZF8-MQ1及dCas9-MQ1/GD-28617的作用進行比較。使用未處理的細胞及經單獨ZF蛋白質(ZF-無效應子或ZF-NE)轉染的細胞作為實驗的陰性對照且使用dCas9-KRAB/GD-28859作為陽性對照。依循如實例3中所述之方案。In this example, the effect of ZF9-MQ1 on downregulating MYC expression in all HCC models (HepG2, Hep3B, and SKHEP1) was compared to that of ZF12-MQ1, ZF8-MQ1, and dCas9-MQ1/GD-28617, respectively. Untreated cells and cells transfected with ZF protein alone (ZF-null effector or ZF-NE) were used as negative controls for the experiment and dCas9-KRAB/GD-28859 was used as positive control. The protocol as described in Example 3 was followed.
與ZF12-MQ1相比,ZF9-MQ1顯示可強烈下調MYC且使所檢查之所有三種HCC模型(Hep3B、HepG2及SKHEP1)的存活率降低(圖7A-C)。ZF9-MQ1調控的MYC1表現下調相對大於在ZF8-MQ1存在下所見的下調(圖7D-F)且在ZF9-MQ1構築體存在下所見的效應比dCas9-MQ1/GD-28617系統可指向的效應大得多(圖7G-I)。資料進一步顯示,雖然ZF9-MQ1下調MYC表現的速度快達24小時,但第72小時或之後才觀測到對存活率的影響。相比之下,ZF8-MQ1使MYC mRNA達成顯著低得多的變化,且使細胞存活率達成大得多的直接降低。 實例7:dCas9-MQ1對MYC表現的影響顯著大於人類dCas9-DNMT1或dCas9-DNMT-3A-3L Compared to ZF12-MQ1, ZF9-MQ1 was shown to strongly downregulate MYC and reduce survival in all three HCC models examined (Hep3B, HepG2 and SKHEP1) (Fig. 7A-C). The downregulation of MYC1 regulated by ZF9-MQ1 was relatively greater than that seen in the presence of ZF8-MQ1 (Fig. 7D-F) and the effect seen in the presence of the ZF9-MQ1 construct was larger than that which the dCas9-MQ1/GD-28617 system could point to Much larger (Fig. 7G-I). The data further showed that while ZF9-MQ1 downregulated MYC expression as fast as 24 hours, the effect on survival was not observed until 72 hours or later. In contrast, ZF8-MQ1 achieved a significantly lower change in MYC mRNA and a much greater immediate decrease in cell viability. Example 7: dCas9-MQ1 has a significantly greater effect on MYC expression than human dCas9-DNMT1 or dCas9-DNMT-3A-3L
此實例表明dCas9-MQ1對MYC表現的影響顯著大於人類dCas9-DNMT1或dCas9-DNMT-3A-3L。This example demonstrates that dCas9-MQ1 has a significantly greater effect on MYC expression than human dCas9-DNMT1 or dCas9-DNMT-3A-3L.
在此實例中,藉由將CRISPR-dCas9系統繫拴至一系列表觀遺傳抑制子(包括KRAB、人類DNMT1、人類DNMT3A-3L融合物、人類DNMT3Bm原核DNMT及MQ1)來修飾該系統。此等分子藉由募集抑制性複合物(KRAB)或使DNA之CpG核苷酸(DNMT1、DNMT3A-3L、DNMT3B及MQ1)甲基化來調節轉錄抑制。利用一組人類HCC細胞株(包括HepG2、Hep3B及SKHEP1),在肝瘤模型中進行效應子篩選。將GD-28617 sgRNA與dCas9-DNMT mRNA組合且藉由LNP轉染而共遞送至此等細胞株中。CRISPR-Cas9研究藉由量測sgRNA/Cas9之編輯效率來評估LNP遞送功效,證明使用此系統達成90-99%編輯效率(資料未展示)。轉染後,接著在多個時間點,藉由qPCR分析細胞中的MYC mRNA表現,且藉由CellTiter-GLO ®分析細胞的存活率。為了藉由亞硫酸氫鹽定序進行標靶及全域甲基化分析,使用Lucigen DNA萃取套組分離基因體DNA。 In this example, the CRISPR-dCas9 system was modified by tethering the system to a series of epigenetic suppressors including KRAB, human DNMT1, human DNMT3A-3L fusion, human DNMT3Bm prokaryotic DNMT, and MQ1. These molecules regulate transcriptional repression by recruiting the repressive complex (KRAB) or methylating CpG nucleotides of DNA (DNMT1, DNMT3A-3L, DNMT3B and MQ1). Using a panel of human HCC cell lines including HepG2, Hep3B, and SKHEP1, effector screening was performed in a hepatoma model. GD-28617 sgRNA was combined with dCas9-DNMT mRNA and co-delivered into these cell lines by LNP transfection. CRISPR-Cas9 studies evaluated LNP delivery efficacy by measuring the editing efficiency of sgRNA/Cas9, demonstrating that 90-99% editing efficiency was achieved using this system (data not shown). After transfection, the expression of MYC mRNA in the cells was analyzed by qPCR at various time points, and the viability of the cells was analyzed by CellTiter-GLO ® . For targeted and global methylation analysis by bisulfite sequencing, genomic DNA was isolated using the Lucigen DNA Extraction Kit.
此等研究表明,dCas9-MQ1對MYC表現的影響顯著大於所檢查的任一種人類dCas9-DNMT或dCas9-KRAB (資料未展示)。第72小時,dCas9-MQ1使所有三種株系中的mRNA達成50-90%減少(圖8A)。儘管MYC下調對SK-HEP-1存活率的影響最小,但在第72小時及第168小時,MYC下調使HepG2及Hep3B的存活率顯著降低(圖8B)。相較於未處理的對照物,dCas9-sgRNA對照物對表現或存活率無影響。These studies demonstrated that dCas9-MQ1 had a significantly greater effect on MYC expression than either human dCas9-DNMT or dCas9-KRAB examined (data not shown). At 72 hours, dCas9-MQ1 achieved a 50-90% reduction in mRNA in all three lines (Fig. 8A). Although MYC downregulation had minimal effect on SK-HEP-1 survival, MYC downregulation significantly decreased HepG2 and Hep3B survival at 72 hours and 168 hours (Fig. 8B). The dCas9-sgRNA control had no effect on performance or survival compared to untreated controls.
由於在MYC下調後,SK-HEP-1模型的存活率展現最小的變化,因此使用此株系評估針對MYC表現之下調作用的耐久性。第5天,MYC mRNA相較於單獨的dCas9 DBD及未處理的對照降低80%。MYC mRNA在第7天及第11天分別減少約70%及約55%。遠至第15天,轉錄本維持約40%的下調(圖8C)。利用亞硫酸氫鹽基因體定序(以單鹼基對解析度量測5-甲基胞嘧啶的定性及定量方式),確定dCas9-MQ1/GD-28617處理將從頭甲基化引向靶向區域,且確定此等轉錄變化與標靶區域中之CpG甲基化百分比緊密相關且證實甲基化保持至第15天(圖8D)。
實例8:dCas9-MQ1/GD-17處理在活體內抑制腫瘤生長
Since the SK-HEP-1 model exhibited minimal changes in survival following MYC downregulation, this line was used to assess durability against downregulation of MYC expression. On
此實例描述對皮下Hep3B異種移植物進行dCas9-MQ1/GD-17處理的活體內分析,相較於對照物處理(PBS及/或dCas9/GD-安全港),其引起腫瘤生長的抑制。This example describes in vivo analysis of dCas9-MQ1/GD-17 treatment of subcutaneous Hep3B xenografts, which resulted in inhibition of tumor growth compared to control treatment (PBS and/or dCas9/GD-safe harbor).
在此實例中,在第1天及第7天,藉由瘤內注射(20 µl/小鼠)將0.6 mg/ml LNP (MC3)調配效應子(dCas9-MQ1/GD-17)遞送至測試組動物中的腫瘤位點。將PBS或0.6 mg/ml LNP (MC3)對照效應子dCas9/GD-安全港(20 µl/小鼠)注射至對照組小鼠的腫瘤位點。各對照組及測試組由具有SubQ Hep3B異種移植物(250 mm
3)的6隻動物組成。15天期間內每3天量測各組的腫瘤體積變化。15天結束時,利用成對T檢驗來量測平均腫瘤體積的變化且作圖。與兩個對照組相比,經dCas9-MQ1/GD-17處理的小鼠顯示腫瘤體積減小(圖9)。
實例9:在B型肝炎感染的背景下,dCas9-MQ1/GD-17使人類肝細胞中的MYC下調
In this example, 0.6 mg/ml LNP (MC3) formulated effector (dCas9-MQ1/GD-17) was delivered to the assay by intratumoral injection (20 µl/mouse) on
此實例表明,在B型肝炎感染的背景下,dCas9-MQ1/GD-17使人類肝細胞中的MYC下調。This example demonstrates that dCas9-MQ1/GD-17 downregulates MYC in human hepatocytes in the context of hepatitis B infection.
在此實例中,用HBV感染人類肝細胞且使未感染的肝細胞與經感染的肝細胞接種且生長8天的時段。9天結束時,與未感染的細胞相比,HBV感染的人類肝細胞顯示更高的MYC表現。用具有對照效應子(dCas9 + 安全港sgRNA (GD-SH))或效應子dCas9-MQ1/GD-17的LNP轉染未感染的肝細胞與經HBV感染的肝細胞且允許再生長48小時。接著在48小時之後,藉由qPCR評估MYC表現(圖8)。以生物學方式重複研究三次。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案分離RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,且使用dCas9 + GD-SH樣品作為校準物。 In this example, human hepatocytes were infected with HBV and uninfected hepatocytes were inoculated with infected hepatocytes and grown for a period of 8 days. At the end of 9 days, HBV-infected human hepatocytes showed higher MYC expression compared to uninfected cells. Uninfected and HBV-infected hepatocytes were transfected with LNPs with control effector (dCas9 + safe harbor sgRNA (GD-SH)) or effector dCas9-MQ1/GD-17 and allowed to grow for an additional 48 hours. Then after 48 hours, MYC expression was assessed by qPCR ( FIG. 8 ). The study was biologically replicated three times. RNA was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method and dCas9 + GD-SH samples were used as calibrator.
48小時之後如藉由qPCR所評估且相對於未感染之人類肝細胞對照組標準化的MYC表現表明,dCas9-MQ1/GD-17 (啟動子)使未感染之細胞及經感染之細胞中的MYC下調(圖10)。After 48 hours dCas9-MQ1/GD-17 (promoter) increased MYC expression in uninfected and infected cells as assessed by qPCR and normalized to the uninfected human hepatocyte control group. down-regulation (Figure 10).
實例10:鋅指域與靶向MYC ASMC之轉錄效應子的組合使MYC表現下調Example 10: Combination of Zinc Finger Domains and Transcriptional Effectors Targeting MYC ASMCs Downregulates MYC Expression
此實例表明,與鋅指域融合的KRAB效應子(或無效應子或NE)靶向緊鄰於CTCF模體的上游區域(ZF3-NE或ZF3-KRAB)及與鋅指域融合的MQ1效應子靶向MYC啟動子(ZF9-MQ1)使MYC mRNA表現下調。與此實例中測試的其他效應子(個別地或組合)相比,ZF9-MQ1與ZF3-KRAB的組合更有效地下調MYC表現。This example demonstrates that KRAB effectors (or no effectors or NEs) fused to zinc finger domains target the immediately upstream region of the CTCF motif (ZF3-NE or ZF3-KRAB) and MQ1 effectors fused to zinc finger domains Targeting the MYC promoter (ZF9-MQ1) downregulates MYC mRNA expression. The combination of ZF9-MQ1 and ZF3-KRAB downregulated MYC expression more effectively than the other effectors tested in this example (individually or in combination).
將靶向錨定CTCF的ZF3-NE或ZF3-KRAB效應子與ZF9-MQ1組合,其設計成結合且靶向HCC細胞株Hep3B中的MYC啟動子。使用dCas-KRAB/GD-28859及dCas9-MQ1/GD-28617作為兩個區域的陽性對照。此等實驗的陰性對照包括未處理的細胞及經ZF5-NE及綠色螢光蛋白(GFP)轉染的細胞。利用SSOP形式的LNP遞送,將效應子mRNA共遞送。將HCC細胞於生長培養基中接種於96孔盤中(約5000個細胞/孔)。接著將LNP調配物(0.6 μg/ml)添加至細胞中以轉染mRNA,接著針對不同時間點進行培育。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品及dCas9 (無效應子)樣品作為校準物。 ZF3-NE or ZF3-KRAB effectors targeting anchored CTCF were combined with ZF9-MQ1, which was designed to bind and target the MYC promoter in the HCC cell line Hep3B. Use dCas-KRAB/GD-28859 and dCas9-MQ1/GD-28617 as positive controls for both regions. Negative controls for these experiments included untreated cells and cells transfected with ZF5-NE and green fluorescent protein (GFP). Effector mRNAs were co-delivered using LNP delivery in the form of SSOP. HCC cells were seeded in 96-well plates (approximately 5000 cells/well) in growth medium. LNP formulations (0.6 μg/ml) were then added to cells to transfect mRNA, followed by incubation for different time points. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples and dCas9 (no effector) samples as calibrators.
此等資料顯示,ZF3-KRAB加ZF9-MQ1使MYC下調的程度大於dCas9-KRAB/GD-28859、ZF3-KRAB、ZF3-NE、dCas9-MQ1/GD-28617、單獨ZF9-MQ1,或ZF3-NE,或ZF5-NE加ZF9-MQ1組合(圖11)。單獨的GFP及ZF5-NE對MYC表現的影響可忽略。所測試之時程(24、48及72小時)內的資料類似,其中在24小時之後出現最大抑制,持續至至少72小時。 實例11:ZF9-MQ1與ZF3-KRAB的組合使三種HCC模型中的MYC表現下調. These data show that ZF3-KRAB plus ZF9-MQ1 downregulates MYC to a greater extent than dCas9-KRAB/GD-28859, ZF3-KRAB, ZF3-NE, dCas9-MQ1/GD-28617, ZF9-MQ1 alone, or ZF3- NE, or ZF5-NE plus ZF9-MQ1 combination (Figure 11). GFP and ZF5-NE alone had negligible effects on MYC expression. The data were similar over the time courses tested (24, 48 and 72 hours), with maximal inhibition occurring after 24 hours and lasting for at least 72 hours. Example 11: Combination of ZF9-MQ1 and ZF3-KRAB downregulates MYC expression in three HCC models.
此實例表明,與單獨或組合之其他所測試效應子相比,ZF9-MQ1與ZF3-KRAB的組合使其他HCC細胞株(Hep3B、HepG2及SKHEP1)中的MYC mRNA表現更大幅度地下調。This example demonstrates that the combination of ZF9-MQ1 and ZF3-KRAB caused a greater downregulation of MYC mRNA in other HCC cell lines (Hep3B, HepG2 and SKHEP1 ) than the other effectors tested alone or in combination.
將靶向錨定CTCF的ZF3-NE或ZF3-KRAB效應子與ZF9-MQ1組合,其設計成結合且靶向三種HCC細胞株Hep3B、HepG2及SKHEP1中的MYC啟動子。此等實驗的陰性對照包括經ZF5-NE轉染的細胞。利用SSOP形式的LNP遞送,將效應子mRNA共遞送。將HCC細胞於生長培養基中接種於96孔盤中(約5000個細胞/孔)。接著將LNP調配物(0.6 μg/ml)添加至細胞中以轉染mRNA,接著針對不同時間點進行培育。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品及dCas9 (無效應子)樣品作為校準物。 ZF3-NE or ZF3-KRAB effectors targeting anchored CTCF were combined with ZF9-MQ1, which was designed to bind and target the MYC promoter in three HCC cell lines Hep3B, HepG2 and SKHEP1. Negative controls for these experiments included ZF5-NE transfected cells. Effector mRNAs were co-delivered using LNP delivery in the form of SSOP. HCC cells were seeded in 96-well plates (approximately 5000 cells/well) in growth medium. LNP formulations (0.6 μg/ml) were then added to cells to transfect mRNA, followed by incubation for different time points. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples and dCas9 (no effector) samples as calibrators.
此等資料顯示,ZF3-KRAB加ZF9-MQ1使MYC下調的程度大於單獨的ZF9-MQ1或ZF3-NE加ZF9-MQ1組合(圖12)。測試時程(24、48及72小時)內的資料類似。 實例12:ZF9-MQ1在其他HCC模型中達成之存活率及MYC表現的劑量反應曲線. These data show that ZF3-KRAB plus ZF9-MQ1 downregulates MYC to a greater extent than ZF9-MQ1 alone or the combination of ZF3-NE plus ZF9-MQ1 ( FIG. 12 ). Data were similar for the duration of the test (24, 48, and 72 hours). Example 12: Dose-response curves of ZF9-MQ1 achieved survival and MYC expression in other HCC models.
此實例描述利用劑量反應曲線、藉由ZF9-MQ1下調五種HCC細胞株中的Myc表現及細胞存活率。This example describes the downregulation of Myc expression and cell viability by ZF9-MQ1 in five HCC cell lines using dose response curves.
在此實例中,設計成結合且靶向MYC啟動子的ZF9-MQ1以多種濃度給與五種HCC細胞株:Hep3B、HepG2、SKHEP1、SNU-182及SNU-449 (圖13A)。未處理之細胞用作陰性對照。使用SSOP形式的LNP遞送來遞送效應子mRNA。將HCC細胞於生長培養基中接種於96孔盤中(約5000個細胞/孔)。接著將LNP調配物(始於5或0.6 μg/ml)添加至3個孔中,接著各在後續孔中約1:2稀釋成6至10次劑量,以便轉染mRNA,接著培育72小時。收集不同複本盤以獲得存活率及RNA。使用Promega的Celltiter GLO分析套組,根據製造商方案量測存活率。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品及dCas9 (無效應子)樣品作為校準物。依據劑量反應曲線計算EC50值。 In this example, ZF9-MQ1 designed to bind and target the MYC promoter was administered at various concentrations to five HCC cell lines: Hep3B, HepG2, SKHEP1, SNU-182 and SNU-449 ( FIG. 13A ). Untreated cells were used as negative controls. LNP delivery in the form of SSOP was used to deliver effector mRNA. HCC cells were seeded in 96-well plates (approximately 5000 cells/well) in growth medium. LNP formulations (starting at 5 or 0.6 μg/ml) were then added to 3 wells, followed by 6 to 10 doses each approximately 1:2 diluted in subsequent wells to transfect mRNA, followed by incubation for 72 hours. Different replica discs were collected for survival and RNA. Viability was measured using Promega's Celltiter GLO assay kit according to the manufacturer's protocol. RNA from three biological replicates was isolated using the Rneasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples and dCas9 (no effector) samples as calibrators. EC50 values were calculated based on dose-response curves.
此等資料顯示,ZF9-MQ1在測試的所有五種HCC細胞株中下調MYC表現且減小存活率。第72小時,ZF9-MQ1下調此等五種HCC細胞株中的MYC表現,其EC50範圍在0.0057-0.065 μg/ml LNP/mRNA之間,其關於活體外存活率的EC50效應在0.049-0.29 μg/ml範圍內(圖13B-F)。
實例13:ZF9-MQ1在裸小鼠皮下生長之Hep 3B模型中的活體內功效
These data show that ZF9-MQ1 downregulates MYC expression and reduces survival in all five HCC cell lines tested. At 72 hours, ZF9-MQ1 downregulated MYC expression in these five HCC cell lines with EC50 ranging from 0.0057-0.065 μg/ml LNP/mRNA, and its EC50 effect on in vitro survival rate was 0.049-0.29 μg /ml range (Fig. 13B-F).
Example 13: In vivo efficacy of ZF9-MQ1 in the
此實例表明用ZF9-MQ1處理抑制雌性裸小鼠中所建立之Hep 3B腫瘤的生長。This example demonstrates that treatment with ZF9-MQ1 inhibits the growth of established
將Hep 3B腫瘤細胞植入三十隻雌性裸小鼠的左側腹以誘發腫瘤。
量測腫瘤體積的變化且當平均腫瘤體積達到約100-150 mm 3時,開始治療。將小鼠分成治療組及對照組,使得各組中之平均腫瘤體積大致相等。向對照組注射PBS或MYCi975 (一種小分子比較物)。各組中的小鼠瘤內注射PBS (3次劑量,每5天注射,接著切換為2次劑量,靜脈內注射)、靜脈內注射ZF9-MQ1 (每5天注射,3 mg/kg)或腹膜內注射MYCi975 (每週5天,每天一次)。所有動物每日稱重且目測評估。各組腫瘤體積變化每週量測3次。在22天過程中,利用成對T檢驗量測體重相對於基線的變化及平均腫瘤體積且作圖(圖14A-B)。 Changes in tumor volume were measured and treatment was initiated when the average tumor volume reached approximately 100-150 mm3 . Mice were divided into treatment and control groups such that the mean tumor volumes in each group were approximately equal. Control groups were injected with PBS or MYCi975 (a small molecule comparator). Mice in each group were injected intratumorally with PBS (3 doses, injected every 5 days, followed by switching to 2 doses, injected intravenously), injected intravenously with ZF9-MQ1 (injected every 5 days, 3 mg/kg) or MYCi975 was injected intraperitoneally (once a day, 5 days a week). All animals were weighed daily and assessed visually. The tumor volume changes in each group were measured 3 times a week. Over the course of 22 days, changes in body weight from baseline and mean tumor volume were measured and plotted using paired T-tests (Fig. 14A-B).
結果顯示,與PBS對照物治療之小鼠相比,ZF9-MQ1能夠顯著減少腫瘤生長(自第6天起)。ZF9-MQ1減少腫瘤生長的程度大於小分子比較物(MYCi975)(圖14A)。另外,最後一次劑量之後第48小時的IHC染色顯示ZF9-MQ1多肽表現、MYC表現減少及增殖減少,如藉由Ki67所量測(資料未展示)。ZF9-MQ1對整個動物體重的作用最小(圖14B)。
實例14:ZF9-MQ1 + ZF3-KRAB在Fox Chase CB17 SCID小鼠中正位生長之Hep 3B模型中的活體內功效
The results showed that ZF9-MQ1 was able to significantly reduce tumor growth (from
此實例展現雙順反子ZF9-MQ1 + ZF3-KRAB在給與雌性Fox Chase CB17 SCID小鼠之後在正位Hep3B-luc模型中的長期抗腫瘤功效及耐久性。This example demonstrates the long-term antitumor efficacy and durability of bicistronic ZF9-MQ1 + ZF3-KRAB in the orthotopic Hep3B-luc model after administration to female Fox Chase CB17 SCID mice.
Hep-3B-luc細胞注射於SCID小鼠之肝臟的左上葉中。隨機分組之各組的平均腹視全身腫瘤相關生物發光(TABL)為約2.8x10
9p/s。在細胞植入後的第7天,將小鼠隨機分配至兩組各用PBS及ZF9-MQ1+ZF3-KRAB治療的12隻小鼠及一組用索拉非尼治療的6隻小鼠。腫瘤細胞植入後的第8天(在圖上標記為給藥第1天)開始治療。小鼠在靜脈內用PBS治療(4次劑量,每5天治療,接著2次劑量,每3天治療)、靜脈內用LNP (MC3) ZF9-MQ1 + ZF3-KRAB治療(2次劑量,每5天1.5 mg/kg;3次劑量,每5天3 mg/kg;1次劑量,每3天3 mg/kg),用索拉非尼(每日50 mg/kg)經口治療。所有動物每日稱重且目測評估。每週2次藉由生物發光量測腫瘤尺寸。
Hep-3B-luc cells were injected in the left upper lobe of the liver of SCID mice. Mean ventral whole-body tumor-associated bioluminescence (TABL) was approximately 2.8x109 p/s for each randomized group. On
結果顯示,與PBS對照物治療之小鼠相比,用ZF9-MQ1 + ZF3-KRAB治療使腫瘤生長(自第21天起)顯著減少。ZF9-MQ1 + ZF3-KRAB減少的腫瘤生長類似於索拉非尼(圖15A)。ZF9-MQ1 + ZF3-KRAB治療對整個動物體重的影響最小(圖15B)。
實例15:ZF9-MQ1及共調配之ZF9-MQ1 + ZF3-KRAB在裸小鼠中皮下生長之Hep 3B模型中的活體內功效
Results showed that treatment with ZF9-MQ1 + ZF3-KRAB significantly reduced tumor growth (from day 21) compared to PBS control treated mice. ZF9-MQ1 + ZF3-KRAB reduced tumor growth similar to Sorafenib (Fig. 15A). ZF9-MQ1 + ZF3-KRAB treatment had minimal effect on whole animal body weight (Fig. 15B).
Example 15: In vivo efficacy of ZF9-MQ1 and co-formulated ZF9-MQ1 + ZF3-KRAB in a
此實例表明ZF9-MQ1及共調配之ZF9-MQ1 + ZF3-KRAB能夠以劑量依賴性方式抑制雌性裸小鼠中的Hep 3B腫瘤生長。This example demonstrates that ZF9-MQ1 and co-formulated ZF9-MQ1 + ZF3-KRAB are able to inhibit
將Hep 3B腫瘤細胞植入七十二隻雌性裸小鼠的左側腹中以誘發腫瘤。量測腫瘤體積的變化且當平均腫瘤體積達到約200 mm
3時,開始治療。將小鼠分成九個治療組(每組8隻小鼠),使得每組的平均腫瘤體積大致相等。每組中的小鼠靜脈內注射PBS (3次劑量,每5天注射;接著3次劑量,每3天注射)、1 mg/kg及3 mg/kg的ZF9-MQ1 (3次劑量,每5天注射;接著3次劑量,每3天注射)、1 mg/kg及3 mg/kg的共調配ZF9-MQ1 + ZF3-KRAB (3次劑量,每5天注射;接著3次劑量,每3天注射)、1 mg/kg及3 mg/kg的陰性對照mRNA (3次劑量,每5天注射;接著3次劑量,每3天注射)、腹膜內注射100 mg/kg的MYCi975 (每週5天,每天一次)或腹膜內注射1 mg/kg的順鉑(每15天一次)。所有動物每日稱重且目測評估。腫瘤體積變化每週量測3次。
結果顯示,與陰性對照物治療之小鼠相比,1 mg/kg的ZF9-MQ1個別地及ZF9-MQ1與ZF3-KRAB之組合能夠減少腫瘤生長(圖16A)。另外,與陰性對照物治療之小鼠相比,3 mg/kg的ZF9-MQ1個別地(自第13天起)及共調配之ZF9-MQ1 + ZF3-KRAB組合(自第6天起)能夠顯著減少腫瘤生長(圖16B)。共調配的ZF9-MQ1 +/ ZF3-KRAB在1 mg/kg與3 mg/kg劑量下,均能夠以類似於或大於順鉑或小分子比較物(MYCi975)的程度減少腫瘤生長。與1 mg/kg及3 mg/kg劑量之順鉑與MYCi975相比,共調配的ZF9-MQ1 +/- ZF3-KRAB對整個動物體重的影響最小(圖16C-D)。 實例16:鋅指域 + 靶向MYC隔離之基因體域(IGD)之DNA甲基轉移酶的組合使MYC表現下調且減少各種肺癌細胞株的細胞存活率 The results showed that 1 mg/kg of ZF9-MQ1 alone and the combination of ZF9-MQ1 and ZF3-KRAB was able to reduce tumor growth compared to negative control treated mice ( FIG. 16A ). In addition, ZF9-MQ1 at 3 mg/kg individually (from day 13) and the co-formulated ZF9-MQ1 + ZF3-KRAB combination (from day 6) were able to Tumor growth was significantly reduced (Fig. 16B). Co-formulated ZF9-MQ1 +/ ZF3-KRAB was able to reduce tumor growth to a similar or greater extent than cisplatin or the small molecule comparator (MYCi975) at both 1 mg/kg and 3 mg/kg doses. Co-formulated ZF9-MQ1 +/- ZF3-KRAB had minimal effect on whole animal body weight compared to 1 mg/kg and 3 mg/kg doses of cisplatin and MYCi975 (Fig. 16C-D). Example 16: Combination of Zinc Finger Domain + DNA Methyltransferase Targeting the MYC Sequestered Gene Body Domain (IGD) Downregulates MYC Expression and Decreases Cell Survival in Various Lung Cancer Cell Lines
此實例表明,藉由使與鋅指域融合的MQ1效應子靶向NSCLC株系(A549、NCI-H2009、NCI-H358HCC95)中的MYC啟動子(ZF9-MQ1)而下調Myc1 mRNA表現使得多種肺癌細胞株的細胞存活率降低。This example demonstrates that downregulation of Myc1 mRNA expression by targeting MQ1 effectors fused to zinc finger domains to the MYC promoter (ZF9-MQ1) in NSCLC lines (A549, NCI-H2009, NCI-H358HCC95) leads to multiple lung cancers The cell viability of the cell line is reduced.
為了結合及靶向肺癌細胞株中之MYC啟動子而設計的ZF9-MQ1與陰性對照物一起遞送,陰性對照物包括未處理的細胞及經綠色螢光蛋白(GFP)轉染的細胞。利用如上文實例12及14中所述的SSOP LNP遞送來遞送編碼ZF9-MQ1或GFP的mRNA。將肺癌細胞於生長培養基中接種於96孔盤中(約10000個細胞/孔)。接著將LNP調配物(1 μg/ml)添加至細胞中以轉染mRNA,接著分別培育72小時及120小時。收集不同的複本盤以測定存活率及mRNA表現的變化。使用Promega的Celltiter-GLO ®分析套組,根據製造商方案量測存活率。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品作為校準物。 ZF9-MQ1, designed to bind and target the MYC promoter in lung cancer cell lines, was delivered with negative controls including untreated cells and cells transfected with green fluorescent protein (GFP). mRNA encoding ZF9-MQ1 or GFP was delivered using SSOP LNP delivery as described in Examples 12 and 14 above. Lung cancer cells were seeded in 96-well dishes (approximately 10000 cells/well) in growth medium. LNP formulations (1 μg/ml) were then added to cells to transfect mRNA, followed by incubation for 72 hours and 120 hours, respectively. Different replica plates were pooled to determine survival and changes in mRNA expression. Viability was measured using Promega's Celltiter-GLO ® assay kit according to the manufacturer's protocol. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples as calibrators.
此等資料表明,ZF9-MQ1能夠降低四種肺癌細胞株中的MYC mRNA水準>80%且此與所有四種細胞株之肺癌細胞存活率的顯著降低一致(圖17A-H)。 實例17:ZF9-MQ1誘導NSCLC細胞株NCI-H2009發生細胞凋亡 These data indicated that ZF9-MQ1 was able to reduce MYC mRNA levels in four lung cancer cell lines by >80% and this was consistent with a significant reduction in lung cancer cell survival in all four cell lines (Fig. 17A-H). Example 17: ZF9-MQ1 induces apoptosis in NSCLC cell line NCI-H2009
此實例展現ZF9-MQ1對肺癌細胞之細胞凋亡的作用。This example demonstrates the effect of ZF9-MQ1 on apoptosis of lung cancer cells.
存活率分析,諸如Cell Titer GLO (實例16中所用),藉由基於ATP水準來測定孔中剩餘的相對細胞數來量測存活率。此等分析無法在細胞增殖損失與不同類型的細胞死亡之間作出區分(例如壞死性凋亡相對於細胞凋亡)。Viability assays, such as Cell Titer GLO (used in Example 16), measure viability by determining the relative number of cells remaining in the well based on ATP levels. These analyzes fail to distinguish between loss of cell proliferation and different types of cell death (eg necroptosis versus apoptosis).
用ZF9-MQ1轉染之後,利用針對磷脂結合蛋白V蛋白質之螢光標記抗體及碘化丙錠(PI)核染料定量細胞凋亡的細胞。除未處理的細胞之外,使用編碼不含效應蛋白之ZF9鋅指域(ZF9-NE)的mRNA作為陰性對照。將肺細胞株NCI-H2009於生長培養基中接種於12孔盤中(每孔50,000個細胞)。接著將具有mRNA的LNP調配物(1 μg/ml)添加至細胞中以轉染ZF9-MQ1或ZF9-NE mRNA,接著培育96小時。收集細胞且使用BD磷脂結合蛋白V: FITC細胞凋亡偵測套組(BDB556570)加以染色且藉由流式細胞術加以分析。對於磷脂結合蛋白V-FITC及PI呈陽性的細胞歸類為細胞凋亡。Following transfection with ZF9-MQ1, apoptotic cells were quantified using a fluorescently labeled antibody against the phospholipid binding protein V protein and propidium iodide (PI) nuclear stain. In addition to untreated cells, mRNA encoding the ZF9 zinc finger domain without effector protein (ZF9-NE) was used as a negative control. Lung cell line NCI-H2009 was seeded in 12-well dishes (50,000 cells per well) in growth medium. LNP formulations (1 μg/ml) with mRNA were then added to cells to transfect ZF9-MQ1 or ZF9-NE mRNA, followed by 96 hours of incubation. Cells were harvested and stained using the BD Phospholipid Binding Protein V: FITC Apoptosis Detection Kit (BDB556570) and analyzed by flow cytometry. Cells positive for phospholipid-binding protein V-FITC and PI were classified as apoptotic.
此等資料表明,96小時之後,在陰性對照(經ZF9-NE處理的細胞及未處理的細胞)中,僅18%細胞為細胞凋亡的(圖18A-B、18D)。相比之下,在經ZF9-MQ1處理的NCI-H2009培養物中,40%細胞為細胞凋亡的(圖18C-D)。此表明ZF9-MQ1可誘導肺癌細胞發生細胞凋亡。These data indicate that after 96 hours, only 18% of the cells in the negative controls (ZF9-NE treated cells and untreated cells) were apoptotic (Fig. 18A-B, 18D). In contrast, in ZF9-MQ1 -treated NCI-H2009 cultures, 40% of cells were apoptotic (Fig. 18C-D). This indicates that ZF9-MQ1 can induce apoptosis in lung cancer cells.
實例18:使用ZF9-MQ1之其他NSCLC細胞株之存活率及MYC表現的劑量反應曲線.Example 18: Dose-response curves of survival rate and MYC expression of other NSCLC cell lines using ZF9-MQ1.
此實例表明ZF9-MQ1以劑量反應性方式使NSCLC細胞株的MYC1表現及細胞存活率下調。This example demonstrates that ZF9-MQ1 downregulates MYC1 expression and cell viability in NSCLC cell lines in a dose-responsive manner.
設計成結合且靶向MYC啟動子的ZF9-MQ1以多種濃度給與兩種NSCLC細胞株:A549及HCC95。未處理的細胞用作陰性對照。利用SSOP形式的LNP遞送來遞送效應子mRNA。將肺癌細胞於生長培養基中接種於96孔盤中(約10000個細胞/孔)。接著將LNP調配物(始於5 μg/ml)添加至3個孔中,接著各在後續孔中約1:2稀釋成10種劑量以便轉染mRNA,且培育72小時。收集不同的複本盤以測定存活率及mRNA表現的變化。ZF9-MQ1 designed to bind and target the MYC promoter was administered at various concentrations to two NSCLC cell lines: A549 and HCC95. Untreated cells were used as negative controls. LNP delivery in the form of SSOP was utilized to deliver effector mRNA. Lung cancer cells were seeded in 96-well dishes (approximately 10000 cells/well) in growth medium. The LNP formulation (starting at 5 μg/ml) was then added to 3 wells, then diluted approximately 1 :2 into 10 doses each in subsequent wells for mRNA transfection, and incubated for 72 hours. Different replica plates were pooled to determine survival and changes in mRNA expression.
使用Promega的Celltiter-GLO ®分析套組,根據製造商方案量測存活率。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品及dCas9 (無效應子)樣品作為校準物。 Viability was measured using Promega's Celltiter-GLO ® assay kit according to the manufacturer's protocol. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples and dCas9 (no effector) samples as calibrators.
資料表明,第72小時,ZF9-MQ1使A549與HCC95的MYC均下調,其EC50為0.08 μg/ml LNP /mRNA,其對存活率具有約25倍高的活體外EC50效應(2 μg/ml)(圖19A-B)。The data showed that at 72 hours, ZF9-MQ1 down-regulated both A549 and HCC95 MYC, and its EC50 was 0.08 μg/ml LNP/mRNA, which had about 25 times higher in vitro EC50 effect on survival rate (2 μg/ml) (FIGS. 19A-B).
實例19:ZF9-MQ1使NSCLC細胞株NCI-H2009中的MYC蛋白質水準降低逾80%Example 19: ZF9-MQ1 reduces MYC protein levels in NSCLC cell line NCI-H2009 by more than 80%
此實例經由免疫墨點技術展現MYC蛋白質水準的變化作為ZF9-MQ1治療的反應。This example demonstrates changes in MYC protein levels as a response to ZF9-MQ1 treatment via immunoblotting techniques.
除陰性對照ZF9-NE及未處理的細胞之外,遞送被設計成結合且靶向肺癌細胞株中之MYC啟動子的ZF9-MQ1。將肺細胞株NCI-H2009於生長培養基中接種於12孔盤中(每孔50,000個細胞)。接著將LNP調配物(1 μg/ml)添加至細胞中以轉染ZF9-MQ1或ZF9-NE mRNA,接著培育96小時。接著將細胞溶解於RIPA緩衝液中且利用Pierce BCA蛋白質分析(23225)定量蛋白質水準。各樣品負載等量的蛋白質且使用NuPAGE
TM小型凝膠系統(Thermo Fisher)、根據尺寸加以分離。接著使用iBlot™ 2凝膠轉移裝置(Thermo Fisher)將蛋白質轉移至PVDF膜上。用ABCAM抗MYC抗體(ab32072)探測膜隔夜。Cell Signaling抗肌動蛋白抗體(8H10D10)用作內參考物。接著使用針對MYC的螢光二級抗體及肌動蛋白抗體物種、使用LI-COR成像系統將信號可視化且定量。
In addition to the negative control ZF9-NE and untreated cells, ZF9-MQ1 designed to bind to and target the MYC promoter in lung cancer cell lines was delivered. Lung cell line NCI-H2009 was seeded in 12-well dishes (50,000 cells per well) in growth medium. LNP formulations (1 μg/ml) were then added to cells to transfect ZF9-MQ1 or ZF9-NE mRNA, followed by 96 hours of incubation. Cells were then lysed in RIPA buffer and protein levels quantified using the Pierce BCA protein assay (23225). Each sample was loaded with an equal amount of protein and separated by size using the NuPAGE ™ Mini Gel System (Thermo Fisher). Proteins were then transferred to PVDF membranes using the
資料顯示,肺癌細胞株處理後第96小時,ZF9-MQ1處理使MYC蛋白質水準降低逾80% (圖20A-B),類似於mRNA表現量的降低(實例17)。 實例20:ZF9-MQ1在裸小鼠皮下生長之NCI-H2009模型中的活體內功效 The data showed that at 96 hours after the lung cancer cell lines were treated, ZF9-MQ1 treatment reduced the MYC protein level by more than 80% (FIG. 20A-B), similar to the reduction of mRNA expression (Example 17). Example 20: In vivo efficacy of ZF9-MQ1 in the NCI-H2009 model grown subcutaneously in nude mice
此實例表明,ZF9-MQ1抑制雌性裸小鼠中所建立的NCI-H2009腫瘤生長。This example demonstrates that ZF9-MQ1 inhibits the growth of established NCI-H2009 tumors in female nude mice.
藉由將NCI-H2009腫瘤細胞植入左側腹中來誘導雌性裸小鼠發生疾病。當平均腫瘤體積達到約100-150 mm 3時,開始治療。將小鼠分成治療組,使得各組之平均腫瘤體積大致相等。mRNA於MC3 LNP中遞送。小鼠靜脈內注射3 mg/kg之ZF9-MQ1或含有非編碼mRNA之MC3 LNP,或注射PBS (每5天一次,4次劑量,接著每3天一次,3次劑量;小鼠總共給與7次劑量)。所有動物每日稱重且目測評估。每週量測腫瘤尺寸3次。 Disease was induced in female nude mice by implanting NCI-H2009 tumor cells into the left flank. Treatment was initiated when the average tumor volume reached approximately 100-150 mm. Mice were divided into treatment groups such that the mean tumor volumes in each group were approximately equal. mRNA was delivered in MC3 LNP. Mice were injected intravenously with 3 mg/kg of ZF9-MQ1 or MC3 LNP containing non-coding mRNA, or with PBS (4 doses every 5 days, followed by 3 doses every 3 days; mice were given a total of 7 doses). All animals were weighed daily and assessed visually. Tumor size was measured 3 times a week.
結果顯示,與PBS對照物治療之小鼠相比,ZF9-MQ1能夠顯著減少腫瘤生長(自第8天起)(圖21)。ZF9-MQ1對整個動物體重的影響最小。 實例21:嚮導RNA與靶向MYC IGD超級增強子之轉錄抑制子(經由dCas9)的組合利用SSOP LNP使MYC表現下調 The results showed that ZF9-MQ1 was able to significantly reduce tumor growth (from day 8) compared to PBS control treated mice (Figure 21). ZF9-MQ1 had minimal effects on whole animal body weight. Example 21: Combination of guide RNA and transcriptional repressor targeting MYC IGD super-enhancer (via dCas9) Downregulation of MYC expression using SSOP LNP
此實例表明,當調控A549細胞株中之MYC表現的肺特異性超級增強子被KRAB效應蛋白靶向時,MYC mRNA表現下調。This example demonstrates that MYC mRNA expression is downregulated when the lung-specific super-enhancer that regulates MYC expression in the A549 cell line is targeted by KRAB effector proteins.
根據整個肺超級增強子區域來設計嚮導RNA (表13)且與KRAB抑制蛋白組合測試,該抑制蛋白與酶失效CAS9偶聯。以GFP之mRNA遞送作為陰性對照,利用SSOP形式的LNP遞送,將效應子mRNA與嚮導RNA共遞送。NCSLC細胞株A549於生長培養基中接種於96孔盤中(約10000個細胞/孔)。接著將LNP調配物(2.5 μg/ml)添加至細胞中以轉染效應子mRNA/嚮導RNA,接著培育72小時。使用Rneasy
®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript
®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan
TM引子/探針組分析及TaqMan
TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品作為校準物。
表13:嚮導
此等資料顯示,嚮導RNA GD-29833及29914當與dCAS9-KRAB效應子mRNA一起遞送時,可下調MYC mRNA水準,突顯了利用此遠端調控元件減少致癌MYC的能力(圖22)。 實例22:嚮導RNA與靶向MYC IGD超級增強子之轉錄抑制子(dCas9)的組合利用MC3 LNP使MYC表現下調 These data show that guide RNAs GD-29833 and 29914 downregulate MYC mRNA levels when delivered together with dCAS9-KRAB effector mRNA, highlighting the ability to use this remote regulatory element to reduce oncogenic MYC (Figure 22). Example 22: Combination of guide RNA and transcriptional repressor (dCas9) targeting the MYC IGD super-enhancer uses MC3 LNP to downregulate MYC expression
此實例描述利用替代脂質MC3 (相對於實例21中的SSOP)、藉由使KRAB效應蛋白靶向調控MYC表現的肺特異性超級增強子來下調MYC mRNA表現。This example describes the use of the surrogate lipid MC3 (relative to SSOP in Example 21) to downregulate MYC mRNA expression by targeting KRAB effector proteins to lung-specific super-enhancers that regulate MYC expression.
根據整個肺超級增強子區域來設計嚮導RNA (表13)且與KRAB抑制蛋白組合測試,該抑制蛋白與酶失效CAS9偶聯。以GFP之mRNA遞送作為陰性對照,利用MC3形式的LNP遞送,將效應子mRNA與嚮導RNA共遞送。NCSLC細胞株A549於生長培養基中接種於96孔盤中(約10000個細胞/孔)。接著將LNP調配物(2.5 μg/ml)添加至細胞中以轉染效應子mRNA/嚮導RNA,接著培育72小時。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品作為校準物。 Guide RNAs (Table 13) were designed based on the entire lung super-enhancer region and tested in combination with the KRAB repressor protein coupled to the enzyme-deactivating CAS9. Effector mRNA was co-delivered with guide RNA using LNP delivery in the form of MC3, using mRNA delivery of GFP as a negative control. NCSLC cell line A549 was seeded in 96-well plates (approximately 10,000 cells/well) in growth medium. LNP formulation (2.5 μg/ml) was then added to cells to transfect effector mRNA/guide RNA followed by 72 hours of incubation. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples as calibrators.
此等資料顯示,嚮導RNA GD-29833及29914當與dCAS9-KRAB效應子mRNA一起遞送時,可下調MYC mRNA水準,突顯了利用此遠端調控元件減少致癌MYC的能力(圖23)。利用SSOP (實例21)與MC3脂質轉染(實例22),均發現此作用。 實例23:嚮導RNA與靶向MYC IGD超級增強子之轉錄抑制子(dCas9)的組合使NSCLC中的MYC表現下調 These data show that guide RNAs GD-29833 and 29914 downregulate MYC mRNA levels when delivered together with dCAS9-KRAB effector mRNA, highlighting the ability to use this remote regulatory element to reduce oncogenic MYC (Figure 23). This effect was found with both SSOP (Example 21) and MC3 lipofection (Example 22). Example 23: Combination of guide RNA and transcriptional repressor (dCas9) targeting the MYC IGD super-enhancer downregulates MYC expression in NSCLC
此實例描述藉由使各種轉錄效應蛋白(EZH2、EZH2-KRAB或MQ1)靶向調控MYC表現的肺特異性超級增強子來下調MYC mRNA表現。This example describes the downregulation of MYC mRNA expression by targeting various transcriptional effector proteins (EZH2, EZH2-KRAB or MQ1 ) to lung-specific super-enhancers that regulate MYC expression.
靶向MYC超級增強子的嚮導RNA (GD-29833及GD-29914)與抑制蛋白(包括組蛋白甲基轉移酶EZH2 (單獨或連同KRAB一起)及與酶失效CAS9偶聯的DNA甲基轉移酶MQ1)組合測試。以GFP之mRNA遞送作為陰性對照,利用SSOP形式的LNP遞送,將效應子mRNA與嚮導RNA共遞送。NCSLC細胞株A549或NCI-H2009於生長培養基中接種於96孔盤中(約10000個細胞/孔)。接著將LNP調配物(2.5 μg/ml)添加至細胞中以轉染效應子mRNA/嚮導RNA,接著培育72小時。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品作為校準物。 Guide RNAs targeting the MYC super-enhancer (GD-29833 and GD-29914) and repressor proteins including the histone methyltransferase EZH2 (alone or in combination with KRAB) and DNA methyltransferase coupled to enzyme-deactivating CAS9 MQ1) Combination test. Effector mRNA was co-delivered with guide RNA using LNP delivery in the form of SSOP, with mRNA delivery of GFP serving as a negative control. NCSLC cell lines A549 or NCI-H2009 were seeded in 96-well plates (approximately 10,000 cells/well) in growth medium. LNP formulation (2.5 μg/ml) was then added to cells to transfect effector mRNA/guide RNA followed by 72 hours of incubation. RNA from three biological replicates was isolated using the Rneasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples as calibrator.
此等資料表明,嚮導RNA GD-29833及29914當與所有3種效應蛋白(EZH2、EZH2-KRAB及MQ1)一起遞送、針對2種不同NSCLC細胞株(A549及NCI-H2009)測試時,可有效地下調MYC mRNA水準(圖24A-B)。 實例24:嚮導RNA與靶向MYC IGD超級增強子之轉錄抑制子(dCas9)的組合在第120小時使NSCLC中的MYC表現進一步下調 These data demonstrate that guide RNAs GD-29833 and 29914 are effective when delivered with all 3 effector proteins (EZH2, EZH2-KRAB and MQ1) and tested against 2 different NSCLC cell lines (A549 and NCI-H2009) Downregulation of MYC mRNA levels (Fig. 24A-B). Example 24: Combination of guide RNA and transcriptional repressor (dCas9) targeting the MYC IGD super-enhancer further downregulates MYC expression in NSCLC at 120 hours
此實例表明,在靶向超級增強子的嚮導與KRAB或MQ1效應蛋白一起轉染肺癌細胞株(A549及NCI-H2009)之後的第120小時,觀測到MYC mRNA表現的降幅增加。This example shows that at 120 hours after transfection of lung cancer cell lines (A549 and NCI-H2009) with super-enhancer-targeting guides together with KRAB or MQ1 effector proteins, an increased decrease in MYC mRNA expression was observed.
靶向MYC超級增強子的嚮導RNA與KRAB抑制蛋白或MQ1 DNA甲基轉移酶組合測試。以GFP之mRNA遞送作為陰性對照,利用SSOP形式的LNP遞送,將效應子mRNA與嚮導RNA共遞送。NCSLC細胞株A549或NCI-H2009於生長培養基中接種於12孔盤中(約50000個細胞/孔)。接著將LNP調配物(2.5 μg/ml)添加至細胞中以轉染效應子mRNA/嚮導RNA,接著培育120小時。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現,使用未處理的樣品作為校準物。 Guide RNAs targeting the MYC super-enhancer were tested in combination with KRAB repressor protein or MQ1 DNA methyltransferase. Effector mRNA was co-delivered with guide RNA using LNP delivery in the form of SSOP, with mRNA delivery of GFP serving as a negative control. NCSLC cell lines A549 or NCI-H2009 were seeded in 12-well plates (approximately 50,000 cells/well) in growth medium. LNP formulation (2.5 μg/ml) was then added to cells to transfect effector mRNA/guide RNA followed by 120 hours of incubation. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method, using untreated samples as calibrators.
此等資料表明,第120小時,與KRAB或MQ1一起遞送的嚮導RNA GD-29833及29914可顯著下調2種NSCLC細胞株(A549及NCI-H2009)中的MYC mRNA水準(圖25A-B)。另外,所觀測到的下調類似於ZF9-MQ1處理NCI-H2009所觀測到的下調(圖25B)。 實例25:將dCAS9-MQ1引向MYC超級增強子使得超級增強子靶點及MYC啟動子區域處的DNA甲基化增加。 These data indicated that guide RNAs GD-29833 and 29914 delivered together with KRAB or MQ1 could significantly downregulate MYC mRNA levels in 2 NSCLC cell lines (A549 and NCI-H2009) at 120 hours (Fig. 25A-B). In addition, the downregulation observed was similar to that observed for ZF9-MQ1 treatment of NCI-H2009 (Figure 25B). Example 25: Directing dCAS9-MQ1 to the MYC super-enhancer resulted in increased DNA methylation at the super-enhancer target as well as at the MYC promoter region.
此實例表明,可將dCas9-MQ1引向MYC超級增強子,從而使得靶點及MYC啟動子區域發生甲基化。This example demonstrates that dCas9-MQ1 can be directed to the MYC super-enhancer, resulting in methylation of the target site as well as the MYC promoter region.
藉由將CRISPR-dCas9系統繫拴至表觀遺傳抑制子MQ1來修飾該系統。此等分子調節轉錄抑制,從而使DNA的CpG核苷酸發生甲基化。以GFP或dCAS9-無效應子構築體(dCas9-NE)之mRNA遞送作為陰性對照,利用SSOP形式的LNP遞送,將效應子mRNA與超級增強子嚮導RNA (29833及29914)共遞送。NCSLC細胞株NCI-H2009於生長培養基中接種於6孔盤中(約100000個細胞/孔)。接著將LNP調配物(2.5 μg/ml)添加至細胞中以轉染效應子mRNA/嚮導RNA,接著培育72小時。使用Lucigen QuickExtract™ DNA提取套組分離出DNA且藉由對啟動子及超級增強子區域使用標靶亞硫酸氫鹽定序來測定甲基化區域。The CRISPR-dCas9 system was modified by tethering the system to the epigenetic suppressor MQ1. These molecules regulate transcriptional repression, resulting in the methylation of CpG nucleotides of DNA. Effector mRNA was co-delivered with super-enhancer guide RNAs (29833 and 29914) using LNP delivery in the SSOP format, using mRNA delivery of GFP or dCAS9-null effector construct (dCas9-NE) as a negative control. The NCSLC cell line NCI-H2009 was seeded in 6-well plates (approximately 100,000 cells/well) in growth medium. LNP formulation (2.5 μg/ml) was then added to cells to transfect effector mRNA/guide RNA followed by 72 hours of incubation. DNA was isolated using the Lucigen QuickExtract™ DNA Extraction Kit and methylated regions were determined by using targeted bisulfite sequencing on promoter and super-enhancer regions.
此等研究表明,dCas9-MQ1使NSCLC中的靶點甲基化增加60%且亦將甲基化引向遠端啟動子區域(增加至約50%)(圖26A-B)。 實例 26 : 嚮導 RNA 與靶向 MYC IGD 超級增強子之轉錄抑制子 ( 經由「失效」的 CAS9 ) 的 組合使 NSCLC 細胞株 NCI - H2009 中的 MYC 蛋白質水準降低 These studies showed that dCas9-MQ1 increased target methylation in NSCLC by 60% and also directed methylation to distal promoter regions (increased to approximately 50%) (Fig. 26A-B). Example 26 : Combination of guide RNA and transcriptional repressor targeting MYC IGD super-enhancer ( via "disabled" CAS9 ) reduces MYC protein levels in NSCLC cell line NCI - H2009
此實例展現MYC蛋白質水準的變化,此係藉由免疫墨點法技術將嚮導RNA靶向NSCLC細胞中的MYC超級增強子來達成。This example demonstrates changes in MYC protein levels by targeting guide RNAs to the MYC super-enhancer in NSCLC cells by immunoblotting techniques.
設計成結合且靶向肺癌細胞株中之MYC超級增強子的GD-29833與dCAS9-KRAB或dCAS9-MQ1效應子mRNA共遞送。將肺細胞株NCI-H2009於生長培養基中接種於12孔盤中(每孔50,000個細胞)。接著將LNP調配物(1 μg/ml)添加至細胞中以轉染嚮導mRNA及效應子mRNA,接著培育96小時。dCAS9-無效應子(dCAS9-NE)構築體用作陰性對照。接著將細胞溶解於RIPA緩衝液中且利用Pierce BCA蛋白質分析(23225)定量蛋白質水準。各樣品負載等量的蛋白質且使用NuPAGE
TM小型凝膠系統(Thermo Fisher)、根據尺寸加以分離。接著使用Invitrogen iBlot™ 2凝膠轉移裝置(Thermo Fisher)將蛋白質轉移至PVDF膜上。用ABCAM抗MYC抗體(ab32072)探測膜隔夜。Cell Signaling抗肌動蛋白抗體(8H10D10)用作內參考物。接著使用針對MYC的螢光二級抗體及肌動蛋白抗體物種、使用LI-COR成像系統將信號可視化且定量。
GD-29833, designed to bind to and target the MYC super-enhancer in lung cancer cell lines, was co-delivered with dCAS9-KRAB or dCAS9-MQ1 effector mRNA. Lung cell line NCI-H2009 was seeded in 12-well dishes (50,000 cells per well) in growth medium. LNP formulations (1 μg/ml) were then added to cells to transfect guide and effector mRNAs, followed by incubation for 96 hours. The dCAS9-null effector (dCAS9-NE) construct was used as a negative control. Cells were then lysed in RIPA buffer and protein levels quantified using the Pierce BCA protein assay (23225). Each sample was loaded with an equal amount of protein and separated by size using the NuPAGE ™ Mini Gel System (Thermo Fisher). Proteins were then transferred to PVDF membranes using the
此等資料表明,第96小時,將嚮導與轉錄抑制子一起引向MYC肺超級增強子使NCI-H2009肺癌細胞株中的MYC蛋白質水準降低高達50%(圖27A-B),類似於mRNA表現量的降低(實例16)。 實例27:全細胞溶解物中存在ZF9-MQ1蛋白與Hep3B細胞株中的MYC蛋白下調相關 These data demonstrate that directing the guide along with the transcriptional repressor to the MYC lung super-enhancer reduces MYC protein levels by up to 50% in the NCI-H2009 lung cancer cell line at 96 hours (Fig. 27A-B), similar to mRNA expression Quantitative reduction (Example 16). Example 27: The presence of ZF9-MQ1 protein in whole cell lysates is associated with downregulation of MYC protein in Hep3B cell line
實例描述在ZF9-MQ1處理之後的時程內,測定Hep 3B細胞中之ZF9-MQ1及MYC蛋白表現量的變化。The examples describe changes in the expression of ZF9-MQ1 and MYC proteins in
ZF9-MQ1處理之後的第6、16及48小時,評估ZF9-MQ1及MYC蛋白質表現量(西方墨點法),在處理開始之後的第24小時移除LNP且置換培養基。使用RIPA緩衝液提取蛋白質且利用BCA分析(Thermo Fisher)定量總蛋白質。總蛋白質在NuPAGE
TMBis-Tris凝膠(Thermo Fisher)上經由MOPS電泳緩衝液跑電泳,且使用iBlot™ 2凝膠轉移裝置(Thermo Fisher)轉移。
At 6, 16 and 48 hours after ZF9-MQ1 treatment, ZF9-MQ1 and MYC protein expression levels (Western blotting) were assessed, and LNPs were removed and medium replaced at 24 hours after the start of treatment. Protein was extracted using RIPA buffer and total protein was quantified using BCA assay (Thermo Fisher). Total proteins were run on NuPAGE ™ Bis-Tris gels (Thermo Fisher) through MOPS running buffer and transferred using the
MYC西方墨點法:β-肌動蛋白抗體用在594 nm波長下發射之螢光團標記的二級抗體染色,且MYC抗體(Abcam)用在488 nm波長下發射之螢光團標記的二級抗體染色。利用近紅外(NIR)螢光的Odyssey ®CLx成像系統(LI-COR)用於擷取蛋白質影像,經由LI-COR軟體、利用該等蛋白質影像進行定量。自各MYC及肌動蛋白色帶的曲線下面積(AUC)中減去背景面積,接著皆相對於各時間點的陰性對照加以標準化。 MYC western blotting: β-actin antibody was stained with a secondary antibody labeled with a fluorophore emitting at 594 nm, and the MYC antibody (Abcam) was stained with a secondary antibody labeled with a fluorophore emitting at 488 nm. grade antibody staining. An Odyssey ® CLx imaging system using near-infrared (NIR) fluorescence (LI-COR) was used to capture protein images, which were used for quantification via LI-COR software. Background areas were subtracted from the area under the curve (AUC) for each MYC and actin band, then both were normalized to the negative control at each time point.
ZF9-MQ1 (經血球凝集素[HA]抗原決定基標記的對照物)西方墨點法:β-肌動蛋白抗體用在594 nm波長下發射之螢光團標記的二級抗體染色,且HA抗體用在488 nm波長下發射之螢光團標記的二級抗體染色。利用NIR螢光的Odyssey ®CLx成像系統(LI-COR)用於擷取蛋白質影像,經由LI-COR軟體、利用該等蛋白質影像進行定量。自各HA及肌動蛋白色帶的AUC中減去背景面積,接著皆相對於各時間點的陰性對照加以標準化。 ZF9-MQ1 (control labeled with hemagglutinin [HA] epitope) Western blot: β-actin antibody stained with a secondary antibody labeled with a fluorophore emitting at 594 nm, and HA Antibodies were stained with secondary antibodies labeled with a fluorophore emitting at a wavelength of 488 nm. The Odyssey ® CLx Imaging System with NIR Fluorescence (LI-COR) was used to capture protein images, which were used for quantification via LI-COR software. The background area was subtracted from the AUC of each HA and actin band, then both were normalized to the negative control at each time point.
資料顯示,用ZF9-MQ1處理細胞之後,全細胞溶解物中存在的ZF9-MQ1蛋白質與MYC蛋白質表現量均減少(圖28A-B),且全細胞溶解物中存在的ZF9-MQ1蛋白質與MYC蛋白質的下調相關(圖28C)。 實例28:MYC表現減少的持續時間及甲基化狀態 The data showed that after the cells were treated with ZF9-MQ1, the expression levels of ZF9-MQ1 protein and MYC protein present in the whole cell lysates were both reduced (Figure 28A-B), and the ZF9-MQ1 protein and MYC protein present in the whole cell Downregulation of the protein was associated (Fig. 28C). Example 28: Duration and Methylation Status of MYC Expression Reduction
實驗評估ZF9-MQ1處理之後,MYC表現減少的耐久性。另外,此實驗展現且評估MYC表現與標靶基因座處增加之DNA甲基化的相關性。The experiments assessed the durability of MYC exhibiting reductions following ZF9-MQ1 treatment. Additionally, this experiment demonstrates and evaluates the correlation of MYC expression with increased DNA methylation at targeted loci.
存活率展現最小變化、但MYC下調40-50%的SKHEP-1細胞株(實例4)用於評估反應的耐久性。SK-HEP-1用LNP/ZF9-MQ1或ZF-無效應子轉染(陰性對照),接著在處理24小時之後,用新培養基置換。在指定的時間點收集細胞用於提取mRNA及基因體DNA (Qiagen RNA/DNA套組)。為了評估MYC mRNA,處理全細胞RNA以產生互補DNA (cDNA)(使用聚腺苷酸引子),接著用於逆轉錄聚合酶鏈反應(RT-PCR)分析,該分析使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)。為了評估標靶基因座處的甲基化狀態,利用標靶亞硫酸氫鹽基因體定序、以單DNA鹼基對解析度來量測5-甲基胞嘧啶。 The SKHEP-1 cell line (Example 4) exhibiting minimal change in viability, but 40-50% downregulation of MYC, was used to assess durability of response. SK-HEP-1 was transfected with LNP/ZF9-MQ1 or ZF-null effector (negative control), followed by replacement with new medium after 24 hours of treatment. Cells were harvested at indicated time points for extraction of mRNA and genomic DNA (Qiagen RNA/DNA kit). To assess MYC mRNA, whole-cell RNA was processed to generate complementary DNA (cDNA) (using polyA primers), followed by reverse transcription-polymerase chain reaction (RT-PCR) analysis using the human MYC mRNA transcript Specific TaqMan ™ probes (Thermo Fisher). To assess methylation status at targeted loci, 5-methylcytosine was measured at single DNA base pair resolution using targeted bisulfite genome sequencing.
第3天,相較於陰性對照(ZF-無效應子)及未處理的細胞(未圖示),MYC mRNA減少89%。mRNA表現的下調緩慢增加,MYC轉錄本在第15天維持45%下調(圖29A)。另外,ZF9-MQ1的表現將從頭CpG甲基化引向MYC啟動子區域。直至第15天,MYC轉錄變化與甲基化百分比相關(圖29B)。
實例29:使用雙順反子ZF9-MQ1_ZF3-KRAB之原代肝細胞的C-MYC表現及細胞存活率
On
該實例評價雙順反子ZF9-MQ1_ZF3-KRAB對原代肝細胞之MYC mRNA及存活率的影響。將低溫保存的人類原代肝細胞(Lonza)解凍且添加至預溫熱的解凍培養基中且第24小時時接種。將細胞再懸浮於接種培養基中且計數以製備成指定濃度(10 6個細胞/毫升)。接著將五十(50) μL細胞溶液添加至96孔盤(含有額外50 μL接種培養基)中達到100 μL總體積且培育隔夜。將LNP調配的mRNA (GFP、ZF-NE、ZF9-MQ1、ZF3-KRAB、ZF9-MQ1 + ZF3-KRAB,或雙順反子ZF9-MQ1_ZF3-KRAB)以0.6 µg/ml、1.25 µg/ml及2.5 µg/ml濃度添加至含有細胞的100 µL額外培養基中。培育細胞72小時,第6小時開始置換維持培養基且隨後每日置換。處理之後的第72小時,評估MYC mRNA表現量(RT-PCR)及細胞存活率(CellTiter-GLO ®)。 This example evaluates the effect of bicistronic ZF9-MQ1_ZF3-KRAB on MYC mRNA and viability of primary hepatocytes. Cryopreserved primary human hepatocytes (Lonza) were thawed and added to pre-warmed thawed medium and seeded at 24 hours. Cells were resuspended in seeding medium and counted to prepare the indicated concentrations ( 106 cells/ml). Fifty (50) μL of the cell solution was then added to the 96-well plate (containing an additional 50 μL of seeding medium) to a total volume of 100 μL and incubated overnight. LNP-modulated mRNA (GFP, ZF-NE, ZF9-MQ1, ZF3-KRAB, ZF9-MQ1 + ZF3-KRAB, or bicistronic ZF9-MQ1_ZF3-KRAB) at 0.6 µg/ml, 1.25 µg/ml and A concentration of 2.5 µg/ml was added to 100 µL of additional medium containing the cells. Cells were incubated for 72 hours with replacement of maintenance medium beginning at 6 hours and then daily thereafter. At 72 hours after treatment, MYC mRNA expression (RT-PCR) and cell viability (CellTiter-GLO ® ) were assessed.
與GFP、ZF-NE或ZF3-KRAB相比,經ZF9-MQ1、ZF9-MQ1 + ZF3-KRAB或雙順反子ZF9-MQ1_ZF3-KRAB處理的原代肝細胞顯示MYC mRNA表現減少(圖30A)。總體而言,該處理對存活率顯示的影響最小,表明正常細胞隨之發生的MYC表現減少低於HCC細胞株(圖30B)。Primary hepatocytes treated with ZF9-MQ1, ZF9-MQ1 + ZF3-KRAB, or bicistronic ZF9-MQ1_ZF3-KRAB showed reduced MYC mRNA expression compared to GFP, ZF-NE, or ZF3-KRAB (Fig. 30A) . Overall, this treatment showed minimal effect on survival, indicating that normal cells had a lower consequent reduction in MYC expression than HCC cell lines (Fig. 30B).
在另一實驗中,低溫保存的PHH在預溫熱的肝細胞解凍培養基中解凍且在室溫下以100 g離心8分鐘。將細胞集結粒再懸浮於肝細胞接種培養基中。接著對細胞計數,且量測其基線存活率。製備細胞稀釋液,且將50,000個細胞接種於雙重複96孔盤中。培育細胞隔夜。次日完全移除接種培養基且添加預溫熱的肝細胞培養基。細胞用2.0、1.0或0.5 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB、ZF9-MQ1、ZF3-KRAB、ZF9-NE或對照GFP mRNA加以處理,重複三次,且培育6小時。接著移除處理培養基且用200 μL新鮮肝細胞培養基置換。轉染後再培育細胞66小時(總共72小時)。處理後,一個培養盤用CellTiter-GLO®試劑溶解,使用Glo Max Discovery讀盤器定量發光。藉由抽吸自第二個細胞培養盤中移除培養基且用RLT Plus溶解緩衝液溶解細胞。使用Rneasy® Plus 96套組,根據製造商說明書進行mRNA提取。提取之後,使用LunaScript® RT SuperMix套組(NEB)將mRNA轉化為cDNA。使用MYC (標靶)及GAPDH (參考)探針,經由ΔΔCT qPCR分析cDNA。In another experiment, cryopreserved PHH was thawed in pre-warmed hepatocyte thawing medium and centrifuged at 100 g for 8 min at room temperature. The cell pellet was resuspended in hepatocyte seeding medium. Cells were then counted and their baseline viability was measured. Cell dilutions were prepared and 50,000 cells were seeded in duplicate 96-well plates. Cells were incubated overnight. The seeding medium was completely removed the next day and pre-warmed hepatocyte medium was added. Cells were treated with 2.0, 1.0 or 0.5 μg/mL of bicistronic ZF9-MQ1_ZF3-KRAB, ZF9-MQ1, ZF3-KRAB, ZF9-NE or control GFP mRNA three times and incubated for 6 hours. Treatment medium was then removed and replaced with 200 μL of fresh hepatocyte medium. Cells were incubated for 66 hours after transfection (total 72 hours). After treatment, one plate is lysed with CellTiter-GLO® Reagent, and luminescence is quantified using a Glo Max Discovery plate reader. Media was removed from the second cell culture plate by aspiration and cells were lysed with RLT Plus Lysis Buffer. mRNA extraction was performed using the
與對照相比,經雙順反子ZF9-MQ1_ZF3-KRAB處理的PHH顯示MYC mRNA減少(圖30C)。ZF9-MQ1 (所有劑量)及ZF3-KRAB (2 µg/mL)下調MYC mRNA,對細胞存活率的影響最小(圖30D)。ZF9-NE不影響MYC mRNA或存活率(圖30C-D)。總體而言,雙順反子ZF9-MQ1_ZF3-KRAB處理對存活率顯示的影響最小,表明正常細胞中之MYC表現的減少不影響細胞存活率(圖30A-D)。 實例30:ZF9-MQ1+ZF3-KRAB在裸小鼠皮下生長之NCI-H2009模型中的活體內功效 PHH treated with the bicistronic ZF9-MQ1_ZF3-KRAB showed a decrease in MYC mRNA compared to controls (Fig. 30C). ZF9-MQ1 (all doses) and ZF3-KRAB (2 µg/mL) downregulated MYC mRNA with minimal effect on cell viability (Fig. 30D). ZF9-NE did not affect MYC mRNA or survival (Fig. 30C-D). Overall, bicistronic ZF9-MQ1_ZF3-KRAB treatment showed minimal effect on survival, suggesting that reduction of MYC expression in normal cells did not affect cell survival (Fig. 30A-D). Example 30: In vivo efficacy of ZF9-MQ1+ZF3-KRAB in the NCI-H2009 model grown subcutaneously in nude mice
此實例表明,ZF9-MQ1+ZF3-KRAB抑制雌性裸小鼠中所建立的NCI-H2009腫瘤生長。This example demonstrates that ZF9-MQ1+ZF3-KRAB inhibits the growth of established NCI-H2009 tumors in female nude mice.
藉由將NCI-H2009腫瘤細胞植入左側腹中來誘導雌性裸小鼠發生疾病。當平均腫瘤體積達到約100-150 mm 3時,開始治療。將小鼠分成治療組,使得各組之平均腫瘤體積大致相等。mRNA於MC3 LNP中遞送。小鼠每5天靜脈內注射3 mg/kg的ZF9-MQ1+ZF3-KRAB,或每5天靜脈內注射3 mg/kg的含有非編碼mRNA之MC3 LNP,或每15天腹膜內注射1 mg/kg的順鉑,或每5天注射PBS。 Disease was induced in female nude mice by implanting NCI-H2009 tumor cells into the left flank. Treatment was initiated when the average tumor volume reached approximately 100-150 mm. Mice were divided into treatment groups such that the mean tumor volumes in each group were approximately equal. mRNA was delivered in MC3 LNP. Mice were injected intravenously with 3 mg/kg of ZF9-MQ1+ZF3-KRAB every 5 days, or with MC3 LNP containing non-coding mRNA at 3 mg/kg intravenously every 5 days, or with 1 mg intraperitoneally every 15 days /kg of cisplatin, or inject PBS every 5 days.
結果顯示,ZF9-MQ1+ZF3-KRAB療法在投與三次之後,使腫瘤尺寸顯示統計學上顯著的降低,腫瘤體積在第25天與對照相比降低63%(圖31A),對經治療之小鼠的體重無顯著影響(圖31B)。在此研究中,ZF9-MQ1+ZF3-KRAB療法與順鉑療法對腫瘤體積的影響相等(圖31A)。
實例31:ZF9-MQ1 + ZF3-KRAB共調配物在Fox Chase CB17 SCID小鼠中正位生長之Hep 3B模型中的活體內功效
The results showed that ZF9-MQ1+ZF3-KRAB therapy showed a statistically significant reduction in tumor size after three administrations, and the tumor volume was reduced by 63% on
此實例展現ZF9-MQ1 + ZF3-KRAB共調配物在給與雌性Fox Chase CB17 SCID小鼠之後在正位Hep3B-luc模型中的長期抗腫瘤功效及耐久性。This example demonstrates the long-term antitumor efficacy and durability of ZF9-MQ1 + ZF3-KRAB co-formulations in the orthotopic Hep3B-luc model after administration to female Fox Chase CB17 SCID mice.
Hep-3B-luc細胞注射於SCID小鼠之肝臟的左上葉中。隨機分組之各組的平均腹視全身腫瘤相關生物發光(TABL)為約2.8x10
9p/s。小鼠隨機分配至四組12隻小鼠,該等小鼠各用PBS、ZF9-MQ1+ZF3-KRAB (較高劑量,亦即,6 mg/kg)、ZF9-MQ1+ZF3-KRAB (中劑量,亦即,3 mg/kg)、ZF9-MQ1+ZF3-KRAB (低劑量,亦即,3 mg/kg)治療,且分配至一組6隻小鼠,該等小鼠在細胞植入後的第7天用索拉非尼治療。腫瘤細胞植入後的第8天(在圖上標記為給藥第1天)開始治療。小鼠靜脈內以PBS (每5天一次,4次劑量,接著為每3天一次,2次劑量)、LNP (MC3) ZF9-MQ1 + ZF3-KRAB (1.5 mg/kg,每5天一次)治療,靜脈內以LNP (MC3) ZF9-MQ1 + ZF3-KRAB (3 mg/kg,每5天一次)治療,靜脈內以LNP (MC3) ZF9-MQ1 + ZF3-KRAB (6 mg/kg,每5天一次)治療,且以索拉非尼(50 mg/kg,每日一次)經口治療。所有動物每日稱重且目測評估。每週2次藉由生物發光量測腫瘤尺寸。
Hep-3B-luc cells were injected in the left upper lobe of the liver of SCID mice. Mean ventral whole-body tumor-associated bioluminescence (TABL) was approximately 2.8x109 p/s for each randomized group. Mice were randomly assigned to four groups of 12 mice each treated with PBS, ZF9-MQ1+ZF3-KRAB (higher dose, i.e., 6 mg/kg), ZF9-MQ1+ZF3-KRAB (medium dose, that is, 3 mg/kg), ZF9-MQ1+ZF3-KRAB (low dose, that is, 3 mg/kg) treatment, and assigned to a group of 6 mice, these mice were implanted with cells After the 7th day of treatment with sorafenib. Treatment started on
結果顯示,ZF9-MQ1 + ZF3-KRAB療法在投與兩次之後,與腫瘤尺寸受到顯著抑制相關。與陰性對照相比,1.5 mg/kg劑量治療使得腫瘤生長截至第23天受到約63%抑制;與陰性對照相比,3 mg/kg治療使得腫瘤生長截至第23天受到約54%抑制(圖32A)。類似地,6 mg/kg劑量ZF9-MQ1 + ZF3-KRAB療法在投與兩次之後,與統計學上顯著的腫瘤尺寸減小相關,使得第23天的腫瘤體積比陰性對照低63% (圖32A)。3 mg/kg ZF9-MQ1 + ZF3-KRAB療法與索拉非尼療法等效(圖32A)。經ZF9-MQ1 + ZF3-KRAB治療之小鼠的體重未經歷顯著降低(圖32B)。經索拉非尼治療之小鼠的體重初始經歷下降,隨後總體重因腫瘤塊增大而潛在地增加(圖32B)。此等資料表明ZF9-MQ1 + ZF3-KRAB療法在此研究中的耐受性良好。
實例32:編碼ZF9-MQ1及ZF3-KRAB的雙順反子mRNA減少MYC表現及細胞存活率
Results showed that ZF9-MQ1 + ZF3-KRAB therapy was associated with significant suppression of tumor size after two administrations. Compared with the negative control, treatment at 1.5 mg/kg dose resulted in approximately 63% inhibition of tumor growth by
此實例對雙順反子構築體ZF9-MQ1_ZF3-KRAB與單一構築體ZF3-KRAB及ZF9-MQ1以及ZF3-KRAB與ZF9-MQ1之共調配物的功效進行比較。此等構築體經由囊封於脂質奈米粒子(LNP)中之mRNA遞送至肝細胞癌細胞。This example compares the efficacy of co-formulations of the bicistronic construct ZF9-MQ1_ZF3-KRAB with the single constructs ZF3-KRAB and ZF9-MQ1 and ZF3-KRAB with ZF9-MQ1. These constructs were delivered to hepatocellular carcinoma cells via mRNA encapsulated in lipid nanoparticles (LNP).
ZF9-MQ1、ZF3-KRAB、雙順反子ZF9-MQ1_ZF3-KRAB及共調配的ZF9-MQ1與ZF3-KRAB構築體係藉由將各別mRNA囊封於LNP中來製備。藉由將每孔10,000個細胞接種於96孔盤中來轉染Hep 3B細胞且進一步用0.6 μg/mL及2 μg/mL mRNA/LNP處理。ZF9-MQ1 , ZF3-KRAB, bicistronic ZF9-MQ1_ZF3-KRAB and co-formulated ZF9-MQ1 and ZF3-KRAB constructs were prepared by encapsulating the respective mRNAs in LNPs.
轉染後的第48小時,分析MYC mRNA及細胞存活率。使用Promega的Celltiter-GLO ®分析套組,根據製造商方案量測存活率。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現(使用TaqMan TM引子/探針)。利用未處理的細胞將MYC表現標準化。 At 48 hours after transfection, MYC mRNA and cell viability were analyzed. Viability was measured using Promega's Celltiter-GLO ® assay kit according to the manufacturer's protocol. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® RT SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific). Analysis (technical experiments repeated three times). MYC expression (using TaqMan ™ primers/probes) was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method. MYC expression was normalized to untreated cells.
此等資料表明,雙順反子構築體ZF9-MQ1_ZF3-KRAB使Hep 3B細胞之MYC mRNA及細胞存活率下調的程度大於單獨的單一構築體(ZF3-KRAB或ZF9-MQ1)(圖33A-33B)。第48小時,雙順反子ZF9-MQ1_ZF3-KRAB在0.6 µg/ml與2 µg/ml濃度下均使總MYC mRNA水準降低99% (圖33A)。雙順反子ZF9-MQ1_ZF3-KRAB在2 µg/ml與0.6 µg/ml濃度下分別使Hep3B細胞存活率降低約80%及27% (圖33B)。另外,雙順反子構築體療法與ZF3-KRAB及ZF9-MQ1構築體的共調配物同樣有效。
實例33:雙順反子ZF9-MQ1_ZF3-KRAB以劑量依賴性方式降低所有HCC亞型的MYC mRNA及HCC細胞存活率
These data demonstrate that the bicistronic construct ZF9-MQ1_ZF3-KRAB downregulates MYC mRNA and cell viability in
此實例評價雙順反子ZF9-MQ1_ZF3-KRAB對所有HCC亞型S1及S2的效能。用雙順反子ZF9-MQ1_ZF3-KRAB處理HCC S1亞型細胞株SKHEP-1、SNU-449及SNU-182以及S2亞型細胞株Hep 3B及Hep G2。在處理72小時之後,吾等評價雙順反子構築體ZF9-MQ1_ZF3-KRAB在連續稀釋之濃度下的MYC mRNA及細胞存活率。This example evaluates the potency of the bicistronic ZF9-MQ1_ZF3-KRAB against all HCC subtypes S1 and S2. The HCC S1 subtype cell lines SKHEP-1, SNU-449 and SNU-182 and the S2 subtype
將HCC細胞於生長培養基中接種於96孔盤中(約10,000個細胞/孔)。接著將LNP調配物(始於2.5 μg/ml)添加至3個孔中,接著各在後續孔中約1:2稀釋成10個劑量點以便轉染mRNA,接著培育72小時。收集不同複本盤以獲得存活率及RNA。使用Promega的Celltiter-GLO ®分析套組,根據製造商方案量測存活率。使用Rneasy ®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript ®SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan TM引子/探針組分析及TaqMan TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現(使用TaqMan TM引子/探針)。利用未處理的細胞將MYC表現標準化。 HCC cells were seeded in 96-well plates (approximately 10,000 cells/well) in growth medium. LNP formulations (starting at 2.5 μg/ml) were then added to 3 wells, then diluted approximately 1 :2 into 10 dose points each in subsequent wells for transfection of mRNA, followed by 72 hours of incubation. Different replica discs were collected for survival and RNA. Viability was measured using Promega's Celltiter-GLO ® assay kit according to the manufacturer's protocol. RNA from three biological replicates was isolated using the RNeasy® Plus 96-well kit (Qiagen) following the manufacturer's protocol. RNA samples were reverse transcribed into cDNA using the LunaScript® SuperMix Kit (NEB) and analyzed by quantitative PCR (qPCR) using MYC-specific TaqMan ™ Primer/Probe Set Analysis and TaqMan ™ Fast Advanced Master Mix (Thermo Scientific) (Technical experiment repeated three times). MYC expression (using TaqMan ™ primers/probes) was quantified relative to the expression of the GAPDH reference gene using the ΔΔCt method. MYC expression was normalized to untreated cells.
結果表明,雙順反子ZF9-MQ1_ZF3-KRAB處理對所有HCC亞型的細胞存活率皆顯示作用(圖34A-C)。關於抑制MYC mRNA的EC50值在<1-20 ng/mL範圍內,而S1與S2亞型之間無此傾向(圖34D)。類似地,與MYC mRNA EC50相比,50%細胞存活率的減小換算成更高的數值範圍(120-200 ng/mL)。在S1與S2亞型之間,MYC mRNA表現或細胞存活率無顯著差異。三種S1及兩種S2亞型HCC腫瘤細胞株的EC50值證明,雙順反子ZF9-MQ1_ZF3-KRAB有效針對兩種HCC亞型(圖34D)。 實例34:雙順反子ZF9-MQ1_ZF3-KRAB對HCC細胞誘導的細胞凋亡 The results showed that bicistronic ZF9-MQ1_ZF3-KRAB treatment showed an effect on cell viability in all HCC subtypes (Fig. 34A-C). EC50 values for inhibition of MYC mRNA ranged from <1-20 ng/mL with no trend between S1 and S2 subtypes (Fig. 34D). Similarly, the 50% reduction in cell viability translated into a higher range of values (120-200 ng/mL) compared to the MYC mRNA EC50. There were no significant differences in MYC mRNA expression or cell viability between the S1 and S2 subtypes. The EC50 values of the three S1 and two S2 subtype HCC tumor cell lines demonstrated that the bicistronic ZF9-MQ1_ZF3-KRAB was effective against the two HCC subtypes (Fig. 34D). Example 34: Apoptosis induced by bicistronic ZF9-MQ1_ZF3-KRAB on HCC cells
此實例描述雙順反子ZF9-MQ1_ZF3-KRAB對HCC細胞產生的細胞凋亡作用。存活率分析(諸如CellTiter-GLO ®)僅評估孔中剩餘的細胞相對數,其基於細胞增殖損失與細胞死亡之間無法區分的ATP水準來評估。 This example describes the apoptotic effect of bicistronic ZF9-MQ1_ZF3-KRAB on HCC cells. Viability assays (such as CellTiter-GLO ® ) only assess the relative number of cells remaining in the well, based on ATP levels that are indistinguishable between loss of cell proliferation and cell death.
在此實例中,用雙順反子ZF9-MQ1_ZF3-KRAB轉染之後,使用連至磷脂結合蛋白V蛋白(磷脂結合蛋白V FITC)及碘化丙錠(PI)的螢光標記抗體定量以下三種HCC細胞株中的凋亡細胞:Hep 3B、Hep G2及SK-HEP-1。除未處理之細胞之外,非編碼mRNA用作陰性對照。HCC細胞於生長培養基中接種於12孔盤中(每孔個50,000個細胞)。接著將LNP調配物(1 μg/ml)添加至細胞中以轉染mRNA且培育48小時。收集細胞且使用BD磷脂結合蛋白V: FITC細胞凋亡偵測套組(BDB556570)加以染色且藉由流式細胞術加以分析。對於磷脂結合蛋白V FITC及PI呈陽性的細胞歸類為細胞凋亡。In this example, following transfection with the bicistronic ZF9-MQ1_ZF3-KRAB, the following three Apoptotic cells in HCC cell lines:
此等資料顯示在雙順反子ZF9-MQ1_ZF3-KRAB處理第48小時,偵測到Hep 3B及Hep G2細胞株的凋亡細胞>75%且偵測到SK-HEP-1細胞株的凋亡細胞為15% (圖35)。與未處理的細胞(5-20%背景細胞凋亡)相比,非編碼mRNA對照物對細胞無影響(圖35)。表明雙順反子ZF9-MQ1_ZF3-KRAB能夠誘導培養中之HCC細胞株發生細胞凋亡。These data show that at 48 hours after bicistronic ZF9-MQ1_ZF3-KRAB treatment, >75% of apoptotic cells were detected in
實例35:雙順反子ZF9-MQ1_ZF3-KRAB以持久方式降低MYC mRNA水準Example 35: The bicistronic ZF9-MQ1_ZF3-KRAB reduces MYC mRNA levels in a durable manner
在此實例中,雙順反子ZF9-MQ1_ZF3-KRAB SSOP LNP處理一次之後,評價雙順反子ZF9-MQ1_ZF3-KRAB實現MYC mRNA下調的耐久性。雙順反子ZF9-MQ1_ZF3-KRAB藉由將甲基化及抑制性組蛋白標記引向MYC IGD來抑制MYC。為了確定彼等修飾可有效發揮作用的持續時間,使用SK-HEP-1細胞,因為在雙順反子ZF9-MQ1_ZF3-KRAB處理之後,彼等修飾對細胞存活率展現的影響最小。此研究的目標為確定雙順反子ZF9-MQ1_ZF3-KRAB處理一次是否能使MYC mRNA抑制維持約2週。In this example, the durability of MYC mRNA downregulation achieved by the bicistronic ZF9-MQ1_ZF3-KRAB was evaluated following one treatment with the bicistronic ZF9-MQ1_ZF3-KRAB SSOP LNP. The bicistronic ZF9-MQ1_ZF3-KRAB inhibits MYC by directing methylation and repressive histone marks to MYC IGD. To determine the duration for which these modifications are effective, SK-HEP-1 cells were used, since these modifications exhibited minimal effects on cell viability after bicistronic ZF9-MQ1_ZF3-KRAB treatment. The goal of this study was to determine whether a single treatment with the bicistronic ZF9-MQ1_ZF3-KRAB could maintain MYC mRNA repression for approximately 2 weeks.
將SK-HEP-1細胞於2 mL生長培養基中以每孔200,000個細胞接種於6孔盤中。接著用0.6 μg/mL雙順反子ZF9-MQ1_ZF3-KRAB或對照的非編碼mRNA LNP處理細胞。第1天,用雙順反子ZF9-MQ1_ZF3-KRAB處理後,以胰蛋白酶處理細胞且分成三個樣品;1個樣品用於RNA提取、1個樣品用於基因體DNA (gDNA)提取、1個樣品保存供將來的時間點用。處理後的第3天、第6天、第9天及第12天重複此過程。第15天,將剩餘的細胞等分用於RNA及gDNA提取。使用Rneasy
®Plus 96孔套組(Qiagen),依循製造商方案,分離來自三個生物學複本的RNA。使用LunaScript
®RT SuperMix套組(NEB)將RNA樣品逆轉錄成cDNA且使用MYC特異性TaqMan
TM引子/探針組分析及TaqMan
TMFast Advanced預混液(Thermo Scientific),藉由定量PCR (qPCR)加以分析(技術實驗重複三次)。使用ΔΔCt方法,相對於GAPDH參考基因的表現來定量MYC表現(使用TaqMan
TM引子/探針)。利用未處理的細胞將MYC表現標準化。
SK-HEP-1 cells were seeded in 6-well plates at 200,000 cells per well in 2 mL of growth medium. Cells were then treated with 0.6 μg/mL bicistronic ZF9-MQ1_ZF3-KRAB or control non-coding mRNA LNP. On
此資料展現雙順反子ZF9-MQ1_ZF3-KRAB介導MYC基因表現調節的耐久性。雙順反子ZF9-MQ1_ZF3-KRAB處理SKHEP1細胞一次之後,MYC mRNA水準在第1天降低且在處理之後保持抑制長達15天(圖36)。
實例36:雙順反子ZF9-MQ1_ZF3-KRAB減少HCC細胞株的MYC mRNA及蛋白質表現及細胞存活率
This data demonstrates the durability of bicistronic ZF9-MQ1_ZF3-KRAB-mediated regulation of MYC gene expression. After one treatment of SKHEP1 cells with bicistronic ZF9-MQ1_ZF3-KRAB, MYC mRNA levels decreased at
為了理解雙順反子ZF9-MQ1_ZF3-KRAB對多種HCC細胞的藥效學,在雙順反子ZF9-MQ1_ZF3-KRAB處理之後的第6-96小時,評估HCC細胞中之MYC mRNA表現及蛋白質水準、細胞存活率,及雙順反子ZF9-MQ1_ZF3-KRAB表現。To understand the pharmacodynamics of bicistronic ZF9-MQ1_ZF3-KRAB on various HCC cells, MYC mRNA expression and protein levels were assessed in HCC cells at 6-96 hours after bicistronic ZF9-MQ1_ZF3-KRAB treatment , cell viability, and bicistronic ZF9-MQ1_ZF3-KRAB expression.
在各時間點,將Hep 3B或SK-HEP-1細胞於培養基中以每孔10,000個細胞接種於一組兩個96孔盤中且以每孔400,000個細胞接種於6孔盤中。用1 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB或非編碼mRNA處理96孔盤,重複三次,以進行細胞存活率及mRNA分析。用1 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB或非編碼mRNA處理6孔盤,重複兩次,以進行蛋白質分析。在處理之後的第6、24、48、72或96小時培育細胞,在各時間點保留一小組細胞作為未處理的陰性對照。在各時間點,用CellTiter-GLO
®溶解一個96孔盤且使用GLOMAX
®定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy
®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用RT LunaScript
®將mRNA轉化為cDNA。接著使用MYC (標靶)及GAPDH (參考)探針,經由ΔΔCT qPCR分析cDNA。在各時間點用RIPA緩衝液溶解6孔盤中的細胞用於蛋白質分離。使用Pierce BCA蛋白質分析(23225)定量總蛋白質。各樣品負載等量的蛋白質且使用NuPAGE
TM小型凝膠系統(Thermo Fisher)、根據尺寸加以分離。接著使用iBlot™ 2凝膠轉移裝置(Thermo Fisher)將蛋白質轉移至PVDF膜上。用抗MYC抗體(Abcam ab32072)探測膜隔夜。抗β-肌動蛋白抗體(Cell Signaling 8H10D10)用作內參考物。接著使用針對MYC的螢光二級抗體及β-肌動蛋白初級抗體、使用LI-COR成像系統將信號可視化且定量。
At each time point,
與非編碼短mRNA或未處理的細胞相比,雙順反子ZF9-MQ1_ZF3-KRAB在第6小時使兩種細胞株中的MYC mRNA及蛋白質表現均減少,隨後96小時保持下調(圖37)。用雙順反子ZF9-MQ1_ZF3-KRAB處理之後的第48小時,觀測到兩種細胞株的細胞存活率降低(圖37)。較短的非編碼陰性對照物對細胞存活率沒有影響。 實例37:雙順反子ZF9-MQ1_ZF3-KRAB mRNA表現ZF9-MQ1與ZF3-KRAB,如藉由HA標記的蛋白質可視化 Bicistronic ZF9-MQ1_ZF3-KRAB reduced MYC mRNA and protein expression in both cell lines at 6 hours compared to non-coding short mRNA or untreated cells, and remained down-regulated for 96 hours thereafter (Figure 37) . At 48 hours after treatment with the bicistronic ZF9-MQ1_ZF3-KRAB, a decrease in cell viability was observed for both cell lines (Figure 37). The shorter non-coding negative control had no effect on cell viability. Example 37: Bicistronic ZF9-MQ1_ZF3-KRAB mRNA expresses ZF9-MQ1 and ZF3-KRAB as visualized by HA-tagged protein
在此實例中,經由西方墨點分析來證實雙順反子ZF9-MQ1_ZF3-KRAB mRNA/LNP所編碼之蛋白質的表現。在細胞內部,雙順反子ZF9-MQ1_ZF3-KRAB產生兩種ZF蛋白質(ZF3-KRAB及ZF9-MQ1),在此實驗中,該兩種蛋白質用HA標記,從而允許在轉染HCC細胞之後對蛋白質表現進行定量。In this example, the expression of the protein encoded by the bicistronic ZF9-MQ1_ZF3-KRAB mRNA/LNP was confirmed by Western blot analysis. Inside the cell, the bicistronic ZF9-MQ1_ZF3-KRAB produces two ZF proteins (ZF3-KRAB and ZF9-MQ1), which in this experiment were tagged with HA, allowing for the identification of HCC cells after transfection. Protein expression was quantified.
在各時間點,將Hep 3B或SK-HEP-1細胞於培養基中以每孔400,000個細胞接種於6孔盤中。用1 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB處理6孔盤,重複兩次,以進行蛋白質分析。培育細胞6或24小時。接著將細胞溶解於RIPA緩衝液中且利用Pierce BCA蛋白質分析(23225)定量總蛋白質水準。各樣品負載等量的蛋白質且使用NuPAGE
TM小型凝膠系統(Thermo Fisher)、根據尺寸加以分離。接著使用iBlot™ 2凝膠轉移裝置(Thermo Fisher)將蛋白質轉移至PVDF膜上。用Abcam HA抗體探測膜隔夜。抗β-肌動蛋白抗體(Cell Signaling 8H10D10)用作內參考物。使用針對MYC的螢光二級抗體及β-肌動蛋白抗體、使用LI-COR成像系統可視化信號且加以定量。
At each time point,
在轉染之後的第6小時與第24小時,由雙順反子ZF9-MQ1_ZF3-KRAB mRNA編碼的ZF3-KRAB與ZF9-MQ1蛋白質在西方墨點上藉由HA標籤可視化(圖38)。觀測ZF3-KRAB與ZF9-MQ1構築體在兩個時間點的聚集。 實例38:雙順反子ZF9-MQ1_ZF3-KRAB增強HCC細胞對索拉非尼的反應 ZF3-KRAB and ZF9-MQ1 proteins encoded by bicistronic ZF9-MQ1_ZF3-KRAB mRNA were visualized by HA-tag on western blots at 6 and 24 hours after transfection ( FIG. 38 ). Aggregation of the ZF3-KRAB and ZF9-MQ1 constructs was observed at two time points. Example 38: Bicistronic ZF9-MQ1_ZF3-KRAB enhances the response of HCC cells to Sorafenib
在此實例中,評價雙順反子ZF9-MQ1_ZF3-KRAB對小分子索拉非尼(多激酶抑制劑)於HCC細胞株中之功效的影響。索拉非尼用作HCC的標準照護療法且高MYC水準可預測索拉非尼抗藥性。假設雙順反子ZF9-MQ1_ZF3-KRAB可經由其下調MYC而使HCC對索拉非尼再敏感。劑量反應分析係用於評價雙順反子ZF9-MQ1_ZF3-KRAB療法是否能降低索拉非尼的IC 50。 In this example, the effect of the bicistronic ZF9-MQ1_ZF3-KRAB on the efficacy of the small molecule sorafenib, a multikinase inhibitor, in HCC cell lines was evaluated. Sorafenib is used as standard-of-care therapy in HCC and high MYC levels predict sorafenib resistance. We hypothesized that the bicistronic ZF9-MQ1_ZF3-KRAB could resensitize HCC to sorafenib via its downregulation of MYC. Dose-response analysis was used to evaluate whether bicistronic ZF9-MQ1_ZF3-KRAB therapy can reduce the IC 50 of sorafenib.
為了評價雙順反子ZF9-MQ1_ZF3-KRAB與索拉非尼之間的潛在協同作用,將Hep 3B或SK-HEP-1細胞以每孔10,000個細胞接種於96孔盤中。接著用劑量範圍在0.1與25 μM之間的索拉非尼(1:2連續稀釋)處理細胞。接著以0.1或0.6 μg/mL之劑量將載運雙順反子ZF9-MQ1_ZF3-KRAB的脂質混合物添加至一小組孔中。一組細胞用單獨的索拉非尼處理以用作對照。處理72小時之後,用CellTiter-GLO
®試劑溶解細胞且使用Glo Max定量發光。藉由將未處理的值取平均值且將各實驗的螢光素酶值除以該平均值來計算相對的細胞存活率。
表19:雙順反子ZF9-MQ1_ZF3-KRAB對索拉非尼IC
50的影響
資料顯示,當索拉非尼與0.6 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對SKHEP1的IC
50自12.3減小至10.7 µM (圖39A及表19)且針對Hep 3B的IC
50自4.4減小至2.9 µM (圖39B及表19)。當索拉非尼與0.1 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對Hep 3B或SK-HEP-1細胞的IC
50未出現顯著變化(圖39A-39B)。索拉非尼與雙順反子ZF9-MQ1_ZF3-KRAB的組合比單獨的索拉非尼更有效(圖39A-39B)。此資料表明雙順反子ZF9-MQ1_ZF3-KRAB與索拉非尼之間的協同活性。
實例39:雙順反子ZF9-MQ1_ZF3-KRAB增強HCC細胞對JQ1的反應
The data showed that when Sorafenib was administered in combination with 0.6 µg/ml bicistronic ZF9-MQ1_ZF3-KRAB, the IC 50 of Sorafenib against SKHEP1 decreased from 12.3 to 10.7 µM (Figure 39A and Table 19) And the IC 50 against
此實例評價雙順反子ZF9-MQ1_ZF3-KRAB與JQ1 (BET抑制劑)之組合針對多種HCC細胞株的功效。BET蛋白質已顯示對MYC轉錄具有重要作用。臨床中當前正針對血液學適應症評價新一代BET抑制劑;然而,其毒性概況已限制其使用。在降低的劑量水準下可增強BET抑制劑效能的組合療法可改善BET抑制劑耐受性。This example evaluates the efficacy of the combination of bicistronic ZF9-MQ1_ZF3-KRAB and JQ1 (BET inhibitor) against various HCC cell lines. BET proteins have been shown to have an important role in MYC transcription. A new generation of BET inhibitors is currently being evaluated in the clinic for hematological indications; however, their toxicity profile has limited their use. Combination therapies that enhance the efficacy of BET inhibitors at reduced dose levels may improve BET inhibitor tolerability.
為了評價雙順反子ZF9-MQ1_ZF3-KRAB與JQ1之間的潛在協同作用,將Hep 3B或SK-HEP-1細胞以每孔10,000個細胞接種於96孔盤中。接著用劑量範圍在0.1與25 μM之間的JQ1 (1:2連續稀釋)處理細胞。接著以0.1或0.6 μg/mL之劑量將載運雙順反子ZF9-MQ1_ZF3-KRAB的脂質混合物添加至一組孔中。一組細胞用單獨的JQ1處理以用作對照。處理72小時之後,用CellTiter-GLO
®試劑溶解細胞且使用Glo Max定量發光。藉由將未處理的值取平均值且將各實驗的螢光素酶值除以該平均值來計算相對的細胞存活率。
表 20 : 雙順反子 ZF9 - MQ1 _ ZF3 - KRAB 對 JQ1 IC50 的 影響
0.6 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB與JQ1組合使JQ1針對SK-HEP1的IC
50自>25減小至1.1 µM (圖40A及表20)且使JQ1針對Hep 3B的IC
50自6.6減小至0.2 μM (圖40B及表20)。0.1 μg/mL的雙順反子ZF9-MQ1_ZF3-KRAB與JQ1組合使JQ1針對SK-HEP1的IC
50自>25減小至1.9 µM (圖40A及表20)且使JQ1針對Hep 3B的IC
50自6.6減小至0.6 μM (圖40B及表20)。此資料表明雙順反子構築體ZF9-MQ1_ZF3-KRAB與JQ1之間的協同活性。
實例40:篩選為了靶向MYC基因體基因座而設計的小鼠替代構築體
Combination of bicistronic ZF9-MQ1_ZF3-KRAB at 0.6 μg/mL with JQ1 decreased the IC 50 of JQ1 against SK-HEP1 from >25 to 1.1 μM (Figure 40A and Table 20) and increased the IC 50 of JQ1 against
此實例(小鼠肝細胞癌模型Hepa1-6)用於評價靶向小鼠中之MYC IGD的構築體。開發此等構築體(ZF15-MQ1、ZF16-MQ1及ZF17-MQ1)作為靶向小鼠基因體之雙順反子ZF9-MQ1_ZF3-KRAB的替代物。吾等評價此等構築體下調MYC mRNA表現且降低小鼠HCC細胞存活率的能力。This example (mouse hepatocellular carcinoma model Hepal-6) was used to evaluate constructs targeting MYC IGD in mice. These constructs (ZF15-MQ1 , ZF16-MQ1 and ZF17-MQ1 ) were developed as alternatives to the bicistronic ZF9-MQ1_ZF3-KRAB targeting the mouse gene body. We evaluated the ability of these constructs to downregulate MYC mRNA expression and reduce mouse HCC cell survival.
產生一組小鼠替代構築體(ZF15-MQ1、ZF16-MQ1及ZF17-MQ1)且藉由將每孔10,000個細胞接種於雙重複盤中而在Hepa1-6細胞中篩選,用於mRNA提取或細胞存活率分析。用0.6或1.2 μg/mL的候選ZF處理96孔盤,重複三次。培育細胞72小時。培育期之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用RT LunaScript ®將mRNA轉化為cDNA。接著使用MYC (標靶)及GAPDH (參考)探針,經由ΔΔCT qPCR分析cDNA。 A panel of mouse replacement constructs (ZF15-MQ1, ZF16-MQ1, and ZF17-MQ1) was generated and screened in Hepa1-6 cells by seeding 10,000 cells per well in duplicate dishes for mRNA extraction or Cell Viability Analysis. Treat 96-well plates with 0.6 or 1.2 μg/mL of candidate ZFs in triplicate. Cells were incubated for 72 hours. Following the incubation period, a 96-well plate was lysed with CellTiter-GLO ® and luminescence quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA is then converted to cDNA using RT LunaScript® . The cDNA was then analyzed by ΔΔCT qPCR using MYC (target) and GAPDH (reference) probes.
此篩選表明,ZF17-MQ1顯示小鼠MYC mRNA下調(圖41A),與此對應的是存活率降低(圖41B)。 實例41:小鼠替代構築體ZF17-MQ1對HCC細胞存活率及MYC mRNA及蛋白質表現的影響 This screen showed that ZF17-MQ1 exhibited downregulation of mouse MYC mRNA (FIG. 41A), which corresponded to reduced survival (FIG. 41B). Example 41: Effect of Mouse Alternative Construct ZF17-MQ1 on HCC Cell Survival Rate and MYC mRNA and Protein Expression
在此實例中,使用小鼠肝細胞癌模型Hepa1-6評價ZF17-MQ1構築體使MYC mRNA及蛋白質表現及小鼠HCC細胞存活率下調的作用。In this example, the effect of the ZF17-MQ1 construct on downregulation of MYC mRNA and protein expression and mouse HCC cell survival was evaluated using the mouse hepatocellular carcinoma model Hepa1-6.
用ZF17-MQ1或GFP mRNA處理96小時後,分析MYC mRNA及細胞存活率。Hepa1-6細胞以每孔10,000個細胞接種於雙重複盤中以進行mRNA提取或細胞存活率分析。用0.6或1.2 μg/mL的ZF17-MQ1處理96孔盤,重複三次。培育之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用RT Lunascript將mRNA轉化為cDNA。接著使用MYC (標靶)及GAPDH (參考)探針,經由ΔΔCT qPCR分析cDNA。 After treatment with ZF17-MQ1 or GFP mRNA for 96 hours, MYC mRNA and cell viability were analyzed. Hepa1-6 cells were plated in duplicate at 10,000 cells per well for mRNA extraction or cell viability analysis. Treat 96-well plates with 0.6 or 1.2 μg/mL of ZF17-MQ1 in triplicate. After incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence was quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using RT Lunascript. The cDNA was then analyzed by ΔΔCT qPCR using MYC (target) and GAPDH (reference) probes.
藉由在12孔盤中用ZF17-MQ1或對照GFP mRNA轉染100,000個細胞而處理24及48小時之後,分析MYC蛋白質水準。將細胞溶解於RIPA緩衝液中且利用Pierce BCA蛋白質分析(23225)定量總蛋白質水準。各樣品負載等量的蛋白質且使用NuPAGE TM小型凝膠系統(Thermo Fisher)、根據尺寸加以分離。接著使用iBlot™凝膠轉移裝置(Thermo Fisher)將蛋白質轉移至PVDF膜上。用抗MYC抗體(Abcam ab32072)探測膜隔夜。抗β-肌動蛋白抗體(Cell Signaling 8H10D10)用作內參考物。接著使用針對MYC的螢光二級抗體及β-肌動蛋白抗體、使用LI-COR成像系統將信號可視化且定量。 MYC protein levels were analyzed after 24 and 48 hours of treatment by transfecting 100,000 cells with ZF17-MQ1 or control GFP mRNA in 12-well plates. Cells were lysed in RIPA buffer and total protein levels were quantified using the Pierce BCA protein assay (23225). Each sample was loaded with an equal amount of protein and separated by size using the NuPAGE ™ Mini Gel System (Thermo Fisher). Proteins were then transferred to PVDF membranes using the iBlot™ Gel Transfer Apparatus (Thermo Fisher). Membranes were probed overnight with anti-MYC antibody (Abcam ab32072). Anti-β-actin antibody (Cell Signaling 8H10D10) was used as internal reference. The signal was then visualized and quantified using a fluorescent secondary antibody against MYC and a β-actin antibody using a LI-COR imaging system.
資料表明,ZF17-MQ1藉由靶向MYC IGD而能夠充當雙順反子ZF9-MQ1_ZF3-KRAB的小鼠替代物。ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理在第24及48小時顯著下調MYC蛋白(圖42A)。ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理顯示MYC mRNA在第96小時顯著下調(圖42C)且細胞存活率在第96小時降低(圖42D)。 實例42:Hepa1-6皮下同基因模型顯示ZF17-MQ1的功效 The data suggest that ZF17-MQ1 can act as a mouse surrogate for the bicistronic ZF9-MQ1_ZF3-KRAB by targeting MYC IGD. Treatment of mouse HCC cells Hepa1-6 with ZF17-MQ1 significantly down-regulated MYC protein at 24 and 48 hours ( FIG. 42A ). ZF17-MQ1 treatment of mouse HCC cells Hepa1-6 showed a significant downregulation of MYC mRNA at 96 hours ( FIG. 42C ) and decreased cell viability at 96 hours ( FIG. 42D ). Example 42: Hepa1-6 subcutaneous isogenic model shows efficacy of ZF17-MQ1
此實例展現ZF17-MQ1在免疫勝任動物中的功效。特定言之,將Hepa1-6植入同基因接受者C57BL/6正常小鼠中。This example demonstrates the efficacy of ZF17-MQ1 in immunocompetent animals. Specifically, Hepal-6 was implanted into isogenic recipient C57BL/6 normal mice.
藉由將Hepa1-6腫瘤細胞植入左側腹中來誘導雌性C57BL/6小鼠發生疾病。當平均腫瘤體積達到約150 mm 3時,開始治療。將小鼠分成治療組(PBS或ZF17-MQ1各有9隻小鼠,索拉非尼有6隻小鼠),使得各組的平均腫瘤體積大致相等。小鼠靜脈內注射3 mg/kg的PBS或ZF17-MQ1。50 mg/kg的陽性對照標準照護療法藥物索拉非尼每天經由經口管飼給與。ZF17-MQ1每5天給與一次,共4次劑量,接著停藥2週,再開始治療兩次以上。所有動物每日稱重且目測評估。經由測徑規每週量測腫瘤尺寸3次。 Disease was induced in female C57BL/6 mice by implanting Hepal-6 tumor cells into the left flank. Treatment was initiated when the average tumor volume reached approximately 150 mm. Mice were divided into treatment groups (nine mice each for PBS or ZF17-MQ1 and six mice for sorafenib) such that the mean tumor volumes in each group were approximately equal. Mice were injected intravenously with 3 mg/kg of PBS or ZF17-MQ1. The positive control standard of care drug Sorafenib at 50 mg/kg was given daily via oral gavage. ZF17-MQ1 was given once every 5 days for a total of 4 doses, followed by a 2-week break before starting treatment two more times. All animals were weighed daily and assessed visually. Tumor size was measured 3 times per week via calipers.
結果顯示,4次劑量之後,ZF17-MQ1顯著減小動物腫瘤負荷(圖43)。藥物假期之後,在約4週之後,小鼠的再治療引起完全的腫瘤耗乏(圖43)。此等資料表明,ZF17-MQ1可有效地減少免疫勝任動物中之HCC異種移植物的腫瘤負荷。The results showed that after 4 doses, ZF17-MQ1 significantly reduced the tumor burden of animals ( FIG. 43 ). Following the drug holiday, re-treatment of the mice resulted in complete tumor depletion after approximately 4 weeks (Fig. 43). These data demonstrate that ZF17-MQ1 can effectively reduce the tumor burden of HCC xenografts in immunocompetent animals.
實例43:作為潛在新穎的HCC療法,對MYC致癌基因進行表觀遺傳調節Example 43: Epigenetic modulation of the MYC oncogene as a potential novel HCC therapy
此實例描述藉由量測MYC mRNA及細胞存活率來表徵HCC細胞株(Hep 3B、Hep G2、SK-HEP-1、SNU-182及SNU-449)中的雙順反子ZF9-MQ1-ZF3-KRAB。This example describes the characterization of bicistronic ZF9-MQ1-ZF3 in HCC cell lines (
藉由量測MYC mRNA及細胞存活率來表徵HCC細胞株(Hep 3B、Hep G2、SK-HEP-1、SNU-182及SNU-449)中的雙順反子ZF9-MQ1_ZF3KRAB構築體。測試雙順反子ZF9-MQ1_ZF3KRAB的持久表觀遺傳(例如DNA/染色質甲基化)及總轉錄本(例如RNA-seq)變化。使用各種蛋白質體方法量測MYC蛋白質水準及路徑信號傳導的變化。最後,藉由評估腫瘤體積、腫瘤相關生物發光(BLI)及免疫組織化學(IHC)來分析雙順反子ZF9-MQ1_ZF3KRAB在皮下(subQ)及正位HCC模型中的活體內活性。The bicistronic ZF9-MQ1_ZF3KRAB construct in HCC cell lines (
吾等鑑別出包括雙順反子ZF9-MQ1_ZF3-KRAB在內的構築體,其靶向MYC IGD上的多個基因座且有效地減少HCC細胞的MYC mRNA、蛋白質及細胞存活率,同時避開正常細胞。在HCC細胞中,雙順反子ZF9-MQ1_ZF3KRAB的中值EC50就抑制MYC mRNA而言,為<0.001 ng/mL且就抑制細胞存活率而言,為120 ng/mL。重要的是,雙順反子ZF9-MQ1_ZF3KRAB的作用保持逾2週,從而提供持久的MYC mRNA抑制。截至第23天,與陰性對照相比,存在於LNP中之雙順反子ZF9-MQ1_ZF3KRAB以3及6 mg/kg Q5D靜脈內遞送於無胸腺裸小鼠之Hep 3B subQ模型中分別展現54%及63%之統計顯著性腫瘤生長抑制(TGI)。經雙順反子ZF9-MQ1_ZF3KRAB治療之小鼠的體重(BW)與陰性對照或索拉非尼治療之小鼠相比,未出現顯著降低。經雙順反子ZF9-MQ1_ZF3KRAB及對照物治療之腫瘤的IHC顯示MYC及Ki67 (增殖標記物)顯著下調以及凋亡蛋白酶3 (細胞凋亡標記物)上調。在Hep 3B正位模型中,3 mg/kg雙順反子ZF9-MQ1_ZF3KRAB Q5D顯示之BLI與50 mg/kg QD之索拉非尼出現類似的減少,而BW不降低。
實例44:小鼠替代構築體對LL2細胞之MYC表現及細胞存活率的影響
We identified constructs including the bicistronic ZF9-MQ1_ZF3-KRAB that target multiple loci on MYC IGD and potently reduce MYC mRNA, protein, and cell survival in HCC cells while avoiding normal cells. In HCC cells, the bicistronic ZF9-MQ1_ZF3KRAB had a median EC50 of <0.001 ng/mL for inhibition of MYC mRNA and 120 ng/mL for inhibition of cell viability. Importantly, the effect of the bicistronic ZF9-MQ1_ZF3KRAB was maintained for more than 2 weeks, providing durable MYC mRNA suppression. By
該實例係關於評價小鼠替代構築體(ZF15-MQ1、ZF16-MQ1及ZF17-MQ1)對LL2細胞之MYC表現及細胞存活率的影響。This example is about evaluating the effect of mouse alternative constructs (ZF15-MQ1 , ZF16-MQ1 and ZF17-MQ1 ) on MYC expression and cell viability of LL2 cells.
LL2細胞以每孔5,000個細胞接種於雙重複96孔盤中以進行mRNA提取或細胞存活率分析。用0.625 µg/mL (mRNA及存活率讀數)及1.25 µg/ml (mRNA及存活率讀數)的負載ZF15-MQ1、ZF16-MQ1或ZF17-MQ1之LNP處理96孔盤,重複三次,且培育。6孔盤每孔接種200,000個細胞且用1.25 µg/ml的上述構築體轉染(西方墨點法讀數)。經GFP處理之細胞用作陰性對照。培育之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對小鼠MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。進行西方墨點法時,β-肌動蛋白抗體用在594 nm波長下發射之螢光團標記的二級抗體染色,且MYC抗體(Abcam)用在488 nm波長下發射之螢光團標記的二級抗體偵測。利用近紅外(NIR)螢光的Odyssey ®CLx成像系統(LI-COR)用於擷取蛋白質影像。 LL2 cells were plated in duplicate 96-well plates at 5,000 cells per well for mRNA extraction or cell viability analysis. 96-well plates were treated with 0.625 µg/mL (mRNA and viability readout) and 1.25 µg/ml (mRNA and viability readout) of LNP loaded with ZF15-MQ1, ZF16-MQ1 or ZF17-MQ1 in triplicate and incubated. 6-well plates were seeded with 200,000 cells per well and transfected with 1.25 µg/ml of the above constructs (western blot read). GFP-treated cells were used as negative controls. After incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence was quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for mouse MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels. For western blotting, the β-actin antibody was stained with a secondary antibody labeled with a fluorophore emitting at 594 nm, and the MYC antibody (Abcam) was labeled with a fluorophore emitting at 488 nm. Secondary antibody detection. The Odyssey ® CLx Imaging System (LI-COR) using near-infrared (NIR) fluorescence was used to capture protein images.
資料表明,在LL2細胞中,經ZF17-MQ1處理之細胞相較於未處理或經GFP處理之調節細胞顯示降低的MYC蛋白質水準(圖44A)。與未處理之細胞中所觀測的水準相比,ZF17-MQ1及ZF16-MQ1分別使LL2細胞中的MYC mRNA水準降低>99.9%或74% (圖44B)。另外,所有三種構築體皆能夠以大於未處理及經GFP處理之細胞的程度減小LL2細胞的細胞存活率(圖44C)。與未處理的細胞相比,構築體ZF17-MQ1、ZF16-MQ1及ZF15-MQ1分別使LL2細胞的存活率減小高達74%、65%及30%。 實例 45 : 經 ZF17 - MQ1 處理之 LL2 及 CT26 細胞中的 MYC 轉錄本下調 The data indicated that in LL2 cells, ZF17-MQ1 -treated cells showed reduced MYC protein levels compared to untreated or GFP-treated regulatory cells (Figure 44A). ZF17-MQ1 and ZF16-MQ1 reduced MYC mRNA levels in LL2 cells by >99.9% or 74%, respectively, compared to levels observed in untreated cells (Figure 44B). In addition, all three constructs were able to reduce cell viability of LL2 cells to a greater extent than untreated and GFP-treated cells (Fig. 44C). Constructs ZF17-MQ1 , ZF16-MQ1 and ZF15-MQ1 reduced the viability of LL2 cells by up to 74%, 65% and 30%, respectively, compared to untreated cells. Example 45 : Down-regulation of MYC transcripts in LL2 and CT26 cells treated with ZF17 - MQ1
此實例係關於評價ZF17-MQ1構築體下調CT26及LL2細胞之MYC mRNA表現且降低其細胞存活率的功效。This example is about evaluating the efficacy of the ZF17-MQ1 construct to downregulate MYC mRNA expression and reduce cell viability in CT26 and LL2 cells.
CT26及LL2細胞以每孔2,500個細胞接種於雙重複盤中以進行mRNA提取或細胞存活率分析。用1.25 µg/mL及2.5 µg/ml的負載ZF17-MQ1之LNP處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對小鼠MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 CT26 and LL2 cells were plated in duplicate at 2,500 cells per well for mRNA extraction or cell viability analysis. 96-well plates were treated with 1.25 µg/mL and 2.5 µg/ml of ZF17-MQ1-loaded LNPs in triplicate and incubated. Untreated cells and GFP-treated cells were used as controls. After incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence was quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for mouse MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示ZF17-MQ1下調MYC mRNA且使LL2及CT26細胞之細胞存活率降低的程度大於未處理的細胞及經GFP處理的細胞(陰性對照組)。與未處理之細胞中所觀測的水準相比,ZF17-MQ1在2.5 µg/mL下分別使LL2及CT26細胞中的MYC mRNA水準降低93%及85% (圖45A)。另外,與未處理的細胞相比,在此等條件下,ZF17-MQ1使LL2及CT26細胞的細胞存活率分別降低87%及93% (圖45B)。 實例 46 : 經 ZF17 - MQ1 處理之 LL2 及 CMT167 細胞中的 MYC 轉錄本下調 The data showed that ZF17-MQ1 down-regulated MYC mRNA and decreased the cell viability of LL2 and CT26 cells to a greater extent than untreated cells and GFP-treated cells (negative control group). At 2.5 µg/mL, ZF17-MQ1 reduced MYC mRNA levels in LL2 and CT26 cells by 93% and 85%, respectively, compared to the levels observed in untreated cells (Fig. 45A). In addition, under these conditions, ZF17-MQ1 reduced the cell viability of LL2 and CT26 cells by 87% and 93%, respectively, compared to untreated cells ( FIG. 45B ). Example 46 : Down-regulation of MYC transcripts in LL2 and CMT167 cells treated with ZF17 - MQ1
此實例係關於評價ZF17-MQ1構築體下調CMT167及LL2細胞之MYC mRNA表現且降低其細胞存活率的功效。This example is about evaluating the efficacy of the ZF17-MQ1 construct to downregulate MYC mRNA expression and reduce cell viability in CMT167 and LL2 cells.
CMT167及LL2細胞以每孔5,000個細胞接種於雙重複盤中以進行mRNA提取或細胞存活率分析。用1.0 µg/mL的負載GFP、非編碼RNA或ZF17-MQ1之LNP處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對小鼠MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 CMT167 and LL2 cells were plated in duplicate at 5,000 cells per well for mRNA extraction or cell viability analysis. The 96-well plate was treated with 1.0 μg/mL of LNP loaded with GFP, non-coding RNA or ZF17-MQ1, repeated three times, and incubated. Untreated cells and GFP-treated cells were used as controls. After incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence was quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for mouse MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示ZF17-MQ1下調CMT167及LL2細胞之MYC mRNA且使其細胞存活率降低的程度大於未處理的細胞及經GFP處理的細胞(陰性對照組)。與未處理之細胞中所觀測的水準相比,ZF17-MQ1分別使CMT167及LL2細胞中的MYC mRNA水準降低62%及73% (圖46)。另外,與未處理的細胞相比,在此等條件下,ZF17-MQ1使CMT167及LL2細胞的細胞存活率分別降低54%及57% (圖46)。 實例47:ZF9-MQ1對原代細胞存活率顯示的影響極小 The data showed that ZF17-MQ1 down-regulated the MYC mRNA of CMT167 and LL2 cells and reduced the cell viability to a greater extent than untreated cells and GFP-treated cells (negative control group). ZF17-MQ1 reduced MYC mRNA levels in CMT167 and LL2 cells by 62% and 73%, respectively, compared to the levels observed in untreated cells (Figure 46). In addition, under these conditions, ZF17-MQ1 reduced the cell viability of CMT167 and LL2 cells by 54% and 57%, respectively, compared to untreated cells ( FIG. 46 ). Example 47: ZF9-MQ1 shows minimal effect on primary cell viability
此實例評價ZF9-MQ1構築體對原代細胞存活率的影響。This example evaluates the effect of the ZF9-MQ1 construct on primary cell viability.
以每孔7,500個細胞(小氣管原代上皮細胞)或5000個細胞(肺葉原代上皮細胞及肺原代纖維母細胞)將小氣管原代上皮細胞、肺葉原代上皮細胞及肺原代纖維母細胞接種於雙重複盤中用於細胞存活率分析。用1.0 µg/mL的負載GFP或ZF9-MQ1之LNP處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。用RLT Plus溶解緩衝液溶解第二個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 Primary small airway epithelial cells, primary pulmonary lobe epithelial cells, and primary lung fibroblasts were cultured at 7,500 cells (primary airway epithelial cells) or 5,000 cells (primary pulmonary lobe epithelial cells and primary pulmonary fibroblasts) per well. Blast cells were plated in duplicate for cell viability analysis. The 96-well plate was treated with 1.0 μg/mL of GFP- or ZF9-MQ1-loaded LNP, repeated three times, and incubated. Untreated cells and GFP-treated cells were used as controls. After incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence was quantified using a GloMax ® Discovery plate reader. Dissolve a second 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for human MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示與未處理的細胞相比,ZF9-MQ1使小氣管原代上皮細胞、肺葉原代上皮細胞及肺原代纖維母細胞中的MYC mRNA水準分別下調94%、96%、96%水準(圖47)。然而,存活率比對照細胞僅降低16%、9%及22%,表明與針對H2009癌細胞的先前研究結果相比,ZF9-MQ1對正常肺上皮細胞或纖維母細胞之細胞存活率僅具有中等的影響。 實例48:ZF9-MQ1與JQ1的共處理對A549存活率顯示大於相加的效應 The data showed that compared with untreated cells, ZF9-MQ1 reduced the levels of MYC mRNA in small airway primary epithelial cells, lung lobe primary epithelial cells, and lung primary fibroblasts by 94%, 96%, and 96% respectively ( Figure 47). However, the survival rate was only reduced by 16%, 9% and 22% compared with the control cells, indicating that ZF9-MQ1 has only moderate effect on the cell survival rate of normal lung epithelial cells or fibroblasts compared with the results of previous studies on H2009 cancer cells. Impact. Example 48: Co-treatment of ZF9-MQ1 and JQ1 shows greater than additive effect on A549 survival
此實例評價ZF9-MQ1構築體當與不同濃度的JQ1抑制劑組合使用時,對A549細胞存活率的影響。This example evaluates the effect of ZF9-MQ1 constructs on A549 cell viability when used in combination with different concentrations of JQ1 inhibitors.
以每孔4,000個細胞將A549細胞接種於雙重複盤中用於細胞存活率分析。將0.5 µg/mL或1.0 µg/mL的負載GFP或ZF9-MQ1之LNP與濃度增加的BET抑制劑JQ1 (濃度直至6.25 μM)組合處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育72小時之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。 A549 cells were seeded in duplicate dishes at 4,000 cells per well for cell viability analysis. 96-well plates were treated with 0.5 µg/mL or 1.0 µg/mL of GFP- or ZF9-MQ1-loaded LNP in combination with increasing concentrations of the BET inhibitor JQ1 (concentration up to 6.25 µM) in triplicate and incubated. Untreated cells and GFP-treated cells were used as controls. After 72 hours of incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence quantified using a GloMax ® Discovery plate reader.
資料顯示ZF9-MQ1及JQ1各自分別抑制A549細胞的細胞存活率(圖48A)。ZF9-MQ1 (0.5或1 µg/ml)與JQ1 (濃度直至6.25 μM)組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示出大於相加的效應,表明ZF9-MQ1與JQ1組合協同地抑制A549細胞的存活率(圖48B-48C)。 實例49:ZF9-MQ1與BET762的共處理對A549存活率顯示大於相加的效應 The data showed that each of ZF9-MQ1 and JQ1 inhibited the cell viability of A549 cells ( FIG. 48A ). ZF9-MQ1 (0.5 or 1 µg/ml) in combination with JQ1 (concentrations up to 6.25 μM) showed a greater than additive effect on inhibiting A549 survival compared to the effect predicted from their individual activities, suggesting that ZF9- The combination of MQ1 and JQ1 synergistically inhibited the survival of A549 cells (FIGS. 48B-48C). Example 49: Co-treatment of ZF9-MQ1 and BET762 shows greater than additive effect on A549 survival
此實例評價ZF9-MQ1構築體當與不同濃度的BET762抑制劑組合使用時,對A549細胞存活率的影響。This example evaluates the effect of ZF9-MQ1 constructs on A549 cell viability when used in combination with different concentrations of BET762 inhibitors.
以每孔4,000個細胞將A549細胞接種於雙重複盤中用於細胞存活率分析。將0.5 µg/mL或1.0 µg/mL的負載GFP或ZF9-MQ1之LNP與濃度增加(濃度直至1.25 μM)的BET抑制劑BET762組合處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育72小時之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。 A549 cells were seeded in duplicate dishes at 4,000 cells per well for cell viability analysis. 96-well plates were treated with 0.5 µg/mL or 1.0 µg/mL of GFP- or ZF9-MQ1-loaded LNP in combination with increasing concentrations (up to 1.25 μM) of the BET inhibitor BET762, three times, and incubated. Untreated cells and GFP-treated cells were used as controls. After 72 hours of incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence quantified using a GloMax ® Discovery plate reader.
資料顯示ZF9-MQ1及BET762各自分別抑制A549細胞的細胞存活率(圖49A)。ZF9-MQ1 (0.5 µg/ml)與BET762 (對於經0.5 µg/ml ZF9-MQ1處理的細胞而言,濃度直至1.25uM,且對於經1.0 µg/ml ZF9-MQ1處理的細胞而言,濃度直至0.625 μM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖49B)。ZF9-MQ1 (1.0 µg/ml)與BET762 (濃度至多0.625 µM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖49C)。資料表明,ZF9-MQ1與BET762組合協同地抑制A549細胞的存活率(圖49B-C)。The data showed that ZF9-MQ1 and BET762 each inhibited the cell viability of A549 cells ( FIG. 49A ). ZF9-MQ1 (0.5 µg/ml) with BET762 (up to 1.25uM for cells treated with 0.5 µg/ml ZF9-MQ1 and up to 1.0 µg/ml for cells treated with ZF9-MQ1 0.625 μΜ) when combined showed a greater than additive effect on inhibiting A549 survival compared to the effect predicted from their individual activities (Figure 49B). ZF9-MQ1 (1.0 µg/ml) in combination with BET762 (concentrations up to 0.625 µM) showed a greater than additive effect on inhibiting A549 survival compared to the effect predicted from their individual activities (Fig. 49C). The data indicated that the combination of ZF9-MQ1 and BET762 synergistically inhibited the survival of A549 cells (Figure 49B-C).
實例50:ZF9-MQ1與必納昔布(Birabresib)的共處理對A549存活率顯示大於相加的效應Example 50: Co-treatment of ZF9-MQ1 with Birabresib shows a greater than additive effect on A549 survival
此實例評價ZF9-MQ1構築體當與不同濃度的BET抑制劑組合使用時,對A549細胞存活率的影響。This example evaluates the effect of the ZF9-MQ1 construct on A549 cell viability when used in combination with different concentrations of BET inhibitors.
以每孔4,000個細胞將A549細胞接種於雙重複盤中用於細胞存活率分析。用0.5 µg/mL或1.0 µg/mL的負載GFP或ZF9-MQ1之LNP與濃度增加(對於經0.5 µg /ml ZF9-MQ1處理的細胞而言,濃度直至0.625 μM;且對於經1.0 µg/ml ZF9-MQ1處理的細胞而言,濃度直至0.313 μM)之BET抑制劑必納昔布的組合對96孔盤進行處理,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育72小時之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。 A549 cells were seeded in duplicate dishes at 4,000 cells per well for cell viability analysis. With 0.5 µg/mL or 1.0 µg/mL of GFP- or ZF9-MQ1-loaded LNP with increasing concentrations (up to 0.625 µM for cells treated with 0.5 µg/ml ZF9-MQ1; and for cells treated with 1.0 µg/ml For ZF9-MQ1-treated cells, 96-well plates were treated with combinations of the BET inhibitor binacoxib at concentrations up to 0.313 μM) in triplicate and incubated. Untreated cells and GFP-treated cells were used as controls. After 72 hours of incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence quantified using a GloMax ® Discovery plate reader.
資料顯示ZF9-MQ1及必納昔布各自分別抑制A549細胞的細胞存活率(圖50A)。ZF9-MQ1 (0.5 µg/ml)與必納昔布(濃度至多0.625 µM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖50B)。ZF9-MQ1 (1.0 µg/ml)與必納昔布(濃度至多0.313 µM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖50C)。資料表明,ZF9-MQ1與必納昔布組合協同地抑制A549細胞的存活率(圖50B-50C)。 實例51:ZF9-MQ1與曲美替尼的共處理對A549存活率顯示大於相加的效應 The data showed that ZF9-MQ1 and binacoxib each inhibited the cell viability of A549 cells ( FIG. 50A ). ZF9-MQ1 (0.5 µg/ml) when combined with binacoxib (concentrations up to 0.625 µM) showed a greater than additive effect on the inhibition of A549 survival compared to the effect predicted from their individual activities (Figure 50B ). ZF9-MQ1 (1.0 µg/ml) in combination with binacoxib (concentrations up to 0.313 µM) showed a greater than additive effect on the inhibition of A549 survival compared to the effect predicted from their individual activities (Fig. 50C ). The data indicated that the combination of ZF9-MQ1 and binacoxib synergistically inhibited the viability of A549 cells (Figures 50B-50C). Example 51: Co-treatment of ZF9-MQ1 with trametinib shows greater than additive effect on A549 survival
此實例評價ZF9-MQ1構築體當與不同濃度的MEK抑制劑曲美替尼組合使用時,對A549細胞存活率的影響。This example evaluates the effect of the ZF9-MQ1 construct on A549 cell viability when used in combination with different concentrations of the MEK inhibitor Trametinib.
以每孔4,000個細胞將A549細胞接種於雙重複盤中用於細胞存活率分析。將0.5 µg/mL或1.0 µg/mL的負載GFP或ZF9-MQ1之LNP與濃度增加(濃度直至0.05 μM)的MEK抑制劑曲美替尼組合處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育72小時之後,用CellTiter-GLO ®溶解一個96孔盤且使用GloMax ®Discovery讀盤器定量發光。 A549 cells were seeded in duplicate dishes at 4,000 cells per well for cell viability analysis. 96-well plates were treated with 0.5 µg/mL or 1.0 µg/mL of GFP- or ZF9-MQ1-loaded LNP in combination with increasing concentrations (up to 0.05 μM) of the MEK inhibitor trametinib in triplicate and incubated. Untreated cells and GFP-treated cells were used as controls. After 72 hours of incubation, a 96-well plate was lysed with CellTiter-GLO ® and luminescence quantified using a GloMax ® Discovery plate reader.
資料顯示ZF9-MQ1及曲美替尼各自分別抑制A549細胞的細胞存活率(圖51A)。ZF9-MQ1 (0.5 µg/ml)與曲美替尼(濃度至多0.05 µM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖51B)。ZF9-MQ1 (1.0 µg/ml)與曲美替尼(濃度至多0.05 µM)當組合時,與根據其個別活性所預測的效應相比,對抑制A549存活率顯示大於相加的效應(圖51C)。資料表明,ZF9-MQ1與曲美替尼組合協同地抑制A549細胞的存活率(圖51B-51C)。 實例52:使用靶向超級增強子區域之一組ZF-KRAB構築體下調多種細胞株中之MYC的研究 The data showed that ZF9-MQ1 and trametinib each inhibited the cell viability of A549 cells ( FIG. 51A ). ZF9-MQ1 (0.5 µg/ml) when combined with trametinib (concentrations up to 0.05 µM) showed a greater than additive effect on inhibiting A549 survival compared to the effect predicted from their individual activities (Figure 51B ). ZF9-MQ1 (1.0 µg/ml) in combination with trametinib (concentrations up to 0.05 µM) showed a greater than additive effect on inhibiting A549 survival compared to the effect predicted from their individual activities (Fig. 51C ). The data indicated that the combination of ZF9-MQ1 and Trametinib synergistically inhibited the survival of A549 cells (FIGS. 51B-51C). Example 52: Down-regulation of MYC in various cell lines using a panel of ZF-KRAB constructs targeting one of the super-enhancer regions
此實例評價ZF9-MQ1及靶向超級增強子區域之一組鋅指構築體ZF9-MQ1、ZF54-KRAB、ZF61-KRAB、ZF67-KRAB及ZF68-KRAB下調H2009、H460及H226細胞中之MYC的作用。This example evaluates the ability of ZF9-MQ1 and a set of zinc finger constructs ZF9-MQ1 , ZF54-KRAB, ZF61-KRAB, ZF67-KRAB and ZF68-KRAB targeting super-enhancer regions to downregulate MYC in H2009, H460 and H226 cells effect.
以每孔5,000個細胞將H2009及H226細胞接種於雙重複盤中用於mRNA分析。用1.0 µg/mL的負載GFP、ZF9-MQ1、ZF54-KRAB、ZF67-KRAB或ZF68 KRAB之LNP處理96孔盤,重複三次,且培育。亦用1.0 µg/mL的負載非編碼RNA之LNP處理H226細胞,重複三次,且培育。以每孔5,000個細胞將H460細胞接種於雙重複盤中用於mRNA分析。用1.0 µg/mL的負載GFP、ZF9-MQ1、ZF61-KRAB、ZF67-KRAB或ZF68-KRAB的LNP處理96孔盤,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。培育72小時之後,用RLT Plus溶解緩衝液溶解細胞,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 H2009 and H226 cells were seeded in duplicate dishes at 5,000 cells per well for mRNA analysis. The 96-well plate was treated with 1.0 μg/mL of LNP loaded with GFP, ZF9-MQ1, ZF54-KRAB, ZF67-KRAB or ZF68 KRAB, repeated three times, and incubated. H226 cells were also treated with 1.0 µg/mL of non-coding RNA-loaded LNP three times and incubated. H460 cells were seeded in duplicate dishes at 5,000 cells per well for mRNA analysis. The 96-well plate was treated with 1.0 µg/mL of LNP loaded with GFP, ZF9-MQ1, ZF61-KRAB, ZF67-KRAB or ZF68-KRAB in triplicate and incubated. Untreated cells and GFP-treated cells were used as controls. After 72 hours of incubation, cells were lysed with RLT Plus Lysis Buffer for mRNA extraction using the RNeasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for human MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示,相較於未處理的細胞,ZF9-MQ1、ZF54-KRAB、ZF67-KRAB及ZF68-KRAB使H2009中的MYC mRNA水準下調至少42% (圖52A)且使H226細胞中的MYC mRNA水準下調至少27% (圖52B-52C)。另外,相較於未處理的細胞,ZF9-MQ1、ZF61-KRAB、ZF67-KRAB及ZF68-KRAB使H460細胞中的MYC mRNA水準下調至少26% (圖52D)。 實例53:相較於單獨的ZF9-MQ1,ZF54-KRAB與ZF9-MQ1的共處理使MYC mRNA水準進一步降低 The data showed that ZF9-MQ1 , ZF54-KRAB, ZF67-KRAB, and ZF68-KRAB downregulated MYC mRNA levels in H2009 by at least 42% compared to untreated cells ( FIG. 52A ) and upregulated MYC mRNA levels in H226 cells. Downregulated by at least 27% (FIGS. 52B-52C). In addition, ZF9-MQ1 , ZF61-KRAB, ZF67-KRAB and ZF68-KRAB downregulated MYC mRNA levels by at least 26% in H460 cells compared to untreated cells ( FIG. 52D ). Example 53: Co-treatment of ZF54-KRAB with ZF9-MQ1 further reduces MYC mRNA levels compared to ZF9-MQ1 alone
此實例評價ZF9-MQ1與ZF54-KRAB組合療法下調H2009細胞中之MYC的影響。This example evaluates the effect of ZF9-MQ1 and ZF54-KRAB combination therapy to downregulate MYC in H2009 cells.
以每孔5,000個細胞將H2009細胞接種於96孔盤中用於mRNA分析。盤壁用1.0 µg/mL或2.0 µg/mL GFP、ZF9-MQ1、負載式LNP與濃度增加之負載ZF9-MQ1之LNP組合處理,重複三次,且培育72小時。未處理的細胞及經GFP處理之細胞用作對照。培育之後,用RLT Plus溶解緩衝液溶解細胞以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 H2009 cells were seeded in 96-well plates at 5,000 cells per well for mRNA analysis. Disc walls were treated with 1.0 µg/mL or 2.0 µg/mL of GFP, ZF9-MQ1, loaded LNP, and increasing concentrations of ZF9-MQ1-loaded LNP, repeated three times, and incubated for 72 hours. Untreated cells and GFP-treated cells were used as controls. After incubation, cells were lysed with RLT Plus Lysis Buffer for mRNA extraction using the RNeasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for human MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示相對於未處理的對照細胞,ZF9-MQ1及ZF54-KRAB在所測試的最高濃度下各自分別下調H2009細胞中的MYC mRNA,分別下調99%或62%。當小於0.313 µg/mL的ZF9-MQ1與1或2 µg/mL的ZF54-KRAB組合時,MYC mRNA下調的程度大於單獨處理所觀測到的下調程度,表明ZF9-MQ1及ZF54-KRAB各自對活體外下調H2009細胞株中之MYC mRNA水準有貢獻(圖53)。 實例54:ZF9-MQ1與ZF54-KRAB的1:1組合在長達至少6天的所有時間點抑制MYC表現 The data showed that ZF9-MQ1 and ZF54-KRAB each down-regulated MYC mRNA in H2009 cells by 99% or 62%, respectively, at the highest concentrations tested, relative to untreated control cells. When less than 0.313 µg/mL of ZF9-MQ1 was combined with 1 or 2 µg/mL of ZF54-KRAB, the degree of MYC mRNA downregulation was greater than that observed with individual treatments, indicating that ZF9-MQ1 and ZF54-KRAB each had External down-regulation of MYC mRNA levels in H2009 cell lines contributed ( FIG. 53 ). Example 54: 1:1 combination of ZF9-MQ1 and ZF54-KRAB inhibits MYC expression at all time points up to at least 6 days
此實例評價ZF9-MQ1與ZF54-KRAB組合療法下調H1299細胞中之MYC的耐久性。This example evaluates the durability of ZF9-MQ1 and ZF54-KRAB combination therapy to downregulate MYC in H1299 cells.
以每孔10,000個細胞將H1299細胞接種於96孔盤中用於mRNA分析。96孔盤壁用1.0 µg/mL的負載GFP、ZF54-KRAB或ZF9-MQ1 + ZF54-KRAB之LNP處理,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。轉染後的第6小時、第1天、第2天、第3天或第6天進行量測。培育之後,用RLT Plus溶解緩衝液溶解一個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 H1299 cells were seeded in 96-well plates at 10,000 cells per well for mRNA analysis. The walls of 96-well plates were treated with 1.0 μg/mL of LNP loaded with GFP, ZF54-KRAB or ZF9-MQ1 + ZF54-KRAB, repeated three times, and incubated. Untreated cells and GFP-treated cells were used as controls. Measurements were taken at 6 hours, 1 day, 2 days, 3 days or 6 days after transfection. After incubation, lyse a 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for human MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示相對於未處理的對照細胞,ZF9-MQ1在第48小時使H1299細胞中的MYC mRNA下調95%且在第144小時將下調維持在對照水準的90%。ZF9-MQ1 + ZF54-KRAB組合第48小時使MYC mRNA水準降低至98%且在第144小時將下調維持在對照水準的93% (圖54)。另外,資料顯示ZF9-MQ1及ZF9-MQ1與ZF54-KRAB的組合使H1299細胞中的MYC mRNA水準下調至少6天(圖54)。 實例 55 : 雙順反子 ZF9 - MQ1 _ ZF54 - KRAB 比 ZF9 - MQ1 更早地 抑制 MYC 水準 The data show that ZF9-MQ1 downregulates MYC mRNA in H1299 cells by 95% at 48 hours relative to untreated control cells and maintains the downregulation at 90% of control levels at 144 hours. The ZF9-MQ1 + ZF54-KRAB combination reduced MYC mRNA levels to 98% at 48 hours and maintained the downregulation at 93% of control levels at 144 hours (Figure 54). In addition, the data showed that ZF9-MQ1 and the combination of ZF9-MQ1 and ZF54-KRAB downregulated MYC mRNA levels in H1299 cells for at least 6 days (Figure 54). Example 55 : Bicistronic ZF9 - MQ1_ZF54 - KRAB Suppresses MYC Levels Earlier Than ZF9 - MQ1 _ _
此實例評價與單獨的ZF9-MQ1及ZF54-KRAB構築體相比,雙順反子構築體ZF9-MQ1_ZF54-KRAB及ZF54-KRAB_ZF9-MQ1的功效。This example evaluates the efficacy of the bicistronic constructs ZF9-MQ1_ZF54-KRAB and ZF54-KRAB_ZF9-MQ1 compared to the ZF9-MQ1 and ZF54-KRAB constructs alone.
以每孔5,000個細胞將H2009細胞接種於96孔盤中用於mRNA分析。96孔盤壁用0.5 µg/mL或1.0 µg/mL的負載GFP、ZF9-MQ1、ZF54-KRAB、ZF9-MQ1_ZF54-KRAB或ZF54-KRAB_ZF9-MQ1之LNP處理,重複三次,且培育。未處理的細胞及經GFP處理之細胞用作對照。在轉染後第24小時及第48小時進行量測。培育之後,用RLT Plus溶解緩衝液溶解一個96孔盤,以便使用Rneasy ®Plus 96套組進行mRNA提取。使溶解的樣品結合至RNA管柱,用緩衝液洗滌且溶離出來。接著使用聚腺苷酸引子及RT Lunascript將mRNA轉化為cDNA。接著使用cDNA,使用對人類MYC mRNA轉錄本具有特異性的TaqMan TM探針(Thermo Fisher)進行逆轉錄聚合酶鏈反應(RT-PCR)分析。利用GAPDH mRNA轉錄本水準,對所有組進行標準化。 H2009 cells were seeded in 96-well plates at 5,000 cells per well for mRNA analysis. The wall of the 96-well plate was treated with 0.5 μg/mL or 1.0 μg/mL of LNP loaded with GFP, ZF9-MQ1, ZF54-KRAB, ZF9-MQ1_ZF54-KRAB or ZF54-KRAB_ZF9-MQ1, repeated three times, and incubated. Untreated cells and GFP-treated cells were used as controls. Measurements were taken at 24 hours and 48 hours after transfection. After incubation, lyse a 96-well plate with RLT Plus Lysis Buffer for mRNA extraction using the Rneasy ® Plus 96 Kit. The solubilized sample was bound to an RNA column, washed with buffer and eluted. The mRNA was then converted to cDNA using a polyA primer and RT Lunascript. The cDNA was then used for reverse transcription polymerase chain reaction (RT-PCR) analysis using TaqMan ™ probes (Thermo Fisher) specific for human MYC mRNA transcripts. All groups were normalized using GAPDH mRNA transcript levels.
資料顯示相較於未處理的細胞,ZF9-MQ1及ZF54-KRAB在引入H2009細胞之後的第24小時,使MYC mRNA水準分別下調高達83%及55%。在處理之後的第48小時,經ZF9-MQ1處理之細胞中的MYC mRNA水準進一步降低另外13%,為未處理對照組的96%,而ZF54-KRAB未使MYC水準進一步下調。在處理後的第24小時,經ZF9-MQ1_ZF54-KRAB及ZF54-KRAB_ZF9-MQ1處理之細胞中的MYC mRNA水準分別降低至對照細胞的95%及96%。資料表明此等控制劑能夠比ZF9-MQ1更早地降低H2009細胞中的MYC mRNA水準,在第24小時使經處理之細胞中的MYC下調程度大於經ZF9-MQ1處理的細胞。(圖55)。 實例56:ZF9-MQ1療法抑制帶有H460 SQ腫瘤之小鼠中之腫瘤生長的作用 The data showed that compared with untreated cells, ZF9-MQ1 and ZF54-KRAB down-regulated MYC mRNA levels by up to 83% and 55%, respectively, at 24 hours after introduction into H2009 cells. At 48 hours after treatment, MYC mRNA levels in ZF9-MQ1-treated cells were further reduced by another 13%, 96% of untreated controls, while ZF54-KRAB did not further down-regulate MYC levels. At 24 hours after treatment, MYC mRNA levels in ZF9-MQ1_ZF54-KRAB and ZF54-KRAB_ZF9-MQ1-treated cells were reduced to 95% and 96% of control cells, respectively. The data indicated that these control agents were able to reduce MYC mRNA levels in H2009 cells earlier than ZF9-MQ1 , downregulating MYC in treated cells to a greater extent than ZF9-MQ1 -treated cells at 24 hours. (Figure 55). Example 56: Effect of ZF9-MQ1 Therapy Inhibits Tumor Growth in Mice Bearing H460 SQ Tumors
此實例表明,ZF9-MQ1抑制裸小鼠中所建立的皮下H460腫瘤生長。This example demonstrates that ZF9-MQ1 inhibits the growth of established subcutaneous H460 tumors in nude mice.
藉由將皮下H460腫瘤細胞植入左側腹中來誘導裸小鼠發生疾病。當平均腫瘤體積達到約100-150 mm 3時,開始治療。將小鼠分成治療組,使得各組之平均腫瘤體積大致相等。用PBS (媒劑)、索拉非尼(標準照護療法)、調配於MC3 LNP中的非編碼RNA或調配於MC3 LNP中的ZF9-MQ1 mRNA治療腫瘤。PBS、非編碼RNA (SEQ ID NO: 198) LNP及ZF9-MQ1 LNP以3 mg/kg給與,每五天投與一次。每日給與50 mg/kg索拉非尼。每週量測腫瘤長度及寬度兩次。腫瘤體積按照(寬度2×長度)/2計算。 Disease was induced in nude mice by subcutaneous implantation of H460 tumor cells into the left flank. Treatment was initiated when the average tumor volume reached approximately 100-150 mm. Mice were divided into treatment groups such that the mean tumor volumes in each group were approximately equal. Tumors were treated with PBS (vehicle), sorafenib (standard of care therapy), noncoding RNA formulated in MC3 LNP, or ZF9-MQ1 mRNA formulated in MC3 LNP. PBS, non-coding RNA (SEQ ID NO: 198) LNP and ZF9-MQ1 LNP were administered at 3 mg/kg every five days. Give 50 mg/kg sorafenib daily. Tumor length and width were measured twice a week. Tumor volume was calculated as (width2×length)/2.
結果顯示ZF9-MQ1療法抑制H460皮下腫瘤模型的腫瘤生長。PBS治療組、mRNA陰性對照物治療組及ZF09-MQ1治療組在研究結束時的平均腫瘤體積分別為1921 mm 3、1829 mm 3及702 mm 3。索拉非尼治療之小鼠的平均腫瘤體積(752mm 3)與ZF9-MQ1-LNP治療之小鼠不可區分,表明ZF9-MQ1在此模型中展現與標準照護療法藥劑索拉非尼相似或更好的活性。此等結果表明,相較於陰性對照組,負載ZF9-MQ1的LNP有效地抑制活體內H460皮下腫瘤模型中的腫瘤生長(圖56)。 The results showed that ZF9-MQ1 therapy inhibited tumor growth in the H460 subcutaneous tumor model. The mean tumor volumes at the end of the study were 1921 mm 3 , 1829 mm 3 and 702 mm 3 in the PBS-treated group, the mRNA negative control group and the ZF09-MQ1-treated group, respectively. The mean tumor volume (752 mm 3 ) of Sorafenib-treated mice was indistinguishable from that of ZF9-MQ1-LNP-treated mice, suggesting that ZF9-MQ1 exhibits similar or greater good activity. These results indicated that ZF9-MQ1 loaded LNPs effectively inhibited tumor growth in the in vivo H460 subcutaneous tumor model compared to the negative control group ( FIG. 56 ).
等效物應瞭解,雖然本發明已結合其實施方式描述,但前述描述意欲說明且不限制本發明之範疇,本發明之範疇係由所附申請專利範圍之範疇限定。一些態樣、優點及潤飾屬於以下申請專利範圍之範疇內。 EQUIVALENTS It should be understood that while the invention has been described in conjunction with embodiments thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Some aspects, advantages and modifications belong to the scope of the following claims.
本發明實施例之以下詳細描述當結合附圖閱讀時將得到更好的理解。為了說明本發明之目的,附圖中展示當前例示性實施例。然而,應瞭解,本發明不限於附圖中所示實施例之精確佈置及執行。
圖1A描繪基於MYC啟動子之持久嵌段及CTCF/TF位點之短暫(48/72小時)嵌段之雙目標方法的示意圖,該持久嵌段使用與DNA甲基轉移酶融合的DBD,該短暫嵌段使用DBD或與短期效應子融合的DBD。
圖1B描繪MYC基因之靶點(CTCF及啟動子)處的嚮導RNA定位及染色質環境。自上而下,圖形表示HepG2細胞之MYC基因座中的H3K4me3 (組蛋白H3 Ky三甲基化)程度;H3K9me3 (組蛋白H3 K9三甲基化)程度(複本1);H3K9me3 (組蛋白H3 K9三甲基化)程度(複本2);H3K27me3 (組蛋白H3 K27三甲基化)程度;H3K27ac (組蛋白H3 K27乙醯化)程度;GROseq_正向股程度(轉錄活性RNA pol II對正向股的結合);GROseq_反向股程度(轉錄活性RNA pol II對反向股的結合);RNAseq_rep2含量(使用RNAseq量測的MYC轉錄本含量,複本2);如藉由WGBS (全基因體亞硫酸氫鹽定序)所量測的DNA甲基化程度,及CTCF結合程度。四種gRNA的位置用箭頭指示。gRNA GD-28859、GD-28616、GD-28862靶向MYC上游之錨定位點或其附近,且gRNA GD-28617靶向MYC啟動子。在本發明中,GD-28859亦稱為GD-59;GD-28616亦稱為GD-16;GD-28862亦稱為GD-62;且GD-28617亦稱為GD-17。
圖1C顯示例示性雙順反子構築體之示意圖。構築體的5'端具有由N7-甲基化鳥苷限定的帽結構,該鳥苷經由反向5'至5'三磷酸酯鍵聯連接至mRNA的第一核苷酸。在一些實施例中,該帽結構促進蛋白質轉譯及穩定性。帽結構下游為設計成促進高水準蛋白質轉譯的非轉譯區(5' UTR),該非轉譯區之後為典型的「Kozak」序列,該序列被核糖體識別以起始蛋白質的轉譯。繼「Kozak」序列之後為CDS,CDS為單一連續序列,其包含含有第一靶向部分及第一效應部分的第一表現抑制子以及含有經tPT2A「核糖體跳讀」序列(連接子)分隔之第二靶向部分及第二效應部分的第二表現抑制子。不希望受理論所束縛,當核糖體到達tPT2A連接子時,其開始將連接子轉譯成胺基酸。由P2A連接子產生的前18個胺基酸仍處於第一表現抑制子(例如包含ZF DBD及MQ1)的C端,核糖體接著將其釋放。核糖體接著繼續行進直至其到達T2A連接子,且將T2A連接子的前17個殘基轉譯且釋放。接著轉譯包含單一胺基酸的第二多肽,且接著開始轉譯第二表現抑制子(例如包含第二ZF DBD及KRAB)。CDS之後為設計成有助於高水準轉譯以及使mRNA穩定的3' UTR。最後,mRNA的最末端3'端為聚腺苷酸尾。在一些實施例中,聚腺苷酸尾促進蛋白質轉譯及mRNA穩定性。
圖2A顯示Cas9-核酸酶對CTCF模體的編輯引起MYC表現下調。Cas9 (與GD-28616的組合)對CTCF模體的破壞引起所有三種HCC細胞株(HepG2、Hep3B及SKHEP1)中的MYC表現下調32-39%。對CTCF模體相鄰區域(GD-28859)的破壞將三種細胞株中之兩者(HepG2及Hep3B)中的MYC表現調控35-45%。
圖2B顯示如藉由AmpSeq所評估的編輯效率,證實所有細胞株被編輯77-100%。
圖3顯示dCas9-KRAB當針對啟動子或相連的CTCF模體時,下調MYC表現。在48/72小時時間點,LNP介導dCas9-KRAB/GD-28616轉染使Hep3B及SKHEP1中的MYC表現下調11-34%。在48/72小時時間點,LNP介導dCas9-KRAB/GD-28859轉染使所有3種HCC模型中的MYC表現下調18-44%。在48/72小時時間點,經由dCas9-KRAB/GD-28617將dCas9-KRAB引向MYC啟動子使所有3種HCC模型中的MYC表現下調24-58%。
圖4A描繪與啟動子相連之CTCF模體處的sgRNA定位及鋅指設計。圖式揭示SEQ ID NO: 208。
圖4B顯示針對與啟動子相連之CTCF的ZF-KRAB構築體實現了Hep3B中的MYC下調。ZF2-KRAB、ZF3-KRAB及ZF4-KRAB使Hep3B細胞中之MYC下調的程度等效於或大於dCas9-KRAB/GD-28859,其中ZF3-KRAB具有最強的下調作用。
圖4C顯示ZF3-非效應子及ZF3-KRAB使多種人類HCC模型(HepG2、Hep3B及SKHEP1)中的MYC表現下調。在其他兩種HCC模型HepG2及SKHEP1中,ZF3-KRAB亦顯示使MYC下調的程度等效於或大於ZF3-非效應子及ZF5-非效應子。
圖4D顯示在不同時間點(24小時、72小時及120小時),ZF3-非效應子及ZF3-KRAB對Hep3B細胞的MYC表現及存活率展現等效作用。
圖5顯示dCas9-MQ1當靶向多種HCC模型(HepG2、Hep3B及SKHEP1)中的MYC啟動子時,下調MYC表現。
圖6A描繪MYC啟動子處的sgRNA定位及鋅指設計。圖式揭示SEQ ID NO: 209。
圖6B描繪針對MYC啟動子之6種ZF-MQ1構築體,根據Hep3B中之MYC下調作用加以篩選。ZF8-MQ1、ZF9-MQ1及ZF11-MQ1使Hep3B細胞中的MYC下調至最大程度,其中ZF9-KRAB MQ1具有最強的下調作用。
圖7A顯示與ZF12-MQ1相比,ZF9-MQ1顯著下調MYC表現且減小Hep3B存活率。
圖7B顯示與ZF12-MQ1相比,ZF9-MQ1顯著下調MYC表現且減小HepG2的存活率。
圖7C顯示與ZF12-MQ1相比,ZF9-MQ1顯著下調MYC表現且減小SKHEP1的存活率。
圖7D顯示與ZF8-MQ1相比,ZF9-MQ1更有效地下調MYC表現且減小Hep3B的存活率。
圖7E顯示與ZF8-MQ1相比,ZF9-MQ1更有效地下調MYC表現且減小HepG2的存活率。
圖7F顯示與ZF8-MQ1相比,ZF9-MQ1更有效地下調MYC表現且減小SKHEP1的存活率。
圖7G顯示與dCas9-MQ1/GD17相比,ZF9-MQ1顯著下調MYC表現且減小Hep3B的存活率。
圖7H顯示與dCas9-MQ1/GD17相比,ZF9-MQ1顯著下調MYC表現且減小HepG2的存活率。
圖7I顯示與dCas9-MQ1/GD17相比,ZF9-MQ1顯著下調MYC表現且減小SKHEP1的存活率。
圖8A顯示dCas9-MQ1在第72小時使所有三種細胞株(Hep3B、HepG2及SKHEP1)中的mRNA實現50-90%減少。
圖8B顯示儘管MYC下調對SK-HEP-1存活率的影響最小,但在第72小時及第168小時,MYC下調使HepG2及Hep3B的存活率顯著降低。
圖8C顯示MYC mRNA在第7天及第11天分別減少約70%及約55%。遠至第15天,轉錄本維持約40%的下調。
圖8D顯示用dCas9-MQ1/GD-28617處理將從頭甲基化引向標靶區域且此等轉錄變化與標靶區域中的CpG甲基化百分比緊密相關且證實甲基化保持至第15天。
圖9顯示用dCas9-MQ1/GD-17處理抑制活體內腫瘤生長。
圖10顯示dCas9-MQ1/GD-17在B型肝炎感染之背景下使人類肝細胞中的MYC下調。
圖11顯示靶向與CTCF模體直接鄰接之上游區域的與鋅指域融合之KRAB效應子(或非效應子或NE)(ZF3-NE或ZF3-KRAB)且靶向MYC啟動子的與鋅指域融合之MQ1效應子(ZF9-MQ1)下調MYC1 mRNA表現。
圖12顯示ZF3-KRAB + ZF9-MQ1使MYC下調的程度大於單獨的ZF9-MQ1,或ZF3-NE + ZF9-MQ1組合。
圖13A顯示設計成結合且靶向MYC啟動子的ZF9-MQ1以多種濃度給與五種HCC細胞株:Hep3B、HepG2、SKHEP1、SNU-182及SNU-449。
圖13B-F顯示ZF9-MQ1下調MYC表現且減小所測試之所有五種HCC細胞株的存活率且ZF9-MQ1以0.028 μg/ml LNP/mRNA之中值EC50下調MYC表現,且在活體外、在第72小時對HepG2細胞株的存活率具有約10倍高的中值EC50效應(0.13 μg/ml)。
圖14顯示與PBS對照物治療之小鼠相比,ZF9-MQ1能夠顯著減少腫瘤生長(自第6天起)且ZF9-MQ1減少腫瘤生長超過小分子比較物(MYCi975)(A)。與PBS或MYCi975 (B)相比,ZF9-MQ1對整個動物體重的影響最小。
圖15A顯示1.5 mg/kg (每5天一次,2次劑量)、3 mg/kg (每5天一次,3次劑量)、3 mg/kg (每3天一次,1次劑量)的ZF9-MQ1與ZF3-KRAB組合使腫瘤生長減少的程度與索拉非尼(sorafenib)類似。
圖15B顯示與索拉非尼治療對整個動物體重的影響相比,用ZF9-MQ1與ZF3-KRAB之組合治療對整個動物體重的影響最小。
圖16A顯示與陰性對照物治療之小鼠相比,ZF9-MQ1 (自第13天起)及ZF9-MQ1與ZF3-KRAB之共調配物(自第6天起)在1 mg/kg下能夠顯著減少腫瘤生長。
圖16B顯示與陰性對照物治療之小鼠相比,ZF9-MQ1個別地及ZF9-MQ1與ZF3-KRAB之共調配物在3 mg/kg下能夠減少腫瘤生長。
圖16C顯示ZF9-MQ1與ZF3-KRAB之共調配物在1 mg/kg與3 mg/kg劑量下,能夠以類似於或大於順鉑(cisplatin)或小分子比較物(MYCi975)的程度減少腫瘤生長。
圖16D顯示與順鉑或MYCi975治療對整個動物體重的影響相比,ZF9-MQ1與ZF3-KRAB之共調配物在1 mg/kg與3 mg/kg劑量下對整個動物體重的影響最小。
圖17A顯示處理後第120小時,ZF9-MQ1使A549細胞株中的MYC mRNA水準降低逾80%。
圖17B顯示處理後第120小時,ZF9-MQ1使NCI-H2009細胞株中的MYC mRNA水準降低逾80%。
圖17C顯示處理後第120小時,ZF9-MQ1使NCI-H358細胞株中的MYC mRNA水準降低逾80%。
圖17D顯示處理後第72小時,ZF9-MQ1使HCC95細胞株中的MYC mRNA水準降低逾80%。
圖17E顯示處理後第120小時,ZF9-MQ1使得A549細胞株的細胞存活率降低。
圖17F顯示處理後第120小時,ZF9-MQ1使得NCI-H2009細胞株的細胞存活率降低。
圖17G顯示處理後第120小時,ZF9-MQ1使得NCI-H358細胞株的細胞存活率降低。
圖17H顯示處理後第72小時,ZF9-MQ1使得HCC95細胞株的細胞存活率降低。
圖18A顯示處理後第96小時,未處理細胞群中約17.5%細胞發生細胞凋亡。
圖18B顯示處理後第96小時,經ZF9-NE處理之細胞群中約18%細胞發生細胞凋亡。
圖18C顯示處理後第96小時,經ZF9-MQ1處理之細胞群中約38.9%細胞發生細胞凋亡。
圖18D顯示處理後第96小時,經ZF9-MQ1處理之細胞群中約38.9%細胞發生細胞凋亡,相比之下,未處理細胞與經ZF9-NE處理之細胞群中約18%細胞發生細胞凋亡,表明ZF9-MQ1能夠誘導肺癌細胞發生細胞凋亡。
圖19 A及B顯示ZF9-MQ1以0.08 μg/ml LNP/mRNA之EC50下調MYC,且在活體外、在第72小時對此等A549 (圖19A)及HCC95 (圖19B)細胞株的存活率具有約25倍高的EC50效應(2 μg/ml)。
圖20A及B顯示肺癌細胞株處理後第96小時,ZF9-MQ1處理使MYC蛋白含量降低逾80%。
圖21顯示與PBS對照物治療之小鼠相比,ZF9-MQ1能夠顯著減少腫瘤生長(自第8天起)。亦觀測到ZF9-MQ1對整個動物體重的影響最小。
圖22顯示嚮導RNA GD-29833及29914當與dCAS9-KRAB效應子mRNA一起使用LNP以SSOP形式遞送時,可有效地下調MYC mRNA水準,突顯了使用此遠端調控元件減少致癌MYC的能力。
圖23顯示嚮導RNA GD-29833及29914當與dCAS9-KRAB效應子mRNA一起使用LNP以MC3形式遞送時,可有效地下調MYC mRNA水準,突顯了使用此遠端調控元件減少致癌MYC的能力。
圖24A顯示嚮導RNA GD-29833及29914當與所有3種效應蛋白(EZH2、EZH2-KRAB及MQ1)一起遞送時,可有效地下調A549細胞株中的MYC mRNA水準。
圖24B顯示嚮導RNA GD-29833及29914當與所有3種效應蛋白(EZH2、EZH2-KRAB及MQ1)一起遞送時,可有效地下調NCI-H2009細胞株中的MYC mRNA水準。
圖25A顯示與KRAB或MQ1一起遞送的嚮導RNA GD-29833及29914在處理後的第120小時可顯著下調A549細胞株中的MYC mRNA水準。
圖25B顯示與KRAB或MQ1一起遞送的嚮導RNA GD-29833及29914在處理後的第120小時可顯著下調NCI-H2009細胞株中的MYC mRNA水準且該下調類似於ZF9-MQ1處理後所觀測到的下調。
圖26A顯示dCas9-MQ1使NSCLC中的靶點甲基化增加至約60%。
圖26B顯示dCas9-MQ1將甲基化引向遠端啟動子區域(增加至約50%)。
圖27A-B顯示經由轉錄抑制子將嚮導引向MYC肺超級增強子在第96小時使NCI-H2009肺癌細胞株中的MYC蛋白水準降低。
圖28A顯示ZF9-MQ1處理Hep3B細胞之後,全細胞溶解物中存在的ZF9-MQ1蛋白逐漸減少。
圖28B顯示ZF9-MQ1處理Hep3B細胞之後,全細胞溶解物中的MYC蛋白表現逐漸下調。
圖28C顯示ZF9-MQ1處理Hep3B細胞之後,全細胞溶解物中存在的ZF9-MQ1蛋白與MYC蛋白的下調相關。
圖29A顯示在ZF9-MQ1處理之後的若干時間點直至第15天,SK-HEP細胞株之MYC轉錄本中的mRNA表現下調,下調45%。
圖29B顯示直至第15天,MYC轉錄變化與甲基化百分比相關。
圖30A顯示與單獨的GFP、ZF-NE或ZF3-KRAB相比,經0.6 µg/ml、1.25 µg/ml及2.5 µg/ml濃度之ZF9-MQ1、ZF9-MQ1 + ZF3-KRAB或雙順反子ZF9-MQ1_ZF3-KRAB處理的原代肝細胞顯示MYC mRNA表現減少。
圖30B顯示用0.6 µg/ml、1.25 µg/ml及2.5 µg/ml濃度之ZF9-MQ1、ZF9-MQ1 + ZF3-KRAB或雙順反子ZF9-MQ1_ZF3-KRAB處理對原代肝細胞的存活率影響最小,證明正常細胞隨之發生的MYC表現減少低於HCC細胞株。
圖30C顯示與單獨的GFP、ZF-NE或ZF3-KRAB相比,經0.5 µg/ml、1.0 µg/ml及2.0 µg/ml濃度之ZF9-MQ1、ZF9-MQ1 + ZF3-KRAB或雙順反子ZF9-MQ1_ZF3-KRAB處理的原代肝細胞顯示MYC mRNA表現減少。
圖30D顯示用0.5 µg/ml、1.0 µg/ml及2.0 µg/ml濃度之ZF9-MQ1、ZF9-MQ1 + ZF3-KRAB或雙順反子ZF9-MQ1_ZF3-KRAB處理對原代肝細胞的存活率影響最小,證明正常細胞隨之發生的MYC表現減少低於HCC細胞株。
圖31A顯示ZF9-MQ1 + ZF3-KRAB療法在投與三次之後,腫瘤尺寸顯示統計學上顯著的減小,第25天腫瘤體積比對照組低63%,且ZF9-MQ1 + ZF3-KRAB療法對腫瘤體積的作用等效於順鉑療法。
圖31B顯示經ZF9-MQ1 + ZF3-KRAB治療之小鼠的體重未經歷顯著的降低。
圖32A顯示1.5 mg/kg ZF9-MQ1 + ZF3-KRAB療法投與兩次之後,使得腫瘤尺寸出現統計學上顯著的減小,第23天將腫瘤生長抑制63% (與陰性對照組相比)。3 mg/kg ZF9-MQ1 + ZF3-KRAB在投與兩次之後,使得腫瘤尺寸出現統計學上顯著之減小,第23天將腫瘤生長抑制54% (與陰性對照組相比),且6 mg/kg劑量的ZF9-MQ1 + ZF3-KRAB療法在投與兩次之後,使得腫瘤尺寸出現統計學上顯著之減小,第23天將腫瘤體積降低63% (與陰性對照組相比)。
圖32B顯示經ZF9-MQ1 + ZF3-KRAB治療之小鼠的體重未經歷顯著的減小。經索拉非尼治療之小鼠的體重初始經歷下降,隨後總體重因腫瘤塊增大而潛在地增加。
圖33A顯示雙順反子構築體ZF9-MQ1_ZF3-KRAB在0.6 µg/ml及2.0 µg/ml濃度下使Hep 3B細胞中的MYC mRNA表現下調的程度大於單獨的單一構築體(ZF3-KRAB或ZF9-MQ1)。第48小時,雙順反子ZF9-MQ1_ZF3-KRAB在0.6 µg/ml與2 µg/ml濃度下均使總MYC mRNA水準降低99%。
圖33B顯示雙順反子構築體ZF9-MQ1_ZF3-KRAB使Hep 3B細胞的細胞存活率下調的程度大於單獨的單一構築體(ZF3-KRAB或ZF9-MQ1)。雙順反子ZF9-MQ1_ZF3-KRAB在0.6 µg/ml與2 µg/ml濃度下分別使Hep3B細胞的存活率降低約80%及27%。
圖34A顯示雙順反子構築體ZF9-MQ1_ZF3-KRAB以劑量依賴性方式下調Hep3B細胞的MYC mRNA及細胞存活率。
圖34B顯示雙順反子構築體ZF9-MQ1_ZF3-KRAB以劑量依賴性方式下調HepG2細胞的MYC mRNA及細胞存活率。
圖34C顯示雙順反子構築體ZF9-MQ1_ZF3-KRAB以劑量依賴性方式下調SKHEP1細胞的MYC mRNA及細胞存活率。
圖34D顯示雙順反子ZF9-MQ1_ZF3-KRAB有效針對HCC S1與S2亞型。
圖35顯示在雙順反子ZF9-MQ1_ZF3-KRAB處理第48小時,偵測到Hep 3B及Hep G2細胞株的凋亡細胞>75%且偵測到SK-HEP-1細胞株的凋亡細胞為約15%。與未處理的細胞(5-20%背景細胞凋亡)相比,非編碼mRNA對照物對細胞無影響。
圖36顯示雙順反子ZF9-MQ1_ZF3-KRAB構築體處理1次之後,SKHEP1細胞中的MYC mRNA水準在第一天降低且在處理後保持抑制狀態長達十五天。
圖37顯示與非編碼短mRNA或未處理的細胞相比,雙順反子ZF9-MQ1_ZF3-KRAB處理在第6小時使Hep3B與SKHEP1細胞株中的MYC mRNA及蛋白質表現均減少,隨後96小時保持下調。
圖38顯示在轉染之後的第6小時與第24小時時間點,由雙順反子ZF9-MQ1_ZF3-KRAB mRNA編碼的OEC ZF3-KRAB與ZF9-MQ1蛋白質在西方墨點上藉由HA標籤可視化。
圖39A顯示當索拉非尼與0.6 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對SKHEP1的IC
50自12.3 µM減小至10.7 µM。當索拉非尼與0.1 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對SKHEP1細胞的IC
50未出現顯著變化。
圖39B顯示當索拉非尼與0.6 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對Hep 3B的IC50自4.4 µM減小至2.9 µM。當索拉非尼與0.1 µg/ml雙順反子ZF9-MQ1_ZF3-KRAB組合投與時,索拉非尼針對Hep 3B細胞的IC
50未出現顯著變化。
圖40A顯示當分別用0.6 µg/ml及0.1 µg/ml濃度的雙順反子ZF9-MQ1_ZF3-KRAB處理時,JQ1針對SKHEP1細胞的IC
50減小。
圖40B顯示當分別用0.6 µg/ml及0.1 µg/ml濃度的雙順反子ZF9-MQ1_ZF3-KRAB處理時,JQ1針對Hep 3B細胞的IC
50減小。
圖41A顯示與未處理的細胞相比,ZF17-MQ1在0.6 µg/ml與1.2 µg/ml濃度下均能夠下調小鼠Hepa1-6細胞中的MYC mRNA表現。
圖41B顯示與未處理的細胞相比,ZF17-MQ1在0.6 µg/ml與1.2 µg/ml濃度下均能夠降低小鼠Hepa1-6細胞的細胞存活率。
圖42A顯示ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理在第24及48小時顯著下調MYC蛋白。
圖42B顯示ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理在第24及48小時顯著下調MYC蛋白。
圖42C顯示ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理在第96小時顯著下調MYC mRNA。
圖42D顯示ZF17-MQ1對小鼠HCC細胞Hepa1-6的處理在第96小時使細胞存活率顯著降低。
圖43顯示ZF17-MQ1在4次劑量之後顯著減小動物腫瘤負荷且在兩週藥物假期之後,以ZF17-MQ1再治療小鼠使得腫瘤在約4週之後完全耗乏。
圖44A顯示相較於未處理或經GFP處理的細胞,經ZF17-MQ1處理之細胞(LL2細胞)中的MYC蛋白水準降低。
圖44B顯示與未處理之細胞中所觀測的水準相比,ZF17-MQ1及ZF16-MQ1分別使LL2細胞中的MYC mRNA水準降低>99.9%或74%。
圖44C顯示與未處理的細胞及經GFP處理的細胞相比,所有三種構築體ZF15-MQ1、ZF16-MQ1及ZF17-MQ1皆能夠以更大的程度降低LL2細胞的細胞存活率。
圖45A顯示ZF17-MQ1在1.25 µg/mL與2.5 µg/mL濃度下均降低MYC mRNA水準。與未處理之細胞中所觀測的水準相比,ZF17-MQ1在2.5 µg/mL下使LL2及CT26細胞中的MYC mRNA水準分別降低93%及85%。
圖45B顯示ZF17-MQ1在兩種濃度下均降低細胞存活率。與未處理的細胞相比,在此等條件下,ZF17-MQ1使LL2及CT26細胞的細胞存活率分別降低87%及93%。
圖46顯示ZF17-MQ1下調CMT167及LL2細胞之MYC mRNA且使其細胞存活率降低的程度大於未處理的細胞及經GFP處理的細胞(陰性對照組)。與未處理之細胞中所觀測的水準相比,ZF17-MQ1使CMT167及LL2細胞中的MYC mRNA水準分別降低62%及73%。另外,與未處理的細胞相比,在此等條件下,ZF17-MQ1使CMT167及LL2細胞的細胞存活率分別降低54%及57%。
圖47顯示與未處理的細胞相比,ZF9-MQ1使小氣管原代上皮細胞、肺葉原代上皮細胞及肺原代纖維母細胞中的MYC mRNA水準分別下調94%、96%、96%水準。然而,存活率比對照細胞僅降低16%、9%及22%。
圖48A顯示ZF9-MQ1及JQ1各自分別抑制A549細胞的細胞存活率。
圖48B顯示ZF9-MQ1 (0.5 µg/ml)及JQ1 (濃度至多6.25 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖48C顯示ZF9-MQ1 (1.0 µg/ml)及JQ1 (濃度至多6.25 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖49A顯示ZF9-MQ1及BET762各自分別抑制A549細胞的細胞存活率。
圖49B顯示ZF9-MQ1 (0.5 µg/ml)及BET762 (濃度至多1.25 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖49C顯示ZF9-MQ1 (1.0 µg/ml)及BET762 (濃度至多0.625 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖50A顯示ZF9-MQ1及必納昔布(Birabresib)各自分別抑制A549細胞的細胞存活率。
圖50B顯示ZF9-MQ1 (0.5 µg/ml)及必納昔布 (濃度至多0.625 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖50C顯示ZF9-MQ1 (1.0 µg/ml)及必納昔布(濃度至多0.313 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖51A顯示ZF9-MQ1及曲美替尼(Trametinib)各自分別抑制A549細胞的細胞存活率。
圖51B顯示ZF9-MQ1 (0.5 µg/ml)及曲美替尼(濃度至多0.05 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖51C顯示ZF9-MQ1 (1.0 µg/ml)及曲美替尼(濃度至多0.05 µM)在抑制A549存活率方面,相較於根據其個別活性所預測的效應,呈現大於相加的效應。
圖52A顯示與未處理的細胞相比,所有構築體ZF9-MQ1、ZF54-KRAB、ZF67-KRAB及ZF68-KRAB在處理後的第72小時皆能夠使H2009細胞中的MYC mRNA水準下調至少42%。
圖52B顯示與未處理的細胞相比,構築體ZF9-MQ1、ZF67-KRAB及ZF68-KRAB在處理後的第72小時能夠使H226細胞中的MYC mRNA水準下調至少27%。
圖52C顯示與未處理的細胞相比,構築體ZF9-MQ1與ZF54-KRAB在處理後的第72小時均能夠使H226細胞中的MYC mRNA水準下調至少27%。
圖52D顯示與未處理的細胞相比,構築體ZF9-MQ1、ZF61-KRAB、ZF67-KRAB及ZF68-KRAB在處理後的第72小時能夠使H460細胞中的MYC mRNA水準下調至少26%。
圖53顯示相對於未處理的對照細胞,ZF9-MQ1及ZF54-KRAB在所測試的最高濃度下各自分別下調H2009細胞中的MYC mRNA,分別下調99%或62%。當小於0.313 µg/mL的ZF9-MQ1與1或2 µg/mL ZF54-KRAB組合時,MYC mRNA下調的程度大於針對單獨療法所觀測的程度。
圖54顯示相對於未處理的對照細胞,ZF9-MQ1在第48小時使H1299細胞中的MYC mRNA下調95%且在第144小時將下調維持在對照水準的90%。ZF9-MQ1 + ZF54-KRAB組合第48小時使MYC mRNA水準降低至98%且在第144小時將下調維持在對照水準的93% (圖54)。另外,資料顯示ZF9-MQ1及ZF9-MQ1與ZF54-KRAB的組合使H1299細胞中的MYC mRNA水準下調至少6天。
圖55顯示相較於未處理的細胞,ZF9-MQ1及ZF54-KRAB在引入H2009細胞之後的第24小時,使MYC mRNA水準分別下調高達83%及55%。在處理之後的第48小時,經ZF9-MQ1處理之細胞中的MYC mRNA水準進一步降低另外13%,降至未處理對照組的96%,而ZF54-KRAB未使MYC水準進一步下調。在處理後的第24小時,經ZF9-MQ1_ZF54-KRAB及ZF54-KRAB_ZF9-MQ1處理之細胞中的MYC mRNA水準分別降低至對照細胞的95%及96%。資料表明此等控制劑能夠比ZF9-MQ1更早地降低H2009細胞中的MYC mRNA水準,在第24小時使經處理之細胞中的MYC下調程度大於經ZF9-MQ1處理的細胞。
圖56顯示ZF9-MQ1療法抑制H460皮下腫瘤模型中之腫瘤生長的程度類似於或高於索拉非尼誘導的腫瘤生長抑制。
The following detailed description of the embodiments of the present invention will be better understood when read in conjunction with the accompanying drawings. For the purpose of illustrating the invention, current exemplary embodiments are shown in the drawings. It should be understood, however, that the invention is not limited to the precise arrangements and execution of the embodiments shown in the drawings. Figure 1A depicts a schematic diagram of a dual-target approach based on a persistent block of the MYC promoter and a transient (48/72 hours) block of the CTCF/TF site using a DBD fused to a DNA methyltransferase, which Short-lived blocks use DBDs or DBDs fused to short-lived effectors. Figure 1B depicts guide RNA localization and chromatin environment at the target sites (CTCF and promoter) of the MYC gene. From top to bottom, the graphs represent the extent of H3K4me3 (histone H3 Ky trimethylation) in the MYC locus of HepG2 cells; the extent of H3K9me3 (histone H3 K9 trimethylation) (replica 1); the H3K9me3 (histone H3 K9 trimethylation) degree (replica 2); H3K27me3 (histone H3 K27 trimethylation) degree; H3K27ac (histone H3 K27 acetylation) degree; GROseq_forward strand degree (transcriptionally active RNA pol II pair Forward strand binding); GROseq_reverse strand level (binding of transcriptionally active RNA pol II to reverse strand); RNAseq_rep2 content (MYC transcript content measured using RNAseq, replica 2); as measured by WGBS (whole The degree of DNA methylation measured by Genome Bisulfite Sequencing), and the degree of CTCF binding. The positions of the four gRNAs are indicated by arrows. gRNA GD-28859, GD-28616, GD-28862 targeted the anchor site upstream of MYC or its vicinity, and gRNA GD-28617 targeted the MYC promoter. In the present invention, GD-28859 is also called GD-59; GD-28616 is also called GD-16; GD-28862 is also called GD-62; and GD-28617 is also called GD-17. Figure 1C shows a schematic of exemplary bicistronic constructs. The 5' end of the construct has a cap structure defined by an N7-methylated guanosine linked to the first nucleotide of the mRNA via an inverted 5' to 5' triphosphate linkage. In some embodiments, the cap structure promotes protein translation and stability. Downstream of the cap structure is an untranslated region (5'UTR) designed to promote high levels of protein translation, followed by a typical "Kozak" sequence that is recognized by ribosomes to initiate protein translation. The "Kozak" sequence is followed by the CDS, which is a single contiguous sequence comprising a first expression suppressor containing a first targeting moiety and a first effector moiety and containing a tPT2A "ribosomal skipping" sequence (linker) separated The second expression inhibitor of the second targeting moiety and the second effector moiety. Without wishing to be bound by theory, when the ribosome reaches the tPT2A linker, it begins to translate the linker into an amino acid. The first 18 amino acids from the P2A linker remain C-terminal to the first expressed inhibitor (eg comprising ZF DBD and MQ1 ), which is then released by the ribosome. The ribosome then proceeds until it reaches the T2A linker, and the first 17 residues of the T2A linker are translated and released. A second polypeptide comprising a single amino acid is then translated, and then translation of a second expression suppressor (eg, comprising a second ZF DBD and KRAB) is initiated. The CDS is followed by a 3' UTR designed to facilitate high levels of translation and stabilize the mRNA. Finally, the very terminal 3' end of the mRNA is a polyA tail. In some embodiments, the polyA tail promotes protein translation and mRNA stability. Figure 2A shows that Cas9-nuclease editing of the CTCF motif results in downregulation of MYC expression. Disruption of the CTCF motif by Cas9 (in combination with GD-28616) caused a 32-39% downregulation of MYC expression in all three HCC cell lines (HepG2, Hep3B, and SKHEP1). Disruption of the region adjacent to the CTCF motif (GD-28859) modulated MYC expression by 35-45% in two of the three cell lines (HepG2 and Hep3B). Figure 2B shows editing efficiency as assessed by AmpSeq, demonstrating that all cell lines were 77-100% edited. Figure 3 shows that dCas9-KRAB downregulates MYC expression when directed against the promoter or linked CTCF motif. LNP-mediated dCas9-KRAB/GD-28616 transfection downregulated MYC expression in Hep3B and SKHEP1 by 11-34% at the 48/72 hour time point. LNP-mediated dCas9-KRAB/GD-28859 transfection downregulated MYC expression by 18-44% in all 3 HCC models at the 48/72 hour time point. Directing dCas9-KRAB to the MYC promoter via dCas9-KRAB/GD-28617 downregulated MYC expression by 24-58% in all 3 HCC models at the 48/72 hour time point. Figure 4A depicts sgRNA positioning and zinc finger design at the CTCF motif linked to the promoter. The scheme reveals SEQ ID NO: 208. Figure 4B shows that the ZF-KRAB construct against CTCF linked to the promoter achieves downregulation of MYC in Hep3B. ZF2-KRAB, ZF3-KRAB and ZF4-KRAB can down-regulate MYC in Hep3B cells to the same or greater extent than dCas9-KRAB/GD-28859, and ZF3-KRAB has the strongest down-regulation effect. Figure 4C shows that ZF3-non-effector and ZF3-KRAB downregulate MYC expression in various human HCC models (HepG2, Hep3B and SKHEP1). In two other HCC models, HepG2 and SKHEP1, ZF3-KRAB was also shown to downregulate MYC to an extent equal to or greater than ZF3-non-effector and ZF5-non-effector. Figure 4D shows that ZF3-non-effector and ZF3-KRAB exhibit equivalent effects on MYC expression and viability of Hep3B cells at different time points (24 hours, 72 hours and 120 hours). Figure 5 shows that dCas9-MQ1 downregulates MYC expression when targeting the MYC promoter in various HCC models (HepG2, Hep3B, and SKHEP1). Figure 6A depicts sgRNA positioning and zinc finger design at the MYC promoter. The scheme reveals SEQ ID NO: 209. Figure 6B depicts six ZF-MQ1 constructs directed against the MYC promoter, screened for MYC downregulation in Hep3B. ZF8-MQ1, ZF9-MQ1 and ZF11-MQ1 down-regulated MYC in Hep3B cells to the greatest extent, and ZF9-KRAB MQ1 had the strongest down-regulation effect. Figure 7A shows that ZF9-MQ1 significantly downregulates MYC expression and reduces Hep3B survival compared to ZF12-MQ1. Figure 7B shows that ZF9-MQ1 significantly downregulates MYC expression and reduces survival of HepG2 compared to ZF12-MQ1. Figure 7C shows that ZF9-MQ1 significantly downregulates MYC expression and reduces survival of SKHEP1 compared to ZF12-MQ1. Figure 7D shows that ZF9-MQ1 downregulates MYC expression more effectively and reduces Hep3B survival compared to ZF8-MQ1. Figure 7E shows that ZF9-MQ1 downregulates MYC expression more effectively and reduces survival of HepG2 compared to ZF8-MQ1. Figure 7F shows that ZF9-MQ1 downregulates MYC expression more effectively and reduces the survival rate of SKHEP1 compared to ZF8-MQ1. Figure 7G shows that ZF9-MQ1 significantly downregulates MYC expression and reduces Hep3B survival compared to dCas9-MQ1/GD17. Figure 7H shows that ZF9-MQ1 significantly downregulates MYC expression and reduces survival of HepG2 compared to dCas9-MQ1/GD17. Figure 7I shows that ZF9-MQ1 significantly downregulates MYC expression and reduces the survival rate of SKHEP1 compared to dCas9-MQ1/GD17. Figure 8A shows that dCas9-MQ1 achieved a 50-90% reduction in mRNA in all three cell lines (Hep3B, HepG2 and SKHEP1) at 72 hours. Figure 8B shows that although MYC downregulation had minimal effect on SK-HEP-1 survival, at 72 hours and 168 hours, MYC downregulation significantly decreased the survival rates of HepG2 and Hep3B. Figure 8C shows that MYC mRNA was reduced by about 70% and about 55% on
<![CDATA[<110> 美商旗艦先鋒創新公司(FLAGSHIP PIONEERING INNOVATIONS V, INC.)]]>
<![CDATA[<120> 用於調節MYC表現之組合物及方法]]>
<![CDATA[<130> O2057-7029WO]]>
<![CDATA[<140> TW 111112954]]>
<![CDATA[<141> 2022-04-01]]>
<![CDATA[<150> 63/281,022]]>
<![CDATA[<151> 2021-11-18]]>
<![CDATA[<150> 63/212,991]]>
<![CDATA[<151> 2021-06-21]]>
<![CDATA[<150> PCT/US21/10059]]>
<![CDATA[<151> 2021-12-15]]>
<![CDATA[<160> 210 ]]>
<![CDATA[<170> PatentIn 3.5版]]>
<![CDATA[<210> 1]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 1]]>
atgatctctg ctgccagtag 20
<![CDATA[<210> 2]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 2]]>
atcgcgcctg gatgtcaacg 20
<![CDATA[<210> 3]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 3]]>
attgtgcagt gcatcggatt 20
<![CDATA[<210> 4]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 4]]>
gtcaaacagt actgctacgg 20
<![CDATA[<210> 5]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 5]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Lys
35 40 45
Asn Ser Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
145 150 155 160
Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 6]]>
<![CDATA[<21]]>1> 209]]>
<br/><![CDATA[<212> PRT]]>
<br/><![CDATA[<213> 人工序列]]>
<br/>
<br/><![CDATA[<220>]]>
<br/><![CDATA[<223> 人工序列之描述:合成多肽]]>
<br/>
<br/><![CDATA[<400> 6]]>
<br/><![CDATA[Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
35 40 45
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Ser His Ser Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn
145 150 155 160
Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 7]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 7]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
35 40 45
Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala
145 150 155 160
Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 8]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> ]]>8
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr
35 40 45
Gly Asn Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg
180 185 190
Asp His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 9]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 9]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser
35 40 45
Gly Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
145 150 155 160
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 10]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 10]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Asp Pro Gly Ala Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg
35 40 45
Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
145 150 155 160
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 11]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 11]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys
35 40 45
Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Lys
145 150 155 160
Asn Ser Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 12]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 12]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Arg
35 40 45
Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Lys
145 150 155 160
Lys Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 13]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 13]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu
35 40 45
Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
145 150 155 160
Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 14]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 14]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
35 40 45
Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala
145 150 155 160
Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 15]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 15]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Lys
35 40 45
Lys Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
145 150 155 160
Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg
180 185 190
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 16]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 16]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
35 40 45
Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg
65 70 75 80
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg
180 185 190
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 17]]>
<![CDATA[<211> 1367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 17]]>
Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val Gly
1 5 10 15
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe Lys
20 25 30
Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile Gly
35 40 45
Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu Lys
50 55 60
Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys Tyr
65 70 75 80
Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser Phe
85 90 95
Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys His
100 105 110
Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr His
115 120 125
Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp Ser
130 135 140
Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His Met
145 150 155 160
Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro Asp
165 170 175
Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr Asn
180 185 190
Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala Lys
195 200 205
Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn Leu
210 215 220
Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn Leu
225 230 235 240
Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe Asp
245 250 255
Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp Asp
260 265 270
Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp Leu
275 280 285
Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp Ile
290 295 300
Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser Met
305 310 315 320
Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys Ala
325 330 335
Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe Asp
340 345 350
Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser Gln
355 360 365
Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp Gly
370 375 380
Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg Lys
385 390 395 400
Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu Gly
405 410 415
Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe Leu
420 425 430
Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile Pro
435 440 445
Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp Met
450 455 460
Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu Val
465 470 475 480
Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr Asn
485 490 495
Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser Leu
500 505 510
Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys Tyr
515 520 525
Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln Lys
530 535 540
Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr Val
545 550 555 560
Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp Ser
565 570 575
Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly Thr
580 585 590
Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp Asn
595 600 605
Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr Leu
610 615 620
Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala His
625 630 635 640
Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr Thr
645 650 655
Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp Lys
660 665 670
Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe Ala
675 680 685
Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe Lys
690 695 700
Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu His
705 710 715 720
Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly Ile
725 730 735
Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly Arg
740 745 750
His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln Thr
755 760 765
Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu
770 775 780
Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val
785 790 795 800
Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln
805 810 815
Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg Leu
820 825 830
Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys Asp
835 840 845
Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg Gly
850 855 860
Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys Asn
865 870 875 880
Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys Phe
885 890 895
Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp Lys
900 905 910
Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys
915 920 925
His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp Glu
930 935 940
Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser Lys
945 950 955 960
Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg Glu
965 970 975
Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val Val
980 985 990
Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe Val
995 1000 1005
Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys
1010 1015 1020
Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr
1025 1030 1035
Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala Asn
1040 1045 1050
Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr
1055 1060 1065
Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg
1070 1075 1080
Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu
1085 1090 1095
Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg
1100 1105 1110
Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro Lys
1115 1120 1125
Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val Leu
1130 1135 1140
Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys Ser
1145 1150 1155
Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser Phe
1160 1165 1170
Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys Glu
1175 1180 1185
Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu Phe
1190 1195 1200
Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly Glu
1205 1210 1215
Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val Asn
1220 1225 1230
Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser Pro
1235 1240 1245
Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His
1250 1255 1260
Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg
1265 1270 1275
Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr
1280 1285 1290
Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile
1295 1300 1305
Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe
1310 1315 1320
Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr
1325 1330 1335
Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr Gly
1340 1345 1350
Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<![CDATA[<210> 18]]>
<![CDATA[<211> 96]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 18]]>
Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys
1 5 10 15
Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr
20 25 30
Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn
35 40 45
Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg
50 55 60
Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln
65 70 75 80
Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val
85 90 95
<![CDATA[<210> 19]]>
<![CDATA[<211> 385]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 19]]>
Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val Phe Glu Ala Phe
1 5 10 15
Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys Val Arg Lys Asp
20 25 30
Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val Pro Ala Ile Val
35 40 45
Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys Leu Glu Tyr Lys
50 55 60
Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu
65 70 75 80
Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys
85 90 95
Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu
100 105 110
Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu
115 120 125
Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser
130 135 140
Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser
145 150 155 160
Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn
165 170 175
Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu Leu His
180 185 190
Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser
195 200 205
Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala Ala Asp Phe Gly
210 215 220
Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser Thr Leu Asn Glu
225 230 235 240
Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys
245 250 255
Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu
260 265 270
Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys
275 280 285
Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr
290 295 300
Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser
305 310 315 320
Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp
325 330 335
Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val
340 345 350
Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly
355 360 365
Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly
370 375 380
Gly
385
<![CDATA[<210> 20]]>
<![CDATA[<211> 480]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 20]]>
Val Asp Leu Arg Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser
1 5 10 15
Glu Gly Phe His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu
20 25 30
Met Trp Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser
35 40 45
Thr Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala
50 55 60
Gly Glu Thr Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly Asp
65 70 75 80
Val Glu Met Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser Gly Met
85 90 95
Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn Ser Leu Val
100 105 110
Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro Arg Phe Phe Leu
115 120 125
Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys Arg Ser Met Val Leu
130 135 140
Lys Leu Thr Leu Arg Cys Leu Val Arg Met Gly Tyr Gln Cys Thr Phe
145 150 155 160
Gly Val Leu Gln Ala Gly Gln Tyr Gly Val Ala Gln Thr Arg Arg Arg
165 170 175
Ala Ile Ile Leu Ala Ala Ala Pro Gly Glu Lys Leu Pro Leu Phe Pro
180 185 190
Glu Pro Leu His Val Phe Ala Pro Arg Ala Cys Gln Leu Ser Val Val
195 200 205
Val Asp Asp Lys Lys Phe Val Ser Asn Ile Thr Arg Leu Ser Ser Gly
210 215 220
Pro Phe Arg Thr Ile Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu
225 230 235 240
Val Arg Asn Gly Ala Ser Ala Leu Glu Ile Ser Tyr Asn Gly Glu Pro
245 250 255
Gln Ser Trp Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile
260 265 270
Leu Arg Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg
275 280 285
Met Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro
290 295 300
Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu Arg
305 310 315 320
Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Gly Ala Leu
325 330 335
Arg Gly Val Cys Ser Cys Val Glu Ala Gly Lys Ala Cys Asp Pro Ala
340 345 350
Ala Arg Gln Phe Asn Thr Leu Ile Pro Trp Cys Leu Pro His Thr Gly
355 360 365
Asn Arg His Asn His Trp Ala Gly Leu Tyr Gly Arg Leu Glu Trp Asp
370 375 380
Gly Phe Phe Ser Thr Thr Val Thr Asn Pro Glu Pro Met Gly Lys Gln
385 390 395 400
Gly Arg Val Leu His Pro Glu Gln His Arg Val Val Ser Val Arg Glu
405 410 415
Cys Ala Arg Ser Gln Gly Phe Pro Asp Thr Tyr Arg Leu Phe Gly Asn
420 425 430
Ile Leu Asp Lys His Arg Gln Val Gly Asn Ala Val Pro Pro Pro Leu
435 440 445
Ala Lys Ala Ile Gly Leu Glu Ile Lys Leu Cys Met Leu Ala Lys Ala
450 455 460
Arg Glu Ser Ala Ser Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp
465 470 475 480
<![CDATA[<210> 21]]>
<![CDATA[<211> 457]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 21]]>
Glu Trp Gly Pro Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp
1 5 10 15
Leu Ser Ile Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly
20 25 30
Arg Leu Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys
35 40 45
Glu Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala
50 55 60
Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser Asn
65 70 75 80
Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg Ala Arg
85 90 95
Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu Ala Ser Thr
100 105 110
Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu Glu His Gly Arg Ile
115 120 125
Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr Arg Ser Asn Ser Ile
130 135 140
Lys Gln Gly Lys Asp Gln His Phe Pro Val Phe Met Asn Glu Lys Glu
145 150 155 160
Asp Ile Leu Trp Cys Thr Glu Met Glu Arg Val Phe Gly Phe Pro Val
165 170 175
His Tyr Thr Asp Val Ser Asn Met Ser Arg Leu Ala Arg Gln Arg Leu
180 185 190
Leu Gly Arg Ser Trp Ser Val Pro Val Ile Arg His Leu Phe Ala Pro
195 200 205
Leu Lys Glu Tyr Phe Ala Cys Val Ser Ser Gly Asn Ser Asn Ala Asn
210 215 220
Ser Arg Gly Pro Ser Phe Ser Ser Gly Leu Val Pro Leu Ser Leu Arg
225 230 235 240
Gly Ser His Met Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp
245 250 255
Arg Arg Gln Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys
260 265 270
Glu Leu Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln
275 280 285
Leu Lys His Val Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu
290 295 300
Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu Gly
305 310 315 320
His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu Phe Gln Phe His Arg
325 330 335
Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg Pro Phe Phe
340 345 350
Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu Asp Leu Asp Val
355 360 365
Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr Ile Pro Asp Val His
370 375 380
Gly Gly Ser Leu Gln Asn Ala Val Arg Val Trp Ser Asn Ile Pro Ala
385 390 395 400
Ile Arg Ser Arg His Trp Ala Leu Val Ser Glu Glu Glu Leu Ser Leu
405 410 415
Leu Ala Gln Asn Lys Gln Ser Ser Lys Leu Ala Ala Lys Trp Pro Thr
420 425 430
Lys Leu Val Lys Asn Cys Phe Leu Pro Leu Arg Glu Tyr Phe Lys Tyr
435 440 445
Phe Ser Thr Glu Leu Thr Ser Ser Leu
450 455
<![CDATA[<210> 22]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 22]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 23]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 23]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser His Ser Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 24]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 24]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 25]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 25]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 26]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 26]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 27]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 27]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly Ala Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His
115 120 125
Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 28]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 28]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 29]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 29]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp
115 120 125
Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 30]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 30]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 31]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 31]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 32]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 32]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His
115 120 125
Leu Leu Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 33]]>
<![CDATA[<211> 661]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 33]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Glu Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[<210> 34]]>
<![CDATA[<211> 1528]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 34]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Ser Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr
1400 1405 1410
Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu
1415 1420 1425
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val
1430 1435 1440
Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu
1445 1450 1455
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
1460 1465 1470
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser
1475 1480 1485
Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg
1490 1495 1500
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly
1505 1510 1515
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1520 1525
<![CDATA[<210> 35]]>
<![CDATA[<211> 1820]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 35]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu
1400 1405 1410
Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala
1415 1420 1425
Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala
1430 1435 1440
Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn
1445 1450 1455
Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
1460 1465 1470
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys
1475 1480 1485
Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu
1490 1495 1500
Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly
1505 1510 1515
Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn
1520 1525 1530
Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln
1535 1540 1545
Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser
1550 1555 1560
Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys
1565 1570 1575
Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu
1580 1585 1590
Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys
1595 1600 1605
Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
1610 1615 1620
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile
1625 1630 1635
Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys
1640 1645 1650
Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys
1655 1660 1665
Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys
1670 1675 1680
Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser
1685 1690 1695
Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
1700 1705 1710
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys
1715 1720 1725
Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu
1730 1735 1740
Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile
1745 1750 1755
Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser
1760 1765 1770
Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly
1775 1780 1785
Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly
1790 1795 1800
Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp
1805 1810 1815
Tyr Ala
1820
<![CDATA[<210> 36]]>
<![CDATA[<211> 1922]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 36]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
1385 1390 1395
Ala Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser Val Asp Leu Arg
1400 1405 1410
Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser Glu Gly Phe
1415 1420 1425
His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu Met Trp
1430 1435 1440
Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser Thr
1445 1450 1455
Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala
1460 1465 1470
Gly Glu Thr Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly
1475 1480 1485
Asp Val Glu Met Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser
1490 1495 1500
Gly Met Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn
1505 1510 1515
Ser Leu Val Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro
1520 1525 1530
Arg Phe Phe Leu Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys
1535 1540 1545
Arg Ser Met Val Leu Lys Leu Thr Leu Arg Cys Leu Val Arg Met
1550 1555 1560
Gly Tyr Gln Cys Thr Phe Gly Val Leu Gln Ala Gly Gln Tyr Gly
1565 1570 1575
Val Ala Gln Thr Arg Arg Arg Ala Ile Ile Leu Ala Ala Ala Pro
1580 1585 1590
Gly Glu Lys Leu Pro Leu Phe Pro Glu Pro Leu His Val Phe Ala
1595 1600 1605
Pro Arg Ala Cys Gln Leu Ser Val Val Val Asp Asp Lys Lys Phe
1610 1615 1620
Val Ser Asn Ile Thr Arg Leu Ser Ser Gly Pro Phe Arg Thr Ile
1625 1630 1635
Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu Val Arg Asn Gly
1640 1645 1650
Ala Ser Ala Leu Glu Ile Ser Tyr Asn Gly Glu Pro Gln Ser Trp
1655 1660 1665
Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile Leu Arg
1670 1675 1680
Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg Met
1685 1690 1695
Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro
1700 1705 1710
Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu
1715 1720 1725
Arg Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Gly
1730 1735 1740
Ala Leu Arg Gly Val Cys Ser Cys Val Glu Ala Gly Lys Ala Cys
1745 1750 1755
Asp Pro Ala Ala Arg Gln Phe Asn Thr Leu Ile Pro Trp Cys Leu
1760 1765 1770
Pro His Thr Gly Asn Arg His Asn His Trp Ala Gly Leu Tyr Gly
1775 1780 1785
Arg Leu Glu Trp Asp Gly Phe Phe Ser Thr Thr Val Thr Asn Pro
1790 1795 1800
Glu Pro Met Gly Lys Gln Gly Arg Val Leu His Pro Glu Gln His
1805 1810 1815
Arg Val Val Ser Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro
1820 1825 1830
Asp Thr Tyr Arg Leu Phe Gly Asn Ile Leu Asp Lys His Arg Gln
1835 1840 1845
Val Gly Asn Ala Val Pro Pro Pro Leu Ala Lys Ala Ile Gly Leu
1850 1855 1860
Glu Ile Lys Leu Cys Met Leu Ala Lys Ala Arg Glu Ser Ala Ser
1865 1870 1875
Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp Gly Gly Gly Gly
1880 1885 1890
Ser Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1895 1900 1905
Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1910 1915 1920
<![CDATA[<210> 37]]>
<![CDATA[<211> 1986]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 37]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Ser Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
1385 1390 1395
Gly Gly Ser Gly Pro Lys Lys Lys Arg Lys Val Ala Ala Ala Gly
1400 1405 1410
Ser Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro
1415 1420 1425
Val Pro Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe
1430 1435 1440
Asp Gly Ile Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile
1445 1450 1455
Gln Val Asp Arg Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile
1460 1465 1470
Thr Val Gly Met Val Arg His Gln Gly Lys Ile Met Tyr Val Gly
1475 1480 1485
Asp Val Arg Ser Val Thr Gln Lys His Ile Gln Glu Trp Gly Pro
1490 1495 1500
Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp Leu Ser Ile
1505 1510 1515
Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly Arg Leu
1520 1525 1530
Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys Glu
1535 1540 1545
Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala
1550 1555 1560
Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser
1565 1570 1575
Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg
1580 1585 1590
Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu
1595 1600 1605
Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu Glu
1610 1615 1620
His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr
1625 1630 1635
Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val
1640 1645 1650
Phe Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu
1655 1660 1665
Arg Val Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met
1670 1675 1680
Ser Arg Leu Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val
1685 1690 1695
Pro Val Ile Arg His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala
1700 1705 1710
Cys Val Ser Ser Gly Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser
1715 1720 1725
Phe Ser Ser Gly Leu Val Pro Leu Ser Leu Arg Gly Ser His Met
1730 1735 1740
Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp Arg Arg Gln
1745 1750 1755
Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys Glu Leu
1760 1765 1770
Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln Leu
1775 1780 1785
Lys His Val Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu
1790 1795 1800
Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu
1805 1810 1815
Gly His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu Phe Gln Phe
1820 1825 1830
His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg
1835 1840 1845
Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu
1850 1855 1860
Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr
1865 1870 1875
Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val
1880 1885 1890
Trp Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val
1895 1900 1905
Ser Glu Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser
1910 1915 1920
Lys Leu Ala Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe
1925 1930 1935
Leu Pro Leu Arg Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr
1940 1945 1950
Ser Ser Leu Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala
1955 1960 1965
Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
1970 1975 1980
Asp Tyr Ala
1985
<![CDATA[<210> 38]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 38]]>
ctggagcccg gcgagaaacc ctacaagtgc cccgagtgcg gcaaatcctt ctctagaagc 60
gacaaactga ccgaacatca gaggacccac accggcgaga agccttataa gtgtcccgaa 120
tgcggcaaat ccttcagcac caagaactct ctgacagaac accagagaac acataccgga 180
gagaaacctt ataaatgccc cgagtgcggc aagtccttct cccagtccgg cgatctgagg 240
agacaccaaa gaacacatac cggcgaaaag ccttacaagt gccccgagtg tggaaagagc 300
ttctccacca ccggcgctct gaccgagcac cagagaacac acaccggcga gaaaccctat 360
aaatgtcccg agtgtggcaa atccttcagc gacagcggca atctgagagt gcaccaaaga 420
acccataccg gcgaaaaacc ctacaaatgc cccgagtgcg gcaaatcctt cagccagagg 480
gcccatctgg agaggcacca aaggacacac accggagaaa agccctacaa gtgtcccgag 540
tgtggaaaaa gctttagcac aagcggcgag ctggtgaggc atcaaaggac ccacaccggc 600
gaaaagccca ccggcaaaaa gaccagc 627
<![CDATA[<210> 39]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工]]>序列
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 39]]>
ctggagcccg gcgagaagcc ctacaagtgc cccgagtgcg gaaagtcctt cagctccccc 60
gccgatctga caagacatca gagaacccat accggcgaga aaccttacaa atgccccgaa 120
tgtggcaagt cctttagcga tcccggacat ctggtgaggc accagaggac acacaccggc 180
gaaaagccct ataaatgtcc cgagtgtgga aagagctttt ctagaagcga caatctcgtg 240
agacaccaga gaacccacac cggagagaag ccttacaagt gccccgagtg cggcaaatcc 300
ttcagccaga gctcctctct ggtgaggcac caaaggaccc acaccggcga gaaaccttat 360
aagtgtcccg agtgtggcaa aagcttcagc acctcccact ctctgaccga gcatcaaaga 420
acccacaccg gcgaaaaacc ttataaatgc cccgagtgtg gcaaatcctt cagcagaaat 480
gacgctctga cagagcacca aagaacacat accggagaaa agccctacaa atgccccgag 540
tgtggaaaat ccttttctag aaacgatgct ctgaccgaac accaaagaac acacaccggc 600
gaaaagccta ccggaaaaaa gaccagc 627
<![CDATA[<210> 40]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 40]]>
ctggagcccg gcgaaaaacc ttacaagtgc cccgagtgcg gaaagagctt cagcagaagc 60
gacaaactgg tgaggcatca aaggacacat accggagaga agccctataa gtgccccgaa 120
tgtggcaaat ccttttccca gagggctcat ctggaaagac accagaggac ccataccggc 180
gaaaaaccct acaaatgtcc cgagtgtgga aagagctttt ccgatcccgg ccatctggtc 240
agacatcaga ggacacatac cggcgaaaag ccttacaagt gtcccgaatg cggaaaatcc 300
ttctccagaa gcgacaagct ggtgaggcac caaagaaccc acaccggcga aaaaccctat 360
aaatgccccg agtgcggcaa gtcctttagc cagctggccc atctgagagc ccaccagaga 420
acacacaccg gagagaagcc ttataagtgt cccgagtgcg gaaagtcctt ctctagagcc 480
gacaatctga ccgaacatca aaggacacac accggcgaga aaccttataa atgccccgag 540
tgcggaaaaa gcttttccga ctgcagagat ctggctagac accagagaac ccacaccggc 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[<210> 41]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 41]]>
ctggagcccg gcgaaaagcc ttataaatgt cccgaatgcg gcaagagctt tagccacacc 60
ggccatctgc tggaacacca aaggacccat accggcgaaa agccctataa gtgccccgag 120
tgtggcaaga gcttcagcac caccggcaat ctgacagtcc atcagaggac ccacaccgga 180
gagaaaccct ataaatgccc cgagtgtgga aagtccttct ccgacaagaa ggatctgaca 240
agacaccaga ggacccatac cggcgagaaa ccctacaaat gccccgagtg cggcaaatcc 300
ttctcccaga gcggcgatct gaggagacat caaagaacac ataccggcga aaaaccctat 360
aagtgccccg aatgcggcaa gtccttcagc cagagggccc atctggaaag gcatcagagg 420
acacacaccg gcgagaagcc ttacaaatgt cccgagtgcg gaaagagctt ctctagaagc 480
gacaagctga ccgagcatca gaggacccac accggagaaa aaccttacaa gtgccccgag 540
tgcggcaaaa gcttcagcag aaccgacaca ctgagagatc accaaaggac acacaccggc 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[<210> 42]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 42]]>
ctggagcccg gcgagaagcc ttataagtgc cccgagtgtg gcaagagctt tagccacacc 60
ggccatctgc tggagcatca aaggacacac accggagaaa agccctataa gtgccccgag 120
tgtggcaaat ccttcagcac ctccggcaat ctcaccgaac accagagaac acacaccgga 180
gaaaaacctt acaaatgtcc cgagtgtgga aagagctttt ccaccagcgg caatctggtg 240
agacatcaaa gaacacatac cggcgaaaaa ccctataaat gccccgagtg tggaaaatcc 300
ttctcccaac tggcccatct gagggcccac cagaggacac ataccggaga aaaaccctac 360
aaatgccccg aatgcggaaa aagcttctcc gagagaagcc atctgagaga gcaccaaagg 420
acccataccg gagaaaagcc ttacaagtgt cccgagtgcg gaaaaagctt tagcgatccc 480
ggacatctgg tgagacatca gagaacccac accggcgaaa agccttataa atgtcccgaa 540
tgtggcaagt cctttagcac ccatctggat ctgattagac accaaagaac ccacaccggc 600
gagaaaccca ccggaaaaaa gaccagc 627
<![CDATA[<210> 43]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 43]]>
ctggagcccg gcgaaaagcc ttacaaatgt cccgagtgcg gaaagtcctt cagcgacccc 60
ggcgctctgg tgagacatca aagaacacat accggcgaga aaccttataa atgccccgaa 120
tgtggaaaat ccttcagcga aagaagccat ctgagggaac accagaggac ccacaccggc 180
gaaaaacctt ataagtgccc cgaatgcgga aaaagctttt ctagaagcga tcatctgacc 240
aaccatcaga gaacacacac cggcgaaaag ccctataaat gtcccgagtg tggcaaatcc 300
tttagcgaga ggtcccatct gagagagcac cagaggacac ataccggaga gaagccctac 360
aagtgccccg agtgtggcaa gagctttagc agaagcgacc atctgaccaa tcatcaaagg 420
acccacaccg gagagaagcc ttacaagtgt cccgagtgcg gaaagtcctt ttccgatccc 480
ggccacctcg tgaggcacca aagaacccat accggcgaga aaccctacaa atgccccgag 540
tgtggaaaga gcttctccag aagcgacaag ctggtgaggc atcagaggac acacaccggc 600
gaaaaaccca ccggcaagaa aaccagc 627
<![CDATA[<210> 44]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 44]]>
ctggagcccg gagagaagcc ctacaaatgc cccgagtgtg gaaagagctt ctctagaaat 60
gacgctctga cagaacacca gaggacccat accggcgaga aaccttacaa atgccccgag 120
tgcggaaaaa gctttagcga ttgcagagat ctggctagac atcagagaac acacaccggc 180
gagaagccct ataagtgccc cgaatgcggc aagagcttta gcgaccccgg ccatctggtg 240
agacatcaaa ggacacatac cggagaaaaa ccttacaagt gccccgagtg cggaaagtcc 300
ttctcccaga gcggccatct caccgagcat caaaggaccc acaccggcga aaagccttat 360
aaatgtcccg aatgtggcaa gtccttctct agagaggata atctgcacac ccatcagagg 420
acccacaccg gcgaaaagcc ttataaatgc cccgaatgtg gaaagtcctt ttccaccaag 480
aactctctga ccgagcatca gaggacacac accggagaga aaccctataa atgtcccgag 540
tgtggcaaga gcttcagcag agccgacaat ctgacagagc accaaagaac acataccggc 600
gaaaagccca ccggcaaaaa gaccagc 627
<![CDATA[<210> 45]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 45]]>
ctggagcccg gcgagaaacc ctacaagtgc cccgagtgtg gcaaatcctt ctctagatcc 60
gacaaactga ccgaacatca gaggacccat accggcgaaa aaccttataa atgtcccgag 120
tgcggaaagt ccttctctag aagggacgag ctgaacgtgc atcagagaac acataccggc 180
gagaagccct ataaatgccc cgaatgcggc aaaagcttct ctagaagcga tcatctgacc 240
aaccaccaga gaacccatac cggagaaaag ccttacaagt gtcccgaatg tggaaaatcc 300
ttcagctccc ccgccgatct gaccagacac caaaggaccc acaccggcga gaagccctat 360
aaatgccccg agtgcggcaa gagcttttcc agatccgacc atctgaccaa tcatcaaaga 420
acccacaccg gcgaaaagcc ttataaatgt cccgagtgcg gcaaatcctt ttccagcaag 480
aaggctctga ccgagcatca aaggacccat accggcgaga agccttacaa atgccccgag 540
tgtggaaagt cctttagcac ccatctggat ctgattagac accagaggac acacaccgga 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[<210> 46]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 46]]>
ctggagcccg gcgagaaacc ttacaaatgc cccgagtgcg gcaagagctt cagcagaagc 60
gacgatctgg tgaggcacca aagaacccac accggcgaaa aaccttacaa gtgtcccgaa 120
tgcggaaagt ccttcagcag agaggacaat ctgcacaccc accagagaac acacaccgga 180
gaaaagcctt acaagtgccc cgaatgcggc aaatcctttt ctagaagcga tcatctgacc 240
acccaccaaa gaacacatac cggcgagaag ccttacaaat gtcccgagtg cggaaagtcc 300
ttctcccaga gagccaatct gagggctcat caaaggaccc ataccggcga aaagccctac 360
aaatgccccg agtgcggaaa atccttcagc cagctggccc atctgagagc ccaccaaagg 420
acacacaccg gagagaaacc ctataagtgc cccgagtgtg gaaaaagctt ttcccagagg 480
gccaatctga gggcccatca gaggacccat accggagaga agccttataa atgtcccgag 540
tgcggaaaaa gcttcagcga gaggagccat ctgagggaac atcaaagaac ccacaccggc 600
gaaaaaccca ccggaaaaaa gaccagc 627
<![CDATA[<210> 47]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223]]>> 人工序列之描述:合成聚核苷酸]]>
<br/>
<br/><![CDATA[<400> 47]]>
<br/><![CDATA[ctggagcccg gcgagaaacc ctacaagtgc cccgagtgtg gaaaaagctt tagccaaagc 60
ggcgatctga ggagacacca aagaacacac accggcgaga agccctacaa atgtcccgag 120
tgcggaaaga gcttcagcca gagcggccat ctgaccgagc atcagagaac ccataccggc 180
gaaaaacctt ataagtgccc cgagtgtgga aagtccttct ccgagagatc ccatctgaga 240
gaacaccaga ggacacacac cggcgaaaaa ccttataagt gtcccgagtg cggaaagtcc 300
ttcagcgatc ccggccatct ggtgagacat caaaggacac ataccggcga aaaaccttat 360
aagtgtcccg aatgcggcaa gagctttagc agaaacgaca cactcaccga acaccagagg 420
acccacaccg gcgagaaacc ctacaaatgc cccgagtgcg gcaaatcctt ttctagagcc 480
gacaatctga ccgaacacca gaggacccat accggagaaa agccttacaa atgtcccgag 540
tgtggcaaat ccttctccac ccatctggat ctgattagac accaaagaac acataccgga 600
gaaaagccca ccggaaaaaa gaccagc 627
<![CDATA[<210> 48]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 48]]>
ctggagcccg gcgaaaaacc ctataagtgc cccgaatgtg gaaagagctt cagccatacc 60
ggccatctgc tggaacacca aaggacacac accggcgaga aaccttacaa gtgtcccgag 120
tgcggaaaaa gcttctcctc caaaaaggct ctcaccgagc accagagaac acataccggc 180
gaaaagcctt ataagtgccc cgagtgtggc aaatcctttt ccgactgtag agatctggcc 240
agacatcaaa gaacccacac cggagagaaa ccttataaat gccccgagtg cggcaagtcc 300
tttagccata ccggccatct gctggagcac cagaggaccc ataccggcga gaagccttac 360
aaatgccccg agtgcggcaa aagcttcagc agaaatgacg ctctgaccga gcatcaaagg 420
acccataccg gcgaaaagcc ctacaagtgt cccgagtgtg gaaagtcctt ctcccagagc 480
ggcgatctga ggagacacca gagaacacac accggcgaga aaccctataa atgtcccgag 540
tgcggaaaga gctttagcga cagcggcaat ctgagggtgc atcaaagaac acacaccggc 600
gaaaaaccca ccggaaaaaa gacaagc 627
<![CDATA[<210> 49]]>
<![CDATA[<211> 538]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 49]]>
caccggcgaa aagccttata agtgccccga gtgcggcaag tccttctcta gaagcgatca 60
cctcaccaat catcagagga cacataccgg agagaagccc tataagtgcc ccgagtgcgg 120
caagagcttt agccagctgg ctcatctgag agctcaccaa agaacccata ccggcgagaa 180
gccttacaaa tgccccgagt gtggaaaatc cttttcccag tccagcaacc tcgtcagaca 240
tcaaaggacc cataccggcg aaaagcctta caagtgtccc gagtgcggaa agtccttctc 300
tagatccgac aacctcgtga ggcaccagag aacccacacc ggcgagaaac cttacaaatg 360
tcccgagtgt ggcaaaagct tttctagaag cgacgagctg gtgagacatc aaagaaccca 420
taccggcgaa aaaccttata agtgtcccga gtgcggcaaa tcctttagcc agctggccca 480
tctgagggcc caccagagaa cacataccgg cgaaaaaccc accggcaaaa agacaagc 538
<![CDATA[<210> 50]]>
<![CDATA[<211> 4101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 50]]>
gacaagaagt acagcatcgg cctggccatc ggcaccaaca gcgtgggctg ggccgtgatc 60
accgacgagt acaaggtgcc cagcaagaag ttcaaggtgc tgggcaacac cgaccggcac 120
agcatcaaga agaacctgat cggcgccctg ctgttcgaca gcggcgagac cgccgaggcc 180
acccggctga agcggaccgc ccggcggcgg tacacccggc ggaagaaccg gatctgctac 240
ctgcaggaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccaccggctg 300
gaggagagct tcctggtgga ggaggacaag aagcacgagc ggcaccccat cttcggcaac 360
atcgtggacg aggtggccta ccacgagaag taccccacca tctaccacct gcggaagaag 420
ctggtggaca gcaccgacaa ggccgacctg cggctgatct acctggccct ggcccacatg 480
atcaagttcc ggggccactt cctgatcgag ggcgacctga accccgacaa cagcgacgtg 540
gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga gaaccccatc 600
aacgccagcg gcgtggacgc caaggccatc ctgagcgccc ggctgagcaa gagccggcgg 660
ctggagaacc tgatcgccca gctgcccggc gagaagaaga acggcctgtt cggcaacctg 720
atcgccctga gcctgggcct gacccccaac ttcaagagca acttcgacct ggccgaggac 780
gccaagctgc agctgagcaa ggacacctac gacgacgacc tggacaacct gctggcccag 840
atcggcgacc agtacgccga cctgttcctg gccgccaaga acctgagcga cgccatcctg 900
ctgagcgaca tcctgcgggt gaacaccgag atcaccaagg cccccctgag cgccagcatg 960
atcaagcggt acgacgagca ccaccaggac ctgaccctgc tgaaggccct ggtgcggcag 1020
cagctgcccg agaagtacaa ggagatcttc ttcgaccaga gcaagaacgg ctacgccggc 1080
tacatcgacg gcggcgccag ccaggaggag ttctacaagt tcatcaagcc catcctggag 1140
aagatggacg gcaccgagga gctgctggtg aagctgaacc gggaggacct gctgcggaag 1200
cagcggacct tcgacaacgg cagcatcccc caccagatcc acctgggcga gctgcacgcc 1260
atcctgcggc ggcaggagga cttctacccc ttcctgaagg acaaccggga gaagatcgag 1320
aagatcctga ccttccggat cccctactac gtgggccccc tggcccgggg caacagccgg 1380
ttcgcctgga tgacccggaa atccgaggag accatcaccc cctggaactt cgaggaggtg 1440
gtggacaagg gcgccagcgc ccagagcttc atcgagcgga tgaccaactt cgacaagaac 1500
ctgcccaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtac 1560
aacgagctga ccaaggtgaa gtacgtgacc gagggcatgc ggaagcccgc cttcctgagc 1620
ggcgagcaga agaaggccat cgtggacctg ctgttcaaga ccaaccggaa ggtgaccgtg 1680
aagcagctga aggaggacta cttcaagaag atcgagtgct tcgacagcgt ggagatcagc 1740
ggcgtggagg accggttcaa cgccagcctg ggcacctacc acgacctgct gaagatcatc 1800
aaggacaagg acttcctgga caacgaggag aacgaggaca tcctggagga catcgtgctg 1860
accctgaccc tgttcgagga ccgggagatg atcgaggagc ggctgaaaac ctacgcccac 1920
ctgttcgacg acaaggtgat gaagcagctg aagcggcggc ggtacaccgg ctggggccgg 1980
ctgagccgga agctgatcaa cggcatccgg gacaagcaga gcggcaagac catcctggac 2040
ttcctgaaat ccgacggctt cgccaaccgg aacttcatgc agctgatcca cgacgacagc 2100
ctgaccttca aggaggacat ccagaaggcc caggtgagcg gccagggcga cagcctgcac 2160
gagcacatcg ccaacctggc cggcagcccc gccatcaaga agggcatcct gcagaccgtg 2220
aaggtggtgg acgagctggt gaaggtgatg ggccggcaca agcccgagaa catcgtgatc 2280
gagatggccc gggagaacca gaccacccag aagggccaga agaacagccg ggagcggatg 2340
aagcggatcg aggagggcat caaggagctg ggcagccaga tcctgaagga gcaccccgtg 2400
gagaacaccc agctgcagaa cgagaagctg tacctgtact acctgcagaa cggccgggac 2460
atgtacgtgg accaggagct ggacatcaac cggctgagcg actacgacgt ggccgccatc 2520
gtgccccaga gcttcctgaa ggacgacagc atcgacaaca aggtgctgac ccggagcgac 2580
aaggcccggg gcaagagcga caacgtgccc agcgaggagg tggtgaagaa gatgaagaac 2640
tactggcggc agctgctgaa cgccaagctg atcacccagc ggaagttcga caacctgacc 2700
aaggccgagc ggggcggcct gagcgagctg gacaaggccg gcttcatcaa gcggcagctg 2760
gtggagaccc ggcagatcac caagcacgtg gcccagatcc tggacagccg gatgaacacc 2820
aagtacgacg agaacgacaa gctgatccgg gaggtgaagg tgatcaccct gaaatccaag 2880
ctggtgagcg acttccggaa ggacttccag ttctacaagg tgcgggagat caacaactac 2940
caccacgccc acgacgccta cctgaacgcc gtggtgggca ccgccctgat caagaagtac 3000
cccaagctgg agagcgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg 3060
atcgccaaga gcgagcagga gatcggcaag gccaccgcca agtacttctt ctacagcaac 3120
atcatgaact tcttcaagac cgagatcacc ctggccaacg gcgagatccg gaagcggccc 3180
ctgatcgaga ccaacggcga gaccggcgag atcgtgtggg acaagggccg ggacttcgcc 3240
accgtgcgga aggtgctgag catgccccag gtgaacatcg tgaagaaaac cgaggtgcag 3300
accggcggct tcagcaagga gagcatcctg cccaagcgga acagcgacaa gctgatcgcc 3360
cggaagaagg actgggaccc caagaagtac ggcggcttcg acagccccac cgtggcctac 3420
agcgtgctgg tggtggccaa ggtggagaag ggcaagagca agaagctgaa atccgtgaag 3480
gagctgctgg gcatcaccat catggagcgg agcagcttcg agaagaaccc catcgacttc 3540
ctggaggcca agggctacaa ggaggtgaag aaggacctga tcatcaagct gcccaagtac 3600
agcctgttcg agctggagaa cggccggaag cggatgctgg ccagcgccgg cgagctgcag 3660
aagggcaacg agctggccct gcccagcaag tacgtgaact tcctgtacct ggccagccac 3720
tacgagaagc tgaagggcag ccccgaggac aacgagcaga agcagctgtt cgtggagcag 3780
cacaagcact acctggacga gatcatcgag cagatcagcg agttcagcaa gcgggtgatc 3840
ctggccgacg ccaacctgga caaggtgctg agcgcctaca acaagcaccg ggacaagccc 3900
atccgggagc aggccgagaa catcatccac ctgttcaccc tgaccaacct gggcgccccc 3960
gccgccttca agtacttcga caccaccatc gaccggaagc ggtacaccag caccaaggag 4020
gtgctggacg ccaccctgat ccaccagagc atcaccggcc tgtacgagac ccggatcgac 4080
ctgagccagc tgggcggcga c 4101
<![CDATA[<210> 51]]>
<![CDATA[<211> 288]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 51]]>
gacgccaaga gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg 60
gacttcaccc gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac 120
gtgatgctgg agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac 180
gtgatcctgc ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag 240
gagacccacc ccgacagcga gaccgccttc gagatcaaga gcagcgtg 288
<![CDATA[<210> 52]]>
<![CDATA[<211> 1155]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 52]]>
agcaaggtgg agaacaagac caagaagctg cgggtgttcg aggccttcgc cggcatcggc 60
gcccagcgga aggccctgga gaaggtgcgg aaggacgagt acgagatcgt gggcctggcc 120
gagtggtacg tgcccgccat cgtgatgtac caggccatcc acaacaactt ccacaccaag 180
ctggagtaca agagcgtgag ccgggaggag atgatcgact acctggagaa caagaccctg 240
agctggaaca gcaagaaccc cgtgagcaac ggctactgga agcggaagaa ggacgacgag 300
ctgaagatca tctacaacgc catcaagctg agcgagaagg agggcaacat cttcgacatc 360
cgggacctgt acaagcggac cctgaagaac atcgacctgc tgacctacag cttcccctgc 420
caggacctga gccagcaggg catccagaag ggcatgaagc ggggcagcgg cacccggagc 480
ggcctgctgt gggagatcga gcgggccctg gacagcaccg agaagaacga cctgcccaag 540
tacctgctga tggagaacgt gggcgccctg ctgcacaaga agaacgagga ggagctgaac 600
cagtggaagc agaagctgga gagcctgggc taccagaaca gcatcgaggt gctgaacgcc 660
gccgacttcg gcagcagcca ggcccggcgg cgggtgttca tgatcagcac cctgaacgag 720
ttcgtggagc tgcccaaggg cgacaagaag cccaagagca tcaagaaggt gctgaacaag 780
atcgtgagcg agaaggacat cctgaacaac ctgctgaagt acaacctgac cgagttcaag 840
aaaaccaaga gcaacatcaa caaggccagc ctgatcggct acagcaagtt caacagcgag 900
ggctacgtgt acgaccccga gttcaccggc cccaccctga ccgccagcgg cgccaacagc 960
cggatcaaga tcaaggacgg cagcaacatc cggaagatga acagcgacga gaccttcctg 1020
tacatcggct tcgacagcca ggacggcaag cgggtgaacg agatcgagtt cctgaccgag 1080
aaccagaaga tcttcgtgtg cggcaacagc atcagcgtgg aggtgctgga ggccatcatc 1140
gacaagatcg gcggc 1155
<![CDATA[<210> 53]]>
<![CDATA[<211> 1440]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 53]]>
gtggatctga ggacactcga cgtgtttagc ggatgcggcg gactctccga aggcttccac 60
caagccggaa tttccgacac actctgggcc attgagatgt gggaccccgc cgctcaagcc 120
ttcagactga ataatcccgg ctccaccgtg ttcaccgagg actgcaacat tctgctgaag 180
ctggtgatgg ctggcgaaac caccaactct agaggccaga ggctgcccca gaagggagat 240
gtggaaatgc tctgtggagg ccctccttgc caaggcttct ccggcatgaa caggttcaac 300
tctagaacat acagcaagtt caagaactct ctggtcgtga gctttctgag ctactgcgac 360
tactatagac ctaggttctt tctgctggag aacgtgagaa atttcgtgtc cttcaagagg 420
agcatggtgc tgaagctgac actgaggtgt ctggtgagga tgggctacca gtgcacattc 480
ggagtgctgc aagctggcca gtacggcgtg gcccagacca gaaggagggc catcattctg 540
gctgctgccc ccggcgagaa actccctctg ttccccgagc ccctccacgt gttcgcccct 600
agagcttgcc agctgagcgt ggtggtcgac gataagaagt tcgtgagcaa catcacaagg 660
ctgtccagcg gacccttcag aaccattacc gtgagggata ccatgtccga cctccccgag 720
gtgaggaatg gcgccagcgc tctggagatt tcctacaacg gcgaacctca gagctggttc 780
caaaggcagc tgagaggcgc tcagtatcag cccattctga gggaccacat ctgcaaagat 840
atgagcgctc tggtggccgc tagaatgaga catattcctc tggcccccgg cagcgactgg 900
agagatctgc ccaatattga ggtgagactc agcgacggaa caatggctag aaaactgagg 960
tacacccatc atgatagaaa gaacggaagg agcagcagcg gcgctctgag aggagtgtgt 1020
agctgcgtgg aagctggcaa ggcttgcgat cccgccgcta ggcagttcaa taccctcatc 1080
ccttggtgtc tgcctcacac cggcaacaga cacaatcatt gggctggact gtatggaagg 1140
ctcgaatggg acggcttttt cagcaccacc gtgaccaatc ccgaacctat gggcaagcaa 1200
ggaagggtgc tccaccccga gcagcataga gtcgtgtccg tgagagaatg cgctagaagc 1260
caaggcttcc ccgacaccta tagactgttc ggcaacattc tggataagca cagacaagtg 1320
ggaaatgctg tccctcctcc tctggccaag gctatcggac tggagatcaa gctgtgtatg 1380
ctcgccaaag ctagggagag cgcttccgcc aagattaagg aggaggaggc cgccaaggac 1440
<![CDATA[<210> 54]]>
<![CDATA[<211> 1626]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 54]]>
aaccacgacc aggagttcga cccccccaag gtgtaccccc ccgtgcccgc cgagaagcgg 60
aagcccatcc gggtgctgag cctgttcgac ggcatcgcca ccggcctgct ggtgctgaag 120
gacctgggca tccaggtgga ccggtacatc gccagcgagg tgtgcgagga cagcatcacc 180
gtgggcatgg tgcggcacca gggcaagatc atgtacgtgg gcgacgtgcg gagcgtgacc 240
cagaagcaca tccaggagtg gggccccttc gacctggtga tcggcggcag cccctgcaac 300
gacctgagca tcgtgaaccc cgcccggaag ggcctgtacg agggcaccgg ccggctgttc 360
ttcgagttct accggctgct gcacgacgcc cggcccaagg agggcgacga ccggcccttc 420
ttctggctgt tcgagaacgt ggtggccatg ggcgtgagcg acaagcggga catcagccgg 480
ttcctggaga gcaaccccgt gatgatcgac gccaaggagg tgagcgccgc ccaccgggcc 540
cggtacttct ggggcaacct gcccggcatg aaccggcccc tggccagcac cgtgaacgac 600
aagctggagc tgcaggagtg cctggagcac ggccggatcg ccaagttcag caaggtgcgg 660
accatcacca cccggagcaa cagcatcaag cagggcaagg accagcactt ccccgtgttc 720
atgaacgaga aggaggacat cctgtggtgc accgagatgg agcgggtgtt cggcttcccc 780
gtgcactaca ccgacgtgag caacatgagc cggctggccc ggcagcggct gctgggccgg 840
agctggagcg tgcccgtgat ccggcacctg ttcgcccccc tgaaggagta cttcgcctgc 900
gtgagcagcg gcaacagcaa cgccaacagc cggggcccca gcttcagcag cggcctggtg 960
cccctgagcc tgcggggcag ccacatgaat cctctggaga tgttcgagac agtgcccgtg 1020
tggagaaggc aacccgtgag ggtgctgagc ctcttcgagg acattaagaa ggagctgacc 1080
tctctgggct ttctggaatc cggcagcgac cccggccagc tgaaacacgt ggtggacgtg 1140
accgacacag tgaggaagga cgtggaagag tggggcccct ttgacctcgt gtatggagcc 1200
acacctcctc tcggccacac atgcgatagg cctcccagct ggtatctctt ccagttccac 1260
agactgctcc agtacgccag acctaagccc ggcagcccca gacccttctt ctggatgttc 1320
gtggacaatc tggtgctgaa caaggaggat ctggatgtgg ccagcagatt tctggagatg 1380
gaacccgtga caatccccga cgtgcatggc ggctctctgc agaacgccgt gagagtgtgg 1440
tccaacatcc ccgccattag aagcagacac tgggctctgg tgagcgagga ggaactgtct 1500
ctgctggccc agaataagca gtcctccaag ctggccgcca agtggcccac caagctggtg 1560
aagaactgct ttctgcctct gagggagtat ttcaagtatt tcagcaccga actgaccagc 1620
agcctg 1626
<![CDATA[<210> 55]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 55]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gcggcaaatc cttctctaga agcgacaaac 180
tgaccgaaca tcagaggacc cacaccggcg agaagcctta taagtgtccc gaatgcggca 240
aatccttcag caccaagaac tctctgacag aacaccagag aacacatacc ggagagaaac 300
cttataaatg ccccgagtgc ggcaagtcct tctcccagtc cggcgatctg aggagacacc 360
aaagaacaca taccggcgaa aagccttaca agtgccccga gtgtggaaag agcttctcca 420
ccaccggcgc tctgaccgag caccagagaa cacacaccgg cgagaaaccc tataaatgtc 480
ccgagtgtgg caaatccttc agcgacagcg gcaatctgag agtgcaccaa agaacccata 540
ccggcgaaaa accctacaaa tgccccgagt gcggcaaatc cttcagccag agggcccatc 600
tggagaggca ccaaaggaca cacaccggag aaaagcccta caagtgtccc gagtgtggaa 660
aaagctttag cacaagcggc gagctggtga ggcatcaaag gacccacacc ggcgaaaagc 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 56]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 56]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa gccctacaag tgccccgagt gcggaaagtc cttcagctcc cccgccgatc 180
tgacaagaca tcagagaacc cataccggcg agaaacctta caaatgcccc gaatgtggca 240
agtcctttag cgatcccgga catctggtga ggcaccagag gacacacacc ggcgaaaagc 300
cctataaatg tcccgagtgt ggaaagagct tttctagaag cgacaatctc gtgagacacc 360
agagaaccca caccggagag aagccttaca agtgccccga gtgcggcaaa tccttcagcc 420
agagctcctc tctggtgagg caccaaagga cccacaccgg cgagaaacct tataagtgtc 480
ccgagtgtgg caaaagcttc agcacctccc actctctgac cgagcatcaa agaacccaca 540
ccggcgaaaa accttataaa tgccccgagt gtggcaaatc cttcagcaga aatgacgctc 600
tgacagagca ccaaagaaca cataccggag aaaagcccta caaatgcccc gagtgtggaa 660
aatccttttc tagaaacgat gctctgaccg aacaccaaag aacacacacc ggcgaaaagc 720
ctaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 57]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 57]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accttacaag tgccccgagt gcggaaagag cttcagcaga agcgacaaac 180
tggtgaggca tcaaaggaca cataccggag agaagcccta taagtgcccc gaatgtggca 240
aatccttttc ccagagggct catctggaaa gacaccagag gacccatacc ggcgaaaaac 300
cctacaaatg tcccgagtgt ggaaagagct tttccgatcc cggccatctg gtcagacatc 360
agaggacaca taccggcgaa aagccttaca agtgtcccga atgcggaaaa tccttctcca 420
gaagcgacaa gctggtgagg caccaaagaa cccacaccgg cgaaaaaccc tataaatgcc 480
ccgagtgcgg caagtccttt agccagctgg cccatctgag agcccaccag agaacacaca 540
ccggagagaa gccttataag tgtcccgagt gcggaaagtc cttctctaga gccgacaatc 600
tgaccgaaca tcaaaggaca cacaccggcg agaaacctta taaatgcccc gagtgcggaa 660
aaagcttttc cgactgcaga gatctggcta gacaccagag aacccacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 58]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223]]>> 人工序列之描述:合成聚核苷酸]]>
<br/>
<br/><![CDATA[<400> 58]]>
<br/><![CDATA[aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa gccttataaa tgtcccgaat gcggcaagag ctttagccac accggccatc 180
tgctggaaca ccaaaggacc cataccggcg aaaagcccta taagtgcccc gagtgtggca 240
agagcttcag caccaccggc aatctgacag tccatcagag gacccacacc ggagagaaac 300
cctataaatg ccccgagtgt ggaaagtcct tctccgacaa gaaggatctg acaagacacc 360
agaggaccca taccggcgag aaaccctaca aatgccccga gtgcggcaaa tccttctccc 420
agagcggcga tctgaggaga catcaaagaa cacataccgg cgaaaaaccc tataagtgcc 480
ccgaatgcgg caagtccttc agccagaggg cccatctgga aaggcatcag aggacacaca 540
ccggcgagaa gccttacaaa tgtcccgagt gcggaaagag cttctctaga agcgacaagc 600
tgaccgagca tcagaggacc cacaccggag aaaaacctta caagtgcccc gagtgcggca 660
aaagcttcag cagaaccgac acactgagag atcaccaaag gacacacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 59]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 59]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa gccttataag tgccccgagt gtggcaagag ctttagccac accggccatc 180
tgctggagca tcaaaggaca cacaccggag aaaagcccta taagtgcccc gagtgtggca 240
aatccttcag cacctccggc aatctcaccg aacaccagag aacacacacc ggagaaaaac 300
cttacaaatg tcccgagtgt ggaaagagct tttccaccag cggcaatctg gtgagacatc 360
aaagaacaca taccggcgaa aaaccctata aatgccccga gtgtggaaaa tccttctccc 420
aactggccca tctgagggcc caccagagga cacataccgg agaaaaaccc tacaaatgcc 480
ccgaatgcgg aaaaagcttc tccgagagaa gccatctgag agagcaccaa aggacccata 540
ccggagaaaa gccttacaag tgtcccgagt gcggaaaaag ctttagcgat cccggacatc 600
tggtgagaca tcagagaacc cacaccggcg aaaagcctta taaatgtccc gaatgtggca 660
agtcctttag cacccatctg gatctgatta gacaccaaag aacccacacc ggcgagaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 60]]>
<![CDATA[<211> 1345]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 60]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa gccttacaaa tgtcccgagt gcggaaagtc cttcagcgac cccggcgctc 180
tggtgagaca tcaaagaaca cataccggcg agaaacctta taaatgcccc gaatgtggaa 240
aatccttcag cgaaagaagc catctgaggg aacaccagag gacccacacc ggcgaaaaac 300
cttataagtg ccccgaatgc ggaaaaagct tttctagaag cgatcatctg accaaccatc 360
agagaacaca caccggcgaa aagccctata aatgtcccga gtgtggcaaa tcctttagcg 420
agaggtccca tctgagagag caccagagga cacataccgg agagaagccc tacaagtgcc 480
ccgagtgtgg caagagcttt agcagaagcg accatctgac caatcatcaa aggacccaca 540
ccggagagaa gccttacaag tgtcccgagt gcggaaagtc cttttccgat cccggccacc 600
tcgtgaggca ccaaagaacc cataccggcg agaaacccta caaatgcccc gagtgtggaa 660
agagcttctc cagaagcgac aagctggtga ggcatcagag gacacacacc ggcgaaaaac 720
ccaccggcaa gaaaaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaa 1345
<![CDATA[<210> 61]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 61]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggagagaa gccctacaaa tgccccgagt gtggaaagag cttctctaga aatgacgctc 180
tgacagaaca ccagaggacc cataccggcg agaaacctta caaatgcccc gagtgcggaa 240
aaagctttag cgattgcaga gatctggcta gacatcagag aacacacacc ggcgagaagc 300
cctataagtg ccccgaatgc ggcaagagct ttagcgaccc cggccatctg gtgagacatc 360
aaaggacaca taccggagaa aaaccttaca agtgccccga gtgcggaaag tccttctccc 420
agagcggcca tctcaccgag catcaaagga cccacaccgg cgaaaagcct tataaatgtc 480
ccgaatgtgg caagtccttc tctagagagg ataatctgca cacccatcag aggacccaca 540
ccggcgaaaa gccttataaa tgccccgaat gtggaaagtc cttttccacc aagaactctc 600
tgaccgagca tcagaggaca cacaccggag agaaacccta taaatgtccc gagtgtggca 660
agagcttcag cagagccgac aatctgacag agcaccaaag aacacatacc ggcgaaaagc 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 62]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 62]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gtggcaaatc cttctctaga tccgacaaac 180
tgaccgaaca tcagaggacc cataccggcg aaaaacctta taaatgtccc gagtgcggaa 240
agtccttctc tagaagggac gagctgaacg tgcatcagag aacacatacc ggcgagaagc 300
cctataaatg ccccgaatgc ggcaaaagct tctctagaag cgatcatctg accaaccacc 360
agagaaccca taccggagaa aagccttaca agtgtcccga atgtggaaaa tccttcagct 420
cccccgccga tctgaccaga caccaaagga cccacaccgg cgagaagccc tataaatgcc 480
ccgagtgcgg caagagcttt tccagatccg accatctgac caatcatcaa agaacccaca 540
ccggcgaaaa gccttataaa tgtcccgagt gcggcaaatc cttttccagc aagaaggctc 600
tgaccgagca tcaaaggacc cataccggcg agaagcctta caaatgcccc gagtgtggaa 660
agtcctttag cacccatctg gatctgatta gacaccagag gacacacacc ggagagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 63]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 63]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 64]]>
<![CDATA[<211> 2080]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 64]]>
agtgtggaaa aagctttagc caaagcggcg atctgaggag acaccaaaga acacacaccg 60
gcgagaagcc ctacaaatgt cccgagtgcg gaaagagctt cagccagagc ggccatctga 120
ccgagcatca gagaacccat accggcgaaa aaccttataa gtgccccgag tgtggaaagt 180
ccttctccga gagatcccat ctgagagaac accagaggac acacaccggc gaaaaacctt 240
ataagtgtcc cgagtgcgga aagtccttca gcgatcccgg ccatctggtg agacatcaaa 300
ggacacatac cggcgaaaaa ccttataagt gtcccgaatg cggcaagagc tttagcagaa 360
acgacacact caccgaacac cagaggaccc acaccggcga gaaaccctac aaatgccccg 420
agtgcggcaa atccttttct agagccgaca atctgaccga acaccagagg acccataccg 480
gagaaaagcc ttacaaatgt cccgagtgtg gcaaatcctt ctccacccat ctggatctga 540
ttagacacca aagaacacat accggagaaa agcccaccgg aaaaaagacc agcgctagcg 600
gcagcggcgg cggcagcggc ggcgcccggg acagcaaggt ggagaacaag accaagaagc 660
tgcgggtgtt cgaggccttc gccggcatcg gcgcccagcg gaaggccctg gagaaggtgc 720
ggaaggacga gtacgagatc gtgggcctgg ccgagtggta cgtgcccgcc atcgtgatgt 780
accaggccat ccacaacaac ttccacacca agctggagta caagagcgtg agccgggagg 840
agatgatcga ctacctggag aacaagaccc tgagctggaa cagcaagaac cccgtgagca 900
acggctactg gaagcggaag aaggacgacg agctgaagat catctacaac gccatcaagc 960
tgagcgagaa ggagggcaac atcttcgaca tccgggacct gtacaagcgg accctgaaga 1020
acatcgacct gctgacctac agcttcccct gccaggacct gagccagcag ggcatccaga 1080
agggcatgaa gcggggcagc ggcacccgga gcggcctgct gtgggagatc gagcgggccc 1140
tggacagcac cgagaagaac gacctgccca agtacctgct gatggagaac gtgggcgccc 1200
tgctgcacaa gaagaacgag gaggagctga accagtggaa gcagaagctg gagagcctgg 1260
gctaccagaa cagcatcgag gtgctgaacg ccgccgactt cggcagcagc caggcccggc 1320
ggcgggtgtt catgatcagc accctgaacg agttcgtgga gctgcccaag ggcgacaaga 1380
agcccaagag catcaagaag gtgctgaaca agatcgtgag cgagaaggac atcctgaaca 1440
acctgctgaa gtacaacctg accgagttca agaaaaccaa gagcaacatc aacaaggcca 1500
gcctgatcgg ctacagcaag ttcaacagcg agggctacgt gtacgacccc gagttcaccg 1560
gccccaccct gaccgccagc ggcgccaaca gccggatcaa gatcaaggac ggcagcaaca 1620
tccggaagat gaacagcgac gagaccttcc tgtacatcgg cttcgacagc caggacggca 1680
agcgggtgaa cgagatcgag ttcctgaccg agaaccagaa gatcttcgtg tgcggcaaca 1740
gcatcagcgt ggaggtgctg gaggccatca tcgacaagat cggcggcccc agcagcggcg 1800
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 1860
cctacgacgt gcccgactac gcctgagcgg ccgcttaatt aagctgcctt ctgcggggct 1920
tgccttctgg ccatgccctt cttctctccc ttgcacctgt acctcttggt ctttgaataa 1980
agcctgagta ggaagtctag aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2040
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2080
<![CDATA[<210> 65]]>
<![CDATA[<211> 1929]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 65]]>
gccttataag tgccccgagt gtggcaaatc cttttccgac tgtagagatc tggccagaca 60
tcaaagaacc cacaccggag agaaacctta taaatgcccc gagtgcggca agtcctttag 120
ccataccggc catctgctgg agcaccagag gacccatacc ggcgagaagc cttacaaatg 180
ccccgagtgc ggcaaaagct tcagcagaaa tgacgctctg accgagcatc aaaggaccca 240
taccggcgaa aagccctaca agtgtcccga gtgtggaaag tccttctccc agagcggcga 300
tctgaggaga caccagagaa cacacaccgg cgagaaaccc tataaatgtc ccgagtgcgg 360
aaagagcttt agcgacagcg gcaatctgag ggtgcatcaa agaacacaca ccggcgaaaa 420
acccaccgga aaaaagacaa gcgctagcgg cagcggcggc ggcagcggcg gcgcccggga 480
cagcaaggtg gagaacaaga ccaagaagct gcgggtgttc gaggccttcg ccggcatcgg 540
cgcccagcgg aaggccctgg agaaggtgcg gaaggacgag tacgagatcg tgggcctggc 600
cgagtggtac gtgcccgcca tcgtgatgta ccaggccatc cacaacaact tccacaccaa 660
gctggagtac aagagcgtga gccgggagga gatgatcgac tacctggaga acaagaccct 720
gagctggaac agcaagaacc ccgtgagcaa cggctactgg aagcggaaga aggacgacga 780
gctgaagatc atctacaacg ccatcaagct gagcgagaag gagggcaaca tcttcgacat 840
ccgggacctg tacaagcgga ccctgaagaa catcgacctg ctgacctaca gcttcccctg 900
ccaggacctg agccagcagg gcatccagaa gggcatgaag cggggcagcg gcacccggag 960
cggcctgctg tgggagatcg agcgggccct ggacagcacc gagaagaacg acctgcccaa 1020
gtacctgctg atggagaacg tgggcgccct gctgcacaag aagaacgagg aggagctgaa 1080
ccagtggaag cagaagctgg agagcctggg ctaccagaac agcatcgagg tgctgaacgc 1140
cgccgacttc ggcagcagcc aggcccggcg gcgggtgttc atgatcagca ccctgaacga 1200
gttcgtggag ctgcccaagg gcgacaagaa gcccaagagc atcaagaagg tgctgaacaa 1260
gatcgtgagc gagaaggaca tcctgaacaa cctgctgaag tacaacctga ccgagttcaa 1320
gaaaaccaag agcaacatca acaaggccag cctgatcggc tacagcaagt tcaacagcga 1380
gggctacgtg tacgaccccg agttcaccgg ccccaccctg accgccagcg gcgccaacag 1440
ccggatcaag atcaaggacg gcagcaacat ccggaagatg aacagcgacg agaccttcct 1500
gtacatcggc ttcgacagcc aggacggcaa gcgggtgaac gagatcgagt tcctgaccga 1560
gaaccagaag atcttcgtgt gcggcaacag catcagcgtg gaggtgctgg aggccatcat 1620
cgacaagatc ggcggcccca gcagcggcgg caagcggccc gccgccacca agaaggccgg 1680
ccaggccaag aagaagaagg gcagctaccc ctacgacgtg cccgactacg cctgagcggc 1740
cgcttaatta agctgccttc tgcggggctt gccttctggc catgcccttc ttctctccct 1800
tgcacctgta cctcttggtc tttgaataaa gcctgagtag gaagtctaga aaaaaaaaaa 1860
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1920
aaaaaaaaa 1929
<![CDATA[<210> 66]]>
<![CDATA[<211> 1779]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 66]]>
gacccatacc ggcgaaaagc cttacaagtg tcccgagtgc ggaaagtcct tctctagatc 60
cgacaacctc gtgaggcacc agagaaccca caccggcgag aaaccttaca aatgtcccga 120
gtgtggcaaa agcttttcta gaagcgacga gctggtgaga catcaaagaa cccataccgg 180
cgaaaaacct tataagtgtc ccgagtgcgg caaatccttt agccagctgg cccatctgag 240
ggcccaccag agaacacata ccggcgaaaa acccaccggc aaaaagacaa gcgctagcgg 300
cagcggcggc ggcagcggcg gcgcccggga cagcaaggtg gagaacaaga ccaagaagct 360
gcgggtgttc gaggccttcg ccggcatcgg cgcccagcgg aaggccctgg agaaggtgcg 420
gaaggacgag tacgagatcg tgggcctggc cgagtggtac gtgcccgcca tcgtgatgta 480
ccaggccatc cacaacaact tccacaccaa gctggagtac aagagcgtga gccgggagga 540
gatgatcgac tacctggaga acaagaccct gagctggaac agcaagaacc ccgtgagcaa 600
cggctactgg aagcggaaga aggacgacga gctgaagatc atctacaacg ccatcaagct 660
gagcgagaag gagggcaaca tcttcgacat ccgggacctg tacaagcgga ccctgaagaa 720
catcgacctg ctgacctaca gcttcccctg ccaggacctg agccagcagg gcatccagaa 780
gggcatgaag cggggcagcg gcacccggag cggcctgctg tgggagatcg agcgggccct 840
ggacagcacc gagaagaacg acctgcccaa gtacctgctg atggagaacg tgggcgccct 900
gctgcacaag aagaacgagg aggagctgaa ccagtggaag cagaagctgg agagcctggg 960
ctaccagaac agcatcgagg tgctgaacgc cgccgacttc ggcagcagcc aggcccggcg 1020
gcgggtgttc atgatcagca ccctgaacga gttcgtggag ctgcccaagg gcgacaagaa 1080
gcccaagagc atcaagaagg tgctgaacaa gatcgtgagc gagaaggaca tcctgaacaa 1140
cctgctgaag tacaacctga ccgagttcaa gaaaaccaag agcaacatca acaaggccag 1200
cctgatcggc tacagcaagt tcaacagcga gggctacgtg tacgaccccg agttcaccgg 1260
ccccaccctg accgccagcg gcgccaacag ccggatcaag atcaaggacg gcagcaacat 1320
ccggaagatg aacagcgacg agaccttcct gtacatcggc ttcgacagcc aggacggcaa 1380
gcgggtgaac gagatcgagt tcctgaccga gaaccagaag atcttcgtgt gcggcaacag 1440
catcagcgtg gaggtgctgg aggccatcat cgacaagatc ggcggcccca gcagcggcgg 1500
caagcggccc gccgccacca agaaggccgg ccaggccaag aagaagaagg gcagctaccc 1560
ctacgacgtg cccgactacg cctgagcggc cgcttaatta agctgccttc tgcggggctt 1620
gccttctggc catgcccttc ttctctccct tgcacctgta cctcttggtc tttgaataaa 1680
gcctgagtag gaagtctaga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1779
<![CDATA[<210> 67]]>
<![CDATA[<211> 4829]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 67]]>
aaggaaataa gagagaaaag aagagtaaga agaaatataa gagccaccat ggcccccaag 60
aagaagcgga aggtgggcat ccacggcgtg cccgccgccg acaagaagta cagcatcggc 120
ctggccatcg gcaccaacag cgtgggctgg gccgtgatca ccgacgagta caaggtgccc 180
agcaagaagt tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc 240
ggcgccctgc tgttcgacag cggcgagacc gccgaggcca cccggctgaa gcggaccgcc 300
cggcggcggt acacccggcg gaagaaccgg atctgctacc tgcaggagat cttcagcaac 360
gagatggcca aggtggacga cagcttcttc caccggctgg aggagagctt cctggtggag 420
gaggacaaga agcacgagcg gcaccccatc ttcggcaaca tcgtggacga ggtggcctac 480
cacgagaagt accccaccat ctaccacctg cggaagaagc tggtggacag caccgacaag 540
gccgacctgc ggctgatcta cctggccctg gcccacatga tcaagttccg gggccacttc 600
ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg 660
gtgcagacct acaaccagct gttcgaggag aaccccatca acgccagcgg cgtggacgcc 720
aaggccatcc tgagcgcccg gctgagcaag agccggcggc tggagaacct gatcgcccag 780
ctgcccggcg agaagaagaa cggcctgttc ggcaacctga tcgccctgag cctgggcctg 840
acccccaact tcaagagcaa cttcgacctg gccgaggacg ccaagctgca gctgagcaag 900
gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac 960
ctgttcctgg ccgccaagaa cctgagcgac gccatcctgc tgagcgacat cctgcgggtg 1020
aacaccgaga tcaccaaggc ccccctgagc gccagcatga tcaagcggta cgacgagcac 1080
caccaggacc tgaccctgct gaaggccctg gtgcggcagc agctgcccga gaagtacaag 1140
gagatcttct tcgaccagag caagaacggc tacgccggct acatcgacgg cggcgccagc 1200
caggaggagt tctacaagtt catcaagccc atcctggaga agatggacgg caccgaggag 1260
ctgctggtga agctgaaccg ggaggacctg ctgcggaagc agcggacctt cgacaacggc 1320
agcatccccc accagatcca cctgggcgag ctgcacgcca tcctgcggcg gcaggaggac 1380
ttctacccct tcctgaagga caaccgggag aagatcgaga agatcctgac cttccggatc 1440
ccctactacg tgggccccct ggcccggggc aacagccggt tcgcctggat gacccggaaa 1500
tccgaggaga ccatcacccc ctggaacttc gaggaggtgg tggacaaggg cgccagcgcc 1560
cagagcttca tcgagcggat gaccaacttc gacaagaacc tgcccaacga gaaggtgctg 1620
cccaagcaca gcctgctgta cgagtacttc accgtgtaca acgagctgac caaggtgaag 1680
tacgtgaccg agggcatgcg gaagcccgcc ttcctgagcg gcgagcagaa gaaggccatc 1740
gtggacctgc tgttcaagac caaccggaag gtgaccgtga agcagctgaa ggaggactac 1800
ttcaagaaga tcgagtgctt cgacagcgtg gagatcagcg gcgtggagga ccggttcaac 1860
gccagcctgg gcacctacca cgacctgctg aagatcatca aggacaagga cttcctggac 1920
aacgaggaga acgaggacat cctggaggac atcgtgctga ccctgaccct gttcgaggac 1980
cgggagatga tcgaggagcg gctgaaaacc tacgcccacc tgttcgacga caaggtgatg 2040
aagcagctga agcggcggcg gtacaccggc tggggccggc tgagccggaa gctgatcaac 2100
ggcatccggg acaagcagag cggcaagacc atcctggact tcctgaaatc cgacggcttc 2160
gccaaccgga acttcatgca gctgatccac gacgacagcc tgaccttcaa ggaggacatc 2220
cagaaggccc aggtgagcgg ccagggcgac agcctgcacg agcacatcgc caacctggcc 2280
ggcagccccg ccatcaagaa gggcatcctg cagaccgtga aggtggtgga cgagctggtg 2340
aaggtgatgg gccggcacaa gcccgagaac atcgtgatcg agatggcccg ggagaaccag 2400
accacccaga agggccagaa gaacagccgg gagcggatga agcggatcga ggagggcatc 2460
aaggagctgg gcagccagat cctgaaggag caccccgtgg agaacaccca gctgcagaac 2520
gagaagctgt acctgtacta cctgcagaac ggccgggaca tgtacgtgga ccaggagctg 2580
gacatcaacc ggctgagcga ctacgacgtg gccgccatcg tgccccagag cttcctgaag 2640
gacgacagca tcgacaacaa ggtgctgacc cggagcgaca aggcccgggg caagagcgac 2700
aacgtgccca gcgaggaggt ggtgaagaag atgaagaact actggcggca gctgctgaac 2760
gccaagctga tcacccagcg gaagttcgac aacctgacca aggccgagcg gggcggcctg 2820
agcgagctgg acaaggccgg cttcatcaag cggcagctgg tggagacccg gcagatcacc 2880
aagcacgtgg cccagatcct ggacagccgg atgaacacca agtacgacga gaacgacaag 2940
ctgatccggg aggtgaaggt gatcaccctg aaatccaagc tggtgagcga cttccggaag 3000
gacttccagt tctacaaggt gcgggagatc aacaactacc accacgccca cgacgcctac 3060
ctgaacgccg tggtgggcac cgccctgatc aagaagtacc ccaagctgga gagcgagttc 3120
gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggag 3180
atcggcaagg ccaccgccaa gtacttcttc tacagcaaca tcatgaactt cttcaagacc 3240
gagatcaccc tggccaacgg cgagatccgg aagcggcccc tgatcgagac caacggcgag 3300
accggcgaga tcgtgtggga caagggccgg gacttcgcca ccgtgcggaa ggtgctgagc 3360
atgccccagg tgaacatcgt gaagaaaacc gaggtgcaga ccggcggctt cagcaaggag 3420
agcatcctgc ccaagcggaa cagcgacaag ctgatcgccc ggaagaagga ctgggacccc 3480
aagaagtacg gcggcttcga cagccccacc gtggcctaca gcgtgctggt ggtggccaag 3540
gtggagaagg gcaagagcaa gaagctgaaa tccgtgaagg agctgctggg catcaccatc 3600
atggagcgga gcagcttcga gaagaacccc atcgacttcc tggaggccaa gggctacaag 3660
gaggtgaaga aggacctgat catcaagctg cccaagtaca gcctgttcga gctggagaac 3720
ggccggaagc ggatgctggc cagcgccggc gagctgcaga agggcaacga gctggccctg 3780
cccagcaagt acgtgaactt cctgtacctg gccagccact acgagaagct gaagggcagc 3840
cccgaggaca acgagcagaa gcagctgttc gtggagcagc acaagcacta cctggacgag 3900
atcatcgagc agatcagcga gttcagcaag cgggtgatcc tggccgacgc caacctggac 3960
aaggtgctga gcgcctacaa caagcaccgg gacaagccca tccgggagca ggccgagaac 4020
atcatccacc tgttcaccct gaccaacctg ggcgcccccg ccgccttcaa gtacttcgac 4080
accaccatcg accggaagcg gtacaccagc accaaggagg tgctggacgc caccctgatc 4140
caccagagca tcaccggcct gtacgagacc cggatcgacc tgagccagct gggcggcgac 4200
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggc cagcgacgcc 4260
aagagcctga ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc 4320
acccgggagg agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg 4380
ctggagaact acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc 4440
ctgcggctgg agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggagacc 4500
caccccgaca gcgagaccgc cttcgagatc aagagcagcg tgagcggcgg caagcggccc 4560
gccgccacca agaaggccgg ccaggccaag aagaagaagg gcagctaccc ctacgacgtg 4620
cccgactacg cctgagcggc cgcttaatta agctgccttc tgcggggctt gccttctggc 4680
catgcccttc ttctctccct tgcacctgta cctcttggtc tttgaataaa gcctgagtag 4740
gaagtctaga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 4829
<![CDATA[<210> 68]]>
<![CDATA[<211> 2591]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 68]]>
gagttcgtgt acggcgacta caaggtgtac gacgtgcgga agatgatcgc caagagcgag 60
caggagatcg gcaaggccac cgccaagtac ttcttctaca gcaacatcat gaacttcttc 120
aagaccgaga tcaccctggc caacggcgag atccggaagc ggcccctgat cgagaccaac 180
ggcgagaccg gcgagatcgt gtgggacaag ggccgggact tcgccaccgt gcggaaggtg 240
ctgagcatgc cccaggtgaa catcgtgaag aaaaccgagg tgcagaccgg cggcttcagc 300
aaggagagca tcctgcccaa gcggaacagc gacaagctga tcgcccggaa gaaggactgg 360
gaccccaaga agtacggcgg cttcgacagc cccaccgtgg cctacagcgt gctggtggtg 420
gccaaggtgg agaagggcaa gagcaagaag ctgaaatccg tgaaggagct gctgggcatc 480
accatcatgg agcggagcag cttcgagaag aaccccatcg acttcctgga ggccaagggc 540
tacaaggagg tgaagaagga cctgatcatc aagctgccca agtacagcct gttcgagctg 600
gagaacggcc ggaagcggat gctggccagc gccggcgagc tgcagaaggg caacgagctg 660
gccctgccca gcaagtacgt gaacttcctg tacctggcca gccactacga gaagctgaag 720
ggcagccccg aggacaacga gcagaagcag ctgttcgtgg agcagcacaa gcactacctg 780
gacgagatca tcgagcagat cagcgagttc agcaagcggg tgatcctggc cgacgccaac 840
ctggacaagg tgctgagcgc ctacaacaag caccgggaca agcccatccg ggagcaggcc 900
gagaacatca tccacctgtt caccctgacc aacctgggcg cccccgccgc cttcaagtac 960
ttcgacacca ccatcgaccg gaagcggtac accagcacca aggaggtgct ggacgccacc 1020
ctgatccacc agagcatcac cggcctgtac gagacccgga tcgacctgag ccagctgggc 1080
ggcgacaagc ggcccgccgc caccaagaag gccggccagg ccaagaagaa gaaggcccgg 1140
gacagcaagg tggagaacaa gaccaagaag ctgcgggtgt tcgaggcctt cgccggcatc 1200
ggcgcccagc ggaaggccct ggagaaggtg cggaaggacg agtacgagat cgtgggcctg 1260
gccgagtggt acgtgcccgc catcgtgatg taccaggcca tccacaacaa cttccacacc 1320
aagctggagt acaagagcgt gagccgggag gagatgatcg actacctgga gaacaagacc 1380
ctgagctgga acagcaagaa ccccgtgagc aacggctact ggaagcggaa gaaggacgac 1440
gagctgaaga tcatctacaa cgccatcaag ctgagcgaga aggagggcaa catcttcgac 1500
atccgggacc tgtacaagcg gaccctgaag aacatcgacc tgctgaccta cagcttcccc 1560
tgccaggacc tgagccagca gggcatccag aagggcatga agcggggcag cggcacccgg 1620
agcggcctgc tgtgggagat cgagcgggcc ctggacagca ccgagaagaa cgacctgccc 1680
aagtacctgc tgatggagaa cgtgggcgcc ctgctgcaca agaagaacga ggaggagctg 1740
aaccagtgga agcagaagct ggagagcctg ggctaccaga acagcatcga ggtgctgaac 1800
gccgccgact tcggcagcag ccaggcccgg cggcgggtgt tcatgatcag caccctgaac 1860
gagttcgtgg agctgcccaa gggcgacaag aagcccaaga gcatcaagaa ggtgctgaac 1920
aagatcgtga gcgagaagga catcctgaac aacctgctga agtacaacct gaccgagttc 1980
aagaaaacca agagcaacat caacaaggcc agcctgatcg gctacagcaa gttcaacagc 2040
gagggctacg tgtacgaccc cgagttcacc ggccccaccc tgaccgccag cggcgccaac 2100
agccggatca agatcaagga cggcagcaac atccggaaga tgaacagcga cgagaccttc 2160
ctgtacatcg gcttcgacag ccaggacggc aagcgggtga acgagatcga gttcctgacc 2220
gagaaccaga agatcttcgt gtgcggcaac agcatcagcg tggaggtgct ggaggccatc 2280
atcgacaaga tcggcggccc cagcagcggc ggcaagcggc ccgccgccac caagaaggcc 2340
ggccaggcca agaagaagaa gggcagctac ccctacgacg tgcccgacta cgcctgagcg 2400
gccgcttaat taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc 2460
cttgcacctg tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaa 2520
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2580
aaaaaaaaaa a 2591
<![CDATA[<210> 69]]>
<![CDATA[<211> 6010]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 69]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccga caagaagtac agcatcggcc 120
tggccatcgg caccaacagc gtgggctggg ccgtgatcac cgacgagtac aaggtgccca 180
gcaagaagtt caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg 240
gcgccctgct gttcgacagc ggcgagaccg ccgaggccac ccggctgaag cggaccgccc 300
ggcggcggta cacccggcgg aagaaccgga tctgctacct gcaggagatc ttcagcaacg 360
agatggccaa ggtggacgac agcttcttcc accggctgga ggagagcttc ctggtggagg 420
aggacaagaa gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc 480
acgagaagta ccccaccatc taccacctgc ggaagaagct ggtggacagc accgacaagg 540
ccgacctgcg gctgatctac ctggccctgg cccacatgat caagttccgg ggccacttcc 600
tgatcgaggg cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg 660
tgcagaccta caaccagctg ttcgaggaga accccatcaa cgccagcggc gtggacgcca 720
aggccatcct gagcgcccgg ctgagcaaga gccggcggct ggagaacctg atcgcccagc 780
tgcccggcga gaagaagaac ggcctgttcg gcaacctgat cgccctgagc ctgggcctga 840
cccccaactt caagagcaac ttcgacctgg ccgaggacgc caagctgcag ctgagcaagg 900
acacctacga cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc 960
tgttcctggc cgccaagaac ctgagcgacg ccatcctgct gagcgacatc ctgcgggtga 1020
acaccgagat caccaaggcc cccctgagcg ccagcatgat caagcggtac gacgagcacc 1080
accaggacct gaccctgctg aaggccctgg tgcggcagca gctgcccgag aagtacaagg 1140
agatcttctt cgaccagagc aagaacggct acgccggcta catcgacggc ggcgccagcc 1200
aggaggagtt ctacaagttc atcaagccca tcctggagaa gatggacggc accgaggagc 1260
tgctggtgaa gctgaaccgg gaggacctgc tgcggaagca gcggaccttc gacaacggca 1320
gcatccccca ccagatccac ctgggcgagc tgcacgccat cctgcggcgg caggaggact 1380
tctacccctt cctgaaggac aaccgggaga agatcgagaa gatcctgacc ttccggatcc 1440
cctactacgt gggccccctg gcccggggca acagccggtt cgcctggatg acccggaaat 1500
ccgaggagac catcaccccc tggaacttcg aggaggtggt ggacaagggc gccagcgccc 1560
agagcttcat cgagcggatg accaacttcg acaagaacct gcccaacgag aaggtgctgc 1620
ccaagcacag cctgctgtac gagtacttca ccgtgtacaa cgagctgacc aaggtgaagt 1680
acgtgaccga gggcatgcgg aagcccgcct tcctgagcgg cgagcagaag aaggccatcg 1740
tggacctgct gttcaagacc aaccggaagg tgaccgtgaa gcagctgaag gaggactact 1800
tcaagaagat cgagtgcttc gacagcgtgg agatcagcgg cgtggaggac cggttcaacg 1860
ccagcctggg cacctaccac gacctgctga agatcatcaa ggacaaggac ttcctggaca 1920
acgaggagaa cgaggacatc ctggaggaca tcgtgctgac cctgaccctg ttcgaggacc 1980
gggagatgat cgaggagcgg ctgaaaacct acgcccacct gttcgacgac aaggtgatga 2040
agcagctgaa gcggcggcgg tacaccggct ggggccggct gagccggaag ctgatcaacg 2100
gcatccggga caagcagagc ggcaagacca tcctggactt cctgaaatcc gacggcttcg 2160
ccaaccggaa cttcatgcag ctgatccacg acgacagcct gaccttcaag gaggacatcc 2220
agaaggccca ggtgagcggc cagggcgaca gcctgcacga gcacatcgcc aacctggccg 2280
gcagccccgc catcaagaag ggcatcctgc agaccgtgaa ggtggtggac gagctggtga 2340
aggtgatggg ccggcacaag cccgagaaca tcgtgatcga gatggcccgg gagaaccaga 2400
ccacccagaa gggccagaag aacagccggg agcggatgaa gcggatcgag gagggcatca 2460
aggagctggg cagccagatc ctgaaggagc accccgtgga gaacacccag ctgcagaacg 2520
agaagctgta cctgtactac ctgcagaacg gccgggacat gtacgtggac caggagctgg 2580
acatcaaccg gctgagcgac tacgacgtgg ccgccatcgt gccccagagc ttcctgaagg 2640
acgacagcat cgacaacaag gtgctgaccc ggagcgacaa ggcccggggc aagagcgaca 2700
acgtgcccag cgaggaggtg gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg 2760
ccaagctgat cacccagcgg aagttcgaca acctgaccaa ggccgagcgg ggcggcctga 2820
gcgagctgga caaggccggc ttcatcaagc ggcagctggt ggagacccgg cagatcacca 2880
agcacgtggc ccagatcctg gacagccgga tgaacaccaa gtacgacgag aacgacaagc 2940
tgatccggga ggtgaaggtg atcaccctga aatccaagct ggtgagcgac ttccggaagg 3000
acttccagtt ctacaaggtg cgggagatca acaactacca ccacgcccac gacgcctacc 3060
tgaacgccgt ggtgggcacc gccctgatca agaagtaccc caagctggag agcgagttcg 3120
tgtacggcga ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaga 3180
tcggcaaggc caccgccaag tacttcttct acagcaacat catgaacttc ttcaagaccg 3240
agatcaccct ggccaacggc gagatccgga agcggcccct gatcgagacc aacggcgaga 3300
ccggcgagat cgtgtgggac aagggccggg acttcgccac cgtgcggaag gtgctgagca 3360
tgccccaggt gaacatcgtg aagaaaaccg aggtgcagac cggcggcttc agcaaggaga 3420
gcatcctgcc caagcggaac agcgacaagc tgatcgcccg gaagaaggac tgggacccca 3480
agaagtacgg cggcttcgac agccccaccg tggcctacag cgtgctggtg gtggccaagg 3540
tggagaaggg caagagcaag aagctgaaat ccgtgaagga gctgctgggc atcaccatca 3600
tggagcggag cagcttcgag aagaacccca tcgacttcct ggaggccaag ggctacaagg 3660
aggtgaagaa ggacctgatc atcaagctgc ccaagtacag cctgttcgag ctggagaacg 3720
gccggaagcg gatgctggcc agcgccggcg agctgcagaa gggcaacgag ctggccctgc 3780
ccagcaagta cgtgaacttc ctgtacctgg ccagccacta cgagaagctg aagggcagcc 3840
ccgaggacaa cgagcagaag cagctgttcg tggagcagca caagcactac ctggacgaga 3900
tcatcgagca gatcagcgag ttcagcaagc gggtgatcct ggccgacgcc aacctggaca 3960
aggtgctgag cgcctacaac aagcaccggg acaagcccat ccgggagcag gccgagaaca 4020
tcatccacct gttcaccctg accaacctgg gcgcccccgc cgccttcaag tacttcgaca 4080
ccaccatcga ccggaagcgg tacaccagca ccaaggaggt gctggacgcc accctgatcc 4140
accagagcat caccggcctg tacgagaccc ggatcgacct gagccagctg ggcggcgaca 4200
gcggcggcaa gcggcccgcc gccaccaaga aggccggcca ggccaagaag aagaagtcgg 4260
gcgggggtgg ctcagtggat ctgaggacac tcgacgtgtt tagcggatgc ggcggactct 4320
ccgaaggctt ccaccaagcc ggaatttccg acacactctg ggccattgag atgtgggacc 4380
ccgccgctca agccttcaga ctgaataatc ccggctccac cgtgttcacc gaggactgca 4440
acattctgct gaagctggtg atggctggcg aaaccaccaa ctctagaggc cagaggctgc 4500
cccagaaggg agatgtggaa atgctctgtg gaggccctcc ttgccaaggc ttctccggca 4560
tgaacaggtt caactctaga acatacagca agttcaagaa ctctctggtc gtgagctttc 4620
tgagctactg cgactactat agacctaggt tctttctgct ggagaacgtg agaaatttcg 4680
tgtccttcaa gaggagcatg gtgctgaagc tgacactgag gtgtctggtg aggatgggct 4740
accagtgcac attcggagtg ctgcaagctg gccagtacgg cgtggcccag accagaagga 4800
gggccatcat tctggctgct gcccccggcg agaaactccc tctgttcccc gagcccctcc 4860
acgtgttcgc ccctagagct tgccagctga gcgtggtggt cgacgataag aagttcgtga 4920
gcaacatcac aaggctgtcc agcggaccct tcagaaccat taccgtgagg gataccatgt 4980
ccgacctccc cgaggtgagg aatggcgcca gcgctctgga gatttcctac aacggcgaac 5040
ctcagagctg gttccaaagg cagctgagag gcgctcagta tcagcccatt ctgagggacc 5100
acatctgcaa agatatgagc gctctggtgg ccgctagaat gagacatatt cctctggccc 5160
ccggcagcga ctggagagat ctgcccaata ttgaggtgag actcagcgac ggaacaatgg 5220
ctagaaaact gaggtacacc catcatgata gaaagaacgg aaggagcagc agcggcgctc 5280
tgagaggagt gtgtagctgc gtggaagctg gcaaggcttg cgatcccgcc gctaggcagt 5340
tcaataccct catcccttgg tgtctgcctc acaccggcaa cagacacaat cattgggctg 5400
gactgtatgg aaggctcgaa tgggacggct ttttcagcac caccgtgacc aatcccgaac 5460
ctatgggcaa gcaaggaagg gtgctccacc ccgagcagca tagagtcgtg tccgtgagag 5520
aatgcgctag aagccaaggc ttccccgaca cctatagact gttcggcaac attctggata 5580
agcacagaca agtgggaaat gctgtccctc ctcctctggc caaggctatc ggactggaga 5640
tcaagctgtg tatgctcgcc aaagctaggg agagcgcttc cgccaagatt aaggaggagg 5700
aggccgccaa ggacggaggt ggcggatcgg gaaagcggcc cgccgccacc aagaaggccg 5760
gtcaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 5820
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 5880
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 5940
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6000
aaaaaaaaaa 6010
<![CDATA[<210> 70]]>
<![CDATA[<211> 6202]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 70]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccga caagaagtac agcatcggcc 120
tggccatcgg caccaacagc gtgggctggg ccgtgatcac cgacgagtac aaggtgccca 180
gcaagaagtt caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg 240
gcgccctgct gttcgacagc ggcgagaccg ccgaggccac ccggctgaag cggaccgccc 300
ggcggcggta cacccggcgg aagaaccgga tctgctacct gcaggagatc ttcagcaacg 360
agatggccaa ggtggacgac agcttcttcc accggctgga ggagagcttc ctggtggagg 420
aggacaagaa gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc 480
acgagaagta ccccaccatc taccacctgc ggaagaagct ggtggacagc accgacaagg 540
ccgacctgcg gctgatctac ctggccctgg cccacatgat caagttccgg ggccacttcc 600
tgatcgaggg cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg 660
tgcagaccta caaccagctg ttcgaggaga accccatcaa cgccagcggc gtggacgcca 720
aggccatcct gagcgcccgg ctgagcaaga gccggcggct ggagaacctg atcgcccagc 780
tgcccggcga gaagaagaac ggcctgttcg gcaacctgat cgccctgagc ctgggcctga 840
cccccaactt caagagcaac ttcgacctgg ccgaggacgc caagctgcag ctgagcaagg 900
acacctacga cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc 960
tgttcctggc cgccaagaac ctgagcgacg ccatcctgct gagcgacatc ctgcgggtga 1020
acaccgagat caccaaggcc cccctgagcg ccagcatgat caagcggtac gacgagcacc 1080
accaggacct gaccctgctg aaggccctgg tgcggcagca gctgcccgag aagtacaagg 1140
agatcttctt cgaccagagc aagaacggct acgccggcta catcgacggc ggcgccagcc 1200
aggaggagtt ctacaagttc atcaagccca tcctggagaa gatggacggc accgaggagc 1260
tgctggtgaa gctgaaccgg gaggacctgc tgcggaagca gcggaccttc gacaacggca 1320
gcatccccca ccagatccac ctgggcgagc tgcacgccat cctgcggcgg caggaggact 1380
tctacccctt cctgaaggac aaccgggaga agatcgagaa gatcctgacc ttccggatcc 1440
cctactacgt gggccccctg gcccggggca acagccggtt cgcctggatg acccggaaat 1500
ccgaggagac catcaccccc tggaacttcg aggaggtggt ggacaagggc gccagcgccc 1560
agagcttcat cgagcggatg accaacttcg acaagaacct gcccaacgag aaggtgctgc 1620
ccaagcacag cctgctgtac gagtacttca ccgtgtacaa cgagctgacc aaggtgaagt 1680
acgtgaccga gggcatgcgg aagcccgcct tcctgagcgg cgagcagaag aaggccatcg 1740
tggacctgct gttcaagacc aaccggaagg tgaccgtgaa gcagctgaag gaggactact 1800
tcaagaagat cgagtgcttc gacagcgtgg agatcagcgg cgtggaggac cggttcaacg 1860
ccagcctggg cacctaccac gacctgctga agatcatcaa ggacaaggac ttcctggaca 1920
acgaggagaa cgaggacatc ctggaggaca tcgtgctgac cctgaccctg ttcgaggacc 1980
gggagatgat cgaggagcgg ctgaaaacct acgcccacct gttcgacgac aaggtgatga 2040
agcagctgaa gcggcggcgg tacaccggct ggggccggct gagccggaag ctgatcaacg 2100
gcatccggga caagcagagc ggcaagacca tcctggactt cctgaaatcc gacggcttcg 2160
ccaaccggaa cttcatgcag ctgatccacg acgacagcct gaccttcaag gaggacatcc 2220
agaaggccca ggtgagcggc cagggcgaca gcctgcacga gcacatcgcc aacctggccg 2280
gcagccccgc catcaagaag ggcatcctgc agaccgtgaa ggtggtggac gagctggtga 2340
aggtgatggg ccggcacaag cccgagaaca tcgtgatcga gatggcccgg gagaaccaga 2400
ccacccagaa gggccagaag aacagccggg agcggatgaa gcggatcgag gagggcatca 2460
aggagctggg cagccagatc ctgaaggagc accccgtgga gaacacccag ctgcagaacg 2520
agaagctgta cctgtactac ctgcagaacg gccgggacat gtacgtggac caggagctgg 2580
acatcaaccg gctgagcgac tacgacgtgg ccgccatcgt gccccagagc ttcctgaagg 2640
acgacagcat cgacaacaag gtgctgaccc ggagcgacaa ggcccggggc aagagcgaca 2700
acgtgcccag cgaggaggtg gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg 2760
ccaagctgat cacccagcgg aagttcgaca acctgaccaa ggccgagcgg ggcggcctga 2820
gcgagctgga caaggccggc ttcatcaagc ggcagctggt ggagacccgg cagatcacca 2880
agcacgtggc ccagatcctg gacagccgga tgaacaccaa gtacgacgag aacgacaagc 2940
tgatccggga ggtgaaggtg atcaccctga aatccaagct ggtgagcgac ttccggaagg 3000
acttccagtt ctacaaggtg cgggagatca acaactacca ccacgcccac gacgcctacc 3060
tgaacgccgt ggtgggcacc gccctgatca agaagtaccc caagctggag agcgagttcg 3120
tgtacggcga ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaga 3180
tcggcaaggc caccgccaag tacttcttct acagcaacat catgaacttc ttcaagaccg 3240
agatcaccct ggccaacggc gagatccgga agcggcccct gatcgagacc aacggcgaga 3300
ccggcgagat cgtgtgggac aagggccggg acttcgccac cgtgcggaag gtgctgagca 3360
tgccccaggt gaacatcgtg aagaaaaccg aggtgcagac cggcggcttc agcaaggaga 3420
gcatcctgcc caagcggaac agcgacaagc tgatcgcccg gaagaaggac tgggacccca 3480
agaagtacgg cggcttcgac agccccaccg tggcctacag cgtgctggtg gtggccaagg 3540
tggagaaggg caagagcaag aagctgaaat ccgtgaagga gctgctgggc atcaccatca 3600
tggagcggag cagcttcgag aagaacccca tcgacttcct ggaggccaag ggctacaagg 3660
aggtgaagaa ggacctgatc atcaagctgc ccaagtacag cctgttcgag ctggagaacg 3720
gccggaagcg gatgctggcc agcgccggcg agctgcagaa gggcaacgag ctggccctgc 3780
ccagcaagta cgtgaacttc ctgtacctgg ccagccacta cgagaagctg aagggcagcc 3840
ccgaggacaa cgagcagaag cagctgttcg tggagcagca caagcactac ctggacgaga 3900
tcatcgagca gatcagcgag ttcagcaagc gggtgatcct ggccgacgcc aacctggaca 3960
aggtgctgag cgcctacaac aagcaccggg acaagcccat ccgggagcag gccgagaaca 4020
tcatccacct gttcaccctg accaacctgg gcgcccccgc cgccttcaag tacttcgaca 4080
ccaccatcga ccggaagcgg tacaccagca ccaaggaggt gctggacgcc accctgatcc 4140
accagagcat caccggcctg tacgagaccc ggatcgacct gagccagctg ggcggcgaca 4200
gcgccggcgg cggcggcagc ggcggcggcg gcagcggcgg cggcggcagc ggccccaaga 4260
agaagcggaa ggtggccgcc gccggcagca accacgacca ggagttcgac ccccccaagg 4320
tgtacccccc cgtgcccgcc gagaagcgga agcccatccg ggtgctgagc ctgttcgacg 4380
gcatcgccac cggcctgctg gtgctgaagg acctgggcat ccaggtggac cggtacatcg 4440
ccagcgaggt gtgcgaggac agcatcaccg tgggcatggt gcggcaccag ggcaagatca 4500
tgtacgtggg cgacgtgcgg agcgtgaccc agaagcacat ccaggagtgg ggccccttcg 4560
acctggtgat cggcggcagc ccctgcaacg acctgagcat cgtgaacccc gcccggaagg 4620
gcctgtacga gggcaccggc cggctgttct tcgagttcta ccggctgctg cacgacgccc 4680
ggcccaagga gggcgacgac cggcccttct tctggctgtt cgagaacgtg gtggccatgg 4740
gcgtgagcga caagcgggac atcagccggt tcctggagag caaccccgtg atgatcgacg 4800
ccaaggaggt gagcgccgcc caccgggccc ggtacttctg gggcaacctg cccggcatga 4860
accggcccct ggccagcacc gtgaacgaca agctggagct gcaggagtgc ctggagcacg 4920
gccggatcgc caagttcagc aaggtgcgga ccatcaccac ccggagcaac agcatcaagc 4980
agggcaagga ccagcacttc cccgtgttca tgaacgagaa ggaggacatc ctgtggtgca 5040
ccgagatgga gcgggtgttc ggcttccccg tgcactacac cgacgtgagc aacatgagcc 5100
ggctggcccg gcagcggctg ctgggccgga gctggagcgt gcccgtgatc cggcacctgt 5160
tcgcccccct gaaggagtac ttcgcctgcg tgagcagcgg caacagcaac gccaacagcc 5220
ggggccccag cttcagcagc ggcctggtgc ccctgagcct gcggggcagc cacatgaatc 5280
ctctggagat gttcgagaca gtgcccgtgt ggagaaggca acccgtgagg gtgctgagcc 5340
tcttcgagga cattaagaag gagctgacct ctctgggctt tctggaatcc ggcagcgacc 5400
ccggccagct gaaacacgtg gtggacgtga ccgacacagt gaggaaggac gtggaagagt 5460
ggggcccctt tgacctcgtg tatggagcca cacctcctct cggccacaca tgcgataggc 5520
ctcccagctg gtatctcttc cagttccaca gactgctcca gtacgccaga cctaagcccg 5580
gcagccccag acccttcttc tggatgttcg tggacaatct ggtgctgaac aaggaggatc 5640
tggatgtggc cagcagattt ctggagatgg aacccgtgac aatccccgac gtgcatggcg 5700
gctctctgca gaacgccgtg agagtgtggt ccaacatccc cgccattaga agcagacact 5760
gggctctggt gagcgaggag gaactgtctc tgctggccca gaataagcag tcctccaagc 5820
tggccgccaa gtggcccacc aagctggtga agaactgctt tctgcctctg agggagtatt 5880
tcaagtattt cagcaccgaa ctgaccagca gcctgagcgg cggcaagcgg cccgccgcca 5940
ccaagaaggc cggccaggcc aagaagaaga agggcagcta cccctacgac gtgcccgact 6000
acgcctgagc ggccgcttaa ttaagctgcc ttctgcgggg cttgccttct ggccatgccc 6060
ttcttctctc ccttgcacct gtacctcttg gtctttgaat aaagcctgag taggaagtct 6120
agaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6180
aaaaaaaaaa aaaaaaaaaa aa 6202
<![CDATA[<210> 71]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成寡核苷酸]]>
<![CDATA[<220>]]>
<![CDATA[<221> modified_base]]>
<![CDATA[<222> (]]>1)..(1)
<![CDATA[<223> a、c、t、g、未知或其他]]>
<![CDATA[<220>]]>
<![CDATA[<221> modified_base]]>
<![CDATA[<222> (3)..(3)]]>
<![CDATA[<223> a、c、t、g、未知或其他]]>
<![CDATA[<400> 71]]>
nbndccdsha grkgghrshv 20
<![CDATA[<210> 72]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成寡核苷酸]]>
<![CDATA[<220]]>>]]>
<br/><![CDATA[<221> modified_base]]>
<br/><![CDATA[<222> (18)..(18)]]>
<br/><![CDATA[<223> a、c、t、g、未知或其他]]>
<br/>
<br/><![CDATA[<220>]]>
<br/><![CDATA[<221> modified_base]]>
<br/><![CDATA[<222> (20)..(20)]]>
<br/><![CDATA[<223> a、c、t、g、未知或其他]]>
<br/>
<br/><![CDATA[<400> 72]]>
<br/><![CDATA[vhsrhggkrg ahsdccdnbn 20
<![CDATA[<210> 73]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成寡核苷酸]]>
<![CDATA[<220>]]>
<![CDATA[<221> modified_base]]>
<![CDATA[<222> (8)..(8)]]>
<![CDATA[<223> a、c、t、g、未知或其他]]>
<![CDATA[<400> 73]]>
ccgccatntt 10
<![CDATA[<210> 74]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成寡核苷酸]]>
<![CDATA[<220>]]>
<![CDATA[<221> modified_base]]>
<![CDATA[<222> (3)..(3)]]>
<![CDATA[<223> a、c、t、g、未知或其他]]>
<![CDATA[<400> 74]]>
aanatggcgg 10
<![CDATA[<210> 75]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 75]]>
gctggaaacc ttgcacctcg g 21
<![CDATA[<210> 76]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 76]]>
ctgctgccag tagagggcac a 21
<![CDATA[<210> 77]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 77]]>
gcccagagag ggggcggagg g 21
<![CDATA[<210> 78]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 78]]>
acgcggggag caaccaatcg c 21
<![CDATA[<210> 79]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 79]]>
actggcagca gagatcatcg c 21
<![CDATA[<210> 80]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 80]]>
gggggcagga gcaggagcgt c 21
<![CDATA[<210> 81]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 81]]>
cagccttagc gaggcgccct g 21
<![CDATA[<210> 82]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 82]]>
actcacagga caaggatgcg g 21
<![CDATA[<210> 83]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 83]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[<210> 84]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 84]]>
actcagccgg gcagccgagc a 21
<![CDATA[<210> 85]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 85]]>
cgtaccaggc tgcagggcgc c 21
<![CDATA[<210> 86]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213]]>> 智人]]>
<br/>
<br/><![CDATA[<400> 86]]>
<br/><![CDATA[agagtggagg aaagaagggt a 21
<![CDATA[<210> 87]]>
<![CDATA[<211> 386]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 87]]>
Met Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val Phe Glu Ala
1 5 10 15
Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys Val Arg Lys
20 25 30
Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val Pro Ala Ile
35 40 45
Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys Leu Glu Tyr
50 55 60
Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu Asn Lys Thr
65 70 75 80
Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg
85 90 95
Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser
100 105 110
Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr
115 120 125
Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu
130 135 140
Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg
145 150 155 160
Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys
165 170 175
Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu Leu
180 185 190
His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys Leu Glu
195 200 205
Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala Ala Asp Phe
210 215 220
Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser Thr Leu Asn
225 230 235 240
Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys Ser Ile Lys
245 250 255
Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu Asn Asn Leu
260 265 270
Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser Asn Ile Asn
275 280 285
Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu Gly Tyr Val
290 295 300
Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser Gly Ala Asn
305 310 315 320
Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys Met Asn Ser
325 330 335
Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg
340 345 350
Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys
355 360 365
Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile
370 375 380
Gly Gly
385
<![CDATA[<210> 88]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 88]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser
20
<![CDATA[<210> 89]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 89]]>
Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1 5 10 15
Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
20 25
<![CDATA[<210> 90]]>
<![CDATA[<211> 9]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 90]]>
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1 5
<![CDATA[<210> 91]]>
<![CDATA[<211> 1123]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 91]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
690 695 700
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu
705 710 715 720
Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile
725 730 735
His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu
740 745 750
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp
755 760 765
Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
770 775 780
Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg
785 790 795 800
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
805 810 815
Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr
820 825 830
His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe
835 840 845
Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu
850 855 860
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala
865 870 875 880
His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
885 890 895
Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu
900 905 910
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
915 920 925
Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr
930 935 940
His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser
945 950 955 960
Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg
965 970 975
Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu
980 985 990
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met
995 1000 1005
Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr
1010 1015 1020
Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp
1025 1030 1035
Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
1040 1045 1050
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Ser Gly Gly Lys Arg
1085 1090 1095
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly
1100 1105 1110
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1115 1120
<![CDATA[<210> 92]]>
<![CDATA[<211> 1123]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 92]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
340 345 350
Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
370 375 380
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
385 390 395 400
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
405 410 415
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
420 425 430
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
435 440 445
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
450 455 460
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
465 470 475 480
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
485 490 495
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
500 505 510
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
515 520 525
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
530 535 540
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
545 550 555 560
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
565 570 575
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
580 585 590
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
595 600 605
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
610 615 620
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
625 630 635 640
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
645 650 655
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
660 665 670
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
675 680 685
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
690 695 700
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
705 710 715 720
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
725 730 735
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
740 745 750
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
755 760 765
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
770 775 780
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
785 790 795 800
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
805 810 815
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
820 825 830
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
835 840 845
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
850 855 860
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
865 870 875 880
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
885 890 895
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
900 905 910
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
915 920 925
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
930 935 940
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
945 950 955 960
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
965 970 975
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
980 985 990
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
995 1000 1005
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln
1010 1015 1020
Asp Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln
1025 1030 1035
Lys Ile Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu
1040 1045 1050
Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Ser Gly Gly Lys Arg
1085 1090 1095
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly
1100 1105 1110
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1115 1120
<![CDATA[<210> 93]]>
<![CDATA[<211> 3332]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 93]]>
gcggcaaatc cttttctaga agcgatcatc tgaccaccca ccaaagaaca cataccggcg 60
agaagcctta caaatgtccc gagtgcggaa agtccttctc ccagagagcc aatctgaggg 120
ctcatcaaag gacccatacc ggcgaaaagc cctacaaatg ccccgagtgc ggaaaatcct 180
tcagccagct ggcccatctg agagcccacc aaaggacaca caccggagag aaaccctata 240
agtgccccga gtgtggaaaa agcttttccc agagggccaa tctgagggcc catcagagga 300
cccataccgg agagaagcct tataaatgtc ccgagtgcgg aaaaagcttc agcgagagga 360
gccatctgag ggaacatcaa agaacccaca ccggcgaaaa acccaccgga aaaaagacca 420
gcgctagcgg cagcggcggc ggcagcggcg gcgcccggga cagcaaggtg gagaacaaga 480
ccaagaagct gcgggtgttc gaggccttcg ccggcatcgg cgcccagcgg aaggccctgg 540
agaaggtgcg gaaggacgag tacgagatcg tgggcctggc cgagtggtac gtgcccgcca 600
tcgtgatgta ccaggccatc cacaacaact tccacaccaa gctggagtac aagagcgtga 660
gccgggagga gatgatcgac tacctggaga acaagaccct gagctggaac agcaagaacc 720
ccgtgagcaa cggctactgg aagcggaaga aggacgacga gctgaagatc atctacaacg 780
ccatcaagct gagcgagaag gagggcaaca tcttcgacat ccgggacctg tacaagcgga 840
ccctgaagaa catcgacctg ctgacctaca gcttcccctg ccaggacctg agccagcagg 900
gcatccagaa gggcatgaag cggggcagcg gcacccggag cggcctgctg tgggagatcg 960
agcgggccct ggacagcacc gagaagaacg acctgcccaa gtacctgctg atggagaacg 1020
tgggcgccct gctgcacaag aagaacgagg aggagctgaa ccagtggaag cagaagctgg 1080
agagcctggg ctaccagaac agcatcgagg tgctgaacgc cgccgacttc ggcagcagcc 1140
aggcccggcg gcgggtgttc atgatcagca ccctgaacga gttcgtggag ctgcccaagg 1200
gcgacaagaa gcccaagagc atcaagaagg tgctgaacaa gatcgtgagc gagaaggaca 1260
tcctgaacaa cctgctgaag tacaacctga ccgagttcaa gaaaaccaag agcaacatca 1320
acaaggccag cctgatcggc tacagcaagt tcaacagcga gggctacgtg tacgaccccg 1380
agttcaccgg ccccaccctg accgccagcg gcgccaacag ccggatcaag atcaaggacg 1440
gcagcaacat ccggaagatg aacagcgacg agaccttcct gtacatcggc ttcgacagcc 1500
aggacggcaa gcgggtgaac gagatcgagt tcctgaccga gaaccagaag atcttcgtgt 1560
gcggcaacag catcagcgtg gaggtgctgg aggccatcat cgacaagatc ggcggcccca 1620
gcagcggcgg caagcggccc gccgccacca agaaggccgg ccaggccaag aagaagaagg 1680
gcagctaccc ctacgacgtg cccgactacg ccgggtccta cccgtacgac gtgcccgatt 1740
acgccgccac caacttctcg ctgctgaagc aggccggaga tgtggaagaa aaccctggac 1800
ctaccagtgc cggaaagctc ggtagcggag agggtcgggg aagcctgctt acttgcggcg 1860
acgtggaaga gaaccccggt ccgctggagg gttcgtccgg ctccggatcc cccaagaaga 1920
agcggaaggt gggcatccac ggcgtgcccg ccgccggcag cagcggatcc ctggagcccg 1980
gcgaaaaacc ttacaagtgc cccgagtgcg gaaagagctt cagcagaagc gacaaactgg 2040
tgaggcatca aaggacacat accggagaga agccctataa gtgccccgaa tgtggcaaat 2100
ccttttccca gagggctcat ctggaaagac accagaggac ccataccggc gaaaaaccct 2160
acaaatgtcc cgagtgtgga aagagctttt ccgatcccgg ccatctggtc agacatcaga 2220
ggacacatac cggcgaaaag ccttacaagt gtcccgaatg cggaaaatcc ttctccagaa 2280
gcgacaagct ggtgaggcac caaagaaccc acaccggcga aaaaccctat aaatgccccg 2340
agtgcggcaa gtcctttagc cagctggccc atctgagagc ccaccagaga acacacaccg 2400
gagagaagcc ttataagtgt cccgagtgcg gaaagtcctt ctctagagcc gacaatctga 2460
ccgaacatca aaggacacac accggcgaga aaccttataa atgccccgag tgcggaaaaa 2520
gcttttccga ctgcagagat ctggctagac accagagaac ccacaccggc gagaaaccca 2580
ccggcaaaaa gaccagcgct agcggcagcg gcggcggcag cggcggcgac gccaagagcc 2640
tgaccgcctg gagccggacc ctggtgacct tcaaggacgt gttcgtggac ttcacccggg 2700
aggagtggaa gctgctggac accgcccagc agatcctgta ccggaacgtg atgctggaga 2760
actacaagaa cctggtgagc ctgggctacc agctgaccaa gcccgacgtg atcctgcggc 2820
tggagaaggg cgaggagccc tggctggtgg agcgggagat ccaccaggag acccaccccg 2880
acagcgagac cgccttcgag atcaagagca gcgtgagcgg cggcaagcgg cccgccgcca 2940
ccaagaaggc cggccaggcc aagaagaaga agggcagcta cccctacgac gtgcccgact 3000
acgcctgaag cagcggcggc aagcggcccg ccgccaccaa gaaggccggc caggccaaga 3060
agaagaaggg cagctacccc tacgacgtgc ccgactacgc ctgagcggcc gcttaattaa 3120
gctgccttct gcggggcttg ccttctggcc atgcccttct tctctccctt gcacctgtac 3180
ctcttggtct ttgaataaag cctgagtagg aagtctagaa aaaaaaaaaa aaaaaaaaaa 3240
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3332
<![CDATA[<210> 94]]>
<![CDATA[<211> 3012]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 94]]>
tataaatgcc ccgagtgcgg aaaaagcttt tccgactgca gagatctggc tagacaccag 60
agaacccaca ccggcgagaa acccaccggc aaaaagacca gcgctagcgg cagcggcggc 120
ggcagcggcg gcgacgccaa gagcctgacc gcctggagcc ggaccctggt gaccttcaag 180
gacgtgttcg tggacttcac ccgggaggag tggaagctgc tggacaccgc ccagcagatc 240
ctgtaccgga acgtgatgct ggagaactac aagaacctgg tgagcctggg ctaccagctg 300
accaagcccg acgtgatcct gcggctggag aagggcgagg agccctggct ggtggagcgg 360
gagatccacc aggagaccca ccccgacagc gagaccgcct tcgagatcaa gagcagcgtg 420
agcagcggcg gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag 480
ggcagctacc cctacgacgt gcccgactac gccgggtcct acccgtacga cgtgcccgat 540
tacgccgcca ccaacttctc gctgctgaag caggccggag atgtggaaga aaaccctgga 600
cctaccagtg ccggaaagct cggtagcgga gagggtcggg gaagcctgct tacttgcggc 660
gacgtggaag agaaccccgg tccgctggag ggttcgtccg gctccggatc ccccaagaag 720
aagcggaagg tgggcatcca cggcgtgccc gccgccggca gcagcggatc cctggagccc 780
ggcgagaaac cttacaaatg ccccgagtgc ggcaagagct tcagcagaag cgacgatctg 840
gtgaggcacc aaagaaccca caccggcgaa aaaccttaca agtgtcccga atgcggaaag 900
tccttcagca gagaggacaa tctgcacacc caccagagaa cacacaccgg agaaaagcct 960
tacaagtgcc ccgaatgcgg caaatccttt tctagaagcg atcatctgac cacccaccaa 1020
agaacacata ccggcgagaa gccttacaaa tgtcccgagt gcggaaagtc cttctcccag 1080
agagccaatc tgagggctca tcaaaggacc cataccggcg aaaagcccta caaatgcccc 1140
gagtgcggaa aatccttcag ccagctggcc catctgagag cccaccaaag gacacacacc 1200
ggagagaaac cctataagtg ccccgagtgt ggaaaaagct tttcccagag ggccaatctg 1260
agggcccatc agaggaccca taccggagag aagccttata aatgtcccga gtgcggaaaa 1320
agcttcagcg agaggagcca tctgagggaa catcaaagaa cccacaccgg cgaaaaaccc 1380
accggaaaaa agaccagcgc tagcggcagc ggcggcggca gcggcggcgc ccgggacagc 1440
aaggtggaga acaagaccaa gaagctgcgg gtgttcgagg ccttcgccgg catcggcgcc 1500
cagcggaagg ccctggagaa ggtgcggaag gacgagtacg agatcgtggg cctggccgag 1560
tggtacgtgc ccgccatcgt gatgtaccag gccatccaca acaacttcca caccaagctg 1620
gagtacaaga gcgtgagccg ggaggagatg atcgactacc tggagaacaa gaccctgagc 1680
tggaacagca agaaccccgt gagcaacggc tactggaagc ggaagaagga cgacgagctg 1740
aagatcatct acaacgccat caagctgagc gagaaggagg gcaacatctt cgacatccgg 1800
gacctgtaca agcggaccct gaagaacatc gacctgctga cctacagctt cccctgccag 1860
gacctgagcc agcagggcat ccagaagggc atgaagcggg gcagcggcac ccggagcggc 1920
ctgctgtggg agatcgagcg ggccctggac agcaccgaga agaacgacct gcccaagtac 1980
ctgctgatgg agaacgtggg cgccctgctg cacaagaaga acgaggagga gctgaaccag 2040
tggaagcaga agctggagag cctgggctac cagaacagca tcgaggtgct gaacgccgcc 2100
gacttcggca gcagccaggc ccggcggcgg gtgttcatga tcagcaccct gaacgagttc 2160
gtggagctgc ccaagggcga caagaagccc aagagcatca agaaggtgct gaacaagatc 2220
gtgagcgaga aggacatcct gaacaacctg ctgaagtaca acctgaccga gttcaagaaa 2280
accaagagca acatcaacaa ggccagcctg atcggctaca gcaagttcaa cagcgagggc 2340
tacgtgtacg accccgagtt caccggcccc accctgaccg ccagcggcgc caacagccgg 2400
atcaagatca aggacggcag caacatccgg aagatgaaca gcgacgagac cttcctgtac 2460
atcggcttcg acagccagga cggcaagcgg gtgaacgaga tcgagttcct gaccgagaac 2520
cagaagatct tcgtgtgcgg caacagcatc agcgtggagg tgctggaggc catcatcgac 2580
aagatcggcg gccccagcgg cggcaagcgg cccgccgcca ccaagaaggc cggccaggcc 2640
aagaagaaga agggcagcta cccctacgac gtgcccgact acgcctgaag cagcggcggc 2700
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggg cagctacccc 2760
tacgacgtgc ccgactacgc ctgagcggcc gcttaattaa gctgccttct gcggggcttg 2820
ccttctggcc atgcccttct tctctccctt gcacctgtac ctcttggtct ttgaataaag 2880
cctgagtagg aagtctagaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2940
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3000
aaaaaaaaaa aa 3012
<![CDATA[<210> 95]]>
<![CDATA[<211> 118]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 95]]>
Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
1 5 10 15
Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
20 25 30
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
35 40 45
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
50 55 60
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
65 70 75 80
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
85 90 95
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
100 105 110
Ala Gly Ser Ser Gly Ser
115
<![CDATA[<210> 96]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 96]]>
tgccacttcc ccactaaccc 20
<![CDATA[<210> 97]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 97]]>
ggccacacaa ggaagctgca 20
<![CDATA[<210> 98]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 98]]>
ccacacaagg aagctgcagg 20
<![CDATA[<210> 99]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 99]]>
tgattggaat gcaacccgaa 20
<![CDATA[<210> 100]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 100]]>
ttttgccctt gctaccccaa 20
<![CDATA[<210> 101]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 101]]>
agctgatggt atccactagg 20
<![CDATA[<210> 102]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 102]]>
cacatccaag aatgtagtgg 20
<![CDATA[<210> 103]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 103]]>
gatacagcca caaagctcac 20
<![CDATA[<210> 104]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 104]]>
attacataac agaatccagg 20
<![CDATA[<210> 105]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 105]]>
cccttgactg tgctgccacc 20
<![CDATA[<210> 106]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 106]]>
cagacgagga acctgaaccc 20
<![CDATA[<210> 107]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 107]]>
agaatccctt ggggtagcaa 20
<![CDATA[<210> 108]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 108]]>
cagcactctc gctgaccgca 20
<![CDATA[<210> 109]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 109]]>
gttgagtcat gtgtactctg 20
<![CDATA[<210> 110]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 110]]>
aggaacagga tgttacaact 20
<![CDATA[<210> 111]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 111]]>
ggggccacta gggacaggat 20
<![CDATA[<210> 112]]>
<![CDATA[<211> 3650]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 112]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc gggtcctacc 2040
cgtacgacgt gcccgattac gccgccacca acttctcgct gctgaagcag gccggagatg 2100
tggaagaaaa ccctggacct accagtgccg gaaagctcgg tagcggagag ggtcggggaa 2160
gcctgcttac ttgcggcgac gtggaagaga accccggtcc gctggagggt tcgtccggct 2220
ccggatcccc caagaagaag cggaaggtgg gcatccacgg cgtgcccgcc gccggcagca 2280
gcggatccct ggagcccggc gaaaaacctt acaagtgccc cgagtgcgga aagagcttca 2340
gcagaagcga caaactggtg aggcatcaaa ggacacatac cggagagaag ccctataagt 2400
gccccgaatg tggcaaatcc ttttcccaga gggctcatct ggaaagacac cagaggaccc 2460
ataccggcga aaaaccctac aaatgtcccg agtgtggaaa gagcttttcc gatcccggcc 2520
atctggtcag acatcagagg acacataccg gcgaaaagcc ttacaagtgt cccgaatgcg 2580
gaaaatcctt ctccagaagc gacaagctgg tgaggcacca aagaacccac accggcgaaa 2640
aaccctataa atgccccgag tgcggcaagt cctttagcca gctggcccat ctgagagccc 2700
accagagaac acacaccgga gagaagcctt ataagtgtcc cgagtgcgga aagtccttct 2760
ctagagccga caatctgacc gaacatcaaa ggacacacac cggcgagaaa ccttataaat 2820
gccccgagtg cggaaaaagc ttttccgact gcagagatct ggctagacac cagagaaccc 2880
acaccggcga gaaacccacc ggcaaaaaga ccagcgctag cggcagcggc ggcggcagcg 2940
gcggcgacgc caagagcctg accgcctgga gccggaccct ggtgaccttc aaggacgtgt 3000
tcgtggactt cacccgggag gagtggaagc tgctggacac cgcccagcag atcctgtacc 3060
ggaacgtgat gctggagaac tacaagaacc tggtgagcct gggctaccag ctgaccaagc 3120
ccgacgtgat cctgcggctg gagaagggcg aggagccctg gctggtggag cgggagatcc 3180
accaggagac ccaccccgac agcgagaccg ccttcgagat caagagcagc gtgagcggcg 3240
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 3300
cctacgacgt gcccgactac gcctgaagca gcggcggcaa gcggcccgcc gccaccaaga 3360
aggccggcca ggccaagaag aagaagggca gctaccccta cgacgtgccc gactacgcct 3420
gagcggccgc ttaattaagc tgccttctgc ggggcttgcc ttctggccat gcccttcttc 3480
tctcccttgc acctgtacct cttggtcttt gaataaagcc tgagtaggaa gtctagaaaa 3540
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3600
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3650
<![CDATA[<210> 113]]>
<![CDATA[<211> 3650]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 113]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accttacaag tgccccgagt gcggaaagag cttcagcaga agcgacaaac 180
tggtgaggca tcaaaggaca cataccggag agaagcccta taagtgcccc gaatgtggca 240
aatccttttc ccagagggct catctggaaa gacaccagag gacccatacc ggcgaaaaac 300
cctacaaatg tcccgagtgt ggaaagagct tttccgatcc cggccatctg gtcagacatc 360
agaggacaca taccggcgaa aagccttaca agtgtcccga atgcggaaaa tccttctcca 420
gaagcgacaa gctggtgagg caccaaagaa cccacaccgg cgaaaaaccc tataaatgcc 480
ccgagtgcgg caagtccttt agccagctgg cccatctgag agcccaccag agaacacaca 540
ccggagagaa gccttataag tgtcccgagt gcggaaagtc cttctctaga gccgacaatc 600
tgaccgaaca tcaaaggaca cacaccggcg agaaacctta taaatgcccc gagtgcggaa 660
aaagcttttc cgactgcaga gatctggcta gacaccagag aacccacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cagcggcggc aagcggcccg 1080
ccgccaccaa gaaggccggc caggccaaga agaagaaggg cagctacccc tacgacgtgc 1140
ccgactacgc cgggtcctac ccgtacgacg tgcccgatta cgccgccacc aacttctcgc 1200
tgctgaagca ggccggagat gtggaagaaa accctggacc taccagtgcc ggaaagctcg 1260
gtagcggaga gggtcgggga agcctgctta cttgcggcga cgtggaagag aaccccggtc 1320
cgctggaggg ttcgtccggc tccggatccc ccaagaagaa gcggaaggtg ggcatccacg 1380
gcgtgcccgc cgccggcagc agcggatccc tggagcccgg cgagaaacct tacaaatgcc 1440
ccgagtgcgg caagagcttc agcagaagcg acgatctggt gaggcaccaa agaacccaca 1500
ccggcgaaaa accttacaag tgtcccgaat gcggaaagtc cttcagcaga gaggacaatc 1560
tgcacaccca ccagagaaca cacaccggag aaaagcctta caagtgcccc gaatgcggca 1620
aatccttttc tagaagcgat catctgacca cccaccaaag aacacatacc ggcgagaagc 1680
cttacaaatg tcccgagtgc ggaaagtcct tctcccagag agccaatctg agggctcatc 1740
aaaggaccca taccggcgaa aagccctaca aatgccccga gtgcggaaaa tccttcagcc 1800
agctggccca tctgagagcc caccaaagga cacacaccgg agagaaaccc tataagtgcc 1860
ccgagtgtgg aaaaagcttt tcccagaggg ccaatctgag ggcccatcag aggacccata 1920
ccggagagaa gccttataaa tgtcccgagt gcggaaaaag cttcagcgag aggagccatc 1980
tgagggaaca tcaaagaacc cacaccggcg aaaaacccac cggaaaaaag accagcgcta 2040
gcggcagcgg cggcggcagc ggcggcgccc gggacagcaa ggtggagaac aagaccaaga 2100
agctgcgggt gttcgaggcc ttcgccggca tcggcgccca gcggaaggcc ctggagaagg 2160
tgcggaagga cgagtacgag atcgtgggcc tggccgagtg gtacgtgccc gccatcgtga 2220
tgtaccaggc catccacaac aacttccaca ccaagctgga gtacaagagc gtgagccggg 2280
aggagatgat cgactacctg gagaacaaga ccctgagctg gaacagcaag aaccccgtga 2340
gcaacggcta ctggaagcgg aagaaggacg acgagctgaa gatcatctac aacgccatca 2400
agctgagcga gaaggagggc aacatcttcg acatccggga cctgtacaag cggaccctga 2460
agaacatcga cctgctgacc tacagcttcc cctgccagga cctgagccag cagggcatcc 2520
agaagggcat gaagcggggc agcggcaccc ggagcggcct gctgtgggag atcgagcggg 2580
ccctggacag caccgagaag aacgacctgc ccaagtacct gctgatggag aacgtgggcg 2640
ccctgctgca caagaagaac gaggaggagc tgaaccagtg gaagcagaag ctggagagcc 2700
tgggctacca gaacagcatc gaggtgctga acgccgccga cttcggcagc agccaggccc 2760
ggcggcgggt gttcatgatc agcaccctga acgagttcgt ggagctgccc aagggcgaca 2820
agaagcccaa gagcatcaag aaggtgctga acaagatcgt gagcgagaag gacatcctga 2880
acaacctgct gaagtacaac ctgaccgagt tcaagaaaac caagagcaac atcaacaagg 2940
ccagcctgat cggctacagc aagttcaaca gcgagggcta cgtgtacgac cccgagttca 3000
ccggccccac cctgaccgcc agcggcgcca acagccggat caagatcaag gacggcagca 3060
acatccggaa gatgaacagc gacgagacct tcctgtacat cggcttcgac agccaggacg 3120
gcaagcgggt gaacgagatc gagttcctga ccgagaacca gaagatcttc gtgtgcggca 3180
acagcatcag cgtggaggtg ctggaggcca tcatcgacaa gatcggcggc cccagcggcg 3240
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 3300
cctacgacgt gcccgactac gcctgaagca gcggcggcaa gcggcccgcc gccaccaaga 3360
aggccggcca ggccaagaag aagaagggca gctaccccta cgacgtgccc gactacgcct 3420
gagcggccgc ttaattaagc tgccttctgc ggggcttgcc ttctggccat gcccttcttc 3480
tctcccttgc acctgtacct cttggtcttt gaataaagcc tgagtaggaa gtctagaaaa 3540
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3600
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3650
<![CDATA[<210> 114]]>
<![CDATA[<211> 542]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 114]]>
Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro Val Pro
1 5 10 15
Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe Asp Gly Ile
20 25 30
Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile Gln Val Asp Arg
35 40 45
Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile Thr Val Gly Met Val
50 55 60
Arg His Gln Gly Lys Ile Met Tyr Val Gly Asp Val Arg Ser Val Thr
65 70 75 80
Gln Lys His Ile Gln Glu Trp Gly Pro Phe Asp Leu Val Ile Gly Gly
85 90 95
Ser Pro Cys Asn Asp Leu Ser Ile Val Asn Pro Ala Arg Lys Gly Leu
100 105 110
Tyr Glu Gly Thr Gly Arg Leu Phe Phe Glu Phe Tyr Arg Leu Leu His
115 120 125
Asp Ala Arg Pro Lys Glu Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe
130 135 140
Glu Asn Val Val Ala Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg
145 150 155 160
Phe Leu Glu Ser Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala
165 170 175
Ala His Arg Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg
180 185 190
Pro Leu Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu
195 200 205
Glu His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr
210 215 220
Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val Phe
225 230 235 240
Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu Arg Val
245 250 255
Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met Ser Arg Leu
260 265 270
Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val Pro Val Ile Arg
275 280 285
His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala Cys Val Ser Ser Gly
290 295 300
Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser Phe Ser Ser Gly Leu Val
305 310 315 320
Pro Leu Ser Leu Arg Gly Ser His Met Asn Pro Leu Glu Met Phe Glu
325 330 335
Thr Val Pro Val Trp Arg Arg Gln Pro Val Arg Val Leu Ser Leu Phe
340 345 350
Glu Asp Ile Lys Lys Glu Leu Thr Ser Leu Gly Phe Leu Glu Ser Gly
355 360 365
Ser Asp Pro Gly Gln Leu Lys His Val Val Asp Val Thr Asp Thr Val
370 375 380
Arg Lys Asp Val Glu Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala
385 390 395 400
Thr Pro Pro Leu Gly His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu
405 410 415
Phe Gln Phe His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser
420 425 430
Pro Arg Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys
435 440 445
Glu Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr
450 455 460
Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val Trp
465 470 475 480
Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val Ser Glu
485 490 495
Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser Lys Leu Ala
500 505 510
Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe Leu Pro Leu Arg
515 520 525
Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr Ser Ser Leu
530 535 540
<![CDATA[<210> 115]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 115]]>
ctggagcccg gcgagaaacc ctataaatgc cccgaatgcg gaaaaagctt cagccagtcc 60
agctctctgg tgagacatca gaggacacac accggcgaaa agccttataa gtgccccgag 120
tgcggcaagt ccttctctag aagcgatcac ctcaccaatc atcagaggac acataccgga 180
gagaagccct ataagtgccc cgagtgcggc aagagcttta gccagctggc tcatctgaga 240
gctcaccaaa gaacccatac cggcgagaag ccttacaaat gccccgagtg tggaaaatcc 300
ttttcccagt ccagcaacct cgtcagacat caaaggaccc ataccggcga aaagccttac 360
aagtgtcccg agtgcggaaa gtccttctct agatccgaca acctcgtgag gcaccagaga 420
acccacaccg gcgagaaacc ttacaaatgt cccgagtgtg gcaaaagctt ttctagaagc 480
gacgagctgg tgagacatca aagaacccat accggcgaaa aaccttataa gtgtcccgag 540
tgcggcaaat cctttagcca gctggcccat ctgagggccc accagagaac acataccggc 600
gaaaaaccca ccggcaaaaa gacaagc 627
<![CDATA[<210> 116]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 116]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gtggaaaaag ctttagccaa agcggcgatc 180
tgaggagaca ccaaagaaca cacaccggcg agaagcccta caaatgtccc gagtgcggaa 240
agagcttcag ccagagcggc catctgaccg agcatcagag aacccatacc ggcgaaaaac 300
cttataagtg ccccgagtgt ggaaagtcct tctccgagag atcccatctg agagaacacc 360
agaggacaca caccggcgaa aaaccttata agtgtcccga gtgcggaaag tccttcagcg 420
atcccggcca tctggtgaga catcaaagga cacataccgg cgaaaaacct tataagtgtc 480
ccgaatgcgg caagagcttt agcagaaacg acacactcac cgaacaccag aggacccaca 540
ccggcgagaa accctacaaa tgccccgagt gcggcaaatc cttttctaga gccgacaatc 600
tgaccgaaca ccagaggacc cataccggag aaaagcctta caaatgtccc gagtgtggca 660
aatccttctc cacccatctg gatctgatta gacaccaaag aacacatacc ggagaaaagc 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 117]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 117]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accctataag tgccccgaat gtggaaagag cttcagccat accggccatc 180
tgctggaaca ccaaaggaca cacaccggcg agaaacctta caagtgtccc gagtgcggaa 240
aaagcttctc ctccaaaaag gctctcaccg agcaccagag aacacatacc ggcgaaaagc 300
cttataagtg ccccgagtgt ggcaaatcct tttccgactg tagagatctg gccagacatc 360
aaagaaccca caccggagag aaaccttata aatgccccga gtgcggcaag tcctttagcc 420
ataccggcca tctgctggag caccagagga cccataccgg cgagaagcct tacaaatgcc 480
ccgagtgcgg caaaagcttc agcagaaatg acgctctgac cgagcatcaa aggacccata 540
ccggcgaaaa gccctacaag tgtcccgagt gtggaaagtc cttctcccag agcggcgatc 600
tgaggagaca ccagagaaca cacaccggcg agaaacccta taaatgtccc gagtgcggaa 660
agagctttag cgacagcggc aatctgaggg tgcatcaaag aacacacacc ggcgaaaaac 720
ccaccggaaa aaagacaagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 118]]>
<![CDATA[<211> 2227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 118]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctataaa tgccccgaat gcggaaaaag cttcagccag tccagctctc 180
tggtgagaca tcagaggaca cacaccggcg aaaagcctta taagtgcccc gagtgcggca 240
agtccttctc tagaagcgat cacctcacca atcatcagag gacacatacc ggagagaagc 300
cctataagtg ccccgagtgc ggcaagagct ttagccagct ggctcatctg agagctcacc 360
aaagaaccca taccggcgag aagccttaca aatgccccga gtgtggaaaa tccttttccc 420
agtccagcaa cctcgtcaga catcaaagga cccataccgg cgaaaagcct tacaagtgtc 480
ccgagtgcgg aaagtccttc tctagatccg acaacctcgt gaggcaccag agaacccaca 540
ccggcgagaa accttacaaa tgtcccgagt gtggcaaaag cttttctaga agcgacgagc 600
tggtgagaca tcaaagaacc cataccggcg aaaaacctta taagtgtccc gagtgcggca 660
aatcctttag ccagctggcc catctgaggg cccaccagag aacacatacc ggcgaaaaac 720
ccaccggcaa aaagacaagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[<210> 119]]>
<![CDATA[<211> 5705]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 119]]>
aaggaaataa gagagaaaag aagagtaaga agaaatataa gagccaccat ggcccccaag 60
aagaagcgga aggtgggcat ccacggcgtg cccgccgccg acaagaagta cagcatcggc 120
ctggccatcg gcaccaacag cgtgggctgg gccgtgatca ccgacgagta caaggtgccc 180
agcaagaagt tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc 240
ggcgccctgc tgttcgacag cggcgagacc gccgaggcca cccggctgaa gcggaccgcc 300
cggcggcggt acacccggcg gaagaaccgg atctgctacc tgcaggagat cttcagcaac 360
gagatggcca aggtggacga cagcttcttc caccggctgg aggagagctt cctggtggag 420
gaggacaaga agcacgagcg gcaccccatc ttcggcaaca tcgtggacga ggtggcctac 480
cacgagaagt accccaccat ctaccacctg cggaagaagc tggtggacag caccgacaag 540
gccgacctgc ggctgatcta cctggccctg gcccacatga tcaagttccg gggccacttc 600
ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg 660
gtgcagacct acaaccagct gttcgaggag aaccccatca acgccagcgg cgtggacgcc 720
aaggccatcc tgagcgcccg gctgagcaag agccggcggc tggagaacct gatcgcccag 780
ctgcccggcg agaagaagaa cggcctgttc ggcaacctga tcgccctgag cctgggcctg 840
acccccaact tcaagagcaa cttcgacctg gccgaggacg ccaagctgca gctgagcaag 900
gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac 960
ctgttcctgg ccgccaagaa cctgagcgac gccatcctgc tgagcgacat cctgcgggtg 1020
aacaccgaga tcaccaaggc ccccctgagc gccagcatga tcaagcggta cgacgagcac 1080
caccaggacc tgaccctgct gaaggccctg gtgcggcagc agctgcccga gaagtacaag 1140
gagatcttct tcgaccagag caagaacggc tacgccggct acatcgacgg cggcgccagc 1200
caggaggagt tctacaagtt catcaagccc atcctggaga agatggacgg caccgaggag 1260
ctgctggtga agctgaaccg ggaggacctg ctgcggaagc agcggacctt cgacaacggc 1320
agcatccccc accagatcca cctgggcgag ctgcacgcca tcctgcggcg gcaggaggac 1380
ttctacccct tcctgaagga caaccgggag aagatcgaga agatcctgac cttccggatc 1440
ccctactacg tgggccccct ggcccggggc aacagccggt tcgcctggat gacccggaaa 1500
tccgaggaga ccatcacccc ctggaacttc gaggaggtgg tggacaaggg cgccagcgcc 1560
cagagcttca tcgagcggat gaccaacttc gacaagaacc tgcccaacga gaaggtgctg 1620
cccaagcaca gcctgctgta cgagtacttc accgtgtaca acgagctgac caaggtgaag 1680
tacgtgaccg agggcatgcg gaagcccgcc ttcctgagcg gcgagcagaa gaaggccatc 1740
gtggacctgc tgttcaagac caaccggaag gtgaccgtga agcagctgaa ggaggactac 1800
ttcaagaaga tcgagtgctt cgacagcgtg gagatcagcg gcgtggagga ccggttcaac 1860
gccagcctgg gcacctacca cgacctgctg aagatcatca aggacaagga cttcctggac 1920
aacgaggaga acgaggacat cctggaggac atcgtgctga ccctgaccct gttcgaggac 1980
cgggagatga tcgaggagcg gctgaaaacc tacgcccacc tgttcgacga caaggtgatg 2040
aagcagctga agcggcggcg gtacaccggc tggggccggc tgagccggaa gctgatcaac 2100
ggcatccggg acaagcagag cggcaagacc atcctggact tcctgaaatc cgacggcttc 2160
gccaaccgga acttcatgca gctgatccac gacgacagcc tgaccttcaa ggaggacatc 2220
cagaaggccc aggtgagcgg ccagggcgac agcctgcacg agcacatcgc caacctggcc 2280
ggcagccccg ccatcaagaa gggcatcctg cagaccgtga aggtggtgga cgagctggtg 2340
aaggtgatgg gccggcacaa gcccgagaac atcgtgatcg agatggcccg ggagaaccag 2400
accacccaga agggccagaa gaacagccgg gagcggatga agcggatcga ggagggcatc 2460
aaggagctgg gcagccagat cctgaaggag caccccgtgg agaacaccca gctgcagaac 2520
gagaagctgt acctgtacta cctgcagaac ggccgggaca tgtacgtgga ccaggagctg 2580
gacatcaacc ggctgagcga ctacgacgtg gccgccatcg tgccccagag cttcctgaag 2640
gacgacagca tcgacaacaa ggtgctgacc cggagcgaca aggcccgggg caagagcgac 2700
aacgtgccca gcgaggaggt ggtgaagaag atgaagaact actggcggca gctgctgaac 2760
gccaagctga tcacccagcg gaagttcgac aacctgacca aggccgagcg gggcggcctg 2820
agcgagctgg acaaggccgg cttcatcaag cggcagctgg tggagacccg gcagatcacc 2880
aagcacgtgg cccagatcct ggacagccgg atgaacacca agtacgacga gaacgacaag 2940
ctgatccggg aggtgaaggt gatcaccctg aaatccaagc tggtgagcga cttccggaag 3000
gacttccagt tctacaaggt gcgggagatc aacaactacc accacgccca cgacgcctac 3060
ctgaacgccg tggtgggcac cgccctgatc aagaagtacc ccaagctgga gagcgagttc 3120
gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggag 3180
atcggcaagg ccaccgccaa gtacttcttc tacagcaaca tcatgaactt cttcaagacc 3240
gagatcaccc tggccaacgg cgagatccgg aagcggcccc tgatcgagac caacggcgag 3300
accggcgaga tcgtgtggga caagggccgg gacttcgcca ccgtgcggaa ggtgctgagc 3360
atgccccagg tgaacatcgt gaagaaaacc gaggtgcaga ccggcggctt cagcaaggag 3420
agcatcctgc ccaagcggaa cagcgacaag ctgatcgccc ggaagaagga ctgggacccc 3480
aagaagtacg gcggcttcga cagccccacc gtggcctaca gcgtgctggt ggtggccaag 3540
gtggagaagg gcaagagcaa gaagctgaaa tccgtgaagg agctgctggg catcaccatc 3600
atggagcgga gcagcttcga gaagaacccc atcgacttcc tggaggccaa gggctacaag 3660
gaggtgaaga aggacctgat catcaagctg cccaagtaca gcctgttcga gctggagaac 3720
ggccggaagc ggatgctggc cagcgccggc gagctgcaga agggcaacga gctggccctg 3780
cccagcaagt acgtgaactt cctgtacctg gccagccact acgagaagct gaagggcagc 3840
cccgaggaca acgagcagaa gcagctgttc gtggagcagc acaagcacta cctggacgag 3900
atcatcgagc agatcagcga gttcagcaag cgggtgatcc tggccgacgc caacctggac 3960
aaggtgctga gcgcctacaa caagcaccgg gacaagccca tccgggagca ggccgagaac 4020
atcatccacc tgttcaccct gaccaacctg ggcgcccccg ccgccttcaa gtacttcgac 4080
accaccatcg accggaagcg gtacaccagc accaaggagg tgctggacgc caccctgatc 4140
caccagagca tcaccggcct gtacgagacc cggatcgacc tgagccagct gggcggcgac 4200
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggc ccgggacagc 4260
aaggtggaga acaagaccaa gaagctgcgg gtgttcgagg ccttcgccgg catcggcgcc 4320
cagcggaagg ccctggagaa ggtgcggaag gacgagtacg agatcgtggg cctggccgag 4380
tggtacgtgc ccgccatcgt gatgtaccag gccatccaca acaacttcca caccaagctg 4440
gagtacaaga gcgtgagccg ggaggagatg atcgactacc tggagaacaa gaccctgagc 4500
tggaacagca agaaccccgt gagcaacggc tactggaagc ggaagaagga cgacgagctg 4560
aagatcatct acaacgccat caagctgagc gagaaggagg gcaacatctt cgacatccgg 4620
gacctgtaca agcggaccct gaagaacatc gacctgctga cctacagctt cccctgccag 4680
gacctgagcc agcagggcat ccagaagggc atgaagcggg gcagcggcac ccggagcggc 4740
ctgctgtggg agatcgagcg ggccctggac agcaccgaga agaacgacct gcccaagtac 4800
ctgctgatgg agaacgtggg cgccctgctg cacaagaaga acgaggagga gctgaaccag 4860
tggaagcaga agctggagag cctgggctac cagaacagca tcgaggtgct gaacgccgcc 4920
gacttcggca gcagccaggc ccggcggcgg gtgttcatga tcagcaccct gaacgagttc 4980
gtggagctgc ccaagggcga caagaagccc aagagcatca agaaggtgct gaacaagatc 5040
gtgagcgaga aggacatcct gaacaacctg ctgaagtaca acctgaccga gttcaagaaa 5100
accaagagca acatcaacaa ggccagcctg atcggctaca gcaagttcaa cagcgagggc 5160
tacgtgtacg accccgagtt caccggcccc accctgaccg ccagcggcgc caacagccgg 5220
atcaagatca aggacggcag caacatccgg aagatgaaca gcgacgagac cttcctgtac 5280
atcggcttcg acagccagga cggcaagcgg gtgaacgaga tcgagttcct gaccgagaac 5340
cagaagatct tcgtgtgcgg caacagcatc agcgtggagg tgctggaggc catcatcgac 5400
aagatcggcg gccccagcag cggcggcaag cggcccgccg ccaccaagaa ggccggccag 5460
gccaagaaga agaagggcag ctacccctac gacgtgcccg actacgcctg agcggccgct 5520
taattaagct gccttctgcg gggcttgcct tctggccatg cccttcttct ctcccttgca 5580
cctgtacctc ttggtctttg aataaagcct gagtaggaag tctagaaaaa aaaaaaaaaa 5640
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5700
aaaaa 5705
<![CDATA[<210> 120]]>
<![CDATA[<211> 18]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 120]]>
Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro
1 5 10 15
Gly Pro
<![CDATA[<210> 121]]>
<![CDATA[<211> 1092]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 121]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
690 695 700
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu
705 710 715 720
Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile
725 730 735
His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu
740 745 750
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp
755 760 765
Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
770 775 780
Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg
785 790 795 800
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
805 810 815
Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr
820 825 830
His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe
835 840 845
Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu
850 855 860
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala
865 870 875 880
His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
885 890 895
Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu
900 905 910
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
915 920 925
Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr
930 935 940
His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser
945 950 955 960
Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg
965 970 975
Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu
980 985 990
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met
995 1000 1005
Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr
1010 1015 1020
Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp
1025 1030 1035
Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
1040 1045 1050
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1085 1090
<![CDATA[<210> 122]]>
<![CDATA[<211> 1092]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 122]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
340 345 350
Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
370 375 380
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
385 390 395 400
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
405 410 415
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
420 425 430
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
435 440 445
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
450 455 460
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
465 470 475 480
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
485 490 495
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
500 505 510
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
515 520 525
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
530 535 540
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
545 550 555 560
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
565 570 575
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
580 585 590
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
595 600 605
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
610 615 620
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
625 630 635 640
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
645 650 655
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
660 665 670
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
675 680 685
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
690 695 700
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
705 710 715 720
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
725 730 735
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
740 745 750
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
755 760 765
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
770 775 780
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
785 790 795 800
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
805 810 815
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
820 825 830
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
835 840 845
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
850 855 860
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
865 870 875 880
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
885 890 895
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
900 905 910
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
915 920 925
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
930 935 940
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
945 950 955 960
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
965 970 975
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
980 985 990
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
995 1000 1005
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln
1010 1015 1020
Asp Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln
1025 1030 1035
Lys Ile Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu
1040 1045 1050
Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1085 1090
<![CDATA[<210> 123]]>
<![CDATA[<211> 355]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 123]]>
cagagaagga ggaagttaat tcacattctt aattttttct aagggcaaaa aaaaaaaaaa 60
aatgcaccag ctcattttcc atctctgctt gggtcatcag tgtgcattgt gagcctgtac 120
aaaggcctta gacggggaat gctgccgaga gcatcacctt ttatgtcttc ttttatatga 180
aatgtgccac ttccccacta accctggctc tgggctctgc ctctgctctc ctgatggtgt 240
gtttatggtg gattcagcat tctgggccac acaaggaagc tgcagggggt gtccaagttc 300
acatgtcccc gcattccagg cgaatgtttc tgacattgag caatgatatg gctct 355
<![CDATA[<210> 124]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 124]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
20 25 30
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro
35 40 45
<![CDATA[<210> 125]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 125]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro
<![CDATA[<210> 126]]>
<![CDATA[<211> 18]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 126]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly
<![CDATA[<210> 127]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 127]]>
Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu
1 5 10 15
Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly
20 25
<![CDATA[<210> 128]]>
<![CDATA[<211> 1]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 128]]>
Pro
1
<![CDATA[<210> 129]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 129]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 130]]>
<![CDATA[<211> 2234]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 130]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctgagcgg ccgcttaatt aagctgcctt ctgcggggct 2040
tgccttctgg ccatgccctt cttctctccc ttgcacctgt acctcttggt ctttgaataa 2100
agcctgagta ggaagtctag aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaaaaa aaaa 2234
<![CDATA[<210> 131]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> RNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 131]]>
cuggagcccg gcgagaaacc uuacaaaugc cccgagugcg gcaagagcuu cagcagaagc 60
gacgaucugg ugaggcacca aagaacccac accggcgaaa aaccuuacaa gugucccgaa 120
ugcggaaagu ccuucagcag agaggacaau cugcacaccc accagagaac acacaccgga 180
gaaaagccuu acaagugccc cgaaugcggc aaauccuuuu cuagaagcga ucaucugacc 240
acccaccaaa gaacacauac cggcgagaag ccuuacaaau gucccgagug cggaaagucc 300
uucucccaga gagccaaucu gagggcucau caaaggaccc auaccggcga aaagcccuac 360
aaaugccccg agugcggaaa auccuucagc cagcuggccc aucugagagc ccaccaaagg 420
acacacaccg gagagaaacc cuauaagugc cccgagugug gaaaaagcuu uucccagagg 480
gccaaucuga gggcccauca gaggacccau accggagaga agccuuauaa augucccgag 540
ugcggaaaaa gcuucagcga gaggagccau cugagggaac aucaaagaac ccacaccggc 600
gaaaaaccca ccggaaaaaa gaccagc 627
<![CDATA[<210> ]]>132
<![CDATA[<211> 1155]]>
<![CDATA[<212> RNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 132]]>
agcaaggugg agaacaagac caagaagcug cggguguucg aggccuucgc cggcaucggc 60
gcccagcgga aggcccugga gaaggugcgg aaggacgagu acgagaucgu gggccuggcc 120
gagugguacg ugcccgccau cgugauguac caggccaucc acaacaacuu ccacaccaag 180
cuggaguaca agagcgugag ccgggaggag augaucgacu accuggagaa caagacccug 240
agcuggaaca gcaagaaccc cgugagcaac ggcuacugga agcggaagaa ggacgacgag 300
cugaagauca ucuacaacgc caucaagcug agcgagaagg agggcaacau cuucgacauc 360
cgggaccugu acaagcggac ccugaagaac aucgaccugc ugaccuacag cuuccccugc 420
caggaccuga gccagcaggg cauccagaag ggcaugaagc ggggcagcgg cacccggagc 480
ggccugcugu gggagaucga gcgggcccug gacagcaccg agaagaacga ccugcccaag 540
uaccugcuga uggagaacgu gggcgcccug cugcacaaga agaacgagga ggagcugaac 600
caguggaagc agaagcugga gagccugggc uaccagaaca gcaucgaggu gcugaacgcc 660
gccgacuucg gcagcagcca ggcccggcgg cggguguuca ugaucagcac ccugaacgag 720
uucguggagc ugcccaaggg cgacaagaag cccaagagca ucaagaaggu gcugaacaag 780
aucgugagcg agaaggacau ccugaacaac cugcugaagu acaaccugac cgaguucaag 840
aaaaccaaga gcaacaucaa caaggccagc cugaucggcu acagcaaguu caacagcgag 900
ggcuacgugu acgaccccga guucaccggc cccacccuga ccgccagcgg cgccaacagc 960
cggaucaaga ucaaggacgg cagcaacauc cggaagauga acagcgacga gaccuuccug 1020
uacaucggcu ucgacagcca ggacggcaag cgggugaacg agaucgaguu ccugaccgag 1080
aaccagaaga ucuucgugug cggcaacagc aucagcgugg aggugcugga ggccaucauc 1140
gacaagaucg gcggc 1155
<![CDATA[<210> 133]]>
<![CDATA[<211> 690]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 133]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly
690
<![CDATA[<210> 134]]>
<![CDATA[<211> 366]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 134]]>
Pro Leu Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys
1 5 10 15
Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu
20 25 30
Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
35 40 45
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
50 55 60
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His
65 70 75 80
Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
85 90 95
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His
100 105 110
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
115 120 125
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
130 135 140
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
145 150 155 160
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
165 170 175
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
180 185 190
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
195 200 205
Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His
210 215 220
Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala
225 230 235 240
Ser Gly Ser Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala
245 250 255
Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr
260 265 270
Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg
275 280 285
Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln
290 295 300
Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
305 310 315 320
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
325 330 335
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro Ala
340 345 350
Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
355 360 365
<![CDATA[<210> 135]]>
<![CDATA[<211> 6]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 135]]>
Pro Lys Lys Lys Arg Lys
1 5
<![CDATA[<210> 136]]>
<![CDATA[<211> 15]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 136]]>
Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys
1 5 10 15
<![CDATA[<210> 137]]>
<![CDATA[<211> 13]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 137]]>
Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp
1 5 10
<![CDATA[<210> 138]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 138]]>
Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly
1 5 10
<![CDATA[<210> 139]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 139]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 140]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 140]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser His Ser Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 141]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 141]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 142]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 142]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 143]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 143]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 144]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 144]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly Ala Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His
115 120 125
Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 145]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 145]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 146]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 146]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp
115 120 125
Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 147]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 147]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 148]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 148]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His
115 120 125
Leu Leu Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 149]]>
<![CDATA[<211> 652]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 149]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Glu Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 150]]>
<![CDATA[<211> 1519]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 150]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Ser Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr
1400 1405 1410
Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu
1415 1420 1425
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val
1430 1435 1440
Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu
1445 1450 1455
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
1460 1465 1470
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser
1475 1480 1485
Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg
1490 1495 1500
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly
1505 1510 1515
Ser
<![CDATA[<210> 151]]>
<![CDATA[<211> 1811]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 151]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu
1400 1405 1410
Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala
1415 1420 1425
Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala
1430 1435 1440
Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn
1445 1450 1455
Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
1460 1465 1470
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys
1475 1480 1485
Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu
1490 1495 1500
Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly
1505 1510 1515
Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn
1520 1525 1530
Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln
1535 1540 1545
Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser
1550 1555 1560
Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys
1565 1570 1575
Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu
1580 1585 1590
Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys
1595 1600 1605
Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
1610 1615 1620
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile
1625 1630 1635
Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys
1640 1645 1650
Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys
1655 1660 1665
Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys
1670 1675 1680
Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser
1685 1690 1695
Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
1700 1705 1710
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys
1715 1720 1725
Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu
1730 1735 1740
Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile
1745 1750 1755
Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser
1760 1765 1770
Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly
1775 1780 1785
Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly
1790 1795 1800
Gln Ala Lys Lys Lys Lys Gly Ser
1805 1810
<![CDATA[<210> 152]]>
<![CDATA[<211> 1913]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 152]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
1385 1390 1395
Ala Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser Val Asp Leu Arg
1400 1405 1410
Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser Glu Gly Phe
1415 1420 1425
His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu Met Trp
1430 1435 1440
Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser Thr
1445 1450 1455
Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala
1460 1465 1470
Gly Glu Thr Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly
1475 1480 1485
Asp Val Glu Met Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser
1490 1495 1500
Gly Met Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn
1505 1510 1515
Ser Leu Val Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro
1520 1525 1530
Arg Phe Phe Leu Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys
1535 1540 1545
Arg Ser Met Val Leu Lys Leu Thr Leu Arg Cys Leu Val Arg Met
1550 1555 1560
Gly Tyr Gln Cys Thr Phe Gly Val Leu Gln Ala Gly Gln Tyr Gly
1565 1570 1575
Val Ala Gln Thr Arg Arg Arg Ala Ile Ile Leu Ala Ala Ala Pro
1580 1585 1590
Gly Glu Lys Leu Pro Leu Phe Pro Glu Pro Leu His Val Phe Ala
1595 1600 1605
Pro Arg Ala Cys Gln Leu Ser Val Val Val Asp Asp Lys Lys Phe
1610 1615 1620
Val Ser Asn Ile Thr Arg Leu Ser Ser Gly Pro Phe Arg Thr Ile
1625 1630 1635
Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu Val Arg Asn Gly
1640 1645 1650
Ala Ser Ala Leu Glu Ile Ser Tyr Asn Gly Glu Pro Gln Ser Trp
1655 1660 1665
Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile Leu Arg
1670 1675 1680
Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg Met
1685 1690 1695
Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro
1700 1705 1710
Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu
1715 1720 1725
Arg Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Gly
1730 1735 1740
Ala Leu Arg Gly Val Cys Ser Cys Val Glu Ala Gly Lys Ala Cys
1745 1750 1755
Asp Pro Ala Ala Arg Gln Phe Asn Thr Leu Ile Pro Trp Cys Leu
1760 1765 1770
Pro His Thr Gly Asn Arg His Asn His Trp Ala Gly Leu Tyr Gly
1775 1780 1785
Arg Leu Glu Trp Asp Gly Phe Phe Ser Thr Thr Val Thr Asn Pro
1790 1795 1800
Glu Pro Met Gly Lys Gln Gly Arg Val Leu His Pro Glu Gln His
1805 1810 1815
Arg Val Val Ser Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro
1820 1825 1830
Asp Thr Tyr Arg Leu Phe Gly Asn Ile Leu Asp Lys His Arg Gln
1835 1840 1845
Val Gly Asn Ala Val Pro Pro Pro Leu Ala Lys Ala Ile Gly Leu
1850 1855 1860
Glu Ile Lys Leu Cys Met Leu Ala Lys Ala Arg Glu Ser Ala Ser
1865 1870 1875
Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp Gly Gly Gly Gly
1880 1885 1890
Ser Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1895 1900 1905
Lys Lys Lys Gly Ser
1910
<![CDATA[<210> 153]]>
<![CDATA[<211> 1977]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 153]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Ser Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
1385 1390 1395
Gly Gly Ser Gly Pro Lys Lys Lys Arg Lys Val Ala Ala Ala Gly
1400 1405 1410
Ser Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro
1415 1420 1425
Val Pro Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe
1430 1435 1440
Asp Gly Ile Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile
1445 1450 1455
Gln Val Asp Arg Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile
1460 1465 1470
Thr Val Gly Met Val Arg His Gln Gly Lys Ile Met Tyr Val Gly
1475 1480 1485
Asp Val Arg Ser Val Thr Gln Lys His Ile Gln Glu Trp Gly Pro
1490 1495 1500
Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp Leu Ser Ile
1505 1510 1515
Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly Arg Leu
1520 1525 1530
Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys Glu
1535 1540 1545
Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala
1550 1555 1560
Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser
1565 1570 1575
Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg
1580 1585 1590
Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu
1595 1600 1605
Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu Glu
1610 1615 1620
His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr
1625 1630 1635
Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val
1640 1645 1650
Phe Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu
1655 1660 1665
Arg Val Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met
1670 1675 1680
Ser Arg Leu Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val
1685 1690 1695
Pro Val Ile Arg His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala
1700 1705 1710
Cys Val Ser Ser Gly Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser
1715 1720 1725
Phe Ser Ser Gly Leu Val Pro Leu Ser Leu Arg Gly Ser His Met
1730 1735 1740
Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp Arg Arg Gln
1745 1750 1755
Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys Glu Leu
1760 1765 1770
Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln Leu
1775 1780 1785
Lys His Val Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu
1790 1795 1800
Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu
1805 1810 1815
Gly His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu Phe Gln Phe
1820 1825 1830
His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg
1835 1840 1845
Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu
1850 1855 1860
Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr
1865 1870 1875
Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val
1880 1885 1890
Trp Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val
1895 1900 1905
Ser Glu Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser
1910 1915 1920
Lys Leu Ala Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe
1925 1930 1935
Leu Pro Leu Arg Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr
1940 1945 1950
Ser Ser Leu Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala
1955 1960 1965
Gly Gln Ala Lys Lys Lys Lys Gly Ser
1970 1975
<![CDATA[<210> 154]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 154]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg
35 40 45
Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Ser Gly Ser Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro
145 150 155 160
Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg
180 185 190
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 155]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 155]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
35 40 45
Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro
145 150 155 160
Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 156]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 156]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
35 40 45
Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser His Arg Thr Thr Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser
145 150 155 160
Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[<210> 157]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 157]]>
cttgagcccg gagaaaagcc atacaaatgt cctgaatgcg gaaagtcatt ttctacgagc 60
ggcgaactcg tgcggcacca aaggactcat accggcgaaa agccttacaa atgcccggag 120
tgcggaaaaa gcttctccga gcgctcgcac ttgcgggaac accagcgaac ccacacaggg 180
gagaaaccgt ataagtgccc agagtgcggc aaatcgttct cccggaacga caccctgacc 240
gaacaccaac gcactcatac tggcgaaaaa ccttacaagt gccctgagtg tggaaagagc 300
ttctcccgcg ccgataacct gaccgagcac cagcggaccc ataccgggga aaagccgtac 360
aagtgtccgg aatgcggcaa aagcttcagc acctcgggtt ccctggtccg gcatcagaga 420
actcacaccg gagagaaacc ctataagtgt cctgagtgcg ggaagtcctt ttcatcgccc 480
gcggacctga ctagacacca gaggacccac accggggaga agccctacaa gtgccccgaa 540
tgtggaaagt ccttctccga ctccggcaac ctccgggtgc accagcgcac ccacactgga 600
gagaagccga ccggaaagaa aacttcc 627
<![CDATA[<210> 158]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 158]]>
ctggaacccg gagaaaaacc ttataagtgc cctgaatgcg gaaagtcatt ctcgaggtcg 60
gacaagctcg tgcggcacca gaggacacac accggggaaa agccatacaa gtgtcccgaa 120
tgtggaaagt ccttcagcca acgcgccaac ctgagagctc atcagcggac tcacactggc 180
gaaaaaccgt acaaatgccc cgaatgcggc aaaagcttct cccgcgccga caacttgacc 240
gagcaccagc ggacccatac cggcgaaaag ccgtacaagt gcccggagtg tgggaagtcg 300
ttcagccagt cctcttccct cgtgcgccac caacgcaccc atactgggga gaagccctat 360
aagtgtcctg agtgtggcaa atcattcagc gataagaagg atcttacccg gcaccaacgg 420
actcataccg gagagaagcc ttacaagtgc cccgagtgcg gaaagagctt ctcgtccccg 480
gcggacctga ctagacacca gcgcacccac accggagaaa agccctacaa gtgcccagag 540
tgcgggaagt ccttttccca atccggtcac ctgactgagc accagagaac ccacacggga 600
gagaaaccga ccggaaagaa aacctcc 627
<![CDATA[<210> 159]]>
<![CDATA[<211> 627]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 159]]>
ttggaacccg gagaaaagcc atacaaatgc cccgaatgcg gaaagtcgtt cagccagtcc 60
ggcgacctca gacggcacca acggactcac accggcgaaa aaccgtacaa gtgcccagag 120
tgcggcaaaa gctttagcca gtcgggcgat ctgcggagac atcagcgcac tcacactggt 180
gaaaagccct acaagtgtcc tgagtgcggg aagtccttca gcgagcgctc ccatcttcgc 240
gagcaccaga gaacccacac tggagaaaaa ccttataagt gccctgagtg tggcaaatcc 300
ttctcaacca ccggcaacct gactgtgcac cagcggaccc acacagggga gaagccttac 360
aagtgcccgg agtgtgggaa gtcattctcc catcggacga ccctgaccaa ccaccagagg 420
acccatactg gcgaaaagcc gtataagtgt ccggagtgcg gaaagagctt ctccgactcc 480
ggaaacctca gggtgcacca acgcacccac accggagaga agccgtacaa atgtcccgaa 540
tgtggaaagt ccttctccca atcctcttcg ctggtccggc accagcgaac tcataccggg 600
gaaaagccca ccggaaagaa aacctcg 627
<![CDATA[<210> 160]]>
<![CDATA[<211> 659]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 160]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Ser Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[<210> 161]]>
<![CDATA[<211> 659]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 161]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[<210> 162]]>
<![CDATA[<211> 659]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 162]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser His Arg Thr Thr Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[<210> 163]]>
<![CDATA[<211> 650]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 163]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Ser Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 164]]>
<![CDATA[<211> 650]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 164]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 165]]>
<![CDATA[<211> 650]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 165]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser His Arg Thr Thr Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[<210> 166]]>
<![CDATA[<211> 2255]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 166]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tcccttgagc 120
ccggagaaaa gccatacaaa tgtcctgaat gcggaaagtc attttctacg agcggcgaac 180
tcgtgcggca ccaaaggact cataccggcg aaaagcctta caaatgcccg gagtgcggaa 240
aaagcttctc cgagcgctcg cacttgcggg aacaccagcg aacccacaca ggggagaaac 300
cgtataagtg cccagagtgc ggcaaatcgt tctcccggaa cgacaccctg accgaacacc 360
aacgcactca tactggcgaa aaaccttaca agtgccctga gtgtggaaag agcttctccc 420
gcgccgataa cctgaccgag caccagcgga cccataccgg ggaaaagccg tacaagtgtc 480
cggaatgcgg caaaagcttc agcacctcgg gttccctggt ccggcatcag agaactcaca 540
ccggagagaa accctataag tgtcctgagt gcgggaagtc cttttcatcg cccgcggacc 600
tgactagaca ccagaggacc cacaccgggg agaagcccta caagtgcccc gaatgtggaa 660
agtccttctc cgactccggc aacctccggg tgcaccagcg cacccacact ggagagaagc 720
cgaccggaaa gaaaacttcc gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaacccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 2255
<![CDATA[<210> 167]]>
<![CDATA[<211> 2255]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 167]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggaac 120
ccggagaaaa accttataag tgccctgaat gcggaaagtc attctcgagg tcggacaagc 180
tcgtgcggca ccagaggaca cacaccgggg aaaagccata caagtgtccc gaatgtggaa 240
agtccttcag ccaacgcgcc aacctgagag ctcatcagcg gactcacact ggcgaaaaac 300
cgtacaaatg ccccgaatgc ggcaaaagct tctcccgcgc cgacaacttg accgagcacc 360
agcggaccca taccggcgaa aagccgtaca agtgcccgga gtgtgggaag tcgttcagcc 420
agtcctcttc cctcgtgcgc caccaacgca cccatactgg ggagaagccc tataagtgtc 480
ctgagtgtgg caaatcattc agcgataaga aggatcttac ccggcaccaa cggactcata 540
ccggagagaa gccttacaag tgccccgagt gcggaaagag cttctcgtcc ccggcggacc 600
tgactagaca ccagcgcacc cacaccggag aaaagcccta caagtgccca gagtgcggga 660
agtccttttc ccaatccggt cacctgactg agcaccagag aacccacacg ggagagaaac 720
cgaccggaaa gaaaacctcc gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaacccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 2255
<![CDATA[<210> 168]]>
<![CDATA[<211> 2255]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 168]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccttggaac 120
ccggagaaaa gccatacaaa tgccccgaat gcggaaagtc gttcagccag tccggcgacc 180
tcagacggca ccaacggact cacaccggcg aaaaaccgta caagtgccca gagtgcggca 240
aaagctttag ccagtcgggc gatctgcgga gacatcagcg cactcacact ggtgaaaagc 300
cctacaagtg tcctgagtgc gggaagtcct tcagcgagcg ctcccatctt cgcgagcacc 360
agagaaccca cactggagaa aaaccttata agtgccctga gtgtggcaaa tccttctcaa 420
ccaccggcaa cctgactgtg caccagcgga cccacacagg ggagaagcct tacaagtgcc 480
cggagtgtgg gaagtcattc tcccatcgga cgaccctgac caaccaccag aggacccata 540
ctggcgaaaa gccgtataag tgtccggagt gcggaaagag cttctccgac tccggaaacc 600
tcagggtgca ccaacgcacc cacaccggag agaagccgta caaatgtccc gaatgtggaa 660
agtccttctc ccaatcctct tcgctggtcc ggcaccagcg aactcatacc ggggaaaagc 720
ccaccggaaa gaaaacctcg gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaacccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaa aaaaaaaaaa 2160
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 2255
<![CDATA[<210> 169]]>
<![CDATA[<211> 204]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 169]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
35 40 45
Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg
180 185 190
Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[<210> 170]]>
<![CDATA[<211> 204]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 170]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Lys
35 40 45
Asn Ala Leu Gln Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala
145 150 155 160
Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr
180 185 190
Arg His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[<210> 171]]>
<![CDATA[<211> 204]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400]]>> 171]]>
<br/><![CDATA[Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys
35 40 45
Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr
145 150 155 160
Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
180 185 190
Thr His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[<210> 172]]>
<![CDATA[<211> 204]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 172]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys
35 40 45
Ser Ser Leu Ile Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn
65 70 75 80
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser
145 150 155 160
Gly Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala
180 185 190
Ser His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[<210> 173]]>
<![CDATA[<211> 612]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 173]]>
ctggagcctg gagagaaacc ctacaaatgc ccggaatgcg ggaagtcctt ttccgaacgc 60
tcgcacctga gggaacacca gagaactcac accggcgaaa aaccctataa gtgcccagaa 120
tgcggaaaga gcttttcacg gtcggacaac ctcgtgcggc accaacgcac tcataccgga 180
gagaagccgt acaagtgtcc tgagtgcgga aagtcattct ccgactgccg ggatttggcc 240
cgccaccaaa gaacacacac tggcgaaaag ccctacaagt gcccggagtg tggaaagtcc 300
ttcagcactt ccggagagct ggtccggcac cagaggaccc acaccgggga gaagccttac 360
aaatgtccag agtgcggtaa aagcttctcc accaccggca acctcaccgt gcaccagcgg 420
acccacactg gagaaaagcc gtataaatgc cccgaatgcg gcaagagctt ctcgcgatcc 480
gataagcttg tgcggcatca gagaacgcac actggggaaa agccttataa gtgtccggag 540
tgcggcaaat ccttctcccg cactgacacc ctgcgggacc atcagcgcac ccataccggc 600
aaaaagacct ct 612
<![CDATA[<210> 174]]>
<![CDATA[<211> 61]]>2
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 174]]>
cttgaacccg gggagaagcc ctacaagtgc ccggaatgcg gaaagagctt cagccagaag 60
tcctcgctga tcgcgcacca gaggactcac accggcgaaa agccatacaa gtgtcctgag 120
tgtggcaaat ccttctcgca caagaacgca ctgcagaacc accagagaac ccacaccggg 180
gaaaagccgt ataagtgccc cgaatgtgga aagtcgttca gccagtcatc caacctcgtg 240
cgccatcaaa ggactcatac cggagagaaa ccttacaaat gccctgaatg cggcaaatct 300
ttctcccgga atgatgccct gaccgagcac cagcgcactc acacgggaga gaagccgtac 360
aaatgtccgg agtgcggaaa gtccttctcc gacaagaagg acttgaccag acaccagcgg 420
acccatactg gcgaaaaacc ctataagtgt ccagagtgcg ggaagtcctt tagccaagcc 480
ggtcacctcg cttcccacca acggacccac acaggagaaa agccttataa atgccccgag 540
tgcggcaaaa gcttctccga taagaaggac ctgactcggc atcagcgcac ccataccgga 600
aagaaaacct ca 612
<![CDATA[<210> 175]]>
<![CDATA[<211> 612]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 175]]>
ctggagcctg gcgaaaaacc ctataagtgc ccagaatgcg gaaagagctt ttcacggtcg 60
gacaacctcg tgcggcacca acgcactcat accggagaga agccgtacaa gtgtcctgag 120
tgcggaaagt cattctccga ctgccgggat ttggcccgcc accaaagaac acacactggc 180
gaaaagccct acaagtgccc ggagtgtgga aagtccttca gcacttccgg agagctggtc 240
cggcaccaga ggacccacac cggggagaag ccttacaaat gtccagagtg cggtaaaagc 300
ttctccacca ccggcaacct caccgtgcac cagcggaccc acactggaga aaagccgtat 360
aaatgccccg aatgcggcaa gagcttctcg cgatccgata agcttgtgcg gcatcagaga 420
acgcacactg gggaaaagcc ttataagtgt ccggagtgcg gcaaatcctt ctcccgcact 480
gacaccctgc gggaccacca gagaacccat actggcgaga agccatacaa atgcccggaa 540
tgtggaaaga gtttctcgcg cgaggacaac ctccacaccc atcagcgcac ccataccggc 600
aaaaagacct ct 612
<![CDATA[<210> 176]]>
<![CDATA[<211> 612]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 176]]>
ctggaacccg gagagaaacc ctacaaatgc ccagagtgcg gcaaatcgtt ctcacggtcc 60
gatcacctca ccacccacca gaggacccat accggggaga agccttacaa gtgtcctgag 120
tgtggaaagt ccttcagcca aaagtcctcg ctgatcgcac accagcgcac gcacactggg 180
gaaaagccat ataaatgccc ggagtgtggc aaatccttct cccgccgcga cgaactgaac 240
gtgcaccaga gaacccacac tggagagaag ccgtataagt gtccggagtg cggaaagagc 300
ttctcgcaat ccggggacct tcggagacat cagaggacac acactggcga aaagccctat 360
aagtgccctg agtgcgggaa gtcctttagc caagccggtc acctggcctc ccaccaacgg 420
actcacaccg gcgaaaaacc gtacaagtgc cccgaatgcg gaaagtcgtt ctctacctcc 480
ggaaacttga ccgaacacca gcggacccac accggagaaa agccgtacaa atgtcctgaa 540
tgcggcaaaa gcttcagcca ggccggtcat ctcgcgagcc atcagcggac tcatactggc 600
aaaaagacct ca 612
<![CDATA[<210> 177]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 177]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 178]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 178]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser His Lys Asn Ala Leu Gln Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Ser Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 179]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 179]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 180]]>
<![CDATA[<211> 367]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 180]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[<210> 181]]>
<![CDATA[<211> 1082]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 181]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys
625 630 635 640
Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val
645 650 655
Pro Asp Tyr Ala Gly Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala
660 665 670
Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly
675 680 685
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
690 695 700
Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys
705 710 715 720
Lys Arg Lys Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly
725 730 735
Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
740 745 750
Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr
755 760 765
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
770 775 780
Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
785 790 795 800
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu
805 810 815
Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro
820 825 830
Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln
835 840 845
Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
850 855 860
Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr
865 870 875 880
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
885 890 895
Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
900 905 910
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu
915 920 925
Arg Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser Ala Ser Gly
930 935 940
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp Ala Lys Ser Leu Thr
945 950 955 960
Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe
965 970 975
Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr
980 985 990
Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr
995 1000 1005
Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu
1010 1015 1020
Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro
1025 1030 1035
Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro Ser Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1070 1075 1080
<![CDATA[<210> 182]]>
<![CDATA[<211> 1082]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 182]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly
355 360 365
Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu
370 375 380
Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg
385 390 395 400
Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu
405 410 415
Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly
420 425 430
Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly
435 440 445
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
450 455 460
Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
465 470 475 480
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
485 490 495
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
500 505 510
Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg
515 520 525
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
530 535 540
Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly
545 550 555 560
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu
565 570 575
Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
580 585 590
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg
595 600 605
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
610 615 620
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
625 630 635 640
Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly
645 650 655
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Ser Lys Val Glu Asn Lys
660 665 670
Thr Lys Lys Leu Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln
675 680 685
Arg Lys Ala Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly
690 695 700
Leu Ala Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His
705 710 715 720
Asn Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
725 730 735
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn
740 745 750
Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys
755 760 765
Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe
770 775 780
Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu
785 790 795 800
Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys
805 810 815
Gly Met Lys Arg Gly Ser Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile
820 825 830
Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu
835 840 845
Leu Met Glu Asn Val Gly Ala Leu Leu His Lys Lys Asn Glu Glu Glu
850 855 860
Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser
865 870 875 880
Ile Glu Val Leu Asn Ala Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg
885 890 895
Arg Val Phe Met Ile Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys
900 905 910
Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val
915 920 925
Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu
930 935 940
Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr
945 950 955 960
Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
965 970 975
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp
980 985 990
Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile
995 1000 1005
Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile Glu Phe
1010 1015 1020
Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser Ile Ser
1025 1030 1035
Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1070 1075 1080
<![CDATA[<210> 183]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 183]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 184]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 184]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser His Lys Asn Ala Leu Gln Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Ser Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 185]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 185]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 186]]>
<![CDATA[<211> 358]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 186]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[<210> 187]]>
<![CDATA[<211> 1073]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 187]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys
625 630 635 640
Lys Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val
645 650 655
Pro Asp Tyr Ala Gly Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala
660 665 670
Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly
675 680 685
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
690 695 700
Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys
705 710 715 720
Lys Arg Lys Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly
725 730 735
Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
740 745 750
Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr
755 760 765
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
770 775 780
Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
785 790 795 800
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu
805 810 815
Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro
820 825 830
Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln
835 840 845
Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
850 855 860
Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr
865 870 875 880
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
885 890 895
Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
900 905 910
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu
915 920 925
Arg Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser Ala Ser Gly
930 935 940
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp Ala Lys Ser Leu Thr
945 950 955 960
Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe
965 970 975
Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr
980 985 990
Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr
995 1000 1005
Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu
1010 1015 1020
Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro
1025 1030 1035
Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro Ser Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser
1070
<![CDATA[<210> 188]]>
<![CDATA[<211> 1073]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成多肽]]>
<![CDATA[<400> 188]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly
355 360 365
Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu
370 375 380
Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg
385 390 395 400
Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu
405 410 415
Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly
420 425 430
Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly
435 440 445
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
450 455 460
Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
465 470 475 480
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
485 490 495
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
500 505 510
Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg
515 520 525
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
530 535 540
Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly
545 550 555 560
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu
565 570 575
Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
580 585 590
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg
595 600 605
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
610 615 620
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
625 630 635 640
Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly
645 650 655
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Ser Lys Val Glu Asn Lys
660 665 670
Thr Lys Lys Leu Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln
675 680 685
Arg Lys Ala Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly
690 695 700
Leu Ala Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His
705 710 715 720
Asn Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
725 730 735
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn
740 745 750
Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys
755 760 765
Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe
770 775 780
Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu
785 790 795 800
Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys
805 810 815
Gly Met Lys Arg Gly Ser Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile
820 825 830
Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu
835 840 845
Leu Met Glu Asn Val Gly Ala Leu Leu His Lys Lys Asn Glu Glu Glu
850 855 860
Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser
865 870 875 880
Ile Glu Val Leu Asn Ala Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg
885 890 895
Arg Val Phe Met Ile Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys
900 905 910
Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val
915 920 925
Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu
930 935 940
Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr
945 950 955 960
Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
965 970 975
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp
980 985 990
Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile
995 1000 1005
Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile Glu Phe
1010 1015 1020
Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser Ile Ser
1025 1030 1035
Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser
1070
<![CDATA[<210> 189]]>
<![CDATA[<211> 1376]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 189]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ctggagagaa accctacaaa tgcccggaat gcgggaagtc cttttccgaa cgctcgcacc 180
tgagggaaca ccagagaact cacaccggcg aaaaacccta taagtgccca gaatgcggaa 240
agagcttttc acggtcggac aacctcgtgc ggcaccaacg cactcatacc ggagagaagc 300
cgtacaagtg tcctgagtgc ggaaagtcat tctccgactg ccgggatttg gcccgccacc 360
aaagaacaca cactggcgaa aagccctaca agtgcccgga gtgtggaaag tccttcagca 420
cttccggaga gctggtccgg caccagagga cccacaccgg ggagaagcct tacaaatgtc 480
cagagtgcgg taaaagcttc tccaccaccg gcaacctcac cgtgcaccag cggacccaca 540
ctggagaaaa gccgtataaa tgccccgaat gcggcaagag cttctcgcga tccgataagc 600
ttgtgcggca tcagagaacg cacactgggg aaaagcctta taagtgtccg gagtgcggca 660
aatccttctc ccgcactgac accctgcggg accatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1376
<![CDATA[<210> 190]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 190]]>
aacacagttc agccgagcgc t 21
<![CDATA[<210> 191]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 191]]>
cgaacaaccg tacagaaagg g 21
<![CDATA[<210> 192]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 192]]>
gtaaacagta atagcgcagc a 21
<![CDATA[<210> 193]]>
<![CDATA[<211> 1376]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 193]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tcccttgaac 120
ccggggagaa gccctacaag tgcccggaat gcggaaagag cttcagccag aagtcctcgc 180
tgatcgcgca ccagaggact cacaccggcg aaaagccata caagtgtcct gagtgtggca 240
aatccttctc gcacaagaac gcactgcaga accaccagag aacccacacc ggggaaaagc 300
cgtataagtg ccccgaatgt ggaaagtcgt tcagccagtc atccaacctc gtgcgccatc 360
aaaggactca taccggagag aaaccttaca aatgccctga atgcggcaaa tctttctccc 420
ggaatgatgc cctgaccgag caccagcgca ctcacacggg agagaagccg tacaaatgtc 480
cggagtgcgg aaagtccttc tccgacaaga aggacttgac cagacaccag cggacccata 540
ctggcgaaaa accctataag tgtccagagt gcgggaagtc ctttagccaa gccggtcacc 600
tcgcttccca ccaacggacc cacacaggag aaaagcctta taaatgcccc gagtgcggca 660
aaagcttctc cgataagaag gacctgactc ggcatcagcg cacccatacc ggaaagaaaa 720
cctcagctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1376
<![CDATA[<210> 194]]>
<![CDATA[<211> 1376]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 194]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ctggcgaaaa accctataag tgcccagaat gcggaaagag cttttcacgg tcggacaacc 180
tcgtgcggca ccaacgcact cataccggag agaagccgta caagtgtcct gagtgcggaa 240
agtcattctc cgactgccgg gatttggccc gccaccaaag aacacacact ggcgaaaagc 300
cctacaagtg cccggagtgt ggaaagtcct tcagcacttc cggagagctg gtccggcacc 360
agaggaccca caccggggag aagccttaca aatgtccaga gtgcggtaaa agcttctcca 420
ccaccggcaa cctcaccgtg caccagcgga cccacactgg agaaaagccg tataaatgcc 480
ccgaatgcgg caagagcttc tcgcgatccg ataagcttgt gcggcatcag agaacgcaca 540
ctggggaaaa gccttataag tgtccggagt gcggcaaatc cttctcccgc actgacaccc 600
tgcgggacca ccagagaacc catactggcg agaagccata caaatgcccg gaatgtggaa 660
agagtttctc gcgcgaggac aacctccaca cccatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1376
<![CDATA[<210> 195]]>
<![CDATA[<211> 1376]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 195]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggaac 120
ccggagagaa accctacaaa tgcccagagt gcggcaaatc gttctcacgg tccgatcacc 180
tcaccaccca ccagaggacc cataccgggg agaagcctta caagtgtcct gagtgtggaa 240
agtccttcag ccaaaagtcc tcgctgatcg cacaccagcg cacgcacact ggggaaaagc 300
catataaatg cccggagtgt ggcaaatcct tctcccgccg cgacgaactg aacgtgcacc 360
agagaaccca cactggagag aagccgtata agtgtccgga gtgcggaaag agcttctcgc 420
aatccgggga ccttcggaga catcagagga cacacactgg cgaaaagccc tataagtgcc 480
ctgagtgcgg gaagtccttt agccaagccg gtcacctggc ctcccaccaa cggactcaca 540
ccggcgaaaa accgtacaag tgccccgaat gcggaaagtc gttctctacc tccggaaact 600
tgaccgaaca ccagcggacc cacaccggag aaaagccgta caaatgtcct gaatgcggca 660
aaagcttcag ccaggccggt catctcgcga gccatcagcg gactcatact ggcaaaaaga 720
cctcagctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1376
<![CDATA[<210> 196]]>
<![CDATA[<211> 3521]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 196]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gccccaaaga 60
agaagagaaa ggtcggaatt catggcgtgc ccgcagccgg cagcagcggt tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaacccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc ggcggcaagc ggcccgccgc caccaagaag gccggccagg 1980
ccaagaagaa gaagggcagc tacccctacg acgtgcccga ctacgccggg tccgccacca 2040
acttctcgct gctgaagcag gccggagacg tggaagaaaa ccctggacct accagtgccg 2100
gaaagctcgg tagcggagag ggtcggggaa gcctgcttac ttgcggcgac gtggaagaga 2160
accccggtcc gctggagggt tcgtccggct ccggatcccc caagaagaag cggaaggtgg 2220
gcatccacgg cgtgcccgcc gccggcagca gcggatccct ggagcctgga gagaaaccct 2280
acaaatgccc ggaatgcggg aagtcctttt ccgaacgctc gcacctgagg gaacaccaga 2340
gaactcacac cggcgaaaaa ccctataagt gcccagaatg cggaaagagc ttttcacggt 2400
cggacaacct cgtgcggcac caacgcactc ataccggaga gaagccgtac aagtgtcctg 2460
agtgcggaaa gtcattctcc gactgccggg atttggcccg ccaccaaaga acacacactg 2520
gcgaaaagcc ctacaagtgc ccggagtgtg gaaagtcctt cagcacttcc ggagagctgg 2580
tccggcacca gaggacccac accggggaga agccttacaa atgtccagag tgcggtaaaa 2640
gcttctccac caccggcaac ctcaccgtgc accagcggac ccacactgga gaaaagccgt 2700
ataaatgccc cgaatgcggc aagagcttct cgcgatccga taagcttgtg cggcatcaga 2760
gaacgcacac tggggaaaag ccttataagt gtccggagtg cggcaaatcc ttctcccgca 2820
ctgacaccct gcgggaccat cagcgcaccc ataccggcaa aaagacctct gctagcggca 2880
gcggcggcgg cagcggcggc gcccgggacg acgccaagag cctgaccgcc tggagccgga 2940
ccctggtgac cttcaaggac gtgttcgtgg acttcacccg ggaggagtgg aagctgctgg 3000
acaccgccca gcagatcctg taccggaacg tgatgctgga gaactacaag aacctggtga 3060
gcctgggcta ccagctgacc aagcccgacg tgatcctgcg gctggagaag ggcgaggagc 3120
cctggctggt ggagcgggag atccaccagg aaacccaccc cgacagcgaa accgccttcg 3180
agatcaagag cagcgtgccc agcagcggcg gcaagcggcc cgccgccacc aagaaggccg 3240
gccaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 3300
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 3360
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 3420
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3480
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 3521
<![CDATA[<210> 197]]>
<![CDATA[<211> 3521]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 197]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gccccaaaga 60
agaagagaaa ggtcggaatt catggcgtgc ccgcagccgg cagcagcggt tccctggagc 120
ctggagagaa accctacaaa tgcccggaat gcgggaagtc cttttccgaa cgctcgcacc 180
tgagggaaca ccagagaact cacaccggcg aaaaacccta taagtgccca gaatgcggaa 240
agagcttttc acggtcggac aacctcgtgc ggcaccaacg cactcatacc ggagagaagc 300
cgtacaagtg tcctgagtgc ggaaagtcat tctccgactg ccgggatttg gcccgccacc 360
aaagaacaca cactggcgaa aagccctaca agtgcccgga gtgtggaaag tccttcagca 420
cttccggaga gctggtccgg caccagagga cccacaccgg ggagaagcct tacaaatgtc 480
cagagtgcgg taaaagcttc tccaccaccg gcaacctcac cgtgcaccag cggacccaca 540
ctggagaaaa gccgtataaa tgccccgaat gcggcaagag cttctcgcga tccgataagc 600
ttgtgcggca tcagagaacg cacactgggg aaaagcctta taagtgtccg gagtgcggca 660
aatccttctc ccgcactgac accctgcggg accatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgccgg gtccgccacc aacttctcgc tgctgaagca ggccggagac gtggaagaaa 1200
accctggacc taccagtgcc ggaaagctcg gtagcggaga gggtcgggga agcctgctta 1260
cttgcggcga cgtggaagag aaccccggtc cgctggaggg ttcgtccggc tccggatccc 1320
ccaagaagaa gcggaaggtg ggcatccacg gcgtgcccgc cgccggcagc agcggatccc 1380
tggagcccgg cgagaaacct tacaaatgcc ccgagtgcgg caagagcttc agcagaagcg 1440
acgatctggt gaggcaccaa agaacccaca ccggcgaaaa accttacaag tgtcccgaat 1500
gcggaaagtc cttcagcaga gaggacaatc tgcacaccca ccagagaaca cacaccggag 1560
aaaagcctta caagtgcccc gaatgcggca aatccttttc tagaagcgat catctgacca 1620
cccaccaaag aacacatacc ggcgagaagc cttacaaatg tcccgagtgc ggaaagtcct 1680
tctcccagag agccaatctg agggctcatc aaaggaccca taccggcgaa aagccctaca 1740
aatgccccga gtgcggaaaa tccttcagcc agctggccca tctgagagcc caccaaagga 1800
cacacaccgg agagaaaccc tataagtgcc ccgagtgtgg aaaaagcttt tcccagaggg 1860
ccaatctgag ggcccatcag aggacccata ccggagagaa gccttataaa tgtcccgagt 1920
gcggaaaaag cttcagcgag aggagccatc tgagggaaca tcaaagaacc cacaccggcg 1980
aaaaacccac cggaaaaaag accagcgcta gcggcagcgg cggcggcagc ggcggcgccc 2040
gggacagcaa ggtggagaac aagaccaaga agctgcgggt gttcgaggcc ttcgccggca 2100
tcggcgccca gcggaaggcc ctggagaagg tgcggaagga cgagtacgag atcgtgggcc 2160
tggccgagtg gtacgtgccc gccatcgtga tgtaccaggc catccacaac aacttccaca 2220
ccaagctgga gtacaagagc gtgagccggg aggagatgat cgactacctg gagaacaaga 2280
ccctgagctg gaacagcaag aaccccgtga gcaacggcta ctggaagcgg aagaaggacg 2340
acgagctgaa gatcatctac aacgccatca agctgagcga gaaggagggc aacatcttcg 2400
acatccggga cctgtacaag cggaccctga agaacatcga cctgctgacc tacagcttcc 2460
cctgccagga cctgagccag cagggcatcc agaagggcat gaagcggggc agcggcaccc 2520
ggagcggcct gctgtgggag atcgagcggg ccctggacag caccgagaag aacgacctgc 2580
ccaagtacct gctgatggag aacgtgggcg ccctgctgca caagaagaac gaggaggagc 2640
tgaaccagtg gaagcagaag ctggagagcc tgggctacca gaacagcatc gaggtgctga 2700
acgccgccga cttcggcagc agccaggccc ggcggcgggt gttcatgatc agcaccctga 2760
acgagttcgt ggagctgccc aagggcgaca agaagcccaa gagcatcaag aaggtgctga 2820
acaagatcgt gagcgagaag gacatcctga acaacctgct gaagtacaac ctgaccgagt 2880
tcaagaaaac caagagcaac atcaacaagg ccagcctgat cggctacagc aagttcaaca 2940
gcgagggcta cgtgtacgac cccgagttca ccggccccac cctgaccgcc agcggcgcca 3000
acagccggat caagatcaag gacggcagca acatccggaa gatgaacagc gacgagacct 3060
tcctgtacat cggcttcgac agccaggacg gcaagcgggt gaacgagatc gagttcctga 3120
ccgagaacca gaagatcttc gtgtgcggca acagcatcag cgtggaggtg ctggaggcca 3180
tcatcgacaa gatcggcggc cccagcggcg gcaagcggcc cgccgccacc aagaaggccg 3240
gccaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 3300
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 3360
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 3420
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3480
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 3521
<![CDATA[<210> 198]]>
<![CDATA[<211> 1385]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 198]]>
gaaataagag agaaaagaag agtaagaaga aatataagac ctaatagtaa tagtaatagc 60
ccaagaagaa gcgcaaggtc gggatccacg gagtcccggc agcaggatcc tcaggctcac 120
tggaaccggg ggaaaaaccc tacaagtgcc cggagtgcgg caagagcttc tcgacctccg 180
ggaacctgac cgagcaccag cgcacccaca ccggagagaa accgtacaag tgccccgagt 240
gcgggaaatc gttctcagac tcgggaaacc tcagggtgca ccagcggacc cacacggggg 300
aaaagccgta caagtgcccg gagtgcggga agtcattctc ccacaagaac gcgctgcaga 360
accaccaaag aacccacacc ggcgaaaaac cgtacaagtg ccccgagtgc ggaaagtcct 420
tctcccgcaa cgacaccctc accgaacacc aacgcaccca caccggagaa aagccctaca 480
agtgcccgga gtgcggaaag agcttcagcc agagggccca cctggaaaga caccagagaa 540
cccacaccgg cgaaaagccg tacaagtgcc cggagtgcgg gaagtccttc agccggtcag 600
acaagctggt ccgccaccaa aggacccaca caggagaaaa gccctacaag tgcccggagt 660
gcggaaaatc gttcagcgac cccggacacc tggtccggca ccagaggacc cacaccgggg 720
agaagccgac cggcaaaaag acctcagcaa gcgggagcgg aggaggaagc ggaggggacg 780
ccaagagcct gaccgcgtgg agcaggaccc tcgtgacctt caaagacgtg ttcgtggact 840
tcacccgcga agagtggaag ctgctggaca ccgcgcagca aatcctgtac cggaacgtga 900
tgctggaaaa ctacaagaac ctcgtcagcc tcggctacca actgaccaag cccgacgtga 960
tcctgcggct ggaaaagggc gaagaaccct ggctcgtgga gcgggagatc caccaggaaa 1020
cccacccgga cagcgaaacc gcgttcgaaa tcaagagcag cgtcagcgga ggaaaaagac 1080
cagccgccac caagaaggcc ggccaagcca agaagaaaaa gggcagctac ccctacgacg 1140
tgccggacta cgcatgagcg gccgcttaat taagctgcct tctgcggggc ttgccttctg 1200
gccatgccct tcttctctcc cttgcacctg tacctcttgg tctttgaata aagcctgagt 1260
aggaagtcta gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380
aaaaa 1385
<![CDATA[<210> 199]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 199]]>
acggggaatg ctgccgagag c 21
<![CDATA[<210> 200]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 200]]>
acctgaaccc tggaaattat a 21
<![CDATA[<210> 201]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 201]]>
tagacgggga atgctgccga g 21
<![CDATA[<210> 202]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 202]]>
tgacattgag caatgatatg g 21
<![CDATA[<210> 203]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 203]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[<210> 204]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 204]]>
acggggaatg ctgccgagag c 21
<![CDATA[<210> 205]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 205]]>
acggggaatg ctgccgagag c 21
<![CDATA[<210> 206]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 智人]]>
<![CDATA[<400> 206]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[<210> 207]]>
<![CDATA[<211> 5]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成肽]]>
<![CDATA[<400> 207]]>
Gly Gly Gly Gly Ser
1 5
<![CDATA[<210> 208]]>
<![CDATA[<211> 830]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 208]]>
gatgttcatt agcagtggtg ataggttaat tttcaccatc tcttatgcgg ttgaatagtc 60
acctctgaac cactttttcc tccagtaact cctctttctt cggaccttct gcagccaacc 120
tgaaagaata acaaggaggt ggctggaaac ttgttttaag gaaccgcctg tccttccccc 180
gctggaaacc ttgcacctcg gacgctcctg ctcctgcccc cacctgaccc ccgccctcgt 240
tgacatccag gcgcgatgat ctctgctgcc agtagagggc acacttactt tactttcgca 300
aacctgaacg cgggtgctgc ccagagaggg ggcggaggga aagacgcttt gcagcaaaat 360
ccagcatagc gattggttgc tccccgcgtt tgcggcaaag gcctggaggc aggagtaatt 420
tgcaatcctt aaagctgaat tgtgcagtgc atcggatttg gaagctacta tattcactta 480
acacttgaac gctgagctgc aaactcaacg ggtaataacc catcttgaac agcgtacatg 540
ctatacacgc acccctttcc cccgaattgt tttctctttt ggaggtggtg gagggagaga 600
aaagtttact taaaatgcct ttgggtgagg gaccaaggat gagaagaatg ttttttgttt 660
ttcatgccgt ggaataacac aaaataaaaa atcccgaggg aatatacatt atatattaaa 720
tatagatcat ttcagggagc aaacaaatca tgtgtggggc tgggcaacta gctaagtcga 780
agcgtaaata aaatgtgaat acacgtttgc gggttacata cagtgcactt 830
<![CDATA[<210> 209]]>
<![CDATA[<211> 830]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成聚核苷酸]]>
<![CDATA[<400> 209]]>
taggcatcgt tttcctcctt atgcctctat cattcctccc tatctacact aacatcccac 60
gctctgaacg cgcgcccatt aatacccttc tttcctccac tctccctggg actcttgatc 120
aaagcgcggc cctttcccca gccttagcga ggcgccctgc agcctggtac gcgcgtggcg 180
tggcggtggg cgcgcagtgc gttctcggtg tggagggcag ctgttccgcc tgcgatgatt 240
tatactcaca ggacaaggat gcggtttgtc aaacagtact gctacggagg agcagcagag 300
aaagggagag ggtttgagag ggagcaaaag aaaatggtag gcgcgcgtag ttaattcatg 360
cggctctctt actctgttta catcctagag ctagagtgct cggctgcccg gctgagtctc 420
ctccccacct tccccaccct ccccaccctc cccataagcg cccctcccgg gttcccaaag 480
cagagggcgt gggggaaaag aaaaaagatc ctctctcgct aatctccgcc caccggccct 540
ttataatgcg agggtctgga cggctgagga cccccgagct gtgctgctcg cggccgccac 600
cgccgggccc cggccgtccc tggctcccct cctgcctcga gaagggcagg gcttctcaga 660
ggcttggcgg gaaaaagaac ggagggaggg atcgcgctga gtataaaagc cggttttcgg 720
ggctttatct aactcgctgt agtaattcca gcgagaggca gagggagcga gcgggcggcc 780
ggctagggtg gaagagccgg gcgagcagag ctgcgctgcg ggcgtcctgg 830
<![CDATA[<210> 210]]>
<![CDATA[<211> 7]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 人工序列之描述:合成寡核苷酸]]>
<![CDATA[<400> 210]]>
tatawaw 7
<![CDATA[ <110> FLAGSHIP PIONEERING INNOVATIONS V, INC.]]>
<![CDATA[ <120> Compositions and methods for modulating MYC expression]]>
<![CDATA[ <130> O2057-7029WO]]>
<![CDATA[ <140> TW 111112954]]>
<![CDATA[ <141> 2022-04-01]]>
<![CDATA[ <150> 63/281,022]]>
<![CDATA[ <151> 2021-11-18]]>
<![CDATA[ <150> 63/212,991]]>
<![CDATA[ <151> 2021-06-21]]>
<![CDATA[ <150>PCT/US21/10059]]>
<![CDATA[ <151> 2021-12-15]]>
<![CDATA[ <160> 210 ]]>
<![CDATA[ <170> PatentIn Version 3.5]]>
<![CDATA[ <210> 1]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 1]]>
atgatctctg ctgccagtag 20
<![CDATA[ <210> 2]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 2]]>
atcgcgcctg gatgtcaacg 20
<![CDATA[ <210> 3]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 3]]>
attgtgcagt gcatcggatt 20
<![CDATA[ <210> 4]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 4]]>
gtcaaacagt actgctacgg 20
<![CDATA[ <210> 5]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 5]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Lys
35 40 45
Asn Ser Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
145 150 155 160
Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 6]]>
<![CDATA[ <21]]>1> 209]]>
<br/> <![CDATA[ <212>PRT]]>
<br/> <![CDATA[ <213> Artificial Sequence]]>
<br/>
<br/> <![CDATA[ <220>]]>
<br/> <![CDATA[ <223> Description of artificial sequence: synthetic polypeptide]]>
<br/>
<br/> <![CDATA[ <400>6]]>
<br/> <![CDATA[Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
35 40 45
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Ser His Ser Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn
145 150 155 160
Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 7]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 7]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
35 40 45
Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala
145 150 155 160
Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 8]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> ]]>8
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Thr
35 40 45
Gly Asn Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg
180 185 190
Asp His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 9]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 9]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser
35 40 45
Gly Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
145 150 155 160
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 10]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 10]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Asp Pro Gly Ala Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg
35 40 45
Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro
145 150 155 160
Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 11]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 11]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys
35 40 45
Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Lys
145 150 155 160
Asn Ser Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 12]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 12]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Arg
35 40 45
Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Ser Lys
145 150 155 160
Lys Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 13]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 13]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu
35 40 45
Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr
65 70 75 80
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
145 150 155 160
Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 14]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 14]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
35 40 45
Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala
145 150 155 160
Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr His Leu Asp Leu Ile
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 15]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 15]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Ser Lys
35 40 45
Lys Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
145 150 155 160
Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg
180 185 190
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 16]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 16]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
35 40 45
Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg
65 70 75 80
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg
180 185 190
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 17]]>
<![CDATA[ <211> 1367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 17]]>
Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val Gly
1 5 10 15
Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe Lys
20 25 30
Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile Gly
35 40 45
Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu Lys
50 55 60
Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys Tyr
65 70 75 80
Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser Phe
85 90 95
Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys His
100 105 110
Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr His
115 120 125
Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp Ser
130 135 140
Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His Met
145 150 155 160
Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro Asp
165 170 175
Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr Asn
180 185 190
Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala Lys
195 200 205
Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn Leu
210 215 220
Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn Leu
225 230 235 240
Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe Asp
245 250 255
Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp Asp
260 265 270
Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp Leu
275 280 285
Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp Ile
290 295 300
Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser Met
305 310 315 320
Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys Ala
325 330 335
Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe Asp
340 345 350
Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser Gln
355 360 365
Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp Gly
370 375 380
Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg Lys
385 390 395 400
Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu Gly
405 410 415
Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe Leu
420 425 430
Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile Pro
435 440 445
Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp Met
450 455 460
Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu Val
465 470 475 480
Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr Asn
485 490 495
Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser Leu
500 505 510
Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys Tyr
515 520 525
Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln Lys
530 535 540
Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr Val
545 550 555 560
Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp Ser
565 570 575
Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly Thr
580 585 590
Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp Asn
595 600 605
Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr Leu
610 615 620
Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala His
625 630 635 640
Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Arg Tyr Thr
645 650 655
Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp Lys
660 665 670
Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe Ala
675 680 685
Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe Lys
690 695 700
Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu His
705 710 715 720
Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly Ile
725 730 735
Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly Arg
740 745 750
His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln Thr
755 760 765
Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu
770 775 780
Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val
785 790 795 800
Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu Gln
805 810 815
Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg Leu
820 825 830
Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys Asp
835 840 845
Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg Gly
850 855 860
Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys Asn
865 870 875 880
Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys Phe
885 890 895
Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp Lys
900 905 910
Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys
915 920 925
His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp Glu
930 935 940
Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser Lys
945 950 955 960
Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg Glu
965 970 975
Ile Asn Asn Tyr His His His Ala His Asp Ala Tyr Leu Asn Ala Val Val
980 985 990
Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe Val
995 1000 1005
Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys
1010 1015 1020
Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr
1025 1030 1035
Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala Asn
1040 1045 1050
Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr
1055 1060 1065
Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg
1070 1075 1080
Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu
1085 1090 1095
Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg
1100 1105 1110
Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro Lys
1115 1120 1125
Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val Leu
1130 1135 1140
Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys Ser
1145 1150 1155
Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser Phe
1160 1165 1170
Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys Glu
1175 1180 1185
Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu Phe
1190 1195 1200
Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly Glu
1205 1210 1215
Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val Asn
1220 1225 1230
Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser Pro
1235 1240 1245
Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His
1250 1255 1260
Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg
1265 1270 1275
Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr
1280 1285 1290
Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile
1295 1300 1305
Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe
1310 1315 1320
Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr
1325 1330 1335
Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr Gly
1340 1345 1350
Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<![CDATA[ <210> 18]]>
<![CDATA[ <211> 96]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 18]]>
Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys
1 5 10 15
Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr
20 25 30
Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn
35 40 45
Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg
50 55 60
Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln
65 70 75 80
Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val
85 90 95
<![CDATA[ <210> 19]]>
<![CDATA[ <211> 385]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 19]]>
Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val Phe Glu Ala Phe
1 5 10 15
Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys Val Arg Lys Asp
20 25 30
Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val Pro Ala Ile Val
35 40 45
Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys Leu Glu Tyr Lys
50 55 60
Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu
65 70 75 80
Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys
85 90 95
Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu
100 105 110
Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu
115 120 125
Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser
130 135 140
Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser
145 150 155 160
Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn
165 170 175
Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu Leu His
180 185 190
Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser
195 200 205
Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala Ala Asp Phe Gly
210 215 220
Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser Thr Leu Asn Glu
225 230 235 240
Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys
245 250 255
Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu
260 265 270
Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys
275 280 285
Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr
290 295 300
Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser
305 310 315 320
Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp
325 330 335
Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val
340 345 350
Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly
355 360 365
Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly
370 375 380
Gly
385
<![CDATA[ <210> 20]]>
<![CDATA[ <211> 480]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 20]]>
Val Asp Leu Arg Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser
1 5 10 15
Glu Gly Phe His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu
20 25 30
Met Trp Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser
35 40 45
Thr Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala
50 55 60
Gly Glu Thr Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly Asp
65 70 75 80
Val Glu Met Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser Gly Met
85 90 95
Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn Ser Leu Val
100 105 110
Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro Arg Phe Phe Leu
115 120 125
Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys Arg Ser Met Val Leu
130 135 140
Lys Leu Thr Leu Arg Cys Leu Val Arg Met Gly Tyr Gln Cys Thr Phe
145 150 155 160
Gly Val Leu Gln Ala Gly Gln Tyr Gly Val Ala Gln Thr Arg Arg Arg
165 170 175
Ala Ile Ile Leu Ala Ala Ala Pro Gly Glu Lys Leu Pro Leu Phe Pro
180 185 190
Glu Pro Leu His Val Phe Ala Pro Arg Ala Cys Gln Leu Ser Val Val
195 200 205
Val Asp Asp Lys Lys Phe Val Ser Asn Ile Thr Arg Leu Ser Ser Gly
210 215 220
Pro Phe Arg Thr Ile Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu
225 230 235 240
Val Arg Asn Gly Ala Ser Ala Leu Glu Ile Ser Tyr Asn Gly Glu Pro
245 250 255
Gln Ser Trp Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile
260 265 270
Leu Arg Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg
275 280 285
Met Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro
290 295 300
Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu Arg
305 310 315 320
Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Ser Gly Ala Leu
325 330 335
Arg Gly Val Cys Ser Cys Val Glu Ala Gly Lys Ala Cys Asp Pro Ala
340 345 350
Ala Arg Gln Phe Asn Thr Leu Ile Pro Trp Cys Leu Pro His Thr Gly
355 360 365
Asn Arg His Asn His Trp Ala Gly Leu Tyr Gly Arg Leu Glu Trp Asp
370 375 380
Gly Phe Phe Ser Thr Thr Val Thr Asn Pro Glu Pro Met Gly Lys Gln
385 390 395 400
Gly Arg Val Leu His Pro Glu Gln His Arg Val Val Ser Val Arg Glu
405 410 415
Cys Ala Arg Ser Gln Gly Phe Pro Asp Thr Tyr Arg Leu Phe Gly Asn
420 425 430
Ile Leu Asp Lys His Arg Gln Val Gly Asn Ala Val Pro Pro Pro Pro Leu
435 440 445
Ala Lys Ala Ile Gly Leu Glu Ile Lys Leu Cys Met Leu Ala Lys Ala
450 455 460
Arg Glu Ser Ala Ser Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp
465 470 475 480
<![CDATA[ <210> 21]]>
<![CDATA[ <211> 457]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 21]]>
Glu Trp Gly Pro Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp
1 5 10 15
Leu Ser Ile Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly
20 25 30
Arg Leu Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys
35 40 45
Glu Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala
50 55 60
Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser Asn
65 70 75 80
Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg Ala Arg
85 90 95
Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu Ala Ser Thr
100 105 110
Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu Glu His Gly Arg Ile
115 120 125
Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr Arg Ser Asn Ser Ile
130 135 140
Lys Gln Gly Lys Asp Gln His Phe Pro Val Phe Met Asn Glu Lys Glu
145 150 155 160
Asp Ile Leu Trp Cys Thr Glu Met Glu Arg Val Phe Gly Phe Pro Val
165 170 175
His Tyr Thr Asp Val Ser Asn Met Ser Arg Leu Ala Arg Gln Arg Leu
180 185 190
Leu Gly Arg Ser Trp Ser Val Pro Val Ile Arg His Leu Phe Ala Pro
195 200 205
Leu Lys Glu Tyr Phe Ala Cys Val Ser Ser Gly Asn Ser Asn Ala Asn
210 215 220
Ser Arg Gly Pro Ser Phe Ser Ser Ser Gly Leu Val Pro Leu Ser Leu Arg
225 230 235 240
Gly Ser His Met Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp
245 250 255
Arg Arg Gln Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys
260 265 270
Glu Leu Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln
275 280 285
Leu Lys His Val Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu
290 295 300
Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu Gly
305 310 315 320
His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu Phe Gln Phe His Arg
325 330 335
Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg Pro Phe Phe
340 345 350
Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu Asp Leu Asp Val
355 360 365
Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr Ile Pro Asp Val His
370 375 380
Gly Gly Ser Leu Gln Asn Ala Val Arg Val Trp Ser Asn Ile Pro Ala
385 390 395 400
Ile Arg Ser Arg His Trp Ala Leu Val Ser Glu Glu Glu Leu Ser Leu
405 410 415
Leu Ala Gln Asn Lys Gln Ser Ser Lys Leu Ala Ala Lys Trp Pro Thr
420 425 430
Lys Leu Val Lys Asn Cys Phe Leu Pro Leu Arg Glu Tyr Phe Lys Tyr
435 440 445
Phe Ser Thr Glu Leu Thr Ser Ser Leu
450 455
<![CDATA[ <210> 22]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 22]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 23]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 23]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser His Ser Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 24]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 24]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 25]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 25]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 26]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 26]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 27]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 27]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly Ala Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His
115 120 125
Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 28]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 28]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 29]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 29]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp
115 120 125
Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 30]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 30]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 31]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 31]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 32]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 32]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His
115 120 125
Leu Leu Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 33]]>
<![CDATA[ <211> 661]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 33]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Glu Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala
660
<![CDATA[ <210> 34]]>
<![CDATA[ <211> 1528]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 34]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Ser Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr
1400 1405 1410
Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Glu
1415 1420 1425
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val
1430 1435 1440
Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu
1445 1450 1455
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
1460 1465 1470
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser
1475 1480 1485
Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg
1490 1495 1500
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly
1505 1510 1515
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1520 1525
<![CDATA[ <210> 35]]>
<![CDATA[ <211> 1820]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 35]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys 1385 1390 1395 Lys Lys Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Lys Leu 1400 1405 1410 Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala 1415 1420 1425 Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala 1430 1435 1440 Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn 1445 1450 1455 Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu 1460 1465 1470 Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys 1475 1480 1485 As n Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu 1490 1495 1500 Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly 1505 1510 1515 Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn 1520 1525 1530 Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln 1535 1540 1545 Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser 1550 1555 1560 Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys 15 65 1570 1575 Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu 1580 1585 1590 Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys 1595 1600 1605 Leu Glu Ser Leu Gly Tyr Gln Asn S er Ile Glu Val Leu Asn Ala 1610 1615 1620 Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile 1625 1630 1635 Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys 1640 1645 1650 Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys 1655 1660 1665 Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys 1670 1675 1680 Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser 1685 1690 16 95 Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly 1700 1705 1710 Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys 1715 1720 1725 Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu 1730 173 5 1740 Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile 1745 1750 1755 Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser 1760 1765 1770 Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly G ly 1775 1780 1785 Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly 1790 1795 1800 Gln Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp 1805 1810 1815 Tyr Ala 1820 <![CDATA[ <210> 36]]>
<![CDATA[ <211> 1922]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 36]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln 1385 1390 1395 Ala Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser Val Asp Leu Arg 1400 1405 1410 Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser Glu Gly Phe 1415 1420 1425 His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu Met Trp 1430 1435 1440 Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser Thr 1445 1450 1455 Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala 1460 1465 1470 Gly Glu Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly 1475 1480 1485 Asp Val Glu M et Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser 1490 1495 1500 Gly Met Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn 1505 1510 1515 Ser Leu Val Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro 1520 1525 1530 Arg Phe Phe Leu Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys 1535 1540 1545 Arg Ser Met Val Leu Lys Leu Thr Leu Arg Cys Leu Val Arg Met 1550 1555 1560 Gly Tyr Gln Cys Thr Phe Gly Val Leu Gln Ala Gly Gln Tyr Gly 1565 1570 1575 Val Ala Gln Thr Arg Arg Arg Ala Ile Ile Ile Leu Ala Ala Ala Pro 1580 1585 1590 Gly Glu Lys Leu Pro Leu Phe Pro Glu Pro Leu His Val Phe Ala 1595 1600 1605 Pro Arg Ala Cys Gln Leu Ser Val Val Val Asp Asp Lys Lys Phe 1610 1615 1620 Val Ser Asn Ile Thr Arg Leu Ser Ser Gly Pro Phe Arg Thr Ile 1625 1630 1635 Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu Val Arg Asn Gly 1640 1645 1650 Ala Ser Ala Leu Glu Ile S er Tyr Asn Gly Glu Pro Gln Ser Trp 1655 1660 1665 Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile Leu Arg 1670 1675 1680 Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg Met 1685 1690 1695 Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro 1700 1705 1710 Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu 1715 1720 1725 Arg Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Ser Gly 1730 1735 1740 Ala. Leu ARG GLY VAL CYS VAL GLU Ala Gly Lys Ala Cys 1745 1755 ASP Pro Ala Ala ARG Gln PHR Leu Ile Pro TRP CYS Leu 1760 1770 PRO His Thr Gly ass n aRG HIS Asn His Tyr Gly Leu Tyr Gly 1775 1780 1785 Arg Leu Glu Trp Asp Gly Phe Phe Ser Thr Thr Val Thr Asn Pro 1790 1795 1800 Glu Pro Met Gly Lys Gln Gly Arg Val Leu His Pro Glu Gln His 1805 1810 1815 Arg Val Val Ser Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro 1820 1825 1830 Asp Thr Tyr Arg Leu Phe Gly Asn Ile Leu Asp Lys His Arg Gln 1835 1840 1845 Val Gly Asn Ala Val Pro Pro Pro Leu Ala Lys Ala Ile Gly Leu 1850 1855 1860 Glu Ile Lys Leu Cys Met Leu Ala Lys Ala Arg Glu Ser Ala Ser 1865 1870 1875 Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp Gly Gly Gly Gly 1880 1885 1890 Ser Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys 1895 1900 1905 L ys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 1910 1915 1920 <![CDATA[ <210> 37]]>
<![CDATA[ <211> 1986]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 37]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Ser Ala Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 1385 1390 1395 Gly Gly Ser Gly Pro Lys Lys Lys Lys Arg Lys Val Ala Ala Ala Gly 1400 1405 1410 Ser Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro 1415 1420 1425 Val Pro Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe 1430 1435 1440 Asp Gly Ile Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile 1445 1450 1455 Gln Val Asp Arg Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile 1460 1465 1470 Thr Val Gly Met Val Arg His Gln Gly Lys Ile Met Tyr Val Gly 1475 1480 1485 Asp Val Ar g Ser Val Thr Gln Lys His Ile Gln Glu Trp Gly Pro 1490 1495 1500 Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp Leu Ser Ile 1505 1510 1515 Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly Arg Leu 1520 1525 153 0 Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys Glu 1535 1540 1545 Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala 1550 1555 1560 Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser 1565 1 570 1575 Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg 1580 1585 1590 Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu 1595 1600 1605 Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln G lu Cys Leu Glu 1610 1615 1620 His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr 1625 1630 1635 Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val 1640 1645 1650 Phe Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu 1655 1660 1665 Arg Val Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met 1670 1675 1680 Ser Arg Leu Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val 1685 1690 1695 Pro Val Ile Arg His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala 1700 1705 1710 Cys Val Ser Ser Gly Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser 1715 1720 1725 Phe Ser Ser Gly Leu Val Pro Leu Ser Leu Arg Gly Ser His Met 1730 1735 1740 Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp Arg Arg Gln 1745 1750 1755 Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys Glu Leu 1760 1765 1770 Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln Leu 1775 1780 1785 Lys His Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu 1790 1795 1800 Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu 1805 1810 1815 Gly His Thr Cys Asp Arg Pro Pro Ser Tr p Tyr Leu Phe Gln Phe 1820 1825 1830 His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg 1835 1840 1845 Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu 1850 1855 1860 Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr 1865 1870 1875 Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val 1880 1885 1890 Trp Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val 1895 1900 190 5 Ser Glu Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser 1910 1915 1920 Lys Leu Ala Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe 1925 1930 1935 Leu Pro Leu Arg Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr 19 40 1945 1950 Ser Ser Leu Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala 1955 1960 1965 Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro 1970 1975 1980 Asp Tyr Ala 1985 <![CDATA[ <210> 38]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 38]]>
ctggagcccg gcgagaaacc ctacaagtgc cccgagtgcg gcaaatcctt ctctagaagc 60
gacaaactga ccgaacatca gaggacccac accggcgaga agccttataa gtgtcccgaa 120
tgcggcaaat ccttcagcac caagaactct ctgacagaac accagagaac acataccgga 180
gagaaacctt ataaatgccc cgagtgcggc aagtccttct cccagtccgg cgatctgagg 240
agacaccaaa gaacacatac cggcgaaaag ccttacaagt gccccgagtg tggaaagagc 300
ttctccacca ccggcgctct gaccgagcac cagagaacac acaccggcga gaaaccctat 360
aaatgtcccg agtgtggcaa atccttcagc gacagcggca atctgagagt gcaccaaaga 420
acccataccg gcgaaaaacc ctacaaatgc cccgagtgcg gcaaatcctt cagccagagg 480
gcccatctgg agaggcacca aaggacacac accggagaaa agccctacaa gtgtcccgag 540
tgtggaaaaa gctttagcac aagcggcgag ctggtgaggc atcaaaggac ccacaccggc 600
gaaaagccca ccggcaaaaa gaccagc 627
<![CDATA[ <210> 39]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial]]> sequence
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 39]]>
ctggagcccg gcgagaagcc ctacaagtgc cccgagtgcg gaaagtcctt cagctccccc 60
gccgatctga caagacatca gagaacccat accggcgaga aaccttacaa atgccccgaa 120
tgtggcaagt cctttagcga tcccggacat ctggtgaggc accagaggac acaccaccggc 180
gaaaagccct ataaatgtcc cgagtgtgga aagagctttt ctagaagcga caatctcgtg 240
agacaccaga gaacccacac cggagagaag ccttacaagt gccccgagtg cggcaaatcc 300
ttcagccaga gctcctctct ggtgaggcac caaaggaccc acaccggcga gaaaccttat 360
aagtgtcccg agtgtggcaa aagcttcagc acctcccact ctctgaccga gcatcaaaga 420
accccacaccg gcgaaaaacc ttataaatgc cccgagtgtg gcaaatcctt cagcagaaat 480
gacgctctga cagagcacca aagaacacat accggagaaa agccctaca atgccccgag 540
tgtggaaaat ccttttctag aaacgatgct ctgaccgaac accaaagaac acacaccggc 600
gaaaagccta ccggaaaaaa gaccagc 627
<![CDATA[ <210> 40]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 40]]>
ctggagcccg gcgaaaaacc ttacaagtgc cccgagtgcg gaaagagctt cagcagaagc 60
gacaaactgg tgaggcatca aaggacacat accggagaga agccctataa gtgccccgaa 120
tgtggcaaat ccttttccca gagggctcat ctggaaagac accagaggac ccataccggc 180
gaaaaaccct acaaatgtcc cgagtgtgga aagagctttt ccgatcccgg ccatctggtc 240
agacatcaga ggacacatac cggcgaaaag ccttacaagt gtcccgaatg cggaaaatcc 300
ttctccagaa gcgacaagct ggtgaggcac caaagaaccc acaccggcga aaaaccctat 360
aaatgccccg agtgcggcaa gtcctttagc cagctggccc atctgagagc ccaccagaga 420
acacacaccg gagagaagcc ttataagtgt cccgagtgcg gaaagtcctt ctctagagcc 480
gacaatctga ccgaacatca aaggacacac accggcgaga aaccttataa atgccccgag 540
tgcggaaaaa gcttttccga ctgcagagat ctggctagac accagagaac ccacaccggc 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[ <210> 41]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 41]]>
ctggagcccg gcgaaaagcc ttataaatgt cccgaatgcg gcaagagctt tagccacacc 60
ggccatctgc tggaacacca aaggacccat accggcgaaa agccctataa gtgccccgag 120
tgtggcaaga gcttcagcac caccggcaat ctgacagtcc atcagaggac ccacaccgga 180
gagaaaccct ataaatgccc cgagtgtgga aagtccttct ccgacaagaa ggatctgaca 240
agacaccaga ggacccatac cggcgagaaa ccctacaaat gccccgagtg cggcaaatcc 300
ttctccccaga gcggcgatct gaggagacat caaagaacac ataccggcga aaaaccctat 360
aagtgccccg aatgcggcaa gtccttcagc cagagggccc atctggaaag gcatcagagg 420
acacacaccg gcgagaagcc ttacaaatgt cccgagtgcg gaaagagctt ctctagaagc 480
gacaagctga ccgagcatca gaggacccac accggagaaa aaccttacaa gtgccccgag 540
tgcggcaaaa gcttcagcag aaccgacaca ctgagagatc accaaaggac acacaccggc 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[ <210> 42]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 42]]>
ctggagcccg gcgagaagcc ttataagtgc cccgagtgtg gcaagagctt tagccacacc 60
ggccatctgc tggagcatca aaggacacac accggagaaa agccctataa gtgccccgag 120
tgtggcaaat ccttcagcac ctccggcaat ctcaccgaac accagagaac acacaccgga 180
gaaaaacctt acaaatgtcc cgagtgtgga aagagctttt ccaccagcgg caatctggtg 240
agacatcaaa gaacacatac cggcgaaaaa ccctataaat gccccgagtg tggaaaatcc 300
ttctcccaac tggcccatct gagggcccac cagaggacac ataccggaga aaaaccctac 360
aaatgccccg aatgcggaaa aagcttctcc gagagaagcc atctgagaga gcaccaaagg 420
acccataccg gagaaaagcc ttacaagtgt cccgagtgcg gaaaaagctt tagcgatccc 480
ggacatctgg tgagacatca gagaacccac accggcgaaa agccttataa atgtcccgaa 540
tgtggcaagt cctttagcac ccatctggat ctgattagac accaaagaac ccacaccggc 600
gagaaaccca ccggaaaaaa gaccagc 627
<![CDATA[ <210> 43]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 43]]>
ctggagcccg gcgaaaagcc ttacaaatgt cccgagtgcg gaaagtcctt cagcgacccc 60
ggcgctctgg tgagacatca aagaacacat accggcgaga aaccttataa atgccccgaa 120
tgtggaaaat ccttcagcga aagaagccat ctgagggaac accagaggac ccacaccggc 180
gaaaaacctt ataagtgccc cgaatgcgga aaaagctttt ctagaagcga tcatctgacc 240
aaccatcaga gaacacacac cggcgaaaag ccctataaat gtcccgagtg tggcaaatcc 300
tttagcgaga ggtcccatct gagagagcac cagaggacac ataccggaga gaagccctac 360
aagtgccccg agtgtggcaa gagctttagc agaagcgacc atctgaccaa tcatcaaagg 420
accccacaccg gagagaagcc ttacaagtgt cccgagtgcg gaaagtcctt ttccgatccc 480
ggccacctcg tgaggcacca aagaacccat accggcgaga aaccctacaa atgccccgag 540
tgtggaaaga gcttctccag aagcgacaag ctggtgaggc atcagaggac acacaccggc 600
gaaaaaccca ccggcaagaa aaccagc 627
<![CDATA[ <210> 44]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 44]]>
ctggagcccg gagagaagcc ctacaaatgc cccgagtgtg gaaagagctt ctctagaaat 60
gacgctctga cagaaccacca gaggaccccat accggcgaga aaccttacaa atgccccgag 120
tgcggaaaaa gctttagcga ttgcagagat ctggctagac atcagagaac acacaccggc 180
gagaagccct ataagtgccc cgaatgcggc aagagcttta gcgaccccgg ccatctggtg 240
agacatcaaa ggacacatac cggagaaaaa ccttacaagt gccccgagtg cggaaagtcc 300
ttctccccaga gcggccatct caccgagcat caaaggaccc acaccggcga aaagccttat 360
aaatgtcccg aatgtggcaa gtccttctct agagaggata atctgcacac ccatcagagg 420
accccacaccg gcgaaaagcc ttataaatgc cccgaatgtg gaaagtcctt ttccaccaag 480
aactctctga ccgagcatca gaggacacac accggagaga aaccctataa atgtcccgag 540
tgtggcaaga gcttcagcag agccgacaat ctgacagagc accaaagaac acataccggc 600
gaaaagccca ccggcaaaaa gaccagc 627
<![CDATA[ <210> 45]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 45]]>
ctggagcccg gcgagaaacc ctacaagtgc cccgagtgtg gcaaatcctt ctctagatcc 60
gacaaactga ccgaacatca gaggacccat accggcgaaa aaccttataa atgtcccgag 120
tgcggaaagt ccttctctag aagggacgag ctgaacgtgc atcagagaac acataccggc 180
gagaagccct ataaatgccc cgaatgcggc aaaagcttct ctagaagcga tcatctgacc 240
aaccaccaga gaacccatac cggagaaaag ccttacaagt gtcccgaatg tggaaaatcc 300
ttcagctccc ccgccgatct gaccagacac caaaggaccc acaccggcga gaagccctat 360
aaatgccccg agtgcggcaa gagcttttcc agatccgacc atctgaccaa tcatcaaaga 420
accccacaccg gcgaaaagcc ttataaatgt cccgagtgcg gcaaatcctt ttccagcaag 480
aaggctctga ccgagcatca aaggacccat accggcgaga agccttacaa atgccccgag 540
tgtggaaagt cctttagcac ccatctggat ctgattagac accagaggac acacaccgga 600
gagaaaccca ccggcaaaaa gaccagc 627
<![CDATA[ <210> 46]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 46]]>
ctggagcccg gcgagaaacc ttacaaatgc cccgagtgcg gcaagagctt cagcagaagc 60
gacgatctgg tgaggcacca aagaacccac accggcgaaa aaccttacaa gtgtcccgaa 120
tgcggaaagt ccttcagcag agaggacaat ctgcacaccc accagagaacacacaccgga 180
gaaaagcctt acaagtgccc cgaatgcggc aaatcctttt ctagaagcga tcatctgacc 240
accccaccaaa gaacacatac cggcgagaag ccttacaaat gtcccgagtg cggaaagtcc 300
ttctccccaga gagccaatct gagggctcat caaaggaccc ataccggcga aaagccctac 360
aaatgccccg agtgcggaaa atccttcagc cagctggccc atctgagagc ccaccaaagg 420
acacacaccg gagagaaacc ctataagtgc cccgagtgtg gaaaaagctt ttcccagagg 480
gccaatctga gggcccatca gaggaccccat accggagaga agccttataa atgtcccgag 540
tgcggaaaaa gcttcagcga gaggagccat ctgagggaac atcaaagaac ccacaccggc 600
gaaaaaccca ccggaaaaaa gaccagc 627
<![CDATA[ <210> 47]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223]]>> Description of artificial sequences: synthetic polynucleotides]]>
<br/>
<br/> <![CDATA[ <400>47]]>
<br/> <![CDATA[ctggagcccg gcgagaaacc ctacaagtgc cccgagtgtg gaaaaagctt tagccaaagc 60
ggcgatctga ggagacacca aagaacacac accggcgaga agccctacaa atgtcccgag 120
tgcggaaaga gcttcagcca gagcggccat ctgaccgagc atcagagaac ccataccggc 180
gaaaaacctt ataagtgccc cgagtgtgga aagtccttct ccgagagatc ccatctgaga 240
gaacaccaga ggacacacac cggcgaaaaa ccttataagt gtcccgagtg cggaaagtcc 300
ttcagcgatc ccggccatct ggtgagacat caaaggacac ataccggcga aaaaccttat 360
aagtgtcccg aatgcggcaa gagctttagc agaaacgaca cactcaccga acaccagagg 420
accccacaccg gcgagaaacc ctacaaatgc cccgagtgcg gcaaatcctt ttctagagcc 480
gacaatctga ccgaaccacca gaggacccat accggagaaa agccttacaa atgtcccgag 540
tgtggcaaat ccttctccac ccatctggat ctgattagac accaaagaac acataccgga 600
gaaaagccca ccggaaaaaa gaccagc 627
<![CDATA[ <210> 48]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 48]]>
ctggagcccg gcgaaaaacc ctataagtgc cccgaatgtg gaaagagctt cagccatacc 60
ggccatctgc tggaacacca aaggacacac accggcgaga aaccttacaa gtgtcccgag 120
tgcggaaaaa gcttctcctc caaaaaggct ctcaccgagc accagagaac acataccggc 180
gaaaagcctt ataagtgccc cgagtgtggc aaatcctttt ccgactgtag agatctggcc 240
agacatcaaa gaacccacac cggagagaaa ccttataaat gccccgagtg cggcaagtcc 300
tttagccata ccggccatct gctggagcac cagaggaccc ataccggcga gaagccttac 360
aaatgccccg agtgcggcaa aagcttcagc agaaatgacg ctctgaccga gcatcaaagg 420
acccataccg gcgaaaagcc ctacaagtgt cccgagtgtg gaaagtcctt ctcccagagc 480
ggcgatctga ggagacacca gagaacacac accggcgaga aaccctataa atgtcccgag 540
tgcggaaaga gctttagcga cagcggcaat ctgagggtgc atcaaagaac acacaccggc 600
gaaaaaccca ccggaaaaaa gacaagc 627
<![CDATA[ <210> 49]]>
<![CDATA[ <211> 538]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 49]]>
caccggcgaa aagccttata agtgccccga gtgcggcaag tccttctcta gaagcgatca 60
cctcaccaat catcagagga cacataccgg agagaagccc tataagtgcc ccgagtgcgg 120
caagagcttt agccagctgg ctcatctgag agctcaccaa agaacccata ccggcgagaa 180
gccttacaaa tgccccgagt gtggaaaatc cttttccccag tccagcaacc tcgtcagaca 240
tcaaaggacc cataccggcg aaaagcctta caagtgtccc gagtgcggaa agtccttctc 300
tagatccgac aacctcgtga ggcaccagag aacccacacc ggcgagaaac cttacaaatg 360
tcccgagtgt ggcaaaagct tttctagaag cgacgagctg gtgagacatc aaagaaccca 420
taccggcgaa aaaccttata agtgtcccga gtgcggcaaa tcctttagcc agctggccca 480
tctgagggcc caccagagaa cacataccgg cgaaaaaccc accggcaaaa agacaagc 538
<![CDATA[ <210> 50]]>
<![CDATA[ <211> 4101]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 50]]>
gacaagaagt acagcatcgg cctggccatc ggcaccaaca gcgtgggctg ggccgtgatc 60
accgacgagt acaaggtgcc cagcaagaag ttcaaggtgc tgggcaacac cgaccggcac 120
agcatcaaga agaacctgat cggcgccctg ctgttcgaca gcggcgagac cgccgaggcc 180
acccggctga agcggaccgc ccggcggcgg taacacccggc ggaagaaccg gatctgctac 240
ctgcaggaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccaccggctg 300
gaggagagct tcctggtgga ggaggacaag aagcacgagc ggcaccccat cttcggcaac 360
atcgtggacg aggtggccta ccacgagaag taccccacca tctaccacct gcggaagaag 420
ctggtggaca gcaccgacaa ggccgacctg cggctgatct acctggccct ggcccacatg 480
atcaagttcc ggggccactt cctgatcgag ggcgacctga accccgacaa cagcgacgtg 540
gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga gaaccccatc 600
aacgccagcg gcgtggacgc caaggccatc ctgagcgccc ggctgagcaa gagccggcgg 660
ctggagaacc tgatcgccca gctgcccggc gagaagaaga acggcctgtt cggcaacctg 720
atcgccctga gcctggggcct gacccccaac ttcaagagca acttcgacct ggccgaggac 780
gccaagctgc agctgagcaa ggacacctac gacgacgacc tggacaacct gctggcccag 840
atcggcgacc agtacgccga cctgttcctg gccgccaaga acctgagcga cgccatcctg 900
ctgagcgaca tcctgcgggt gaacaccgag atcaccaagg cccccctgag cgccagcatg 960
atcaagcggt acgacgagca ccaccaggac ctgaccctgc tgaaggccct ggtgcggcag 1020
cagctgcccg agaagtacaa ggagatcttc ttcgaccaga gcaagaacgg ctacgccggc 1080
tacatcgacg gcggcgccag ccaggagggag ttctacaagt tcatcaagcc catcctggag 1140
aagatggacg gcaccgagga gctgctggtg aagctgaacc gggaggacct gctgcggaag 1200
cagcggacct tcgacaacgg cagcatcccc caccagatcc acctgggcga gctgcacgcc 1260
atcctgcggc ggcaggagga cttctacccc ttcctgaagg acaaccggga gaagatcgag 1320
aagatcctga ccttccggat cccctactac gtgggccccc tggcccgggg caacagccgg 1380
ttcgcctgga tgacccggaa atccgaggag accatcaccc cctggaactt cgaggaggtg 1440
gtggacaagg gcgccagcgc ccagagcttc atcgagcgga tgaccaactt cgacaagaac 1500
ctgcccaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtac 1560
aacgagctga ccaaggtgaa gtacgtgacc gagggcatgc ggaagcccgc cttcctgagc 1620
ggcgagcaga agaaggccat cgtggacctg ctgttcaaga ccaaccggaa ggtgaccgtg 1680
aagcagctga aggaggacta cttcaagaag atcgagtgct tcgacagcgt ggagatcagc 1740
ggcgtggagg accggttcaa cgccagcctg ggcacctacc acgacctgct gaagatcatc 1800
aaggacaagg acttcctgga caacgaggag aacgaggaca tcctggagga catcgtgctg 1860
accctgaccc tgttcgagga ccgggagatg atcgaggagc ggctgaaaac ctacgcccac 1920
ctgttcgacg acaaggtgat gaagcagctg aagcggcggc ggtacaccgg ctggggccgg 1980
ctgagccgga agctgatcaa cggcatccgg gacaagcaga gcggcaagac catcctggac 2040
ttcctgaaat ccgacggctt cgccaaccgg aacttcatgc agctgatcca cgacgacagc 2100
ctgaccttca aggaggacat ccagaaggcc caggtgagcg gccagggcga cagcctgcac 2160
gagcacatcg ccaacctggc cggcagcccc gccatcaaga agggcatcct gcagaccgtg 2220
aaggtggtgg acgagctggt gaaggtgatg ggccggcaca agcccgagaa catcgtgatc 2280
gagatggccc gggagaacca gaccacccag aagggccaga agaacagccg ggagcggatg 2340
aagcggatcg aggagggcat caaggagctg ggcagccaga tcctgaagga gcaccccgtg 2400
gagaacaccc agctgcagaa cgagaagctg tacctgtact acctgcagaa cggccgggac 2460
atgtacgtgg accagagct ggacatcaac cggctgagcg actacgacgt ggccgccatc 2520
gtgccccaga gcttcctgaa ggacgacagc atcgacaaca aggtgctgac ccggagcgac 2580
aaggcccggg gcaagagcga caacgtgccc agcgaggagg tggtgaagaa gatgaagaac 2640
tactggcggc agctgctgaa cgccaagctg atcacccagc ggaagttcga caacctgacc 2700
aaggccgagc ggggcggcct gagcgagctg gacaaggccg gcttcatcaa gcggcagctg 2760
gtggagaccc ggcagatcac caagcacgtg gccccagatcc tggacagccg gatgaacacc 2820
aagtacgacg agaacgacaa gctgatccgg gaggtgaagg tgatcaccct gaaatccaag 2880
ctggtgagcg acttccggaa ggacttccag ttctacaagg tgcggggagat caacaactac 2940
caccacgccc acgacgccta cctgaacgcc gtggtgggca ccgccctgat caagaagtac 3000
cccaagctgg agagcgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg 3060
atcgccaaga gcgagcagga gatcggcaag gccaccgcca agtacttctt ctacagcaac 3120
atcatgaact tcttcaagac cgagatcacc ctggccaacg gcgagatccg gaagcggccc 3180
ctgatcgaga ccaacggcga gaccggcgag atcgtgtggg acaagggccg ggacttcgcc 3240
accgtgcgga aggtgctgag catgccccag gtgaacatcg tgaagaaaac cgaggtgcag 3300
accggcggct tcagcaagga gagcatcctg cccaagcgga acagcgacaa gctgatcgcc 3360
cggaagaagg actgggaccc caagaagtac ggcggcttcg acagccccac cgtggcctac 3420
agcgtgctgg tggtggccaa ggtggagaag ggcaagagca agaagctgaa atccgtgaag 3480
gagctgctgg gcatcaccat catggagcgg agcagcttcg agaagaaccc catcgacttc 3540
ctggaggcca agggctacaa ggaggtgaag aaggacctga tcatcaagct gcccaagtac 3600
agcctgttcg agctggagaa cggccggaag cggatgctgg ccagcgccgg cgagctgcag 3660
aagggcaacg agctggccct gcccagcaag tacgtgaact tcctgtacct ggccagccac 3720
tacgagaagc tgaagggcag ccccgaggac aacgagcaga agcagctgtt cgtggagcag 3780
cacaagcact acctggacga gatcatcgag cagatcagcg agttcagcaa gcgggtgatc 3840
ctggccgacg ccaacctgga caaggtgctg agcgcctaca acaagcaccg ggacaagccc 3900
atccgggagc aggccgagaa catcatccac ctgttcaccc tgaccaacct gggcgccccc 3960
gccgccttca agtacttcga caccaccatc gaccggaagc ggtacaccag caccaaggag 4020
gtgctggacg ccaccctgat ccaccagagc atcaccggcc tgtacgagac ccggatcgac 4080
ctgagccagc tgggcggcga c 4101
<![CDATA[ <210> 51]]>
<![CDATA[ <211> 288]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 51]]>
gacgccaaga gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg 60
gacttcaccc gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac 120
gtgatgctgg agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac 180
gtgatcctgc ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag 240
gagacccacc ccgacagcga gaccgccttc gagatcaaga gcagcgtg 288
<![CDATA[ <210> 52]]>
<![CDATA[ <211> 1155]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 52]]>
agcaaggtgg agaacaagac caagaagctg cgggtgttcg aggccttcgc cggcatcggc 60
gcccagcgga aggccctgga gaaggtgcgg aaggacgagt acgagatcgt gggcctggcc 120
gagtggtacg tgcccgccat cgtgatgtac caggccatcc acaacaactt ccacaccaag 180
ctggagtaca agagcgtgag ccgggaggag atgatcgact acctggagaa caagaccctg 240
agctggaaca gcaagaaccc cgtgagcaac ggctactgga agcggaagaa ggacgacgag 300
ctgaagatca tctacaacgc catcaagctg agcgagaagg agggcaacat cttcgacatc 360
cgggacctgt acaagcggac cctgaagaac atcgacctgc tgacctacag cttcccctgc 420
caggacctga gccagcaggg catccagaag ggcatgaagc ggggcagcgg cacccggagc 480
ggcctgctgt gggagatcga gcgggccctg gacagcaccg agaagaacga cctgcccaag 540
tacctgctga tggagaacgt gggcgccctg ctgcacaaga agaacgagga ggagctgaac 600
cagtggaagc agaagctgga gagcctgggc taccagaaca gcatcgaggt gctgaacgcc 660
gccgacttcg gcagcagcca ggcccggcgg cgggtgttca tgatcagcac cctgaacgag 720
ttcgtggagc tgcccaaggg cgacaagaag cccaagagca tcaagaaggt gctgaacaag 780
atcgtgagcg agaaggacat cctgaacaac ctgctgaagt acaacctgac cgagttcaag 840
aaaaccaaga gcaacatcaa caaggccagc ctgatcggct acaagcaagtt caacagcgag 900
ggctacgtgt acgaccccga gttcaccggc cccaccctga ccgccagcgg cgccaacagc 960
cggatcaaga tcaaggacgg cagcaacatc cggaagatga acagcgacga gaccttcctg 1020
tacatcggct tcgacagcca ggacggcaag cgggtgaacg agatcgagtt cctgaccgag 1080
aaccagaaga tcttcgtgtg cggcaacagc atcagcgtgg aggtgctgga ggccatcatc 1140
gacaagatcg gcggc 1155
<![CDATA[ <210> 53]]>
<![CDATA[ <211> 1440]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 53]]>
gtggatctga ggacactcga cgtgtttagc ggatgcggcg gactctccga aggcttccac 60
caagccggaa tttccgacac actctgggcc attgagatgt gggaccccgc cgctcaagcc 120
ttcagactga ataatcccgg ctccaccgtg ttcaccgagg actgcaacat tctgctgaag 180
ctggtgatgg ctggcgaaac caccaactct agaggccaga ggctgcccca gaagggagat 240
gtggaaatgc tctgtggagg ccctccttgc caaggcttct ccggcatgaa caggttcaac 300
tctagaacat acaagcaagtt caagaactct ctggtcgtga gctttctgag ctactgcgac 360
tactatagac ctaggttctt tctgctggag aacgtgagaa atttcgtgtc cttcaagagg 420
agcatggtgc tgaagctgac actgaggtgt ctggtgagga tgggctacca gtgcacattc 480
ggagtgctgc aagctggcca gtacggcgtg gcccagacca gaaggagggc catcattctg 540
gctgctgccc ccggcgagaa actccctctg ttccccgagc ccctccacgt gttcgcccct 600
agagcttgcc agctgagcgt ggtggtcgac gataagaagt tcgtgagcaa catcacaagg 660
ctgtccagcg gacccttcag aaccattacc gtgagggata ccatgtccga cctccccgag 720
gtgaggaatg gcgccagcgc tctggagatt tcctacaacg gcgaacctca gagctggttc 780
caaaggcagc tgagaggcgc tcagtatcag cccattctga gggaccacat ctgcaaagat 840
atgagcgctc tggtggccgc tagaatgaga catattcctc tggcccccgg cagcgactgg 900
agagatctgc ccaatattga ggtgagactc agcgacggaa caatggctag aaaactgagg 960
tacacccatc atgatagaaa gaacggaagg agcagcagcg gcgctctgag aggagtgtgt 1020
agctgcgtgg aagctggcaa ggcttgcgat cccgccgcta ggcagttcaa taccctcatc 1080
ccttggtgtc tgcctcacac cggcaacaga cacaatcatt gggctggact gtatggaagg 1140
ctcgaatggg acggcttttt cagcaccacc gtgaccaatc ccgaacctat gggcaagcaa 1200
ggaagggtgc tccaccccga gcagcataga gtcgtgtccg tgagagaatg cgctagaagc 1260
caaggcttcc ccgacaccta tagactgttc ggcaacattc tggataagca cagacaagtg 1320
ggaaatgctg tccctcctcc tctggccaag gctatcggac tggagatcaa gctgtgtatg 1380
ctcgccaaag ctagggagag cgcttccgcc aagattaagg aggaggaggc cgccaaggac 1440
<![CDATA[ <210> 54]]>
<![CDATA[ <211> 1626]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 54]]>
aaccacgacc aggagttcga cccccccaag gtgtaccccc ccgtgcccgc cgagaagcgg 60
aagcccatcc gggtgctgag cctgttcgac ggcatcgcca ccggcctgct ggtgctgaag 120
gacctgggca tccaggtgga ccggtacatc gccagcgagg tgtgcgagga cagcatcacc 180
gtgggcatgg tgcggcacca gggcaagatc atgtacgtgg gcgacgtgcg gagcgtgacc 240
cagaagcaca tccaggagtg gggccccttc gacctggtga tcggcggcag cccctgcaac 300
gacctgagca tcgtgaaccc cgcccggaag ggcctgtacg agggcaccgg ccggctgttc 360
ttcgagttct accggctgct gcacgacgcc cggcccaagg agggcgacga ccggcccttc 420
ttctggctgt tcgagaacgt ggtggccatg ggcgtgagcg acaagcggga catcagccgg 480
ttcctggaga gcaaccccgt gatgatcgac gccaaggagg tgagcgccgc ccaccgggcc 540
cggtacttct ggggcaacct gcccggcatg aaccggcccc tggccagcac cgtgaacgac 600
aagctggagc tgcaggagtg cctggagcac ggccggatcg ccaagttcag caaggtgcgg 660
accatcacca cccggagcaa cagcatcaag cagggcaagg accagcactt ccccgtgttc 720
atgaacgaga aggaggacat cctgtggtgc accgagatgg agcgggtgtt cggcttcccc 780
gtgcactaca ccgacgtgag caacatgagc cggctggccc ggcagcggct gctgggccgg 840
agctggagcg tgcccgtgat ccggcacctg ttcgcccccc tgaaggagta cttcgcctgc 900
gtgagcagcg gcaacagcaa cgccaacagc cggggcccca gcttcagcag cggcctggtg 960
cccctgagcc tgcggggcag ccacatgaat cctctggaga tgttcgagac agtgcccgtg 1020
tggagaaggc aacccgtgag ggtgctgagc ctcttcgagg acattaagaa ggagctgacc 1080
tctctgggct ttctggaatc cggcagcgac cccggccagc tgaaacacgt ggtggacgtg 1140
accgacacag tgaggaagga cgtggaagag tggggcccct ttgacctcgt gtatggagcc 1200
acacctcctc tcggccacac atgcgatagg cctcccagct ggtatctctt ccagttccac 1260
agactgctcc agtacgccag acctaagccc ggcagcccca gacccttctt ctggatgttc 1320
gtggacaatc tggtgctgaa caaggaggat ctggatgtgg ccagcagatt tctggagatg 1380
gaacccgtga caatccccga cgtgcatggc ggctctctgc agaacgccgt gagagtgtgg 1440
tccaacatcc ccgccattag aagcagacac tgggctctgg tgagcgagga ggaactgtct 1500
ctgctggccc agaataagca gtcctccaag ctggccgcca agtggcccac caagctggtg 1560
aagaactgct ttctgcctct gagggagtat ttcaagtatt tcagcaccga actgaccagc 1620
agcctg 1626
<![CDATA[ <210> 55]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 55]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gcggcaaatc cttctctaga agcgacaaac 180
tgaccgaaca tcagaggacc cacaccggcg agaagcctta taagtgtccc gaatgcggca 240
aatccttcag caccaagaac tctctgacag aacaccagag aacacatacc ggagagaaac 300
ccttataaatg ccccgagtgc ggcaagtcct tctcccagtc cggcgatctg aggagacacc 360
aaagaacaca taccggcgaa aagccttaca agtgccccga gtgtggaaag agcttctcca 420
ccaccggcgc tctgaccgag caccagagaa cacacaccgg cgagaaaccc tataaatgtc 480
ccgagtgtgg caaatccttc agcgacagcg gcaatctgag agtgcaccaa agaacccata 540
ccggcgaaaa accctacaaa tgccccgagt gcggcaaatc cttcagccag agggcccatc 600
tggagaggca ccaaaggaca cacaccggag aaaagcccta caagtgtccc gagtgtggaa 660
aaagctttag cacaagcggc gagctggtga ggcatcaaag gacccacacc ggcgaaaagc 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 56]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 56]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa gccctacaag tgccccgagt gcggaaagtc cttcagctcc cccgccgatc 180
tgacaagaca tcagagaacc cataccggcg agaaacctta caaatgcccc gaatgtggca 240
agtcctttag cgatcccgga catctggtga ggcaccagag gacacaccc ggcgaaaagc 300
cctataaatg tcccgagtgt ggaaagagct tttctagaag cgacaatctc gtgagacacc 360
agagaaccca caccggagag aagccttaca agtgccccga gtgcggcaaa tccttcagcc 420
agagctcctc tctggtgagg caccaaagga cccacaccgg cgagaaacct tataagtgtc 480
ccgagtgtgg caaaagcttc agcacctccc actctctgac cgagcatcaa agaacccaca 540
ccggcgaaaa accttataaa tgccccgagt gtggcaaatc cttcagcaga aatgacgctc 600
tgacagagca ccaaagaaca cataccggag aaaagcccta caaatgcccc gagtgtggaa 660
aatccttttc tagaaacgat gctctgaccg aacaccaaag aacacacacc ggcgaaaagc 720
ctaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 57]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 57]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accttacaag tgccccgagt gcggaaagag cttcagcaga agcgacaaac 180
tggtgaggca tcaaaggaca cataccggag agaagcccta taagtgcccc gaatgtggca 240
aatccttttc ccagagggct catctggaaa gacaccagag gacccatacc ggcgaaaaac 300
cctacaaatg tcccgagtgt ggaaagagct tttccgatcc cggccatctg gtcagacatc 360
agaggacaca taccggcgaa aagccttaca agtgtcccga atgcggaaaa tccttctcca 420
gaagcgacaa gctggtgagg caccaaagaa cccacaccgg cgaaaaaccc tataaatgcc 480
ccgagtgcgg caagtccttt agccagctgg cccatctgag agcccaccag agaacacaca 540
ccggagagaa gccttataag tgtcccgagt gcggaaagtc cttctctaga gccgacaatc 600
tgaccgaaca tcaaaggaca cacaccggcg agaaacctta taaatgcccc gagtgcggaa 660
aaagcttttc cgactgcaga gatctggcta gacaccagag aacccacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 58]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223]]>> Description of artificial sequences: synthetic polynucleotides]]>
<br/>
<br/> <![CDATA[ <400>58]]>
<br/> <![CDATA[aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa gccttataaa tgtcccgaat gcggcaagag ctttagccac accggccatc 180
tgctggaaca ccaaaggacc cataccggcg aaaagcccta taagtgcccc gagtgtggca 240
agagcttcag caccaccggc aatctgacag tccatcagag gacccacacc ggagagaaac 300
cctataaatg ccccgagtgt ggaaagtcct tctccgacaa gaaggatctg acaagacacc 360
agaggaccca taccggcgag aaaccctaca aatgccccga gtgcggcaaa tccttctccc 420
agagcggcga tctgaggaga catcaaagaa cacataccgg cgaaaaaccc tataagtgcc 480
ccgaatgcgg caagtccttc agccagaggg cccatctgga aaggcatcag aggacacaca 540
ccggcgagaa gccttacaaa tgtcccgagt gcggaaagag cttctctaga agcgacaagc 600
tgaccgagca tcagaggacc cacaccggag aaaaacctta caagtgcccc gagtgcggca 660
aaagcttcag cagaaccgac acactgagag atcaccaaag gacacacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 59]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 59]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa gccttataag tgccccgagt gtggcaagag ctttagccac accggccatc 180
tgctggagca tcaaaggaca cacaccggag aaaagcccta taagtgcccc gagtgtggca 240
aatccttcag cacctccggc aatctcaccg aacaccagag aacacacacc ggagaaaaac 300
cttacaaatg tcccgagtgt ggaaagagct tttccaccag cggcaatctg gtgagacatc 360
aaagaacaca taccggcgaa aaaccctata aatgccccga gtgtggaaaa tccttctccc 420
aactggccca tctgagggcc caccagagga cacataccgg agaaaaaccc tacaaatgcc 480
ccgaatgcgg aaaaagcttc tccgagagaa gccatctgag agagcaccaa aggacccata 540
ccggagaaaa gccttacaag tgtcccgagt gcggaaaaag ctttagcgat cccggacatc 600
tggtgagaca tcagagaacc cacaccggcg aaaagcctta taaatgtccc gaatgtggca 660
agtcctttag cacccatctg gatctgatta gacaccaaag aacccacacc ggcgagaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 60]]>
<![CDATA[ <211> 1345]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 60]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa gccttacaaa tgtcccgagt gcggaaagtc cttcagcgac cccggcgctc 180
tggtgagaca tcaaagaaca cataccggcg agaaacctta taaatgcccc gaatgtggaa 240
aatccttcag cgaaagaagc catctgaggg aacaccagag gacccacacc ggcgaaaaac 300
cttataagtg ccccgaatgc ggaaaaagct tttctagaag cgatcatctg accaaccatc 360
agagaacaca caccggcgaa aagccctata aatgtcccga gtgtggcaaa tcctttagcg 420
agaggtccca tctgagagag caccagagga cacataccgg agagaagccc tacaagtgcc 480
ccgagtgtgg caagagcttt agcagaagcg accatctgac caatcatcaa aggacccaca 540
ccggagagaa gccttacaag tgtcccgagt gcggaaagtc cttttccgat cccggccacc 600
tcgtgaggca ccaaagaacc cataccggcg agaaacccta caaatgcccc gagtgtggaa 660
agagcttctc cagaagcgac aagctggtga ggcatcagag gacacaccc ggcgaaaaac 720
ccaccggcaa gaaaaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1345
<![CDATA[ <210> 61]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 61]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggagagaa gccctacaaa tgccccgagt gtggaaagag cttctctaga aatgacgctc 180
tgacagaaca ccagaggacc cataccggcg agaaacctta caaatgcccc gagtgcggaa 240
aaagctttag cgattgcaga gatctggcta gacatcagag aacacacacc ggcgagaagc 300
cctataagtg ccccgaatgc ggcaagagct ttagcgaccc cggccatctg gtgagacatc 360
aaaggacaca taccggagaa aaaccttaca agtgccccga gtgcggaaag tccttctccc 420
agagcggcca tctcaccgag catcaaagga cccacaccgg cgaaaagcct tataaatgtc 480
ccgaatgtgg caagtccttc tctagagagg ataatctgca cacccatcag aggacccaca 540
ccggcgaaaa gccttataaa tgccccgaat gtggaaagtc cttttccacc aagaactctc 600
tgaccgagca tcagaggaca cacaccggag agaaacccta taaatgtccc gagtgtggca 660
agagcttcag cagagccgac aatctgacag agcaccaaag aacacatacc ggcgaaaagc 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 62]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 62]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gtggcaaatc cttctctaga tccgacaaac 180
tgaccgaaca tcagaggacc cataccggcg aaaaacctta taaatgtccc gagtgcggaa 240
agtccttctc tagaagggac gagctgaacg tgcatcagag aacacatacc ggcgagaagc 300
cctataaatg ccccgaatgc ggcaaaagct tctctagaag cgatcatctg accaaccacc 360
agagaaccca taccggagaa aagccttaca agtgtcccga atgtggaaaa tccttcagct 420
cccccgccga tctgaccaga caccaaagga cccacaccgg cgagaagccc tataaatgcc 480
ccgagtgcgg caagagcttt tccagatccg accatctgac caatcatcaa agaacccaca 540
ccggcgaaaa gccttataaa tgtcccgagt gcggcaaatc cttttccagc aagaaggctc 600
tgaccgagca tcaaaggacc cataccggcg agaagcctta caaatgcccc gagtgtggaa 660
agtcctttag cacccatctg gatctgatta gacaccagag gacacacacc ggagagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 63]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 63]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 64]]>
<![CDATA[ <211> 2080]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 64]]>
agtgtggaaa aagctttagc caaagcggcg atctgaggag acaccaaaga acacacaccg 60
gcgagaagcc ctacaaatgt cccgagtgcg gaaagagctt cagccagagc ggccatctga 120
ccgagcatca gagaacccat accggcgaaa aaccttataa gtgccccgag tgtggaaagt 180
ccttctccga gagatcccat ctgagagaac accagaggac acacaccggc gaaaaacctt 240
ataagtgtcc cgagtgcgga aagtccttca gcgatcccgg ccatctggtg agacatcaaa 300
ggacacatac cggcgaaaaa ccttataagt gtcccgaatg cggcaagagc tttagcagaa 360
acgacacact caccgaacac cagaggaccc acaccggcga gaaaccctac aaatgccccg 420
agtgcggcaa atccttttct agagccgaca atctgaccga acaccagagg acccataccg 480
gagaaaagcc ttacaaatgt cccgagtgtg gcaaatcctt ctccacccat ctggatctga 540
ttagacacca aagaacacat accggagaaa agcccaccgg aaaaaagacc agcgctagcg 600
gcagcggcgg cggcagcggc ggcgcccggg acaagcaaggt ggagaacaag accaagaagc 660
tgcgggtgtt cgaggccttc gccggcatcg gcgcccagcg gaaggccctg gagaaggtgc 720
ggaaggacga gtacgagatc gtgggcctgg ccgagtggta cgtgcccgcc atcgtgatgt 780
accaggccat ccacaacaac ttccacacca agctggagta caagagcgtg agccgggagg 840
agatgatcga ctacctggag aacaagaccc tgagctggaa cagcaagaac cccgtgagca 900
acggctactg gaagcggaag aaggacgacg agctgaagat catctacaac gccatcaagc 960
tgagcgagaa ggagggcaac atcttcgaca tccgggacct gtacaagcgg accctgaaga 1020
acatcgacct gctgacctac agcttcccct gccaggacct gagccagcag ggcatccaga 1080
agggcatgaa gcggggcagc ggcacccgga gcggcctgct gtgggagatc gagcgggccc 1140
tggacagcac cgagaagaac gacctgccca agtacctgct gatggagaac gtgggcgccc 1200
tgctgcacaa gaagaacgag gaggagctga accagtggaa gcagaagctg gagagcctgg 1260
gctaccagaa cagcatcgag gtgctgaacg ccgccgactt cggcagcagc caggcccggc 1320
ggcgggtgtt catgatcagc accctgaacg agttcgtgga gctgcccaag ggcgacaaga 1380
agcccaagag catcaagaag gtgctgaaca agatcgtgag cgagaaggac atcctgaaca 1440
acctgctgaa gtacaacctg accgagttca agaaaaccaa gagcaacatc aacaaggcca 1500
gcctgatcgg ctacagcaag ttcaacagcg agggctacgt gtacgacccc gagttcaccg 1560
gccccaccct gaccgccagc ggcgccaaca gccggatcaa gatcaaggac ggcagcaaca 1620
tccggaagat gaacagcgac gagaccttcc tgtacatcgg cttcgacagc caggacggca 1680
agcgggtgaa cgagatcgag ttcctgaccg agaaccagaa gatcttcgtg tgcggcaaca 1740
gcatcagcgt ggaggtgctg gaggccatca tcgacaagat cggcggcccc agcagcggcg 1800
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 1860
cctacgacgt gcccgactac gcctgagcgg ccgcttaatt aagctgcctt ctgcggggct 1920
tgccttctgg ccatgccctt cttctctccc ttgcacctgt acctcttggt ctttgaataa 1980
agcctgagta ggaagtctag aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2040
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2080
<![CDATA[ <210> 65]]>
<![CDATA[ <211> 1929]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 65]]>
gccttataag tgccccgagt gtggcaaatc cttttccgac tgtagagatc tggccagaca 60
tcaaagaacc cacaccggag agaaacctta taaatgcccc gagtgcggca agtcctttag 120
ccataccggc catctgctgg agcaccagag gacccatacc ggcgagaagc cttacaaatg 180
ccccgagtgc ggcaaaagct tcagcagaaa tgacgctctg accgagcatc aaaggaccca 240
taccggcgaa aagccctaca agtgtcccga gtgtggaaag tccttctccc agagcggcga 300
tctgaggaga caccagagaa cacacaccgg cgagaaaccc tataaatgtc ccgagtgcgg 360
aaagagcttt agcgacagcg gcaatctgag ggtgcatcaa agaacacaca ccggcgaaaa 420
accacccgga aaaaagacaa gcgctagcgg cagcggcggc ggcagcggcg gcgcccggga 480
cagcaaggtg gagaacaaga ccaagaagct gcgggtgttc gaggccttcg ccggcatcgg 540
cgcccagcgg aaggccctgg agaaggtgcg gaaggacgag tacgagatcg tgggcctggc 600
cgagtggtac gtgcccgcca tcgtgatgta ccaggccatc cacaacaact tccacaccaa 660
gctggagtac aagagcgtga gccgggagga gatgatcgac tacctggaga acaagaccct 720
gagctggaac agcaagaacc ccgtgagcaa cggctactgg aagcggaaga aggacgacga 780
gctgaagatc atctacaacg ccatcaagct gagcgagaag gagggcaaca tcttcgacat 840
ccgggacctg tacaagcgga ccctgaagaa catcgacctg ctgacctaca gcttcccctg 900
ccaggacctg agccagcagg gcatccagaa gggcatgaag cggggcagcg gcacccggag 960
cggcctgctg tgggagatcg agcgggccct ggacagcacc gagaagaacg acctgcccaa 1020
gtacctgctg atggagaacg tgggcgccct gctgcacaag aagaacgagg aggagctgaa 1080
ccagtggaag cagaagctgg agagcctggg ctaccagaac agcatcgagg tgctgaacgc 1140
cgccgacttc ggcagcagcc aggcccggcg gcgggtgttc atgatcagca ccctgaacga 1200
gttcgtggag ctgcccaagg gcgacaagaa gcccaagagc atcaagaagg tgctgaacaa 1260
gatcgtgagc gagaaggaca tcctgaacaa cctgctgaag tacaacctga ccgagttcaa 1320
gaaaaccaag agcaacatca acaaggccag cctgatcggc tacagcaagt tcaacagcga 1380
gggctacgtg tacgaccccg agttcaccgg ccccaccctg accgccagcg gcgccaacag 1440
ccggatcaag atcaaggacg gcagcaacat ccggaagatg aacagcgacg agaccttcct 1500
gtacatcggc ttcgacagcc aggacggcaa gcgggtgaac gagatcgagt tcctgaccga 1560
gaaccagaag atcttcgtgt gcggcaacag catcagcgtg gaggtgctgg aggccatcat 1620
cgacaagatc ggcggcccca gcagcggcgg caagcggccc gccgccacca agaaggccgg 1680
ccaggccaag aagaagaagg gcagctaccc ctacgacgtg cccgactacg cctgagcggc 1740
cgcttaatta agctgccttc tgcggggctt gccttctggc catgcccttc ttctctccct 1800
tgcacctgta cctcttggtc tttgaataaa gcctgagtag gaagtctaga aaaaaaaaaa 1860
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1920
aaaaaaaaa 1929
<![CDATA[ <210> 66]]>
<![CDATA[ <211> 1779]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 66]]>
gacccatacc ggcgaaaagc cttacaagtg tcccgagtgc ggaaagtcct tctctagatc 60
cgacaacctc gtgaggcacc agagaaccca caccggcgag aaaccttaca aatgtcccga 120
gtgtggcaaa agcttttcta gaagcgacga gctggtgaga catcaaagaa cccataccgg 180
cgaaaaacct tataagtgtc ccgagtgcgg caaatccttt agccagctgg cccatctgag 240
ggcccaccag agaacacata ccggcgaaaa accccaccggc aaaaagacaa gcgctagcgg 300
cagcggcggc ggcagcggcg gcgcccggga cagcaaggtg gagaacaaga ccaagaagct 360
gcgggtgttc gaggccttcg ccggcatcgg cgcccagcgg aaggccctgg agaaggtgcg 420
gaaggacgag tacgagatcg tgggcctggc cgagtggtac gtgcccgcca tcgtgatgta 480
ccaggccatc cacaacaact tccacaccaa gctggagtac aagagcgtga gccgggagga 540
gatgatcgac tacctggaga acaagaccct gagctggaac agcaagaacc ccgtgagcaa 600
cggctactgg aagcggaaga aggacgacga gctgaagatc atctacaacg ccatcaagct 660
gagcgagaag gagggcaaca tcttcgacat ccgggacctg tacaagcgga ccctgaagaa 720
catcgacctg ctgacctaca gcttcccctg ccaggacctg agccagcagg gcatccagaa 780
gggcatgaag cggggcagcg gcacccggag cggcctgctg tgggagatcg agcgggccct 840
ggacagcacc gagaagaacg acctgcccaa gtacctgctg atggagaacg tgggcgccct 900
gctgcacaag aagaacgagg aggagctgaa ccagtggaag cagaagctgg agagcctggg 960
ctaccagaac agcatcgagg tgctgaacgc cgccgacttc ggcagcagcc aggcccggcg 1020
gcgggtgttc atgatcagca ccctgaacga gttcgtggag ctgcccaagg gcgacaagaa 1080
gcccaagagc atcaagaagg tgctgaacaa gatcgtgagc gagaaggaca tcctgaacaa 1140
cctgctgaag tacaacctga ccgagttcaa gaaaaccaag agcaacatca acaaggccag 1200
cctgatcggc tacagcaagt tcaacagcga gggctacgtg tacgaccccg agttcaccgg 1260
ccccaccctg accgccagcg gcgccaacag ccggatcaag atcaaggacg gcagcaacat 1320
ccggaagatg aacagcgacg agaccttcct gtacatcggc ttcgacagcc aggacggcaa 1380
gcgggtgaac gagatcgagt tcctgaccga gaaccagaag atcttcgtgt gcggcaacag 1440
catcagcgtg gaggtgctgg aggccatcat cgacaagatc ggcggcccca gcagcggcgg 1500
caagcggccc gccgccacca agaaggccgg ccaggccaag aagaagaagg gcagctaccc 1560
ctacgacgtg cccgactacg cctgagcggc cgcttaatta agctgccttc tgcggggctt 1620
gccttctggc catgcccttc ttctctccct tgcacctgta cctcttggtc tttgaataaa 1680
gcctgagtag gaagtctaga aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1740
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1779
<![CDATA[ <210> 67]]>
<![CDATA[ <211> 4829]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 67]]>
aaggaaataa gagagaaaag aagagtaaga agaaatataa gagccaccat ggcccccaag 60
aagaagcgga aggtgggcat ccacggcgtg cccgccgccg acaagaagta cagcatcggc 120
ctggccatcg gcaccaacag cgtgggctgg gccgtgatca ccgacgagta caaggtgccc 180
agcaagaagt tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc 240
ggcgccctgc tgttcgacag cggcgagacc gccgaggcca cccggctgaa gcggaccgcc 300
cggcggcggt acacccggcg gaagaaccgg atctgctacc tgcaggagat cttcagcaac 360
gagatggcca aggtggacga cagcttcttc caccggctgg aggagagctt cctggtggag 420
gaggacaaga agcacgagcg gcacccccatc ttcggcaaca tcgtggacga ggtggcctac 480
cacgagaagt accccaccat ctaccacctg cggaagaagc tggtggacag caccgacaag 540
gccgacctgc ggctgatcta cctggccctg gcccacatga tcaagttccg gggccacttc 600
ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg 660
gtgcagacct acaaccagct gttcgaggag aaccccatca acgccagcgg cgtggacgcc 720
aaggccatcc tgagcgcccg gctgagcaag agccggcggc tggagaacct gatcgcccag 780
ctgcccggcg agaagaagaa cggcctgttc ggcaacctga tcgccctgag cctgggcctg 840
accccccaact tcaagagcaa cttcgacctg gccgaggacg ccaagctgca gctgagcaag 900
gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac 960
ctgttcctgg ccgccaagaa cctgagcgac gccatcctgc tgagcgacat cctgcgggtg 1020
aacaccgaga tcaccaaggc ccccctgagc gccagcatga tcaagcggta cgacgagcac 1080
caccaggacc tgaccctgct gaaggccctg gtgcggcagc agctgcccga gaagtacaag 1140
gagatcttct tcgaccagag caagaacggc tacgccggct acatcgacgg cggcgccagc 1200
caggagggagt tctacaagtt catcaagccc atcctggaga agatggacgg caccgaggag 1260
ctgctggtga agctgaaccg ggaggacctg ctgcggaagc agcggacctt cgacaacggc 1320
agcatccccc accagatcca cctgggcgag ctgcacgcca tcctgcggcg gcaggaggac 1380
ttctacccct tcctgaagga caaccgggag aagatcgaga agatcctgac cttccggatc 1440
ccctactacg tgggccccct ggcccggggc aacagccggt tcgcctggat gacccggaaa 1500
tccgaggaga ccatcacccc ctggaacttc gaggaggtgg tggacaaggg cgccagcgcc 1560
cagagcttca tcgagcggat gaccaacttc gacaagaacc tgcccaacga gaaggtgctg 1620
cccaagcaca gcctgctgta cgagtacttc accgtgtaca acgagctgac caaggtgaag 1680
tacgtgaccg agggcatgcg gaagcccgcc ttcctgagcg gcgagcagaa gaaggccatc 1740
gtggacctgc tgttcaagac caaccggaag gtgaccgtga agcagctgaa ggaggactac 1800
ttcaagaagaga tcgagtgctt cgacagcgtg gagatcagcg gcgtggagga ccggttcaac 1860
gccagcctgg gcacctacca cgacctgctg aagatcatca aggacaagga cttcctggac 1920
aacgaggaga acgaggacat cctggaggac atcgtgctga ccctgaccct gttcgaggac 1980
cgggagatga tcgaggagcg gctgaaaacc tacgcccacc tgttcgacga caaggtgatg 2040
aagcagctga agcggcggcg gtacaccggc tggggccggc tgagccggaa gctgatcaac 2100
ggcatccggg acaagcagag cggcaagacc atcctggact tcctgaaatc cgacggcttc 2160
gccaaccgga acttcatgca gctgatccac gacgacagcc tgaccttcaa ggaggacatc 2220
cagaaggccc aggtgagcgg ccagggcgac agcctgcacg agcacatcgc caacctggcc 2280
ggcagccccg ccatcaagaa gggcatcctg cagaccgtga aggtggtgga cgagctggtg 2340
aaggtgatgg gccggcacaa gcccgagaac atcgtgatcg agatggcccg ggagaaccag 2400
accacccaga agggccagaa gaacagccgg gagcggatga agcggatcga ggagggcatc 2460
aaggagctgg gcagccagat cctgaaggag caccccgtgg agaacaccca gctgcagaac 2520
gagaagctgt acctgtacta cctgcagaac ggccgggaca tgtacgtgga ccaggagctg 2580
gacatcaacc ggctgagcga ctacgacgtg gccgccatcg tgccccagag cttcctgaag 2640
gacgacagca tcgacaacaa ggtgctgacc cggagcgaca aggcccgggg caagagcgac 2700
aacgtgccca gcgaggaggt ggtgaagaag atgaagaact actggcggca gctgctgaac 2760
gccaagctga tcacccagcg gaagttcgac aacctgacca aggccgagcg gggcggcctg 2820
agcgagctgg acaaggccgg cttcatcaag cggcagctgg tggagacccg gcagatcacc 2880
aagcacgtgg cccagatcct ggacagccgg atgaacacca agtacgacga gaacgacaag 2940
ctgatccggg aggtgaaggt gatcaccctg aaatccaagc tggtgagcga cttccggaag 3000
gacttccagt tctacaaggt gcgggagatc aacaactacc accacgccca cgacgcctac 3060
ctgaacgccg tggtgggcac cgccctgatc aagaagtacc ccaagctgga gagcgagttc 3120
gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggag 3180
atcggcaagg ccaccgccaa gtacttcttc tacagcaaca tcatgaactt cttcaagacc 3240
gagatcaccc tggccaacgg cgagatccgg aagcggcccc tgatcgagac caacggcgag 3300
accggcgaga tcgtgtggga caagggccgg gacttcgcca ccgtgcggaa ggtgctgagc 3360
atgccccagg tgaacatcgt gaagaaaacc gaggtgcaga ccggcggctt cagcaaggag 3420
agcatcctgc ccaagcggaa cagcgacaag ctgatcgccc ggaagaagga ctgggacccc 3480
aagaagtacg gcggcttcga cagccccacc gtggcctaca gcgtgctggt ggtggccaag 3540
gtggagaagg gcaagagcaa gaagctgaaa tccgtgaagg agctgctggg catcaccatc 3600
atggagcgga gcagcttcga gaagaacccc atcgacttcc tggaggccaa gggctacaag 3660
gaggtgaaga aggacctgat catcaagctg cccaagtaca gcctgttcga gctggagaac 3720
ggccggaagc ggatgctggc cagcgccggc gagctgcaga agggcaacga gctggccctg 3780
cccagcaagt acgtgaactt cctgtacctg gccagccact acgagaagct gaagggcagc 3840
cccgaggaca acgagcagaa gcagctgttc gtggagcagc acaagcacta cctggacgag 3900
atcatcgagc agatcagcga gttcagcaag cgggtgatcc tggccgacgc caacctggac 3960
aaggtgctga gcgcctacaa caagcaccgg gacaagccca tccgggagca ggccgagaac 4020
atcatccacc tgttcaccct gaccaacctg ggcgcccccg ccgccttcaa gtacttcgac 4080
accaccatcg accggaagcg gtacaccagc accaaggagg tgctggacgc caccctgatc 4140
caccagagca tcaccggcct gtacgagacc cggatcgacc tgagccagct gggcggcgac 4200
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggc cagcgacgcc 4260
aagagcctga ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc 4320
acccgggagg agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg 4380
ctggagaact acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc 4440
ctgcggctgg agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggagacc 4500
caccccgaca gcgagaccgc cttcgagatc aagagcagcg tgagcggcgg caagcggccc 4560
gccgccacca agaaggccgg ccaggccaag aagaagaagg gcagctaccc ctacgacgtg 4620
cccgactacg cctgagcggc cgcttaatta agctgccttc tgcggggctt gccttctggc 4680
catgcccttc ttctctccct tgcacctgta cctcttggtc tttgaataaa gcctgagtag 4740
gaagtctaga aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 4800
aaaaaaaaaaaaaaaaaaaaaaaaaaaaa 4829
<![CDATA[ <210> 68]]>
<![CDATA[ <211> 2591]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 68]]>
gagttcgtgt acggcgacta caaggtgtac gacgtgcgga agatgatcgc caagagcgag 60
caggagatcg gcaaggccac cgccaagtac ttcttctaca gcaacatcat gaacttcttc 120
aagaccgaga tcaccctggc caacggcgag atccggaagc ggcccctgat cgagaccaac 180
ggcgagaccg gcgagatcgt gtgggacaag ggccgggact tcgccaccgt gcggaaggtg 240
ctgagcatgc cccaggtgaa catcgtgaag aaaaccgagg tgcagaccgg cggcttcagc 300
aaggagagca tcctgcccaa gcggaacagc gacaagctga tcgcccggaa gaaggactgg 360
gaccccaaga agtacggcgg cttcgacagc cccaccgtgg cctacagcgt gctggtggtg 420
gccaaggtgg agaagggcaa gagcaagaag ctgaaatccg tgaaggagct gctgggcatc 480
accatcatgg agcggagcag cttcgagaag aaccccatcg acttcctgga ggccaagggc 540
tacaaggagg tgaagaagga cctgatcatc aagctgccca agtacagcct gttcgagctg 600
gagaacggcc ggaagcggat gctggccagc gccggcgagc tgcagaaggg caacgagctg 660
gccctgccca gcaagtacgt gaacttcctg tacctggcca gccactacga gaagctgaag 720
ggcagccccg aggacaacga gcagaagcag ctgttcgtgg agcagcacaa gcactacctg 780
gacgagatca tcgagcagat cagcgagttc agcaagcggg tgatcctggc cgacgccaac 840
ctggacaagg tgctgagcgc ctacaacaag caccgggaca agcccatccg ggagcaggcc 900
gagaacatca tccacctgtt caccctgacc aacctgggcg cccccgccgc cttcaagtac 960
ttcgacacca ccatcgaccg gaagcggtac accagcacca aggaggtgct ggacgccacc 1020
ctgatccacc agagcatcac cggcctgtac gagacccgga tcgacctgag ccagctgggc 1080
ggcgacaagc ggcccgccgc caccaagaag gccggccagg ccaagaagaa gaaggcccgg 1140
gacagcaagg tggagaacaa gaccaagaag ctgcgggtgt tcgaggcctt cgccggcatc 1200
ggcgcccagc ggaaggccct ggagaaggtg cggaaggacg agtacgagat cgtgggcctg 1260
gccgagtggt acgtgcccgc catcgtgatg taccaggcca tccacaacaa cttccacacc 1320
aagctggagt acaagagcgt gagccggggag gagatgatcg actacctgga gaacaagacc 1380
ctgagctgga acagcaagaa ccccgtgagc aacggctact ggaagcggaa gaaggacgac 1440
gagctgaaga tcatctacaa cgccatcaag ctgagcgaga aggagggcaa catcttcgac 1500
atccgggacc tgtacaagcg gaccctgaag aacatcgacc tgctgaccta cagcttcccc 1560
tgccaggacc tgagccagca gggcatccag aagggcatga agcggggcag cggcacccgg 1620
agcggcctgc tgtggggagat cgagcggggcc ctggacagca ccgagaagaa cgacctgccc 1680
aagtacctgc tgatggagaa cgtgggcgcc ctgctgcaca agaagaacga ggaggagctg 1740
aaccagtgga agcagaagct ggagagcctg ggctaccaga acagcatcga ggtgctgaac 1800
gccgccgact tcggcagcag ccaggcccgg cggcgggtgttcatgatcag caccctgaac 1860
gagttcgtgg agctgcccaa gggcgacaag aagcccaaga gcatcaagaa ggtgctgaac 1920
aagatcgtga gcgagaagga catcctgaac aacctgctga agtacaacct gaccgagttc 1980
aagaaaacca agagcaacat caacaaggcc agcctgatcg gctacagcaa gttcaacagc 2040
gagggctacg tgtacgaccc cgagttcacc ggccccaccc tgaccgccag cggcgccaac 2100
agccggatca agatcaagga cggcagcaac atccggaaga tgaacagcga cgagaccttc 2160
ctgtacatcg gcttcgacag ccaggacggc aagcgggtga acgagatcga gttcctgacc 2220
gagaaccaga agatcttcgt gtgcggcaac agcatcagcg tggaggtgct ggaggccatc 2280
atcgacaaga tcggcggccc cagcagcggc ggcaagcggc ccgccgccac caagaaggcc 2340
ggccaggcca agaagaagaa gggcagctac ccctacgacg tgcccgacta cgcctgagcg 2400
gccgcttaat taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc 2460
cttgcacctg tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaa 2520
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2580
aaaaaaaaaa a 2591
<![CDATA[ <210> 69]]>
<![CDATA[ <211> 6010]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 69]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccga caagaagtac agcatcggcc 120
tggccatcgg caccaacagc gtgggctggg ccgtgatcac cgacgagtac aaggtgccca 180
gcaagaagtt caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg 240
gcgccctgct gttcgacagc ggcgagaccg ccgaggccac ccggctgaag cggaccgccc 300
ggcggcggta cacccggcgg aagaaccgga tctgctacct gcaggagatc ttcagcaacg 360
agatggccaa ggtggacgac agcttcttcc accggctgga ggagagcttc ctggtggagg 420
aggacaagaa gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc 480
acgagaagta ccccaccatc taccacctgc ggaagaagct ggtggacagc accgacaagg 540
ccgacctgcg gctgatctac ctggccctgg cccacatgat caagttccgg ggccacttcc 600
tgatcgaggg cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg 660
tgcagaccta caaccagctg ttcgaggaga accccatcaa cgccagcggc gtggacgcca 720
aggccatcct gagcgcccgg ctgagcaaga gccggcggct ggagaacctg atcgcccagc 780
tgcccggcga gaagaagaac ggcctgttcg gcaacctgat cgccctgagc ctgggcctga 840
cccccaactt caagagcaac ttcgacctgg ccgaggacgc caagctgcag ctgagcaagg 900
acacctacga cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc 960
tgttcctggc cgccaagaac ctgagcgacg ccatcctgct gagcgacatc ctgcgggtga 1020
acaccgagat caccaaggcc cccctgagcg ccagcatgat caagcggtac gacgagcacc 1080
accaggacct gaccctgctg aaggccctgg tgcggcagca gctgcccgag aagtacaagg 1140
agatcttctt cgaccagagc aagaacggct acgccggcta catcgacggc ggcgccagcc 1200
aggagggagtt ctacaagttc atcaagccca tcctggagaa gatggacggc accgaggagc 1260
tgctggtgaa gctgaaccgg gaggacctgc tgcggaagca gcggaccttc gacaacggca 1320
gcatccccca ccagatccac ctgggcgagc tgcacgccat cctgcggcgg caggaggact 1380
tctacccctt cctgaaggac aaccgggaga agatcgagaa gatcctgacc ttccggatcc 1440
cctactacgt gggccccctg gcccggggca acagccggtt cgcctggatg acccggaaat 1500
ccgaggagac catcacccccc tggaacttcg aggaggtggt ggacaagggc gccagcgccc 1560
agagcttcat cgagcggatg accaacttcg acaagaacct gcccaacgag aaggtgctgc 1620
ccaagcacag cctgctgtac gagtacttca ccgtgtacaa cgagctgacc aaggtgaagt 1680
acgtgaccga gggcatgcgg aagcccgcct tcctgagcgg cgagcagaag aaggccatcg 1740
tggacctgct gttcaagacc aaccggaagg tgaccgtgaa gcagctgaag gaggactact 1800
tcaagaagat cgagtgcttc gacagcgtgg agatcagcgg cgtggaggac cggttcaacg 1860
ccagcctggg cacctaccac gacctgctga agatcatcaa ggacaaggac ttcctggaca 1920
acgaggagaa cgaggacatc ctggaggaca tcgtgctgac cctgaccctg ttcgaggacc 1980
gggagatgat cgaggagcgg ctgaaaacct acgcccacct gttcgacgac aaggtgatga 2040
agcagctgaa gcggcggcgg tacaccggct ggggccggct gagccggaag ctgatcaacg 2100
gcatccggga caagcagagc ggcaagacca tcctggactt cctgaaatcc gacggcttcg 2160
ccaaccggaa cttcatgcag ctgatccacg acgacagcct gaccttcaag gaggacatcc 2220
agaaggccca ggtgagcggc cagggcgaca gcctgcacga gcacatcgcc aacctggccg 2280
gcagccccgc catcaagaag ggcatcctgc agaccgtgaa ggtggtggac gagctggtga 2340
aggtgatggg ccggcacaag cccgagaaca tcgtgatcga gatggcccgg gagaaccaga 2400
ccacccagaa gggccagaag aacagccggg agcggatgaa gcggatcgag gagggcatca 2460
aggagctggg cagccagatc ctgaaggagc accccgtgga gaacacccag ctgcagaacg 2520
agaagctgta cctgtactac ctgcagaacg gccgggacat gtacgtggac caggagctgg 2580
acatcaaccg gctgagcgac tacgacgtgg ccgccatcgt gccccagagc ttcctgaagg 2640
acgacagcat cgacaacaag gtgctgaccc ggagcgacaa ggcccggggc aagagcgaca 2700
acgtgcccag cgaggaggtg gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg 2760
ccaagctgat cacccagcgg aagttcgaca acctgaccaa ggccgagcgg ggcggcctga 2820
gcgagctgga caaggccggc ttcatcaagc ggcagctggt ggagacccgg cagatcacca 2880
agcacgtggc ccagatcctg gacagccgga tgaacaccaa gtacgacgag aacgacaagc 2940
tgatccggga ggtgaaggtg atcaccctga aatccaagct ggtgagcgac ttccggaagg 3000
acttccagtt ctacaaggtg cgggagatca acaactacca ccacgcccac gacgcctacc 3060
tgaacgccgt ggtgggcacc gccctgatca agaagtaccc caagctggag agcgagttcg 3120
tgtacggcga ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaga 3180
tcggcaaggc caccgccaag tacttcttct acagcaacat catgaacttc ttcaagaccg 3240
agatcaccct ggccaacggc gagatccgga agcggcccct gatcgagacc aacggcgaga 3300
ccggcgagat cgtgtgggac aagggccggg acttcgccac cgtgcggaag gtgctgagca 3360
tgccccaggt gaacatcgtg aagaaaaccg aggtgcagac cggcggcttc agcaaggaga 3420
gcatcctgcc caagcggaac agcgacaagc tgatcgcccg gaagaaggac tgggacccca 3480
agaagtacgg cggcttcgac agccccaccg tggcctacag cgtgctggtg gtggccaagg 3540
tggagaaggg caagagcaag aagctgaaat ccgtgaagga gctgctgggc atcaccatca 3600
tggagcggag cagcttcgag aagaacccca tcgacttcct ggaggccaag ggctacaagg 3660
aggtgaagaa ggacctgatc atcaagctgc ccaagtacag cctgttcgag ctggagaacg 3720
gccggaagcg gatgctggcc agcgccggcg agctgcagaa gggcaacgag ctggccctgc 3780
ccagcaagta cgtgaacttc ctgtacctgg ccagccacta cgagaagctg aagggcagcc 3840
ccgaggacaa cgagcagaag cagctgttcg tggagcagca caagcactac ctggacgaga 3900
tcatcgagca gatcagcgag ttcagcaagc gggtgatcct ggccgacgcc aacctggaca 3960
aggtgctgag cgcctacaac aagcaccggg acaagcccat ccgggagcag gccgagaaca 4020
tcatccacct gttcaccctg accaacctgg gcgcccccgc cgccttcaag tacttcgaca 4080
ccaccatcga ccggaagcgg tacaccagca ccaaggaggt gctggacgcc accctgatcc 4140
accagagcat caccggcctg tacgagaccc ggatcgacct gagccagctg ggcggcgaca 4200
gcggcggcaa gcggcccgcc gccaccaaga aggccggcca ggccaagaag aagaagtcgg 4260
gcgggggtgg ctcagtggat ctgaggacac tcgacgtgtt tagcggatgc ggcggactct 4320
ccgaaggctt ccaccaagcc ggaatttccg acacactctg ggccatgag atgtgggacc 4380
ccgccgctca agccttcaga ctgaataatc ccggctccac cgtgttcacc gaggactgca 4440
acattctgct gaagctggtg atggctggcg aaaccaccaa ctctagaggc cagaggctgc 4500
cccagaaggg agatgtggaa atgctctgtg gaggccctcc ttgccaaggc ttctccggca 4560
tgaacaggtt caactctaga acatacagca agttcaagaa ctctctggtc gtgagctttc 4620
tgagctactg cgactactat agacctagt tctttctgct ggagaacgtg agaaatttcg 4680
tgtccttcaa gaggagcatg gtgctgaagc tgacactgag gtgtctggtg aggatgggct 4740
accagtgcac attcggagtg ctgcaagctg gccagtacgg cgtggcccag accagaagga 4800
gggccatcat tctggctgct gcccccggcg agaaactccc tctgttcccc gagcccctcc 4860
acgtgttcgc ccctagagct tgccagctga gcgtggtggt cgacgataag aagttcgtga 4920
gcaacatcac aaggctgtcc agcggaccct tcagaaccat taccgtgagg gataccatgt 4980
ccgacctccc cgaggtgagg aatggcgcca gcgctctgga gatttcctac aacggcgaac 5040
ctcagagctg gttccaaagg cagctgagag gcgctcagta tcagcccatt ctgagggacc 5100
acatctgcaa agatatgagc gctctggtgg ccgctagaat gagacatatt cctctggccc 5160
ccggcagcga ctggagagat ctgcccaata ttgaggtgag actcagcgac ggaacaatgg 5220
ctagaaaact gaggtacacc catcatgata gaaagaacgg aaggagcagc agcggcgctc 5280
tgagaggagt gtgtagctgc gtggaagctg gcaaggcttg cgatcccgcc gctaggcagt 5340
tcaataccct catcccttgg tgtctgcctc acaccggcaa cagacacaat cattgggctg 5400
gactgtatgg aaggctcgaa tgggacggct ttttcagcac caccgtgacc aatcccgaac 5460
ctatgggcaa gcaaggaagg gtgctccacc ccgagcagca tagagtcgtg tccgtgagag 5520
aatgcgctag aagccaaggc ttccccgaca cctatagact gttcggcaac attctggata 5580
agcacagaca agtgggaaat gctgtccctc ctcctctggc caaggctatc ggactggaga 5640
tcaagctgtg tatgctcgcc aaagctaggg agagcgcttc cgccaagatt aaggaggagg 5700
aggccgccaa ggacggaggt ggcggatcgg gaaagcggcc cgccgccacc aagaaggccg 5760
gtcaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 5820
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 5880
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 5940
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 6000
aaaaaaaaaa 6010
<![CDATA[ <210> 70]]>
<![CDATA[ <211> 6202]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 70]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccga caagaagtac agcatcggcc 120
tggccatcgg caccaacagc gtgggctggg ccgtgatcac cgacgagtac aaggtgccca 180
gcaagaagtt caaggtgctg ggcaacaccg accggcacag catcaagaag aacctgatcg 240
gcgccctgct gttcgacagc ggcgagaccg ccgaggccac ccggctgaag cggaccgccc 300
ggcggcggta cacccggcgg aagaaccgga tctgctacct gcaggagatc ttcagcaacg 360
agatggccaa ggtggacgac agcttcttcc accggctgga ggagagcttc ctggtggagg 420
aggacaagaa gcacgagcgg caccccatct tcggcaacat cgtggacgag gtggcctacc 480
acgagaagta ccccaccatc taccacctgc ggaagaagct ggtggacagc accgacaagg 540
ccgacctgcg gctgatctac ctggccctgg cccacatgat caagttccgg ggccacttcc 600
tgatcgaggg cgacctgaac cccgacaaca gcgacgtgga caagctgttc atccagctgg 660
tgcagaccta caaccagctg ttcgaggaga accccatcaa cgccagcggc gtggacgcca 720
aggccatcct gagcgcccgg ctgagcaaga gccggcggct ggagaacctg atcgcccagc 780
tgcccggcga gaagaagaac ggcctgttcg gcaacctgat cgccctgagc ctgggcctga 840
cccccaactt caagagcaac ttcgacctgg ccgaggacgc caagctgcag ctgagcaagg 900
acacctacga cgacgacctg gacaacctgc tggcccagat cggcgaccag tacgccgacc 960
tgttcctggc cgccaagaac ctgagcgacg ccatcctgct gagcgacatc ctgcgggtga 1020
acaccgagat caccaaggcc cccctgagcg ccagcatgat caagcggtac gacgagcacc 1080
accaggacct gaccctgctg aaggccctgg tgcggcagca gctgcccgag aagtacaagg 1140
agatcttctt cgaccagagc aagaacggct acgccggcta catcgacggc ggcgccagcc 1200
aggagggagtt ctacaagttc atcaagccca tcctggagaa gatggacggc accgaggagc 1260
tgctggtgaa gctgaaccgg gaggacctgc tgcggaagca gcggaccttc gacaacggca 1320
gcatccccca ccagatccac ctgggcgagc tgcacgccat cctgcggcgg caggaggact 1380
tctacccctt cctgaaggac aaccgggaga agatcgagaa gatcctgacc ttccggatcc 1440
cctactacgt gggccccctg gcccggggca acagccggtt cgcctggatg acccggaaat 1500
ccgaggagac catcacccccc tggaacttcg aggaggtggt ggacaagggc gccagcgccc 1560
agagcttcat cgagcggatg accaacttcg acaagaacct gcccaacgag aaggtgctgc 1620
ccaagcacag cctgctgtac gagtacttca ccgtgtacaa cgagctgacc aaggtgaagt 1680
acgtgaccga gggcatgcgg aagcccgcct tcctgagcgg cgagcagaag aaggccatcg 1740
tggacctgct gttcaagacc aaccggaagg tgaccgtgaa gcagctgaag gaggactact 1800
tcaagaagat cgagtgcttc gacagcgtgg agatcagcgg cgtggaggac cggttcaacg 1860
ccagcctggg cacctaccac gacctgctga agatcatcaa ggacaaggac ttcctggaca 1920
acgaggagaa cgaggacatc ctggaggaca tcgtgctgac cctgaccctg ttcgaggacc 1980
gggagatgat cgaggagcgg ctgaaaacct acgcccacct gttcgacgac aaggtgatga 2040
agcagctgaa gcggcggcgg tacaccggct ggggccggct gagccggaag ctgatcaacg 2100
gcatccggga caagcagagc ggcaagacca tcctggactt cctgaaatcc gacggcttcg 2160
ccaaccggaa cttcatgcag ctgatccacg acgacagcct gaccttcaag gaggacatcc 2220
agaaggccca ggtgagcggc cagggcgaca gcctgcacga gcacatcgcc aacctggccg 2280
gcagccccgc catcaagaag ggcatcctgc agaccgtgaa ggtggtggac gagctggtga 2340
aggtgatggg ccggcacaag cccgagaaca tcgtgatcga gatggcccgg gagaaccaga 2400
ccacccagaa gggccagaag aacagccggg agcggatgaa gcggatcgag gagggcatca 2460
aggagctggg cagccagatc ctgaaggagc accccgtgga gaacacccag ctgcagaacg 2520
agaagctgta cctgtactac ctgcagaacg gccgggacat gtacgtggac caggagctgg 2580
acatcaaccg gctgagcgac tacgacgtgg ccgccatcgt gccccagagc ttcctgaagg 2640
acgacagcat cgacaacaag gtgctgaccc ggagcgacaa ggcccggggc aagagcgaca 2700
acgtgcccag cgaggaggtg gtgaagaaga tgaagaacta ctggcggcag ctgctgaacg 2760
ccaagctgat cacccagcgg aagttcgaca acctgaccaa ggccgagcgg ggcggcctga 2820
gcgagctgga caaggccggc ttcatcaagc ggcagctggt ggagacccgg cagatcacca 2880
agcacgtggc ccagatcctg gacagccgga tgaacaccaa gtacgacgag aacgacaagc 2940
tgatccggga ggtgaaggtg atcaccctga aatccaagct ggtgagcgac ttccggaagg 3000
acttccagtt ctacaaggtg cgggagatca acaactacca ccacgcccac gacgcctacc 3060
tgaacgccgt ggtgggcacc gccctgatca agaagtaccc caagctggag agcgagttcg 3120
tgtacggcga ctacaaggtg tacgacgtgc ggaagatgat cgccaagagc gagcaggaga 3180
tcggcaaggc caccgccaag tacttcttct acagcaacat catgaacttc ttcaagaccg 3240
agatcaccct ggccaacggc gagatccgga agcggcccct gatcgagacc aacggcgaga 3300
ccggcgagat cgtgtgggac aagggccggg acttcgccac cgtgcggaag gtgctgagca 3360
tgccccaggt gaacatcgtg aagaaaaccg aggtgcagac cggcggcttc agcaaggaga 3420
gcatcctgcc caagcggaac agcgacaagc tgatcgcccg gaagaaggac tgggacccca 3480
agaagtacgg cggcttcgac agccccaccg tggcctacag cgtgctggtg gtggccaagg 3540
tggagaaggg caagagcaag aagctgaaat ccgtgaagga gctgctgggc atcaccatca 3600
tggagcggag cagcttcgag aagaacccca tcgacttcct ggaggccaag ggctacaagg 3660
aggtgaagaa ggacctgatc atcaagctgc ccaagtacag cctgttcgag ctggagaacg 3720
gccggaagcg gatgctggcc agcgccggcg agctgcagaa gggcaacgag ctggccctgc 3780
ccagcaagta cgtgaacttc ctgtacctgg ccagccacta cgagaagctg aagggcagcc 3840
ccgaggacaa cgagcagaag cagctgttcg tggagcagca caagcactac ctggacgaga 3900
tcatcgagca gatcagcgag ttcagcaagc gggtgatcct ggccgacgcc aacctggaca 3960
aggtgctgag cgcctacaac aagcaccggg acaagcccat ccgggagcag gccgagaaca 4020
tcatccacct gttcaccctg accaacctgg gcgcccccgc cgccttcaag tacttcgaca 4080
ccaccatcga ccggaagcgg tacaccagca ccaaggaggt gctggacgcc accctgatcc 4140
accagagcat caccggcctg tacgagaccc ggatcgacct gagccagctg ggcggcgaca 4200
gcgccggcgg cggcggcagc ggcggcggcg gcagcggcgg cggcggcagc ggccccaaga 4260
agaagcggaa ggtggccgcc gccggcagca accacgacca ggagttcgac ccccccaagg 4320
tgtaccccccc cgtgcccgcc gagaagcgga agcccatccg ggtgctgagc ctgttcgacg 4380
gcatcgccac cggcctgctg gtgctgaagg acctgggcat ccaggtggac cggtacatcg 4440
ccagcgaggt gtgcgaggac agcatcaccg tgggcatggt gcggcaccag ggcaagatca 4500
tgtacgtggg cgacgtgcgg agcgtgaccc agaagcacat ccaggagtgg ggccccttcg 4560
acctggtgat cggcggcagc ccctgcaacg acctgagcat cgtgaaccccc gcccggaagg 4620
gcctgtacga gggcaccggc cggctgttct tcgagttcta ccggctgctg cacgacgccc 4680
ggcccaagga gggcgacgac cggcccttct tctggctgtt cgagaacgtg gtggccatgg 4740
gcgtgagcga caagcgggac atcagccggt tcctggagag caaccccgtg atgatcgacg 4800
ccaaggaggt gagcgccgcc caccgggccc ggtacttctg gggcaacctg cccggcatga 4860
accggcccct ggccagcacc gtgaacgaca agctggagct gcaggagtgc ctggagcacg 4920
gccggatcgc caagttcagc aaggtgcgga ccatcaccac ccggagcaac agcatcaagc 4980
agggcaagga ccagcacttc cccgtgttca tgaacgagaa gggaggacatc ctgtggtgca 5040
ccgagatgga gcgggtgttc ggcttccccg tgcactacac cgacgtgagc aacatgagcc 5100
ggctggcccg gcagcggctg ctgggccgga gctggagcgt gcccgtgatc cggcacctgt 5160
tcgcccccct gaaggagtac ttcgcctgcg tgagcagcgg caacagcaac gccaacagcc 5220
ggggccccag cttcagcagc ggcctggtgc ccctgagcct gcggggcagc cacatgaatc 5280
ctctggagat gttcgagaca gtgcccgtgt ggagaaggca acccgtgagg gtgctgagcc 5340
tcttcgagga cattaagaag gagctgacct ctctgggctt tctggaatcc ggcagcgacc 5400
ccggccagct gaaacacgtg gtggacgtga ccgacacagt gaggaaggac gtggaagagt 5460
ggggcccctt tgacctcgtg tatggagcca cacctcctct cggccacaca tgcgataggc 5520
ctcccagctg gtatctcttc cagttccaca gactgctcca gtacgccaga cctaagcccg 5580
gcagccccag acccttcttc tggatgttcg tggacaatct ggtgctgaac aaggaggatc 5640
tggatgtggc cagcagattt ctggagatgg aacccgtgac aatccccgac gtgcatggcg 5700
gctctctgca gaacgccgtg agagtgtggt ccaacatccc cgccattaga agcagacact 5760
gggctctggt gagcgaggag gaactgtctc tgctggccca gaataagcag tcctccaagc 5820
tggccgccaa gtggcccacc aagctggtga agaactgctt tctgcctctg agggagtatt 5880
tcaagtattt cagcaccgaa ctgaccagca gcctgagcgg cggcaagcgg cccgccgcca 5940
ccaagaaggc cggccaggcc aagaagaaga agggcagcta cccctacgac gtgcccgact 6000
acgcctgagc ggccgcttaa ttaagctgcc ttctgcgggg cttgccttct ggccatgccc 6060
ttcttctctc ccttgcacct gtacctcttg gtctttgaat aaagcctgag taggaagtct 6120
agaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 6180
aaaaaaaaaaaaaaaaaaaaaa 6202
<![CDATA[ <210> 71]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic oligonucleotides]]>
<![CDATA[ <220>]]>
<![CDATA[ <221> modified_base]]>
<![CDATA[ <222> (]]>1)..(1)
<![CDATA[ <223> a, c, t, g, unknown or other]]>
<![CDATA[ <220>]]>
<![CDATA[ <221> modified_base]]>
<![CDATA[ <222> (3)..(3)]]>
<![CDATA[ <223> a, c, t, g, unknown or other]]>
<![CDATA[ <400> 71]]>
nbndccdsha grkgghrshv 20
<![CDATA[ <210> 72]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic oligonucleotides]]>
<![CDATA[ <220]]>>]]>
<br/> <![CDATA[ <221>modified_base]]>
<br/> <![CDATA[ <222>(18)..(18)]]>
<br/> <![CDATA[ <223> a, c, t, g, unknown or other]]>
<br/>
<br/> <![CDATA[ <220>]]>
<br/> <![CDATA[ <221>modified_base]]>
<br/> <![CDATA[ <222>(20)..(20)]]>
<br/> <![CDATA[ <223> a, c, t, g, unknown or other]]>
<br/>
<br/> <![CDATA[ <400>72]]>
<br/> <![CDATA[vhsrhggkrg ahsdccdnbn 20
<![CDATA[ <210> 73]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic oligonucleotides]]>
<![CDATA[ <220>]]>
<![CDATA[ <221> modified_base]]>
<![CDATA[ <222> (8)..(8)]]>
<![CDATA[ <223> a, c, t, g, unknown or other]]>
<![CDATA[ <400> 73]]>
ccgccatntt 10
<![CDATA[ <210> 74]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic oligonucleotides]]>
<![CDATA[ <220>]]>
<![CDATA[ <221> modified_base]]>
<![CDATA[ <222> (3)..(3)]]>
<![CDATA[ <223> a, c, t, g, unknown or other]]>
<![CDATA[ <400> 74]]>
aanatggcgg10
<![CDATA[ <210> 75]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 75]]>
gctggaaacc ttgcacctcg g 21
<![CDATA[ <210> 76]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 76]]>
ctgctgccag tagagggcac a 21
<![CDATA[ <210> 77]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 77]]>
gcccagagag ggggcggagg g 21
<![CDATA[ <210> 78]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 78]]>
acgcggggag caaccaatcg c 21
<![CDATA[ <210> 79]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 79]]>
actggcagca gagatcatcg c 21
<![CDATA[ <210> 80]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 80]]>
gggggcagga gcaggagcgt c 21
<![CDATA[ <210> 81]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 81]]>
cagccttagc gaggcgccct g 21
<![CDATA[ <210> 82]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 82]]>
actcacagga caaggatgcg g 21
<![CDATA[ <210> 83]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 83]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[ <210> 84]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 84]]>
actcagccgg gcagccgagc a 21
<![CDATA[ <210> 85]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 85]]>
cgtaccaggc tgcagggcgc c 21
<![CDATA[ <210> 86]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213]]>> Homo sapiens]]>
<br/>
<br/> <![CDATA[ <400>86]]>
<br/> <![CDATA[agagtggagg aaagaagggt a 21
<![CDATA[ <210> 87]]>
<![CDATA[ <211> 386]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 87]]>
Met Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val Phe Glu Ala
1 5 10 15
Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys Val Arg Lys
20 25 30
Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val Pro Ala Ile
35 40 45
Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys Leu Glu Tyr
50 55 60
Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu Asn Lys Thr
65 70 75 80
Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr Trp Lys Arg
85 90 95
Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser
100 105 110
Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr
115 120 125
Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu
130 135 140
Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg
145 150 155 160
Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys
165 170 175
Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu Leu
180 185 190
His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys Leu Glu
195 200 205
Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala Ala Asp Phe
210 215 220
Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser Thr Leu Asn
225 230 235 240
Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys Ser Ile Lys
245 250 255
Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu Asn Asn Leu
260 265 270
Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser Asn Ile Asn
275 280 285
Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu Gly Tyr Val
290 295 300
Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser Gly Ala Asn
305 310 315 320
Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys Met Asn Ser
325 330 335
Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg
340 345 350
Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys
355 360 365
Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile
370 375 380
Gly Gly
385
<![CDATA[ <210> 88]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 88]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser
20
<![CDATA[ <210> 89]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 89]]>
Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1 5 10 15
Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
20 25
<![CDATA[ <210> 90]]>
<![CDATA[ <211> 9]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 90]]>
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1 5
<![CDATA[ <210> 91]]>
<![CDATA[ <211> 1123]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 91]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
690 695 700
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu
705 710 715 720
Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile
725 730 735
His Gly Val Pro Ala Ala Gly Ser Ser Ser Gly Ser Leu Glu Pro Gly Glu
740 745 750
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp
755 760 765
Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
770 775 780
Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg
785 790 795 800
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
805 810 815
Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr
820 825 830
His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe
835 840 845
Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu
850 855 860
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala
865 870 875 880
His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
885 890 895
Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu
900 905 910
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
915 920 925
Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr
930 935 940
His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser
945 950 955 960
Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg
965 970 975
Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Glu
980 985 990
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met
995 1000 1005
Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr
1010 1015 1020
Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp
1025 1030 1035
Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
1040 1045 1050
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Ser Gly Gly Lys Arg
1085 1090 1095
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly
1100 1105 1110
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1115 1120
<![CDATA[ <210> 92]]>
<![CDATA[ <211> 1123]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 92]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
340 345 350
Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
370 375 380
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
385 390 395 400
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
405 410 415
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
420 425 430
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
435 440 445
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
450 455 460
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
465 470 475 480
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
485 490 495
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
500 505 510
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
515 520 525
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
530 535 540
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
545 550 555 560
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
565 570 575
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
580 585 590
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
595 600 605
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
610 615 620
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
625 630 635 640
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
645 650 655
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
660 665 670
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
675 680 685
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
690 695 700
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
705 710 715 720
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
725 730 735
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
740 745 750
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
755 760 765
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
770 775 780
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
785 790 795 800
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
805 810 815
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
820 825 830
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
835 840 845
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
850 855 860
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
865 870 875 880
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
885 890 895
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
900 905 910
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
915 920 925
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
930 935 940
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
945 950 955 960
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
965 970 975
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
980 985 990
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
995 1000 1005
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln
1010 1015 1020
Asp Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln
1025 1030 1035
Lys Ile Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu
1040 1045 1050
Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Ser Gly Gly Lys Arg
1085 1090 1095
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly
1100 1105 1110
Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1115 1120
<![CDATA[ <210> 93]]>
<![CDATA[ <211> 3332]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 93]]>
gcggcaaatc cttttctaga agcgatcatc tgaccaccca ccaaagaaca cataccggcg 60
agaagcctta caaatgtccc gagtgcggaa agtccttctc ccagagagcc aatctgaggg 120
ctcatcaaag gacccatacc ggcgaaaagc cctacaaatg ccccgagtgc ggaaaatcct 180
tcagccagct ggcccatctg agagcccacc aaaggacaca caccggagag aaaccctata 240
agtgccccga gtgtggaaaa agcttttccc agagggccaa tctgagggcc catcagagga 300
cccataccgg agagaagcct tataaatgtc ccgagtgcgg aaaaagcttc agcgagagga 360
gccatctgag ggaacatcaa agaacccaca ccggcgaaaa accccaccgga aaaaagacca 420
gcgctagcgg cagcggcggc ggcagcggcg gcgcccggga cagcaaggtg gagaacaaga 480
ccaagaagct gcgggtgttc gaggccttcg ccggcatcgg cgcccagcgg aaggccctgg 540
agaaggtgcg gaaggacgag tacgagatcg tgggcctggc cgagtggtac gtgcccgcca 600
tcgtgatgta ccaggccatc cacaacaact tccacaccaa gctggagtac aagagcgtga 660
gccggggagga gatgatcgac tacctggaga acaagaccct gagctggaac agcaagaacc 720
ccgtgagcaa cggctactgg aagcggaaga aggacgacga gctgaagatc atctacaacg 780
ccatcaagct gagcgagaag gagggcaaca tcttcgacat ccgggacctg tacaagcgga 840
ccctgaagaa catcgacctg ctgacctaca gcttcccctg ccaggacctg agccagcagg 900
gcatccagaa gggcatgaag cggggcagcg gcacccggag cggcctgctg tgggagatcg 960
agcgggccct ggacagcacc gagaagaacg acctgcccaa gtacctgctg atggagaacg 1020
tgggcgccct gctgcacaag aagaacgagg aggagctgaa ccagtggaag cagaagctgg 1080
agagcctggg ctaccagaac agcatcgagg tgctgaacgc cgccgacttc ggcagcagcc 1140
aggcccggcg gcgggtgttc atgatcagca ccctgaacga gttcgtggag ctgcccaagg 1200
gcgacaagaa gcccaagagc atcaagaagg tgctgaacaa gatcgtgagc gagaaggaca 1260
tcctgaacaa cctgctgaag tacaacctga ccgagttcaa gaaaaccaag agcaacatca 1320
acaaggccag cctgatcggc tacagcaagt tcaacagcga gggctacgtg tacgaccccg 1380
agttcaccgg ccccaccctg accgccagcg gcgccaacag ccggatcaag atcaaggacg 1440
gcagcaacat ccggaagatg aacagcgacg agaccttcct gtacatcggc ttcgacagcc 1500
aggacggcaa gcgggtgaac gagatcgagt tcctgaccga gaaccagaag atcttcgtgt 1560
gcggcaacag catcagcgtg gaggtgctgg aggccatcat cgacaagatc ggcggcccca 1620
gcagcggcgg caagcggccc gccgccacca agaaggccgg ccaggccaag aagaagaagg 1680
gcagctaccc ctacgacgtg cccgactacg ccgggtccta cccgtacgac gtgcccgatt 1740
acgccgccac caacttctcg ctgctgaagc aggccggaga tgtggaagaa aaccctggac 1800
ctaccagtgc cggaaagctc ggtagcggag agggtcgggg aagcctgctt acttgcggcg 1860
acgtggaaga gaacccccggt ccgctggagg gttcgtccgg ctccggatcc cccaagaaga 1920
agcggaaggt gggcatccac ggcgtgcccg ccgccggcag cagcggatcc ctggagcccg 1980
gcgaaaaacc ttacaagtgc cccgagtgcg gaaagagctt cagcagaagc gacaaactgg 2040
tgaggcatca aaggacacat accggagaga agccctataa gtgccccgaa tgtggcaaat 2100
ccttttccca gagggctcat ctggaaagac accagaggac ccataccggc gaaaaaccct 2160
acaaatgtcc cgagtgtgga aagagctttt ccgatcccgg ccatctggtc agacatcaga 2220
ggacacatac cggcgaaaag ccttacaagt gtcccgaatg cggaaaatcc ttctccagaa 2280
gcgacaagct ggtgaggcac caaagaaccc acaccggcga aaaaccctat aaatgccccg 2340
agtgcggcaa gtcctttagc cagctggccc atctgagagc ccaccagaga acacacaccg 2400
gagagaagcc ttataagtgt cccgagtgcg gaaagtcctt ctctagagcc gacaatctga 2460
ccgaacatca aaggacacac accggcgaga aaccttataa atgccccgag tgcggaaaaa 2520
gcttttccga ctgcagagat ctggctagac accagagaac ccacaccggc gagaaaccca 2580
ccggcaaaaa gaccagcgct agcggcagcg gcggcggcag cggcggcgac gccaagagcc 2640
tgaccgcctg gagccggacc ctggtgacct tcaaggacgt gttcgtggac ttcacccggg 2700
aggagtggaa gctgctggac accgcccagc agatcctgta ccggaacgtg atgctggaga 2760
actacaagaa cctggtgagc ctgggctacc agctgaccaa gcccgacgtg atcctgcggc 2820
tggagaaggg cgaggagccc tggctggtgg agcggggagat ccaccaggag accccaccccg 2880
acagcgagac cgccttcgag atcaagagca gcgtgagcgg cggcaagcgg cccgccgcca 2940
ccaagaaggc cggccaggcc aagaagaaga agggcagcta cccctacgac gtgcccgact 3000
acgcctgaag cagcggcggc aagcggcccg ccgccaccaa gaaggccggc caggccaaga 3060
agaagaaggg cagctacccc tacgacgtgc ccgactacgc ctgagcggcc gcttaattaa 3120
gctgccttct gcggggcttg ccttctggcc atgcccttct tctctccctt gcacctgtac 3180
ctcttggtct ttgaataaag cctgagtagg aagtctagaa aaaaaaaaaaaaaaaaaaaa 3240
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3300
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3332
<![CDATA[ <210> 94]]>
<![CDATA[ <211> 3012]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 94]]>
tataaatgcc ccgagtgcgg aaaaagcttt tccgactgca gagatctggc tagacaccag 60
agaacccaca ccggcgagaa acccaccggc aaaaagacca gcgctagcgg cagcggcggc 120
ggcagcggcg gcgacgccaa gagcctgacc gcctggagcc ggaccctggt gaccttcaag 180
gacgtgttcg tggacttcac ccgggaggag tggaagctgc tggacaccgc ccagcagatc 240
ctgtaccgga acgtgatgct ggagaactac aagaacctgg tgagcctggg ctaccagctg 300
accaagcccg acgtgatcct gcggctggag aagggcgagg agccctggct ggtggagcgg 360
gagatccacc aggagaccca ccccgacagc gagaccgcct tcgagatcaa gagcagcgtg 420
agcagcggcg gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag 480
ggcagctacc cctacgacgt gcccgactac gccgggtcct acccgtacga cgtgcccgat 540
tacgccgcca ccaacttctc gctgctgaag caggccggag atgtggaaga aaaccctgga 600
cctaccagtg ccggaaagct cggtagcgga gagggtcggg gaagcctgct tacttgcggc 660
gacgtggaag agaacccccgg tccgctggag ggttcgtccg gctccggatc ccccaagaag 720
aagcggaagg tgggcatcca cggcgtgccc gccgccggca gcagcggatc cctggagccc 780
ggcgagaaac cttacaaatg ccccgagtgc ggcaagagct tcagcagaag cgacgatctg 840
gtgaggcacc aaagaaccca caccggcgaa aaaccttaca agtgtcccga atgcggaaag 900
tccttcagca gagaggacaa tctgcacacc caccagagaa cacacaccgg agaaaagcct 960
tacaagtgcc ccgaatgcgg caaatccttt tctagaagcg atcatctgac cacccaccaa 1020
agaacacata ccggcgagaa gccttacaaa tgtcccgagt gcggaaagtc cttctcccag 1080
agagccaatc tgagggctca tcaaaggacc cataccggcg aaaagcccta caaatgcccc 1140
gagtgcggaa aatccttcag ccagctggcc catctgagag cccaccaaag gacacacacc 1200
ggagagaaac cctataagtg ccccgagtgt ggaaaaagct tttcccagag ggccaatctg 1260
agggcccatc agaggaccca taccggagag aagccttata aatgtcccga gtgcggaaaa 1320
agcttcagcg agaggagcca tctgagggaa catcaaagaa cccacaccgg cgaaaaaccc 1380
accggaaaaa agaccagcgc tagcggcagc ggcggcggca gcggcggcgc ccgggacagc 1440
aaggtggaga acaagaccaa gaagctgcgg gtgttcgagg ccttcgccgg catcggcgcc 1500
cagcggaagg ccctggagaa ggtgcggaag gacgagtacg agatcgtggg cctggccgag 1560
tggtacgtgc ccgccatcgt gatgtaccag gccatccaca acaacttcca caccaagctg 1620
gagtacaaga gcgtgagccg ggaggagatg atcgactacc tggagaacaa gaccctgagc 1680
tggaacagca agaaccccgt gagcaacggc tactggaagc ggaagaagga cgacgagctg 1740
aagatcatct acaacgccat caagctgagc gagaaggagg gcaacatctt cgacatccgg 1800
gacctgtaca agcggaccct gaagaacatc gacctgctga cctacagctt cccctgccag 1860
gacctgagcc agcagggcat ccagaagggc atgaagcggg gcagcggcac ccggagcggc 1920
ctgctgtggg agatcgagcg ggccctggac agcaccgaga agaacgacct gcccaagtac 1980
ctgctgatgg agaacgtggg cgccctgctg cacaagaaga acgaggagga gctgaaccag 2040
tggaagcaga agctggagag cctgggctac cagaacagca tcgaggtgct gaacgccgcc 2100
gacttcggca gcagccaggc ccggcggcgg gtgttcatga tcagcaccct gaacgagttc 2160
gtggagctgc ccaagggcga caagaagccc aagagcatca agaaggtgct gaacaagatc 2220
gtgagcgaga aggacatcct gaacaacctg ctgaagtaca acctgaccga gttcaagaaa 2280
accaagagca acatcaacaa ggccagcctg atcggctaca gcaagttcaa cagcgagggc 2340
tacgtgtacg accccgagtt caccggcccc accctgaccg ccagcggcgc caacagccgg 2400
atcaagatca aggacggcag caacatccgg aagatgaaca gcgacgagac cttcctgtac 2460
atcggcttcg acagccagga cggcaagcgg gtgaacgaga tcgagttcct gaccgagaac 2520
cagaagatct tcgtgtgcgg caacagcatc agcgtggagg tgctggaggc catcatcgac 2580
aagatcggcg gccccagcgg cggcaagcgg cccgccgcca ccaagaaggc cggccaggcc 2640
aagaagaagaaga agggcagcta cccctacgac gtgcccgact acgcctgaag cagcggcggc 2700
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggg cagctacccc 2760
tacgacgtgc ccgactacgc ctgagcggcc gcttaattaa gctgccttct gcggggcttg 2820
ccttctggcc atgcccttct tctctccctt gcacctgtac ctcttggtct ttgaataaag 2880
cctgagtagg aagtctagaa aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2940
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3000
aaaaaaaaaaaa 3012
<![CDATA[ <210> 95]]>
<![CDATA[ <211> 118]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 95]]>
Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
1 5 10 15
Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
20 25 30
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
35 40 45
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
50 55 60
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
65 70 75 80
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
85 90 95
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
100 105 110
Ala Gly Ser Ser Gly Ser
115
<![CDATA[ <210> 96]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 96]]>
tgccacttcc ccactaaccc 20
<![CDATA[ <210> 97]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 97]]>
ggccacacaa ggaagctgca 20
<![CDATA[ <210> 98]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 98]]>
ccacacaagg aagctgcagg 20
<![CDATA[ <210> 99]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 99]]>
tgattggaat gcaacccgaa 20
<![CDATA[ <210> 100]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 100]]>
ttttgccctt gctaccccaa 20
<![CDATA[ <210> 101]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 101]]>
agctgatggt atccactagg 20
<![CDATA[ <210> 102]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 102]]>
cacatccaag aatgtagtgg 20
<![CDATA[ <210> 103]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 103]]>
gatacagcca caaagctcac 20
<![CDATA[ <210> 104]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 104]]>
attacataac agaatccagg 20
<![CDATA[ <210> 105]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 105]]>
cccttgactg tgctgccacc 20
<![CDATA[ <210> 106]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 106]]>
cagacgagga acctgaaccc 20
<![CDATA[ <210> 107]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 107]]>
agaatccctt ggggtagcaa 20
<![CDATA[ <210> 108]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 108]]>
cagcactctc gctgaccgca 20
<![CDATA[ <210> 109]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 109]]>
gttgagtcat gtgtactctg 20
<![CDATA[ <210> 110]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 110]]>
aggaacagga tgttacaact 20
<![CDATA[ <210> 111]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 111]]>
ggggccacta gggacaggat 20
<![CDATA[ <210> 112]]>
<![CDATA[ <211> 3650]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 112]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc gggtcctacc 2040
cgtacgacgt gcccgattac gccgccacca acttctcgct gctgaagcag gccggagatg 2100
tggaagaaaa ccctggacct accagtgccg gaaagctcgg tagcggagag ggtcggggaa 2160
gcctgcttac ttgcggcgac gtggaagaga accccggtcc gctggagggt tcgtccggct 2220
ccggatcccc caagaagaag cggaaggtgg gcatccacgg cgtgcccgcc gccggcagca 2280
gcggatccct ggagcccggc gaaaaacctt acaagtgccc cgagtgcgga aagagcttca 2340
gcagaagcga caaactggtg aggcatcaaa ggacacatac cggagagaag ccctataagt 2400
gccccgaatg tggcaaatcc ttttccccaga gggctcatct ggaaagacac cagaggaccc 2460
ataccggcga aaaaccctac aaatgtcccg agtgtggaaa gagcttttcc gatcccggcc 2520
atctggtcag acatcagagg acacataccg gcgaaaagcc ttacaagtgt cccgaatgcg 2580
gaaaatcctt ctccagaagc gacaagctgg tgaggcacca aagaacccac accggcgaaa 2640
aaccctataa atgccccgag tgcggcaagt cctttagcca gctggcccat ctgagagccc 2700
accagagaac acacaccgga gagaagcctt ataagtgtcc cgagtgcgga aagtccttct 2760
ctagagccga caatctgacc gaacatcaaa ggacacacac cggcgagaaa ccttataaat 2820
gccccgagtg cggaaaaagc ttttccgact gcagagatct ggctagacac cagagaaccc 2880
acaccggcga gaaacccacc ggcaaaaaga ccagcgctag cggcagcggc ggcggcagcg 2940
gcggcgacgc caagagcctg accgcctgga gccggaccct ggtgaccttc aaggacgtgt 3000
tcgtggactt cacccggggag gagtggaagc tgctggacac cgcccagcag atcctgtacc 3060
ggaacgtgat gctggagaac tacaagaacc tggtgagcct gggctaccag ctgaccaagc 3120
ccgacgtgat cctgcggctg gagaagggcg aggagccctg gctggtggag cgggagatcc 3180
accaggagac ccaccccgac agcgagaccg ccttcgagat caagagcagc gtgagcggcg 3240
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 3300
cctacgacgt gcccgactac gcctgaagca gcggcggcaa gcggcccgcc gccaccaaga 3360
aggccggcca ggccaagaag aagaagggca gctaccccta cgacgtgccc gactacgcct 3420
gagcggccgc ttaattaagc tgccttctgc ggggcttgcc ttctggccat gcccttcttc 3480
tctcccttgc acctgtacct cttggtcttt gaataaagcc tgagtaggaa gtctagaaaa 3540
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3600
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3650
<![CDATA[ <210> 113]]>
<![CDATA[ <211> 3650]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 113]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accttacaag tgccccgagt gcggaaagag cttcagcaga agcgacaaac 180
tggtgaggca tcaaaggaca cataccggag agaagcccta taagtgcccc gaatgtggca 240
aatccttttc ccagagggct catctggaaa gacaccagag gacccatacc ggcgaaaaac 300
cctacaaatg tcccgagtgt ggaaagagct tttccgatcc cggccatctg gtcagacatc 360
agaggacaca taccggcgaa aagccttaca agtgtcccga atgcggaaaa tccttctcca 420
gaagcgacaa gctggtgagg caccaaagaa cccacaccgg cgaaaaaccc tataaatgcc 480
ccgagtgcgg caagtccttt agccagctgg cccatctgag agcccaccag agaacacaca 540
ccggagagaa gccttataag tgtcccgagt gcggaaagtc cttctctaga gccgacaatc 600
tgaccgaaca tcaaaggaca cacaccggcg agaaacctta taaatgcccc gagtgcggaa 660
aaagcttttc cgactgcaga gatctggcta gacaccagag aacccacacc ggcgagaaac 720
ccaccggcaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gacgccaaga 780
gcctgaccgc ctggagccgg accctggtga ccttcaagga cgtgttcgtg gacttcaccc 840
gggaggagtg gaagctgctg gacaccgccc agcagatcct gtaccggaac gtgatgctgg 900
agaactacaa gaacctggtg agcctgggct accagctgac caagcccgac gtgatcctgc 960
ggctggagaa gggcgaggag ccctggctgg tggagcggga gatccaccag gagacccacc 1020
ccgacagcga gaccgccttc gagatcaaga gcagcgtgag cagcggcggc aagcggcccg 1080
ccgccaccaa gaaggccggc caggccaaga agaagaaggg cagctacccc tacgacgtgc 1140
ccgactacgc cgggtcctac ccgtacgacg tgcccgatta cgccgccacc aacttctcgc 1200
tgctgaagca ggccggagat gtggaagaaa accctggacc taccagtgcc ggaaagctcg 1260
gtagcggaga gggtcgggga agcctgctta cttgcggcga cgtggaagag aaccccggtc 1320
cgctggaggg ttcgtccggc tccggatccc ccaagaagaa gcggaaggtg ggcatccacg 1380
gcgtgcccgc cgccggcagc agcggatccc tggagcccgg cgagaaacct tacaaatgcc 1440
ccgagtgcgg caagagcttc agcagaagcg acgatctggt gaggcaccaa agaacccaca 1500
ccggcgaaaa accttacaag tgtcccgaat gcggaaagtc cttcagcaga gaggacaatc 1560
tgcacaccca ccagagaaca cacaccggag aaaagcctta caagtgcccc gaatgcggca 1620
aatccttttc tagaagcgat catctgacca cccaccaaag aacacatacc ggcgagaagc 1680
cttacaaatg tcccgagtgc ggaaagtcct tctcccagag agccaatctg agggctcatc 1740
aaaggaccca taccggcgaa aagccctaca aatgccccga gtgcggaaaa tccttcagcc 1800
agctggccca tctgagagcc caccaaagga cacacaccgg agagaaaccc tataagtgcc 1860
ccgagtgtgg aaaaagcttt tccccagagggg ccaatctgag ggcccatcag aggacccata 1920
ccggagagaa gccttataaa tgtcccgagt gcggaaaaag cttcagcgag aggagccatc 1980
tgagggaaca tcaaagaacc cacaccggcg aaaaacccac cggaaaaaag accagcgcta 2040
gcggcagcgg cggcggcagc ggcggcgccc gggacagcaa ggtggagaac aagaccaaga 2100
agctgcgggt gttcgaggcc ttcgccggca tcggcgccca gcggaaggcc ctggagaagg 2160
tgcggaagga cgagtacgag atcgtgggcc tggccgagtg gtacgtgccc gccatcgtga 2220
tgtaccaggc catccacaac aacttccaca ccaagctgga gtacaagagc gtgagccggg 2280
aggagatgat cgactacctg gagaacaaga ccctgagctg gaacagcaag aacccccgtga 2340
gcaacggcta ctggaagcgg aagaaggacg acgagctgaa gatcatctac aacgccatca 2400
agctgagcga gaaggagggc aacatcttcg acatccggga cctgtacaag cggaccctga 2460
agaacatcga cctgctgacc tacagcttcc cctgccagga cctgagccag cagggcatcc 2520
agaagggcat gaagcggggc agcggcaccc ggagcggcct gctgtggggag atcgagcggg 2580
ccctggacag caccgagaag aacgacctgc ccaagtacct gctgatggag aacgtgggcg 2640
ccctgctgca caagaagaac gaggaggagc tgaaccagtg gaagcagaag ctggagagcc 2700
tgggctacca gaacagcatc gaggtgctga acgccgccga cttcggcagc agccaggccc 2760
ggcggcgggt gttcatgatc agcaccctga acgagttcgt ggagctgccc aagggcgaca 2820
agaagcccaa gagcatcaag aaggtgctga acaagatcgt gagcgagaag gacatcctga 2880
acaacctgct gaagtacaac ctgaccgagt tcaagaaaac caagagcaac atcaacaagg 2940
ccagcctgat cggctacagc aagttcaaca gcgagggcta cgtgtacgac cccgagttca 3000
ccggccccac cctgaccgcc agcggcgcca acagccggat caagatcaag gacggcagca 3060
acatccggaa gatgaacagc gacgagacct tcctgtacat cggcttcgac agccaggacg 3120
gcaagcgggt gaacgagatc gagttcctga ccgagaacca gaagatcttc gtgtgcggca 3180
acagcatcag cgtggaggtg ctggaggcca tcatcgacaa gatcggcggc cccagcggcg 3240
gcaagcggcc cgccgccacc aagaaggccg gccaggccaa gaagaagaag ggcagctacc 3300
cctacgacgt gcccgactac gcctgaagca gcggcggcaa gcggcccgcc gccaccaaga 3360
aggccggcca ggccaagaag aagaagggca gctaccccta cgacgtgccc gactacgcct 3420
gagcggccgc ttaattaagc tgccttctgc ggggcttgcc ttctggccat gcccttcttc 3480
tctcccttgc acctgtacct cttggtcttt gaataaagcc tgagtaggaa gtctagaaaa 3540
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3600
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3650
<![CDATA[ <210> 114]]>
<![CDATA[ <211> 542]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 114]]>
Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro Val Pro
1 5 10 15
Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe Asp Gly Ile
20 25 30
Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile Gln Val Asp Arg
35 40 45
Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile Thr Val Gly Met Val
50 55 60
Arg His Gln Gly Lys Ile Met Tyr Val Gly Asp Val Arg Ser Val Thr
65 70 75 80
Gln Lys His Ile Gln Glu Trp Gly Pro Phe Asp Leu Val Ile Gly Gly
85 90 95
Ser Pro Cys Asn Asp Leu Ser Ile Val Asn Pro Ala Arg Lys Gly Leu
100 105 110
Tyr Glu Gly Thr Gly Arg Leu Phe Phe Glu Phe Tyr Arg Leu Leu His
115 120 125
Asp Ala Arg Pro Lys Glu Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe
130 135 140
Glu Asn Val Val Ala Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg
145 150 155 160
Phe Leu Glu Ser Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala
165 170 175
Ala His Arg Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg
180 185 190
Pro Leu Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln Glu Cys Leu
195 200 205
Glu His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr
210 215 220
Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val Phe
225 230 235 240
Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu Arg Val
245 250 255
Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met Ser Arg Leu
260 265 270
Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val Pro Val Ile Arg
275 280 285
His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala Cys Val Ser Ser Gly
290 295 300
Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser Phe Ser Ser Gly Leu Val
305 310 315 320
Pro Leu Ser Leu Arg Gly Ser His Met Asn Pro Leu Glu Met Phe Glu
325 330 335
Thr Val Pro Val Trp Arg Arg Gln Pro Val Arg Val Leu Ser Leu Phe
340 345 350
Glu Asp Ile Lys Lys Glu Leu Thr Ser Leu Gly Phe Leu Glu Ser Gly
355 360 365
Ser Asp Pro Gly Gln Leu Lys His Val Val Asp Val Thr Asp Thr Val
370 375 380
Arg Lys Asp Val Glu Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala
385 390 395 400
Thr Pro Pro Leu Gly His Thr Cys Asp Arg Pro Pro Ser Trp Tyr Leu
405 410 415
Phe Gln Phe His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser
420 425 430
Pro Arg Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys
435 440 445
Glu Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr
450 455 460
Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val Trp
465 470 475 480
Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val Ser Glu
485 490 495
Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser Lys Leu Ala
500 505 510
Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe Leu Pro Leu Arg
515 520 525
Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr Ser Ser Leu
530 535 540
<![CDATA[ <210> 115]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 115]]>
ctggagcccg gcgagaaacc ctataaatgc cccgaatgcg gaaaaagctt cagccagtcc 60
agctctctgg tgagacatca gaggacacac accggcgaaa agccttataa gtgccccgag 120
tgcggcaagt ccttctctag aagcgatcac ctcaccaatc atcagaggac acataccgga 180
gagaagccct ataagtgccc cgagtgcggc aagagcttta gccagctggc tcatctgaga 240
gctcaccaaa gaacccatac cggcgagaag ccttacaaat gccccgagtg tggaaaatcc 300
ttttcccagt ccagcaacct cgtcagacat caaaggaccc ataccggcga aaagccttac 360
aagtgtcccg agtgcggaaa gtccttctct agatccgaca acctcgtgag gcaccagaga 420
accccacaccg gcgagaaacc ttacaaatgt cccgagtgtg gcaaaagctt ttctagaagc 480
gacgagctgg tgagacatca aagaacccat accggcgaaa aaccttataa gtgtcccgag 540
tgcggcaaat cctttagcca gctggcccat ctgagggccc accagagaac acataccggc 600
gaaaaaccca ccggcaaaaa gacaagc 627
<![CDATA[ <210> 116]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 116]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctacaag tgccccgagt gtggaaaaag ctttagccaa agcggcgatc 180
tgaggagaca ccaaagaaca cacaccggcg agaagcccta caaatgtccc gagtgcggaa 240
agagcttcag ccagagcggc catctgaccg agcatcagag aacccatacc ggcgaaaaac 300
cttataagtg ccccgagtgt ggaaagtcct tctccgagag atcccatctg agagaacacc 360
agaggacaca caccggcgaa aaaccttata agtgtcccga gtgcggaaag tccttcagcg 420
atcccggcca tctggtgaga catcaaagga cacataccgg cgaaaaacct tataagtgtc 480
ccgaatgcgg caagagcttt agcagaaacg acacactcac cgaacaccag aggacccaca 540
ccggcgagaa accctacaaa tgccccgagt gcggcaaatc cttttctaga gccgacaatc 600
tgaccgaaca ccagaggacc cataccggag aaaagcctta caaatgtccc gagtgtggca 660
aatccttctc cacccatctg gatctgatta gacaccaaag aacacatacc ggagaaaagc 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 117]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 117]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgaaaa accctataag tgccccgaat gtggaaagag cttcagccat accggccatc 180
tgctggaaca ccaaaggaca cacaccggcg agaaacctta caagtgtccc gagtgcggaa 240
aaagcttctc ctccaaaaag gctctcaccg agcaccagag aacacatacc ggcgaaaagc 300
cttataagtg ccccgagtgt ggcaaatcct tttccgactg tagagatctg gccagacatc 360
aaagaaccca caccggagag aaaccttata aatgccccga gtgcggcaag tcctttagcc 420
ataccggcca tctgctggag caccagagga cccataccgg cgagaagcct tacaaatgcc 480
ccgagtgcgg caaaagcttc agcagaaatg acgctctgac cgagcatcaa aggacccata 540
ccggcgaaaa gccctacaag tgtcccgagt gtggaaagtc cttctcccag agcggcgatc 600
tgaggagaca ccagagaaca cacaccggcg agaaacccta taaatgtccc gagtgcggaa 660
agagctttag cgacagcggc aatctgaggg tgcatcaaag aacacacacc ggcgaaaaac 720
ccaccggaaa aaagacaagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 118]]>
<![CDATA[ <211> 2227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 118]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accctataaa tgccccgaat gcggaaaaag cttcagccag tccagctctc 180
tggtgagaca tcagaggaca cacaccggcg aaaagcctta taagtgcccc gagtgcggca 240
agtccttctc tagaagcgat cacctcacca atcatcagag gacacatacc ggagagaagc 300
cctataagtg ccccgagtgc ggcaagagct ttagccagct ggctcatctg agagctcacc 360
aaagaaccca taccggcgag aagccttaca aatgccccga gtgtggaaaa tccttttccc 420
agtccagcaa cctcgtcaga catcaaagga cccataccgg cgaaaagcct tacaagtgtc 480
ccgagtgcgg aaagtccttc tctagatccg acaacctcgt gaggcaccag agaacccaca 540
ccggcgagaa accttacaaa tgtcccgagt gtggcaaaag cttttctaga agcgacgagc 600
tggtgagaca tcaaagaacc cataccggcg aaaaacctta taagtgtccc gagtgcggca 660
aatcctttag ccagctggcc catctgaggg cccaccagag aacacatacc ggcgaaaaac 720
ccaccggcaa aaagacaagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctacccct acgacgtgcc cgactacgcc tgagcggccg 2040
cttaattaag ctgccttctg cggggcttgc cttctggcca tgcccttctt ctctcccttg 2100
cacctgtacc tcttggtctt tgaataaagc ctgagtagga agtctagaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaa 2227
<![CDATA[ <210> 119]]>
<![CDATA[ <211> 5705]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 119]]>
aaggaaataa gagagaaaag aagagtaaga agaaatataa gagccaccat ggcccccaag 60
aagaagcgga aggtgggcat ccacggcgtg cccgccgccg acaagaagta cagcatcggc 120
ctggccatcg gcaccaacag cgtgggctgg gccgtgatca ccgacgagta caaggtgccc 180
agcaagaagt tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc 240
ggcgccctgc tgttcgacag cggcgagacc gccgaggcca cccggctgaa gcggaccgcc 300
cggcggcggt acacccggcg gaagaaccgg atctgctacc tgcaggagat cttcagcaac 360
gagatggcca aggtggacga cagcttcttc caccggctgg aggagagctt cctggtggag 420
gaggacaaga agcacgagcg gcacccccatc ttcggcaaca tcgtggacga ggtggcctac 480
cacgagaagt accccaccat ctaccacctg cggaagaagc tggtggacag caccgacaag 540
gccgacctgc ggctgatcta cctggccctg gcccacatga tcaagttccg gggccacttc 600
ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg 660
gtgcagacct acaaccagct gttcgaggag aaccccatca acgccagcgg cgtggacgcc 720
aaggccatcc tgagcgcccg gctgagcaag agccggcggc tggagaacct gatcgcccag 780
ctgcccggcg agaagaagaa cggcctgttc ggcaacctga tcgccctgag cctgggcctg 840
accccccaact tcaagagcaa cttcgacctg gccgaggacg ccaagctgca gctgagcaag 900
gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac 960
ctgttcctgg ccgccaagaa cctgagcgac gccatcctgc tgagcgacat cctgcgggtg 1020
aacaccgaga tcaccaaggc ccccctgagc gccagcatga tcaagcggta cgacgagcac 1080
caccaggacc tgaccctgct gaaggccctg gtgcggcagc agctgcccga gaagtacaag 1140
gagatcttct tcgaccagag caagaacggc tacgccggct acatcgacgg cggcgccagc 1200
caggagggagt tctacaagtt catcaagccc atcctggaga agatggacgg caccgaggag 1260
ctgctggtga agctgaaccg ggaggacctg ctgcggaagc agcggacctt cgacaacggc 1320
agcatccccc accagatcca cctgggcgag ctgcacgcca tcctgcggcg gcaggaggac 1380
ttctacccct tcctgaagga caaccgggag aagatcgaga agatcctgac cttccggatc 1440
ccctactacg tgggccccct ggcccggggc aacagccggt tcgcctggat gacccggaaa 1500
tccgaggaga ccatcacccc ctggaacttc gaggaggtgg tggacaaggg cgccagcgcc 1560
cagagcttca tcgagcggat gaccaacttc gacaagaacc tgcccaacga gaaggtgctg 1620
cccaagcaca gcctgctgta cgagtacttc accgtgtaca acgagctgac caaggtgaag 1680
tacgtgaccg agggcatgcg gaagcccgcc ttcctgagcg gcgagcagaa gaaggccatc 1740
gtggacctgc tgttcaagac caaccggaag gtgaccgtga agcagctgaa ggaggactac 1800
ttcaagaagaga tcgagtgctt cgacagcgtg gagatcagcg gcgtggagga ccggttcaac 1860
gccagcctgg gcacctacca cgacctgctg aagatcatca aggacaagga cttcctggac 1920
aacgaggaga acgaggacat cctggaggac atcgtgctga ccctgaccct gttcgaggac 1980
cgggagatga tcgaggagcg gctgaaaacc tacgcccacc tgttcgacga caaggtgatg 2040
aagcagctga agcggcggcg gtacaccggc tggggccggc tgagccggaa gctgatcaac 2100
ggcatccggg acaagcagag cggcaagacc atcctggact tcctgaaatc cgacggcttc 2160
gccaaccgga acttcatgca gctgatccac gacgacagcc tgaccttcaa ggaggacatc 2220
cagaaggccc aggtgagcgg ccagggcgac agcctgcacg agcacatcgc caacctggcc 2280
ggcagccccg ccatcaagaa gggcatcctg cagaccgtga aggtggtgga cgagctggtg 2340
aaggtgatgg gccggcacaa gcccgagaac atcgtgatcg agatggcccg ggagaaccag 2400
accacccaga agggccagaa gaacagccgg gagcggatga agcggatcga ggagggcatc 2460
aaggagctgg gcagccagat cctgaaggag caccccgtgg agaacaccca gctgcagaac 2520
gagaagctgt acctgtacta cctgcagaac ggccgggaca tgtacgtgga ccaggagctg 2580
gacatcaacc ggctgagcga ctacgacgtg gccgccatcg tgccccagag cttcctgaag 2640
gacgacagca tcgacaacaa ggtgctgacc cggagcgaca aggcccgggg caagagcgac 2700
aacgtgccca gcgaggaggt ggtgaagaag atgaagaact actggcggca gctgctgaac 2760
gccaagctga tcacccagcg gaagttcgac aacctgacca aggccgagcg gggcggcctg 2820
agcgagctgg acaaggccgg cttcatcaag cggcagctgg tggagacccg gcagatcacc 2880
aagcacgtgg cccagatcct ggacagccgg atgaacacca agtacgacga gaacgacaag 2940
ctgatccggg aggtgaaggt gatcaccctg aaatccaagc tggtgagcga cttccggaag 3000
gacttccagt tctacaaggt gcgggagatc aacaactacc accacgccca cgacgcctac 3060
ctgaacgccg tggtgggcac cgccctgatc aagaagtacc ccaagctgga gagcgagttc 3120
gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggag 3180
atcggcaagg ccaccgccaa gtacttcttc tacagcaaca tcatgaactt cttcaagacc 3240
gagatcaccc tggccaacgg cgagatccgg aagcggcccc tgatcgagac caacggcgag 3300
accggcgaga tcgtgtggga caagggccgg gacttcgcca ccgtgcggaa ggtgctgagc 3360
atgccccagg tgaacatcgt gaagaaaacc gaggtgcaga ccggcggctt cagcaaggag 3420
agcatcctgc ccaagcggaa cagcgacaag ctgatcgccc ggaagaagga ctgggacccc 3480
aagaagtacg gcggcttcga cagccccacc gtggcctaca gcgtgctggt ggtggccaag 3540
gtggagaagg gcaagagcaa gaagctgaaa tccgtgaagg agctgctggg catcaccatc 3600
atggagcgga gcagcttcga gaagaacccc atcgacttcc tggaggccaa gggctacaag 3660
gaggtgaaga aggacctgat catcaagctg cccaagtaca gcctgttcga gctggagaac 3720
ggccggaagc ggatgctggc cagcgccggc gagctgcaga agggcaacga gctggccctg 3780
cccagcaagt acgtgaactt cctgtacctg gccagccact acgagaagct gaagggcagc 3840
cccgaggaca acgagcagaa gcagctgttc gtggagcagc acaagcacta cctggacgag 3900
atcatcgagc agatcagcga gttcagcaag cgggtgatcc tggccgacgc caacctggac 3960
aaggtgctga gcgcctacaa caagcaccgg gacaagccca tccgggagca ggccgagaac 4020
atcatccacc tgttcaccct gaccaacctg ggcgcccccg ccgccttcaa gtacttcgac 4080
accaccatcg accggaagcg gtacaccagc accaaggagg tgctggacgc caccctgatc 4140
caccagagca tcaccggcct gtacgagacc cggatcgacc tgagccagct gggcggcgac 4200
aagcggcccg ccgccaccaa gaaggccggc caggccaaga agaagaaggc ccgggacagc 4260
aaggtggaga acaagaccaa gaagctgcgg gtgttcgagg ccttcgccgg catcggcgcc 4320
cagcggaagg ccctggagaa ggtgcggaag gacgagtacg agatcgtggg cctggccgag 4380
tggtacgtgc ccgccatcgt gatgtaccag gccatccaca acaacttcca caccaagctg 4440
gagtacaaga gcgtgagccg ggaggagatg atcgactacc tggagaacaa gaccctgagc 4500
tggaacagca agaacccccgt gagcaacggc tactggaagc ggaagaagga cgacgagctg 4560
aagatcatct acaacgccat caagctgagc gagaaggagg gcaacatctt cgacatccgg 4620
gacctgtaca agcggaccct gaagaacatc gacctgctga cctacagctt cccctgccag 4680
gacctgagcc agcagggcat ccagaagggc atgaagcggg gcagcggcac ccggagcggc 4740
ctgctgtggg agatcgagcg ggccctggac agcaccgaga agaacgacct gcccaagtac 4800
ctgctgatgg agaacgtggg cgccctgctg cacaagaaga acgaggagga gctgaaccag 4860
tggaagcaga agctggagag cctgggctac cagaacagca tcgaggtgct gaacgccgcc 4920
gacttcggca gcagccaggc ccggcggcgg gtgttcatga tcagcaccct gaacgagttc 4980
gtggagctgc ccaagggcga caagaagccc aagagcatca agaaggtgct gaacaagatc 5040
gtgagcgaga aggacatcct gaacaacctg ctgaagtaca acctgaccga gttcaagaaa 5100
accaagagca acatcaacaa ggccagcctg atcggctaca gcaagttcaa cagcgagggc 5160
tacgtgtacg accccgagtt caccggcccc accctgaccg ccagcggcgc caacagccgg 5220
atcaagatca aggacggcag caacatccgg aagatgaaca gcgacgagac cttcctgtac 5280
atcggcttcg acagccagga cggcaagcgg gtgaacgaga tcgagttcct gaccgagaac 5340
cagaagatct tcgtgtgcgg caacagcatc agcgtggagg tgctggaggc catcatcgac 5400
aagatcggcg gccccagcag cggcggcaag cggcccgccg ccaccaagaa ggccggccag 5460
gccaagaaga agaagggcag ctacccctac gacgtgcccg actacgcctg agcggccgct 5520
taattaagct gccttctgcg gggcttgcct tctggccatg cccttcttct ctcccttgca 5580
cctgtacctc ttggtctttg aataaagcct gagtaggaag tctagaaaaaaaaaaaaaaa 5640
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 5700
aaaaa 5705
<![CDATA[ <210> 120]]>
<![CDATA[ <211> 18]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 120]]>
Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro
1 5 10 15
GlyPro
<![CDATA[ <210> 121]]>
<![CDATA[ <211> 1092]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 121]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
690 695 700
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu
705 710 715 720
Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile
725 730 735
His Gly Val Pro Ala Ala Gly Ser Ser Ser Gly Ser Leu Glu Pro Gly Glu
740 745 750
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp
755 760 765
Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
770 775 780
Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg
785 790 795 800
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
805 810 815
Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr
820 825 830
His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe
835 840 845
Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu
850 855 860
Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala
865 870 875 880
His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys
885 890 895
Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu
900 905 910
His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys
915 920 925
Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr
930 935 940
His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser
945 950 955 960
Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg
965 970 975
Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Glu
980 985 990
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met
995 1000 1005
Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr
1010 1015 1020
Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp
1025 1030 1035
Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
1040 1045 1050
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1085 1090
<![CDATA[ <210> 122]]>
<![CDATA[ <211> 1092]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 122]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln
340 345 350
Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ala Thr Asn Phe Ser
370 375 380
Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser
385 390 395 400
Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
405 410 415
Gly Asp Val Glu Glu Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser
420 425 430
Gly Ser Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
435 440 445
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
450 455 460
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
465 470 475 480
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
485 490 495
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
500 505 510
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
515 520 525
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
530 535 540
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
545 550 555 560
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
565 570 575
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
580 585 590
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
595 600 605
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
610 615 620
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
625 630 635 640
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
645 650 655
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
660 665 670
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
675 680 685
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
690 695 700
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
705 710 715 720
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
725 730 735
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
740 745 750
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
755 760 765
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
770 775 780
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
785 790 795 800
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
805 810 815
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
820 825 830
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
835 840 845
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
850 855 860
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
865 870 875 880
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
885 890 895
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
900 905 910
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
915 920 925
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
930 935 940
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
945 950 955 960
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
965 970 975
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
980 985 990
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
995 1000 1005
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln
1010 1015 1020
Asp Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln
1025 1030 1035
Lys Ile Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu
1040 1045 1050
Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro
1055 1060 1065
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
1070 1075 1080
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1085 1090
<![CDATA[ <210> 123]]>
<![CDATA[ <211> 355]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 123]]>
cagagaagga ggaagttaat tcacattctt aattttttct aagggcaaaaaaaaaaaaaa 60
aatgcaccag ctcattttcc atctctgctt gggtcatcag tgtgcattgt gagcctgtac 120
aaaggcctta gacggggaat gctgccgaga gcatcacctt ttatgtcttc ttttatatga 180
aatgtgccac ttccccacta accctggctc tgggctctgc ctctgctctc ctgatggtgt 240
gtttatggtg gattcagcat tctgggccac acaaggaagc tgcagggggt gtccaagttc 300
acatgtcccc gcattccagg cgaatgtttc tgacattgag caatgatatg gctct 355
<![CDATA[ <210> 124]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 124]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly
20 25 30
Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro
35 40 45
<![CDATA[ <210> 125]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 125]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro
<![CDATA[ <210> 126]]>
<![CDATA[ <211> 18]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 126]]>
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly
<![CDATA[ <210> 127]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 127]]>
Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu
1 5 10 15
Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly
20 25
<![CDATA[ <210> 128]]>
<![CDATA[ <211> 1]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 128]]>
Pro
1
<![CDATA[ <210> 129]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 129]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 130]]>
<![CDATA[ <211> 2234]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 130]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc agcggcggca agcggcccgc cgccaccaag aaggccggcc 1980
aggccaagaa gaagaagggc agctgagcgg ccgcttaatt aagctgcctt ctgcggggct 2040
tgccttctgg ccatgccctt cttctctccc ttgcacctgt acctcttggt ctttgaataa 2100
agcctgagta ggaagtctag aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaaaaaaaaa 2234
<![CDATA[ <210> 131]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212> RNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 131]]>
cuggagcccg gcgagaaacc uuacaaaugc cccgagugcg gcaagagcuu cagcagaagc 60
gacgaucugg ugaggcacca aagaacccac accggcgaaa aaccuuacaa gugucccgaa 120
ugcggaaagu ccuucagcag agaggacaau cugcacaccc accagagaac acacaccgga 180
gaaaagccuu acaagugccc cgaaugcggc aaauccuuuu cuagaagcga ucaucugacc 240
accccaccaaa gaacacauac cggcgagaag ccuuacaaau guccccgagug cggaaagucc 300
uucucccaga gagccaaucu gagggcucau caaaggaccc auaccggcga aaagcccuac 360
aaaugccccg agugcggaaa auccuucagc cagcuggccc aucugagagc ccaccaaagg 420
acacacaccg gagagaaacc cuauaagugc cccgagugug gaaaaagcuu uucccagagg 480
gccaaucuga gggcccauca gaggacccau accggagaga agccuuauaa augucccgag 540
ugcggaaaaa gcuucagcga gaggagccau cugagggaac aucaaagaac ccacaccggc 600
gaaaaaccca ccggaaaaaa gaccagc 627
<![CDATA[ <210> ]]>132
<![CDATA[ <211> 1155]]>
<![CDATA[ <212> RNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 132]]>
agcaaggugg agaacaagac caagaagcug cggguguucg aggccuucgc cggcaucggc 60
gcccagcgga aggcccugga gaaggugcgg aaggacgagu acgagaucgu gggccuggcc 120
gaguguacg ugcccgccau cgugauguac caggccaucc acaacaacuu ccacaccaag 180
cuggaguaca agagcgugag ccgggaggag augaucgacu accuggagaa caagacccug 240
agcuggaaca gcaagaaccc cgugagcaac ggcuacugga agcggaagaa ggacgacgag 300
cugaagauca ucuacaacgc caucaagcug agcgagaagg agggcaacau cuucgacauc 360
cgggaccugu acaagcggac ccugaagaac aucgaccugc ugaccuacag cuuccccugc 420
caggaccuga gccagcaggg cauccagaag ggcaugaagc ggggcagcgg cacccggagc 480
ggccugcugu gggagaucga gcgggcccug gacagcaccg agaagaacga ccugcccaag 540
uaccugcuga uggagaacgu gggcgcccug cugcacaaga agaacgagga ggagcugaac 600
caguggaagc agaagcugga gagccugggc uaccagaaca gcaucgaggu gcugaacgcc 660
gccgacuucg gcagcagcca ggcccggcgg cggguguuca ugaucagcac ccugaacgag 720
uucguggagc ugcccaaggg cgacaagaag cccaagagca ucaagaaggu gcugaacaag 780
aucgugagcg agaaggacau ccugaacaac cugcugaagu acaaccugac cgaguucaag 840
aaaaccaaga gcaacaucaa caaggccagc cugaucggcu acagcaaguu caacagcgag 900
ggcuacgugu acgaccccga guucaccggc cccacccuga ccgccagcgg cgccaacagc 960
cggaucaaga ucaaggacgg cagcaacauc cggaagauga acagcgacga gaccuuccug 1020
uacaucggcu ucgacagcca ggacggcaag cgggugaacg agaucgaguu ccugaccgag 1080
aaccagaaga ucuucgugug cggcaacagc aucagcgugg aggugcugga ggccaucauc 1140
gacaagaucg gcggc 1155
<![CDATA[ <210> 133]]>
<![CDATA[ <211> 690]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 133]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp
645 650 655
Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
660 665 670
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
675 680 685
Pro Gly
690
<![CDATA[ <210> 134]]>
<![CDATA[ <211> 366]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 134]]>
Pro Leu Glu Gly Ser Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Lys Arg Lys
1 5 10 15
Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu
20 25 30
Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
35 40 45
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
50 55 60
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His
65 70 75 80
Leu Glu Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
85 90 95
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His
100 105 110
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
115 120 125
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
130 135 140
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
145 150 155 160
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
165 170 175
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
180 185 190
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
195 200 205
Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His
210 215 220
Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala
225 230 235 240
Ser Gly Ser Gly Gly Gly Ser Gly Gly Asp Ala Lys Ser Leu Thr Ala
245 250 255
Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr
260 265 270
Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg
275 280 285
Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln
290 295 300
Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
305 310 315 320
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser Glu
325 330 335
Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg Pro Ala
340 345 350
Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
355 360 365
<![CDATA[ <210> 135]]>
<![CDATA[ <211> 6]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 135]]>
Pro Lys Lys Lys Arg Lys
1 5
<![CDATA[ <210> 136]]>
<![CDATA[ <211> 15]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 136]]>
Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys
1 5 10 15
<![CDATA[ <210> 137]]>
<![CDATA[ <211> 13]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 137]]>
Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp
1 5 10
<![CDATA[ <210> 138]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 138]]>
Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly
1 5 10
<![CDATA[ <210> 139]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 139]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 140]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 140]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser His Ser Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Asn Asp Ala Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 141]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 141]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 142]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 142]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala His Leu Glu Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 143]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 143]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 144]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 144]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly Ala Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His
115 120 125
Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Pro Gly His Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe
245 250 255
Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp
260 265 270
Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys
275 280 285
Asn Leu Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu
290 295 300
Arg Leu Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His
305 310 315 320
Gln Glu Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser
325 330 335
Val Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 145]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 145]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Pro Gly His Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Lys Asn Ser Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 146]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 146]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Thr Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro Ala Asp
115 120 125
Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 147]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 147]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Pro Gly His
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Thr His Leu Asp Leu Ile Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 148]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 148]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His Leu Leu Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Ser Lys Lys Ala Leu Thr Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Thr Gly His
115 120 125
Leu Leu Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 149]]>
<![CDATA[ <211> 652]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 149]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp His Leu Thr Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Glu Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Leu Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr
625 630 635 640
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 150]]>
<![CDATA[ <211> 1519]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 150]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
20 25 30
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
35 40 45
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
50 55 60
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
65 70 75 80
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
85 90 95
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
100 105 110
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
115 120 125
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
130 135 140
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
145 150 155 160
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
165 170 175
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
180 185 190
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
195 200 205
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
210 215 220
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
225 230 235 240
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
245 250 255
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
260 265 270
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
275 280 285
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
290 295 300
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
305 310 315 320
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
325 330 335
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
340 345 350
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
355 360 365
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
370 375 380
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
385 390 395 400
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
405 410 415
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
420 425 430
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
435 440 445
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
450 455 460
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
465 470 475 480
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
485 490 495
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
500 505 510
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
515 520 525
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
530 535 540
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
545 550 555 560
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
565 570 575
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
580 585 590
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
595 600 605
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
610 615 620
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
625 630 635 640
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
645 650 655
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
660 665 670
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
675 680 685
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
690 695 700
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
705 710 715 720
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
725 730 735
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
740 745 750
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
755 760 765
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
770 775 780
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
785 790 795 800
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
805 810 815
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
820 825 830
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
835 840 845
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys
850 855 860
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg
865 870 875 880
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
885 890 895
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
900 905 910
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
915 920 925
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
930 935 940
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
945 950 955 960
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
965 970 975
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
980 985 990
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
995 1000 1005
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu
1010 1015 1020
Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile
1025 1030 1035
Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe
1040 1045 1050
Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu
1055 1060 1065
Ala Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly
1070 1075 1080
Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr
1085 1090 1095
Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys
1100 1105 1110
Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro
1115 1120 1125
Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp
1130 1135 1140
Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser
1145 1150 1155
Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu
1160 1165 1170
Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser
1175 1180 1185
Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr
1190 1195 1200
Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser
1205 1210 1215
Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala
1220 1225 1230
Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1235 1240 1245
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly
1250 1255 1260
Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His
1265 1270 1275
Lys His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser
1280 1285 1290
Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser
1295 1300 1305
Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu
1310 1315 1320
Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala
1325 1330 1335
Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr
1340 1345 1350
Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile
1355 1360 1365
Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly
1370 1375 1380
Asp Lys Arg Pro Ala Ala Thr Lys Lys Lys Ala Gly Gln Ala Lys Lys
1385 1390 1395
Lys Lys Ala Ser Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr
1400 1405 1410
Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu Glu Glu
1415 1420 1425
Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr Arg Asn Val
1430 1435 1440
Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln Leu
1445 1450 1455
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro
1460 1465 1470
Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro Asp Ser
1475 1480 1485
Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Ser Gly Gly Lys Arg
1490 1495 1500
Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly
1505 1510 1515
Ser
<![CDATA[ <210> 151]]>
<![CDATA[ <211> 1811]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 151]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys 1385 1390 1395 Lys Lys Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Lys Leu 1400 1405 1410 Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala 1415 1420 1425 Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala 1430 1435 1440 Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn 1445 1450 1455 Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu 1460 1465 1470 Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys 1475 1480 1485 As n Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu 1490 1495 1500 Leu Lys Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly 1505 1510 1515 Asn Ile Phe Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn 1520 1525 1530 Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln 1535 1540 1545 Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser Gly Thr Arg Ser 1550 1555 1560 Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser Thr Glu Lys 15 65 1570 1575 Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly Ala Leu 1580 1585 1590 Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln Lys 1595 1600 1605 Leu Glu Ser Leu Gly Tyr Gln Asn S er Ile Glu Val Leu Asn Ala 1610 1615 1620 Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile 1625 1630 1635 Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys 1640 1645 1650 Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys 1655 1660 1665 Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys 1670 1675 1680 Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser 1685 1690 16 95 Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly 1700 1705 1710 Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys 1715 1720 1725 Asp Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu 1730 173 5 1740 Tyr Ile Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile 1745 1750 1755 Glu Phe Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser 1760 1765 1770 Ile Ser Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly G ly 1775 1780 1785 Pro Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly 1790 1795 1800 Gln Ala Lys Lys Lys Lys Lys Gly Ser 1805 1810 <![CDATA[ <210> 152]]>
<![CDATA[ <211> 1913]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 152]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln 1385 1390 1395 Ala Lys Lys Lys Lys Ser Gly Gly Gly Gly Ser Val Asp Leu Arg 1400 1405 1410 Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser Glu Gly Phe 1415 1420 1425 His Gln Ala Gly Ile Ser Asp Thr Leu Trp Ala Ile Glu Met Trp 1430 1435 1440 Asp Pro Ala Ala Gln Ala Phe Arg Leu Asn Asn Pro Gly Ser Thr 1445 1450 1455 Val Phe Thr Glu Asp Cys Asn Ile Leu Leu Lys Leu Val Met Ala 1460 1465 1470 Gly Glu Thr Asn Ser Arg Gly Gln Arg Leu Pro Gln Lys Gly 1475 1480 1485 Asp Val Glu M et Leu Cys Gly Gly Pro Pro Cys Gln Gly Phe Ser 1490 1495 1500 Gly Met Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn 1505 1510 1515 Ser Leu Val Val Ser Phe Leu Ser Tyr Cys Asp Tyr Tyr Arg Pro 1520 1525 1530 Arg Phe Phe Leu Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys 1535 1540 1545 Arg Ser Met Val Leu Lys Leu Thr Leu Arg Cys Leu Val Arg Met 1550 1555 1560 Gly Tyr Gln Cys Thr Phe Gly Val Leu Gln Ala Gly Gln Tyr Gly 1565 1570 1575 Val Ala Gln Thr Arg Arg Arg Ala Ile Ile Ile Leu Ala Ala Ala Pro 1580 1585 1590 Gly Glu Lys Leu Pro Leu Phe Pro Glu Pro Leu His Val Phe Ala 1595 1600 1605 Pro Arg Ala Cys Gln Leu Ser Val Val Val Asp Asp Lys Lys Phe 1610 1615 1620 Val Ser Asn Ile Thr Arg Leu Ser Ser Gly Pro Phe Arg Thr Ile 1625 1630 1635 Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu Val Arg Asn Gly 1640 1645 1650 Ala Ser Ala Leu Glu Ile S er Tyr Asn Gly Glu Pro Gln Ser Trp 1655 1660 1665 Phe Gln Arg Gln Leu Arg Gly Ala Gln Tyr Gln Pro Ile Leu Arg 1670 1675 1680 Asp His Ile Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg Met 1685 1690 1695 Arg His Ile Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro 1700 1705 1710 Asn Ile Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu 1715 1720 1725 Arg Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Ser Gly 1730 1735 1740 Ala. Leu ARG GLY VAL CYS VAL GLU Ala Gly Lys Ala Cys 1745 1755 ASP Pro Ala Ala ARG Gln PHR Leu Ile Pro TRP CYS Leu 1760 1770 PRO His Thr Gly ass n aRG HIS Asn His Tyr Gly Leu Tyr Gly 1775 1780 1785 Arg Leu Glu Trp Asp Gly Phe Phe Ser Thr Thr Val Thr Asn Pro 1790 1795 1800 Glu Pro Met Gly Lys Gln Gly Arg Val Leu His Pro Glu Gln His 1805 1810 1815 Arg Val Val Ser Val Arg Glu Cys Ala Arg Ser Gln Gly Phe Pro 1820 1825 1830 Asp Thr Tyr Arg Leu Phe Gly Asn Ile Leu Asp Lys His Arg Gln 1835 1840 1845 Val Gly Asn Ala Val Pro Pro Pro Leu Ala Lys Ala Ile Gly Leu 1850 1855 1860 Glu Ile Lys Leu Cys Met Leu Ala Lys Ala Arg Glu Ser Ala Ser 1865 1870 1875 Ala Lys Ile Lys Glu Glu Glu Ala Ala Lys Asp Gly Gly Gly Gly 1880 1885 1890 Ser Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys 1895 1900 1905 L ys Lys Lys Gly Ser 1910 <![CDATA[ <210> 153]]>
<![CDATA[ <211> 1977]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 153]]> Met Ala Pro Lys Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 1 5 10 15 Ala Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 20 25 30 Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 35 40 45 Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 50 55 60 Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 65 70 75 80 Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 85 90 95 Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 100 105 110 Phe Phe His Arg Leu Glu Glu Ser PHE Leu Val Glu Glu Glu ASP LYS LYS LYS LYS LYS LYS LYS LYS 115 120 HIS GLU ARG HIS Pro Ile PHE GLE Val ASP GLU Val Ala Tyr 130 135 His Glu Lys Tyr His Leu ARG LYS LYS LYS LYS LYS LYS Leu Val ASP 145 150 155 160 Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 165 170 175 Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 180 185 190 Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 195 200 205 Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 210 215 220 Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 225 230 235 240 Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 245 250 255 Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 260 265 270 Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 275 280 285 Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 290 295 300 Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 305 310 315 320 Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 325 330 335 Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 340 345 350 Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 355 360 365 Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 370 375 380 Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 385 390 395 400 Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu ASP Leu Leu ARG 410 415 LYS GLN ARG THR PHR PHE Asn Gly Serle Pro His Gln Ile His Leu 420 425 430 GLU HIS ALA ILE Leu ARG Gln Gln Glu Asr Pro PHE 4 35 440 445 leu lys asp asn arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 450 455 460 Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 465 470 475 480 Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 485 490 495 Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 500 505 510 Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 515 520 525 Leu Leu Tyr Glu Tyr Phe Thr Val Ty r Asn Glu Leu Thr Lys Val Lys 530 535 540 Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 545 550 555 560 Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 565 570 575 Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 580 585 590 Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 595 600 605 Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 610 615 620 Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 625 630 635 640 Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 645 650 655 His Leu Phe Asp A sp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 660 665 670 Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 675 680 685 Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 690 695 700 Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 705 710 715 720 Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 725 730 735 His Glu His Ile Ala Asn Leu A la Gly Ser Pro Ala Ile Lys Lys Gly 740 745 750 Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 755 760 765 Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 770 775 780 Thr Thr GLN LYS GLY GLN LSN Serg Glu ARG MET LYS ARG ILE 785 795 800 GLU GLY ILE LYS GLU Leu GLN ILE Leu Ly Glu His Pro 805 Val Gl Gln Leu GLN Asn Glu LYS Leu Tyr Leu Tyr Tyr Leu 820 825 830 Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 835 840 845 Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 850 855 860 Asp Asp Ser Ile Asp Asn LYS VAL Leu Thr ASP LYS ALA ALA ALA ARG 865 870 880 Gly LYS Sern Val Pro Serg Val Val Val Lys Met Lys 885 895 ARG Gln Leu Leu asn Ala LYS Leu Ile Thr Gln ARG LYS 900 905 910 Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 915 920 925 Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 930 935 940 Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 945 950 955 960 Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 965 970 975 Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 980 985 990 G lu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 995 1000 1005 Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu 1010 1015 1020 Phe Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys M et Ile 1025 1030 1035 Ala Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe 1040 1045 1050 Phe Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu 1055 1060 1065 Ala Asn Gly Glu Ile Ar g Lys Arg Pro Leu Ile Glu Thr Asn Gly 1070 1075 1080 Glu Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr 1085 1090 1095 Val Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys 1100 1105 1110 Thr Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro 1115 1120 1125 Lys Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp 1130 1135 1140 Pro Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser 1145 1150 1155 Val Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu 1160 1165 1170 Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser 1175 1180 1185 Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr 1190 1 195 1200 Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser 1205 1210 1215 Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala 1220 1225 1230 Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro S er Lys Tyr 1235 1240 1245 Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly 1250 1255 1260 Ser Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His 1265 1270 1275 Lys His Ty r Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser 1280 1285 1290 Lys Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser 1295 1300 1305 Ala Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu 1310 1315 1320 Asn Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala 1325 1330 1335 Ala Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr 1340 1345 1350 Ser Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile 1355 136 0 1365 Thr Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly 1370 1375 1380 Asp Ser Ala Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 1385 1390 1395 Gly Gly Ser Gly Pro Lys Lys Lys Lys Arg Lys Val Ala Ala Ala Gly 1400 1405 1410 Ser Asn His Asp Gln Glu Phe Asp Pro Pro Lys Val Tyr Pro Pro 1415 1420 1425 Val Pro Ala Glu Lys Arg Lys Pro Ile Arg Val Leu Ser Leu Phe 1430 1435 1440 Asp Gly Ile Ala Thr Gly Leu Leu Val Leu Lys Asp Leu Gly Ile 1445 1450 1455 Gln Val Asp Arg Tyr Ile Ala Ser Glu Val Cys Glu Asp Ser Ile 1460 1465 1470 Thr Val Gly Met Val Arg His Gln Gly Lys Ile Met Tyr Val Gly 1475 1480 1485 Asp Val Ar g Ser Val Thr Gln Lys His Ile Gln Glu Trp Gly Pro 1490 1495 1500 Phe Asp Leu Val Ile Gly Gly Ser Pro Cys Asn Asp Leu Ser Ile 1505 1510 1515 Val Asn Pro Ala Arg Lys Gly Leu Tyr Glu Gly Thr Gly Arg Leu 1520 1525 153 0 Phe Phe Glu Phe Tyr Arg Leu Leu His Asp Ala Arg Pro Lys Glu 1535 1540 1545 Gly Asp Asp Arg Pro Phe Phe Trp Leu Phe Glu Asn Val Val Ala 1550 1555 1560 Met Gly Val Ser Asp Lys Arg Asp Ile Ser Arg Phe Leu Glu Ser 1565 1 570 1575 Asn Pro Val Met Ile Asp Ala Lys Glu Val Ser Ala Ala His Arg 1580 1585 1590 Ala Arg Tyr Phe Trp Gly Asn Leu Pro Gly Met Asn Arg Pro Leu 1595 1600 1605 Ala Ser Thr Val Asn Asp Lys Leu Glu Leu Gln G lu Cys Leu Glu 1610 1615 1620 His Gly Arg Ile Ala Lys Phe Ser Lys Val Arg Thr Ile Thr Thr 1625 1630 1635 Arg Ser Asn Ser Ile Lys Gln Gly Lys Asp Gln His Phe Pro Val 1640 1645 1650 Phe Met Asn Glu Lys Glu Asp Ile Leu Trp Cys Thr Glu Met Glu 1655 1660 1665 Arg Val Phe Gly Phe Pro Val His Tyr Thr Asp Val Ser Asn Met 1670 1675 1680 Ser Arg Leu Ala Arg Gln Arg Leu Leu Gly Arg Ser Trp Ser Val 1685 1690 1695 Pro Val Ile Arg His Leu Phe Ala Pro Leu Lys Glu Tyr Phe Ala 1700 1705 1710 Cys Val Ser Ser Gly Asn Ser Asn Ala Asn Ser Arg Gly Pro Ser 1715 1720 1725 Phe Ser Ser Gly Leu Val Pro Leu Ser Leu Arg Gly Ser His Met 1730 1735 1740 Asn Pro Leu Glu Met Phe Glu Thr Val Pro Val Trp Arg Arg Gln 1745 1750 1755 Pro Val Arg Val Leu Ser Leu Phe Glu Asp Ile Lys Lys Glu Leu 1760 1765 1770 Thr Ser Leu Gly Phe Leu Glu Ser Gly Ser Asp Pro Gly Gln Leu 1775 1780 1785 Lys His Val Asp Val Thr Asp Thr Val Arg Lys Asp Val Glu 1790 1795 1800 Glu Trp Gly Pro Phe Asp Leu Val Tyr Gly Ala Thr Pro Pro Leu 1805 1810 1815 Gly His Thr Cys Asp Arg Pro Pro Ser Tr p Tyr Leu Phe Gln Phe 1820 1825 1830 His Arg Leu Leu Gln Tyr Ala Arg Pro Lys Pro Gly Ser Pro Arg 1835 1840 1845 Pro Phe Phe Trp Met Phe Val Asp Asn Leu Val Leu Asn Lys Glu 1850 1855 1860 Asp Leu Asp Val Ala Ser Arg Phe Leu Glu Met Glu Pro Val Thr 1865 1870 1875 Ile Pro Asp Val His Gly Gly Ser Leu Gln Asn Ala Val Arg Val 1880 1885 1890 Trp Ser Asn Ile Pro Ala Ile Arg Ser Arg His Trp Ala Leu Val 1895 1900 190 5 Ser Glu Glu Glu Leu Ser Leu Leu Ala Gln Asn Lys Gln Ser Ser 1910 1915 1920 Lys Leu Ala Ala Lys Trp Pro Thr Lys Leu Val Lys Asn Cys Phe 1925 1930 1935 Leu Pro Leu Arg Glu Tyr Phe Lys Tyr Phe Ser Thr Glu Leu Thr 19 40 1945 1950 Ser Ser Leu Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala 1955 1960 1965 Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser 1970 1975 <![CDATA[ <210> 154]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 154]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg
35 40 45
Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Thr Leu Thr
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Ser Gly Ser Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro
145 150 155 160
Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg
180 185 190
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 155]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 155]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg
35 40 45
Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn Leu Thr
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Ser Pro
145 150 155 160
Ala Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly His Leu Thr
180 185 190
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 156]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 156]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser
35 40 45
Gly Asp Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg
65 70 75 80
Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser His Arg Thr Thr Leu Thr Asn His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Ser
145 150 155 160
Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Leu Val
180 185 190
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr
195 200 205
Ser
<![CDATA[ <210> 157]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 157]]>
cttgagcccg gagaaaagcc atacaaatgt cctgaatgcg gaaagtcatt ttctacgagc 60
ggcgaactcg tgcggcacca aaggactcat accggcgaaa agccttacaa atgcccggag 120
tgcggaaaaa gcttctccga gcgctcgcac ttgcgggaac accagcgaac ccacacaggg 180
gagaaaccgt ataagtgccc agagtgcggc aaatcgttct cccggaacga caccctgacc 240
gaacaccaac gcactcatac tggcgaaaaa ccttacaagt gccctgagtg tggaaagagc 300
ttctcccgcg ccgataacct gaccgagcac cagcggaccc ataccgggga aaagccgtac 360
aagtgtccgg aatgcggcaa aagcttcagc acctcgggtt ccctggtccg gcatcagaga 420
actcacaccg gagagaaacc ctataagtgt cctgagtgcg ggaagtcctt ttcatcgccc 480
gcggacctga ctagacacca gaggacccac accggggaga agccctacaa gtgccccgaa 540
tgtggaaagt ccttctccga ctccggcaac ctccgggtgc accagcgcac ccacactgga 600
gagaagccga ccggaaagaa aacttcc 627
<![CDATA[ <210> 158]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 158]]>
ctggaacccg gagaaaaacc ttataagtgc cctgaatgcg gaaagtcatt ctcgaggtcg 60
gacaagctcg tgcggcacca gaggacacac accggggaaa agccatacaa gtgtcccgaa 120
tgtggaaagt ccttcagcca acgcgccaac ctgagagctc atcagcggac tcacactggc 180
gaaaaaccgt acaaatgccc cgaatgcggc aaaagcttct cccgcgccga caacttgacc 240
gagcaccagc ggacccatac cggcgaaaag ccgtacaagt gcccggagtg tgggaagtcg 300
ttcagccagt cctcttccct cgtgcgccac caacgcaccc atactgggga gaagccctat 360
aagtgtcctg agtgtggcaa atcattcagc gataagaagg atcttacccg gcaccaacgg 420
actcataccg gagagaagcc ttacaagtgc cccgagtgcg gaaagagctt ctcgtccccg 480
gcggacctga ctagacacca gcgcacccac accggagaaa agccctacaa gtgcccagag 540
tgcgggaagt ccttttccca atccggtcac ctgactgagc accagagaac ccacacggga 600
gagaaaccga ccggaaagaa aacctcc 627
<![CDATA[ <210> 159]]>
<![CDATA[ <211> 627]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 159]]>
ttggaacccg gagaaaagcc atacaaatgc cccgaatgcg gaaagtcgtt cagccagtcc 60
ggcgacctca gacggcacca acggactcac accggcgaaa aaccgtacaa gtgcccagag 120
tgcggcaaaa gctttagcca gtcgggcgat ctgcggagac atcagcgcac tcacactggt 180
gaaaagccct acaagtgtcc tgagtgcggg aagtccttca gcgagcgctc ccatcttcgc 240
gagcaccaga gaacccacac tggagaaaaa ccttataagt gccctgagtg tggcaaatcc 300
ttctcaacca ccggcaacct gactgtgcac cagcggaccc acacaggggga gaagccttac 360
aagtgcccgg agtgtgggaa gtcattctcc catcggacga ccctgaccaa ccaccagagg 420
acccatactg gcgaaaagcc gtataagtgt ccggagtgcg gaaagagctt ctccgactcc 480
ggaaacctca gggtgcacca acgcacccac accggagaga agccgtacaa atgtcccgaa 540
tgtggaaagt ccttctccca atcctcttcg ctggtccggc accagcgaac tcataccggg 600
gaaaagccca ccggaaagaa aacctcg 627
<![CDATA[ <210> 160]]>
<![CDATA[ <211> 659]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 160]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Ser Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[ <210> 161]]>
<![CDATA[ <211> 659]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 161]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[ <210> 162]]>
<![CDATA[ <211> 659]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 162]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser His Arg Thr Thr Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro
645 650 655
Asp Tyr Ala
<![CDATA[ <210> 163]]>
<![CDATA[ <211> 650]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 163]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Asn Asp Thr Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ala Asp Asn
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Ser Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 164]]>
<![CDATA[ <211> 650]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 164]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ala Asp Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Ser Ser
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Ser Pro Ala Asp Leu Thr Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Gly His Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 165]]>
<![CDATA[ <211> 650]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 165]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser His Arg Thr Thr Leu Thr Asn His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Asp Ser Gly Asn Leu Arg Val His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ser Ser Ser Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys
625 630 635 640
Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser
645 650
<![CDATA[ <210> 166]]>
<![CDATA[ <211> 2255]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 166]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tcccttgagc 120
ccggagaaaa gccatacaaa tgtcctgaat gcggaaagtc attttctacg agcggcgaac 180
tcgtgcggca ccaaaggact cataccggcg aaaagcctta caaatgcccg gagtgcggaa 240
aaagcttctc cgagcgctcg cacttgcggg aacaccagcg aacccacaca ggggagaaac 300
cgtataagtg cccagagtgc ggcaaatcgt tctcccggaa cgacaccctg accgaacacc 360
aacgcactca tactggcgaa aaaccttaca agtgccctga gtgtggaaag agcttctccc 420
gcgccgataa cctgaccgag caccagcgga cccataccgg ggaaaagccg tacaagtgtc 480
cggaatgcgg caaaagcttc agcacctcgg gttccctggt ccggcatcag agaactcaca 540
ccggagagaa accctataag tgtcctgagt gcgggaagtc cttttcatcg cccgcggacc 600
tgactagaca ccagaggacc cacaccgggg agaagcccta caagtgcccc gaatgtggaa 660
agtccttctc cgactccggc aacctccggg tgcaccagcg cacccacact ggagagaagc 720
cgaccggaaa gaaaacttcc gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaaccccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2255
<![CDATA[ <210> 167]]>
<![CDATA[ <211> 2255]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 167]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggaac 120
ccggagaaaa accttataag tgccctgaat gcggaaagtc attctcgagg tcggacaagc 180
tcgtgcggca ccagaggaca cacaccgggg aaaagccata caagtgtccc gaatgtggaa 240
agtccttcag ccaacgcgcc aacctgagag ctcatcagcg gactcacact ggcgaaaaac 300
cgtacaaatg ccccgaatgc ggcaaaagct tctcccgcgc cgacaacttg accgagcacc 360
agcggaccca taccggcgaa aagccgtaca agtgcccgga gtgtgggaag tcgttcagcc 420
agtcctcttc cctcgtgcgc caccaacgca cccatactgg ggagaagccc tataagtgtc 480
ctgagtgtgg caaatcattc agcgataaga aggatcttac ccggcaccaa cggactcata 540
ccggagagaa gccttacaag tgccccgagt gcggaaagag cttctcgtcc ccggcggacc 600
tgactagaca ccagcgcacc cacaccggag aaaagcccta caagtgccca gagtgcggga 660
agtccttttc ccaatccggt cacctgactg agcaccagag aacccacacg ggagagaaac 720
cgaccggaaa gaaaacctcc gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaaccccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2255
<![CDATA[ <210> 168]]>
<![CDATA[ <211> 2255]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 168]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccttggaac 120
ccggagaaaa gccatacaaa tgccccgaat gcggaaagtc gttcagccag tccggcgacc 180
tcagacggca ccaacggact cacaccggcg aaaaaccgta caagtgccca gagtgcggca 240
aaagctttag ccagtcgggc gatctgcgga gacatcagcg cactcacact ggtgaaaagc 300
cctacaagtg tcctgagtgc gggaagtcct tcagcgagcg ctcccatctt cgcgagcacc 360
agagaaccca cactggagaa aaaccttata agtgccctga gtgtggcaaa tccttctcaa 420
ccaccggcaa cctgactgtg caccagcgga cccacacagg ggagaagcct tacaagtgcc 480
cggagtgtgg gaagtcattc tcccatcgga cgaccctgac caaccaccag aggacccata 540
ctggcgaaaa gccgtataag tgtccggagt gcggaaagag cttctccgac tccggaaacc 600
tcagggtgca ccaacgcacc cacaccggag agaagccgta caaatgtccc gaatgtggaa 660
agtccttctc ccaatcctct tcgctggtcc ggcaccagcg aactcatacc ggggaaaagc 720
ccaccggaaa gaaaacctcg gcctccggtt cgggaggagg ctcaggagga gcgagagatt 780
ccaaggtcga gaacaagacc aagaagctgc gggtgttcga ggcctttgct ggcatcggag 840
cccagaggaa ggccctcgag aaggtccgca aggatgagta cgagatcgtg ggactcgcgg 900
agtggtacgt gcccgccatt gtgatgtacc aggccatcca taacaacttc cacactaagc 960
tggagtacaa gtccgtgtcc cgggaggaaa tgattgacta cctggagaat aagaccctgt 1020
catggaactc taagaaccccc gtgtcgaacg gttactggaa gagaaagaag gatgacgaac 1080
tgaagattat ctacaacgcg atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctcta caagcgcacc ttgaagaaca tcgatctgct gacctactcc ttcccgtgcc 1200
aagacctgag ccagcagggc atccagaagg ggatgaaacg gggctccggt actcgcagcg 1260
gcttgctgtg ggaaattgag cgggccctgg atagcaccga gaagaacgac ttgccgaagt 1320
atcttctcat ggaaaacgtc ggggctctcc ttcacaagaa gaacgaggaa gaactgaacc 1380
agtggaagca aaagctggaa tccctcggat accagaactc cattgaggtc ctgaacgccg 1440
ccgacttcgg atcgtcgcaa gccagacgga gggtgttcat gattagcact ctgaacgaat 1500
tcgtggaact gccgaagggc gacaagaagc ctaagtccat caagaaggtg ctgaacaaga 1560
tcgtgtccga gaaggacatt ctcaacaatc tgctgaagta caacctgaca gagttcaaga 1620
aaaccaagtc caacatcaac aaggcctcct tgattggtta ctcaaagttc aacagcgagg 1680
gatacgtgta cgaccccgaa ttcactggac ccactctgac cgcctccgga gcaaactcta 1740
ggattaagat caaggacggc tccaacatcc ggaagatgaa ctccgacgaa acctttctgt 1800
acatcggctt cgactcgcaa gacggaaagc gcgtgaacga gatcgaattt cttaccgaaa 1860
accagaagat cttcgtgtgc ggcaattcaa tctccgtgga agtcctggaa gcgattatcg 1920
acaagatcgg aggcagtggt ggaaagcgcc cagcagccac taagaaggcc ggacaggcca 1980
agaagaagaagaa gggatcctac ccttacgatg tgccggatta cgcttgagcg gccgcttaat 2040
taagctgcct tctgcggggc ttgccttctg gccatgccct tcttctctcc cttgcacctg 2100
tacctcttgg tctttgaata aagcctgagt aggaagtcta gaaaaaaaaaaaaaaaaaaa 2160
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2220
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 2255
<![CDATA[ <210> 169]]>
<![CDATA[ <211> 204]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 169]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
35 40 45
Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
145 150 155 160
Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg
180 185 190
Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[ <210> 170]]>
<![CDATA[ <211> 204]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 170]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser His Lys
35 40 45
Asn Ala Leu Gln Asn His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Ser Asn Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala Leu Thr Glu His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala
145 150 155 160
Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr
180 185 190
Arg His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[ <210> 171]]>
<![CDATA[ <211> 204]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400]]>> 171]]>
<br/> <![CDATA[Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys
35 40 45
Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val
65 70 75 80
Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr
145 150 155 160
Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
180 185 190
Thr His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[ <210> 172]]>
<![CDATA[ <211> 204]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 172]]>
Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
1 5 10 15
Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly
20 25 30
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys
35 40 45
Ser Ser Leu Ile Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
50 55 60
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Arg Asp Glu Leu Asn
65 70 75 80
Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
85 90 95
Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp Leu Arg Arg His Gln Arg
100 105 110
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
115 120 125
Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly
130 135 140
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser
145 150 155 160
Gly Asn Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
165 170 175
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala
180 185 190
Ser His Gln Arg Thr His Thr Gly Lys Lys Thr Ser
195 200
<![CDATA[ <210> 173]]>
<![CDATA[ <211> 612]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 173]]>
ctggagcctg gagagaaacc ctacaaatgc ccggaatgcg ggaagtcctt ttccgaacgc 60
tcgcacctga gggaaccacca gagaactcac accggcgaaa aaccctataa gtgcccagaa 120
tgcggaaaga gcttttcacg gtcggacaac ctcgtgcggc accaacgcac tcataccgga 180
gagaagccgt acaagtgtcc tgagtgcgga aagtcattct ccgactgccg ggatttggcc 240
cgccaccaaa gaacacacac tggcgaaaag ccctacaagt gcccggagtg tggaaagtcc 300
ttcagcactt ccggagagct ggtccggcac cagaggaccc acaccggggga gaagccttac 360
aaatgtccag agtgcggtaa aagcttctcc accacccggca acctcaccgt gcaccagcgg 420
accacacactg gagaaaagcc gtataaatgc cccgaatgcg gcaagagctt ctcgcgatcc 480
gataagcttg tgcggcatca gagaacgcac actggggaaa agccttataa gtgtccggag 540
tgcggcaaat ccttctcccg cactgacacc ctgcgggacc atcagcgcac ccataccggc 600
aaaaagacct ct 612
<![CDATA[ <210> 174]]>
<![CDATA[ <211> 61]]>2
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 174]]>
cttgaacccg gggagaagcc ctacaagtgc ccggaatgcg gaaagagctt cagccagaag 60
tcctcgctga tcgcgcacca gaggactcac accggcgaaa agccatacaa gtgtcctgag 120
tgtggcaaat ccttctcgca caagaacgca ctgcagaacc accagagaac ccacaccggg 180
gaaaagccgt ataagtgccc cgaatgtgga aagtcgttca gccagtcatc caacctcgtg 240
cgccatcaaa ggactcatac cggagagaaa ccttacaaat gccctgaatg cggcaaatct 300
ttctcccgga atgatgccct gaccgagcac cagcgcactc acacgggaga gaagccgtac 360
aaatgtccgg agtgcggaaa gtccttctcc gacaagaagg acttgaccag acaccagcgg 420
acccatactg gcgaaaaacc ctataagtgt ccagagtgcg ggaagtcctt tagccaagcc 480
ggtcacctcg cttccccacca acggacccac acaggagaaa agccttataa atgccccgag 540
tgcggcaaaa gcttctccga taagaaggac ctgactcggc atcagcgcac ccataccgga 600
aagaaaacct ca 612
<![CDATA[ <210> 175]]>
<![CDATA[ <211> 612]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 175]]>
ctggagcctg gcgaaaaacc ctataagtgc ccagaatgcg gaaagagctt ttcacggtcg 60
gacaacctcg tgcggcacca acgcactcat accggagaga agccgtacaa gtgtcctgag 120
tgcggaaagt cattctccga ctgccgggat ttggcccgcc accaaagaac acacactggc 180
gaaaagccct acaagtgccc ggagtgtgga aagtccttca gcacttccgg agagctggtc 240
cggcaccaga ggacccacac cggggagaag ccttacaaat gtccagagtg cggtaaaagc 300
ttctccacca ccggcaacct caccgtgcac cagcggaccc aacactggaga aaagccgtat 360
aaatgccccg aatgcggcaa gagcttctcg cgatccgata agcttgtgcg gcatcagaga 420
acgcacactg gggaaaagcc ttataagtgt ccggagtgcg gcaaatcctt ctcccgcact 480
gacaccctgc gggaccacca gagaacccat actggcgaga agccatacaa atgcccggaa 540
tgtggaaaga gtttctcgcg cgaggacaac ctccacaccc atcagcgcac ccataccggc 600
aaaaagacct ct 612
<![CDATA[ <210> 176]]>
<![CDATA[ <211> 612]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 176]]>
ctggaacccg gagagaaacc ctacaaatgc ccagagtgcg gcaaatcgtt ctcacggtcc 60
gatcacctca ccaccccacca gaggaccccat accggggaga agccttacaa gtgtcctgag 120
tgtggaaagt ccttcagcca aaagtcctcg ctgatcgcac accagcgcac gcacactggg 180
gaaaagccat ataaatgccc ggagtgtggc aaatccttct cccgccgcga cgaactgaac 240
gtgcaccaga gaacccacac tggagagaag ccgtataagt gtccggagtg cggaaagagc 300
ttctcgcaat ccggggacct tcggagacat cagaggacac acactggcga aaagccctat 360
aagtgccctg agtgcgggaa gtcctttagc caagccggtc acctggcctc ccaccaacgg 420
actcacaccg gcgaaaaacc gtacaagtgc cccgaatgcg gaaagtcgtt ctctacctcc 480
ggaaacttga ccgaaccacca gcggacccac accggagaaa agccgtacaa atgtcctgaa 540
tgcggcaaaa gcttcagcca ggccggtcat ctcgcgagcc atcagcggac tcatactggc 600
aaaaagacct ca 612
<![CDATA[ <210> 177]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 177]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 178]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 178]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser His Lys Asn Ala Leu Gln Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Ser Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 179]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 179]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 180]]>
<![CDATA[ <211> 367]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 180]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
355 360 365
<![CDATA[ <210> 181]]>
<![CDATA[ <211> 1082]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 181]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys
625 630 635 640
Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val
645 650 655
Pro Asp Tyr Ala Gly Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala
660 665 670
Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly
675 680 685
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
690 695 700
Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys
705 710 715 720
Lys Arg Lys Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly
725 730 735
Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
740 745 750
Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr
755 760 765
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
770 775 780
Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
785 790 795 800
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu
805 810 815
Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro
820 825 830
Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln
835 840 845
Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
850 855 860
Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr
865 870 875 880
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
885 890 895
Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
900 905 910
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu
915 920 925
Arg Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser Ala Ser Gly
930 935 940
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp Ala Lys Ser Leu Thr
945 950 955 960
Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe
965 970 975
Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr
980 985 990
Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr
995 1000 1005
Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu
1010 1015 1020
Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro
1025 1030 1035
Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro Ser Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1070 1075 1080
<![CDATA[ <210> 182]]>
<![CDATA[ <211> 1082]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 182]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly
355 360 365
Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu
370 375 380
Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg
385 390 395 400
Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu
405 410 415
Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly
420 425 430
Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly
435 440 445
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
450 455 460
Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
465 470 475 480
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
485 490 495
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
500 505 510
Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg
515 520 525
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
530 535 540
Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly
545 550 555 560
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu
565 570 575
Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
580 585 590
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg
595 600 605
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
610 615 620
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
625 630 635 640
Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly
645 650 655
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Ser Lys Val Glu Asn Lys
660 665 670
Thr Lys Lys Leu Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln
675 680 685
Arg Lys Ala Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly
690 695 700
Leu Ala Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His
705 710 715 720
Asn Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
725 730 735
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn
740 745 750
Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys
755 760 765
Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe
770 775 780
Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu
785 790 795 800
Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys
805 810 815
Gly Met Lys Arg Gly Ser Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile
820 825 830
Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu
835 840 845
Leu Met Glu Asn Val Gly Ala Leu Leu His Lys Lys Asn Glu Glu Glu Glu
850 855 860
Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser
865 870 875 880
Ile Glu Val Leu Asn Ala Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg
885 890 895
Arg Val Phe Met Ile Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys
900 905 910
Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val
915 920 925
Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu
930 935 940
Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr
945 950 955 960
Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
965 970 975
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp
980 985 990
Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile
995 1000 1005
Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile Glu Phe
1010 1015 1020
Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser Ile Ser
1025 1030 1035
Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
1070 1075 1080
<![CDATA[ <210> 183]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 183]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 184]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 184]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser His Lys Asn Ala Leu Gln Asn His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Gln Ser Ser Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Asn Asp Ala
115 120 125
Leu Thr Glu His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Asp Lys Lys Asp Leu Thr Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Asp Lys Lys Asp Leu Thr Arg His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 185]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 185]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Thr Ser Gly Glu Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn
115 120 125
Leu Thr Val His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 186]]>
<![CDATA[ <211> 358]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 186]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Gln Lys Ser Ser Leu Ile Ala His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Arg Asp Glu Leu Asn Val His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Ser Gly Asp
115 120 125
Leu Arg Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Ala Gly His Leu Ala Ser His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Thr Ser Gly Asn Leu Thr Glu His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Gln Ala Gly His Leu Ala Ser His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser
355
<![CDATA[ <210> 187]]>
<![CDATA[ <211> 1073]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 187]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser Asp Asp Leu Val Arg His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Glu Asp Asn Leu His Thr His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Arg Ser Asp His Leu Thr Thr His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn
115 120 125
Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu Ala His Leu Arg Ala His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr Gly Glu Lys
210 215 220
Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly
225 230 235 240
Gly Ala Arg Asp Ser Lys Val Glu Asn Lys Thr Lys Lys Leu Arg Val
245 250 255
Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln Arg Lys Ala Leu Glu Lys
260 265 270
Val Arg Lys Asp Glu Tyr Glu Ile Val Gly Leu Ala Glu Trp Tyr Val
275 280 285
Pro Ala Ile Val Met Tyr Gln Ala Ile His Asn Asn Phe His Thr Lys
290 295 300
Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu Met Ile Asp Tyr Leu Glu
305 310 315 320
Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn Pro Val Ser Asn Gly Tyr
325 330 335
Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys Ile Ile Tyr Asn Ala Ile
340 345 350
Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe Asp Ile Arg Asp Leu Tyr
355 360 365
Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu Thr Tyr Ser Phe Pro Cys
370 375 380
Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys Gly Met Lys Arg Gly Ser
385 390 395 400
Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile Glu Arg Ala Leu Asp Ser
405 410 415
Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu Leu Met Glu Asn Val Gly
420 425 430
Ala Leu Leu His Lys Lys Asn Glu Glu Glu Leu Asn Gln Trp Lys Gln
435 440 445
Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser Ile Glu Val Leu Asn Ala
450 455 460
Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg Arg Val Phe Met Ile Ser
465 470 475 480
Thr Leu Asn Glu Phe Val Glu Leu Pro Lys Gly Asp Lys Lys Pro Lys
485 490 495
Ser Ile Lys Lys Val Leu Asn Lys Ile Val Ser Glu Lys Asp Ile Leu
500 505 510
Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu Phe Lys Lys Thr Lys Ser
515 520 525
Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr Ser Lys Phe Asn Ser Glu
530 535 540
Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly Pro Thr Leu Thr Ala Ser
545 550 555 560
Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp Gly Ser Asn Ile Arg Lys
565 570 575
Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile Gly Phe Asp Ser Gln Asp
580 585 590
Gly Lys Arg Val Asn Glu Ile Glu Phe Leu Thr Glu Asn Gln Lys Ile
595 600 605
Phe Val Cys Gly Asn Ser Ile Ser Val Glu Val Leu Glu Ala Ile Ile
610 615 620
Asp Lys Ile Gly Gly Pro Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys
625 630 635 640
Lys Ala Gly Gln Ala Lys Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val
645 650 655
Pro Asp Tyr Ala Gly Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala
660 665 670
Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly
675 680 685
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
690 695 700
Asn Pro Gly Pro Leu Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys
705 710 715 720
Lys Arg Lys Val Gly Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly
725 730 735
Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
740 745 750
Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg Thr His Thr
755 760 765
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
770 775 780
Ser Asp Asn Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
785 790 795 800
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Asp Cys Arg Asp Leu
805 810 815
Ala Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro
820 825 830
Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu Leu Val Arg His Gln
835 840 845
Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys
850 855 860
Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His Gln Arg Thr His Thr
865 870 875 880
Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg
885 890 895
Ser Asp Lys Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro
900 905 910
Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Thr Asp Thr Leu
915 920 925
Arg Asp His Gln Arg Thr His Thr Gly Lys Lys Thr Ser Ala Ser Gly
930 935 940
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp Ala Lys Ser Leu Thr
945 950 955 960
Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe
965 970 975
Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Leu Tyr
980 985 990
Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr
995 1000 1005
Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu
1010 1015 1020
Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu Thr His Pro
1025 1030 1035
Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro Ser Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser
1070
<![CDATA[ <210> 188]]>
<![CDATA[ <211> 1073]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 188]]>
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly Glu Lys Pro Tyr Lys Cys
20 25 30
Pro Glu Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His
35 40 45
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
50 55 60
Lys Ser Phe Ser Arg Ser Asp Asn Leu Val Arg His Gln Arg Thr His
65 70 75 80
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
85 90 95
Asp Cys Arg Asp Leu Ala Arg His Gln Arg Thr His Thr Gly Glu Lys
100 105 110
Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Thr Ser Gly Glu
115 120 125
Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys
130 135 140
Pro Glu Cys Gly Lys Ser Phe Ser Thr Thr Gly Asn Leu Thr Val His
145 150 155 160
Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly
165 170 175
Lys Ser Phe Ser Arg Ser Asp Lys Leu Val Arg His Gln Arg Thr His
180 185 190
Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser
195 200 205
Arg Thr Asp Thr Leu Arg Asp His Gln Arg Thr His Thr Gly Lys Lys
210 215 220
Thr Ser Ala Ser Gly Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Asp
225 230 235 240
Ala Lys Ser Leu Thr Ala Trp Ser Arg Thr Leu Val Thr Phe Lys Asp
245 250 255
Val Phe Val Asp Phe Thr Arg Glu Glu Trp Lys Leu Leu Asp Thr Ala
260 265 270
Gln Gln Ile Leu Tyr Arg Asn Val Met Leu Glu Asn Tyr Lys Asn Leu
275 280 285
Val Ser Leu Gly Tyr Gln Leu Thr Lys Pro Asp Val Ile Leu Arg Leu
290 295 300
Glu Lys Gly Glu Glu Pro Trp Leu Val Glu Arg Glu Ile His Gln Glu
305 310 315 320
Thr His Pro Asp Ser Glu Thr Ala Phe Glu Ile Lys Ser Ser Val Pro
325 330 335
Ser Ser Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala
340 345 350
Lys Lys Lys Lys Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly
355 360 365
Ser Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu
370 375 380
Asn Pro Gly Pro Thr Ser Ala Gly Lys Leu Gly Ser Gly Glu Gly Arg
385 390 395 400
Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro Leu
405 410 415
Glu Gly Ser Ser Gly Ser Gly Ser Pro Lys Lys Lys Arg Lys Val Gly
420 425 430
Ile His Gly Val Pro Ala Ala Gly Ser Ser Gly Ser Leu Glu Pro Gly
435 440 445
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Ser
450 455 460
Asp Asp Leu Val Arg His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
465 470 475 480
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Arg Glu Asp Asn Leu His
485 490 495
Thr His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
500 505 510
Cys Gly Lys Ser Phe Ser Arg Ser Asp His Leu Thr Thr His Gln Arg
515 520 525
Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser
530 535 540
Phe Ser Gln Arg Ala Asn Leu Arg Ala His Gln Arg Thr His Thr Gly
545 550 555 560
Glu Lys Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Leu
565 570 575
Ala His Leu Arg Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr
580 585 590
Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gln Arg Ala Asn Leu Arg
595 600 605
Ala His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Pro Glu
610 615 620
Cys Gly Lys Ser Phe Ser Glu Arg Ser His Leu Arg Glu His Gln Arg
625 630 635 640
Thr His Thr Gly Glu Lys Pro Thr Gly Lys Lys Thr Ser Ala Ser Gly
645 650 655
Ser Gly Gly Gly Ser Gly Gly Ala Arg Asp Ser Lys Val Glu Asn Lys
660 665 670
Thr Lys Lys Leu Arg Val Phe Glu Ala Phe Ala Gly Ile Gly Ala Gln
675 680 685
Arg Lys Ala Leu Glu Lys Val Arg Lys Asp Glu Tyr Glu Ile Val Gly
690 695 700
Leu Ala Glu Trp Tyr Val Pro Ala Ile Val Met Tyr Gln Ala Ile His
705 710 715 720
Asn Asn Phe His Thr Lys Leu Glu Tyr Lys Ser Val Ser Arg Glu Glu
725 730 735
Met Ile Asp Tyr Leu Glu Asn Lys Thr Leu Ser Trp Asn Ser Lys Asn
740 745 750
Pro Val Ser Asn Gly Tyr Trp Lys Arg Lys Lys Asp Asp Glu Leu Lys
755 760 765
Ile Ile Tyr Asn Ala Ile Lys Leu Ser Glu Lys Glu Gly Asn Ile Phe
770 775 780
Asp Ile Arg Asp Leu Tyr Lys Arg Thr Leu Lys Asn Ile Asp Leu Leu
785 790 795 800
Thr Tyr Ser Phe Pro Cys Gln Asp Leu Ser Gln Gln Gly Ile Gln Lys
805 810 815
Gly Met Lys Arg Gly Ser Gly Thr Arg Ser Gly Leu Leu Trp Glu Ile
820 825 830
Glu Arg Ala Leu Asp Ser Thr Glu Lys Asn Asp Leu Pro Lys Tyr Leu
835 840 845
Leu Met Glu Asn Val Gly Ala Leu Leu His Lys Lys Asn Glu Glu Glu Glu
850 855 860
Leu Asn Gln Trp Lys Gln Lys Leu Glu Ser Leu Gly Tyr Gln Asn Ser
865 870 875 880
Ile Glu Val Leu Asn Ala Ala Asp Phe Gly Ser Ser Gln Ala Arg Arg
885 890 895
Arg Val Phe Met Ile Ser Thr Leu Asn Glu Phe Val Glu Leu Pro Lys
900 905 910
Gly Asp Lys Lys Pro Lys Ser Ile Lys Lys Val Leu Asn Lys Ile Val
915 920 925
Ser Glu Lys Asp Ile Leu Asn Asn Leu Leu Lys Tyr Asn Leu Thr Glu
930 935 940
Phe Lys Lys Thr Lys Ser Asn Ile Asn Lys Ala Ser Leu Ile Gly Tyr
945 950 955 960
Ser Lys Phe Asn Ser Glu Gly Tyr Val Tyr Asp Pro Glu Phe Thr Gly
965 970 975
Pro Thr Leu Thr Ala Ser Gly Ala Asn Ser Arg Ile Lys Ile Lys Asp
980 985 990
Gly Ser Asn Ile Arg Lys Met Asn Ser Asp Glu Thr Phe Leu Tyr Ile
995 1000 1005
Gly Phe Asp Ser Gln Asp Gly Lys Arg Val Asn Glu Ile Glu Phe
1010 1015 1020
Leu Thr Glu Asn Gln Lys Ile Phe Val Cys Gly Asn Ser Ile Ser
1025 1030 1035
Val Glu Val Leu Glu Ala Ile Ile Asp Lys Ile Gly Gly Pro Ser
1040 1045 1050
Gly Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1055 1060 1065
Lys Lys Lys Gly Ser
1070
<![CDATA[ <210> 189]]>
<![CDATA[ <211> 1376]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 189]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ctggagagaa accctacaaa tgcccggaat gcgggaagtc cttttccgaa cgctcgcacc 180
tgagggaaca ccagagaact cacaccggcg aaaaacccta taagtgccca gaatgcggaa 240
agagcttttc acggtcggac aacctcgtgc ggcaccaacg cactcatacc ggagagaagc 300
cgtacaagtg tcctgagtgc ggaaagtcat tctccgactg ccggatttg gcccgccacc 360
aaagaacaca cactggcgaa aagccctaca agtgcccgga gtgtggaaag tccttcagca 420
cttccggaga gctggtccgg caccagagga cccacaccgg ggagaagcct tacaaatgtc 480
cagagtgcgg taaaagcttc tccaccaccg gcaacctcac cgtgcaccag cggacccaca 540
ctggagaaaa gccgtataaa tgccccgaat gcggcaagag cttctcgcga tccgataagc 600
ttgtgcggca tcagagaacg cacactgggg aaaagcctta taagtgtccg gagtgcggca 660
aatccttctc ccgcactgac accctgcggg accatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1376
<![CDATA[ <210> 190]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 190]]>
aacacagttc agccgagcgc t 21
<![CDATA[ <210> 191]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 191]]>
cgaacaaccg tacagaaagg g 21
<![CDATA[ <210> 192]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 192]]>
gtaaacagta atagcgcagc a 21
<![CDATA[ <210> 193]]>
<![CDATA[ <211> 1376]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 193]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tcccttgaac 120
ccggggagaa gccctacaag tgcccggaat gcggaaagag cttcagccag aagtcctcgc 180
tgatcgcgca ccagaggact cacaccggcg aaaagccata caagtgtcct gagtgtggca 240
aatccttctc gcacaagaac gcactgcaga accacccagag aacccacacc ggggaaaagc 300
cgtataagtg ccccgaatgt ggaaagtcgt tcagccagtc atccaacctc gtgcgccatc 360
aaaggactca taccggagag aaaccttaca aatgccctga atgcggcaaa tctttctccc 420
ggaatgatgc cctgaccgag caccagcgca ctcacacggg agagaagccg tacaaatgtc 480
cggagtgcgg aaagtccttc tccgacaaga aggacttgac cagacaccag cggacccata 540
ctggcgaaaa accctataag tgtccagagt gcgggaagtc ctttagccaa gccggtcacc 600
tcgcttccca ccaacggacc cacacaggag aaaagcctta taaatgcccc gagtgcggca 660
aaagcttctc cgataagaag gacctgactc ggcatcagcg cacccatacc ggaaagaaaa 720
cctcagctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1376
<![CDATA[ <210> 194]]>
<![CDATA[ <211> 1376]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 194]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggagc 120
ctggcgaaaa accctataag tgccccagaat gcggaaagag cttttcacgg tcggacaacc 180
tcgtgcggca ccaacgcact cataccggag agaagccgta caagtgtcct gagtgcggaa 240
agtcattctc cgactgccgg gatttggccc gccaccaaag aacacacact ggcgaaaagc 300
cctacaagtg cccggagtgt ggaaagtcct tcagcacttc cggagagctg gtccggcacc 360
agaggaccca caccggggag aagccttaca aatgtccaga gtgcggtaaa agcttctcca 420
ccaccggcaa cctcaccgtg caccagcgga cccaacactgg agaaaagccg tataaatgcc 480
ccgaatgcgg caagagcttc tcgcgatccg ataagcttgt gcggcatcag agaacgcaca 540
ctggggaaaa gccttataag tgtccggagt gcggcaaatc cttctcccgc actgacaccc 600
tgcgggacca ccagagaacc catactggcg agaagccata caaatgcccg gaatgtggaa 660
agagtttctc gcgcgaggac aacctccaca cccatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1376
<![CDATA[ <210> 195]]>
<![CDATA[ <211> 1376]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 195]]>
aggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gcccccaaga 60
agaagcggaa ggtgggcatc cacggcgtgc ccgccgccgg cagcagcgga tccctggaac 120
ccggagagaa accctacaaa tgcccagagt gcggcaaatc gttctcacgg tccgatcacc 180
tcaccaccca ccagaggacc cataccgggg agaagcctta caagtgtcct gagtgtggaa 240
agtccttcag ccaaaagtcc tcgctgatcg cacaccagcg cacgcacact ggggaaaagc 300
catataaatg cccggagtgt ggcaaatcct tctcccgccg cgacgaactg aacgtgcacc 360
agagaaccca cactggagag aagccgtata agtgtccgga gtgcggaaag agcttctcgc 420
aatccgggga ccttcggaga catcagagga cacacactgg cgaaaagccc tataagtgcc 480
ctgagtgcgg gaagtccttt agccaagccg gtcacctggc ctccaccaa cggactcaca 540
ccggcgaaaa accgtacaag tgccccgaat gcggaaagtc gttctctacc tccggaaact 600
tgaccgaaca ccagcggacc cacaccggag aaaagccgta caaatgtcct gaatgcggca 660
aaagcttcag ccaggccggt catctcgcga gccatcagcg gactcatact ggcaaaaaga 720
cctcagctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgcctg agcggccgct taattaagct gccttctgcg gggcttgcct tctggccatg 1200
cccttcttct ctcccttgca cctgtacctc ttggtctttg aataaagcct gagtaggaag 1260
tctagaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1376
<![CDATA[ <210> 196]]>
<![CDATA[ <211> 3521]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 196]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gccccaaaga 60
agaagagaaa ggtcggaatt catggcgtgc ccgcagccgg cagcagcggt tccctggagc 120
ccggcgagaa accttacaaa tgccccgagt gcggcaagag cttcagcaga agcgacgatc 180
tggtgaggca ccaaagaacc cacaccggcg aaaaacctta caagtgtccc gaatgcggaa 240
agtccttcag cagagaggac aatctgcaca cccaccagag aacacacacc ggagaaaagc 300
cttacaagtg ccccgaatgc ggcaaatcct tttctagaag cgatcatctg accacccacc 360
aaagaacaca taccggcgag aagccttaca aatgtcccga gtgcggaaag tccttctccc 420
agagagccaa tctgagggct catcaaagga cccataccgg cgaaaagccc tacaaatgcc 480
ccgagtgcgg aaaatccttc agccagctgg cccatctgag agcccaccaa aggacacaca 540
ccggagagaa accctataag tgccccgagt gtggaaaaag cttttcccag agggccaatc 600
tgagggccca tcagaggacc cataccggag agaagcctta taaatgtccc gagtgcggaa 660
aaagcttcag cgagaggagc catctgaggg aacatcaaag aacccacacc ggcgaaaaac 720
ccaccggaaa aaagaccagc gctagcggca gcggcggcgg cagcggcggc gcccgggaca 780
gcaaggtgga gaacaagacc aagaagctgc gggtgttcga ggccttcgcc ggcatcggcg 840
cccagcggaa ggccctggag aaggtgcgga aggacgagta cgagatcgtg ggcctggccg 900
agtggtacgt gcccgccatc gtgatgtacc aggccatcca caacaacttc cacaccaagc 960
tggagtacaa gagcgtgagc cgggaggaga tgatcgacta cctggagaac aagaccctga 1020
gctggaacag caagaaccccc gtgagcaacg gctactggaa gcggaagaag gacgacgagc 1080
tgaagatcat ctacaacgcc atcaagctga gcgagaagga gggcaacatc ttcgacatcc 1140
gggacctgta caagcggacc ctgaagaaca tcgacctgct gacctacagc ttcccctgcc 1200
aggacctgag ccagcagggc atccagaagg gcatgaagcg gggcagcggc acccggagcg 1260
gcctgctgtg ggagatcgag cgggccctgg acagcaccga gaagaacgac ctgcccaagt 1320
acctgctgat ggagaacgtg ggcgccctgc tgcacaagaa gaacgaggag gagctgaacc 1380
agtggaagca gaagctggag agcctgggct accagaacag catcgaggtg ctgaacgccg 1440
ccgacttcgg cagcagccag gcccggcggc gggtgttcat gatcagcacc ctgaacgagt 1500
tcgtggagct gcccaagggc gacaagaagc ccaagagcat caagaaggtg ctgaacaaga 1560
tcgtgagcga gaaggacatc ctgaacaacc tgctgaagta caacctgacc gagttcaaga 1620
aaaccaagag caacatcaac aaggccagcc tgatcggcta cagcaagttc aacagcgagg 1680
gctacgtgta cgaccccgag ttcaccggcc ccaccctgac cgccagcggc gccaacagcc 1740
ggatcaagat caaggacggc agcaacatcc ggaagatgaa cagcgacgag accttcctgt 1800
acatcggctt cgacagccag gacggcaagc gggtgaacga gatcgagttc ctgaccgaga 1860
accagaagat cttcgtgtgc ggcaacagca tcagcgtgga ggtgctggag gccatcatcg 1920
acaagatcgg cggccccagc ggcggcaagc ggcccgccgc caccaagaag gccggccagg 1980
ccaagaagaagaa gaagggcagc tacccctacg acgtgcccga ctacgccggg tccgccacca 2040
acttctcgct gctgaagcag gccggagacg tggaagaaaa ccctggacct accagtgccg 2100
gaaagctcgg tagcggagag ggtcggggaa gcctgcttac ttgcggcgac gtggaagaga 2160
accccggtcc gctggagggt tcgtccggct ccggatcccc caagaagaag cggaaggtgg 2220
gcatccacgg cgtgcccgcc gccggcagca gcggatccct ggagcctgga gagaaaccct 2280
acaaatgccc ggaatgcggg aagtcctttt ccgaacgctc gcacctgagg gaacaccaga 2340
gaactcacac cggcgaaaaa ccctataagt gcccagaatg cggaaagagc ttttcacggt 2400
cggacaacct cgtgcggcac caacgcactc ataccggaga gaagccgtac aagtgtcctg 2460
agtgcggaaa gtcattctcc gactgccggg atttggcccg ccaccaaaga aacacacactg 2520
gcgaaaagcc ctacaagtgc ccggagtgtg gaaagtcctt cagcacttcc ggagagctgg 2580
tccggcacca gaggacccac accggggaga agccttacaa atgtccagag tgcggtaaaa 2640
gcttctccac caccggcaac ctcaccgtgc accagcggac ccaacactgga gaaaagccgt 2700
ataaatgccc cgaatgcggc aagagcttct cgcgatccga taagcttgtg cggcatcaga 2760
gaacgcacac tggggaaaag ccttataagt gtccggagtg cggcaaatcc ttctcccgca 2820
ctgacaccct gcgggaccat cagcgcaccc ataccggcaa aaagaccctct gctagcggca 2880
gcggcggcgg cagcggcggc gcccgggacg acgccaagag cctgaccgcc tggagccgga 2940
ccctggtgac cttcaaggac gtgttcgtgg acttcacccg ggaggagtgg aagctgctgg 3000
acaccgccca gcagatcctg taccggaacg tgatgctgga gaactacaag aacctggtga 3060
gcctgggcta ccagctgacc aagcccgacg tgatcctgcg gctggagaag ggcgaggagc 3120
cctggctggt ggagcggggag atccaccagg aaacccaccc cgacagcgaa accgccttcg 3180
agatcaagag cagcgtgccc agcagcggcg gcaagcggcc cgccgccacc aagaaggccg 3240
gccaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 3300
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 3360
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 3420
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3480
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa a 3521
<![CDATA[ <210> 197]]>
<![CDATA[ <211> 3521]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 197]]>
gggaaataag agagaaaaga agagtaagaa gaaatataag agccaccatg gccccaaaga 60
agaagagaaa ggtcggaatt catggcgtgc ccgcagccgg cagcagcggt tccctggagc 120
ctggagagaa accctacaaa tgcccggaat gcgggaagtc cttttccgaa cgctcgcacc 180
tgagggaaca ccagagaact cacaccggcg aaaaacccta taagtgccca gaatgcggaa 240
agagcttttc acggtcggac aacctcgtgc ggcaccaacg cactcatacc ggagagaagc 300
cgtacaagtg tcctgagtgc ggaaagtcat tctccgactg ccggatttg gcccgccacc 360
aaagaacaca cactggcgaa aagccctaca agtgcccgga gtgtggaaag tccttcagca 420
cttccggaga gctggtccgg caccagagga cccacaccgg ggagaagcct tacaaatgtc 480
cagagtgcgg taaaagcttc tccaccaccg gcaacctcac cgtgcaccag cggacccaca 540
ctggagaaaa gccgtataaa tgccccgaat gcggcaagag cttctcgcga tccgataagc 600
ttgtgcggca tcagagaacg cacactgggg aaaagcctta taagtgtccg gagtgcggca 660
aatccttctc ccgcactgac accctgcggg accatcagcg cacccatacc ggcaaaaaga 720
cctctgctag cggcagcggc ggcggcagcg gcggcgcccg ggacgacgcc aagagcctga 780
ccgcctggag ccggaccctg gtgaccttca aggacgtgtt cgtggacttc acccgggagg 840
agtggaagct gctggacacc gcccagcaga tcctgtaccg gaacgtgatg ctggagaact 900
acaagaacct ggtgagcctg ggctaccagc tgaccaagcc cgacgtgatc ctgcggctgg 960
agaagggcga ggagccctgg ctggtggagc gggagatcca ccaggaaacc caccccgaca 1020
gcgaaaccgc cttcgagatc aagagcagcg tgcccagcag cggcggcaag cggcccgccg 1080
ccaccaagaa ggccggccag gccaagaaga agaagggcag ctacccctac gacgtgcccg 1140
actacgccgg gtccgccacc aacttctcgc tgctgaagca ggccggagac gtggaagaaa 1200
accctggacc taccagtgcc ggaaagctcg gtagcggaga gggtcgggga agcctgctta 1260
cttgcggcga cgtggaagag aaccccggtc cgctggaggg ttcgtccggc tccggatccc 1320
ccaagaagaa gcggaaggtg ggcatccacg gcgtgcccgc cgccggcagc agcggatccc 1380
tggagcccgg cgagaaacct tacaaatgcc ccgagtgcgg caagagcttc agcagaagcg 1440
acgatctggt gaggcaccaa agaacccaca ccggcgaaaa accttacaag tgtcccgaat 1500
gcggaaagtc cttcagcaga gaggacaatc tgcacaccca ccagagaaca cacaccggag 1560
aaaagcctta caagtgcccc gaatgcggca aatccttttc tagaagcgat catctgacca 1620
cccaccaaag aacacatacc ggcgagaagc cttacaaatg tcccgagtgc ggaaagtcct 1680
tctccccagag agccaatctg agggctcatc aaaggaccca taccggcgaa aagccttaca 1740
aatgccccga gtgcggaaaa tccttcagcc agctggccca tctgagagcc caccaaagga 1800
cacacaccgg agagaaaccc tataagtgcc ccgagtgtgg aaaaagcttt tcccagaggg 1860
ccaatctgag ggcccatcag aggacccata ccggagagaa gccttataaa tgtcccgagt 1920
gcggaaaaag cttcagcgag aggagccatc tgagggaaca tcaaagaacc cacaccggcg 1980
aaaaacccac cggaaaaaag accagcgcta gcggcagcgg cggcggcagc ggcggcgccc 2040
gggacagcaa ggtggagaac aagaccaaga agctgcgggt gttcgaggcc ttcgccggca 2100
tcggcgccca gcggaaggcc ctggagaagg tgcggaagga cgagtacgag atcgtgggcc 2160
tggccgagtg gtacgtgccc gccatcgtga tgtaccaggc catccacaac aacttccaca 2220
ccaagctgga gtacaagagc gtgagccggg aggagatgat cgactacctg gagaacaaga 2280
ccctgagctg gaacagcaag aacccccgtga gcaacggcta ctggaagcgg aagaaggacg 2340
acgagctgaa gatcatctac aacgccatca agctgagcga gaaggagggc aacatcttcg 2400
acatccggga cctgtacaag cggaccctga agaacatcga cctgctgacc tacagcttcc 2460
cctgccagga cctgagccag cagggcatcc agaagggcat gaagcggggc agcggcaccc 2520
ggagcggcct gctgtggggag atcgagcggg ccctggacag caccgagaag aacgacctgc 2580
ccaagtacct gctgatggag aacgtgggcg ccctgctgca caagaagaac gaggaggagc 2640
tgaaccagtg gaagcagaag ctggagagcc tgggctacca gaacagcatc gaggtgctga 2700
acgccgccga cttcggcagc agccaggccc ggcggcgggt gttcatgatc agcaccctga 2760
acgagttcgt ggagctgccc aagggcgaca agaagcccaa gagcatcaag aaggtgctga 2820
acaagatcgt gagcgagaag gacatcctga acaacctgct gaagtacaac ctgaccgagt 2880
tcaagaaaac caagagcaac atcaacaagg ccagcctgat cggctacagc aagttcaaca 2940
gcgagggcta cgtgtacgac cccgagttca ccggccccac cctgaccgcc agcggcgcca 3000
acagccggat caagatcaag gacggcagca acatccggaa gatgaacagc gacgagacct 3060
tcctgtacat cggcttcgac agccaggacg gcaagcgggt gaacgagatc gagttcctga 3120
ccgagaacca gaagatcttc gtgtgcggca acagcatcag cgtggaggtg ctggaggcca 3180
tcatcgacaa gatcggcggc cccagcggcg gcaagcggcc cgccgccacc aagaaggccg 3240
gccaggccaa gaagaagaag ggcagctacc cctacgacgt gcccgactac gcctgagcgg 3300
ccgcttaatt aagctgcctt ctgcggggct tgccttctgg ccatgccctt cttctctccc 3360
ttgcacctgt acctcttggt ctttgaataa agcctgagta ggaagtctag aaaaaaaaaa 3420
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 3480
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa a 3521
<![CDATA[ <210> 198]]>
<![CDATA[ <211> 1385]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 198]]>
gaaataagag agaaaagaag agtaagaaga aatataagac ctaatagtaa tagtaatagc 60
ccaagaagaaa gcgcaaggtc gggatccacg gagtcccggc agcaggatcc tcaggctcac 120
tggaaccggg ggaaaaaccc tacaagtgcc cggagtgcgg caagagcttc tcgacctccg 180
ggaacctgac cgagcaccag cgcacccaca ccggagagaa accgtacaag tgccccgagt 240
gcgggaaatc gttctcagac tcgggaaacc tcagggtgca ccagcgggacc cacacgggggg 300
aaaagccgta caagtgcccg gagtgcggga agtcattctc ccacaagaac gcgctgcaga 360
accaccaaag aacccacacc ggcgaaaaac cgtacaagtg ccccgagtgc ggaaagtcct 420
tctcccgcaa cgacaccctc accgaacacc aacgcaccca caccggagaa aagccctaca 480
agtgcccgga gtgcggaaag agcttcagcc agagggccca cctggaaaga caccagagaa 540
cccacaccgg cgaaaagccg tacaagtgcc cggagtgcgg gaagtccttc agccggtcag 600
acaagctggt ccgccaccaa aggacccaca caggagaaaa gccctacaag tgcccggagt 660
gcggaaaatc gttcagcgac cccggacacc tggtccggca ccagaggacc cacaccgggg 720
agaagccgac cggcaaaaag acctcagcaa gcgggagcgg aggaggaagc ggaggggacg 780
ccaagagcct gaccgcgtgg agcaggaccc tcgtgacctt caaagacgtg ttcgtggact 840
tcacccgcga agagtggaag ctgctggaca ccgcgcagca aatcctgtac cggaacgtga 900
tgctggaaaa ctacaagaac ctcgtcagcc tcggctacca actgaccaag cccgacgtga 960
tcctgcggct ggaaaagggc gaagaaccct ggctcgtgga gcgggagatc caccaggaaa 1020
cccaccccgga cagcgaaacc gcgttcgaaa tcaagagcag cgtcagcgga ggaaaaagac 1080
cagccgccac caagaaggcc ggccaagcca agaagaaaaa gggcagctac ccctacgacg 1140
tgccggacta cgcatgagcg gccgcttaat taagctgcct tctgcggggc ttgccttctg 1200
gccatgccct tcttctctcc cttgcacctg tacctcttgg tctttgaata aagcctgagt 1260
aggaagtcta gaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1320
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1380
aaaaa 1385
<![CDATA[ <210> 199]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 199]]>
acggggaatg ctgccgagag c 21
<![CDATA[ <210> 200]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 200]]>
acctgaaccc tggaaattat a 21
<![CDATA[ <210> 201]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 201]]>
tagacgggga atgctgccga g 21
<![CDATA[ <210> 202]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 202]]>
tgacattgag caatgatatg g 21
<![CDATA[ <210> 203]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 203]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[ <210> 204]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 204]]>
acggggaatg ctgccgagag c 21
<![CDATA[ <210> 205]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 205]]>
acggggaatg ctgccgagag c 21
<![CDATA[ <210> 206]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Sapiens]]>
<![CDATA[ <400> 206]]>
agcaaaagaa aatggtaggc g 21
<![CDATA[ <210> 207]]>
<![CDATA[ <211> 5]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of Artificial Sequences: Synthetic Peptides]]>
<![CDATA[ <400> 207]]>
Gly Gly Gly Gly Ser
1 5
<![CDATA[ <210> 208]]>
<![CDATA[ <211> 830]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 208]]>
gatgttcatt agcagtggtg ataggttaat tttcaccatc tcttatgcgg ttgaatagtc 60
acctctgaac cactttttcc tccagtaact cctctttctt cggaccttct gcagccaacc 120
tgaaagaata acaaggaggt ggctggaaac ttgttttaag gaaccgcctg tccttccccc 180
gctggaaacc ttgcacctcg gacgctcctg ctcctgcccc cacctgaccc ccgccctcgt 240
tgacatccag gcgcgatgat ctctgctgcc agtagagggc acacttactt tactttcgca 300
aacctgaacg cgggtgctgc ccagagagggg ggcggaggga aagacgcttt gcagcaaaat 360
ccagcatagc gattggttgc tccccgcgtt tgcggcaaag gcctggaggc aggagtaatt 420
tgcaatcctt aaagctgaat tgtgcagtgc atcggatttg gaagctacta tattcactta 480
acacttgaac gctgagctgc aaactcaacg ggtaataacc catcttgaac agcgtacatg 540
ctatacacgc acccctttcc cccgaattgt tttctctttt ggaggtggtg gagggagaga 600
aaagtttact taaaatgcct ttgggtgagg gaccaaggat gagaagaatg ttttttgttt 660
ttcatgccgt ggaataacac aaaataaaaa atcccgaggg aatatcatatt atatattaaa 720
tatagatcat ttcagggagc aaacaaatca tgtgtggggc tgggcaacta gctaagtcga 780
agcgtaaata aaatgtgaat acacgtttgc gggttacatacagtgcactt 830
<![CDATA[ <210> 209]]>
<![CDATA[ <211> 830]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic polynucleotides]]>
<![CDATA[ <400> 209]]>
taggcatcgt tttcctcctt atgcctctat cattcctccc tatctacact aacatcccac 60
gctctgaacg cgcgcccatt aatacccttc tttcctccac tctccctggg actcttgatc 120
aaagcgcggc cctttcccca gccttagcga ggcgccctgc agcctggtac gcgcgtggcg 180
tggcggtggg cgcgcagtgc gttctcggtg tggagggcag ctgttccgcc tgcgatgatt 240
tatactcaca ggacaaggat gcggtttgtc aaacagtact gctacggagg agcagcagag 300
aaagggagag ggtttgagag ggagcaaaag aaaatggtag gcgcgcgtag ttaattcatg 360
cggctctctt actctgttta catcctagag ctagagtgct cggctgcccg gctgagtctc 420
ctccccacct tccccaccct ccccaccctc cccataagcg cccctcccgg gttcccaaag 480
cagagggcgt gggggaaaag aaaaaagatc ctctctcgct aatctccgcc caccggccct 540
ttataatgcg agggtctgga cggctgagga cccccgagct gtgctgctcg cggccgccac 600
cgccgggccc cggccgtccc tggctcccct cctgcctcga gaagggcagg gcttctcaga 660
ggcttggcgg gaaaaagaac ggagggaggg atcgcgctga gtataaaagc cggttttcgg 720
ggctttatct aactcgctgt agtaattcca gcgagaggca gagggagcga gcgggcggcc 780
ggctagggtg gaagagccgg gcgagcagag ctgcgctgcg ggcgtcctgg 830
<![CDATA[ <210> 210]]>
<![CDATA[ <211> 7]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Description of artificial sequences: synthetic oligonucleotides]]>
<![CDATA[ <400> 210]]>
tatawaw 7
Claims (57)
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163212991P | 2021-06-21 | 2021-06-21 | |
US63/212,991 | 2021-06-21 | ||
US202163281022P | 2021-11-18 | 2021-11-18 | |
US63/281,022 | 2021-11-18 | ||
PCT/US2021/010059 WO2022132195A2 (en) | 2020-12-15 | 2021-12-15 | Compositions and methods for modulation myc expression |
WOPCT/US21/10059 | 2021-12-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
TW202317601A true TW202317601A (en) | 2023-05-01 |
Family
ID=87378760
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW111112954A TW202317601A (en) | 2021-06-21 | 2022-04-01 | Compositions and methods for modulating myc expression |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW202317601A (en) |
-
2022
- 2022-04-01 TW TW111112954A patent/TW202317601A/en unknown
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2023119040A (en) | Methods and compositions for modulating gene expression | |
US20230399640A1 (en) | Compositions and methods for inhibiting gene expression | |
US20210322577A1 (en) | Methods and systems for modifying dna | |
US20240132559A1 (en) | Compositions and methods for modulating myc expression | |
WO2020081598A1 (en) | Disrupting genomic complex assembly in fusion genes | |
US20140248254A1 (en) | The use of ache as nuclease | |
US20220403387A1 (en) | Modulating genomic complexes | |
TW202317601A (en) | Compositions and methods for modulating myc expression | |
US20220348893A1 (en) | Methods and compositions for modulating frataxin expression and treating friedrich's ataxia | |
US20220267756A1 (en) | Compositions and methods for modulating genomic complex integrity index | |
CN116829175A (en) | Compositions and methods for modulating MYC expression | |
TW202408541A (en) | Formulations for modulating myc expression | |
WO2024040229A2 (en) | Combination therapies comprising myc modulators and checkpoint inhibitors | |
CA3196827A1 (en) | Compositions and methods for inhibiting gene expression | |
US20230374549A1 (en) | Compositions and methods for inhibiting the expression of multiple genes | |
TW202417012A (en) | Combination therapies comprising myc modulators and checkpoint inhibitors | |
CN116635086A (en) | Compositions and methods for inhibiting expression of multiple genes | |
CN116615539A (en) | Compositions and methods for inhibiting gene expression | |
WO2010043243A1 (en) | Novel diagnostic and therapeutic agents |